Naïve Bayes Classifier: A MapReduce Approach

Zheng, Songtao

dc.contributor.author	Zheng, Songtao
dc.description.abstract	Machine learning algorithms have the advantage of making use of the powerful Hadoop distributed computing platform and the MapReduce programming model to process data in parallel. Many machine learning algorithms have been investigated to be transformed to the MapReduce paradigm in order to make use of the Hadoop Distributed File System (HDFS). Naïve Bayes classifier is one of the supervised learning classification algorithm that can be programmed in form of MapReduce. In our study, we build a Naïve Bayes MapReduce model and evaluate the classifier on five datasets based on the prediction accuracy. Also, a scalability analysis is conducted to see the speedup of the data processing time with the increasing number of nodes in the cluster. Results show that running the Naïve Bayes MapReduce model across multiple nodes can save considerate amount of time compared with running the model against a single node, without sacrificing the classification accuracy.	en_US
dc.publisher	North Dakota State University	en_US
dc.rights	NDSU Policy 190.6.2
dc.title	Naïve Bayes Classifier: A MapReduce Approach	en_US
dc.type	Master's paper	en_US
dc.date.accessioned	2014-12-23T14:55:46Z
dc.date.available	2014-12-23T14:55:46Z
dc.date.issued	2014
dc.identifier.uri	http://hdl.handle.net/10365/24752
dc.subject.lcsh	Big data.	en_US
dc.subject.lcsh	Machine learning.	en_US
dc.subject.lcsh	Apache Hadoop.	en_US
dc.subject.lcsh	MapReduce (Computer file)	en_US
dc.subject.lcsh	Bayesian statistical decision theory.	en_US
dc.rights.uri	https://www.ndsu.edu/fileadmin/policy/190.pdf
ndsu.degree	Master of Science (MS)	en_US
ndsu.college	Engineering	en_US
ndsu.department	Computer Science	en_US
ndsu.program	Computer Science	en_US
ndsu.advisor	Ludwig, Simone

Files in this item

Name:: Naïve Bayes Classifier - A ...
Size:: 995.0Kb
Format:: PDF
Description:: Naïve Bayes Classifier: A Mapreduce ...

View/Open

This item appears in the following Collection(s)

Computer Science Masters Papers

Show simple item record