A Map Reduce Approach of K-Means++ Algorithm with Initial Equidistant Centers

Bhattacharyya, Krittika

dc.contributor.author	Bhattacharyya, Krittika
dc.description.abstract	Data clustering has been received considerable attention in many applications, such as data mining, document retrieval, image segmentation and pattern classification. The enlarging volumes of information emerging by the progress of technology, makes clustering of very large scale of data a challenging task. In order to deal with the problem, many researchers try to design efficient parallel clustering algorithms. In this paper, we propose a parallel k-means++ clustering algorithm based on MapReduce, which is simple like traditional K-means, yet more powerful because the initial centroid selection process is not random. It follows a formula to plot initial centroids at equal distance and then iterates repeatedly like k-means to converge and produce final cluster. This makes this algorithm faster and parallelizing makes it more scalable. The experimental results demonstrate that the proposed algorithm can scale well and efficiently process large datasets.	en_US
dc.publisher	North Dakota State University	en_US
dc.rights	NDSU Policy 190.6.2
dc.title	A Map Reduce Approach of K-Means++ Algorithm with Initial Equidistant Centers	en_US
dc.type	Master's paper	en_US
dc.date.accessioned	2015-11-09T15:23:18Z
dc.date.available	2015-11-09T15:23:18Z
dc.date.issued	2015
dc.identifier.uri	http://hdl.handle.net/10365/25350
dc.subject.lcsh	Cluster analysis.	en_US
dc.subject.lcsh	Big data.	en_US
dc.subject.lcsh	Computer algorithms.	en_US
dc.rights.uri	https://www.ndsu.edu/fileadmin/policy/190.pdf
ndsu.degree	Master of Science (MS)	en_US
ndsu.college	Engineering	en_US
ndsu.department	Computer Science	en_US
ndsu.program	Computer Science	en_US
ndsu.advisor	Ludwig, Simone

Files in this item

Name:: A Map Reduce Approach of K-Means++ ...
Size:: 901.2Kb
Format:: PDF
Description:: A Map Reduce Approach of K-Means++ ...

View/Open

This item appears in the following Collection(s)

Computer Science Masters Papers

Show simple item record