Search
Now showing items 1-3 of 3
A Map Reduce Approach of K-Means++ Algorithm with Initial Equidistant Centers
(North Dakota State University, 2015)
Data clustering has been received considerable attention in many applications, such as data mining, document retrieval, image segmentation and pattern classification. The enlarging volumes of information emerging by the ...
Performance Comparison of Apache Spark MLlib
(North Dakota State University, 2018)
This study makes an attempt to understand the performance of Apache Spark and the MLlib platform. To this end, the cluster computing system of Apache Spark is set up and five supervised machine learning algorithms (Naïve-Bayes, ...
Study of Similarity Coefficients Using MapReduce Programming Model
(North Dakota State University, 2013)
MapReduce is a programming model for processing and generating large data sets. Users specify a map function that processes a key/value pair to generate a set of intermediate key/value pairs, and a reduce function that ...