Search

Now showing items 1-3 of 3

A Map Reduce Approach of K-Means++ Algorithm with Initial Equidistant Centers

Bhattacharyya, Krittika (North Dakota State University, 2015)

Data clustering has been received considerable attention in many applications, such as data mining, document retrieval, image segmentation and pattern classification. The enlarging volumes of information emerging by the ...

Performance Comparison of Apache Spark MLlib

Sharma, Pallavi (North Dakota State University, 2018)

This study makes an attempt to understand the performance of Apache Spark and the MLlib platform. To this end, the cluster computing system of Apache Spark is set up and five supervised machine learning algorithms (Naïve-Bayes, ...

Study of Similarity Coefficients Using MapReduce Programming Model

Nayakam, GhanaShyam Nath (North Dakota State University, 2013)

MapReduce is a programming model for processing and generating large data sets. Users specify a map function that processes a key/value pair to generate a set of intermediate key/value pairs, and a reduce function that ...

Search

Filters

A Map Reduce Approach of K-Means++ Algorithm with Initial Equidistant Centers

Performance Comparison of Apache Spark MLlib

Study of Similarity Coefficients Using MapReduce Programming Model