On K-Means Clustering Using Mahalanobis Distance

dc.contributor.author	Nelson, Joshua
dc.description.abstract	A problem that arises quite frequently in statistics is that of identifying groups, or clusters, of data within a population or sample. The most widely used procedure to identify clusters in a set of observations is known as K-Means. The main limitation of this algorithm is that it uses the Euclidean distance metric to assign points to clusters. Hence, this algorithm operates well only if the covariance structures of the clusters are nearly spherical and homogeneous in nature. To remedy this shortfall in the K-Means algorithm the Mahalanobis distance metric was used to capture the variance structure of the clusters. The issue with using Mahalanobis distances is that the accuracy of the distance is sensitive to initialization. If this method serves as a signicant improvement over its competitors, then it will provide a useful tool for analyzing clusters.	en_US
dc.publisher	North Dakota State University	en_US
dc.rights	NDSU Policy 190.6.2
dc.title	On K-Means Clustering Using Mahalanobis Distance	en_US
dc.type	Thesis	en_US
dc.date.accessioned	2017-11-05T23:06:44Z
dc.date.available	2017-11-05T23:06:44Z
dc.date.issued	2012
dc.identifier.uri	https://hdl.handle.net/10365/26766
dc.rights.uri	https://www.ndsu.edu/fileadmin/policy/190.pdf
ndsu.degree	Master of Science (MS)	en_US
ndsu.college	Science and Mathematics	en_US
ndsu.department	Statistics	en_US
ndsu.program	Statistics	en_US
ndsu.advisor	Melnykov, Volodymyr