On K-Means Clustering Using Mahalanobis Distance

dc.contributor.authorNelson, Joshua
dc.date.accessioned2017-11-05T23:06:44Z
dc.date.available2017-11-05T23:06:44Z
dc.date.issued2012
dc.description.abstractA problem that arises quite frequently in statistics is that of identifying groups, or clusters, of data within a population or sample. The most widely used procedure to identify clusters in a set of observations is known as K-Means. The main limitation of this algorithm is that it uses the Euclidean distance metric to assign points to clusters. Hence, this algorithm operates well only if the covariance structures of the clusters are nearly spherical and homogeneous in nature. To remedy this shortfall in the K-Means algorithm the Mahalanobis distance metric was used to capture the variance structure of the clusters. The issue with using Mahalanobis distances is that the accuracy of the distance is sensitive to initialization. If this method serves as a signicant improvement over its competitors, then it will provide a useful tool for analyzing clusters.en_US
dc.identifier.urihttps://hdl.handle.net/10365/26766
dc.publisherNorth Dakota State Universityen_US
dc.rightsNDSU Policy 190.6.2
dc.rights.urihttps://www.ndsu.edu/fileadmin/policy/190.pdf
dc.titleOn K-Means Clustering Using Mahalanobis Distanceen_US
dc.typeThesisen_US
ndsu.advisorMelnykov, Volodymyr
ndsu.collegeScience and Mathematicsen_US
ndsu.degreeMaster of Science (MS)en_US
ndsu.departmentStatisticsen_US
ndsu.programStatisticsen_US

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
On K-Means Clustering Using Mahalanobis Distance.pdf
Size:
334.04 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.63 KB
Format:
Item-specific license agreed to upon submission
Description: