Show simple item record

dc.contributor.authorOlson, Christopher
dc.description.abstractEntropy is a measure of the randomness of a system state. This quantity gives us a measure of uncertainty that is associated with each particular observation belonging to a specific cluster. We examine this property and its potential use in analyzing high dimension datasets. Entropy proves most interesting in identifying possible dimensions that do not contribute meaningful classification to the clusters present. We can remove the dimension(s) found which are the least important and generalize this idea to a procedure. After identifying all the dimensions that should be eliminated from the dataset, we then compare its ability in recovering the true classification of the observations versus the estimated classification of the data. From the results obtained and shown in this paper, it is clear that entropy is a good candidate for a criterion in variable reduction.en_US
dc.publisherNorth Dakota State Universityen_US
dc.rightsNDSU Policy 190.6.2
dc.titleEntropy as a Criterion for Variable Reduction in Cluster Dataen_US
dc.typeThesisen_US
dc.date.accessioned2017-11-05T17:32:21Z
dc.date.available2017-11-05T17:32:21Z
dc.date.issued2012
dc.identifier.urihttps://hdl.handle.net/10365/26760
dc.rights.urihttps://www.ndsu.edu/fileadmin/policy/190.pdf
ndsu.degreeMaster of Science (MS)en_US
ndsu.collegeScience and Mathematicsen_US
ndsu.departmentStatisticsen_US
ndsu.programApplied Statisticsen_US
ndsu.advisorMelnykov, Volodymyr


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record