Entropy as a Criterion for Variable Reduction in Cluster Data

dc.contributor.authorOlson, Christopher
dc.date.accessioned2017-11-05T17:32:21Z
dc.date.available2017-11-05T17:32:21Z
dc.date.issued2012
dc.description.abstractEntropy is a measure of the randomness of a system state. This quantity gives us a measure of uncertainty that is associated with each particular observation belonging to a specific cluster. We examine this property and its potential use in analyzing high dimension datasets. Entropy proves most interesting in identifying possible dimensions that do not contribute meaningful classification to the clusters present. We can remove the dimension(s) found which are the least important and generalize this idea to a procedure. After identifying all the dimensions that should be eliminated from the dataset, we then compare its ability in recovering the true classification of the observations versus the estimated classification of the data. From the results obtained and shown in this paper, it is clear that entropy is a good candidate for a criterion in variable reduction.en_US
dc.identifier.urihttps://hdl.handle.net/10365/26760
dc.publisherNorth Dakota State Universityen_US
dc.rightsNDSU Policy 190.6.2
dc.rights.urihttps://www.ndsu.edu/fileadmin/policy/190.pdf
dc.titleEntropy as a Criterion for Variable Reduction in Cluster Dataen_US
dc.typeThesisen_US
ndsu.advisorMelnykov, Volodymyr
ndsu.collegeScience and Mathematicsen_US
ndsu.degreeMaster of Science (MS)en_US
ndsu.departmentStatisticsen_US
ndsu.programApplied Statisticsen_US

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Entropy as a Criterion for Variable Reduction in Cluster Data.pdf
Size:
402.82 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.63 KB
Format:
Item-specific license agreed to upon submission
Description: