Show simple item record

dc.contributor.authorLi, Hanzhe
dc.description.abstractTwitter are a new source of information for data mining techniques. Messages posted through Twitter provide a major information source to gauge public sentiment on topics ranging from politics to fashion trends. The purpose of this paper is to analyze the Twitter tweets to discern the opinions of users regarding Genetically Modified Organisms (GMOs). We examine the effectiveness of several classifiers, Multinomial Naïve Bayes, Bernoulli Naïve Bayes, Logistic Regression and Linear Support Vector Classifier (SVC) in identifying a positive, negative or neutral category on a tweet corpus. Additionally, we use three datasets in this experiment to examine which dataset has the best score. Comparing the classifiers, we discovered that GMO_NDSU has the highest score in each classifier of my experiment among three datasets, and Linear SVC had the highest consistent accuracy by using bigrams as feature extraction and Term Frequency, Chi Square as feature selection.en_US
dc.publisherNorth Dakota State Universityen_US
dc.rightsNDSU Policy 190.6.2
dc.titleSentiment Analysis and Opinion Mining on Twitter with GMO Keyworden_US
dc.typeThesisen_US
dc.date.accessioned2016-07-25T20:22:20Z
dc.date.available2016-07-25T20:22:20Z
dc.date.issued2016
dc.identifier.urihttp://hdl.handle.net/10365/25787
dc.subject.lcshData mining.
dc.subject.lcshGenetically modified foods -- Public opinion.
dc.subject.lcshTwitter.
dc.rights.urihttps://www.ndsu.edu/fileadmin/policy/190.pdf
ndsu.degreeMaster of Science (MS)en_US
ndsu.collegeEngineeringen_US
ndsu.departmentComputer Scienceen_US
ndsu.programComputer Scienceen_US
ndsu.advisorNygard, Kendall


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record