Show simple item record

dc.contributor.authorUpadhyaya, Sudhi
dc.description.abstractStatistical models such as Logistic Regression (LR), Neural Network (NN) and Support Vector Machines (SVM) often use datasets with missing values while making inferences regarding the population. When inferences are made based on the data set used, the presence of missing data can severely skew the results and distort the efficiency of the model. Our objective was to identify a robust model among LR, NN, SVM in the presence of missing data. The study was conducted by simulating observations based on Monte Carlo methods and missing data was introduced randomly at 10% level. Single mode imputation was used to impute missing values. Simple random samples of 120, 240 and 500 observations were chosen and these three models were fit for two scenarios. Results showed that the performance of SVM was far superior compared to LR or NN models. However, the classification accuracy of SVM gradually decreased as sample size increased.en_US
dc.publisherNorth Dakota State Universityen_US
dc.rightsNDSU Policy 190.6.2
dc.titleComparison of Classification Rates among Logistic Regression, Neural Network and Support Vector Machines in the Presence of Missing Dataen_US
dc.typeMaster's paperen_US
dc.date.accessioned2014-08-26T20:40:09Z
dc.date.available2014-08-26T20:40:09Z
dc.date.issued2014
dc.identifier.urihttp://hdl.handle.net/10365/23948
dc.subject.lcshMathematical statistics.
dc.subject.lcshMissing observations (Statistics)
dc.subject.lcshLogistic regression analysis.
dc.subject.lcshNeural networks (Computer science)
dc.subject.lcshSupport vector machines.
dc.rights.urihttps://www.ndsu.edu/fileadmin/policy/190.pdf
ndsu.degreeMaster of Science (MS)en_US
ndsu.collegeScience and Mathematicsen_US
ndsu.departmentStatisticsen_US
ndsu.programStatisticsen_US
ndsu.advisorMagel, Rhonda


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record