Show simple item record

dc.contributor.authorChakraborty, Ushashi
dc.description.abstractClassification can be used to predict unknown functions of proteins by using known function information. In some cases, multiple sets of data are available for classification where prediction is only part of the problem, and knowing the most reliable source for prediction is also relevant. Our goal is to develop classification techniques to find the most predictive of the multiple data sets that we have in this project. We use existing classification techniques like linear and quadratic classifications and statistical relevance measures like posterior and log p analysis in our proposed algorithm, which is able to find the data set that is expected to give the best prediction. The proposed algorithm is used on experimental readings during cell cycle of yeast and it predicts the genes that participate in cell-cycle regulation and the type of experiment that provides evidence of cell cycle involvement for any particular gene.en_US
dc.publisherNorth Dakota State Universityen_US
dc.rightsNDSU Policy 190.6.2
dc.titleFinding the Most Predictive Data Source in Biological Dataen_US
dc.typeThesisen_US
dc.date.accessioned2017-10-12T14:35:30Z
dc.date.available2017-10-12T14:35:30Z
dc.date.issued2013
dc.identifier.urihttps://hdl.handle.net/10365/26567
dc.subject.lcshData miningen_US
dc.subject.lcshCell cycleen_US
dc.subject.lcshYeasten_US
dc.subject.lcshScience -- Methodologyen_US
dc.rights.urihttps://www.ndsu.edu/fileadmin/policy/190.pdf
ndsu.degreeMaster of Science (MS)en_US
ndsu.collegeEngineeringen_US
ndsu.departmentComputer Scienceen_US
ndsu.programComputer Scienceen_US
ndsu.advisorDenton, Anne M.


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record