Show simple item record

dc.contributor.authorLiu, Yuan
dc.description.abstractBreast cancer is the most common cancer in women worldwide, and accurate and early detection of breast cancer is vital in characterizing the disease. Transcriptomic expression is embedded abundant tumor and cell state information. However, selecting a good pipeline in applying mRNA expression is critical in downstream characteristics prediction. We designed a study that focused on determining the best combinations of preprocessing processes in predictions. We tested six normalization methods, two gene selection methods, and over ten machine learning algorithms. By using appropriate evaluation metrics, we recommend using FPKM normalization method combined with either gene selection method and employing RF for the purpose of breast cancer downstream prediction.en_US
dc.publisherNorth Dakota State Universityen_US
dc.rightsNDSU policy 190.6.2en_US
dc.titleOptimizing Prediction Power of RNA-seq on Intrinsic Characteristics in Breast Canceren_US
dc.typeMaster's Paperen_US
dc.date.accessioned2022-07-14T20:55:31Z
dc.date.available2022-07-14T20:55:31Z
dc.date.issued2022
dc.identifier.urihttps://hdl.handle.net/10365/32793
dc.rights.urihttps://www.ndsu.edu/fileadmin/policy/190.pdfen_US
ndsu.degreeMaster of Science (MS)en_US
ndsu.collegeScience and Mathematicsen_US
ndsu.departmentStatisticsen_US
ndsu.programStatisticsen_US
ndsu.advisorOrr, Megan


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record