Optimizing Prediction Power of RNA-seq on Intrinsic Characteristics in Breast Cancer

dc.contributor.authorLiu, Yuan
dc.date.accessioned2022-07-14T20:55:31Z
dc.date.available2022-07-14T20:55:31Z
dc.date.issued2022
dc.description.abstractBreast cancer is the most common cancer in women worldwide, and accurate and early detection of breast cancer is vital in characterizing the disease. Transcriptomic expression is embedded abundant tumor and cell state information. However, selecting a good pipeline in applying mRNA expression is critical in downstream characteristics prediction. We designed a study that focused on determining the best combinations of preprocessing processes in predictions. We tested six normalization methods, two gene selection methods, and over ten machine learning algorithms. By using appropriate evaluation metrics, we recommend using FPKM normalization method combined with either gene selection method and employing RF for the purpose of breast cancer downstream prediction.en_US
dc.identifier.urihttps://hdl.handle.net/10365/32793
dc.publisherNorth Dakota State Universityen_US
dc.rightsNDSU policy 190.6.2en_US
dc.rights.urihttps://www.ndsu.edu/fileadmin/policy/190.pdfen_US
dc.titleOptimizing Prediction Power of RNA-seq on Intrinsic Characteristics in Breast Canceren_US
dc.typeMaster's Paperen_US
ndsu.advisorOrr, Megan
ndsu.collegeScience and Mathematicsen_US
ndsu.degreeMaster of Science (MS)en_US
ndsu.departmentStatisticsen_US
ndsu.programStatisticsen_US

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Optimizing Prediction Power of RNA-seq on Intrinsic Characteristics in Breast Cancer.pdf
Size:
453.89 KB
Format:
Adobe Portable Document Format
Description:
Optimizing Prediction Power of RNA-seq on Intrinsic Characteristics in Breast Cancer

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.63 KB
Format:
Item-specific license agreed to upon submission
Description: