Mining Novel Knowledge from Biomedical Literature using Statistical Measures and Domain Knowledge
dc.contributor.author | Jha, Kishlay | |
dc.date.accessioned | 2018-05-07T17:56:45Z | |
dc.date.available | 2018-05-07T17:56:45Z | |
dc.date.issued | 2016 | en_US |
dc.description.abstract | The problem of inferring novel knowledge from implicit facts by logically connecting independent fragments of literature is known as Literature Based Discovery (LBD). In LBD, to discover hidden links, it is important to determine the relevancy between concepts using appropriate information measures. In this study, to discover interesting and inherent links latent in large corpora, nine distinct methods, comprising variants of statistical information measures and derived semantic knowledge from domain ontology, are designed and compared. A series of experiments are performed and analyzed for those proposed methods. Also, a new strategy of effective preprocessing is proposed, which is capable of removing terms that have meager chances of constituting a new discovery. Finally, an organized list of final concepts deemed worthy of scientific investigation are provided to the user. Overall, our research presents a comprehensive analysis and perspective of how different statistical information measures and semantic knowledge affect the knowledge discovery procedure. | en_US |
dc.identifier.uri | https://hdl.handle.net/10365/28085 | |
dc.publisher | North Dakota State University | en_US |
dc.rights | NDSU Policy 190.6.2 | |
dc.rights.uri | https://www.ndsu.edu/fileadmin/policy/190.pdf | |
dc.title | Mining Novel Knowledge from Biomedical Literature using Statistical Measures and Domain Knowledge | en_US |
dc.type | Thesis | en_US |
ndsu.advisor | Jin, Wei | |
ndsu.college | Engineering | en_US |
ndsu.degree | Master of Science (MS) | en_US |
ndsu.department | Computer Science | en_US |
ndsu.program | Computer Science | en_US |
Files
Original bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- Mining Novel Knowledge from Biomedical Literature using Statistical Measures and Domain Knowledge.pdf
- Size:
- 860.32 KB
- Format:
- Adobe Portable Document Format
- Description:
License bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- license.txt
- Size:
- 1.63 KB
- Format:
- Item-specific license agreed to upon submission
- Description: