Show simple item record

dc.contributor.authorEl Radie, Eihab Salah
dc.description.abstractFrequent graph mining has received considerable attention from researchers. Existing algorithms for frequent subgraph mining do not scale for large networks, and take hours to finish. Mining multiple gene coexpressions networks allows for identifying context-specific modules. Frequent subnetworks represent essential biological modules. In this thesis, we propose two algorithms for mining frequent subgraphs. In the first algorithm, we propose a parallel algorithm for mining maximal frequent subgraphs from gene coexpression networks. Despite the algorithm’s parallelization, it takes much time and it does not allow relaxation. This inspired us to develop a second algorithm that solves those problems. In the second algorithm, we propose a greedy approach for mining approximate frequent subgraphs. Experiments on real tissue-specific RNA-seq expression networks and synthetic data demonstrate the effectiveness of the proposed algorithms. Moreover, biological enrichment analysis shows that the reported patterns are biologically relevant and enriched with known biological processes and KEGG pathways.en_US
dc.publisherNorth Dakota State Universityen_US
dc.rightsNDSU Policy 190.6.2
dc.titleScalable Algorithms for Mining Maximal Quasi Frequent Subnetworksen_US
dc.typeThesisen_US
dc.date.accessioned2018-07-30T19:12:06Z
dc.date.available2018-07-30T19:12:06Z
dc.date.issued2018en_US
dc.identifier.urihttps://hdl.handle.net/10365/28735
dc.identifier.orcid0000-0002-1473-5070
dc.rights.urihttps://www.ndsu.edu/fileadmin/policy/190.pdf
ndsu.degreeMaster of Science (MS)en_US
ndsu.collegeEngineeringen_US
ndsu.departmentComputer Scienceen_US
ndsu.programComputer Scienceen_US
ndsu.advisorSalem, Saeed


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record