Show simple item record

dc.contributor.authorNan, Yehong
dc.description.abstractMany real-world network data can be formulated as graphs, where a binary relation exists between nodes. One of the fundamental problems in network data analysis is community detection, clustering the nodes into different groups. Statistically, this problem can be formulated as hypothesis testing: under the null hypothesis, there is no community structure, while under the alternative hypothesis, community structure exists. One is of the method is to use the largest eigenvalues of the scaled adjacency matrix proposed by Bickel and Sarkar (2016), which works for dense graph. Another one is the subgraph counting method proposed by Gao and Lafferty (2017a), valid for sparse network. In this paper, firstly, we empirically study the BS or GL methods to see whether either of them works for moderately sparse network; secondly, we propose a subsampling method to reduce the computation of the BS method and run simulations to evaluate the performance.en_US
dc.publisherNorth Dakota State Universityen_US
dc.rightsNDSU policy 190.6.2en_US
dc.titleEmpirical Study of Two Hypothesis Test Methods for Community Structure in Networksen_US
dc.typeThesisen_US
dc.date.accessioned2020-11-09T22:07:41Z
dc.date.available2020-11-09T22:07:41Z
dc.date.issued2019
dc.identifier.urihttps://hdl.handle.net/10365/31640
dc.subjectBickel and Sarkar methoden_US
dc.subjectcommunity networken_US
dc.subjectdegree-corrected SBMen_US
dc.subjectErdős–Rényi modelen_US
dc.subjectGao and Lafferty methoden_US
dc.subjectsparse networken_US
dc.rights.urihttps://www.ndsu.edu/fileadmin/policy/190.pdfen_US
ndsu.degreeMaster of Science (MS)en_US
ndsu.collegeScience and Mathematicsen_US
ndsu.departmentStatisticsen_US
ndsu.programApplied Statisticsen_US
ndsu.advisorYuan, Mingao


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record