Distributed Inference for Degenerate U-Statistics with Application to One and Two Sample Test
dc.contributor.author | Atta-Asiamah, Ernest | |
dc.date.accessioned | 2021-03-04T21:17:04Z | |
dc.date.available | 2021-03-04T21:17:04Z | |
dc.date.issued | 2020 | |
dc.description.abstract | In many hypothesis testing problems such as one-sample and two-sample test problems, the test statistics are degenerate U-statistics. One of the challenges in practice is the computation of U-statistics for a large sample size. Besides, for degenerate U-statistics, the limiting distribution is a mixture of weighted chi-squares, involving the eigenvalues of the kernel of the U-statistics. As a result, it’s not straightforward to construct the rejection region based on this asymptotic distribution. In this research, we aim to reduce the computation complexity of degenerate U-statistics and propose an easy-to-calibrate test statistic by using the divide-and-conquer method. Specifically, we randomly partition the full n data points into kn even disjoint groups, and compute U-statistics on each group and combine them by averaging to get a statistic Tn. We proved that the statistic Tn has the standard normal distribution as the limiting distribution. In this way, the running time is reduced from O(n^m) to O( n^m/km_n), where m is the order of the one sample U-statistics. Besides, for a given significance level , it’s easy to construct the rejection region. We apply our method to the goodness of fit test and two-sample test. The simulation and real data analysis show that the proposed test can achieve high power and fast running time for both one and two-sample tests. | en_US |
dc.identifier.uri | https://hdl.handle.net/10365/31777 | |
dc.publisher | North Dakota State University | en_US |
dc.rights | NDSU policy 190.6.2 | en_US |
dc.rights.uri | https://www.ndsu.edu/fileadmin/policy/190.pdf | en_US |
dc.subject | degenerate and non degenerate | en_US |
dc.subject | divide-and-conquer | en_US |
dc.subject | goodness-of-fit test | en_US |
dc.subject | hypothesis testing | en_US |
dc.subject | maximum mean discrepancy | en_US |
dc.subject | U-statistics | en_US |
dc.title | Distributed Inference for Degenerate U-Statistics with Application to One and Two Sample Test | en_US |
dc.type | Dissertation | en_US |
ndsu.advisor | Yuan, Mingao | |
ndsu.college | Science and Mathematics | en_US |
ndsu.degree | Doctor of Philosophy (PhD) | en_US |
ndsu.department | Statistics | en_US |
ndsu.program | Statistics | en_US |
Files
Original bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- Distributed Inference for Degenerate U-Statistics with Application to One and Two Sample Test.pdf
- Size:
- 1.5 MB
- Format:
- Adobe Portable Document Format
- Description:
- Distributed Inference for Degenerate U-Statistics with Application to One and Two Sample Test
License bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- license.txt
- Size:
- 1.63 KB
- Format:
- Item-specific license agreed to upon submission
- Description: