Statistical Modeling of Large-scale Network Data
dc.contributor.author | Zhang, Yuhua | |
dc.date.accessioned | 2023-09-22T15:23:45Z | |
dc.date.available | 2023-09-22T15:23:45Z | |
dc.date.issued | 2023 | |
dc.date.submitted | 2023 | |
dc.identifier.uri | https://hdl.handle.net/2027.42/177798 | |
dc.description.abstract | The explosive growth of network data in various domains has spurred extensive research on statistical modeling techniques to unravel the underlying patterns of complex interaction processes. This thesis addresses the challenges and opportunities presented by large-scale network data. In particular, the focus is on the development of novel statistical models to gain insights into community structures and predict future links. The research comprises three key projects, namely community detection within edge exchangeable model, community detection in the scenario where interactions exhibit edge clustering, and conformal link prediction. This work advances the existing research in several directions. First, it extends conventional node-centric models for network analysis to interaction process-based models using the concept of edge exchangeability. Second, scalable algorithms are implemented and applied to real-world data that are composed of millions of interactions. Third, it transforms the elucidation of the network structures into a predictive tool for future interactions using conformal prediction. Throughout this thesis, extensive experiments are conducted on a series of simulation data and real-world datasets, such as TalkLife and Enron data, to evaluate the performance and effectiveness of the proposed methodologies. The results demonstrate the superiority of the developed statistical models in capturing the complexities of large-scale network data and providing valuable insights into community structures and link prediction. | |
dc.language.iso | en_US | |
dc.subject | network data | |
dc.subject | statistical modeling | |
dc.subject | community detection | |
dc.subject | link prediction | |
dc.title | Statistical Modeling of Large-scale Network Data | |
dc.type | Thesis | |
dc.description.thesisdegreename | PhD | en_US |
dc.description.thesisdegreediscipline | Biostatistics | |
dc.description.thesisdegreegrantor | University of Michigan, Horace H. Rackham School of Graduate Studies | |
dc.contributor.committeemember | Dempsey, Walter | |
dc.contributor.committeemember | Zoellner, Sebastian K | |
dc.contributor.committeemember | Zhu, Ji | |
dc.contributor.committeemember | Wen, Xiaoquan William | |
dc.subject.hlbsecondlevel | Statistics and Numeric Data | |
dc.subject.hlbtoplevel | Health Sciences | |
dc.subject.hlbtoplevel | Science | |
dc.description.bitstreamurl | http://deepblue.lib.umich.edu/bitstream/2027.42/177798/1/zyuhua_1.pdf | |
dc.identifier.doi | https://dx.doi.org/10.7302/8255 | |
dc.identifier.orcid | 0000-0002-5006-4366 | |
dc.identifier.name-orcid | Zhang, Yuhua; 0000-0002-5006-4366 | en_US |
dc.working.doi | 10.7302/8255 | en |
dc.owningcollname | Dissertations and Theses (Ph.D. and Master's) |
Files in this item
Remediation of Harmful Language
The University of Michigan Library aims to describe its collections in a way that respects the people and communities who create, use, and are represented in them. We encourage you to Contact Us anonymously if you encounter harmful or problematic language in catalog records or finding aids. More information about our policies and practices is available at Remediation of Harmful Language.
Accessibility
If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.