Show simple item record

Statistical Modeling of Large-scale Network Data

dc.contributor.authorZhang, Yuhua
dc.date.accessioned2023-09-22T15:23:45Z
dc.date.available2023-09-22T15:23:45Z
dc.date.issued2023
dc.date.submitted2023
dc.identifier.urihttps://hdl.handle.net/2027.42/177798
dc.description.abstractThe explosive growth of network data in various domains has spurred extensive research on statistical modeling techniques to unravel the underlying patterns of complex interaction processes. This thesis addresses the challenges and opportunities presented by large-scale network data. In particular, the focus is on the development of novel statistical models to gain insights into community structures and predict future links. The research comprises three key projects, namely community detection within edge exchangeable model, community detection in the scenario where interactions exhibit edge clustering, and conformal link prediction. This work advances the existing research in several directions. First, it extends conventional node-centric models for network analysis to interaction process-based models using the concept of edge exchangeability. Second, scalable algorithms are implemented and applied to real-world data that are composed of millions of interactions. Third, it transforms the elucidation of the network structures into a predictive tool for future interactions using conformal prediction. Throughout this thesis, extensive experiments are conducted on a series of simulation data and real-world datasets, such as TalkLife and Enron data, to evaluate the performance and effectiveness of the proposed methodologies. The results demonstrate the superiority of the developed statistical models in capturing the complexities of large-scale network data and providing valuable insights into community structures and link prediction.
dc.language.isoen_US
dc.subjectnetwork data
dc.subjectstatistical modeling
dc.subjectcommunity detection
dc.subjectlink prediction
dc.titleStatistical Modeling of Large-scale Network Data
dc.typeThesis
dc.description.thesisdegreenamePhDen_US
dc.description.thesisdegreedisciplineBiostatistics
dc.description.thesisdegreegrantorUniversity of Michigan, Horace H. Rackham School of Graduate Studies
dc.contributor.committeememberDempsey, Walter
dc.contributor.committeememberZoellner, Sebastian K
dc.contributor.committeememberZhu, Ji
dc.contributor.committeememberWen, Xiaoquan William
dc.subject.hlbsecondlevelStatistics and Numeric Data
dc.subject.hlbtoplevelHealth Sciences
dc.subject.hlbtoplevelScience
dc.description.bitstreamurlhttp://deepblue.lib.umich.edu/bitstream/2027.42/177798/1/zyuhua_1.pdf
dc.identifier.doihttps://dx.doi.org/10.7302/8255
dc.identifier.orcid0000-0002-5006-4366
dc.identifier.name-orcidZhang, Yuhua; 0000-0002-5006-4366en_US
dc.working.doi10.7302/8255en
dc.owningcollnameDissertations and Theses (Ph.D. and Master's)


Files in this item

Show simple item record

Remediation of Harmful Language

The University of Michigan Library aims to describe its collections in a way that respects the people and communities who create, use, and are represented in them. We encourage you to Contact Us anonymously if you encounter harmful or problematic language in catalog records or finding aids. More information about our policies and practices is available at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.