Show simple item record

Capturing Political Communication Online Using Image and Text Data: A Deep Learning Approach

dc.contributor.authorPineda, Alejandro
dc.date.accessioned2023-05-25T14:47:49Z
dc.date.available2023-05-25T14:47:49Z
dc.date.issued2023
dc.date.submitted2023
dc.identifier.urihttps://hdl.handle.net/2027.42/176652
dc.description.abstractSocial media data enables political scientists to observe phenomena that have been otherwise difficult to capture. The scale and structure of such data is problematic, however, as sorting social media posts by hand is a prohibitively costly endeavor. For instance, there are over 500 million tweets posted per day, consisting of text, image, gif, and video content. This has created a technology gap between what social scientists want to do, conceptually, and what they can do, computationally. This study develops text- and multimodal (text and image) classification technology. Such methods are used to investigate questions in algorithmic bias, election experiences, and Black Lives Matter protest activity. Multiple machine learning algorithms -- called convolutional neural networks or textit{deep learning} models -- were developed. These models were trained on facial images and tweet text. Results indicate that deep learning achieves high accuracy on training data; performance declines when the machine attempts to predict the previously unseen validation set. These algorithms can lack predictive power. Deep learning shows promise for automated content analysis, but more work must be done to curate theoretically motivated training data. Social scientists should focus on features in the data that best differentiate categories of interest. This study contributes to larger trends in computational social science that seek to apply machine learning methods to problems in political science. Even the most advanced methodology, however, must be wrapped in strong theory and substantively interesting questions.
dc.language.isoen_US
dc.subjectmachine learning
dc.subjectcomputational social science
dc.subjectimage and text analysis
dc.subjectmultimodal deep learning
dc.subjectblack lives matter
dc.subjectalgorithmic bias
dc.titleCapturing Political Communication Online Using Image and Text Data: A Deep Learning Approach
dc.typeThesis
dc.description.thesisdegreenamePhDen_US
dc.description.thesisdegreedisciplinePolitical Science
dc.description.thesisdegreegrantorUniversity of Michigan, Horace H. Rackham School of Graduate Studies
dc.contributor.committeememberMebane Jr, Walter R
dc.contributor.committeememberPasek, Josh
dc.contributor.committeememberFariss, Christopher Jennings
dc.contributor.committeememberHutchings, Vincent L
dc.contributor.committeememberOstfeld, Mara Cecilia
dc.subject.hlbsecondlevelCommunications
dc.subject.hlbsecondlevelPolitical Science
dc.subject.hlbsecondlevelStatistics and Numeric Data
dc.subject.hlbtoplevelSocial Sciences
dc.description.bitstreamurlhttp://deepblue.lib.umich.edu/bitstream/2027.42/176652/1/ajpineda_1.pdf
dc.identifier.doihttps://dx.doi.org/10.7302/7501
dc.identifier.orcid0000-0003-0162-3042
dc.identifier.name-orcidPineda, Alejandro; 0000-0003-0162-3042en_US
dc.working.doi10.7302/7501en
dc.owningcollnameDissertations and Theses (Ph.D. and Master's)


Files in this item

Show simple item record

Remediation of Harmful Language

The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.