Capturing Political Communication Online Using Image and Text Data: A Deep Learning Approach

Pineda, Alejandro

Capturing Political Communication Online Using Image and Text Data: A Deep Learning Approach

dc.contributor.author	Pineda, Alejandro
dc.date.accessioned	2023-05-25T14:47:49Z
dc.date.available	2023-05-25T14:47:49Z
dc.date.issued	2023
dc.date.submitted	2023
dc.identifier.uri	https://hdl.handle.net/2027.42/176652
dc.description.abstract	Social media data enables political scientists to observe phenomena that have been otherwise difficult to capture. The scale and structure of such data is problematic, however, as sorting social media posts by hand is a prohibitively costly endeavor. For instance, there are over 500 million tweets posted per day, consisting of text, image, gif, and video content. This has created a technology gap between what social scientists want to do, conceptually, and what they can do, computationally. This study develops text- and multimodal (text and image) classification technology. Such methods are used to investigate questions in algorithmic bias, election experiences, and Black Lives Matter protest activity. Multiple machine learning algorithms -- called convolutional neural networks or textit{deep learning} models -- were developed. These models were trained on facial images and tweet text. Results indicate that deep learning achieves high accuracy on training data; performance declines when the machine attempts to predict the previously unseen validation set. These algorithms can lack predictive power. Deep learning shows promise for automated content analysis, but more work must be done to curate theoretically motivated training data. Social scientists should focus on features in the data that best differentiate categories of interest. This study contributes to larger trends in computational social science that seek to apply machine learning methods to problems in political science. Even the most advanced methodology, however, must be wrapped in strong theory and substantively interesting questions.
dc.language.iso	en_US
dc.subject	machine learning
dc.subject	computational social science
dc.subject	image and text analysis
dc.subject	multimodal deep learning
dc.subject	black lives matter
dc.subject	algorithmic bias
dc.title	Capturing Political Communication Online Using Image and Text Data: A Deep Learning Approach
dc.type	Thesis
dc.description.thesisdegreename	PhD	en_US
dc.description.thesisdegreediscipline	Political Science
dc.description.thesisdegreegrantor	University of Michigan, Horace H. Rackham School of Graduate Studies
dc.contributor.committeemember	Mebane Jr, Walter R
dc.contributor.committeemember	Pasek, Josh
dc.contributor.committeemember	Fariss, Christopher Jennings
dc.contributor.committeemember	Hutchings, Vincent L
dc.contributor.committeemember	Ostfeld, Mara Cecilia
dc.subject.hlbsecondlevel	Communications
dc.subject.hlbsecondlevel	Political Science
dc.subject.hlbsecondlevel	Statistics and Numeric Data
dc.subject.hlbtoplevel	Social Sciences
dc.description.bitstreamurl	http://deepblue.lib.umich.edu/bitstream/2027.42/176652/1/ajpineda_1.pdf
dc.identifier.doi	https://dx.doi.org/10.7302/7501
dc.identifier.orcid	0000-0003-0162-3042
dc.identifier.name-orcid	Pineda, Alejandro; 0000-0003-0162-3042	en_US
dc.working.doi	10.7302/7501	en
dc.owningcollname	Dissertations and Theses (Ph.D. and Master's)

Files in this item

Name:: ajpineda_1.pdf
Size:: 2.119MB
Format:: PDF

View/Open

Dissertations and Theses (Ph.D. and Master's)

Show simple item record

Remediation of Harmful Language

The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.