Using Natural Language Processing to Mine Multiple Perspectives from Social Media and Scientific Literature.
dc.contributor.author | Abu Jbara, Amjad | en_US |
dc.date.accessioned | 2013-09-24T16:02:33Z | |
dc.date.available | NO_RESTRICTION | en_US |
dc.date.available | 2013-09-24T16:02:33Z | |
dc.date.issued | 2013 | en_US |
dc.date.submitted | 2013 | en_US |
dc.identifier.uri | https://hdl.handle.net/2027.42/99934 | |
dc.description.abstract | This thesis studies how Natural Language Processing techniques can be used to mine perspectives from textual data. The first part of the thesis focuses on analyzing the text exchanged by people who participate in discussions on social media sites. We particularly focus on threaded discussions that discuss ideological and political topics. The goal is to identify the different viewpoints that the discussants have with respect to the discussion topic. We use subjectivity and sentiment analysis techniques to identify the attitudes that the participants carry toward one another and toward the different aspects of the discussion topic. This involves identifying opinion expressions and their polarities, and identifying the targets of opinion. We use this information to represent discussions in one of two representations: discussant attitude vectors or signed attitude networks. We use data mining and network analysis techniques to analyze these representations to detect rifts in discussion groups and study how the discussants split into subgroups with contrasting opinions. In the second part of the thesis, we use linguistic analysis to mine scholars perspectives from scientific literature through the lens of citations. We analyze the text adjacent to reference anchors in scientific articles as a means to identify researchers' viewpoints toward previously published work. We propose methods for identifying, extracting, and cleaning citation text. We analyze this text to identify the purpose (author's intention) and polarity (author's sentiment) of citation. Finally, we present several applications that can benefit from this analysis such as generating multi-perspective summaries of scientific articles and predicting future prominence of publications. | en_US |
dc.language.iso | en_US | en_US |
dc.subject | Natural Language Processing | en_US |
dc.subject | Computational Linguistics | en_US |
dc.subject | Sentiment Analysis | en_US |
dc.subject | Opinion Mining | en_US |
dc.subject | Citation Analysis | en_US |
dc.title | Using Natural Language Processing to Mine Multiple Perspectives from Social Media and Scientific Literature. | en_US |
dc.type | Thesis | en_US |
dc.description.thesisdegreename | PhD | en_US |
dc.description.thesisdegreediscipline | Computer Science & Engineering | en_US |
dc.description.thesisdegreegrantor | University of Michigan, Horace H. Rackham School of Graduate Studies | en_US |
dc.contributor.committeemember | Radev, Dragomir Radkov | en_US |
dc.contributor.committeemember | Abney, Steven P. | en_US |
dc.contributor.committeemember | Mei, Qiaozhu | en_US |
dc.contributor.committeemember | Adar, Eytan | en_US |
dc.contributor.committeemember | Provost, Emily Kaplan Mower | en_US |
dc.subject.hlbsecondlevel | Computer Science | en_US |
dc.subject.hlbtoplevel | Engineering | en_US |
dc.description.bitstreamurl | http://deepblue.lib.umich.edu/bitstream/2027.42/99934/1/amjbara_1.pdf | |
dc.owningcollname | Dissertations and Theses (Ph.D. and Master's) |
Files in this item
Remediation of Harmful Language
The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.
Accessibility
If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.