Using Collective Discourse to Generate Surveys of Scientific Paradigms.

Qazvinian, Vahed

Using Collective Discourse to Generate Surveys of Scientific Paradigms.

dc.contributor.author	Qazvinian, Vahed	en_US
dc.date.accessioned	2013-02-04T18:03:54Z
dc.date.available	NO_RESTRICTION	en_US
dc.date.available	2013-02-04T18:03:54Z
dc.date.issued	2012	en_US
dc.date.submitted	2012	en_US
dc.identifier.uri	https://hdl.handle.net/2027.42/95960
dc.description.abstract	This thesis is focused on understanding collective discourse and employing its properties to build better decision support systems. We first define collective discourse as a collective human behavior in content generation. In social media, collective discourse is often a collective reaction to an event. A collective reaction to a well-defined subject emerges in response to an event (a movie release, a breaking story, a newly published paper) in the form of independent writings (movie reviews, news headlines, citation sentences) by many individuals. In order to understand collective discourse, we perform our analysis on a wide range of real-world datasets from citations to movie reviews. We show that all these datasets exhibit diversity of perspective, a property seen in other collective systems and a criterion in wise crowds. Our experiments also confirm that the network of different perspective co-occurrences exhibits the small-world property with high clustering of different perspectives. Finally, we show that non-expert contributions in collective discourse can be used to answer simple questions that are otherwise hard to answer. As a concrete example of collective discourse, we discuss citations to scholarly work. We show how they contain important information that convey the key features and basic underpinnings of a particular field, early and late developments, important contributions, and basic definitions and examples that enable rapid understanding of a field by non-experts. We then present C-LexRank, a system that exploits scientific collective discourse to produce automatically generated, readily consumable technical surveys. Finally, we further extend our experiments to summarize an entire scientific topic. We generate extractive surveys of a set of Question Answering (QA) and Dependency Parsing (DP) papers, their abstracts, and their citation sentences and show that citations have unique survey-worthy information.	en_US
dc.language.iso	en_US	en_US
dc.subject	Automatic Text Summarization	en_US
dc.subject	Collective Intelligence	en_US
dc.subject	Natural Language Processing	en_US
dc.title	Using Collective Discourse to Generate Surveys of Scientific Paradigms.	en_US
dc.type	Thesis	en_US
dc.description.thesisdegreename	PhD	en_US
dc.description.thesisdegreediscipline	Computer Science & Engineering	en_US
dc.description.thesisdegreegrantor	University of Michigan, Horace H. Rackham School of Graduate Studies	en_US
dc.contributor.committeemember	Radev, Dragomir Radkov	en_US
dc.contributor.committeemember	Adamic, Lada A.	en_US
dc.contributor.committeemember	Cafarella, Michael John	en_US
dc.contributor.committeemember	Mei, Qiaozhu	en_US
dc.subject.hlbsecondlevel	Computer Science	en_US
dc.subject.hlbtoplevel	Engineering	en_US
dc.description.bitstreamurl	http://deepblue.lib.umich.edu/bitstream/2027.42/95960/1/vahed_1.pdf
dc.owningcollname	Dissertations and Theses (Ph.D. and Master's)

Files in this item

Name:: vahed_1.pdf
Size:: 1.614MB
Format:: PDF

View/Open

Dissertations and Theses (Ph.D. and Master's)

Show simple item record

Remediation of Harmful Language

The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.