Flexible query facilities for heterogeneous semi-structured data.

Li, Yunyao

Flexible query facilities for heterogeneous semi-structured data.

dc.contributor.author	Li, Yunyao
dc.contributor.advisor	Jagadish, Hosagrahar V.
dc.date.accessioned	2016-08-30T16:14:56Z
dc.date.available	2016-08-30T16:14:56Z
dc.date.issued	2007
dc.identifier.uri	http://gateway.proquest.com/openurl?url_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:dissertation&res_dat=xri:pqm&rft_dat=xri:pqdiss:3253332
dc.identifier.uri	https://hdl.handle.net/2027.42/126485
dc.description.abstract	This dissertation studies flexible query facilities for semi-structured data in a heterogeneous environment, with a focus on XML databases. The popularity of XML naturally follows from the needs of querying XML documents from a wide spectrum of users. Although formal database query languages such as XQuery can provide precise access to XML data, the challenges of querying XML using such rigid formal database languages---i.e., requiring users to have perfect knowledge of database schema, query language syntax and query semantics---leads to the requests for flexible yet accurate query facilities over XML documents. This dissertation discusses a two-part solution for supporting flexible queries over XML documents: (a) Schema-Free XQuery that allows database queries to be specified with limited or even no schema knowledge and (b) Natural Language Interface for Querying XML (NaLIX) that can translate database queries with complex semantics in plain English into Schema-Free XQuery expressions. NaLIX enables users to pose complex database queries in plain English without knowing any formal query language or underlying database schema. Iterative user search is also supported in NaLIX by allowing queries to be stated with respect to previous queries. NaLIX does not depend on any domain knowledge. However, it can be improved further by automatically learning domain information. We also present a novel stack-based algorithm and cost-based optimization techniques to allow these techniques, Schema-Free XQuery and NaLIX, to be implemented efficiently. In addition, we report experimental results that validate the proposed solution. Finally, we discuss how our solution improves the state-of-art through comparison with previous work.
dc.format.extent	163 p.
dc.language	English
dc.language.iso	EN
dc.subject	Data
dc.subject	Facilities
dc.subject	Flexible
dc.subject	Heterogeneous
dc.subject	Natural Language Interface
dc.subject	Query
dc.subject	Schema-free
dc.subject	Semi
dc.subject	Semistructured
dc.subject	Structured
dc.title	Flexible query facilities for heterogeneous semi-structured data.
dc.type	Thesis
dc.description.thesisdegreename	PhD	en_US
dc.description.thesisdegreediscipline	Applied Sciences
dc.description.thesisdegreediscipline	Computer science
dc.description.thesisdegreediscipline	Experimental psychology
dc.description.thesisdegreediscipline	Language, Literature and Linguistics
dc.description.thesisdegreediscipline	Linguistics
dc.description.thesisdegreediscipline	Psychology
dc.description.thesisdegreegrantor	University of Michigan, Horace H. Rackham School of Graduate Studies
dc.description.bitstreamurl	http://deepblue.lib.umich.edu/bitstream/2027.42/126485/2/3253332.pdf
dc.owningcollname	Dissertations and Theses (Ph.D. and Master's)

Files in this item

Name:: 3253332.pdf
Size:: 4.446MB
Format:: PDF
Description:: Access Restricted to UM users only.

View/Open

Dissertations and Theses (Ph.D. and Master's)

Show simple item record

Remediation of Harmful Language

The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.