Show simple item record

LinkWiper – A System For Data Quality in Linked Open Data

dc.contributor.authorGade, Srivalli
dc.contributor.advisorMedjahed, Brahim
dc.date.accessioned2017-02-09T02:00:57Z
dc.date.availableNO_RESTRICTIONen_US
dc.date.available2017-02-09T02:00:57Z
dc.date.issued2016-12-17
dc.date.submitted2016
dc.identifier.urihttps://hdl.handle.net/2027.42/136065
dc.description.abstractLinked Open Data (LOD) provides access to large amounts of data on Web. These data sets range from high quality curated data sets to low quality sets. LOD sources often need strategies to clean up data and provide methodology for quality assessment in linked data. They allow interlinking and integrating any kind of data on the web. Links between various data sources enable software applications to operate over the aggregated data space as if it is a unique local database. However, such links may be broken, leading to data quality problems. In this thesis we present LinkWiper, an automated system for cleaning data in LOD. While this thesis focuses on problems related to dereferenced links, LinkWiper can be used to tackle any other data quality problem such as duplication and consistency. The proposed system includes two major phases. The first phase uses information retrieval-like search techniques to recommend sets of alternative links. The second phase adopts crowdsourcing mechanisms to involve workers (or users) in improving the quality of the LOD sources. We provide an implementation of LinkWiper over DBPedia, a community effort to extract structured information from Wikipedia and make this information using LOD principles. We also conduct extensive experiments to illustrate the efficiency and high precision of the proposed approach.en_US
dc.language.isoen_USen_US
dc.subjectData qualityen_US
dc.subjectRDFen_US
dc.subjectLinked open dataen_US
dc.subjectCrowdsourcingen_US
dc.subjectDereferenced linksen_US
dc.subject.otherComputer Scienceen_US
dc.titleLinkWiper – A System For Data Quality in Linked Open Dataen_US
dc.typeThesisen_US
dc.description.thesisdegreenameMaster of Science (MS)en_US
dc.description.thesisdegreedisciplineComputer and Information Science, College of Engineering and Computer Scienceen_US
dc.description.thesisdegreegrantorUniversity of Michigan-Dearbornen_US
dc.contributor.committeememberKessentini, Marouane
dc.contributor.committeememberZhu, Qiang
dc.identifier.uniqname34089270en_US
dc.description.bitstreamurlhttp://deepblue.lib.umich.edu/bitstream/2027.42/136065/1/LinkWiper – A System For Data Quality in Linked Open Data.pdf
dc.identifier.orcid0000-0002-2820-190X
dc.description.filedescriptionDescription of LinkWiper – A System For Data Quality in Linked Open Data.pdf : Master of Science Thesis
dc.identifier.name-orcidInamanamelluri, Srivalli; 0000-0002-2820-190Xen_US
dc.owningcollnameDissertations and Theses (Ph.D. and Master's)


Files in this item

Show simple item record

Remediation of Harmful Language

The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.