Show simple item record

Measuring Content Quality in a Preservation Repository: HathiTrust and Large-Scale Book Digitization

dc.contributor.authorConway, Paul
dc.date.accessioned2011-07-19T16:18:21Z
dc.date.available2011-07-19T16:18:21Z
dc.date.issued2010
dc.identifier.citationProceedings of 7th International Conference on Preservation of Digital Objects, iPres 2010, 19-24 Sept. 2010, Vienna, Austria, pp. 95-102 <http://hdl.handle.net/2027.42/85227>en_US
dc.identifier.urihttps://hdl.handle.net/2027.42/85227
dc.description.abstractAs mechanisms emerge to certify the trustworthiness of digital preservation repositories, no systematic efforts have been devoted to assessing the quality and usefulness of the preserved content itself. With generous support from the Andrew W. Mellon Foundation, the University of Michigan’s School of Information, in close collaboration with the University of Michigan Library and HathiTrust, is developing new methods to measure the visual and textual qualities of books from university libraries digitized by Google, Internet Archive, and others and then deposited for preservation. This paper describes a new approach to measuring quality in largescale digitization; namely, the absence of error relative to the expected uses of the deposited content. The paper specifies the design of a research project to develop and test statistically valid methods of measuring error. The design includes a model of understanding and recording errors observed through manual inspection of sample volumes, and strategies to validate the outcomes of the research through open evaluation by stakeholders and users. The research project will utilize content deposited in HathiTrust – a large-scale digital preservation repository that presently contains over five million digitized volumes – to develop broadly applicable quality assessment strategies for preservation repositories.en_US
dc.description.sponsorshipAndrew W. Mellon Foundationen_US
dc.language.isoen_USen_US
dc.publisherAustrian Computer Societyen_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/3.0/en_US
dc.subjectDigitization Qualityen_US
dc.subjectResearch Designen_US
dc.titleMeasuring Content Quality in a Preservation Repository: HathiTrust and Large-Scale Book Digitizationen_US
dc.typePreprinten_US
dc.subject.hlbsecondlevelInformation and Library Science
dc.subject.hlbtoplevelSocial Sciences
dc.description.peerreviewedPeer Revieweden_US
dc.contributor.affiliationumInformation, School ofen_US
dc.contributor.affiliationumcampusAnn Arboren_US
dc.description.bitstreamurlhttp://deepblue.lib.umich.edu/bitstream/2027.42/85227/1/C06 Conway Measuring Content Quality iPres 2010.pdf
dc.identifier.sourceProceedings of 7th International Conference on Preservation of Digital Objectsen_US
dc.owningcollnameInformation, School of (SI)


Files in this item

Show simple item record

http://creativecommons.org/licenses/by-nc-sa/3.0/
Except where otherwise noted, this item's license is described as http://creativecommons.org/licenses/by-nc-sa/3.0/

Remediation of Harmful Language

The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.