Measuring Content Quality in a Preservation Repository: HathiTrust and Large-Scale Book Digitization
dc.contributor.author | Conway, Paul | |
dc.date.accessioned | 2011-07-19T16:18:21Z | |
dc.date.available | 2011-07-19T16:18:21Z | |
dc.date.issued | 2010 | |
dc.identifier.citation | Proceedings of 7th International Conference on Preservation of Digital Objects, iPres 2010, 19-24 Sept. 2010, Vienna, Austria, pp. 95-102 <http://hdl.handle.net/2027.42/85227> | en_US |
dc.identifier.uri | https://hdl.handle.net/2027.42/85227 | |
dc.description.abstract | As mechanisms emerge to certify the trustworthiness of digital preservation repositories, no systematic efforts have been devoted to assessing the quality and usefulness of the preserved content itself. With generous support from the Andrew W. Mellon Foundation, the University of Michigan’s School of Information, in close collaboration with the University of Michigan Library and HathiTrust, is developing new methods to measure the visual and textual qualities of books from university libraries digitized by Google, Internet Archive, and others and then deposited for preservation. This paper describes a new approach to measuring quality in largescale digitization; namely, the absence of error relative to the expected uses of the deposited content. The paper specifies the design of a research project to develop and test statistically valid methods of measuring error. The design includes a model of understanding and recording errors observed through manual inspection of sample volumes, and strategies to validate the outcomes of the research through open evaluation by stakeholders and users. The research project will utilize content deposited in HathiTrust – a large-scale digital preservation repository that presently contains over five million digitized volumes – to develop broadly applicable quality assessment strategies for preservation repositories. | en_US |
dc.description.sponsorship | Andrew W. Mellon Foundation | en_US |
dc.language.iso | en_US | en_US |
dc.publisher | Austrian Computer Society | en_US |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/3.0/ | en_US |
dc.subject | Digitization Quality | en_US |
dc.subject | Research Design | en_US |
dc.title | Measuring Content Quality in a Preservation Repository: HathiTrust and Large-Scale Book Digitization | en_US |
dc.type | Preprint | en_US |
dc.subject.hlbsecondlevel | Information and Library Science | |
dc.subject.hlbtoplevel | Social Sciences | |
dc.description.peerreviewed | Peer Reviewed | en_US |
dc.contributor.affiliationum | Information, School of | en_US |
dc.contributor.affiliationumcampus | Ann Arbor | en_US |
dc.description.bitstreamurl | http://deepblue.lib.umich.edu/bitstream/2027.42/85227/1/C06 Conway Measuring Content Quality iPres 2010.pdf | |
dc.identifier.source | Proceedings of 7th International Conference on Preservation of Digital Objects | en_US |
dc.owningcollname | Information, School of (SI) |
Files in this item
Remediation of Harmful Language
The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.
Accessibility
If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.