Adding New Content Types to a Large-Scale Shared Digital Repository

Deep Blue Home

Show simple item record Beers, Shane York, Jeremy Mardesich, Andrew 2011-03-17T17:15:25Z 2011-03-17T17:15:25Z 2011-03-17T17:15:25Z en_US 2010-09
dc.identifier.isbn 978-3-85403-262-5
dc.description.abstract As a digital repository for the nation’s great research libraries, HathiTrust brings together the immense collections of partner institutions. Initially, the Submission Information Packages (SIPs) deposited into HathiTrust were extremely uniform, being constituted primarily of books digitized by Google. HathiTrust’s ingest validation processes were correspondingly highly regular, designed to ensure that these SIPs met agreed-upon qualities and specifications. As HathiTrust has expanded to include materials digitized from other sources, SIPs have become more varied in their content and specifications, introducing the need to make adjustments to ingest and validation routines. One of the primary sources of new SIPs is the Internet Archive, which has digitized a large number of public domain materials owned by HathiTrust partners. Many of the technical, structural, and descriptive characteristics of materials digitized by the Internet Archive did not match previously developed standards for materials in HathiTrust. A variety of solutions were developed to transform these materials into HathiTrust-compatible AIPs and ingest them into the repository. The process of developing these solutions provides an example to other organizations that would like to add new types of materials to their repository, but are uncertain of the issues that may arise, or how these issues can be addressed. en_US
dc.language.iso en_US en_US
dc.publisher Austrian Computer Society (OCG) en_US
dc.rights.uri en_US
dc.subject Digital Repositories en_US
dc.subject HathiTrust en_US
dc.subject Internet Archive en_US
dc.title Adding New Content Types to a Large-Scale Shared Digital Repository en_US
dc.type Article en_US
dc.subject.hlbsecondlevel Information and Library Science
dc.subject.hlbtoplevel Social Sciences
dc.contributor.affiliationum Library, University of Michigan en_US
dc.contributor.affiliationother California Digital Library, University of California en_US
dc.contributor.affiliationumcampus Ann Arbor en_US
dc.identifier.source Proceedings of the 7th International Conference on Preservation of Digital Objects en_US
dc.identifier.orcid 0000-0001-8225-9291 York, Jeremy; 0000-0001-8225-9291 en_US
dc.owningcollname Library (University of Michigan Library)
 Show simple item record

This item appears in the following Collection(s)

Search Deep Blue

Browse by

My Account


Coming Soon

MLibrary logo