Show simple item record

Adding New Content Types to a Large-Scale Shared Digital Repository

dc.contributor.authorBeers, Shane
dc.contributor.authorYork, Jeremy
dc.contributor.authorMardesich, Andrew
dc.date.accessioned2011-03-17T17:15:25Z
dc.date.accessioned2011-03-17T17:15:25Z
dc.date.available2011-03-17T17:15:25Zen_US
dc.date.issued2010-09
dc.identifier.isbn978-3-85403-262-5
dc.identifier.urihttps://hdl.handle.net/2027.42/83275
dc.description.abstractAs a digital repository for the nation’s great research libraries, HathiTrust brings together the immense collections of partner institutions. Initially, the Submission Information Packages (SIPs) deposited into HathiTrust were extremely uniform, being constituted primarily of books digitized by Google. HathiTrust’s ingest validation processes were correspondingly highly regular, designed to ensure that these SIPs met agreed-upon qualities and specifications. As HathiTrust has expanded to include materials digitized from other sources, SIPs have become more varied in their content and specifications, introducing the need to make adjustments to ingest and validation routines. One of the primary sources of new SIPs is the Internet Archive, which has digitized a large number of public domain materials owned by HathiTrust partners. Many of the technical, structural, and descriptive characteristics of materials digitized by the Internet Archive did not match previously developed standards for materials in HathiTrust. A variety of solutions were developed to transform these materials into HathiTrust-compatible AIPs and ingest them into the repository. The process of developing these solutions provides an example to other organizations that would like to add new types of materials to their repository, but are uncertain of the issues that may arise, or how these issues can be addressed.en_US
dc.language.isoen_USen_US
dc.publisherAustrian Computer Society (OCG)en_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/en_US
dc.subjectDigital Repositoriesen_US
dc.subjectHathiTrusten_US
dc.subjectInternet Archiveen_US
dc.titleAdding New Content Types to a Large-Scale Shared Digital Repositoryen_US
dc.typeArticleen_US
dc.subject.hlbsecondlevelInformation and Library Science
dc.subject.hlbtoplevelSocial Sciences
dc.contributor.affiliationumLibrary, University of Michiganen_US
dc.contributor.affiliationotherCalifornia Digital Library, University of Californiaen_US
dc.contributor.affiliationumcampusAnn Arboren_US
dc.description.bitstreamurlhttp://deepblue.lib.umich.edu/bitstream/2027.42/83275/1/mainpaper.pdf
dc.identifier.sourceProceedings of the 7th International Conference on Preservation of Digital Objectsen_US
dc.identifier.orcid0000-0001-8225-9291
dc.identifier.name-orcidYork, Jeremy; 0000-0001-8225-9291en_US
dc.owningcollnameLibrary (University of Michigan Library)


Files in this item

Show simple item record

http://creativecommons.org/licenses/by-nc-nd/3.0/
Except where otherwise noted, this item's license is described as http://creativecommons.org/licenses/by-nc-nd/3.0/

Remediation of Harmful Language

The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.