Adding New Content Types to a Large-Scale Shared Digital Repository
dc.contributor.author | Beers, Shane | |
dc.contributor.author | York, Jeremy | |
dc.contributor.author | Mardesich, Andrew | |
dc.date.accessioned | 2011-03-17T17:15:25Z | |
dc.date.accessioned | 2011-03-17T17:15:25Z | |
dc.date.available | 2011-03-17T17:15:25Z | en_US |
dc.date.issued | 2010-09 | |
dc.identifier.isbn | 978-3-85403-262-5 | |
dc.identifier.uri | https://hdl.handle.net/2027.42/83275 | |
dc.description.abstract | As a digital repository for the nation’s great research libraries, HathiTrust brings together the immense collections of partner institutions. Initially, the Submission Information Packages (SIPs) deposited into HathiTrust were extremely uniform, being constituted primarily of books digitized by Google. HathiTrust’s ingest validation processes were correspondingly highly regular, designed to ensure that these SIPs met agreed-upon qualities and specifications. As HathiTrust has expanded to include materials digitized from other sources, SIPs have become more varied in their content and specifications, introducing the need to make adjustments to ingest and validation routines. One of the primary sources of new SIPs is the Internet Archive, which has digitized a large number of public domain materials owned by HathiTrust partners. Many of the technical, structural, and descriptive characteristics of materials digitized by the Internet Archive did not match previously developed standards for materials in HathiTrust. A variety of solutions were developed to transform these materials into HathiTrust-compatible AIPs and ingest them into the repository. The process of developing these solutions provides an example to other organizations that would like to add new types of materials to their repository, but are uncertain of the issues that may arise, or how these issues can be addressed. | en_US |
dc.language.iso | en_US | en_US |
dc.publisher | Austrian Computer Society (OCG) | en_US |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/3.0/ | en_US |
dc.subject | Digital Repositories | en_US |
dc.subject | HathiTrust | en_US |
dc.subject | Internet Archive | en_US |
dc.title | Adding New Content Types to a Large-Scale Shared Digital Repository | en_US |
dc.type | Article | en_US |
dc.subject.hlbsecondlevel | Information and Library Science | |
dc.subject.hlbtoplevel | Social Sciences | |
dc.contributor.affiliationum | Library, University of Michigan | en_US |
dc.contributor.affiliationother | California Digital Library, University of California | en_US |
dc.contributor.affiliationumcampus | Ann Arbor | en_US |
dc.description.bitstreamurl | http://deepblue.lib.umich.edu/bitstream/2027.42/83275/1/mainpaper.pdf | |
dc.identifier.source | Proceedings of the 7th International Conference on Preservation of Digital Objects | en_US |
dc.identifier.orcid | 0000-0001-8225-9291 | |
dc.identifier.name-orcid | York, Jeremy; 0000-0001-8225-9291 | en_US |
dc.owningcollname | Library (University of Michigan Library) |
Files in this item
Remediation of Harmful Language
The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.
Accessibility
If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.