Show simple item record

Supporting the long‐term curation and migration of natural history museum collections databases

dc.contributor.authorThomer, Andrea K.
dc.contributor.authorWeber, Nicholas M.
dc.contributor.authorTwidale, Michael B.
dc.date.accessioned2019-02-12T20:23:18Z
dc.date.available2019-02-12T20:23:18Z
dc.date.issued2018
dc.identifier.citationThomer, Andrea K.; Weber, Nicholas M.; Twidale, Michael B. (2018). "Supporting the long‐term curation and migration of natural history museum collections databases." Proceedings of the Association for Information Science and Technology 55(1): 504-513.
dc.identifier.issn2373-9231
dc.identifier.issn2373-9231
dc.identifier.urihttps://hdl.handle.net/2027.42/147782
dc.description.abstractMigration of data collections from one platform to another is an important component of data curation – yet, there is surprisingly little guidance for information professionals faced with this task. Data migration may be particularly challenging when these data collections are housed in relational databases, due to the complex ways that data, data schemas, and relational database management software become intertwined over time. Here we present results from a study of the maintenance, evolution and migration of research databases housed in Natural History Museums. We find that database migration is an on‐going – rather than occasional – process for many Collection managers, and that they creatively appropriate and innovate on many existing technologies in their migration work. This paper contributes descriptions of a preliminary set of common adaptations and “migration patterns” in the practices of database curators. It also outlines the strategies they use when facing collection‐level data migration and describes the limitations of existing tools in supporting LAM and “small science” research database migration. We conclude by outlining future research directions for the maintenance and migration of collections and complex digital objects.
dc.publisherWiley
dc.subject.othermuseum informatics
dc.subject.othernatural history museums
dc.subject.otherdatabase migration
dc.subject.otherData curation
dc.titleSupporting the long‐term curation and migration of natural history museum collections databases
dc.typeArticleen_US
dc.rights.robotsIndexNoFollow
dc.subject.hlbsecondlevelInformation Science
dc.subject.hlbtoplevelSocial Sciences
dc.description.peerreviewedPeer Reviewed
dc.description.bitstreamurlhttps://deepblue.lib.umich.edu/bitstream/2027.42/147782/1/pra214505501055.pdf
dc.identifier.doi10.1002/pra2.2018.14505501055
dc.identifier.sourceProceedings of the Association for Information Science and Technology
dc.identifier.citedreferenceSarasan, L., Neuner, A. M., & Association of Systematics Collections (Eds.). ( 1983 ). Museum collections and computers: report of an ASC survey. Lawrence, Kan., U.S.A: Association of Systematics Collections.
dc.identifier.citedreferenceLi, Q., & Lochovsky, F. H. ( 1996 ). Advanced database support facilities for CSCW systems. Journal of Organizational Computing and Electronic Commerce, 6 ( 2 ), 191 – 210. https://doi.org/10.1080/10919399609540276
dc.identifier.citedreferenceMello, J. F. ( 1975 ). The Use of the Selgem System in Support of Systematics. In Computers in Botanical Collections (pp. 125 – 138 ). Springer, Boston, MA. https://doi.org/10.1007/978-1-4684-2157-6_17
dc.identifier.citedreferencemigration. (n.d.). Glossary of Archival and Records Terminology. Society of American Archivists. Retrieved from https://www2.archivists.org/glossary/terms/m/migration
dc.identifier.citedreferenceNational Science and Technology Council, & Scientific collections: mission‐critical infrastructure for federal science agencies: a report of the Interagency Working Group on Scientific Collections (Eds.). ( 2009 ). Scientific collections: mission‐critical infrastructure for federal science agencies: a report of the Interagency Working Group on Scientific Collections. Washington DC: Office of Science and Technology Policy.
dc.identifier.citedreferenceOlson, J. E. ( 2009 ). Database archiving: how to keep lots of data for a very long time. San Francisco, Calif.: Oxford: Morgan Kaufmann; Elsevier Science [distributor].
dc.identifier.citedreferencePadilla, T. G. ( 2018 ). Collections as data: Implications for enclosure. College & Research Libraries News, 79 ( 6 ). Retrieved from https://crln.acrl.org/index.php/crlnews/article/view/17003/18751
dc.identifier.citedreferencePalmer, C. L., Zavalina, O. L., & Fenlon, K. ( 2010 ). Beyond size and search: Building contextual mass in digital aggregations for scholarly use. Proceedings of the American Society for Information Science and Technology, 47(1), 1 – 10. https://doi.org/10.1002/meet.14504701213
dc.identifier.citedreferencePlantin, J.‐C., Lagoze, C., Edwards, P. N., & Sandvig, C. ( 2016 ). Infrastructure studies meet platform studies in the age of Google and Facebook. New Media & Society, 1461444816661553. https://doi.org/10.1177/1461444816661553
dc.identifier.citedreferenceRobertson, T., Döring, M., Guralnick, R., Bloom, D., Wieczorek, J., Braak, K., … Desmet, P. ( 2014 ). The GBIF Integrated Publishing Toolkit: Facilitating the Efficient Publishing of Biodiversity Data on the Internet. PLoS ONE, 9 ( 8 ), e102623. https://doi.org/10.1371/journal.pone.0102623
dc.identifier.citedreferenceStein, B. R., & Wieczorek, J. R. ( 2004 ). Mammals of the World: MaNIS as an example of data integration in a distributed network environment. Biodiversity Informatics, 1 ( 0 ). https://doi.org/10.17161/bi.v1i0.7
dc.identifier.citedreferenceTate. ( 2018 ). collection: Tate Collection metadata. Python, Tate Modern. Retrieved from https://github.com/tategallery/collection (Original work published 2013)
dc.identifier.citedreferenceThibodeau, K. ( 2002 ). Overview of Technological Approaches to Digital Preservation and Challenges in Coming Years (The State of Digital preservation: An International Perspective). Washington, D.C.: CLIR and the Library of Congress. Retrieved from https://www.clir.org/pubs/reports/pub107/thibodeau/
dc.identifier.citedreferenceVassiliadis, P. ( 2009 ). A Survey of Extract–Transform–Load Technology: International Journal of Data Warehousing and Mining, 5 ( 3 ), 1–27. https://doi.org/10.4018/jdwm.2009070101
dc.identifier.citedreferenceVieglais, D., Wiley, E. O., Robins, R., & Peterson, T. ( 2000 ). Harnessing Museum Resources for the Census of Marine Life: The FISHNET Project. Oceanography, 13 ( 3 ), 10 – 13. https://doi.org/10.5670/oceanog.2000.02
dc.identifier.citedreferenceVoida, A., Harmon, E., & Al‐Ani, B. ( 2011 ). Homebrew databases: complexities of everyday information management in nonprofit organizations (p. 915). ACM Press. https://doi.org/10.1145/1978942.1979078
dc.identifier.citedreferenceWickett, K. M. (in press). A logic‐based framework for collection/item metadata relationships. Journal of Documentation.
dc.identifier.citedreferenceWickett, K. M., Renear, A. H., & Urban, R. J. ( 2010 ). Rule categories for collection/item metadata relationships. Proceedings of the American Society for Information Science and Technology, 47(1), 1 – 10. https://doi.org/10.1002/meet.14504701218
dc.identifier.citedreferenceYin, R. K. ( 2009 ). Case study research: Design and methods. ( 4th ed. ). SAGE Publications Ltd.
dc.identifier.citedreferenceZavalina, O. L., Palmer, C. L., Jackson, A. S., & Han, M.‐J. ( 2009 ). Evaluating Descriptive Richness in Collection‐Level Metadata. Journal of Library Metadata, 8 ( 4 ), 263 – 292. https://doi.org/10.1080/19386380802627109
dc.identifier.citedreferenceAriño, A. H. ( 2010 ). Approaches to estimating the universe of natural history collections data. Biodiversity Informatics, 7 ( 2 ). https://doi.org/10.17161/bi.v7i2.3991
dc.identifier.citedreferenceAtzeni, P., Jensen, C. S., Orsi, G., Ram, S., Tanca, L., & Torlone, R. ( 2013 ). The relational model is dead, SQL is dead, and I don’t feel so good myself. ACM SIGMOD Record, 42 ( 1 ), 64. https://doi.org/10.1145/2503792.2503808
dc.identifier.citedreferenceBeaman, R., & Cellinese, N. ( 2012 ). Mass digitization of scientific collections: New opportunities to transform the use of biological specimens and underwrite biodiversity science. ZooKeys, 209, 7 – 17. https://doi.org/10.3897/zookeys.209.3313
dc.identifier.citedreferenceBrodie, M. L., & Stonebraker, M. ( 1995 ). Migrating legacy systems: gateways, interfaces & the incremental approach. San Francisco, Calif.: [S.l.]: Morgan Kaufmann Publishers; IT/Information Technology.
dc.identifier.citedreferenceBuneman, P., Chapman, A., & Cheney, J. ( 2006 ). Provenance management in curated databases (p. 539). ACM Press. https://doi.org/10.1145/1142473.1142534
dc.identifier.citedreferenceBuneman, P., Cheney, J., Tan, W.‐C., & Vansummeren, S. ( 2008 ). Curated Databases. In Proceedings of the Twenty‐seventh ACM SIGMOD‐SIGACT‐SIGART Symposium on Principles of Database Systems (pp. 1 – 12 ). New York, NY, USA: ACM. https://doi.org/10.1145/1376916.1376918
dc.identifier.citedreferenceBuneman, P., Müller, H., & Rusbridge, C. ( 2009 ). Curating the CIA World Factbook. International Journal of Digital Curation, 4 ( 3 ), 29 – 43. https://doi.org/10.2218/ijdc.v4i3.126
dc.identifier.citedreferenceChan, S. ( 2014, November 7). The API at the center of the museum. Retrieved June 8, 2018, from https://labs.cooperhewitt.org/2014/the-api-at-the-center-of-the-museum/
dc.identifier.citedreferenceCodd, E. F. ( 1970 ). A relational model of data for large shared data banks. Communications of the ACM, 13 ( 6 ), 377 – 387. https://doi.org/10.1145/362384.362685
dc.identifier.citedreferenceCodd, E. F. ( 1971 ). Normalized data base structure: a brief tutorial (p. 1). ACM Press. https://doi.org/10.1145/1734714.1734716
dc.identifier.citedreferenceTsichritzis, D., & Klug, A. ( 1978 ). The ANSI/X3/SPARC DBMS framework report of the study group on database management systems. Information Systems, 3 ( 3 ), 173 – 191. https://doi.org/10.1016/0306-4379(78)90001-7
dc.identifier.citedreferenceCragin, M. H., Palmer, C. L., Carlson, J. R., & Witt, M. ( 2010 ). Data sharing, small science and institutional repositories. Philosophical Transactions. Series A, Mathematical, Physical, and Engineering Sciences, 368 ( 1926 ), 4023 – 38. https://doi.org/10.1098/rsta.2010.0165
dc.identifier.citedreferenceDourish, P., & Edwards, W. K. ( 2000 ). A Tale of Two Toolkits: Relating Infrastructure and Use in Flexible CSCW Toolkits. Computer Supported Cooperative Work (CSCW), 9 ( 1 ), 33 – 51. https://doi.org/10.1023/A:1008709725729
dc.identifier.citedreferenceDPLA API Codex. (n.d.). Retrieved June 8, 2018, from https://pro.dp.la/developers/api-codex
dc.identifier.citedreferenceEglash, R. (Ed.). ( 2004 ). Appropriating technology: An introduction. In Appropriating technology: vernacular science and social power (pp. vii – xxi ). Minneapolis: University of Minnesota Press.
dc.identifier.citedreferenceHenry, S., Hoon, S., Hwang, M., Lee, D., & DeVore, M. D. ( 2005 ). Engineering trade study: extract, transform, load tools for data migration. In 2005 IEEE Design Symposium, Systems and Information Engineering (pp. 1 – 8 ). https://doi.org/10.1109/SIEDS.2005.193231
dc.identifier.citedreferenceHerrmann, K., Voigt, H., Rausch, J., Behrend, A., & Lehner, W. ( 2017 ). Robust and simple database evolution. Information Systems Frontiers, 20, 45 – 61. https://doi.org/10.1007/s10796-016-9730-2
dc.identifier.citedreferenceHine, C. ( 2006 ). Databases as Scientific Instruments and Their Role in the Ordering of Scientific Work. Social Studies of Science, 36 ( 2 ), 269 – 298. https://doi.org/10.1177/0306312706054047
dc.identifier.citedreferenceHudson, L. W., Dutton, R. D., Reynolds, M. M., & Walden, W. E. ( 1971 ). TAXIR‐A biologically oriented information retrieval system as an aid to plant introduction. Economic Botany, 25 ( 4 ), 401 – 406. https://doi.org/10.1007/BF02985207
dc.identifier.citedreferenceImporting External Data into Specify 6. ( 2013, August 17). Specify Software Proejct. Retrieved from http://www.sustain.specifysoftware.org/wp-content/uploads/2017/03/Importing-External-Data-into-Specify-6.pdf
dc.identifier.citedreferenceJagadish, H. V., Chapman, A., Elkiss, A., Jayapandian, M., Li, Y., Nandi, A., & Yu, C. ( 2007 ). Making database systems usable (p. 13). ACM Press. https://doi.org/10.1145/1247480.1247483
dc.identifier.citedreferenceKerr, S. T. ( 1990 ). Wayfinding in an electronic database: The relative importance of navigational cues vs. mental models. Information Processing & Management, 26 ( 4 ), 511 – 523. https://doi.org/10.1016/0306-4573(90)90071-9
dc.owningcollnameInterdisciplinary and Peer-Reviewed


Files in this item

Show simple item record

Remediation of Harmful Language

The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.