Supporting the long‐term curation and migration of natural history museum collections databases
dc.contributor.author | Thomer, Andrea K. | |
dc.contributor.author | Weber, Nicholas M. | |
dc.contributor.author | Twidale, Michael B. | |
dc.date.accessioned | 2019-02-12T20:23:18Z | |
dc.date.available | 2019-02-12T20:23:18Z | |
dc.date.issued | 2018 | |
dc.identifier.citation | Thomer, Andrea K.; Weber, Nicholas M.; Twidale, Michael B. (2018). "Supporting the long‐term curation and migration of natural history museum collections databases." Proceedings of the Association for Information Science and Technology 55(1): 504-513. | |
dc.identifier.issn | 2373-9231 | |
dc.identifier.issn | 2373-9231 | |
dc.identifier.uri | https://hdl.handle.net/2027.42/147782 | |
dc.description.abstract | Migration of data collections from one platform to another is an important component of data curation – yet, there is surprisingly little guidance for information professionals faced with this task. Data migration may be particularly challenging when these data collections are housed in relational databases, due to the complex ways that data, data schemas, and relational database management software become intertwined over time. Here we present results from a study of the maintenance, evolution and migration of research databases housed in Natural History Museums. We find that database migration is an on‐going – rather than occasional – process for many Collection managers, and that they creatively appropriate and innovate on many existing technologies in their migration work. This paper contributes descriptions of a preliminary set of common adaptations and “migration patterns” in the practices of database curators. It also outlines the strategies they use when facing collection‐level data migration and describes the limitations of existing tools in supporting LAM and “small science” research database migration. We conclude by outlining future research directions for the maintenance and migration of collections and complex digital objects. | |
dc.publisher | Wiley | |
dc.subject.other | museum informatics | |
dc.subject.other | natural history museums | |
dc.subject.other | database migration | |
dc.subject.other | Data curation | |
dc.title | Supporting the long‐term curation and migration of natural history museum collections databases | |
dc.type | Article | en_US |
dc.rights.robots | IndexNoFollow | |
dc.subject.hlbsecondlevel | Information Science | |
dc.subject.hlbtoplevel | Social Sciences | |
dc.description.peerreviewed | Peer Reviewed | |
dc.description.bitstreamurl | https://deepblue.lib.umich.edu/bitstream/2027.42/147782/1/pra214505501055.pdf | |
dc.identifier.doi | 10.1002/pra2.2018.14505501055 | |
dc.identifier.source | Proceedings of the Association for Information Science and Technology | |
dc.identifier.citedreference | Sarasan, L., Neuner, A. M., & Association of Systematics Collections (Eds.). ( 1983 ). Museum collections and computers: report of an ASC survey. Lawrence, Kan., U.S.A: Association of Systematics Collections. | |
dc.identifier.citedreference | Li, Q., & Lochovsky, F. H. ( 1996 ). Advanced database support facilities for CSCW systems. Journal of Organizational Computing and Electronic Commerce, 6 ( 2 ), 191 – 210. https://doi.org/10.1080/10919399609540276 | |
dc.identifier.citedreference | Mello, J. F. ( 1975 ). The Use of the Selgem System in Support of Systematics. In Computers in Botanical Collections (pp. 125 – 138 ). Springer, Boston, MA. https://doi.org/10.1007/978-1-4684-2157-6_17 | |
dc.identifier.citedreference | migration. (n.d.). Glossary of Archival and Records Terminology. Society of American Archivists. Retrieved from https://www2.archivists.org/glossary/terms/m/migration | |
dc.identifier.citedreference | National Science and Technology Council, & Scientific collections: mission‐critical infrastructure for federal science agencies: a report of the Interagency Working Group on Scientific Collections (Eds.). ( 2009 ). Scientific collections: mission‐critical infrastructure for federal science agencies: a report of the Interagency Working Group on Scientific Collections. Washington DC: Office of Science and Technology Policy. | |
dc.identifier.citedreference | Olson, J. E. ( 2009 ). Database archiving: how to keep lots of data for a very long time. San Francisco, Calif.: Oxford: Morgan Kaufmann; Elsevier Science [distributor]. | |
dc.identifier.citedreference | Padilla, T. G. ( 2018 ). Collections as data: Implications for enclosure. College & Research Libraries News, 79 ( 6 ). Retrieved from https://crln.acrl.org/index.php/crlnews/article/view/17003/18751 | |
dc.identifier.citedreference | Palmer, C. L., Zavalina, O. L., & Fenlon, K. ( 2010 ). Beyond size and search: Building contextual mass in digital aggregations for scholarly use. Proceedings of the American Society for Information Science and Technology, 47(1), 1 – 10. https://doi.org/10.1002/meet.14504701213 | |
dc.identifier.citedreference | Plantin, J.‐C., Lagoze, C., Edwards, P. N., & Sandvig, C. ( 2016 ). Infrastructure studies meet platform studies in the age of Google and Facebook. New Media & Society, 1461444816661553. https://doi.org/10.1177/1461444816661553 | |
dc.identifier.citedreference | Robertson, T., Döring, M., Guralnick, R., Bloom, D., Wieczorek, J., Braak, K., … Desmet, P. ( 2014 ). The GBIF Integrated Publishing Toolkit: Facilitating the Efficient Publishing of Biodiversity Data on the Internet. PLoS ONE, 9 ( 8 ), e102623. https://doi.org/10.1371/journal.pone.0102623 | |
dc.identifier.citedreference | Stein, B. R., & Wieczorek, J. R. ( 2004 ). Mammals of the World: MaNIS as an example of data integration in a distributed network environment. Biodiversity Informatics, 1 ( 0 ). https://doi.org/10.17161/bi.v1i0.7 | |
dc.identifier.citedreference | Tate. ( 2018 ). collection: Tate Collection metadata. Python, Tate Modern. Retrieved from https://github.com/tategallery/collection (Original work published 2013) | |
dc.identifier.citedreference | Thibodeau, K. ( 2002 ). Overview of Technological Approaches to Digital Preservation and Challenges in Coming Years (The State of Digital preservation: An International Perspective). Washington, D.C.: CLIR and the Library of Congress. Retrieved from https://www.clir.org/pubs/reports/pub107/thibodeau/ | |
dc.identifier.citedreference | Vassiliadis, P. ( 2009 ). A Survey of Extract–Transform–Load Technology: International Journal of Data Warehousing and Mining, 5 ( 3 ), 1–27. https://doi.org/10.4018/jdwm.2009070101 | |
dc.identifier.citedreference | Vieglais, D., Wiley, E. O., Robins, R., & Peterson, T. ( 2000 ). Harnessing Museum Resources for the Census of Marine Life: The FISHNET Project. Oceanography, 13 ( 3 ), 10 – 13. https://doi.org/10.5670/oceanog.2000.02 | |
dc.identifier.citedreference | Voida, A., Harmon, E., & Al‐Ani, B. ( 2011 ). Homebrew databases: complexities of everyday information management in nonprofit organizations (p. 915). ACM Press. https://doi.org/10.1145/1978942.1979078 | |
dc.identifier.citedreference | Wickett, K. M. (in press). A logic‐based framework for collection/item metadata relationships. Journal of Documentation. | |
dc.identifier.citedreference | Wickett, K. M., Renear, A. H., & Urban, R. J. ( 2010 ). Rule categories for collection/item metadata relationships. Proceedings of the American Society for Information Science and Technology, 47(1), 1 – 10. https://doi.org/10.1002/meet.14504701218 | |
dc.identifier.citedreference | Yin, R. K. ( 2009 ). Case study research: Design and methods. ( 4th ed. ). SAGE Publications Ltd. | |
dc.identifier.citedreference | Zavalina, O. L., Palmer, C. L., Jackson, A. S., & Han, M.‐J. ( 2009 ). Evaluating Descriptive Richness in Collection‐Level Metadata. Journal of Library Metadata, 8 ( 4 ), 263 – 292. https://doi.org/10.1080/19386380802627109 | |
dc.identifier.citedreference | Ariño, A. H. ( 2010 ). Approaches to estimating the universe of natural history collections data. Biodiversity Informatics, 7 ( 2 ). https://doi.org/10.17161/bi.v7i2.3991 | |
dc.identifier.citedreference | Atzeni, P., Jensen, C. S., Orsi, G., Ram, S., Tanca, L., & Torlone, R. ( 2013 ). The relational model is dead, SQL is dead, and I don’t feel so good myself. ACM SIGMOD Record, 42 ( 1 ), 64. https://doi.org/10.1145/2503792.2503808 | |
dc.identifier.citedreference | Beaman, R., & Cellinese, N. ( 2012 ). Mass digitization of scientific collections: New opportunities to transform the use of biological specimens and underwrite biodiversity science. ZooKeys, 209, 7 – 17. https://doi.org/10.3897/zookeys.209.3313 | |
dc.identifier.citedreference | Brodie, M. L., & Stonebraker, M. ( 1995 ). Migrating legacy systems: gateways, interfaces & the incremental approach. San Francisco, Calif.: [S.l.]: Morgan Kaufmann Publishers; IT/Information Technology. | |
dc.identifier.citedreference | Buneman, P., Chapman, A., & Cheney, J. ( 2006 ). Provenance management in curated databases (p. 539). ACM Press. https://doi.org/10.1145/1142473.1142534 | |
dc.identifier.citedreference | Buneman, P., Cheney, J., Tan, W.‐C., & Vansummeren, S. ( 2008 ). Curated Databases. In Proceedings of the Twenty‐seventh ACM SIGMOD‐SIGACT‐SIGART Symposium on Principles of Database Systems (pp. 1 – 12 ). New York, NY, USA: ACM. https://doi.org/10.1145/1376916.1376918 | |
dc.identifier.citedreference | Buneman, P., Müller, H., & Rusbridge, C. ( 2009 ). Curating the CIA World Factbook. International Journal of Digital Curation, 4 ( 3 ), 29 – 43. https://doi.org/10.2218/ijdc.v4i3.126 | |
dc.identifier.citedreference | Chan, S. ( 2014, November 7). The API at the center of the museum. Retrieved June 8, 2018, from https://labs.cooperhewitt.org/2014/the-api-at-the-center-of-the-museum/ | |
dc.identifier.citedreference | Codd, E. F. ( 1970 ). A relational model of data for large shared data banks. Communications of the ACM, 13 ( 6 ), 377 – 387. https://doi.org/10.1145/362384.362685 | |
dc.identifier.citedreference | Codd, E. F. ( 1971 ). Normalized data base structure: a brief tutorial (p. 1). ACM Press. https://doi.org/10.1145/1734714.1734716 | |
dc.identifier.citedreference | Tsichritzis, D., & Klug, A. ( 1978 ). The ANSI/X3/SPARC DBMS framework report of the study group on database management systems. Information Systems, 3 ( 3 ), 173 – 191. https://doi.org/10.1016/0306-4379(78)90001-7 | |
dc.identifier.citedreference | Cragin, M. H., Palmer, C. L., Carlson, J. R., & Witt, M. ( 2010 ). Data sharing, small science and institutional repositories. Philosophical Transactions. Series A, Mathematical, Physical, and Engineering Sciences, 368 ( 1926 ), 4023 – 38. https://doi.org/10.1098/rsta.2010.0165 | |
dc.identifier.citedreference | Dourish, P., & Edwards, W. K. ( 2000 ). A Tale of Two Toolkits: Relating Infrastructure and Use in Flexible CSCW Toolkits. Computer Supported Cooperative Work (CSCW), 9 ( 1 ), 33 – 51. https://doi.org/10.1023/A:1008709725729 | |
dc.identifier.citedreference | DPLA API Codex. (n.d.). Retrieved June 8, 2018, from https://pro.dp.la/developers/api-codex | |
dc.identifier.citedreference | Eglash, R. (Ed.). ( 2004 ). Appropriating technology: An introduction. In Appropriating technology: vernacular science and social power (pp. vii – xxi ). Minneapolis: University of Minnesota Press. | |
dc.identifier.citedreference | Henry, S., Hoon, S., Hwang, M., Lee, D., & DeVore, M. D. ( 2005 ). Engineering trade study: extract, transform, load tools for data migration. In 2005 IEEE Design Symposium, Systems and Information Engineering (pp. 1 – 8 ). https://doi.org/10.1109/SIEDS.2005.193231 | |
dc.identifier.citedreference | Herrmann, K., Voigt, H., Rausch, J., Behrend, A., & Lehner, W. ( 2017 ). Robust and simple database evolution. Information Systems Frontiers, 20, 45 – 61. https://doi.org/10.1007/s10796-016-9730-2 | |
dc.identifier.citedreference | Hine, C. ( 2006 ). Databases as Scientific Instruments and Their Role in the Ordering of Scientific Work. Social Studies of Science, 36 ( 2 ), 269 – 298. https://doi.org/10.1177/0306312706054047 | |
dc.identifier.citedreference | Hudson, L. W., Dutton, R. D., Reynolds, M. M., & Walden, W. E. ( 1971 ). TAXIR‐A biologically oriented information retrieval system as an aid to plant introduction. Economic Botany, 25 ( 4 ), 401 – 406. https://doi.org/10.1007/BF02985207 | |
dc.identifier.citedreference | Importing External Data into Specify 6. ( 2013, August 17). Specify Software Proejct. Retrieved from http://www.sustain.specifysoftware.org/wp-content/uploads/2017/03/Importing-External-Data-into-Specify-6.pdf | |
dc.identifier.citedreference | Jagadish, H. V., Chapman, A., Elkiss, A., Jayapandian, M., Li, Y., Nandi, A., & Yu, C. ( 2007 ). Making database systems usable (p. 13). ACM Press. https://doi.org/10.1145/1247480.1247483 | |
dc.identifier.citedreference | Kerr, S. T. ( 1990 ). Wayfinding in an electronic database: The relative importance of navigational cues vs. mental models. Information Processing & Management, 26 ( 4 ), 511 – 523. https://doi.org/10.1016/0306-4573(90)90071-9 | |
dc.owningcollname | Interdisciplinary and Peer-Reviewed |
Files in this item
Remediation of Harmful Language
The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.
Accessibility
If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.