Provenance Metadata for Statistical Data: An Introduction to Structured Data Transformation Language (SDTL)

Deep Blue Home

Show simple item record

dc.contributor.author Alter, George
dc.contributor.author Donakowski, Darrell
dc.contributor.author Gager, Jack
dc.contributor.author Heus, Pascal
dc.contributor.author Hunter, Carson
dc.contributor.author Ionescu, Sanda
dc.contributor.author Iverson, Jeremy
dc.contributor.author Jagadish, H V
dc.contributor.author Lagoze, Carl
dc.contributor.author Lyle, Jared
dc.contributor.author Mueller, Alexander
dc.contributor.author Revheim, Sigbjørn
dc.contributor.author Richardson, Matthew
dc.contributor.author Risnes, Ørnulf
dc.contributor.author Seelam, Karunakara
dc.contributor.author Smith, Dan
dc.contributor.author Smith, Tom
dc.contributor.author Song, Jie
dc.contributor.author Vaidya, Yashas Jaydeep
dc.contributor.author Voldsater, Ole
dc.date.accessioned 2020-07-06T18:58:58Z
dc.date.available 2020-07-06T18:58:58Z
dc.date.issued 2020-07-06
dc.identifier.uri http://hdl.handle.net/2027.42/156015
dc.description.abstract Structured Data Transformation Language (SDTL) provides structured, machine actionable representations of data transformation commands found in statistical analysis software. The Continuous Capture of Metadata for Statistical Data Project (C2Metadata) created SDTL as part of an automated system that captures provenance metadata from data transformation scripts and adds variable derivations to standard metadata files. SDTL also has potential for auditing scripts and for translating scripts between languages. SDTL is expressed in a set of JSON schemas, which are machine actionable and easily serialized to other formats. Statistical software languages have a number of special features that have been carried into SDTL. We explain how SDTL handles differences among statistical languages and complex operations, such as merging files and reshaping data tables from “wide” to “long”. en_US
dc.description.sponsorship National Science Foundation grant ACI-1640575 en_US
dc.language.iso en_US en_US
dc.subject metadata, data sharing, statistical analysis en_US
dc.title Provenance Metadata for Statistical Data: An Introduction to Structured Data Transformation Language (SDTL) en_US
dc.type Article en_US
dc.subject.hlbsecondlevel Statistics and Numeric Data
dc.subject.hlbtoplevel Social Sciences
dc.contributor.affiliationum Inter-university Consortium for Political and Social Research en_US
dc.contributor.affiliationum Center for Political Studies en_US
dc.contributor.affiliationum Computer Science and Engineering en_US
dc.contributor.affiliationother Colectica Inc. en_US
dc.contributor.affiliationother Metadata Technology North America Inc. en_US
dc.contributor.affiliationother Norwegian Centre for Research Data en_US
dc.contributor.affiliationother NORC en_US
dc.contributor.affiliationumcampus Ann Arbor en_US
dc.description.bitstreamurl https://deepblue.lib.umich.edu/bitstream/2027.42/156015/1/SDTL_Intro_v14.pdf
dc.identifier.orcid 0000-0003-3823-4972 en_US
dc.description.filedescription Description of SDTL_Intro_v14.pdf : Main article
dc.identifier.name-orcid Alter, George; 0000-0003-3823-4972 en_US
dc.owningcollname Inter-university Consortium for Political and Social Research (ICPSR)
 Show simple item record

This item appears in the following Collection(s)


Search Deep Blue

Browse by

My Account

Information

Coming Soon


MLibrary logo