Show simple item record

Provenance Metadata for Statistical Data: An Introduction to Structured Data Transformation Language (SDTL)

dc.contributor.authorAlter, George
dc.contributor.authorDonakowski, Darrell
dc.contributor.authorGager, Jack
dc.contributor.authorHeus, Pascal
dc.contributor.authorHunter, Carson
dc.contributor.authorIonescu, Sanda
dc.contributor.authorIverson, Jeremy
dc.contributor.authorJagadish, H V
dc.contributor.authorLagoze, Carl
dc.contributor.authorLyle, Jared
dc.contributor.authorMueller, Alexander
dc.contributor.authorRevheim, Sigbjørn
dc.contributor.authorRichardson, Matthew
dc.contributor.authorRisnes, Ørnulf
dc.contributor.authorSeelam, Karunakara
dc.contributor.authorSmith, Dan
dc.contributor.authorSmith, Tom
dc.contributor.authorSong, Jie
dc.contributor.authorVaidya, Yashas Jaydeep
dc.contributor.authorVoldsater, Ole
dc.date.accessioned2020-07-06T18:58:58Z
dc.date.available2020-07-06T18:58:58Z
dc.date.issued2020-07-06
dc.identifier.urihttps://hdl.handle.net/2027.42/156015
dc.description.abstractStructured Data Transformation Language (SDTL) provides structured, machine actionable representations of data transformation commands found in statistical analysis software. The Continuous Capture of Metadata for Statistical Data Project (C2Metadata) created SDTL as part of an automated system that captures provenance metadata from data transformation scripts and adds variable derivations to standard metadata files. SDTL also has potential for auditing scripts and for translating scripts between languages. SDTL is expressed in a set of JSON schemas, which are machine actionable and easily serialized to other formats. Statistical software languages have a number of special features that have been carried into SDTL. We explain how SDTL handles differences among statistical languages and complex operations, such as merging files and reshaping data tables from “wide” to “long”.en_US
dc.description.sponsorshipNational Science Foundation grant ACI-1640575en_US
dc.language.isoen_USen_US
dc.subjectmetadata, data sharing, statistical analysisen_US
dc.titleProvenance Metadata for Statistical Data: An Introduction to Structured Data Transformation Language (SDTL)en_US
dc.typeArticleen_US
dc.subject.hlbsecondlevelStatistics and Numeric Data
dc.subject.hlbtoplevelSocial Sciences
dc.contributor.affiliationumInter-university Consortium for Political and Social Researchen_US
dc.contributor.affiliationumCenter for Political Studiesen_US
dc.contributor.affiliationumComputer Science and Engineeringen_US
dc.contributor.affiliationotherColectica Inc.en_US
dc.contributor.affiliationotherMetadata Technology North America Inc.en_US
dc.contributor.affiliationotherNorwegian Centre for Research Dataen_US
dc.contributor.affiliationotherNORCen_US
dc.contributor.affiliationumcampusAnn Arboren_US
dc.description.bitstreamurlhttps://deepblue.lib.umich.edu/bitstream/2027.42/156015/1/SDTL_Intro_v14.pdf
dc.identifier.orcid0000-0003-3823-4972en_US
dc.description.filedescriptionDescription of SDTL_Intro_v14.pdf : Main article
dc.identifier.name-orcidAlter, George; 0000-0003-3823-4972en_US
dc.owningcollnameInter-university Consortium for Political and Social Research (ICPSR)


Files in this item

Show simple item record

Remediation of Harmful Language

The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.