Show simple item record

Detecting Informal Data References in Academic Literature

dc.contributor.authorLafia, Sara
dc.contributor.authorKo, Jeong-Woo
dc.contributor.authorMoss, Elizabeth
dc.contributor.authorKim, Jinseok
dc.contributor.authorThomer, Andrea
dc.contributor.authorHemphill, Libby
dc.date.accessioned2021-07-22T21:09:38Z
dc.date.available2021-07-22T21:09:38Z
dc.date.issued2021-07-22
dc.identifier.urihttps://hdl.handle.net/2027.42/168392en
dc.description.abstractThe Inter-university Consortium for Political and Social Research (ICPSR) is developing a machine learning approach using natural language processing (NLP) to assist in the detection of informal data references. Formal data citations that reference unique identifiers are readily discoverable; however, informal references indicating research data reuse are challenging to infer and detect. We contribute a model that uses a combination of cues, such as the presence of indicator terms and syntactical patterns, to assign a likelihood score to dataset mentions and extract candidate data citations from academic text. In production, the model will support the evaluation of candidate documents for ingest into the ICPSR Bibliography of Data-related Literature. This work supports a larger effort to measure the impact of research data.en_US
dc.language.isoen_USen_US
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 International*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/*
dc.subjectdata citationen_US
dc.subjectdata referenceen_US
dc.subjectmachine learningen_US
dc.subjectresearch data metricsen_US
dc.titleDetecting Informal Data References in Academic Literatureen_US
dc.typePreprinten_US
dc.subject.hlbsecondlevelSocial Sciences (General)
dc.subject.hlbtoplevelSocial Sciences
dc.contributor.affiliationumInter-university Consortium for Political and Social Research (ICPSR)en_US
dc.contributor.affiliationumInstitute for Social Research (ISR)en_US
dc.contributor.affiliationumSchool of Information (UMSI)en_US
dc.contributor.affiliationumcampusAnn Arboren_US
dc.description.bitstreamurlhttp://deepblue.lib.umich.edu/bitstream/2027.42/168392/1/Detecting_Informal_Data_Refs.pdf
dc.identifier.doihttps://dx.doi.org/10.7302/1671
dc.identifier.doihttps://doi.org/10.1002/asi.24646en_US
dc.identifier.orcid0000-0002-5896-7295en_US
dc.identifier.orcid0000-0001-5464-8716en_US
dc.identifier.orcid0000-0001-6481-2065en_US
dc.identifier.orcid0000-0001-6238-3498en_US
dc.identifier.orcid0000-0002-3793-7281en_US
dc.description.filedescriptionDescription of Detecting_Informal_Data_Refs.pdf : Preprint
dc.description.depositorSELFen_US
dc.identifier.name-orcidLafia, Sara; 0000-0002-5896-7295en_US
dc.identifier.name-orcidMoss, Elizabeth; 0000-0001-5464-8716en_US
dc.identifier.name-orcidKim, Jinseok; 0000-0001-6481-2065en_US
dc.identifier.name-orcidThomer, Andrea; 0000-0001-6238-3498en_US
dc.identifier.name-orcidHemphill, Libby; 0000-0002-3793-7281en_US
dc.working.doi10.7302/1671en_US
dc.owningcollnameInstitute for Social Research (ISR)


Files in this item

Show simple item record

Attribution-NonCommercial-NoDerivatives 4.0 International
Except where otherwise noted, this item's license is described as Attribution-NonCommercial-NoDerivatives 4.0 International

Remediation of Harmful Language

The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.