The Value of Early Disclosure Risk Decisions
dc.contributor.author | O'Rourke, JoAnne McFarland | |
dc.date.accessioned | 2012-03-20T19:48:25Z | |
dc.date.available | 2012-03-20T19:48:25Z | |
dc.date.issued | 2012-03-19 | |
dc.identifier.uri | https://hdl.handle.net/2027.42/90426 | |
dc.description.abstract | When data are made available to others to analyze for their purposes, steps must be taken to ensure confidentiality, that is to prevent the identities of the persons or institutions that were studied are not disclosed and cannot be deduced. Disclosure risk analysis is conducted in order to create a public-use file (PUF) from confidential, or restricted-use, data. Based on this analysis of disclosure risks, statistical disclosure limitation (SDL) methodologies are applied to the data to create the PUF. The public-use file (PUF) is the only version of the microdata to which most researchers ever have access and the version from which much of the utility of the data is extracted. Therefore, decisions made to create the PUF, in terms of variable changes (e.g., deletions, recodes) and the selection of statistical disclosure limitation (SDL) methods (e.g., data swapping, imputation collapsing categories) are very important and must match the key intended purposes of the data collection and the disclosure risk. Typically, decisions regarding disclosure risk are made after data collection is completed. This article will describe a new model for conducting disclosure risk analysis for the creation of PUFs that moves decisions regarding disclosure risk to the beginning of the survey research process. Early thinking and decision-making regarding disclosure risk can lead to a more analytically useful PUF and the most optimal set of data products that can be developed (tables, maps, online analysis, and so on, in addition to the PUF). Efficiencies created between the various stages of the research process by the model will shorten the time between data collection and data release, thus increasing the value of the shared data to secondary analysts and to science. | en_US |
dc.language.iso | en_US | en_US |
dc.subject | Disclosure Risk | en_US |
dc.subject | Statistical Disclosure Limitation | en_US |
dc.subject | Disclosure Analysis | en_US |
dc.subject | Public-use Data Files | en_US |
dc.subject | Survey Data | en_US |
dc.title | The Value of Early Disclosure Risk Decisions | en_US |
dc.type | Article | en_US |
dc.subject.hlbsecondlevel | Social Sciences (General) | |
dc.subject.hlbtoplevel | Social Sciences | |
dc.contributor.affiliationum | ISR | en_US |
dc.contributor.affiliationumcampus | Ann Arbor | en_US |
dc.description.bitstreamurl | http://deepblue.lib.umich.edu/bitstream/2027.42/90426/4/Next Steps in Advancing Disclosure Risk Analysis.pdf | |
dc.owningcollname | Institute for Social Research (ISR) |
Files in this item
Remediation of Harmful Language
The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.
Accessibility
If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.