Feasibility and Reliability of Automated Coding of Occupation in the Health and Retirement Study
dc.contributor.author | Helppie-McFall, Brooke | |
dc.contributor.author | Sonnega, Amanda | |
dc.date.accessioned | 2019-02-20T14:48:35Z | |
dc.date.available | 2019-02-20T14:48:35Z | |
dc.date.issued | 2018-12 | |
dc.identifier.citation | Helppie-McFall, Brooke and Amanda Sonnega. 2018. “Feasibility and Reliability of Automated Coding of Occupation in the Health and Retirement Study.” Ann Arbor MI: University of Michigan Retirement Research Center (MRRC) Working Paper, WP 2018-392. https://mrdrc.isr.umich.edu/publications/papers/pdf/wp392.pdf | en_US |
dc.identifier.uri | https://hdl.handle.net/2027.42/148129 | |
dc.description.abstract | Due to advances in computing power and the increase in coverage of longitudinal datasets in the Health and Retirement Study (HRS) that provide information about detailed occupations, demand has increased among researchers for improved occupation and industry data. The detailed data are currently hard to use because they were coded at different times, and the codeframes are, therefore, not consistent over time. Additionally, the HRS gathers new occupation and industry information from respondents every two years, and coding of new data at each wave is costly and time-consuming. In this project, we tested the NIOSH Industry and Occupation Computerized Coding System (NIOCCS) to see if it could improve processes for coding data from the HRS. We tested results from NIOCCS against results from a human coder for multiple datasets. NIOCCs does reasonably well compared to coding results from a highly-trained, professional occupation and industry coder, with kappa inter-rater reliability on detailed codes of just under 70 percent and agreement rates on broader codes of around 80 percent; however, code rates for NIOCCS for the datasets tested ranged from 60 percent to 72 percent, as compared to a professional coder’s ability to code those same datasets that ranged from 95 percent to 100 percent. In its current form, we find that NIOCCS is a tool that might be best used to reduce the number of cases human coders must code, either in coding historical data to a consistent codeframe or in coding data from future HRS waves. However, it is not yet ready to fully replace human coders. | en_US |
dc.description.sponsorship | U.S. Social Security Administration, Award number RRC08098401-10, R-UM18-06 | en_US |
dc.language.iso | en_US | en_US |
dc.publisher | Michigan Retirement Research Center, University of Michigan, P.O. Box 1248, Ann Arbor, MI 48104 | en_US |
dc.relation.ispartofseries | WP 2018-392 | en_US |
dc.subject | Health and Retirement Study, older workers, occupation | en_US |
dc.title | Feasibility and Reliability of Automated Coding of Occupation in the Health and Retirement Study | en_US |
dc.title.alternative | WP 2018-392 | en_US |
dc.type | Working Paper | en_US |
dc.subject.hlbsecondlevel | Population and Demography | |
dc.subject.hlbtoplevel | Social Sciences | |
dc.contributor.affiliationum | University of Michigan, Institute for Social Research | en_US |
dc.contributor.affiliationum | University of Michigan, Institute for Social Research | en_US |
dc.contributor.affiliationumcampus | Ann Arbor | en_US |
dc.description.bitstreamurl | https://deepblue.lib.umich.edu/bitstream/2027.42/148129/1/wp392.pdf | |
dc.owningcollname | Retirement and Disability Research Center, Michigan (MRDRC) |
Files in this item
Remediation of Harmful Language
The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.
Accessibility
If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.