Exploratory and Directed Search Strategies at a Social Science Data Archive
dc.contributor.author | Lafia, Sara | |
dc.contributor.author | Million, A.J. | |
dc.contributor.author | Hemphill, Libby | |
dc.date.accessioned | 2023-05-01T15:42:55Z | |
dc.date.available | 2023-05-01T15:42:55Z | |
dc.date.issued | 2023-05-01 | |
dc.identifier.uri | https://hdl.handle.net/2027.42/176239 | en |
dc.description.abstract | Researchers need to be able to find, access, and use data to participate in open science. To understand how users search for research data, we analyzed textual queries issued at a large social science data archive, the Inter-university Consortium for Political and Social Research (ICPSR). We collected unique user queries from 988,475 user search sessions over four years (2012-16). Overall, we found that only 30% of site visitors entered search terms into the ICPSR website. We analyzed search strategies within these sessions by extending existing dataset search taxonomies to classify a subset of the 1,554 most popular queries. We identified five categories of commonly-issued queries: keyword-based (e.g., date, place, topic); name (e.g., study, series); identifier (e.g., study, series); author (e.g., institutional, individual); and type (e.g., file, format). While the dominant search strategy used short keywords to explore topics, directed searches for known items using study and series names were also common. We further distinguished exploratory browsing from directed search queries based on their page views, refinements, search depth, duration, and length. Directed queries were longer (i.e., they had more words), while sessions with exploratory queries had more refinements and associated page views. By comparing search interactions at ICPSR to other natural language interactions in similar web search contexts, we conclude that dataset search at ICPSR is underutilized. We envision how alternative search paradigms, such as those enabled by recommender systems, can enhance dataset search. | en_US |
dc.description.sponsorship | This material is based upon work supported by the National Science Foundation under grant 2121789. | en_US |
dc.language.iso | en_US | en_US |
dc.rights | Attribution-NonCommercial 4.0 International | * |
dc.rights.uri | http://creativecommons.org/licenses/by-nc/4.0/ | * |
dc.subject | research data | en_US |
dc.subject | information search | en_US |
dc.subject | query log analysis | en_US |
dc.subject | user behavior | en_US |
dc.subject | web analytics | en_US |
dc.title | Exploratory and Directed Search Strategies at a Social Science Data Archive | en_US |
dc.type | Conference Paper | en_US |
dc.subject.hlbsecondlevel | Social Sciences (General) | |
dc.subject.hlbtoplevel | Social Sciences | |
dc.contributor.affiliationum | ICPSR | en_US |
dc.contributor.affiliationum | UMSI | en_US |
dc.contributor.affiliationumcampus | Ann Arbor | en_US |
dc.description.bitstreamurl | http://deepblue.lib.umich.edu/bitstream/2027.42/176239/1/Exploratory and Directed Search Strategies at a Social Science Data Archive.pdf | |
dc.identifier.doi | https://dx.doi.org/10.7302/7178 | |
dc.identifier.orcid | 0000-0002-5896-7295 | en_US |
dc.identifier.orcid | 0000-0002-3793-7281 | en_US |
dc.identifier.orcid | 0000-0002-8909-153X | en_US |
dc.description.filedescription | Description of Exploratory and Directed Search Strategies at a Social Science Data Archive.pdf : Main article | |
dc.description.depositor | SELF | en_US |
dc.identifier.name-orcid | Lafia, Sara; 0000-0002-5896-7295 | en_US |
dc.identifier.name-orcid | Hemphill, Libby; 0000-0002-3793-7281 | en_US |
dc.identifier.name-orcid | Million, Anthony; 0000-0002-8909-153X | en_US |
dc.working.doi | 10.7302/7178 | en_US |
dc.owningcollname | Institute for Social Research (ISR) |
Files in this item
Remediation of Harmful Language
The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.
Accessibility
If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.