Index Catalog // Deep Blue Data

Downloaded IAAF Sprint Results in all Heats for 2004 - 2016 Olympics for both Men and Women

Creator:: Mirshams Shahshahani, Payam
Description:: Investigating minimum human reaction times is often confounded by the motivation, training, and state of arousal of the subjects. We used the reaction times of athletes competing in the shorter sprint events in the Athletics competitions in recent Olympics (2004-2016) to determine minimum human reaction times because there's little question as to their motivation, training, or state of arousal. The reaction times of sprinters however are only available on the IAAF web page for each individual heat, in each event, at each Olympic. Therefore we compiled all these data into two separate excel sheets which can be used for further analyses.
Keyword:: minimum reaction time, sprinter, Olympics, Athletics, sex difference, starting block, and false start
Discipline:: Engineering, Health Sciences, Science, Other, and General Information Sources

Detecting Machine-obfuscated Plagiarism

Creator:: Foltynek, Tomas, Ruas, Terry, Scharpf, Philipp , Meuschke, Norman, Schubotz, Moritz , Grosky, William , and Gipp, Bela
Description:: This data set is comprised of multiple folders. The corpus folder contains raw text used for training and testing in two splits, "document" and "paragraph". The Spun documents and paragraphs are generated using the SpinBot tool ( https://spinbot.com/API). The paragraph split is generated by only selecting paragraphs with 3 or more sentences in the document split. Each folder is divided in mg (i.e., machine generated through SpinBot) and og (i.e., original generated file), The human judgement folder contains the human evaluation between original and spun documents (sample). It also contains the answers (keys) and survey results. , The models folder contains the machine learning classifier models for each word embedding technique used (only for document split training). The models were exported using pickle (Python 3.6). The grid search for hyperparameter adjustments is described in the paper. , and The vector folders (train and test) contains the average of all word vectors for each document and paragraph. Each line has the number of dimensions of the word embeddings technique used (see paper for more details) followed by its respective class (i.e, label mg or og). Each file belong to one class, either "mg" or "og". The values are comma-separated (.csv). The extension is .arff can be read as a normal .txt file.
Keyword:: paraphrase detection, plagiarism detection, document classification, and word embeddings
Citation to related publication:: Foltýnek, T. & Ruas, T. & Scharpf, P. & Meuschke, N. & Schubotz, M. & Grosky, W. & Gipp, B., “Detecting Machine-obfuscated Plagiarism,” in Sustainable Digital Communities, vol. 12051 LNCS, Springer, 2020, pp. 816–827. https://doi.org/10.1007/978-3-030-43687-2_68
Discipline:: General Information Sources

Data for Evaluating Individual Participant Data Plans for ICMJE Compliance: A Case Study at University of Michigan

Creator:: Samuel, Sara M, Wilson, Diane L, and Fleming, Emily K
Description:: The International Committee of Medical Journal Editors (ICMJE) requires researchers to post individual participant data (IPD) plans for interventional clinical trials with registration in order to be eligible for publication in its member journals. This study looked at how researchers interpret the ICMJE requirements and the related prompts for information used by ClinicalTrials.gov. This data consists of the analyzed contents of the IPD plans that researchers at the University of Michigan (U-M) submitted with trial registrations for the first 27 months that the 2019 requirement was in effect.
Keyword:: research data sharing, research data policy, research data, clinical trials, ClinicalTrials.gov, individual participant data, IPD, data sharing plan, and compliance
Citation to related publication:: Samuel, S. M. & Wilson, D. L. & Fleming, E., (2023) “Evaluating individual participant data plans for ICMJE compliance: A case study at University of Michigan”, Journal of the Society for Clinical Data Management 3(4). doi: https://doi.org/10.47912/jscdm.257
Discipline:: Health Sciences, Social Sciences, and General Information Sources

Downloaded IAAF Sprint Results in all Heats for 2004 - 2016 Olympics for both Men and Women

Detecting Machine-obfuscated Plagiarism

Data for Evaluating Individual Participant Data Plans for ICMJE Compliance: A Case Study at University of Michigan

Limit your search

Resource type

Creator

Discipline

Language

Search Results

Search Constraints

Search Results

Downloaded IAAF Sprint Results in all Heats for 2004 - 2016 Olympics for both Men and Women

Detecting Machine-obfuscated Plagiarism

Data for Evaluating Individual Participant Data Plans for ICMJE Compliance: A Case Study at University of Michigan

Limit your search

Resource type

Creator

Discipline

Language