Search Constraints
Number of results to display per page
View results as:
Search Results
-
- Creator:
- Foltynek, Tomas, Ruas, Terry, Scharpf, Philipp , Meuschke, Norman, Schubotz, Moritz , Grosky, William , and Gipp, Bela
- Description:
- This data set is comprised of multiple folders. The corpus folder contains raw text used for training and testing in two splits, "document" and "paragraph". The Spun documents and paragraphs are generated using the SpinBot tool ( https://spinbot.com/API). The paragraph split is generated by only selecting paragraphs with 3 or more sentences in the document split. Each folder is divided in mg (i.e., machine generated through SpinBot) and og (i.e., original generated file), The human judgement folder contains the human evaluation between original and spun documents (sample). It also contains the answers (keys) and survey results. , The models folder contains the machine learning classifier models for each word embedding technique used (only for document split training). The models were exported using pickle (Python 3.6). The grid search for hyperparameter adjustments is described in the paper. , and The vector folders (train and test) contains the average of all word vectors for each document and paragraph. Each line has the number of dimensions of the word embeddings technique used (see paper for more details) followed by its respective class (i.e, label mg or og). Each file belong to one class, either "mg" or "og". The values are comma-separated (.csv). The extension is .arff can be read as a normal .txt file.
- Keyword:
- paraphrase detection, plagiarism detection, document classification, and word embeddings
- Citation to related publication:
- Foltýnek, T. & Ruas, T. & Scharpf, P. & Meuschke, N. & Schubotz, M. & Grosky, W. & Gipp, B., “Detecting Machine-obfuscated Plagiarism,” in Sustainable Digital Communities, vol. 12051 LNCS, Springer, 2020, pp. 816–827. https://doi.org/10.1007/978-3-030-43687-2_68
- Discipline:
- General Information Sources
-
Geotechnical observations of weathered rock across a tectonic and climatic gradient in Central Nepal
- Creator:
- Medwedeff, William, G (University of Michigan Earth & Environmental Science), Clark, Marin, K (University of Michigan Earth & Environmental Science), Zekkos, Dimitrios (University of California, Berkeley), West, A., Joshua (University of Southern California), and Chamlagain, Deepak (Tribhuvan University, Kathmandu Nepal)
- Description:
- These datasets support the findings of Medwedeff et al. (2021) in JGR: Earth Surface. In this article, we present seismic and geotechnical characterizations of the shallow subsurface across a 200 km by 50 km swath of the central Himalayan Range, in Nepal. By pairing widely-distributed 1D shear wave velocity surveys and engineering outcrop descriptions per the Geological Strength Index classification system, we evaluate landscape-scale patterns in near-surface mechanical characteristics and their relation to environmental factors known to affect rock strength. We find that near-surface strength is more dependent on the degree of weathering, rather than the mineral and textural differences between the metamorphic lithologies found in the central Himalaya. Furthermore, weathering varies systematically with topography. Bedrock ridge top sites are highly weathered and have S-wave seismic velocities and shear strength characteristics that are more typical of engineering soils, whereas sites near the bedrock channel bottom tend to be less weathered and characterized by high S-wave velocities and shear strength estimates typical of hard rock. Weathering of bedrock on hillslopes is significantly more variable, resulting in S-wave velocities that range between the ridge and channel endmembers. We hypothesize variability in the hillslope environment may be partly explained by the stochastic nature of mass wasting, which clears away weathered material where landslide scars are recent. These results underscore the mechanical heterogeneity in the shallow subsurface and highlight the need to account for bedrock weathering when estimating strength parameters for regional landslide hazard analysis.
- Keyword:
- rock strength, critical zone, shallow seismic, and chemical weathering
- Discipline:
- Science
-
- Creator:
- Gorchov Negron, Alan M., Kort, Eric A., Conley, Stephen A., and Smith, Mackenzie L.
- Description:
- This data-set contains data used in the publication "Airborne Assessment of Methane Emissions from Offshore Platforms in the U.S. Gulf of Mexico" by Gorchov Negron et al. (2020). There are 46,032 rows and 45 columns in the data. and The aircraft sampled offshore facilities with two unique sampling strategies: facility-level samples and regional box samples. Gorchov Negron et al. used facility-level samples to calculate facility-level fluxes and regional box samples, in conjunction with vertical profiles, to calculate regional-level fluxes. Meteorological parameters in the data were evaluated to discern when assumptions for each method were met. The facility-level fluxes were used to generate a facility-level aerial measurement-based inventory that was scaled up for comparison with regional-level fluxes.
- Keyword:
- Methane Emissions, Offshore Oil and Gas Platforms, Airborne Measurements, Greenhouse Gas Mitigation, and Gulf of Mexico
- Citation to related publication:
- Alan M. Gorchov Negron, Eric A. Kort, Stephen A. Conley, Mackenzie L. Smith. "Airborne Assessment of Methane Emissions from Offshore Platforms in the U.S. Gulf of Mexico". Environ. Sci. Technol. 2020. http://dx.doi.org/10.1021/acs.est.0c00179
- Discipline:
- Science
-
- Creator:
- Yiwen, Mei
- Description:
- The datasets of this archive are produced for a research project on the development of an advanced hydrologic modeling system for the St. Lawrence river basin. The outputted datasets from model simulations are in Netcdf 4 format. The author recommend using the netCDF Operators (NCO) program for data processing. For visualization and plotting, the author recommend using software like MATLAB, Python or R.
- Keyword:
- Hydrologic modeling, reanalysis product, St. Lawrence river, water balance, WRF-Hydro
- Discipline:
- Science
-
- Creator:
- Dewaraja, Yuni K and Van, Benjamin J
- Description:
- This publication contains anonymized time integrated activity maps for two patients. SPECT/CT scans were taken at 4 different time points in the week following a therapeutic dose of Lu-177 DOTATATE and combined into a single activity map for each patient. All images are in DICOM format.
- Keyword:
- Lu-177, Dosimetry, Radionuclide, SPECT, and CT
- Discipline:
- Health Sciences
-
- Creator:
- McCuen, Brett A.
- Description:
- These data are TLA events identified in MACCS magnetometer data throughout 2015. These events are short-timescale (< 60 s), large -amplitude (> 6 nT/s) magnetic disturbances measured at Earth's surface that are analyzed for space weather research purposes. and The events were identified in a year's worth of magnetic field data using an algorithm developed in the MATLAB platform. The algorithm dBdt_main.m can be run using the associated scripts (clean_maccs.m, simple_dbdt.m, extremes1.m, newdbdt.m) to return the events in the 2015_AllEvents.csv file. The substorm onset delays of each event are determined with the onset_delays.m script and the substorm event list 20191127-15-56-substorms.csv (both included).
- Keyword:
- space weather impacts, geomagnetically induced currents, GIC, transient induced currents, transient large amplitude, dB/dt search algorithm, and TLA
- Citation to related publication:
- Engebretson, M. J., Pilipenko, V. A., Ahmed, L. Y., Posch, J. L., Steinmetz, E. S., Moldwin, M. B., … Vorobev, A. V. (2019). Nighttime Magnetic Perturbation Events Observed in Arctic Canada: 1. Survey and Statistical Analysis. Journal of Geophysical Research: Space Physics, 124(9), 7442–7458. https://doi.org/10.1029/2019JA026794
- Discipline:
- Science
-
- Creator:
- Li, Jieming, Zhang, Leyou, Johnson-Buck, Alexander, and Walter, Nils G.
- Description:
- Traces from single-molecule fluorescence microscopy (SMFM) experiments exhibit photophysical artifacts that typically necessitate human expert screening, which is time-consuming and introduces potential for user-dependent expectation bias. Here, we have used deep learning to develop a rapid, automatic SMFM trace selector, termed AutoSiM, that improves the sensitivity and specificity of an assay for a DNA point mutation based on single-molecule recognition through equilibrium Poisson sampling (SiMREPS). The improved performance of AutoSiM is based on accepting both more true positives and fewer false positives than the conventional approach of hidden Markov modeling (HMM) followed by thresholding. As a second application, the selector was used for automated screening of single-molecule Förster resonance energy transfer (smFRET) data to identify high-quality traces for further analysis, and achieves ~90% concordance with manual selection while requiring less processing time. AutoSiM can be adapted readily to novel datasets, requiring only modest Transfer Learning.
- Keyword:
- deep learning, single-molecule fluorescence, total internal reflection microscopy, SiMREPS, smFRET, and Forster resonance energy transfer
- Citation to related publication:
- Li, J., Zhang, L., Johnson-Buck, A., & Walter, N. G. (2020). Automatic classification and segmentation of single-molecule fluorescence time traces with deep learning. Nature Communications, 11(1), 5833. https://doi.org/10.1038/s41467-020-19673-1 and Hayward, S., Lund, P., Kang, Q., Johnson-Buck, A., Tewari, M., Walter, N. (2018). Single-molecule microscopy image data and analysis files for "Ultra-specific and Amplification-free Quantification of Mutant DNA by Single-molecule Kinetic Fingerprinting" [Data set]. University of Michigan - Deep Blue. https://doi.org/10.7302/Z2CZ35DF
- Discipline:
- Science
-
- Creator:
- University of Michigan Museum of Paleontology and CTEES
- Description:
- Reconstructed CT slices for a right proximal metatarsal 1 of the Cantius trigonodus (University of Michigan Museum of Paleontology catalog number UMMP VP 81822), as a series of TIFF images. Raw projections are not included in this dataset.
- Keyword:
- Paleontology, Fossil, CT, Primates, Notharctidae, UMMP, University of Michigan Museum of Paleontology, Eocene, and CTEES
- Discipline:
- Science
-
- Creator:
- Chun-Seok Cho, Jingyue Xi, Hyun Min Kang, and Jun Hee Lee
- Description:
- There are three experimental outputs from Seq-Scope. (1) High definition map coordinate identifier (HDMI) sequence, tile and spatial coordinate information from 1st-Seq, (2) HDMI sequence, coupled with cDNA sequence from 2nd-Seq, and (3) Histological image obtained from Hematoxylin and Eosin (H&E) staining of the tissue slice. (1) and (2) were uploaded to GEO ( https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE169706). (3) is deposited here. In addition, this deposit includes the processed RDS (single R object) data files.
- Keyword:
- Seq-Scope, scRNA-seq, and spatial transcriptomics
- Citation to related publication:
- Chun-Seok Cho, Jingyue Xi, Sung-Rye Park, Jer-En Hsu, Myungjin Kim, Goo Jun, Hyun-Min Kang, Jun Hee Lee “Seq-Scope: Submicrometer-resolution spatial transcriptomics for single cell and subcellular studies” (preprint) bioRxiv https://doi.org/10.1101/2021.01.25.427807 and Related data sets in NCBI’s Gene Expression Omnibus (GEO) repository: Cho C, Xi J, Si Y, Lee JH, Kang HM, Park S, Hsu J, Kim M, Jun G “Seq-Scope: Submicrometer-resolution spatial barcoding technology that enables microscopic examination of tissue transcriptome at single cell and subcellular levels” https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE169706
- Discipline:
- Health Sciences
-
- Creator:
- Xian Li
- Description:
- Low-velocity accretionary wedges and sedimentary layers overlaying continental plates are widely observed in the subduction zones where historical large earthquakes have occurred. It was observed that rupture of the 2011 Mw 9.0 Tohoku-Oki earthquake propagated to the trench with large coseismic slip on the shallow fault, but what caused the huge shallow slip remains a prominent problem., Here we explore how the two low-velocity structures, accretionary wedge and sedimentary layer, affect the coseismic slip and near-fault ground motions during the 2011 Tohoku-Oki earthquake. Constrained by the observed seafloor deformation, we present a 2-D dynamic rupture model of the 2011 Tohoku-Oki earthquake with an accretionary wedge and a sedimentary layer. Compared to a homogeneous model with the same friction and stress parameters on the fault, we find that the co-existence of the accretionary wedge and sedimentary layer significantly enhances the shallow coseismic slip and amplifies ground accelerations near the accretionary wedge. We then investigate a plausible scenario of a smaller Tohoku-Oki earthquake when its rupture does not reach the accretionary wedge. The sedimentary layer slightly enhances the coseismic slip while the accretionary wedge has almost no influence for the smaller earthquake scenario, but both structures significantly amplify the ground accelerations on the overriding plate. , and By simulating a suite of earthquake scenarios, we suggest that the co-existence of an accretionary wedge and sedimentary layers tend to enhance coseismic slip, but the enhancement effect decreases as the up-dip limit of rupture zones terminates at a larger depth. The numerical simulations were solved using SEM2DPACK _2.3.8 ( http://www.sourceforge.net/projects/sem2d/), and simulation results were visualized by Matlab. This folder includes the input files to reproduce our simulation results and plot scripts.
- Keyword:
- 2011 Tohoku-Oki earthquake, Dynamic rupture simulation, Accretionary wedge, and Sedimentary layer
- Citation to related publication:
- Li, X., & Huang, Y. (2021). The enhancement of coseismic slip and ground motion due to the accretionary wedge and sedimentary layer in the 2011 Tohoku-Oki earthquake (world) [Preprint]. Earth and Space Science Open Archive. https://doi.org/10.1002/essoar.10506336.1
- Discipline:
- Science