Search Constraints
Filtering by:
Language
English
Remove constraint Language: English
Resource type
Dataset
Remove constraint Resource type: Dataset
Discipline
Science
Remove constraint Discipline: Science
Number of results to display per page
View results as:
Search Results
-
- Creator:
- Gliske, Stephen V and Stacey, William C
- Description:
- This data is part of a large program to translate detection and interpretation of HFOs into clinical use. A zip file is included which contains hfo detections, metadata, and Matlab scripts. The matlab scripts analyze this input data and produce figures as in the referenced paper (note: the blind source separation method is stochastic, and so the figures may not be exactly the same). A file "README.txt" provides more detail about each individual file within the zip file.
- Keyword:
- hfo, high frequency oscillation, ripple, fast ripple, blind source separation, non-negative matrix factorization, and temporal variability
- Discipline:
- Science, Engineering, and Health Sciences
-
- Creator:
- Ward, Jamie L ., Flanner, Mark G., Bergin, Mike, Dibb, Jack E., Polashenski, Chris M., Soja, Amber J., and Thomas, Jennie L.
- Description:
- Biomass burning produces smoke aerosols that are emitted into the atmosphere. Some smoke constituents, notably black carbon (BC), are highly effective light-absorbing aerosols (LAA). Emitted LAA can be transported to high albedo regions like the Greenland Ice Sheet (GrIS) and affect local snowmelt. In the summer, the effects of LAA in Greenland are uncertain. To explore how LAA affect GrIS snowmelt and surface energy flux in the summer, we conduct idealized global climate model simulations with perturbed aerosol amounts and properties in the GrIS snow and overlying atmosphere. The in-snow and atmospheric aerosol burdens we select range from background values measured on the GrIS to unrealistically high values. This helps us explore the linearity of snowmelt response and to achieve high signal-to-noise ratios. With LAA operating only in the atmosphere, we find no significant change in snowmelt due to the competing effects of surface dimming and tropospheric warming. Regardless of atmospheric LAA presence, in-snow BC-equivalent mixing ratios greater than ~60 ng/g produce statistically significant snowmelt increases over much of the GrIS. We find that net surface energy flux changes correspond well to snowmelt changes for all cases. The dominant component of surface energy flux change is solar energy flux, but sensible and longwave energy fluxes respond to temperature changes. Atmospheric LAA dampen the magnitude of solar radiation absorbed by in-snow LAA when both varieties are simulated. In general, the significant melt and surface energy flux changes we simulate occur with LAA quantities that have never been recorded in Greenland.
- Keyword:
- climate, Greenland Ice Sheet, black carbon, biomass burning, snowmelt, and surface energy balance
- Citation to related publication:
- Ward, J.L., et al. (2018). Modeled Response of Greenland Snowmelt to the Presence of Biomass Burning-Based Absorbing Aerosols in the Atmosphere and Snow. Journal of Geophysical Research: Atmospheres. 123, 6122– 6141. https://doi.org/10.1029/2017JD027878
- Discipline:
- Science
-
- Creator:
- Bemmels, Jordan B. and Dick, Christopher W.
- Description:
- Raw SNP genotypes are provided in STRUCTURE format, with a maximum of one SNP reported per ddRAD locus. The files "caryco_SNP.str" and "caryov_SNP.str" are genotypes for Carya cordiformis and Carya ovata, respectively. The first column of each file is the individual name, the second column is the population (see original publication for information on population locations), and the remaining columns are genotypes of individual SNPs. Rows represent individuals, with the diploid genotypes contained on two lines per individual. Missing data are entered as "0" (zero). The first row is a header with a unique identifier for each SNP. and Occurrence records for each species are provided in the file "occs_carya.csv" and contain the latitude and longitude of each record.
- Keyword:
- eastern North America, glacial refugia, phylogeography, temperate trees, and single nucleotide polymorphisms
- Citation to related publication:
- Bemmels, J.B., and C.W. Dick. 2018. Genomic evidence of a widespread southern distribution during the Last Glacial Maximum for two North American hickory species. Journal of Biogeography, 45: 1739– 1750. https://doi.org/10.1111/jbi.13358
- Discipline:
- Science
-
- Creator:
- Mirshams Shahshahani, Payam
- Description:
- Investigating minimum human reaction times is often confounded by the motivation, training, and state of arousal of the subjects. We used the reaction times of athletes competing in the shorter sprint events in the Athletics competitions in recent Olympics (2004-2016) to determine minimum human reaction times because there's little question as to their motivation, training, or state of arousal. The reaction times of sprinters however are only available on the IAAF web page for each individual heat, in each event, at each Olympic. Therefore we compiled all these data into two separate excel sheets which can be used for further analyses.
- Keyword:
- minimum reaction time, sprinter, Olympics, Athletics, sex difference, starting block, and false start
- Citation to related publication:
- Mirshams Shahshahani P, Lipps DB, Galecki AT, Ashton-Miller JA (2018) On the apparent decrease in Olympic sprinter reaction times. PLoS ONE 13(6): e0198633. https://doi.org/10.1371/journal.pone.0198633
- Discipline:
- Engineering, Health Sciences, Science, Other, and General Information Sources
-
- Creator:
- Adam Schneider and Mark Flanner
- Description:
- This dataset contains all data used to generate the figures in The Cryosphere manuscript “Measuring Snow Specific Surface Area with 1.30 and 1.55 micro-meter Bidirectional Reflectance Factors,” by Adam Schneider, Mark Flanner, and Roger De Roo. These data support the theory, calibration, and application of the Near-Infrared Emitting and Reflectance Monitoring Dome (NERD), an instrument engineered to rapidly retrieve surface snow specific surface area in the field. Note that this deposit includes a microCT scan database for natural snowfall samples collected in New Hampshire during 2015-2017, comprised of raw tiff files as well as reconstructions, binarized reconstructions, and some 3D model reconstructions. and Running python scripts generally require that the following packages are installed: NumPy, SciPy, Matplotlib, Pandas, and ipdb (for debugging).
- Keyword:
- Snow specific surface area, Monte Carlo, X-ray micro-computed tomography, SNICAR, Near-Infrared Emitting and Reflectance-Monitoring Dome, Bidirectional reflectance factor, Cryosphere, and 3D
- Discipline:
- Science
-
- Creator:
- Hayward, Stephen L. , Lund, Paul E., Kang, Qing, Johnson-Buck, Alexander , Tewari, Muneesh, and Walter, Nils G.
- Description:
- This work contains the experimental data and associated analysis that are described in the research publication entitled "Ultra-specific and Amplification-free Quantification of Mutant DNA by Single-molecule Kinetic Fingerprinting". This work contains multiple zip files, each of which represents one of the principal experiment groups presented in the publication. Each experiment group contains movie and analysis files corresponding to various experimental conditions related to that experiment group.
- Keyword:
- Single Molecule Fluorescence, Super-Resolution Microscopy, Nucleic Acid Hybridization, T790M Mutation, Cytosine Deamination, SiMREPS, and single molecule kinetic fingerprinting
- Citation to related publication:
- https://pubs.acs.org/doi/10.1021/jacs.8b06685
- Discipline:
- Science
-
- Creator:
- Alsip, Peter
- Description:
- Percent Weight Change Data: The model was run continuously on a daily time step for seasonal intervals (Spring: March thru May; Summer: June thru August; Fall: September thru November) as well as contiguously from Spring to Fall to assess total growth over the likely growing season (March thru November). CSV files represent the simulated weight change (%) of Bighead and Silver Carp for the respective time periods associated with the file name. Initial fish mass for each seasonal interval and growing season was 4350 g for Silver Carp and 5480 g for Bighead Carp. Maximum and mean total weight change (%) was determined for three depth ranges (near surface depths [NS]: 0 – 10 m; deep chlorophyll layer depths [DCL]: 10 - 50 m; and whole water column [WC]). Coordinates are in decimal degrees. File naming convention: speciesSeasonWtChange (e.g. bigheadFallWtChange = % weight change of Bighead Carp from September through November) , Monthly Habitat Quality Data: Rdata files contain matrices of Bighead or Silver carp growth rate potential as represented as a mass-proportional growth rate (gram of carp/gram of carp/day [g/g/d]) for the 15th day of each month. Habitats with growth rate potential >= 0 g/g/d were deemed suitable. Matrix attributes: Rows: Row numbers refer to the spatial node with 20 equally-spaced vertical layers. Columns: Columns 1-20 refer to the growth rate potential value for each vertical layer of each node. Vertical layers are evenly spaced based on the total depth of the water column for each node. Depth for each node can be found in the grid attributes data file. Columns 21 ("meanG") and 22 ("Gmax") represent the average and maximum growth rate potential, respectively, of the fish across the whole water column for the corresponding node. File naming convention: species_MonthNumber (e.g. silver_06 = Silver carp growth rate potential in June) Spatial coordinates for each node can be found in the grid attributes data files., Grid attributes data: This Rdata file provides the spatial reference data and other grid attributes. Coordinates are provided in UTM (x & y) and latitude and longitude (decimal degrees). Depth (meters) for each node is listed in this file. , GRP Model code: Details bioenergetics equations, foraging equation, functions for running the model on a monthly time-step and daily time step, and functions for basic analyses. Model is coded in R., and The simulated input data (prey and temperature) used to run our model is not included in this data set. Instead we provide the model code, grid attributes, and outputs of the model. The readRDS() function (R Base Package v.3.5.1) is required to read in .Rdata files in R.
- Keyword:
- Asian Carp, Laurentian Great Lakes, Habitat Suitability, Invasive Species, Lake Michigan, and Ecological Modeling
- Discipline:
- Science
-
- Creator:
- Thomaz, Andréa T. (UMICH) and Knowles, L. Lacey (UMICH)
- Description:
- The eastern coastal basins of Brazil are a series of small and isolated rivers that drain directly into the Atlantic Ocean. During the Pleistocene, sea-level retreat caused by glaciations exposed the continental shelf, resulting in enlarged paleodrainages that connected rivers that are isolated today. Using Geographic Information System (GIS), we infer the distribution of these paleodrainages, and their properties for the east Brazilian coast. Here, we publicly make available the shapefiles that demonstrate the paleodrainage structure along the Brazilian coast during the largest sea-level retreats in the Pleistocene, the riverine vectors during the same period and the coastal line for a drop of -125m in the sea.
- Keyword:
- Paleodrainages, Glaciations, Pleistocene, Brazil, Neotropical, and Sea-level retreat
- Discipline:
- Science
-
- Creator:
- Kort, EA, Gvakharia, A, Smith, ML, and Conley, S
- Description:
- Data is collected from research flights based in West Memphis, Arkansas, covering the Mississippi River Valley. The data file contains all merged flight data from each flight day.
- Keyword:
- Greenhouse gas
- Citation to related publication:
- Gvakharia, A., Kort, E.A., Smith, M.L., Conley, S., 2018. Testing and evaluation of a new airborne system for continuous N2O, CO2, CO, and H2O measurements: the Frequent Calibration High-performance Airborne Observation System (FCHAOS). Atmospheric Measurement Techniques; Katlenburg-Lindau 11, 6059. https://doi.org/10.5194/amt-11-6059-2018
- Discipline:
- Science
-
- Creator:
- Liemohn, Michael W, McCollough, James P, Engel, Miles A, Jordanova, Vania K, and Morley, Steven K
- Description:
- There is a directory tree inside this zipped file. The main directory has the Adobe Illustrator plots of the figures in the paper, Space Weather journal manuscript # 2018SW002067, "Model evaluation guidelines for geomagnetic index predictions" by M. W. Liemohn and coauthors. The three subdirectories have the files for the individual models, the data to which they are compared, and the IDL code used to create the figure plots and metrics calculations. and Date coverage is specific to each model. The RAMSCB model covers January 2005, the WINDMI model all of 2014, and the UPOS model 1.5 solar cycles, from 1 October 2001 through 29 July 2013.
- Keyword:
- space weather, model assessment, time series metrics, and geomagnetic indices
- Discipline:
- Science
-
- Creator:
- R Paul Drake
- Description:
- The specific focus of the project was radiative shocks, which develop when shock waves become so fast and hot that the radiation from the shocked matter dominates the energy transport. This in turn leads to changes in the shock structure. Radiative shocks are challenging to simulate, as they include phenomena on a range of spatial and temporal scales and involve two types of nonlinear physics Ð- hydrodynamics and radiation transport. Even so, the range of physics involved is narrow enough that one can hope to model all of it with sufficient fidelity to reproduce the data. CRASH was focused on developing predictions for a sequence of experiments performed in Project Year 5, in which those experiments represented an extrapolation from all previously available data. The previous data involved driving radiative shocks within cylindrical structures, and mainly straight tubes. The Year 5 experiments drove a radiative shock down an elliptical tube. Our long-stated goal for these predictions was that the distribution of predicted values would overlap significantly with the observed distribution. We achieved this goal. Achieving our goal required the conversion of an established space-weather code to model radiative shocks at high energy density. To obtain reasonable fidelity with respect to the experimental data required implementing a laser absorption package, in addition to a hydrodynamic solver, electron physics and heat conduction, and multigroup diffusive radiation transport. The dedicated experiments provided evidence of experimental variability, validation of the calculation of initial shock wave behavior, and validation data at many observation times using cylindrical shock tubes. Following this were preparatory experiments for and finally the execution of the Year 5 experiments. The predictive science research included a wide range of sensitivity studies to determine which variables were important and a sequence of predictive studies focused on specific issues and sets of data. This led ultimately to predictions of shock location for the Year 5 experiments. A conclusion from this project is that the serious quantification of uncertainty in simulations is a dauntingly difficult and expensive prospect. Pre-existing codes are unlikely to have been built with attention to what will be needed to quantify their uncertainty. Pre-existing experimental results are even more unlikely to include a sufficiently detailed analysis of the experimental uncertainties. And this will also be true of most experiments that might be used to validate components of the simulation. The analysis of uncertainty in any one of the physical processes (and related physical constants) is a major effort. And addressing model form uncertainty is an even bigger challenge, that may in principle require development of complete, alternative simulation models. We made a start at all of this, and completed almost none of it. But by the end of a project, we finally had all the pieces in place and working that would have enabled a range of important studies and advances in relatively near-term years. But the sponsor terminated the program after only five years. For most of the participants this was a relatively minor development, although for a few of them it proved to be enormously disruptive. We believe that the cost to the nation, in work that was ready be done but now will not be, was much much larger. The sketch of the target was produced using a drawing program based on the experimental dimensions. The annotated photograph of the target was obtained using a visible-light camera. The colorized radiographs were obtained via backilit-pinhole radiography of a radiative shock propagating down an elliptical tube, at 26 ns after the lasers driving the shock tube fired. The graph showing lines and circles was produced by running many computer models, analyzing their statistical distribution, and measuring actual shock positions in the experiment.
- Keyword:
- Radiative shock
- Discipline:
- Science
-
- Creator:
- Ramasubramani, Vyas
- Description:
- The goal of the work is to elucidate the stability of a complex experimentally observed structure of proteins. We found that supercharged GFP molecules spontaneously assemble into a complex 16-mer structure that we term a protomer, and that under the right conditions an even larger assembly is observed. The protomer structure is very well defined, and we performed simulations to try and understand the mechanics underlying its behavior. In particular, we focused on understanding the role of electrostatics in this system and how varying salt concentrations would alter the stability of the structure, with the ultimate goal of predicting the effects of various mutations on the stability of the structure. There are two separate projects included in this repository, but the two are closely linked. One, the candidate_structures folder, contains the atomistic outputs used to generate coarse-grained configurations. The actual coarse-grained simulations are in the rigid_protein folder, which pulls the atomistic coordinates from the other folder. All data is managed by signac and lives in the workspace directories, which contain various folders corresponding to different parameter combinations. The parameters associated with a given folder are stored in the signac_statepoint.json files within each subdirectory. The atomistic data uses experimentally determined protein structures as a starting point; all of these are stored in the ConfigFiles folder. The primary output is the topology files generated from the PDBs by GROMACS; these topologies are then used to parametrize the Monte Carlo simulations. In some cases, atomistic simulations were actually run as well, and the outputs are stored alongside the topology files. In the rigid_protein folder, the ConfigFiles folder contains MSMS, the software used to generate polyhedral representations of proteins from the PDBs in the candidate_structures folder. All of the actual polyhedral structures are also stored in the ConfigFiles folder. The actual simulation trajectories are stored as general simulation data (GSD) files within each subdirectory of the workspace, along with a single .pos file that contains the shape definition of the (nonconvex) polyhedron used to represent a protein. The logged quantities, such as energies and MC move sizes, are stored in .log files. The logic for the simulations in the candidate_structures project is in the Python scripts project.py, operations.py, and scripts/init.py. The rigid_protein folder also includes the notebooks directory, which contains Jupyter notebooks used to perform analyses, as well as the Python scripts used to actually perform the simulations and manage the data space. In particular, the project.py, operations.py and scripts/init.py scripts contain most of the logic associated with the simulations.
- Keyword:
- Protein assembly, Cryo TEM, Hierarchical Assembly, Monte Carlo simulation, and Coarse-grained simulation
- Discipline:
- Science and Engineering
-
- Creator:
- Thomaz, Andréa T. (UMICH), Carvalho, Tiago P. (UFRGS), Malabarba, Luiz R. (UFRGS), and Knowles, L. Lacey (UMICH)
- Description:
- Estimated phylogenetic relationships based on more than 18,000 loci in 93 individuals (full data) or 21 individuals (subset data) representing 19 described species and two putative undescribed species. Nine files are part of this dataset, including all input files to infer the phylogenetic reconstructions and the outputs obtained, in addition to a pruned tree used to infer the ancestral state reconstructions.
- Keyword:
- dusky millions poeciliids, sexual selection, South America, and ddRADseq
- Citation to related publication:
- Andréa T. Thomaz, Tiago P. Carvalho, Luiz R. Malabarba, L. Lacey Knowles, Geographic distributions, phenotypes, and phylogenetic relationships of Phalloceros (Cyprinodontiformes: Poeciliidae): insights about diversification among sympatric species pools, Molecular Phylogenetics and Evolution, 2018, ISSN 1055-7903, https://doi.org/10.1016/j.ympev.2018.12.008
- Discipline:
- Science
-
- Creator:
- Johnson, JE and Molnar, PH
- Description:
- This IF compilation was assembled from the existing literature to understand if preservation biases affected the record of iron formations.
- Keyword:
- Archean ocean chemistry, temporal record of iron formations, and early Earth iron cycle
- Citation to related publication:
- Johnson, J. E., & Molnar, P. H. ( 2019). Widespread and persistent deposition of iron formations for two billion years. Geophysical Research Letters, 46, 3327– 3339. https://doi.org/10.1029/2019GL081970
- Discipline:
- Science
-
- Creator:
- Bougher, Stephen W. (CLaSP Department, U. of Michigan) and Roeten, Kali J. (CLaSP Department, U. of Michigan)
- Description:
- The NASA MAVEN (Mars Atmosphere and Volatile Evolution) spacecraft, which is currently in orbit around Mars, has been taking monthly measurements of the speed and direction of the winds in the upper atmosphere of Mars between about 140 to 240 km above the surface. The observed wind speeds and directions change with time and location, and sometimes fluctuate quickly. These measurements are compared to simulations from a computer model of the Mars atmosphere called M-GITM (Mars Global Ionosphere-Thermosphere Model), developed at U. of Michigan. This is the first comparison between direct measurements of the winds in the upper atmosphere of Mars and simulated winds and is important because it can help to inform us what physical processes are acting on the observed winds. Some wind measurements have similar wind speeds or directions to those predicted by the M-GITM model, but sometimes, there are large differences between the simulated and measured winds. The disagreements between wind observations and model simulations suggest that processes other than normal solar forcing may become relatively more important during these observations and alter the expected circulation pattern. Since the global circulation plays a role in the structure, variability, and evolution of the atmosphere, understanding the processes that drive the winds in the upper atmosphere of Mars provides key context for understanding how the atmosphere behaves as a whole system. A basic version of the M-GITM code can be found on Github as follows: https:/github.com/dpawlows/MGITM and About 30 Neutral Gas and Ion Mass Spectrometer (NGIMS) wind campaigns (of 5 to 10 orbits each) have been conducted by the MAVEN team (Benna et al., 2019). Five of these campaigns are selected for detailed study (Roeten et al. 2019). The Mars conditions for these five campaigns have been used to launch corresponding M-GITM code simulations, yielding 3-D neutral wind fields for comparison to these NGIMS wind observations. The M-GITM datacubes used to extract the zonal and meridional neutral winds, along the trajectory of each orbit path between 140 and 240 km, are provided in this Deep Blue Data archive. README files are provided for each datacube, detailing the contents of each file. A general README file is also provided that summarizes the inputs and outputs of the M-GITM code simulations for this study.
- Keyword:
- Mars, MAVEN spacecraft, Mars thermosphere, and Mars global upper atmosphere winds
- Citation to related publication:
- Roeten, K. J., Bougher, S. W., Benna, M., Mahaffy, P. R., Lee, Y., Pawlowski, D., et al. (2019). MAVEN/NGIMS thermospheric neutral wind observations: Interpretation using the M‐GITM general circulation model. Journal of Geophysical Research: Planets, 124, 3283– 3303. https://doi.org/10.1029/2019JE005957
- Discipline:
- Science and Engineering
-
- Creator:
- Crisp, Dakota N., Saggio, Maria L., Scott, Jared, Stacey, William C., Nakatani, Mitsuyoshi, Gliske, Stephen V., and Lin, Jack
- Description:
- This data and scripts are meant to test and show seizure differentiation based on bifurcation theory. A zip file is included which contains real and simulated seizure waveforms, Matlab scripts, and metadata. The matlab scripts allow for visual review validation and objective feature analysis. The file “README.txt” provides more detail about each individual file within the zip file. and Data citation: Crisp, D.N., Saggio, M.L., Scott, J., Stacey, W.C., Nakatani, M., Gliske, S.F., Lin, J. (2019). Epidynamics: Navigating the map of seizure dynamics - Code & Data [Data set]. University of Michigan Deep Blue Data Repository. https://doi.org/10.7302/ejhy-5h41
- Keyword:
- Bifurcation, Epilepsy, Seizure, and Divergence
- Citation to related publication:
- Saggio, M.L., Crisp, D., Scott, J., Karoly, P.J., Kuhlmann, L., Nakatani, M., Murai, T., Dümpelmann, M., Schulze-Bonhage, A., Ikeda, A., Cook, M., Gliske, S.V., Lin, J., Bernard, C., Jirsa, V., Stacey, W., 2020. In pre-print. Epidynamics characterize and navigate the map of seizure dynamics. bioRxiv 2020.02.08.940072. https://doi.org/10.1101/2020.02.08.940072
- Discipline:
- Engineering, Science, and Health Sciences
-
- Creator:
- Crisp, Dakota N., Cheung, Warwick, Gliske, Stephen V., Lai, Alan, Freestone, Dean R., Grayden, David B., Cook, Mark J., and Stacey, William C.
- Description:
- The data and the scripts are to show that seizure onset dynamics and evoked responses change over the progression of epileptogenesis defined in this intrahippocampal tetanus toxin rat model. All tests explored in this study can be repeated with the data and scripts included in this repository. and Dataset citation: Crisp, D.N., Cheung, W., Gliske, S.V., Lai, A., Freestone, D.R., Grayden, D.B., Cook, MJ., Stacey, W.C. (2019). Epileptogenesis modulates spontaneous and responsive brain state dynamics [Data set]. University of Michigan Deep Blue Data Repository. https://doi.org/10.7302/r6vg-9658
- Keyword:
- evoked response, stimulation, bifurcation, epilepsy, seizure, divergence, and dynamics
- Citation to related publication:
- Crisp, D. N., Cheung, W., Gliske, S. V., Lai, A., Freestone, D. R., Grayden, D. B., Cook, M. J., & Stacey, W. C. (2020). Quantifying epileptogenesis in rats with spontaneous and responsive brain state dynamics. Brain Communications, 2(1). https://doi.org/10.1093/braincomms/fcaa048
- Discipline:
- Science, Engineering, and Health Sciences
-
- Creator:
- Nasser, Ahmad and Gumise, Wonder
- Description:
- The work on accelerating authenticated boot for embedded system resulted in designing an algorithm in python to perform the random address generation and cryptographic MAC calculation. The Sampled Boot schemes implemented in this package allow a significant reduction of the time needed to authenticate firmware images during startup, while still retaining a high degree of trust. This is particularly useful for automotive applications in which startup time constraints make secure boot a time prohibitive process. and Citation for this dataset: Nasser, A., Gumise, W. (2019). Authenticated Boot Acceleration Algorithm [Code and data]. University of Michigan Deep Blue Data Repository. https://doi.org/10.7302/yeh1-1x17
- Keyword:
- Trusted Computing, IOT security, Embedded Security, and Cyber Physical Systems
- Citation to related publication:
- Nasser, A., Gumise, W., and Ma, D., "Accelerated Secure Boot for Real-Time Embedded Safety Systems," SAE Int. J. Transp. Cyber. & Privacy 2(1) : 35-48, 2019, https://doi.org/10.4271/11-02-01-0003
- Discipline:
- Science
-
- Creator:
- Ruas, Terry, Ferreira, Charles H. P., Grosky, William, França, Fabrício O., and Medeiros, Débora M. R,
- Description:
- The relationship between words in a sentence often tell us more about the underlying semantic content of a document than its actual words, individually. Recent publications in the natural language processing arena, more specifically using word embeddings, try to incorporate semantic aspects into their word vector representation by considering the context of words and how they are distributed in a document collection. In this work, we propose two novel algorithms, called Flexible Lexical Chain II and Fixed Lexical Chain II that combine the semantic relations derived from lexical chains, prior knowledge from lexical databases, and the robustness of the distributional hypothesis in word embeddings into a single decoupled system. In short, our approach has three main contributions: (i) unsupervised techniques that fully integrate word embeddings and lexical chains; (ii) a more solid semantic representation that considers the latent relation between words in a document; and (iii) lightweight word embeddings models that can be extended to any natural language task. Knowledge-based systems that use natural language text can benefit from our approach to mitigate ambiguous semantic representations provided by traditional statistical approaches. The proposed techniques are tested against seven word embeddings algorithms using five different machine learning classifiers over six scenarios in the document classification task. Our results show that the integration between lexical chains and word embeddings representations sustain state-of-the-art results, even against more complex systems. Github: https://github.com/truas/LexicalChain_Builder
- Keyword:
- document classification, lexical chains, word embeddings, synset embeddings, chain2vec, and natural language processing
- Citation to related publication:
- Terry Ruas, Charles Henrique Porto Ferreira, William Grosky, Fabrício Olivetti de França, Débora Maria Rossi de Medeiros, "Enhanced word embeddings using multi-semantic representation through lexical chains", Information Sciences, 2020, https://doi.org/10.1016/j.ins.2020.04.048
- Discipline:
- Other, Science, and Engineering
-
- Creator:
- Johnson, Jena E., Webb, Samuel M., Condit, Cailey B., Beukes, Nicolas J., and Fischer, Woodward W.
- Description:
- Manganese in the sedimentary record has been interpreted by many as a powerful redox proxy for paleoenvironments, and yet very little work has been done to ensure that the manganese-rich minerals in the rock record are actually recording primary signals. In the accompanying manuscript, we present an in-depth characterization of the manganese mineralogy from two correlated regions recording the Transvaal Supergroup in South Africa with markedly different alteration histories to investigate if there can be post-depositional emplacement of manganese-rich minerals. The data uploaded here are X-ray absorption spectra of (1) manganese standard minerals that were useful in our analyses and (2) minerals from an important well-characterized sample that may be useful as comparative standards in future studies.
- Keyword:
- manganese and X-ray absorption spectroscopy
- Citation to related publication:
- J.E. Johnson, S.M. Webb, C.B. Condit, N.J. Beukes, W.W. Fischer; Effects of metamorphism and metasomatism on manganese mineralogy: Examples from the Transvaal Supergroup. South African Journal of Geology doi: https://doi.org/10.25131/sajg.122.0034
- Discipline:
- Science