Work Description
Title: Database of PWM scores across the D. melanogaster genome Open Access Deposited
Attribute | Value |
---|---|
Methodology |
|
Description |
|
Creator | |
Depositor |
|
Contact information | |
Discipline | |
ORSP grant number |
|
Keyword | |
Date coverage |
|
Citations to related material |
|
Resource type | |
Last modified |
|
Published |
|
DOI |
|
License |
(2022). Database of PWM scores across the D. melanogaster genome [Data set], University of Michigan - Deep Blue Data. https://doi.org/10.7302/yb9e-aw67
Relationships
- This work is not a member of any user collections.
Files (Count: 2; Size: 978 MB)
Thumbnailthumbnail-column | Title | Original Upload | Last Modified | File Size | Access | Actions |
---|---|---|---|---|---|---|
|
README | 2019-02-05 | 2021-08-11 | 1.24 KB | Open Access |
|
![]() |
tf_binding_database.tbz | 2019-02-06 | 2021-03-16 | 978 MB | Open Access |
|
The included file contains information on all predicted binding sites for annotated transcription factors in the D. melanogaster genome. Detailed methods are provided in https://doi.org/10.1101/516500
In brief, position weight matrices (PWMs) were downloaded from the CIS-BP database or constructed from ChIP experiments in the modENCODE database, and scanned against the genome using the FIMO program from the MEME software suite. Scores against all possible genomic positions were obtained, and then normalized via calculation of robust z-scores.
The data are provided as a GNU tar archive that has been compressed using the bzip2 program. A separate .dat file is present for each chromosome (as indicated by the chr*** portion of the file name). Each file consists of a tab-delimited table with the following columns:
TF_name -- the protein corresponding to the PWM being checked
chr -- the name of the chromosome of a given binding site
start -- the starting position of a particular binding site
end -- the ending location of a particular binding site
rzscore -- the robust z-score for the match
Only sites with robust z scores of at least 2.3364 (corresponding to roughly the 99th percentile of a standard normal distribution) are included in the table.