Joint between-sample normalization and differential expression detection through ℓ0-regularized regression

Liu, Kefei; Shen, Li; Jiang, Hui

Joint between-sample normalization and differential expression detection through ℓ0-regularized regression

dc.contributor.author	Liu, Kefei
dc.contributor.author	Shen, Li
dc.contributor.author	Jiang, Hui
dc.date.accessioned	2022-08-10T17:58:40Z
dc.date.available	2022-08-10T17:58:40Z
dc.date.issued	2019-12-02
dc.identifier.citation	BMC Bioinformatics. 2019 Dec 02;20(Suppl 16):593
dc.identifier.uri	https://doi.org/10.1186/s12859-019-3070-4
dc.identifier.uri	https://hdl.handle.net/2027.42/173427	en
dc.description.abstract	Abstract Background A fundamental problem in RNA-seq data analysis is to identify genes or exons that are differentially expressed with varying experimental conditions based on the read counts. The relativeness of RNA-seq measurements makes the between-sample normalization of read counts an essential step in differential expression (DE) analysis. In most existing methods, the normalization step is performed prior to the DE analysis. Recently, Jiang and Zhan proposed a statistical method which introduces sample-specific normalization parameters into a joint model, which allows for simultaneous normalization and differential expression analysis from log-transformed RNA-seq data. Furthermore, an ℓ0 penalty is used to yield a sparse solution which selects a subset of DE genes. The experimental conditions are restricted to be categorical in their work. Results In this paper, we generalize Jiang and Zhan’s method to handle experimental conditions that are measured in continuous variables. As a result, genes with expression levels associated with a single or multiple covariates can be detected. As the problem being high-dimensional, non-differentiable and non-convex, we develop an efficient algorithm for model fitting. Conclusions Experiments on synthetic data demonstrate that the proposed method outperforms existing methods in terms of detection accuracy when a large fraction of genes are differentially expressed in an asymmetric manner, and the performance gain becomes more substantial for larger sample sizes. We also apply our method to a real prostate cancer RNA-seq dataset to identify genes associated with pre-operative prostate-specific antigen (PSA) levels in patients.
dc.title	Joint between-sample normalization and differential expression detection through ℓ0-regularized regression
dc.type	Journal Article
dc.description.bitstreamurl	http://deepblue.lib.umich.edu/bitstream/2027.42/173427/1/12859_2019_Article_3070.pdf
dc.identifier.doi	https://dx.doi.org/10.7302/5158
dc.language.rfc3066	en
dc.rights.holder	The Author(s)
dc.date.updated	2022-08-10T17:58:39Z
dc.owningcollname	Interdisciplinary and Peer-Reviewed

Files in this item

Name:: 12859_2019_Article_3070.pdf
Size:: 1.630MB
Format:: PDF

View/Open

Interdisciplinary and Peer-Reviewed

Show simple item record

Remediation of Harmful Language

The University of Michigan Library aims to describe its collections in a way that respects the people and communities who create, use, and are represented in them. We encourage you to Contact Us anonymously if you encounter harmful or problematic language in catalog records or finding aids. More information about our policies and practices is available at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.