Show simple item record

A fast and efficient count-based matrix factorization method for detecting cell types from single-cell RNAseq data

dc.contributor.authorSun, Shiquan
dc.contributor.authorChen, Yabo
dc.contributor.authorLiu, Yang
dc.contributor.authorShang, Xuequn
dc.date.accessioned2019-04-07T03:19:50Z
dc.date.available2019-04-07T03:19:50Z
dc.date.issued2019-04-05
dc.identifier.citationBMC Systems Biology. 2019 Apr 05;13(Suppl 2):28
dc.identifier.urihttps://doi.org/10.1186/s12918-019-0699-6
dc.identifier.urihttps://hdl.handle.net/2027.42/148526
dc.description.abstractAbstract Background Single-cell RNA sequencing (scRNAseq) data always involves various unwanted variables, which would be able to mask the true signal to identify cell-types. More efficient way of dealing with this issue is to extract low dimension information from high dimensional gene expression data to represent cell-type structure. In the past two years, several powerful matrix factorization tools were developed for scRNAseq data, such as NMF, ZIFA, pCMF and ZINB-WaVE. But the existing approaches either are unable to directly model the raw count of scRNAseq data or are really time-consuming when handling a large number of cells (e.g. n>500). Results In this paper, we developed a fast and efficient count-based matrix factorization method (single-cell negative binomial matrix factorization, scNBMF) based on the TensorFlow framework to infer the low dimensional structure of cell types. To make our method scalable, we conducted a series of experiments on three public scRNAseq data sets, brain, embryonic stem, and pancreatic islet. The experimental results show that scNBMF is more powerful to detect cell types and 10 - 100 folds faster than the scRNAseq bespoke tools. Conclusions In this paper, we proposed a fast and efficient count-based matrix factorization method, scNBMF, which is more powerful for detecting cell type purposes. A series of experiments were performed on three public scRNAseq data sets. The results show that scNBMF is a more powerful tool in large-scale scRNAseq data analysis. scNBMF was implemented in R and Python, and the source code are freely available at https://github.com/sqsun .
dc.titleA fast and efficient count-based matrix factorization method for detecting cell types from single-cell RNAseq data
dc.typeArticleen_US
dc.description.bitstreamurlhttps://deepblue.lib.umich.edu/bitstream/2027.42/148526/1/12918_2019_Article_699.pdf
dc.language.rfc3066en
dc.rights.holderThe Author(s)
dc.date.updated2019-04-07T03:19:51Z
dc.owningcollnameInterdisciplinary and Peer-Reviewed


Files in this item

Show simple item record

Remediation of Harmful Language

The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.