About this item:

48 Views | 21 Downloads

Author Notes:

Peng Qiu, Email: peng.qiu@bme.gatech.edu

P.Q. designed the research, performed the research, developed the algorithm, analyzed the data, and wrote the paper.

The author gratefully acknowledges funding from the Chan Zuckerberg Initiative, the Helmsley Charitable Trust, and the National Science Foundation (CCF1552784). P.Q. is an ISAC Marylou Ingram Scholar and a Carol Ann and David D. Flanagan Faculty Fellow.

The author declares no competing interests.

Subject:

Keywords:

  • Biotechnology
  • Computational biology and bioinformatics

Embracing the dropouts in single-cell RNA-seq analysis

Tools:

Journal Title:

Nature Communications

Volume:

Volume 11

Publisher:

Type of Work:

Article | Final Publisher PDF

Abstract:

One primary reason that makes single-cell RNA-seq analysis challenging is dropouts, where the data only captures a small fraction of the transcriptome of each cell. Almost all computational algorithms developed for single-cell RNA-seq adopted gene selection, dimension reduction or imputation to address the dropouts. Here, an opposite view is explored. Instead of treating dropouts as a problem to be fixed, we embrace it as a useful signal. We represent the dropout pattern by binarizing single-cell RNA-seq count data, and present a co-occurrence clustering algorithm to cluster cells based on the dropout pattern. We demonstrate in multiple published datasets that the binary dropout pattern is as informative as the quantitative expression of highly variable genes for the purpose of identifying cell types. We expect that recognizing the utility of dropouts provides an alternative direction for developing computational algorithms for single-cell RNA-seq analysis.

Copyright information:

© The Author(s) 2020

This is an Open Access work distributed under the terms of the Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/rdf).
Export to EndNote