Published April 20, 2020 | Version v1
Journal article Open

Characterizing and inferring quantitative cell cycle phase in single-cell RNA-seq data analysis

Description

Cellular heterogeneity in gene expression is driven by cellular processes, such as cell cycle and cell-type identity, and cellular environment such as spatial location. The cell cycle, in particular, is thought to be a key driver of cell-to-cell heterogeneity in gene expression, even in otherwise homogeneous cell populations. Recent advances in single-cell RNA-sequencing (scRNA-seq) facilitate detailed characterization of gene expression heterogeneity and can thus shed new light on the processes driving heterogeneity. Here, we combined fluorescence imaging with scRNA-seq to measure cell cycle phase and gene expression levels in human induced pluripotent stem cells (iPSCs). By using these data, we developed a novel approach to characterize cell cycle progression. Although standard methods assign cells to discrete cell cycle stages, our method goes beyond this and quantifies cell cycle progression on a continuum. We found that, on average, scRNA-seq data from only five genes predicted a cell's position on the cell cycle continuum to within 14% of the entire cycle and that using more genes did not improve this accuracy. Our data and predictor of cell cycle phase can directly help future studies to account for cell cycle-related heterogeneity in iPSCs. Our results and methods also provide a foundation for future work to characterize the effects of the cell cycle on expression heterogeneity in other cell types.

Data availability

All rawand processed sequencing data generated in this study have been submitted to the NCBI Gene Expression Omnibus (GEO; https://www.ncbi.nlm.nih.gov/geo/) (Edgar et al. 2002) under accession number GSE121265. We also make the processed data available at GitHub (https://github.com/jhsiao999/peco-paper) and https://giladlab.uchicago.edu/wp-content/uploads/2019/02/ Hsiao_et_al_2019.tar.gz. All analysis results, scripts, and data required to reproduce this work are available at https://gilad.com/ jhsiao999/peco-paper as well as in the Supplemental Code (Supplemental File S3). The source code is available in an R/Bioconductor package peco (the development version of peco is available at https://github.com/jhsiao999/peco).

Files

Characterizing-and-inferring-quantitative-cell-cycle-phase-in-single-cell-RNA-seq-data-analysis.pdf

Files (34.7 MB)

Name Size Download all
Article
md5:54c0ec6ea586016b4b9a04ce07092df5
3.6 MB Preview Download
Supplemental material file 1
md5:da03e9e0d141cff2d294552e74ce6c78
29.5 kB Download
Supplemental material file 2
md5:4cadb907f4ac891c9fc7986bf5b4cbb1
1.1 MB Download
Supplemental material file 3
md5:9837a626455ace9f1566cd25cab330ed
8.1 MB Preview Download
Supplemental material file 4
md5:40a86f9c92c859dd077c8948fefb4363
21.9 MB Download

Additional details

Identifiers

DOI
10.1101/gr.247759.118
Other
oai:uchicago.tind.io:5692

Funding

National Institutes of Health
HG002585
National Institutes of Health
GM122930

UChicago Information

Division(s)
Biological Sciences Division, Physical Sciences Division
Department(s)
Human Genetics, Medicine, Statistics