Published April 20, 2020
| Version v1
Journal article
Open
Characterizing and inferring quantitative cell cycle phase in single-cell RNA-seq data analysis
Creators
- 1. University of Chicago
- 2. Harvard University
Description
Cellular heterogeneity in gene expression is driven by cellular processes, such as cell cycle and cell-type identity, and cellular environment such as spatial location. The cell cycle, in particular, is thought to be a key driver of cell-to-cell heterogeneity in gene expression, even in otherwise homogeneous cell populations. Recent advances in single-cell RNA-sequencing (scRNA-seq) facilitate detailed characterization of gene expression heterogeneity and can thus shed new light on the processes driving heterogeneity. Here, we combined fluorescence imaging with scRNA-seq to measure cell cycle phase and gene expression levels in human induced pluripotent stem cells (iPSCs). By using these data, we developed a novel approach to characterize cell cycle progression. Although standard methods assign cells to discrete cell cycle stages, our method goes beyond this and quantifies cell cycle progression on a continuum. We found that, on average, scRNA-seq data from only five genes predicted a cell's position on the cell cycle continuum to within 14% of the entire cycle and that using more genes did not improve this accuracy. Our data and predictor of cell cycle phase can directly help future studies to account for cell cycle-related heterogeneity in iPSCs. Our results and methods also provide a foundation for future work to characterize the effects of the cell cycle on expression heterogeneity in other cell types.
Data availability
All rawand processed sequencing data generated in this study have been submitted to the NCBI Gene Expression Omnibus (GEO; https://www.ncbi.nlm.nih.gov/geo/) (Edgar et al. 2002) under accession number GSE121265. We also make the processed data available at GitHub (https://github.com/jhsiao999/peco-paper) and https://giladlab.uchicago.edu/wp-content/uploads/2019/02/ Hsiao_et_al_2019.tar.gz. All analysis results, scripts, and data required to reproduce this work are available at https://gilad.com/ jhsiao999/peco-paper as well as in the Supplemental Code (Supplemental File S3). The source code is available in an R/Bioconductor package peco (the development version of peco is available at https://github.com/jhsiao999/peco).Files
Characterizing-and-inferring-quantitative-cell-cycle-phase-in-single-cell-RNA-seq-data-analysis.pdf
Files
(34.7 MB)
| Name | Size | Download all |
|---|---|---|
|
Article md5:54c0ec6ea586016b4b9a04ce07092df5 |
3.6 MB | Preview Download |
|
Supplemental material file 1 md5:da03e9e0d141cff2d294552e74ce6c78 |
29.5 kB | Download |
|
Supplemental material file 2 md5:4cadb907f4ac891c9fc7986bf5b4cbb1 |
1.1 MB | Download |
|
Supplemental material file 3 md5:9837a626455ace9f1566cd25cab330ed |
8.1 MB | Preview Download |
|
Supplemental material file 4 md5:40a86f9c92c859dd077c8948fefb4363 |
21.9 MB | Download |
Additional details
Identifiers
- DOI
- 10.1101/gr.247759.118
- Other
- oai:uchicago.tind.io:5692
Funding
- National Institutes of Health
- HG002585
- National Institutes of Health
- GM122930