Published September 28, 2023
| Version v1
Journal article
Open
A new Bayesian factor analysis method improves detection of genes and biological processes affected by perturbations in single-cell CRISPR screening
Description
Clustered regularly interspaced short palindromic repeats (CRISPR) screening coupled with single-cell RNA sequencing has emerged as a powerful tool to characterize the effects of genetic perturbations on the whole transcriptome at a single-cell level. However, due to its sparsity and complex structure, analysis of single-cell CRISPR screening data is challenging. In particular, standard differential expression analysis methods are often underpowered to detect genes affected by CRISPR perturbations. We developed a statistical method for such data, called guided sparse factor analysis (GSFA). GSFA infers latent factors that represent coregulated genes or gene modules; by borrowing information from these factors, it infers the effects of genetic perturbations on individual genes. We demonstrated through extensive simulation studies that GSFA detects perturbation effects with much higher power than state-of-the-art methods. Using single-cell CRISPR data from human CD8+ T cells and neural progenitor cells, we showed that GSFA identified biologically relevant gene modules and specific genes affected by CRISPR perturbations, many of which were missed by existing methods, providing new insights into the functions of genes involved in T cell activation and neurodevelopment.
Data availability
Both CROP-seq datasets used in this study are publicly available and were downloaded from the GEO (accession nos. GSE119450 and GSE142078). Source data are provided with this paper.
The R package implementing the GSFA is freely available at https://github.com/xinhe-lab/GSFA. The source code used in our study is deposited at https://github.com/xinhe-lab/GSFA_paper.
Files
New-Bayesian-factor-analysis-method.pdf
Additional details
Identifiers
- DOI
- 10.1038/s41592-023-02017-4
- Other
- oai:uchicago.tind.io:8351
Funding
- National Institutes of Health
- R01MH110531
- National Institutes of Health
- R01HG010773
- National Institutes of Health
- R01MH116281
- National Institutes of Health
- R01 GM126553
- National Institutes of Health
- R01 HG011883
- National Science Foundation
- 2016307
- Sloan Research Fellowship