Published September 28, 2023 | Version v1
Journal article Open

A new Bayesian factor analysis method improves detection of genes and biological processes affected by perturbations in single-cell CRISPR screening

  • 1. University of Chicago

Description

Clustered regularly interspaced short palindromic repeats (CRISPR) screening coupled with single-cell RNA sequencing has emerged as a powerful tool to characterize the effects of genetic perturbations on the whole transcriptome at a single-cell level. However, due to its sparsity and complex structure, analysis of single-cell CRISPR screening data is challenging. In particular, standard differential expression analysis methods are often underpowered to detect genes affected by CRISPR perturbations. We developed a statistical method for such data, called guided sparse factor analysis (GSFA). GSFA infers latent factors that represent coregulated genes or gene modules; by borrowing information from these factors, it infers the effects of genetic perturbations on individual genes. We demonstrated through extensive simulation studies that GSFA detects perturbation effects with much higher power than state-of-the-art methods. Using single-cell CRISPR data from human CD8+ T cells and neural progenitor cells, we showed that GSFA identified biologically relevant gene modules and specific genes affected by CRISPR perturbations, many of which were missed by existing methods, providing new insights into the functions of genes involved in T cell activation and neurodevelopment.

Data availability

Both CROP-seq datasets used in this study are publicly available and were downloaded from the GEO (accession nos. GSE119450 and GSE142078). Source data are provided with this paper.

The R package implementing the GSFA is freely available at https://github.com/xinhe-lab/GSFA. The source code used in our study is deposited at https://github.com/xinhe-lab/GSFA_paper.

Files

New-Bayesian-factor-analysis-method.pdf

Files (16.1 MB)

Name Size Download all
Article
md5:3d1f7512a98e38f7dd0ac991657b533b
4.5 MB Preview Download
md5:da67a732b38771e31a3854615ff03f5c
4.1 MB Preview Download
Supplemental information files
md5:1d8d8505c3182a1d0e73d81d5227dbf5
7.5 MB Preview Download

Additional details

Identifiers

DOI
10.1038/s41592-023-02017-4
Other
oai:uchicago.tind.io:8351

Funding

National Institutes of Health
R01MH110531
National Institutes of Health
R01HG010773
National Institutes of Health
R01MH116281
National Institutes of Health
R01 GM126553
National Institutes of Health
R01 HG011883
National Science Foundation
2016307
Sloan Research Fellowship

UChicago Information

Division(s)
Biological Sciences Division
Department(s)
Biophysical Sciences, Human Genetics, Medicine