Published May 13, 2023 | Version v1
Journal article Open

Immune gene variation associated with chromosome-scale differences among individual zebrafish genomes

  • 1. University of Chicago

Description

Immune genes have evolved to maintain exceptional diversity, offering robust defense against pathogens. We performed genomic assembly to examine immune gene variation in zebrafish. Gene pathway analysis identified immune genes as significantly enriched among genes with evidence of positive selection. A large subset of genes was absent from analysis of coding sequences due to apparent lack of reads, prompting us to examine genes overlapping zero coverage regions (ZCRs), defined as 2 kb stretches without mapped reads. Immune genes were identified as highly enriched within ZCRs, including over 60% of major histocompatibility complex (MHC) genes and NOD-like receptor (NLR) genes, mediators of direct and indirect pathogen recognition. This variation was most highly concentrated throughout one arm of chromosome 4 carrying a large cluster of NLR genes, associated with large-scale structural variation covering more than half of the chromosome. Our genomic assemblies uncovered alternative haplotypes and distinct complements of immune genes among individual zebrafish, including the MHC Class II locus on chromosome 8 and the NLR gene cluster on chromosome 4. While previous studies have shown marked variation in NLR genes between vertebrate species, our study highlights extensive variation in NLR gene regions between individuals of the same species. Taken together, these findings provide evidence of immune gene variation on a scale previously unknown in other vertebrate species and raise questions about potential impact on immune function.

Data availability

The datasets generated and/or analyzed during the current study are included in this published article (and its supplementary information files) or are available from the following repositories. Genomic assembly data generated in this study have been submitted to the NCBI BioProject database ( https://www.ncbi.nlm.nih.gov/bioproject/) under accession numbers PRJNA292113, LKPD02000000 (CG2); PRJNA454110, JALCZS000000000 (CG1); and PRJNA454111, JALCZT000000000 (AB3). Raw sequence data have been deposited in the NCBI short read archives (SRA) with accession numbers SRR7080552, SRR7081528, and SRR7081557. Supplemental data files, including bigWig and BED files have been published in the CyVerse Data Commons under https://de.cyverse.org/data/ds/iplant/home/shared/commons_repo/curated/McConnell_ZeroCoverageRegions_2022.

Files

Immune-gene-variation.pdf

Files (23.0 MB)

Name Size Download all
Article
md5:7b43726dbf3a3951190a101ca39f3f3c
22.0 MB Preview Download
md5:8f71b526069507ebefd30c3da05e41a2
941.1 kB Preview Download

Additional details

Identifiers

DOI
10.1038/s41598-023-34467-3
Other
oai:uchicago.tind.io:5931

Funding

Chicago Community Trust
Searle Funds
University of Chicago
Cancer Research Foundation Auxiliary Board
University of Chicago Medicine Comprehensive Cancer Center
Support Grant
Unknown funder
Chicago Biomedical Consortium Postdoctoral Research Grant

UChicago Information

Division(s)
Biological Sciences Division
Department(s)
Medicine, Pediatrics
Center(s) or Institute(s)
Center for Research Informatics, Center for Translational Data Science