Published May 31, 2022 | Version v1
Journal article Open

Genetic characterization of outbred Sprague Dawley rats and utility for genome-wide association studies

Description

Sprague Dawley (SD) rats are among the most widely used outbred laboratory rat populations. Despite this, the genetic characteristics of SD rats have not been clearly described, and SD rats are rarely used for experiments aimed at exploring genotype-phenotype relationships. In order to use SD rats to perform a genome-wide association study (GWAS), we collected behavioral data from 4,625 SD rats that were predominantly obtained from two commercial vendors, Charles River Laboratories and Harlan Sprague Dawley Inc. Using double-digest genotyping-by-sequencing (ddGBS), we obtained dense, high-quality genotypes at 291,438 SNPs across 4,061 rats. This genetic data allowed us to characterize the variation present in Charles River vs. Harlan SD rats. We found that the two populations are highly diverged (FST > 0.4). Furthermore, even for rats obtained from the same vendor, there was strong population structure across breeding facilities and even between rooms at the same facility. We performed multiple separate GWAS by fitting a linear mixed model that accounted for population structure and using meta-analysis to jointly analyze all cohorts. Our study examined Pavlovian conditioned approach (PavCA) behavior, which assesses the propensity for rats to attribute incentive salience to reward-associated cues. We identified 46 significant associations for the various metrics used to define PavCA. The surprising degree of population structure among SD rats from different sources has important implications for their use in both genetic and non-genetic studies.

Data availability

Data files too large to be included as supplemental content were uploaded to a central online repository through the UCSD Library System: https://doi.org/10.6075/J0XS5V8K. This DOI can be followed to access (1) a VCF file containing all raw SNP calls with dosage data in the original set of 4,625 Sprague Dawley rats, (2) filtered variants in BIM/BED/FAM format for each of the seven SD subgroups, and (3) summary files for the GWAS results for all 55 metric x day combinations for all analyses, individual subgroups and meta-analyses. In addition, FASTQ files from the original ddGBS and light WGS sequencing runs were submitted to the NCBI Sequence Read Archive (SRA) under BioProject identifier PRJNA779552 (https://www.ncbi.nlm.nih.gov/sra/PRJNA779552). Instructions on how to download the data can be found here: https://www.ncbi.nlm.nih.gov/sra/docs/sradownload/#download-metadata-associated-wit.

Files

journal.pgen.1010234.pdf

Files (27.8 MB)

Name Size Download all
Supporting information
md5:d01df2d35f2fc878dd92756afd291b6b
25.2 MB Preview Download
Article
md5:51448445c49c52eb4068eeef57a3693f
2.6 MB Preview Download

Additional details

Identifiers

DOI
10.1371/journal.pgen.1010234
Other
oai:uchicago.tind.io:6324

Funding

National Institute of Drug Addiction
R21 DA036672
National Institute of Drug Addiction
P50 DA037844
National Institute of Drug Addiction
F31 DA039638-02
National Institute of General Medical Sciences
T32 GM007197

UChicago Information

Division(s)
Biological Sciences Division
Department(s)
Human Genetics