Published October 3, 2016 | Version v1
Journal article Open

CERAMIC: Case-Control Association Testing in Samples with Related Individuals, Based on Retrospective Mixed Model Analysis with Adjustment for Covariates

  • 1. University of Chicago

Description

We consider the problem of genetic association testing of a binary trait in a sample that contains related individuals, where we adjust for relevant covariates and allow for missing data. We propose CERAMIC, an estimating equation approach that can be viewed as a hybrid of logistic regression and linear mixed-effects model (LMM) approaches. CERAMIC extends the recently proposed CARAT method to allow samples with related individuals and to incorporate partially missing data. In simulations, we show that CERAMIC outperforms existing LMM and generalized LMM approaches, maintaining high power and correct type 1 error across a wider range of scenarios. CERAMIC results in a particularly large power increase over existing methods when the sample includes related individuals with some missing data (e.g., when some individuals with phenotype and covariate information have missing genotype), because CERAMIC is able to make use of the relationship information to incorporate partially missing data in the analysis while correcting for dependence. Because CERAMIC is based on a retrospective analysis, it is robust to misspecification of the phenotype model, resulting in better control of type 1 error and higher power than that of prospective methods, such as GMMAT, when the phenotype model is misspecified. CERAMIC is computationally efficient for genomewide analysis in samples of related individuals of almost any configuration, including small families, unrelated individuals and even large, complex pedigrees. We apply CERAMIC to data on type 2 diabetes (T2D) from the Framingham Heart Study. In a genome scan, 9 of the 10 smallest CERAMIC p-values occur in or near either known T2D susceptibility loci or plausible candidates, verifying that CERAMIC is able to home in on the important loci in a genome scan.

Data availability

All Framingham Heart Study files are available from the dbGaP database (accession number phs000007.v25.p9).

Files

journal.pgen.1006329.pdf

Files (4.0 MB)

Name Size Download all
Article
md5:f3c5eb00c9296d3e7aed5b3b74cb4ebc
3.4 MB Preview Download
md5:c4f413d032e89c04a31514380c7d2fd6
661.4 kB Preview Download

Additional details

Identifiers

DOI
10.1371/journal.pgen.1006329
Other
oai:uchicago.tind.io:6752

Funding

National Institutes of Health
R01 HG001645
National Heart, Lung, and Blood Institute
N01-HC-25195
National Heart, Lung, and Blood Institute
N02-HL-64278

UChicago Information

Division(s)
Physical Sciences Division
Department(s)
Statistics