Published September 28, 2017 | Version v1
Journal article Open

Trans-ethnic predicted expression genome-wide association analysis identifies a gene for estrogen receptor-negative breast cancer

Description

Genome-wide association studies (GWAS) have identified more than 90 susceptibility loci for breast cancer, but the underlying biology of those associations needs to be further elucidated. More genetic factors for breast cancer are yet to be identified but sample size constraints preclude the identification of individual genetic variants with weak effects using traditional GWAS methods. To address this challenge, we utilized a gene-level expression-based method, implemented in the MetaXcan software, to predict gene expression levels for 11,536 genes using expression quantitative trait loci and examine the genetically-predicted expression of specific genes for association with overall breast cancer risk and estrogen receptor (ER)-negative breast cancer risk. Using GWAS datasets from a Challenge launched by National Cancer Institute, we identified TP53INP2 (tumor protein p53-inducible nuclear protein 2) at 20q11.22 to be significantly associated with ER-negative breast cancer (Z = -5.013, p = 5.35×10−7, Bonferroni threshold = 4.33×10−6). The association was consistent across four GWAS datasets, representing European, African and Asian ancestry populations. There are 6 single nucleotide polymorphisms (SNPs) included in the prediction of TP53INP2 expression and five of them were associated with estrogen-receptor negative breast cancer, although none of the SNP-level associations reached genome-wide significance. We conducted a replication study using a dataset outside of the Challenge, and found the association between TP53INP2 and ER-negative breast cancer was significant (p = 5.07x10-3). Expression of HP (16q22.2) showed a suggestive association with ER-negative breast cancer in the discovery phase (Z = 4.30, p = 1.70x10-5) although the association was not significant after Bonferroni adjustment. Of the 249 genes that are 250 kb within known breast cancer susceptibility loci identified from previous GWAS, 20 genes (8.0%) were statistically significant associated with ER-negative breast cancer (p<0.05), compared to 582 (5.2%) of 11,287 genes that are not close to previous GWAS loci. This study demonstrated that expression-based gene mapping is a promising approach for identifying cancer susceptibility genes.

Data availability

Data are available upon request from dbGaP to researchers who meet the criteria for access to data (accession numbers: phs000851.v1.p1, phs000812.v1.p1, phs000383.v1.p1, phs000147.v3.p1, and phs000799.v1.p1). African American Breast Cancer Consortium (AABC) data was provided via dbGaP project phs000851.v1.p1: https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000851.v1.p1 The BPC3 data was provided via dbGaP project phs000812.v1.p1: https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000812.v1.p1 Cancer Genetic Markers of Susceptibility Breast Cancer (CGEMS) data was provided via dbGaP project phs000147.v3.p1: https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000147.v3.p1 Genome-Wide Association Study of Breast Cancer in the African Diaspora - the ROOT study was provided via dbGaP project phs000383.v1.p1: https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000383.v1.p1 Shanghai Breast Cancer Genetic Study (SBCGS) data was provided via dbGaP project phs000799.v1.p1: https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000799.v1.p1

Files

journal.pgen.1006727.pdf

Files (3.0 MB)

Name Size Download all
Article
md5:4af6c319d7be3814fa8f2740c483ca7a
2.9 MB Preview Download
md5:f809e5b48a4ab68dd9464e49d53c84c9
109.2 kB Preview Download

Additional details

Identifiers

DOI
10.1371/journal.pgen.1006727
Other
oai:uchicago.tind.io:6589

Funding

National Cancer Institute
P50-CA125183
National Cancer Institute
R01-CA89085
National Cancer Institute
U01-CA161032
National Institute of Mental Health
R01-MH107666
National Institute of Diabetes and Digestive and Kidney Diseases
P30 DK020595
American Cancer Society
MRSG-13-063-01-TBG
American Cancer Society
CRP-10-119-01-CCE
National Institutes of Health
R01 MH107666
National Institutes of Health
P30 DK020595

UChicago Information

Division(s)
Biological Sciences Division
Department(s)
Human Genetics, Medicine, Public Health Sciences