Published July 22, 2025 | Version v1
Journal article Open

A novel expectation-maximization approach to infer general diploid selection from time-series genetic data

  • 1. University of Chicago

Description

Detecting and quantifying the strength of selection is a major objective in population genetics. Since selection acts over multiple generations, many approaches have been developed to detect and quantify selection using genetic data sampled at multiple points in time. Such time-series genetic data is commonly analyzed using Hidden Markov Models, but in most cases, under the assumption of additive selection. However, many examples of genetic variation exhibiting non-additive mechanisms exist, making it critical to develop methods that can characterize selection in more general scenarios. Here, we extend a previously introduced expectation-maximization algorithm for the inference of additive selection coefficients to the case of general diploid selection, in which the heterozygote and homozygote fitness are parameterized independently. We furthermore introduce a framework to identify bespoke modes of diploid selection from given data, a heuristic to account for variable population size, and a procedure for aggregating data across linked loci to increase power and robustness. Using extensive simulation studies, we find that our method accurately and efficiently estimates selection coefficients for different modes of diploid selection across a wide range of scenarios; however, power to classify the mode of selection is low unless selection is very strong. We apply our method to ancient DNA samples from Great Britain in the last 4,450 years and detect evidence for selection in six genomic regions, including the well-characterized LCT locus. Our work is the first genome-wide scan characterizing signals of general diploid selection.

Data availability

All code to generate the results is available at https://github.com/steinrue/EMSel. The ancient DNA data for the analysis was downloaded from the Allen Ancient DNA Resource (AADR) at https://doi.org/10.7910/DVN/FFIDCW (Version 7.0).

Files

journal.pgen.1011769.pdf

Files (10.8 MB)

Name Size Download all
Article
md5:c868f7b575e0766986d9fca53949afa8
7.1 MB Preview Download
Supporting information
md5:36cfeae2a3b28063204545754d069a91
3.7 MB Preview Download

Additional details

Identifiers

DOI
10.1371/journal.pgen.1011769
Other
oai:uchicago.tind.io:15892

Funding

Department of Education
Graduate Assistance in Areas of National Need
National Institute of General Medical Sciences
R01GM146051

UChicago Information

Division(s)
Biological Sciences Division
Department(s)
Biophysical Sciences, Ecology and Evolution, Human Genetics