Published June 24, 2022 | Version v1
Journal article Open

Inferring couplings in networks across order-disorder phase transitions

  • 1. City University of New York
  • 2. University of Chicago
  • 3. Institute of Science and Technology Austria

Description

Statistical inference is central to many scientific endeavors, yet how it works remains unresolved. Answering this requires a quantitative understanding of the intrinsic interplay between statistical models, inference methods, and the structure in the data. To this end, we characterize the efficacy of direct coupling analysis (DCA)—a highly successful method for analyzing amino acid sequence data—in inferring pairwise interactions from samples of ferromagnetic Ising models on random graphs. Our approach allows for physically motivated exploration of qualitatively distinct data regimes separated by phase transitions. We show that inference quality depends strongly on the nature of data-generating distributions: optimal accuracy occurs at an intermediate temperature where the detrimental effects from macroscopic order and thermal noise are minimal. Importantly our results indicate that DCA does not always outperform its local-statistics-based predecessors; while DCA excels at low temperatures, it becomes inferior to simple correlation thresholding at virtually all temperatures when data are limited. Our findings offer insights into the regime in which DCA operates so successfully, and more broadly, how inference interacts with the structure in the data.

Files

PhysRevResearch.4.023240.pdf

Files (1.4 MB)

Name Size Download all
md5:ed6fdc2a3a096df785fa5f7b17b716c6
1.4 MB Preview Download

Additional details

Identifiers

DOI
10.1103/physrevresearch.4.023240
Other
oai:uchicago.tind.io:11690

Funding

National Science Foundation
PHY-1734030
National Institutes of Health
R01EB026943

UChicago Information

Division(s)
Biological Sciences Division, Physical Sciences Division
Department(s)
Organismal Biology and Anatomy, Physics