Published May 20, 2025 | Version v1
Journal article Open

Can AI weather models predict out-of-distribution gray swan tropical cyclones?

  • 1. University of Chicago
  • 2. University of California, Santa Cruz
  • 3. New York University

Description

Predicting gray swan weather extremes, which are possible but so rare that they are absent from the training dataset, is a major concern for AI weather models and long-term climate emulators. An important open question is whether AI models can extrapolate from weaker weather events present in the training set to stronger, unseen weather extremes. To test this, we train independent versions of the AI weather model FourCastNet on the 1979–2015 ERA5 dataset with all data, or with Category 3–5 tropical cyclones (TCs) removed, either globally or only over the North Atlantic or Western Pacific basin. We then test these versions of FourCastNet on 2018–2023 Category 5 TCs (gray swans). All versions yield similar accuracy for global weather, but the one trained without Category 3–5 TCs cannot accurately forecast Category 5 TCs, indicating that these models cannot extrapolate from weaker storms. The versions trained without Category 3–5 TCs in one basin show some skill forecasting Category 5 TCs in that basin, suggesting that FourCastNet can generalize across tropical basins. This is encouraging and surprising because regional information is implicitly encoded in inputs. Given that current state-of-the-art AI weather and climate models have similar learning strategies, we expect our findings to apply to other models. Other types of weather extremes need to be similarly investigated. Our work demonstrates that novel learning strategies are needed for AI models to reliably provide early warning or estimated statistics for the rarest, most impactful TCs, and, possibly, other weather extremes.

Data availability

Some study data are available. Due to size limitations, some data cannot be uploaded to the repository. But we will share all our codes and provide all the details required for the readers to reproduce all the data used in this study themselves. Here are more details: We use the original FourCastNet with modifications for our customized training sets. These codes are publicly available at https://github.com/envfluids/FourCastNet (68). The necessary data to reproduce the results, including the weights of the 25 trained models and indices of dates that are removed in each training dataset, can be found on Zenodo at https://zenodo.org/uploads/13835657 (69) and https://zenodo.org/uploads/13834149 (70).

Files

sun-et-al-can-ai-weather-models-predict-out-of-distribution-gray-swan-tropical-cyclones.pdf

Files (8.2 MB)

Name Size Download all
Supporting information
md5:823532f810f9c08c6497917847e4b06f
3.0 MB Preview Download
Article
md5:e328921add029721e2b1f961ecb7ebb5
5.2 MB Preview Download

Additional details

Identifiers

DOI
10.1073/pnas.2420914122
Other
oai:uchicago.tind.io:15273

Funding

ONR
N000142012722
ARO
W911NF-22-2- 0124
National Science Foundation
AGS-2046309
National Science Foundation
ATM170020

UChicago Information

Division(s)
Physical Sciences Division
Department(s)
Computational and Applied Mathematics, Geophysical Sciences