Published July 1, 2025
| Version v1
Journal article
Open
A machine learning model using clinical notes to identify physician fatigue
Creators
- 1. University of Chicago
- 2. University of California, Berkeley
Description
Clinical notes should capture important information from a physician-patient encounter, but they may also contain signals indicative of physician fatigue. Using data from 129,228 emergency department (ED) visits, we train a model to identify notes written by physicians who are likely to be tired: those who worked ED shifts on at least 5 of the prior 7 days. In a hold-out set, the model accurately identifies notes written by such high-workload physicians. It also flags notes written in other settings with high fatigue: overnight shifts and high patient volumes. When the model identifies signs of fatigue in a note, physician decision-making for that patient appears worse: yield of testing for heart attack is 19% lower with each standard deviation increase in model-predicted fatigue. A key feature of notes written by fatigued doctors is the predictability of the next word, given the preceding context. Perhaps unsurprisingly, because word prediction is the core of how large language models (LLMs) work, we find that predicted fatigue of LLM-written notes is 74% higher than that of physician-written ones, highlighting the possibility that LLMs may introduce distortions in generated text that are not yet fully understood.
Data availability
Data supporting the findings of this study are available in the article and its Supplementary information. Source data are provided as Source Data file and may be obtained from the corresponding authors upon request. The data used for the primary analysis consist of individual patient records, including free-text physician notes, which are challenging to fully deidentify. As a result, the IRB did not approve public data sharing. External validation was performed using the publicly available MIMIC-III dataset (https://physionet.org/content/mimiciii/1.4/). Source data are provided with this paper.
Code that supports the main findings of this study are available on GitHub: https://github.com/ChicagoHAI/physician-fatigue.
Files
Machine-learning-model-using-clinical-notes-to-identify-physician-fatigue.pdf
Additional details
Identifiers
- DOI
- 10.1038/s41467-025-60865-4
- Other
- oai:uchicago.tind.io:15613
Funding
- Unknown funder
- IIS-2126602
- Unknown funder
- Sloan Research Fellowship