A machine learning model using clinical notes to identify physician fatigue

Hsu, Chao-Chun; Obermeyer, Ziad; Tan, Chenhao

doi:10.6082/exh2q-heb07

Published July 1, 2025 | Version v1

Journal article Open

A machine learning model using clinical notes to identify physician fatigue

1. University of Chicago
2. University of California, Berkeley

Clinical notes should capture important information from a physician-patient encounter, but they may also contain signals indicative of physician fatigue. Using data from 129,228 emergency department (ED) visits, we train a model to identify notes written by physicians who are likely to be tired: those who worked ED shifts on at least 5 of the prior 7 days. In a hold-out set, the model accurately identifies notes written by such high-workload physicians. It also flags notes written in other settings with high fatigue: overnight shifts and high patient volumes. When the model identifies signs of fatigue in a note, physician decision-making for that patient appears worse: yield of testing for heart attack is 19% lower with each standard deviation increase in model-predicted fatigue. A key feature of notes written by fatigued doctors is the predictability of the next word, given the preceding context. Perhaps unsurprisingly, because word prediction is the core of how large language models (LLMs) work, we find that predicted fatigue of LLM-written notes is 74% higher than that of physician-written ones, highlighting the possibility that LLMs may introduce distortions in generated text that are not yet fully understood.

Data availability

Data supporting the findings of this study are available in the article and its Supplementary information. Source data are provided as Source Data file and may be obtained from the corresponding authors upon request. The data used for the primary analysis consist of individual patient records, including free-text physician notes, which are challenging to fully deidentify. As a result, the IRB did not approve public data sharing. External validation was performed using the publicly available MIMIC-III dataset (https://physionet.org/content/mimiciii/1.4/). Source data are provided with this paper.

Code that supports the main findings of this study are available on GitHub: https://github.com/ChicagoHAI/physician-fatigue.

Files

Machine-learning-model-using-clinical-notes-to-identify-physician-fatigue.pdf

Files (3.8 MB)

Name	Size	Download all
41467_2025_60865_MOESM4_ESM.xlsx Source data md5:da11bf7b02a9f9d1585ab10ae2faa88c	1.6 MB	Download
Machine-learning-model-using-clinical-notes-to-identify-physician-fatigue.pdf Article md5:e99bf09d31201d232dc9a603eff185d7	731.5 kB	Preview Download
Supplementary-information.zip Supplementary information files md5:6acf7c59d7d4052e8f7c6ccd9e85ed7c	1.4 MB	Preview Download

Additional details

DOI: 10.1038/s41467-025-60865-4
Other: oai:uchicago.tind.io:15613

Unknown funder
IIS-2126602
Unknown funder
Sloan Research Fellowship

Division(s): Harris School of Public Policy Studies, Physical Sciences Division
Department(s): Computer Science, Harris School of Public Policy Studies Research Publications

Views

Downloads

Show more details

	All versions	This version
Views	9	9
Downloads	21	21
Data volume	26.5 MB	26.5 MB

More info on how stats are collected....

DOI

Resource type

Journal article

Publisher

University of Chicago

Published in

Nature Communications, 2025.

Languages

English

License

Creative Commons Attribution Non Commercial No Derivatives 4.0 International

No further description. Read more
Distribution License

No further description.

Copyrights

© The Author(s) 2025 This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Technical metadata

Created: May 29, 2026
Modified: May 29, 2026

A machine learning model using clinical notes to identify physician fatigue

Data availability

Files

Machine-learning-model-using-clinical-notes-to-identify-physician-fatigue.pdf

Files (3.8 MB)

Additional details

Identifiers

Funding

UChicago Information

A machine learning model using clinical notes to identify physician fatigue

Creators

Description

Data availability

Files

Machine-learning-model-using-clinical-notes-to-identify-physician-fatigue.pdf

Files (3.8 MB)

Additional details

Identifiers

Funding

UChicago Information