If At First You Don't Succeed, Try, Try, Again...? Insights and LLM-informed Tooling for Detecting Retry Bugs in Software Systems

Stoica, Bogdan Alexandru; Sethi, Utsav; Su, Yiming; Zhou, Cyrus; Lu, Shan; Mace, Jonathan; Musuvathi, Madanlal; Nath, Suman

doi:10.6082/8r23k-fas64

Published November 15, 2024 | Version v1

Journal article Open

If At First You Don't Succeed, Try, Try, Again...? Insights and LLM-informed Tooling for Detecting Retry Bugs in Software Systems

1. University of Chicago
2. Microsoft Research

Retry---the re-execution of a task on failure---is a common mechanism to enable resilient software systems. Yet, despite its commonality and long history, retry remains difficult to implement and test.

Guided by our study of real-world retry issues, we propose a novel suite of static and dynamic techniques to detect retry problems in software. We find that the ad-hoc nature of retry implementation poses challenges for traditional program analysis but can be well suited for large language models; and that carefully repurposing existing unit tests can, along with fault injection, expose various types of retry problems.

Files

If-At-First-You-Dont-Succeed-Try-Try-Again-Insights-and-LLM-informed-Tooling.pdf

Files (978.8 kB)

Name	Size	Download all
If-At-First-You-Dont-Succeed-Try-Try-Again-Insights-and-LLM-informed-Tooling.pdf md5:ee0c646525e8b3b9d3719605f4aca99d	978.8 kB	Preview Download

Additional details

DOI: 10.1145/3694715.3695971
Other: oai:uchicago.tind.io:14028

National Science Foundation
CNS-2313190
National Science Foundation
CCF-2119184
National Science Foundation
CNS-1956180
Chameleon Cloud Project
Eckhardt Fellowship
University of Chicago
Quad Undergraduate Research grants

Division(s): Physical Sciences Division
Department(s): Computer Science

Views

286

Downloads

Show more details

	All versions	This version
Views	22	22
Downloads	286	286
Data volume	24.5 MB	24.5 MB

More info on how stats are collected....

DOI

Resource type

Journal article

Publisher

University of Chicago

Published in

SOSP: Proceedings of the ACM SIGOPS Symposium on Operating Systems Principles, 2024.

Languages

English

License

Creative Commons Attribution Non Commercial Share Alike 4.0 International

No further description. Read more
Distribution License

No further description.

Copyrights

Technical metadata

Created: May 22, 2026
Modified: May 22, 2026

If At First You Don't Succeed, Try, Try, Again...? Insights and LLM-informed Tooling for Detecting Retry Bugs in Software Systems

Files

If-At-First-You-Dont-Succeed-Try-Try-Again-Insights-and-LLM-informed-Tooling.pdf

Files (978.8 kB)

Additional details

Identifiers

Funding

UChicago Information

If At First You Don't Succeed, Try, Try, Again...? Insights and LLM-informed Tooling for Detecting Retry Bugs in Software Systems

Creators

Description

Files

If-At-First-You-Dont-Succeed-Try-Try-Again-Insights-and-LLM-informed-Tooling.pdf

Files (978.8 kB)

Additional details

Identifiers

Funding

UChicago Information