If At First You Don’t Succeed, Try, Try, Again...? Insights and LLM-informed Tooling for Detecting Retry Bugs in Software Systems

Stoica, Bogdan Alexandru; Sethi, Utsav; Su, Yiming; Zhou, Cyrus; Lu, Shan; Mace, Jonathan; Musuvathi, Madanlal; Nath, Suman

If At First You Don’t Succeed, Try, Try, Again...? Insights and LLM-informed Tooling for Detecting Retry Bugs in Software Systems

Stoica, Bogdan Alexandru; Sethi, Utsav; Su, Yiming; Zhou, Cyrus; Lu, Shan; Mace, Jonathan; Musuvathi, Madanlal; Nath, Suman

2024

Download

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DataCite
DublinCore
EndNote
NLM
RefWorks
RIS

Add to Basket

Cite

Files

Abstract

Retry---the re-execution of a task on failure---is a common mechanism to enable resilient software systems. Yet, despite its commonality and long history, retry remains difficult to implement and test.

Guided by our study of real-world retry issues, we propose a novel suite of static and dynamic techniques to detect retry problems in software. We find that the ad-hoc nature of retry implementation poses challenges for traditional program analysis but can be well suited for large language models; and that carefully repurposing existing unit tests can, along with fault injection, expose various types of retry problems.

Details

Title

If At First You Don’t Succeed, Try, Try, Again...? Insights and LLM-informed Tooling for Detecting Retry Bugs in Software Systems

Author

Stoica, Bogdan Alexandru : University of Chicago : (https://orcid.org/0000-0002-0130-2065)
Sethi, Utsav : University of Chicago : (https://orcid.org/0009-0002-5865-6187)
Su, Yiming : University of Chicago : (https://orcid.org/0009-0004-0128-8664)
Zhou, Cyrus : University of Chicago : (https://orcid.org/0000-0001-8768-0659)
Lu, Shan : Microsoft Research : (https://orcid.org/0000-0002-3701-9296)
Mace, Jonathan : Microsoft Research : (https://orcid.org/0000-0002-3701-9296)
Musuvathi, Madanlal : Microsoft Research : (https://orcid.org/0000-0002-2482-7892)
Nath, Suman : Microsoft Research : (https://orcid.org/0000-0001-7813-9756)

Content Type

Article

Published in

SOSP: Proceedings of the ACM SIGOPS Symposium on Operating Systems Principles

Identifier(s)

DOI: https://doi.org/10.1145/3694715.3695971

Funding Information

National Science Foundation, CNS-2313190
National Science Foundation, CCF-2119184
National Science Foundation, CNS-1956180
Chameleon Cloud Project
Eckhardt Fellowship
University of Chicago, (https://ror.org/024mw5h28), ROR, Quad Undergraduate Research grants

Publication Date

2024-11-15

Language

English

Copyright Statement

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike International 4.0 License.

Licensing

CC BY-NC-SA

Record Appears in

Physical Sciences Division > Computer Science
All

Record Created

2024-11-17