Published May 11, 2024 | Version v1
Journal article Open

JupyterLab in Retrograde: Contextual Notifications That Highlight Fairness and Bias Issues for Data Scientists

Description

Current algorithmic fairness tools focus on auditing completed models, neglecting the potential downstream impacts of iterative decisions about cleaning data and training machine learning models. In response, we developed Retrograde, a JupyterLab environment extension for Python that generates real-time, contextual notifications for data scientists about decisions they are making regarding protected classes, proxy variables, missing data, and demographic differences in model performance. Our novel framework uses automated code analysis to trace data provenance in JupyterLab, enabling these notifications. In a between-subjects online experiment, 51 data scientists constructed loan-decision models with Retrograde providing notifications continuously throughout the process, only at the end, or never. Retrograde's notifications successfully nudged participants to account for missing data, avoid using protected classes as predictors, minimize demographic differences in model performance, and exhibit healthy skepticism about their models.

Files

JupyterLab-in-Retrograde.pdf

Files (1.8 MB)

Name Size Download all
md5:6f8df972ff6fbe0cc04137ab44b197a2
1.8 MB Preview Download

Additional details

Identifiers

DOI
10.1145/3613904.3642755
Other
oai:uchicago.tind.io:11805

UChicago Information

Division(s)
Physical Sciences Division
Department(s)
Computer Science