Addressing discretization-induced bias in demographic prediction
- 1. Cornell University
- 2. University of Chicago
- 3. University of Michigan
- 4. Cornell Tech
Description
Data availability
The replication dataset in SI Appendix C.1 is public at Barber and Argyle (57) and the result of work by Argyle and Barber (21). The replication dataset in SI Appendix C.4 is public at Greengard and Gelman (58) and the result of work by Greengard and Gelman (18). The specific code and jupyter notebook used for these analyses are available at https://github.com/evan-dong/demographic-prediction-argmax-bias. A more general repository of code with a jupyter notebook for other researchers and practitioners to discretize and analyze their own model outputs is at https://github.com/evan-dong/demographic-discretization. The commercial dataset used in our analysis is privately owned by TargetSmart, a political data and analytics company, a copy of which we accessed with a research license from PredictWise, a campaign analytics firm. We are unable to provide public access to this proprietary dataset. Researchers can apply for access to TargetSmart data by contacting TargetSmart at: https://targetsmart.com/contact-us/.
Files
pgaf027.pdf
Files
(20.2 MB)
| Name | Size | Download all |
|---|---|---|
|
Article md5:9e771b8a7740dbe0f3c7168860a9a409 |
7.1 MB | Preview Download |
|
Supplementary material md5:033c24611b40ec4eb5778a096ef701d9 |
13.1 MB | Preview Download |
Additional details
Identifiers
- DOI
- 10.1093/pnasnexus/pgaf027
- Other
- oai:uchicago.tind.io:14756