Explaining Gendered Language Through Embedding Approximation: How “Stupid” Became a Man’s Word

Zhu, Yutong

doi:10.6082/uchicago.15448

Explaining Gendered Language Through Embedding Approximation: How “Stupid” Became a Man’s Word

Zhu, Yutong

2025

Download

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DataCite
DublinCore
EndNote
NLM
RefWorks
RIS

Add to Basket

Cite

Files

Abstract

Recent advances in word embedding and digitized text have reinvigorated macro-historical cultural analysis based on empirical data. However, static word2vec embedding, a distance-based measurement of latent semantic structures, becomes less interpretable when moved beyond survey-validated categories. Often, there is a one-to-many mapping between low-level semantic distance and high-level meanings, obscuring the pragmatic language use that drive latent semantic shifts in the embedding space. Focusing on “unintelligence” words in 20th-century American discourse, I critique the stereotype-based interpretation of their changing gender alignment in historical word2vec embeddings of American literature. While the gender axis reliably measures word associations, low-level associations do not neatly map onto well-defined gender stereotypes. Applying an adapted A La Carte approximation of individual context words and sentences, I suggest another possible interpretation: that the masculinization of “unintelligence” words could have stemmed from the increasing harshness in tone (a spurious variable), not judgments of male intelligence or stupidity. This study highlights the interpretive gap in embedding-based methods and calls for more rigorous approaches to making intersubjectively valid interpretations on cultural change.

Details

Title

Explaining Gendered Language Through Embedding Approximation: How “Stupid” Became a Man’s Word

Author

Zhu, Yutong : University of Chicago : (https://orcid.org/0009-0009-1014-4132)

Degree Type

M.A.

Content Type

Thesis

Academic Advisor

John Levi Martin
Rochelle Layla Terman
David A. Peterson

Keywords

culture; gender; language; computational text analysis; word embedding

Digital Object Identifier

https://doi.org/10.6082/uchicago.15448

Publication Date

2025-06

Language

English

Copyright Statement

Licensing

CC BY-NC-ND

Record Appears in

Social Sciences Division > Computational Social Sciences (MACSS)
Social Sciences Division > MA Thesis Archive
All

Record Created

2025-06-02

Explaining Gendered Language Through Embedding Approximation: How “Stupid” Became a Man’s Word

Files

Abstract

Details

Statistics