Princeton University users: to view a senior thesis while away from campus, connect to the campus network via the Global Protect virtual private network (VPN). Unaffiliated researchers: please note that requests for copies are handled manually by staff and require time to process.
 

Publication:

r/LinguisticPolarization: Lexical and Semantic Variation between Political Communities on Reddit

Loading...
Thumbnail Image

Files

EMThesis.pdf (1.62 MB)

Date

2025-04-10

Journal Title

Journal ISSN

Volume Title

Publisher

Research Projects

Organizational Units

Journal Issue

Access Restrictions

Abstract

Political Polarization is a growing issue in the US, and undermines the stability of our democracy. Linguistic Polarization is the manifestation of political polarization in the language used by ideological groups, and can serve to deepen ideological divides. In this thesis, we investigate two forms of linguistic polarization: lexical polarization and semantic polarization. Lexical Polarization focuses on vocabulary differences between ideological groups, while semantic polarization captures shifts in the meanings of words. We examine four corpora of Reddit data, collected from r/democrats and r/Republican in 2019 and 2023. We use frequency and embedding-based analysis methods to characterize the language polarization in our datasets. This allows us to identify polarizing issues and political figures, and identify any communication gaps between the two sides of the ideological spectrum that may be exacerbating overall polarization.

Description

Keywords

Citation