Court v. Classifier: A Data-Driven Evaluation of Language and Decision-Making on the U.S. Supreme Court

Lee, Erin

Publication:
Court v. Classifier: A Data-Driven Evaluation of Language and Decision-Making on the U.S. Supreme Court

datacite.rights	restricted
dc.contributor.advisor	Kernighan, Brian W.
dc.contributor.author	Lee, Erin
dc.date.accessioned	2025-08-06T15:32:49Z
dc.date.available	2025-08-06T15:32:49Z
dc.date.issued	2025-04-10
dc.description.abstract	This thesis investigates the language, behavior, and decision-making of U.S. Supreme Court justices through a computational lens. Grounding my study in structured and curated datasets—including justice- and case-level variables, authored opinions, and over 1,600 transcribed oral arguments—I analyze how justices speak, write, and vote. I begin with an empirical study of voting patterns, opinion authorship, and judicial trends across natural court eras. I then turn to oral argument behavior, quantifying the participation of justices across alignments and outcomes. Building on these insights, I implement a series of predictive classifiers, replicating and extending a previous statistical model to include oral argument features. While the inclusion of these features yields modest and at times inconclusive improvements in accuracy, they underscore the complexity of predicting voting patterns based on oral argument behavior, given the distinct rhetorical styles and engagement patterns of individual justices. Nonetheless, the findings allude to promising directions for future modeling of case outcomes using alternative features derived from oral arguments. Finally, I experiment with prompting large language models (LLMs) to classify tones of judicial questioning due to the limitations of more traditional natural language processing techniques. I also simulate justice voting behavior with LLMs on unseen cases, assessing the capabilities of generative AI for legal reasoning. Through our experimentation, the LLMs proved to be limited in their capacity for legal judgement, though they also demonstrate opportunity to be better leveraged when provided additional guidance through fine-tuning. Altogether, this study offers a data-driven portrait of the Supreme Court and its justices, rooted in empirical data and powered by modern machine learning methods.
dc.identifier.uri	https://theses-dissertations.princeton.edu/handle/88435/dsp01cf95jf920
dc.language.iso	en_US
dc.title	Court v. Classifier: A Data-Driven Evaluation of Language and Decision-Making on the U.S. Supreme Court
dc.type	Princeton University Senior Theses
dspace.entity.type	Publication
dspace.workflow.startDateTime	2025-04-20T19:16:50.542Z
pu.contributor.authorid	920291112
pu.date.classyear	2025
pu.department	Computer Science

Files

Original bundle

Now showing 1 - 1 of 1

Name:: written_final_report_final.pdf
Size:: 5.49 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 100 B
Format:: Item-specific license agreed to upon submission
Description:

Download

Collections

Computer Science, 1987-2025

Publication: Court v. Classifier: A Data-Driven Evaluation of Language and Decision-Making on the U.S. Supreme Court

Files

Original bundle

License bundle

Collections

Publication:
Court v. Classifier: A Data-Driven Evaluation of Language and Decision-Making on the U.S. Supreme Court