Princeton University users: to view a senior thesis while away from campus, connect to the campus network via the Global Protect virtual private network (VPN). Unaffiliated researchers: please note that requests for copies are handled manually by staff and require time to process.
 

Publication:

Tracking Topics in Earnings Call Transcripts Using Natural Language Processing

datacite.rightsrestricted
dc.contributor.advisorHolen, Margaret
dc.contributor.authorMaldacena, Marcos
dc.date.accessioned2025-08-06T14:39:35Z
dc.date.available2025-08-06T14:39:35Z
dc.date.issued2025-04-10
dc.description.abstractThis thesis explores the incremental value that Large Language Models (LLMs) can provide compared to existing bag-of-words methodologies to automate the reading of earnings call transcripts and create topic mappings that help executives, investors, and policymakers make more informed decisions. We run several small-scale experiments to assess the abilities of LLMs to classify texts and concluded that using standalone LLMs to classify portions of text is not the most optimal approach. Instead, we propose a hybrid approach that leverages LLMs to generate keyword lists, which are subsequently applied within a bag-of-words framework, enabling us to effectively map 30 distinct topics across both the presentation section and the questions earnings calls. We also explore how structural and contextual knowledge could be applied to enhance both LLM and bag-of-words methodologies for topic mapping. Using a dataset comprised of earnings calls from S&P 500 companies from 2011 to 2025, we analyze trends in specific topics mentioned on calls, particularly focusing on tariffs. Furthermore, we present a detailed case study of Dollar Tree to illustrate how our topic mapping approach can effectively inform decision-makers.
dc.identifier.urihttps://theses-dissertations.princeton.edu/handle/88435/dsp01qz20sw95b
dc.language.isoen
dc.titleTracking Topics in Earnings Call Transcripts Using Natural Language Processing
dc.typePrinceton University Senior Theses
dspace.entity.typePublication
dspace.workflow.startDateTime2025-04-10T18:41:45.414Z
pu.contributor.authorid920304635
pu.date.classyear2025
pu.departmentOps Research & Financial Engr
pu.minorStatistics and Machine Learning
pu.minorComputer Science

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Marcos Maldacena ORFE Thesis .pdf
Size:
5.56 MB
Format:
Adobe Portable Document Format
Download

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
100 B
Format:
Item-specific license agreed to upon submission
Description:
Download