Campus users should disconnect from VPN to access senior theses, as there is a temporary disruption affecting VPN.
 

Publication:

Fact or Fiction? Evaluating the Ability of Large Language Models to Detect Legal Hallucinations

datacite.rightsrestricted
dc.contributor.advisorHenderson, Peter
dc.contributor.authorMa, Leyuan
dc.date.accessioned2026-01-05T19:32:04Z
dc.date.available2026-01-05T19:32:04Z
dc.date.issued2025
dc.description.abstractAs large language models (LLMs) become increasingly integrated into legal research tools, concerns about their tendency to “hallucinate”—generate factually incorrect or unsupported content—have grown. This paper investigates whether LLMs can also serve as factual consistency checkers in legal question-answering: given a legal query, an AI-generated answer, and its cited sources, can the model assess whether the response contains hallucinated information? To evaluate this approach, we construct two datasets: one comprising AI-generated question–answer pairs with controlled hallucinations, and another based on real outputs from Westlaw’s AI-Assisted Research (AI-AR) tool. We assess five models—GPT-4o, DeepSeek-R1, and three LLaMA variants—on their ability to detect and classify hallucinations. Results show that larger models, particularly GPT-4o and DeepSeek-R1, significantly outperform smaller alternatives and can reliably serve as automated evaluators of legal content. Although Westlaw AI-AR has improved since prior benchmarks, hallucinations remain a recurring issue. These findings suggest that LLMs hold promise not only as content generators, but also as scalable evaluators for legal AI systems.
dc.identifier.urihttps://theses-dissertations.princeton.edu/handle/88435/dsp011n79h777m
dc.language.isoen_US
dc.titleFact or Fiction? Evaluating the Ability of Large Language Models to Detect Legal Hallucinations
dc.typePrinceton University Senior Theses
dspace.entity.typePublication
dspace.workflow.startDateTime2025-12-15T16:58:01.074Z
pu.contributor.authorid920291705
pu.date.classyear2025
pu.departmentComputer Science

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
lm8183_written_final_report-3.pdf
Size:
1.64 MB
Format:
Adobe Portable Document Format
Download

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
100 B
Format:
Item-specific license agreed to upon submission
Description:
Download