Princeton University Users: If you would like to view a senior thesis while you are away from campus, you will need to connect to the campus network remotely via the Global Protect virtual private network (VPN). If you are not part of the University requesting a copy of a thesis, please note, all requests are handled manually by staff and will require additional time to process.
 

Publication:

Uncertainty-Aware Transformers: Conformal Prediction for LLMs

dc.contributor.advisorJha, Niraj Kumar
dc.contributor.authorVellore, Abhiram
dc.date.accessioned2025-08-12T13:24:34Z
dc.date.available2025-08-12T13:24:34Z
dc.date.issued2025-04-14
dc.description.abstractThis study extends the CONFINE algorithm as a framework for uncertainty quantification onto transformer-based language models. CONFIDE (CONformal prediction for FIne-tuned DEep language models) applies conformal prediction to the internal embeddings of BERT and RoBERTa architectures, introducing new hyperparameters such as distance metrics and PCA. CONFIDE uses either [CLS] token embeddings or flattened hidden states to construct class-conditional nonconformity scores, enabling statistically valid prediction sets with instance-level explanations. Empirically, CONFIDE improves test accuracy by up to 4.09% on BERT-TINY and achieves greater correct efficiency compared to prior methods, including NM2 and VanillaNN. We show that early and intermediate transformer layers often yield better-calibrated and more semantically meaningful representations for conformal prediction. In resource-constrained models and high-stakes tasks with ambiguous labels, CONFIDE offers robustness and interpretability where softmax-based uncertainty fails.
dc.identifier.urihttps://theses-dissertations.princeton.edu/handle/88435/dsp018623j2203
dc.language.isoen_US
dc.titleUncertainty-Aware Transformers: Conformal Prediction for LLMs
dc.typePrinceton University Senior Theses
dspace.entity.typePublication
dspace.workflow.startDateTime2025-04-15T01:49:46.210Z
pu.certificateOptimization and Quantitative Decision Science
pu.contributor.authorid920272710
pu.date.classyear2025
pu.departmentElectrical and Computer Engineering
pu.minorRobotics

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Vellore, Abhiram.pdf
Size:
10.17 MB
Format:
Adobe Portable Document Format
Download

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
100 B
Format:
Item-specific license agreed to upon submission
Description:
Download