Princeton University Users: If you would like to view a senior thesis while you are away from campus, you will need to connect to the campus network remotely via the Global Protect virtual private network (VPN).
 

Publication:

Beyond the Stats: Quantifying Intangible Qualities in NFL Draft Prospects

dc.contributor.advisorAkrotirianakis, Ioannis
dc.contributor.authorBeyene, Jonathan
dc.date.accessioned2025-08-06T17:14:15Z
dc.date.available2025-08-06T17:14:15Z
dc.date.issued2025-04-10
dc.description.abstractNFL teams invest heavily in the scouting and evaluation of college players before the draft, however many high draft picks underperform while later round picks emerge as stars. This thesis investigates whether intangible traits, such as leadership, competitiveness, and work ethic can be quantified from scouting reports and used to better predict a player’s success in the NFL. Using a combination of zero-shot classification and sentiment analysis, trait-specific sentiment scores are computed across multiple positions. These scores are then incorporated alongside quantitative combine data and college career statistics for K-Means clustering to group players with similar profiles. For each position where the incorporation of intangible traits was more explanatory than using strictly quantitative statistics, six regression models were trained to predict a custom-defined career success metric based on positional performance and Approximate Value (AV). Cluster assignments were one-hot encoded to determine their predictive impact on career success. Clusters with high cluster coefficients and late average draft positions were identified as “undervalued,” while clusters with early average draft positions and lower coefficients were considered “overvalued.” Results indicated that qualitative clustering often yielded higher explanatory power than models based purely on quantitative features. In several cases, we observed higher cluster coefficients with a later average draft pick, suggesting that NFL teams may be systematically overlooking certain high-potential players. This thesis demonstrates the potential of utilizing Natural Language Processing and qualitative data in the evaluation of professional football scouting, and how it can prove more effective than the traditional quantitative approach.
dc.identifier.urihttps://theses-dissertations.princeton.edu/handle/88435/dsp01kw52jc525
dc.language.isoen_US
dc.titleBeyond the Stats: Quantifying Intangible Qualities in NFL Draft Prospects
dc.typePrinceton University Senior Theses
dspace.entity.typePublication
dspace.workflow.startDateTime2025-04-10T04:18:33.615Z
pu.contributor.authorid920250169
pu.date.classyear2025
pu.departmentOps Research & Financial Engr
pu.minorStatistics and Machine Learning

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Beyene_ORFE_SeniorThesis.pdf
Size:
7.41 MB
Format:
Adobe Portable Document Format
Download

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
100 B
Format:
Item-specific license agreed to upon submission
Description:
Download