Princeton University users: to view a senior thesis while away from campus, connect to the campus network via the Global Protect virtual private network (VPN). Unaffiliated researchers: please note that requests for copies are handled manually by staff and require time to process.
 

Publication:

ChangeIn: A Dynamic Camera Intrinsics Benchmark for Robust Computer Vision Systems

datacite.rightsrestricted
dc.contributor.advisorDeng, Jia
dc.contributor.authorBhattacharjee, Roma
dc.date.accessioned2025-08-06T14:15:12Z
dc.date.available2025-08-06T14:15:12Z
dc.date.issued2025-04-10
dc.description.abstractModern Simultaneous Localization and Mapping (SLAM) systems assume that camera intrinsics remain fixed throughout a video. This assumption breaks down in real-world, “in-the-wild” scenarios where zoom and focus can vary dynamically, meaning existing SLAM systems do not work on real-world videos. To address this, we introduce ChangeIn, a benchmark tailored for per-frame dynamic intrinsics prediction, laying the groundwork for more robust SLAM and other vision systems that can handle dynamic intrinsics. To properly label ChangeIn’s dataset, we recorded a set of calibration videos using traditional calibration boards and resorting to drone-based methods for wider FOVs. Then, a comprehensive lookup table (LUT) was built that interpolates between our collected calibration data to map any lens focal length and focus distance to intrinsic parameters. Using this table, we produced ground truth intrinsics for a diverse collection of 389 real-world videos (126 indoor, 263 outdoor), featuring focal lengths from 17mm to 250mm and captured across varied environments—from Princeton’s campuses to urban centers. In addition to featuring changing zooms and focus distances, these videos include both static and moving cameras, as well as dynamic scene elements like people, vehicles, and interaction with objects, adding to the realism and complexity of the dataset. Evaluation of three existing intrinsic prediction methods on this benchmark demonstrates much room for improvement. We hope ChangeIn will serve as a valuable resource for developing and evaluating models that estimate intrinsics on a per-frame basis and ultimately foster new research into SLAM and other computer vision applications on real-world videos.
dc.identifier.urihttps://theses-dissertations.princeton.edu/handle/88435/dsp01xk81jp830
dc.language.isoen_US
dc.titleChangeIn: A Dynamic Camera Intrinsics Benchmark for Robust Computer Vision Systems
dc.typePrinceton University Senior Theses
dspace.entity.typePublication
dspace.workflow.startDateTime2025-04-10T20:37:11.309Z
dspace.workflow.startDateTime2025-04-11T03:34:51.081Z
pu.certificateOptimization and Quantitative Decision Science
pu.contributor.authorid920279101
pu.date.classyear2025
pu.departmentComputer Science

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
written_final_report.pdf
Size:
9.33 MB
Format:
Adobe Portable Document Format
Download

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
100 B
Format:
Item-specific license agreed to upon submission
Description:
Download