Princeton University users: to view a senior thesis while away from campus, connect to the campus network via the Global Protect virtual private network (VPN). Unaffiliated researchers: please note that requests for copies are handled manually by staff and require time to process.
 

Publication:

Weak Convergence of L2-Regularized Two-Layer Neural Networks under SGD via Mean Field Theory

datacite.rightsrestricted
dc.contributor.advisorHanin, Boris
dc.contributor.authorNguyen, Vincent Vinh Huu
dc.date.accessioned2025-08-07T17:12:08Z
dc.date.available2025-08-07T17:12:08Z
dc.date.issued2025-04-28
dc.description.abstractExtending the work of Sirignano and Spiliopoulos (2020), we use mean field theory to study two-layer neural networks with $l_2$-regularization trained under single-sample online stochastic gradient descent. We prove that in the asymptotic regime of both infinite training steps and infinite hidden layer width, such a neural network weakly converges to a deterministic and unique solution that satisfies a partial differential equation of the gradient flow form seen elsewhere in optimal transport and physics. Moreover, we show that the parameters of said neural network, despite being interdependent throughout training, asymptotically become independent. These results are only subject to loose moment bounds at initialization. Our proofs utilize a probabilistic approach on the network's training evolution instead of studying the geometry of the loss surface. We also provide numerical simulation results consistent with our theoretical guarantees.
dc.identifier.urihttps://theses-dissertations.princeton.edu/handle/88435/dsp014b29b945n
dc.language.isoen_US
dc.titleWeak Convergence of L2-Regularized Two-Layer Neural Networks under SGD via Mean Field Theory
dc.typePrinceton University Senior Theses
dspace.entity.typePublication
dspace.workflow.startDateTime2025-04-28T06:36:43.796Z
dspace.workflow.startDateTime2025-05-02T16:50:27.507Z
dspace.workflow.startDateTime2025-05-09T02:59:32.952Z
pu.certificateOptimization and Quantitative Decision Science
pu.contributor.authorid920294015
pu.date.classyear2025
pu.departmentMathematics
pu.minorStatistics and Machine Learning
pu.minorComputer Science
pu.minorValues and Public Life

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
NGUYEN_VINCENT_THESIS.pdf
Size:
2.39 MB
Format:
Adobe Portable Document Format
Download

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
100 B
Format:
Item-specific license agreed to upon submission
Description:
Download