Publication: Weak Convergence of L2-Regularized Two-Layer Neural Networks under SGD via Mean Field Theory
datacite.rights | restricted | |
dc.contributor.advisor | Hanin, Boris | |
dc.contributor.author | Nguyen, Vincent Vinh Huu | |
dc.date.accessioned | 2025-08-07T17:12:08Z | |
dc.date.available | 2025-08-07T17:12:08Z | |
dc.date.issued | 2025-04-28 | |
dc.description.abstract | Extending the work of Sirignano and Spiliopoulos (2020), we use mean field theory to study two-layer neural networks with $l_2$-regularization trained under single-sample online stochastic gradient descent. We prove that in the asymptotic regime of both infinite training steps and infinite hidden layer width, such a neural network weakly converges to a deterministic and unique solution that satisfies a partial differential equation of the gradient flow form seen elsewhere in optimal transport and physics. Moreover, we show that the parameters of said neural network, despite being interdependent throughout training, asymptotically become independent. These results are only subject to loose moment bounds at initialization. Our proofs utilize a probabilistic approach on the network's training evolution instead of studying the geometry of the loss surface. We also provide numerical simulation results consistent with our theoretical guarantees. | |
dc.identifier.uri | https://theses-dissertations.princeton.edu/handle/88435/dsp014b29b945n | |
dc.language.iso | en_US | |
dc.title | Weak Convergence of L2-Regularized Two-Layer Neural Networks under SGD via Mean Field Theory | |
dc.type | Princeton University Senior Theses | |
dspace.entity.type | Publication | |
dspace.workflow.startDateTime | 2025-04-28T06:36:43.796Z | |
dspace.workflow.startDateTime | 2025-05-02T16:50:27.507Z | |
dspace.workflow.startDateTime | 2025-05-09T02:59:32.952Z | |
pu.certificate | Optimization and Quantitative Decision Science | |
pu.contributor.authorid | 920294015 | |
pu.date.classyear | 2025 | |
pu.department | Mathematics | |
pu.minor | Statistics and Machine Learning | |
pu.minor | Computer Science | |
pu.minor | Values and Public Life |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- NGUYEN_VINCENT_THESIS.pdf
- Size:
- 2.39 MB
- Format:
- Adobe Portable Document Format
Download
License bundle
1 - 1 of 1
Loading...
- Name:
- license.txt
- Size:
- 100 B
- Format:
- Item-specific license agreed to upon submission
- Description:
Download