Publication: SpectraLDS: Distilling Spectral Filters into Constant-Time Recurrent Models
| datacite.rights | restricted | |
| dc.contributor.advisor | Hazan, Elad | |
| dc.contributor.author | Fortgang, Shlomo T. | |
| dc.date.accessioned | 2025-08-06T15:43:45Z | |
| dc.date.available | 2025-08-06T15:43:45Z | |
| dc.date.issued | 2025-04-10 | |
| dc.description.abstract | We introduce the first provable method for learning a symmetric linear dynamical system of arbitrarily high effective memory. This allows us to distill the convolutional layers in a leading hybrid state space model, FlashSTU, into O(1) linear dynamical systems, merging Transformer and RNN architectures in a manner suitable for scaling and with application to language modeling and other sequential processing tasks. | |
| dc.identifier.uri | https://theses-dissertations.princeton.edu/handle/88435/dsp01z890rx705 | |
| dc.language.iso | en_US | |
| dc.title | SpectraLDS: Distilling Spectral Filters into Constant-Time Recurrent Models | |
| dc.type | Princeton University Senior Theses | |
| dspace.entity.type | Publication | |
| dspace.workflow.startDateTime | 2025-04-10T22:36:05.888Z | |
| pu.certificate | Optimization and Quantitative Decision Science | |
| pu.contributor.authorid | 920245534 | |
| pu.date.classyear | 2025 | |
| pu.department | Computer Science |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- written_final_report.pdf
- Size:
- 1.07 MB
- Format:
- Adobe Portable Document Format
Download
License bundle
1 - 1 of 1
Loading...
- Name:
- license.txt
- Size:
- 100 B
- Format:
- Item-specific license agreed to upon submission
- Description:
Download