A Supervised Learning Framework for Generating DJ Transitions

Hein, Michael

Publication:
A Supervised Learning Framework for Generating DJ Transitions

datacite.rights	restricted
dc.contributor.advisor	Hubert, Emma
dc.contributor.author	Hein, Michael
dc.date.accessioned	2025-08-07T12:40:01Z
dc.date.available	2025-08-07T12:40:01Z
dc.date.issued	2025-04-10
dc.description.abstract	A disc jockey (DJ) curates a seamless auditory experience by skillfully transitioning between tracks. While these transitions can sometimes involve complex loops and sound effects, their most fundamental components often involve manipulating volume and adjusting frequency ranges to blend two songs. Prior work on automating DJ transitions has largely relied on heuristics or unsupervised learning approaches such as generative adversarial networks (GANs). In this paper, we present a unique supervised learning framework for generating DJ transitions between two tracks, providing an interpretable, data-driven alternative to previous methods. Using a dataset from 1001Tracklists containing real DJ mixes and their source tracks, we extract mel-spectrograms of the audio and train a convolutional neural network (CNN) to predict control signals that specify how volume and equalizer (EQ) bands should change over time. These predicted control signals are then applied to the source tracks to produce a transition, which is compared to the original transition from the DJ mix. To generate labeled input-output training pairs, we developed a full preprocessing pipeline that includes track-to-mix alignment using dynamic time warping (DTW), supported by both theoretical and empirical analyses of feature selection. While inspired by differentiable digital signal processing (DDSP), our learning phase operates entirely in the mel-spectrogram domain for simplicity and interpretability. We trained the model on a single example and found that it was able to replicate the corresponding ground truth transition with reasonable accuracy, offering early evidence that the task is learnable and that our framework has the capacity to produce non-trivial transitions. This work demonstrates the potential of supervised learning in generating realistic DJ transitions and lays the foundation for future research training on more data.
dc.identifier.uri	https://theses-dissertations.princeton.edu/handle/88435/dsp01x633f448f
dc.language.iso	en_US
dc.title	A Supervised Learning Framework for Generating DJ Transitions
dc.type	Princeton University Senior Theses
dspace.entity.type	Publication
dspace.workflow.startDateTime	2025-04-10T05:05:27.033Z
pu.certificate	Optimization and Quantitative Decision Science
pu.contributor.authorid	920284472
pu.date.classyear	2025
pu.department	Ops Research & Financial Engr
pu.minor	Statistics and Machine Learning
pu.minor	Computer Science

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Michael Hein Final Thesis.pdf
Size:: 2.65 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 100 B
Format:: Item-specific license agreed to upon submission
Description:

Download

Collections

Operations Research and Financial Engineering, 2000-2025

Publication: A Supervised Learning Framework for Generating DJ Transitions

Files

Original bundle

License bundle

Collections

Publication:
A Supervised Learning Framework for Generating DJ Transitions