MIDI-DDSP: Hierarchical modeling of music for detailed control

Yusong Wu; Ethan Manilow; Yi Deng; Rigel Jacob Swavely; Kyle Kastner; TIm Cooijmans; Aaron Courville; Anna Huang; Jesse Engel

MIDI-DDSP: Hierarchical modeling of music for detailed control

Yusong Wu

Ethan Manilow

Yi Deng

Rigel Jacob Swavely

Kyle Kastner

TIm Cooijmans

Aaron Courville

Anna Huang

Jesse Engel

ICLR 2022 (2022) (to appear)

Download Google Scholar

Abstract

Musical expression requires control of both \textit{what} notes that are played, and \textit{how} they are performed. Conventional audio synthesizers provide detailed expressive controls, but at the cost of realism. Black-box neural audio synthesis and concatenative samplers can produce realistic audio, but have few mechanisms for control. In this work, we introduce MIDI-DDSP a hierarchical model of musical instruments that enables both realistic neural audio synthesis and detailed user control. Starting from interpretable Differentiable Digital Signal Processing (DDSP) synthesis parameters, we infer musical notes and high-level properties of their expressive performance (such as timbre, vibrato, dynamics, and articulation). This creates a 3-level hierarchy (notes, performance, synthesis) that affords individuals the option to intervene at each level, or utilize trained priors (performance given notes, synthesis given performance) for creative assistance. Through quantitative experiments and listening tests, we demonstrate that this hierarchy can reconstruct high-fidelity audio, accurately predict performance attributes for a note sequence, independently manipulate the attributes of a given performance, and as a complete system, generate realistic audio from a novel note sequence. By utilizing an interpretable hierarchy, with multiple levels of granularity, MIDI-DDSP opens the door to assistive tools to empower individuals across a diverse range of musical experience.

Research Areas

Machine intelligence

Explore our many areas of focus

Building a collaborative ecosystem

Shaping the future together

Translating discovery into real-world impact

MIDI-DDSP: Hierarchical modeling of music for detailed control

Abstract

Research Areas

Meet the teams driving innovation

Google AI

Google Cloud

Google DeepMind

Google Labs