Diffusion-Based Sound Synthesis in Music Production (FARM 2024)

Mon 2 - Sat 7 September 2024 Milan, Italy

Who

Pierre-Louis Suckrow, Christoph Johannes Weber, Sylvia Rothe

Track

FARM 2024

Time Zone

The program is currently displayed in (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna.

Use conference time zone: (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, ViennaSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Mon 2 Sep 2024 14:22 - 14:45 at Meeting 3 - Music Generation

Abstract

In this paper, we explore the usability of generative artificial intelligence in music production through the development of a digital instrument that incorporates diffusion-based sound synthesis in its sound generation. Current text-to-audio models offer a novel method of defining sounds, which we aim to render utilizable in a music-production environment. Selected pretrained latent diffusion models, enable the synthesis of playable sounds through textual descriptions, which we incorporated into a digital instrument that integrates with standard music production tools. The resultant user interface not only allows generating but also modifying the sounds by editing model and instrument-specific parameters. We evaluated the applicability of current diffusion models with their parameters as well as the fitness of possible prompts for music production scenarios. Adapting published diffusion model pipelines for integration into the instrument, we facilitate experimentation and exploration of this innovative sound synthesis method. Our findings show that despite facing some limitations in the models’ responsiveness to specific music production contexts and the instrument’s functionality, the tool allows the development of novel and intriguing soundscapes. The instrument and code is published under https://anonymous.4open.science/r/WaveGenSynth/

Pierre-Louis Suckrow

Berlin University of the Arts, Technical University of Berlin

Christoph Johannes Weber

University of Television and Film Munich, LMU Munich

Sylvia Rothe

University of Television and Film Munich