Improving Synthesizer Programming From Variational Autoencoders Latent Space

Gwendal Le Vaillant; Thierry Dutoit; Sebastien Dekeyser

doi:10.23919/dafx51585.2021.9768218

ScienceGate Book Chapters

JOURNAL ARTICLE

Improving Synthesizer Programming From Variational Autoencoders Latent Space

Gwendal Le Vaillant Thierry Dutoit Sebastien Dekeyser

Year: 2021 Pages: 276-283

DOI: 10.23919/dafx51585.2021.9768218

Get Full-Text PDF Get Analytical Report

Abstract

Deep neural networks have been recently applied to the task of automatic synthesizer programming, i.e., finding optimal values of sound synthesis parameters in order to reproduce a given input sound. This paper focuses on generative models, which can infer parameters as well as generate new sets of parameters or perform smooth morphing effects between sounds. We introduce new models to ensure scalability and to increase performance by using heterogeneous representations of parameters as numerical and categorical random variables. Moreover, a spectral variational autoencoder architecture with multi-channel input is proposed in order to improve inference of parameters related to the pitch and intensity of input sounds. Model performance was evaluated according to several criteria such as parameters estimation error and audio reconstruction accuracy. Training and evaluation were performed using a 30k presets dataset which is published with this paper. They demonstrate significant improvements in terms of parameter inference and audio accuracy and show that presented models can be used with subsets or full sets of synthesizer parameters.

Keywords:

Computer science Autoencoder Scalability Inference Artificial neural network Categorical variable Artificial intelligence Representation (politics) Machine learning Speech recognition Algorithm

Metrics

Cited By

1.87

FWCI (Field Weighted Citation Impact)

Refs

0.87

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Music Technology and Sound Studies

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Improving Synthesizer Programming From Variational Autoencoders Latent Space

Abstract

Metrics

Citation History

Topics

Related Documents

Improving Synthesizer Programming From Variational Autoencoders Latent Space

Creating Latent Representations of Synthesizer Patches using Variational Autoencoders

Adaptive Compression of the Latent Space in Variational Autoencoders

Facial Attribute Editing by Latent Space Adversarial Variational Autoencoders

Disentangling the Latent Space of (Variational) Autoencoders for NLP