Abstract

In this paper, we propose a data-driven approach to train a Generative Adversarial Network (GAN) conditioned on "soft-labels" distilled from the penultimate layer of an audio classifier trained on a target set of audio texture classes. We demonstrate that interpolation between such conditions or control vectors provide smooth morphing between the generated audio textures, and show similar or better audio texture morphing capability compared to the state-of-the-art methods. The proposed approach results in a well-organized latent space that generates novel audio outputs while remaining consistent with the semantics of the conditioning parameters. This is a step towards a general data-driven approach to designing generative audio models with customized controls capable of traversing out-of-distribution regions for novel sound synthesis.

Keywords:
Morphing Computer science Texture synthesis Audio signal Artificial intelligence Audio signal processing Generative grammar Interpolation (computer graphics) Speech recognition Pattern recognition (psychology) Computer vision Image texture Speech coding Image processing Image (mathematics)

Metrics

7
Cited By
1.27
FWCI (Field Weighted Citation Impact)
34
Refs
0.76
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Music Technology and Sound Studies
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing

Related Documents

DISSERTATION

All Organic Polymers Based Morphing Skin with Controllable Surface Texture

Natanael Bolson

University:   King Abdullah University of Science and Technology Repository (King Abdullah University of Science and Technology) Year: 2018
BOOK-CHAPTER

Fast Spatially Controllable Multi-dimensional Exemplar-Based Texture Synthesis and Morphing

Felix MankeBurkhard C. Wünsche

Communications in computer and information science Year: 2010 Pages: 21-34
JOURNAL ARTICLE

Automatic audio morphing

Malcolm SlaneyMichele CovellB. Lassiter

Year: 2002 Vol: 2 Pages: 1001-1004
© 2026 ScienceGate Book Chapters — All rights reserved.