Audio Source Separation Using Variational Autoencoders and Weak Class Supervision
Loading...
Date
2019
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Institute of Electrical and Electronics Engineers (IEEE)
Open Access Color
OpenAIRE Downloads
OpenAIRE Views
Abstract
In this letter, we propose a source separation method that is trained by observing the mixtures and the class labels of the sources present in the mixture without any access to isolated sources. Since our method does not require source class labels for every time-frequency bin but only a single label for each source constituting the mixture signal, we call this scenario as weak class supervision. We associate a variational autoencoder (VAE) with each source class within a non negative (compositional) model. Each VAE provides a prior model to identify the signal from its associated class in a sound mixture. After training the model on mixtures, we obtain a generative model for each source class and demonstrate our method on one-second mixtures of utterances of digits from 0 to 9. We show that the separation performance obtained by source class supervision is as good as the performance obtained by source signal supervision.
Description
Keywords
Weak supervision, Source separation, Variational autoencoders
Turkish CoHE Thesis Center URL
Fields of Science
Citation
Karamatli, E., Cemgil, AT., & Kırbız, S. (2019). Audio source separation using variational autoencoders and weak class supervision. IEEE Signal Processing Letters. 26(9), 1349-1353.
WoS Q
Q2
Scopus Q
Q1
Source
IEEE Signal Processing Letters
Volume
26
Issue
9
Start Page
1349
End Page
1353
Web of Science™ Citations
21
checked on Dec 06, 2025
Page Views
234
checked on Dec 06, 2025
Downloads
8
checked on Dec 06, 2025