Please use this identifier to cite or link to this item: https://hdl.handle.net/20.500.11779/1128
Title: Audio source separation using variational autoencoders and weak class supervision
Authors: Karamatlı, Ertuğ
Cemgil, Ali Taylan
Kırbız, Serap
Keywords: Weak Supervision
Source Separation
Variational Autoencoders
Publisher: Institute of Electrical and Electronics Engineers (IEEE)
Source: Karamatli, E., Cemgil, AT., & Kırbız, S. (2019). Audio source separation using variational autoencoders and weak class supervision. IEEE Signal Processing Letters. 26(9), 1349-1353.
Abstract: In this letter, we propose a source separation method that is trained by observing the mixtures and the class labels of the sources present in the mixture without any access to isolated sources. Since our method does not require source class labels for every time-frequency bin but only a single label for each source constituting the mixture signal, we call this scenario as weak class supervision. We associate a variational autoencoder (VAE) with each source class within a non negative (compositional) model. Each VAE provides a prior model to identify the signal from its associated class in a sound mixture. After training the model on mixtures, we obtain a generative model for each source class and demonstrate our method on one-second mixtures of utterances of digits from 0 to 9. We show that the separation performance obtained by source class supervision is as good as the performance obtained by source signal supervision.
URI: https://hdl.handle.net/20.500.11779/1128
ISSN: 1070-9908
1558-2361
Appears in Collections:Elektrik Elektronik Mühendisliği Bölümü koleksiyonu
WoS İndeksli Yayınlar Koleksiyonu / WoS Indexed Publications Collection

Files in This Item:
File Description SizeFormat 
08769885.pdf
  Until 2089-08-23
Yayıncı Sürümü - Makale506.35 kBAdobe PDFView/Open    Request a copy
Show full item record



CORE Recommender

WEB OF SCIENCETM
Citations

18
checked on Jun 23, 2024

Page view(s)

2
checked on Jun 26, 2024

Google ScholarTM

Check





Items in GCRIS Repository are protected by copyright, with all rights reserved, unless otherwise indicated.