Audio Source Separation Using Variational Autoencoders and Weak Class Supervision

Kırbız, Serap; Karamatlı, Ertuğ; Cemgil,  Ali Taylan

Audio Source Separation Using Variational Autoencoders and Weak Class Supervision

Files

08769885.pdf (506.35 KB)

Date

2019

Authors

Kırbız, Serap

Karamatlı, Ertuğ

Cemgil, Ali Taylan

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

Abstract

In this letter, we propose a source separation method that is trained by observing the mixtures and the class labels of the sources present in the mixture without any access to isolated sources. Since our method does not require source class labels for every time-frequency bin but only a single label for each source constituting the mixture signal, we call this scenario as weak class supervision. We associate a variational autoencoder (VAE) with each source class within a non negative (compositional) model. Each VAE provides a prior model to identify the signal from its associated class in a sound mixture. After training the model on mixtures, we obtain a generative model for each source class and demonstrate our method on one-second mixtures of utterances of digits from 0 to 9. We show that the separation performance obtained by source class supervision is as good as the performance obtained by source signal supervision.

ORCID

Ertuğ Karamatlı

Serap Kırbız

Keywords

Weak supervision, Source separation, Variational autoencoders

Citation

Karamatli, E., Cemgil, AT., & Kırbız, S. (2019). Audio source separation using variational autoencoders and weak class supervision. IEEE Signal Processing Letters. 26(9), 1349-1353.

WoS Q

Q2

Scopus Q

Q1

Source

IEEE Signal Processing Letters

Volume

26

Issue

9

Start Page

1349

End Page

1353

URI

https://hdl.handle.net/20.500.11779/1128

Collections

WoS İndeksli Yayınlar Koleksiyonu / WoS Indexed Publications Collection
Elektrik Elektronik Mühendisliği Bölümü Koleksiyonu

Full item page

Web of Science™ Citations

22

checked on Mar 02, 2026

Page Views

238

checked on Mar 02, 2026

Downloads

44

checked on Mar 02, 2026

Google Scholar™

Check

Audio Source Separation Using Variational Autoencoders and Weak Class Supervision

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Open Access Color

OpenAIRE Downloads

OpenAIRE Views

Research Projects

Journal Issue

Abstract

Description

ORCID

Keywords

Fields of Science

Citation

WoS Q

Scopus Q

Source

Volume

Issue

Start Page

End Page

URI

Collections

Web of Science™ Citations

22

Page Views

238

Downloads

44

Google Scholar™

Sustainable Development Goals

SDG data is not available