Audio Source Separation Using Variational Autoencoders and Weak Class Supervision

dc.contributor.author Kırbız, Serap
dc.contributor.author Karamatlı, Ertuğ
dc.contributor.author Cemgil, Ali Taylan
dc.contributor.author Karamatli, Ertug
dc.contributor.author Cemgil, Ali Taylan
dc.date.accessioned 2019-08-23T05:48:05Z
dc.date.available 2019-08-23T05:48:05Z
dc.date.issued 2019-09-01
dc.description.abstract In this letter, we propose a source separation method that is trained by observing the mixtures and the class labels of the sources present in the mixture without any access to isolated sources. Since our method does not require source class labels for every time-frequency bin but only a single label for each source constituting the mixture signal, we call this scenario as weak class supervision. We associate a variational autoencoder (VAE) with each source class within a non negative (compositional) model. Each VAE provides a prior model to identify the signal from its associated class in a sound mixture. After training the model on mixtures, we obtain a generative model for each source class and demonstrate our method on one-second mixtures of utterances of digits from 0 to 9. We show that the separation performance obtained by source class supervision is as good as the performance obtained by source signal supervision.
dc.description.sponsorship This work was supported by the Scientific and Technological Research Council of Turkey (TUBITAK) under Grant 215E076.
dc.description.sponsorship Scientific and Technological Research Council of Turkey (TUBITAK) [215E076]
dc.identifier.citation Karamatli, E., Cemgil, AT., & Kırbız, S. (2019). Audio source separation using variational autoencoders and weak class supervision. IEEE Signal Processing Letters. 26(9), 1349-1353.
dc.identifier.doi 10.1109/LSP.2019.2929440
dc.identifier.issn 1070-9908
dc.identifier.issn 1558-2361
dc.identifier.uri https://hdl.handle.net/20.500.11779/1128
dc.language.iso en
dc.publisher Institute of Electrical and Electronics Engineers (IEEE)
dc.relation.ispartof IEEE Signal Processing Letters
dc.rights info:eu-repo/semantics/closedAccess
dc.subject Weak supervision
dc.subject Source separation
dc.subject Variational autoencoders
dc.subject Variational Autoencoders
dc.subject Weak Supervision
dc.subject Source Separation
dc.title Audio Source Separation Using Variational Autoencoders and Weak Class Supervision
dc.type Article
dspace.entity.type Publication
gdc.author.id Ertuğ Karamatlı / 0000-0001-8839-0821
gdc.author.id Serap Kırbız / 0000-0001-7718-3683
gdc.author.institutional Kırbız, Serap
gdc.author.wosid Kırbız, Serap/LPP-8018-2024
gdc.author.wosid Cemgil, Ali/A-3068-2016
gdc.coar.access metadata only access
gdc.coar.type text::journal::journal article
gdc.collaboration.industrial false
gdc.description.department Mühendislik Fakültesi, Elektrik Elektronik Mühendisliği Bölümü
gdc.description.endpage 1353
gdc.description.issue 9
gdc.description.publicationcategory Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
gdc.description.scopusquality Q1
gdc.description.startpage 1349
gdc.description.volume 26
gdc.description.woscitationindex Science Citation Index Expanded
gdc.description.wosquality Q2
gdc.identifier.openalex W2963933499
gdc.identifier.wos WOS:000480311900003
gdc.index.type WoS
gdc.openalex.collaboration National
gdc.openalex.fwci 2.50
gdc.openalex.normalizedpercentile 0.90
gdc.openalex.toppercent TOP 10%
gdc.opencitations.count 22
gdc.plumx.crossrefcites 9
gdc.plumx.mendeley 27
gdc.publishedmonth Eylül
gdc.wos.citedcount 22
gdc.wos.collaboration Uluslararası işbirliği ile yapılmayan - HAYIR
gdc.wos.documenttype Article
gdc.wos.indexdate 2019
gdc.wos.publishedmonth Eylül
gdc.yokperiod YÖK - 2019-20
relation.isAuthorOfPublication.latestForDiscovery 552e4b0c-955f-4b93-925b-08cb2e6c5cc0
relation.isOrgUnitOfPublication.latestForDiscovery de19334f-6a5b-4f7b-9410-9433c48d1e5a

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Name:
08769885.pdf
Size:
483.26 KB
Format:
Adobe Portable Document Format
Description:
Watermarked PDF

License bundle

Now showing 1 - 1 of 1
Loading...
Name:
license.txt
Size:
1.44 KB
Format:
Item-specific license agreed upon to submission
Description: