Dialogue Enhancement Using Kernel Additive Modelling

Loading...
Thumbnail Image

Date

2015

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Institute of Electrical and Electronics Engineers Inc.

Open Access Color

Green Open Access

No

OpenAIRE Downloads

OpenAIRE Views

Publicly Funded

No
Impulse
Average
Influence
Average
Popularity
Average

Research Projects

Journal Issue

Abstract

It is a major problem for the sound engineers to find the right balance between the dialogue signals and the ambient sources. This problem also makes one of the main causes of the audience concerns. The audience wants to arrange the sound balance based on their personal preferences, listening environment and their hearing. In this work, a method is proposed for enhancing the dialogue signals in stereo recordings that consist of more than one source. The kernel additive modelling that has been used successfully in sound source separation is used to extract the dialogues and the ambient sources from the movie sounds. The separated dialogue and ambient sources can later be upmixed by the user to make a personal mix. The separation performance of the proposed method is evaluated on the sounds generated by mixing the sources which were taken from the only dialogue and only music parts of the movies. It has been shown that the Kernel Additive Modelling (KAM) based method can be successfully used for dialogue enhancement. © 2015 IEEE.

Description

Keywords

Sound source separation, Kernel additive modelling, Additives, Source separation, Upmixing, Separation performance, Audition, Dialogue enhancement, Stereophonic recordings, Separation

Turkish CoHE Thesis Center URL

Fields of Science

03 medical and health sciences, 0202 electrical engineering, electronic engineering, information engineering, 02 engineering and technology, 0305 other medical science

Citation

Kırbız, S., Liutkus, A., & Cemgil, A. T. (May, 2015). Dialogue enhancement using kernel additive modelling. In 2015 23nd Signal Processing and Communications Applications Conference (SIU).IEEE. pp. 2242-2245.

WoS Q

N/A

Scopus Q

N/A
OpenCitations Logo
OpenCitations Citation Count
1

Source

2015 23nd Signal Processing and Communications Applications Conference (SIU)

Volume

Issue

Start Page

2242

End Page

2245
PlumX Metrics
Citations

Scopus : 0

Captures

Mendeley Readers : 1

Page Views

251

checked on Feb 03, 2026

Downloads

34

checked on Feb 03, 2026

Google Scholar Logo
Google Scholar™
OpenAlex Logo
OpenAlex FWCI
0.0

Sustainable Development Goals

SDG data is not available