Improving the Usage of Subword-Based Units for Turkish Speech Recognition
Loading...
Date
2020
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
IEEE
Open Access Color
Green Open Access
No
OpenAIRE Downloads
OpenAIRE Views
Publicly Funded
No
Abstract
Subword units are often utilized to achieve better performance in speech recognition because of the high number of observed words in agglutinative languages. In this study, the proper use of subword units is explored in recognition by a reconsideration of details such as silence modeling and position-dependent phones. A modified lexicon by finite-state transducers is implemented to represent the subword units correctly. Also, we experiment with different types of word boundary markers and achieve the best performance by adding a marker both to the left and right side of a subword unit. In our experiments on a Turkish broadcast news dataset, the subword models do outperform word-based models and naive subword implementations. Results show that using proper subword units leads to a relative word error rate (WER) reductions, which is 2.4%, compared with the word level automatic speech recognition (ASR) system for Turkish.
Description
ORCID
Keywords
Konuşma tanıma, Language modelling, Acoustic modelling, Speech recognition, Akustik modelleme, Dil modelleme
Turkish CoHE Thesis Center URL
Fields of Science
03 medical and health sciences, 0305 other medical science
Citation
G. Çetinkaya, E. Arısoy and M. Saraçlar, (5-7 Oct. 2020). Improving the Usage of Subword-Based Units for Turkish Speech Recognition, 2020 28th Signal Processing and Communications Applications Conference (SIU), pp. 1-4, doi: 10.1109/SIU49456.2020.9302043.
WoS Q
N/A
Scopus Q
N/A

OpenCitations Citation Count
1
Source
2020 28th Signal Processing and Communications Applications Conference (SIU)
Volume
Issue
Start Page
1-4
End Page
4
PlumX Metrics
Citations
CrossRef : 1
Scopus : 2
Captures
Mendeley Readers : 3
SCOPUS™ Citations
2
checked on Feb 03, 2026
Web of Science™ Citations
1
checked on Feb 03, 2026
Page Views
200
checked on Feb 03, 2026
Downloads
27
checked on Feb 03, 2026
Google Scholar™


