Developing an Automatic Transcription and Retrieval System for Spoken Lectures in Turkish

dc.contributor.author Arısoy, Ebru
dc.date.accessioned 2019-02-28T13:04:26Z
dc.date.accessioned 2019-02-28T11:08:19Z
dc.date.available 2019-02-28T13:04:26Z
dc.date.available 2019-02-28T11:08:19Z
dc.date.issued 2017
dc.description ##nofulltext##
dc.description Ebru Arısoy (MEF Author)
dc.description.WoSDocumentType Proceedings Paper
dc.description.WoSIndexDate 2017
dc.description.abstract With the increase of online video lectures, using speech and language processing technologies for education has become quite important. This paper presents an automatic transcription and retrieval system developed for processing spoken lectures in Turkish. The main steps in the system are automatic transcription of Turkish video lectures using a large vocabulary continuous speech recognition (LVCSR) system and finding keywords on the lattices obtained from the LVCSR system using a speech retrieval system based on keyword search. While developing this system, first a state-of-the-art LVCSR system was developed for Turkish using advance acoustic modeling methods, then keywords were extracted automatically front word sequences in the reference transcriptions of video lectures, and a speech retrieval system was developed for searching these keywords in the lattice output of the LVCSR system. The spoken lecture processing system yields 14.2% word error rate and 0.86 maximum term weighted value on the test data.
dc.identifier.citation Arisoy, E., (2017). Developing an Automatic Transcription and Retrieval System for Spoken Lectures in Turkish. Conference: 25th Signal Processing and Communications Applications Conference (SIU) Location: Antalya, TURKEY
dc.identifier.issn 2165-0608
dc.identifier.scopus 2-s2.0-85026293796
dc.identifier.uri https://hdl.handle.net/20.500.11779/695
dc.language.iso en
dc.relation.ispartof Conference: 25th Signal Processing and Communications Applications Conference (SIU) Location: Antalya, TURKEY Date: MAY 15-18, 2017
dc.rights info:eu-repo/semantics/closedAccess
dc.subject Large vocabulary continuous speech recognition
dc.subject Speech retrieval
dc.subject Speech and language processing for educational technologies
dc.title Developing an Automatic Transcription and Retrieval System for Spoken Lectures in Turkish
dc.title.alternative Türkçe sözlü ders anlatımları için otomatik yazılandırma ve geri getirim sistemi geliştirilmesi
dc.type Conference Object
dspace.entity.type Publication
gdc.author.id Ebru Arısoy / 0000-0002-8311-3611
gdc.author.institutional Arısoy, Ebru
gdc.author.institutional Arısoy Saraçlar, Ebru
gdc.coar.access metadata only access
gdc.coar.type text::conference output
gdc.description.department Mühendislik Fakültesi, Elektrik Elektronik Mühendisliği Bölümü
gdc.description.publicationcategory Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı
gdc.description.woscitationindex Conference Proceedings Citation Index - Science
gdc.identifier.wos WOS:000413813100238
gdc.publishedmonth Mayıs
gdc.scopus.citedcount 5
gdc.wos.citedcount 3
gdc.wos.publishedmonth Mayıs
gdc.wos.yokperiod YÖK - 2016-17
relation.isAuthorOfPublication 0b895153-5793-4e46-bc2f-06a28b30f531
relation.isAuthorOfPublication.latestForDiscovery 0b895153-5793-4e46-bc2f-06a28b30f531
relation.isOrgUnitOfPublication de19334f-6a5b-4f7b-9410-9433c48d1e5a
relation.isOrgUnitOfPublication 0d54cd31-4133-46d5-b5cc-280b2c077ac3
relation.isOrgUnitOfPublication a6e60d5c-b0c7-474a-b49b-284dc710c078
relation.isOrgUnitOfPublication.latestForDiscovery de19334f-6a5b-4f7b-9410-9433c48d1e5a

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Developing an automatic transcription and retrieval system for spoken lectures in Turkish.pdf
Size:
159.57 KB
Format:
Adobe Portable Document Format
Description:
Konferans Dosyası

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
0 B
Format:
Item-specific license agreed upon to submission
Description: