Please use this identifier to cite or link to this item: https://hdl.handle.net/20.500.11779/1134
Title: An XML parser for Turkish wikipedia
Other Titles: Türkçe vikipedi için bir XML ayrıştırıcı
Authors: Vardar, Uluç Furkan
Devran, İlkay Tevfik
Demir, Şeniz
Keywords: XML
Dogs
Encyclopedias
Electronic Publishing
Internet
Natural Language Processing
Publisher: IEEE
Source: Vardar, U. F., Devran, I. T., Demir, S., & 2019 27th Signal Processing and Communications Applications Conference (SIU). (April 01, 2019). An XML Parser for Turkish Wikipedia. (Sivas; Turkey) 1-4.
Abstract: Nowadays, visual and written data that can be easily accessed over the internet has enabled the development of research in many different fields. However, the availability of data is not sufficient by itself. It is of great importance that these data can be effectively utilized and interpreted in accordance with the requirements. Access to written content in the Wikipedia encyclopedia, which is becoming increasingly common in Turkish natural language processing, can be done via XML dumps. In this study, our aim is to develop and demonstrate the applicability of an XML parser for the processing of Turkish Wikipedia dumps. The use of the open-source parser, which allows information extraction at different levels of granularity, is reported on pages containing biography infoboxes and textual contents.
URI: https://hdl.handle.net/20.500.11779/1134
ISBN: 9781728119045
ISSN: 2165-0608
Appears in Collections:Bilgisayar Mühendisliği Bölümü koleksiyonu
Scopus İndeksli Yayınlar Koleksiyonu / Scopus Indexed Publications Collection
WoS İndeksli Yayınlar Koleksiyonu / WoS Indexed Publications Collection

Files in This Item:
File Description SizeFormat 
An-XML.pdf
  Until 2030-09-16
Konferans Dosyası1.31 MBAdobe PDFView/Open    Request a copy
Show full item record



CORE Recommender

SCOPUSTM   
Citations

3
checked on Aug 1, 2024

Page view(s)

6
checked on Jun 26, 2024

Google ScholarTM

Check




Altmetric


Items in GCRIS Repository are protected by copyright, with all rights reserved, unless otherwise indicated.