An Xml Parser for Turkish Wikipedia

dc.contributor.author Demir, Şeniz
dc.contributor.author Vardar, Uluç Furkan
dc.contributor.author Devran, İlkay Tevfik
dc.contributor.other 02.02. Department of Computer Engineering
dc.contributor.other 02. Faculty of Engineering
dc.contributor.other 01. MEF University
dc.date.accessioned 2019-09-16T09:37:04Z
dc.date.available 2019-09-16T09:37:04Z
dc.date.issued 2019
dc.description.WoSDocumentType Proceedings Paper
dc.description.WoSIndexDate 2019
dc.description.abstract Nowadays, visual and written data that can be easily accessed over the internet has enabled the development of research in many different fields. However, the availability of data is not sufficient by itself. It is of great importance that these data can be effectively utilized and interpreted in accordance with the requirements. Access to written content in the Wikipedia encyclopedia, which is becoming increasingly common in Turkish natural language processing, can be done via XML dumps. In this study, our aim is to develop and demonstrate the applicability of an XML parser for the processing of Turkish Wikipedia dumps. The use of the open-source parser, which allows information extraction at different levels of granularity, is reported on pages containing biography infoboxes and textual contents.
dc.identifier.citation Vardar, U. F., Devran, I. T., Demir, S., & 2019 27th Signal Processing and Communications Applications Conference (SIU). (April 01, 2019). An XML Parser for Turkish Wikipedia. (Sivas; Turkey) 1-4.
dc.identifier.isbn 9781728119045
dc.identifier.issn 2165-0608
dc.identifier.scopus 2-s2.0-85071983888
dc.identifier.uri https://hdl.handle.net/20.500.11779/1134
dc.language.iso en
dc.publisher IEEE
dc.relation.ispartof 27th Signal Processing and Communications Applications Conference, SIU 2019
dc.rights info:eu-repo/semantics/closedAccess
dc.subject Xml
dc.subject Encyclopedias
dc.subject Natural language processing
dc.subject Dogs
dc.subject Electronic publishing
dc.subject Internet
dc.title An Xml Parser for Turkish Wikipedia
dc.title.alternative Türkçe vikipedi için bir XML ayrıştırıcı
dc.type Conference Object
dspace.entity.type Publication
gdc.author.institutional Demir, Şeniz
gdc.author.institutional Demir, Şeniz
gdc.coar.access metadata only access
gdc.coar.type text::conference output
gdc.description.department Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümü
gdc.description.endpage 4
gdc.description.publicationcategory Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı
gdc.description.scopusquality N/A
gdc.description.startpage 1
gdc.description.woscitationindex Conference Proceedings Citation Index - Science
gdc.description.wosquality N/A
gdc.identifier.wos WOS:000518994300096
gdc.publishedmonth Nisan
gdc.scopus.citedcount 3
gdc.wos.citedcount 0
gdc.wos.publishedmonth Nisan
gdc.wos.yokperiod YÖK - 2018-19
relation.isAuthorOfPublication 93fa0200-13f7-446a-bdc2-118401cab062
relation.isAuthorOfPublication.latestForDiscovery 93fa0200-13f7-446a-bdc2-118401cab062
relation.isOrgUnitOfPublication 05ffa8cd-2a88-4676-8d3b-fc30eba0b7f3
relation.isOrgUnitOfPublication 0d54cd31-4133-46d5-b5cc-280b2c077ac3
relation.isOrgUnitOfPublication a6e60d5c-b0c7-474a-b49b-284dc710c078
relation.isOrgUnitOfPublication.latestForDiscovery 05ffa8cd-2a88-4676-8d3b-fc30eba0b7f3

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
An-XML.pdf
Size:
1.28 MB
Format:
Adobe Portable Document Format
Description:
Konferans Dosyası

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.44 KB
Format:
Item-specific license agreed upon to submission
Description: