An Xml Parser for Turkish Wikipedia

dc.contributor.author Demir, Şeniz
dc.contributor.author Vardar, Uluç Furkan
dc.contributor.author Devran, İlkay Tevfik
dc.date.accessioned 2019-09-16T09:37:04Z
dc.date.available 2019-09-16T09:37:04Z
dc.date.issued 2019-04-01
dc.description.abstract Nowadays, visual and written data that can be easily accessed over the internet has enabled the development of research in many different fields. However, the availability of data is not sufficient by itself. It is of great importance that these data can be effectively utilized and interpreted in accordance with the requirements. Access to written content in the Wikipedia encyclopedia, which is becoming increasingly common in Turkish natural language processing, can be done via XML dumps. In this study, our aim is to develop and demonstrate the applicability of an XML parser for the processing of Turkish Wikipedia dumps. The use of the open-source parser, which allows information extraction at different levels of granularity, is reported on pages containing biography infoboxes and textual contents.
dc.identifier.citation Vardar, U. F., Devran, I. T., Demir, S., & 2019 27th Signal Processing and Communications Applications Conference (SIU). (April 01, 2019). An XML Parser for Turkish Wikipedia. (Sivas; Turkey) 1-4.
dc.identifier.doi 10.1109/SIU.2019.8806399
dc.identifier.isbn 9781728119045
dc.identifier.issn 2165-0608
dc.identifier.scopus 2-s2.0-85071983888
dc.identifier.uri https://hdl.handle.net/20.500.11779/1134
dc.identifier.uri https://doi.org/10.1109/SIU.2019.8806399
dc.language.iso en
dc.publisher IEEE
dc.relation.ispartof 27th Signal Processing and Communications Applications Conference, SIU 2019
dc.relation.ispartofseries Signal Processing and Communications Applications Conference
dc.rights info:eu-repo/semantics/closedAccess
dc.subject Xml
dc.subject Encyclopedias
dc.subject Natural language processing
dc.subject Dogs
dc.subject Electronic publishing
dc.subject Internet
dc.subject XML Parser
dc.subject Information Extraction
dc.subject Wikipedia
dc.subject Biography Pages
dc.title An Xml Parser for Turkish Wikipedia
dc.title.alternative Türkçe vikipedi için bir XML ayrıştırıcı
dc.type Conference Object
dspace.entity.type Publication
gdc.author.institutional Demir, Şeniz
gdc.author.scopusid 57215362572
gdc.author.scopusid 57215341654
gdc.author.scopusid 14044928200
gdc.author.wosid Demir, Şeniz/AAB-5451-2021
gdc.coar.access metadata only access
gdc.coar.type text::conference output
gdc.collaboration.industrial false
gdc.description.department Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümü
gdc.description.endpage 4
gdc.description.publicationcategory Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı
gdc.description.scopusquality N/A
gdc.description.startpage 1
gdc.description.woscitationindex Conference Proceedings Citation Index - Science
gdc.description.wosquality N/A
gdc.identifier.openalex W2969728433
gdc.identifier.wos WOS:000518994300096
gdc.index.type WoS
gdc.index.type Scopus
gdc.openalex.collaboration National
gdc.openalex.fwci 0.29
gdc.openalex.normalizedpercentile 0.67
gdc.opencitations.count 1
gdc.plumx.crossrefcites 1
gdc.plumx.mendeley 1
gdc.plumx.scopuscites 3
gdc.publishedmonth Nisan
gdc.scopus.citedcount 3
gdc.wos.citedcount 0
gdc.wos.documenttype Proceedings Paper
gdc.wos.indexdate 2019
gdc.wos.publishedmonth Nisan
gdc.yokperiod YÖK - 2018-19
relation.isAuthorOfPublication.latestForDiscovery 93fa0200-13f7-446a-bdc2-118401cab062
relation.isOrgUnitOfPublication.latestForDiscovery 05ffa8cd-2a88-4676-8d3b-fc30eba0b7f3

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Name:
An-XML.pdf
Size:
1.28 MB
Format:
Adobe Portable Document Format
Description:
Watermarked PDF

License bundle

Now showing 1 - 1 of 1
Loading...
Name:
license.txt
Size:
1.44 KB
Format:
Item-specific license agreed upon to submission
Description: