An Xml Parser for Turkish Wikipedia

Loading...

Journal Title

Journal ISSN

Volume Title

Publisher

Open Access Color

OpenAIRE Downloads

OpenAIRE Views

relationships.isProjectOf

relationships.isJournalIssueOf

Abstract

Nowadays, visual and written data that can be easily accessed over the internet has enabled the development of research in many different fields. However, the availability of data is not sufficient by itself. It is of great importance that these data can be effectively utilized and interpreted in accordance with the requirements. Access to written content in the Wikipedia encyclopedia, which is becoming increasingly common in Turkish natural language processing, can be done via XML dumps. In this study, our aim is to develop and demonstrate the applicability of an XML parser for the processing of Turkish Wikipedia dumps. The use of the open-source parser, which allows information extraction at different levels of granularity, is reported on pages containing biography infoboxes and textual contents.

Description

Keywords

Xml, Encyclopedias, Natural language processing, Dogs, Electronic publishing, Internet, XML Parser, Information Extraction, Wikipedia, Biography Pages

Fields of Science

Citation

Vardar, U. F., Devran, I. T., Demir, S., & 2019 27th Signal Processing and Communications Applications Conference (SIU). (April 01, 2019). An XML Parser for Turkish Wikipedia. (Sivas; Turkey) 1-4.

WoS Q

Scopus Q

OpenCitations Logo
OpenCitations Citation Count
1

Volume

Issue

Start Page

1

End Page

4
PlumX Metrics
Citations

CrossRef : 1

Scopus : 3

Captures

Mendeley Readers : 1

SCOPUS™ Citations

3

checked on Jun 11, 2026

Page Views

49

checked on Jun 11, 2026

Google Scholar Logo
Google Scholar™
OpenAlex Logo
OpenAlex FWCI
0.29

Sustainable Development Goals

SDG data is not available