An Xml Parser for Turkish Wikipedia

Loading...
Thumbnail Image

Date

2019

Journal Title

Journal ISSN

Volume Title

Publisher

IEEE

Open Access Color

OpenAIRE Downloads

OpenAIRE Views

Research Projects

Journal Issue

Abstract

Nowadays, visual and written data that can be easily accessed over the internet has enabled the development of research in many different fields. However, the availability of data is not sufficient by itself. It is of great importance that these data can be effectively utilized and interpreted in accordance with the requirements. Access to written content in the Wikipedia encyclopedia, which is becoming increasingly common in Turkish natural language processing, can be done via XML dumps. In this study, our aim is to develop and demonstrate the applicability of an XML parser for the processing of Turkish Wikipedia dumps. The use of the open-source parser, which allows information extraction at different levels of granularity, is reported on pages containing biography infoboxes and textual contents.

Description

Keywords

Xml, Encyclopedias, Natural language processing, Dogs, Electronic publishing, Internet

Turkish CoHE Thesis Center URL

Fields of Science

Citation

Vardar, U. F., Devran, I. T., Demir, S., & 2019 27th Signal Processing and Communications Applications Conference (SIU). (April 01, 2019). An XML Parser for Turkish Wikipedia. (Sivas; Turkey) 1-4.

WoS Q

N/A

Scopus Q

N/A

Source

27th Signal Processing and Communications Applications Conference, SIU 2019

Volume

Issue

Start Page

1

End Page

4
SCOPUS™ Citations

3

checked on Dec 06, 2025

Page Views

175

checked on Dec 06, 2025

Downloads

8

checked on Dec 06, 2025

Google Scholar Logo
Google Scholar™

Sustainable Development Goals

SDG data is not available