An Xml Parser for Turkish Wikipedia
| dc.contributor.author | Demir, Şeniz | |
| dc.contributor.author | Vardar, Uluç Furkan | |
| dc.contributor.author | Devran, İlkay Tevfik | |
| dc.date.accessioned | 2019-09-16T09:37:04Z | |
| dc.date.available | 2019-09-16T09:37:04Z | |
| dc.date.issued | 2019 | |
| dc.description.WoSDocumentType | Proceedings Paper | |
| dc.description.WoSIndexDate | 2019 | |
| dc.description.abstract | Nowadays, visual and written data that can be easily accessed over the internet has enabled the development of research in many different fields. However, the availability of data is not sufficient by itself. It is of great importance that these data can be effectively utilized and interpreted in accordance with the requirements. Access to written content in the Wikipedia encyclopedia, which is becoming increasingly common in Turkish natural language processing, can be done via XML dumps. In this study, our aim is to develop and demonstrate the applicability of an XML parser for the processing of Turkish Wikipedia dumps. The use of the open-source parser, which allows information extraction at different levels of granularity, is reported on pages containing biography infoboxes and textual contents. | |
| dc.identifier.citation | Vardar, U. F., Devran, I. T., Demir, S., & 2019 27th Signal Processing and Communications Applications Conference (SIU). (April 01, 2019). An XML Parser for Turkish Wikipedia. (Sivas; Turkey) 1-4. | |
| dc.identifier.isbn | 9781728119045 | |
| dc.identifier.issn | 2165-0608 | |
| dc.identifier.scopus | 2-s2.0-85071983888 | |
| dc.identifier.uri | https://hdl.handle.net/20.500.11779/1134 | |
| dc.language.iso | en | |
| dc.publisher | IEEE | |
| dc.relation.ispartof | 27th Signal Processing and Communications Applications Conference, SIU 2019 | |
| dc.rights | info:eu-repo/semantics/closedAccess | |
| dc.subject | Xml | |
| dc.subject | Encyclopedias | |
| dc.subject | Natural language processing | |
| dc.subject | Dogs | |
| dc.subject | Electronic publishing | |
| dc.subject | Internet | |
| dc.title | An Xml Parser for Turkish Wikipedia | |
| dc.title.alternative | Türkçe vikipedi için bir XML ayrıştırıcı | |
| dc.type | Conference Object | |
| dspace.entity.type | Publication | |
| gdc.author.institutional | Demir, Şeniz | |
| gdc.author.institutional | Demir, Şeniz | |
| gdc.coar.access | metadata only access | |
| gdc.coar.type | text::conference output | |
| gdc.description.department | Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümü | |
| gdc.description.endpage | 4 | |
| gdc.description.publicationcategory | Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı | |
| gdc.description.scopusquality | N/A | |
| gdc.description.startpage | 1 | |
| gdc.description.woscitationindex | Conference Proceedings Citation Index - Science | |
| gdc.description.wosquality | N/A | |
| gdc.identifier.wos | WOS:000518994300096 | |
| gdc.publishedmonth | Nisan | |
| gdc.scopus.citedcount | 3 | |
| gdc.wos.citedcount | 0 | |
| gdc.wos.publishedmonth | Nisan | |
| gdc.wos.yokperiod | YÖK - 2018-19 | |
| relation.isAuthorOfPublication | 93fa0200-13f7-446a-bdc2-118401cab062 | |
| relation.isAuthorOfPublication.latestForDiscovery | 93fa0200-13f7-446a-bdc2-118401cab062 | |
| relation.isOrgUnitOfPublication | 05ffa8cd-2a88-4676-8d3b-fc30eba0b7f3 | |
| relation.isOrgUnitOfPublication | 0d54cd31-4133-46d5-b5cc-280b2c077ac3 | |
| relation.isOrgUnitOfPublication | a6e60d5c-b0c7-474a-b49b-284dc710c078 | |
| relation.isOrgUnitOfPublication.latestForDiscovery | 05ffa8cd-2a88-4676-8d3b-fc30eba0b7f3 |
Files
Original bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- An-XML.pdf
- Size:
- 1.28 MB
- Format:
- Adobe Portable Document Format
- Description:
- Konferans Dosyası
License bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- license.txt
- Size:
- 1.44 KB
- Format:
- Item-specific license agreed upon to submission
- Description: