An Evaluation of Recent Neural Sequence Tagging Models in Turkish Named Entity Recognition
Loading...
Date
2021
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Elsevier
Open Access Color
HYBRID
Green Open Access
Yes
OpenAIRE Downloads
OpenAIRE Views
Publicly Funded
No
Abstract
Named entity recognition (NER) is an extensively studied task that extracts and classifies named entities in a text. NER is crucial not only in downstream language processing applications such as relation extraction and question answering but also in large scale big data operations such as real-time analysis of online digital media content. Recent research efforts on Turkish, a less studied language with morphologically rich nature, have demonstrated the effectiveness of neural architectures on well-formed texts and yielded state-of-the art results by formulating the task as a sequence tagging problem. In this work, we empirically investigate the use of recent neural architectures (Bidirectional long short-term memory (BiLSTM) and Transformer-based networks) proposed for Turkish NER tagging in the same setting. Our results demonstrate that transformer-based networks which can model long-range context overcome the limitations of BiLSTM networks where different input features at the character, subword, and word levels are utilized. We also propose a transformer-based network with a conditional random field (CRF) layer that leads to the state-of-the-art result (95.95% f-measure) on a common dataset. Our study contributes to the literature that quantifies the impact of transfer learning on processing morphologically rich languages.
Description
ORCID
Keywords
Turkish, Named entity recognition, Transfer learning, Digital media industry, Crf, Named entity recognition, FOS: Computer and information sciences, Computer Science - Machine Learning, Computer Science - Computation and Language, Digital media industry, Turkish, CRF, Computation and Language (cs.CL), Transfer learning, Machine Learning (cs.LG)
Turkish CoHE Thesis Center URL
Fields of Science
0202 electrical engineering, electronic engineering, information engineering, 02 engineering and technology
Citation
WoS Q
Q1
Scopus Q
Q1

OpenCitations Citation Count
25
Source
Expert Systems with Applications
Volume
182
Issue
15
Start Page
1
End Page
11
PlumX Metrics
Citations
CrossRef : 20
Scopus : 28
Captures
Mendeley Readers : 46
Google Scholar™

OpenAlex FWCI
0.28220715
Sustainable Development Goals
3
GOOD HEALTH AND WELL-BEING

4
QUALITY EDUCATION

5
GENDER EQUALITY

9
INDUSTRY, INNOVATION AND INFRASTRUCTURE

11
SUSTAINABLE CITIES AND COMMUNITIES

16
PEACE, JUSTICE AND STRONG INSTITUTIONS


