An Evaluation of Recent Neural Sequence Tagging Models in Turkish Named Entity Recognition

Loading...
Thumbnail Image

Date

2021

Journal Title

Journal ISSN

Volume Title

Publisher

Elsevier

Open Access Color

HYBRID

Green Open Access

Yes

OpenAIRE Downloads

OpenAIRE Views

Publicly Funded

No
Impulse
Top 10%
Influence
Top 10%
Popularity
Top 10%

Research Projects

Journal Issue

Abstract

Named entity recognition (NER) is an extensively studied task that extracts and classifies named entities in a text. NER is crucial not only in downstream language processing applications such as relation extraction and question answering but also in large scale big data operations such as real-time analysis of online digital media content. Recent research efforts on Turkish, a less studied language with morphologically rich nature, have demonstrated the effectiveness of neural architectures on well-formed texts and yielded state-of-the art results by formulating the task as a sequence tagging problem. In this work, we empirically investigate the use of recent neural architectures (Bidirectional long short-term memory (BiLSTM) and Transformer-based networks) proposed for Turkish NER tagging in the same setting. Our results demonstrate that transformer-based networks which can model long-range context overcome the limitations of BiLSTM networks where different input features at the character, subword, and word levels are utilized. We also propose a transformer-based network with a conditional random field (CRF) layer that leads to the state-of-the-art result (95.95% f-measure) on a common dataset. Our study contributes to the literature that quantifies the impact of transfer learning on processing morphologically rich languages.

Description

Keywords

Turkish, Named entity recognition, Transfer learning, Digital media industry, Crf, Named entity recognition, FOS: Computer and information sciences, Computer Science - Machine Learning, Computer Science - Computation and Language, Digital media industry, Turkish, CRF, Computation and Language (cs.CL), Transfer learning, Machine Learning (cs.LG)

Turkish CoHE Thesis Center URL

Fields of Science

0202 electrical engineering, electronic engineering, information engineering, 02 engineering and technology

Citation

WoS Q

Q1

Scopus Q

Q1
OpenCitations Logo
OpenCitations Citation Count
25

Source

Expert Systems with Applications

Volume

182

Issue

15

Start Page

1

End Page

11
PlumX Metrics
Citations

CrossRef : 20

Scopus : 28

Captures

Mendeley Readers : 46

SCOPUS™ Citations

28

checked on Feb 03, 2026

Web of Science™ Citations

20

checked on Feb 03, 2026

Page Views

288

checked on Feb 03, 2026

Downloads

499

checked on Feb 03, 2026

Google Scholar Logo
Google Scholar™
OpenAlex Logo
OpenAlex FWCI
0.28220715

Sustainable Development Goals

SDG data is not available