Bilgisayar Mühendisliği Bölümü Koleksiyonu

Permanent URI for this collectionhttps://hdl.handle.net/20.500.11779/1940

Browse

Search Results

Now showing 1 - 6 of 6
  • Article
    Citation - WoS: 3
    Citation - Scopus: 3
    A Benchmark Dataset for Turkish Data-To Generation
    (Elsevier, 2023-01-01) Demir, Şeniz; Öktem, Seza
    In the last decades, data-to-text (D2T) systems that directly learn from data have gained a lot of attention in natural language generation. These systems need data with high quality and large volume, but unfortunately some natural languages suffer from the lack of readily available generation datasets. This article describes our efforts to create a new Turkish dataset (Tr-D2T) that consists of meaning representation and reference sentence pairs without fine-grained word alignments. We utilize Turkish web resources and existing datasets in other languages for producing meaning representations and collect reference sentences by crowdsourcing native speakers. We particularly focus on the generation of single-sentence biographies and dining venue descriptions. In order to motivate future Turkish D2T studies, we present detailed benchmarking results of different sequence-to-sequence neural models trained on this dataset. To the best of our knowledge, this work is the first of its kind that provides preliminary findings and lessons learned from the creation of a new Turkish D2T dataset. Moreover, our work is the first extensive study that presents generation performances of transformer and recurrent neural network models from meaning representations in this morphologically-rich language.
  • Article
    Citation - WoS: 1
    Citation - Scopus: 2
    Extracting, Computing, Coordination: What Does a Triphasic Erp Pattern Say About Language Processing?
    (Elsevier, 2021-11-25) Çakar, Tuna; Eken, Aykut; Cedden, Gülay
    The current study aims at contributing to the interpretation of the most prominent language-related ERP effects, N400 and P600, by investigating how neural responses to congruent and incongruent sentence endings vary, when the language processor processes the full array of the lexico-syntactic content in verbs with three affixes in canonical Turkish sentences. The ERP signals in response to three different violation conditions reveal a similar triphasic (P200/N400/P600) pattern resembling in topography and peak amplitude The P200 wave is interpreted as the extraction of meaning from written.form by generating a code which triggers the computation of neuronal ensembles in the distributed LTM (N400). The P600 potential reflects the widely distributed coordination process of activated neuronal patterns of semantic and morphosyntactic cues by connecting the generated subsets of these patterns and adapting them into the current context. It further can be deduced that these ERP components reflect cognitive rather than linguistic processes. © 2021 Informa UK Limited, trading as Taylor & Francis Group.
  • Article
    Citation - WoS: 6
    Citation - Scopus: 9
    A Data-Assisted Reliability Model for Carrier-Assisted Cold Data Storage Systems
    (Elsevier, 2020-04-01) Arslan, Şuayb Şefik; Göker, Turguy; Peng, James
    Cold data storage systems are used to allow long term digital preservation for institutions’ archive. The common functionality among cold and warm/hot data storage is that the data is stored on some physical medium for read-back at a later time. However in cold storage, write and read operations are not necessarily done in the same exact geographical location. Hence, a third party assistance is typically utilized to bring together the medium and the drive. On the other hand, the reliability modeling of such a decomposed system poses few challenges that do not necessarily exist in other warm/hot storage alternatives such as fault detection and absence of the carrier, all totaling up to the data unavailability issues. In this paper, we propose a generalized non-homogenous Markov model that encompasses the aging of the carriers in order to address the requirements of today's cold data storage systems in which the data is encoded and spread across multiple nodes for the long-term data retention. We have derived useful lower/upper bounds on the overall system availability. Furthermore, the collected field data is used to estimate parameters of a Weibull distribution to accurately predict the lifetime of the carriers in an example scale-out setting.
  • Article
    Citation - Scopus: 1
    On the Distribution of the Threshold Voltage in Multi-Level Cell Flash Memories
    (Elsevier, 2019-10-01) Pusane, Ali E; Ashrafi, Reza A; Arslan, Şuayb Şefik
    In Multi-Level Cell (MLC) memories, multiple bits of information are packed within the cell to enable higher capacity and lower cost of manufacturing compared to those of the single-level cell flash. However, because of heavy information packing, MLC memories suffer from several error sources including inter-cell interference, retention error, and random telegraph noise which make their lifetime shorter. Having so many error sources that are statistically hard to characterize makes it challenging to properly derive the underlying probability distribution of the sensed threshold voltage, which is vital for finding optimal decision rules to secure better detection performance and hence better lifetime. Although several recent works have already considered this problem, they mostly recourse to few loose assumptions that are far from being realistic. In this study, a more comprehensive/general analysis is conducted to derive the probability density function of the final sensed voltage, and through realistic simplifications, closed form expressions are presented. Extensive computer simulations corroborate the accuracy of the derived analytical expressions, and we think they shall be essential for accurately estimating the reliability and the overall lifetime of modern MLC memories.
  • Book Part
    Citation - Scopus: 3
    The Use of Neurometric and Biometric Research Methods in Understanding the User Experience of First-Time Buyers in E-Commerce - Book Chapter 94
    (Elsevier, 2018) Çakar, Tuna; Öztürk, Özgürol; Rızvanoğlu, Kerem; Çelik, Deniz Zengin
    User experience (UX) research has attracted increasing attention especially in the last decade as the demand for online shopping has increased by 30.7% from 2014 to 2015 in Turkey. The traditional methods including surveys/questionnaires, think-aloud procedures, and in-depth interviews have contributed greatly for understanding the problems during the use of shopping internet sites. On the other hand, the use of neuroscientific methods, such as biometrics and neurometrics, has also grabbed attention with the exciting idea of providing an objective means of understanding cognitive and affective processes during the user experience during online shopping. Despite significant/strong limitations, many researchers are interested in exploring actively its potential use in the field.
  • Article
    Citation - WoS: 13
    Citation - Scopus: 16
    Face Recognition With Patch-Based Local Walsh Transform
    (Elsevier, 2018-02-01) Uzun-Per, Meryem; Gökmen, Muhittin
    In this paper, we present a novel dense local image representation method called Local Walsh Transform (LWT)by applying the well-known Walsh Transform (WT) to each pixel of an image. The LWT decomposes an image into multiple components, and produces LWT complex images by using the symmetrical relationship between them. Cascaded LWT (CLWT) is also a dense local image representation obtained by applying the LWT again to real and imaginary parts of LWT complex images. Applying the LWT once more to real and imaginary parts of LWT complex images increases the success rate especially on low resolution images. In order to combine the advantages of sparse and dense local image representations, we present Patch-based LWT (PLWT) and Patch-based CLWT (PCLWT) by applying the LWT and CLWT, respectively, to patches extracted around landmarks of multi-scaled face images. The extracted high dimensional features of the patches are reduced through the application of the Whitened Principal Component Analysis (WPCA). Experimental results show that both thePLWT and PCLWT are robust to illumination and expression changes, occlusion and low resolution. The state-of-the-art performance is achieved on the FERET and SCface databases, and the second best unsupervised category result is achieved on the LFW database.