Bilgisayar Mühendisliği Bölümü Koleksiyonu
Permanent URI for this collectionhttps://hdl.handle.net/20.500.11779/1940
Browse
49 results
Search Results
Research Project İmece-depo: İşbirlikçi Hücresel Ağlarda Veri Önbellekleme için Cihazdan Cihaza Iletişim ile Dağıtık Depolama, Optimale Yakın Kodlama ve Protokol Tasarımı.(2023) Haytaoğlu, Elif; Pourmandı, Massoud; Kaya, Erdi; Arslan, Şefik Şuayb; 02.02. Department of Computer Engineering; 02. Faculty of Engineering; 01. MEF UniversityHücresel ağlarda popüler dosyaların cihazlarda önbelleklenmesi ile, cihazlar arası etkileşim baz istasyonu (Bİ) üzerine düşen iletişim yükünü oldukça azaltmaktadır. Dağıtık veri önbellekleme işlemi popüler bir dosyanın parçalarının kodlanmamış orijinal haliyle ya da herhangi bir silinti kodu kullanılarak kodlanmış halinin mobil cihazlar içerisinde dağıtık bir şekilde depolanması yardımıyla gerçekleştirilir. Dosyanın herhangi bir parçası, komşu mobil cihazlardan ya da mümkün değilse, doğrudan Bİ?lerden, yüksek bir iletişim maliyeti pahasına indirilebilir. Bir hücresel ağda, rastgele zamanlarda bazı düğümlerin hücreye katıldığı ve bazılarının ayrıldığı göz önüne alındığında, performans için Bİ ile iletişimin minimum düzeyde olmasını sağlayacak akıllı veri onarım yöntemlerine ihtiyaç duyulacaktır. Tek bir veya birden fazla Bİ?nin sisteme katılımı, önceki onarım paradigmalarına, özellikle de işbirlikçi düğüm onarım süreçlerine farklı bir boyut eklemektedir. Bunun nedeni, çalışma protokolü kurallarının yanı sıra iletişim kısıtlamalarının da değişmesidir. Literatür, bu durum için temel bant genişliği/depolama ödünleşim uzayını inceleyen bir çalışma içermemektedir. Yeni hücre mimarileri buna göre, yeni silinti kod yapılarını, verimli protokol tasarımlarını, veri erişim gecikmesi, gerçekçi kuyruk modelleri ve gerçekçi benzetim platformları dahil ancak bunlarla sınırlı olmamak üzere farklı tasarım değerlendirmelerini gerektirmektedir. Bu projede ilk olarak, daha önceki hiçbir çalışmada düşünülmemiş işbirliği yapan Bİ?lerin cihaz ayrılışlarında yaşanan kayıp verinin onarımı için bant genişliği ve depolama kapasitesinin iyileştirilmiş teorik sınırlarının veri akış diyagramları kullanılarak elde edilmesi amaçlanmıştır. Bununla beraber, bant genişliği ve depolama alanını en iyi kullanan kod yapılarından esinlenerek, veri önbellekleme işlemini optimale yakın bir maliyetle gerçekleştirecek tamamen özgün çizge tabanlı kod yapıları ve bu yeni kodlar için daha önce düğüm onarım problemine uygulanmamış genetik algoritma, optimize edilmiş artık veri dağıtımı gibi yeni yaklaşımlar kullanılarak önceden düşünülmemiş düğüm onarım algoritmaları önerilecektir. Ayrıca, düğümlerin hücreye katılma ve ayrılma süreçleri için, bant genişliği ve veri depolaması gereksinimlerini en aza indirmeye yardımcı olacak enerji tüketimi odaklı son derece özgün protokoller önerilecektir. Bu protokoller, düğümlerin bir hücreden diğerine geçiş yapabileceği ve hücre içi kaynakların etkin kullanılmasına yardımcı olmak için Bİ?lerin işbirliği yapmasını sağlayan geçiş senaryoları ile güçlendirilecektir. Bu durum, iki onarım işlemi arasındaki sürenin ayarlanması, veri erişim maliyetlerinin azaltılması, hücreye katılan düğüm içeriğinin kullanımı, artık veri kullanımı v.s. gibi yenilikleri içerecektir. Son olarak, önerilen kod yapıları ve protokol mimarisinin performansını analitik olarak türetmek için bilinen çeşitli ve daha gerçekçi kuyruklama modelleri değerlendirilecektir. Analitik sonuçlarımızı doğrulamak için daha sonra hücresel ağ tabanlı büyük ölçekli benzetimler yapılıp sayısal yöntemler ile toplam iletişim ve dosya onarım işlemlerinin maliyet hesaplamaları ve karşılaştırmaları yapılacaktır. MEF Üniversitesi öğretim üyesi Dr. Şuayb Arslan?ın yürütücüsü olduğu ve 36 ay sürecek projede, Pamukkale üniversitesi Bilgisayar Mühendisliği Bölümü öğretim üyesi Dr. Elif Haytaoğlu araştırmacı olarak görev alacaktır. Projede, iki doktora, iki yüksek lisans ve son iki senemizde iki lisans öğrencisi bursiyer olarak görev alacaktır.Patent Artificial Intelligence Augmented Iterative Product Decoding(2023) Arslan , Şuayb Şefik; Göker, TurguyA method for product decoding within a data storage system includes receiving data to be decoded within a first decoder; performing a plurality of decoding iterations to decode the data utilizing a first decoder and a second decoder; and outputting fully decoded data based on the performance of the plurality of decoding iterations. Each of the plurality of decoding iterations includes (i) decoding the data with the first decoder operating at a first decoder operational mode to generate once decoded data; (ii) sending the once decoded data from the first decoder to the second decoder; (iii) receiving error information from the first decoder with an artificial intelligence system; (iv) selecting a second decoder operational mode based at least in part on the error information that is received by the artificial intelligence system; and (v) decoding the once decoded data with the second decoder operating at the second decoder operational mode to generate twice decoded data; and outputting fully decoded data based on the performance of the plurality of decoding iterations.Patent Joint Multi-Nanopore Sequencing for Reliable Data Retrieval in Nucleic Acid Storage(2023) Arslan , Şuayb Şefik; Göker, Turguy; Doerner, DonA nucleic acid storage system (100) that uses nanopore sequencing to read data values chemically embedded in oligonucleotides includes a membrane (102), a voltage source (108), and a nucleic acid strand (110). The membrane (102) has a plurality of nanopores (104) that are stacked upon one another in a multi-nanopore arrangement. The voltage source (108) is configured to direct voltage across the plurality of nanopores (104). The nucleic acid strand (110) including the oligonucleotides is threaded through each of the plurality of nanopores (104) within the membrane (102). A separate base signal (118) is generated from the nucleic acid strand (110) being threaded through each of the plurality of nanopores (104), and Recursive Neural Networks can be used to estimate a signal shape for each oligonucleotide. Recurrent Convolutional Neural Networks and noise predictive data detection algorithms can be used based on the estimated signal shapes to sequence the oligonucleotides.Article Minimum Repair Bandwidth Ldpc Codes for Distributed Storage Systems(IEEE, 2023) Pourmandi, Massoud; Arslan, Şefik Şuayb; Arslan , Şuayb Şefik; Haytaoğlu, Elif; 02.02. Department of Computer Engineering; 02. Faculty of Engineering; 01. MEF UniversityIn distributed storage systems (DSS), an optimal code design must meet the requirements of efficient local data regeneration in addition to reliable data retention. Recently, lowdensity parity-check (LDPC) codes have been proposed as a promising candidate that can secure high data rates as well as low repair bandwidth while maintaining low complexity in data reconstruction. The main objective of this study is to optimize the repair bandwidth characteristics of LDPC code families for a DSS application while meeting the data reliability requirements. First, a data access scenario in which nodes contact other available nodes randomly to download data is examined. Later, a minimum-bandwidth protocol is considered in which nodes make their selections based on the degree numbers of check nodes. Through formulating optimization problems for both protocols, a fundamental trade-off between the decoding threshold and the repair bandwidth is established for a given code rate. Finally, conclusions are confirmed by numerical results showing that irregular constructions have a large potential for establishing optimized LDPC code families for DSS applications.Article What Is the Effective Resolution of the Retinal Image of a Distant Face?(Vision Sciences Society Annual Meeting Abstract, 2023) Arslan , Şuayb Şefik; Arslan, Şefik Şuayb; Sinha, Pawan; 02.02. Department of Computer Engineering; 02. Faculty of Engineering; 01. MEF UniversityWe consider the following question: What is the effective resolution of a face image projected on the retina, when the face is at a specified distance from the eye? Though simple to state, this is a surprisingly challenging issue to resolve. The mapping between viewing distance and effective resolution cannot be readily derived based on the contrast sensitivity, Snellen acuity, or even the packing density of photoreceptors in the fovea. With initial guidelines derived from theoretical considerations, images of varying resolution were presented across a range of viewing distances. For each distance, participants were required to perform an ‘odd one out’ task. This involved detecting the one that was different from the rest in a 2x2 grid, with image resolution being the only dimension of variation. As the experiment progressed, the viewing distance decreased monotonically, and participants were able to detect increasingly subtle resolution differences between the three standard images and the outlier. The collected data have allowed us to establish the upper/lower bounds on the effective available resolution for typical human vision as a function of viewing distance. Interestingly, we find that humans perform significantly better, particularly at short ranges, than what a theoretical model predicts based on projected image size, cone density, and foveal extent. Accordingly, we suggest that the non-uniform in-fovea density, as well as less sharp fall-off in the acuity density function outside the fovea, need to be integrated into future theoretical models to translate viewing distance to perceived image characteristics. A pragmatic benefit of the <distance : effective-resolution> mapping is that it enables a direct comparison of human face recognition performance as assessed across blur and viewing distance. Additionally, it allows us to systematically compare human performance on face recognition at varying distances with that of machine vision systems using the common axis of resolution.Article Comparing Humans and Deep Neural Networks on Face Recognition Under Various Distance and Rotation Viewing Conditions(Journal of Vision, 2023) Fux, Michal; Arslan, Şefik Şuayb; Jang, Hojin; Boix, Xavier; Cooper, Avi; Groth, Matt J; Sinha, Pawan; 02.02. Department of Computer Engineering; 02. Faculty of Engineering; 01. MEF UniversityHumans possess impressive skills for recognizing faces even when the viewing conditions are challenging, such as long ranges, non-frontal regard, variable lighting, and atmospheric turbulence. We sought to characterize the effects of such viewing conditions on the face recognition performance of humans, and compared the results to those of DNNs. In an online verification task study, we used a 100 identity face database, with images captured at five different distances (2m, 5m, 300m, 650m and 1000m) three pitch values (00 - straight ahead, +/- 30 degrees) and three levels of yaw (00, 45, and 90 degrees). Participants were presented with 175 trials (5 distances x 7 yaw and pitch combinations, with 5 repetitions). Each trial included a query image, from a certain combination of range x yaw x pitch, and five options, all frontal short range (2m) faces. One was of the same identity as the query, and the rest were the most similar identities, chosen according to a DNN-derived similarity matrix. Participants ranked the top three most similar target images to the query image. The collected data reveal the functional relationship between human performance and multiple viewing parameters. Nine state-of-the-art pre-trained DNNs were tested for their face recognition performance on precisely the same stimulus set. Strikingly, DNN performance was significantly diminished by variations in ranges and rotated viewpoints. Even the best-performing network reported below 65% accuracy at the closest distance with a profile view of faces, with results dropping to near chance for longer ranges. The confusion matrices of DNNs were generally consistent across the networks, indicating systematic errors induced by viewing parameters. Taken together, these data not only help characterize human performance as a function of key ecologically important viewing parameters, but also enable a direct comparison of humans and DNNs in this parameter regimeArticle A New Benchmark Dataset for P300 Erp-Based Bci Applications(Academic Press Inc Elsevier Science, 2023) Çakar, Tuna; Çakar, Tuna; Arslan, Şefik Şuayb; Arslan, Suayb S.; Yağan, Mehmet; Alp, Nihan; 02.02. Department of Computer Engineering; 02. Faculty of Engineering; 01. MEF UniversityBecause of its non-invasive nature, one of the most commonly used event-related potentials in brain -computer interface (BCI) system designs is the P300 electroencephalogram (EEG) signal. The fact that the P300 response can easily be stimulated and measured is particularly important for participants with severe motor disabilities. In order to train and test P300-based BCI speller systems in more realistic high-speed settings, there is a pressing need for a large and challenging benchmark dataset. Various datasets already exist in the literature but most of them are not publicly available, and they either have a limited number of participants or utilize relatively long stimulus duration (SD) and inter-stimulus intervals (ISI). They are also typically based on a 36 target (6 x 6) character matrix. The use of long ISI, in particular, not only reduces the speed and the information transfer rates (ITRs) but also oversimplifies the P300 detection. This leaves a limited challenge to state-of-the-art machine learning and signal processing algorithms. In fact, near-perfect P300 classification accuracies are reported with the existing datasets. Therefore, one certainly needs a large-scale dataset with challenging settings to fully exploit the recent advancements in algorithm design (machine learning and signal processing) and achieve high-performance speller results. To this end, in this article we introduce a new freely-and publicly-accessible P300 dataset obtained using 32-channel EEG, in the hope that it will lead to new research findings and eventually more efficient BCI designs. The introduced dataset comprises 18 participants performing a 40 -target (5 x 8) cued-spelling task, with reduced SD (66.6 ms) and ISI (33.3 ms) for fast spelling. We have also processed, analyzed, and character-classified the introduced dataset and we presented the accuracy and ITR results as a benchmark. The introduced dataset and the codes of our experiments are publicly accessible at https://data .mendeley.com /datasets /vyczny2r4w.(c) 2023 Elsevier Inc. All rights reserved.Article Cooperative Network Coding for Distributed Storage Using Base Stations With Link Constraints(arXiv, 2021) Arslan, Şuayb Şefik; Arslan, Şefik Şuayb; Haytaoğlu, Elif; 02.02. Department of Computer Engineering; 02. Faculty of Engineering; 01. MEF UniversityIn this work, we consider a novel distributed data storage/caching scenario in a cellular setting where multiple nodes may fail/depart at the same time. In order to maintain the target reliability, we allow cooperative regeneration of lost nodes with the help of base stations allocated in a set of hierarchical layers. Due to this layered structure, a symbol download from each base station has a different cost, while the link capacities connecting the nodes of the cellular system and the base stations are also limited. In this more practical and general scenario, we present the fundamental trade-off between repair bandwidth cost and the storage space per node. Particularly interesting operating points are the minimum storage as well as bandwidth cost points in this trade-off curve. We provide closed-form expressions for the corresponding bandwidth (cost) and storage space per node for these operating points. Finally, we provide an explicit optimal code construction for the minimum storage regeneration point for a given set of system parameters.Conference Object Residual Data Usage in LDPC Codes(IEEE, 2022) Arslan, Şefik Şuayb; Pourmandi, Massoud; Haytaoglu, Elif; Arslan, Şefik Şuayb; 02.02. Department of Computer Engineering; 02. Faculty of Engineering; 01. MEF UniversityIn distributed storage systems/coded caching systems, padding operations should be performed when the encoded data cannot be divided by the number of storage nodes evenly. Thus, extra zero values are stored in one of the nodes to balance each node's storage content. In this study, distribution of data to storage nodes with no padding was investigated for distributed caching context in which a base station and devices both store the coded data. In other words, no redundancy (no-padding) is included into the encoded data. This approach is named as residual data distribution. LDPC codes are selected as the erasure code due to their low complexity encode/decode operations. Moreover, performance comparisons were conducted between using traditional data distribution approach (with padding) and using residual data (use of no-padding) (standard) in terms of repair time. In our work, the effect of no-padding data usage on the repair time and the ratios of storage savings have been also demonstrated.Conference Object Citation - Scopus: 1Improved Bounds on the Moments of Guessing Cost(IEEE, 2022) Arslan, Suayb S.; Arslan, Şefik Şuayb; 02.02. Department of Computer Engineering; 02. Faculty of Engineering; 01. MEF UniversityGuessing a random variable with finite or countably infinite support in which each selection leads to a positive cost value has recently been studied within the context of "guessing cost". In those studies, similar to standard guesswork, upper and lower bounds for the rho-th moment of guessing cost are described in terms of the known measure Renyi's entropy. In this study, we non-trivially improve the known bounds using previous techniques along with new notions such as balancing cost. We have demonstrated that the novel lower bound proposed in this work, achieves 5.84%, 18.47% higher values than that of the known lower bound for rho = 1 and rho = 5, respectively. As for the upper bound, the novel expression provides 10.93%, 5.54% lower values than that of the previously presented bounds for rho = 1 and rho = 5, respectively.
