Please use this identifier to cite or link to this item: https://hdl.handle.net/20.500.11779/1325
Full metadata record
DC FieldValueLanguage
dc.contributor.authorZeydan, Engin-
dc.contributor.authorArslan, Şefik Şuayb-
dc.date.accessioned2020-05-31T13:51:23Z
dc.date.available2020-05-31T13:51:23Z
dc.date.issued2020-
dc.identifier.citationZeydan, E. & Arslan S. S. (February 01, 2020). Cloud2HDD: large-scale HDD data analysisn cloud for cloud datacenters, 23rd Conference on Innovation in Clouds, Internet and Networks and Workshops (ICIN 2020), Paris, France, IEEE, Article number: 9059482, pp. 243-249, DOI: https://doi.org/10.1109/ICIN48450.2020.9059482en_US
dc.identifier.isbn9781728151274-
dc.identifier.isbn9781728151281-
dc.identifier.issn2472-8144-
dc.identifier.issn2162-3414-
dc.identifier.urihttps://hdl.handle.net/20.500.11779/1325-
dc.identifier.urihttps://doi.org/10.1109/ICIN48450.2020.9059482-
dc.description.abstractThe main focus of this paper is to develop a distributed large scale data analysis platform for the opensource data of Backblaze cloud datacenter which consists of operational hard disk drive (HDD) information collected over an observable period of 2272 days (over 74 months). To carefully analyze the intrinsic characteristics of the hard disk behavior, we have exploited a large bolume of data and the benefits of Hadoop ecosystem as our big data processing engine. In other words, we have utilized a special distributed scheme on cloud for cloud HDD data, which is termed as Cloud2HDD. To classify the remaining lifetime of hard disk drives based on health indicators such as in-built S.M.A.R.T (Self-Monitoring, Analysis, and Reporting Technology) features, we used some of the state-of-the-art classification algorithms and compared their accuracy, precision, and recall rates simultaneously. In addition, importance of various S.M.A.R.T. features in predicting the true remaining lifetime of HDDs are identified. For instance, our analysis results indicate that Random Forest Classifier (RFC) can yield up to 94% accuracy with the highest precision and recall at a reasonable time by classifying the remaining lifetime of drives into one of three different classes, namely critical, high and low ideal states in comparison to other classification approaches based on a specific subset of S.M.A.R.T. features.en_US
dc.description.sponsorshipTÜBİTAK, MINECOen_US
dc.language.isoenen_US
dc.publisherIEEEen_US
dc.relation.ispartof23rd Conference on Innovation in Clouds, Internet and Networks and Workshops = ICIN 2020en_US
dc.rightsinfo:eu-repo/semantics/closedAccessen_US
dc.subjectClouden_US
dc.subjectData Centeren_US
dc.subjectHadoopen_US
dc.subjectHDDsen_US
dc.subjectLifetimeen_US
dc.subjectMachine Learningen_US
dc.titleCloud2HDD: large-scale HDD data analysis on cloud for cloud datacentersen_US
dc.typeConference Objecten_US
dc.identifier.doi10.1109/ICIN48450.2020.9059482-
dc.identifier.scopus2-s2.0-85084061181en_US
dc.authoridŞuayb Şefik Arslan / 0000-0003-3779-0731-
dc.authoridŞuayb Şefik Arslan / K-2883-2015-
dc.description.woscitationindexConference Proceedings Citation Index - Science-
dc.description.WoSDocumentTypeProceedings Paper
dc.description.WoSPublishedMonthŞubaten_US
dc.description.WoSIndexDate2020en_US
dc.description.WoSYOKperiodYÖK - 2019-20en_US
dc.relation.publicationcategoryKonferans Öğesi - Uluslararası - Kurum Öğretim Elemanıen_US
dc.identifier.endpage249en_US
dc.identifier.startpage243en_US
dc.departmentMühendislik Fakültesi, Bilgisayar Mühendisliği Bölümüen_US
dc.identifier.wosWOS:000569984100041en_US
dc.institutionauthorArslan, Şuayb Şefik-
item.openairecristypehttp://purl.org/coar/resource_type/c_18cf-
item.grantfulltextembargo_20400522-
item.languageiso639-1en-
item.cerifentitytypePublications-
item.fulltextWith Fulltext-
item.openairetypeConference Object-
crisitem.author.dept02.02. Department of Computer Engineering-
Appears in Collections:Bilgisayar Mühendisliği Bölümü koleksiyonu
Scopus İndeksli Yayınlar Koleksiyonu / Scopus Indexed Publications Collection
WoS İndeksli Yayınlar Koleksiyonu / WoS Indexed Publications Collection
Files in This Item:
File Description SizeFormat 
Şefik Şuayb ARSLAN.pdf
  Until 2040-05-22
Full Text - Conference Proceeding1.15 MBAdobe PDFView/Open    Request a copy
Show simple item record



CORE Recommender

SCOPUSTM   
Citations

6
checked on Aug 1, 2024

WEB OF SCIENCETM
Citations

5
checked on Jun 23, 2024

Page view(s)

4
checked on Jun 26, 2024

Google ScholarTM

Check




Altmetric


Items in GCRIS Repository are protected by copyright, with all rights reserved, unless otherwise indicated.