A Multimodal AI and ML Framework for Fashion Image Segmentation, Recommendation, and Similarity Recognition

dc.contributor.author Soyhan M.E.
dc.contributor.author Ay T.B.
dc.contributor.author Memis E.C.
dc.contributor.author Fatih Capal M.
dc.contributor.author Cakar T.
dc.contributor.author Gunay S.
dc.contributor.author Coskun H.
dc.date.accessioned 2026-03-05T15:02:39Z
dc.date.available 2026-03-05T15:02:39Z
dc.date.issued 2025
dc.description.abstract This study presents a scalable multimodal Artificial Intelligence (AI) and Machine Learning (ML) framework designed to enhance decision making in the fashion industry. The proposed system integrates garment segmentation, multimodal feature extraction, and similarity recommendation into a unified pipeline. Using Segformer for segmentation, along with the convolutional neural network (CNN)-based feature extraction models ResNet152V2 and Xception, and the transformer-based vision-language model LLaVA, the framework generates visual and semantic embeddings of garments. These representations are processed through similarity detection using OpenAI embedding models and stored in the Pinecone vector database for efficient retrieval. Real-time similarity scoring is enabled through FastAPI endpoints, offering interactive search capabilities. Preliminary results demonstrate the system's strong ability to identify conceptually and visually similar items across a large catalog, providing actionable insights for designers. This framework lays the groundwork for intelligent, interpretable, and production-ready AI systems in the fashion industry. © 2025 IEEE. en_US
dc.identifier.doi 10.1109/UBMK67458.2025.11206866
dc.identifier.issn 2521-1641
dc.identifier.scopus 2-s2.0-105030879107
dc.identifier.uri https://doi.org/10.1109/UBMK67458.2025.11206866
dc.identifier.uri https://hdl.handle.net/20.500.11779/3229
dc.language.iso en en_US
dc.publisher Institute of Electrical and Electronics Engineers Inc. en_US
dc.relation.ispartof International Conference on Computer Science and Engineering, UBMK en_US
dc.rights info:eu-repo/semantics/closedAccess en_US
dc.subject Artificial Intelligence en_US
dc.subject Deep Learning en_US
dc.subject Fashion Industry en_US
dc.subject Feature Extraction en_US
dc.subject Image Segmentation en_US
dc.subject LLaVA en_US
dc.subject Machine Learning en_US
dc.subject OpenAI API en_US
dc.subject ResNet en_US
dc.subject SegFormer en_US
dc.subject Similarity Detection en_US
dc.subject Xception en_US
dc.title A Multimodal AI and ML Framework for Fashion Image Segmentation, Recommendation, and Similarity Recognition en_US
dc.type Conference Object en_US
dspace.entity.type Publication
gdc.author.institutional Çakar, Tuna
gdc.author.scopusid 60411389000
gdc.author.scopusid 60411327000
gdc.author.scopusid 60411168400
gdc.author.scopusid 60411224800
gdc.author.scopusid 56329345400
gdc.author.scopusid 60411293300
gdc.author.scopusid 55532291700
gdc.collaboration.industrial true
gdc.description.department Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümü
gdc.description.endpage 1052 en_US
gdc.description.issue 2025 en_US
gdc.description.publicationcategory Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı en_US
gdc.description.scopusquality N/A
gdc.description.startpage 1047 en_US
gdc.description.wosquality N/A
gdc.identifier.openalex W4415523976
gdc.index.type Scopus
gdc.openalex.collaboration International
gdc.openalex.fwci 0.0
gdc.openalex.normalizedpercentile 0.32
gdc.opencitations.count 0
gdc.plumx.mendeley 1
gdc.plumx.scopuscites 0
gdc.publishedmonth Ekim
gdc.scopus.citedcount 0
gdc.virtual.author Özlem, Şirin
gdc.virtual.author Çakar, Tuna
gdc.yokperiod YÖK - 2025-26
relation.isAuthorOfPublication 8dc3d5ee-f7f1-4e1a-963b-2315897f3832
relation.isAuthorOfPublication 10f8ce3b-94c2-40f0-9381-0725723768fe
relation.isAuthorOfPublication.latestForDiscovery 8dc3d5ee-f7f1-4e1a-963b-2315897f3832
relation.isOrgUnitOfPublication 636850bf-e58c-4b59-bcf0-fa7418bb7977
relation.isOrgUnitOfPublication 0d54cd31-4133-46d5-b5cc-280b2c077ac3
relation.isOrgUnitOfPublication a6e60d5c-b0c7-474a-b49b-284dc710c078
relation.isOrgUnitOfPublication 05ffa8cd-2a88-4676-8d3b-fc30eba0b7f3
relation.isOrgUnitOfPublication.latestForDiscovery 636850bf-e58c-4b59-bcf0-fa7418bb7977

Files