A Multimodal AI and ML Framework for Fashion Image Segmentation, Recommendation, and Similarity Recognition

Soyhan M.E.; Ay T.B.; Memis E.C.; Fatih Capal M.; Cakar T.; Gunay S.; Coskun H.; Gunay, Savas; Memis, Emir Cetin; Fatih Capal, Mehmet; Soyhan, Mustafa Eren; Coskun, Hasan; Cakar, Tuna; Ay, Tarik Bugra

doi:10.1109/UBMK67458.2025.11206866

A Multimodal AI and ML Framework for Fashion Image Segmentation, Recommendation, and Similarity Recognition

dc.contributor.author	Soyhan M.E.
dc.contributor.author	Ay T.B.
dc.contributor.author	Memis E.C.
dc.contributor.author	Fatih Capal M.
dc.contributor.author	Cakar T.
dc.contributor.author	Gunay S.
dc.contributor.author	Coskun H.
dc.contributor.author	Gunay, Savas
dc.contributor.author	Memis, Emir Cetin
dc.contributor.author	Fatih Capal, Mehmet
dc.contributor.author	Soyhan, Mustafa Eren
dc.contributor.author	Coskun, Hasan
dc.contributor.author	Cakar, Tuna
dc.contributor.author	Ay, Tarik Bugra
dc.date.accessioned	2026-03-05T15:02:39Z
dc.date.available	2026-03-05T15:02:39Z
dc.date.issued	2025-09-17
dc.description.abstract	This study presents a scalable multimodal Artificial Intelligence (AI) and Machine Learning (ML) framework designed to enhance decision making in the fashion industry. The proposed system integrates garment segmentation, multimodal feature extraction, and similarity recommendation into a unified pipeline. Using Segformer for segmentation, along with the convolutional neural network (CNN)-based feature extraction models ResNet152V2 and Xception, and the transformer-based vision-language model LLaVA, the framework generates visual and semantic embeddings of garments. These representations are processed through similarity detection using OpenAI embedding models and stored in the Pinecone vector database for efficient retrieval. Real-time similarity scoring is enabled through FastAPI endpoints, offering interactive search capabilities. Preliminary results demonstrate the system's strong ability to identify conceptually and visually similar items across a large catalog, providing actionable insights for designers. This framework lays the groundwork for intelligent, interpretable, and production-ready AI systems in the fashion industry. © 2025 IEEE.	en_US
dc.identifier.doi	10.1109/UBMK67458.2025.11206866
dc.identifier.issn	2521-1641
dc.identifier.scopus	2-s2.0-105030879107
dc.identifier.uri	https://doi.org/10.1109/UBMK67458.2025.11206866
dc.identifier.uri	https://hdl.handle.net/20.500.11779/3229
dc.language.iso	en	en_US
dc.publisher	Institute of Electrical and Electronics Engineers Inc.	en_US
dc.relation.ispartof	International Conference on Computer Science and Engineering, UBMK	en_US
dc.rights	info:eu-repo/semantics/closedAccess	en_US
dc.subject	Artificial Intelligence	en_US
dc.subject	Deep Learning	en_US
dc.subject	Fashion Industry	en_US
dc.subject	Feature Extraction	en_US
dc.subject	Image Segmentation	en_US
dc.subject	LLaVA	en_US
dc.subject	Machine Learning	en_US
dc.subject	OpenAI API	en_US
dc.subject	ResNet	en_US
dc.subject	SegFormer	en_US
dc.subject	Similarity Detection	en_US
dc.subject	Xception	en_US
dc.title	A Multimodal AI and ML Framework for Fashion Image Segmentation, Recommendation, and Similarity Recognition	en_US
dc.type	Conference Object	en_US
dspace.entity.type	Publication
gdc.author.institutional	Çakar, Tuna
gdc.author.scopusid	60411389000
gdc.author.scopusid	60411327000
gdc.author.scopusid	60411168400
gdc.author.scopusid	60411224800
gdc.author.scopusid	56329345400
gdc.author.scopusid	60411293300
gdc.author.scopusid	55532291700
gdc.author.scopusid	60411359700
gdc.coar.access	metadata only access
gdc.coar.type	text::conference output
gdc.collaboration.industrial	true
gdc.description.department	Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümü
gdc.description.departmenttemp	[Soyhan M.E.] MEF University, İstanbul, Turkey; [Ay T.B.] MEF University, İstanbul, Turkey; [Memis E.C.] Mindsane Ltd., Research&Development Center, İstanbul, Turkey; [Fatih Capal M.] MEF University, İstanbul, Turkey; [Cakar T.] MEF University, İstanbul, Turkey; [Gunay S.] Mindsane Ltd., Research&Development Center, İstanbul, Turkey; [Drias Y.] MEF University, İstanbul, Turkey; [Ozlem S.] Research&Development İpekyol Inc., İstanbul, Turkey; [Coskun H.] MEF University, İstanbul, Turkey
gdc.description.endpage	1052	en_US
gdc.description.issue	2025	en_US
gdc.description.publicationcategory	Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı	en_US
gdc.description.scopusquality	N/A
gdc.description.startpage	1047	en_US
gdc.description.wosquality	N/A
gdc.identifier.openalex	W4415523976
gdc.index.type	Scopus
gdc.openalex.collaboration	International
gdc.openalex.fwci	0.00
gdc.openalex.normalizedpercentile	0.29
gdc.opencitations.count	0
gdc.plumx.mendeley	1
gdc.plumx.scopuscites	0
gdc.publishedmonth	Ekim
gdc.scopus.citedcount	0
gdc.yokperiod	YÖK - 2025-26
relation.isAuthorOfPublication.latestForDiscovery	8dc3d5ee-f7f1-4e1a-963b-2315897f3832
relation.isOrgUnitOfPublication.latestForDiscovery	636850bf-e58c-4b59-bcf0-fa7418bb7977

Collections

Scopus İndeksli Yayınlar Koleksiyonu / Scopus Indexed Publications Collection
Bilgisayar Mühendisliği Bölümü Koleksiyonu

A Multimodal AI and ML Framework for Fashion Image Segmentation, Recommendation, and Similarity Recognition

Files

Collections