A Multimodal AI and ML Framework for Fashion Image Segmentation, Recommendation, and Similarity Recognition

This study presents a scalable multimodal Artificial Intelligence (AI) and Machine Learning (ML) framework designed to enhance decision making in the fashion industry. The proposed system integrates garment segmentation, multimodal feature extraction, and similarity recommendation into a unified pipeline. Using Segformer for segmentation, along with the convolutional neural network (CNN)-based feature extraction models ResNet152V2 and Xception, and the transformer-based vision-language model LLaVA, the framework generates visual and semantic embeddings of garments. These representations are processed through similarity detection using OpenAI embedding models and stored in the Pinecone vector database for efficient retrieval. Real-time similarity scoring is enabled through FastAPI endpoints, offering interactive search capabilities. Preliminary results demonstrate the system's strong ability to identify conceptually and visually similar items across a large catalog, providing actionable insights for designers. This framework lays the groundwork for intelligent, interpretable, and production-ready AI systems in the fashion industry. © 2025 IEEE.

Keywords

Artificial Intelligence, Deep Learning, Fashion Industry, Feature Extraction, Image Segmentation, LLaVA, Machine Learning, OpenAI API, ResNet, SegFormer, Similarity Detection, Xception

WoS Q

N/A

Scopus Q

N/A

OpenCitations Citation Count

N/A

Source

International Conference on Computer Science and Engineering, UBMK

Issue

2025

Start Page

1047

End Page

1052

URI

https://doi.org/10.1109/UBMK67458.2025.11206866
https://hdl.handle.net/20.500.11779/3229

Collections

Scopus İndeksli Yayınlar Koleksiyonu / Scopus Indexed Publications Collection
Bilgisayar Mühendisliği Bölümü Koleksiyonu

PlumX Metrics

Citations

Scopus : 0

Captures

Mendeley Readers : 1

Full item page

Google Scholar™

Check

A Multimodal AI and ML Framework for Fashion Image Segmentation, Recommendation, and Similarity Recognition

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Open Access Color

OpenAIRE Downloads

OpenAIRE Views

relationships.isProjectOf

relationships.isJournalIssueOf

Abstract

Description

Keywords

Fields of Science

Citation