Bilgisayar Mühendisliği Bölümü Koleksiyonu

Permanent URI for this collectionhttps://hdl.handle.net/20.500.11779/1940

Browse

Search Results

Now showing 1 - 2 of 2
  • Conference Object
    A Multimodal AI and ML Framework for Fashion Image Segmentation, Recommendation, and Similarity Recognition
    (Institute of Electrical and Electronics Engineers Inc., 2025) Soyhan M.E.; Ay T.B.; Memis E.C.; Fatih Capal M.; Cakar T.; Gunay S.; Coskun H.; Gunay, Savas; Memis, Emir Cetin; Fatih Capal, Mehmet; Soyhan, Mustafa Eren; Coskun, Hasan; Cakar, Tuna; Ay, Tarik Bugra
    This study presents a scalable multimodal Artificial Intelligence (AI) and Machine Learning (ML) framework designed to enhance decision making in the fashion industry. The proposed system integrates garment segmentation, multimodal feature extraction, and similarity recommendation into a unified pipeline. Using Segformer for segmentation, along with the convolutional neural network (CNN)-based feature extraction models ResNet152V2 and Xception, and the transformer-based vision-language model LLaVA, the framework generates visual and semantic embeddings of garments. These representations are processed through similarity detection using OpenAI embedding models and stored in the Pinecone vector database for efficient retrieval. Real-time similarity scoring is enabled through FastAPI endpoints, offering interactive search capabilities. Preliminary results demonstrate the system's strong ability to identify conceptually and visually similar items across a large catalog, providing actionable insights for designers. This framework lays the groundwork for intelligent, interpretable, and production-ready AI systems in the fashion industry. © 2025 IEEE.
  • Conference Object
    Citation - WoS: 6
    Citation - Scopus: 3
    Facial Landmark Localization in Depth Images Using Supervised Descent Method
    (IEEE, 2015) Camgoz, Necati Cihan; Gökberk, Berk; Akarun, Lale; Struc, Vitomir; Kindiroglu, Ahmet Alp; Štruc, Vitomir
    Supervised Descent Method (SDM) has proven successful in many computer vision applications such as face alignment, tracking and camera calibration. Recent studies which used SDM, achieved state of the-art performance on facial landmark localization in depth images [4]. In this study, we propose to use ridge regression instead of least squares regression for learning the SDM, and to change feature sizes in each iteration, effectively turning the landmark search into a coarse to fine process. We apply the proposed method to facial landmark localization on the Bosphorus 3D Face Database; using frontal depth images with no occlusion. Experimental results confirm that both ridge regression and using adaptive feature sizes improve the localization accuracy considerably.