Bilgisayar Mühendisliği Bölümü Koleksiyonu
Permanent URI for this collectionhttps://hdl.handle.net/20.500.11779/1940
Browse
3 results
Search Results
Conference Object A Multimodal AI and ML Framework for Fashion Image Segmentation, Recommendation, and Similarity Recognition(Institute of Electrical and Electronics Engineers Inc., 2025-09-17) Soyhan M.E.; Ay T.B.; Memis E.C.; Fatih Capal M.; Cakar T.; Gunay S.; Coskun H.; Gunay, Savas; Memis, Emir Cetin; Fatih Capal, Mehmet; Soyhan, Mustafa Eren; Coskun, Hasan; Cakar, Tuna; Ay, Tarik BugraThis study presents a scalable multimodal Artificial Intelligence (AI) and Machine Learning (ML) framework designed to enhance decision making in the fashion industry. The proposed system integrates garment segmentation, multimodal feature extraction, and similarity recommendation into a unified pipeline. Using Segformer for segmentation, along with the convolutional neural network (CNN)-based feature extraction models ResNet152V2 and Xception, and the transformer-based vision-language model LLaVA, the framework generates visual and semantic embeddings of garments. These representations are processed through similarity detection using OpenAI embedding models and stored in the Pinecone vector database for efficient retrieval. Real-time similarity scoring is enabled through FastAPI endpoints, offering interactive search capabilities. Preliminary results demonstrate the system's strong ability to identify conceptually and visually similar items across a large catalog, providing actionable insights for designers. This framework lays the groundwork for intelligent, interpretable, and production-ready AI systems in the fashion industry. © 2025 IEEE.Conference Object Citation - Scopus: 1Liking Prediction Using fNIRS and Machine Learning: Comparison of Feature Extraction Methods(IEEE, 2022-05-15) Koksal, Mehmet Yigit; Çakar, Tuna; Demircioğlu, Esin Tuna; Girisken, Yener; Tuna, EsinThe fMRI method, which is generally used to detect behavioral patterns, draws attention with its expensive and impractical features. On the other hand, near infrared spectroscopy (fNIRS) method is less expensive and portable, but it is as effective as fMRI in creating a good prediction model. With this method, a model has been developed that can predict whether people like a stimulus or not, using machine learning various algorithms. A comparison was made between feature extraction methods, which was the main focus while developing the model.Conference Object Citation - WoS: 6Citation - Scopus: 3Facial Landmark Localization in Depth Images Using Supervised Descent Method(IEEE, 2015-12-01) Camgoz, Necati Cihan; Gökberk, Berk; Akarun, Lale; Struc, Vitomir; Kindiroglu, Ahmet Alp; Štruc, VitomirSupervised Descent Method (SDM) has proven successful in many computer vision applications such as face alignment, tracking and camera calibration. Recent studies which used SDM, achieved state of the-art performance on facial landmark localization in depth images [4]. In this study, we propose to use ridge regression instead of least squares regression for learning the SDM, and to change feature sizes in each iteration, effectively turning the landmark search into a coarse to fine process. We apply the proposed method to facial landmark localization on the Bosphorus 3D Face Database; using frontal depth images with no occlusion. Experimental results confirm that both ridge regression and using adaptive feature sizes improve the localization accuracy considerably.
