Bilgisayar Mühendisliği Bölümü Koleksiyonu
Permanent URI for this collectionhttps://hdl.handle.net/20.500.11779/1940
Browse
2 results
Search Results
Conference Object A Multimodal AI and ML Framework for Fashion Image Segmentation, Recommendation, and Similarity Recognition(Institute of Electrical and Electronics Engineers Inc., 2025) Soyhan M.E.; Ay T.B.; Memis E.C.; Fatih Capal M.; Cakar T.; Gunay S.; Coskun H.; Gunay, Savas; Memis, Emir Cetin; Fatih Capal, Mehmet; Soyhan, Mustafa Eren; Coskun, Hasan; Cakar, Tuna; Ay, Tarik BugraThis study presents a scalable multimodal Artificial Intelligence (AI) and Machine Learning (ML) framework designed to enhance decision making in the fashion industry. The proposed system integrates garment segmentation, multimodal feature extraction, and similarity recommendation into a unified pipeline. Using Segformer for segmentation, along with the convolutional neural network (CNN)-based feature extraction models ResNet152V2 and Xception, and the transformer-based vision-language model LLaVA, the framework generates visual and semantic embeddings of garments. These representations are processed through similarity detection using OpenAI embedding models and stored in the Pinecone vector database for efficient retrieval. Real-time similarity scoring is enabled through FastAPI endpoints, offering interactive search capabilities. Preliminary results demonstrate the system's strong ability to identify conceptually and visually similar items across a large catalog, providing actionable insights for designers. This framework lays the groundwork for intelligent, interpretable, and production-ready AI systems in the fashion industry. © 2025 IEEE.Conference Object Citation - WoS: 1Citation - Scopus: 1Adaptive Boosting of Dnn Ensembles for Brain-Computer Interface Spellers(IEEE, 2021) Çatak, Yiğit; Aksoy, Can; Özkan, Hüseyin; Güney, Osman Berke; Koç, Emirhan; Arslan, Şuayb ŞefikSteady-state visual evoked potentials (SSVEP) are commonly used in brain computer interface (BCI) applications such as spelling systems, due to their advantages over other paradigms. In this study, we develop a method for SSVEP-based BCI speller systems, using a known deep neural network (DNN), which includes transfer and ensemble learning techniques. We test performance of our method on publicly available benchmark and BETA datasets with leave-one-subject-out procedure. Our method consists of two stages. In the first stage, a global DNN is trained using data from all subjects except one subject that is excluded for testing. In the second stage, the global model is fine-tuned to each subject whose data are used in the training. Combining the responses of trained DNNs with different weights for each test subject, rather than an equal weight, provide better performance as brain signals may differ significantly between individuals. To this end, weights of DNNs are learnt with SAMME algorithm with using data belonging to the test subject. Our method significantly outperforms canonical correlation analysis (CCA) and filter bank canonical correlation analysis (FBCCA) methods.
