Yüksek Lisans Tezleri
Permanent URI for this collectionhttps://hdl.handle.net/20.500.11779/1785
Browse
52 results
Search Results
Master Term Project A Study on Churn Prediction in Telecommunication and Pay Tv Area(MEF Üniversitesi, Fen Bilimleri Enstitüsü, 2020) Şayık, Murat; …...Master Term Project Duplicate Record Detection: a Rule-Based Approach(MEF Üniversitesi, Fen Bilimleri Enstitüsü, 2017) Malkaralı, Gülce; Özgür ÖzlükThe study presents a rule based algorithm to detect dublicate and near-dublicate rocords within a dataset that is extracted from a leading online reality platform.Master Term Project Forecasting With Ensemble Methods: an Application Using Fashion Retail Sales Data(MEF Üniversitesi, Fen Bilimleri Enstitüsü, 2019) Yüzbaşıoğlu, Orkun Berk; Küçükaydın, HandeIn this project, ensemble methods of machine learning are used to predict short term store sales of a fashion retailer. Sales forecasts of various products at different stores are generated for a span of three months with bagging tree regressor, random forest regressor, and gradient boosting regressor algorithm. Algorithms are trained and evaluated with real past sales data of a Turkish fashion retailer. The predictive performance of the models is compared with linear regression. The results of the study show that random forest regressor shows the best performanceMaster Term Project Alternative Credit Scoring Model for Thin File Customers(MEF Üniversitesi, Fen Bilimleri Enstitüsü, 2019) Korkmaz, İstem Akça; Taş Küten, DuyguCredit scoring is a widely used tool for banks, financial institutions or corporations. Traditional credit score models are calculated from past financial history of users, and this may lead to exclude some people who have limited financial history from the credit system. Alternative credit scoring allows sector players to access to a larger portion of these customers. The credit scoring industry has expanded with an "all data is credit data" approach that combines traditional credit scoring systems with new data points. In this study, we aim to build an alternative credit scoring model for customers who have limited financial historical data (thin file) by using alternative data points for a national bank in Turkey. Some of the alternative data points and variables have been gathered from one of the bank’s products: the authorized card for Turkish national league football tickets (Passolig). Using alternative data points combining with demographical and geographical information, we perform a comparison between the machine-learning approaches. We use logistic regression approach as a base model and perform a comparison between tree-based approaches: decision tree, random forest and XGBoost to select the most effective modelling approachMaster Term Project Predicttion of Brent Oil Spot Prices Using Country Based Inventory and Trading Data(MEF Üniversitesi, Fen Bilimleri Enstitüsü, 2019) Usta, İsmail Batur; Ağralı, SemraCrude oil price forecasting has been the focus of numerous authorities, yet the task still persists on being a challenging one. The extremely volatile nature of oil market and high number of active players in it makes establishing a solid forecasting model that is constantly relevant to time very difficult. Recent advancements on data technologies, mainly ever-increasing computing power and trending big data technologies allowed new approaches to be born. From online learners to natural language processing, advanced data analytics models were employed with the help of easily accessible and diverse data. This project is an attempt on making use of such available data in order to forecast Brent oil spot price. By using monthly country by country inventory, trading and economic data, strong drivers of crude price was explored. The data used in this project comes from various sources and in multiple formats, with the final merged data frame has over 17000 observations and contains information on 86 countries. To enhance prediction power, a specialized learner is fit on each country individually and then the predictions are accumulated and filtered before outputting a single prediction. Compared to a single predictor, this approach enhanced the predictive power of the algorithm by adapting to dynamics of each country.Master Term Project A Comprasion of Ensemble Learning Methods in Retail Sales Forecasting(MEF Üniversitesi, Fen Bilimleri Enstitüsü, 2019) Süer, Serhan; Güney, EvrenForecasting has always been an essential skill which companies try to have and implement in various areas. Sales forecasting is one of the major usage areas of forecasting which is used in almost all sectors. This study refers to forecasting sales of Walmart Stores based on several features such as store id, department id, date, and store size. Walmart sales data which was used in this study contains information of stores between 2010 and 2012. At the beginning of the study, the introduction of the dataset and exploratory data analysis were made to identify dependent/independent variables and their characteristics. To apply machine learning algorithms, data preprocessing methods such as missing value treatment, outlier treatment, and feature selection was applied. Ensemble learning methods in machine learning algorithms were applied in the modeling stage. These methods were addressed in three parts such as Bootstrap Aggregation, Boosting, and Stacked Generalization and these parts consist of six different algorithms in total. The models were compared based on four regression metrics as Root Mean Square Error, Mean Absolute Error, R-Squared, and runtime. After selecting the main metric which models were evaluated, cross-validation was applied to achieve unbiased estimates. Finally, parameters of the model which have the highest score in cross-validation were tuned in the hyperparameter optimization stage and a machine learning model which can be used in forecasting sales of Walmart stores and its success score were obtained.Master Term Project Machine Learning Applications To Increase Customer Satisfaction In Finance Sector(MEF Üniversitesi, Fen Bilimleri Enstitüsü, 2019) Yiğit, Leyla; Çakar, TunaIn this project, consumers’ complaints about financial data are analyzed. After the analysis, we aim to provide a tool for financial companies such as banks, Lenders that will help them in managing communication with the consumers. Our main aim is to answer the question “How do consumers feel?” This analysis will give a complete picture of consumers’ feedback. We start the project by clustering the customers into different groups. In order to classify customers, we use classification algorithms XGBOOST and Random Forest. XGBOOST is used to predict the probability of getting a complaint. XGBOOST is also tested as an ensemble learning technique. By Using Random Forest the comparison of Bagging and Boosting is performed. This kind of model is very useful for a customer service department that wants to classify the complaints they receive from their customers. These kinds of models can also be expanded into a system that can recommend automatic solutions to future complaints as they come. The topic is motivated by the researcher’s experience in finance where she intends to increase credit sell numbers by anticipating customer feelings. The data set that we use has many measures and dimensions that facilitate to use more than 3 machine learning algorithms. The complaints database is published by the Consumer Financial Protection Bureau (https://www.consumerfinance.gov/). It provides consumers’ feedback in a string format. We also aim to analyze consumers’ complaints dataset from the perceptive of a consumer dispute.Master Term Project Retention Period Prediction for Pension Policies(MEF Üniversitesi, Fen Bilimleri Enstitüsü, 2019) Bayır, Ömer; Güney, EvrenCustomer Retention in Pension market refers to the activities and actions companies and organizations take to reduce the number of customer defections. How long the customer will be with our company or will stay in the system is retention. There are already workings in my company and other companies in the market about customer retention. Existing works generally contains how to measure customer retention and how to define distribution channels are successful in customer retention. Also existing predictive models are working on the feature set customer fund, total collection, un-paid premium frequency in general. In pension market companies have small margin of profit from pension policies. To make a profit from pension policies the companies have to retain their customer for long years. It ‘s approximately nine year to make profit from a pension policy because of high sales costs. Therefore to gain a new customer is less profitable than retaining present customers in Pension Market. In my project, I want to look retention in the pension application phase of customer. My main purpose is when the customer applied for pension product predict its retention period. If I produce an applicable model, It will be used in my company’s sales channels.Master Term Project Boarding Pass Detection In Social Media To Prevent Flight Information Thft(MEF Üniversitesi, Fen Bilimleri Enstitüsü, 2019) Ekici, Hasan Oktay; Çakar, TunaDuring the past few years, along with social media gaining bigger share on people’s lives, everyone has started to share their moments with detailed information on multiple platforms instantly. Sharing these kinds of information on posts may cause security bugs in people’s lives, such as undesired flight changes/cancellations because of flight information theft, frequent flyers miles theft, and even car theft and burglary. This project’s aim is to develop an artificial intelligence algorithm that can help to prevent these security bugs. In this project, we use data that is collected from instagram posts that contain boarding passes. Our main purpose is to build an artificial intelligence that makes decisions and processes following procedures: a machine learning algorithm that decides if the shared instagram post contains a boarding pass shared with #boardingpass; an optical character recognition algorithm that gathers text information from the post and scripts that send the information instantly to the relevant air carrier about the shared post. With this information, air carrier will be able to inform the passenger about their concern on the flight safety only in a couple of minutes after the post is shared.Master Term Project Internet Baking Channel Usage(MEF Üniversitesi, Fen Bilimleri Enstitüsü, 2019) Aydemir, Büşra Nur; Güney, Evren…
