Yüksek Lisans, Proje Koleksiyonu
Permanent URI for this collectionhttps://hdl.handle.net/20.500.11779/215
Browse
Browsing Yüksek Lisans, Proje Koleksiyonu by Author "Arı, Esra"
Now showing 1 - 1 of 1
- Results Per Page
- Sort Options
master's-degree-project.listelement.badge Trangling Weratedogs Twitter Data To Create Interesting and Trustworthy Explosatory/Predictive Anaylses and Visulation Using Different Machine Learning Algorithms(MEF Üniversitesi, Fen Bilimleri Enstitüsü, 2018) Arı, Esra; Çakar, TunaSocial media usage has rapidly grown in recent years and knowledge in these environments increased due to this expansion. Therefore, doing exploratory and predictive analysis from intensive data of social media became so popular. However, almost all of the large datasets obtained are uncleaned / raw data. Therefore, the assessing and cleaning of the data is at least as important as the exploratory and predictive analysis. The open source WeRateDogs twitter account tweets have been gathered, assessed, cleaned, analyzed and predicted for this thesis. As a result of the study, it was understood that the most important and most time-consuming part of the predictive data analysis is the data gathering and cleaning. As a result of this project, probability of dog’s breed whether retriever or not is predicted from the tweet’s text body. 24 points increase (%34 change) in accuracy values has been achieved by doing oversampling in the data sets which contain low event observation. At the same time, the decision tree, logistic regression and random forest algorithms are compared and it is shown that the random forest's model performance is better than the others. The algorithm works 13 points better than logistic regression, 21 points better than decision tree.