A Machine Learning Framework for Student Retention Policy Development: A Case Study

dc.contributor.authorHoca, Sidika
dc.contributor.authorDimililer, Nazife
dc.date.accessioned2026-02-06T18:24:00Z
dc.date.issued2025
dc.departmentDoğu Akdeniz Üniversitesi
dc.description.abstractStudent attrition at tertiary institutions is a global challenge with significant personal and social consequences. Early identification of students at risk of dropout is crucial for proactive and preventive intervention. This study presents a machine learning framework for predicting and visualizing students at risk of dropping out. While most previous work relies on wide-ranging data from numerous sources such as surveys, enrolment, and learning management systems, making the process complex and time-consuming, the current study uses minimal data that are readily available in any registration system. The use of minimal data simplifies the process and ensures broad applicability. Unlike most similar research, the proposed framework provides a comprehensive system that not only identifies students at risk of dropout but also groups them into meaningful clusters, enabling tailored policy generation for each cluster through digital technologies. The proposed framework comprises two stages where the first stage identifies at-risk students using a machine learning classifier, and the second stage uses interpretable AI techniques to cluster and visualize similar students for policy-making purposes. For the case study, various machine learning algorithms-including Support Vector Classifier, K-Nearest Neighbors, Logistic Regression, Na & iuml;ve Bayes, Artificial Neural Network, Random Forest, Classification and Regression Trees, and Categorical Boosting-were trained for dropout prediction using data available at the end of the students' second semester. The experimental results indicated that Categorical Boosting with an F1-score of 82% is the most effective classifier for the dataset. The students identified as at risk of dropout were then clustered and a decision tree was used to visualize each cluster, enabling tailored policy-making.
dc.description.sponsorshipBAPC Project Fund [BAPC-0A-23-01 BAPC]; EMU
dc.description.sponsorshipThis work was supported partially by the EMU BAPC-0A-23-01 BAPC Project Fund.
dc.identifier.doi10.3390/app15062989
dc.identifier.issn2076-3417
dc.identifier.issue6
dc.identifier.scopus2-s2.0-105001135710
dc.identifier.scopusqualityQ1
dc.identifier.urihttps://doi.org/10.3390/app15062989
dc.identifier.urihttps://hdl.handle.net/11129/9993
dc.identifier.volume15
dc.identifier.wosWOS:001453559000001
dc.identifier.wosqualityQ2
dc.indekslendigikaynakWeb of Science
dc.indekslendigikaynakScopus
dc.language.isoen
dc.publisherMdpi
dc.relation.ispartofApplied Sciences-Basel
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
dc.rightsinfo:eu-repo/semantics/openAccess
dc.snmzKA_WoS_20260204
dc.subjectstudent dropout
dc.subjectattrition
dc.subjectmachine learning
dc.subjectclassification
dc.subjectAI-assisted policy-making
dc.subjectdigitalization in education
dc.titleA Machine Learning Framework for Student Retention Policy Development: A Case Study
dc.typeArticle

Files