Leveraging Decision Trees to Forecast Academic Outcomes
Abstract
Machine learning has become an essential component of modern educational research, offering data-driven strategies for assessing and enhancing student achievement. Within this field, decision tree algorithms have emerged as one of the most effective approaches because of their interpretability, computational simplicity, and flexibility in managing both categorical and continuous data. This paper provides an extensive review of decision tree-based models applied to student performance prediction in educational settings. It consolidates recent literature to discuss the importance of machine learning in contemporary education, outlines the theoretical concepts underlying major decision tree variants, and analyzes diverse predictors influencing academic success. The review highlights the ability of decision tree methods to achieve predictive accuracies between 80% and 95% in various educational scenarios . Furthermore, it compares algorithmic approaches such as ID3, C4.5, CART, and ensemble models while drawing insights from real-world applications documented in prior studies. The paper also examines implementation issues such as bias, data reliability, and interpretability, concluding with recommendations for advancing the role of decision tree analytics in educational decision-making and early academic intervention systems.
References
Romero, C., & Ventura, S. (2010). Educational data mining: A review of the state of the art. IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews), 40(6), 601–618. https://doi.org/10.1109/TSMCC.2010.2053532
Dutt, A., Ismail, M. A., & Herawan, T. (2017). A systematic review on educational data mining. IEEE Access, 5, 15991–16005. https://doi.org/10.1109/ACCESS.2017.2654247
Peña-Ayala, A. (2014). Educational data mining: A survey and a data mining-based analysis of recent works. Expert Systems with Applications, 41(4), 1432–1462. https://doi.org/10.1016/j.eswa.2013.08.042
Mayilvaganan, M., & Kalpanadevi, D. (2014). Comparison of classification techniques for predicting the performance of students’ academic environment. In Proceedings of the International Conference on Communication and Network Technologies (pp. 113–118).
Pandey, U. K., & Pal, S. (2011). Data mining: A prediction of performer or underperformer using classification. International Journal of Computer Science and Information Technologies, 2(2), 686–690.
Kotsiantis, S., Pierrakeas, C., & Pintelas, P. (2004). Predicting students’ performance in distance learning using machine learning techniques. Applied Artificial Intelligence, 18(5), 411–426. https://doi.org/10.1080/08839510490442058
Kumar, S. A., & Vijayalakshmi, M. N. (2011). Efficiency of decision trees in predicting student’s academic performance. In Proceedings of the First International Conference on Computer Science, Engineering and Applications (pp. 335–343).
Kovacic, Z. J. (2010). Early prediction of student success: Mining students’ enrolment data. In Proceedings of the Informing Science & IT Education Conference (pp. 647–665).
Baker, R. S., & Yacef, K. (2009). The state of educational data mining in 2009: A review and future visions. Journal of Educational Data Mining, 1(1), 3–17.
Mohamad, S. K., & Tasir, Z. (2013). Educational data mining: A review. Procedia – Social and Behavioral Sciences, 97, 320–324. https://doi.org/10.1016/j.sbspro.2013.10.240
Refbacks
- There are currently no refbacks.