Student's Performance Evaluation Using Ensemble Machine Learning Algorithms
Downloads
This study explores the critical domain of predicting students' academic performance in educational institutions. By harnessing the potential of machine learning algorithms, specifically Random Forest, KNN, and XGBoost, and leveraging data collected through technology-enhanced learning applications, the research aims to provide valuable insights into the factors influencing academic outcomes based on the dataset obtained from Kaggle. It is important to note that these models were also hybridized using the stacking ensemble approach. The performances of the algorithm were evaluated using the Root Mean Squared Error (RMSE), Mean Squared Error (MSE), Mean Absolute Error (MAE), and R-squared (R2) score. Resultively, the stacked ensemble model displayed remarkable results, with an impressively low RMSE of 0.1768, MSE of 0.0312, MAE of 0.1247, and a high R2-score of 0.9705. This finding showed that the Ensemble model, which combines the strengths of the Random Forest, KNN, and XGBoost algorithms, provides the best overall prediction accuracy, with a high degree of correlation between predicted and actual student performance.
Ababneh, M., Aljarrah, A., Karagozlu, D., & Ozdamli, F. (2021). Guiding the Students in High School by Using Machine Learning. TEM Journal, 10(1).
Abu Zohair, L. M. (2019). Prediction of student's performance by modelling small dataset size. Journal of Educational Technology in Higher Education, 16, 27. https://doi.org/10.1186/s41239-019-0160-3
Alghamdi, A. S., & Rahman, A. (2023). Data mining approach to predict success of secondary school students: A Saudi Arabian case study. Education Sciences, 13(3), 293.
Arashpour, M., Golafshani, E. M., Parthiban, R., Lamborn, J., Kashani, A., Li, H., & Farzanehfar, P. (2023). Predicting individual learning performance using machine‐learning hybridized with the teaching‐learning‐based optimization. Computer Applications in Engineering Education, 31(1), 83-99.
Ashoka S, Savitha M, Shrinidhi Hegde, Rakshitha G, & Poorvika M S (2021) Predicting Students Performance Using Machine Learning. International Research Journal of Modernization in Engineering Technology and Science. 5.354:(3)7. e-ISSN: 2582-5208
Avanijaa, J. (2021). Prediction of house price using xgboost regression algorithm. Turkish Journal of Computer and Mathematics Education (TURCOMAT), 12(2), 2151-2155.
Batool, S., Rashid, J., Nisar, M. W., Kim, J., Kwon, H. Y., & Hussain, A. (2023). Educational data mining to predict students' academic performance: A survey study. Education and Information Technologies, 28(1), 905-971.
Canales, A., & Maldonado, L. (2018). Teacher quality and student achievement in Chile: Linking teachers' contribution and observable characteristics. International journal of educational development, 60, 33-50.
Cardona, T., Cudney, E. A., Hoerl, R., & Snyder, J. (2023). Data mining and machine learning retention models in higher education. Journal of College Student Retention: Research, Theory & Practice, 25(1), 51-75.
Chen, Y., & Zhai, L. (2023). A comparative study on student performance prediction using machine learning. Education and Information Technologies, 1-19.
Dewsbury, B., & Brame, C. J. (2019). Inclusive teaching. CBE—Life Sciences Education, 18(2), fe2.
Fauth, B., Decristan, J., Decker, A.T., Büttner, G., Hardy, I., Klieme, E. and Kunter, M., (2019). The effects of teacher competence on student outcomes in elementary science education: The mediating role of teaching quality. Teaching and teacher education, 86, p.102882.
Fernández-Delgado, M., Sirsat, M. S., Cernadas, E., Alawadi, S., Barro, S., & Febrero-Bande, M. (2019). An extensive experimental survey of regression methods. Neural Networks, 111, 11-34.
Hamoud, A., Humadi, A., Awadh, W. A., & Hashim, A. S. (2017). Students’ success prediction based on Bayes algorithms. International Journal of Computer Applications, 178(7), 6-12.
Keramat-Jahromi, M., Mohtasebi, S. S., Mousazadeh, H., Ghasemi-Varnamkhasti, M., & Rahimi-Movassagh, M. (2021). Real-time moisture ratio study of drying date fruit chips based on online image attributes using kNN and random forest regression methods. Measurement, 172, 108899.
Kukkar, A., Mohana, R., Sharma, A., & Nayyar, A. (2023). Prediction of student academic performance based on their emotional well-being and interaction on various e-learning platforms. Education and Information Technologies, 1-30.
Okewu, E., Adewole, P., Misra, S., Maskeliunas, R., & Damasevicius, R. (2021). Artificial neural networks for educational data mining in higher education: A systematic literature review. Applied Artificial Intelligence, 35(13), 983-1021.
Pallathadka, H., Wenda, A., Ramirez-Asís, E., Asís-López, M., Flores-Albornoz, J., & Phasinam, K. (2023). Classification and prediction of student performance data using various machine learning algorithms. Materials today: proceedings, 80, 3782-3785.
Romero, C., & Ventura, S. (2020). Educational data mining and learning analytics: An updated survey. Wiley interdisciplinary reviews: Data mining and knowledge discovery, 10(3), 1355.
Roslan, M. H. B., & Chen, C. J. (2023). Predicting students’ performance in English and Mathematics using data mining techniques. Education and Information Technologies, 28(2), 1427-1453.
Stronge, J. H. (2018). Qualities of effective teachers. Ascd.
Wang, X., Zhao, Y., Li, C., & Ren, P. (2023). ProbSAP: A comprehensive and high-performance system for student academic performance prediction. Pattern Recognition, 137, 109309.
Wood, R. (2018). The Influence of Teacher-Student Relationships and Feedback on Students' Engagement with Learning. Cambridge Scholars Publishing.
Wreford, A. I., & Abiodun, O. J. (2023). Credit Card Fraud Detection Based on Feature Selection Using Linear Discriminant Analysis and Deep Artificial Neural Network. Asian Research Journal of Current Science, 5(1), 218–228.
Yagci, M. (2022). Educational data mining: Prediction of students' academic performance using machine learning algorithms. Smart Learning Environments, 9(1), 1-9.