Prediction of Student Performance in the Results of the Online Learning Process Assessment in Informatics Subjects in High School
##plugins.themes.bootstrap3.article.main##
Abstract
In the Corona Endemic, we are not just returning to offline education patterns but are already moving towards education 5.0. Online, normal, blended learning patterns have become commonplace. Online learning assessment requires fast and precise predictions of student performance (high accuracy). The reason is first, due to limited direct interaction. Second, normal learning usually involves an assessment of the learning process and character assessment to be able to provide an accurate final assessment, which is difficult to implement in online learning accurately. Third, there is a lot of data to be processed quickly and precisely so that it can be reported to educational institutions and to students' families. Fourth, Informatics is a lesson that is 80% practical and 20% theory so that the assessment instruments used are 80% performance instruments (Bloom's taxonomy: C2, C3, C4, C5) and 20% multiple choice instruments (C1). Informatics correction and assessment requires more time because 80% cannot be assessed automatically. This research aims to predict student performance (Pass (1) or Intervention (0)) on the results of the online learning process assessment for informatics subjects in high school. If the student performance prediction results in an intervention, it will be immediately followed up by providing an intervention strategy to increase student performance. The target of the research results is to achieve > 70% accuracy on the processed dataset. This research uses the ensemble learning method random Forest Classification and XG Boosting classification. The research results of Student Performance Prediction using XG Boost Classification produce higher accuracy than RF Classification which has an average accuracy value = 93% while RF Classification has an average accuracy result = 92%. The research objectives have been achieved because the results of the 2 methods used have met the desired targets.
##plugins.themes.bootstrap3.article.details##
Ahmad, S., Umirzakova, S., Mujtaba, G., Amin, M. S., & Whangbo, T. (2023). Education 5.0: Requirements, Enabling Technologies, and Future Directions. http://arxiv.org/abs/2307.15846
Ahmed, D. M., Abdulazeez, A. M., Zeebaree, D. Q., & Ahmed, F. Y. H. (2021). Predicting University’s Students Performance Based on Machine Learning Techniques. 2021 IEEE International Conference on Automatic Control and Intelligent Systems, I2CACIS 2021 - Proceedings, 276–281. https://doi.org/10.1109/I2CACIS52118.2021.9495862
Alwarthan, S., Aslam, N., & Khan, I. U. (2022). An Explainable Model for Identifying At-Risk Student at Higher Education. IEEE Access, 10, 107649–107668. https://doi.org/10.1109/ACCESS.2022.3211070
Bujang, S. D. A., Selamat, A., Ibrahim, R., Krejcar, O., Herrera-Viedma, E., Fujita, H., & Ghani, N. A. M. (2021). Multiclass Prediction Model for Student Grade Prediction Using Machine Learning. IEEE Access, 9, 95608–95621. https://doi.org/10.1109/ACCESS.2021.3093563
Ghorbani, R., & Ghousi, R. (2020). Comparing Different Resampling Methods in Predicting Students’ Performance Using Machine Learning Techniques. IEEE Access, 8, 67899–67911. https://doi.org/10.1109/ACCESS.2020.2986809
Kumar, V. U., Krishna, A., Neelakanteswara, P., & Basha, C. Z. (2020). Advanced Prediction of Performance of a Student in an University using Machine Learning Techniques. 2020 International Conference on Electronics and Sustainable Communication Systems (ICESC), 121–126. https://doi.org/10.1109/ICESC48915.2020.9155557
Mienye, I. D., & Sun, Y. (2022). A Survey of Ensemble Learning: Concepts, Algorithms, Applications, and Prospects. In IEEE Access (Vol. 10, pp. 99129–99149). Institute of Electrical and Electronics Engineers Inc. https://doi.org/10.1109/ACCESS.2022.3207287
Nabil, A., Seyam, M., & Abou-Elfetouh, A. (2021). Prediction of Students’ Academic Performance Based on Courses’ Grades Using Deep Neural Networks. IEEE Access, 9, 140731–140746. https://doi.org/10.1109/ACCESS.2021.3119596
Ogunleye, A., & Wang, Q. G. (2020). XGBoost Model for Chronic Kidney Disease Diagnosis. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 17(6), 2131–2140. https://doi.org/10.1109/TCBB.2019.2911071
Perkasa, K. B. P. Y., & Eka Purwiantono, F. (2023). Sistem Rekomendasi Jurusan Menggunakan Algoritma Naïve Bayes Gaussian Berbasis Web. J-INTECH, 11(2), 361–370. https://doi.org/10.32664/j-intech.v11i2.1090
Sahlaoui, H., Alaoui, E. A. A., Nayyar, A., Agoujil, S., & Jaber, M. M. (2021). Predicting and Interpreting Student Performance Using Ensemble Models and Shapley Additive Explanations. IEEE Access, 9, 152688–152703. https://doi.org/10.1109/ACCESS.2021.3124270
Saidani, O., Menzli, L. J., Ksibi, A., Alturki, N., & Alluhaidan, A. S. (2022). Predicting Student Employability Through the Internship Context Using Gradient Boosting Models. IEEE Access, 10, 46472–46489. https://doi.org/10.1109/ACCESS.2022.3170421
Shekar, B. H., & Dagnew, G. (2019). Grid Search-Based Hyperparameter Tuning and Classification of Microarray Cancer Data. 2019 Second International Conference on Advanced Computational and Communication Paradigms (ICACCP), 1–8. https://doi.org/10.1109/ICACCP.2019.8882943
Weerts, H. J. P., Mueller, A. C., & Vanschoren, J. (2020). Importance of Tuning Hyperparameters of Machine Learning Algorithms. http://arxiv.org/abs/2007.07588
Yao, J., Zheng, Y., & Jiang, H. (2021). An Ensemble Model for Fake Online Review Detection Based on Data Resampling, Feature Pruning, and Parameter Optimization. In IEEE Access (Vol. 9, pp. 16914–16927). Institute of Electrical and Electronics Engineers Inc. https://doi.org/10.1109/ACCESS.2021.3051174