Abstract
Massive Open Online Courses (MOOCs) has seen a dramatic increase of participants in the last few years with an exponential growth of internet users all around the world. MOOC allows users to attend lectures of top professors from world class universities. Despite flexible accessibility, the common trend observed in each course is that the number of active participants appears to decrease exponentially as the week's progress. The structure and nature of the courses affects the number of active participants directly. A comprehensive review of the available literature shows that very little intensive work was done using the pattern of user interaction with courses in the field of MOOC data analysis. In this paper, we take an initial step to use the deep learning algorithm to construct the dropout prediction model and produce the predicted individual student dropout probability. Additional improvements are made to optimize the performance of the dropout prediction model and provide the course providers with appropriate interventions based on a temporal prediction mechanism. Our Exploratory Data Analysis demonstrates that there is a strong correlation between click stream actions and successful learner outcomes. Among other features, the deep learning algorithm takes the weekly history of student data into account and thus is able to notice changes in student behaviour over time.
References
- A. D. Ho et al., “HarvardX and MITx: The First Year of Open Online Courses, Fall 2012-Summer 2013,” SSRN Electron. JNo. 1, pp. 1–33, 2014.
- Abidi, S.M.R.; Hussain, M.; Xu, Y.; Zhang, W. “Prediction of Confusion Attempting Algebra Homework in an Intelligent Tutoring System through Machine Learning Techniques for Educational Sustainable Development.” Sustainability 2019, 11, 105.
- Al-Shabandar, R., Hussain, A, Laws, A., Keight, R., & Lunn, J. (2017). “Machine learning approaches to predict learning outcomes in massive open online courses.” Paper presented at the 2017 international joint conference on neural networks (IJCNN 2017), Anchorage, Alaska, USA.
- C. Peng, K. Lee, and G. M. Ingersoll, “An introduction to logistic regression analysis and reporting,” J. Educ. Res., vol. 96, pp. 3–14, 2002.
- D. S. Chaplot, E. Rhim, and J. Kim, “Predicting student attrition in MOOCs using sentiment analysis and neural networks,” Work. 17th Int. Conf. Artif. Intell. Educ. AIED-WS 2015, vol. 1432, pp. 7–12, 2015.
- D. S. Chaplot, E. Rhim, and J. Kim, “Predicting student attrition in MOOCs using sentiment analysis and neural networks,” in Fourth Workshop on Intelligent Support for Learning in Groups, Madrid, Spain, June 2015.
- Halawa, S.; Greene, D.; and Mitchell, J. 2014. “Dropout prediction in MOOCs using learner activity features.” In Proceedings of the European MOOC Summit.
- M. Kloft, F. Stiehler, Z. Zheng, and N. Pinkwart, “Predicting MOOC dropout over weeks using machine learning methods,” in Conference on Empirical Methods in Natural Language Processing (EMNLP), Doha, Qatar, October 2014.
- Mongkhonavit, K., Kanopa, K. and Lang, D. (2019). “Deep Knowledge Tracing and Engagement with MOOCs.” Proceedings of the 9th International Conference on Learning Analytics & Knowledge. Pp. 340-342, Tempe, AZ, USA — March 04 - 08, 2019.
- P. Viszlay, M. Lojka and J. Juhár, “Class-dependent two dimensional linear discriminant analysis using two-pass recognition strategy”, in: Proceedings of the 22nd European Signal Processing Conference (EUSIPCO), IEEE, 2014, pp. 1796– 1800.
- R. F. Kizilcec, C. Piech, and E. Schneider, “Deconstructing Disengagement: Analyzing Learner Subpopulations in Massive Open Online Courses,” Lak ’13, p. 10, 2013.
- R. R. Kabra and R. S. Bichkar, “Performance prediction of engineering students using decision trees,” Int. J. Comput. Appl., vol. 36, no. 11, pp. 8–12, 2011.
- Sinha, T., Jermann, P., Li, N., Dillenbourg, P. (2014). “Your click decides your fate: Inferring Information Processing and Attrition Behavior from MOOC Video Clickstream Interactions.” Proc. of the 2014 Empirical Methods in Natural Language Processing Workshop on Modeling Large Scale Social Interaction in Massively Open Online Courses.
- Svozil D, Kvasnicka V, Pospichal J. “Introduction to multi-layer feed-forward neural networks.” Chemometrics and intelligent laboratory systems 1997;39(1):43-62.
- Tharwat, Alaa & Gaber, Tarek & Ibrahim, Abdelhameed & Hassanien, Aboul Ella. (2017). “Linear discriminant analysis: A detailed tutorial.” Ai Communications. 30. 169-190,. 10.3233/AIC-170729.
- W. Xing and D. Du, “Dropout Prediction in MOOCs: Using Deep Learning for Personalized Intervention,” J. Educ. Comput. Res., 2018.
- Scipy. Retrieved from https://docs.scipy.org/doc/scipy/reference/tutorial/general.html
