Explainable machine learning for educational data

Al Luhaybi, Mashael

Please use this identifier to cite or link to this item: http://bura.brunel.ac.uk/handle/2438/22478

Title:	Explainable machine learning for educational data
Authors:	Al Luhaybi, Mashael
Advisors:	Tucker, A Swift, S
Keywords:	Educational Informatics;Artificial Intelligent in Education;Time-series Analysis;Explainable AI;Probabilistic modelling and Deep Learning
Issue Date:	2021
Publisher:	Brunel University London
Abstract:	Educational repositories contain complex trajectories of students and university data. Being able to model this data would offer great value in being able to identify students’ trajectories, predicting their likely future performance, and identifying those who require appropriate intervention as early as possible. However, understanding the nature of the correlations and the dependencies among the educational attributes (which can be time-dependant non-linear relationships) is fundamental for the learning of robust predictive classifiers. When predicting academic performance, many machine learning algorithms make decisions based on data that can be imbalanced, badly sampled, or biased based on historical societal prejudices. In this thesis, I explore, implement, and evaluate temporal predictive classifiers that aim to overcome some of these issues. The approach combines time-series clustering in conjunction with probabilistic learning, resampling, feature subspace learning, and specialist deep learning methods to learn models that are simultaneously accurate and unbiased. A key technical objective in learning these classifiers is to incorporate different types of temporal performance data collected at different times (student admission to a higher education institution, and at Year 1 and 2 of a student’s studies), for the explicit modelling of cognitive styles. A resampling method is applied with bootstrap aggregating to address the issue of the imbalanced time-series educational datasets, which is related to miss-classifying the minority-class of the high-risk or failing students. The evaluation of an unsupervised subspace learning approach using an Autoassociative Neural Network (Autoencoder) is also made, to reconstruct the educational data by maximising variance for improved performance prediction. In addition, the issues of modelling bias are explored such that the types are identified and whether they are accounting for inflated predictive accuracies is established. A graphical learning approach with a BN, that is transparent in how they make decisions, is compared with three forms of Deep Multi-label Convolutional Neural Network (CNNs) to investigate whether deep learning classifiers can be learned that maximise accuracy and minimise bias. The evaluation of the experimental results reveals that identifying cognitive styles improves both explanation and accuracy; that rebalancing also improves accuracy, and that a combination of probabilistic modelling and deep 1D Multi-label CNN can successfully identify and eliminate many biases when predicting student’s academic performance.
Description:	This thesis was submitted for the award of Doctor of Philosophy and was awarded by Brunel University London
URI:	http://bura.brunel.ac.uk/handle/2438/22478
Appears in Collections:	Computer Science Dept of Computer Science Theses

Files in This Item:

File	Description	Size	Format
FulltextThesis.pdf		6.8 MB	Adobe PDF	View/Open

Show full item record