Deep Learning - Surrogate
Last updated
Last updated
Deep learning methods are based on artificial neural networks with representation learning
They can be supervised, semi-supervised or unsupervised
Architectures include deep neural networks, deep belief networks, recurrent neural networks and convolutional neural networks
Dataset: Pima Indians Diabetes; Target: Outcome
The data is trained by making a Keras Sequential model with 3 layers.
Accuracy: 72.73%
In this section we will interpret a Keras Sequential model with a Decision Tree model as a surrogate.
We found the best hyper-parameters for our DecisionTreeClassifier using GridSearch and then fit it on the training data without the outcome. The output column is replaced by the predictions made by the Keras Sequential model and then fit into the Decision Tree model.
The above tree serves as a surrogate model to the Deep Learning model. As we can see, the most important feature is the root node i.e., Glucose.
There are 399 samples in the test set that are diabetic and 215 that are non-diabetic. If the person has low Glucose, then Age is the next most important feature.
If the Glucose is high, then BMI is the next most important feature in determining if the person is diabetic or not.
The full tree can be interpreted in a similar step-by-step manner wherein every level has an important feature related to it.