🎨
Explainable-AI
  • Explainable AI
  • Preface
  • How to use this book?
  • Contents
  • What is Explainable AI?
  • Why do we need Explainablity?
  • Explainable systems and Black box systems
  • Types of Explainability Techniques
  • Explainable Models
    • Linear Regression
      • Assumptions
      • Model
      • Statistical Interpretation
    • Decision Trees
      • How Do They Work?
      • Creating the model
      • Interpretation
  • Explainability Techniques for Classical ML
    • SHAP (SHapley Additive exPlanations)
    • Surrogate model
    • LIME (Local Interpretable Model-Agnostic Explanations)
    • PDP (Partial Dependence Plot)
    • ICE (Individual Conditional Expectation Plots)
    • ALE (Accumulated Local Effects Plot)
  • Datasets
    • Medical Cost Personal Dataset
    • Telecom Churn Dataset
    • Sales Opportunity Size Dataset
    • Pima Indians Diabetes Dataset
  • Implementation of these techniques on different models
    • Logistic Regression - SHAP
    • Random Forest - LIME
    • GBM - PDP
    • GBM - ICE
    • Deep Learning - Surrogate
  • Future scope
  • Contributors
  • Citing this Book
Powered by GitBook
On this page

Was this helpful?

  1. Datasets

Pima Indians Diabetes Dataset

PreviousSales Opportunity Size DatasetNextImplementation of these techniques on different models

Last updated 4 years ago

Was this helpful?

Description

The dataset is originally from the National Institute of Diabetes and Digestive and Kidney Diseases. The dataset used to predict whether or not a patient has diabetes, based on certain diagnostic measurements included in the dataset.

Data Set Characteristics: Classification - Bivariate

Number of Instances: 768

Features

  • Pregnancies

  • Glucose

  • Blood Pressure

  • Skin Thickness

  • Insulin

  • BMI

  • DiabetesPedigreeFunction

  • Age

  • Outcome

Acknowledgments

National Institute of Diabetes and Digestive and Kidney Diseases; Includes cost data (donated by Peter Turney)

The dataset is available .

here