Rsqrd AI: Recent Advances in Explainable Machine Learning Research

Recent advances in
explainable machine
learning research
Bernease Herman, Research Scientist
University of Washington eScience Institute and
Paul G. Allen School of Computer Science & Engineering
June 6, 2019 - Allen Institute for Artiﬁcial Intelligence

Recent advances in
explainable machine
learning research
Bernease Herman, Research Scientist
University of Washington eScience Institute and
Paul G. Allen School of Computer Science & Engineering
June 6, 2019 - Allen Institute for Artiﬁcial Intelligence
Speed run edition

eScience Institute
University of Washington

Modified graphic. Original from Alice Zheng
Model
interpretation
Data Curation · Evaluation · Interpretation
Personal research interests in ML

Model
interpretation
How do we choose and collect dataset?

Model
interpretation
How to check performance of ML?

Model
interpretation
What’s happening on the inside?

Defining model explainability
No single definition within community.
“component of interpretable modeling process informing
how model works in understandable form” - Me
“process of giving explanations [of ML] to humans”
- Kim & Doshi-Valez 2017
to left, formal definition (I)
that extends beyond humans
- Dhurandhar et al. 2017

Linear regression models
(with a certain number of parameters)
Decision trees (or similar)
(with a certain depth/number of parameters)
Text explanations
Visualizations (e.g., saliency maps)
more
Explanations come in many forms

“Explanation vehicles” (Herman 2017)
Explanations come in many forms

from “The Mythos of Model Interpretability”, Lipton 2016
1. Simulatability, comprehend the entire model
at once; model complexity
2. Decomposability, comprehend the individual
components/parameters; intelligibility [2]
3. Algorithmic transparency, comprehend
the algorithm behavior; loss surface and randomness
Three levels of transparency

Interpretability methods
(with focus on explanations)

Interpretable methods survey
heavily borrowed from Kim & Doshi-Valez ICML 2017 tutorial
1. Fitting new models that are
intrinsically interpretable
2. Post-hoc analysis of existing model
3. Interpretable analysis of raw data
(or model architecture)

Inherently interpretable models
intrinsically interpretable
Decision trees, rule lists, rule sets
Generalized linear models (and feature manipulation)
Case-based methods
Sparsity-based methods
Monotonicity-based methods
Conceptual and hierarchical models

intrinsically interpretable.
Decision trees, rule lists, rule sets
Generalized linear models (and feature manipulation)
table above from Gehrke et al. 2012

ﬁgure above from Gupta et al. 2016
Monotonicity constraints
Conceptual and hierarchical models

Sensitivity analysis
Surrogate models
Gradient-based methods
Hidden layer investigations
Post-hoc for existing models

2.
Surrogate models
Patrick Hall et al. 2017, O’Reilly Blog

2. Post-hoc analysis of existing model.
Surrogate models
Surrogate models
ﬁgures from Ribeiro et al. 2015 (LIME)

2. Post-hoc analysis of existing mode
Surrogate models
Lundberg & Lee 2017 (SHAP)

Surrogate models
Lundberg et al. (2019)

2.
Surrogate models
Selvaraju et al. 2017

3. Interpretable analysis of raw data
Visualization
Variable Importance
Partial dependence plots
Correlation analysis
Interpretable raw data analysis

Visualizations for AutoML
Wang et al. (2019)

Do interpretability and
explainability methods
always work?

Explanations can be persuasive
Lipton 2016, Herman 2017
When tailoring our model explanations to
human preferences and judges, our models
may learn to prioritize persuasive
explanations over introspective ones.

Potentially unfaithful explanations
Hendricks et al. 2016

Visual saliency explanation methods
Adebayo et al. 2018

Even random labels don’t stop them
Adebayo et al. 2018

How AI detectives are cracking open the black box of
deep learning, Science Magazine, July 2017
Background reading

Ideas on interpreting machine learning,
O’Reilly Ideas, March 2017
Background reading

Thank you!
Let’s discuss this more.
Bernease Herman
bernease@uw.edu
@bernease on Twitter, Github, MSDSE Slack, everything

Splitting model form from simplicity
Herman 2017
Simultaneously
coerced into suitable
model form
(e.g., decision tree)
and reduced in
complexity
(e.g., model size).
Difficult to evaluate
across complexity
preferences.

Splitting model form from simplicity
Herman 2017
Keeps model form
and reduction of
complexity separate.
Improves evaluation
and adaptability.

Rsqrd AI: Recent Advances in Explainable Machine Learning Research

Recommandé

Recommandé

Contenu connexe

Tendances

Tendances (20)

Similaire à Rsqrd AI: Recent Advances in Explainable Machine Learning Research

Similaire à Rsqrd AI: Recent Advances in Explainable Machine Learning Research (20)

Plus de Sanjana Chowdhury

Plus de Sanjana Chowdhury (12)

Dernier

Dernier (20)

Rsqrd AI: Recent Advances in Explainable Machine Learning Research