XAI/XAI-papers

Anh M. Nguyen eeae204ca4

Update README.md

2018-04-19 13:44:04 -05:00

1.6 KiB

Raw Blame History

Papers on Understanding and Explaining Neural Networks

This is an on-going attempt to consolidate all interesting efforts in the area of understanding / interpreting / explaining / visualizing neural networks.

1. GUI tools

Deep Visualization Toolbox code | paper

2. Feature Visualization / Activation Maximization

DGN-AM
PPGN

3. Heatmap / Attribution

Learning how to explain neural networks: PatternNet and PatternAttribution (pdf)

Layer-wise Backpropagation

Beyond saliency: understanding convolutional neural networks from saliency prediction on layer-wise relevance propagation (pdf)
Explaining NonLinear Classification Decisions With Deep Tayor Decomposition

4. Bayesian approaches

Yang, S. C. H., & Shafto, P. Explainable Artificial Intelligence via Bayesian Teaching. NIPS 2017 (pdf)

5. Distilling DNNs into more interpretable models

Interpreting CNNs via Decision Trees pdf
Distilling a Neural Network Into a Soft Decision Tree pdf

6. DNNs that learn to explain

Deep Learning for Case-Based Reasoning through Prototypes pdf

7. Others

Understanding Deep Architectures by Interpretable Visual Summaries pdf