SmoothHess: ReLU Network Feature Interactions via Stein's Lemma (NeurIPS 2023)

Max Torop*¹ Aria Masoomi*¹ Davin Hill¹ Kivanc Kose² Stratis Ioannidis¹ Jennifer Dy¹

¹Northeastern University ²Memorial Sloan Kettering Cancer Center

[Paper] • [Poster] • [Video]

📘 Abstract

Several recent methods for interpretability model feature interactions by looking at the Hessian of a neural network. This poses a challenge for ReLU networks, which are piecewise-linear and thus have a zero Hessian almost everywhere. We propose SmoothHess, a method of estimating second-order interactions through Stein's Lemma. In particular, we estimate the Hessian of the network convolved with a Gaussian through an efficient sampling algorithm, requiring only network gradient calls. SmoothHess is applied post-hoc, requires no modifications to the ReLU network architecture, and the extent of smoothing can be controlled explicitly. We provide a non-asymptotic bound on the sample complexity of our estimation procedure. We validate the superior ability of SmoothHess to capture interactions on benchmark datasets and a real-world medical spirometry dataset.

Code written in Python 3.8.3 using the following packages

torch 1.13.1+cu117
torchvision 0.11.1
matplotlib 3.7.1
numpy 1.25.5
pandas 1.5.3
cvxpy 1.2.1
tqdm 4.55.0

Overview

FourQuadrant.ipynb: Shows the Four Quadrant dataset experiment, highlighting the intuitive control offered by SmoothHess compared to the SoftPlus Hessian.
PMSEExample.ipynb: Example notebook for the Perturbation Mean-Squared Error (PMSE) experiment, demonstrating the strong ability of SmoothHess to capture the networks local behaviour.
AdvAttackExample.ipynb: Example notebook demonstrating the use of SmoothHess to perform adversarial attacks.

Citation

If you use this code, please cite our paper:

@article{torop2023smoothhess,
  title={SmoothHess: ReLU network feature interactions via stein's lemma},
  author={Torop, Max and Masoomi, Aria and Hill, Davin and Kose, Kivanc and Ioannidis, Stratis and Dy, Jennifer},
  journal={Advances in Neural Information Processing Systems},
  volume={36},
  pages={50697--50729},
  year={2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
Models		Models
AdvAttackExample.ipynb		AdvAttackExample.ipynb
ComputeDerivatives.py		ComputeDerivatives.py
Datasets.py		Datasets.py
Evaluation.py		Evaluation.py
FourQuadrant.ipynb		FourQuadrant.ipynb
LICENSE		LICENSE
Models.py		Models.py
PMSEExample.ipynb		PMSEExample.ipynb
README.md		README.md
SmoothHess.py		SmoothHess.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SmoothHess: ReLU Network Feature Interactions via Stein's Lemma (NeurIPS 2023)

[Paper] • [Poster] • [Video]

📘 Abstract

Code written in Python 3.8.3 using the following packages

Overview

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SmoothHess: ReLU Network Feature Interactions via Stein's Lemma (NeurIPS 2023)

[Paper] • [Poster] • [Video]

📘 Abstract

Code written in Python 3.8.3 using the following packages

Overview

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages