In this fork of the original repository of S-LIME, we expanded the algorithm to image data. The expansion has been merged to the original repository. This project was developed in the context of the course "Interpretability and Explainability in AI" of the MSc AI program in the University of Amsterdam. Our findings can be summarized by the following poster:
This repository holds code for replicating experiments in papar S-LIME: Stabilized-LIME for Model Explanation to appear in KDD2021.
It is built on the implementation of LIME with added functionalities.
It has been shown that post hoc explanations based on perturbations (such as LIME) exhibit large instability, posing serious challenges to the effectiveness of the method itself and harming user trust. S-LIME stands for Stabilized-LIME, which utilizes a hypothesis testing framework based on central limit theorem for determining the number of perturbation points needed to guarantee stability of the resulting explanation.
clone the repository and install using pip:
git clone https://github.com/ZhengzeZhou/slime.git
cd slime
pip install .
Currently, S-LIME only support tabular data and when feature selection method is set to "lasso_path". We are woring on extending the use cases to other data types and feature selection methods.
The following screenshot shows a typical usage of LIME on breasd cancer data. We can easily observe that two runs of the explanation algorithms result in different features being selected.
S-LIME is invoked by calling explainer.slime instead of explainer.explain_instance. n_max indicates the maximum number of sythetic samples to generate and alpha denotes the significance level of hypothesis testing. S-LIME explanations are guranteed to be stable under high probability.