Interpretable deep classifier for counterfactual generation
Framework using a supervised VAE with Normalizing Flow to generate counterfactual explanations in deep learning models. (ICAIF)
Topics:
Counterfactual explanation has been the core of interpretable machine learning, which requires a trained model to be able to not only infer but also justify its inference. This problem is crucial in many fields, such as fintech and the healthcare industry, where accurate decisions and their justifications are equally important. Many studies have leveraged the power of deep generative models for counterfactual generation. However, most focus on vision data and leave the latent space unsupervised. In this paper, we propose a new and general framework that uses a supervised extension to the Variational Auto-Encoder (VAE) with Normalizing Flow (NF) for simultaneous classification and counterfactual generation. We show experiments on two tabular financial data-sets, Lending Club (LCD) and Give Me Some Credit (GMC), which show that the model can achieve a state-of-art level prediction accuracy while also producing meaningful counterfactual examples to interpret and justify the classifier’s decision.
Latest publications
Understanding counterfactual generation using maximum mean discrepancy
A quantitative approach using Maximum Mean Discrepancy (MMD) to evaluate and compare counterfactual explanation methods.
ICAIFMixed membership recurrent neural networks for modeling customer purchases
A mixed membership RNN approach for modeling customer purchase behavior with varying time intervals between purchases.
ICAIFALARM: human-in-the-loop anomaly detection
An end-to-end framework for anomaly detection, explanation and management with human-in-the-loop processes for effective action.
ICAIF