loading page

Explaining the Black-box Smoothly- A Counterfactual Approach
  • +1
  • Sumedha Singla ,
  • Brian Pollack ,
  • Stephen Wallace ,
  • Kayhan Batmanghelich
Sumedha Singla
University of Pittsburgh

Corresponding Author:[email protected]

Author Profile
Brian Pollack
Author Profile
Stephen Wallace
Author Profile
Kayhan Batmanghelich
Author Profile


We propose a BlackBox Counterfactual Explainer that is explicitly developed for medical imaging applications. Classical approaches (e.g., saliency maps) assessing feature importance do not explain how and why variations in a particular anatomical region are relevant to the outcome, which is crucial for transparent decision making in healthcare application. Our framework explains the outcome by gradually exaggerating the semantic effect of the given outcome label. Given a query input to a classifier, Generative Adversarial Networks produce a progressive set of perturbations to the query image that gradually changes the posterior probability from its original class to its negation. We design the loss function to ensure that essential and potentially relevant details, such as support devices, are preserved in the counterfactually generated images. We provide an extensive evaluation of different classification tasks on the chest X-Ray images. Our experiments show that a counterfactually generated visual explanation is consistent with the disease’s clinical relevant measurements, both quantitatively and qualitatively.
Feb 2023Published in Medical Image Analysis volume 84 on pages 102721. 10.1016/j.media.2022.102721