TechRxiv
main.pdf (27.03 MB)

Explaining the Black-box Smoothly- A Counterfactual Approach

Download (27.03 MB)
preprint
posted on 15.06.2021, 01:24 by Sumedha Singla, Brian Pollack, Stephen Wallace, Kayhan Batmanghelich

We propose a BlackBox Counterfactual Explainer that is explicitly developed for medical imaging applications. Classical approaches (e.g., saliency maps) assessing feature importance do not explain how and why variations in a particular anatomical region are relevant to the outcome, which is crucial for transparent decision making in healthcare application. Our framework explains the outcome by gradually exaggerating the semantic effect of the given outcome label. Given a query input to a classifier, Generative Adversarial Networks produce a progressive set of perturbations to the query image that gradually changes the posterior probability from its original class to its negation. We design the loss function to ensure that essential and potentially relevant details, such as support devices, are preserved in the counterfactually generated images. We provide an extensive evaluation of different classification tasks on the chest X-Ray images. Our experiments show that a counterfactually generated visual explanation is consistent with the disease's clinical relevant measurements, both quantitatively and qualitatively.

History

Email Address of Submitting Author

sumedha.singla@pitt.edu

ORCID of Submitting Author

https://orcid.org/0000-0003-3477-0524

Submitting Author's Institution

University of Pittsburgh

Submitting Author's Country

United States of America