Understanding Explanation by ExamplesUnderstanding Explanation by Examples

Abstract Explaining the inner workings of deep neural net- work models has received considerable attention in recent years. Researchers have developed various explanation techniques in an attempt to provide human-understandable explanations, jus- tifying a model’s decision. In this work, we perform an in- depth study of one such explanation technique – explanation-by- example. For a given test input, explanation-by-example provides the nearest matching data examples from the training data as representative examples of the model’s decision boundary. To understand its relative performance compared to other state-of- the-art explanation methods we perform a cross-analysis Amazon Mechanical Turk study. The participants were asked to compare explanation methods across applications spanning image, text, audio, and sensory domains. Among the surveyed methods, explanation-by-example was preferred in all domains except text sentiment classification. Furthermore, our initial investigation also indicates that the explanation by examples is a good detector of adversarial inputs generated using both white and black-box attacks.
  • Jeya Vikranth Jeyakumar (UCLA)
  • Pengrui Quan (UCLA)
  • Joseph Noor (UCLA)
  • Supriyo Chakraborty (IBM US)
  • Prudhvi Gurram (ARL)
  • Mani Srivastava (UCLA)
Date Sep-2020
Venue 4th Annual Fall Meeting of the DAIS ITA, 2020