TechRxiv
windeattTransactions-template_tnnlsv2.pdf (615.79 kB)
Download file

Adversarial Detection by Approximation of Ensemble Boundary

Download (615.79 kB)
preprint
posted on 2022-12-15, 03:46 authored by terry windeattterry windeatt

A spectral approximation of a Boolean function is proposed for approximating the decision boundary of an ensemble of Deep Neural Networks (DNNs) solving two-class pattern recognition problems. The Walsh combination of relatively weak DNN classifiers is shown experimentally to be capable of detecting adversarial attacks. By observing the difference in Walsh coefficient approximation between clean and adversarial images, it appears that transferability of attack may be used for detection. Approximating the decision boundary may also aid in understanding the learning and transferability properties of DNNs. While the experiments here use images, the proposed approach of modelling two-class ensemble decision boundaries could in principle be applied to any application area.

History

Email Address of Submitting Author

t.windeatt@surrey.ac.uk

ORCID of Submitting Author

0000-0002-5058-9701

Submitting Author's Institution

university of surrey

Submitting Author's Country

  • United Kingdom