Bolstering Adversarial Robustness with Latent Disparity Regularization

David Schwartz; Gregory Ditzler

doi:10.1109/ijcnn52387.2021.9533815

ScienceGate Book Chapters

JOURNAL ARTICLE

Bolstering Adversarial Robustness with Latent Disparity Regularization

David Schwartz Gregory Ditzler

Year: 2021 Vol: 8 Pages: 1-8

DOI: 10.1109/ijcnn52387.2021.9533815

Get Full-Text PDF Get Analytical Report

Abstract

Recent research has revealed that neural networks and other machine learning models are vulnerable to adversarial attacks that aim to subvert their predictions' integrity or privacy by adding a small calculated perturbation to inputs. Further, the adversary can significantly degrade the performance of the model. The number and severity of attacks continues to grow. However, a dearth of techniques robustly defends machine learning models in a computationally inexpensive way. Against this background, we propose an adversarially robust training procedure and objective function for arbitrary neural network architectures. Robustness of neural networks against adversarial attacks on integrity is achieved by augmentation of a novel regularization term. This regularizer penalizes the discrepancy between the representations induced in hidden layers by benign and adversarial data. We benchmark our regularization approach on the Fashion-Mnist and Cifar-10 datasets. Our model is benchmarked against three state-of-the-art defense methods, namely: (i) regularization to the largest eigenvalue in the Fisher information matrix of the activity of the terminal layer, (ii) a higher-level representation guided denoising autoencoder (trained with adversarial examples), and (iii) training an otherwise undefended model on data distorted by additive white Gaussian noise. Our experiments show that the proposed regularizer provides significant improvements in adversarial robustness over both an undefended baseline model as well as the same model defended with other techniques. This result is observed over several adversarial budgets with only a small (but seemingly unavoidable) decline in benign test accuracy.

Keywords:

MNIST database Adversarial system Computer science Artificial intelligence Robustness (evolution) Machine learning Regularization (linguistics) Artificial neural network Deep neural networks Pattern recognition (psychology) Algorithm

Metrics

Cited By

0.42

FWCI (Field Weighted Citation Impact)

Refs

0.69

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Adversarial Robustness in Machine Learning

Physical Sciences → Computer Science → Artificial Intelligence

Anomaly Detection Techniques and Applications

Physical Sciences → Computer Science → Artificial Intelligence

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Bolstering Adversarial Robustness with Latent Disparity Regularization

Abstract

Metrics

Citation History

Topics

Related Documents

Consistency Regularization for Adversarial Robustness

Regularization with Latent Space Virtual Adversarial Training

Adversarial Robustness Via Fisher-Rao Regularization

Improving Adversarial Robustness of Detector via Objectness Regularization

Adversarial Neural Pruning with Modified Latent Vulnerability utilizing Feature Robustness