Towards robustifying deep neural networks against adversarial, fringe and distorted examples

Srinivasan, Vignesh

doi:10.14279/depositonce-14961

ScienceGate Book Chapters

JOURNAL ARTICLE

Towards robustifying deep neural networks against adversarial, fringe and distorted examples

Srinivasan, Vignesh

Year: 2022 Journal: Deposit Once (Technische Universität Berlin) Publisher: Technische Universität Berlin

DOI: 10.14279/depositonce-14961

Get Full-Text PDF Get Analytical Report

Abstract

Recently Deep Neural Network (DNN) models have shown remarkable successes on several tasks including classification, domain translation etc. However, those methods typically do not perform well on samples lying on relatively low-density areas of the data distribution, where the model was not well trained. In this thesis, we analyze the effect of different types of noise on the predictions of different DNN-based applications. In particular, for classification based models, we propose a generalized framework for crafting adversarial examples in a blackbox attack setting. As defense against such adversarial examples, we propose a novel algorithm called MALADE, which drives the given off-manifold input towards the high density regions of the data generating distribution with intrinsic knowledge of the perceptual decision boundary during inference. For domain translation based models, we propose to drive the unsuccessful fringe examples towards the data manifold by cooling the input test distribution using Langevin dynamics. We demonstrate qualitatively and quantitatively that our strategy enhances the robustness of state-of-the-art methods for classification as well as for domain translation tasks. Taking medical imaging as an exemplar use-case of DNN-based classification, we evaluate the robustness of pretraining and self-supervision strategies to input distortions and bias.

Keywords:

Robustness (evolution) Adversarial system Artificial neural network Decision boundary Translation (biology) Domain (mathematical analysis) Deep neural networks Deep learning Noise (video)

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.28

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Adversarial Robustness in Machine Learning

Physical Sciences → Computer Science → Artificial Intelligence

Generative Adversarial Networks and Image Synthesis

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Explainable Artificial Intelligence (XAI)

Physical Sciences → Computer Science → Artificial Intelligence

Towards robustifying deep neural networks against adversarial, fringe and distorted examples

Abstract

Metrics

Topics

Related Documents

Towards deep neural networks robust to adversarial examples

Towards robust adversarial examples for deep neural networks

Towards Model Robustness and Generalization Against Adversarial Examples for Deep Neural Networks

ARGAN: Adversarially Robust Generative Adversarial Networks for Deep Neural Networks Against Adversarial Examples

Neuron Selecting: Defending Against Adversarial Examples in Deep Neural Networks