PAT: Geometry-Aware Hard-Label Black-Box Adversarial Attacks on Text

Muchao Ye; Jinghui Chen; Chenglin Miao; Han Liu; Ting Wang; Fenglong Ma

doi:10.1145/3580305.3599461

ScienceGate Book Chapters

JOURNAL ARTICLE

PAT: Geometry-Aware Hard-Label Black-Box Adversarial Attacks on Text

Muchao Ye Jinghui Chen Chenglin Miao Han Liu Ting Wang Fenglong Ma

Year: 2023 Pages: 3093-3104

DOI: 10.1145/3580305.3599461

Get Full-Text PDF Get Analytical Report

Abstract

Despite a plethora of prior explorations, conducting text adversarial attacks in practical settings is still challenging with the following constraints: black box -- the inner structure of the victim model is unknown; hard label -- the attacker only has access to the top-1 prediction results; and semantic preservation - the perturbation needs to preserve the original semantics. In this paper, we present PAT, a novel adversarial attack method employed under all these constraints. Specifically, PAT explicitly models the adversarial and non-adversarial prototypes and incorporates them to measure semantic changes for replacement selection in the hard-label black-box setting to generate high-quality samples. In each iteration, PAT finds original words that can be replaced back and selects better candidate words for perturbed positions in a geometry-aware manner guided by this estimation, which maximally improves the perturbation construction and minimally impacts the original semantics. Extensive evaluation with benchmark datasets and state-of-the-art models shows that PAT outperforms existing text adversarial attacks in terms of both attack effectiveness and semantic preservation. Moreover, we validate the efficacy of PAT against industry-leading natural language processing platforms in real-world settings.

Keywords:

Adversarial system Black box Computer science Semantics (computer science) Benchmark (surveying) Theoretical computer science Artificial intelligence Programming language

Metrics

Cited By

1.53

FWCI (Field Weighted Citation Impact)

Refs

0.82

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Adversarial Robustness in Machine Learning

Physical Sciences → Computer Science → Artificial Intelligence

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Anomaly Detection Techniques and Applications

Physical Sciences → Computer Science → Artificial Intelligence

PAT: Geometry-Aware Hard-Label Black-Box Adversarial Attacks on Text

Abstract

Metrics

Citation History

Topics

Related Documents

Improving Example Quality in Black Box Hard Label Text Adversarial Attacks

Hard-Label Black-Box Adversarial Attacks for Implicit Scene Interactions

HyGloadAttack: Hard-label black-box textual adversarial attacks via hybrid optimization

Sensitive region-aware black-box adversarial attacks

VIWHard: Text adversarial attacks based on important-word discriminator in the hard-label black-box setting