JOURNAL ARTICLE

PTMNet: Pixel-Text Matching Network for Zero-Shot Anomaly Detection

Abstract

Visual anomaly detection is crucial for automating industrial quality inspection. However, prior research has primarily involved training custom models for specific scenarios, limiting their flexibility and scalability. Zero-shot anomaly detection, based on the vision-language model, leverages robust natural language supervision to enable defect detection in new categories without the need for dedicated training. Recently CLIP exhibits impressive transferability on zero-shot classification tasks. However, CLIP's effectiveness in zero-shot anomaly detection is limited due to the lack of defect-related knowledge and the complexity of extending image-text pair matching to per-pixel prediction tasks. In this study, we propose a novel framework for independently learning text representations for normal and anomalous cases and enhance modal fusion to capture pixel-text associations within the text-image joint space. Specifically, we transform image-text matching into pixel-text matching to generate pixel-text score maps. These score maps are further refined using a transformer fusion module, effectively strengthening modal fusion and improving the utilization of pretrained knowledge. Additionally, to enhance the learning of visual representations, we introduce an extra linear layer following the image encoder. Extensive experiments demonstrate the superior performance of our methods.

Keywords:
Pixel Computer science Anomaly detection Matching (statistics) Anomaly (physics) Zero (linguistics) Artificial intelligence Shot (pellet) Pattern recognition (psychology) Computer vision Physics Mathematics Materials science Statistics

Metrics

1
Cited By
0.26
FWCI (Field Weighted Citation Impact)
17
Refs
0.61
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Anomaly Detection Techniques and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence
Digital Media Forensic Detection
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
COVID-19 diagnosis using AI
Health Sciences →  Medicine →  Radiology, Nuclear Medicine and Imaging

Related Documents

JOURNAL ARTICLE

A zero-shot anomaly detection method based on learnable text query

Yanan SongBaisong PanWenchao YiB. Zhang

Journal:   Machine Vision and Applications Year: 2025 Vol: 36 (5)
JOURNAL ARTICLE

Zero-shot versus Many-shot: Unsupervised Texture Anomaly Detection

Toshimichi AotaLloyd Teh Tzer TongTakayuki Okatani

Journal:   2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) Year: 2023 Pages: 5553-5561
JOURNAL ARTICLE

Prioritized Local Matching Network for Cross-Category Few-Shot Anomaly Detection

Huilin DengHongchen LuoWei ZhaiYanming GuoYang CaoYu Kang

Journal:   IEEE Transactions on Artificial Intelligence Year: 2024 Vol: 5 (9)Pages: 4550-4561
© 2026 ScienceGate Book Chapters — All rights reserved.