JOURNAL ARTICLE

Zero-Shot Human–Object Interaction Detection via Similarity Propagation

Daoming ZongShiliang Sun

Year: 2023 Journal:   IEEE Transactions on Neural Networks and Learning Systems Vol: 35 (12)Pages: 17805-17816   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Human-object interaction (HOI) detection involves identifying interactions represented as , requiring the localization of human-object pairs and interaction classification within an image. This work focuses on the challenge of detecting HOIs with unseen objects using the prevalent Transformer architecture. Our empirical analysis reveals that the performance degradation of novel HOI instances primarily arises from misclassifying unseen objects as confusable seen objects. To address this issue, we propose a similarity propagation (SP) scheme that leverages cosine similarity distance to regulate the prediction margin between seen and unseen objects. In addition, we introduce pseudo-supervision for unseen objects based on class semantic similarities during training. Furthermore, we incorporate semantic-aware instance-level and interaction-level contrastive losses with Transformer to enhance intraclass compactness and interclass separability, resulting in improved visual representations. Extensive experiments on two challenging benchmarks, V-COCO and HICO-DET, demonstrate the effectiveness of our model, outperforming current state-of-the-art methods under various zero-shot settings.

Keywords:
Artificial intelligence Computer science Transformer Object (grammar) Margin (machine learning) Pattern recognition (psychology) Similarity (geometry) Semantic similarity Cosine similarity Image (mathematics) Machine learning

Metrics

10
Cited By
1.82
FWCI (Field Weighted Citation Impact)
65
Refs
0.83
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Domain Adaptation and Few-Shot Learning
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.