Entity-Focused Dense Passage Retrieval for Outside-Knowledge Visual Question Answering

Jialin Wu; Raymond J. Mooney

doi:10.18653/v1/2022.emnlp-main.551

ScienceGate Book Chapters

JOURNAL ARTICLE

Entity-Focused Dense Passage Retrieval for Outside-Knowledge Visual Question Answering

Jialin Wu Raymond J. Mooney

Year: 2022 Pages: 8061-8072

DOI: 10.18653/v1/2022.emnlp-main.551

Get Full-Text PDF Get Analytical Report

Abstract

Most Outside-Knowledge Visual Question Answering (OK-VQA) systems employ a two-stage framework that first retrieves external knowledge given the visual question and then predicts the answer based on the retrieved content. However, the retrieved knowledge is often inadequate. Retrievals are frequently too general and fail to cover specific knowledge needed to answer the question. Also, the naturally available supervision (whether the passage contains the correct answer) is weak and does not guarantee question relevancy. To address these issues, we propose an Entity-Focused Retrieval (EnFoRe) model that provides stronger supervision during training and recognizes question-relevant entities to help retrieve more specific knowledge. Experiments show that our EnFoRe model achieves superior retrieval performance on OK-VQA, the currently largest outside-knowledge VQA dataset. We also combine the retrieved knowledge with state-of-the-art VQA models, and achieve a new state-of-the-art performance on OK-VQA.

Keywords:

Question answering Computer science Information retrieval Knowledge extraction Artificial intelligence

Metrics

Cited By

0.87

FWCI (Field Weighted Citation Impact)

Refs

0.72

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Multimodal Machine Learning Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Domain Adaptation and Few-Shot Learning

Physical Sciences → Computer Science → Artificial Intelligence

Entity-Focused Dense Passage Retrieval for Outside-Knowledge Visual Question Answering

Abstract

Metrics

Citation History

Topics

Related Documents

Cross-Modal Dense Passage Retrieval for Outside Knowledge Visual Question Answering

Passage Retrieval for Outside-Knowledge Visual Question Answering

Retrieval Augmented Visual Question Answering with Outside Knowledge

Hierarchical Representations in Dense Passage Retrieval for Question-Answering

Pre-Training Multi-Modal Dense Retrievers for Outside-Knowledge Visual Question Answering