JOURNAL ARTICLE

An Interactive Multi-Modal Query Answering System with Retrieval-Augmented Large Language Models

Mengzhao WangHaotian WuXiangyu KeYunjun GaoXiaoliang XuLu Chen

Year: 2024 Journal:   Proceedings of the VLDB Endowment Vol: 17 (12)Pages: 4333-4336   Publisher: Association for Computing Machinery

Abstract

Retrieval-augmented Large Language Models (LLMs) have reshaped traditional query-answering systems, offering unparalleled user experiences. However, existing retrieval techniques often struggle to handle multi-modal query contexts. In this paper, we present an interactive M ulti-modal Q uery A nswering (MQA) system, empowered by our newly developed multi-modal retrieval framework and navigation graph index, integrated with cutting-edge LLMs. It comprises five core components: Data Preprocessing, Vector Representation, Index Construction, Query Execution, and Answer Generation, all orchestrated by a dedicated coordinator to ensure smooth data flow from input to answer generation. One notable aspect of MQA is its utilization of contrastive learning to assess the significance of different modalities, facilitating precise measurement of multimodal information similarity. Furthermore, the system achieves efficient retrieval through our advanced navigation graph index, refined using computational pruning techniques. Another highlight of our system is its pluggable processing framework, allowing seamless integration of embedding models, graph indexes, and LLMs. This flexibility provides users diverse options for gaining insights from their multi-modal knowledge base. A preliminary video introduction of MQA is available at https://youtu.be/xvUuo2ZIqWk.

Keywords:
Computer science Question answering Modal Query language Information retrieval Query expansion Natural language processing RDF query language Artificial intelligence Web search query Search engine Web query classification Chemistry

Metrics

4
Cited By
2.12
FWCI (Field Weighted Citation Impact)
5
Refs
0.81
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Web Data Mining and Analysis
Physical Sciences →  Computer Science →  Information Systems
Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

© 2026 ScienceGate Book Chapters — All rights reserved.