JOURNAL ARTICLE

TopiOCQA: Open-domain Conversational Question Answering with Topic Switching

Vaibhav AdlakhaShehzaad DhuliawalaKaheer SulemanHarm de VriesSiva Reddy

Year: 2022 Journal:   Transactions of the Association for Computational Linguistics Vol: 10 Pages: 468-483   Publisher: Association for Computational Linguistics

Abstract

Abstract In a conversational question answering scenario, a questioner seeks to extract information about a topic through a series of interdependent questions and answers. As the conversation progresses, they may switch to related topics, a phenomenon commonly observed in information-seeking search sessions. However, current datasets for conversational question answering are limiting in two ways: 1) they do not contain topic switches; and 2) they assume the reference text for the conversation is given, that is, the setting is not open-domain. We introduce TopiOCQA (pronounced Tapioca), an open-domain conversational dataset with topic switches based on Wikipedia. TopiOCQA contains 3,920 conversations with information-seeking questions and free-form answers. On average, a conversation in our dataset spans 13 question-answer turns and involves four topics (documents). TopiOCQA poses a challenging test-bed for models, where efficient retrieval is required on multiple turns of the same conversation, in conjunction with constructing valid responses using conversational history. We evaluate several baselines, by combining state-of-the-art document retrieval methods with neural reader models. Our best model achieves F1 of 55.8, falling short of human performance by 14.2 points, indicating the difficulty of our dataset. Our dataset and code are available at https://mcgill-nlp.github.io/topiocqa.

Keywords:
Conversation Computer science Question answering Open domain Domain (mathematical analysis) Information retrieval Interdependence Natural language processing Artificial intelligence Relevance (law) Code (set theory) Linguistics Set (abstract data type)

Metrics

62
Cited By
11.94
FWCI (Field Weighted Citation Impact)
46
Refs
0.98
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

BOOK-CHAPTER

Open-Domain Question Answering with Topic Clustering

Arinc GurkanLakshmi Babu Saheer

Lecture notes in networks and systems Year: 2023 Pages: 1090-1109
BOOK-CHAPTER

Knowledge Graph Enabled Open-Domain Conversational Question Answering

Joel Oduro-AfriyieHasan M. Jamil

Lecture notes in computer science Year: 2023 Pages: 63-76
JOURNAL ARTICLE

Conversational open-domain question answering for resource-constrained languages

Emrah BudurTunga Güngör

Journal:   TURKISH JOURNAL OF ELECTRICAL ENGINEERING & COMPUTER SCIENCES Year: 2025 Vol: 33 (2)Pages: 203-223
© 2026 ScienceGate Book Chapters — All rights reserved.