Abstract

When asking about unfamiliar topics, information seeking users often pose questions with false presuppositions. Most existing question answering (QA) datasets, in contrast, assume all questions have well defined answers. We introduce CREPE, a QA dataset containing a natural distribution of presupposition failures from online information-seeking forums. We find that 25% of questions contain false presuppositions, and provide annotations for these presuppositions and their corrections. Through extensive baseline experiments, we show that adaptations of existing open-domain QA models can find presuppositions moderately well, but struggle when predicting whether a presupposition is factually correct. This is in large part due to difficulty in retrieving relevant evidence passages from a large text corpus. CREPE provides a benchmark to study question answering in the wild, and our analyses provide avenues for future work in better modeling and further studying the task.

Keywords:
Presupposition Question answering Benchmark (surveying) Computer science Task (project management) Natural language processing Domain (mathematical analysis) Baseline (sea) Artificial intelligence Epistemology Open domain Information retrieval Linguistics Philosophy Mathematics Political science

Metrics

12
Cited By
3.07
FWCI (Field Weighted Citation Impact)
41
Refs
0.90
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Expert finding and Q&A systems
Physical Sciences →  Computer Science →  Information Systems
Advanced Graph Neural Networks
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Open-Domain Question Answering

Danqi ChenWen-tau Yih

Year: 2020 Pages: 34-37
JOURNAL ARTICLE

Open-Domain Question–Answering

John Prager

Journal:   Foundations and Trends® in Information Retrieval Year: 2007 Vol: 1 (2)Pages: 91-231
BOOK-CHAPTER

Open-Domain Question Answering

Rishiraj Saha RoyAvishek Anand

Synthesis lectures on information concepts, retrieval, and services Year: 2022 Pages: 111-120
BOOK

Open-Domain Question–Answering

John Prager

now publishers, Inc. eBooks Year: 2006
© 2026 ScienceGate Book Chapters — All rights reserved.