JOURNAL ARTICLE

Evaluating Task-oriented Dialogue Systems with Users

Abstract

Evaluation is one of the major concerns when developing information retrieval systems. Especially in the field of conversational AI, this topic has been heavily studied in the setting of both non-task and task-oriented conversational agents (dialogue systems).[1] Recently, several automatic metrics e.g., BLEU and ROUGE, proposed for the evaluation of dialogue systems, have shown poor correlation with human judgment and are thus ineffective for the evaluation of dialogue systems. As a consequence, a significant amount of research relies on human evaluation to estimate the effectiveness of dialogue systems[1, 4}.

Keywords:
Computer science Task (project management) Field (mathematics) Artificial intelligence Human–computer interaction Natural language processing Task analysis

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
6
Refs
0.08
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Speech and dialogue systems
Physical Sciences →  Computer Science →  Artificial Intelligence
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Metaphorical User Simulators for Evaluating Task-oriented Dialogue Systems

Weiwei SunShuyu GuoShuo ZhangPengjie RenZhumin ChenMaarten de RijkeZhaochun Ren

Journal:   ACM Transactions on Information Systems Year: 2023 Vol: 42 (1)Pages: 1-29
JOURNAL ARTICLE

Adapting language generation to dialogue environments and users for task-oriented dialogue systems

Atsumoto OhashiRyuichiro Higashinaka

Journal:   Natural Language Processing Journal Year: 2025 Vol: 11 Pages: 100153-100153
JOURNAL ARTICLE

Are Current Task-Oriented Dialogue Systems Able to Satisfy Impolite Users?

Zhiqiang HuNancy F. ChenRoy Ka-Wei Lee

Journal:   IEEE Transactions on Computational Social Systems Year: 2025 Vol: 12 (5)Pages: 2876-2887
JOURNAL ARTICLE

Understanding User Satisfaction with Task-oriented Dialogue Systems

Clemencia SiroMohammad AliannejadiMaarten de Rijke

Journal:   Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval Year: 2022 Pages: 2018-2023
JOURNAL ARTICLE

Constructing Task-Oriented Dialogue Systems with Limited Resources

Qian, Kun

Journal:   Columbia Academic Commons (Columbia University) Year: 2024
© 2026 ScienceGate Book Chapters — All rights reserved.