Evaluating Task-oriented Dialogue Systems with Users

Clemencia Siro

doi:10.1145/3539618.3591788

ScienceGate Book Chapters

JOURNAL ARTICLE

Evaluating Task-oriented Dialogue Systems with Users

Clemencia Siro

Year: 2023 Pages: 3495-3495

DOI: 10.1145/3539618.3591788

Get Full-Text PDF Get Analytical Report

Abstract

Evaluation is one of the major concerns when developing information retrieval systems. Especially in the field of conversational AI, this topic has been heavily studied in the setting of both non-task and task-oriented conversational agents (dialogue systems).[1] Recently, several automatic metrics e.g., BLEU and ROUGE, proposed for the evaluation of dialogue systems, have shown poor correlation with human judgment and are thus ineffective for the evaluation of dialogue systems. As a consequence, a significant amount of research relies on human evaluation to estimate the effectiveness of dialogue systems[1, 4}.

Keywords:

Computer science Task (project management) Field (mathematics) Artificial intelligence Human–computer interaction Natural language processing Task analysis

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.08

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Speech and dialogue systems

Physical Sciences → Computer Science → Artificial Intelligence

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Evaluating Task-oriented Dialogue Systems with Users

Abstract

Metrics

Topics

Related Documents

Metaphorical User Simulators for Evaluating Task-oriented Dialogue Systems

Adapting language generation to dialogue environments and users for task-oriented dialogue systems

Are Current Task-Oriented Dialogue Systems Able to Satisfy Impolite Users?

Understanding User Satisfaction with Task-oriented Dialogue Systems

Constructing Task-Oriented Dialogue Systems with Limited Resources