Causal Document-Grounded Dialogue Pre-training

Yingxiu Zhao; Bowen Yu; Bowen Li; Haiyang Yu; Jinyang Li; Chao Wang; Fei Huang; Yongbin Li; Nevin Zhang

doi:10.18653/v1/2023.emnlp-main.443

ScienceGate Book Chapters

JOURNAL ARTICLE

Causal Document-Grounded Dialogue Pre-training

Yingxiu Zhao Bowen Yu Bowen Li Haiyang Yu Jinyang Li Chao Wang Fei Huang Yongbin Li Nevin Zhang

Year: 2023 Pages: 7160-7174

DOI: 10.18653/v1/2023.emnlp-main.443

Get Full-Text PDF Get Analytical Report

Abstract

The goal of document-grounded dialogue (DocGD) is to generate a response by anchoring the evidence in a supporting document in accordance with the dialogue context. This entails four causally interconnected variables. While task-specific pre-training has significantly enhanced performances on numerous downstream tasks, existing DocGD methods still rely on general pre-trained language models without a specifically tailored pre-training approach that explicitly captures the causal relationships. To address this, we present the first causally-complete dataset construction strategy for developing million-scale DocGD pre-training corpora. Additionally, we propose a causally-perturbed pre-training strategy to better capture causality by introducing perturbations on the variables and optimizing the overall causal effect. Experiments conducted on three benchmark datasets demonstrate that our causal pre-training yields substantial and consistent improvements in fully-supervised, low-resource, few-shot, and zero-shot settings.

Keywords:

Computer science Causality (physics) Task (project management) Context (archaeology) Artificial intelligence Machine learning Benchmark (surveying) Training (meteorology) Resource (disambiguation) Training set Causal model Natural language processing Mathematics

Metrics

Cited By

0.26

FWCI (Field Weighted Citation Impact)

Refs

0.59

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Speech and dialogue systems

Physical Sciences → Computer Science → Artificial Intelligence

Causal Document-Grounded Dialogue Pre-training

Abstract

Metrics

Citation History

Topics

Related Documents

Enhancing Multilingual Document-Grounded Dialogue Using Cascaded Prompt-Based Post-Training Models

GLM-Dialog: Noise-tolerant Pre-training for Knowledge-grounded Dialogue Generation

Building Goal-oriented Document-grounded Dialogue Systems

DG2: Data Augmentation Through Document Grounded Dialogue Generation

Exploration of multilingual prompts in document-grounded dialogue