Large-scale multimodal movie dialogue corpus

Ryu Yasuhara; Masashi Inoue; Ikuya Suga; Tetsuo Kosaka

doi:10.1145/2993148.2998523

ScienceGate Book Chapters

JOURNAL ARTICLE

Large-scale multimodal movie dialogue corpus

Ryu Yasuhara Masashi Inoue Ikuya Suga Tetsuo Kosaka

Year: 2016 Pages: 414-415

DOI: 10.1145/2993148.2998523

Get Full-Text PDF Get Analytical Report

Abstract

We present an outline of our newly created multimodal dialogue corpus that is constructed from public domain movies. Dialogues in movies are useful sources for analyzing human communication patterns. In addition, they can be used to train machine-learning-based dialogue processing systems. However, the movie files are processing intensive and they contain large portions of non-dialogue segments. Therefore, we created a corpus that contains only dialogue segments from movies. The corpus contains 165,368 dialogue segments taken from 1,722 movies. These dialogues are automatically segmented by using deep neural network-based voice activity detection with filtering rules. Our corpus can reduce the human workload and machine-processing effort required to analyze human dialogue behavior by using movies.

Keywords:

Computer science Artificial intelligence Natural language processing Workload Domain (mathematical analysis) Closed captioning Scale (ratio) Speech recognition Image (mathematics)

Metrics

Cited By

0.85

FWCI (Field Weighted Citation Impact)

Refs

0.90

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech and dialogue systems

Physical Sciences → Computer Science → Artificial Intelligence

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Large-scale multimodal movie dialogue corpus

Abstract

Metrics

Citation History

Topics

Related Documents

Improving Voice Activity Detection for Multimodal Movie Dialogue Corpus

Multimodal Dialogue Corpus Hazumi

mOSCAR: A Large-scale Multilingual and Multimodal Document-level Corpus

Multimodal Persuasive Dialogue Corpus using Teleoperated Android

KwaiChat: A Large-Scale Video-Driven Multilingual Mixed-Type Dialogue Corpus