JOURNAL ARTICLE

Lightly supervised alignment of subtitles on multi-genre broadcasts

Óscar SazSalil DeenaMortaza DoulatyMadina HasanBilal KhaliqRosanna MilnerRaymond W. M. NgJúlia OlcozThomas Hain

Year: 2018 Journal:   Multimedia Tools and Applications Vol: 77 (23)Pages: 30533-30550   Publisher: Springer Science+Business Media

Abstract

Abstract This paper describes a system for performing alignment of subtitles to audio on multigenre broadcasts using a lightly supervised approach. Accurate alignment of subtitles plays a substantial role in the daily work of media companies and currently still requires large human effort. Here, a comprehensive approach to performing this task in an automated way using lightly supervised alignment is proposed. The paper explores the different alternatives to speech segmentation, lightly supervised speech recognition and alignment of text streams. The proposed system uses lightly supervised decoding to improve the alignment accuracy by performing language model adaptation using the target subtitles. The system thus built achieves the third best reported result in the alignment of broadcast subtitles in the Multi–Genre Broadcast (MGB) challenge, with an F1 score of 88.8%. This system is available for research and other non–commercial purposes through webASR, the University of Sheffield’s cloud–based speech technology web service. Taking as inputs an audio file and untimed subtitles, webASR can produce timed subtitles in multiple formats, including TTML, WebVTT and SRT.

Keywords:
Computer science Task (project management) Decoding methods Speech recognition Segmentation Adaptation (eye) Artificial intelligence Natural language processing Multimedia Telecommunications

Metrics

7
Cited By
0.79
FWCI (Field Weighted Citation Impact)
40
Refs
0.76
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Subtitles and Audiovisual Media
Social Sciences →  Arts and Humanities →  Language and Linguistics
© 2026 ScienceGate Book Chapters — All rights reserved.