JOURNAL ARTICLE

Content-based language models for spoken document retrieval

Abstract

Spoken document retrieval (SDR) has been extensively studied in recent years because of its potential use in navigating large multimedia collections in the near future. This paper presents a novel concept of applying the content-based language models to spoken document retrieval. In an example task for retrieval of Mandarin broadcast news, the content-based language models either trained with the automatic transcriptions of the spoken documents or adapted from the baseline language models using the automatic transcriptions of the spoken documents were used to create the more accurate recognition results and indexing terms from both the spoken documents and the speech queries. We report on some interesting findings obtained in this research.

Keywords:
Computer science Spoken language Search engine indexing Natural language processing Mandarin Chinese Document retrieval Task (project management) Artificial intelligence Information retrieval Language model Speech recognition Linguistics

Metrics

4
Cited By
0.42
FWCI (Field Weighted Citation Impact)
16
Refs
0.73
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Speech and dialogue systems
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Content-based Language Models for Spoken Document Retrieval

Hsin‐Min WangBerlin Chen

Journal:   International Journal of Computer Processing Of Languages Year: 2001 Vol: 14 (02)Pages: 193-209
JOURNAL ARTICLE

Statistical language models for query-by-example spoken document retrieval

Paula Lopez‐OteroJavier ParaparÁlvaro Barreiro

Journal:   Multimedia Tools and Applications Year: 2020 Vol: 79 (11-12)Pages: 7927-7949
JOURNAL ARTICLE

Enhanced Bert-Based Ranking Models for Spoken Document Retrieval

Hsiao-Yun LinTien-Hong LoBerlin Chen

Journal:   2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) Year: 2019 Vol: 39 Pages: 601-606
© 2026 ScienceGate Book Chapters — All rights reserved.