Content-based language models for spoken document retrieval

Hsin‐Min Wang; Berlin Chen

doi:10.1145/355214.355236

ScienceGate Book Chapters

JOURNAL ARTICLE

Content-based language models for spoken document retrieval

Hsin‐Min Wang Berlin Chen

Year: 2000 Pages: 149-155

DOI: 10.1145/355214.355236

Get Full-Text PDF Get Analytical Report

Abstract

Spoken document retrieval (SDR) has been extensively studied in recent years because of its potential use in navigating large multimedia collections in the near future. This paper presents a novel concept of applying the content-based language models to spoken document retrieval. In an example task for retrieval of Mandarin broadcast news, the content-based language models either trained with the automatic transcriptions of the spoken documents or adapted from the baseline language models using the automatic transcriptions of the spoken documents were used to create the more accurate recognition results and indexing terms from both the spoken documents and the speech queries. We report on some interesting findings obtained in this research.

Keywords:

Computer science Spoken language Search engine indexing Natural language processing Mandarin Chinese Document retrieval Task (project management) Artificial intelligence Information retrieval Language model Speech recognition Linguistics

Metrics

Cited By

0.42

FWCI (Field Weighted Citation Impact)

Refs

0.73

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Speech and dialogue systems

Physical Sciences → Computer Science → Artificial Intelligence

Content-based language models for spoken document retrieval

Abstract

Metrics

Citation History

Topics

Related Documents

Content-based Language Models for Spoken Document Retrieval

Statistical language models for query-by-example spoken document retrieval

Spoken document retrieval using topic models

I-vector based language modeling for spoken document retrieval

Enhanced Bert-Based Ranking Models for Spoken Document Retrieval