Unsupervised Speaker Adaptation Using Speaker-Class Models for Lecture Speech Recognition

Tetsuo Kosaka; Yuui TAKEDA; Takashi Ito; Masaharu Kato; Masaki Kohda

doi:10.1587/transinf.e93.d.2363

ScienceGate Book Chapters

JOURNAL ARTICLE

Unsupervised Speaker Adaptation Using Speaker-Class Models for Lecture Speech Recognition

Tetsuo Kosaka Yuui TAKEDA Takashi Ito Masaharu Kato Masaki Kohda

Year: 2010 Journal: IEICE Transactions on Information and Systems Vol: E93-D (9)Pages: 2363-2369 Publisher: Institute of Electronics, Information and Communication Engineers

DOI: 10.1587/transinf.e93.d.2363

Get Full-Text PDF Get Analytical Report

Abstract

In this paper, we propose a new speaker-class modeling and its adaptation method for the LVCSR system and evaluate the method on the Corpus of Spontaneous Japanese (CSJ). In this method, closer speakers are selected from training speakers and the acoustic models are trained by using their utterances for each evaluation speaker. One of the major issues of the speaker-class model is determining the selection range of speakers. In order to solve the problem, several models which have a variety of speaker range are prepared for each evaluation speaker in advance, and the most proper model is selected on a likelihood basis in the recognition step. In addition, we improved the recognition performance using unsupervised speaker adaptation with the speaker-class models. In the recognition experiments, a significant improvement could be obtained by using the proposed speaker adaptation based on speaker-class models compared with the conventional adaptation method.

Keywords:

Computer science Speaker recognition Speaker diarisation Speech recognition Adaptation (eye) Class (philosophy) Artificial intelligence Variety (cybernetics) Selection (genetic algorithm) Natural language processing

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.09

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Unsupervised Speaker Adaptation Using Speaker-Class Models for Lecture Speech Recognition

Abstract

Metrics

Topics

Related Documents

Unsupervised Speaker Adaptation in Speech Recognition

An Unsupervised Speaker Adaptation Method for Lecture-Style Spontaneous Speech Recognition Using Multiple Recognition Systems

Rapid speaker adaptation using speaker-mixture allophone models applied to speaker-independent speech recognition

Unsupervised speaker adaptation for speech recognition using demi-syllable HMM

Speaker adaptation for large vocabulary speech recognition systems using speaker Markov models