JOURNAL ARTICLE

Unsupervised Speaker Adaptation Using Speaker-Class Models for Lecture Speech Recognition

Tetsuo KosakaYuui TAKEDATakashi ItoMasaharu KatoMasaki Kohda

Year: 2010 Journal:   IEICE Transactions on Information and Systems Vol: E93-D (9)Pages: 2363-2369   Publisher: Institute of Electronics, Information and Communication Engineers

Abstract

In this paper, we propose a new speaker-class modeling and its adaptation method for the LVCSR system and evaluate the method on the Corpus of Spontaneous Japanese (CSJ). In this method, closer speakers are selected from training speakers and the acoustic models are trained by using their utterances for each evaluation speaker. One of the major issues of the speaker-class model is determining the selection range of speakers. In order to solve the problem, several models which have a variety of speaker range are prepared for each evaluation speaker in advance, and the most proper model is selected on a likelihood basis in the recognition step. In addition, we improved the recognition performance using unsupervised speaker adaptation with the speaker-class models. In the recognition experiments, a significant improvement could be obtained by using the proposed speaker adaptation based on speaker-class models compared with the conventional adaptation method.

Keywords:
Computer science Speaker recognition Speaker diarisation Speech recognition Adaptation (eye) Class (philosophy) Artificial intelligence Variety (cybernetics) Selection (genetic algorithm) Natural language processing

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
13
Refs
0.09
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing

Related Documents

JOURNAL ARTICLE

An Unsupervised Speaker Adaptation Method for Lecture-Style Spontaneous Speech Recognition Using Multiple Recognition Systems

Seiichi Nakagawa

Journal:   IEICE Transactions on Information and Systems Year: 2005 Vol: E88-D (3)Pages: 463-471
JOURNAL ARTICLE

Rapid speaker adaptation using speaker-mixture allophone models applied to speaker-independent speech recognition

Takumi KosakaJun-ichi TakamiShigeki Sagayama

Journal:   IEEE International Conference on Acoustics Speech and Signal Processing Year: 1993 Vol: 5 6 Pages: 570-573 vol.2
JOURNAL ARTICLE

Speaker adaptation for large vocabulary speech recognition systems using speaker Markov models

Gerhard Rigoll

Journal:   International Conference on Acoustics, Speech, and Signal Processing Year: 1989 Pages: 5-8 vol.1
© 2026 ScienceGate Book Chapters — All rights reserved.