Speaker Vector-Based Speaker Recognition with Phonetic Modeling

Tetsuo Kosaka; Tatsuya Akatsu; Masaharu Kato; Masaki Koh

doi:10.5772/6387

ScienceGate Book Chapters

BOOK-CHAPTER

Speaker Vector-Based Speaker Recognition with Phonetic Modeling

Tetsuo Kosaka Tatsuya Akatsu Masaharu Kato Masaki Koh

Year: 2008 InTech eBooks

DOI: 10.5772/6387

Get Full-Text PDF Get Analytical Report

Abstract

This chapter proposed the method of anchor model-based speaker recognition in textindependent way with phonetic modeling. Since the method doesn't require model training for the target speaker, only about single utterance is needed for reference speech. In order to improve the recognition performance, phonetic modeling was used instead of Gaussian Mixture Model (GMM) scheme as anchor models. The proposed method was evaluated on Japanese speaker identification task. Compared with the performance of GMM-based system, significant improvement could be achieved. The identification rate of 94.21% could be obtained with 3-state 10-mixture HMMs in 30-speaker identification task. In the experiments, the average length of reference speech was only 5.5 sec. By comparison with the GMM-based system, the relative improvement of 62.9% was achieved. The results show that the phonetic modeling is effective for anchor model-based speaker recognition. We are now conducting the evaluation of the method on speaker verification task. We are also conducting the evaluation of speaker identification in noisy conditions. Some results in noisy conditions have been reported in (Goto et al., 2008). The merit of this method is that the system can detect speaker characteristics with a very short utterance as short as 5 sec. Then the method can be used in the tasks of speaker indexing or tracking.

Keywords:

Speech recognition Speaker recognition Computer science Speaker verification Speaker diarisation

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.11

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Speaker Vector-Based Speaker Recognition with Phonetic Modeling

Abstract

Metrics

Topics

Related Documents

Phonetic speaker recognition

Phonetic speaker recognition

Speaker-Phonetic I-Vector Modeling for Text-Dependent Speaker Verification with Random Digit Strings

Speaker-Phonetic Vector Estimation for Short Duration Speaker Verification

Speaker Recognition Via Nonlinear Phonetic- and Speaker-Discriminative Features