Iterative pseudo-labeling methods for improving speech recognition

shiyu Fan; Nurmemet Yolwas; Wen Li; Jin‐Ting Zhang

doi:10.1117/12.3011128

ScienceGate Book Chapters

JOURNAL ARTICLE

Iterative pseudo-labeling methods for improving speech recognition

shiyu Fan Nurmemet Yolwas Wen Li Jin‐Ting Zhang

Year: 2023 Pages: 100-100

DOI: 10.1117/12.3011128

Get Full-Text PDF Get Analytical Report

Abstract

In recent years, pseudo-labeling methods can reduce the difficulty of building speech recognition systems, in end-to-end automatic speech recognition (ASR). Iterative pseudo-labeling (IPL) is a classical semi-supervised algorithm that can efficiently perform multiple pseudo-labeling iterations on unlabeled data as acoustic models evolve. We incorporate the language model to generate pseudo-labeling based on IPL using the language model for decoding and data augmentation, and make new attempts on the selection of pseudo-labeling. The effectiveness of the improved approach is demonstrated by simulating low resources and standard settings and obtaining a word error rate better than IPL on the LIBRISPEECH test.

Keywords:

Computer science Speech recognition Language model Decoding methods Word error rate Selection (genetic algorithm) Artificial intelligence Acoustic model Word (group theory) Sequence labeling Pattern recognition (psychology) Speech processing Task (project management) Algorithm Mathematics

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.14

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Iterative pseudo-labeling methods for improving speech recognition

Abstract

Metrics

Topics

Related Documents

Iterative Pseudo-Labeling for Speech Recognition

Improved Noisy Iterative Pseudo-Labeling for Semi-Supervised Speech Recognition

Pseudo-Labeling for Massively Multilingual Speech Recognition

Unsupervised Speech Recognition via Utterance-wise Pseudo-labeling

Momentum Pseudo-Labeling for Semi-Supervised Speech Recognition