Combining Weak Tokenisers for Phonotactic Language Recognition in a Resource-Constrained Setting

Raymond W. M. Ng; Bhusan Chettri; Thomas Hain

doi:10.21437/interspeech.2016-630

ScienceGate Book Chapters

JOURNAL ARTICLE

Combining Weak Tokenisers for Phonotactic Language Recognition in a Resource-Constrained Setting

Raymond W. M. Ng Bhusan Chettri Thomas Hain

Year: 2016 Pages: 2939-2943

DOI: 10.21437/interspeech.2016-630

Get Full-Text PDF Get Analytical Report

Abstract

In the phonotactic approach for language recognition, a phone tokeniser is normally used to transform the audio signal into acoustic tokens. The language identity of the speech is modelled by the occurrence statistics of the decoded tokens. The performance of this approach depends heavily on the quality of the audio tokeniser. A high-quality tokeniser in matched condition is not always available for a language recognition task. This study investigated into the performance of a phonotactic language recogniser in a resource-constrained setting, following NIST LRE 2015 specification. An ensemble of phone tokenisers was constructed by applying unsupervised sequence training on different target languages followed by a score-based fusion. This method gave 5−7% relative performance improvement to baseline system on LRE 2015 eval set. This gain was retained when the ensemble phonotactic system was further fused with an acoustic iVector system

Keywords:

Phonotactics Computer science Resource (disambiguation) Natural language processing Artificial intelligence Speech recognition Phonology Linguistics Computer network

Metrics

Cited By

0.48

FWCI (Field Weighted Citation Impact)

Refs

0.66

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Combining Weak Tokenisers for Phonotactic Language Recognition in a Resource-Constrained Setting

Abstract

Metrics

Citation History

Topics

Related Documents

Interspeech 2016 - Experiment results for the paper "Combining weak tokenisers for phonotactic language recognition in a resource-constrained setting"

Advances in phonotactic language recognition

Time-Frequency Cepstral Features and Combining Discriminative Training for Phonotactic Language Recognition

Unsupervised crosslingual adaptation of tokenisers for spoken language recognition

Selecting phonotactic features for language recognition