Few shot learning for cross-lingual isolated word recognition

Tirthankar Banerjee; Dhanya Eledath; V. Ramasubramanian

doi:10.1145/3486001.3486235

ScienceGate Book Chapters

JOURNAL ARTICLE

Few shot learning for cross-lingual isolated word recognition

Tirthankar Banerjee Dhanya Eledath V. Ramasubramanian

Year: 2021 Pages: 1-7

DOI: 10.1145/3486001.3486235

Get Full-Text PDF Get Analytical Report

Abstract

We address the problem of low resource machine learning in the form of few-shot learning (FSL) applied to word recognition in both mono-lingual and cross-lingual settings. Recently, we proposed an adaptation of a FSL framework, matching networks (MN) to a suite of speech recognition tasks such as multi-speaker small-to-medium vocabulary word recognition and frame-wise phoneme recognition tasks under mel-spectrogram and single-frame feature representations. In this paper, we extend this FSL adaptation of MN to multi-speaker isolated word recognition (IWR), in a framework termed MN-IWR. The IWR task is specifically set in a 'command-and-control' (C&C) scenario with the requirement of needing only very few-shot examples (e.g. up to 20) for a target IWR classification task with vocabularies defined dynamically. Moreover, our proposed MN-IWR framework addresses a cross-domain and cross-lingual setting defined as below: a model is trained on a possibly large set of words in a source-language and used for inference on a cross-domain task (vocabulary of words different from the training vocabulary) or a cross-lingual task (vocabulary of words from a target-language different from the source-language). In this work, we present the main formulation of the MN-IWR framework, its adaptation from source-to-target tasks and results on TIMIT vocabulary of words in a mono-lingual setting and on English, Kannada and Tamil words in cross-lingual settings and report very high performances of the proposed MN-IWR FSL paradigm over conventional IWR classification without the FSL advantage of the MN formulation.

Keywords:

Computer science Shot (pellet) Word (group theory) Artificial intelligence Natural language processing Speech recognition Mathematics

Metrics

Cited By

0.14

FWCI (Field Weighted Citation Impact)

Refs

0.56

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Domain Adaptation and Few-Shot Learning

Physical Sciences → Computer Science → Artificial Intelligence

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Few shot learning for cross-lingual isolated word recognition

Abstract

Metrics

Citation History

Topics

Related Documents

Cross-lingual few-shot sign language recognition

Word Reordering for Zero-shot Cross-lingual Structured Prediction

UoB_UK at SemEval 2021 Task 2: Zero-Shot and Few-Shot Learning for Multi-lingual and Cross-lingual Word Sense Disambiguation.

Simple and Effective Zero-shot Cross-lingual Phoneme Recognition

Unsupervised Learning of Cross-Lingual Word Embeddings