JOURNAL ARTICLE

Few shot learning for cross-lingual isolated word recognition

Abstract

We address the problem of low resource machine learning in the form of few-shot learning (FSL) applied to word recognition in both mono-lingual and cross-lingual settings. Recently, we proposed an adaptation of a FSL framework, matching networks (MN) to a suite of speech recognition tasks such as multi-speaker small-to-medium vocabulary word recognition and frame-wise phoneme recognition tasks under mel-spectrogram and single-frame feature representations. In this paper, we extend this FSL adaptation of MN to multi-speaker isolated word recognition (IWR), in a framework termed MN-IWR. The IWR task is specifically set in a 'command-and-control' (C&C) scenario with the requirement of needing only very few-shot examples (e.g. up to 20) for a target IWR classification task with vocabularies defined dynamically. Moreover, our proposed MN-IWR framework addresses a cross-domain and cross-lingual setting defined as below: a model is trained on a possibly large set of words in a source-language and used for inference on a cross-domain task (vocabulary of words different from the training vocabulary) or a cross-lingual task (vocabulary of words from a target-language different from the source-language). In this work, we present the main formulation of the MN-IWR framework, its adaptation from source-to-target tasks and results on TIMIT vocabulary of words in a mono-lingual setting and on English, Kannada and Tamil words in cross-lingual settings and report very high performances of the proposed MN-IWR FSL paradigm over conventional IWR classification without the FSL advantage of the MN formulation.

Keywords:
Computer science Shot (pellet) Word (group theory) Artificial intelligence Natural language processing Speech recognition Mathematics

Metrics

1
Cited By
0.14
FWCI (Field Weighted Citation Impact)
19
Refs
0.56
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Domain Adaptation and Few-Shot Learning
Physical Sciences →  Computer Science →  Artificial Intelligence
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Cross-lingual few-shot sign language recognition

Yunus Can BilgeNazlı İkizler-CinbişRamazan Gökberk Cinbiş

Journal:   Pattern Recognition Year: 2024 Vol: 151 Pages: 110374-110374
JOURNAL ARTICLE

Word Reordering for Zero-shot Cross-lingual Structured Prediction

Tao JiYong JiangTao WangZhongqiang HuangFei HuangYuanbin WuXiaoling Wang

Journal:   Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing Year: 2021 Pages: 4109-4120
BOOK-CHAPTER

Unsupervised Learning of Cross-Lingual Word Embeddings

Anders SøgaardIvan VulićSebastian RuderManaal Faruqui

Synthesis lectures on human language technologies Year: 2019 Pages: 67-74
© 2026 ScienceGate Book Chapters — All rights reserved.