JOURNAL ARTICLE

Natural Language Processing in Low-Resource Languages: Progress and Prospects

Ritul Phukan, Monalisa Daimari, Anupam Kharghoria, Biman Basumatary

Year: 2025 Journal:   Zenodo (CERN European Organization for Nuclear Research)   Publisher: European Organization for Nuclear Research

Abstract

Low-resource languageslanguages with limited annotated corpora, lexicons, and digital resourcespose major challenges for modern natural language processing (NLP). Recent progress in transfer learning, multilingual pretraining, parameter-efficient adaptation, data augmentation, and community-driven dataset creation has substantially improved capabilities for many such languages, yet large performance gaps remain compared to high-resource languages. This article surveys the technical advances that enable NLP for low-resource languages (including unsupervised and weakly supervised methods, multilingual and massively multilingual models, few-shot and in-context learning with large language models, and adapter/LoRA-style parameter-efficient fine-tuning). We examine practical pipelines for tasks such as machine translation, speech recognition, OCR, and information extraction; describe prominent dataset and community projects; summarize typical evaluation strategies and their pitfalls; and outline promising research directions (community data collection, privacy-preserving methods, on-device adaptation, and ethics-aware deployments). The review highlights approaches that balance performance, compute cost, and data-efficiency, and recommends research and deployment practices to accelerate inclusive language technology.

Keywords:
Software deployment Natural language Open research Computational linguistics Transfer of learning Natural language understanding

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
0.75
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

ICT in Developing Communities
Physical Sciences →  Computer Science →  Information Systems
Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Big Data and Digital Economy
Physical Sciences →  Computer Science →  Information Systems

Related Documents

JOURNAL ARTICLE

Natural Language Processing in Low-Resource Languages: Progress and Prospects

Ritul Phukan, Monalisa Daimari, Anupam Kharghoria, Biman Basumatary

Journal:   Zenodo (CERN European Organization for Nuclear Research) Year: 2025
JOURNAL ARTICLE

Natural language processing applications for low-resource languages

Partha PakrayAlexander GelbukhSivaji Bandyopadhyay

Journal:   Natural language processing. Year: 2025 Vol: 31 (2)Pages: 183-197
JOURNAL ARTICLE

Question Answering for Low Resource Languages Using Natural Language Processing

Nirav A. Baldha

Journal:   International Journal of Scientific Research and Engineering Trends Year: 2022 Vol: 8 (2)Pages: 1122-1126
© 2026 ScienceGate Book Chapters — All rights reserved.