JOURNAL ARTICLE

New Language Models for Spelling Correction

Saida LaaroussiSi Lhoussain AouraghAbdellah YousfiMohammed NejjaHicham GeddahSaïd Ouatik El Alaoui

Year: 2022 Journal:   The International Arab Journal of Information Technology Vol: 19 (6)   Publisher: Zarqa University

Abstract

Correcting spelling errors based on the context is a fairly significant problem in Natural Language Processing (NLP) applications. The majority of the work carried out to introduce the context into the process of spelling correction uses the n-gram language models. However, these models fail in several cases to give adequate probabilities for the suggested solutions of a misspelled word in a given context. To resolve this issue, we propose two new language models inspired by stochastic language models combined with edit distance. A first phase consists in finding the words of the lexicon orthographically close to the erroneous word and a second phase consists in ranking and limiting these suggestions. We have applied the new approach to Arabic language taking into account its specificity of having strong contextual connections between distant words in a sentence. To evaluate our approach, we have developed textual data processing applications, namely the extraction of distant transition dictionaries. The correction accuracy obtained exceeds 98% for the first 10 suggestions. Our approach has the advantage of simplifying the parameters to be estimated with a higher correction accuracy compared to n-gram language models. Hence the need to use such an approach.

Keywords:
Computer science Natural language processing Spelling Artificial intelligence Language model Lexicon Context (archaeology) Sentence Word (group theory) Ranking (information retrieval) n-gram Natural language Linguistics

Metrics

2
Cited By
0.39
FWCI (Field Weighted Citation Impact)
0
Refs
0.63
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Text Readability and Simplification
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.