JOURNAL ARTICLE

Information Retrieval System in Bangla Document Ranking using Latent Semantic Indexing

Md. Nesarul HoqueMd. Rabiul IslamMd. Sajidul Karim

Year: 2019 Journal:   2019 1st International Conference on Advances in Science, Engineering and Robotics Technology (ICASERT) Vol: 18 Pages: 1-5

Abstract

Nowadays, like the English and other languages, Bangla also plays a significant role to strengthen the web repository. The storing rate of Bangla information is augmented day-by-day. Because of the numerous documents in the World Wide Web, it is very difficult for a user to retrieve the desired information. Furthermore, finding the useful documents tends to be more time spending as well as an annoying job. These demands emerge to develop an Information Retrieval (IR) system to document ranking for Bangla language. In this paper, we have built such a retrieval system where users can find their needed documents which correspond to their own query strings throughout the ranking index. Although a lot of works have been done for English and other languages to rank the documents, unfortunately, we have found a very negligible amount of contributions in Bangla Language. Many methods such as - Boolean model, Maximal Marginal Relevance (MMR), Portfolio Theory (PR), Quantum Probability Ranking Principle (QPRP), Query Directed Clustering (QDC), Vector-based TFIDF and so on, have been proposed to implement the document ranking system. Here, we have applied a new approach, called Latent Semantic Indexing (LSI) to do the same task for Bangla documents. LSI uses the mathematical method called Singular Value Decomposition (SVD). After that, we have applied the cosine similarity to rank all the documents. We believe that the performance result of our proposed system has reached the trustworthy level.

Keywords:
Computer science Information retrieval tf–idf Ranking (information retrieval) Bengali Relevance (law) Vector space model Cosine similarity Rank (graph theory) Cluster analysis Document retrieval Search engine indexing Artificial intelligence Natural language processing Term (time) Mathematics

Metrics

3
Cited By
0.00
FWCI (Field Weighted Citation Impact)
19
Refs
0.19
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Text Analysis Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Information Retrieval and Search Behavior
Physical Sciences →  Computer Science →  Information Systems

Related Documents

JOURNAL ARTICLE

Framework for Document Retrieval using Latent Semantic Indexing

Neelam PhadnisJayant Gadge

Journal:   International Journal of Computer Applications Year: 2014 Vol: 94 (14)Pages: 37-41
JOURNAL ARTICLE

Using latent semantic indexing for multilanguage information retrieval

Michael W. BerryPaul G. Young

Journal:   Computers and the Humanities Year: 1995 Vol: 29 (6)Pages: 413-429
JOURNAL ARTICLE

Large-scale information retrieval with latent semantic indexing

Todd A. LetscheMichael W. Berry

Journal:   Information Sciences Year: 1997 Vol: 100 (1-4)Pages: 105-137
© 2026 ScienceGate Book Chapters — All rights reserved.