JOURNAL ARTICLE

Semantic Cross-lingual Information Retrieval

Abstract

Cross lingual Information Retrieval (CLIR) refers to the information retrieval activities in which the query and/or documents may appear in different languages. Dictionary-based query translation has been a common method in CLIR systems. In these methods we face with the problem of translation ambiguity in which a single word in one language has more than one translation in the other language. In this paper we propose a hybrid approach to retrieve English documents relevant to Persian queries. In this approach we exploit a combination of phrase reorganization, pattern based phrase translation and query expansion before and after translation to improve the dictionary-based query translation. We also propose an improved probabilistic algorithm to choose the best translation of words and phrases. Finally, the documents will be ranked according to statistical language model with some translation steps. Our experimental results show that each of the mentioned methods can bring significant improvement over simple dictionary approaches.

Keywords:
Computer science Cross-language information retrieval Natural language processing Artificial intelligence Machine translation Phrase Query expansion Information retrieval Translation (biology) Rule-based machine translation Ranking (information retrieval) Bilingual dictionary

Metrics

8
Cited By
0.80
FWCI (Field Weighted Citation Impact)
18
Refs
0.84
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Semantic Web and Ontologies
Physical Sciences →  Computer Science →  Artificial Intelligence
Information Retrieval and Search Behavior
Physical Sciences →  Computer Science →  Information Systems
Web Data Mining and Analysis
Physical Sciences →  Computer Science →  Information Systems

Related Documents

BOOK-CHAPTER

Cross-lingual Information Retrieval

Encyclopedia of Database Systems Year: 2009 Pages: 528-528
BOOK-CHAPTER

Cross-Lingual Information Retrieval

Christopher C. YangKar W. Li

IGI Global eBooks Year: 2004 Pages: 153-170
BOOK-CHAPTER

Cross-Lingual Information Retrieval

Christopher C. YangKar W. Li

IGI Global eBooks Year: 2011
BOOK-CHAPTER

Tamil English Cross Lingual Information Retrieval

T. Pattabhi R. K. RaoSobha Lalitha Devi

Lecture notes in computer science Year: 2013 Pages: 269-279
© 2026 ScienceGate Book Chapters — All rights reserved.