JOURNAL ARTICLE

Statistical cross-language information retrieval using n-best query translations

Marcello FedericoNicola Bertoldi

Year: 2002 Journal:   Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '02

Abstract

This paper presents a novel statistical model for cross-language information retrieval. Given a written query in the source language, documents in the target language are ranked by integrating probabilities computed by two statistical models: a query-translation model, which generates most probable term-by-term translations of the query, and a query-document model, which evaluates the likelihood of each document and translation. Integration of the two scores is performed over the set of N most probable translations of the query. Experimental results with values N=1, 5, 10 are presented on the Italian-English bilingual track data used in the CLEF 2000 and 2001 evaluation campaigns.

Keywords:
Clef Computer science Query expansion Query language RDF query language Cross-language information retrieval Query optimization Web query classification Information retrieval Sargable Natural language processing Term (time) Set (abstract data type) Artificial intelligence Language model Web search query Search engine Programming language

Metrics

6
Cited By
0.65
FWCI (Field Weighted Citation Impact)
0
Refs
0.81
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Information Retrieval and Search Behavior
Physical Sciences →  Computer Science →  Information Systems
Image Retrieval and Classification Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Data Management and Algorithms
Physical Sciences →  Computer Science →  Signal Processing
© 2026 ScienceGate Book Chapters — All rights reserved.