Abstract

Cross-language information retrieval (CLIR) today is dominated by techniques that use token-to-token mappings from bilingual dictionaries. Yet, state-of-the-art statistical translation models (e.g., using Synchronous Context-Free Grammars) are far richer, capturing multi-term phrases, term dependencies, and contextual constraints on translation choice. We present a novel CLIR framework that is able to reach inside the translation "black box" and exploit these sources of evidence. Experiments on the TREC-5/6 English-Chinese test collection show this approach to be promising.

Keywords:
Computer science Security token Exploit Cross-language information retrieval Natural language processing Artificial intelligence Context (archaeology) Rule-based machine translation Term (time) Translation (biology) Machine translation Information retrieval

Metrics

17
Cited By
4.17
FWCI (Field Weighted Citation Impact)
8
Refs
0.95
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Algorithms and Data Compression
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Looking Inside

Massimo GarbuioAdelaide Wilcox KingDan Lovallo

Journal:   Journal of Management Year: 2011 Vol: 37 (5)Pages: 1444-1463
BOOK-CHAPTER

Looking Inside

Lauren Long Ferris

Year: 2025 Pages: 12-16
BOOK-CHAPTER

Looking Inside

Howard BurtonJohn Duncan

Open Agenda Publishing eBooks Year: 2020 Pages: 223-227
BOOK-CHAPTER

Looking Inside

Open Agenda Publishing eBooks Year: 2020
JOURNAL ARTICLE

Looking Inside

Karen Salyer McElmurray

Journal:   Appalachian heritage Year: 2014 Vol: 42 (2)Pages: 85-102
© 2026 ScienceGate Book Chapters — All rights reserved.