JOURNAL ARTICLE

Log-Bilinear Document Language Model for Ad-hoc Information Retrieval

Abstract

Incorporating semantic information into document representation is effective and potentially significant to improve retrieval performance. Recently, log-bilinear language model (LBL), as a form of neural language model, has been proved to be an effective way to learn semantic word representations, but its feasibility and effectiveness in information retrieval is mostly unknown. In this paper, we study how to efficiently use LBL to improve as-hoc retrieval. We propose a log-bilinear document language model (LB-DM) within the language modeling framework. The key idea is to learn semantically oriented representations for words, and estimate document language models based on these representations. Noise-constrictive estimation is employed to perform fast training on large document collections. Experiment results on standard TREC collections show that LB-DM performs better than translation language model and LDA-based retrieval model.

Keywords:
Computer science Language model Natural language processing Artificial intelligence Information retrieval Word (group theory) Question answering Machine translation Key (lock) Semantics (computer science) Programming language Linguistics

Metrics

1
Cited By
0.48
FWCI (Field Weighted Citation Impact)
18
Refs
0.75
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Information Retrieval and Search Behavior
Physical Sciences →  Computer Science →  Information Systems
Data Quality and Management
Social Sciences →  Decision Sciences →  Management Science and Operations Research

Related Documents

JOURNAL ARTICLE

Bayesian extension to the language model for ad hoc information retrieval

Hugo ZaragozaDjoerd HiemstraMichael E. Tipping

Journal:   Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval - SIGIR '03 Year: 2003
JOURNAL ARTICLE

Clusters, language models, and ad hoc information retrieval

Oren KurlandLillian Lee

Journal:   ACM Transactions on Information Systems Year: 2009 Vol: 27 (3)Pages: 1-39
JOURNAL ARTICLE

Utilizing passage-based language models for ad hoc document retrieval

Michael BenderskyOren Kurland

Journal:   Information Retrieval Year: 2009 Vol: 13 (2)Pages: 157-187
© 2026 ScienceGate Book Chapters — All rights reserved.