JOURNAL ARTICLE

Term-specific smoothing for the language modeling approach to information retrieval

Abstract

This paper follows a formal approach to information retrieval based on statistical language models. By introducing some simple reformulations of the basic language modeling approach we introduce the notion of importance of a query term. The importance of a query term is an unknown parameter that explicitly models which of the query terms are generated from the relevant documents (the important terms), and which are not (the unimportant terms). The new language modeling approach is shown to explain a number of practical facts of today's information retrieval systems that are not very well explained by the current state of information retrieval theory, including stop words, mandatory terms, coordination level ranking and retrieval using phrases.

Keywords:
Computer science Ranking (information retrieval) Query language Term (time) Language model Smoothing Query expansion Term Discrimination Information retrieval RDF query language Divergence-from-randomness model Natural language processing Artificial intelligence Concept search Web query classification Search engine Web search query Probabilistic logic

Metrics

90
Cited By
13.23
FWCI (Field Weighted Citation Impact)
42
Refs
0.99
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Information Retrieval and Search Behavior
Physical Sciences →  Computer Science →  Information Systems
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Algorithms and Data Compression
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Term-specific smoothing for the language modeling approach to information retrieval

Djoerd Hiemstra

Journal:   Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '02 Year: 2002
JOURNAL ARTICLE

A Language Modeling Approach to Information Retrieval

Jay PonteW. Bruce Croft

Journal:   ACM SIGIR Forum Year: 2017 Vol: 51 (2)Pages: 202-208
JOURNAL ARTICLE

Enhancing information retrieval through concept‐based language modeling and semantic smoothing

Lynda Said LhadjMohand BoughanemKarima Amrouche

Journal:   Journal of the Association for Information Science and Technology Year: 2015 Vol: 67 (12)Pages: 2909-2927
© 2026 ScienceGate Book Chapters — All rights reserved.