We propose a Bayesian extension to the ad-hoc Language Model. Many smoothed estimators used for the multinomial query model in ad-hoc Language Models (including Laplace and Bayes-smoothing) are approximations to the Bayesian predictive distribution. In this paper we derive the full predictive distribution in a form amenable to implementation by classical IR models, and then compare it to other currently used estimators. In our experiments the proposed model outperforms Bayes-smoothing, and its combination with linear interpolation smoothing outperforms all other estimators.
Hugo ZaragozaDjoerd HiemstraMichael E. Tipping
Kamel GarrouchMohamed Nazih Omri
Pritam Singh NegiM. M. S. RauthanH.S. Dhami
Zheng WangQing WangDingwei Wang
Francisco Santiago do Carmo PereiraHilário SeibelSérgio de Freitas