JOURNAL ARTICLE

Top-k query processing in probabilistic databases with non-materialized views

Abstract

We investigate a novel approach of computing confidence bounds for top-k ranking queries in probabilistic databases with non-materialized views. Unlike related approaches, we present an exact pruning algorithm for finding the top-ranked query answers according to their marginal probabilities without the need to first materialize all answer candidates via the views. Specifically, we consider conjunctive queries over multiple levels of select-project-join views, the latter of which are cast into Datalog rules which we ground in a top-down fashion directly at query processing time. To our knowledge, this work is the first to address integrated data and confidence computations for intensional query evaluations in the context of probabilistic databases by considering confidence bounds over first-order lineage formulas. We extend our query processing techniques by a tool-suite of scheduling strategies based on selectivity estimation and the expected impact on confidence bounds. Further extensions to our query processing strategies include improved top-k bounds in the case when sorted relations are available as input, as well as the consideration of recursive rules. Experiments with large datasets demonstrate significant runtime improvements of our approach compared to both exact and sampling-based top-k methods over probabilistic data.

Keywords:
Computer science Query optimization Probabilistic logic Materialized view Probabilistic database Conjunctive query Sargable Database Ranking (information retrieval) Data mining View Query expansion Joins Web search query Theoretical computer science Relational database Information retrieval Database theory Search engine Programming language Artificial intelligence Database design

Metrics

34
Cited By
6.80
FWCI (Field Weighted Citation Impact)
46
Refs
0.98
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Data Management and Algorithms
Physical Sciences →  Computer Science →  Signal Processing
Bayesian Modeling and Causal Inference
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Database Systems and Queries
Physical Sciences →  Computer Science →  Computer Networks and Communications

Related Documents

JOURNAL ARTICLE

Probabilistic top-k range query processing for uncertain databases

Guoqing XiaoFan WuXu ZhouKeqin Li

Journal:   Journal of Intelligent & Fuzzy Systems Year: 2016 Vol: 31 (2)Pages: 1109-1120
JOURNAL ARTICLE

Queries and materialized views on probabilistic databases

Nilesh DalviChristopher RéDan Suciu

Journal:   Journal of Computer and System Sciences Year: 2010 Vol: 77 (3)Pages: 473-490
JOURNAL ARTICLE

Top-k query processing on probabilistic data

Fan ZhouShuquan LiChun-jing XIAOYue Wu

Journal:   JOURNAL OF ELECTRONIC MEASUREMENT AND INSTRUMENT Year: 2010 Vol: 24 (7)Pages: 650-657
JOURNAL ARTICLE

Maintenance of top-k materialized views

Eftychia BaikousiPanos Vassiliadis

Journal:   Distributed and Parallel Databases Year: 2009 Vol: 27 (2)Pages: 95-137
© 2026 ScienceGate Book Chapters — All rights reserved.