JOURNAL ARTICLE

Adaptive Processing for Distributed Skyline Queries over Uncertain Data

Xu ZhouKenli LiYantao ZhouKeqin Li

Year: 2015 Journal:   IEEE Transactions on Knowledge and Data Engineering Vol: 28 (2)Pages: 371-384   Publisher: IEEE Computer Society

Abstract

Query processing over uncertain data has gained growing attention, because it is necessary to deal with uncertain data in many real-life applications. In this paper, we investigate skyline queries over uncertain data in distributed environments (DSUD query) whose research is only in an early stage. The state-of-the-art algorithm, called e-DSUD algorithm, is designed for processing this query. It has the desirable characteristics of progressiveness and minimum bandwidth consumption. However, it still needs to be perfected in three aspects. (1) Progressiveness. Each time it only returns one query result at most. (2) Efficiency. There are a significant amount of redundant I/O cost and numerous iterations which causes a long total query time. (3) Universality. It is restricted to the case where local skyline tuples are incomparability. To address these concerns, we first present a detailed analysis of the e-DSUD algorithm and then develop an improved framework for the DSUD query, namely IDSUD. Based on the new framework, we propose an adaptive algorithm, called ADSUD, for the DSUD query. In the algorithm, we redefine the approximate global skyline probability and choose local representative tuples due to minimum probabilistic bounding rectangle adaptively. Furthermore, we design a progressive pruning method and apply the reuse mechanism to improve its efficiency. The results of extensive experiments verify the better overall performance of our algorithm than the e-DSUD algorithm.

Keywords:
Computer science Skyline Tuple Uncertain data Query optimization Data mining Probabilistic logic Distributed database Bounding overwatch Theoretical computer science Distributed computing Artificial intelligence

Metrics

83
Cited By
11.16
FWCI (Field Weighted Citation Impact)
30
Refs
0.99
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Data Management and Algorithms
Physical Sciences →  Computer Science →  Signal Processing
Advanced Database Systems and Queries
Physical Sciences →  Computer Science →  Computer Networks and Communications
Constraint Satisfaction and Optimization
Physical Sciences →  Computer Science →  Computer Networks and Communications
© 2026 ScienceGate Book Chapters — All rights reserved.