JOURNAL ARTICLE

Optimizing Skyline Query Processing in Incomplete Data

Yonis GulzarAli A. AlwanSherzod Turaev

Year: 2019 Journal:   IEEE Access Vol: 7 Pages: 178121-178138   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Given the significance of skyline queries, they are incorporated in various modern applications including personalized recommendation systems as well as decision-making and decision-support systems. Skyline queries are used to identify superior data items in the database. Most of the previously proposed skyline algorithms work on a complete database where the data are always present (non-missing). However, in many contemporary real-world databases, particularly those databases with large cardinality and high dimensionality, such assumption is not necessarily valid. Hence, missing data pose new challenges if the processing skyline queries cannot easily apply those methods that are designed for complete data. This is due to the fact that imperfect data cause the loss of the transitivity property of the skyline method and cyclic dominance. This paper presents a framework called Optimized Incomplete Skyline (OIS) which utilizes a technique that simplifies the skyline process on a database with missing data and helps prune the data items before performing the skyline process. The proposed strategy assures that the number of the domination tests is significantly reduced. A set of experiments has been accomplished using both real and synthetic datasets aimed at validating the performance of the framework. The experiment results confirm that the OIS framework is indeed superior and steadily outperforms the current approaches in terms of the number of domination tests required to retrieve the skylines.

Keywords:
Skyline Computer science Data mining Cardinality (data modeling) Missing data Process (computing) Set (abstract data type) Curse of dimensionality Database Information retrieval Machine learning

Metrics

17
Cited By
2.63
FWCI (Field Weighted Citation Impact)
54
Refs
0.91
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Data Management and Algorithms
Physical Sciences →  Computer Science →  Signal Processing
Geographic Information Systems Studies
Social Sciences →  Social Sciences →  Geography, Planning and Development
Advanced Database Systems and Queries
Physical Sciences →  Computer Science →  Computer Networks and Communications
© 2026 ScienceGate Book Chapters — All rights reserved.