JOURNAL ARTICLE

Scheduling distributed multiway spatial join queries: optimization models and algorithms

Abstract

Multiway spatial joins are a commonly occurring and fundamental type of query for spatial data processing. This article presents models and algorithms to schedule this type of query in distributed database systems while attempting to strike a balance between makespan and communication costs. We propose three algorithms based on combinatorial optimization methods: the well-known linear relaxation technique of rounding a solution generated by linear programming (LP), a more sophisticated Lagrangian Relaxation method (LR), as well as a greedy heuristic (GR) for baseline comparison. Our evaluation shows that a schedule built using GR consumes, on average, 22% more processing and communication resources than a more elaborate schedule constructed via the LR method, when scheduling a query for 64 machines. The schedule provided by LR is also, on average, an order of magnitude closer to the optimal schedule for a query compared to GR. We show that scheduling Gigabyte-size multiway queries before execution can reduce its processing time by an order of magnitude compared to state-of-the-art frameworks for spatial data processing that do not have this capability, and can significantly reduce the amount of shuffled data in the network.

Keywords:
Scheduling (production processes) Rounding Joins Schedule Job shop scheduling Linear programming relaxation Linear programming Lagrangian relaxation Greedy algorithm

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
0.35
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Data Management and Algorithms
Physical Sciences →  Computer Science →  Signal Processing
Advanced Database Systems and Queries
Physical Sciences →  Computer Science →  Computer Networks and Communications
Cloud Computing and Resource Management
Physical Sciences →  Computer Science →  Information Systems

Related Documents

JOURNAL ARTICLE

Scheduling distributed multiway spatial join queries: optimization models and algorithms

Thiago Borges de OliveiraFábio M. CostaL. R. FouldsHumberto J. Longo

Journal:   International Journal of Geographical Information Systems Year: 2023 Vol: 37 (6)Pages: 1388-1419
JOURNAL ARTICLE

Optimization Models and Algorithms for Spatial Scheduling

Ghaith Rabadi

Journal:   ODU Digital Commons (Old Dominion University) Year: 2010
JOURNAL ARTICLE

Optimization Algorithms for Distributed Queries

Peter M. G. ApersAlan R. HevnerS. Bing Yao

Journal:   IEEE Transactions on Software Engineering Year: 1983 Vol: SE-9 (1)Pages: 57-68
© 2026 ScienceGate Book Chapters — All rights reserved.