Abstract

We study the problem of computing a conjunctive query q in parallel, using p of servers, on a large database. We consider algorithms with one round of communication, and study the complexity of the communication. We are especially interested in the case where the data is skewed, which is a major challenge for scalable parallel query processing. We establish a tight connection between the fractional edge packing of the query and the amount of communication in two cases. First, in the case when the only statistics on the database are the cardinalities of the input relations, and the data is skew-free, we provide matching upper and lower bounds (up to a polylogarithmic factor of p) expressed in terms of fractional edge packings of the query q. Second, in the case when the relations are skewed and the heavy hitters and their frequencies are known, we provide upper and lower bounds expressed in terms of packings of residual queries obtained by specializing the query to a heavy hitter. All our lower bounds are expressed in the strongest form, as number of bits needed to be communicated between processors with unlimited computational power. Our results generalize prior results on uniform databases (where each relation is a matching) [4], and lower bounds for the MapReduce model [1].

Keywords:
Scalability Skew Computer science Matching (statistics) Boolean conjunctive query Upper and lower bounds Conjunctive query Query optimization Enhanced Data Rates for GSM Evolution Communication complexity Parallel database Sargable Parallel computing Theoretical computer science Mathematics Database Search engine Relational database Web search query Information retrieval

Metrics

105
Cited By
12.02
FWCI (Field Weighted Citation Impact)
18
Refs
0.99
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Complexity and Algorithms in Graphs
Physical Sciences →  Computer Science →  Computational Theory and Mathematics
Graph Theory and Algorithms
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Database Systems and Queries
Physical Sciences →  Computer Science →  Computer Networks and Communications

Related Documents

BOOK-CHAPTER

Parallel Query Processing

Esther Pacitti

Encyclopedia of Database Systems Year: 2016 Pages: 1-4
BOOK-CHAPTER

Parallel query processing

P.S. YuMan ChenJoel L. WolfJavier S. Turek

Lecture notes in computer science Year: 1993 Pages: 229-258
BOOK-CHAPTER

Parallel Query Processing

Esther Pacitti

Encyclopedia of Database Systems Year: 2009 Pages: 2038-2040
BOOK-CHAPTER

Parallel Query Processing

Esther Pacitti

Encyclopedia of Database Systems Year: 2018 Pages: 2677-2680
© 2026 ScienceGate Book Chapters — All rights reserved.