DISSERTATION

Scheduling in Distributed Stream Processing Systems

Abstract

Stream processing systems receive continuous streams of messages with relatively raw information and produce streams of messages with processed information. The utility of a stream-processing system depends, in part, on the accuracy and timeliness of the output. Streams in complex event processing systems are processed on distributed systems; several steps are taken on different processors to process each incoming message, and messages may be enqueued between steps. This work explores the problem of distributed dynamic control of streams to optimize the total utility provided by the system. A system can be controlled using central control or distributed control. In the former case a single central controller maintains the state of the entire system and controls the operation of all processors. In distributed control systems, each processor controls itself based on its state and information from other processors. A challenge of distributed control is that timeliness of output depends only on the total end-to-end time and is otherwise independent of the delays at each separate processor whereas the controller for each processor takes action to control only the steps on that processor and cannot directly control the entire network. In this work, we discuss a framework for design and analysis of the control-based scheduling algorithms for a distributed stream processing system and illustrate our framework with two concrete scheduling algorithms.

Keywords:
Computer science Scheduling (production processes) Stream processing Distributed computing Real-time computing Process control Control system Data stream mining Controller (irrigation) Process (computing) Engineering Operating system

Metrics

4
Cited By
0.00
FWCI (Field Weighted Citation Impact)
42
Refs
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Distributed systems and fault tolerance
Physical Sciences →  Computer Science →  Computer Networks and Communications
Distributed and Parallel Computing Systems
Physical Sciences →  Computer Science →  Computer Networks and Communications
Petri Nets in System Modeling
Physical Sciences →  Computer Science →  Computational Theory and Mathematics

Related Documents

JOURNAL ARTICLE

Scheduling in Distributed Stream Processing Systems

Khorlin, Andrey

Journal:   CaltechTHESIS (California Institute of Technology) Year: 2006
JOURNAL ARTICLE

Model-driven scheduling for distributed stream processing systems

Anshu ShuklaYogesh Simmhan

Journal:   Journal of Parallel and Distributed Computing Year: 2018 Vol: 117 Pages: 98-114
JOURNAL ARTICLE

I-Scheduler: Iterative scheduling for distributed stream processing systems

Leila EskandariJason MairZhiyi HuangDavid Eyers

Journal:   Future Generation Computer Systems Year: 2020 Vol: 117 Pages: 219-233
© 2026 ScienceGate Book Chapters — All rights reserved.