Distributed Online Stochastic-Constrained Convex Optimization With Bandit Feedback

Cong Wang; Shengyuan Xu; Deming Yuan

doi:10.1109/tcyb.2022.3177644

ScienceGate Book Chapters

JOURNAL ARTICLE

Distributed Online Stochastic-Constrained Convex Optimization With Bandit Feedback

Cong Wang Shengyuan Xu Deming Yuan

Year: 2022 Journal: IEEE Transactions on Cybernetics Vol: 54 (1)Pages: 63-75 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/tcyb.2022.3177644

Get Full-Text PDF Get Analytical Report

Abstract

This article studies the distributed online stochastic convex optimization problem with the time-varying constraint over a multiagent system constructed by various agents. The sequences of cost functions and constraint functions, both of which have dynamic parameters following time-varying distributions, are unacquainted to the agent ahead of time. Agents in the network are able to interact with their neighbors through a sequence of strongly connected and time-varying graphs. We develop the adaptive distributed bandit primal-dual algorithm whose step size and regularization sequences are adaptive and have no prior knowledge about the total iteration span T . The adaptive distributed bandit primal-dual algorithm applies bandit feedback with a one-point or two-point gradient estimator to evaluate gradient values. It is illustrated in this article that if the drift of the benchmark sequence is sublinear, then the adaptive distributed bandit primal-dual algorithm exhibits sublinear expected dynamic regret and constraint violation using both two kinds of gradient estimator to compute gradient information. We present a numerical experiment to show the performance of the proposed method.

Keywords:

Mathematical optimization Convex function Sequence (biology) Sublinear function Convex optimization Mathematics Computer science Benchmark (surveying) Regularization (linguistics) Regular polygon Algorithm Discrete mathematics Artificial intelligence

Metrics

Cited By

2.00

FWCI (Field Weighted Citation Impact)

Refs

0.84

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Bandit Algorithms Research

Social Sciences → Decision Sciences → Management Science and Operations Research

Stochastic Gradient Optimization Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Distributed Control Multi-Agent Systems

Physical Sciences → Computer Science → Computer Networks and Communications

Distributed Online Stochastic-Constrained Convex Optimization With Bandit Feedback

Abstract

Metrics

Citation History

Topics

Related Documents

Constrained distributed online convex optimization with bandit feedback for unbalanced digraphs

Stochastic Convex Optimization with Bandit Feedback

Event-triggered distributed online convex optimization with delayed bandit feedback

Push-sum Distributed Dual Averaging Online Convex Optimization With Bandit Feedback

Online Bandit Convex Optimization with Stochastic Constraints and Delays