JOURNAL ARTICLE

Adaptive Algorithm for Multi-Armed Bandit Problem with High-Dimensional Covariates

Wei QianChing‐Kang IngJi Liu

Year: 2022 Journal:   Journal of the American Statistical Association Vol: 119 (546)Pages: 970-982

Abstract

This article studies an important sequential decision making problem known as the multi-armed stochastic bandit problem with covariates. Under a linear bandit framework with high-dimensional covariates, we propose a general multi-stage arm allocation algorithm that integrates both arm elimination and randomized assignment strategies. By employing a class of high-dimensional regression methods for coefficient estimation, the proposed algorithm is shown to have near optimal finite-time regret performance under a new study scope that requires neither a margin condition nor a reward gap condition for competitive arms. Based on the synergistically verified benefit of the margin, our algorithm exhibits adaptive performance that automatically adapts to the margin and gap conditions, and attains optimal regret rates simultaneously for both study scopes, without or with the margin, up to a logarithmic factor. Besides the desirable regret performance, the proposed algorithm simultaneously generates useful coefficient estimation output for competitive arms and is shown to achieve both estimation consistency and variable selection consistency. Promising empirical performance is demonstrated through extensive simulation and two real data evaluation examples. Supplementary materials for this article are available online.

Keywords:
Regret Margin (machine learning) Covariate Consistency (knowledge bases) Computer science Logarithm Mathematical optimization Variable (mathematics) Smoothing Algorithm Mathematics Artificial intelligence Machine learning

Metrics

5
Cited By
1.00
FWCI (Field Weighted Citation Impact)
62
Refs
0.76
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Bandit Algorithms Research
Social Sciences →  Decision Sciences →  Management Science and Operations Research
Optimization and Search Problems
Physical Sciences →  Computer Science →  Computer Networks and Communications
Reinforcement Learning in Robotics
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

The multi-armed bandit problem with covariates

Vianney PerchetPhilippe Rigollet

Journal:   The Annals of Statistics Year: 2013 Vol: 41 (2)
BOOK-CHAPTER

Dynamic Multi-Armed Bandit with Covariates

Pavlidis Nicos G.Tasoulis Dimitris K.Adams Niall M.Hand David J.

Frontiers in artificial intelligence and applications Year: 2008
JOURNAL ARTICLE

A non-parametric solution to the multi-armed bandit problem with covariates

Mingyao AiYimin HuangJun Yu

Journal:   Journal of Statistical Planning and Inference Year: 2020 Vol: 211 Pages: 402-413
© 2026 ScienceGate Book Chapters — All rights reserved.