JOURNAL ARTICLE

Online Second Price Auction with Semi-Bandit Feedback under the Non-Stationary Setting

Haoyu ZhaoWei Chen

Year: 2020 Journal:   Proceedings of the AAAI Conference on Artificial Intelligence Vol: 34 (04)Pages: 6893-6900   Publisher: Association for the Advancement of Artificial Intelligence

Abstract

In this paper, we study the non-stationary online second price auction problem. We assume that the seller is selling the same type of items in T rounds by the second price auction, and she can set the reserve price in each round. In each round, the bidders draw their private values from a joint distribution unknown to the seller. Then, the seller announced the reserve price in this round. Next, bidders with private values higher than the announced reserve price in that round will report their values to the seller as their bids. The bidder with the highest bid larger than the reserved price would win the item and she will pay to the seller the price equal to the second-highest bid or the reserve price, whichever is larger. The seller wants to maximize her total revenue during the time horizon T while learning the distribution of private values over time. The problem is more challenging than the standard online learning scenario since the private value distribution is non-stationary, meaning that the distribution of bidders' private values may change over time, and we need to use the non-stationary regret to measure the performance of our algorithm. To our knowledge, this paper is the first to study the repeated auction in the non-stationary setting theoretically. Our algorithm achieves the non-stationary regret upper bound Õ(min{√S T, V¯⅓T⅔), where S is the number of switches in the distribution, and V¯ is the sum of total variation, and S and V¯ are not needed to be known by the algorithm. We also prove regret lower bounds Ω(√S T) in the switching case and Ω(V¯⅓T⅔) in the dynamic case, showing that our algorithm has nearly optimal non-stationary regret.

Keywords:
Reservation price Regret Revenue Vickrey auction Common value auction Microeconomics Distribution (mathematics) Economics Value (mathematics) Mathematical economics Auction theory Mathematics Finance Statistics

Metrics

9
Cited By
1.78
FWCI (Field Weighted Citation Impact)
40
Refs
0.85
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Bandit Algorithms Research
Social Sciences →  Decision Sciences →  Management Science and Operations Research
Auction Theory and Applications
Social Sciences →  Decision Sciences →  Management Science and Operations Research
Data Stream Mining Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Online Influence Maximization With Semi-Bandit Feedback Under Corruptions

Xiaotong ChengBehzad Nourani-KolijiSetareh Maghsudi

Journal:   IEEE Transactions on Network Science and Engineering Year: 2025 Vol: 12 (3)Pages: 2308-2321
JOURNAL ARTICLE

Non-stochastic Budgeted Online Pricing with Semi-Bandit Feedback

Xiang LiuHau ChanMinming LiWeiwei WuLong Tran-Thanh

Journal:   Proceedings of the AAAI Conference on Artificial Intelligence Year: 2025 Vol: 39 (18)Pages: 18978-18986
JOURNAL ARTICLE

Online learning for auction mechanism in bandit setting

Di HeWei ChenLiwei WangTie‐Yan Liu

Journal:   Decision Support Systems Year: 2013 Vol: 56 Pages: 379-386
JOURNAL ARTICLE

Online Nonsubmodular Optimization with Delayed Feedback in the Bandit Setting

Sifan YangYuanyu WanLijun Zhang

Journal:   Proceedings of the AAAI Conference on Artificial Intelligence Year: 2025 Vol: 39 (20)Pages: 21992-22000
© 2026 ScienceGate Book Chapters — All rights reserved.