Thompson Sampling for Bandit Learning in Matching Markets

Fang Kong; Junming Yin; Shuai Li

doi:10.24963/ijcai.2022/439

ScienceGate Book Chapters

JOURNAL ARTICLE

Thompson Sampling for Bandit Learning in Matching Markets

Fang Kong Junming Yin Shuai Li

Year: 2022 Journal: Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence Pages: 3164-3170

DOI: 10.24963/ijcai.2022/439

Get Full-Text PDF Get Analytical Report

Abstract

The problem of two-sided matching markets has a wide range of real-world applications and has been extensively studied in the literature. A line of recent works have focused on the problem setting where the preferences of one-side market participants are unknown a priori and are learned by iteratively interacting with the other side of participants. All these works are based on explore-then-commit (ETC) and upper confidence bound (UCB) algorithms, two common strategies in multi-armed bandits (MAB). Thompson sampling (TS) is another popular approach, which attracts lots of attention due to its easier implementation and better empirical performances. In many problems, even when UCB and ETC-type algorithms have already been analyzed, researchers are still trying to study TS for its benefits. However, the convergence analysis of TS is much more challenging and remains open in many problem settings. In this paper, we provide the first regret analysis for TS in the new setting of iterative matching markets. Extensive experiments demonstrate the practical advantages of the TS-type algorithm over the ETC and UCB-type baselines.

Keywords:

Computer science Regret Thompson sampling A priori and a posteriori Matching (statistics) Commit Convergence (economics) Machine learning Sampling (signal processing) Range (aeronautics) Artificial intelligence Mathematical optimization Filter (signal processing) Mathematics Statistics Engineering

Metrics

Cited By

2.03

FWCI (Field Weighted Citation Impact)

Refs

0.86

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Bandit Algorithms Research

Social Sciences → Decision Sciences → Management Science and Operations Research

Optimization and Search Problems

Physical Sciences → Computer Science → Computer Networks and Communications

Auction Theory and Applications

Social Sciences → Decision Sciences → Management Science and Operations Research

Thompson Sampling for Bandit Learning in Matching Markets

Abstract

Metrics

Citation History

Topics

Related Documents

Contextual Bandit for Active Learning: Active Thompson Sampling

Thompson Sampling for Bandit Convex Optimization

Bandit Learning in Many-to-One Matching Markets

A Survey on Bandit Learning in Matching Markets

A Survey on Bandit Learning in Matching Markets