Non-Stationary Contextual Multi-Armed Bandit with Application in Online Recommendations

Muhammad Hassan

doi:10.18130/v3rt0d

ScienceGate Book Chapters

DISSERTATION

Non-Stationary Contextual Multi-Armed Bandit with Application in Online Recommendations

Muhammad Hassan

Year: 2015

DOI: 10.18130/v3rt0d

Get Full-Text PDF Get Analytical Report

Abstract

The only constant in the online social behavior is forever changing user intent and preferences.These changes could be inspired by myriad of factors but still have an overall trend e.g.todays popular news will be stale tomorrow etc.Such patterns are especially noticeable in viral trends where an immediate gain in popularity is followed by gradual lost of holistic interest.In this study we focus on design of a recommendation system which accounts for this non-stationary behavior of declining popularity.Contextual multi-armed bandit (contextual MAB) is a popular framework for learning user behavior and personalized recommendations based on the past behavior.Fundamentally MAB solves the exploitationexploration dilemma which aims at minimal guided experimentation required to gain certain level of confidence in its recommendation.Traditionally contextual MABs (e.g.LinUCB) have been used to model stationary user behavior that is not appropriate for the target environment where LinUCB can accumulate linear regret.Here we extend this LinUCB type algorithm to model a decaying environment.We present three algorithms with variable levels of specificity in the assumptions they make about the non-stationary environment.We show by simulation the e↵ectiveness of our methods which illustrates the usefulness of modeling meta-trends in user behavior.

Keywords:

Popularity Regret Computer science Dilemma Multi-armed bandit Focus (optics) Recommender system Social media Artificial intelligence Machine learning World Wide Web Mathematics Psychology

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Bandit Algorithms Research

Social Sciences → Decision Sciences → Management Science and Operations Research

Machine Learning and Algorithms

Physical Sciences → Computer Science → Artificial Intelligence

Smart Grid Energy Management

Physical Sciences → Engineering → Electrical and Electronic Engineering

Non-Stationary Contextual Multi-Armed Bandit with Application in Online Recommendations

Abstract

Metrics

Citation History

Topics

Related Documents

Contextual Multi-Armed Bandit With Costly Feature Observation in Non-Stationary Environments

Stochastic Multi-Armed-Bandit Problem with Non-stationary Rewards

Dynamic spectrum access with non-stationary Multi-Armed Bandit

The non-stationary stochastic multi-armed bandit problem

Con-CNAME: A Contextual Multi-armed Bandit Algorithm for Personalized Recommendations