Xu ChenZhenlei WangHongteng XuJingsen ZhangYongfeng ZhangWayne Xin ZhaoJi-Rong Wen
Sequential recommendation has recently attracted increasing attention from the industry and academic communities. While previous models have achieved remarkable successes, an important problem may still hinder their performances, that is, the sparsity of the real-world data. In this paper, we propose a novel counterfactual data augmentation framework to alleviate the problem of data sparsity. In specific, our framework contains a sampler model and an anchor model. The sampler model aims to generate high-quality user behavior sequences, while the anchor model is trained based on the original and new generated samples, and leveraged to provide the final recommendation list. To implement the sampler model, we first design four types of heuristic methods based on either random or frequency-based strategies. And then, to improve the quality of the generated sequences, we propose two learning-based samplers by discovering the decision boundaries or increasing the sample informativeness. At last, we build an RL based model to automatically determine where to edit the history behaviors and how many items should be replaced. Considering that the sampler model can be imperfect, we, at last, analyze the influence of the noisy information contained in the generated sequences on the anchor model in theory, and design a simple but effective method to better serve the anchor model. We conduct extensive experiments to demonstrate the effectiveness of our model.
Zhenlei WangJingsen ZhangHongteng XuXu ChenYongfeng ZhangWayne Xin ZhaoJi-Rong Wen
Haiyang WangYan ChuHui NingZhengkui WangWen Shan
Shixuan ZhuQi ShenChuan CuiYu JiYiming ZhangZhenwei DongZhihua Wei
Zhiqiang WangJiayi PanXingwang ZhaoJianqing LiangChenjiao FengKaixuan Yao
Zewei XuXuejun LiuZezhou XingJiasheng CaoTao HeXiaoyang Huang