論文 Hugging Face 発表: 2026-05-26 HF ↑72

ProRL: Effective Reinforcement Learning for Proactive Recommendation via Rectified Policy Gradient Estimation

ProRL: Effective Reinforcement Learning for Proactive Recommendation via Rectified Policy Gradient Estimation

著者: Hongru Hou, Tiehua Mei, Denghui Geng, Jinhui Huang, Ao Xu ほか3名

要約

Proactive Recommender Systems (PRSs) aim to guide user preference shift toward target items by generating paths of intermediate recommendations. Reinforcement learning (RL) provides a principled framework for optimizing such sequential decision tasks, as path rewards can naturally capture both short…

#rl

同じカテゴリの記事