Jinyi Liu (刘金毅)
Open Menu
Close Menu
Bio
Papers
News
Experience
Projects
PbRL
Optimizing Reward Models with Proximal Policy Exploration in Preference-Based Reinforcement Learning
Jul 1, 2024
Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback
Jan 1, 2024
Uni-RLHF
Universal Platform for Reinforcement Learning with Diverse Feedback Types.
Jan 1, 2024