Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback
Jan 1, 2024·,,,,,
,,,·
0 min read
Yifu Yuan
HAO Jianye
Yi Ma
Zibin Dong
Hebin Liang
Jinyi Liu
Zhixin Feng
Kai Zhao
Yan Zheng
Overview
A unified platform and benchmark suite for reinforcement learning with diverse human feedback.
Venue. The Twelfth International Conference on Learning Representations