Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback

Jan 1, 2024·
Yifu Yuan
,
HAO Jianye
,
Yi Ma
,
Zibin Dong
,
Hebin Liang
Jinyi Liu
Jinyi Liu
,
Zhixin Feng
,
Kai Zhao
,
Yan Zheng
· 0 min read
Type
Publication
The Twelfth International Conference on Learning Representations