Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback

Jan 1, 2024·

Yifu Yuan

,

HAO Jianye

,

Yi Ma

,

Zibin Dong

,

Hebin Liang

,

Jinyi Liu

Jinyi Liu

,

Zhixin Feng

,

Kai Zhao

,

Yan Zheng

· 0 min read

Type

Conference paper

Publication

The Twelfth International Conference on Learning Representations

Overview

A unified platform and benchmark suite for reinforcement learning with diverse human feedback.

Venue. The Twelfth International Conference on Learning Representations

Last updated on Apr 26, 2025

Jinyi Liu

Authors

Ph.D. Candidate Reinforcement Learning and LLM Systems

← Peria: Perceive, reason, imagine, act via holistic language and vision planning for manipulation Jan 1, 2024

vMFER: Von Mises-Fisher experience resampling based on uncertainty of gradient directions for policy improvement Jan 1, 2024 →

Back to Publications

More Publications

Squeeze the Soaked Sponge: Efficient Off-policy RFT for Large Language Model Jan 2026 CellAgent: LLM-Driven Multi-Agent Framework for Natural Language-Based Single-Cell Analysis Jan 2026 From Seeing to Doing: Bridging Reasoning and Decision for Robotic Manipulation Jan 2026