Project Jan 2024 1 min read

Uni-RLHF

Public platform page for Uni-RLHF, emphasizing the benchmark, interface, and reproducible workflow for RLHF experimentation.

Explore Platform Read Paper

Related publication: Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback · The Twelfth International Conference on Learning Representations

PbRL RLHF

Uni-RLHF is best understood as infrastructure: a common platform for setting up, comparing, and reproducing RLHF pipelines under one interface instead of scattered experiment scripts.

This page focuses on the platform and benchmark experience: how tasks, feedback sources, and evaluation workflows are organized for actual use rather than described only at the paper level.

Use the project site if you want to see the system framing and public-facing structure first. Use the paper page if you want the research motivation behind the benchmark suite.

Last updated on Jan 1, 2024

PbRL RLHF

Authors

Jinyi Liu

Ph.D. Candidate Reinforcement Learning and LLM Systems

← CellAgent Jan 1, 2025

Back to Projects Publication Live Site

More Projects

LLM Agent Tutorial Public tutorial hub for learning LLM agents through guided concepts, practical examples, … CellAgent CellAgent is presented here as a usable research system rather than only a paper artifact. …