Improving Reward Models with Proximal Policy Exploration for Preference-Based Reinforcement Learning
Yiwen Zhu, Jinyi Liu, Pengjie Gu, Yifu Yuan, Zhenxing Ge, Wenya Wei, Zhou Fang, Yujing Hu, Bo An
NeurIPS 2025 · Sep 2025
Use the tag controls below to browse publications by DRL, LLM Agent, LLM Post-training (RL Tuning), LLM Post-training (TTS), or Embodied AI focus areas.
Yiwen Zhu, Jinyi Liu, Pengjie Gu, Yifu Yuan, Zhenxing Ge, Wenya Wei, Zhou Fang, Yujing Hu, Bo An
NeurIPS 2025 · Sep 2025
Jinyi Liu, Yan Zheng, Rong Cheng, Qiyu Wu, Wei Guo, Fei Ni, Hebin Liang, Yifu Yuan, Hangyu Mao, Fuzheng Zhang, others
SCALR@COLM 2025 · Aug 2025
A lightweight large language model inference framework that performs structured and fine-grained natural language reasoning without the need for complex search and external tools.
Longxin Kou, Fei Ni, Jianye HAO, Peilong Han, Jinyi Liu, Haiqin Cui, Rui Liu, YAN ZHENG
Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) 2025 · Jul 2025
This paper presents RoboAnnotatorX, a comprehensive and universal framework for annotating long-horizon robot demonstrations to enable accurate understanding.
Jing Liang, Hongyao Tang, Yi Ma, Jinyi Liu, YAN ZHENG, Shuyue Hu, Lei Bai, Jianye HAO
arXiv preprint arXiv:2507.06892 · Jul 2025
This paper introduces an efficient method for finetuning Large Language Models (LLMs) using off-policy reinforcement learning, aiming to improve performance while minimizing computational resources.
Qian Zhang, Yan Zheng, Jinyi Liu, Hebin Liang, Lanjun Wang, Jianye Hao
ICML 2025 Workshop on Multi-Agent Systems in the Era of Foundation Models: Opportunities, Challenges and Futures · Jun 2025
Yibin Chen, Jinyi Liu, YAN ZHENG, Yifu Yuan, Jianye HAO
Findings of the Association for Computational Linguistics: ACL 2025 · May 2025
This paper investigates how competitive mechanisms can enhance the reasoning capabilities of Large Language Models (LLMs), leading to improved performance on complex tasks.
Rong Cheng, Jinyi Liu, YAN ZHENG, Fei Ni, Jiazhen Du, Hangyu Mao, Fuzheng Zhang, Bo Wang, Jianye HAO
Proceedings of the Association for Computational Linguistics: ACL 2025 · May 2025
Yifu Yuan, Haiqin Cui, Yibin Chen, Zibin Dong, Fei Ni, Longxin Kou, Jinyi Liu, Pengyi Li, Yan Zheng, Jianye Hao
arXiv preprint arXiv:2505.08548 · May 2025
Qian Zhang, YAN ZHENG, Jinyi Liu, Hebin Liang, Lanjun Wang
AAAI Conference on Artificial Intelligence, 2025 (Poster) · Feb 2025
Analyzes mediator roles and decisive voices within multi-agent debate frameworks, revealing how influence shifts throughout deliberation.
Yibin Chen, Yifu Yuan, Zeyu Zhang, Yan Zheng, Jinyi Liu, Fei Ni, Jianye Hao, Hangyu Mao, Fuzheng Zhang
Proceedings of the ACM on Web Conference 2025 · Jan 2025
SheetAgent, an novel autonomous agent that utilizes the power of LLMs.
Yihang Xiao, Jinyi Liu, Yan Zheng, Xiaohan Xie, Jianye Hao, Mingzhi Li, Ruitao Wang, Fei Ni, Yuxiao Li, Jintian Luo, others
BioRxiv · Aug 2024
An LLM-driven multi-agent framework for single-cell data analysis, ensuring high-quality results with minimal effort.
Yiwen Zhu, Jinyi Liu, Yifu Yuan, Wenya Wei, Zhenxing Ge, Zhou Fang, Yujing Hu, Bo An, others
NeurIPS 2024 Workshop on Behavioral Machine Learning · Jul 2024
Jinyi Liu, Zhi Wang, Yan Zheng, Jianye Hao, Chenjia Bai, Junjie Ye, Zhen Wang, Haiyin Piao, Yang Sun
Proceedings of the AAAI Conference on Artificial Intelligence · May 2024
We propose Optimistic Value Distribution Explorer (OVD-Explorer) to achieve a noise-aware optimistic exploration for continuous control.
Jinyi Liu, Yi Ma, Jianye Hao, Yujing Hu, Yan Zheng, Tangjie Lv, Changjie Fan
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems · May 2024
Organizing samples in a trajective manner can improve the learning efficiency for offline RL algorithms.
Jinyi Liu, Yifu Yuan, Jianye Hao, Fei Ni, Lingzhi Fu, Yibin Chen, Yan Zheng
arXiv preprint arXiv:2402.14245 · Feb 2024
Yiwen Zhu, Jinyi Liu, Wenya Wei, Qianyi Fu, Yujing Hu, Zhou Fang, Bo An, Jianye Hao, Tangjie Lv, Changjie Fan
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence · Jan 2024
Yifu Yuan, HAO Jianye, Yi Ma, Zibin Dong, Hebin Liang, Jinyi Liu, Zhixin Feng, Kai Zhao, YAN ZHENG
The Twelfth International Conference on Learning Representations · Jan 2024
Fei Ni, Jianye Hao, Shiguang Wu, Longxin Kou, Yifu Yuan, Zibin Dong, Jinyi Liu, MingZhi Li, Yuzheng Zhuang, Yan Zheng
Advances in Neural Information Processing Systems · Jan 2024
Longxin Kou, Fei Ni, Yan Zheng, Jinyi Liu, Yifu Yuan, Zibin Dong, Jianye Hao
Forty-first International Conference on Machine Learning · Jan 2024
Kai Zhao, Jianye Hao, Yi Ma, Jinyi Liu, Yan Zheng, Zhaopeng Meng
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence · Jan 2024
Yi Ma, Chao Wang, Chen Chen, Jinyi Liu, Zhaopeng Meng, Yan Zheng, Jianye Hao
CAAI Artificial Intelligence Research · Jan 2023
Jianye Hao, Tianpei Yang, Hongyao Tang, Chenjia Bai, Jinyi Liu, Zhaopeng Meng, Peng Liu, Zhen Wang
IEEE Transactions on Neural Networks and Learning Systems · Jan 2023
Yifu Yuan, HAO Jianye, Fei Ni, Yao Mu, YAN ZHENG, Yujing Hu, Jinyi Liu, Yingfeng Chen, Changjie Fan
The Eleventh International Conference on Learning Representations · Jan 2023
Shaohua Zhang, Shuang Liu, Jun Sun, Yuqi Chen, Wenzhi Huang, Jinyi Liu, Jian Liu, Jianye Hao
2021 36th IEEE/ACM International Conference on Automated Software Engineering (ASE) · Jan 2021