Publications

Jinyi Liu, Zhi Wang, Yan Zheng, Jianye Hao, Chenjia Bai, Junjie Ye, Zhen Wang, Haiyin Piao, Yang Sun (2024). Ovd-explorer: Optimism should not be the sole pursuit of exploration in noisy environments. Proceedings of the AAAI Conference on Artificial Intelligence.

PDF Cite Poster

Jinyi Liu, Yi Ma, Jianye Hao, Yujing Hu, Yan Zheng, Tangjie Lv, Changjie Fan (2024). A trajectory perspective on the role of data sampling techniques in offline reinforcement learning. Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems.

Jinyi Liu, Yifu Yuan, Jianye Hao, Fei Ni, Lingzhi Fu, Yibin Chen, Yan Zheng (2024). Enhancing robotic manipulation with AI feedback from multimodal large language models. arXiv preprint arXiv:2402.14245.

Yiwen Zhu, Jinyi Liu, Wenya Wei, Qianyi Fu, Yujing Hu, Zhou Fang, Bo An, Jianye Hao, Tangjie Lv, Changjie Fan (2024). vMFER: Von Mises-Fisher experience resampling based on uncertainty of gradient directions for policy improvement. Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence.

Yifu Yuan, HAO Jianye, Yi Ma, Zibin Dong, Hebin Liang, Jinyi Liu, Zhixin Feng, Kai Zhao, Yan Zheng (2024). Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback. The Twelfth International Conference on Learning Representations.

Fei Ni, Jianye Hao, Shiguang Wu, Longxin Kou, Yifu Yuan, Zibin Dong, Jinyi Liu, Mingzhi Li, Yuzheng Zhuang, Yan Zheng (2024). Peria: Perceive, reason, imagine, act via holistic language and vision planning for manipulation. Advances in Neural Information Processing Systems.

Longxin Kou, Fei Ni, Yan Zheng, Jinyi Liu, Yifu Yuan, Zibin Dong, Jianye Hao (2024). Kisa: A unified keyframe identifier and skill annotator for long-horizon robotics demonstrations. Forty-first International Conference on Machine Learning.

Kai Zhao, Jianye Hao, Yi Ma, Jinyi Liu, Yan Zheng, Zhaopeng Meng (2024). ENOTO: improving offline-to-online reinforcement learning with Q-ensembles. Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence.

Yi Ma, Chao Wang, Chen Chen, Jinyi Liu, Zhaopeng Meng, Yan Zheng, Jianye Hao (2023). OSCAR: OOD State-Conservative Offline Reinforcement Learning for Sequential Decision Making. CAAI Artificial Intelligence Research.

Jianye Hao, Tianpei Yang, Hongyao Tang, Chenjia Bai, Jinyi Liu, Zhaopeng Meng, Peng Liu, Zhen Wang (2023). Exploration in deep reinforcement learning: From single-agent to multiagent domain. IEEE Transactions on Neural Networks and Learning Systems.