Publications

(2024). Ovd-explorer: Optimism should not be the sole pursuit of exploration in noisy environments. Proceedings of the AAAI Conference on Artificial Intelligence.
(2024). A trajectory perspective on the role of data sampling techniques in offline reinforcement learning. Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems.
(2024). Enhancing robotic manipulation with AI feedback from multimodal large language models. arXiv preprint arXiv:2402.14245.
(2024). vMFER: Von Mises-Fisher experience resampling based on uncertainty of gradient directions for policy improvement. Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence.
(2024). Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback. The Twelfth International Conference on Learning Representations.
(2024). Peria: Perceive, reason, imagine, act via holistic language and vision planning for manipulation. Advances in Neural Information Processing Systems.
(2024). Kisa: A unified keyframe identifier and skill annotator for long-horizon robotics demonstrations. Forty-first International Conference on Machine Learning.
(2024). ENOTO: improving offline-to-online reinforcement learning with Q-ensembles. Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence.
(2023). OSCAR: OOD State-Conservative Offline Reinforcement Learning for Sequential Decision Making. CAAI Artificial Intelligence Research.
(2023). Exploration in deep reinforcement learning: From single-agent to multiagent domain. IEEE Transactions on Neural Networks and Learning Systems.