Jinyi Liu 💻
Jinyi Liu

Ph.D. Candidate

About Me

Jinyi Liu (刘金毅) is currently pursuing a Ph.D. at Tianjin University under the supervision of Professor Jianye Hao, as a member of the DRL Lab. His research interests primarily focus on DRL, LLMs, and LLM Agents. His work aims to explore synergies between decision-making frameworks and language-based AI systems, advancing applications in autonomous reasoning and human-AI collaboration.

🔬 TJU DRL-LAB (by Jianye Hao, Yan Zheng and Hongyao Tang) is seeking collaborators (interns, MS/PhD)! 👋 DM me if you're interested!

我正关注26届校招,LLM后训练、RL、Agentic Intelligence等方向算法研究岗。如有合适职位,欢迎联系(jyliu_tju.edu.cn)!

Interests
  • Deep Reinforcement Learning
  • LLM Post-training (Scaling and RFT)
  • LLM and LLM Agents
  • AI for Science
Education
  • PhD (and MSc)

    Tianjin University

  • BSc

    NorthEastern University

News
  • 📑 2025-09 1 paper accepted by NeurIPS 2025
  • 📑 2025-07 1 paper accepted by SCALR@COLM (Atomic Reasoner)
  • 📑 2025-07 1 paper accepted by ICCV 2025 (RoboAnnotatorX)!
  • 📑 2025-06 1 paper accepted by ICML 2025 Workshop MAS (MADC)!
  • 📑 2025-05 3 papers accepted by ACL 2025 (long paper, Atomic Reasoner, DualRAG, WoT)!
  • 🛫 2025-04 The 114th RLCHINA Paper Seminar hosted!
  • 📢 2025-02 Nominated as Distinguished PC Member of AAMAS 2025.
  • 📑 2025-01 1 paper accepted by WWW 2025, oral presentation (SheetAgent)!
Featured Publications
Recent Publications
(2025). Improving Reward Models with Proximal Policy Exploration for Preference-Based Reinforcement Learning. NeurIPS 2025.
(2025). From Chaos to Order: The Atomic Reasoner Framework for Fine-grained Reasoning in Large Language Models. SCALR@COLM 2025.
(2025). Squeeze the Soaked Sponge: Efficient Off-policy Reinforcement Finetuning for Large Language Model. arXiv preprint arXiv:2507.06892.
(2025). Unlocking Multi-Agent Debate Potential: Enhancing Effective Scaling through Role Allocation Strategies. ICML 2025 Workshop on Multi-Agent Systems in the Era of Foundation Models: Opportunities, Challenges and Futures.
(2025). DualRAG: A Dual-Process Approach to Integrate Reasoning and Retrieval for Multi-Hop Question Answering. Proceedings of the Association for Computational Linguistics: ACL 2025.

Experience

  1. Algorithm Research Intern (Project Collaboration)

    Kuaishou (advised by Hangyu Mao)
  2. Algorithm Research Intern

    NetEase (advised by Yujing Hu)

Education

  1. PhD (and MSc)

    Tianjin University
  2. BSc

    NorthEastern University

Awards

January 2025

🏆

Distinguished PC Members in AAMAS 2025

by AAMAS 2025

September 2024

🎓

Academic First-class Scholarship (Top 10%)

by Tianjin University

December 2022

🥇

Academic Second-class Scholarship

by Tianjin University

June 2023

🏅

iFlyTek Spark “Prompt Engineer” Certification

by iFlyTek Spark

December 2022

🎖️

Silver Award, 8th China International “Internet+” Innovation and Entrepreneurship Competition (Tianjin Regional Division)

December 2021

Tianjin University Academic Second-class Scholarship ×2 (Master’s Student)

by Tianjin University

June 2019

🌟

Outstanding Bachelor’s Thesis Award (Top 1%)

by Tianjin University

Academic Service

📝 Reviewer (Journal)
IEEE TNNLS, Machine Learning, Quantum Machine Intelligence, SIVP
📝 Reviewer / PC member (Conference)
NeurIPS (2024-), IJCAI (2024-), AAAI (2025-), ICCV (2025-), ICML (2025-), AAMAS (2025-), CIKM (2022)
🙋‍♂️ Conference Committee Volunteer
DAI 2022
👥 Student Liaison
RL China (Academic Community)
Selected Projects