Jinyi Liu 💻
Jinyi Liu

Ph.D. Candidate

About Me

Jinyi Liu (刘金毅) is currently pursuing a Ph.D. at Tianjin University under the supervision of Professor Jianye Hao, as a member of the DRL Lab. His research interests primarily focus on DRL, LLMs, and LLM Agents. His work aims to explore synergies between decision-making frameworks and language-based AI systems, advancing applications in autonomous reasoning and human-AI collaboration.

🔬 TJU DRL-LAB (by Jianye Hao, Yan Zheng and Hongyao Tang) is seeking collaborators (interns, MS/PhD)! 👋 DM me if you're interested!


Interests
  • Deep Reinforcement Learning
  • LLM Post-training (Scaling and RL)
  • LLM and LLM Agents
  • AI for Science
Education
  • PhD (and MSc)

    Tianjin University

  • BSc

    NorthEastern University

News

  • 📝 2026-01 Paper accepted by ACM TheWebConf 2026 Industry: AFE-Master.
  • 🔥 2025-12 Released our beginner-friendly LLM Agent tutorial (website & PDF).
  • 🎤 2025-12 The 137th RLCHINA Paper Seminar hosted!
  • 📝 2025-11 1 paper accepted by AAAI 2025 (MADC)!
  • 📝 2025-09 1 paper accepted by NeurIPS 2025.
  • 📝 2025-07 1 paper accepted by SCALR@COLM (Atomic Reasoner).
  • 📝 2025-07 1 paper accepted by ICCV 2025 (RoboAnnotatorX)!
  • 📝 2025-06 1 paper accepted by ICML 2025 Workshop MAS (MADC)!
  • 📝 2025-05 3 papers accepted by ACL 2025 (long paper, Atomic Reasoner, DualRAG, WoT)!
  • 🏆 2025-02 Nominated as Distinguished PC Member of AAMAS 2025.
Featured Publications
Recent Publications
(2026). AFE-Master: Enhancing LLM-Driven Autonomous Feature Engineering with Domain-Specific Language Parsing and Guided Local Search. ACM TheWebConf 2026 Industry.
(2025). Hands-on LLM-based Agents: A Tutorial for General Audiences. TechRxiv.
(2025). Improving Reward Models with Proximal Policy Exploration for Preference-Based Reinforcement Learning. NeurIPS 2025.
(2025). From Chaos to Order: The Atomic Reasoner Framework for Fine-grained Reasoning in Large Language Models. SCALR@COLM 2025.
(2025). RoboAnnotatorX: A Comprehensive and Universal Annotation Framework for Accurate Understanding of Long-horizon Robot Demonstration. Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) 2025.

Experience

Algorithm Research Intern
Shanghai AI Lab (advised by Shuyue Hu)
August 2025 – Present
Algorithm Research Intern (Project Collaboration)
Kuaishou (advised by Hangyu Mao)
October 2024 – August 2025
Algorithm Research Intern
NetEase (advised by Yujing Hu)
June 2022 – March 2024

Education

PhD (and MSc)
Tianjin University
September 2019 – Present
BSc
NorthEastern University
September 2015 – June 2019

Awards

January 2025
🏆

CSIG Science and Technology Progress Award, First Prize (2025年度CSIG科技进步奖一等奖)

CSIG

January 2025
🎓

Distinguished PC Members in AAMAS 2025

AAMAS 2025

September 2024
🥇

Academic First-class Scholarship (Top 10%)

Tianjin University

June 2023
🏅

iFlyTek Spark “Prompt Engineer” Certification

iFlyTek Spark

December 2022
🎖️

Academic Second-class Scholarship

Tianjin University

December 2022

Silver Award, 8th China International “Internet+” Innovation and Entrepreneurship Competition (Tianjin Regional Division)

December 2021
🌟

Tianjin University Academic Second-class Scholarship ×2 (Master’s Student)

Tianjin University

June 2019
🎉

Outstanding Bachelor’s Thesis Award (Top 1%)

Tianjin University

Academic Service

📝 Reviewer (Journal)
IEEE TNNLS, Machine Learning, Quantum Machine Intelligence, SIVP
📝 Reviewer / PC member (Conference)
NeurIPS (2024-), IJCAI (2024-), AAAI (2025-), ICCV (2025-), ICML (2025-), ICLR (2025-), AAMAS (2025-), CIKM (2022)
🙋‍♂️ Conference Committee Volunteer
DAI 2022
👥 Student Liaison
RL China (Academic Community)
Selected Projects