OSCAR: OOD State-Conservative Offline Reinforcement Learning for Sequential Decision Making
Jan 1, 2023·,,,
,,,·
0 min read
Yi Ma
Chao Wang
Chen Chen
Jinyi Liu
Zhaopeng Meng
Yan Zheng
Jianye Hao
Overview
An offline reinforcement learning method that stays conservative on out-of-distribution states for sequential decision-making.
Venue. CAAI Artificial Intelligence Research