OSCAR: OOD State-Conservative Offline Reinforcement Learning for Sequential Decision Making

Jan 1, 2023·
Yi Ma
,
Chao Wang
,
Chen Chen
,
Jinyi Liu
Jinyi Liu
,
Zhaopeng Meng
,
Yan Zheng
,
Jianye Hao
· 0 min read
Type
Publication
CAAI Artificial Intelligence Research

Overview

An offline reinforcement learning method that stays conservative on out-of-distribution states for sequential decision-making.

Venue. CAAI Artificial Intelligence Research