Continuous Value Assignment: A Doubly Robust Data Augmentation for Off-Policy Learning
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS(2024)
Key words
Trajectory,Data augmentation,Optimization,Interpolation,Estimation,Training,Robots,Causal data augmentation,continuous control problem,reinforcement learning (RL),sample efficiency
AI Read Science
Must-Reading Tree
Example

Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined