Chrome Extension
WeChat Mini Program
Use on ChatGLM

Leveraging Transition Exploratory Bonus for Efficient Exploration in Hard-Transiting Reinforcement Learning Problems

FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE(2023)

Cited 1|Views10
Key words
Decision spatiotemporal data,Reinforcement learning,Sparse reward,Hard-Transiting
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined