Title: Belief Projection-Based Reinforcement Learning for Environments with Delayed Feedback
Conference: Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS)
Author: Jangwon Kim, Hangyeol Kim, Jiwook Kang, Jongchan Baek and Soohee Han
Link: https://openreview.net/forum?id=sq0m11cUMV