Title: Reinforcement learning via conservative agent for environments with random delays
Journal: Neural Networks
Author: Jongsoo lee, Jangwon kim, Jiseok Jeong, Soohee Han
Paper: https://www.sciencedirect.com/science/article/abs/pii/S0893608026001073?via%3Dihub