Safe Exploration in Finite Markov Decision Processes with Gaussian Processes
2016·Arxiv