Interpretable End-to-end Urban Autonomous Driving with Latent Deep Reinforcement Learning