A Hoeffding Inequality for Finite State Markov Chains and its Applications to Markovian Bandits
2020·Arxiv