Finite-Time Bounds for Average-Reward Fitted Q-Iteration | Read Paper on Bytez