Concentration bounds for temporal difference learning with linear function approximation: The case of batch data and uniform sampling | Read Paper on Bytez