Scaling Law with Learning Rate Annealing | Read Paper on Bytez