A Bayesian Model Selection Criterion for Selecting Pretraining Checkpoints | Read Paper on Bytez