AutoEval Done Right: Using Synthetic Data for Model Evaluation | Read Paper on Bytez