Auditing Sabotage Bench: A Benchmark for Detecting and Fixing Research Sabotage in ML Codebases | Read Paper on Bytez