Automated Benchmark Generation for Repository-Level Coding Tasks | Read Paper on Bytez