Establishing Best Practices in Building Rigorous Agentic Benchmarks | Read Paper on Bytez