WildClawBench: A Benchmark for Real-World, Long-Horizon Agent Evaluation | Read Paper on Bytez