bytez
Search
Feed
Models
Agent
Devs
Plan
docs
SEC-bench: Automated Benchmarking of LLM Agents on Real-World Software Security Tasks | Read Paper on Bytez