bytez
Search
Feed
Models
Agent
Devs
Plan
docs
NYU CTF Bench: A Scalable Open-Source Benchmark Dataset for Evaluating LLMs in Offensive Security | Read Paper on Bytez