b
Discover
Models
Search
About
SG-Bench: Evaluating LLM Safety Generalization Across Diverse Tasks and Prompt Types
2 weeks ago
·
NeurIPS