b
Discover
Models
Search
About
WikiContradict: A Benchmark for Evaluating LLMs on Real-World Knowledge Conflicts from Wikipedia
1 week ago
·
NeurIPS