b
Discover
Models
Search
About
ClashEval: Quantifying the tug-of-war between an LLM’s internal prior and external evidence
1 week ago
·
NeurIPS