bytez
Search
Feed
Models
Agent
Devs
API Dashboard
docs
Position: Don't Use the CLT in LLM Evals With Fewer Than a Few Hundred Datapoints | Read Paper on Bytez
Position: Don't Use the CLT in LLM Evals With Fewer Than a Few Hundred Datapoints
3 months ago
·
arXiv