b
Discover
Models
Search
About
Coevolving with the Other You: Fine-Tuning LLM with Sequential Cooperative Multi-Agent Reinforcement Learning
1 week ago
·
NeurIPS