Real-Time Reasoning Agents in Evolving Environments Paper • 2511.04898 • Published Nov 7, 2025 • 12 • 2
SoMi-ToM: Evaluating Multi-Perspective Theory of Mind in Embodied Social Interactions Paper • 2506.23046 • Published Jun 29, 2025 • 1
AutoLibra: Agent Metric Induction from Open-Ended Feedback Paper • 2505.02820 • Published May 5, 2025 • 3
AutoLibra: Agent Metric Induction from Open-Ended Feedback Paper • 2505.02820 • Published May 5, 2025 • 3 • 2
AutoLibra: Agent Metric Induction from Open-Ended Feedback Paper • 2505.02820 • Published May 5, 2025 • 3
AutoLibra: Agent Metric Induction from Open-Ended Feedback Paper • 2505.02820 • Published May 5, 2025 • 3 • 2
Think on your Feet: Adaptive Thinking via Reinforcement Learning for Social Agents Paper • 2505.02156 • Published May 4, 2025 • 18
Learning from Failures in Multi-Attempt Reinforcement Learning Paper • 2503.04808 • Published Mar 4, 2025 • 18
Mind the Gap! Static and Interactive Evaluations of Large Audio Models Paper • 2502.15919 • Published Feb 21, 2025 • 4
EgoNormia: Benchmarking Physical Social Norm Understanding Paper • 2502.20490 • Published Feb 27, 2025 • 6