Externalization in LLM Agents: A Unified Review of Memory, Skills, Protocols and Harness Engineering Paper • 2604.08224 • Published 12 days ago • 50
Do Audio-Visual Large Language Models Really See and Hear? Paper • 2604.02605 • Published 18 days ago • 7
Sommelier: Scalable Open Multi-turn Audio Pre-processing for Full-duplex Speech Language Models Paper • 2603.25750 • Published Mar 20 • 36