TriAttention: Efficient Long Reasoning with Trigonometric KV Compression Paper • 2604.04921 • Published 5 days ago • 96
Synthetic Sandbox for Training Machine Learning Engineering Agents Paper • 2604.04872 • Published 5 days ago • 12
Does Your Reasoning Model Implicitly Know When to Stop Thinking? Paper • 2602.08354 • Published Feb 9 • 263