From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence Paper • 2511.18538 • Published Nov 23, 2025 • 279
DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning Paper • 2511.22570 • Published Nov 27, 2025 • 84
On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes Paper • 2306.13649 • Published Jun 23, 2023 • 28
Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance Paper • 2511.13254 • Published Nov 17, 2025 • 136
Seer: Online Context Learning for Fast Synchronous LLM Reinforcement Learning Paper • 2511.14617 • Published Nov 18, 2025 • 2
The Path Not Taken: RLVR Provably Learns Off the Principals Paper • 2511.08567 • Published Nov 11, 2025 • 33
Driven by Compression Progress: A Simple Principle Explains Essential Aspects of Subjective Beauty, Novelty, Surprise, Interestingness, Attention, Curiosity, Creativity, Art, Science, Music, Jokes Paper • 0812.4360 • Published Dec 23, 2008 • 2
Depth Anything 3: Recovering the Visual Space from Any Views Paper • 2511.10647 • Published Nov 13, 2025 • 95
Open Character Training: Shaping the Persona of AI Assistants through Constitutional AI Paper • 2511.01689 • Published Nov 3, 2025 • 4
LeJEPA: Provable and Scalable Self-Supervised Learning Without the Heuristics Paper • 2511.08544 • Published Nov 11, 2025 • 8
From Memorization to Reasoning in the Spectrum of Loss Curvature Paper • 2510.24256 • Published Oct 28, 2025 • 2