FineInstructions: Scaling Synthetic Instructions to Pre-Training Scale Paper • 2601.22146 • Published 4 days ago • 6
Self-Distilled Reasoner: On-Policy Self-Distillation for Large Language Models Paper • 2601.18734 • Published 7 days ago • 2
Self-Improving Pretraining: using post-trained models to pretrain better models Paper • 2601.21343 • Published 5 days ago • 13
Cache What Lasts: Token Retention for Memory-Bounded KV Cache in LLMs Paper • 2512.03324 • Published Dec 3, 2025 • 1 • 1
Cache What Lasts: Token Retention for Memory-Bounded KV Cache in LLMs Paper • 2512.03324 • Published Dec 3, 2025 • 1
Idea2Story: An Automated Pipeline for Transforming Research Concepts into Complete Scientific Narratives Paper • 2601.20833 • Published 5 days ago • 163
Statistical Estimation of Adversarial Risk in Large Language Models under Best-of-N Sampling Paper • 2601.22636 • Published 4 days ago • 7
[papers] Gameplay Optimization Collection Research papers that may contribute to a broader approach to teaching machines how to play complex strategy games beyond just Chess. • 18 items • Updated about 7 hours ago • 1
Pushing the Boundaries of Natural Reasoning: Interleaved Bonus from Formal-Logic Verification Paper • 2601.22642 • Published 4 days ago • 7
DreamActor-M2: Universal Character Image Animation via Spatiotemporal In-Context Learning Paper • 2601.21716 • Published 4 days ago • 10
[mixed] Image Generation Stack Collection The stuff we actually use, pruned on an ongoing basis. • 10 items • Updated about 7 hours ago • 1
PaperBanana: Automating Academic Illustration for AI Scientists Paper • 2601.23265 • Published 3 days ago • 16 • 2