Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are on tiny models (AR & dLLMs).
-
Diffusion Language Models Know the Answer Before Decoding
Paper • 2508.19982 • Published • 26 -
ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding
Paper • 2512.13586 • Published • 92 -
LSRIF: Logic-Structured Reinforcement Learning for Instruction Following
Paper • 2601.06431 • Published • 12 -
Distribution-Aligned Sequence Distillation for Superior Long-CoT Reasoning
Paper • 2601.09088 • Published • 58