SpecBlock: Block-Iterative Speculative Decoding with Dynamic Tree Drafting Paper • 2605.07243 • Published 6 days ago • 3
TCOD: Exploring Temporal Curriculum in On-Policy Distillation for Multi-turn Autonomous Agents Paper • 2604.24005 • Published 17 days ago • 8
Trinity-RFT: A General-Purpose and Unified Framework for Reinforcement Fine-Tuning of Large Language Models Paper • 2505.17826 • Published May 23, 2025 • 10
DIDS: Domain Impact-aware Data Sampling for Large Language Model Training Paper • 2504.13227 • Published Apr 17, 2025
Measuring Hong Kong Massive Multi-Task Language Understanding Paper • 2505.02177 • Published May 4, 2025 • 1
LegalReasoner: Step-wised Verification-Correction for Legal Judgment Reasoning Paper • 2506.07443 • Published Jun 9, 2025
Towards Advanced Mathematical Reasoning for LLMs via First-Order Logic Theorem Proving Paper • 2506.17104 • Published Jun 20, 2025 • 2