Papers
updated
WorldVLA: Towards Autoregressive Action World Model
Paper
•
2506.21539
•
Published
•
40
Fast and Simplex: 2-Simplicial Attention in Triton
Paper
•
2507.02754
•
Published
•
25
IntFold: A Controllable Foundation Model for General and Specialized
Biomolecular Structure Prediction
Paper
•
2507.02025
•
Published
•
35
Thinking Beyond Tokens: From Brain-Inspired Intelligence to Cognitive
Foundations for Artificial General Intelligence and its Societal Impact
Paper
•
2507.00951
•
Published
•
23
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable
Reinforcement Learning
Paper
•
2507.01006
•
Published
•
249
Does Math Reasoning Improve General LLM Capabilities? Understanding
Transferability of LLM Reasoning
Paper
•
2507.00432
•
Published
•
79
CriticLean: Critic-Guided Reinforcement Learning for Mathematical
Formalization
Paper
•
2507.06181
•
Published
•
44
MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via
Context-Aware Multi-Stage Policy Optimization
Paper
•
2507.14683
•
Published
•
134
A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm
Bridging Foundation Models and Lifelong Agentic Systems
Paper
•
2508.07407
•
Published
•
98
MoBE: Mixture-of-Basis-Experts for Compressing MoE-based LLMs
Paper
•
2508.05257
•
Published
•
13
Grove MoE: Towards Efficient and Superior MoE LLMs with Adjugate Experts
Paper
•
2508.07785
•
Published
•
28
rStar2-Agent: Agentic Reasoning Technical Report
Paper
•
2508.20722
•
Published
•
116
Think in Games: Learning to Reason in Games via Reinforcement Learning
with Large Language Models
Paper
•
2508.21365
•
Published
•
29
Less is More: Recursive Reasoning with Tiny Networks
Paper
•
2510.04871
•
Published
•
501
Diffusion Transformers with Representation Autoencoders
Paper
•
2510.11690
•
Published
•
165
Agent Learning via Early Experience
Paper
•
2510.08558
•
Published
•
270
Demystifying Reinforcement Learning in Agentic Reasoning
Paper
•
2510.11701
•
Published
•
31
ERA: Transforming VLMs into Embodied Agents via Embodied Prior Learning
and Online Reinforcement Learning
Paper
•
2510.12693
•
Published
•
27
Information Gain-based Policy Optimization: A Simple and Effective
Approach for Multi-Turn LLM Agents
Paper
•
2510.14967
•
Published
•
33
Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale
Thinking Model
Paper
•
2510.18855
•
Published
•
71
LoongRL:Reinforcement Learning for Advanced Reasoning over Long Contexts
Paper
•
2510.19363
•
Published
•
61
MathSE: Improving Multimodal Mathematical Reasoning via Self-Evolving Iterative Reflection and Reward-Guided Fine-Tuning
Paper
•
2511.06805
•
Published
•
12