1's picture

1 12

1

Ava154

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Online Causal Kalman Filtering for Stable and Effective Policy Optimization

upvoted a paper 4 months ago

Agent Learning via Early Experience

upvoted a paper 4 months ago

ExGRPO: Learning to Reason from Experience

View all activity

Organizations

upvoted a paper 1 day ago

Online Causal Kalman Filtering for Stable and Effective Policy Optimization

Paper • 2602.10609 • Published 2 days ago • 15

upvoted 5 papers 4 months ago

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9, 2025 • 273

ExGRPO: Learning to Reason from Experience

Paper • 2510.02245 • Published Oct 2, 2025 • 80

Quantile Advantage Estimation for Entropy-Safe Reasoning

Paper • 2509.22611 • Published Sep 26, 2025 • 118

Rethinking Reward Models for Multi-Domain Test-Time Scaling

Paper • 2510.00492 • Published Oct 1, 2025 • 28

DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search

Paper • 2509.25454 • Published Sep 29, 2025 • 146

upvoted 4 papers 5 months ago

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2, 2025 • 229

Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing

Paper • 2509.08721 • Published Sep 10, 2025 • 662

Tree Search for LLM Agent Reinforcement Learning

Paper • 2509.21240 • Published Sep 25, 2025 • 92

VCRL: Variance-based Curriculum Reinforcement Learning for Large Language Models

Paper • 2509.19803 • Published Sep 24, 2025 • 120

updated a collection 5 months ago

daily paper

6 items • Updated Sep 23, 2025

upvoted 2 papers 5 months ago

Qwen3-Omni Technical Report

Paper • 2509.17765 • Published Sep 22, 2025 • 148

LIMI: Less is More for Agency

Paper • 2509.17567 • Published Sep 22, 2025 • 104

New activity in sergiopaniego/AlfredAgent 6 months ago

1111

#98 opened 6 months ago by

1111

#98 opened 6 months ago by