Tianle Wang's picture

In a Training Loop 🔄

Tianle Wang

wtl666wtl

https://wtl666wtl.github.io/

wtl666wtl

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Large Language Models Explore by Latent Distilling

authored a paper 5 days ago

Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key

commentedon a paper 7 days ago

Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key

View all activity

Organizations

None yet

authored a paper 5 days ago

Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key

Paper • 2605.06638 • Published 8 days ago • 14

submitted a paper to Daily Papers 7 days ago

Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key

Paper • 2605.06638 • Published 8 days ago • 14

authored a paper 10 months ago

MaPPO: Maximum a Posteriori Preference Optimization with Prior Knowledge

Paper • 2507.21183 • Published Jul 27, 2025 • 15