6 2

wwwchang

chang04

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning

upvoted a paper 2 days ago

Rubric-based On-policy Distillation

upvoted a paper 6 months ago

Are Video Models Ready as Zero-Shot Reasoners? An Empirical Study with the MME-CoF Benchmark

View all activity

Organizations

None yet

upvoted 2 papers 2 days ago

Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning

Paper • 2605.06130 • Published 7 days ago • 95

Rubric-based On-policy Distillation

Paper • 2605.07396 • Published 6 days ago • 37

upvoted a paper 6 months ago

Are Video Models Ready as Zero-Shot Reasoners? An Empirical Study with the MME-CoF Benchmark

Paper • 2510.26802 • Published Oct 30, 2025 • 34

upvoted a paper 8 months ago

Quantile Advantage Estimation for Entropy-Safe Reasoning

Paper • 2509.22611 • Published Sep 26, 2025 • 119

liked a model 11 months ago

xy06/MINT-CoT-7B

8B • Updated Jun 4, 2025 • 199 • 7

liked a dataset 11 months ago

xy06/MINT-CoT-Dataset

Viewer • Updated Jun 10, 2025 • 100 • 36 • 8

upvoted a paper 11 months ago

MINT-CoT: Enabling Interleaved Visual Tokens in Mathematical Chain-of-Thought Reasoning

Paper • 2506.05331 • Published Jun 5, 2025 • 13

upvoted an article almost 2 years ago

Article

Mixture of Experts Explained

osanseviero, lewtun, philschmid, smangrul, ybelkada, pcuenq

•

Dec 11, 2023

• 1.13k

updated a model almost 2 years ago

chang04/ddi

Updated Jul 13, 2024

wwwchang

AI & ML interests

Recent Activity

Organizations

chang04's activity

Mixture of Experts Explained