6 1

Tanjf

Sober-Clever

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning

upvoted a paper 2 days ago

Rubric-based On-policy Distillation

liked a model about 1 month ago

heyingzhi/onerec-ra_games_ep1

View all activity

Organizations

None yet

upvoted 2 papers 2 days ago

Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning

Paper • 2605.06130 • Published 7 days ago • 95

Rubric-based On-policy Distillation

Paper • 2605.07396 • Published 6 days ago • 37

liked a model about 1 month ago

heyingzhi/onerec-ra_games_ep1

2B • Updated Apr 7 • 17 • 2

updated a model 3 months ago

Sober-Clever/SIDReasoner-Models

Updated Feb 27 • 1

published a model 3 months ago

Sober-Clever/SIDReasoner-Models

Updated Feb 27 • 1

updated a model 3 months ago

Sober-Clever/sft_reasoning-activation_7Task-End2End-GPTGen_Qwen3-1.7B-ckpt741-Industrial_EP1

Updated Feb 6

published a model 3 months ago

Sober-Clever/sft_reasoning-activation_7Task-End2End-GPTGen_Qwen3-1.7B-ckpt741-Industrial_EP1

Updated Feb 6

updated a model 3 months ago

Sober-Clever/Qwen3-1.7B_base_e2e-AmazonMix3-EP2_General_reasoning-activate-ep1_RLonOffice_ckpt1000

Updated Feb 6

published a model 3 months ago

Sober-Clever/Qwen3-1.7B_base_e2e-AmazonMix3-EP2_General_reasoning-activate-ep1_RLonOffice_ckpt1000

Updated Feb 6

updated a model 3 months ago

Sober-Clever/Qwen3-1.7B_base_e2e-AmazonMix3-EP2_General_reasoning-activate-ep1_RLonGames_ckpt900

Updated Feb 5

published a model 3 months ago

Sober-Clever/Qwen3-1.7B_base_e2e-AmazonMix3-EP2_General_reasoning-activate-ep1_RLonGames_ckpt900

Updated Feb 5

updated a dataset 3 months ago

Sober-Clever/Sampled_OOR_150k

Updated Jan 30 • 6

published a dataset 3 months ago

Sober-Clever/Sampled_OOR_150k

Updated Jan 30 • 6

upvoted a paper 7 months ago

Distilled Decoding 2: One-step Sampling of Image Auto-regressive Models with Conditional Score Distillation

Paper • 2510.21003 • Published Oct 23, 2025 • 8

upvoted a paper 8 months ago

Quantile Advantage Estimation for Entropy-Safe Reasoning

Paper • 2509.22611 • Published Sep 26, 2025 • 119

upvoted an article about 1 year ago

Article

Mixture of Experts Explained

osanseviero, lewtun, philschmid, smangrul, ybelkada, pcuenq

•

Dec 11, 2023

• 1.13k

upvoted a paper over 1 year ago

Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching

Paper • 2412.17153 • Published Dec 22, 2024 • 39

updated 2 models over 2 years ago

Sober-Clever/distilbert-base-uncased-finetuned-squad-d5716d28

Question Answering • Updated Nov 20, 2023 • 5

Sober-Clever/codeparrot-ds

Text Generation • 0.1B • Updated Nov 5, 2023 • 4

updated a dataset over 2 years ago

Sober-Clever/github-issues

Viewer • Updated Oct 20, 2023 • 100 • 109

Tanjf

AI & ML interests

Recent Activity

Organizations

Sober-Clever's activity

Mixture of Experts Explained