3 31

jac

jaczhao

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

R^3: Replay, Reflection, and Ranking Rewards for LLM Reinforcement Learning

liked a model 2 months ago

zai-org/GLM-4.7

updated a model 2 months ago

jaczhao/ICC-1.5B-Preview

View all activity

Organizations

None yet

upvoted a paper 1 day ago

R^3: Replay, Reflection, and Ranking Rewards for LLM Reinforcement Learning

Paper • 2601.19620 • Published Jan 27 • 2

liked a model 2 months ago

zai-org/GLM-4.7

Text Generation • Updated 30 days ago • 121k • • 1.93k

updated a model 2 months ago

jaczhao/ICC-1.5B-Preview

2B • Updated Dec 18, 2025

published a model 2 months ago

jaczhao/ICC-1.5B-Preview

2B • Updated Dec 18, 2025

upvoted a paper 3 months ago

REVISOR: Beyond Textual Reflection, Towards Multimodal Introspective Reasoning in Long-Form Video Understanding

Paper • 2511.13026 • Published Nov 17, 2025 • 26

liked 2 datasets 6 months ago

MaziyarPanahi/Llama-Nemotron-Post-Training-Dataset-v1-ShareGPT

Viewer • Updated Jun 2, 2025 • 30.2M • 162 • 41

HuggingFaceFW/finepdfs

Viewer • Updated Jan 9 • 476M • 31k • 817

liked 2 datasets 11 months ago

a-m-team/AM-DeepSeek-R1-Distilled-1.4M

Preview • Updated Mar 30, 2025 • 2.89k • 176

glaiveai/reasoning-v1-20m

Viewer • Updated Mar 19, 2025 • 22.2M • 2.66k • 232

published a model 11 months ago

jaczhao/DeepSeek-R1-Distill-Qwen-7B-GRPO

Updated Mar 24, 2025

liked a dataset 12 months ago

facebook/natural_reasoning

Viewer • Updated Feb 21, 2025 • 1.15M • 1.46k • 551

published a model about 1 year ago

jaczhao/Qwen2.5-1.5B-Base-Open-R1-GRPO

Updated Feb 17, 2025

updated a model about 1 year ago

jaczhao/Qwen2.5-1.5B-Open-R1-GRPO

Text Generation • 2B • Updated Feb 12, 2025

published a model about 1 year ago

jaczhao/Qwen2.5-1.5B-Open-R1-GRPO

Text Generation • 2B • Updated Feb 12, 2025

upvoted an article about 1 year ago

Article

Open R1: Update #2

Feb 10, 2025

•

218

liked a dataset about 1 year ago

open-thoughts/OpenThoughts-114k

Viewer • Updated Aug 31, 2025 • 228k • 83.2k • 809

liked a Space about 1 year ago

Scaling test-time compute

📈

593

Boost LLM answers with search‑guided test‑time compute

liked a dataset about 1 year ago

BAAI/Infinity-MM

Updated Dec 13, 2024 • 1.82k • 116

liked a dataset over 1 year ago

allenai/dolmino-mix-1124

Viewer • Updated Oct 29, 2025 • 170M • 236k • 90

liked a model over 1 year ago

allenai/OLMo-2-1124-7B

7B • Updated Jan 6, 2025 • 45k • 64

jac

AI & ML interests

Recent Activity

Organizations

jaczhao's activity

Open R1: Update #2

Scaling test-time compute