1 9 2

马逸川

YichuanMa

Entarochuan

AI & ML interests

(M)LLM

Recent Activity

authored a paper 4 days ago

TL-GRPO: Turn-Level RL for Reasoning-Guided Iterative Optimization

authored a paper 4 days ago

Timely Machine: Awareness of Time Makes Test-Time Scaling Agentic

authored a paper 4 days ago

Mixing Expert Knowledge: Bring Human Thoughts Back To the Game of Go

View all activity

Organizations

None yet

authored 4 papers 4 days ago

upvoted a paper 5 days ago

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

Paper • 2603.25040 • Published 10 days ago • 125

liked a dataset 12 days ago

YichuanMa/Expert-Go-SFT-100K

Viewer • Updated Mar 2 • 100k • 49 • 3

upvoted a paper 24 days ago

Timely Machine: Awareness of Time Makes Test-Time Scaling Agentic

Paper • 2601.16486 • Published Jan 23 • 1

upvoted a paper about 1 month ago

Mixing Expert Knowledge: Bring Human Thoughts Back To the Game of Go

Paper • 2601.16447 • Published Jan 23 • 1

updated 2 datasets about 1 month ago

YichuanMa/LoGos-Rollout-1K

Viewer • Updated Mar 2 • 1k • 18

YichuanMa/Go-GRPO-1K

Viewer • Updated Mar 2 • 1k • 12

updated a model about 1 month ago

YichuanMa/LoGos-7B

Text Generation • 8B • Updated Mar 2 • 7 • 3

updated a dataset about 1 month ago

YichuanMa/Expert-Go-SFT-100K

Viewer • Updated Mar 2 • 100k • 49 • 3

New activity in YichuanMa/Expert-Go-SFT-100K about 1 month ago

Clarification on the two distinct data formats

#2 opened 2 months ago by

peiyao-sentient

published 3 datasets 2 months ago

YichuanMa/LoGos-Rollout-1K

Viewer • Updated Mar 2 • 1k • 18

YichuanMa/Go-GRPO-1K

Viewer • Updated Mar 2 • 1k • 12

YichuanMa/Expert-Go-SFT-100K

Viewer • Updated Mar 2 • 100k • 49 • 3

upvoted a paper 2 months ago

TL-GRPO: Turn-Level RL for Reasoning-Guided Iterative Optimization

Paper • 2601.16480 • Published Jan 23 • 51

liked a model 2 months ago

YichuanMa/LoGos-7B

Text Generation • 8B • Updated Mar 2 • 7 • 3

upvoted an article 5 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28, 2025

•

887

published a model 6 months ago

YichuanMa/LoGos-7B

Text Generation • 8B • Updated Mar 2 • 7 • 3

马逸川

AI & ML interests

Recent Activity

Organizations

YichuanMa's activity

Clarification on the two distinct data formats

Open-R1: a fully open reproduction of DeepSeek-R1