马逸川's picture

马逸川

YichuanMa

·

Entarochuan

AI & ML interests

(M)LLM

Recent Activity

authored a paper 4 days ago

TL-GRPO: Turn-Level RL for Reasoning-Guided Iterative Optimization

authored a paper 4 days ago

Timely Machine: Awareness of Time Makes Test-Time Scaling Agentic

authored a paper 4 days ago

Mixing Expert Knowledge: Bring Human Thoughts Back To the Game of Go

View all activity

Organizations

None yet

Papers 11

arxiv:2603.25040

arxiv:2601.16486

arxiv:2601.16480

arxiv:2601.16447

models 1

YichuanMa/LoGos-7B

Text Generation • 8B • Updated Mar 2 • 7 • 3

datasets 3

YichuanMa/LoGos-Rollout-1K

Viewer • Updated Mar 2 • 1k • 16

YichuanMa/Go-GRPO-1K

Viewer • Updated Mar 2 • 1k • 12

YichuanMa/Expert-Go-SFT-100K

Viewer • Updated Mar 2 • 100k • 44 • 3