Zepeng Zhai
zzp-seeker
AI & ML interests
None yet
Recent Activity
upvoted a paper about 8 hours ago
Rewards as Labels: Revisiting RLVR from a Classification Perspective upvoted a paper 4 months ago
Video-Thinker: Sparking "Thinking with Videos" via Reinforcement
Learning liked
a Space over 1 year ago
lmarena-ai/arena-leaderboard