3 8 3

Yi Ding

Tuwhy

https://dripnowhy.github.io/

DripNowhy

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Learning Self-Correction in Vision-Language Models via Rollout Augmentation

submitted a paper 1 day ago

Learning Self-Correction in Vision-Language Models via Rollout Augmentation

updated a model 1 day ago

Tuwhy/Octopus-8B

View all activity

Organizations

upvoted a paper 1 day ago

Learning Self-Correction in Vision-Language Models via Rollout Augmentation

Paper • 2602.08503 • Published 3 days ago • 2

submitted a paper to Daily Papers 1 day ago

Learning Self-Correction in Vision-Language Models via Rollout Augmentation

Paper • 2602.08503 • Published 3 days ago • 2

updated a model 1 day ago

Tuwhy/Octopus-8B

Image-Text-to-Text • 9B • Updated 1 day ago • 42

updated a collection 3 days ago

Octopus

Collection

RL checkpoints of Octopus-8B and baselines of paper: Learning Self-Correction in Vision–Language Models via Rollout Augmentation • 6 items • Updated 3 days ago

updated a model 15 days ago

Tuwhy/Qwen3-VL-8B-GRPO-n16

9B • Updated 15 days ago • 7

published a model 15 days ago

Tuwhy/Qwen3-VL-8B-GRPO-n16

9B • Updated 15 days ago • 7

updated a model 17 days ago

Tuwhy/Qwen3-VL-8B-DAPO-n16

9B • Updated 17 days ago • 11

published a model 17 days ago

Tuwhy/Qwen3-VL-8B-DAPO-n16

9B • Updated 17 days ago • 11

updated a model 20 days ago

Tuwhy/Qwen3-VL-8B-SRPO-n16

9B • Updated 20 days ago • 11

published a model 20 days ago

Tuwhy/Qwen3-VL-8B-SRPO-n16

9B • Updated 20 days ago • 11

updated a model 20 days ago

Tuwhy/Qwen3-VL-8B-SCPO-random

9B • Updated 20 days ago • 16

published a model 20 days ago

Tuwhy/Qwen3-VL-8B-SCPO-random

9B • Updated 20 days ago • 16

updated a model 20 days ago

Tuwhy/Qwen3-VL-8B-SRPO-n8

9B • Updated 20 days ago • 10

Yi Ding

AI & ML interests

Recent Activity

Organizations

Tuwhy's activity