9 27 19

Xinchen Zhang

comin

https://cominclip.github.io/

Cominclip

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Training Long-Context Vision-Language Models Effectively with Generalization Beyond 128K Context

upvoted a paper about 2 months ago

Mixture-of-Depths Attention

upvoted a paper 3 months ago

Diversity-Preserved Distribution Matching Distillation for Fast Visual Synthesis

View all activity

Organizations

None yet

upvoted a paper 1 day ago

Training Long-Context Vision-Language Models Effectively with Generalization Beyond 128K Context

Paper • 2605.13831 • Published 3 days ago • 81

upvoted a paper about 2 months ago

Mixture-of-Depths Attention

Paper • 2603.15619 • Published Mar 16 • 80

upvoted 2 papers 3 months ago

Diversity-Preserved Distribution Matching Distillation for Fast Visual Synthesis

Paper • 2602.03139 • Published Feb 3 • 44

Making Avatars Interact: Towards Text-Driven Human-Object Interaction for Controllable Talking Avatars

Paper • 2602.01538 • Published Feb 2 • 15

submitted a paper to Daily Papers 3 months ago

Making Avatars Interact: Towards Text-Driven Human-Object Interaction for Controllable Talking Avatars

Paper • 2602.01538 • Published Feb 2 • 15

upvoted a paper 4 months ago

Visual Generation Unlocks Human-Like Reasoning through Multimodal World Models

Paper • 2601.19834 • Published Jan 27 • 25

liked a model 4 months ago

moonshotai/Kimi-K2.5

Image-Text-to-Text • 1.1T • Updated 16 days ago • 1.86M • • 2.78k

upvoted a paper 4 months ago

X-Coder: Advancing Competitive Programming with Fully Synthetic Tasks, Solutions, and Tests

Paper • 2601.06953 • Published Jan 11 • 46

upvoted 2 papers 5 months ago

See Less, See Right: Bi-directional Perceptual Shaping For Multimodal Reasoning

Paper • 2512.22120 • Published Dec 26, 2025 • 15

Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance

Paper • 2512.08765 • Published Dec 9, 2025 • 134

upvoted a paper 7 months ago

From Denoising to Refining: A Corrective Framework for Vision-Language Diffusion Model

Paper • 2510.19871 • Published Oct 22, 2025 • 30

updated a model 7 months ago

comin/OmniVerifier-7B

8B • Updated Oct 23, 2025 • 189 • 4

New activity in comin/ViVerBench 7 months ago

Enhance ViVerBench dataset card: Add metadata, links, and sample usage

#2 opened 7 months ago by

nielsr

liked a dataset 7 months ago

comin/ViVerBench

Viewer • Updated Oct 17, 2025 • 3.59k • 66 • 2

liked a model 7 months ago

comin/OmniVerifier-7B

8B • Updated Oct 23, 2025 • 189 • 4

authored a paper 7 months ago

Generative Universal Verifier as Multimodal Meta-Reasoner

Paper • 2510.13804 • Published Oct 15, 2025 • 28

upvoted a paper 7 months ago

Generative Universal Verifier as Multimodal Meta-Reasoner

Paper • 2510.13804 • Published Oct 15, 2025 • 28

published a model 7 months ago

comin/OmniVerifier-7B

8B • Updated Oct 23, 2025 • 189 • 4

updated a dataset 7 months ago

comin/ViVerBench

Viewer • Updated Oct 17, 2025 • 3.59k • 66 • 2

published a dataset 7 months ago

comin/ViVerBench

Viewer • Updated Oct 17, 2025 • 3.59k • 66 • 2

Xinchen Zhang

AI & ML interests

Recent Activity

Organizations

comin's activity

Enhance ViVerBench dataset card: Add metadata, links, and sample usage