9 43 16

Wei Cheng

wchengad

https://wchengad.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

LTX-2: Efficient Joint Audio-Visual Foundation Model

upvoted a paper 2 days ago

VINO: A Unified Visual Generator with Interleaved OmniModal Context

liked a Space 28 days ago

ziheng1234/ImageCritic

View all activity

Organizations

None yet

upvoted a paper 1 day ago

LTX-2: Efficient Joint Audio-Visual Foundation Model

Paper • 2601.03233 • Published 2 days ago • 65

upvoted a paper 2 days ago

VINO: A Unified Visual Generator with Interleaved OmniModal Context

Paper • 2601.02358 • Published 3 days ago • 26

liked a Space 28 days ago

ImageCritic

🖼

Official Demo of ImageCritic

upvoted 2 papers about 1 month ago

Relational Visual Similarity

Paper • 2512.07833 • Published Dec 8, 2025 • 24

EditThinker: Unlocking Iterative Reasoning for Any Image Editor

Paper • 2512.05965 • Published Dec 5, 2025 • 38

liked a dataset about 1 month ago

OmniSVG/MMSVGBench

Viewer • Updated Dec 3, 2025 • 600 • 296 • 6

authored a paper about 1 month ago

REASONEDIT: Towards Reasoning-Enhanced Image Editing Models

Paper • 2511.22625 • Published Nov 27, 2025 • 46

liked a model about 1 month ago

stepfun-ai/Step1X-Edit-v1p2

Image-to-Image • Updated 11 days ago • 510 • • 54

upvoted 3 papers about 1 month ago

Captain Safari: A World Engine

Paper • 2511.22815 • Published Nov 28, 2025 • 10

Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer

Paper • 2511.22699 • Published Nov 27, 2025 • 225

REASONEDIT: Towards Reasoning-Enhanced Image Editing Models

Paper • 2511.22625 • Published Nov 27, 2025 • 46

authored a paper about 1 month ago

iMontage: Unified, Versatile, Highly Dynamic Many-to-many Image Generation

Paper • 2511.20635 • Published Nov 25, 2025 • 32

upvoted a paper about 1 month ago

iMontage: Unified, Versatile, Highly Dynamic Many-to-many Image Generation

Paper • 2511.20635 • Published Nov 25, 2025 • 32

upvoted a paper about 2 months ago

Step-Audio-R1 Technical Report

Paper • 2511.15848 • Published Nov 19, 2025 • 53

upvoted a paper 2 months ago

VCode: a Multimodal Coding Benchmark with SVG as Symbolic Visual Representation

Paper • 2511.02778 • Published Nov 4, 2025 • 101

authored a paper 2 months ago

RegionE: Adaptive Region-Aware Generation for Efficient Image Editing

Paper • 2510.25590 • Published Oct 29, 2025 • 27

upvoted 2 papers 2 months ago

RegionE: Adaptive Region-Aware Generation for Efficient Image Editing

Paper • 2510.25590 • Published Oct 29, 2025 • 27

IGGT: Instance-Grounded Geometry Transformer for Semantic 3D Reconstruction

Paper • 2510.22706 • Published Oct 26, 2025 • 40

liked a model 3 months ago

yexiguafu/VFMTok

Updated Oct 13, 2025 • 1

liked a Space 3 months ago

WithAnyone Demo

🏃

WithAnyone is capable of generating high-quality, controllab