Hongyu Li's picture

1 6

Hongyu Li

appletea2333

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 24 days ago

OpenSubject: Leveraging Video-Derived Identity and Diversity Priors for Subject-driven Image Generation and Manipulation

authored a paper 27 days ago

OmniAlpha: A Sequence-to-Sequence Framework for Unified Multi-Task RGBA Generation

authored a paper 27 days ago

Architecture Decoupling Is Not All You Need For Unified Multimodal Model

View all activity

Organizations

None yet

authored 4 papers 27 days ago

OmniAlpha: A Sequence-to-Sequence Framework for Unified Multi-Task RGBA Generation

Paper • 2511.20211 • Published Nov 25, 2025 • 12

Architecture Decoupling Is Not All You Need For Unified Multimodal Model

Paper • 2511.22663 • Published Nov 27, 2025 • 29

OneThinker: All-in-one Reasoning Model for Image and Video

Paper • 2512.03043 • Published Dec 2, 2025 • 32

EditThinker: Unlocking Iterative Reasoning for Any Image Editor

Paper • 2512.05965 • Published 29 days ago • 38

authored 3 papers about 1 month ago

LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding

Paper • 2501.08282 • Published Jan 14, 2025

Reinforcement Learning Tuning for VideoLLMs: Reward Design and Data Efficiency

Paper • 2506.01908 • Published Jun 2, 2025

Thinking-while-Generating: Interleaving Textual Reasoning throughout Visual Generation

Paper • 2511.16671 • Published Nov 20, 2025 • 15