Zhenyang Cai's picture

3 24 8

Zhenyang Cai

Eric3200

·

Eric3200C

AI & ML interests

None yet

Recent Activity

authored a paper about 13 hours ago

ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image Generation

authored a paper about 13 hours ago

MedGen: Unlocking Medical Video Generation by Scaling Granularly-annotated Medical Videos

authored a paper about 13 hours ago

ShizhenGPT: Towards Multimodal LLMs for Traditional Chinese Medicine

View all activity

Organizations

authored 5 papers about 13 hours ago

ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image Generation

Paper • 2506.18095 • Published Jun 22, 2025 • 66

MedGen: Unlocking Medical Video Generation by Scaling Granularly-annotated Medical Videos

Paper • 2507.05675 • Published Jul 8, 2025 • 26

ShizhenGPT: Towards Multimodal LLMs for Traditional Chinese Medicine

Paper • 2508.14706 • Published Aug 20, 2025

DentalGPT: Incentivizing Multimodal Complex Reasoning in Dentistry

Paper • 2512.11558 • Published 28 days ago • 42

WaveMind: Towards a Conversational EEG Foundation Model Aligned to Textual and Visual Modalities

Paper • 2510.00032 • Published Sep 26, 2025

submitted a paper to Daily Papers 26 days ago

DentalGPT: Incentivizing Multimodal Complex Reasoning in Dentistry

Paper • 2512.11558 • Published 28 days ago • 42

authored a paper 4 months ago

Can Multimodal LLMs See Materials Clearly? A Multimodal Benchmark on Materials Characterization

Paper • 2509.09307 • Published Sep 11, 2025 • 6

authored 2 papers about 1 year ago

On the Compositional Generalization of Multimodal LLMs for Medical Imaging

Paper • 2412.20070 • Published Dec 28, 2024 • 42

HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

Paper • 2412.18925 • Published Dec 25, 2024 • 106

authored a paper over 1 year ago

HuatuoGPT-Vision, Towards Injecting Medical Visual Knowledge into Multimodal LLMs at Scale

Paper • 2406.19280 • Published Jun 27, 2024 • 63