ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image Generation Paper • 2506.18095 • Published Jun 22, 2025 • 66
MedGen: Unlocking Medical Video Generation by Scaling Granularly-annotated Medical Videos Paper • 2507.05675 • Published Jul 8, 2025 • 26
ShizhenGPT: Towards Multimodal LLMs for Traditional Chinese Medicine Paper • 2508.14706 • Published Aug 20, 2025
DentalGPT: Incentivizing Multimodal Complex Reasoning in Dentistry Paper • 2512.11558 • Published 28 days ago • 42
WaveMind: Towards a Conversational EEG Foundation Model Aligned to Textual and Visual Modalities Paper • 2510.00032 • Published Sep 26, 2025
DentalGPT: Incentivizing Multimodal Complex Reasoning in Dentistry Paper • 2512.11558 • Published 28 days ago • 42
Can Multimodal LLMs See Materials Clearly? A Multimodal Benchmark on Materials Characterization Paper • 2509.09307 • Published Sep 11, 2025 • 6
On the Compositional Generalization of Multimodal LLMs for Medical Imaging Paper • 2412.20070 • Published Dec 28, 2024 • 42
HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs Paper • 2412.18925 • Published Dec 25, 2024 • 106
HuatuoGPT-Vision, Towards Injecting Medical Visual Knowledge into Multimodal LLMs at Scale Paper • 2406.19280 • Published Jun 27, 2024 • 63