A comprehensive framework designed to cultivate VLMs with human-like visuospatial abilities.
Ray Yang
rayruiyang
AI & ML interests
None yet
Recent Activity
upvoted a paper about 10 hours ago
AlphaGRPO: Unlocking Self-Reflective Multimodal Generation in UMMs via Decompositional Verifiable Reward liked a dataset 22 days ago
jdopensource/JoyAI-Image-OpenSpatial upvoted a paper about 1 month ago
TriAttention: Efficient Long Reasoning with Trigonometric KV CompressionOrganizations
None yet