VideoSSR: Video Self-Supervised Reinforcement Learning Paper • 2511.06281 • Published Nov 9, 2025 • 25
DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models Paper • 2512.24165 • Published Dec 30, 2025 • 52
LatentMem: Customizing Latent Memory for Multi-Agent Systems Paper • 2602.03036 • Published Feb 3 • 14
GEMS: Agent-Native Multimodal Generation with Memory and Skills Paper • 2603.28088 • Published 4 days ago • 78
Spotlight on Token Perception for Multimodal Reinforcement Learning Paper • 2510.09285 • Published Oct 10, 2025 • 37
FrameThinker: Learning to Think with Long Videos via Multi-Turn Frame Spotlighting Paper • 2509.24304 • Published Sep 29, 2025 • 5
Sampling-Efficient Test-Time Scaling: Self-Estimating the Best-of-N Sampling in Early Decoding Paper • 2503.01422 • Published Mar 3, 2025