Thinking in Frames: How Visual Context and Test-Time Scaling Empower Video Reasoning Paper • 2601.21037 • Published 10 days ago • 13
OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory Paper • 2512.07802 • Published Dec 8, 2025 • 45
PokemonChat: Auditing ChatGPT for Pokémon Universe Knowledge Paper • 2306.03024 • Published Jun 5, 2023 • 2
Structural Similarities Between Language Models and Neural Response Measurements Paper • 2306.01930 • Published Jun 2, 2023 • 2
RAVENEA: A Benchmark for Multimodal Retrieval-Augmented Visual Culture Understanding Paper • 2505.14462 • Published May 20, 2025 • 4
RAVENEA: A Benchmark for Multimodal Retrieval-Augmented Visual Culture Understanding Paper • 2505.14462 • Published May 20, 2025 • 4
RAVENEA: A Benchmark for Multimodal Retrieval-Augmented Visual Culture Understanding Paper • 2505.14462 • Published May 20, 2025 • 4 • 2