The Curious Case of Analogies: Investigating Analogical Reasoning in Large Language Models Paper • 2511.20344 • Published Nov 25, 2025 • 14
Thinking Sparks!: Emergent Attention Heads in Reasoning Models During Post Training Paper • 2509.25758 • Published Sep 30, 2025 • 24
Soohak: A Mathematician-Curated Benchmark for Evaluating Research-level Math Capabilities of LLMs Paper • 2605.09063 • Published 5 days ago • 68
InteractWeb-Bench: Can Multimodal Agent Escape Blind Execution in Interactive Website Generation? Paper • 2604.27419 • Published 14 days ago • 13
ASGuard: Activation-Scaling Guard to Mitigate Targeted Jailbreaking Attack Paper • 2509.25843 • Published 30 days ago • 19
System Message Generation for User Preferences using Open-Source Models Paper • 2502.11330 • Published Feb 17, 2025 • 15
Does Time Have Its Place? Temporal Heads: Where Language Models Recall Time-specific Information Paper • 2502.14258 • Published Feb 20, 2025 • 26