An Axiomatic Benchmark for Evaluation of Scientific Novelty Metrics Paper • 2604.15145 • Published 7 days ago
LangFlow: Continuous Diffusion Rivals Discrete in Language Modeling Paper • 2604.11748 • Published 8 days ago • 14
Narrative-Driven Paper-to-Slide Generation via ArcDeck Paper • 2604.11969 • Published 10 days ago • 7
HandX: Scaling Bimanual Motion and Interaction Generation Paper • 2603.28766 • Published 24 days ago • 12
Sparking Scientific Creativity via LLM-Driven Interdisciplinary Inspiration Paper • 2603.12226 • Published Mar 12 • 4
Tool-R0: Self-Evolving LLM Agents for Tool-Learning from Zero Data Paper • 2602.21320 • Published Feb 24 • 12
Steer2Adapt: Dynamically Composing Steering Vectors Elicits Efficient Adaptation of LLMs Paper • 2602.07276 • Published Feb 7 • 11
CodeCircuit: Toward Inferring LLM-Generated Code Correctness via Attribution Graphs Paper • 2602.07080 • Published Feb 6 • 6
HandsOff: Labeled Dataset Generation With No Additional Human Annotations Paper • 2212.12645 • Published Dec 24, 2022 • 1
Why do These Match? Explaining the Behavior of Image Similarity Models Paper • 1905.10797 • Published May 26, 2019 • 1
OutfitTransformer: Learning Outfit Representations for Fashion Recommendation Paper • 2204.04812 • Published Apr 11, 2022 • 1
Learning Type-Aware Embeddings for Fashion Compatibility Paper • 1803.09196 • Published Mar 25, 2018 • 2
Learning Similarity Conditions Without Explicit Supervision Paper • 1908.08589 • Published Aug 22, 2019 • 1
Drift No More? Context Equilibria in Multi-Turn LLM Interactions Paper • 2510.07777 • Published Oct 9, 2025
MIRAGE: A Benchmark for Multimodal Information-Seeking and Reasoning in Agricultural Expert-Guided Conversations Paper • 2506.20100 • Published Jun 25, 2025 • 1
Plan Verification for LLM-Based Embodied Task Completion Agents Paper • 2509.02761 • Published Sep 2, 2025
GenoMAS: A Multi-Agent Framework for Scientific Discovery via Code-Driven Gene Expression Analysis Paper • 2507.21035 • Published Jul 28, 2025 • 3
PIPA: A Unified Evaluation Protocol for Diagnosing Interactive Planning Agents Paper • 2505.01592 • Published May 2, 2025