ThinkTwice: Jointly Optimizing Large Language Models for Reasoning and Self-Refinement Paper • 2604.01591 • Published 9 days ago • 35
era-temporary/openvla-7b-era_dataset-b16-lr-0.0005-lora-r32-dropout-0.0 8B • Updated Nov 21, 2025 • 1
EmbodiedReasoningAgent/EB-ALFRED_trajectory_augmented_prior_dataset Preview • Updated Oct 18, 2025 • 15
EmbodiedReasoningAgent/EB-ALFRED_environment_anchored_prior_dataset Viewer • Updated Oct 18, 2025 • 48.6k • 12
EmbodiedReasoningAgent/EB-ALFRED_external_knowledge_prior_dataset Viewer • Updated Oct 18, 2025 • 10k • 14
ERA: Transforming VLMs into Embodied Agents via Embodied Prior Learning and Online Reinforcement Learning Paper • 2510.12693 • Published Oct 14, 2025 • 28
EmbodiedReasoningAgent/EB-ALFRED_external_knowledge_prior_dataset Viewer • Updated Oct 18, 2025 • 10k • 14
EmbodiedReasoningAgent/EB-ALFRED_environment_anchored_prior_dataset Viewer • Updated Oct 18, 2025 • 48.6k • 12
EmbodiedReasoningAgent/EB-ALFRED_trajectory_augmented_prior_dataset Preview • Updated Oct 18, 2025 • 15