Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

hud

Team
company
https://www.hud.so
hud_evals
hud-evals
Activity Feed

AI & ML interests

AI, Evaluations, RL

Parth Patel's profile pictureJaideep Chawla's profile pictureLorenss Martinsons's profile pictureJay Ram's profile pictureshin's profile picture

hud-evals 's datasets 17

hud-evals/SheetBench-50

Viewer • Updated Dec 3, 2025 • 50 • 52

hud-evals/SpreadSheetBench-200

Viewer • Updated Nov 23, 2025 • 200 • 18

hud-evals/SpreadSheetBench

Viewer • Updated Nov 23, 2025 • 912 • 39

hud-evals/OSWorld-Gold-Mini

Viewer • Updated Nov 18, 2025 • 20 • 9

hud-evals/2048-basic

Viewer • Updated Nov 18, 2025 • 1 • 8

hud-evals/OSWorld-Verified

Viewer • Updated Nov 18, 2025 • 369 • 44

hud-evals/Online-Mind2Web-Tiny

Viewer • Updated Nov 18, 2025 • 10 • 6

hud-evals/Online-Mind2Web

Viewer • Updated Nov 18, 2025 • 300 • 151

hud-evals/OSWorld-Gold

Viewer • Updated Nov 18, 2025 • 294 • 8

hud-evals/SheetBench-50_db_test

Viewer • Updated Oct 1, 2025 • 50 • 8

hud-evals/OSWorld-Gold_db_test

Viewer • Updated Sep 30, 2025 • 294 • 7

hud-evals/OSWorld-Verified-XLang_db_test

Viewer • Updated Sep 30, 2025 • 369 • 7

hud-evals/test-diverse

Viewer • Updated Sep 21, 2025 • 3 • 10

hud-evals/tasks-json-test-3

Viewer • Updated Aug 30, 2025 • 4 • 8

hud-evals/tasks-json-test-2

Viewer • Updated Aug 30, 2025 • 4 • 8

hud-evals/test-json-tasks

Viewer • Updated Aug 29, 2025 • 1 • 7

hud-evals/2048-taskset

Viewer • Updated Aug 27, 2025 • 6 • 7
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs