Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
pipizhao's picture
4 4 3

pipizhao

pipizhao
SilviuMatei's profile picture jawadmohmmad's profile picture jilangdi's profile picture
·

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago
Rubrics to Tokens: Bridging Response-level Rubrics and Token-level Rewards in Instruction Following Tasks
upvoted a paper 6 days ago
ContextBudget: Budget-Aware Context Management for Long-Horizon Search Agents
updated a model 7 days ago
pipizhao/SkillRouter-Embedding-0.6B
View all activity

Organizations

ICCV2023's profile picture

pipizhao 's collections 1

paper
  • Iterative Reasoning Preference Optimization

    Paper • 2404.19733 • Published Apr 30, 2024 • 50
  • SkillRouter: Retrieve-and-Rerank Skill Selection for LLM Agents at Scale

    Paper • 2603.22455 • Published 17 days ago • 2
paper
  • Iterative Reasoning Preference Optimization

    Paper • 2404.19733 • Published Apr 30, 2024 • 50
  • SkillRouter: Retrieve-and-Rerank Skill Selection for LLM Agents at Scale

    Paper • 2603.22455 • Published 17 days ago • 2
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs