Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Mashiro's picture
9

Mashiro

AlexMashiro

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago
Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards
upvoted a paper 8 days ago
RM-R1: Reward Modeling as Reasoning
upvoted a paper 13 days ago
Auto-Rubric: Learning to Extract Generalizable Criteria for Reward Modeling
View all activity

Organizations

None yet

AlexMashiro 's models

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs