Collection of models and datasets for Beyond Binary Rewards: Training LMs to Reason about their Uncertainty
Mehul Damani PRO
mehuldamani
AI & ML interests
Reinforcement Learning, Large Language Models
Recent Activity
updated a model 1 day ago
mehuldamani/bug_fixing_new-arl-add_multiply published a model 1 day ago
mehuldamani/bug_fixing_new-arl-add_multiply updated a model 1 day ago
mehuldamani/bug_fixing_rlvr-7b-nokl-v2Organizations
None yet