jkrs
jkrs
ยท
AI & ML interests
Reinforcement Learning
Recent Activity
upvoted a paper 5 months ago
Reinforcement Learning with Verifiable yet Noisy Rewards under Imperfect
Verifiers liked
a dataset over 1 year ago
Anthropic/hh-rlhf Organizations
None yet