Andreas Stöffelbauer's picture

10

Andreas Stöffelbauer

andreasskyscanner

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

You Only Judge Once: Multi-response Reward Modeling in a Single Forward Pass

upvoted a paper 1 day ago

Lightning OPD: Efficient Post-Training for Large Reasoning Models with Offline On-Policy Distillation

upvoted a paper 1 day ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

View all activity

Organizations

None yet

andreasskyscanner 's datasets

None public yet