Ariel Kwiatkowski's picture

1 4 18

Ariel Kwiatkowski

RedTachyon

·

https://redtachyon.me

RedTachyon

AI & ML interests

RL, MARL, Crowd Simulation

Recent Activity

upvoted a paper 1 day ago

Likelihood-Based Reward Designs for General LLM Reasoning

upvoted a paper 10 days ago

Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability

upvoted a paper 4 months ago

Soft Tokens, Hard Truths

View all activity

Organizations

upvoted a paper 1 day ago

Likelihood-Based Reward Designs for General LLM Reasoning

Paper • 2602.03979 • Published 3 days ago • 8

upvoted a paper 10 days ago

Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability

Paper • 2601.18778 • Published 11 days ago • 40

upvoted a paper 4 months ago

Soft Tokens, Hard Truths

Paper • 2509.19170 • Published Sep 23, 2025 • 16

upvoted a paper 12 months ago

PILAF: Optimal Human Preference Sampling for Reward Modeling

Paper • 2502.04270 • Published Feb 6, 2025 • 12