Wanqiao Xu
wanqiaox
ยท
AI & ML interests
Reinforcement learning
Recent Activity
upvoted a paper about 14 hours ago
Understanding the Challenges in Iterative Generative Optimization with LLMs upvoted a paper 9 months ago
Provably Learning from Language Feedback updated a collection about 2 years ago
Lightweight models