Lewis Tunstall PRO
AI & ML interests
LLMs, LLMs, LLMs
Recent Activity
liked a model 2 days ago
Hcompany/Holotron-12B liked a dataset 3 days ago
stepfun-ai/Step-3.5-Flash-SFT updated a Space 3 days ago
lewtun/climbing-dashboardOrganizations
Awesome RLHF
A curated collection of datasets, models, Spaces, and papers on Reinforcement Learning from Human Feedback (RLHF).
Hub tools
— Awesome RL datasets 📈 —
— Long-context post-training 🧶 —
Resources for post-training LLMs with long-context samples
Awesome RLHF
A curated collection of datasets, models, Spaces, and papers on Reinforcement Learning from Human Feedback (RLHF).
Mistral 7B + UltraChat + Arithmo checkpoints
A collection of Mistral 7B fine-tunes on UltraChat and Arithmo to boost the math capabilities of chat models. See https://x.com/_lewtun/status/1715652
Hub tools
Gemma RLAIF