Lewis Tunstall's picture

In a Training Loop 🔄

Lewis Tunstall PRO

lewtun

huggingface

·

https://lewtun.github.io/blog/

AI & ML interests

LLMs, LLMs, LLMs

Recent Activity

updated a Space 12 minutes ago

lewtun/running-dashboard

updated a dataset 21 minutes ago

lewtun/running-dashboard-data

updated a Space about 1 hour ago

lewtun/climbing-dashboard

View all activity

Organizations

upvoted a paper 2 days ago

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6, 2024 • 66

upvoted an article 2 days ago

Article

ML Intern Takes Our Post-Training Internship Test

2 days ago

•

24

upvoted a paper 9 days ago

Embarrassingly Simple Self-Distillation Improves Code Generation

Paper • 2604.01193 • Published 24 days ago • 46

upvoted a changelog 14 days ago

Hugging Face Changelog

Agent Traces on the Hub

18 days ago

• 114

upvoted an article 16 days ago

Article

How we OCR'ed 30,000 papers using Codex, open OCR models and Jobs

18 days ago

•

59

upvoted a collection 24 days ago

Trinity-Large-Thinking

5 items • Updated 15 days ago • 30

upvoted an article 25 days ago

Article

How I contributed a new model to the Transformers library using Codex

26 days ago

•

49

upvoted 2 articles about 2 months ago

Article

Introducing Storage Buckets on the Hugging Face Hub

+10

Mar 10

•

192

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

+7

Mar 10

•

133

upvoted a paper about 2 months ago

Data Repetition Beats Data Scaling in Long-CoT Supervised Fine-Tuning

Paper • 2602.11149 • Published Feb 11 • 17

upvoted 2 articles about 2 months ago

Article

Ulysses Sequence Parallelism: Training with Million-Token Contexts

Mar 9

•

26

Article

Forge: Scalable Agent RL Framework and Algorithm

Feb 13

•

147

upvoted a changelog about 2 months ago

Hugging Face Changelog

Public Storage Add-ons

Feb 26

• 168

upvoted an article about 2 months ago

Article

Mixture of Experts (MoEs) in Transformers

+5

Feb 26

•

156

upvoted 3 articles 2 months ago

Article

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

+4

Feb 20

•

503

Article

Did GPT 5.2 make a breakthrough discovery in theoretical physics?

Feb 19

•

62

Article

I Let a Lobster Run My Jetson: What OpenClaw Taught Me About the Future of Computing

Feb 19

•

16

upvoted a paper 2 months ago

Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL

Paper • 2602.03773 • Published Feb 3 • 13

upvoted a collection 2 months ago

QED Nano

Artifacts for the QED Nano release • 9 items • Updated Mar 2 • 9

upvoted a paper 2 months ago

Towards Robust Mathematical Reasoning

Paper • 2511.01846 • Published Nov 3, 2025 • 10