Doug Holton's picture

Doug Holton

edtechdev

·

https://edtechdev.wordpress.com/

AI & ML interests

Educational Technology, Intelligent Tutoring Systems, Learning Sciences

Recent Activity

upvoted a paper 2 days ago

MathTutorBench: A Benchmark for Measuring Open-ended Pedagogical Capabilities of LLM Tutors

upvoted a collection 2 days ago

upvoted a paper 2 days ago

From Problem-Solving to Teaching Problem-Solving: Aligning LLMs with Pedagogy using Reinforcement Learning

View all activity

Organizations

None yet

upvoted a paper 2 days ago

MathTutorBench: A Benchmark for Measuring Open-ended Pedagogical Capabilities of LLM Tutors

Paper • 2502.18940 • Published Feb 26, 2025 • 3

upvoted a collection 2 days ago

tutoringLM

7 items • Updated Aug 6, 2025 • 1

upvoted a paper 2 days ago

From Problem-Solving to Teaching Problem-Solving: Aligning LLMs with Pedagogy using Reinforcement Learning

Paper • 2505.15607 • Published May 21, 2025 • 4

upvoted 3 collections 2 days ago

PERSONA

Collection of various datasets related to the PERSONA paper. • 5 items • Updated Apr 16, 2025 • 5

Big-Math

This collection contains assets associated with the Big-Math dataset, a high-quality collection of over 250,000 math questions with verifiable answers • 4 items • Updated Apr 16, 2025 • 7

Pedagogical LLMs

4 items • Updated 26 days ago • 2

upvoted a paper 5 days ago

Hyperagents

Paper • 2603.19461 • Published 8 days ago • 35

upvoted an article 6 days ago

Article

Can Your LLM Think Like a Professional? Introducing ProfBench

Oct 28, 2025

•

21

upvoted a collection 6 days ago

Nemotron-Cascade 2

Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation • 4 items • Updated 3 days ago • 40

upvoted a collection 15 days ago

Qwen3.5

21 items • Updated 19 days ago • 1.33k

upvoted a collection 17 days ago

Distil Efficiency Benchmarks

Collection of models used in the blog post www.distillabs.ai/blog/the-10x-inference-tax-you-dont-have-to-pay • 9 items • Updated 26 days ago • 3

upvoted a collection about 2 months ago

Molmo2

Artifacts for the Molmo2 release • 5 items • Updated 26 days ago • 36

upvoted 2 papers about 2 months ago

Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding

Paper • 2601.10611 • Published Jan 15 • 32

OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models

Paper • 2601.21639 • Published Jan 29 • 51

upvoted a paper 2 months ago

Agentic Reasoning for Large Language Models

Paper • 2601.12538 • Published Jan 18 • 202

upvoted 2 collections 2 months ago

LightOnOCR-2 🦉

LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family • 12 items • Updated 25 days ago • 23

Qwen3Guard

7 items • Updated Dec 31, 2025 • 64

upvoted 2 collections 3 months ago

GLM-4.6V

3 items • Updated Dec 8, 2025 • 48

Parakeet

NeMo Parakeet ASR Models attain strong speech recognition accuracy while being efficient for inference. Available in CTC and RNN-Transducer variants. • 16 items • Updated 3 days ago • 58

upvoted a collection 4 months ago

Step-Audio-R1

Step-Audio-R1 is the first audio language model to successfully unlock test-time compute scaling. • 4 items • Updated Jan 14 • 18