-
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper • 2402.17764 • Published • 627 -
Hierarchical Reasoning Model
Paper • 2506.21734 • Published • 47 -
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 506 -
Training language models to follow instructions with human feedback
Paper • 2203.02155 • Published • 24
Gabriel
stoksweet
·
AI & ML interests
None yet
Recent Activity
liked
a model
16 days ago
HuggingFaceTB/SmolVLM2-500M-Video-Instruct
updated
a collection
about 1 month ago
Papers
updated
a collection
about 2 months ago
Papers
Organizations
None yet