Innovator-VL: A Multimodal Large Language Model for Scientific Discovery Paper β’ 2601.19325 β’ Published Jan 27 β’ 81
VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents Paper β’ 2601.16973 β’ Published Jan 23 β’ 40
view article Article Weβre open-sourcing our text-to-image model and the process behind it Nov 12, 2025 β’ 97
The Bestiary Collection Decensored language models made using Heretic (https://github.com/p-e-w/heretic) β’ 6 items β’ Updated Nov 16, 2025 β’ 112
AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day β’ 638 items β’ Updated 3 days ago β’ 96
VertexRegen: Mesh Generation with Continuous Level of Detail Paper β’ 2508.09062 β’ Published Aug 12, 2025 β’ 39
DeepSeek R1 (All Versions) Collection DeepSeek-R1-0528 is here! The most powerful reasoning open LLM, available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. β’ 37 items β’ Updated 7 days ago β’ 267
BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data Paper β’ 2402.08093 β’ Published Feb 12, 2024 β’ 61