view post Post 4074 I am very sad to say that the budget in creating of SnowflakeCore-G1 1b and 7b MoE models ran out and I can't pre-train them anymore. See translation
view post Post 520 the training for SnowflakeCore-G1-1B and 7B would be retaken because now I implemented DeepSpeed and management to use two gpus. See translation
CoT datasets TeichAI/claude-4.5-opus-high-reasoning-250x Viewer • Updated Nov 28, 2025 • 250 • 8.22k • 158 TeichAI/gemini-3-pro-preview-high-reasoning-1000x Viewer • Updated about 1 month ago • 1.02k • 1.5k • 56
TeichAI/gemini-3-pro-preview-high-reasoning-1000x Viewer • Updated about 1 month ago • 1.02k • 1.5k • 56
i3-Series Note: The models are listed in the default order set by Hugging Face, so the latest model appears at the botSeries i3-lab/i3-tiny Text Generation • 711k • Updated Oct 17, 2025 • 34 • 1 i3-lab/i3-12m Text Generation • 12.7M • Updated Oct 23, 2025 • 69 • 3 i3-lab/i3-22m Text Generation • 22.6M • Updated Oct 31, 2025 • 21 • 2 i3-lab/i3-80m Text Generation • 82.8M • Updated Nov 30, 2025 • 73 • 7
CoT datasets TeichAI/claude-4.5-opus-high-reasoning-250x Viewer • Updated Nov 28, 2025 • 250 • 8.22k • 158 TeichAI/gemini-3-pro-preview-high-reasoning-1000x Viewer • Updated about 1 month ago • 1.02k • 1.5k • 56
TeichAI/gemini-3-pro-preview-high-reasoning-1000x Viewer • Updated about 1 month ago • 1.02k • 1.5k • 56
i3-Series Note: The models are listed in the default order set by Hugging Face, so the latest model appears at the botSeries i3-lab/i3-tiny Text Generation • 711k • Updated Oct 17, 2025 • 34 • 1 i3-lab/i3-12m Text Generation • 12.7M • Updated Oct 23, 2025 • 69 • 3 i3-lab/i3-22m Text Generation • 22.6M • Updated Oct 31, 2025 • 21 • 2 i3-lab/i3-80m Text Generation • 82.8M • Updated Nov 30, 2025 • 73 • 7