open-sci/open-sci-ref-v0.02-1.7b-fineweb-edu-1.4t-300B-4096-longsft_16k Feature Extraction • 2B • Updated 6 days ago • 29 • 1
ali-elganzory/open-sci-ref-v0.02-1.7b-fineweb-edu-1.4t-300B-4096-4096-longsft_16k Text Generation • 2B • Updated 2 days ago • 390 • 1
geodesic-research/nemotron_nano_sft_warm_start_100k Text Generation • 32B • Updated 6 days ago • 272 • 1
geodesic-research/nemotron_nano_sft_warm_start_1150 Text Generation • 32B • Updated 6 days ago • 377 • 1
geodesic-research/nemotron_nano_sft_warm_start_16k Text Generation • 32B • Updated 6 days ago • 302 • 1
Lanni-ni/dynamic_forgetting_4_6_384_babylm_10m_epoch10 Text Generation • 45.7M • Updated 3 days ago • 52 • 1
ali-elganzory/1.7b-MixtureVitae-100BT-longsft_16k Feature Extraction • 2B • Updated 2 days ago • 29 • 1