view changelog Hugging Face Changelog Introducing Buckets: S3-like storage on the Hub 10 days ago โข 177
view post Post 2721 Surya-1.1T: Scaling Beyond Human-Level Reasoning via 146 Trillion Token Pre-trainingAuthor: Shrijan Kumar TiwariAffiliation: SKT AI Labs / Project SuryaModel Architecture: Optimized Dense TransformerParameters: 1.1 TrillionTraining Tokens: 146 TrillionWanna collaborate us Friends let's Start Journey we have Collected 146 trillon tokens and done pre training but we need to made more powerfull See translation 29 replies ยท ๐ฅ 7 7 ๐ 5 5 ๐ 4 4 ๐ค 3 3 ๐ 2 2 โค๏ธ 2 2 ๐ 2 2 โ 2 2 ๐ง 2 2 ๐ค 2 2 ๐คฏ 1 1 + Reply
view reply We know any one who want it downgraded version of 500 gb in 4bit combress they can contact us Url -- https://forms.gle/Wk2XXtzJX1uMQsu58
Shrijanagain/SKT_OMNI_SUPREME Text Generation โข 481B โข Updated about 16 hours ago โข 1.04k โข 6