Ali NT
AliNT99
AI & ML interests
None yet
Recent Activity
commented on
a paper
2 days ago
Progressive Residual Warmup for Language Model Pretraining published
a model 26 days ago
AliNT99/Flash_attn2_2.8.3_cu128_sm120_cp312_cu128_torch210_wheel upvoted an article 4 months ago
ZeRO Optimization Strategies for Large-Scale Model Training - A brief Performance Analysis Organizations
None yet