A collection of efficient language models for edge deployment. Features MoE architecture with only 25% parameter activation.
Faria Sultana
fariasultana
AI & ML interests
None yet
Recent Activity
liked a dataset 29 days ago
fariasultana/TrickGPT updated a dataset 29 days ago
fariasultana/TrickGPT published a dataset 29 days ago
fariasultana/TrickGPTOrganizations
None yet