APEX Quants (GGUF) Collection MoE models quantized with the APEX Quantization technique ( https://github.com/mudler/apex-quant ) • 22 items • Updated 1 day ago • 36
REAM Collection Compressed MoE models with a reduced number of experts. See additional models at https://huggingface.co/bknyaz. • 9 items • Updated 7 days ago • 5
Cerebras REAP Collection Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 30 items • Updated Feb 25 • 134