Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
928.0
TFLOPS
20
8
Luke Alonso
PRO
lukealonso
Follow
ktsaou's profile picture
Raizek's profile picture
kil0rk's profile picture
50 followers
Β·
3 following
AI & ML interests
None yet
Recent Activity
updated
a model
about 22 hours ago
lukealonso/GLM-5.1-NVFP4
new
activity
about 22 hours ago
lukealonso/GLM-5.1-NVFP4:
RuntimeError: The size of tensor a (3072) must match the size of tensor b (6144) at non-singleton dimension 1
updated
a model
about 22 hours ago
lukealonso/MiniMax-M2.7-NVFP4
View all activity
Organizations
None yet
lukealonso
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
lukealonso/GLM-5.1-NVFP4
about 22 hours ago
RuntimeError: The size of tensor a (3072) must match the size of tensor b (6144) at non-singleton dimension 1
1
#5 opened about 22 hours ago by
lianyouzao
New activity in
lukealonso/MiniMax-M2.7-NVFP4
1 day ago
w1 not matching w3 weight scales
9
#1 opened 1 day ago by
dareposte
New activity in
lukealonso/GLM-5.1-NVFP4
1 day ago
From "Doesn't Work" to 641 tok/s: GLM-5.1 NVFP4 on 6Γ RTX PRO 6000 Blackwell
π₯
1
#4 opened 2 days ago by
sakamakismile
New activity in
lukealonso/GLM-5.1-NVFP4
3 days ago
Hopper GPU?
1
#2 opened 3 days ago by
AndrewMatienko
New activity in
lukealonso/MiniMax-M2.5-NVFP4
about 2 months ago
Request: NVFP4 version of MiniMax-M2.5-REAP-139B (to fit on a single RTX 6000 Pro)
14
#7 opened about 2 months ago by
mondovero
New activity in
lukealonso/GLM-5-NVFP4
about 2 months ago
Crash on first request on RTX Pro 6000 x8
π
1
6
#3 opened about 2 months ago by
koushd
New activity in
cerebras/MiniMax-M2.5-REAP-139B-A10B
about 2 months ago
nvfp4
β
π
2
1
#1 opened about 2 months ago by
ktsaou
New activity in
lukealonso/MiniMax-M2.5-NVFP4
about 2 months ago
VLLM error for kv weight scaling - workaround
7
#6 opened about 2 months ago by
ShaunEvansMD
fp8 kv cache
15
#4 opened about 2 months ago by
festr2
Thanks for your effort
5
#5 opened about 2 months ago by
darkstar3537
KeyError: '110.w1.input_scale' with TRT
2
#3 opened about 2 months ago by
guanwenyu1995
"w1_weight_scale_2 must match w3_weight_scale_2. Accuracy may be affected."
π
1
21
#2 opened about 2 months ago by
zenmagnets
Here's the vLLM recipe I'm using with 2x RTX Pro 6000
π
3
17
#1 opened about 2 months ago by
zenmagnets
New activity in
lukealonso/MiniMax-M2.1-NVFP4
about 2 months ago
one more time..? Minimax-M2.5 π₯
4
#4 opened about 2 months ago by
reneho
New activity in
mratsim/MiniMax-M2.1-FP8-INT4-AWQ
about 2 months ago
nvfp4
12
#9 opened 2 months ago by
festr2
New activity in
lukealonso/MiniMax-M2.1-NVFP4
3 months ago
This is perfect! Thank you!
π₯
1
14
#1 opened 3 months ago by
ktsaou
New activity in
lukealonso/MiniMax-M2-NVFP4
4 months ago
MinimaxM2.1
3
#5 opened 4 months ago by
reneho
New activity in
MiniMaxAI/MiniMax-M2.1
4 months ago
NVFP4?
6
#2 opened 4 months ago by
ktsaou
New activity in
lukealonso/MiniMax-M2-NVFP4
4 months ago
Devstral-2 NVFP4?
3
#3 opened 4 months ago by
reneho
New activity in
lukealonso/MiniMax-M2-NVFP4
5 months ago
you know which nightly it worked with? because it does not with current one
31
#1 opened 5 months ago by
willfalco