53 1 18

ElliotGao

tclf90

AI & ML interests

None yet

Recent Activity

liked a model about 1 hour ago

Soul-AILab/SoulX-Duplug-0.6B

new activity 2 days ago

QuantTrio/Qwen3.5-27B-AWQ:AWQ 4-bit version of this Opus-Distilled-v2 model?

updated a collection 2 days ago

Qwen3.5-AWQ

View all activity

Organizations

New activity in QuantTrio/Qwen3.5-27B-AWQ 2 days ago

AWQ 4-bit version of this Opus-Distilled-v2 model?

#5 opened 3 days ago by

0xburakcelik

New activity in QuantTrio/Qwen3.5-27B-AWQ 21 days ago

--max-model-len 32768 seems a bit too small for agent use cases ?

#3 opened 21 days ago by

edwarddukewu

New activity in QuantTrio/MiniMax-M2-AWQ 21 days ago

Install & run QuantTrio/MiniMax-M2-AWQ easily using llmpm

👍 1

#8 opened 21 days ago by

sarthak-saxena

New activity in QuantTrio/Qwen3.5-27B-AWQ 30 days ago

My personal vLLM launch cmd on my old personal 2x3090 workstation

#1 opened about 1 month ago by

tclf90

New activity in QuantTrio/Qwen3.5-35B-A3B-AWQ 30 days ago

Can't get vLLM running on 1xRTX 4090

#1 opened about 1 month ago by

slyfox1186

New activity in cyankiwi/Qwen3.5-27B-AWQ-4bit about 1 month ago

Easy to fall into infinite loop

👍 1

#2 opened about 1 month ago by

dwaynedu

New activity in QuantTrio/GLM-5-AWQ about 1 month ago

GLM-5-AWQ vLLM 部署指南

👍 1

#2 opened about 1 month ago by

CharlesChen2023

Great work

#1 opened about 1 month ago by

JoeyHwong

New activity in QuantTrio/Qwen3.5-35B-A3B-AWQ about 1 month ago

How run this model on Sglang?

#2 opened about 1 month ago by

Salvadori

New activity in QuantTrio/Qwen3.5-397B-A17B-AWQ about 1 month ago

Anyone else getting only exclamation marks?

#3 opened about 1 month ago by

Halbin

QuantTrio/Qwen3.5-397B-A17B-AWQ reponse is !

#5 opened about 1 month ago by

duyuting

GPTQ int4-int8 mixed

#4 opened about 1 month ago by

darkstar3537

New activity in QuantTrio/GLM-4.7-Flash-AWQ 2 months ago

--max-model-len 32768 ?

#1 opened 2 months ago by

pathosethoslogos

New activity in QuantTrio/DeepSeek-V3.2-AWQ 3 months ago

The model startup using vllm failed.

#5 opened 3 months ago by

beausoft

New activity in QuantTrio/GLM-4.7-GPTQ-Int4-Int8Mix 3 months ago

Accessing LLM, response without<think>start tag

#2 opened 3 months ago by

sudage

New activity in QuantTrio/MiniMax-M2-AWQ 3 months ago

Minimax-M2.1 AWQ Please

#6 opened 3 months ago by

mtcl

New activity in QuantTrio/GLM-4.7-GPTQ-Int4-Int8Mix 3 months ago

How do you run this?

#1 opened 3 months ago by

mtcl

New activity in QuantTrio/GLM-4.7-AWQ 3 months ago

Once again Thanks, here is my review for 8 x RTX 5090 setup

#2 opened 3 months ago by

crystech

New activity in QuantTrio/DeepSeek-V3.2-AWQ 3 months ago

can this model be Quantized?

#4 opened 4 months ago by

tinging

New activity in QuantTrio/GLM-4.6-AWQ 4 months ago

endless response

#5 opened 4 months ago by

ramidahbash

ElliotGao

AI & ML interests

Recent Activity

Organizations

tclf90's activity

AWQ 4-bit version of this Opus-Distilled-v2 model?

--max-model-len 32768 seems a bit too small for agent use cases ?

Install & run QuantTrio/MiniMax-M2-AWQ easily using llmpm

My personal vLLM launch cmd on my old personal 2x3090 workstation

Can't get vLLM running on 1xRTX 4090

Easy to fall into infinite loop

GLM-5-AWQ vLLM 部署指南

Great work

How run this model on Sglang?

Anyone else getting only exclamation marks?

QuantTrio/Qwen3.5-397B-A17B-AWQ reponse is !

GPTQ int4-int8 mixed

--max-model-len 32768 ?

The model startup using vllm failed.

Accessing LLM, response without<think>start tag

Minimax-M2.1 AWQ Please

How do you run this?

Once again Thanks, here is my review for 8 x RTX 5090 setup

can this model be Quantized?

endless response