wenhua cheng's picture

wenhua cheng

wenhuach

·

wenhuach21

AI & ML interests

Model Compression, CV

Recent Activity

new activity 5 days ago

Intel/Qwen3.5-35B-A3B-int4-AutoRound:Thanks! And MTP key question

new activity 5 days ago

Intel/GLM-5-int4-mixed-AutoRound:vLLM fails to serve Intel/GLM-5-int4-mixed-AutoRound on NVIDIA DGX Spark (GB10, sm121) due to no valid MLA attention backend (qk_nope_head_dim 192)

liked a model 6 days ago

kaitchup/Qwen3.5-27B-autoround-W4A16

View all activity

Organizations

authored a paper 3 months ago

SignRoundV2: Closing the Performance Gap in Extremely Low-Bit Post-Training Quantization for LLMs

Paper • 2512.04746 • Published Dec 4, 2025 • 14

authored 2 papers over 2 years ago

TEQ: Trainable Equivalent Transformation for Quantization of LLMs

Paper • 2310.10944 • Published Oct 17, 2023 • 10

Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs

Paper • 2309.05516 • Published Sep 11, 2023 • 11