Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
wenhua cheng's picture
32 10 31

wenhua cheng

wenhuach
sbrandeis's profile picture sameuldavid's profile picture kramp's profile picture
Β·
  • wenhuach21

AI & ML interests

Model Compression, CV

Recent Activity

new activity 5 days ago
Intel/Qwen3.5-35B-A3B-int4-AutoRound:Thanks! And MTP key question
new activity 5 days ago
Intel/GLM-5-int4-mixed-AutoRound:vLLM fails to serve Intel/GLM-5-int4-mixed-AutoRound on NVIDIA DGX Spark (GB10, sm121) due to no valid MLA attention backend (qk_nope_head_dim 192)
liked a model 6 days ago
kaitchup/Qwen3.5-27B-autoround-W4A16
View all activity

Organizations

Intel's profile picture Need4Speed's profile picture Qwen's profile picture

authored a paper 3 months ago

SignRoundV2: Closing the Performance Gap in Extremely Low-Bit Post-Training Quantization for LLMs

Paper β€’ 2512.04746 β€’ Published Dec 4, 2025 β€’ 14
authored 2 papers over 2 years ago

TEQ: Trainable Equivalent Transformation for Quantization of LLMs

Paper β€’ 2310.10944 β€’ Published Oct 17, 2023 β€’ 10

Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs

Paper β€’ 2309.05516 β€’ Published Sep 11, 2023 β€’ 11
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs