12 25 17

Huiqiang Jiang

iofu728

https://hqjiang.com/

AI & ML interests

None yet

Recent Activity

authored a paper about 1 month ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

upvoted a paper about 1 month ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

authored a paper 8 months ago

Chain-of-Model Learning for Language Model

View all activity

Organizations

authored a paper about 1 month ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published Dec 1, 2025 • 96

upvoted a paper about 1 month ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published Dec 1, 2025 • 96

authored a paper 8 months ago

Chain-of-Model Learning for Language Model

Paper • 2505.11820 • Published May 17, 2025 • 121

upvoted a paper 8 months ago

Chain-of-Model Learning for Language Model

Paper • 2505.11820 • Published May 17, 2025 • 121

authored a paper 8 months ago

RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference

Paper • 2505.02922 • Published May 5, 2025 • 28

upvoted a paper 8 months ago

RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference

Paper • 2505.02922 • Published May 5, 2025 • 28

commented a paper 8 months ago

RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference

Paper • 2505.02922 • Published May 5, 2025 • 28 •

liked a model 8 months ago

Qwen/Qwen3-235B-A22B

Text Generation • 235B • Updated Jul 26, 2025 • 245k • • 1.07k

upvoted a paper 8 months ago

MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse Attention

Paper • 2504.16083 • Published Apr 22, 2025 • 8

commented a paper 8 months ago

MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse Attention

Paper • 2504.16083 • Published Apr 22, 2025 • 8 •

liked a model 11 months ago

moonshotai/Moonlight-16B-A3B

Text Generation • 16B • Updated Feb 26, 2025 • 11k • 101

upvoted a paper 11 months ago

Optimizing Large Language Model Training Using FP4 Quantization

Paper • 2501.17116 • Published Jan 28, 2025 • 36

liked a model 12 months ago

Qwen/Qwen2.5-14B-Instruct-1M

Text Generation • 15B • Updated Jan 29, 2025 • 4.65k • • 331

upvoted a paper 12 months ago

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

Paper • 2501.13629 • Published Jan 23, 2025 • 48

liked 2 models 12 months ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

Text Generation • 33B • Updated Feb 24, 2025 • 2.62M • • 1.49k

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27, 2025 • 439k • • 12.9k

upvoted a paper 12 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8, 2025 • 287

updated a dataset about 1 year ago

microsoft/SCBench

Viewer • Updated Dec 24, 2024 • 922 • 1.04k • 8

upvoted a paper about 1 year ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 376

authored a paper about 1 year ago

SCBench: A KV Cache-Centric Analysis of Long-Context Methods

Paper • 2412.10319 • Published Dec 13, 2024 • 11

Huiqiang Jiang

AI & ML interests

Recent Activity

Organizations

iofu728's activity