liu's picture

1 5

liu

Fangcheng2

·

AI & ML interests

None yet

Recent Activity

authored a paper about 11 hours ago

Pangu Ultra MoE: How to Train Your Big MoE on Ascend NPUs

authored a paper about 11 hours ago

Rethinking Optimization and Architecture for Tiny Language Models

authored a paper about 11 hours ago

MeKi: Memory-based Expert Knowledge Injection for Efficient LLM Scaling

View all activity

Organizations

None yet

authored 3 papers about 11 hours ago

Pangu Ultra MoE: How to Train Your Big MoE on Ascend NPUs

Paper • 2505.04519 • Published May 7, 2025 • 5

Rethinking Optimization and Architecture for Tiny Language Models

Paper • 2402.02791 • Published Feb 5, 2024 • 13

MeKi: Memory-based Expert Knowledge Injection for Efficient LLM Scaling

Paper • 2602.03359 • Published 4 days ago • 9

authored a paper 7 months ago

Pangu Pro MoE: Mixture of Grouped Experts for Efficient Sparsity

Paper • 2505.21411 • Published May 27, 2025 • 17

authored a paper almost 2 years ago

Kangaroo: Lossless Self-Speculative Decoding via Double Early Exiting

Paper • 2404.18911 • Published Apr 29, 2024 • 30

authored a paper about 2 years ago

PanGu-$π$: Enhancing Language Model Architectures via Nonlinearity Compensation

Paper • 2312.17276 • Published Dec 27, 2023 • 16