arxiv:2412.01800
hangyu guo
Rosiness
AI & ML interests
Natural Language Processing
Recent Activity
upvoted
a
paper
about 21 hours ago
mHC: Manifold-Constrained Hyper-Connections
upvoted
a
paper
3 days ago
Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss
upvoted
a
paper
9 days ago
Scaling Laws for Code: Every Programming Language Matters