Understanding Differential Transformer Unchains Pretrained Self-Attentions Paper • 2505.16333 • Published May 22, 2025 • 1