yixuan's picture

In a Training Loop 🔄

yixuan PRO

yixuantt

·

yixuantt

AI & ML interests

None yet

Recent Activity

upvoted a paper about 6 hours ago

KV-Embedding: Training-free Text Embedding via Internal KV Re-routing in Decoder-only LLMs

submitted a paper about 7 hours ago

KV-Embedding: Training-free Text Embedding via Internal KV Re-routing in Decoder-only LLMs

authored a paper about 1 month ago

SR-GRPO: Stable Rank as an Intrinsic Geometric Reward for Large Language Model Alignment

View all activity

Organizations

upvoted a paper about 6 hours ago

KV-Embedding: Training-free Text Embedding via Internal KV Re-routing in Decoder-only LLMs

Paper • 2601.01046 • Published 3 days ago • 4

upvoted a paper about 1 month ago

SR-GRPO: Stable Rank as an Intrinsic Geometric Reward for Large Language Model Alignment

Paper • 2512.02807 • Published Dec 2, 2025 • 8

upvoted 2 papers 3 months ago

Glyph: Scaling Context Windows via Visual-Text Compression

Paper • 2510.17800 • Published Oct 20, 2025 • 67

Fine-tuning Done Right in Model Editing

Paper • 2509.22072 • Published Sep 26, 2025 • 28

upvoted a paper 4 months ago

GAPrune: Gradient-Alignment Pruning for Domain-Aware Embeddings

Paper • 2509.10844 • Published Sep 13, 2025 • 2

upvoted a paper 11 months ago

FinMTEB: Finance Massive Text Embedding Benchmark

Paper • 2502.10990 • Published Feb 16, 2025 • 6

upvoted a collection over 1 year ago

Embedding Model Datasets

A curated subset of the datasets that work out of the box with Sentence Transformers: https://huggingface.co/datasets?other=sentence-transformers • 70 items • Updated 27 days ago • 158

upvoted 2 papers almost 2 years ago

Large Language Models are Superpositions of All Characters: Attaining Arbitrary Role-play via Self-Alignment

Paper • 2401.12474 • Published Jan 23, 2024 • 36

LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens

Paper • 2402.13753 • Published Feb 21, 2024 • 116

upvoted a paper over 2 years ago

Extending Context Window of Large Language Models via Positional Interpolation

Paper • 2306.15595 • Published Jun 27, 2023 • 53