KV-Embedding: Training-free Text Embedding via Internal KV Re-routing in Decoder-only LLMs Paper ⢠2601.01046 ⢠Published 3 days ago ⢠4
SR-GRPO: Stable Rank as an Intrinsic Geometric Reward for Large Language Model Alignment Paper ⢠2512.02807 ⢠Published Dec 2, 2025 ⢠8
Glyph: Scaling Context Windows via Visual-Text Compression Paper ⢠2510.17800 ⢠Published Oct 20, 2025 ⢠67
GAPrune: Gradient-Alignment Pruning for Domain-Aware Embeddings Paper ⢠2509.10844 ⢠Published Sep 13, 2025 ⢠2
FinMTEB: Finance Massive Text Embedding Benchmark Paper ⢠2502.10990 ⢠Published Feb 16, 2025 ⢠6
Embedding Model Datasets Collection A curated subset of the datasets that work out of the box with Sentence Transformers: https://huggingface.co/datasets?other=sentence-transformers ⢠70 items ⢠Updated 27 days ago ⢠158
Large Language Models are Superpositions of All Characters: Attaining Arbitrary Role-play via Self-Alignment Paper ⢠2401.12474 ⢠Published Jan 23, 2024 ⢠36
LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper ⢠2402.13753 ⢠Published Feb 21, 2024 ⢠116
Extending Context Window of Large Language Models via Positional Interpolation Paper ⢠2306.15595 ⢠Published Jun 27, 2023 ⢠53