MMTEB: Massive Multilingual Text Embedding Benchmark Paper • 2502.13595 • Published Feb 19, 2025 • 43
Iterative Layer Pruning for Efficient Translation Inference Paper • 2510.22763 • Published Oct 26, 2025
AraLingBench A Human-Annotated Benchmark for Evaluating Arabic Linguistic Capabilities of Large Language Models Paper • 2511.14295 • Published Nov 18, 2025 • 71
Wasm: A Pipeline for Constructing Structured Arabic Interleaved Multimodal Corpora Paper • 2511.07080 • Published Nov 10, 2025 • 31
Baseer: A Vision-Language Model for Arabic Document-to-Markdown OCR Paper • 2509.18174 • Published Sep 17, 2025 • 128
Saudi-Dialect-ALLaM: LoRA Fine-Tuning for Dialectal Arabic Generation Paper • 2508.13525 • Published Aug 19, 2025 • 1
Hala Technical Report: Building Arabic-Centric Instruction & Translation Models at Scale Paper • 2509.14008 • Published Sep 17, 2025 • 88
Exploring the Landscape for Generative Sequence Models for Specialized Data Synthesis Paper • 2411.01929 • Published Nov 4, 2024 • 2
Optimizing Deep Neural Networks using Safety-Guided Self Compression Paper • 2505.00350 • Published May 1, 2025 • 1
Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task Arithmetic Paper • 2509.01363 • Published Sep 1, 2025 • 58
SpeechT: Findings of the First Mentorship in Speech Translation Paper • 2502.12050 • Published Feb 17, 2025 • 1
Bemba Speech Translation: Exploring a Low-Resource African Language Paper • 2505.02518 • Published May 5, 2025
Efficient Speech Translation through Model Compression and Knowledge Distillation Paper • 2505.20237 • Published May 26, 2025 • 1
Language Modelling Approaches to Adaptive Machine Translation Paper • 2401.14559 • Published Jan 25, 2024
Mutarjim: Advancing Bidirectional Arabic-English Translation with a Small Language Model Paper • 2505.17894 • Published May 23, 2025 • 220