view article Article Building Blocks for Foundation Model Training and Inference on AWS amazon • 5 days ago • 20
OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis Paper • 2603.20278 • Published Mar 17 • 98
📝 Research & Long-Form Blog Posts Collection In-depth technical articles and research pieces published by Hugging Face • 14 items • Updated 11 days ago • 21
view article Article We Got Claude to Fine-Tune an Open Source LLM burtenshaw, evalstate • Dec 4, 2025 • 624
view article Article ViDoRe V3: a comprehensive evaluation of retrieval for enterprise use-cases QuentinJG • Nov 5, 2025 • 64
view article Article Training and Finetuning Sparse Embedding Models with Sentence Transformers tomaarsen, arthurbresnu • Jul 1, 2025 • 138
view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) +2 natolambert, LouisCastricato, lvwerra, Dahoas • Dec 9, 2022 • 412
view article Article Training and Finetuning Reranker Models with Sentence Transformers tomaarsen • Mar 26, 2025 • 194
view article Article Assisted Generation: a new direction toward low-latency text generation joaogante • May 11, 2023 • 78
view article Article Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face dvgodoy • Feb 11, 2025 • 123
view article Article Agentic RAG Stack (2/5) - Augment retrieval results by reranking using Sentence Transformers davidberenstein1957 • Feb 5, 2025 • 10
view article Article Agentic RAG Stack (1/5) - Index and retrieve documents for vector search using Sentence Transformers and DuckDB davidberenstein1957 • Jan 27, 2025 • 22
view article Article Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK davidberenstein1957 • Nov 21, 2024 • 35
Training with Prompts Collection See the Training with Prompts documentation for more details: https://sbert.net/examples/training/prompts/README.html • 5 items • Updated Apr 10 • 3
view article Article 🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets dvilasuero • Jun 4, 2024 • 79
view article Article Model2Vec: Distill a Small Fast Model from any Sentence Transformer Pringled • Oct 14, 2024 • 104