In a Training Loop 🔄

Jisoo Kim PRO

kuotient

AI & ML interests

NLP

Recent Activity

liked a dataset 18 days ago

nvidia/Nemotron-Personas-Korea

liked a model 19 days ago

deepseek-ai/DeepSeek-V4-Flash

liked a model 20 days ago

Qwen/Qwen3.6-27B

View all activity

Organizations

upvoted a changelog about 1 month ago

Hugging Face Changelog

Agent Traces on the Hub

Apr 7

• 128

upvoted a paper 2 months ago

Beyond Length Scaling: Synergizing Breadth and Depth for Generative Reward Models

Paper • 2603.01571 • Published Mar 2 • 33

upvoted an article 3 months ago

Article

Mixture of Experts (MoEs) in Transformers

ariG23498, pcuenq, merve, IlyasMoutawwakil, ArthurZ, sergiopaniego, Molbap

•

Feb 26

• 159

upvoted 2 articles 5 months ago

Article

How We Use Claude Code Skills to Run 1,000+ ML Experiments a Day

sionic-ai

•

Dec 8, 2025

• 57

Article

Transformers v5: Simple model definitions powering the AI ecosystem

lysandre, ArthurZ, cyrilvallez, reach-vb

•

Dec 1, 2025

• 310

upvoted a paper 6 months ago

Black-Box On-Policy Distillation of Large Language Models

Paper • 2511.10643 • Published Nov 13, 2025 • 53

upvoted an article 6 months ago

Article

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

codelion

•

Nov 3, 2025

• 65

upvoted a paper 7 months ago

KORMo: Korean Open Reasoning Model for Everyone

Paper • 2510.09426 • Published Oct 10, 2025 • 87

upvoted 2 articles 8 months ago

Article

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

ariG23498, sergiopaniego, reach-vb, pcuenq, ArthurZ, SaylorTwift, cyrilvallez

•

Sep 11, 2025

• 187

Article

Exploring Environments Hub: Your Language Model needs better (open) environments to learn

anakin87

•

Sep 4, 2025

• 30

upvoted a paper 9 months ago

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

Paper • 2508.13167 • Published Aug 6, 2025 • 129

upvoted a collection 9 months ago

Tool Use Reasoning

Collection

A collection of tool use reasoning dataset in Hermes format • 5 items • Updated Jul 23, 2025 • 9

upvoted 2 articles 9 months ago

Article

ChatML vs Harmony: Understanding the new Format from OpenAI 🔍

kuotient

•

Aug 9, 2025

• 57

Article

Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training

smohammadi, siro1, winglian, marcsun13, djsaunde

•

Aug 8, 2025

• 98

upvoted an article about 1 year ago

Article

Training Large Language Models with Interpreter Feedback using WebAssembly

axolotl-ai-co

•

Apr 3, 2025

• 14

upvoted 2 papers about 1 year ago

R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning

Paper • 2503.05592 • Published Mar 7, 2025 • 27

Kanana: Compute-efficient Bilingual Language Models

Paper • 2502.18934 • Published Feb 26, 2025 • 65

upvoted a paper over 1 year ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 379

upvoted an article over 1 year ago

Article

The Beginners Guide to Cleaning a Dataset

cfahlgren1

•

Nov 18, 2024

• 24

upvoted a paper over 1 year ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19, 2024 • 140

Jisoo Kim PRO

AI & ML interests

Recent Activity

Organizations

kuotient's activity

Agent Traces on the Hub

Mixture of Experts (MoEs) in Transformers

How We Use Claude Code Skills to Run 1,000+ ML Experiments a Day

Transformers v5: Simple model definitions powering the AI ecosystem

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

Exploring Environments Hub: Your Language Model needs better (open) environments to learn

ChatML vs Harmony: Understanding the new Format from OpenAI 🔍

Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training

Training Large Language Models with Interpreter Feedback using WebAssembly

The Beginners Guide to Cleaning a Dataset