Byung-Kwan Lee's picture

Byung-Kwan Lee

BK-Lee

·

https://sites.google.com/view/byungkwanlee

AI & ML interests

Vision Language Models

Recent Activity

authored a paper 4 days ago

Masking Teacher and Reinforcing Student for Distilling Vision-Language Models

upvoted a paper 5 days ago

SurgWorld: Learning Surgical Robot Policies from Videos via World Modeling

upvoted a paper 5 days ago

Quantile Rendering: Efficiently Embedding High-dimensional Feature on 3D Gaussian Splatting

View all activity

Organizations

upvoted 3 papers 5 days ago

SurgWorld: Learning Surgical Robot Policies from Videos via World Modeling

Paper • 2512.23162 • Published 7 days ago • 9

Quantile Rendering: Efficiently Embedding High-dimensional Feature on 3D Gaussian Splatting

Paper • 2512.20927 • Published 11 days ago • 6

Masking Teacher and Reinforcing Student for Distilling Vision-Language Models

Paper • 2512.22238 • Published 12 days ago • 17

upvoted 2 papers 10 days ago

NVIDIA Nemotron 3: Efficient and Open Intelligence

Paper • 2512.20856 • Published 12 days ago • 27

Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Paper • 2512.20848 • Published 12 days ago • 28

upvoted a paper 14 days ago

JustRL: Scaling a 1.5B LLM with a Simple RL Recipe

Paper • 2512.16649 • Published 17 days ago • 23

upvoted a paper 18 days ago

Zoom-Zero: Reinforced Coarse-to-Fine Video Understanding via Temporal Zoom-in

Paper • 2512.14273 • Published 19 days ago • 7

upvoted 2 papers 27 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published Dec 2, 2025 • 244

Qwen3-VL Technical Report

Paper • 2511.21631 • Published Nov 26, 2025 • 148

upvoted 4 papers about 1 month ago

RefineBench: Evaluating Refinement Capability of Language Models via Checklists

Paper • 2511.22173 • Published Nov 27, 2025 • 14

OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

Paper • 2511.16334 • Published Nov 20, 2025 • 92

Nemotron Elastic: Towards Efficient Many-in-One Reasoning LLMs

Paper • 2511.16664 • Published Nov 20, 2025 • 26

Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning

Paper • 2511.16043 • Published Nov 20, 2025 • 108

upvoted a paper about 2 months ago

NVIDIA Nemotron Nano V2 VL

Paper • 2511.03929 • Published Nov 6, 2025 • 27

upvoted 4 papers 2 months ago

Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence

Paper • 2510.20579 • Published Oct 23, 2025 • 55

BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping

Paper • 2510.18927 • Published Oct 21, 2025 • 83

Unified Reinforcement and Imitation Learning for Vision-Language Models

Paper • 2510.19307 • Published Oct 22, 2025 • 30

DeepSeek-OCR: Contexts Optical Compression

Paper • 2510.18234 • Published Oct 21, 2025 • 86

upvoted a paper 3 months ago

MultiVerse: A Multi-Turn Conversation Benchmark for Evaluating Large Vision and Language Models

Paper • 2510.16641 • Published Oct 18, 2025 • 4

upvoted a paper 4 months ago

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

Paper • 2509.00676 • Published Aug 31, 2025 • 84