HKUST NLP Group

university

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

yuzhen17 authored a paper 5 days ago

SWE-RM: Execution-free Feedback For Software Engineering Agents

lockon updated a dataset about 1 month ago

hkust-nlp/Toolathlon-Trajectories

Junteng new activity about 1 month ago

hkust-nlp/WebExplorer-QA:Add metadata (license, task categories, tags) and link to HF paper

View all activity

yuzhen17

authored a paper 5 days ago

SWE-RM: Execution-free Feedback For Software Engineering Agents

Paper • 2512.21919 • Published 11 days ago • 9

lockon

updated a dataset about 1 month ago

hkust-nlp/Toolathlon-Trajectories

Preview • Updated Dec 5, 2025 • 2.47k • 18

Junteng

in hkust-nlp/WebExplorer-QA about 1 month ago

Add metadata (license, task categories, tags) and link to HF paper

#3 opened 4 months ago by

nielsr

SivilTaram

authored a paper about 2 months ago

Diffusion Language Models are Super Data Learners

Paper • 2511.03276 • Published Nov 5, 2025 • 128

AndrewZeng

authored 6 papers 2 months ago

MoSLD: An Extremely Parameter-Efficient Mixture-of-Shared LoRAs for Multi-Task Learning

Paper • 2412.08946 • Published Dec 12, 2024

AgentRefine: Enhancing Agent Generalization through Refinement Tuning

Paper • 2501.01702 • Published Jan 3, 2025

On the Perception Bottleneck of VLMs for Chart Understanding

Paper • 2503.18435 • Published Mar 24, 2025 • 1

Pitfalls of Rule- and Model-based Verifiers -- A Case Study on Mathematical Reasoning

Paper • 2505.22203 • Published May 28, 2025 • 6

CareBot: A Pioneering Full-Process Open-Source Medical Language Model

Paper • 2412.15236 • Published Dec 12, 2024 • 1

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Paper • 2510.25726 • Published Oct 29, 2025 • 45

PeterV09

authored a paper 2 months ago

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Paper • 2510.25726 • Published Oct 29, 2025 • 45

yuzhen17

authored a paper 2 months ago

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Paper • 2510.25726 • Published Oct 29, 2025 • 45

lockon

updated a collection 2 months ago

Toolathlon

Collection

2 items • Updated Oct 30, 2025

lockon

published a dataset 2 months ago

hkust-nlp/Toolathlon-Trajectories

Preview • Updated Dec 5, 2025 • 2.47k • 18

SivilTaram

authored a paper 4 months ago

SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning

Paper • 2509.02479 • Published Sep 2, 2025 • 83

SivilTaram

authored 3 papers 6 months ago

SWE-Perf: Can Language Models Optimize Code Performance on Real-World Repositories?

Paper • 2507.12415 • Published Jul 16, 2025 • 42

First Return, Entropy-Eliciting Explore

Paper • 2507.07017 • Published Jul 9, 2025 • 23

ZeCO: Zero Communication Overhead Sequence Parallelism for Linear Attention

Paper • 2507.01004 • Published Jul 1, 2025 • 10

yuzhen17

authored a paper 7 months ago

Pitfalls of Rule- and Model-based Verifiers -- A Case Study on Mathematical Reasoning

Paper • 2505.22203 • Published May 28, 2025 • 6

ShiqiChen

authored a paper 7 months ago

SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond

Paper • 2505.19641 • Published May 26, 2025 • 68

AI & ML interests

Recent Activity

Team members 15

hkust-nlp's activity

Add metadata (license, task categories, tags) and link to HF paper