Xin-Qiang Cai's picture

2

Xin-Qiang Cai

caixq

https://caixq1996.github.io/

caixq1996

AI & ML interests

RL, RLHF, Learning under Weak Supervision, Diffusion Model

Recent Activity

upvoted a paper 2 days ago

VI-CuRL: Stabilizing Verifier-Independent RL Reasoning via Confidence-Guided Variance Reduction

authored a paper 5 months ago

PIG-Nav: Key Insights for Pretrained Image Goal Navigation Models

authored a paper 5 months ago

Reinforcement Learning with Verifiable yet Noisy Rewards under Imperfect Verifiers

View all activity

Organizations

None yet

upvoted a paper 2 days ago

VI-CuRL: Stabilizing Verifier-Independent RL Reasoning via Confidence-Guided Variance Reduction

Paper • 2602.12579 • Published 26 days ago • 2

authored 2 papers 5 months ago

PIG-Nav: Key Insights for Pretrained Image Goal Navigation Models

Paper • 2507.17220 • Published Jul 23, 2025 • 1

Reinforcement Learning with Verifiable yet Noisy Rewards under Imperfect Verifiers

Paper • 2510.00915 • Published Oct 1, 2025 • 2

upvoted a paper 5 months ago

Reinforcement Learning with Verifiable yet Noisy Rewards under Imperfect Verifiers

Paper • 2510.00915 • Published Oct 1, 2025 • 2