Bolian Li
lblaoke
AI & ML interests
None yet
Recent Activity
authored a paper about 6 hours ago
More is Less: The Pitfalls of Multi-Model Synthetic Preference Data in
DPO Safety Alignment authored a paper about 6 hours ago
DRIFT: Learning from Abundant User Dissatisfaction in Real-World
Preference Learning authored a paper about 6 hours ago
Learning Self-Correction in Vision-Language Models via Rollout Augmentation