DCAgent/g1_min_episodes_e1_gpt_long_sampled_swesmith_psu_thinking_tacc-Qwen3-32B Text Generation • Updated 1 day ago • 436
DCAgent/g1_timeout_e1_gpt_long_sampled_swesmith_psu_thinking_tacc-Qwen3-32B Text Generation • Updated 1 day ago • 847
DCAgent/g1_min_episodes_e1_gpt_long_thinking_tacc-Qwen3-32B Text Generation • Updated 1 day ago • 745
DCAgent/d1_constrain_then_harden_top4_seq_glm47 Text Generation • 308k • Updated 6 days ago • 118 • 1