agentic-moral-alignment/qwen35-9b__gtharm_pd_str_tft__gtharm_ut__native_tool__r1__gtharm_pd Updated 6 days ago
agentic-moral-alignment/qwen35-9b__gtharm_pd_str_tft__gtharm_game__native_tool__r1__gtharm_pd Updated 6 days ago
agentic-moral-alignment/qwen35-9b__ipd_str_rnd_tft__deont__native_tool__r1__bastard Updated 8 days ago • 207
agentic-moral-alignment/qwen35-9b__ipd_str_rnd_tft__deont__native_tool__r1__core Updated 9 days ago • 342
agentic-moral-alignment/qwen35-9b__ipd_str_tft__deont__native_tool__r1 Text Generation • Updated 24 days ago • 47
agentic-moral-alignment/qwen35-9b__ipd_str_tft__deont__native_notool__r20 Text Generation • Updated 24 days ago • 24
agentic-moral-alignment/qwen35-9b-grpo-unsloth-ut-tft-1000ep Viewer • Updated about 1 month ago • 7.44k • 10
agentic-moral-alignment/qwen35-9b-grpo-unsloth-game-tft-1000ep Viewer • Updated about 1 month ago • 6.48k • 13