ConicCat/AntiRepV0.2
Viewer • Updated • 471 • 10 • 1
TODO: improve model card
Trained from Apertus-8B Base using liger kernel with a two step Anchored SFT > LD-DPO recipe.
Alpaca template, no system.
.7 temp, top_p .95, no rep pen or dry