koutch/paper_qwen_qwen3-instruct-4b_train_sft_train_think Text Generation • 4B • Updated 1 day ago • 26
koutch/paper_llama_llama3.1-8b_train_sft_train_no_think Text Generation • 8B • Updated 1 day ago • 41