Qwen3 models (123M/300M/600M) trained from scratch on 2.47B kk+ru tokens. Includes tokenizer, datasets, and checkpoints.
Saken Tukenov PRO
stukenov
AI & ML interests
None yet
Recent Activity
updated a model about 5 hours ago
stukenov/sozkz-core-gptoss-2b-mul-dense-merge-v1 published a model about 5 hours ago
stukenov/sozkz-core-gptoss-2b-mul-dense-merge-v1 liked a model 1 day ago
AIDC-AI/Marco-Mini-BaseOrganizations
models 75
stukenov/sozkz-core-gptoss-2b-mul-dense-merge-v1
Text Generation • 2B • Updated
stukenov/sozkz-morphbpe-256k-kk-v1
Token Classification • Updated
stukenov/sozkz-fix-mt5-50m-kk-gec-v1
Text Generation • 50.6M • Updated • 17
stukenov/sozkz-nllb-1b-kk-pretrain-v1
Translation • 1B • Updated • 51
stukenov/sozkz-nllb-1b-kk-gec-v1
1B • Updated • 64
stukenov/sozkz-fix-mt5b-kk-gec-run13-v1
Text Generation • 0.6B • Updated • 7
stukenov/sozkz-fix-qwen-500m-kk-gec-v1
Text Generation • 0.4B • Updated • 246
stukenov/sozkz-fix-qwen-500m-kk-gec-v2
Text Generation • 0.4B • Updated • 96
stukenov/sozkz-fix-qwen-500m-kk-gec-v3
Text Generation • 0.4B • Updated • 665
stukenov/sozkz-fix-qwen-500m-kk-gec-v4
Text Generation • 0.4B • Updated • 662
datasets 54
stukenov/sozkz-corpus-tokenized-kk-morphbpe256k-v1
Viewer • Updated • 1.51M • 37
stukenov/sozkz-corpus-segmented-kk-v1
Viewer • Updated • 55.5M • 381
stukenov/sozkz-corpus-gec-benchmark-kk-v1
Viewer • Updated • 1.44k • 163
stukenov/sozkz-corpus-pretrain-gec-mix-v1
Viewer • Updated • 1.77M • 80
stukenov/sozkz-corpus-synthetic-kk-gec-rulebased-v1
Viewer • Updated • 1.06M • 33
stukenov/sozkz-corpus-synthetic-kk-gec-v1
Viewer • Updated • 19.3k • 61
stukenov/sozkz-gec-synthetic-gpt4o-v1
Viewer • Updated • 9.6k • 87
stukenov/sozkz-corpus-clean-v3
Viewer • Updated • 13.5M • 36
stukenov/sozkz-corpus-instruct-kk-alpaca-qwen35-v1
Viewer • Updated • 4.88k • 21 • 1
stukenov/kaznet-crawl-raw
Viewer • Updated • 1.55M • 6 • 1