Tristan/llama3.2_piqa_custom_splits_sft_broad_1e-5_lr1e-5_wd0.001_ep5_piqa_custom_splits Text Generation β’ 1B β’ Updated Sep 30, 2025 β’ 6
Tristan/llama3.2_piqa_custom_splits_sft_broad_1e-5_lr1e-5_wd0.001_ep10_piqa_custom_splits Text Generation β’ 1B β’ Updated Sep 30, 2025 β’ 7
Tristan/llama3.2_piqa_custom_splits_sft_broad_1e-5_lr1e-5_wd0.0001_ep10_piqa_custom_splits Text Generation β’ 1B β’ Updated Sep 30, 2025 β’ 6
Tristan/llama3.2_piqa_custom_splits_sft_broad_1e-5_lr1e-5_wd0.0001_ep5_piqa_custom_splits Text Generation β’ 1B β’ Updated Sep 30, 2025 β’ 7
Tristan/llama3.2_piqa_custom_splits_sft_broad_1e-5_lr1e-5_wd0.0001_ep1_piqa_custom_splits Text Generation β’ 1B β’ Updated Sep 28, 2025 β’ 8
Tristan/llama3.2_piqa_custom_splits_sft_broad_lr1e-6_wd0.0001_ep10_piqa_custom_splits Text Generation β’ 1B β’ Updated Sep 28, 2025 β’ 8
Tristan/sft_test_llama3.2_5e-6_5e-5_lr5e-5_wd0.001_ep5_arc_easy Text Generation β’ 1B β’ Updated Sep 28, 2025 β’ 7
Tristan/sft_test_llama3.2_5e-6_5e-5_lr5e-6_wd0.001_ep5_arc_easy Text Generation β’ 1B β’ Updated Sep 28, 2025 β’ 6
Tristan/sft_test_llama3.2_5e-6_5e-5_lr5e-6_wd0.0001_ep1_arc_easy Text Generation β’ 1B β’ Updated Sep 28, 2025 β’ 5
Tristan/sft_test_llama3.2_broad_coverage_lr1e-5_wd0.0001_ep10_arc_easy Text Generation β’ 1B β’ Updated Sep 25, 2025 β’ 4
Tristan/RedPajama-Data-V2-sample-100B-filtered-shuffled-tokenized-with-token-counts Viewer β’ Updated May 31, 2024 β’ 4.16M β’ 170
Tristan/RedPajama-Data-V2-sample-100B-filtered-for-regression-domains-with-domains Viewer β’ Updated May 24, 2024 β’ 4.16M β’ 110