One Sample to Rule Them All: Extreme Data Efficiency in RL Scaling Paper • 2601.03111 • Published Jan 6 • 10
MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning Paper • 2507.16812 • Published Jul 22, 2025 • 64
OctoThinker/OctoThinker-3B-Hybrid-Base Text Generation • 3B • Updated Jul 12, 2025 • 1.42k • 1
OctoThinker/Llama_32_3B_megamath_web_pro_open_r1_longcot_general_ins_89_10_1_bs4M_seq8k_20B Text Generation • Updated Jul 7, 2025
OctoThinker/Llama_32_3B_megamath_web_pro_open_r1_longcot_91_bs4M_seq8k_20B Text Generation • Updated Jul 7, 2025
OctoThinker/Llama_32_3B_megamath_web_pro_megamath_synth_qa_general_ins_89_10_1_bs4M_seq8k_20B Text Generation • Updated Jul 7, 2025
OctoThinker/Llama_32_3B_megamath_web_pro_megamath_synth_qa_91_bs4M_seq8k_20B Text Generation • Updated Jul 7, 2025