OctoThinker

community

https://github.com/GAIR-NLP/OctoThinker

AI & ML interests

None defined yet.

Recent Activity

Pengfei authored a paper about 2 months ago

One Sample to Rule Them All: Extreme Data Efficiency in RL Scaling

SinclairWang authored a paper 8 months ago

MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning

SinclairWang updated a model 8 months ago

OctoThinker/OctoThinker-3B-Hybrid-Zero

View all activity

authored a paper about 2 months ago

One Sample to Rule Them All: Extreme Data Efficiency in RL Scaling

Paper • 2601.03111 • Published Jan 6 • 10

authored a paper 8 months ago

MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning

Paper • 2507.16812 • Published Jul 22, 2025 • 64

updated 18 models 8 months ago

OctoThinker/OctoThinker-3B-Hybrid-Zero

Text Generation • 4B • Updated Jul 12, 2025 • 6 • 1

OctoThinker/OctoThinker-3B-Hybrid-Base

Text Generation • 3B • Updated Jul 12, 2025 • 1.42k • 1

OctoThinker/OctoThinker-3B-Short-Zero

Text Generation • 4B • Updated Jul 12, 2025 • 8 • 1

OctoThinker/OctoThinker-3B-Short-Base

Text Generation • 3B • Updated Jul 12, 2025 • 15

OctoThinker/Llama_32_3B_megamath_web_pro_max_bs4M_seq8k_100B

Text Generation • Updated Jul 7, 2025

OctoThinker/Llama_32_3B_megamath_web_pro_open_r1_longcot_general_ins_89_10_1_bs4M_seq8k_20B

Text Generation • Updated Jul 7, 2025

OctoThinker/Llama_32_3B_megamath_web_pro_open_r1_longcot_91_bs4M_seq8k_20B

Text Generation • Updated Jul 7, 2025

OctoThinker/Llama_32_3B_megamath_web_pro_megamath_synth_qa_general_ins_89_10_1_bs4M_seq8k_20B

Text Generation • Updated Jul 7, 2025

OctoThinker/Llama_32_3B_megamath_web_pro_megamath_synth_qa_91_bs4M_seq8k_20B

Text Generation • Updated Jul 7, 2025

OctoThinker/Llama_32_3B_megamath_web_pro_max_bs4M_seq8k_20B

Text Generation • Updated Jul 7, 2025

OctoThinker/Llama_32_3B_megamath_web_pro_bs4M_seq8k_20B

Text Generation • Updated Jul 7, 2025

OctoThinker/Llama_32_3B_finemath_4p_bs4M_seq8k_20B

Text Generation • Updated Jul 7, 2025

OctoThinker/OctoThinker-3B-Long-Zero

Text Generation • 4B • Updated Jul 6, 2025 • 5

OctoThinker/OctoThinker-1B-Short-Zero

Text Generation • 1B • Updated Jul 6, 2025 • 4

OctoThinker/OctoThinker-1B-Hybrid-Zero

Text Generation • 1B • Updated Jul 6, 2025

OctoThinker/OctoThinker-1B-Long-Zero

Text Generation • 1B • Updated Jul 6, 2025 • 5

OctoThinker/OctoThinker-3B-Long-Base

Text Generation • 3B • Updated Jul 6, 2025 • 6

OctoThinker/OctoThinker-1B-Short-Base

Text Generation • 1B • Updated Jul 6, 2025 • 1