DavidAU/Llama3.3-8B-Instruct-Thinking-Claude-4.5-Opus-High-Reasoning Text Generation • 8B • Updated 2 days ago • 214 • 50
Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning Paper • 2512.20605 • Published 13 days ago • 60
DavidAU/Llama-3.2-8X3B-MOE-Dark-Champion-Instruct-uncensored-abliterated-18.4B-GGUF Text Generation • 18B • Updated Dec 1, 2025 • 53.4k • 454
nvidia/Llama-3.1-Nemotron-8B-UltraLong-4M-Instruct Text Generation • 8B • Updated Apr 17, 2025 • 401 • 121