inference-optimization/Qwen3-Next-80B-A3B-Thinking-FP8-block Text Generation • 80B • Updated about 8 hours ago • 40
inference-optimization/Qwen3-Next-80B-A3B-Instruct-FP8-block Text Generation • 80B • Updated about 9 hours ago • 76
inference-optimization/Qwen3-Next-80B-A3B-Instruct-FP8-dynamic Text Generation • 80B • Updated about 9 hours ago • 80
inference-optimization/Qwen3-Next-80B-A3B-Thinking-NVFP4 Text Generation • Updated about 9 hours ago • 22
inference-optimization/Qwen3-Next-80B-A3B-Instruct-NVFP4 Text Generation • Updated about 9 hours ago • 31
inference-optimization/Qwen3-Next-80B-A3B-Thinking-FP8-dynamic Text Generation • 80B • Updated about 10 hours ago • 72
Qwen3-Next-80B-A3B Quantized Models Collection FP8-dynamic, FP8-block, NVFP4, INT4, INT8 versions of Qwen3-Next-80B-A3B-Instruct and Qwen3-Next-80B-A3B-Thinking Models • 8 items • Updated about 10 hours ago
inference-optimization/Qwen3-Next-80B-A3B-Thinking-NVFP4 Text Generation • Updated about 9 hours ago • 22
inference-optimization/Qwen3-Next-80B-A3B-Thinking-FP8-dynamic Text Generation • 80B • Updated about 10 hours ago • 72
inference-optimization/Qwen3-Next-80B-A3B-Thinking-FP8-block Text Generation • 80B • Updated about 8 hours ago • 40