deepseek-ai/DeepSeek-R1-Distill-Qwen-32B Text Generation • 33B • Updated Feb 24, 2025 • 2.62M • • 1.49k
princeton-nlp/Llama-3-8B-ProLong-64k-Instruct Text Generation • 8B • Updated Oct 31, 2024 • 8.14k • • 13
Running on CPU Upgrade Featured 994 Model Memory Utility 🚀 994 Calculate vRAM needed for model training and inference