Collection of Quantized Models for MoE
Krishna Teja Chitty-Venkata
AI & ML interests
LLM Optimization, Neural Architecture Search, Quantization, Pruning
Recent Activity
updated
a model about 5 hours ago
inference-optimization/Qwen3-30B-A3B-Instruct-2507-6.5bits published
a model about 5 hours ago
inference-optimization/Qwen3-30B-A3B-Instruct-2507-6.5bits updated
a model about 5 hours ago
inference-optimization/Qwen3-30B-A3B-Instruct-2507-5bits