Sometimes I finetune models specifically to take on expert roles in a MoE configuration, sometimes I find interesting models others have fine tuned.
Rasmus Rasmussen
theprint
AI & ML interests
Small model experiments and homespun datasets.
Recent Activity
updated a dataset about 2 hours ago
theprint/TextAnalysis published a dataset about 2 hours ago
theprint/TextAnalysis updated a model 2 days ago
theprint/Llama3.2-3B-Math-gsm8k-AutoSFT