jacobmorrison/dpo-yolo1-200k-gpt4.1-judge-2weak2strong-maxdelta_rejected-DECON-remove-gemma3 Viewer • Updated Oct 14, 2025 • 182k • 15
jacobmorrison/Nemotron-Post-Training-Dataset-v2-reasoning-chat Viewer • Updated Aug 27, 2025 • 546k • 38
jacobmorrison/olmo-2-1124-7b-preference-mix-filtered-overlapping Viewer • Updated Aug 12, 2025 • 258k • 14
jacobmorrison/qwen3-30b-3a-coder-no-reasoning-combined-outputs Viewer • Updated Aug 12, 2025 • 2M • 93