AI & ML interests
None yet
Organizations
None yet
models
10
varsunk/unsloth_training_checkpoints
Updated
varsunk/Qwen3-4B-LORA-GRPO-Experiment
Text Generation
•
Updated
•
6
varsunk/Qwen3-8B-GRPO-test
Updated
varsunk/Qwen3-8B-Base-GRPO-test
Updated
varsunk/Qwen2-1.5B-Instruct-GRPO-test
Updated
varsunk/Qwen2-0.5B-Instruct-GRPO-test-GRPO-test
Updated
varsunk/Qwen2-0.5B-GRPO-diagnose
Updated
varsunk/Qwen2-0.5B-GRPO-test
Updated
varsunk/Qwen2-0.5B-Instruct-GRPO-test
Updated
varsunk/Qwen2.5-7B-Instruct-GRPO-test
Updated