Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
7
30
30
Sangwoo Park
Sangsang
Follow
invincible-jha's profile picture
21world's profile picture
jiongdao's profile picture
15 followers
·
30 following
swgger
AI & ML interests
I do LLM post-training research (KAIST AI)
Recent Activity
updated
a model
4 days ago
Sangsang/feedback_asymmetric_kl_fixed_ema_Qwen2.5-7B-Instruct_bw0p75_fw0p25_ema0p999_ep30
published
a model
4 days ago
Sangsang/feedback_asymmetric_kl_fixed_ema_Qwen2.5-7B-Instruct_bw0p75_fw0p25_ema0p999_ep30
updated
a model
4 days ago
Sangsang/feedback_asymmetric_kl_fixed_ema_Qwen2.5-7B-Instruct_bw0p25_fw0p75_ema0p999_ep30
View all activity
Organizations
None yet
Sangsang
's models
218
Sort: Recently updated
Sangsang/CI-7B-Feedback-merged
Text Generation
•
8B
•
Updated
Mar 4
•
14
Sangsang/CI-7B-SFT-merged
Text Generation
•
8B
•
Updated
Mar 4
•
152
Sangsang/Qwen2.5-7B-Instruct-ci-rl
Text Generation
•
Updated
Mar 4
Sangsang/Qwen2.5-7B-Instruct-feedback
Text Generation
•
Updated
Mar 4
Sangsang/Llama-3.1-8B-Instruct_CI-RL-ep30
Text Generation
•
Updated
Feb 23
•
2
Sangsang/Qwen2.5-14B-Instruct_pm_think_ep5
15B
•
Updated
Feb 18
•
1
Sangsang/DeepSeek-R1-Distill-Qwen-14B_pm_ep5
15B
•
Updated
Feb 18
•
1
Sangsang/Qwen2.5-7B-Instruct_pm_think_ep5
8B
•
Updated
Feb 18
•
1
Sangsang/DeepSeek-R1-Distill-Qwen-7B_pm_ep5
8B
•
Updated
Feb 18
•
1
Sangsang/thinksafe-r1-1.5B-ablation_R32_BZ64_Gen8
Text Generation
•
Updated
Jan 26
Sangsang/qwen3-8B-thinksafe-8B-n1-ablation-32-pm-e3
Text Generation
•
Updated
Jan 20
Sangsang/qwen3-8B-thinksafe-8B-n1-ablation-32-pm-e2
Text Generation
•
Updated
Jan 20
Sangsang/qwen3-8B-thinksafe-8B-n1-ablation-16-pm-e3
Text Generation
•
Updated
Jan 20
Sangsang/R1-8B-thinksafe-r1-8B-ablation-32-pm-e3
Text Generation
•
Updated
Jan 20
Sangsang/R1-8B-thinksafe-r1-8B-ablation-32-pm-e2
Text Generation
•
Updated
Jan 20
•
2
Sangsang/R1-8B-thinksafe-r1-8B-ablation-16-pm-e3
Text Generation
•
Updated
Jan 19
Sangsang/qwen3-8B-thinksafe-8B-n1-ablation-16-pm-e2
Text Generation
•
Updated
Jan 19
Sangsang/R1-8B-thinksafe-r1-8B-ablation-16-pm-e2
Text Generation
•
Updated
Jan 19
Sangsang/qwen3-8B-thinksafe-8B-n1-ablation-16-pm-e1
Text Generation
•
Updated
Jan 19
Sangsang/R1-8B-thinksafe-r1-8B-ablation-16-pm-e1
Text Generation
•
Updated
Jan 19
Sangsang/qwen3-8B-thinksafe-8B-n1-ablation-64-pm-e3
Text Generation
•
Updated
Jan 19
Sangsang/qwen3-8B-thinksafe-8B-n1-ablation-64-pm-e2
Text Generation
•
Updated
Jan 19
Sangsang/qwen3-8B-thinksafe-8B-n1-ablation-64-pm-e1
Text Generation
•
Updated
Jan 19
Sangsang/qwen3-4B-thinksafe-4B-n1-ablation-64-pm-e3
Text Generation
•
Updated
Jan 19
Sangsang/R1-8B-thinksafe-r1-8B-ablation-64-pm-e3
Text Generation
•
Updated
Jan 19
•
3
Sangsang/R1-8B-thinksafe-r1-8B-ablation-64-pm-e2
Text Generation
•
Updated
Jan 19
Sangsang/R1-8B-thinksafe-r1-8B-ablation-64-pm-e1
Text Generation
•
Updated
Jan 19
•
1
Sangsang/R1-7B-thinksafe-r1-7B-ablation-64-pm-e3
Text Generation
•
Updated
Jan 19
Sangsang/R1-7B-thinksafe-r1-7B-ablation-64-pm-e2
Text Generation
•
Updated
Jan 18
Sangsang/R1-7B-thinksafe-r1-7B-ablation-64-pm-e1
Text Generation
•
Updated
Jan 18
Previous
1
2
3
4
5
...
8
Next