-
Hao0oWang/CurioSFT-Qwen2.5-Math-7B-RL
8B • Updated • 2 -
Hao0oWang/CurioSFT-Qwen2.5-Math-7B-SFT
8B • Updated • 1 -
Hao0oWang/CurioSFT_Data
Viewer • Updated • 63k • 20 -
Learning While Staying Curious: Entropy-Preserving Supervised Fine-Tuning via Adaptive Self-Distillation for Large Reasoning Models
Paper • 2602.02244 • Published • 1
Hao
Hao0oWang
·
AI & ML interests
None yet
Organizations
None yet
CurioSFT
-
Hao0oWang/CurioSFT-Qwen2.5-Math-7B-RL
8B • Updated • 2 -
Hao0oWang/CurioSFT-Qwen2.5-Math-7B-SFT
8B • Updated • 1 -
Hao0oWang/CurioSFT_Data
Viewer • Updated • 63k • 20 -
Learning While Staying Curious: Entropy-Preserving Supervised Fine-Tuning via Adaptive Self-Distillation for Large Reasoning Models
Paper • 2602.02244 • Published • 1