arxiv:2605.07865
Minjae Oh
Riasok
·
AI & ML interests
None yet
Recent Activity
authored a paper 1 day ago
ThinkBrake: Efficient Reasoning via Log-Probability Margin Guided Decoding authored a paper 1 day ago
KL for a KL: On-Policy Distillation with Control Variate Baseline