arxiv:2603.05369
Sky
dandingsky
AI & ML interests
None yet
Recent Activity
commented on
a paper
1 day ago
Progressive Residual Warmup for Language Model Pretraining submitted
a paper
2 days ago
Progressive Residual Warmup for Language Model Pretraining authored
a paper
5 days ago
Thinking-Free Policy Initialization Makes Distilled Reasoning Models
More Effective and Efficient Reasoners