arxiv:2507.01949
Xiao Hu
huxiao09
AI & ML interests
Reinforcement Learning, LLM Reasoning
Recent Activity
upvoted an article about 2 months ago
Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries upvoted a paper 5 months ago
Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting liked a model 7 months ago
Kwai-Keye/Keye-VL-671B-A37BOrganizations
None yet