Deprecated: The each() function is deprecated. This message will be suppressed on further calls in /home/zhenxiangba/zhenxiangba.com/public_html/phproxy-improved-master/index.php on line 456
huxiao09 (Xiao Hu)

Xiao Hu's picture

5 3

Xiao Hu

huxiao09

·

huxiao09

AI & ML interests

Reinforcement Learning, LLM Reasoning

Recent Activity

upvoted an article about 2 months ago

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

upvoted a paper 5 months ago

Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting

liked a model 7 months ago

Kwai-Keye/Keye-VL-671B-A37B

View all activity

Organizations

None yet

Papers 5

arxiv:2507.01949

arxiv:2505.21067

arxiv:2505.02835

arxiv:2402.03046

models 0

None public yet

datasets 0

None public yet