Deprecated: The each() function is deprecated. This message will be suppressed on further calls in /home/zhenxiangba/zhenxiangba.com/public_html/phproxy-improved-master/index.php on line 456
pb09204048 (Hejian Sang)

Hejian Sang's picture

Hejian Sang

pb09204048

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

Beyond GRPO and On-Policy Distillation: An Empirical Sparse-to-Dense Reward Principle for Language-Model Post-Training

authored a paper 2 months ago

TIP: Token Importance in On-Policy Distillation

upvoted a paper 2 months ago

TIP: Token Importance in On-Policy Distillation

View all activity

Organizations

Articles 1

Article

79

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

Papers 3

arxiv:2604.14084

arxiv:2602.21420

arxiv:2510.00237

models 0

None public yet

datasets 0

None public yet