Deprecated: The each() function is deprecated. This message will be suppressed on further calls in /home/zhenxiangba/zhenxiangba.com/public_html/phproxy-improved-master/index.php on line 456
Keven16 (Wenkai Yang)

Wenkai Yang's picture

Wenkai Yang

Keven16

·

https://keven980716.github.io/

keven980716

AI & ML interests

None yet

Recent Activity

upvoted a paper 9 days ago

Filter, Then Reweight: Rethinking Optimization Granularity in On-Policy Distillation

authored a paper 12 days ago

Rethinking Continual Experience Internalization for Self-Evolving LLM Agents

upvoted a paper 13 days ago

Rethinking Continual Experience Internalization for Self-Evolving LLM Agents

View all activity

Organizations

None yet

Collections 1

Papers 16

arxiv:2606.04703

arxiv:2604.13016

arxiv:2603.14465

arxiv:2602.12125

models 16

Keven16/Qwen3-4B-Non-Thinking-RL-Code-Step300

4B • Updated Mar 16 • 304 • 1

Keven16/Qwen3-4B-Non-Thinking-RL-Math-Step500

4B • Updated Mar 16 • 1.62k

Keven16/Qwen2.5-7B-LaSeR

8B • Updated Oct 15, 2025 • 4

Keven16/OctoThinker-3B-Short-LaSeR

4B • Updated Oct 15, 2025 • 5

Keven16/ORZ-7B-LaSeR

8B • Updated Oct 15, 2025 • 2

Keven16/DeepCritic-7B-RL1.5-PRM800K

8B • Updated Jun 25, 2025 • 1

Keven16/DeepCritic-7B-RL1.5-Numina

8B • Updated Jun 23, 2025 • 2

Keven16/Qwen2.5-32B-TOPS-Iter-DPO-Preview

33B • Updated May 15, 2025 • 2

Keven16/Qwen2.5-32B-TOPS

33B • Updated May 15, 2025 • 2

Keven16/Qwen2.5-32B-TOPS-Iter-DPO

33B • Updated May 15, 2025 • 1

datasets 6

Keven16/OPSD-Example-Data

Viewer • Updated Mar 18 • 49.1k • 32

Keven16/G-OPD-Training-Data

Viewer • Updated Feb 17 • 134k • 1.47k • 3

Keven16/LaSeR_training_data

Viewer • Updated Oct 16, 2025 • 104k • 14 • 2

Keven16/TOPS-Data

Preview • Updated Oct 7, 2025 • 22

Keven16/DeepCritic-RL-Data

Viewer • Updated May 13, 2025 • 55k • 11

Keven16/DeepCritic-4.5K

Preview • Updated May 13, 2025 • 19