Deprecated : The each() function is deprecated. This message will be suppressed on further calls in /home/zhenxiangba/zhenxiangba.com/public_html/phproxy-improved-master/index.php on line 456
zen-E (Zhenyi Shen)
7
followers ·
16 following AI & ML interests LLM Reasoning
Recent Activity Organizations None yet
zen-E/qwen3-4b-instruct-grpo-dapo-2epoch-8k Updated Apr 29
zen-E/qwen3-4b-instruct-grpo-dapo-1epoch-16k Updated Apr 29
zen-E/qwen3-8b-think-math-step100-opsd Updated Apr 19
zen-E/qwen3-8b-think-math-step500-grpo Updated Apr 19
zen-E/qwen3-8b-base-math-step700-grpo Updated Apr 12
zen-E/opsd_qwen3_1b_hybrid_factor0p01_lennorm_adv_ckpt1160 Updated Jan 21
zen-E/opsd_qwen3-1b_factor0p0001_gtcot Updated Jan 14
zen-E/qwen3_1b_base_opsd_hybrid_lennormalize_step300 Updated Dec 30, 2025
zen-E/qwen3_1b_base_opsd_hybrid_gencot_factor01 Updated Dec 29, 2025
zen-E/off-policy_student-qwen3-1b-base_teacher-qwen25-math-1b_math_e1 Updated Dec 22, 2025
zen-E/grpo_nokl_qwen3_1b_e20_last_ckpt Updated Dec 19, 2025
zen-E/grpo_nokl_qwen3_1b_e20 Updated Dec 19, 2025
1B • Updated Sep 11, 2025 zen-E/CODI-llama3.2-1b-Instruct Updated Jun 4, 2025
zen-E/bert-mini-sentence-distil-unsupervised-pca Updated Oct 3, 2023
zen-E/bert-mini-sentence-distil-supervised Feature Extraction
• Updated Oct 3, 2023 • 3
zen-E/bert-mini-sentence-distil-unsupervised Feature Extraction
• Updated Oct 3, 2023 • 3
Reinforcement Learning
• Updated Jul 15, 2023 zen-E/q-FrozenLake-v1-4x4-noSlippery Reinforcement Learning
• Updated Jul 14, 2023 zen-E/deepspeed-chat-step2-model-opt350m Text Generation
• Updated Apr 27, 2023 • 3
• 1
zen-E/deepspeed-chat-step3-rlhf-actor-model-opt1.3b Text Generation
• Updated Apr 27, 2023 • 2
• 1
zen-E/deepspeed-chat-step1-model-opt1.3b Text Generation
• Updated Apr 24, 2023 • 2
• 2