Deprecated : The each() function is deprecated. This message will be suppressed on further calls in /home/zhenxiangba/zhenxiangba.com/public_html/phproxy-improved-master/index.php on line 456
Anna4242 (D)
3
followers ·
5 following AI & ML interests None yet
Organizations models 25 Anna4242/qwen25-7b-multihop-grpo-checkpoint-200 8B • Updated Dec 2, 2025 • 1
Anna4242/qwen25-7b-singlehop-grpo-checkpoint-200 8B • Updated Dec 2, 2025 Anna4242/qwen25-3b-instruct-grpo-merged 3B • Updated Nov 29, 2025 Anna4242/qwen25-3b-base-grpo Text Generation
• Updated Nov 29, 2025 • 2
Anna4242/qwen25-7b-full-sft-multihop 8B • Updated Nov 28, 2025 • 1
Anna4242/qwen25-3b-full-sft-multihop 3B • Updated Nov 28, 2025 Anna4242/qwen25-7b-sft-grpo-checkpoint-200 Reinforcement Learning
• Updated Nov 28, 2025 Anna4242/qwen25-3b-original-sft-ep1-grpo-checkpoint-200 Text Generation
• Updated Nov 27, 2025 • 1
Anna4242/Qwen2.5-7B-Instruct-onlyrl-step-1000 8B • Updated Nov 26, 2025 Anna4242/Qwen2.5-7B-Instruct-Singlehop-SFT 8B • Updated Nov 25, 2025 • 1
datasets 23 Anna4242/grpo-training-plots Viewer
• Updated Nov 29, 2025 • 1.41k • 6
Anna4242/tool-n1-combined-3-6-9-hop-corrected-split Viewer
• Updated Nov 13, 2025 • 8.12k • 8
Anna4242/triton-bench-verifiers Viewer
• Updated Nov 13, 2025 • 184 • 5
Anna4242/tool-n1-combined-3-6-9-hop-corrected Viewer
• Updated Nov 10, 2025 • 8.12k • 6
Anna4242/TritonBench_G_v1 Viewer
• Updated Nov 8, 2025 • 184 • 6
Anna4242/TritonBench_T_v1 Viewer
• Updated Nov 8, 2025 • 166 • 4
Anna4242/toucan-multiturn-output Viewer
• Updated Nov 4, 2025 • 20 • 4
Anna4242/bfcl-v4-memory-verifiers-new Preview
• Updated Oct 29, 2025 • 5
• 1
Anna4242/tool-n1-sft-combined-standardized Viewer
• Updated Sep 18, 2025 • 321k • 6
Anna4242/tool-n1-sft-dataset-original-backup Viewer
• Updated Sep 18, 2025 • 5.5k • 5