Deprecated : The each() function is deprecated. This message will be suppressed on further calls in /home/zhenxiangba/zhenxiangba.com/public_html/phproxy-improved-master/index.php on line 456
weqweasdas (Wei Xiong)
20
followers
·
21 following
AI & ML interests
Machine learning, RLHF
Organizations
models
23
weqweasdas/zephyr-7b-dpo-full
Text Generation
•
7B
•
Updated
May 3, 2024
•
85
weqweasdas/zephyr-7b-gemma-dpo
Updated
May 1, 2024
weqweasdas/zephyr-7b-sft-full
Updated
Apr 30, 2024
weqweasdas/zephyr-7b-dpo-qlora
Updated
Apr 30, 2024
weqweasdas/gpt2-cpt-dutch
Text Generation
•
0.1B
•
Updated
Apr 29, 2024
•
3
weqweasdas/zephyr-7b-gemma-sft
Updated
Apr 29, 2024
weqweasdas/raft_baseline_zephyr_packing_model6_1_4_e6_weight085
Text Generation
•
7B
•
Updated
Apr 16, 2024
weqweasdas/raft_baseline_zephyr_packing_model6_1_4_e6
Text Generation
•
7B
•
Updated
Apr 16, 2024
weqweasdas/raft_baseline_zephyr_packing_model6
Text Generation
•
7B
•
Updated
Apr 15, 2024
weqweasdas/raft_baseline_openchat_llama13b_model1
Text Generation
•
7B
•
Updated
Apr 14, 2024
weqweasdas/qwen15b_train_simple_subset5k_for_difficulty_transition
Viewer
•
Updated
Oct 26, 2025
•
5k
•
8
weqweasdas/ultrafeedback_binarized_processed
Viewer
•
Updated
Oct 4, 2025
•
61.1k
•
2
weqweasdas/qwen7b_prompt_difficult
Viewer
•
Updated
Sep 29, 2025
•
15.7k
•
7
weqweasdas/qwen7b_openr1_with_scores_sub
Viewer
•
Updated
Sep 28, 2025
•
57.7k
•
3
weqweasdas/qwen7b_openr1_with_scores_filtered_0375
Viewer
•
Updated
Sep 25, 2025
•
24.3k
•
6
weqweasdas/qwen7b_openr1_with_scores
Viewer
•
Updated
Sep 23, 2025
•
75k
•
3
weqweasdas/from_default_filtered_openr1_with_scores_filtered_05_and_filtered_allwrong
Viewer
•
Updated
Sep 18, 2025
•
25k
•
12
Viewer
•
Updated
Sep 16, 2025
•
1.68k
•
43
weqweasdas/dapo_with_scores
Viewer
•
Updated
Sep 16, 2025
•
13k
•
5
weqweasdas/dapo_and_openr1_can_be_evaluated_by_daporm_deduplicate_with_scores
Viewer
•
Updated
Sep 16, 2025
•
34.1k
•
2