Deprecated : The each() function is deprecated. This message will be suppressed on further calls in /home/zhenxiangba/zhenxiangba.com/public_html/phproxy-improved-master/index.php on line 456
ReasoningMila (Mila Reasoning)
models 10 ReasoningMila/math_train_gold_qs_all_64_synthetic_soln_480k Updated Mar 17, 2025
ReasoningMila/hendricks_math_7500_train_synthetic_corr_soln Updated Mar 16, 2025
ReasoningMila/polIter_qwen2.5_math_1.5B_inst_ppo_MATH_ckpt__iter_0047__epoch_2.00_step_1504 Updated Feb 10, 2025
ReasoningMila/math_synthetic_raw Updated Feb 7, 2025
ReasoningMila/polIter_qwen2.5_math_inst_1.5B_genppo_MATH_ckpt_iter_0008_epoch_2.00_step_0448 Updated Feb 6, 2025
ReasoningMila/polIter_qwen2.5_math_inst_1.5B_genppo_MATH_ckpt_iter_0008_epoch_2.00_step_0512 Updated Jan 31, 2025
ReasoningMila/ver_gen_partial_ft_model_meta-llama_Llama-32-1B_checkpoint-5634 Text Generation
• 1B • Updated Jan 12, 2025 • 3
• ReasoningMila/ver_partial_ft_model_meta-llama_Llama-32-3B_checkpoint-4224 Text Generation
• 3B • Updated Jan 8, 2025 • 3
ReasoningMila/ver_partial_ft_model_meta-llama_Llama-32-1B_checkpoint-4224 Text Generation
• 1B • Updated Jan 6, 2025 • 4
• ReasoningMila/math_partial_ft_model_meta-llama_Llama-32-3B_checkpoint-681 Text Generation
• 3B • Updated Dec 22, 2024 • 1
datasets 15 ReasoningMila/Training_gen_dataset Viewer
• Updated Jun 8, 2025 • 7.5k • 47
ReasoningMila/ServiceNowAI_R1_Distill_SFT_with_problems_and_responses Viewer
• Updated May 22, 2025 • 1.68M • 1.31k
ReasoningMila/math7500_1_wrong_soln_wrt_human_gold Viewer
• Updated Apr 4, 2025 • 6.08k • 4
ReasoningMila/wrong_solutions_dataset_of_30k_verified_qs Viewer
• Updated Mar 25, 2025 • 22.4k • 8
ReasoningMila/syn_qs_and_soln_cleaned_0_and_less20_multiple_soln_per_qs_1937545 Viewer
• Updated Mar 23, 2025 • 1.94M • 67
ReasoningMila/syn_qs_and_soln_cleaned_0_and_less20_1_soln_per_qs_131845 Viewer
• Updated Mar 23, 2025 • 132k • 5
ReasoningMila/syn_qs_soln_dat_cleaned_1_soln_per_qs_41k Viewer
• Updated Mar 22, 2025 • 41.3k • 17
ReasoningMila/Verifier_filtered_datasets Preview
• Updated Mar 21, 2025 • 5
ReasoningMila/merged_output_qs_only_exact_dedup_90_cleaned_dataset_morethan10resp_clip16_sorted Viewer
• Updated Mar 2, 2025 • 215k • 10
ReasoningMila/merged_output_qs_only_exact_dedup_90_dedup_350k Viewer
• Updated Mar 1, 2025 • 217k • 46