Datasets used in "Understanding the Design Space and Cross-Modality Transfer for Vision-Language Models"
Rosie Zhao
rosieyzh
·
AI & ML interests
theory of machine learning, deep learning
Organizations
Qwen2.5-1.5B SFT - Unstructured Code
rosieyzh/sft_qwen15_swallow_lr_1e-5_cosine_bsz_{64,128}_ckpt_{i}_of_5
-
rosieyzh/sft_qwen15_swallow_lr_1e-5_cosine_bsz_128_ckpt_1_of_5
2B • Updated • 4 -
rosieyzh/sft_qwen15_swallow_lr_1e-5_cosine_bsz_128_ckpt_2_of_5
2B • Updated • 5 -
rosieyzh/sft_qwen15_swallow_lr_1e-5_cosine_bsz_128_ckpt_3_of_5
2B • Updated • 14 -
rosieyzh/sft_qwen15_swallow_lr_1e-5_cosine_bsz_128_ckpt_4_of_5
2B • Updated • 3
Synthetic Multimodal Datasets
Datasets used in "Understanding the Design Space and Cross-Modality Transfer for Vision-Language Models"
Qwen2.5-1.5B SFT - Unstructured Code
rosieyzh/sft_qwen15_swallow_lr_1e-5_cosine_bsz_{64,128}_ckpt_{i}_of_5
-
rosieyzh/sft_qwen15_swallow_lr_1e-5_cosine_bsz_128_ckpt_1_of_5
2B • Updated • 4 -
rosieyzh/sft_qwen15_swallow_lr_1e-5_cosine_bsz_128_ckpt_2_of_5
2B • Updated • 5 -
rosieyzh/sft_qwen15_swallow_lr_1e-5_cosine_bsz_128_ckpt_3_of_5
2B • Updated • 14 -
rosieyzh/sft_qwen15_swallow_lr_1e-5_cosine_bsz_128_ckpt_4_of_5
2B • Updated • 3
models 563
rosieyzh/rlvr_qwen15_gsm8k_rbz_642_epochs_ckpt_10_of_10
2B • Updated • 1
rosieyzh/rlvr_qwen15_gsm8k_rbz_642_epochs_ckpt_9_of_10
2B • Updated • 1
rosieyzh/rlvr_qwen15_gsm8k_rbz_642_epochs_ckpt_8_of_10
2B • Updated • 1
rosieyzh/rlvr_qwen15_gsm8k_rbz_642_epochs_ckpt_7_of_10
2B • Updated • 1
rosieyzh/rlvr_qwen15_gsm8k_rbz_642_epochs_ckpt_6_of_10
2B • Updated • 2
rosieyzh/rlvr_qwen15_gsm8k_rbz_642_epochs_ckpt_5_of_10
2B • Updated • 1
rosieyzh/rlvr_qwen15_gsm8k_rbz_642_epochs_ckpt_4_of_10
2B • Updated • 1
rosieyzh/rlvr_qwen15_gsm8k_rbz_642_epochs_ckpt_3_of_10
2B • Updated • 1
rosieyzh/rlvr_qwen15_gsm8k_rbz_642_epochs_ckpt_2_of_10
2B • Updated • 1
rosieyzh/rlvr_qwen15_gsm8k_rbz_642_epochs_ckpt_1_of_10
2B • Updated • 1
datasets 22
rosieyzh/Visual-TableQA-formatted
Viewer • Updated • 8.23k • 13
rosieyzh/tinygsm_fobinary_workspace_depth1to9_traindepth5_tokenized
Viewer • Updated • 7.87M • 80
rosieyzh/tinygsm_fobinary_workspace_depth1to9_traindepth5
Viewer • Updated • 7.87M • 34
rosieyzh/tinygsm_fobinary_obs_depth1to9_traindepth5_tokenized
Viewer • Updated • 7.87M • 102
rosieyzh/tinygsm_fobinary_no_obs_depth1to9_traindepth5_tokenized
Viewer • Updated • 7.87M • 209
rosieyzh/tinygsm_fobinary_obs_depth1to9_traindepth5
Viewer • Updated • 7.87M • 796
rosieyzh/tinygsm_fobinary_no_obs_depth1to9_traindepth5
Viewer • Updated • 7.87M • 193
rosieyzh/tinygsm_fopython_workspace_depth1to9_traindepth5
Viewer • Updated • 7.87M • 507
rosieyzh/tinygsm_fodirect_depth1to9_traindepth5
Viewer • Updated • 7.87M • 22
rosieyzh/tinygsm_fopython_obs_depth1to9_traindepth5
Viewer • Updated • 7.87M • 359