arxiv:2502.07780
Tang
Shengkun
AI & ML interests
None yet
Recent Activity
upvoted a paper 12 days ago
SlimQwen: Exploring the Pruning and Distillation in Large MoE Model Pre-training published a dataset 22 days ago
Shengkun/chemistry_dataset submitted a paper about 1 month ago
SlimQwen: Exploring the Pruning and Distillation in Large MoE Model Pre-training