view post Post 4245 I am very sad to say that the budget in creating of SnowflakeCore-G1 1b and 7b MoE models ran out and I can't pre-train them anymore. See translation
view post Post 692 the training for SnowflakeCore-G1-1B and 7B would be retaken because now I implemented DeepSpeed and management to use two gpus. See translation
Liquid Claude FlameF0X/LFM2.5-1.2B-Distilled-Claude Text Generation • 1B • Updated 1 day ago • 2.31k • 2 FlameF0X/LFM2.5-1.2B-Distilled-Claude-GGUF Text Generation • 1B • Updated 1 day ago • 621 • 1 FlameF0X/LFM2.5-1.2B-Distilled-Claude-4.6-GGUF 1B • Updated 1 day ago • 258 FlameF0X/LFM2.5-1.2B-Distilled-Claude-4.6 Text Generation • 1B • Updated 1 day ago • 289
Chess A collection dedicated to my pre-train LMs for chess. Sleeping 1 ChessSLM 🦀 1 Play chess against an AI opponent FlameF0X/ChessSLM Text Generation • 30.3M • Updated 11 days ago • 642 • 1 FlameF0X/ChessSLM-RL Text Generation • 30.3M • Updated 11 days ago • 455 • 2 FlameF0X/ChessSLM-Neo Text Generation • 41M • Updated 9 days ago • 443 • 1
Liquid Claude FlameF0X/LFM2.5-1.2B-Distilled-Claude Text Generation • 1B • Updated 1 day ago • 2.31k • 2 FlameF0X/LFM2.5-1.2B-Distilled-Claude-GGUF Text Generation • 1B • Updated 1 day ago • 621 • 1 FlameF0X/LFM2.5-1.2B-Distilled-Claude-4.6-GGUF 1B • Updated 1 day ago • 258 FlameF0X/LFM2.5-1.2B-Distilled-Claude-4.6 Text Generation • 1B • Updated 1 day ago • 289
Chess A collection dedicated to my pre-train LMs for chess. Sleeping 1 ChessSLM 🦀 1 Play chess against an AI opponent FlameF0X/ChessSLM Text Generation • 30.3M • Updated 11 days ago • 642 • 1 FlameF0X/ChessSLM-RL Text Generation • 30.3M • Updated 11 days ago • 455 • 2 FlameF0X/ChessSLM-Neo Text Generation • 41M • Updated 9 days ago • 443 • 1