Deprecated : The each() function is deprecated. This message will be suppressed on further calls in /home/zhenxiangba/zhenxiangba.com/public_html/phproxy-improved-master/index.php on line 456
MNC-LLM (Jihun Kim)
2
followers ·
1 following AI & ML interests None yet
Organizations models 33 MNC-LLM/batch1_epochs4_lr1e-05_paged_adamw_32bit_cosine_length2048_warmup_0.05_max_grad1.0_grad_accu32 Text Generation
• 7B • Updated Jan 3, 2024 • 3
MNC-LLM/batch1_epochs1_lr1e-05_paged_adamw_32bit_cosine_length2048_warmup_0.05_max_grad1.0_grad_accu16 Text Generation
• Updated Dec 12, 2023 • 3
MNC-LLM/Mistral-7B-NWS-u2k-Marcoroni-prompt-found-LaAdMoAl-ep4lr5 Text Generation
• Updated Dec 12, 2023 • 4
MNC-LLM/Mistral-7B-NWS-u2k-merge-Marcoroni-LaAdMoAl-ep4-lr5 Text Generation
• Updated Dec 11, 2023 • 4
MNC-LLM/batch1_epochs4_lr1e-05_paged_adamw_32bit_cosine_length2048_warmup_0.05_max_grad1.0_grad_accu16 Text Generation
• Updated Dec 11, 2023 • 3
MNC-LLM/batch1_epochs2_lr1e-05_paged_adamw_32bit_cosine_length2048_warmup_0.05_max_grad1.0_grad_accu32 Updated Dec 11, 2023
MNC-LLM/Mistral-7B-NWS-u2k-merge-Marcoroni Text Generation
• Updated Dec 11, 2023 • 4
MNC-LLM/Mistral-7B-LaAdMoAl-merge-Marcoroni Text Generation
• Updated Dec 11, 2023 • 3
MNC-LLM/tulu-2-dpo-7B-NWSCot-600-ep4lr5 Text Generation
• Updated Dec 7, 2023 • 2
MNC-LLM/Tulu-2-DPO-7B-NWSO-5k-4ep-lr5 Text Generation
• Updated Dec 5, 2023 • 3