Deprecated : The each() function is deprecated. This message will be suppressed on further calls in /home/zhenxiangba/zhenxiangba.com/public_html/phproxy-improved-master/index.php on line 456
OpenRLHF (OpenRLHF)
models 10 OpenRLHF/Llama-3-8b-rm-700k Text Ranking
• 8B • Updated
Jul 28, 2025 • 57
• 3
OpenRLHF/Llama-3-8b-rm-mixture 8B • Updated
Nov 30, 2024 • 20
• 1
OpenRLHF/Llama-2-7b-rm-anthropic_hh-lmsys-oasst-webgpt 7B • Updated
Nov 30, 2024 • 5
• 1
OpenRLHF/Mistral-7b-PRM-Math-Shepherd 7B • Updated
Oct 30, 2024 • 2
• 1
OpenRLHF/Llama-3-8b-iter-dpo-179k Text Generation
• 8B • Updated
Jul 28, 2024 • 2
OpenRLHF/Llama-3-8b-rlhf-100k Text Generation
• 8B • Updated
Jun 24, 2024 • 6
• 4
OpenRLHF/Llama-3-8b-sft-mixture Text Generation
• 8B • Updated
Jun 14, 2024 • 1.32k
• • 1
OpenRLHF/Llama-2-7b-sft-model-ocra-500k Text Generation
• 7B • Updated
Jun 9, 2024 • 6
OpenRLHF/Llama-2-13b-rm-anthropic_hh-lmsys-oasst-webgpt 13B • Updated
Jan 24, 2024 • 2
OpenRLHF/Llama-2-13b-sft-model-ocra-500k Text Generation
• 13B • Updated
Jan 5, 2024 • 3
• 1