ThinkPRM Process Reward Models that Think -- https://arxiv.org/abs/2504.16828 launch/ThinkPRM-1.5B Text Generation β’ 2B β’ Updated Jun 25, 2025 β’ 382 β’ 3 launch/ThinkPRM-7B Text Generation β’ 8B β’ Updated May 17, 2025 β’ 38 β’ 1 launch/ThinkPRM-14B Text Generation β’ 15B β’ Updated Jul 1, 2025 β’ 133 β’ 6 mradermacher/ThinkPRM-7B-i1-GGUF 8B β’ Updated Jul 11, 2025 β’ 477
ThinkPRM Process Reward Models that Think -- https://arxiv.org/abs/2504.16828 launch/ThinkPRM-1.5B Text Generation β’ 2B β’ Updated Jun 25, 2025 β’ 382 β’ 3 launch/ThinkPRM-7B Text Generation β’ 8B β’ Updated May 17, 2025 β’ 38 β’ 1 launch/ThinkPRM-14B Text Generation β’ 15B β’ Updated Jul 1, 2025 β’ 133 β’ 6 mradermacher/ThinkPRM-7B-i1-GGUF 8B β’ Updated Jul 11, 2025 β’ 477