Deprecated: The each() function is deprecated. This message will be suppressed on further calls in /home/zhenxiangba/zhenxiangba.com/public_html/phproxy-improved-master/index.php on line 456
judge/SPC-Critic-2 · Hugging Face
[go: Go Back, main page]

SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning [arXiv] [Project]

Jiaqi Chen, Bang Zhang, Ruotian Ma, Peisong Wang, Xiaodan Liang, Zhaopeng Tu, Xiaolong Li, Kwan-Yee K. Wong.

Downloads last month
6
Safetensors
Model size
8B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for judge/SPC-Critic-2

Base model

Qwen/Qwen2.5-7B
Finetuned
(3206)
this model

Paper for judge/SPC-Critic-2