Deprecated: The each() function is deprecated. This message will be suppressed on further calls in /home/zhenxiangba/zhenxiangba.com/public_html/phproxy-improved-master/index.php on line 456
OctoThinker (OctoThinker)
[go: Go Back, main page]

AI & ML interests

None defined yet.

Organization Card

šŸ™ OctoThinker is led by GAIR

šŸŽÆ Our Goal: To reshape the pre-training trajectory so models scale better under RL.

Check our technical report for more details!

image/png