arxiv:2401.05566
Ansh Radhakrishnan
anshr
AI & ML interests
None yet
Organizations
None yet
models 14
anshr/distilgpt2_trained_policy_model_final
Text Generation • Updated • 27
anshr/distilgpt2_supervised_model_final
Text Generation • Updated • 26
anshr/distilgpt2_reward_model_final
Text Classification • Updated • 29
anshr/distilgpt2_trained_policy_model_02
Text Generation • Updated • 25
anshr/distilgpt2_reward_model_05
Text Classification • Updated • 24
anshr/distilgpt2_reward_model_04
Text Classification • Updated • 24
anshr/distilgpt2_reward_model_03
Text Classification • Updated • 26
anshr/distilgpt2_trained_policy_model_01
Text Generation • Updated • 26
anshr/distilgpt2_reward_model_02
Text Classification • Updated • 22
anshr/distilgpt2_supervised_model_01
Text Generation • Updated • 25
datasets 0
None public yet