Yu-Shiang Huang
hyusterr
·
AI & ML interests
NLP, IR, RecSys, FinTech
Recent Activity
upvoted
a
paper
about 1 month ago
Less is More: Recursive Reasoning with Tiny Networks
upvoted
a
collection
about 1 month ago
Deepseek Papers
upvoted
a
paper
about 1 month ago
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization
Organizations
None yet