Tianyu Pang
P2333
AI & ML interests
Machine Learning
Recent Activity
upvoted a paper 14 minutes ago
Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models submitted a paper 15 minutes ago
Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models upvoted a paper 31 minutes ago
Rethinking the Divergence Regularization in LLM RL