YUYUNYAO
Yyy195
ยท
AI & ML interests
None yet
Recent Activity
upvoted a paper about 15 hours ago
WeaveBench: A Long-Horizon, Real-World Benchmark for Computer-Use Agents with Hybrid Interfaces upvoted a paper 3 months ago
When Models Judge Themselves: Unsupervised Self-Evolution for Multimodal Reasoning upvoted a paper about 1 year ago
MMR-V: What's Left Unsaid? A Benchmark for Multimodal Deep Reasoning in
VideosOrganizations
None yet