arxiv:2606.09426
wen
Wen36666
AI & ML interests
None yet
Recent Activity
authored a paper about 10 hours ago
WeaveBench: A Long-Horizon, Real-World Benchmark for Computer-Use Agents with Hybrid Interfaces liked a dataset about 15 hours ago
wanlilll/WeaveBench upvoted a paper about 15 hours ago
WeaveBench: A Long-Horizon, Real-World Benchmark for Computer-Use Agents with Hybrid InterfacesOrganizations
None yet