CurveBench A vision benchmark for testing whether models can infer hierarchical containment trees from images of nested, non-intersecting curves. CurveBench: A Benchmark for Exact Topological Reasoning over Nested Jordan Curves Paper • 2605.14068 • Published 27 days ago • 8 AmirMohseni/CurveBench-Easy Viewer • Updated 24 days ago • 600 • 199 • 1 AmirMohseni/CurveBench Viewer • Updated 24 days ago • 912 • 196 • 1 AmirMohseni/curvebench-qwen3-vl-8b Updated 24 days ago • 147
CurveBench: A Benchmark for Exact Topological Reasoning over Nested Jordan Curves Paper • 2605.14068 • Published 27 days ago • 8
Legal QA Sleeping 3 Legal Conversation Explorer ⚖ 3 Explore legal conversations on an interactive topic map Sleeping Agents Legal Classifier ⚖ Route WildChat conversations with ModernBERT encoders AmirMohseni/WildChat-Legal-Classification-V2-Balanced Viewer • Updated 18 days ago • 4.24k • 302 AmirMohseni/WildChat-Legal-Classification-V2-LegalOnly Viewer • Updated 18 days ago • 2.12k • 41
CurveBench A vision benchmark for testing whether models can infer hierarchical containment trees from images of nested, non-intersecting curves. CurveBench: A Benchmark for Exact Topological Reasoning over Nested Jordan Curves Paper • 2605.14068 • Published 27 days ago • 8 AmirMohseni/CurveBench-Easy Viewer • Updated 24 days ago • 600 • 199 • 1 AmirMohseni/CurveBench Viewer • Updated 24 days ago • 912 • 196 • 1 AmirMohseni/curvebench-qwen3-vl-8b Updated 24 days ago • 147
CurveBench: A Benchmark for Exact Topological Reasoning over Nested Jordan Curves Paper • 2605.14068 • Published 27 days ago • 8
Legal QA Sleeping 3 Legal Conversation Explorer ⚖ 3 Explore legal conversations on an interactive topic map Sleeping Agents Legal Classifier ⚖ Route WildChat conversations with ModernBERT encoders AmirMohseni/WildChat-Legal-Classification-V2-Balanced Viewer • Updated 18 days ago • 4.24k • 302 AmirMohseni/WildChat-Legal-Classification-V2-LegalOnly Viewer • Updated 18 days ago • 2.12k • 41