Deprecated: The each() function is deprecated. This message will be suppressed on further calls in /home/zhenxiangba/zhenxiangba.com/public_html/phproxy-improved-master/index.php on line 456
DynaMath (DynaMath Team)
[go: Go Back, main page]

AI & ML interests

None defined yet.

Recent Activity

Organization Card

Welcome to the official repository for DynaMath: A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision-Language Models. This repository contains the code, resources, and documentation supporting our paper, which introduces DynaMath: a benchmark designed to rigorously evaluate mathematical reasoning across various vision-language models (VLMs).

For further details, including the benchmark leaderboard, please visit our project website and our preprint paper.

image/png

models 0

None public yet