Pinned
Our MathVision benchmark is accepted by NeurIPS DB Track, 2024! We show a notable performance gap between current LMMs and human performance on simple math problems with visual context.
Dataset: huggingface.co/datasets/MathL…
Paper: arxiv.org/pdf/2402.14804




















