SpatialBot: Precise Spatial Understanding with Vision Language Models
Paper • 2406.13642 • Published •
2
RussRobin/SpatialBench · Datasets at Hugging Face
This repository is publicly accessible, but you have to accept the conditions to access its files and content.
Log in or Sign Up to review the conditions and access this dataset content.
SpatialBench evaluates model performance on spatial understanding. We design positional, existence, counting, reaching and size comparasion tasks.
In this HF dataset, SpatialBench RGB & Depth images, questions, answers and meta data are provided.
https://arxiv.org/abs/2406.13642
https://github.com/BAAI-DCAI/SpatialBot