A Situated Video Reasoning Benchmark with Aligned Open-World Knowledge

SOK-Bench

Our work aims to delve deeper into reasoning evaluations, specifically within dynamic, open-world, and structured context knowledge. SOK-Bench consists of 44K questions and 10K situations with instance-level annotations depicted in the videos. The reasoning process is required to understand and apply situated knowledge and general knowledge for problem-solving.

Preview

SOK-Bench consists of 44K questions and 10K situations with instance-level annotations depicted in the videos. The reasoning process is required to understand and apply situated knowledge and general knowledge for problem-solving.

Data Example


Data Download


Data Overview

44K Situated Questions

10K Situation Video Clips

Situation Commonsense Knowledge Graphs

Annotation Statistics


Paper

Link to Paper

@inproceedings{SOK-Bench, author = {Wang*, Andong and Wu*, Bo and Chen, Sunli and Chen, Zhenfang and Guan, Haotian and Lee, Wei-Ning and Li, Erran Li and Tenenbaum, Joshua B and Gan, Chuang}, title = {SOK-Bench: A Situated Video Reasoning Benchmark with Aligned Open-World Knowledge}, booktitle = {The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)}, year = {2024} }

Author


Bo Wu*
MIT-IBM Watson AI Lab

Sunli Chen
Tsinghua University

Zhenfang Chen
MIT-IBM Watson AI Lab

Haotian Guan
The University of Hong Kong

Wei-Ning Lee
The University of Hong Kong

Li Erran Li
AWS AI

Chuang Gan
UMass Amherst, MIT-IBM Watson AI Lab