feat: spatial reasoning tasks #493

MagdalenaKotynia · 2025-03-28T20:11:12Z

Purpose

Tasks to test the spatial reasoning capabilities of VLMs in tool-calling agents.

Proposed Changes

Added spatial reasoning tasks - true/false questions to the given images.

Issues

Need for testing the spatial reasoning capabilities of VLMs.

Testing

python src/rai_bench/rai_bench/examples/spatial_tool_calling_agent_bench.py

Note

The results of tasks depend not only on the spatial capabilities of the models but also on the tool-calling capabilities of the model. I suggest refactoring it in the future to make the spatial reasoning benchmark independent from the tool calling capabilities of the models.

MagdalenaKotynia added 2 commits April 1, 2025 10:30

feat: spatial reasoning tasks

fa889c9

chore: changed rack to cabinet to be more precise

4cde7bb

MagdalenaKotynia force-pushed the mk/feat/spatial-reasoning-tasks branch from dea1177 to 4cde7bb Compare April 1, 2025 08:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: spatial reasoning tasks #493

feat: spatial reasoning tasks #493

MagdalenaKotynia commented Mar 28, 2025 •

edited

Loading

feat: spatial reasoning tasks #493

Are you sure you want to change the base?

feat: spatial reasoning tasks #493

Conversation

MagdalenaKotynia commented Mar 28, 2025 • edited Loading

Purpose

Proposed Changes

Issues

Testing

MagdalenaKotynia commented Mar 28, 2025 •

edited

Loading