Creating 2D Spatial Reasoning Data Without LLMs / VLMs: A Deterministic Curriculum Trial

Quick Links: GitHub Repository | Dataset Sample Introduction Spatial reasoning remains a significant challenge for language models, particularly in tasks requiring 2D navigation and visual-spatial understanding. Current approaches typically rely on either training large vision-language models (VLMs) on visual data or using language models to generate training examples through expensive API calls. This post explores what might be considered an unconventional (and possibly naive) approach: deterministic generation of spatial reasoning data without requiring any LLMs or VLMs in the data creation process. Rather than using models to generate training examples, this experiment algorithmically creates what we hope are realistic learning trajectories that simulate how spatial reasoning competency might develop over time. ...

January 16, 2025 | 8 min | Aman Priyanshu