Can Large Language Models Understand Symbolic Graphics Programs?
Authors: Z Qiu, W Liu, H Feng, Z Liu, T Xiao
Published: 2024
Publication: ArXiv
Introduces SGP-Bench, a benchmark testing whether LLMs can answer semantic and spatial questions about images purely from graphics programs (SVG/CAD), effectively probing “visual imagination without vision.” The authors show current LLMs struggle - sometimes near chance - even when images are trivial for humans, but demonstrate that Symbolic Instruction Tuning (SIT) can meaningfully improve this ability and even boost general instruction-following performance.
Institution: Massachusetts Institute of Technology,Max Planck Institute, University of Cambridge
Research Area: Computational cognition, LLM evaluation, Program synthesis, Multimodal reasoning
Discipline: Artificial Intelligence