CodeOCR: On the Effectiveness of Vision Language Models in Code Understanding
IntermediateYuling Shi, Chaoxiang Xie et al.Feb 2arXiv
The paper tests a simple but bold idea: show code to AI as pictures instead of plain text, then shrink those pictures to save tokens and time.
#multimodal language models#code as images#visual code understanding