OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models
IntermediateYufeng Zhong, Lei Chen et al.Jan 29arXiv
OCRVerse is a new AI model that can read both plain text in documents and the visual structures in charts, webpages, and science plots, all in one system.
#Holistic OCR#Vision-Language Model#Supervised Fine-Tuning