General OCR Theory – Towards OCR-2.0 via a Unified End-to-end Model – HF Transformers implementation

https://huggingface.co/stepfun-ai/GOT-OCR-2.0-hf

GOT-OCR2 works on a wide range of tasks, including plain document OCR, scene text OCR, formatted document OCR, and even OCR for tables, charts, mathematical formulas, geometric shapes, molecular formulas and sheet music.

pIXELsHAM