Show HN: OCR pipeline for ML training (tables, diagrams, math, multilingual)

(github.com)

167 points | by ses425500000 3 days ago

38 comments