mirror of
https://github.com/CCExtractor/ccextractor.git
synced 2026-02-03 21:23:48 +00:00
OCR doesn't work well in DVB when there's two lines at the same time #163
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @cfsmp3 on GitHub (Jun 21, 2016).
Originally assigned to: @cfsmp3, @anshul1912 on GitHub.
In DVB some times a bitmap contains two lines of text. In this case CCExtractor just writes the second one when using the OCR.
However in spupng the output is good.
I enabled the DEBUG_OCR stuff and what is passed to tesseract is indeed one line of text, so the problem is not with tesseract. I wonder, why are passing a different image to tessaract than the one we use for spupng?