[QUESTION] Instructions for using ccextractor with Tesseract 4 #499

Closed
opened 2026-01-29 16:45:24 +00:00 by claunia · 5 comments
Owner

Originally created by @rboy1 on GitHub (Jun 24, 2019).

Please prefix your issue with one of the following: [QUESTION].

CCExtractor version (using the --version parameter preferably) : 0.88

My familiarity with the project is as follows (check one, eg [X] - and delete unchecked ones):

  • I have never used CCExtractor.
  • I have used CCExtractor just a couple of times.
  • I absolutely love CCExtractor, but have not contributed previously.
  • I am an active contributor to CCExtractor.

Necessary information

  • Is this a regression (did it work before)? [X] NO | [ ] YES - please specify the last known working version
  • What platform did you use? [X] Windows - [ ] Linux - [ ] Mac

What are the instructions for using CCExtractor 0.88 with Tesseract 4? Does it mean Tesseract 3.04 won't work anymore?

The docs page doesn't seem to talk about this, can it updated to include that information please.

https://github.com/CCExtractor/ccextractor/blob/master/docs/OCR.md

Originally created by @rboy1 on GitHub (Jun 24, 2019). Please prefix your issue with one of the following: [QUESTION]. CCExtractor version (using the --version parameter preferably) : **0.88** **My familiarity with the project is as follows (check one, eg [X] - and delete unchecked ones):** - [ ] I have never used CCExtractor. - [ ] I have used CCExtractor just a couple of times. - [X] I absolutely love CCExtractor, but have not contributed previously. - [ ] I am an active contributor to CCExtractor. **Necessary information** - Is this a regression (did it work before)? [X] NO | [ ] YES - *please specify the last known working version* - What platform did you use? [X] Windows - [ ] Linux - [ ] Mac What are the instructions for using CCExtractor 0.88 with Tesseract 4? Does it mean Tesseract 3.04 won't work anymore? The docs page doesn't seem to talk about this, can it updated to include that information please. https://github.com/CCExtractor/ccextractor/blob/master/docs/OCR.md
Author
Owner

@soulspark666 commented on GitHub (Jun 25, 2019):

According to https://github.com/UB-Mannheim/tesseract/wiki Tesseract 5.0 exists. I think updating the ccextractor to 5.0 build and then updating the OCR.md will be a better option.

@soulspark666 commented on GitHub (Jun 25, 2019): According to https://github.com/UB-Mannheim/tesseract/wiki Tesseract 5.0 exists. I think updating the ccextractor to 5.0 build and then updating the OCR.md will be a better option.
Author
Owner

@rboy1 commented on GitHub (Jun 26, 2019):

In the meanwhile can someone confirm if 0.88 works with Tesseract v3 or only v4 or both and/or if there's any difference in the stability/quality between the two.

@rboy1 commented on GitHub (Jun 26, 2019): In the meanwhile can someone confirm if 0.88 works with Tesseract v3 or only v4 or both and/or if there's any difference in the stability/quality between the two.
Author
Owner

@rboy1 commented on GitHub (Sep 26, 2019):

Anyone thoughts?

@rboy1 commented on GitHub (Sep 26, 2019): Anyone thoughts?
Author
Owner

@NilsIrl commented on GitHub (Jan 2, 2020):

I can confirm it works with both v3 and v4.

However I've had terrible results with v4

@NilsIrl commented on GitHub (Jan 2, 2020): I can confirm it works with both v3 and v4. However I've had terrible results with v4
Author
Owner

@NilsIrl commented on GitHub (Jan 2, 2020):

What I did is I just installed tesseract4 instead of tesseract3 and then built it. Everything worked without a problem

There is already some code in ccextractor related to tesseract4:

3a1815163f/src/lib_ccx/ocr.c (L202)

@NilsIrl commented on GitHub (Jan 2, 2020): What I did is I just installed tesseract4 instead of tesseract3 and then built it. Everything worked without a problem There is already some code in ccextractor related to tesseract4: https://github.com/CCExtractor/ccextractor/blob/3a1815163fed76683d7c92b0e7aaf32772e9e074/src/lib_ccx/ocr.c#L202
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: starred/ccextractor#499