mirror of
https://github.com/CCExtractor/ccextractor.git
synced 2026-02-17 05:25:33 +00:00
[BUG] Using -sc option crashes ccextractor #513
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @rboy1 on GitHub (Oct 17, 2019).
Please prefix your issue with one of the following: [BUG], [PROPOSAL], [QUESTION].
CCExtractor version (using the --version parameter preferably) : 0.88
In raising this issue, I confirm the following (please check boxes, eg [X] - and delete unchecked ones):
My familiarity with the project is as follows (check one, eg [X] - and delete unchecked ones):
Necessary information
-scAdditional information
Link to file which is causing the crash:
https://www.dropbox.com/s/4jiooj787e02kd3/CCExtractor%20crash.ts?dl=0
Output of cmd line:
@sp2703 commented on GitHub (Nov 3, 2019):
I'm wanting to work on this. Any leads where to start off?
@cfsmp3 commented on GitHub (Nov 3, 2019):
Start by reproducing the issue locally.
On Sun, Nov 3, 2019 at 5:24 AM sp2703 notifications@github.com wrote:
@rboy1 commented on GitHub (Nov 11, 2019):
Patched here:
https://github.com/CCExtractor/ccextractor/pull/1122
@rboy1 commented on GitHub (Jul 17, 2020):
@cfsmp3 Carlos any chance of getting a new build? It's been a while and many fixes have gone in since the last release
@cfsmp3 commented on GitHub (Jul 20, 2020):
I'll bundle a new version after GSoC (in one month or so). These days I really don't have a lot of time on my hands I'm afraid.
@rboy1 commented on GitHub (Dec 8, 2020):
@cfsmp3 do you think it’s ready for a release?
@cfsmp3 commented on GitHub (Dec 8, 2020):
Well, there's lots of bugs, but there's no one doing active work these days, so they're not going to go away magically.
I've set a bit of time this Sunday to do a release with what we currently have.
@rboy1 commented on GitHub (Dec 15, 2020):
@cfsmp3 were you able to get the new build released Carlos?
@rboy1 commented on GitHub (Jan 26, 2021):
@cfsmp3 bump on release
@rboy1 commented on GitHub (May 22, 2021):
@cfsmp3 can we expect a new release anytime soon?
@cfsmp3 commented on GitHub (May 22, 2021):
Around mid June
On Sat, May 22, 2021, 08:05 rboy1 @.***> wrote:
@rboy1 commented on GitHub (Jun 17, 2021):
@cfsmp3 Mid June is here, looking forward to it :)
@canihavesomecoffee commented on GitHub (Jun 17, 2021):
https://github.com/CCExtractor/ccextractor/releases/tag/v0.89 :)
Windows build is still WIP*. You can download the binaries here though (let me know if you can't): https://github.com/CCExtractor/ccextractor/suites/2983776538/artifacts/67339947
* We're working on a new installer and code signing, the latter is what's holding us back right now
@rboy1 commented on GitHub (Jun 18, 2021):
Awesome thanks. I noticed that it says it's compiled against tessdata 4.0alpha. Does it mean it won't work with tessdata 3?
@canihavesomecoffee commented on GitHub (Jun 18, 2021):
For Windows the libraries are embedded, so you're indeed stuck to that specific version.
I noticed you had another comment a couple of minutes ago, but it seems to have vanished.
Was that when trying the standalone binary too, or caused by the GUI exe itself?
Anyway, looks like we should add that dll to the generated artifacts too.
@rboy1 commented on GitHub (Jun 18, 2021):
Hmm, I tried using tessdata 3.04 and it seemed to work fine converting dvbsub to srt
@canihavesomecoffee commented on GitHub (Jun 18, 2021):
IIRC tessdata is not bound to the tesseract version, so that's indeed no problem :)
@rboy1 commented on GitHub (Jun 18, 2021):
On a side note I have some files with multiple dvbsub tracks but when I run ccextractor it only extracts the first track. Is there a way to get it to extract all tracks or maybe specify the track number?
@cfsmp3 commented on GitHub (Jun 18, 2021):
Maybe we should also figure out a way to build those libraries again from source :-) @Izaron did that work a few years ago and we haven't touched that ever since I think?
@cfsmp3 commented on GitHub (Jun 18, 2021):
Since tesseract 3 is no longer maintained at all, I think we should stick to 4 (which as can be see, supports the pattern recognition mode from v3, so no need to actually use v3).
@rboy1 commented on GitHub (Jun 18, 2021):
@cfsmp3 are you saying that we need to explicitly add --oem 0 to get it to work with Tesseract 3 because 0.89 is working (or am I missing something here).
For future ref, wouldn't be better if ccextractor automatically detects if it's using Tesseract 3 or 4 with an option to override using the --oem?
@cfsmp3 commented on GitHub (Jun 18, 2021):
What I pasted comes from tesseract's website. v4 supports v3's legacy engine, so there's not reason to actually have v3 around at all. If you want to use the old system, just use --oem (If I remember correctly we do expose that argument in CCExtractor).
I don't want to support legacy versions of libraries. If the tesseract maintainers have decided to stop development of v3, what's the reason for us to bother supporting both? Just use v4 and use the legacy mode if it works better.