CCExtractor fails to extract srt,webvtt but spupng is ok from the same track #533

Closed
opened 2026-01-29 16:46:41 +00:00 by claunia · 3 comments
Owner

Originally created by @Murmur on GitHub (Dec 30, 2019).

I made a small TS test file and a text file with detailed debug information.
http://akini.mbnet.fi/videos/test1_dvbsub.ts
http://akini.mbnet.fi/videos/test1_readme.txt

File has two dvbsub streams (0xc37 works for srt,vtt,png extract. 0xc4f works only for png extract):

Stream #0:2[0xc37](fin): Subtitle: dvb_subtitle ([6][0][0][0] / 0x0006)
Stream #0:4[0xc4f](swe): Subtitle: dvb_subtitle ([6][0][0][0] / 0x0006) (hearing impaired)

CCExtractor test commands:

ccextractor -codec dvbsub -utf8 -nobom -nofc -noteletext -nocodec teletext -nomyth -nots \
  -oem 2 -quant 2 \
  -in=ts "test1_dvbsub.ts" -datapid 0xc37 \
  -out=srt -o test1_c37.srt -delay 0

ccextractor -codec dvbsub -utf8 -nobom -nofc -noteletext -nocodec teletext -nomyth -nots \
  -oem 2 -quant 2 \
  -in=ts "test1_dvbsub.ts" -datapid 0xc4f \
  -out=srt -o test1_c4f.srt -delay 0

ccextractor -codec dvbsub -utf8 -nobom -nofc -noteletext -nocodec teletext -nomyth -nots \
  -oem 2 -quant 2 \
  -in=ts "test1_dvbsub.ts" -datapid 0xc4f \
  -out=spupng -o test1_c4f.xml -delay 0

Results for text extract
out=srt+webvtt works fine for 0xc37 track but empty on 0xc4f dvbsub track.
out=spupng works fine for both dvbsub tracks.

This means have to use a standalone tesseract command line app to convert png to text file, use python or something script to write the final srt,webvtt text files.

Originally created by @Murmur on GitHub (Dec 30, 2019). I made a small TS test file and a text file with detailed debug information. http://akini.mbnet.fi/videos/test1_dvbsub.ts http://akini.mbnet.fi/videos/test1_readme.txt File has two dvbsub streams (0xc37 works for srt,vtt,png extract. 0xc4f works only for png extract): ``` Stream #0:2[0xc37](fin): Subtitle: dvb_subtitle ([6][0][0][0] / 0x0006) Stream #0:4[0xc4f](swe): Subtitle: dvb_subtitle ([6][0][0][0] / 0x0006) (hearing impaired) ``` CCExtractor test commands: ``` ccextractor -codec dvbsub -utf8 -nobom -nofc -noteletext -nocodec teletext -nomyth -nots \ -oem 2 -quant 2 \ -in=ts "test1_dvbsub.ts" -datapid 0xc37 \ -out=srt -o test1_c37.srt -delay 0 ccextractor -codec dvbsub -utf8 -nobom -nofc -noteletext -nocodec teletext -nomyth -nots \ -oem 2 -quant 2 \ -in=ts "test1_dvbsub.ts" -datapid 0xc4f \ -out=srt -o test1_c4f.srt -delay 0 ccextractor -codec dvbsub -utf8 -nobom -nofc -noteletext -nocodec teletext -nomyth -nots \ -oem 2 -quant 2 \ -in=ts "test1_dvbsub.ts" -datapid 0xc4f \ -out=spupng -o test1_c4f.xml -delay 0 ``` **Results for text extract** out=srt+webvtt works fine for 0xc37 track but empty on 0xc4f dvbsub track. out=spupng works fine for both dvbsub tracks. This means have to use a standalone tesseract command line app to convert png to text file, use python or something script to write the final srt,webvtt text files.
claunia added the difficulty: easyHacktoberfest labels 2026-01-29 16:46:41 +00:00
Author
Owner

@mridullpandey commented on GitHub (May 26, 2020):

hi can i take this up

@mridullpandey commented on GitHub (May 26, 2020): hi can i take this up
Author
Owner

@cfsmp3 commented on GitHub (May 26, 2020):

No need to ask - you can work on anything that is open :-)

On Tue, May 26, 2020 at 3:49 AM mridulpandey notifications@github.com wrote:

hi can i take this up


You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub, or unsubscribe.

@cfsmp3 commented on GitHub (May 26, 2020): No need to ask - you can work on anything that is open :-) On Tue, May 26, 2020 at 3:49 AM mridulpandey <notifications@github.com> wrote: > > hi can i take this up > > — > You are receiving this because you are subscribed to this thread. > Reply to this email directly, view it on GitHub, or unsubscribe.
Author
Owner

@cfsmp3 commented on GitHub (Mar 22, 2023):

@Murmur that link no longer works, please reopen the issue if you still have problems and can provide a sample

@cfsmp3 commented on GitHub (Mar 22, 2023): @Murmur that link no longer works, please reopen the issue if you still have problems and can provide a sample
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: starred/ccextractor#533