[BUG] dvblang option doesn't work #532

Open
opened 2026-01-29 16:46:41 +00:00 by claunia · 0 comments
Owner

Originally created by @hamelg on GitHub (Dec 29, 2019).

CCExtractor detailed version info
Version: 0.88
Git commit: bc3d729e30a751feb9b854a54c085f0e81a99134
Compilation date: 2019-12-25
File SHA256: Could not open file
Libraries used by CCExtractor
Tesseract Version: 4.1.1
Leptonica Version: leptonica-1.78.0
libGPAC Version: 0.7.2-DEV
zlib: 1.2.11
utf8proc Version: 2.2.0
protobuf-c Version: 1.3.1
libpng Version: 1.6.35
FreeType
libhash
nuklear
libzvbi

In raising this issue, I confirm the following (please check boxes, eg [X] - and delete unchecked ones):

  • [ X] I have read and understood the contributors guide.
  • [ X] I have checked that the bug-fix I am reporting can be replicated, or that the feature I am suggesting isn't already present.
  • [ X] I have checked that the issue I'm posting isn't already reported.
  • [ X] I have checked that the issue I'm porting isn't already solved and no duplicates exist in closed issues and in opened issues
  • [ X] I have checked the pull requests tab for existing solutions/implementations to my issue/suggestion.
  • [ X] I have used the latest available version of CCExtractor to verify this issue exists.

My familiarity with the project is as follows (check one, eg [X] - and delete unchecked ones):

  • [ X] I have used CCExtractor just a couple of times.

Necessary information

  • Is this a regression (did it work before)? [X ] NO | [ ] YES - please specify the last known working version
  • What platform did you use? [ ] Windows - [X ] Linux - [ ] Mac

Issue description

Some french dvb channels doesn't use ISO 639-2 to specify the language for the subtitles stream. Here is an example :

Input #0, mpegts, from '1008_20191216001500.ts':
  Duration: 00:04:59.97, start: 1.400000, bitrate: 4007 kb/s
  Program 1 
    Metadata:
      service_name    : Service01
      service_provider: FFmpeg
    Stream #0:0[0x100]: Video: h264 (High) ([27][0][0][0] / 0x001B), yuv420p(tv, bt709, top first), 1920x1080 [SAR 1:1 DAR 16:9], 25 fps, 25 tbr, 90k tbn, 50 tbc
    Stream #0:1[0x101](fre): Audio: eac3 ([135][0][0][0] / 0x0087), 48000 Hz, stereo, fltp, 128 kb/s
    Stream #0:2[0x102](qaa): Audio: eac3 ([135][0][0][0] / 0x0087), 48000 Hz, stereo, fltp, 128 kb/s
    Stream #0:3[0x103](fra): Audio: eac3 ([135][0][0][0] / 0x0087), 48000 Hz, stereo, fltp, 128 kb/s (visual impaired) (descriptions)
    Stream #0:4[0x104](fre): Subtitle: dvb_subtitle ([6][0][0][0] / 0x0006) (hearing impaired)
    Stream #0:5[0x105](fre): Subtitle: dvb_subtitle ([6][0][0][0] / 0x0006)
    Stream #0:6[0x106]: Data: bin_data ([6][0][0][0] / 0x0006)

On the subtitle streams, the language code should be "fra", and not "fre".

The following command fails to find the subtitle stream :

$ ccextractor -o sbt -out=spupng -dvblang fre -ocrlang fra mythtv/1008_20191216001500/1008_20191216001500.ts
...
Analyzing data in general mode
Ignoring stream language 'und' not equal to dvblang 'fre'
Ignoring stream language 'und' not equal to dvblang 'fre'
...
No captions were found in input.

It fails because the code "fre" doesn't exist in the language array (see lib_ccx/ccx_common_constants.c).

Originally created by @hamelg on GitHub (Dec 29, 2019). CCExtractor detailed version info Version: 0.88 Git commit: bc3d729e30a751feb9b854a54c085f0e81a99134 Compilation date: 2019-12-25 File SHA256: Could not open file Libraries used by CCExtractor Tesseract Version: 4.1.1 Leptonica Version: leptonica-1.78.0 libGPAC Version: 0.7.2-DEV zlib: 1.2.11 utf8proc Version: 2.2.0 protobuf-c Version: 1.3.1 libpng Version: 1.6.35 FreeType libhash nuklear libzvbi **In raising this issue, I confirm the following (please check boxes, eg [X] - and delete unchecked ones):** - [ X] I have read and understood the [contributors guide](https://github.com/CCExtractor/ccextractor/blob/master/.github/CONTRIBUTING.md). - [ X] I have checked that the bug-fix I am reporting can be replicated, or that the feature I am suggesting isn't already present. - [ X] I have checked that the issue I'm posting isn't already reported. - [ X] I have checked that the issue I'm porting isn't already solved and no duplicates exist in [closed issues](https://github.com/CCExtractor/ccextractor/issues?q=is%3Aissue+is%3Aclosed) and in [opened issues](https://github.com/CCExtractor/ccextractor/issues) - [ X] I have checked the pull requests tab for existing solutions/implementations to my issue/suggestion. - [ X] I have used the latest available version of CCExtractor to verify this issue exists. **My familiarity with the project is as follows (check one, eg [X] - and delete unchecked ones):** - [ X] I have used CCExtractor just a couple of times. **Necessary information** - Is this a regression (did it work before)? [X ] NO | [ ] YES - *please specify the last known working version* - What platform did you use? [ ] Windows - [X ] Linux - [ ] Mac **Issue description** Some french dvb channels doesn't use ISO 639-2 to specify the language for the subtitles stream. Here is an example : ```` Input #0, mpegts, from '1008_20191216001500.ts': Duration: 00:04:59.97, start: 1.400000, bitrate: 4007 kb/s Program 1 Metadata: service_name : Service01 service_provider: FFmpeg Stream #0:0[0x100]: Video: h264 (High) ([27][0][0][0] / 0x001B), yuv420p(tv, bt709, top first), 1920x1080 [SAR 1:1 DAR 16:9], 25 fps, 25 tbr, 90k tbn, 50 tbc Stream #0:1[0x101](fre): Audio: eac3 ([135][0][0][0] / 0x0087), 48000 Hz, stereo, fltp, 128 kb/s Stream #0:2[0x102](qaa): Audio: eac3 ([135][0][0][0] / 0x0087), 48000 Hz, stereo, fltp, 128 kb/s Stream #0:3[0x103](fra): Audio: eac3 ([135][0][0][0] / 0x0087), 48000 Hz, stereo, fltp, 128 kb/s (visual impaired) (descriptions) Stream #0:4[0x104](fre): Subtitle: dvb_subtitle ([6][0][0][0] / 0x0006) (hearing impaired) Stream #0:5[0x105](fre): Subtitle: dvb_subtitle ([6][0][0][0] / 0x0006) Stream #0:6[0x106]: Data: bin_data ([6][0][0][0] / 0x0006) ```` On the subtitle streams, the language code should be "fra", and not "fre". The following command fails to find the subtitle stream : ``` $ ccextractor -o sbt -out=spupng -dvblang fre -ocrlang fra mythtv/1008_20191216001500/1008_20191216001500.ts ... Analyzing data in general mode Ignoring stream language 'und' not equal to dvblang 'fre' Ignoring stream language 'und' not equal to dvblang 'fre' ... No captions were found in input. ```` It fails because the code "fre" doesn't exist in the language array (see lib_ccx/ccx_common_constants.c).
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: starred/ccextractor#532