Teletext page number dropped in -UCLA switch #143

Closed
opened 2026-01-29 16:36:18 +00:00 by claunia · 5 comments
Owner

Originally created by @Liontooth on GitHub (May 19, 2016).

In the latest github version of CCExtractor the teletext page number is no longer listed in the third field:

20160219101902.280|20160219101905.360|TLT|-Может быть, вы забыли, я занимаюсь полковником Лыковым.

The github version from mid-March shows the teletext page number and the string TLT:

20160219101902.280|20160219101905.360|888TLT|-Может быть, вы забыли, я занимаюсь полковником Лыковым.

What we actually want is the teletext number with no TLT string:

20160219101902.280|20160219101905.360|888|-Может быть, вы забыли, я занимаюсь полковником Лыковым.

So the wrong part of the output was removed in a recent change. The teletext page number is useful information and we'd like to keep it.

Command:

$ ccextractor -debug -608 -datets -ttxt -autoprogram -tpage 888 -UCLA -noru -utf8 --nofontcolor -parsepat -parsepmt -unixts $BTIM -o $FIL.ccx.out $FIL.mpg

Sample file:

On the other hand, the mid-March github version would typically fail to use Cyrillic for the very last line; this bug (not reported) has disappeared in the current version.

Cheers,
David

Originally created by @Liontooth on GitHub (May 19, 2016). In the latest github version of CCExtractor the teletext page number is no longer listed in the third field: ``` 20160219101902.280|20160219101905.360|TLT|-Может быть, вы забыли, я занимаюсь полковником Лыковым. ``` The github version from mid-March shows the teletext page number and the string TLT: ``` 20160219101902.280|20160219101905.360|888TLT|-Может быть, вы забыли, я занимаюсь полковником Лыковым. ``` What we actually want is the teletext number with no TLT string: ``` 20160219101902.280|20160219101905.360|888|-Может быть, вы забыли, я занимаюсь полковником Лыковым. ``` So the wrong part of the output was removed in a recent change. The teletext page number is useful information and we'd like to keep it. Command: ``` $ ccextractor -debug -608 -datets -ttxt -autoprogram -tpage 888 -UCLA -noru -utf8 --nofontcolor -parsepat -parsepmt -unixts $BTIM -o $FIL.ccx.out $FIL.mpg ``` Sample file: - http://vrnewsscape.ucla.edu/dropbox/2016-02-19_1019_RU_TVC_News.mpg On the other hand, the mid-March github version would typically fail to use Cyrillic for the very last line; this bug (not reported) has disappeared in the current version. Cheers, David
Author
Owner

@Abhinav95 commented on GitHub (May 19, 2016):

Hi David.

I have introduced this undesirable change inadvertently while fixing another issue with the -ucla switch, and will try and fix this for you.

Could you confirm that you need only the teletext page number in the third field, while for other types of subtitles it should contain their mode (for instance ...|...|ISDB|... for ISDB subtitles)?

I will set it accordingly.

@Abhinav95 commented on GitHub (May 19, 2016): Hi David. I have introduced this undesirable change inadvertently while fixing another issue with the -ucla switch, and will try and fix this for you. Could you confirm that you need only the teletext page number in the third field, while for other types of subtitles it should contain their mode (for instance ...|...|ISDB|... for ISDB subtitles)? I will set it accordingly.
Author
Owner

@bigharshrag commented on GitHub (May 19, 2016):

Hi David,
I would also like to add that I had noticed the bug in which Cyrillic would not be used for the last line and it was subsequently fixed (via #335).

@bigharshrag commented on GitHub (May 19, 2016): Hi David, I would also like to add that I had noticed the bug in which Cyrillic would not be used for the last line and it was subsequently fixed (via #335).
Author
Owner

@Abhinav95 commented on GitHub (May 19, 2016):

#360 should fix this

Sample output is now:-

20160519190355.280|20160519190358.360|888|-Может быть, вы забыли, я занимаюсь полковником Лыковым.

@Abhinav95 commented on GitHub (May 19, 2016): #360 should fix this Sample output is now:- 20160519190355.280|20160519190358.360|888|-Может быть, вы забыли, я занимаюсь полковником Лыковым.
Author
Owner

@Liontooth commented on GitHub (May 19, 2016):

@Abhinav95 -- great, that's exactly right. I'll test it.
@bigharshrag -- thank you for noticing!

@Liontooth commented on GitHub (May 19, 2016): @Abhinav95 -- great, that's exactly right. I'll test it. @bigharshrag -- thank you for noticing!
Author
Owner

@Liontooth commented on GitHub (May 20, 2016):

I can confirm #360 fixes this issue; please merge!

@Liontooth commented on GitHub (May 20, 2016): I can confirm #360 fixes this issue; please merge!
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: starred/ccextractor#143