Case fixing in teletext #45

Closed
opened 2026-01-29 16:33:53 +00:00 by claunia · 1 comment
Owner

Originally created by @cfsmp3 on GitHub (Mar 3, 2015).

Originally assigned to: @Abhinav95 on GitHub.

For American closed captions CCExtractor is able to apply some case conversion rules so instead of all caps we get reasonably correct case.

We need to apply the same logic to Teletext (and well, DVB).

Originally created by @cfsmp3 on GitHub (Mar 3, 2015). Originally assigned to: @Abhinav95 on GitHub. For American closed captions CCExtractor is able to apply some case conversion rules so instead of all caps we get reasonably correct case. We need to apply the same logic to Teletext (and well, DVB).
Author
Owner

@Abhinav95 commented on GitHub (May 24, 2016):

#368 fixes this for teletext.

Old sample output:-

8
00:00:20,020 --> 00:00:24,500
WELL, FIVE IS PARTWAY
BETWEEN THREE... NEVER MIND.
9
00:00:24,540 --> 00:00:25,580
I'LL TELL YOU WHAT.
10
00:00:25,620 --> 00:00:27,180
HOW ABOUT WE GO
ROCK-PAPER-SCISSORS?
11
00:00:27,220 --> 00:00:28,380
OOH, I DON'T THINK SO.
12
00:00:28,420 --> 00:00:30,140
ANECDOTAL EVIDENCE SUGGESTS

New sample output with -sc:-

8
00:00:20,020 --> 00:00:24,500
Well, five is partway
between three... Never mind.
9
00:00:24,540 --> 00:00:25,580
I'll tell you what.
10
00:00:25,620 --> 00:00:27,180
How about we go
rock-paper-scissors?
11
00:00:27,220 --> 00:00:28,380
Ooh, I don't think so.
12
00:00:28,420 --> 00:00:30,140
Anecdotal evidence suggests

All existing tests pass.

@Abhinav95 commented on GitHub (May 24, 2016): #368 fixes this for teletext. Old sample output:- ``` 8 00:00:20,020 --> 00:00:24,500 WELL, FIVE IS PARTWAY BETWEEN THREE... NEVER MIND. 9 00:00:24,540 --> 00:00:25,580 I'LL TELL YOU WHAT. 10 00:00:25,620 --> 00:00:27,180 HOW ABOUT WE GO ROCK-PAPER-SCISSORS? 11 00:00:27,220 --> 00:00:28,380 OOH, I DON'T THINK SO. 12 00:00:28,420 --> 00:00:30,140 ANECDOTAL EVIDENCE SUGGESTS ``` New sample output with -sc:- ``` 8 00:00:20,020 --> 00:00:24,500 Well, five is partway between three... Never mind. 9 00:00:24,540 --> 00:00:25,580 I'll tell you what. 10 00:00:25,620 --> 00:00:27,180 How about we go rock-paper-scissors? 11 00:00:27,220 --> 00:00:28,380 Ooh, I don't think so. 12 00:00:28,420 --> 00:00:30,140 Anecdotal evidence suggests ``` All existing tests pass.
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: starred/ccextractor#45