> saved as > in teletext-subs saved with color-information #86

Closed
opened 2026-01-29 16:34:47 +00:00 by claunia · 4 comments
Owner

Originally created by @hurda on GitHub (Nov 18, 2015).

CCExtractor 0.77 and git-677fee4
File: http://www.mediafire.com/download/uryjrncc88747mr/great-than-sign.7z (30MB)

Without color-information

1
00:00:01,080 --> 00:00:02,720
>Überhaupt nicht.

2
00:00:02,880 --> 00:00:04,200
Warum?

With color information:

1
00:00:01,080 --> 00:00:02,720
>Überhaupt nicht.

2
00:00:02,880 --> 00:00:04,200
<font color="#ffff00">Warum?</font>

Probably due to 763972ca4b/src/lib_ccx/telxcc.c (L173)

I'm reporting this as e.g. MPC-HC and WDTV Media Player set-topbox are showing &gt; as-is and not as >.

Originally created by @hurda on GitHub (Nov 18, 2015). CCExtractor 0.77 and git-677fee4 File: http://www.mediafire.com/download/uryjrncc88747mr/great-than-sign.7z (30MB) Without color-information ``` 1 00:00:01,080 --> 00:00:02,720 >Überhaupt nicht. 2 00:00:02,880 --> 00:00:04,200 Warum? ``` With color information: ``` 1 00:00:01,080 --> 00:00:02,720 &gt;Überhaupt nicht. 2 00:00:02,880 --> 00:00:04,200 <font color="#ffff00">Warum?</font> ``` Probably due to https://github.com/CCExtractor/ccextractor/blob/763972ca4b541a4890304130f03e1616c0ea661a/src/lib_ccx/telxcc.c#L173 I'm reporting this as e.g. MPC-HC and WDTV Media Player set-topbox are showing `&gt;` as-is and not as `>`.
Author
Owner

@canihavesomecoffee commented on GitHub (Nov 19, 2015):

Yeah, you're right that it's most likely caused by that conversion table. The question is however, in how far this is posing an issue... Looks like it's been in there since the initial git, so maybe @cfsmp3 can give some insight why it's in there :)

@canihavesomecoffee commented on GitHub (Nov 19, 2015): Yeah, you're right that it's most likely caused by that conversion table. The question is however, in how far this is posing an issue... Looks like it's been in there since the initial git, so maybe @cfsmp3 can give some insight why it's in there :)
Author
Owner

@hurda commented on GitHub (Nov 20, 2015):

Unfortunately petrkutalek removed the original repo, and thus the commit-history.

But I hunted down a fork of the original repo, and I found the reason for this conversion: searchengine-mode.
d694940537

Is that relevant when the subtitle-files aren't being indexed and only used with mediaplayers?

@hurda commented on GitHub (Nov 20, 2015): Unfortunately petrkutalek removed the original repo, and thus the commit-history. But I hunted down a fork of the original repo, and I found the reason for this conversion: searchengine-mode. https://github.com/pude/telxcc/commit/d69494053759b93e20979d4b59d74a63b023884e Is that relevant when the subtitle-files aren't being indexed and only used with mediaplayers?
Author
Owner

@cfsmp3 commented on GitHub (Nov 20, 2015):

Probably we should make the escaping optional, in general. But not for
0.78, maybe as a 0.79 goal :-)

On Fri, Nov 20, 2015 at 2:28 AM, hurda notifications@github.com wrote:

Unfortunately petrkutalek removed the original repo, and thus the
commit-history.

But I hunted down a fork of the original repo, and I found the reason for
this conversion: searchengine-mode.
pude/telxcc@d694940
d694940537

Is that relevant when the subtitle-files aren't being indexed and only
used with mediaplayers?


Reply to this email directly or view it on GitHub
https://github.com/CCExtractor/ccextractor/issues/248#issuecomment-158350669
.

@cfsmp3 commented on GitHub (Nov 20, 2015): Probably we should make the escaping optional, in general. But not for 0.78, maybe as a 0.79 goal :-) On Fri, Nov 20, 2015 at 2:28 AM, hurda notifications@github.com wrote: > Unfortunately petrkutalek removed the original repo, and thus the > commit-history. > > But I hunted down a fork of the original repo, and I found the reason for > this conversion: searchengine-mode. > pude/telxcc@d694940 > https://github.com/pude/telxcc/commit/d69494053759b93e20979d4b59d74a63b023884e > > Is that relevant when the subtitle-files aren't being indexed and only > used with mediaplayers? > > — > Reply to this email directly or view it on GitHub > https://github.com/CCExtractor/ccextractor/issues/248#issuecomment-158350669 > .
Author
Owner

@anshul1912 commented on GitHub (Jan 8, 2016):

Added --nohtmlescape option which dont allow to convert > to &gt and other characters like &lt &amp

@anshul1912 commented on GitHub (Jan 8, 2016): Added --nohtmlescape option which dont allow to convert > to &gt and other characters like &lt &amp
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: starred/ccextractor#86