[BUG] "Unicode" output encoding mixes newlines #328

Closed
opened 2026-01-29 16:41:05 +00:00 by claunia · 7 comments
Owner

Originally created by @aaaxx on GitHub (Aug 14, 2017).

CCExtractor version: 0.85
Platform: Windows

Using the combination of Unicode encoding and "DOS/Windows (CRLF)" line terminators creates files that use both CR and CRLF newlines. (The "UNIX (LF)" option works fine, though.)

screenshot of an affected file opened in a text editor, with whitespace characters visible

Originally created by @aaaxx on GitHub (Aug 14, 2017). CCExtractor version: **0.85** Platform: Windows Using the combination of Unicode encoding and "DOS/Windows (CRLF)" line terminators creates files that use both CR and CRLF newlines. (The "UNIX (LF)" option works fine, though.) ![screenshot of an affected file opened in a text editor, with whitespace characters visible](https://user-images.githubusercontent.com/6186480/29266377-d5d12a94-80e4-11e7-9ca7-99ae5f9c21f4.png)
Author
Owner

@saurabhshri commented on GitHub (Oct 2, 2017):

@aaaxx What exact command did you use? Also, did the video by any change had bitmap subtitles (e.g. was it a DVB sample?).

I have made a PR which should hopefully fix this (as the context based newlines were missing).

@saurabhshri commented on GitHub (Oct 2, 2017): @aaaxx What exact command did you use? Also, did the video by any change had bitmap subtitles (e.g. was it a DVB sample?). I have made a PR which should hopefully fix this (as the context based newlines were missing).
Author
Owner

@aaaxx commented on GitHub (Oct 3, 2017):

ccextractorwin.exe  --gui_mode_reports  -autoprogram  -out=srt -bom -unicode [+input files]

The source is a retail DVD (multiple VOB files).

Note that it happens only with -unicode, not with -utf8 and -latin1.

@aaaxx commented on GitHub (Oct 3, 2017): ``` ccextractorwin.exe --gui_mode_reports -autoprogram -out=srt -bom -unicode [+input files] ``` The source is a retail DVD (multiple VOB files). Note that it happens only with `-unicode`, not with `-utf8` and `-latin1`.
Author
Owner

@aaaxx commented on GitHub (Oct 3, 2017):

And yes, it also had bitmap subtitles.

@aaaxx commented on GitHub (Oct 3, 2017): And yes, it also had bitmap subtitles.
Author
Owner

@saurabhshri commented on GitHub (Oct 3, 2017):

@aaaxx Please try again with the latest code from the repo, and if you still see the mixed line endings, feel free to re-open the issue.

@saurabhshri commented on GitHub (Oct 3, 2017): @aaaxx Please try again with the latest code from the repo, and if you still see the mixed line endings, feel free to re-open the issue.
Author
Owner

@aaaxx commented on GitHub (Oct 3, 2017):

Sorry, I'm not geeky enough to know how to compile things, but if you can be bothered to make an exe, I'll test it.

@aaaxx commented on GitHub (Oct 3, 2017): Sorry, I'm not geeky enough to know how to compile things, but if you can be bothered to make an exe, I'll test it.
Author
Owner

@hax0kartik commented on GitHub (Nov 29, 2017):

task_2_3
This issue was fixed with pr #783. The image on the right is produced by ccextractor built from the latest commit while the image on the left is produced from ccextractor v0.80

@hax0kartik commented on GitHub (Nov 29, 2017): ![task_2_3](https://user-images.githubusercontent.com/16360444/33383558-aa329c88-d549-11e7-92fc-982dbbb8447e.png) This issue was fixed with pr #783. The image on the right is produced by ccextractor built from the latest commit while the image on the left is produced from ccextractor v0.80
Author
Owner

@saurabhshri commented on GitHub (Nov 29, 2017):

@Pirater12 Thank you for validating the fix! :)

@saurabhshri commented on GitHub (Nov 29, 2017): @Pirater12 Thank you for validating the fix! :)
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: starred/ccextractor#328