[QUESTION] warning with text extraction #658

Open
opened 2026-01-29 16:50:19 +00:00 by claunia · 0 comments
Owner

Originally created by @delevinson on GitHub (Aug 23, 2021).

CCExtractor version: CCExtractor 0.92

Necessary information

  • Is this a regression (i.e. did it work before)? NO
  • What platform did you use? Mac
  • What were the used arguments? {replace with the arguments}ccextractor Symposium_\ Many\ Forms\ of\ Gender\ Discrimination\ I_default_c75bcc31-2.mp4 -txt

Video links

https://mxlakeforest-my.sharepoint.com/✌️/g/personal/levinson_mx_lakeforest_edu/ET1bqw1EEa9PttN9FASvr_QBZWZrb7A35Xtt34XilRjxGg?e=pNnbBK

Additional information

When I try to extract subtitles as text, I get this error
WARNING: Could not encode in specified format

and the output file contains 0 bytes. Here's the messages I get.

levinson@DL209-Levinson Downloads % ccextractor Symposium_\ Many\ Forms\ of\ Gender\ Discrimination\ I_default_c75bcc31-2.mp4 -txt       
CCExtractor 0.92, Carlos Fernandez Sanz, Volker Quetschke.
Teletext portions taken from Petr Kutalek's telxcc
--------------------------------------------------------------------------
Input: Symposium_ Many Forms of Gender Discrimination I_default_c75bcc31-2.mp4
[Extract: 1] [Stream mode: Autodetect]
[Program : Auto ] [Hauppage mode: No] [Use MythTV code: Auto]
[Timing mode: Auto] [Debug: No] [Buffer input: No]
[Use pic_order_cnt_lsb for H.264: No] [Print CC decoder traces: No]
[Target format: .txt] [Encoding: UTF-8] [Delay: 0] [Trim lines: No]
[Add font color data: Yes] [Add font typesetting: Yes]
[Convert case: No][Filter profanity: No] [Video-edit join: No]
[Extraction start time: not set (from start)]
[Extraction end time: not set (to end)]
[Live stream: No] [Clock frequency: 90000]
[Teletext page: Autodetect]
[Start credits text: None]
[Quantisation-mode: CCExtractor's internal function]

-----------------------------------------------------------------
Opening file: Symposium_ Many Forms of Gender Discrimination I_default_c75bcc31-2.mp4
Detected MP4 box with name: ftyp
Detected MP4 box with name: mdat
File seems to be a MP4
Analyzing data with GPAC (MP4 library)
Opening 'Symposium_ Many Forms of Gender Discrimination I_default_c75bcc31-2.mp4': ok
Track 1, type=soun subtype=MPEG
Track 2, type=vide subtype=avc1
Track 3, type=sbtl subtype=tx3g
MP4: found 3 tracks: 1 avc and 1 cc
Processing track 1, type=soun subtype=MPEG
Processing track 2, type=vide subtype=avc1
Processing track 3, type=sbtl subtype=tx3g
WARNING: Could not encode in specified format
... (same warning over and over)
WARNING: Could not encode in specified format
100%  |  59:18
Closing media: ok
Found 1 AVC track(s). Found 1 CC track(s).


Total frames time:	  00:00:00:000  (0 frames at 29.97fps)

Min PTS:				00:00:00:000
Max PTS:				00:59:18:920
Length:				 00:59:18:920
Done, processing time = 0 seconds
Issues? Open a ticket here
Originally created by @delevinson on GitHub (Aug 23, 2021). CCExtractor version: CCExtractor 0.92 # Necessary information - Is this a regression (i.e. did it work before)? NO - What platform did you use? Mac - What were the used arguments? `{replace with the arguments}`ccextractor Symposium_\ Many\ Forms\ of\ Gender\ Discrimination\ I_default_c75bcc31-2.mp4 -txt # Video links https://mxlakeforest-my.sharepoint.com/:v:/g/personal/levinson_mx_lakeforest_edu/ET1bqw1EEa9PttN9FASvr_QBZWZrb7A35Xtt34XilRjxGg?e=pNnbBK # Additional information When I try to extract subtitles as text, I get this error WARNING: Could not encode in specified format and the output file contains 0 bytes. Here's the messages I get. ``` levinson@DL209-Levinson Downloads % ccextractor Symposium_\ Many\ Forms\ of\ Gender\ Discrimination\ I_default_c75bcc31-2.mp4 -txt CCExtractor 0.92, Carlos Fernandez Sanz, Volker Quetschke. Teletext portions taken from Petr Kutalek's telxcc -------------------------------------------------------------------------- Input: Symposium_ Many Forms of Gender Discrimination I_default_c75bcc31-2.mp4 [Extract: 1] [Stream mode: Autodetect] [Program : Auto ] [Hauppage mode: No] [Use MythTV code: Auto] [Timing mode: Auto] [Debug: No] [Buffer input: No] [Use pic_order_cnt_lsb for H.264: No] [Print CC decoder traces: No] [Target format: .txt] [Encoding: UTF-8] [Delay: 0] [Trim lines: No] [Add font color data: Yes] [Add font typesetting: Yes] [Convert case: No][Filter profanity: No] [Video-edit join: No] [Extraction start time: not set (from start)] [Extraction end time: not set (to end)] [Live stream: No] [Clock frequency: 90000] [Teletext page: Autodetect] [Start credits text: None] [Quantisation-mode: CCExtractor's internal function] ----------------------------------------------------------------- Opening file: Symposium_ Many Forms of Gender Discrimination I_default_c75bcc31-2.mp4 Detected MP4 box with name: ftyp Detected MP4 box with name: mdat File seems to be a MP4 Analyzing data with GPAC (MP4 library) Opening 'Symposium_ Many Forms of Gender Discrimination I_default_c75bcc31-2.mp4': ok Track 1, type=soun subtype=MPEG Track 2, type=vide subtype=avc1 Track 3, type=sbtl subtype=tx3g MP4: found 3 tracks: 1 avc and 1 cc Processing track 1, type=soun subtype=MPEG Processing track 2, type=vide subtype=avc1 Processing track 3, type=sbtl subtype=tx3g WARNING: Could not encode in specified format ... (same warning over and over) WARNING: Could not encode in specified format 100% | 59:18 Closing media: ok Found 1 AVC track(s). Found 1 CC track(s). Total frames time: 00:00:00:000 (0 frames at 29.97fps) Min PTS: 00:00:00:000 Max PTS: 00:59:18:920 Length: 00:59:18:920 Done, processing time = 0 seconds Issues? Open a ticket here ```
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: starred/ccextractor#658