mirror of
https://github.com/CCExtractor/ccextractor.git
synced 2026-02-03 21:23:48 +00:00
Here's a quandary: 608 captions in Korean #171
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @SWalkerTTU on GitHub (Aug 14, 2016).
Some time back I downloaded a TS of the Korean music show "Music Bank" on KBS2 (a clip from "인기가요" of Crayon Pop performing "Dancing Queen") and at the time of recording (late 2012) KBS2 was using 608 captions, as ccextractor 0.81 could pull that data out into BIN format. However, 0.81 could not convert it to SRT. hexdump -C shows readable ASCII text (the English words from the song) so sensible captions are able to be extracted.
Here's the extracted BIN, zipped:
DancingQueen.bin.zip
I've consulted the binary file format documentation, so I now have some idea of what's going on, but perhaps mentally parsing the binary file is not the most efficient way to approach this. The documentation also mentions the cc_type and cc_valid fields in the first byte of three-byte caption groups, but doesn't say much past that.