CEA-708 Korean #68

Closed
opened 2026-01-29 16:34:18 +00:00 by claunia · 11 comments
Owner

Originally created by @cfsmp3 on GitHub (Aug 4, 2015).

Originally assigned to: @bigharshrag on GitHub.

We have a couple of Korean samples with CEA-708 captions. They are in the Korean708D directory in videos-repository.

Some info here:

http://www.tta.or.kr/English/new/standardization/eng_ttastddesc.jsp?stdno=TTAK.KO-08.0022/R1

You can download PDF file which describe CEA-708-D specification in Korean and partially English above.

Assigned to Oleg since he's in charge of completing 708 during GSOC.

Originally created by @cfsmp3 on GitHub (Aug 4, 2015). Originally assigned to: @bigharshrag on GitHub. We have a couple of Korean samples with CEA-708 captions. They are in the Korean708D directory in videos-repository. Some info here: http://www.tta.or.kr/English/new/standardization/eng_ttastddesc.jsp?stdno=TTAK.KO-08.0022/R1 You can download PDF file which describe CEA-708-D specification in Korean and partially English above. Assigned to Oleg since he's in charge of completing 708 during GSOC.
Author
Owner

@unicode45 commented on GitHub (Aug 4, 2015):

I've uploaded several Korean CC embeded TS files. Please check URL below.

https://drive.google.com/open?id=0B8AjqtxcLQDxaTR6STVVUUFHUlk

@unicode45 commented on GitHub (Aug 4, 2015): I've uploaded several Korean CC embeded TS files. Please check URL below. https://drive.google.com/open?id=0B8AjqtxcLQDxaTR6STVVUUFHUlk
Author
Owner

@unicode45 commented on GitHub (Aug 5, 2015):

Just added mp4 files with '_CC' suffix. You can check how CEA-708-D works with Korean CC.

@unicode45 commented on GitHub (Aug 5, 2015): Just added mp4 files with '_CC' suffix. You can check how CEA-708-D works with Korean CC.
Author
Owner

@HaneolLee commented on GitHub (Nov 18, 2015):

Hello.
HaneolLee of Korea

MBC broadcasting tests in Korea to CCextractor.

Test environment:

  1. Program: Brazil Capture Station
    http://www.redhenlab.org/home/the-cognitive-core-research-topics-in-red-hen/the-barnyard/brazilian-capture-station
  2. set up :
    http://ccextractor.canihavesome.coffee/reports/119/Report__DTVCC_2015-10-12_220904.html

The test results of the attached file is a TS:

  • mbc_1. ts - OK (mbc_1.p1.svc01.txt)
  • mbc_2..ts - None extraction.
    mbc_2. ts is used between the VANC No.7 ~ 14 times.

I would like to get the character extracting method of mbc_2.ts file.

Best wishes,

check URL below.
(mbc_1.p1.svc01.txt)
https://drive.google.com/file/d/0BxFzM3fSXVOiWHpNT0dJR1RDU1U/view?usp=sharing

(mbc_1.ts)
https://drive.google.com/file/d/0BxFzM3fSXVOiV2FfelJGaTZkOFU/view?usp=sharing

(mbc_1.txt)
https://drive.google.com/file/d/0BxFzM3fSXVOiQk9CYnlYSi00SjA/view?usp=sharing

(mbc_2.ts)
https://drive.google.com/file/d/0BxFzM3fSXVOiRGRZSlcyUkxzUkE/view?usp=sharing

(mbc_2.txt)
https://drive.google.com/file/d/0BxFzM3fSXVOiRU44WU4xTE5IQ1E/view?usp=sharing

@HaneolLee commented on GitHub (Nov 18, 2015): Hello. HaneolLee of Korea MBC broadcasting tests in Korea to CCextractor. Test environment: 1) Program: Brazil Capture Station http://www.redhenlab.org/home/the-cognitive-core-research-topics-in-red-hen/the-barnyard/brazilian-capture-station 2) set up : http://ccextractor.canihavesome.coffee/reports/119/Report__DTVCC_2015-10-12_220904.html The test results of the attached file is a TS: - mbc_1. ts - OK (mbc_1.p1.svc01.txt) - mbc_2..ts - None extraction. mbc_2. ts is used between the VANC No.7 ~ 14 times. I would like to get the character extracting method of mbc_2.ts file. Best wishes, check URL below. (mbc_1.p1.svc01.txt) https://drive.google.com/file/d/0BxFzM3fSXVOiWHpNT0dJR1RDU1U/view?usp=sharing (mbc_1.ts) https://drive.google.com/file/d/0BxFzM3fSXVOiV2FfelJGaTZkOFU/view?usp=sharing (mbc_1.txt) https://drive.google.com/file/d/0BxFzM3fSXVOiQk9CYnlYSi00SjA/view?usp=sharing (mbc_2.ts) https://drive.google.com/file/d/0BxFzM3fSXVOiRGRZSlcyUkxzUkE/view?usp=sharing (mbc_2.txt) https://drive.google.com/file/d/0BxFzM3fSXVOiRU44WU4xTE5IQ1E/view?usp=sharing
Author
Owner

@anshul1912 commented on GitHub (Jan 7, 2016):

File Does not exist any more

@anshul1912 commented on GitHub (Jan 7, 2016): File Does not exist any more
Author
Owner
@HaneolLee commented on GitHub (Jan 8, 2016): I'm sorry. Upload again. (mbc_1.p1.svc01.txt) https://drive.google.com/file/d/0BxFzM3fSXVOiWHpNT0dJR1RDU1U/view?usp=sharing (mbc_1.ts) https://drive.google.com/file/d/0BxFzM3fSXVOiV2FfelJGaTZkOFU/view?usp=sharing (mbc_1.txt) https://drive.google.com/file/d/0BxFzM3fSXVOiQk9CYnlYSi00SjA/view?usp=sharing (mbc_2.ts) https://drive.google.com/file/d/0BxFzM3fSXVOiRGRZSlcyUkxzUkE/view?usp=sharing (mbc_2.txt) https://drive.google.com/file/d/0BxFzM3fSXVOiRU44WU4xTE5IQ1E/view?usp=sharing
Author
Owner

@bigharshrag commented on GitHub (May 20, 2016):

Please check the files as they do not exist anymore.
Can you please also elaborate on what problem you are facing during extraction?

@bigharshrag commented on GitHub (May 20, 2016): Please check the files as they do not exist anymore. Can you please also elaborate on what problem you are facing during extraction?
Author
Owner

@cfsmp3 commented on GitHub (Nov 7, 2016):

@HaneolLee Can you reply to @bigharshrag comment? We have a window to look into this now, but if we don't get some info (and files) in the next couple of weeks we'll close the ticket.

@cfsmp3 commented on GitHub (Nov 7, 2016): @HaneolLee Can you reply to @bigharshrag comment? We have a window to look into this now, but if we don't get some info (and files) in the next couple of weeks we'll close the ticket.
Author
Owner

@unicode45 commented on GitHub (Apr 11, 2017):

It seems Korean characters code encoding is wrong on lastest version 0.85b.
CEA-708 Korean is written in EUC-KR, so you have to convert original EUC-KR to target encoding (except Latin1) before saving.
There’re no problems with opening subtitle with EUC-KR extracted by version 0.78 and 0.79, but something is wrong on ver. 0.85b.

@unicode45 commented on GitHub (Apr 11, 2017): It seems Korean characters code encoding is wrong on lastest version 0.85b. CEA-708 Korean is written in EUC-KR, so you have to convert original EUC-KR to target encoding (except Latin1) before saving. There’re no problems with opening subtitle with EUC-KR extracted by version 0.78 and 0.79, but something is wrong on ver. 0.85b.
Author
Owner

@cfsmp3 commented on GitHub (Jan 3, 2018):

Closing this ticket because some files are missing - please create a new ticket with samples.

@cfsmp3 commented on GitHub (Jan 3, 2018): Closing this ticket because some files are missing - please create a new ticket with samples.
Author
Owner

@MatejMecka commented on GitHub (Jan 4, 2018):

Hi there!
I couldn't reproduce the issue.
Can you please tell me what command you used
Thanks
@unicode45 @bigharshrag @HaneolLee

@MatejMecka commented on GitHub (Jan 4, 2018): Hi there! I couldn't reproduce the issue. Can you please tell me what command you used Thanks @unicode45 @bigharshrag @HaneolLee
Author
Owner

@unicode45 commented on GitHub (Jan 5, 2018):

@MatejMecka
Hi,
It was solved by #690.
Thanks!

@unicode45 commented on GitHub (Jan 5, 2018): @MatejMecka Hi, It was solved by #690. Thanks!
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: starred/ccextractor#68