mirror of
https://github.com/CCExtractor/ccextractor.git
synced 2026-02-16 05:25:09 +00:00
Extraction from bin file does not honor -unixts and -UCLA #259
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @Liontooth on GitHub (Jan 25, 2017).
Create a bin file from a DVB transport stream:
ccextractor -ts -pn $PN -out=bin -o $FIL.bin $DIR/$FIL.$EXTExtracting the text from this bin file:
ccextractor -in=bin -pn 53007 -tpage 891 -datets -ttxt -UCLA -noru -utf8 -parsepat -parsepmt -unixts 1485198721 -o 2017-01-23_1912_FR_TV5_Géopolitis.ccx.out 2017-01-23_1912_FR_TV5_Géopolitis.binresults in wrong timestamps, a messed up third field, and an extra |:
19700101000109.360|19700101000112.520|CC?||Bonjour, bienvenue dans cette edition de Geopolitis.while extraction from the transport stream produces the correct output:
20170123191310.360|20170123191313.520|891|Bonjour, bienvenue dans cette edition de Geopolitis.Let me know if you need samples; this likely holds for any file.
@cfsmp3 commented on GitHub (Jan 25, 2017):
Should be easy to fix.
GSoC qualification: Solving this issue gives 2 points.
@barun511 commented on GitHub (Jan 28, 2017):
Could I have samples please? I'll give this a shot.
@cfsmp3 commented on GitHub (Jan 28, 2017):
You can probably use any of the teletext ones from here:
http://ccextractor.org/doku.php?id=public:general:tvsamples
On Fri, Jan 27, 2017 at 6:55 PM, Barun Parruck notifications@github.com
wrote:
@barun511 commented on GitHub (Jan 30, 2017):
I can't seem to reproduce this. Is there a particular sample that you noticed this on?
@Liontooth commented on GitHub (Jan 30, 2017):
http://vrnewsscape.ucla.edu/dropbox/2017-01-23_1912_FR_TV5_G%c3%a9opolitis.bin
@cfsmp3 commented on GitHub (Jan 31, 2017):
Confirmed. I'll let GSoC applicants give it a go though since it's not too hard.
@saurabhshri commented on GitHub (Feb 21, 2017):
Also,in this case (teletext) when extracting from bin it says
No captions were found in input.and yield return code 10 even when they are extracted properly.@cfsmp3 commented on GitHub (Feb 22, 2017):
Please send fix for that :-)
On Tue, Feb 21, 2017 at 10:36 AM, Saurabh Shrivastava <
notifications@github.com> wrote:
@alexandrumc commented on GitHub (Feb 23, 2017):
@Liontooth, can you post here the DVB transport stream?
@saurabhshri commented on GitHub (Feb 26, 2017):
@cfsmp3 @Liontooth While fixing, I am facing timing issues - I mean this :
From TS :
From .bin
But then I found out that while using .bin few lines are missing too (See https://github.com/CCExtractor/ccextractor/issues/699 ).
Since timings are correct when extracted without
-unixts, it must be something wrong at my part. I am trying to fix it. :)@saurabhshri commented on GitHub (Feb 26, 2017):
I was unnecessarily calculating deltas and all which had mistake somewhere. The solution was staring right in the face :P Timing is correct now (in the PR #700 ).