2016-08-10 03:39:01 -07:00
|
|
|
|
|
|
|
|
Overview
|
|
|
|
|
========
|
2016-08-10 09:33:08 -07:00
|
|
|
Subtitles which are burned into the video (or hard subbed) can be extracted using the -hardsubx flag.
|
|
|
|
|
The system works by processing video frames and extracting only the subtitles from them, followed
|
|
|
|
|
by an OCR recognition using Tesseract.
|
2016-08-10 03:39:01 -07:00
|
|
|
|
|
|
|
|
Dependencies
|
|
|
|
|
============
|
|
|
|
|
Tesseract (OCR library by Google)
|
|
|
|
|
Leptonica (C Image processing library)
|
|
|
|
|
FFMpeg (Video Processing Library)
|
|
|
|
|
|
|
|
|
|
Compilation
|
|
|
|
|
===========
|
|
|
|
|
|
|
|
|
|
Linux
|
|
|
|
|
-----
|
|
|
|
|
|
|
|
|
|
Make sure Tesseract, Leptonica and FFMPeg are installed, and that their libraries can be found using pkg-config.
|
|
|
|
|
Refer to OCR.txt for installation details.
|
|
|
|
|
|
2021-06-08 17:57:22 +00:00
|
|
|
FFmpeg from packages (on Debian) plus a couple of other dependencies you will need:
|
|
|
|
|
sudo apt-get install libavcodec-dev libavformat-dev libavutil-dev libswscale-dev libxcb-shm0-dev liblzma-dev
|
|
|
|
|
|
|
|
|
|
FFmpeg from source:
|
2016-08-10 09:33:08 -07:00
|
|
|
To install FFmpeg (libav), follow the steps at:-
|
|
|
|
|
https://trac.ffmpeg.org/wiki/CompilationGuide/Ubuntu - For Ubuntu, Debian and Linux Mint
|
|
|
|
|
https://trac.ffmpeg.org/wiki/CompilationGuide/Generic - For generic Linux compilation
|
|
|
|
|
|
|
|
|
|
To validate your FFMpeg installation, make sure you can run the following commands on your terminal:-
|
|
|
|
|
pkg-config --cflags libavcodec
|
|
|
|
|
pkg-config --cflags libavformat
|
|
|
|
|
pkg-config --cflags libavutil
|
|
|
|
|
pkg-config --cflags libswscale
|
|
|
|
|
pkg-config --libs libavcodec
|
|
|
|
|
pkg-config --libs libavformat
|
|
|
|
|
pkg-config --libs libavutil
|
|
|
|
|
pkg-config --libs libswscale
|
|
|
|
|
|
|
|
|
|
On success, you should see the correct include directory path and the linker flags.
|
|
|
|
|
|
2018-05-15 12:39:23 +05:30
|
|
|
To build the program with hardsubx support,
|
|
|
|
|
|
|
|
|
|
== from the Linux directory run:-
|
|
|
|
|
./configure --enable-hardsubx
|
|
|
|
|
make ENABLE_HARDSUBX=yes
|
|
|
|
|
|
|
|
|
|
== using cmake from root directory
|
|
|
|
|
mkdir build
|
|
|
|
|
cd build
|
|
|
|
|
cmake -DWITH_OCR=on -DWITH_HARDSUBX=on ../src/
|
|
|
|
|
make
|
2016-08-10 03:39:01 -07:00
|
|
|
|
2016-08-25 15:45:33 -07:00
|
|
|
NOTE: The build has been tested with FFMpeg version 3.1.0, and Tesseract 3.04.
|
|
|
|
|
|
2025-12-20 18:41:37 +01:00
|
|
|
macOS
|
|
|
|
|
-----
|
|
|
|
|
|
|
|
|
|
Install the required dependencies using Homebrew:
|
|
|
|
|
brew install tesseract leptonica ffmpeg
|
|
|
|
|
|
|
|
|
|
To build the program with hardsubx support, use one of these methods:
|
|
|
|
|
|
|
|
|
|
== Using build.command (Recommended):
|
|
|
|
|
cd ccextractor/mac
|
|
|
|
|
./build.command -hardsubx
|
|
|
|
|
|
|
|
|
|
== Using autoconf:
|
|
|
|
|
cd ccextractor/mac
|
|
|
|
|
./autogen.sh
|
|
|
|
|
./configure --enable-hardsubx --enable-ocr
|
|
|
|
|
make
|
|
|
|
|
|
|
|
|
|
== Using cmake:
|
|
|
|
|
cd ccextractor
|
|
|
|
|
mkdir build && cd build
|
|
|
|
|
cmake -DWITH_OCR=ON -DWITH_HARDSUBX=ON ../src/
|
|
|
|
|
make
|
|
|
|
|
|
|
|
|
|
NOTE: The -hardsubx parameter uses a single dash (not --hardsubx).
|
|
|
|
|
|
2016-08-10 03:39:01 -07:00
|
|
|
Windows
|
|
|
|
|
-------
|
|
|
|
|
|
2016-12-02 13:46:57 +05:30
|
|
|
Coming Soon
|