[FEATURE]: Add Snap packaging support with Github workflow

Merge pull request #2040 from THE-Amrit-mahto-05/fix/avc-sei-payload-size
Fix SEI payload type handling: changes payload_type and payload_size from i32 to u32 for type safety, keeping as usize casts only where needed for indexing.
2026-02-04 13:54:47 +00:00 · 2026-01-31 17:52:06 -08:00 · 2026-01-31 17:35:40 -08:00 · 2026-01-31 17:18:31 -08:00 · 2026-01-31 13:58:48 -08:00 · 2026-01-31 00:49:50 +05:30
1272 changed files with 343196 additions and 175263 deletions
--- a/.DS_Store
+++ b/.DS_Store
--- a/.clang-format
+++ b/.clang-format
@@ -0,0 +1,7 @@
+BreakBeforeBraces: Allman
+ColumnLimit: 0
+IndentCaseLabels: true
+IndentWidth: 8
+TabWidth: 8
+UseTab: Always
+SortIncludes: false
--- a/.dockerignore
+++ b/.dockerignore
@@ -0,0 +1,37 @@
+# Build artifacts
+linux/ccextractor
+linux/rust/
+linux/*.o
+linux/*.a
+mac/ccextractor
+mac/rust/
+build/
+build_*/
+
+# Git
+.git/
+.github/
+
+# IDE
+.vscode/
+.idea/
+*.swp
+*.swo
+
+# Docker
+docker/
+
+# Documentation (not needed for build)
+docs/
+*.md
+!README.md
+
+# Test files
+*.ts
+*.mp4
+*.mkv
+*.srt
+*.vtt
+
+# Plans
+plans/
--- a/.github/CONTRIBUTING.md
+++ b/.github/CONTRIBUTING.md
@@ -0,0 +1,36 @@
+# Contributors Guide
+
+Please read and understand the contribution guide before creating an issue or pull request. We would like to thank [Nishad TR](https://github.com/nishad) for their contributor's guide, upon which we based ours.
+
+## Etiquette
+
+This project is open source, and as such, we (the maintainers) give our **free time** to build, maintain and **provide user support** for the CCExtractor program. We make the code freely available in the hope that it will be of use to other developers and users. It would be extremely unfair for us to suffer abuse or anger for our hard work.
+
+Please be considerate towards the developers and other users when raising issues or presenting pull requests.
+
+It's the duty of the maintainer to ensure that all submissions to the project are of sufficient quality to benefit the project. Many developers have different skillsets, strengths, and weaknesses. Respect the decision of the maintainers, and do not be upset or abusive if your submission is not used.
+
+## Viability
+
+When requesting or submitting new features, first consider whether it might be useful to others. Open source projects are used by many developers, who may have entirely different needs to your own. Think about whether or not your feature is likely to be used by other users of the project.
+
+## Procedure
+
+**Before filing an issue**:
+
+- Attempt to replicate the problem, to ensure that it wasn't a coincidental incident.
+- Check to make sure your feature suggestion isn't already present within the project.
+- Check the pull requests tab to ensure that the bug doesn't have a fix in progress.
+- Check the pull requests tab to ensure that the feature isn't already in progress.
+
+**Before submitting a pull request**:
+
+- Ensure that your submission is [viable](#viability) for the project.
+- Check the codebase to ensure that your feature doesn't already exist.
+- Check the pull requests to ensure that another person hasn't already submitted the feature or fix.
+
+## Technical requirements
+
+- Before Submitting your Pull Request, merge `master` with your new branch and fix any conflicts. (Make sure you don't break anything in development!)
+- Commit Unix line endings.
+- Make sure to reasonably test your code. We have a sample platform that runs a test-suite for you, but it only covers a general set of tests.
--- a/.github/ISSUE_TEMPLATE.md
+++ b/.github/ISSUE_TEMPLATE.md
@@ -0,0 +1,47 @@
+Please prefix your issue with one of the following: [BUG], [PROPOSAL], [QUESTION].
+
+To get the version of CCExtractor, you can use `--version`.
+
+If this issue is related to the flutter GUI, please make the issue on the GUI repo [here](https://github.com/CCExtractor/ccextractorfluttergui/issues/new)
+
+Please check all that apply and **remove the ones that do not**.
+
+In the necessary information section, if this is a regression (something that used to work does not work anymore), make sure to specify the last known working version.
+
+Only specify the minimum number of arguments needed to reproduce the issue.
+
+In the additional information section, describe your problem.
+
+Please make the affected input file available for us (no screenshots, those don't help!). Public links to Dropbox, Google Drive, etc, are all fine. If it is not possible to make it available publicly, send us a private invitation (both Dropbox and Google Drive allow that). In this case we will download the file and upload it to the private developer repository. Methods to send the private invitation to us can be found [here](https://ccextractor.org/public:general:support#email).
+
+Do **not** upload your file to any location that will require us to sign up or endure a wait list, slow downloads, etc. If your upload expires make sure you keep it active somehow (replace links if needed). Keep in mind that while we go over all tickets some may take a few days, and it's important we have the file available when we actually need it.
+
+Make sure to enable notifications in GitHub so you get notifications about your ticket. We may need to ask questions and we do everything inside GitHub's system.
+
+Once you have read all of the instructions **delete all the text from here to the top**.
+
+CCExtractor version: {replace with the version}
+
+# In raising this issue, I confirm the following:
+
+- [ ] I have read and understood the [contributors guide](https://github.com/CCExtractor/ccextractor/blob/master/.github/CONTRIBUTING.md).
+- [ ] I have checked that the bug-fix I am reporting can be replicated, or that the feature I am suggesting isn't already present.
+- [ ] I have checked that the issue I'm posting isn't already reported.
+- [ ] I have checked that the issue I'm porting isn't already solved and no duplicates exist in [closed issues](https://github.com/CCExtractor/ccextractor/issues?q=is%3Aissue+is%3Aclosed) and in [opened issues](https://github.com/CCExtractor/ccextractor/issues)
+- [ ] I have checked the pull requests tab for existing solutions/implementations to my issue/suggestion.
+- [ ] I have used the latest available version of CCExtractor to verify this issue exists.
+- [ ] I have ticked all the boxes in this section and to prove it I'm deleting the section completely to remove boilerplate text.
+
+# Necessary information
+
+- Is this a regression (i.e. did it work before)? {YES/NO}
+- What platform did you use? {Window/Linux/Mac}
+- What were the used arguments? `{replace with the arguments}`
+
+# Video links
+
+* {Replace with a link to a video file}
+
+# Additional information
+
+{issue content here, replace this line with your issue content}
--- a/.github/PULL_REQUEST_TEMPLATE.md
+++ b/.github/PULL_REQUEST_TEMPLATE.md
@@ -0,0 +1,21 @@
+<!-- Please prefix your pull request with one of the following: **[FEATURE]** **[FIX]** **[IMPROVEMENT]**. -->
+
+**In raising this pull request, I confirm the following (please check boxes):**
+
+- [ ] I have read and understood the [contributors guide](https://github.com/CCExtractor/ccextractor/blob/master/.github/CONTRIBUTING.md).
+- [ ] I have checked that another pull request for this purpose does not exist.
+- [ ] I have considered, and confirmed that this submission will be valuable to others.
+- [ ] I accept that this submission may not be used, and the pull request closed at the will of the maintainer.
+- [ ] I give this submission freely, and claim no ownership to its content.
+- [ ] **I have mentioned this change in the [changelog](https://github.com/CCExtractor/ccextractor/blob/master/docs/CHANGES.TXT).**
+
+**My familiarity with the project is as follows (check one):**
+
+- [ ] I have never used CCExtractor.
+- [ ] I have used CCExtractor just a couple of times.
+- [ ] I absolutely love CCExtractor, but have not contributed previously.
+- [ ] I am an active contributor to CCExtractor.
+
+---
+
+{pull request content here}
--- a/.github/dependabot.yml
+++ b/.github/dependabot.yml
@@ -0,0 +1,9 @@
+version: 2
+updates:
+- package-ecosystem: github-actions
+  directory: "/"
+  schedule:
+    interval: daily
+    time: "10:00"
+    timezone: America/Los_Angeles
+  open-pull-requests-limit: 10
--- a/.github/workflows/build_appimage.yml
+++ b/.github/workflows/build_appimage.yml
@@ -0,0 +1,157 @@
+name: Build Linux AppImage
+
+on:
+  # Build on releases
+  release:
+    types: [published]
+  # Allow manual trigger
+  workflow_dispatch:
+    inputs:
+      build_type:
+        description: 'Build type (all, minimal, ocr, hardsubx)'
+        required: false
+        default: 'all'
+  # Build on pushes to workflow file for testing
+  push:
+    paths:
+      - '.github/workflows/build_appimage.yml'
+      - 'linux/build_appimage.sh'
+
+jobs:
+  build-appimage:
+    runs-on: ubuntu-22.04
+    strategy:
+      fail-fast: false
+      matrix:
+        build_type: [minimal, ocr, hardsubx]
+
+    steps:
+      - name: Check if should build this variant
+        id: should_build
+        run: |
+          if [ "${{ github.event_name }}" = "workflow_dispatch" ]; then
+            INPUT_TYPE="${{ github.event.inputs.build_type }}"
+            if [ "$INPUT_TYPE" = "all" ] || [ "$INPUT_TYPE" = "${{ matrix.build_type }}" ]; then
+              echo "should_build=true" >> $GITHUB_OUTPUT
+            else
+              echo "should_build=false" >> $GITHUB_OUTPUT
+            fi
+          else
+            echo "should_build=true" >> $GITHUB_OUTPUT
+          fi
+
+      - name: Checkout repository
+        if: steps.should_build.outputs.should_build == 'true'
+        uses: actions/checkout@v6
+
+      - name: Install base dependencies
+        if: steps.should_build.outputs.should_build == 'true'
+        run: |
+          sudo apt-get update
+          sudo apt-get install -y --no-install-recommends \
+            build-essential \
+            cmake \
+            pkg-config \
+            wget \
+            file \
+            libfuse2 \
+            zlib1g-dev \
+            libpng-dev \
+            libjpeg-dev \
+            libfreetype-dev \
+            libxml2-dev \
+            libcurl4-gnutls-dev \
+            libssl-dev \
+            clang \
+            libclang-dev
+
+      - name: Install OCR dependencies
+        if: steps.should_build.outputs.should_build == 'true' && (matrix.build_type == 'ocr' || matrix.build_type == 'hardsubx')
+        run: |
+          sudo apt-get install -y --no-install-recommends \
+            tesseract-ocr \
+            libtesseract-dev \
+            libleptonica-dev \
+            tesseract-ocr-eng
+
+      - name: Install FFmpeg dependencies (HardSubX)
+        if: steps.should_build.outputs.should_build == 'true' && matrix.build_type == 'hardsubx'
+        run: |
+          sudo apt-get install -y --no-install-recommends \
+            libavcodec-dev \
+            libavformat-dev \
+            libavutil-dev \
+            libswscale-dev \
+            libswresample-dev \
+            libavfilter-dev \
+            libavdevice-dev
+
+      - name: Install Rust toolchain
+        if: steps.should_build.outputs.should_build == 'true'
+        uses: dtolnay/rust-toolchain@stable
+
+      - name: Cache GPAC build
+        if: steps.should_build.outputs.should_build == 'true'
+        id: cache-gpac
+        uses: actions/cache@v5
+        with:
+          path: /usr/local/lib/libgpac*
+          key: gpac-v2.4.0-ubuntu22
+
+      - name: Build and install GPAC
+        if: steps.should_build.outputs.should_build == 'true' && steps.cache-gpac.outputs.cache-hit != 'true'
+        run: |
+          git clone -b v2.4.0 --depth 1 https://github.com/gpac/gpac
+          cd gpac
+          ./configure
+          make -j$(nproc) lib
+          sudo make install-lib
+          sudo ldconfig
+
+      - name: Update library cache
+        if: steps.should_build.outputs.should_build == 'true'
+        run: sudo ldconfig
+
+      - name: Build AppImage
+        if: steps.should_build.outputs.should_build == 'true'
+        run: |
+          cd linux
+          chmod +x build_appimage.sh
+          BUILD_TYPE=${{ matrix.build_type }} ./build_appimage.sh
+
+      - name: Get AppImage name
+        if: steps.should_build.outputs.should_build == 'true'
+        id: appimage_name
+        run: |
+          case "${{ matrix.build_type }}" in
+            minimal)
+              echo "name=ccextractor-minimal-x86_64.AppImage" >> $GITHUB_OUTPUT
+              ;;
+            ocr)
+              echo "name=ccextractor-x86_64.AppImage" >> $GITHUB_OUTPUT
+              ;;
+            hardsubx)
+              echo "name=ccextractor-hardsubx-x86_64.AppImage" >> $GITHUB_OUTPUT
+              ;;
+          esac
+
+      - name: Test AppImage
+        if: steps.should_build.outputs.should_build == 'true'
+        run: |
+          chmod +x linux/${{ steps.appimage_name.outputs.name }}
+          linux/${{ steps.appimage_name.outputs.name }} --version
+
+      - name: Upload AppImage artifact
+        if: steps.should_build.outputs.should_build == 'true'
+        uses: actions/upload-artifact@v6
+        with:
+          name: ${{ steps.appimage_name.outputs.name }}
+          path: linux/${{ steps.appimage_name.outputs.name }}
+
+      - name: Upload to Release
+        if: steps.should_build.outputs.should_build == 'true' && github.event_name == 'release'
+        uses: softprops/action-gh-release@v2
+        with:
+          files: linux/${{ steps.appimage_name.outputs.name }}
+        env:
+          GITHUB_TOKEN: ${{ secrets.GITHUB_TOKEN }}
--- a/.github/workflows/build_deb.yml
+++ b/.github/workflows/build_deb.yml
@@ -0,0 +1,283 @@
+name: Build Linux .deb Package
+
+on:
+  # Build on releases
+  release:
+    types: [published]
+  # Allow manual trigger
+  workflow_dispatch:
+    inputs:
+      build_type:
+        description: 'Build type (all, basic, hardsubx)'
+        required: false
+        default: 'all'
+  # Build on pushes to workflow file for testing
+  push:
+    paths:
+      - '.github/workflows/build_deb.yml'
+
+jobs:
+  build-deb:
+    runs-on: ubuntu-24.04
+    strategy:
+      fail-fast: false
+      matrix:
+        build_type: [basic, hardsubx]
+
+    steps:
+      - name: Check if should build this variant
+        id: should_build
+        run: |
+          if [ "${{ github.event_name }}" = "workflow_dispatch" ]; then
+            INPUT_TYPE="${{ github.event.inputs.build_type }}"
+            if [ "$INPUT_TYPE" = "all" ] || [ "$INPUT_TYPE" = "${{ matrix.build_type }}" ]; then
+              echo "should_build=true" >> $GITHUB_OUTPUT
+            else
+              echo "should_build=false" >> $GITHUB_OUTPUT
+            fi
+          else
+            echo "should_build=true" >> $GITHUB_OUTPUT
+          fi
+
+      - name: Checkout repository
+        if: steps.should_build.outputs.should_build == 'true'
+        uses: actions/checkout@v6
+
+      - name: Get version
+        if: steps.should_build.outputs.should_build == 'true'
+        id: version
+        run: |
+          # Extract version from source or use tag
+          if [ "${{ github.event_name }}" = "release" ]; then
+            VERSION="${{ github.event.release.tag_name }}"
+            VERSION="${VERSION#v}"  # Remove 'v' prefix if present
+          else
+            # Extract version from lib_ccx.h (e.g., #define VERSION "0.96.5")
+            VERSION=$(grep -oP '#define VERSION "\K[^"]+' src/lib_ccx/lib_ccx.h || echo "0.96")
+          fi
+          echo "version=$VERSION" >> $GITHUB_OUTPUT
+          echo "Building version: $VERSION"
+
+      - name: Install base dependencies
+        if: steps.should_build.outputs.should_build == 'true'
+        run: |
+          sudo apt-get update
+          sudo apt-get install -y --no-install-recommends \
+            build-essential \
+            cmake \
+            pkg-config \
+            zlib1g-dev \
+            libpng-dev \
+            libjpeg-dev \
+            libfreetype-dev \
+            libxml2-dev \
+            libcurl4-gnutls-dev \
+            libssl-dev \
+            clang \
+            libclang-dev \
+            tesseract-ocr \
+            libtesseract-dev \
+            libleptonica-dev \
+            patchelf
+
+      - name: Install FFmpeg dependencies (HardSubX)
+        if: steps.should_build.outputs.should_build == 'true' && matrix.build_type == 'hardsubx'
+        run: |
+          sudo apt-get install -y --no-install-recommends \
+            libavcodec-dev \
+            libavformat-dev \
+            libavutil-dev \
+            libswscale-dev \
+            libswresample-dev \
+            libavfilter-dev \
+            libavdevice-dev
+
+      - name: Install Rust toolchain
+        if: steps.should_build.outputs.should_build == 'true'
+        uses: dtolnay/rust-toolchain@stable
+
+      - name: Cache GPAC build
+        if: steps.should_build.outputs.should_build == 'true'
+        id: cache-gpac
+        uses: actions/cache@v5
+        with:
+          path: ~/gpac-install
+          key: gpac-abi-16.4-ubuntu24-deb
+
+      - name: Build GPAC
+        if: steps.should_build.outputs.should_build == 'true' && steps.cache-gpac.outputs.cache-hit != 'true'
+        run: |
+          git clone -b abi-16.4 --depth 1 https://github.com/gpac/gpac
+          cd gpac
+          ./configure --prefix=/usr
+          make -j$(nproc)
+          make DESTDIR=$HOME/gpac-install install-lib
+
+      - name: Install GPAC to system
+        if: steps.should_build.outputs.should_build == 'true'
+        run: |
+          sudo cp -r $HOME/gpac-install/usr/lib/* /usr/lib/
+          sudo cp -r $HOME/gpac-install/usr/include/* /usr/include/
+          sudo ldconfig
+
+      - name: Build CCExtractor
+        if: steps.should_build.outputs.should_build == 'true'
+        run: |
+          mkdir build && cd build
+          if [ "${{ matrix.build_type }}" = "hardsubx" ]; then
+            cmake ../src -DCMAKE_BUILD_TYPE=Release -DWITH_OCR=ON -DWITH_HARDSUBX=ON
+          else
+            cmake ../src -DCMAKE_BUILD_TYPE=Release -DWITH_OCR=ON
+          fi
+          make -j$(nproc)
+
+      - name: Test build
+        if: steps.should_build.outputs.should_build == 'true'
+        run: ./build/ccextractor --version
+
+      - name: Create .deb package structure
+        if: steps.should_build.outputs.should_build == 'true'
+        run: |
+          VERSION="${{ steps.version.outputs.version }}"
+          VARIANT="${{ matrix.build_type }}"
+
+          if [ "$VARIANT" = "basic" ]; then
+            PKG_NAME="ccextractor_${VERSION}_amd64"
+          else
+            PKG_NAME="ccextractor-${VARIANT}_${VERSION}_amd64"
+          fi
+
+          mkdir -p ${PKG_NAME}/DEBIAN
+          mkdir -p ${PKG_NAME}/usr/bin
+          mkdir -p ${PKG_NAME}/usr/lib/ccextractor
+          mkdir -p ${PKG_NAME}/usr/share/doc/ccextractor
+          mkdir -p ${PKG_NAME}/usr/share/man/man1
+
+          # Copy binary
+          cp build/ccextractor ${PKG_NAME}/usr/bin/
+
+          # Copy GPAC library
+          cp $HOME/gpac-install/usr/lib/libgpac.so* ${PKG_NAME}/usr/lib/ccextractor/
+
+          # Set rpath so ccextractor finds bundled libgpac
+          patchelf --set-rpath '/usr/lib/ccextractor:$ORIGIN/../lib/ccextractor' ${PKG_NAME}/usr/bin/ccextractor
+
+          # Copy documentation
+          cp docs/CHANGES.TXT ${PKG_NAME}/usr/share/doc/ccextractor/changelog
+          cp LICENSE.txt ${PKG_NAME}/usr/share/doc/ccextractor/copyright
+          gzip -9 -n ${PKG_NAME}/usr/share/doc/ccextractor/changelog
+
+          # Generate man page
+          help2man --no-info --name="closed captions and teletext subtitle extractor" \
+            ./build/ccextractor > ${PKG_NAME}/usr/share/man/man1/ccextractor.1 2>/dev/null || true
+          if [ -f ${PKG_NAME}/usr/share/man/man1/ccextractor.1 ]; then
+            gzip -9 -n ${PKG_NAME}/usr/share/man/man1/ccextractor.1
+          fi
+
+          # Create control file
+          if [ "$VARIANT" = "basic" ]; then
+            PKG_DESCRIPTION="CCExtractor - closed captions and teletext subtitle extractor"
+          else
+            PKG_DESCRIPTION="CCExtractor (with HardSubX) - closed captions and teletext subtitle extractor"
+          fi
+
+          INSTALLED_SIZE=$(du -sk ${PKG_NAME}/usr | cut -f1)
+
+          # Determine dependencies based on build variant (Ubuntu 24.04)
+          if [ "$VARIANT" = "hardsubx" ]; then
+            DEPENDS="libc6, libtesseract5, liblept5, libcurl3t64-gnutls, libavcodec60, libavformat60, libavutil58, libswscale7, libavdevice60, libswresample4, libavfilter9"
+          else
+            DEPENDS="libc6, libtesseract5, liblept5, libcurl3t64-gnutls"
+          fi
+
+          cat > ${PKG_NAME}/DEBIAN/control << CTRL
+          Package: ccextractor
+          Version: ${VERSION}
+          Section: utils
+          Priority: optional
+          Architecture: amd64
+          Installed-Size: ${INSTALLED_SIZE}
+          Depends: ${DEPENDS}
+          Maintainer: CCExtractor Development Team <carlos@ccextractor.org>
+          Homepage: https://www.ccextractor.org
+          Description: ${PKG_DESCRIPTION}
+           CCExtractor is a tool that extracts closed captions and teletext subtitles
+           from video files and streams. It supports a wide variety of input formats
+           including MPEG, H.264/AVC, H.265/HEVC, MP4, MKV, WTV, and transport streams.
+           .
+           This package includes a bundled GPAC library for MP4 support.
+          CTRL
+
+          # Remove leading spaces from control file
+          sed -i 's/^          //' ${PKG_NAME}/DEBIAN/control
+
+          # Create postinst to update library cache
+          cat > ${PKG_NAME}/DEBIAN/postinst << 'POSTINST'
+          #!/bin/sh
+          set -e
+          ldconfig
+          POSTINST
+          chmod 755 ${PKG_NAME}/DEBIAN/postinst
+
+          # Create postrm to update library cache
+          cat > ${PKG_NAME}/DEBIAN/postrm << 'POSTRM'
+          #!/bin/sh
+          set -e
+          ldconfig
+          POSTRM
+          chmod 755 ${PKG_NAME}/DEBIAN/postrm
+
+          # Set permissions
+          chmod 755 ${PKG_NAME}/usr/bin/ccextractor
+          chmod 755 ${PKG_NAME}/usr/lib/ccextractor
+          find ${PKG_NAME}/usr/lib/ccextractor -name "*.so*" -exec chmod 644 {} \;
+
+          # Build the .deb
+          dpkg-deb --build --root-owner-group ${PKG_NAME}
+
+          echo "deb_name=${PKG_NAME}.deb" >> $GITHUB_OUTPUT
+
+      - name: Test .deb package
+        if: steps.should_build.outputs.should_build == 'true'
+        run: |
+          VERSION="${{ steps.version.outputs.version }}"
+          VARIANT="${{ matrix.build_type }}"
+
+          if [ "$VARIANT" = "basic" ]; then
+            PKG_NAME="ccextractor_${VERSION}_amd64"
+          else
+            PKG_NAME="ccextractor-${VARIANT}_${VERSION}_amd64"
+          fi
+
+          # Install and test (apt handles dependencies automatically)
+          sudo apt-get update
+          sudo apt-get install -y ./${PKG_NAME}.deb
+          ccextractor --version
+
+      - name: Get .deb filename
+        if: steps.should_build.outputs.should_build == 'true'
+        id: deb_name
+        run: |
+          VERSION="${{ steps.version.outputs.version }}"
+          VARIANT="${{ matrix.build_type }}"
+
+          if [ "$VARIANT" = "basic" ]; then
+            echo "name=ccextractor_${VERSION}_amd64.deb" >> $GITHUB_OUTPUT
+          else
+            echo "name=ccextractor-${VARIANT}_${VERSION}_amd64.deb" >> $GITHUB_OUTPUT
+          fi
+
+      - name: Upload .deb artifact
+        if: steps.should_build.outputs.should_build == 'true'
+        uses: actions/upload-artifact@v6
+        with:
+          name: ${{ steps.deb_name.outputs.name }}
+          path: ${{ steps.deb_name.outputs.name }}
+
+      - name: Upload to Release
+        if: steps.should_build.outputs.should_build == 'true' && github.event_name == 'release'
+        uses: softprops/action-gh-release@v2
+        with:
+          files: ${{ steps.deb_name.outputs.name }}
+        env:
+          GITHUB_TOKEN: ${{ secrets.GITHUB_TOKEN }}
--- a/.github/workflows/build_deb_debian13.yml
+++ b/.github/workflows/build_deb_debian13.yml
@@ -0,0 +1,275 @@
+name: Build Debian 13 .deb Package
+
+on:
+  # Build on releases
+  release:
+    types: [published]
+  # Allow manual trigger
+  workflow_dispatch:
+    inputs:
+      build_type:
+        description: 'Build type (all, basic, hardsubx)'
+        required: false
+        default: 'all'
+  # Build on pushes to workflow file for testing
+  push:
+    paths:
+      - '.github/workflows/build_deb_debian13.yml'
+
+jobs:
+  build-deb:
+    runs-on: ubuntu-latest
+    container:
+      image: debian:trixie
+    strategy:
+      fail-fast: false
+      matrix:
+        build_type: [basic, hardsubx]
+
+    steps:
+      - name: Check if should build this variant
+        id: should_build
+        run: |
+          if [ "${{ github.event_name }}" = "workflow_dispatch" ]; then
+            INPUT_TYPE="${{ github.event.inputs.build_type }}"
+            if [ "$INPUT_TYPE" = "all" ] || [ "$INPUT_TYPE" = "${{ matrix.build_type }}" ]; then
+              echo "should_build=true" >> $GITHUB_OUTPUT
+            else
+              echo "should_build=false" >> $GITHUB_OUTPUT
+            fi
+          else
+            echo "should_build=true" >> $GITHUB_OUTPUT
+          fi
+
+      - name: Install git and dependencies for checkout
+        if: steps.should_build.outputs.should_build == 'true'
+        run: |
+          apt-get update
+          apt-get install -y git ca-certificates
+
+      - name: Checkout repository
+        if: steps.should_build.outputs.should_build == 'true'
+        uses: actions/checkout@v6
+
+      - name: Get version
+        if: steps.should_build.outputs.should_build == 'true'
+        id: version
+        run: |
+          # Extract version from source or use tag
+          if [ "${{ github.event_name }}" = "release" ]; then
+            VERSION="${{ github.event.release.tag_name }}"
+            VERSION="${VERSION#v}"  # Remove 'v' prefix if present
+          else
+            # Extract version from lib_ccx.h (e.g., #define VERSION "0.96.5")
+            VERSION=$(grep -oP '#define VERSION "\K[^"]+' src/lib_ccx/lib_ccx.h || echo "0.96")
+          fi
+          echo "version=$VERSION" >> $GITHUB_OUTPUT
+          echo "Building version: $VERSION"
+
+      - name: Install base dependencies
+        if: steps.should_build.outputs.should_build == 'true'
+        run: |
+          apt-get install -y --no-install-recommends \
+            build-essential \
+            cmake \
+            pkg-config \
+            zlib1g-dev \
+            libpng-dev \
+            libjpeg-dev \
+            libfreetype-dev \
+            libxml2-dev \
+            libcurl4-gnutls-dev \
+            libssl-dev \
+            clang \
+            libclang-dev \
+            tesseract-ocr \
+            libtesseract-dev \
+            libleptonica-dev \
+            patchelf \
+            curl
+
+      - name: Install FFmpeg dependencies (HardSubX)
+        if: steps.should_build.outputs.should_build == 'true' && matrix.build_type == 'hardsubx'
+        run: |
+          apt-get install -y --no-install-recommends \
+            libavcodec-dev \
+            libavformat-dev \
+            libavutil-dev \
+            libswscale-dev \
+            libswresample-dev \
+            libavfilter-dev \
+            libavdevice-dev
+
+      - name: Install Rust toolchain
+        if: steps.should_build.outputs.should_build == 'true'
+        run: |
+          curl --proto '=https' --tlsv1.2 -sSf https://sh.rustup.rs | sh -s -- -y
+          echo "$HOME/.cargo/bin" >> $GITHUB_PATH
+
+      - name: Build GPAC
+        if: steps.should_build.outputs.should_build == 'true'
+        run: |
+          git clone -b abi-16.4 --depth 1 https://github.com/gpac/gpac
+          cd gpac
+          ./configure --prefix=/usr
+          make -j$(nproc)
+          make install-lib
+          ldconfig
+
+      - name: Build CCExtractor
+        if: steps.should_build.outputs.should_build == 'true'
+        run: |
+          export PATH="$HOME/.cargo/bin:$PATH"
+          mkdir build && cd build
+          if [ "${{ matrix.build_type }}" = "hardsubx" ]; then
+            cmake ../src -DCMAKE_BUILD_TYPE=Release -DWITH_OCR=ON -DWITH_HARDSUBX=ON
+          else
+            cmake ../src -DCMAKE_BUILD_TYPE=Release -DWITH_OCR=ON
+          fi
+          make -j$(nproc)
+
+      - name: Test build
+        if: steps.should_build.outputs.should_build == 'true'
+        run: ./build/ccextractor --version
+
+      - name: Create .deb package structure
+        if: steps.should_build.outputs.should_build == 'true'
+        id: create_deb
+        run: |
+          VERSION="${{ steps.version.outputs.version }}"
+          VARIANT="${{ matrix.build_type }}"
+
+          if [ "$VARIANT" = "basic" ]; then
+            PKG_NAME="ccextractor_${VERSION}_debian13_amd64"
+          else
+            PKG_NAME="ccextractor-${VARIANT}_${VERSION}_debian13_amd64"
+          fi
+
+          mkdir -p ${PKG_NAME}/DEBIAN
+          mkdir -p ${PKG_NAME}/usr/bin
+          mkdir -p ${PKG_NAME}/usr/lib/ccextractor
+          mkdir -p ${PKG_NAME}/usr/share/doc/ccextractor
+          mkdir -p ${PKG_NAME}/usr/share/man/man1
+
+          # Copy binary
+          cp build/ccextractor ${PKG_NAME}/usr/bin/
+
+          # Copy GPAC library
+          cp /usr/lib/libgpac.so* ${PKG_NAME}/usr/lib/ccextractor/
+
+          # Set rpath so ccextractor finds bundled libgpac
+          patchelf --set-rpath '/usr/lib/ccextractor:$ORIGIN/../lib/ccextractor' ${PKG_NAME}/usr/bin/ccextractor
+
+          # Copy documentation
+          cp docs/CHANGES.TXT ${PKG_NAME}/usr/share/doc/ccextractor/changelog
+          cp LICENSE.txt ${PKG_NAME}/usr/share/doc/ccextractor/copyright
+          gzip -9 -n ${PKG_NAME}/usr/share/doc/ccextractor/changelog
+
+          # Create control file
+          if [ "$VARIANT" = "basic" ]; then
+            PKG_DESCRIPTION="CCExtractor - closed captions and teletext subtitle extractor"
+          else
+            PKG_DESCRIPTION="CCExtractor (with HardSubX) - closed captions and teletext subtitle extractor"
+          fi
+
+          INSTALLED_SIZE=$(du -sk ${PKG_NAME}/usr | cut -f1)
+
+          # Determine dependencies based on build variant (Debian 13 Trixie)
+          if [ "$VARIANT" = "hardsubx" ]; then
+            DEPENDS="libc6, libtesseract5, libleptonica6, libcurl3t64-gnutls, libavcodec61, libavformat61, libavutil59, libswscale8, libavdevice61, libswresample5, libavfilter10"
+          else
+            DEPENDS="libc6, libtesseract5, libleptonica6, libcurl3t64-gnutls"
+          fi
+
+          cat > ${PKG_NAME}/DEBIAN/control << CTRL
+          Package: ccextractor
+          Version: ${VERSION}
+          Section: utils
+          Priority: optional
+          Architecture: amd64
+          Installed-Size: ${INSTALLED_SIZE}
+          Depends: ${DEPENDS}
+          Maintainer: CCExtractor Development Team <carlos@ccextractor.org>
+          Homepage: https://www.ccextractor.org
+          Description: ${PKG_DESCRIPTION}
+           CCExtractor is a tool that extracts closed captions and teletext subtitles
+           from video files and streams. It supports a wide variety of input formats
+           including MPEG, H.264/AVC, H.265/HEVC, MP4, MKV, WTV, and transport streams.
+           .
+           This package includes a bundled GPAC library for MP4 support.
+           Built for Debian 13 (Trixie).
+          CTRL
+
+          # Remove leading spaces from control file
+          sed -i 's/^          //' ${PKG_NAME}/DEBIAN/control
+
+          # Create postinst to update library cache
+          cat > ${PKG_NAME}/DEBIAN/postinst << 'POSTINST'
+          #!/bin/sh
+          set -e
+          ldconfig
+          POSTINST
+          chmod 755 ${PKG_NAME}/DEBIAN/postinst
+
+          # Create postrm to update library cache
+          cat > ${PKG_NAME}/DEBIAN/postrm << 'POSTRM'
+          #!/bin/sh
+          set -e
+          ldconfig
+          POSTRM
+          chmod 755 ${PKG_NAME}/DEBIAN/postrm
+
+          # Set permissions
+          chmod 755 ${PKG_NAME}/usr/bin/ccextractor
+          chmod 755 ${PKG_NAME}/usr/lib/ccextractor
+          find ${PKG_NAME}/usr/lib/ccextractor -name "*.so*" -exec chmod 644 {} \;
+
+          # Build the .deb
+          dpkg-deb --build --root-owner-group ${PKG_NAME}
+
+          echo "deb_name=${PKG_NAME}.deb" >> $GITHUB_OUTPUT
+
+      - name: Test .deb package
+        if: steps.should_build.outputs.should_build == 'true'
+        run: |
+          VERSION="${{ steps.version.outputs.version }}"
+          VARIANT="${{ matrix.build_type }}"
+
+          if [ "$VARIANT" = "basic" ]; then
+            PKG_NAME="ccextractor_${VERSION}_debian13_amd64"
+          else
+            PKG_NAME="ccextractor-${VARIANT}_${VERSION}_debian13_amd64"
+          fi
+
+          # Install and test (apt handles dependencies automatically)
+          apt-get update
+          apt-get install -y ./${PKG_NAME}.deb
+          ccextractor --version
+
+      - name: Get .deb filename
+        if: steps.should_build.outputs.should_build == 'true'
+        id: deb_name
+        run: |
+          VERSION="${{ steps.version.outputs.version }}"
+          VARIANT="${{ matrix.build_type }}"
+
+          if [ "$VARIANT" = "basic" ]; then
+            echo "name=ccextractor_${VERSION}_debian13_amd64.deb" >> $GITHUB_OUTPUT
+          else
+            echo "name=ccextractor-${VARIANT}_${VERSION}_debian13_amd64.deb" >> $GITHUB_OUTPUT
+          fi
+
+      - name: Upload .deb artifact
+        if: steps.should_build.outputs.should_build == 'true'
+        uses: actions/upload-artifact@v6
+        with:
+          name: ${{ steps.deb_name.outputs.name }}
+          path: ${{ steps.deb_name.outputs.name }}
+
+      - name: Upload to Release
+        if: steps.should_build.outputs.should_build == 'true' && github.event_name == 'release'
+        uses: softprops/action-gh-release@v2
+        with:
+          files: ${{ steps.deb_name.outputs.name }}
+        env:
+          GITHUB_TOKEN: ${{ secrets.GITHUB_TOKEN }}
--- a/.github/workflows/build_docker.yml
+++ b/.github/workflows/build_docker.yml
@@ -0,0 +1,96 @@
+name: Build CCExtractor Docker Images
+
+on:
+  workflow_dispatch:
+  push:
+    paths:
+    - '.github/workflows/build_docker.yml'
+    - 'docker/**'
+    - '**.c'
+    - '**.h'
+    - '**CMakeLists.txt'
+    - '**.cmake'
+    - 'src/rust/**'
+  pull_request:
+    types: [opened, synchronize, reopened]
+    paths:
+    - '.github/workflows/build_docker.yml'
+    - 'docker/**'
+    - '**.c'
+    - '**.h'
+    - '**CMakeLists.txt'
+    - '**.cmake'
+    - 'src/rust/**'
+
+jobs:
+  build_minimal:
+    name: Docker build (minimal)
+    runs-on: ubuntu-latest
+    steps:
+    - uses: actions/checkout@v6
+    - name: Set up Docker Buildx
+      uses: docker/setup-buildx-action@v3
+    - name: Build minimal image
+      uses: docker/build-push-action@v6
+      with:
+        context: .
+        file: docker/Dockerfile
+        build-args: |
+          BUILD_TYPE=minimal
+          USE_LOCAL_SOURCE=1
+        tags: ccextractor:minimal
+        load: true
+        cache-from: type=gha,scope=docker-minimal
+        cache-to: type=gha,mode=max,scope=docker-minimal
+    - name: Test minimal image
+      run: |
+        docker run --rm ccextractor:minimal --version
+        echo "Minimal build successful"
+
+  build_ocr:
+    name: Docker build (ocr)
+    runs-on: ubuntu-latest
+    steps:
+    - uses: actions/checkout@v6
+    - name: Set up Docker Buildx
+      uses: docker/setup-buildx-action@v3
+    - name: Build OCR image
+      uses: docker/build-push-action@v6
+      with:
+        context: .
+        file: docker/Dockerfile
+        build-args: |
+          BUILD_TYPE=ocr
+          USE_LOCAL_SOURCE=1
+        tags: ccextractor:ocr
+        load: true
+        cache-from: type=gha,scope=docker-ocr
+        cache-to: type=gha,mode=max,scope=docker-ocr
+    - name: Test OCR image
+      run: |
+        docker run --rm ccextractor:ocr --version
+        echo "OCR build successful"
+
+  build_hardsubx:
+    name: Docker build (hardsubx)
+    runs-on: ubuntu-latest
+    steps:
+    - uses: actions/checkout@v6
+    - name: Set up Docker Buildx
+      uses: docker/setup-buildx-action@v3
+    - name: Build HardSubX image
+      uses: docker/build-push-action@v6
+      with:
+        context: .
+        file: docker/Dockerfile
+        build-args: |
+          BUILD_TYPE=hardsubx
+          USE_LOCAL_SOURCE=1
+        tags: ccextractor:hardsubx
+        load: true
+        cache-from: type=gha,scope=docker-hardsubx
+        cache-to: type=gha,mode=max,scope=docker-hardsubx
+    - name: Test HardSubX image
+      run: |
+        docker run --rm ccextractor:hardsubx --version
+        echo "HardSubX build successful"
--- a/.github/workflows/build_linux.yml
+++ b/.github/workflows/build_linux.yml
@@ -0,0 +1,117 @@
+name: Build CCExtractor on Linux
+
+on:
+  workflow_dispatch:
+  push:
+    paths:
+    - '.github/workflows/build_linux.yml'
+    - '**.c'
+    - '**.h'
+    - '**CMakeLists.txt'
+    - '**.cmake'
+    - '**Makefile**'
+    - 'linux/**'
+    - 'package_creators/**'
+    - 'src/rust/**'
+  pull_request:
+    types: [opened, synchronize, reopened]
+    paths:
+    - '.github/workflows/build_linux.yml'
+    - '**.c'
+    - '**.h'
+    - '**CMakeLists.txt'
+    - '**.cmake'
+    - '**Makefile**'
+    - 'linux/**'
+    - 'package_creators/**'
+    - 'src/rust/**'
+jobs:
+  build_shell:
+    runs-on: ubuntu-latest
+    steps:
+    - name: Install dependencies
+      run: sudo apt update && sudo apt-get install libgpac-dev libtesseract-dev libavcodec-dev libavdevice-dev libx11-dev libxcb1-dev libxcb-shm0-dev
+    - uses: actions/checkout@v6
+    - name: build
+      run: ./build -hardsubx
+      working-directory: ./linux
+    - name: Display version information
+      run: ./ccextractor --version
+      working-directory: ./linux
+    - name: Prepare artifacts
+      run: mkdir ./linux/artifacts
+    - name: Copy release artifact
+      run: cp ./linux/ccextractor ./linux/artifacts/
+    - uses: actions/upload-artifact@v6
+      with:
+        name: CCExtractor Linux build
+        path: ./linux/artifacts
+  build_autoconf:
+    runs-on: ubuntu-latest
+    steps:
+    - name: Install dependencies
+      run: sudo apt update && sudo apt-get install libgpac-dev
+    - uses: actions/checkout@v6
+    - name: run autogen
+      run: ./autogen.sh
+      working-directory: ./linux
+    - name: configure
+      run: ./configure --enable-debug
+      working-directory: ./linux
+    - name: make
+      run: make
+      working-directory: ./linux
+    - name: Display version information
+      run: ./ccextractor --version
+      working-directory: ./linux
+  cmake:
+    runs-on: ubuntu-latest
+    steps:
+    - name: Install dependencies
+      run: sudo apt update && sudo apt-get install libgpac-dev
+    - uses: actions/checkout@v6
+    - name: cmake
+      run: mkdir build && cd build && cmake ../src
+    - name: build
+      run: make -j$(nproc)
+      working-directory: build
+    - name: Display version information
+      run: ./build/ccextractor --version
+  cmake_ocr_hardsubx:
+    runs-on: ubuntu-latest
+    steps:
+    - uses: actions/checkout@v6
+    - name: Install dependencies
+      run: sudo apt update && sudo apt install libgpac-dev libtesseract-dev libavformat-dev libavdevice-dev libswscale-dev yasm
+    - name: cmake
+      run: |
+        mkdir build && cd build
+        cmake -DWITH_OCR=ON -DWITH_HARDSUBX=ON ../src
+    - name: build
+      run: |
+        make -j$(nproc)
+      working-directory: build
+    - name: Display version information
+      run: ./build/ccextractor --version
+  build_rust:
+    runs-on: ubuntu-latest
+    steps:
+      - name: Install dependencies
+        run: sudo apt update && sudo apt-get install libgpac-dev
+      - uses: actions/checkout@v6
+      - name: cache 
+        uses: actions/cache@v5
+        with:
+          path: |
+            src/rust/.cargo/registry
+            src/rust/.cargo/git
+            src/rust/target
+          key: ${{ runner.os }}-cargo-${{ hashFiles('**/Cargo.lock') }}
+          restore-keys: ${{ runner.os }}-cargo-
+      - uses: actions-rs/toolchain@v1
+        with:
+          toolchain: stable
+          override: true
+      - name: build
+        run: cargo build 
+        working-directory: ./src/rust
--- a/.github/workflows/build_linux_systemlibs.yml
+++ b/.github/workflows/build_linux_systemlibs.yml
@@ -0,0 +1,154 @@
+name: Build Linux (System Libs)
+
+on:
+  # Build on releases
+  release:
+    types: [published]
+  # Allow manual trigger
+  workflow_dispatch:
+    inputs:
+      build_type:
+        description: 'Build type (all, basic, hardsubx)'
+        required: false
+        default: 'all'
+  # Build on pushes to workflow file for testing
+  push:
+    paths:
+      - '.github/workflows/build_linux_systemlibs.yml'
+      - 'linux/build'
+
+permissions:
+  contents: write
+
+jobs:
+  build-systemlibs:
+    runs-on: ubuntu-22.04
+    strategy:
+      fail-fast: false
+      matrix:
+        build_type: [basic, hardsubx]
+
+    steps:
+      - name: Check if should build this variant
+        id: should_build
+        run: |
+          if [ "${{ github.event_name }}" = "workflow_dispatch" ]; then
+            INPUT_TYPE="${{ github.event.inputs.build_type }}"
+            if [ "$INPUT_TYPE" = "all" ] || [ "$INPUT_TYPE" = "${{ matrix.build_type }}" ]; then
+              echo "should_build=true" >> $GITHUB_OUTPUT
+            else
+              echo "should_build=false" >> $GITHUB_OUTPUT
+            fi
+          else
+            echo "should_build=true" >> $GITHUB_OUTPUT
+          fi
+
+      - name: Checkout repository
+        if: steps.should_build.outputs.should_build == 'true'
+        uses: actions/checkout@v6
+
+      - name: Install base dependencies
+        if: steps.should_build.outputs.should_build == 'true'
+        run: |
+          sudo apt-get update
+          sudo apt-get install -y --no-install-recommends \
+            build-essential \
+            pkg-config \
+            zlib1g-dev \
+            libpng-dev \
+            libfreetype-dev \
+            libutf8proc-dev \
+            libgpac-dev \
+            libtesseract-dev \
+            libleptonica-dev \
+            tesseract-ocr-eng \
+            clang \
+            libclang-dev
+
+      - name: Install FFmpeg dependencies (HardSubX)
+        if: steps.should_build.outputs.should_build == 'true' && matrix.build_type == 'hardsubx'
+        run: |
+          sudo apt-get install -y --no-install-recommends \
+            libavcodec-dev \
+            libavformat-dev \
+            libavutil-dev \
+            libswscale-dev \
+            libswresample-dev \
+            libavfilter-dev \
+            libavdevice-dev \
+            libxcb1-dev \
+            libxcb-shm0-dev \
+            libx11-dev \
+            liblzma-dev
+
+      - name: Install Rust toolchain
+        if: steps.should_build.outputs.should_build == 'true'
+        uses: dtolnay/rust-toolchain@stable
+
+      - name: Build with system libraries
+        if: steps.should_build.outputs.should_build == 'true'
+        run: |
+          cd linux
+          if [ "${{ matrix.build_type }}" = "hardsubx" ]; then
+            ./build -system-libs -hardsubx
+          else
+            ./build -system-libs
+          fi
+
+      - name: Verify build
+        if: steps.should_build.outputs.should_build == 'true'
+        run: |
+          ./linux/ccextractor --version
+          echo "=== Library dependencies ==="
+          ldd ./linux/ccextractor | grep -E 'freetype|png|utf8proc|tesseract|leptonica' || true
+
+      - name: Get output name
+        if: steps.should_build.outputs.should_build == 'true'
+        id: output_name
+        run: |
+          case "${{ matrix.build_type }}" in
+            basic)
+              echo "name=ccextractor-linux-systemlibs-x86_64" >> $GITHUB_OUTPUT
+              ;;
+            hardsubx)
+              echo "name=ccextractor-linux-systemlibs-hardsubx-x86_64" >> $GITHUB_OUTPUT
+              ;;
+          esac
+
+      - name: Package binary
+        if: steps.should_build.outputs.should_build == 'true'
+        run: |
+          mkdir -p package
+          cp linux/ccextractor package/
+          # Create a simple README for the package
+          cat > package/README.txt << 'EOF'
+          CCExtractor - System Libraries Build
+          =====================================
+
+          This build uses system libraries (dynamic linking).
+
+          Required system packages (Debian/Ubuntu):
+            sudo apt install libgpac12 libtesseract5 libleptonica6 \
+                             libpng16-16 libfreetype6 libutf8proc3
+
+          For HardSubX builds, also install:
+            sudo apt install libavcodec60 libavformat60 libswscale7 libavfilter9
+
+          Run with: ./ccextractor --help
+          EOF
+          tar -czvf ${{ steps.output_name.outputs.name }}.tar.gz -C package .
+
+      - name: Upload artifact
+        if: steps.should_build.outputs.should_build == 'true'
+        uses: actions/upload-artifact@v6
+        with:
+          name: ${{ steps.output_name.outputs.name }}
+          path: ${{ steps.output_name.outputs.name }}.tar.gz
+
+      - name: Upload to Release
+        if: steps.should_build.outputs.should_build == 'true' && github.event_name == 'release'
+        uses: softprops/action-gh-release@v2
+        with:
+          files: ${{ steps.output_name.outputs.name }}.tar.gz
+        env:
+          GITHUB_TOKEN: ${{ secrets.GITHUB_TOKEN }}
--- a/.github/workflows/build_mac.yml
+++ b/.github/workflows/build_mac.yml
@@ -0,0 +1,177 @@
+name: Build CCExtractor on Mac
+
+on:
+  workflow_dispatch:
+  push:
+    paths:
+    - '.github/workflows/build_mac.yml'
+    - '**.c'
+    - '**.h'
+    - '**CMakeLists.txt'
+    - '**.cmake'
+    - '**Makefile**'
+    - 'mac/**'
+    - 'package_creators/**'
+    - 'src/rust/**'
+  pull_request:
+    types: [opened, synchronize, reopened]
+    paths:
+    - '.github/workflows/build_mac.yml'
+    - '**.c'
+    - '**.h'
+    - '**CMakeLists.txt'
+    - '**.cmake'
+    - '**Makefile**'
+    - 'mac/**'
+    - 'package_creators/**'
+    - 'src/rust/**'
+jobs:
+  build_shell:
+    runs-on: macos-latest
+    steps:
+    - name: Install dependencies
+      run: brew install pkg-config autoconf automake libtool tesseract leptonica gpac
+    - uses: actions/checkout@v6
+    - name: build
+      run: ./build.command
+      working-directory: ./mac
+    - name: Display version information
+      run: ./ccextractor --version
+      working-directory: ./mac
+    - name: Prepare artifacts
+      run: mkdir ./mac/artifacts
+    - name: Copy release artifact
+      run: cp ./mac/ccextractor ./mac/artifacts/
+    - uses: actions/upload-artifact@v6
+      with:
+        name: CCExtractor mac build
+        path: ./mac/artifacts
+  build_shell_system_libs:
+    # Test building with system libraries via pkg-config (for Homebrew formula compatibility)
+    runs-on: macos-latest
+    steps:
+    - name: Install dependencies
+      run: brew install pkg-config autoconf automake libtool tesseract leptonica gpac freetype libpng protobuf-c utf8proc zlib
+    - uses: actions/checkout@v6
+    - name: build with system libs
+      run: ./build.command -system-libs
+      working-directory: ./mac
+    - name: Display version information
+      run: ./ccextractor --version
+      working-directory: ./mac
+  build_autoconf:
+    runs-on: macos-latest
+    steps:
+    - uses: actions/checkout@v6
+    - name: Install dependencies
+      run: brew install pkg-config autoconf automake libtool gpac
+    - name: run autogen
+      run: ./autogen.sh
+      working-directory: ./mac
+    - name: configure
+      run: ./configure --enable-debug
+      working-directory: ./mac
+    - name: make
+      run: make
+      working-directory: ./mac
+    - name: Display version information
+      run: ./ccextractor --version
+      working-directory: ./mac
+  cmake:
+    runs-on: macos-latest
+    steps:
+    - uses: actions/checkout@v6
+    - name: dependencies
+      run: brew install gpac
+    - uses: actions/checkout@v6
+    - name: cmake
+      run: mkdir build && cd build && cmake ../src
+    - name: build
+      run: make -j$(nproc)
+      working-directory: build
+    - name: Display version information
+      run: ./build/ccextractor --version
+  cmake_ocr_hardsubx:
+    runs-on: macos-latest
+    steps:
+    - uses: actions/checkout@v6
+    - name: Install dependencies
+      run: brew install pkg-config autoconf automake libtool tesseract leptonica gpac ffmpeg
+    - name: cmake
+      run: |
+        mkdir build && cd build
+        cmake -DWITH_OCR=ON -DWITH_HARDSUBX=ON ../src
+    - name: build
+      run: |
+        make -j$(nproc)
+      working-directory: build
+    - name: Display version information
+      run: ./build/ccextractor --version
+  build_shell_hardsubx:
+    # Test build.command with -hardsubx flag (burned-in subtitle extraction)
+    runs-on: macos-latest
+    steps:
+    - name: Install dependencies
+      run: brew install pkg-config autoconf automake libtool tesseract leptonica gpac ffmpeg
+    - uses: actions/checkout@v6
+    - name: build with hardsubx
+      run: ./build.command -hardsubx
+      working-directory: ./mac
+    - name: Display version information
+      run: ./ccextractor --version
+      working-directory: ./mac
+    - name: Verify hardsubx support
+      run: |
+        # Check that -hardsubx is recognized (will fail if not compiled in)
+        ./ccextractor -hardsubx --help 2>&1 | head -20 || true
+      working-directory: ./mac
+  build_autoconf_hardsubx:
+    # Test autoconf build with HARDSUBX enabled (fixes issue #1173)
+    runs-on: macos-latest
+    steps:
+    - uses: actions/checkout@v6
+    - name: Install dependencies
+      run: brew install pkg-config autoconf automake libtool tesseract leptonica gpac ffmpeg
+    - name: run autogen
+      run: ./autogen.sh
+      working-directory: ./mac
+    - name: configure with hardsubx
+      run: |
+        # Set Homebrew paths for configure to find libraries
+        export HOMEBREW_PREFIX="$(brew --prefix)"
+        export LDFLAGS="-L${HOMEBREW_PREFIX}/lib"
+        export CPPFLAGS="-I${HOMEBREW_PREFIX}/include"
+        export PKG_CONFIG_PATH="${HOMEBREW_PREFIX}/lib/pkgconfig"
+        ./configure --enable-hardsubx --enable-ocr
+      working-directory: ./mac
+    - name: make
+      run: make
+      working-directory: ./mac
+    - name: Display version information
+      run: ./ccextractor --version
+      working-directory: ./mac
+    - name: Verify hardsubx support
+      run: |
+        # Check that -hardsubx is recognized
+        ./ccextractor -hardsubx --help 2>&1 | head -20 || true
+      working-directory: ./mac
+  build_rust:
+    runs-on: macos-latest
+    steps:
+      - uses: actions/checkout@v6
+      - name: cache
+        uses: actions/cache@v5
+        with:
+          path: |
+            src/rust/.cargo/registry
+            src/rust/.cargo/git
+            src/rust/target
+          key: ${{ runner.os }}-cargo-${{ hashFiles('**/Cargo.lock') }}
+          restore-keys: ${{ runner.os }}-cargo-
+      - uses: actions-rs/toolchain@v1
+        with:
+          toolchain: stable
+          override: true
+      - name: build
+        run: cargo build
+        working-directory: ./src/rust
--- a/.github/workflows/build_snap.yml
+++ b/.github/workflows/build_snap.yml
@@ -0,0 +1,51 @@
+name: Build CCExtractor Snap
+
+on:
+  workflow_dispatch:
+  release:
+    types: [published]
+
+jobs:
+  build_snap:
+    name: Build Snap package
+    runs-on: ubuntu-22.04
+
+    steps:
+      - name: Checkout repository
+        uses: actions/checkout@v6
+
+      - name: Install snapd
+        run: |
+          sudo apt update
+          sudo apt install -y snapd
+
+      - name: Start snapd
+        run: |
+          sudo systemctl start snapd.socket
+          sudo systemctl start snapd
+
+      - name: Install Snapcraft
+        run: |
+          sudo snap install core22
+          sudo snap install snapcraft --classic
+
+      - name: Show Snapcraft version
+        run: snapcraft --version
+
+      - name: Build snap
+        run: sudo snapcraft --destructive-mode
+
+      - name: List generated snap
+        run: ls -lh *.snap
+
+      - name: Upload snap as workflow artifact
+        uses: actions/upload-artifact@v6
+        with:
+          name: CCExtractor Snap
+          path: "*.snap"
+
+      - name: Upload snap to GitHub Release
+        if: github.event_name == 'release'
+        uses: softprops/action-gh-release@v2
+        with:
+          files: "*.snap"
--- a/.github/workflows/build_windows.yml
+++ b/.github/workflows/build_windows.yml
@@ -0,0 +1,142 @@
+name: Build CCExtractor on Windows
+
+env:
+  RUSTFLAGS: -Ctarget-feature=+crt-static
+  VCPKG_DEFAULT_TRIPLET: x64-windows-static
+  VCPKG_COMMIT: ab2977be50c702126336e5088f4836060733c899
+
+on:
+  workflow_dispatch:
+  push:
+    paths:
+      - ".github/workflows/build_windows.yml"
+      - "**.c"
+      - "**.h"
+      - "**CMakeLists.txt"
+      - "**.cmake"
+      - "windows/**"
+      - "src/rust/**"
+  pull_request:
+    types: [opened, synchronize, reopened]
+    paths:
+      - ".github/workflows/build_windows.yml"
+      - "**.c"
+      - "**.h"
+      - "**CMakeLists.txt"
+      - "**.cmake"
+      - "windows/**"
+      - "src/rust/**"
+
+jobs:
+  build:
+    runs-on: windows-2022
+    steps:
+      - name: Check out repository
+        uses: actions/checkout@v6
+
+      - name: Setup MSBuild.exe
+        uses: microsoft/setup-msbuild@v2.0.0
+        with:
+          msbuild-architecture: x64
+
+      # Install GPAC (fast, ~30s, not worth caching complexity)
+      - name: Install gpac
+        run: choco install gpac --version 2.4.0 --no-progress
+
+      # Use lukka/run-vcpkg for better caching
+      - name: Setup vcpkg
+        uses: lukka/run-vcpkg@v11
+        id: runvcpkg
+        with:
+          vcpkgGitCommitId: ${{ env.VCPKG_COMMIT }}
+          vcpkgDirectory: ${{ github.workspace }}/vcpkg
+          vcpkgJsonGlob: 'windows/vcpkg.json'
+
+      # Cache vcpkg installed packages separately for faster restores
+      - name: Cache vcpkg installed packages
+        id: vcpkg-installed-cache
+        uses: actions/cache@v5
+        with:
+          path: ${{ github.workspace }}/vcpkg/installed
+          key: vcpkg-installed-${{ runner.os }}-${{ env.VCPKG_COMMIT }}-${{ hashFiles('windows/vcpkg.json') }}
+          restore-keys: |
+            vcpkg-installed-${{ runner.os }}-${{ env.VCPKG_COMMIT }}-
+
+      - name: Install vcpkg dependencies
+        if: steps.vcpkg-installed-cache.outputs.cache-hit != 'true'
+        run: ${{ github.workspace }}/vcpkg/vcpkg.exe install --x-install-root ${{ github.workspace }}/vcpkg/installed/
+        working-directory: windows
+
+      # Cache Rust/Cargo artifacts
+      - name: Cache Cargo registry
+        uses: actions/cache@v5
+        with:
+          path: |
+            ~/.cargo/registry
+            ~/.cargo/git
+          key: ${{ runner.os }}-cargo-registry-${{ hashFiles('**/Cargo.lock') }}
+          restore-keys: |
+            ${{ runner.os }}-cargo-registry-
+
+      # Cache Cargo build artifacts - rust.bat sets CARGO_TARGET_DIR to windows/
+      # which results in artifacts at windows/x86_64-pc-windows-msvc/
+      - name: Cache Cargo build artifacts
+        uses: actions/cache@v5
+        with:
+          path: ${{ github.workspace }}/windows/x86_64-pc-windows-msvc
+          key: ${{ runner.os }}-cargo-build-${{ hashFiles('**/Cargo.lock') }}-${{ hashFiles('src/rust/**/*.rs') }}
+          restore-keys: |
+            ${{ runner.os }}-cargo-build-${{ hashFiles('**/Cargo.lock') }}-
+            ${{ runner.os }}-cargo-build-
+
+      - name: Setup Rust toolchain
+        uses: dtolnay/rust-toolchain@stable
+
+      - name: Install Win 10 SDK
+        uses: ilammy/msvc-dev-cmd@v1
+
+      # Build Release-Full
+      - name: Build Release-Full
+        env:
+          LIBCLANG_PATH: "C:\\Program Files\\LLVM\\lib"
+          LLVM_CONFIG_PATH: "C:\\Program Files\\LLVM\\bin\\llvm-config"
+          BINDGEN_EXTRA_CLANG_ARGS: -fmsc-version=0
+          VCPKG_ROOT: ${{ github.workspace }}/vcpkg
+        run: msbuild ccextractor.sln /p:Configuration=Release-Full /p:Platform=x64
+        working-directory: ./windows
+
+      - name: Display Release version information
+        run: ./ccextractorwinfull.exe --version
+        working-directory: ./windows/x64/Release-Full
+
+      - name: Upload Release artifact
+        uses: actions/upload-artifact@v6
+        with:
+          name: CCExtractor Windows Release build
+          path: |
+            ./windows/x64/Release-Full/ccextractorwinfull.exe
+            ./windows/x64/Release-Full/*.dll
+
+      # Build Debug-Full (reuses cached Cargo artifacts)
+      - name: Build Debug-Full
+        env:
+          LIBCLANG_PATH: "C:\\Program Files\\LLVM\\lib"
+          LLVM_CONFIG_PATH: "C:\\Program Files\\LLVM\\bin\\llvm-config"
+          BINDGEN_EXTRA_CLANG_ARGS: -fmsc-version=0
+          VCPKG_ROOT: ${{ github.workspace }}/vcpkg
+        run: msbuild ccextractor.sln /p:Configuration=Debug-Full /p:Platform=x64
+        working-directory: ./windows
+
+      - name: Display Debug version information
+        continue-on-error: true
+        run: ./ccextractorwinfull.exe --version
+        working-directory: ./windows/x64/Debug-Full
+
+      - name: Upload Debug artifact
+        uses: actions/upload-artifact@v6
+        with:
+          name: CCExtractor Windows Debug build
+          path: |
+            ./windows/x64/Debug-Full/ccextractorwinfull.exe
+            ./windows/x64/Debug-Full/ccextractorwinfull.pdb
+            ./windows/x64/Debug-Full/*.dll
--- a/.github/workflows/format.yml
+++ b/.github/workflows/format.yml
@@ -0,0 +1,57 @@
+name: Format sourcecode
+on:
+  push:
+    paths:
+    - '.github/workflows/format.yml'
+    - 'src/**.c'
+    - 'src/**.h'
+    - 'src/rust/**'
+    tags-ignore: # ignore push via new tag
+    - '*.*'
+  pull_request:
+    types: [opened, synchronize, reopened]
+    paths:
+    - '.github/workflows/format.yml'
+    - 'src/**.c'
+    - 'src/**.h'
+    - 'src/rust/**'
+jobs:
+  format:
+    runs-on: ubuntu-latest
+    steps:
+      - uses: actions/checkout@v6
+      - name: Format code
+        run: |
+          find src/ -type f -not -path "src/thirdparty/*" -not -path "src/lib_ccx/zvbi/*" -name '*.c' -not -path "src/GUI/icon_data.c" | xargs clang-format -i
+          git diff-index --quiet HEAD -- || (git diff && exit 1)
+  format_rust:
+    runs-on: ubuntu-latest
+    strategy:
+      matrix:
+        workdir: ['./src/rust', './src/rust/lib_ccxr']
+    defaults:
+      run:
+        working-directory: ${{ matrix.workdir }}
+    steps:
+      - uses: actions/checkout@v6
+      - name: cache 
+        uses: actions/cache@v5
+        with:
+          path: |
+            ${{ matrix.workdir }}/.cargo/registry
+            ${{ matrix.workdir }}/.cargo/git
+            ${{ matrix.workdir }}/target
+          key: ${{ runner.os }}-cargo-${{ hashFiles('${{ matrix.workdir }}/Cargo.lock') }}
+          restore-keys: ${{ runner.os }}-cargo-
+      - uses: actions-rs/toolchain@v1
+        with:
+          toolchain: stable
+          override: true
+          components: rustfmt, clippy
+      - name: dependencies
+        run: sudo apt update && sudo apt install libtesseract-dev  libavformat-dev libavdevice-dev libswscale-dev yasm
+      - name: rustfmt
+        run: cargo fmt --all -- --check
+      - name: clippy
+        run: |
+          cargo clippy -- -D warnings
--- a/.github/workflows/homebrew.yml
+++ b/.github/workflows/homebrew.yml
@@ -0,0 +1,15 @@
+name: Bump Homebrew Formula
+
+on:
+  release:
+    types: [published]
+
+jobs:
+  homebrew:
+    runs-on: ubuntu-latest
+    steps:
+      - name: Update Homebrew formula
+        uses: dawidd6/action-homebrew-bump-formula@v7
+        with:
+          token: ${{ secrets.HOMEBREW_GITHUB_API_TOKEN }}
+          formula: ccextractor
--- a/.github/workflows/publish_chocolatey.yml
+++ b/.github/workflows/publish_chocolatey.yml
@@ -0,0 +1,136 @@
+# Publish to Chocolatey Community Repository
+#
+# PREREQUISITES:
+# 1. Create a Chocolatey account at https://community.chocolatey.org/account/Register
+# 2. Get your API key from https://community.chocolatey.org/account
+# 3. Add the API key as repository secret: CHOCOLATEY_API_KEY
+#
+# Reference: https://docs.chocolatey.org/en-us/create/create-packages-quick-start
+
+name: Publish to Chocolatey
+
+on:
+  release:
+    types: [released]
+  workflow_dispatch:
+    inputs:
+      release_tag:
+        description: 'Release tag to publish (e.g., v0.96.1)'
+        required: true
+        type: string
+
+jobs:
+  publish:
+    runs-on: windows-latest
+    steps:
+      - name: Checkout repository
+        uses: actions/checkout@v6
+
+      - name: Get version from tag
+        id: version
+        shell: bash
+        run: |
+          TAG="${{ github.event.inputs.release_tag || github.event.release.tag_name }}"
+          # Strip 'v' prefix if present
+          VERSION="${TAG#v}"
+          echo "version=$VERSION" >> $GITHUB_OUTPUT
+          echo "tag=$TAG" >> $GITHUB_OUTPUT
+
+      - name: Download MSI from release
+        shell: pwsh
+        run: |
+          $version = "${{ steps.version.outputs.version }}"
+          $tag = "${{ steps.version.outputs.tag }}"
+          $msiUrl = "https://github.com/CCExtractor/ccextractor/releases/download/$tag/CCExtractor.$version.msi"
+
+          Write-Host "Downloading MSI from: $msiUrl"
+          Invoke-WebRequest -Uri $msiUrl -OutFile "CCExtractor.msi"
+
+          # Calculate SHA256 checksum
+          $hash = (Get-FileHash -Path "CCExtractor.msi" -Algorithm SHA256).Hash
+          Write-Host "SHA256: $hash"
+          echo "MSI_CHECKSUM=$hash" >> $env:GITHUB_ENV
+
+      - name: Update nuspec version
+        shell: pwsh
+        run: |
+          $version = "${{ steps.version.outputs.version }}"
+          $nuspecPath = "packaging/chocolatey/ccextractor.nuspec"
+
+          $content = Get-Content $nuspecPath -Raw
+          $content = $content -replace '<version>.*</version>', "<version>$version</version>"
+          Set-Content -Path $nuspecPath -Value $content
+
+          Write-Host "Updated nuspec to version $version"
+
+      - name: Update install script
+        shell: pwsh
+        run: |
+          $version = "${{ steps.version.outputs.version }}"
+          $tag = "${{ steps.version.outputs.tag }}"
+          $checksum = $env:MSI_CHECKSUM
+          $installScript = "packaging/chocolatey/tools/chocolateyInstall.ps1"
+
+          $content = Get-Content $installScript -Raw
+
+          # Update URL
+          $newUrl = "https://github.com/CCExtractor/ccextractor/releases/download/$tag/CCExtractor.$version.msi"
+          $content = $content -replace "url64bit\s*=\s*'[^']*'", "url64bit       = '$newUrl'"
+
+          # Update checksum
+          $content = $content -replace "checksum64\s*=\s*'[^']*'", "checksum64     = '$checksum'"
+
+          Set-Content -Path $installScript -Value $content
+
+          Write-Host "Updated install script with URL and checksum"
+
+      - name: Build Chocolatey package
+        shell: pwsh
+        run: |
+          cd packaging/chocolatey
+          choco pack ccextractor.nuspec
+
+          # List the generated package
+          Get-ChildItem *.nupkg
+
+      - name: Test package locally
+        shell: pwsh
+        run: |
+          cd packaging/chocolatey
+          $nupkg = Get-ChildItem *.nupkg | Select-Object -First 1
+          Write-Host "Testing package: $($nupkg.Name)"
+
+          # Install from local package
+          choco install ccextractor --source="'.;https://community.chocolatey.org/api/v2/'" --yes --force
+
+          # Verify installation
+          $ccx = Get-Command ccextractor -ErrorAction SilentlyContinue
+          if ($ccx) {
+            Write-Host "CCExtractor found at: $($ccx.Source)"
+            & ccextractor --version
+          } else {
+            Write-Host "CCExtractor not found in PATH, checking Program Files..."
+            $exePath = Join-Path $env:ProgramFiles "CCExtractor\ccextractor.exe"
+            if (Test-Path $exePath) {
+              & $exePath --version
+            }
+          }
+
+      - name: Push to Chocolatey
+        shell: pwsh
+        env:
+          CHOCOLATEY_API_KEY: ${{ secrets.CHOCOLATEY_API_KEY }}
+        run: |
+          cd packaging/chocolatey
+          $nupkg = Get-ChildItem *.nupkg | Select-Object -First 1
+
+          Write-Host "Pushing $($nupkg.Name) to Chocolatey..."
+          choco push $nupkg.Name --source="https://push.chocolatey.org/" --api-key="$env:CHOCOLATEY_API_KEY"
+
+          Write-Host "Package submitted to Chocolatey! It may take some time to be moderated and published."
+
+      - name: Upload package artifact
+        uses: actions/upload-artifact@v6
+        with:
+          name: chocolatey-package
+          path: packaging/chocolatey/*.nupkg
--- a/.github/workflows/publish_winget.yml
+++ b/.github/workflows/publish_winget.yml
@@ -0,0 +1,38 @@
+# Publish to Windows Package Manager (winget)
+#
+# PREREQUISITES:
+# 1. CCExtractor must already have ONE version in winget-pkgs before this works
+#    - Submit the initial manifest manually from packaging/winget/
+#    - PR to: https://github.com/microsoft/winget-pkgs
+#
+# 2. Create a fork of microsoft/winget-pkgs under the CCExtractor organization
+#    - https://github.com/CCExtractor/winget-pkgs (needs to be created)
+#
+# 3. Create a GitHub Personal Access Token (classic) with 'public_repo' scope
+#    - Add as repository secret: WINGET_TOKEN
+#
+# Reference: https://github.com/vedantmgoyal9/winget-releaser
+
+name: Publish to WinGet
+
+on:
+  release:
+    types: [released]
+  workflow_dispatch:
+    inputs:
+      release_tag:
+        description: 'Release tag to publish (e.g., v0.96.1)'
+        required: true
+        type: string
+
+jobs:
+  publish:
+    runs-on: windows-latest
+    steps:
+      - name: Publish to WinGet
+        uses: vedantmgoyal9/winget-releaser@v2
+        with:
+          identifier: CCExtractor.CCExtractor
+          installers-regex: '\.msi$'  # Only use the MSI installer
+          token: ${{ secrets.WINGET_TOKEN }}
+          release-tag: ${{ github.event.inputs.release_tag || github.event.release.tag_name }}
--- a/.github/workflows/release.yml
+++ b/.github/workflows/release.yml
@@ -0,0 +1,137 @@
+name: Upload releases
+
+on:
+  release:
+    types:
+      - created
+
+permissions:
+  contents: write
+
+env:
+  RUSTFLAGS: -Ctarget-feature=+crt-static
+  VCPKG_DEFAULT_TRIPLET: x64-windows-static
+  VCPKG_DEFAULT_BINARY_CACHE: C:\vcpkg\.cache
+  VCPKG_COMMIT: ab2977be50c702126336e5088f4836060733c899
+
+jobs:
+  build_windows:
+    runs-on: windows-2022
+    steps:
+    - name: Check out repository
+      uses: actions/checkout@v6
+    - name: Get the version
+      id: get_version
+      run: |
+        # Extract version from tag, strip 'v' prefix and everything after first dash
+        VERSION=${GITHUB_REF/refs\/tags\/v/}
+        VERSION=${VERSION%%-*}
+        # Save display version for filenames (e.g., 0.96.1)
+        echo ::set-output name=DISPLAY_VERSION::$VERSION
+        # Count dots to determine version format
+        DOTS="${VERSION//[^.]}"
+        PART_COUNT=$((${#DOTS} + 1))
+        # MSI requires 4-part version (major.minor.build.revision)
+        if [ "$PART_COUNT" -eq 2 ]; then
+          MSI_VERSION="${VERSION}.0.0"
+        elif [ "$PART_COUNT" -eq 3 ]; then
+          MSI_VERSION="${VERSION}.0"
+        else
+          MSI_VERSION="${VERSION}"
+        fi
+        echo ::set-output name=VERSION::$MSI_VERSION
+      shell: bash
+    - name: Setup MSBuild.exe
+      uses: microsoft/setup-msbuild@v2.0.0
+      with:
+        msbuild-architecture: x64
+    - name: Install gpac
+      run: choco install gpac --version 2.4.0
+    - name: Setup vcpkg
+      run: mkdir C:\vcpkg\.cache
+    - name: Cache vcpkg
+      id: cache
+      uses: actions/cache@v5
+      with:
+        path: |
+          C:\vcpkg\.cache
+        key: vcpkg-${{ runner.os }}-${{ env.VCPKG_COMMIT }}
+    - name: Build vcpkg
+      run: |
+        git clone https://github.com/microsoft/vcpkg
+        ./vcpkg/bootstrap-vcpkg.bat
+    - name: Install dependencies
+      run: ${{ github.workspace }}/vcpkg/vcpkg.exe install --x-install-root ${{ github.workspace }}/vcpkg/installed/
+      working-directory: windows
+    - uses: actions-rs/toolchain@v1
+      with:
+        toolchain: stable
+        override: true
+    - name: Install Win 10 SDK
+      uses: ilammy/msvc-dev-cmd@v1
+    - name: build Release-Full
+      env:
+        LIBCLANG_PATH: "C:\\Program Files\\LLVM\\lib"
+        LLVM_CONFIG_PATH: "C:\\Program Files\\LLVM\\bin\\llvm-config"
+        CARGO_TARGET_DIR: "..\\..\\windows"
+        BINDGEN_EXTRA_CLANG_ARGS: -fmsc-version=0
+        VCPKG_ROOT: ${{ github.workspace }}/vcpkg
+      run: msbuild ccextractor.sln /p:Configuration=Release-Full /p:Platform=x64
+      working-directory: ./windows
+    - name: Copy files to directory for installer
+      run: mkdir installer; cp ./x64/Release-Full/ccextractorwinfull.exe ./installer; cp ./x64/Release-Full/*.dll ./installer
+      working-directory: ./windows
+    - name: Download tessdata for OCR support
+      run: |
+        mkdir -p ./installer/tessdata
+        # Download English traineddata from tessdata_fast (smaller, faster, good for most use cases)
+        Invoke-WebRequest -Uri "https://github.com/tesseract-ocr/tessdata_fast/raw/main/eng.traineddata" -OutFile "./installer/tessdata/eng.traineddata"
+        # Download OSD (Orientation and Script Detection) for automatic script detection
+        Invoke-WebRequest -Uri "https://github.com/tesseract-ocr/tessdata_fast/raw/main/osd.traineddata" -OutFile "./installer/tessdata/osd.traineddata"
+      working-directory: ./windows
+    - name: install WiX
+      run: dotnet tool uninstall --global wix; dotnet tool install --global wix --version 6.0.2 && wix extension add -g WixToolset.UI.wixext/6.0.2
+    - name: Make sure WiX works
+      run: wix --version && wix extension list -g
+    - name: Download Flutter GUI
+      run: ((Invoke-WebRequest -UseBasicParsing https://api.github.com/repos/CCExtractor/ccextractorfluttergui/releases/latest).Content | ConvertFrom-Json).assets | ForEach-Object {if ($_.name -eq "windows.zip") { Invoke-WebRequest -UseBasicParsing -Uri $_.browser_download_url -OutFile windows.zip}}
+      working-directory: ./windows
+    - name: Display contents of dir
+      run: ls
+      working-directory: ./windows
+    - name: Unzip Flutter GUI
+      run: Expand-Archive -Path ./windows.zip -DestinationPath ./installer -Force
+      working-directory: ./windows
+    - name: Display installer folder contents
+      run: Get-ChildItem -Recurse ./installer
+      working-directory: ./windows
+    - name: Create portable zip
+      run: Compress-Archive -Path ./installer/* -DestinationPath ./CCExtractor.${{ steps.get_version.outputs.DISPLAY_VERSION }}_win_portable.zip
+      working-directory: ./windows
+    - name: Build installer
+      run:  wix build -arch x64 -ext WixToolset.UI.wixext -d "AppVersion=${{ steps.get_version.outputs.VERSION }}" -o CCExtractor.${{ steps.get_version.outputs.DISPLAY_VERSION }}.msi installer.wxs CustomUI.wxs
+      working-directory: ./windows
+    - name: Upload as asset
+      uses: AButler/upload-release-assets@v3.0
+      with:
+        files: './windows/CCExtractor.${{ steps.get_version.outputs.DISPLAY_VERSION }}.msi;./windows/CCExtractor.${{ steps.get_version.outputs.DISPLAY_VERSION }}_win_portable.zip'
+        repo-token: ${{ secrets.GITHUB_TOKEN }}
+  create_linux_package:
+    runs-on: ubuntu-latest
+    steps:
+      - uses: actions/checkout@v6
+        with:
+          path: ./ccextractor
+      - name: Get the version
+        id: get_version
+        run: |
+          VERSION=${GITHUB_REF/refs\/tags\/v/}
+          VERSION=${VERSION%%-*}
+          echo ::set-output name=DISPLAY_VERSION::$VERSION
+      - name: Create .tar.gz without git and windows folders
+        run: tar -pczf ./ccextractor.${{ steps.get_version.outputs.DISPLAY_VERSION }}.tar.gz --exclude "ccextractor/windows" --exclude "ccextractor/.git" ccextractor
+      - name: Upload as asset
+        uses: AButler/upload-release-assets@v3.0
+        with:
+          files: './ccextractor.${{ steps.get_version.outputs.DISPLAY_VERSION }}.tar.gz'
+          repo-token: ${{ secrets.GITHUB_TOKEN }}
--- a/.github/workflows/test_rust.yml
+++ b/.github/workflows/test_rust.yml
@@ -0,0 +1,41 @@
+name: Unit Test Rust
+on:
+  push:
+    paths:
+      - ".github/workflows/test.yml"
+      - "src/rust/**"
+    tags-ignore:
+      - "*.*"
+  pull_request:
+    types: [opened, synchronize, reopened]
+    paths:
+      - ".github/workflows/test.yml"
+      - "src/rust/**"
+jobs:
+  test_rust:
+    runs-on: ubuntu-latest
+    defaults:
+      run:
+        working-directory: ./src/rust
+    steps:
+      - uses: actions/checkout@v6
+      - name: cache
+        uses: actions/cache@v5
+        with:
+          path: |
+            src/rust/.cargo/registry
+            src/rust/.cargo/git
+            src/rust/target
+            src/rust/lib_ccxr/target
+          key: ${{ runner.os }}-cargo-${{ hashFiles('**/Cargo.lock') }}
+          restore-keys: ${{ runner.os }}-cargo-
+      - uses: actions-rs/toolchain@v1
+        with:
+          toolchain: stable
+          override: true
+      - name: Test main module
+        run: cargo test
+        working-directory: ./src/rust
+      - name: Test lib_ccxr module
+        run: cargo test
+        working-directory: ./src/rust/lib_ccxr
--- a/.gitignore
+++ b/.gitignore
@@ -13,10 +13,15 @@ CVS
 ####
 # Linux  Ignored binary and build folder
 *.o
+*.so
+mac/ccextractor
 linux/ccextractor
 linux/depend
+linux/build_scan/
+windows/x86_64-pc-windows-msvc/**
 windows/Debug/**
 windows/Debug-OCR/**
+windows/release-with-debug/**
 windows/Release/**
 windows/Release-Full/**
 windows/Release-OCR/**
@@ -24,15 +29,25 @@ windows/Debug-Full/**
 windows/x64/**
 windows/ccextractor.VC.db
 build/
+build_*/
+
+####
+# Python
+*.pyc

 ####
 # Visual Studio project Ignored files

+.vs/**
+windows/.vs/**
+!windows/.vs/config/applicationhost.config
 *.suo
 *.sdf
 *.opensdf
 *.user
 *.opendb
+*.db
+*.vscode

 ####
 # Ignore the header file that is updated upon build
@@ -44,6 +59,7 @@ windows/libs/tesseract/**

 # Ctags
 *.tags*
+tags

 # Vagrant
 .vagrant/
@@ -52,3 +68,101 @@ windows/libs/tesseract/**
 .cproject
 .project
 .settings/
+
+# Mac
+.DS_Store
+windows/enc_temp_folder/*
+
+#CMake
+src/cmake-build-debug/
+src/.idea/
+
+
+#Autotools
+linux/config.h
+linux/config.log
+linux/config.status
+linux/Makefile
+linux/autom4te.cache
+linux/aclocal.m4
+linux/*.in
+linux/configure
+linux/build-conf/
+mac/rust/
+mac/config.h
+mac/config.log
+mac/config.status
+mac/Makefile
+mac/autom4te.cache
+mac/aclocal.m4
+mac/*.in
+mac/configure
+mac/build-conf/
+package_creators/*tar.gz
+package_creators/build/*.deb
+src/.deps/
+src/.dirstamp
+src/lib_ccx/.deps/
+src/lib_ccx/.dirstamp
+src/lib_hash/.deps/
+src/lib_hash/.dirstamp
+src/libpng/.deps/
+src/libpng/.dirstamp
+src/utf8proc/.deps/
+src/utf8proc/.dirstamp
+src/zlib/.deps/
+src/zlib/.dirstamp
+src/zvbi/.deps/
+src/zvbi/.dirstamp
+
+# Arch
+package_creators/*.pkg.tar.xz
+
+#RPMs
+package_creators/*.rpm
+src/lib_ccx/ccx.pc
+windows/combase.pdb/
+src/**/.deps
+src/**/.dirstamp
+mac/ccextractorGUI
+linux/ccextractorGUI
+linux/ccxGUI.ini
+linux/CMakeCache.txt
+linux/CMakeFiles/
+linux/cmake_install.cmake
+linux/install_manifest.txt
+linux/lib_ccx/
+mac/lib_ccx/
+mac/install_manifest.txt
+mac/cmake_install.cmake
+mac/CMakeFiles/
+mac/CMakeCache.txt
+*.py.bak
+
+# Bazel
+bazel*
+
+#Intellij IDEs
+.idea/
+
+# Plans (local only)
+plans/
+
+# Rust build and MakeFiles (and CMake files)
+src/rust/CMakeFiles/
+src/rust/CMakeCache.txt
+src/rust/Makefile
+src/rust/cmake_install.cmake
+src/rust/target/
+src/rust/lib_ccxr/target/
+windows/ccx_rust.lib
+windows/*/debug/*
+windows/*/CACHEDIR.TAG
+windows/.rustc_info.json
+linux/configure~
+
+# Plans and temporary files
+plans/
+tess.log
+**/tess.log
+ut=srt*
--- a/.travis.yml
+++ b/.travis.yml
@@ -0,0 +1,101 @@
+language: c
+
+matrix:
+  include:
+    - os: osx
+      osx_image: xcode10.1
+      compiler: gcc
+      addons:
+        homebrew:
+          packages:
+            autoconf
+            libtool
+            tesseract
+            leptonica
+      script:
+        - cd mac
+        - ./build.command
+        - ./ccextractor --version
+
+    - os: osx
+      osx_image: xcode10.1
+      compiler: clang
+      addons:
+        homebrew:
+          packages:
+            autoconf
+            libtool
+            tesseract
+            leptonica
+      script:
+        - cd mac
+        - ./build.command
+        - ./ccextractor --version
+
+    - os: osx
+      osx_image: xcode10.1
+      compiler: gcc
+      addons:
+        homebrew:
+          packages:
+            autoconf
+            libtool
+            tesseract
+            leptonica
+      script:
+        - cd mac
+        - ./autogen.sh
+        - ./configure
+        - make
+        - ./ccextractor --version
+
+    - os: osx
+      osx_image: xcode10.1
+      compiler: clang
+      addons:
+        homebrew:
+          packages:
+            autoconf
+            libtool
+            tesseract
+            leptonica
+      script:
+        - cd mac
+        - ./autogen.sh
+        - ./configure
+        - make
+        - ./ccextractor --version
+
+    - os: osx
+      osx_image: xcode10.1
+      compiler: gcc
+      addons:
+        homebrew:
+          packages:
+            autoconf
+            libtool
+            tesseract
+            leptonica
+      script:
+        - mkdir build
+        - cd build
+        - cmake ../src/
+        - make
+        - ./ccextractor --version
+
+    - os: osx
+      osx_image: xcode10.1
+      compiler: clang
+      addons:
+        homebrew:
+          packages:
+            autoconf
+            libtool
+            tesseract
+            leptonica
+      script:
+        - mkdir build
+        - cd build
+        - cmake ../src/
+        - make
+        - ./ccextractor --version
--- a/Dictionary/dict_Arrow.txt
+++ b/Dictionary/dict_Arrow.txt
@@ -1,637 +0,0 @@
-A.R.G.U.S Tech
-A.R.G.U.S. Agent
-A.R.G.U.S. Guard
-A.R.G.U.S. Tech
-Adam Castwidth
-Adam Donner
-Adam Hoffman
-Adam Hunt
-Additional
-Adrian Chase
-Agent
-Aglin
-Aide
-Akio Yamashiro
-Al Ow-Al
-Alan Chang
-Alan Durand
-Albie
-Alderman Richard Ford
-Alex Davis
-Alex Salese
-Alexi Leonov
-Allison Lee
-Alvarez
-Amanda Waller
-Ambushed Soldier
-Anastasia
-Anatoly Knyazev
-Andy Diggle
-Andy Jr.
-Angry Crowd Riot
-Angry Woman
-Ankov
-Anthony Venza
-Anthony Walker
-Armed Citizen
-Armed Citizen #2
-Armed Guard
-Armored Truck Driver
-Asian Driver
-Assassin
-Attendant
-Attorney
-Attractive Woman
-Badass Inmate
-Baker
-Bank Guard
-Bank Guard #1
-Bank Guard #2
-Bank Manager
-Bar Guy
-Barman
-Baron Reiter
-Barry Allen
-Bartender
-Barton Mathis
-Bean Pole
-Becky
-Ben Turner
-Benefit Patron
-Benefit Security
-Bertinelli's Thug
-Bethany Snow
-Big Donor
-Biker
-Biker Leader
-Billy Malone
-Black Hawk Guard
-Blaine
-Blake
-Bo Travis
-Bodyguard
-Bodyguard #1
-Bomb Squad Officer
-Boss
-Bouncer
-Bouncer #1
-Boy
-Bratva Thug
-Brick Thug #1
-Brick Thug #2
-Brie Larvan
-Briefcase Man
-Burly Guard
-Business Suit
-Businesswoman
-Bystander
-CSI Tech
-CSU Tech Kelton
-Caitlin Snow
-Camille Declan
-Captain
-Captain Stein
-Carl
-Carl Roberto
-Carly Diggle
-Carrie Cutter
-Carter Bowen
-Carter Hall
-Cass Derenick
-Certo
-Chase
-Chauffeur
-China White
-Chinese Man
-Chinese Pilot
-Christopher Chance
-Cisco Ramon
-Claire Abbott
-Clinton Hogue
-Clock King
-Cocktail Waitress
-College Kid
-Colonel Walker
-Colton
-Commissioner Brian Nudocerdo
-Comptroller
-Conklin
-Constantine Drakon
-Controller
-Cooper Seldon
-Cop
-Cop #1
-Cop's Wife
-Coroner
-Corrupt Cop #1
-Costa
-Councillor
-Councilman Kullons
-Count Vertigo
-Counterfeiter
-Courier
-Court Clerk
-Crewman
-Cronan
-Curtis Holt
-Customs Agent
-Cyrus Gold
-Cyrus Vanch
-D.B. Gavin Carnahan
-D.O.C. Guard
-DJ
-Damian Darhk's Assistant
-Damien Darhk
-Dance Partner
-Danny Brickwell
-Danny De La Vega
-Dark Archer
-Dark Archer Stick
-Daughter
-Dead Girl
-Dead Girl in Photograph
-Deadshot
-Dealer
-Deathbolt
-Deathstroke
-Declan Lin
-Delivery Clerk
-Dennis
-Dennis Fisk
-Derek Reston
-Derek Sampson
-Desk Guard
-Detective
-Detective Lucas Hilton
-Detective McKenna Hall
-Deveau
-Digger Harkness
-Dignitary's Wife
-Dina Salvati
-Dinah Lance
-District Attorney
-Doctor
-Dominic Alonzo
-Donna Smoak
-Double Down
-Dr. Aldus Boardman
-Dr. Anthony Ivo
-Dr. Avery Pressnall
-Dr. Douglas Miller
-Dr. Lockhart
-Dr. Neil Lamb
-Dr. Schwartz
-Dr. Vaca
-Dr. Webb
-Driver
-Drug Dealer
-Drug Dealer's Girlfriend
-Drunk Guy
-Drunk Partier
-Dying Culebra Member
-ESU Officer
-ESU Sergeant
-Eddie Walczak
-Edward Fyers
-Edward Rasmus
-Elite Man
-Emergency Services
-Emily Nocenti
-Engineer Antonov
-Eric Dunn
-Eric Moore
-Erica Vendel
-Erlich Kelso
-Esrin Fortuna
-Evan Wender
-Evelyn Sharp
-Ezra Barnes
-FBI Agent
-Federal Marshal
-Felicity Smoak
-Female Clubber
-Female Hostage
-Female Scientist
-Fence
-Field Reporter
-Fire Chief Raynes
-Firefly
-Firestorm
-First Mate
-Fitzmartin
-Frank Bertinelli
-Frank Chen
-Fred
-Friendly Bratva
-Galina
-Gambler
-Gang Leader
-Gang Member
-Gangbanger
-Gangbanger #1
-Gangbanger #2
-Gardener
-Gary
-General Matthew Shrieve
-General Vadimov
-George Wolfman
-German
-Gerry Conway
-Gholem Qadir
-Gholem Security
-Ghost
-Ghost #2
-Gideon
-Girl
-Glass Banger
-Gora
-Grandmother
-Gravano
-Greg Osborne
-Grizzled Man
-Guard
-Guard #1
-Guard #2
-Guillermo Barrera
-Gun Dealer
-Gunman #2
-Gus Sabatoni
-Halcones Gang Member
-Hardhat
-Harley Quinn
-Harold Backman
-Hawkgirl
-Heat Wave
-Helena Bertinelli
-Hendrick Von Arnim
-Hive Scientist
-Homeless Man
-Hooded Man
-Hoodie
-Hoodlum
-Hot Chick
-Hot Girl
-Housewife
-Husband
-Ian
-Infected Man
-Injured Chemist
-Inmate
-Intern
-Isaac Stanzler
-Isabel Rochev
-Ishmael Gregor
-Izzy Declan
-Jackhammer
-James Holder
-Jana Washington
-Janet Carroll
-Janice Bowen
-Jason Brodeur
-Jean Loring
-Jenn
-Jenny Russo
-Jermaine Fisher
-Jessica Danforth
-Jim Huffman
-Joanna De La Vega
-John Constantine
-John Diggle
-John Jr.
-John Le
-John Nickel
-Jordan Kern
-Jose Anton
-Joseph Falk
-Josiah Hudson
-Judge
-Judge Brackett
-Judge Mandelbaum
-Judge Moss
-Judge Sakow
-Junior Gangbanger
-Junkie
-Kandy Kane
-Kara Danvers
-Karla Groves
-Kate Spencer
-Katherine
-Katsu Cheng
-Keating
-Kendrick Weller
-Kirby Bates
-Klaus Markos
-Konstantin Kovar
-Kyle Reston
-LOA Soldier
-Lady in Red
-Landmine Soldier
-Laura - Skull A
-Laura Hoffman
-Laura Washington
-Laurel Lance
-Lead Gangbanger
-Lead Gunman
-Lead Soldier
-League Assassin
-Leo Mueller
-Li Khuan Hui
-Lieutenant
-Lieutenant Joyner
-Liling
-Linda Park
-Little Boy
-Liza Warner
-Lonnie Machin
-Lowlife
-Lt. Conahan
-Lt. Dave Ellet
-Lt. Frank Pike
-Lyla Michaels
-M.I.T Student
-MP #1
-MP #2
-MP #3
-MP #4
-Maddie
-Madison Danforth
-Maitre'd
-Malcolm Merlyn
-Malcolm's Friend
-Male Scientist
-Man
-Man #2
-Man in Suit
-Man on Bus
-Manny
-Marcus Redmond
-Margo
-Mari McCabe
-Mark Francis
-Mark Scheffer
-Mark Shaw
-Markov
-Martin Somers
-Maseo Yamashiro
-Matt Istook
-Max Fuller
-Maya Resik
-Mayor Altman
-Mayor Celia Castle
-Mayor Queen's Assistant
-Mercenary
-Mercenary #3
-Merlyn Security #1
-Merlyn Security #2
-Michael Amar
-Michael Ancona
-Middle Aged Woman
-Milo Armitage
-Mina Fayad
-Minister
-Mirakuru Soldier
-Model
-Moderator
-Moira Queen
-Morgan
-Mother at Mall
-Motorcycle Cop
-Mouthpiece
-Mr. Blank
-Mr. Gardner
-Mr. Russo
-Mrs. Gardner
-Mrs. Merlyn
-Mrs. Reston
-Mrs. Volodarsky
-Myron Forest
-Nancy Moore
-Nate Heywood
-Nathan Sierra
-Ned Foster
-Nelson Ravich
-News Caster
-News Reporter
-Nick Salvati
-Nico
-Noah Kuttler
-Nora Darhk
-Nurse
-Nyssa al Ghul
-Obnoxious Clubber
-Officer
-Officer Benton
-Officer Daily
-Officer Jones
-Officer Lopez
-Officer Thompson
-Officiant
-Oliver Queen
-Oliver's Security Detail
-Ops Leader
-Orderly
-Orphan
-Overlapping Personnel #1
-Overlapping Personnel #2
-Pablo Estevez
-Palmer Tech Guard
-Paparazzi
-Parole Officer
-Partner
-Paul
-Paul Copani
-Paul Knox
-Pedestrian
-Peter
-Peter Declan
-Peter Kang
-Petrov
-Phaedra Nixon
-Pharmacist
-Phil
-Pilot
-Pino Bertinelli
-Pirate
-Pit Boss
-Pretty Girl
-Prison Guard
-Prison Guard #1
-Prison Guard #2
-Prisoner
-Prisoner #1
-Prisoner #2
-Private Collins
-Protester
-Pudgy Emcee
-Pyotr Friedkin
-QC Security Guard
-Queen Family Lawyer
-Quentin Lance
-Ra's al Ghul
-Ragman
-Raisa
-Rameses II
-Raven
-Ray Palmer
-Real Estate Woman
-Rebecca Merlyn
-Recruit #1
-Recruit #2
-Redhead Girl
-Refugee Woman
-Rene Ramirez
-Reporter
-Reporter #1
-Reporter #2
-Reporter #3
-Reporter #5
-Richard
-Rickie - Skull Man
-Ripped
-Rob Scott
-Robert Joyce
-Robert Queen
-Rookie
-Rosie
-Roy Harper
-Russian Cop #1
-Russian Drugs Buyer
-Russian Fight Arranger
-Russian Policeman
-Ruvé Adams
-SCPD Clerk
-SCPD Detective
-SCPD Officer
-SCPD Officer #1
-SWAT #2
-SWAT #3
-SWAT #4
-Samantha Clayton
-Sara Lance
-Scared Girl
-Scientist
-Screaming Woman
-Sebastian Blood
-Secretary
-Security Guard
-Security Guard #1
-Security Guard #2
-Senator Joseph Cray
-Sentry
-Sergei
-Sergio
-Servant
-Sexy Hostess
-Shado
-Shadowspire Soldier
-Shane Colvin
-Shannon Groff
-Sharp Gangster
-Shimosawa
-Shooter
-Silhouetted Man
-Simon
-Simon Lacroix
-Sin
-Sin's Father
-Skel
-Skull B
-Slade Wilson
-Sleazy Businessman
-Sleazy Clerk
-Slim
-Sobbing Man
-Social Worker
-Soldier
-Soldier #2
-Staffer #1
-Staffer #2
-Station Attendant
-Steve Aoki
-Street Kid
-Street Tough
-Striking Woman
-Suited Bratva
-Supporter #1
-Supporter #2
-Survivor
-Susan Williams
-Susie Lawton
-Swat Team Leader
-TV Host
-Taiana
-Talibah
-Talking Head
-Task Force Agent
-Task Force Leader
-Tatsu Yamashiro
-Tattooed Inmate
-Taylor Moore
-Technician
-Technician #2
-Ted Gaynor
-Ted Grant
-Teddy Reston
-Terrified Prisoner
-The Butcher
-The Captain
-The Count
-The Dodger
-The Mayor
-The Mechanic
-The Priestess
-Thea Queen
-Thomas Flynn
-Thomas Kemp
-Thug
-Thug #1
-Tim Kaufman
-Tim Sullivan
-Tobias Church
-Tom Weston
-Tomas
-Tommy Merlyn
-Tony Daniel
-Torque
-Triad Accountant
-Triad Thug
-Turk
-US Ambassador
-Uniform
-Uniform Cop
-Untouchable #2
-Untouchable #3
-Vandal Savage
-Vendor
-Veronica Sparks
-Victor Nocenti
-Victor Swanstrom
-Vigilante
-Viktor
-Viktor's Henchman
-Vivian
-Vixen
-Vlad
-Volkov
-Volunteer
-Waiter
-Waitress
-Walter Steele
-Warehouse Worker
-Wealthy Patron
-Wheelman
-Wife
-William
-Woman
-Woman on Bus
-Yao Fei
-Young Boy
-Young Girl with Glasses
-Young Guard
-Young Mother
-Young Nyssa
-Young Oliver
-Young Thug
-Young Tommy
-Zhishan
-Zoe Lawton
--- a/Dictionary/dict_Breaking_Bad.txt
+++ b/Dictionary/dict_Breaking_Bad.txt
@@ -1,370 +0,0 @@
-ABQ Detective #1
-ABQ Detective #2
-APD Detective Tim Roberts
-APD Officer
-ASAC George Merkert
-AUSA
-Addict
-Agent Buddy
-Airport Traveler
-Albuquerque Police Officer
-Amber
-Anchor #1
-Anchor #2
-Anchor #3
-Anchor #4
-Andrea Cantillo
-Arms Dealer
-Asst. US Attorney
-Backhoe Operator
-Bad Girl
-Badger
-Banger #1
-Bank Customer
-Bank Teller
-Bar Fighter
-Barfly
-Barry
-Bartender
-Ben
-Beneke Employee #1
-Beneke Employee #2
-Benicio Fuentes
-Beto
-Big Biker
-Biker
-Bingo Caller
-Bingo Lady
-Bob
-Bogdan Wolynetz
-Boy in Museum
-Brock Cantillo
-Burnout
-Business Community Leader #1
-Business Community Leader #2
-Businessman
-CID Special Agent
-Cab Driver
-Cancer Patient
-Car Wash Attendant
-Car Wash Customer
-Car Wash Patron
-CarWash Patron
-Cara
-Caregiver
-Carmen Molina
-Carol
-Carpet Cleaner
-Carpet Cleaner #2
-Cartel Gunman #1
-Cartel Gunman #2
-Cartel Gunman #3
-Cartel Gunman #4
-Cartel Henchman
-Carwash Customer
-Chad
-Chad's Girlfriend
-Charlie Rose
-Chemical Plant Guard
-Chemistry Student
-Chief Food Technician
-Chris Mara
-Chuck
-Clovis
-Colleen
-Combo
-Commercial Narrator
-Commercial Voice Over
-Concerned Parent
-Conductor
-Cop
-Cop #1
-Cop #2
-Corpse
-Customer
-DEA
-DEA Agent
-DEA Agent #1
-DEA Agent Artie
-DEA Agent Scott
-DEA Agent Tom
-DEA Agent Vanco
-DEA Point Man
-Dan Wachsberger
-Darla
-Daughter
-Dave
-Declan
-Declan's Crew Member
-Declan's Driver
-Delivery Man
-Delores
-Dennis Markowski
-Deputy #1
-Deputy Kee
-Detective
-Detective #1
-Detective #2
-Detective Kalanchoe
-Detective Munn
-Doctor
-Dog Handler
-Don Eladio
-Donald Margolis
-Dorothy Yobs
-Dr. Barry Goodman
-Dr. Belknap
-Dr. Chavez
-Dr. Delcavoli
-Dr. Soper
-Dr. Victor Bravenec
-Drew Sharp
-Duane Chow
-Duty Officer
-EMT
-ER Doctor #1
-ER Doctor #2
-Ed
-El Paso DEA Agent
-Elliott Schwartz
-Emilio Koyama
-Emotional Woman
-Engineer
-Farley
-Father
-Federale
-Female Employee
-Female Homeowner
-Fernando
-Fireman
-First Realtor
-Fran
-Francesca
-Frankie
-Friendly Agent
-Friendly Guy
-GYN
-Gaff
-Gale Boetticher
-Garduño's Diner
-Getz
-Gonzo
-Good Samaritan
-Government Lawyer
-Grandma
-Gretchen Schwartz
-Group Leader
-Gunman #1
-Gunman #2
-Gus' Operative
-Gustavo 'Gus' Fring
-Hank Schrader
-Henry Tyree
-Herr Herzog
-High School Student
-Homeless Man
-Homeless Man's Wife
-Homeowner
-Hospital Administrator
-Hospital Patient
-Hot Chick Cop
-Huell
-Hugo Archuleta
-Ira
-Irving
-Jack's Henchman
-Jake Pinkman
-James Edward Kilkelly
-Jane Margolis
-Janice
-Jeffrey
-Jesse Pinkman
-Jewelry Store Owner
-Jock
-Jock's Friend #1
-Jock's Friend #2
-Juan Bolsa
-Kaylee Ehrmantraut
-Ken Wins
-Kenny
-Kid
-Kiira
-Krazy-8
-Kuby
-Lady in Car
-Laundry Woman 1
-Laundry Woman 2
-Laundry Woman 3
-Laundry Worker
-Lawson
-Lawyer
-Lead Doctor
-Leonel Salamanca
-Lester
-Little Old Lady
-Local Correspondant
-Locksmith
-Look out
-Lookout
-Los Pollos Hermanos Cook
-Los Pollos Hermanos Patron
-Louis
-Lt. Adam Estiguez
-Lucy
-Lydia Rodarte-Quayle
-Ma Kettle
-Madrigal Suit
-Magnet Guy #2
-Mail Lady
-Male Homeowner
-Manager
-Marco Salamanca
-Mariano
-Marie Schrader
-Matt
-Max Arsiniega
-Medical Technician
-Meth Cook
-Meth Drug Dealer
-Miguel
-Mike Ehrmantraut
-Mike's Security Team 1
-Mike's Security Team 2
-Min-Ye
-Morning After Girl
-Mortgage Broker
-Mother
-Mr. Gardiner
-Mr. Pinkman
-Mr. Wilson
-Mrs. Ortega
-Mrs. Pinkman
-Mrs. Pope
-Ms. Tromel
-Music Producer
-NA Sponsor
-Narcocorridos Band #1
-Narcocorridos Band #2
-Native American Man
-Neighbor
-Neighborhood Boy
-Neighborhood Kid
-News Reporter
-No-Doze
-Nurse
-O.M.I. Attendant
-O.M.I. Officer
-OPR Official #1
-Ob-Gyn
-Off Duty Cop
-Office Manager
-Office Worker
-Old Crawler
-Old Joe
-Old Man
-Orderly
-Pa Kettle
-Pamela
-Parent
-Party Girl
-Partygoer
-Paul Tyree
-Pedestrian
-Pediatric Nurse
-Peng
-Peter Schuler
-Physical Therapist
-Police Officer
-Policeman
-Pollos Manager
-Preppy Shopper
-Prison Guard
-Prospective Buyer
-Public Defender
-Radio DJ #1
-Radio DJ #2
-Realtor
-Receptionist
-Rehab Group Girl
-Rehab Patient
-Restaurant Employee
-Restaurant Patron
-Rival Dealer #1
-Rival Dealer #2
-Ron Forenall
-Rookie Officer
-Rowdy Prisoner
-SAC Ramey
-Sad Faced Girl
-Sales Girl
-Salesman
-Sara Tyree
-Saul Goodman
-Saul's Client
-Scary Skell
-Schlubby Guy #1
-Schlubby Guy #2
-School Office Worker
-Scientist
-Screaming Shopper
-Secretary
-Senior DEA Agent
-Server
-Sexy Cartel Girl
-Sexy Neighbor
-Skater
-Skell
-Sketchy
-Skinny Pete
-Skycap
-Skyler White
-Soren
-Spooge
-Spooge's Woman
-Stephanie Doswell
-Steven Gomez
-Stew
-Stripper #1
-Stripper #2
-Student
-Supermarket Clerk
-Support Group Leader
-Support Group Member
-TV Reporter
-Tattooed Biker
-Tattooed Woman
-Teacher
-Technician
-Ted Beneke
-The Assassin
-Thug Buddy
-Tio Salamanca
-Tio's Nurse
-Todd
-Tomas
-Tortuga
-Trent
-Truck Guard 1
-Truck Guard 2
-Tucker
-Tuco Salamanca
-Tweaker Thief
-Tweaky Dude
-Tyrus Kitt
-Uncle Jack
-Union Rep
-Urinal Guy
-Victor
-Waiter
-Waitress
-Walter White
-Walter White, Jr.
-Warehouse Worker
-Wendy
-Wide Eyed Boy
-Wino
-Woman in Denny's
-Young Boy
-Young Leonel
-Young Marco
-Yuppie Woman
--- a/Dictionary/dict_Game_of_Thrones.txt
+++ b/Dictionary/dict_Game_of_Thrones.txt
@@ -1,614 +0,0 @@
-
-'That's Right' Man
-Addam Marbrand
-Adrack Humble
-Aeron Greyjoy
-Aerys Targaryen
-Aggo
-Alliser Thorne
-Alton Lannister
-Amory Lorch
-Anara
-Anguy
-Anya Waynwood
-Areo Hotah
-Armeca
-Arthur
-Arthur Dayne
-Arya Stark
-Ash
-Axell Florent
-Aya
-Baby Sam
-Balon Greyjoy
-Baratheon Archer
-Baratheon General
-Baratheon Guard
-Baratheon Officer
-Baratheon Soldier
-Baratheon Soldier #1
-Baratheon Soldier #2
-Barristan Selmy
-Bathhouse Boy
-Bathhouse Prostitute
-Bear Island Maester
-Beggar Woman
-Belicho Paenymion
-Benjen Stark
-Beric Dondarrion
-Bianca
-Biter
-Black Jack Bulwer
-Black Lorren
-Black Walder Rivers
-Boat Commander
-Bobono
-Bolton General
-Bolton Guard
-Bolton Officer
-Bolton Soldier
-Bowen Marsh
-Braavosi Captain
-Braavosi Madam
-Braavosi Theatre Server
-Braavosi Theatre Sound Artist
-Braavosi Woman #1
-Braavosi Woman #2
-Bran Stark
-Brant
-Brea
-Brienne of Tarth
-Bronn
-Brothel Child #1
-Brothel Child #2
-Brothel Customer
-Brothel Guard
-Brother Ray
-Brotherhood Member
-Brusco
-Brynden 'Blackfish' Tully
-Camello
-Captain of the Archers
-Captain of the Bolton Archers
-Captain's Daughter
-Catelyn Stark
-Catspaw Assassin
-Cersei Body Double
-Cersei Lannister
-Chella
-Child of the Forest
-Citadel Maester
-Clarenzo
-Clea
-Cley Cerwyn
-Colen of Greenpools
-Cooper
-Craster
-Craster's Wife
-Craster's Wife #2
-Craster's Wife #3
-Craster's Younger Wife
-Daario Naharis
-Daenerys Targaryen
-Dagmer Cleftjaw
-Daisy
-Davos Seaworth
-Denys Mallister
-Derek
-Desmond Crakehall
-Despondent Man
-Dickon Tarly
-Dim Dalba
-Donnel Hill
-Donnel Waynwood
-Dontos Hollard
-Doran Martell
-Doreah
-Dothraki
-Dothraki Bloodrider #1
-Dothraki Bloodrider #2
-Dothraki Crone
-Dothraki Man Having Sex
-Dothraki Widow #1
-Dothraki Widow #2
-Dothraki Woman Having Sex
-Dragonstone waiter
-Drennan
-Drowned Priest
-Drummer
-Drunk Patron
-Dwarf Hunter #1
-Dwarf Hunter #2
-Dying Man
-Eddard 'Ned' Stark
-Eddison Tollett
-Edmure Tully
-Elder Meereen Slave
-Ellaria Sand
-Eon Hunter
-Euron Greyjoy
-Eyrie Guard
-Faceless Man
-Faith Militant
-Faith Militant #1
-Faith Militant #2
-Farlen
-Farmer Hamlet
-Farmer's Daughter
-Fennesz
-Fighter
-First Mate
-Foreign Merchant
-Frances
-Frey Guard
-Frey Guardsman
-Frey Soldier #1
-Frey Soldier #2
-Frey Wedding Guest
-Fruit Vendor
-Gared
-Gatins
-Gendry
-Genna
-Gerald
-Gerold Hightower
-Ghita
-Gilly
-Glover General
-Goatherd
-Goatherd's Son
-Goldcloak
-Goldcloak #1
-Goldcloak #2
-Gordy
-Grand Maester Pycelle
-Great Master #1
-Great Master #2
-Great Master #3
-Great Master #4
-Great Master #5
-Great Master #6
-Great Master #7
-Greatjon Umber
-Gregor Clegane
-Greizhen mo Ullhor
-Grenn
-Grey Worm
-Guymon
-Hallyne
-Handmaid
-Harald Karstark
-Healtor Troop
-High Priestess
-High Septon
-High Sparrow
-Hizdahr zo Loraq
-Hodor
-Hog Farmer
-Hoster Tully
-Hot Pie
-Howland Reed
-Hugh of the Vale
-Iggo
-Illyrio Mopatis
-Ilyn Payne
-Imry Florent
-Inn Waitress
-Innkeeper
-Innkeeper's Daughter
-Ironborn #1
-Ironborn #2
-Ironborn #3
-Ironborn Abusing a Volantene Whore
-Ironborn at Brothel
-Ironborn in Skiff
-Irri
-Izembaro
-Jacks
-Jaime Lannister
-Janos Slynt
-Jaqen H'ghar
-Jaqen's Disguise
-Jaremy Rykker
-Jeor Mormont
-Jhiqui
-Joffrey Baratheon
-Johnna
-Jojen Reed
-Jon Arryn
-Jon Snow
-Jon Snow Soldier
-Jonos Bracken
-Jorah Mormont
-Jory Cassel
-Joss
-Joyeuse Erenford
-Karl Tanner
-Karsi
-Karstark Lead Archer
-Karstark Lookout
-Karstark Soldier
-Karstark Soldier #1
-Karstark Soldier #2
-Kayla
-Kegs
-Kesh
-Kevan Lannister
-Khal Brozho
-Khal Drogo
-Khal Forzho
-Khal Moro
-Khal Qorro
-Khal Rhalko
-Khaleesi Handmaiden #1
-King Balon Greyjoy Dwarf
-King Joffrey Baratheon Dwarf
-King Renly Baratheon Dwarf
-King Robb Stark Dwarf
-King Stannis Baratheon Dwarf
-King's Guard
-King's Landing Baker
-King's Landing Boaster
-King's Landing Drunkard
-King's Landing Flasher #1
-King's Landing Flasher #2
-King's Landing Handmaiden
-King's Landing Rioter #1
-King's Landing Rioter #2
-King's Landing Rioter #3
-King's Landing Tailor
-King's Landing Urchin
-King's Landing Whore
-Kinvara
-Knight of House Frey
-Knight of House Lynderly
-Kovarro
-Kraznys mo Nakloz
-Kurleket
-Lady Crane
-Lady Kitty Frey
-Lancel Lannister
-Lannister Archer
-Lannister Army Member
-Lannister Captain
-Lannister Guard
-Lannister Guard #1
-Lannister Guard #2
-Lannister Guard #3
-Lannister Guardsman
-Lannister Lord
-Lannister Messenger
-Lannister Scout
-Lannister Soldier
-Lannister Torturer
-Lead Dornish Guard
-Lead Kingsguard
-Leaf
-Lem Lemoncloak
-Leo Lefford
-Lhara
-Little Bird
-Little Bird #3
-Little Bird #4
-Little Bird #5
-Little Bird #6
-Little Bird #7
-Loboda
-Locke
-Lollys Stokeworth
-Lommy Greenhands
-Loras Tyrell
-Lord Blackmont
-Lord Galbart Glover
-Lord Portan
-Lord Varys
-Lord of Bones
-Lordsport Dockhand
-Lothar Frey
-Loyal Night's Watchman #1
-Loyal Night's Watchman #2
-Lyanna Mormont
-Lyanna Stark
-Lysa Arryn
-Mace Tyrell
-Maester Aemon
-Maester Caleotte
-Maester Cressen
-Maester Helliweg
-Maester Luwin
-Maester Wolkan
-Mag the Mighty
-Maggy
-Mago
-Male Prostitute
-Malko
-Mallister Supporter
-Mance Rayder
-Mandon Moore
-Manservant
-Marei
-Margaery Tyrell
-Margaery Tyrell Mummer
-Margaery's Handmaiden
-Marianne Frey
-Marillion
-Masha Heddle
-Master Torturer
-Matthos Seaworth
-Meera Reed
-Meereen Guard
-Meereen Slave
-Meereenese Homeless Mother
-Melara Hetherspoon
-Melessa Tarly
-Melisandre
-Merchant Captain
-Merchant in Tavern
-Mero
-Merry Frey
-Meryn Trant
-Mhaegen
-Mikken
-Mirelle
-Mirri Maz Duur
-Missandei
-Mole's Town Madam
-Mole's Town Whore
-Morag
-Mord
-Morgan
-Morgan's Friend
-Moro's Wife #1
-Moro's Wife #2
-Mossador
-Mully
-Mummer #2
-Mummer #3
-Mummer #4
-Musician #1
-Musician #2
-Musician #3
-Musician #4
-Musician #5
-Mycah
-Myranda
-Myrcella Baratheon
-Night's Watch
-Night's Watch Archer
-Night's Watch Deserter
-Night's Watch Messenger
-Night's Watch Officer
-Night's Watchman
-Night's Watchman #1
-Night's Watchman #2
-Nobel Man
-Noble Lady
-Noble Man
-Northern Lord
-Northman Archer
-Northman Rider
-Northman Rioter
-Nymeria Sand
-Obara Sand
-Oberyn Martell
-Old Man
-Old Nan
-Old Woman
-Old Woman Prisoner
-Olenna Tyrell
-Olly
-Olly's Mother
-Olyvar
-Orell
-Ornela
-Orphan Kid
-Osha
-Othell Yarwyck
-Oznak zo Pahl
-Peasant
-Pentoshi Servant
-Petyr 'Littlefinger' Baelish
-Pit Announcer
-Pit Fighter
-Podrick Payne
-Polliver
-Prendahl na Ghezn
-Prisoner
-Protestor
-Pyat Pree
-Pypar
-Qartheen Woman
-Qhono
-Qhorin Halfhand
-Qotho
-Quaithe
-Quent
-Qyburn
-Rakharo
-Ralf Kenning
-Ramsay Bolton
-Randyll Tarly
-Rast
-Rattleshirt
-Razdal mo Eraz
-Red Keep Stableboy
-Red Priestess
-Reginald Lannister
-Renly Baratheon
-Rennick
-Rhaego
-Rickard Karstark
-Rickard Stark
-Rickon Stark
-Riddell
-Riverlands Traveller
-Riverrun Nobleman
-Robb Stark
-Robert Baratheon
-Robett Glover
-Robin Arryn
-Rodrik Cassel
-Roose Bolton
-Rorge
-Ros
-Roslin Frey
-Royal Steward
-Ryger Rivers
-Sailor
-Salladhor Saan
-Samwell Tarly
-Sandor 'The Hound' Clegane
-Sansa Stark
-Second Son
-Sellsword #1
-Sellsword #2
-Selyse Baratheon
-Septa Moelle
-Septa Mordane
-Septa Scolera
-Septa Unella
-Septon
-Ser Endrew Tarth
-Shae
-Shagga
-Shireen Baratheon
-Silk King
-Simpson
-Singing Lannister Soldier
-Sissy
-Slaver
-Smalljon Umber
-Soldier
-Son of the Harpy
-Sorcerer
-Sparring Boy
-Spice King
-Stannis Baratheon
-Stark Guard
-Stark Messenger
-Stark Soldier
-Steelshanks Walton
-Steve
-Stevron Frey
-Stiv
-Stone Man
-Street Tough #1
-Street Tough #2
-Strong Fighter
-Strong Sam Stone
-Styr
-Syrio Forel
-Talisa Stark
-Talla Tarly
-Tansy
-Ternesio Terys
-The Bear
-The Crone
-The Maiden
-The Mother
-The Night King
-The Tickler
-The Waif
-The Waif's Disguise
-Thenn Warg
-Theon Greyjoy
-Theon's Master of Hounds
-Thin Man
-Thoros of Myr
-Three-Eyed Raven
-Timett
-Tobho Mott
-Todder
-Tomard
-Tommen Baratheon
-Tommen's Attendant
-Tommen's Manservant
-Tommy
-Tormund Giantsbane
-Torrhen Karstark
-Tortured Prisoner
-Tortured Slave
-Tourney Herald
-Trystane Martell
-Tully Bannerman
-Tully Soldier
-Tycho Nestoris
-Tyene Sand
-Tyrell Bannerman
-Tyrell Guard
-Tyrell Lady
-Tyrell Servant
-Tyrell Soldier
-Tyrion Lannister
-Tywin Lannister
-Unsullied
-Vala
-Valyrian Slave
-Vance Corbray
-Vardis Egen
-Varly
-Vayon Poole
-Violet
-Viserys Targaryen
-Volantene Whore
-Volantene Whore #1
-Volantene Whore #2
-Volantene Whore #3
-Volantene Whore #4
-Volantene Whore #5
-Waitress
-Walda Bolton
-Walder Frey
-Warlock
-Waymar Royce
-Wedding Band
-Wedding Guest
-Wendel Manderly
-Westerosi Trader
-White Rat
-White Walker
-White Walker #2
-Whore
-Whore #1
-Whore #2
-Wight
-Wight Wildling Girl
-Wilding Gladiator
-Wildling
-Wildling Rioter
-Will
-Willa
-Willem Lannister
-Willis Wode
-Wine Merchant
-Winter Town Man
-Winterfell Beekeeper
-Winterfell Shepherd
-Woodcutter
-Wounded Lannister
-Wun Wun
-Wyllis
-Wyman Manderly
-Xaro Xhoan Daxos
-Yara Greyjoy
-Yezzan zo Qaggaz
-Ygritte
-Yohn Royce
-Yoren
-Young Benjen Stark
-Young Braavosi
-Young Cersei Lannister
-Young Lyanna Stark
-Young Nan
-Young Ned
-Young Ned Stark
-Young Nobleman
-Young Rodrik Cassel
-Yunkai Citizen
-Yunkai'i Slave #1
-Yunkai'i Slave #2
-Yunkai'i Slave #3
-Yunkai'i Slave #4
-Yunkai'i Slave #5
-Yunkai'i Whore
-Zanrush
--- a/Dictionary/dict_House.txt
+++ b/Dictionary/dict_House.txt
--- a/Dictionary/dict_adventure_time.txt
+++ b/Dictionary/dict_adventure_time.txt
@@ -1,54 +0,0 @@
-Ancient Psychic Tandem War Elephant
-Banana Guard
-Candy Kingdom
-Candy People
-Choose Goose
-Cinnamon Bun
-City of Thieves
-Colonel Candycorn
-Cosmic Owl
-Crab Princess
-Dr. Donut
-Dr. Ice Cream
-Duchess of Nuts
-Earl of Lemongrab
-Everything Burrito
-Finn the Human
-Fire Kingdom
-Flame Princess
-Flying Lettuce Bros.
-Ghost Princess
-Hotdog Knight
-Ice King
-Ice Kingdom
-Jake the Dog
-Lady Rainicorn
-Lake Butterscotch
-Land of Ooo
-Lumpy Space Princess
-Marauder Village
-Marshmallow Kid
-Mr. Cream Puff
-Muscle Princess
-Nice King
-Nice Knights
-Nightosphere
-Nurse Poundcake
-Old Lady Princess
-Party Pat
-Peppermint Butler
-Pillow World
-Princess Bubblegum
-Raggedy Princess
-Root Beer Guy
-Sir Slicer
-Skeleton Princess
-Slime Princess
-Snow Golem
-The Enchiridion
-The Lich
-Toast Princess
-Tree Fort
-Tree Trunks
-Wildberry Princess
-Wizard Battle
--- a/Dictionary/dict_doctor_who.txt
+++ b/Dictionary/dict_doctor_who.txt
@@ -1,25 +0,0 @@
-Adipose
-Amy Pond
-Clara Oswin Oswald
-Cybermen
-Dalek
-Davros
-Donna Noble
-Jack Harkness
-Judoon
-K-9
-Martha Jones
-Master
-Mickey Smith
-Missy
-Ood
-River Song
-Rory Williams
-Rose Tyler
-Sontarans
-Tardis
-Time Lord
-The Doctor
-The Silence
-Weeping Angel
-Zygon
--- a/Dictionary/dict_glee.txt
+++ b/Dictionary/dict_glee.txt
@@ -1,97 +0,0 @@
-Glee Club
-New Directions
-Will Schuester
-Sue Sylvester
-Emma Pillsbury
-Terri Schuester
-Arthur Abrams
-Artie Abrams
-Tina Cohen-Chang
-Brittany Pierce
-Glease
-Finn Hudson
-Film School
-Lima
-Unique Adams
-Vocal Adrenaline
-Glee Club Regionals
-Jesse St. James
-Nationals
-William McKinley High School
-Rizzo
-Grease
-Ryder Lynn
-Mr. Shue
-Coach Shannon Beiste
-Blaine Devon Anderson
-Dalton Academy
-The Warblers
-Sectionals
-NYADA
-June Dolloway
-Dave Karofsky
-McKinley High Titans
-Principal Figgins
-Cooter Menkins
-Roz Washington
-Rachel Barbra Berry
-Jesse St. James
-Shelby Corcoran
-Cassandra July
-Brody Weston
-Funny Girl
-Michel Robert "Mike" Chang, Jr.
-Joffrey Ballet
-Asian Camp
-Sam Evans
-Lucy Quinn Fabray
-Cheerios
-Beth
-Skanks
-Finn Hudson
-Marley Rose
-Burt Hummel
-Mercedes Jones
-Santana Lopez
-Ryder Lynn
-Jake Puckerman
-Noah Puckerman
-Kitty Wilde
-Alistair
-Azimio
-Jacob Ben Israel
-Rory Flanagan
-Joe Hart
-Jane Hayward
-Becky Jackson
-Madison McCarthy
-Sugar Motta
-Myron Muskovitz
-Bob Harris
-Spencer Porter
-Roderick
-Matt Rutherford
-Lauren Zizes
-Jazz Ensemble
-Hank Saunders
-Suzy Pepper
-Shane Tinsley
-Rick "The Stick" Nelson
-Lillian Adler
-Holly Holliday
-Acafellas
-Sandy Ryerson
-Ken Tanaka
-Sunshine Corazón
-Dustin Goolsby
-Harmony
-Grace Hitchens
-Sebastian Smythe
-Kendra Giardi
-Carl Howell
-Carole Hudson
-Millie Rose
-April Rhodes
-Bryan Ryan
-
-
--- a/Dictionary/dict_greys.anatomy.txt
+++ b/Dictionary/dict_greys.anatomy.txt
@@ -1,59 +0,0 @@
-Grey’s Anatomy
-Meredith Grey
-Lexie Grey
-Ellis Grey
-Thatcher Grey
-Derek Shepherd
-Amelia Shepherd
-Owen Hunt
-Dr. Margaret Pierce
-Dr. Teddy Altman
-Alex Karev
-Callie Torres
-Izzie Stevens
-Christina Yang
-Mark Sloan
-Jackson Avery
-Leah Murphy
-April Kepner
-Arizona Robbins
-George O'Malley
-Preston Bruke
-Miranda Bailey
-Denny Duquette
-Dr. Addison Montgomery
-Richard Webber
-Adele Webber
-Jo Wilson
-Andrew Deluca
-Nathan Riggs
-Erica Hahn
-Sadie Harris
-Stephanie Edwards
-Jason Myers
-Dr. Nicole Herman
-Hannah Davies
-Shane Ross
-Seattle Grace Hospital
-Mercy West Medical Center
-Seattle Grace Mercy West Hospital
-Denny Duquette Memorial Clinic
-Grey Sloan Memorial Hospital
-Mayo Clinic
-Cleveland Clinic
-Portland General Hospital
-Seattle Presbyterian Hospital
-Klausman Institute for Medical Research
-Roseridge Home for Extended Care
-Veterans Rehabilitation Center
-Trauma Center
-Emergency Room
-Intensive Care Unit
-Neonatal Intensive Care Unit
-Operating Room
-On-Call Room
-Chasing Cars
-Snow Patrol
-
-
-
--- a/Dictionary/dict_how_to_get_away_with_murder.txt
+++ b/Dictionary/dict_how_to_get_away_with_murder.txt
@@ -1,42 +0,0 @@
-Annalise Keating
-Anna Mae Harkness
-Nate Lahey
-Wes Gibbins
-Connor Walsh
-Michaela Pratt
-Asher Millstone
-Laurel Castillo
-Frank Delfino
-Bonnie Winterbottom
-Oliver Hampton
-Rebecca Sutter
-Sam Keating
-Caleb Hapstall
-Catherine Hapstall
-Emily Sinclair
-Meggy Travers
-Simon Drake
-Soraya Hargrove
-Phillip Jessup
-Eve Rothlow
-Lila Stangard
-Bill Millstone
-A.D.A. Rene Atwood
-Kan
-Griffin O'Reilly
-Rose
-Detective Mumford
-A.D.A. Todd Denver
-D.A. Wendy Parks
-Levi Sutter
-Wallace Mahoney
-Charles Mahoney
-Vince Levin
-Christophe
-Hannah Keating
-Middleton University
-Keating 5
-Who's Under the Sheet
-Keating House
-HTGAWM
-pro-bono law clinic 
--- a/Dictionary/dict_master_of_none.txt
+++ b/Dictionary/dict_master_of_none.txt
@@ -1,11 +0,0 @@
-Dev
-Rachel
-Go-Gurt
-Arnold
-Brian
-Denise
-The Sickening
-Nina
-Nashville
-Paro
-Benjamin
--- a/Dictionary/dict_mr_robot.txt
+++ b/Dictionary/dict_mr_robot.txt
@@ -1,36 +0,0 @@
-Elliot Alderson
-Mr. Robot
-Darlene
-Angela Moss
-Tyrell Wellick
-Joanna Wellick
-Phillip Price
-Federal Bureau of Investigation
-Fun Society
-Gideon Goddard
-Lloyd Chung
-Ollie Parker
-E Corp
-Evil Corp
-Terry Colby
-Scott Knowles
-Sharon Knowles
-Mr. Sutherland
-Antara Nayar
-Krista Gordon
-Shayla Nico
-Fernando Vera
-Elliot's Mother
-The Hackers
-fsociety
-Romero
-Trenton
-Mobley 
-The Dark Army
-Whiterose
-Cisco
-New York
-Evil Corp Headquarters
-Allsafe Cybersecurity
-Ron’s Coffee
-Python
--- a/Dictionary/dict_new_girl.txt
+++ b/Dictionary/dict_new_girl.txt
@@ -1,11 +0,0 @@
-Jess
-Jessica Day
-Nick Miller
-Winston Bishop
-Schmidt 
-Cece Parekh
-Coach
-Latvian Basketball League
-Ferguson
-True American
-Los Angeles middle school
--- a/Dictionary/dict_sense8.txt
+++ b/Dictionary/dict_sense8.txt
@@ -1,59 +0,0 @@
-Sense8
-Abraham
-Amanita
-Amondi Kabaka
-Angelica Turing
-Anton Bogdanow
-Bug
-Capheus "Van Damme"
-Daniela Velasquez
-Daya Dandekar
-Diego Morales
-Dr. Metzger
-Felix Berner
-Githu
-Grace
-Gunnar
-Hassan Bogdanow
-Hernando
-Jacks
-Janet Marks
-Jela
-Joaquin Flores
-Jonas Maliki
-Joong-Ki Bak
-Kala Dandekar
-Kang-Dae Bak
-Wolfgang Bogdanow
-Lina
-Lito Rodriguez
-Lúna Magnúsdóttir
-Magnús Þórsson
-Manendra Rasal
-Mi-Cha
-Michael Gorski
-Min-Jung
-Mr. Whispers
-Niles Bolger
-Nomi Marks
-Nyx
-Prisoner 818
-Priya Dandekar
-Purab Kohli
-Rajan Rasal
-Riley Blue
-Sahana Rasal
-Sanyam Dandekar
-Sanyam Dendekar
-Sara Patrell
-Sergei Bogdanow
-Shiro
-Silas Kabaka
-Soo-Jin
-Steiner Bogdanow
-Sun Bak
-Sven
-Teagan Marks
-Will Gorski
-Yrsa
-Sensates
--- a/Dictionary/dict_sherlock.txt
+++ b/Dictionary/dict_sherlock.txt
@@ -1,37 +0,0 @@
-Ajay
-Alex
-Anderson
-Baker Street
-Bill Wiggins
-Charles Augustus Magnussen
-Charlie Welsborough
-Detective Inspector Lestrade
-Gabriel
-Greg Lestrade
-Irene Adler
-James Moriarty
-Jim Moriarty
-John Watson
-John Hamish Watson
-Karim
-Lady Smallwood
-Lestrade
-Magnussen
-Margaret Thatcher
-Mary Morstan
-Mary Watson
-Mike Stamford
-Molly Hooper
-Moriarty
-Mrs. Hudson
-Mycroft Holmes
-Norbury
-Philip Anderson
-Rosamund Mary
-Rosie
-Sally Donovan
-Samarra
-Sergeant Donovan
-Sherlock Holmes
-Tbilisi
-Vivian Norbury
--- a/Dictionary/dict_smash.txt
+++ b/Dictionary/dict_smash.txt
@@ -1,42 +0,0 @@
-Smash
-Julia Houston
-Derek Wills
-Karen Cartwright
-Tom Levitt
-Ivy Lynn
-Eileen Rand
-Jimmy Collins
-Sam Strickland
-Kyle Bishop
-Ana Vargas
-Ellis Boyd
-Dev Sundaram
-Frank Houston
-Lyle West
-Leigh Conroy
-Rebecca Duvall
-Veronica Moore
-Terrence Falls
-Linda
-Lisa McMann
-Roger Cartwright
-Mrs. Cartwright
-Jerry Rand
-Leo Houston
-Bobby
-Dennis
-Jessica
-Sue
-Nick Felder
-Michael Swift
-John Goodwin
-Daisy Parker
-R.J.
-Monica Swift
-Scott Nichols
-Margot
-Agnes
-Bombshell
-Hit List
-Heaven on Earth
-Liaisons
--- a/Dictionary/dict_soul_eater.txt
+++ b/Dictionary/dict_soul_eater.txt
@@ -1,41 +0,0 @@
-Arachne
-Arachnophobia
-Asura
-Azusa
-Black Star
-Blair
-Crona
-Death City
-Death the Kid
-Death Weapon Meister Academy
-DWMA
-Eibon
-Eruka
-Excalibur
-Franken Stein
-Free
-Giriko
-Joe Buttataki
-Justin
-Kilik
-Kishin
-Liz
-Lord Death
-Maka
-Maka Albarn
-Marie
-Masamune
-Medusa
-Meister
-Mifune
-Mizune
-Mosquito
-Ox Ford
-Patty
-Professor Stein
-Sid
-Soul
-Soul Eater
-Spirit
-Tsubaki
-Weapon
--- a/Dictionary/dict_steven_universe.txt
+++ b/Dictionary/dict_steven_universe.txt
@@ -1,16 +0,0 @@
-Amethyst
-Beach City
-Cookie Cat
-Crying Breakfast Friends
-Crystal Gems
-Crystal Temple
-Earthlings
-Fryman
-Garnet
-Lion
-Pearl
-Peridot
-Rose Quartz
-Ruby
-Sapphire
-Steven Universe
--- a/Dictionary/dict_stranger_things.txt
+++ b/Dictionary/dict_stranger_things.txt
@@ -1,84 +0,0 @@
-Stranger Things
-Murray Bauman
-Becky Ives
-Benny Hammond
-Bill
-Billy
-Martin Brenner
-Jonathan Byers
-Joyce Byers
-Lonnie Byers
-Will Byers
-Callahan
-Carol
-Russell Coleman
-Connie Frazier
-Cynthia
-Dark dimension creature
-David O'Bannon
-Diane
-Donald Melvald
-Earl
-Eel-like creature
-Elevator Scientist
-Eleven
-El
-011
-The Weirdo
-Eleanor
-Florence
-Tommy H.
-Steve Harrington
-Dustin Henderson
-Barbara Holland
-Jim Hopper
-Sarah Hopper
-James
-Jeffrey
-Jen
-Jennifer Hayes
-Lead Agent
-Marissa
-Max
-The Monster
-Mrs. Holland
-Bob Newby
-Nicole
-Dr. Owens
-Pastor Charles
-Patrick
-Powell
-Roman
-Russian Agent
-Sandra
-Scott Clarke
-Shepard
-Lucas Sinclair
-Slug-like creature
-Terry Ives
-Troy
-Troy's mother
-Holly Wheeler
-Karen Wheeler
-Mike Wheeler
-Nancy Wheeler
-Ted Wheeler
-Demogorgon
-Hawkins National Laboratory
-Upside Down
-Hawkins Middle School
-Project MKUltra
-Dungeons & Dragons
-Operation Mirkwood
-Heathkit ham shack
-Castle Byers
-Hawkins General Hospital
-Hawkins Police Station
-Hawkins High School
-Quarry
-Etowah
-Benny's Burgers
-Bradley's Big Buy
-Byers house
-Downtown Hawkins
-Roane County Coroner
--- a/Dictionary/dict_the.big.bang.theory.txt
+++ b/Dictionary/dict_the.big.bang.theory.txt
@@ -1,30 +0,0 @@
-The Big Bang Theory
-Penny
-Leonard Hofstadter
-Sheldon Cooper
-Raj Koothrappali
-Bernadette Rostenkowski
-Howard Wolowitz
-Amy Farrah Fowler
-Leslie Winkle
-Stuart Bloom
-Arthur Jeffries
-Mrs. Wolowitz
-Barry Kripke
-Priya Koothrappali
-Mrs. Koothrappali
-Mr. Koothrappali
-Lucy
-Sheldon’s Spot
-The Apartment Building
-Apartment 4A/B
-The Laundry Room
-The Roof
-Wolowitzs' House
-Capitol Comics
-The Cheesecake Factory
-The Comic Center of Pasadena
-California Institute of Technology
-Massachusetts Institute of Technology
-Jet Propulsion Laboratory
-Pasadena
--- a/Dictionary/dict_the_it_crowd.txt
+++ b/Dictionary/dict_the_it_crowd.txt
@@ -1,23 +0,0 @@
-Arsenal Football Club
-Aunt Irma
-Big Ben
-Countdown
-Dragon's Den
-Emergency Services
-Employee of the Month
-Friendface
-Gay: A Gay Musical
-Information Technology
-Jen Barber 
-Lonely Hearts
-Maurice Moss
-Random Access Memory
-Sea Parks
-Spaceology
-The Banner
-The Evening Informer 
-The Internet
-The London Echo
-Tnetennba
-Windows Vista
-Word
--- a/Dictionary/dict_white_collar.txt
+++ b/Dictionary/dict_white_collar.txt
@@ -1,37 +0,0 @@
-Neal Caffrey
-Mozzie
-Peter Burke
-Sara Ellis
-Elizabeth Burke
-Diana Berrigan
-Lauren Cruz
-Clinton Jones
-Kate Moreau
-Garrett Fowler
-Alex Hunter
-Vincent Adler
-Special Agent
-White Collar Division
-June Ellington
-Reese Hughes
-Matthew Keller
-Rebecca Lowe
-Rachel Turner
-Cindy
-Christie
-Senator Terrence Pratt
-Amanda Callaway
-David Siegel
-Operation Mentor
-Office of Professional Responsibility
-Samantha
-Sterling-Bosch
-Kali
-Teddy Winters
-Detroit Mob
-Burke's Seven
-Mrs. Suit
-Special Agent in Charge
-Satchmo
-Dutchman
-
--- a/LICENSE.txt
+++ b/LICENSE.txt
@@ -0,0 +1,339 @@
+                    GNU GENERAL PUBLIC LICENSE
+                       Version 2, June 1991
+
+ Copyright (C) 1989, 1991 Free Software Foundation, Inc.,
+ 51 Franklin Street, Fifth Floor, Boston, MA 02110-1301 USA
+ Everyone is permitted to copy and distribute verbatim copies
+ of this license document, but changing it is not allowed.
+
+                            Preamble
+
+  The licenses for most software are designed to take away your
+freedom to share and change it.  By contrast, the GNU General Public
+License is intended to guarantee your freedom to share and change free
+software--to make sure the software is free for all its users.  This
+General Public License applies to most of the Free Software
+Foundation's software and to any other program whose authors commit to
+using it.  (Some other Free Software Foundation software is covered by
+the GNU Lesser General Public License instead.)  You can apply it to
+your programs, too.
+
+  When we speak of free software, we are referring to freedom, not
+price.  Our General Public Licenses are designed to make sure that you
+have the freedom to distribute copies of free software (and charge for
+this service if you wish), that you receive source code or can get it
+if you want it, that you can change the software or use pieces of it
+in new free programs; and that you know you can do these things.
+
+  To protect your rights, we need to make restrictions that forbid
+anyone to deny you these rights or to ask you to surrender the rights.
+These restrictions translate to certain responsibilities for you if you
+distribute copies of the software, or if you modify it.
+
+  For example, if you distribute copies of such a program, whether
+gratis or for a fee, you must give the recipients all the rights that
+you have.  You must make sure that they, too, receive or can get the
+source code.  And you must show them these terms so they know their
+rights.
+
+  We protect your rights with two steps: (1) copyright the software, and
+(2) offer you this license which gives you legal permission to copy,
+distribute and/or modify the software.
+
+  Also, for each author's protection and ours, we want to make certain
+that everyone understands that there is no warranty for this free
+software.  If the software is modified by someone else and passed on, we
+want its recipients to know that what they have is not the original, so
+that any problems introduced by others will not reflect on the original
+authors' reputations.
+
+  Finally, any free program is threatened constantly by software
+patents.  We wish to avoid the danger that redistributors of a free
+program will individually obtain patent licenses, in effect making the
+program proprietary.  To prevent this, we have made it clear that any
+patent must be licensed for everyone's free use or not licensed at all.
+
+  The precise terms and conditions for copying, distribution and
+modification follow.
+
+                    GNU GENERAL PUBLIC LICENSE
+   TERMS AND CONDITIONS FOR COPYING, DISTRIBUTION AND MODIFICATION
+
+  0. This License applies to any program or other work which contains
+a notice placed by the copyright holder saying it may be distributed
+under the terms of this General Public License.  The "Program", below,
+refers to any such program or work, and a "work based on the Program"
+means either the Program or any derivative work under copyright law:
+that is to say, a work containing the Program or a portion of it,
+either verbatim or with modifications and/or translated into another
+language.  (Hereinafter, translation is included without limitation in
+the term "modification".)  Each licensee is addressed as "you".
+
+Activities other than copying, distribution and modification are not
+covered by this License; they are outside its scope.  The act of
+running the Program is not restricted, and the output from the Program
+is covered only if its contents constitute a work based on the
+Program (independent of having been made by running the Program).
+Whether that is true depends on what the Program does.
+
+  1. You may copy and distribute verbatim copies of the Program's
+source code as you receive it, in any medium, provided that you
+conspicuously and appropriately publish on each copy an appropriate
+copyright notice and disclaimer of warranty; keep intact all the
+notices that refer to this License and to the absence of any warranty;
+and give any other recipients of the Program a copy of this License
+along with the Program.
+
+You may charge a fee for the physical act of transferring a copy, and
+you may at your option offer warranty protection in exchange for a fee.
+
+  2. You may modify your copy or copies of the Program or any portion
+of it, thus forming a work based on the Program, and copy and
+distribute such modifications or work under the terms of Section 1
+above, provided that you also meet all of these conditions:
+
+    a) You must cause the modified files to carry prominent notices
+    stating that you changed the files and the date of any change.
+
+    b) You must cause any work that you distribute or publish, that in
+    whole or in part contains or is derived from the Program or any
+    part thereof, to be licensed as a whole at no charge to all third
+    parties under the terms of this License.
+
+    c) If the modified program normally reads commands interactively
+    when run, you must cause it, when started running for such
+    interactive use in the most ordinary way, to print or display an
+    announcement including an appropriate copyright notice and a
+    notice that there is no warranty (or else, saying that you provide
+    a warranty) and that users may redistribute the program under
+    these conditions, and telling the user how to view a copy of this
+    License.  (Exception: if the Program itself is interactive but
+    does not normally print such an announcement, your work based on
+    the Program is not required to print an announcement.)
+
+These requirements apply to the modified work as a whole.  If
+identifiable sections of that work are not derived from the Program,
+and can be reasonably considered independent and separate works in
+themselves, then this License, and its terms, do not apply to those
+sections when you distribute them as separate works.  But when you
+distribute the same sections as part of a whole which is a work based
+on the Program, the distribution of the whole must be on the terms of
+this License, whose permissions for other licensees extend to the
+entire whole, and thus to each and every part regardless of who wrote it.
+
+Thus, it is not the intent of this section to claim rights or contest
+your rights to work written entirely by you; rather, the intent is to
+exercise the right to control the distribution of derivative or
+collective works based on the Program.
+
+In addition, mere aggregation of another work not based on the Program
+with the Program (or with a work based on the Program) on a volume of
+a storage or distribution medium does not bring the other work under
+the scope of this License.
+
+  3. You may copy and distribute the Program (or a work based on it,
+under Section 2) in object code or executable form under the terms of
+Sections 1 and 2 above provided that you also do one of the following:
+
+    a) Accompany it with the complete corresponding machine-readable
+    source code, which must be distributed under the terms of Sections
+    1 and 2 above on a medium customarily used for software interchange; or,
+
+    b) Accompany it with a written offer, valid for at least three
+    years, to give any third party, for a charge no more than your
+    cost of physically performing source distribution, a complete
+    machine-readable copy of the corresponding source code, to be
+    distributed under the terms of Sections 1 and 2 above on a medium
+    customarily used for software interchange; or,
+
+    c) Accompany it with the information you received as to the offer
+    to distribute corresponding source code.  (This alternative is
+    allowed only for noncommercial distribution and only if you
+    received the program in object code or executable form with such
+    an offer, in accord with Subsection b above.)
+
+The source code for a work means the preferred form of the work for
+making modifications to it.  For an executable work, complete source
+code means all the source code for all modules it contains, plus any
+associated interface definition files, plus the scripts used to
+control compilation and installation of the executable.  However, as a
+special exception, the source code distributed need not include
+anything that is normally distributed (in either source or binary
+form) with the major components (compiler, kernel, and so on) of the
+operating system on which the executable runs, unless that component
+itself accompanies the executable.
+
+If distribution of executable or object code is made by offering
+access to copy from a designated place, then offering equivalent
+access to copy the source code from the same place counts as
+distribution of the source code, even though third parties are not
+compelled to copy the source along with the object code.
+
+  4. You may not copy, modify, sublicense, or distribute the Program
+except as expressly provided under this License.  Any attempt
+otherwise to copy, modify, sublicense or distribute the Program is
+void, and will automatically terminate your rights under this License.
+However, parties who have received copies, or rights, from you under
+this License will not have their licenses terminated so long as such
+parties remain in full compliance.
+
+  5. You are not required to accept this License, since you have not
+signed it.  However, nothing else grants you permission to modify or
+distribute the Program or its derivative works.  These actions are
+prohibited by law if you do not accept this License.  Therefore, by
+modifying or distributing the Program (or any work based on the
+Program), you indicate your acceptance of this License to do so, and
+all its terms and conditions for copying, distributing or modifying
+the Program or works based on it.
+
+  6. Each time you redistribute the Program (or any work based on the
+Program), the recipient automatically receives a license from the
+original licensor to copy, distribute or modify the Program subject to
+these terms and conditions.  You may not impose any further
+restrictions on the recipients' exercise of the rights granted herein.
+You are not responsible for enforcing compliance by third parties to
+this License.
+
+  7. If, as a consequence of a court judgment or allegation of patent
+infringement or for any other reason (not limited to patent issues),
+conditions are imposed on you (whether by court order, agreement or
+otherwise) that contradict the conditions of this License, they do not
+excuse you from the conditions of this License.  If you cannot
+distribute so as to satisfy simultaneously your obligations under this
+License and any other pertinent obligations, then as a consequence you
+may not distribute the Program at all.  For example, if a patent
+license would not permit royalty-free redistribution of the Program by
+all those who receive copies directly or indirectly through you, then
+the only way you could satisfy both it and this License would be to
+refrain entirely from distribution of the Program.
+
+If any portion of this section is held invalid or unenforceable under
+any particular circumstance, the balance of the section is intended to
+apply and the section as a whole is intended to apply in other
+circumstances.
+
+It is not the purpose of this section to induce you to infringe any
+patents or other property right claims or to contest validity of any
+such claims; this section has the sole purpose of protecting the
+integrity of the free software distribution system, which is
+implemented by public license practices.  Many people have made
+generous contributions to the wide range of software distributed
+through that system in reliance on consistent application of that
+system; it is up to the author/donor to decide if he or she is willing
+to distribute software through any other system and a licensee cannot
+impose that choice.
+
+This section is intended to make thoroughly clear what is believed to
+be a consequence of the rest of this License.
+
+  8. If the distribution and/or use of the Program is restricted in
+certain countries either by patents or by copyrighted interfaces, the
+original copyright holder who places the Program under this License
+may add an explicit geographical distribution limitation excluding
+those countries, so that distribution is permitted only in or among
+countries not thus excluded.  In such case, this License incorporates
+the limitation as if written in the body of this License.
+
+  9. The Free Software Foundation may publish revised and/or new versions
+of the General Public License from time to time.  Such new versions will
+be similar in spirit to the present version, but may differ in detail to
+address new problems or concerns.
+
+Each version is given a distinguishing version number.  If the Program
+specifies a version number of this License which applies to it and "any
+later version", you have the option of following the terms and conditions
+either of that version or of any later version published by the Free
+Software Foundation.  If the Program does not specify a version number of
+this License, you may choose any version ever published by the Free Software
+Foundation.
+
+  10. If you wish to incorporate parts of the Program into other free
+programs whose distribution conditions are different, write to the author
+to ask for permission.  For software which is copyrighted by the Free
+Software Foundation, write to the Free Software Foundation; we sometimes
+make exceptions for this.  Our decision will be guided by the two goals
+of preserving the free status of all derivatives of our free software and
+of promoting the sharing and reuse of software generally.
+
+                            NO WARRANTY
+
+  11. BECAUSE THE PROGRAM IS LICENSED FREE OF CHARGE, THERE IS NO WARRANTY
+FOR THE PROGRAM, TO THE EXTENT PERMITTED BY APPLICABLE LAW.  EXCEPT WHEN
+OTHERWISE STATED IN WRITING THE COPYRIGHT HOLDERS AND/OR OTHER PARTIES
+PROVIDE THE PROGRAM "AS IS" WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESSED
+OR IMPLIED, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF
+MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE.  THE ENTIRE RISK AS
+TO THE QUALITY AND PERFORMANCE OF THE PROGRAM IS WITH YOU.  SHOULD THE
+PROGRAM PROVE DEFECTIVE, YOU ASSUME THE COST OF ALL NECESSARY SERVICING,
+REPAIR OR CORRECTION.
+
+  12. IN NO EVENT UNLESS REQUIRED BY APPLICABLE LAW OR AGREED TO IN WRITING
+WILL ANY COPYRIGHT HOLDER, OR ANY OTHER PARTY WHO MAY MODIFY AND/OR
+REDISTRIBUTE THE PROGRAM AS PERMITTED ABOVE, BE LIABLE TO YOU FOR DAMAGES,
+INCLUDING ANY GENERAL, SPECIAL, INCIDENTAL OR CONSEQUENTIAL DAMAGES ARISING
+OUT OF THE USE OR INABILITY TO USE THE PROGRAM (INCLUDING BUT NOT LIMITED
+TO LOSS OF DATA OR DATA BEING RENDERED INACCURATE OR LOSSES SUSTAINED BY
+YOU OR THIRD PARTIES OR A FAILURE OF THE PROGRAM TO OPERATE WITH ANY OTHER
+PROGRAMS), EVEN IF SUCH HOLDER OR OTHER PARTY HAS BEEN ADVISED OF THE
+POSSIBILITY OF SUCH DAMAGES.
+
+                     END OF TERMS AND CONDITIONS
+
+            How to Apply These Terms to Your New Programs
+
+  If you develop a new program, and you want it to be of the greatest
+possible use to the public, the best way to achieve this is to make it
+free software which everyone can redistribute and change under these terms.
+
+  To do so, attach the following notices to the program.  It is safest
+to attach them to the start of each source file to most effectively
+convey the exclusion of warranty; and each file should have at least
+the "copyright" line and a pointer to where the full notice is found.
+
+    <one line to give the program's name and a brief idea of what it does.>
+    Copyright (C) <year>  <name of author>
+
+    This program is free software; you can redistribute it and/or modify
+    it under the terms of the GNU General Public License as published by
+    the Free Software Foundation; either version 2 of the License, or
+    (at your option) any later version.
+
+    This program is distributed in the hope that it will be useful,
+    but WITHOUT ANY WARRANTY; without even the implied warranty of
+    MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
+    GNU General Public License for more details.
+
+    You should have received a copy of the GNU General Public License along
+    with this program; if not, write to the Free Software Foundation, Inc.,
+    51 Franklin Street, Fifth Floor, Boston, MA 02110-1301 USA.
+
+Also add information on how to contact you by electronic and paper mail.
+
+If the program is interactive, make it output a short notice like this
+when it starts in an interactive mode:
+
+    Gnomovision version 69, Copyright (C) year name of author
+    Gnomovision comes with ABSOLUTELY NO WARRANTY; for details type `show w'.
+    This is free software, and you are welcome to redistribute it
+    under certain conditions; type `show c' for details.
+
+The hypothetical commands `show w' and `show c' should show the appropriate
+parts of the General Public License.  Of course, the commands you use may
+be called something other than `show w' and `show c'; they could even be
+mouse-clicks or menu items--whatever suits your program.
+
+You should also get your employer (if you work as a programmer) or your
+school, if any, to sign a "copyright disclaimer" for the program, if
+necessary.  Here is a sample; alter the names:
+
+  Yoyodyne, Inc., hereby disclaims all copyright interest in the program
+  `Gnomovision' (which makes passes at compilers) written by James Hacker.
+
+  <signature of Ty Coon>, 1 April 1989
+  Ty Coon, President of Vice
+
+This General Public License does not permit incorporating your program into
+proprietary programs.  If your program is a subroutine library, you may
+consider it more useful to permit linking proprietary applications with the
+library.  If this is what you want to do, use the GNU Lesser General
+Public License instead of this License.
--- a/OpenBSD/Makefile
+++ b/OpenBSD/Makefile
@@ -3,8 +3,8 @@
 MAINTAINER = 	Marc Espie <espie@openbsd.org>
 CATEGORIES =	multimedia
 COMMENT =	closed caption subtitles extractor
-HOMEPAGE = 	http://ccextractor.sourceforge.net/
-V =		0.77
+HOMEPAGE = 	https://ccextractor.org
+V =		0.96.5
 DISTFILES =	ccextractor.${V:S/.//}-src.zip
 MASTER_SITES =	${MASTER_SITE_SOURCEFORGE:=ccextractor/}
 DISTNAME =	ccextractor-$V
--- a/README.md
+++ b/README.md
@@ -1,68 +1,119 @@
-![logo](https://avatars3.githubusercontent.com/u/7253637?v=3&s=100)
- 
+<img src ="https://github.com/CCExtractor/ccextractor-org-media/blob/master/static/ccx_logo_transparent_800x600.png" width="200px" alt="logo" />
+
 # CCExtractor

-CCExtractor is a tool that produces subtitles from TV use. Global accessibility (all users, all content, all countries) is the goal. With so many different formats, this is a constantly moving target, but we intend to keep up with all sources and formats.
+[![Sample-Platform Build Status Windows](https://sampleplatform.ccextractor.org/static/img/status/build-windows.svg?maxAge=1800)](https://sampleplatform.ccextractor.org/test/master/windows)
+[![Sample-Platform Build Status Linux](https://sampleplatform.ccextractor.org/static/img/status/build-linux.svg?maxAge=1800)](https://sampleplatform.ccextractor.org/test/master/linux)
+[![SourceForge](https://img.shields.io/badge/SourceForge%20downloads-213k%2Ftotal-brightgreen.svg)](https://sourceforge.net/projects/ccextractor/)
+[![GitHub All Releases](https://img.shields.io/github/downloads/CCExtractor/CCExtractor/total.svg)](https://github.com/CCExtractor/ccextractor/releases/latest)

-Carlos' version (mainstream) is the most stable branch.
+CCExtractor is a tool used to produce subtitles for TV recordings from almost anywhere in the world. We intend to keep up with all sources and formats.
+
+Subtitles are important for many people. If you're learning a new language, subtitles are a great way to learn it from movies or TV shows. If you are hard of hearing, subtitles can help you better understand what's happening on the screen. We aim to make it easy to generate subtitles by using the command line tool or Windows GUI.
+
+The official repository is ([CCExtractor/ccextractor](https://github.com/CCExtractor/ccextractor)) and master being the most stable branch.
+
+### **Features**
+
+- Extract subtitles in real-time
+- Translate subtitles
+- Extract closed captions from DVDs
+- Convert closed captions to subtitles
+
+### Programming Languages & Technologies
+
+The core functionality is written in C. Other languages used include C++ and Python.

 ## Installation and Usage

-Downloads for precompiled binaries and source code can be found [on our website](http://www.ccextractor.org/doku.php?id=public:general:downloads).
+Downloads for precompiled binaries and source code can be found [on our website](https://ccextractor.org/public/general/downloads/).
+
+
+### Windows Package Managers
+
+**WinGet:**
+```powershell
+winget install CCExtractor.CCExtractor
+```
+
+**Chocolatey:**
+```powershell
+choco install ccextractor
+```
+
+**Scoop:**
+```powershell
+scoop bucket add extras
+scoop install ccextractor
+```

 Extracting subtitles is relatively simple. Just run the following command:

-```ccextractor <input>```
+`ccextractor <input>`

-This will extract the subtitles. 
+This will extract the subtitles.

 More usage information can be found on our website:

- [Using the command line tool](http://www.ccextractor.org/doku.php?id=public:general:command_line_usage)
- [Using the Windows GUI](http://www.ccextractor.org/doku.php?id=public:general:win_gui_usage) 
+- [Using the command line tool](https://ccextractor.org/public/general/command_line_usage/)
+- [Using the Flutter GUI](https://ccextractor.org/public/general/flutter_gui/)

+You can also find the list of parameters and their brief description by running `ccextractor` without any arguments.

-## Compiling
+You can find sample files on [our website](https://ccextractor.org/public/general/tvsamples/) to test the software.

-### Debian/Ubuntu
+### Building from Source

-Install these packages in the terminal
+- [Building on Windows using WSL](docs/build-wsl.md)

-    sudo apt-get install -y gcc
-    sudo apt-get install -y libcurl4-gnutls-dev
-    sudo apt-get install -y tesseract-ocr
-    sudo apt-get install -y tesseract-ocr-dev
-    sudo apt-get install -y libleptonica-dev
-Then run script linux/build or linux/builddebug.
+#### Linux (Autotools) build notes

-### Windows
+CCExtractor also supports an autotools-based build system under the `linux/`
+directory.

-Open the windows/ccextractor.sln file with Visual Studio (2015 at least), and build it. Configurations "(Debug|Release)-Full" includes dependent libraries which are used for OCR.
+Important notes:
+- The autotools workflow lives inside `linux/`. The `configure` script is
+  generated there and should be run from that directory.
+- Typical build steps are:
+```
+cd linux
+./autogen.sh
+./configure
+make
+```
+- Rust support is enabled automatically if `cargo` and `rustc` are available
+  on the system. In that case, Rust components are built and linked during
+  `make`.
+- If you encounter unexpected build or linking issues, a clean rebuild
+  (`make clean` or a fresh clone) is recommended, especially when Rust is
+  involved.
+
+This build flow has been tested on Linux and WSL.
+
+## Compiling CCExtractor
+
+To learn more about how to compile and build CCExtractor for your platform check the [compilation guide](https://github.com/CCExtractor/ccextractor/blob/master/docs/COMPILATION.MD).

 ## Support

-By far the best way to get support is by opening a support ticket at our [issue tracker](https://github.com/CCExtractor/ccextractor/issues). 
+By far the best way to get support is by opening an issue at our [issue tracker](https://github.com/CCExtractor/ccextractor/issues).

-When creating a ticket:
+When you create a new issue, please fill in the needed details in the provided template. That makes it easier for us to help you more efficiently.

- Make sure you are using the latest CCExtractor version.
- If it's a new issue (for example a video file that a previous CCExtractor version processed fine but now causes a crash), mention the last version you know was working.
- If the issue is about a specific file, make that file available for us. Don't just send us the output from CCExtractor, as we can't do anything about a screenshot that shows a crash. We need the input that actually causes it. You can upload the file to Dropbox, Google Drive, etc, and make it public so you get a download link to add to your ticket.
- If you cannot make the file public for any (reasonable) reason you can send us a private invitation (both Dropbox and Google Drive allow that). In this case we will download the file and upload it to the private developer repository.
- Do not upload your file to any location that will require us to sign up or endure a wait list, slow downloads, etc.
- If your upload expires make sure you keep it active somehow (replace links if needed). Keep in mind that while we go over all tickets some may take a few days, and it's important we have the file available when we actually need it.
- Make sure you set an alert in GitHub so you get notifications about your ticket. We may need to ask questions and we do everything inside GitHub's system.
- Please use English. 
- It goes without saying, we like polite people.
+If you have a question or a problem you can also [contact us by email or chat with the team in Slack](https://ccextractor.org/public/general/support/).
+
+If you want to contribute to CCExtractor but can't submit some code patches or issues or video samples, you can also [donate to us](https://sourceforge.net/donate/index.php?group_id=190832)

-You can also [contact us by email or chat with the team in Slack](http://www.ccextractor.org/doku.php?id=public:general:support). 
-    
 ## Contributing

-You can contribute to the project by forking it, modifying the code, and making a pull request to the repository. 
+You can contribute to the project by reporting issues, forking it, modifying the code and making a pull request to the repository. We have some rules, outlined in the [contributor's guide](.github/CONTRIBUTING.md).

 ## News & Other Information

-News about releases and modifications to the code can be found in the `CHANGES.TXT` file. 
+News about releases and modifications to the code can be found in the [CHANGES.TXT](docs/CHANGES.TXT) file.

-For more information visit the CCExtractor website: [http://www.ccextractor.org](http://www.ccextractor.org)
+For more information visit the CCExtractor website: [https://www.ccextractor.org](https://www.ccextractor.org)
+
+## License
+
+GNU General Public License version 2.0 (GPL-2.0)
--- a/0
+++ b/0
--- a/docker/Dockerfile
+++ b/docker/Dockerfile
@@ -0,0 +1,239 @@
+# CCExtractor Docker Build
+#
+# Build variants via BUILD_TYPE argument:
+#   - minimal: Basic CCExtractor without OCR
+#   - ocr: CCExtractor with OCR support (default)
+#   - hardsubx: CCExtractor with burned-in subtitle extraction (requires FFmpeg)
+#
+# Source options via USE_LOCAL_SOURCE argument:
+#   - 0 (default): Clone from GitHub (standalone Dockerfile usage)
+#   - 1: Use local source (when building from cloned repo)
+#
+# Build examples:
+#
+#   # Standalone (just the Dockerfile, clones from GitHub):
+#   docker build -t ccextractor docker/
+#   docker build --build-arg BUILD_TYPE=hardsubx -t ccextractor docker/
+#
+#   # From cloned repository (faster, uses local source):
+#   docker build --build-arg USE_LOCAL_SOURCE=1 -f docker/Dockerfile -t ccextractor .
+#   docker build --build-arg USE_LOCAL_SOURCE=1 --build-arg BUILD_TYPE=minimal -f docker/Dockerfile -t ccextractor .
+
+ARG DEBIAN_VERSION=bookworm-slim
+
+FROM debian:${DEBIAN_VERSION} AS base
+
+FROM base AS builder
+
+# Build arguments
+ARG BUILD_TYPE=ocr
+ARG USE_LOCAL_SOURCE=0
+# BUILD_TYPE: minimal, ocr, hardsubx
+# USE_LOCAL_SOURCE: 0 = git clone, 1 = copy local source
+
+# Avoid interactive prompts during package installation
+ENV DEBIAN_FRONTEND=noninteractive
+
+# Install base build dependencies
+RUN apt-get update && apt-get install -y --no-install-recommends \
+    git \
+    curl \
+    ca-certificates \
+    gcc \
+    g++ \
+    cmake \
+    make \
+    pkg-config \
+    bash \
+    zlib1g-dev \
+    libpng-dev \
+    libjpeg-dev \
+    libssl-dev \
+    libfreetype-dev \
+    libxml2-dev \
+    libcurl4-gnutls-dev \
+    clang \
+    libclang-dev \
+    && rm -rf /var/lib/apt/lists/*
+
+# Install Rust toolchain
+RUN curl --proto '=https' --tlsv1.2 -sSf https://sh.rustup.rs | sh -s -- -y --default-toolchain stable
+ENV PATH="/root/.cargo/bin:${PATH}"
+
+# Install OCR dependencies (for ocr and hardsubx builds)
+RUN if [ "$BUILD_TYPE" = "ocr" ] || [ "$BUILD_TYPE" = "hardsubx" ]; then \
+        apt-get update && apt-get install -y --no-install-recommends \
+            tesseract-ocr \
+            libtesseract-dev \
+            libleptonica-dev \
+        && rm -rf /var/lib/apt/lists/*; \
+    fi
+
+# Install FFmpeg dependencies (for hardsubx build)
+RUN if [ "$BUILD_TYPE" = "hardsubx" ]; then \
+        apt-get update && apt-get install -y --no-install-recommends \
+            libavcodec-dev \
+            libavformat-dev \
+            libavutil-dev \
+            libswscale-dev \
+            libswresample-dev \
+            libavfilter-dev \
+            libavdevice-dev \
+        && rm -rf /var/lib/apt/lists/*; \
+    fi
+
+# Build and install GPAC library
+WORKDIR /root
+RUN git clone -b v2.4.0 --depth 1 https://github.com/gpac/gpac
+WORKDIR /root/gpac
+RUN ./configure && make -j$(nproc) lib && make install-lib && ldconfig
+WORKDIR /root
+RUN rm -rf /root/gpac
+
+# Get CCExtractor source (either clone or copy based on USE_LOCAL_SOURCE)
+WORKDIR /root
+# First, copy local source if provided (will be empty dir if building standalone)
+COPY . /root/ccextractor-local/
+
+# Then get source: use local copy if USE_LOCAL_SOURCE=1 and source exists,
+# otherwise clone from GitHub
+RUN if [ "$USE_LOCAL_SOURCE" = "1" ] && [ -f /root/ccextractor-local/src/ccextractor.c ]; then \
+        echo "Using local source"; \
+        mv /root/ccextractor-local /root/ccextractor; \
+    else \
+        echo "Cloning from GitHub"; \
+        rm -rf /root/ccextractor-local; \
+        git clone --depth 1 https://github.com/CCExtractor/ccextractor.git /root/ccextractor; \
+    fi
+
+WORKDIR /root/ccextractor/linux
+
+# Generate build info
+RUN ./pre-build.sh
+
+# Build Rust library with appropriate features
+RUN if [ "$BUILD_TYPE" = "hardsubx" ]; then \
+        cd ../src/rust && \
+        CARGO_TARGET_DIR=../../linux/rust cargo build --release --features hardsubx_ocr; \
+    else \
+        cd ../src/rust && \
+        CARGO_TARGET_DIR=../../linux/rust cargo build --release; \
+    fi
+
+RUN cp rust/release/libccx_rust.a ./libccx_rust.a
+
+# Compile CCExtractor
+RUN if [ "$BUILD_TYPE" = "minimal" ]; then \
+        BLD_FLAGS="-std=gnu99 -Wno-write-strings -Wno-pointer-sign -D_FILE_OFFSET_BITS=64 -DVERSION_FILE_PRESENT -DFT2_BUILD_LIBRARY -DGPAC_DISABLE_VTT -DGPAC_DISABLE_OD_DUMP -DGPAC_DISABLE_REMOTERY -DNO_GZIP -DGPAC_64_BITS"; \
+        BLD_INCLUDE="-I../src -I../src/lib_ccx/ -I /usr/include/gpac/ -I../src/thirdparty/libpng -I../src/thirdparty/zlib -I../src/lib_ccx/zvbi -I../src/thirdparty/lib_hash -I../src/thirdparty -I../src/thirdparty/freetype/include"; \
+        BLD_LINKER="-lm -Wl,--allow-multiple-definition -lpthread -ldl -lgpac ./libccx_rust.a"; \
+    elif [ "$BUILD_TYPE" = "hardsubx" ]; then \
+        BLD_FLAGS="-std=gnu99 -Wno-write-strings -Wno-pointer-sign -D_FILE_OFFSET_BITS=64 -DVERSION_FILE_PRESENT -DENABLE_OCR -DENABLE_HARDSUBX -DFT2_BUILD_LIBRARY -DGPAC_DISABLE_VTT -DGPAC_DISABLE_OD_DUMP -DGPAC_DISABLE_REMOTERY -DNO_GZIP -DGPAC_64_BITS"; \
+        BLD_INCLUDE="-I../src -I /usr/include/leptonica/ -I /usr/include/tesseract/ -I../src/lib_ccx/ -I /usr/include/gpac/ -I../src/thirdparty/libpng -I../src/thirdparty/zlib -I../src/lib_ccx/zvbi -I../src/thirdparty/lib_hash -I../src/thirdparty -I../src/thirdparty/freetype/include"; \
+        BLD_LINKER="-lm -Wl,--allow-multiple-definition -ltesseract -lleptonica -lpthread -ldl -lgpac -lswscale -lavutil -lavformat -lavcodec -lavfilter -lswresample ./libccx_rust.a"; \
+    else \
+        BLD_FLAGS="-std=gnu99 -Wno-write-strings -Wno-pointer-sign -D_FILE_OFFSET_BITS=64 -DVERSION_FILE_PRESENT -DENABLE_OCR -DFT2_BUILD_LIBRARY -DGPAC_DISABLE_VTT -DGPAC_DISABLE_OD_DUMP -DGPAC_DISABLE_REMOTERY -DNO_GZIP -DGPAC_64_BITS"; \
+        BLD_INCLUDE="-I../src -I /usr/include/leptonica/ -I /usr/include/tesseract/ -I../src/lib_ccx/ -I /usr/include/gpac/ -I../src/thirdparty/libpng -I../src/thirdparty/zlib -I../src/lib_ccx/zvbi -I../src/thirdparty/lib_hash -I../src/thirdparty -I../src/thirdparty/freetype/include"; \
+        BLD_LINKER="-lm -Wl,--allow-multiple-definition -ltesseract -lleptonica -lpthread -ldl -lgpac ./libccx_rust.a"; \
+    fi && \
+    SRC_LIBPNG="$(find ../src/thirdparty/libpng/ -name '*.c')" && \
+    SRC_ZLIB="$(find ../src/thirdparty/zlib/ -name '*.c')" && \
+    SRC_CCX="$(find ../src/lib_ccx/ -name '*.c')" && \
+    SRC_GPAC="$(find /usr/include/gpac/ -name '*.c' 2>/dev/null || true)" && \
+    SRC_HASH="$(find ../src/thirdparty/lib_hash/ -name '*.c')" && \
+    SRC_UTF8PROC="../src/thirdparty/utf8proc/utf8proc.c" && \
+    SRC_FREETYPE="../src/thirdparty/freetype/autofit/autofit.c \
+        ../src/thirdparty/freetype/base/ftbase.c \
+        ../src/thirdparty/freetype/base/ftbbox.c \
+        ../src/thirdparty/freetype/base/ftbdf.c \
+        ../src/thirdparty/freetype/base/ftbitmap.c \
+        ../src/thirdparty/freetype/base/ftcid.c \
+        ../src/thirdparty/freetype/base/ftfntfmt.c \
+        ../src/thirdparty/freetype/base/ftfstype.c \
+        ../src/thirdparty/freetype/base/ftgasp.c \
+        ../src/thirdparty/freetype/base/ftglyph.c \
+        ../src/thirdparty/freetype/base/ftgxval.c \
+        ../src/thirdparty/freetype/base/ftinit.c \
+        ../src/thirdparty/freetype/base/ftlcdfil.c \
+        ../src/thirdparty/freetype/base/ftmm.c \
+        ../src/thirdparty/freetype/base/ftotval.c \
+        ../src/thirdparty/freetype/base/ftpatent.c \
+        ../src/thirdparty/freetype/base/ftpfr.c \
+        ../src/thirdparty/freetype/base/ftstroke.c \
+        ../src/thirdparty/freetype/base/ftsynth.c \
+        ../src/thirdparty/freetype/base/ftsystem.c \
+        ../src/thirdparty/freetype/base/fttype1.c \
+        ../src/thirdparty/freetype/base/ftwinfnt.c \
+        ../src/thirdparty/freetype/bdf/bdf.c \
+        ../src/thirdparty/freetype/bzip2/ftbzip2.c \
+        ../src/thirdparty/freetype/cache/ftcache.c \
+        ../src/thirdparty/freetype/cff/cff.c \
+        ../src/thirdparty/freetype/cid/type1cid.c \
+        ../src/thirdparty/freetype/gzip/ftgzip.c \
+        ../src/thirdparty/freetype/lzw/ftlzw.c \
+        ../src/thirdparty/freetype/pcf/pcf.c \
+        ../src/thirdparty/freetype/pfr/pfr.c \
+        ../src/thirdparty/freetype/psaux/psaux.c \
+        ../src/thirdparty/freetype/pshinter/pshinter.c \
+        ../src/thirdparty/freetype/psnames/psnames.c \
+        ../src/thirdparty/freetype/raster/raster.c \
+        ../src/thirdparty/freetype/sfnt/sfnt.c \
+        ../src/thirdparty/freetype/smooth/smooth.c \
+        ../src/thirdparty/freetype/truetype/truetype.c \
+        ../src/thirdparty/freetype/type1/type1.c \
+        ../src/thirdparty/freetype/type42/type42.c \
+        ../src/thirdparty/freetype/winfonts/winfnt.c" && \
+    BLD_SOURCES="../src/ccextractor.c $SRC_CCX $SRC_GPAC $SRC_ZLIB $SRC_LIBPNG $SRC_HASH $SRC_UTF8PROC $SRC_FREETYPE" && \
+    gcc $BLD_FLAGS $BLD_INCLUDE -o ccextractor $BLD_SOURCES $BLD_LINKER
+
+# Copy binary to known location
+RUN cp /root/ccextractor/linux/ccextractor /ccextractor
+
+# Final minimal image
+FROM base AS final
+
+ARG BUILD_TYPE=ocr
+
+# Avoid interactive prompts
+ENV DEBIAN_FRONTEND=noninteractive
+
+# Install runtime dependencies based on build type
+RUN apt-get update && apt-get install -y --no-install-recommends \
+    libpng16-16 \
+    libjpeg62-turbo \
+    zlib1g \
+    libssl3 \
+    libcurl4 \
+    && rm -rf /var/lib/apt/lists/*
+
+# OCR runtime dependencies
+RUN if [ "$BUILD_TYPE" = "ocr" ] || [ "$BUILD_TYPE" = "hardsubx" ]; then \
+        apt-get update && apt-get install -y --no-install-recommends \
+            tesseract-ocr \
+            liblept5 \
+        && rm -rf /var/lib/apt/lists/*; \
+    fi
+
+# HardSubX runtime dependencies
+RUN if [ "$BUILD_TYPE" = "hardsubx" ]; then \
+        apt-get update && apt-get install -y --no-install-recommends \
+            libavcodec59 \
+            libavformat59 \
+            libavutil57 \
+            libswscale6 \
+            libswresample4 \
+            libavfilter8 \
+            libavdevice59 \
+        && rm -rf /var/lib/apt/lists/*; \
+    fi
+
+# Copy GPAC library from builder
+COPY --from=builder /usr/local/lib/libgpac.so* /usr/local/lib/
+
+# Update library cache
+RUN ldconfig
+
+# Copy CCExtractor binary
+COPY --from=builder /ccextractor /ccextractor
+
+ENTRYPOINT ["/ccextractor"]
--- a/docker/README.md
+++ b/docker/README.md
@@ -0,0 +1,91 @@
+# CCExtractor Docker Image
+
+This Dockerfile builds CCExtractor with support for multiple build variants.
+
+## Build Variants
+
+| Variant | Description | Features |
+|---------|-------------|----------|
+| `minimal` | Basic CCExtractor | No OCR support |
+| `ocr` | With OCR support (default) | Tesseract OCR for bitmap subtitles |
+| `hardsubx` | With burned-in subtitle extraction | OCR + FFmpeg for hardcoded subtitles |
+
+## Building
+
+### Standalone Build (from Dockerfile only)
+
+You can build CCExtractor using just the Dockerfile - it will clone the source from GitHub:
+
+```bash
+# Default build (OCR enabled)
+docker build -t ccextractor docker/
+
+# Minimal build (no OCR)
+docker build --build-arg BUILD_TYPE=minimal -t ccextractor docker/
+
+# HardSubX build (OCR + FFmpeg for burned-in subtitles)
+docker build --build-arg BUILD_TYPE=hardsubx -t ccextractor docker/
+```
+
+### Build from Cloned Repository (faster)
+
+If you have already cloned the repository, you can use local source for faster builds:
+
+```bash
+git clone https://github.com/CCExtractor/ccextractor.git
+cd ccextractor
+
+# Default build (OCR enabled)
+docker build --build-arg USE_LOCAL_SOURCE=1 -f docker/Dockerfile -t ccextractor .
+
+# Minimal build
+docker build --build-arg USE_LOCAL_SOURCE=1 --build-arg BUILD_TYPE=minimal -f docker/Dockerfile -t ccextractor .
+
+# HardSubX build
+docker build --build-arg USE_LOCAL_SOURCE=1 --build-arg BUILD_TYPE=hardsubx -f docker/Dockerfile -t ccextractor .
+```
+
+## Build Arguments
+
+| Argument | Default | Description |
+|----------|---------|-------------|
+| `BUILD_TYPE` | `ocr` | Build variant: `minimal`, `ocr`, or `hardsubx` |
+| `USE_LOCAL_SOURCE` | `0` | Set to `1` to use local source instead of cloning |
+| `DEBIAN_VERSION` | `bookworm-slim` | Debian version to use as base |
+
+## Usage
+
+### Basic Usage
+
+```bash
+# Show version
+docker run --rm ccextractor --version
+
+# Show help
+docker run --rm ccextractor --help
+```
+
+### Processing Local Files
+
+Mount your local directory to process files:
+
+```bash
+# Process a video file with output file
+docker run --rm -v $(pwd):$(pwd) -w $(pwd) ccextractor input.mp4 -o output.srt
+
+# Process using stdout
+docker run --rm -v $(pwd):$(pwd) -w $(pwd) ccextractor input.mp4 --stdout > output.srt
+```
+
+### Interactive Mode
+
+```bash
+docker run --rm -it --entrypoint=/bin/bash ccextractor
+```
+
+## Image Size
+
+The multi-stage build produces runtime images:
+- `minimal`: ~130MB
+- `ocr`: ~215MB (includes Tesseract)
+- `hardsubx`: ~610MB (includes Tesseract + FFmpeg)
--- a/docs/708_STATUS.TXT
+++ b/docs/708_STATUS.TXT
@@ -29,7 +29,7 @@ To do:
  though. No samples, no support.
 - A few commands are not yet supported, specifically those related
  to delay. 
- Detect and extract captions from MP4 (MOV) files, handled by gpacmp4
+- Detect and extract captions from MP4 (MOV) files, handled by gpac

 Done (18.08.2015):

--- a/docs/AUTHORS.TXT
+++ b/docs/AUTHORS.TXT
@@ -0,0 +1,92 @@
+ccextractor was originally a mildly optimized C port of McPoodle's excellent
+but painfully slow Perl script SCC_RIP. That port (ccextractor 0.01) was
+written by Carlos Fernández (cfsmp3).
+
+After a number of versions that did something semiuseful Volker Quetschke
+joined the effort and together Carlos and Volker to CCExtractor a point in
+which it was actually really usable, at least for the cases that interested
+them.
+
+Unfortunately Volker moved on once CCExtractor did what he needed to do for
+him.
+
+At some point David Liontooth from UCLA started to use CCExtractor as a 
+replacement for libzvbi because libzvbi wasn't working for some specific
+streams. UCLA became the primary key user as they were using CCExtractor
+24x7 to process a huge amount of stream from several countries, and was 
+therefore able to provide samples, proper bug reports, etc.
+
+At that time CCEXtractor was still US-centric, because it was originally
+written so Carlos could get subtitles for US TV shows. But UCLA wanted
+European subtitles too, and they already had recording nodes in Denmark
+(which use teletext) and Spain (which uses DVB).
+
+For teletext a good solution existed already: Petr Kutalek's telxcc. 
+We contacted Petr and asked for permission to integrate his code into
+CCExtractor. Petr's absolutely brilliantly clean code was easy to 
+integrate and build upon - and with it, we added support for the first
+kind of European subtitles.
+
+Around that time, we decided to apply for Google Summer of Code. That
+was also a game changer, with Willem, Ruslan and Anshul being the first
+3 students. They are still around, now as mentors and year round 
+contributors.
+
+Since them, many more people have been involved: More than 10 as 
+Google Summer of Code students, Code-In students, companies that
+sponsored development by hiring team members to do custom development
+(Comcast was the first one, and we'll always be grateful for the 
+opportunity). 
+
+List of students is below (if they added themselves). For a complete
+list, just check the pull requests at GitHub.
+
+Home: https://www.ccextractor.org
+
+Google Summer of Code 2014 students
+- Willem Van Iseghem
+- Ruslan Kuchumov
+- Anshul Maheshwari
+
+Google Summer of Code 2015 students
+- Willem Van Iseghem
+- Ruslan Kuchumov
+- Anshul Maheshwari
+- Nurendra Choudhary
+- Oleg Kiselev
+- Vasanth Kalingeri
+
+Google Summer of Code 2016 students
+- Willem Van Iseghem
+- Ruslan Kuchumov
+- Abhishek Vinjamoori
+- Abhinav Shukla
+- Rishabh Garg
+
+Google Code-in 2016 students
+- Evgeny Shulgin
+- Manveer Basra
+- Alexandru Bratosin
+- Matej Plavevski
+- Danila Fedorin
+
+Google Code-in 2017 students
+- Matej Plavevski
+- Harry Yu
+- Theodore Fabian
+- Nikunj Taneja
+- John Chew
+- Aadi Bajpai
+- Wiliam(Hori75)
+
+Google Summer of Code 2017 students
+- Diptanshu Jamgade
+- Mayank Gupta
+
+Google Code-in 2018 students
+- Matej Plavevski
+- Ivan Makarov
+- Albert (alufers)
+- Brian M 
+- John Chew
+- T1duS 
--- a/docs/Building_macos_system_libs.md
+++ b/docs/Building_macos_system_libs.md
@@ -0,0 +1,157 @@
+# Building CCExtractor on macOS using System Libraries (-system-libs)
+
+## Overview
+
+This document explains how to build CCExtractor on macOS using system-installed libraries instead of bundled third-party libraries.
+
+This build mode is required for Homebrew compatibility and is enabled via the `-system-libs` flag introduced in PR #1862.
+
+## Why is -system-libs needed?
+
+### Background
+
+CCExtractor was removed from Homebrew (homebrew-core) because:
+
+- Homebrew does not allow bundling third-party libraries
+- The default CCExtractor build compiles libraries from `src/thirdparty/`
+- This violates Homebrew packaging policies
+
+### What -system-libs fixes
+
+The `-system-libs` flag allows CCExtractor to:
+
+- Use system-installed libraries via Homebrew
+- Resolve headers and linker flags using `pkg-config`
+- Skip compiling bundled copies of common libraries
+
+This makes CCExtractor acceptable for Homebrew packaging.
+
+## Build Modes Explained
+
+### 1️⃣ Default Build (Bundled Libraries)
+
+**Command:**
+
+```bash
+./mac/build.command
+```
+
+**Behavior:**
+
+- Compiles bundled libraries:
+  - `freetype`
+  - `libpng`
+  - `zlib`
+  - `utf8proc`
+- Self-contained binary
+- Larger size
+- Suitable for standalone builds
+
+### 2️⃣ System Libraries Build (Homebrew-compatible)
+
+**Command:**
+
+```bash
+./mac/build.command -system-libs
+```
+
+**Behavior:**
+
+- Uses system libraries via `pkg-config`
+- Does not compile bundled libraries
+- Smaller binary
+- Faster build
+- Required for Homebrew
+
+## Required Homebrew Dependencies
+
+Install required dependencies:
+
+```bash
+brew install pkg-config autoconf automake libtool \
+  gpac freetype libpng protobuf-c utf8proc zlib
+```
+
+**Optional** (OCR / HARDSUBX support):
+
+```bash
+brew install tesseract leptonica ffmpeg
+```
+
+## How to Build
+
+```bash
+cd mac
+./build.command -system-libs
+```
+
+**Verify:**
+
+```bash
+./ccextractor --version
+```
+
+## What Changes Internally with -system-libs
+
+### Libraries NOT compiled (system-provided)
+
+- **FreeType**
+- **libpng**
+- **zlib**
+- **utf8proc**
+
+### Libraries STILL bundled
+
+- **lib_hash** (Custom SHA-256 implementation, no system equivalent)
+
+## CI Coverage
+
+A new CI job was added:
+
+- `build_shell_system_libs`
+
+**What it does:**
+
+- Installs Homebrew dependencies
+- Runs `./build.command -system-libs`
+- Verifies the binary runs correctly
+
+This ensures Homebrew-compatible builds stay working.
+
+## Verification (Local)
+
+You can confirm system libraries are used:
+
+```bash
+otool -L mac/ccextractor
+```
+
+**Expected output includes paths like:**
+
+```
+/opt/homebrew/opt/gpac/lib/libgpac.dylib
+```
+
+## Homebrew Formula Usage (Future)
+
+Example formula snippet:
+
+```ruby
+def install
+  system "./mac/build.command", "-system-libs"
+  bin.install "mac/ccextractor"
+end
+```
+
+## Summary
+
+- `-system-libs` is opt-in
+- Default build remains unchanged
+- Enables CCExtractor to return to Homebrew
+- Fully tested in CI and locally
+
+## Related
+
+- **PR #1862** — Add `-system-libs` flag
+- **Issue #1580** — Homebrew compatibility
+- **Issue #1534** — System library support
--- a/docs/CHANGES.TXT
+++ b/docs/CHANGES.TXT
@@ -1,5 +1,316 @@
-0.85b (2017-1-26)
+0.96.6 (unreleased)
+-------------------
+- New: Add Snap packaging support with Snapcraft configuration and GitHub Actions CI workflow. 
+- Fix: Clear status line output on Linux/WSL to prevent text artifacts (#2017)
+- Fix: Prevent infinite loop on truncated MKV files
+- Fix: Various memory safety and stability fixes in demuxers (MP4, PS, MKV, DVB)
+- Fix: Delete empty output files instead of leaving 0-byte files (#1282)
+- Fix: --mkvlang now supports BCP 47 language tags (e.g., en-US, zh-Hans-CN) and multiple codes
+
+0.96.5 (2026-01-05)
+-------------------
+- New: CCExtractor is available again via Homebrew on macOS and Linux.
+- New: Add support for raw CDP (Caption Distribution Packet) files (#1406)
+- New: Add --scc-accurate-timing option for bandwidth-aware SCC output (#1120)
+- Fix: MXF files containing CEA-708 captions not being detected/extracted (#1647)
+- Docs: Add Windows WSL build instructions
+- Fix: Security fixes (out-of-bounds read/write) in a few places in the legacy C code.
+
+0.96.4 (2026-01-01)
+-------------------
+- New: Persistent CEA-708 decoder context - maintains state across multiple calls for proper subtitle continuity
+- New: OCR character blacklist options (--ocr-blacklist, --ocr-blacklist-file) for improved accuracy
+- New: OCR line-split option (--ocr-splitontimechange) for better subtitle segmentation
+- Fix: 32-bit build failures on i686 and armv7l architectures
+- Fix: Legacy command-line argument compatibility (-1, -2, -12, --sc, --svc)
+- Fix: Prevent heap buffer overflow in Teletext processing (security fix)
+- Fix: Prevent integer overflow leading to heap buffer overflow in Transport Stream handling (security fix)
+- Fix: Lazy OCR initialization - only initialize when first DVB subtitle is encountered
+- Build: Optimized Windows CI workflow for faster builds
+- Fix: Updated GUI with version 0.7.1. A blind attempt to fix a hang on start on some Windows.
+
+0.96.3 (2025-12-29)
+-------------------
+- New: VOBSUB subtitle extraction with OCR support for MP4 files
+- New: VOBSUB subtitle extraction support for MKV/Matroska files
+- New: Native SCC (Scenarist Closed Caption) input file support - CCExtractor can now read SCC files
+- New: Configurable frame rate (--scc-framerate) and styled PAC codes for SCC output
+- Fix: Apply --delay option to DVB/bitmap subtitles (previously only worked with text-based subtitles)
+- Fix: 200ms timing offset in MOV/MP4 caption extraction
+- Fix: utf8proc include path for system library builds
+- Fix: Use fixed-width integer types in MP4 bswap functions for better portability
+- Fix: Guard ocr_text access with ENABLE_OCR preprocessor check
+- Fix: Preserve FFmpeg libs when building with -system-libs -hardsubx
+- Build: Add vobsub_decoder to Windows and autoconf build systems
+- Build: Add winget and Chocolatey packaging workflows for Windows distribution
+- Docs: Add VOBSUB extraction documentation and subtile-ocr Dockerfile
+
+0.96.2 (2025-12-26)
+-------------------
+- Fix: Resolve utf8proc header include path when building against system libraries on Linux.
+- Rebundle Windows version to include required runtime files to process hardcoded subtitles
+  (hardcodex mode).
+- New: Add optional -system-libs flag to Linux build script for package manager compatibility
+
+0.96.1 (2025-12-25)
+-------------------
+- Rebundle Windows version to include an updated GUI. No changes in CCExtractor itself.
+
+0.96 (2025-12-23)
 -----------------
+- New: Multi-page teletext extraction support (#665)
+  - Extract multiple teletext pages simultaneously with separate output files
+  - Use --tpage multiple times (e.g., --tpage 100 --tpage 200)
+  - Output files are named with page suffix (e.g., output_p100.srt, output_p200.srt)
+- Fix: SPUPNG subtitle offset calculation to center based on actual image dimensions
+
+- New: Added --list-tracks (-L) option to list all tracks in media files without processing
+  New: Chinese, Korean, Japanese support - proper encoding and OCR.
+  New: Correct McPoodle DVD raw format support
+  Fix: Timing is now frame perfect (using FFMpeg timing dump as reference) in all formats.
+  Fix: Solved garbling in all the pending issues we had on GitHub.
+  Fix: All causes of "premature end of file" messages due to bugs and not actual file cuts.
+  Fix: All memory leaks, double frees and usual C nastyness that valgrind could find.
+- Fix Include ATSC VCT virtual channel numbers and call signs in XMLTV output
+- Fix: Restore ATSC XMLTV generation with ETT parsing for extended descriptions, multi-segment handling, extended table ID's (EIT/VCT), corrected <programme> XMLTV formatting, buffer bounds fixes
+- Fix: Add HEVC/H.265 stream type recognition to prevent crashes on ATSC 3.0 streams.
+  Fix: Tolerance to damaged streams - recover where possible instead of terminating.
+  Issues closed: Over 40! Too many to list here, but each of them was either a bug squashed or a feature implemented.
+
+0.95 (2025-09-15 - never formally packaged)
+-----------------
+- New: Create a Docker image to simplify the CCExtractor usage without any environmental hustle (#1611)
+- New: Add SCC support for CEA-708 decoder (#1595)
+  Refactor: Lots of code ported to Rust.
+- Fix: Improved handling of IETF language tags in Matroska files (#1665)
+- Breaking: Major argument flags revamp for CCExtractor (#1564 & #1619)
+- Fix: segmentation fault in using hardsubx
+- Fix: WebVTT X-TIMESTAMP-MAP placement (#1463)
+- Fix: ffmpeg 5.0, tesseract 5.0 compatibility and remove deprecated methods
+- Fix: tesseract 5.x traineddata location in ocr
+- Improvement: Ignore MXF Caption Essence Container version byte to enhance SRT subtitle extraction compatibility
+- New: Add tesseract page segmentation modes control with `--psm` flag
+- Fix: Support for MINGW-w64 cross compiling
+
+0.94 (2021-12-14)
+-----------------
+- BOM is no longer enabled by default on windows platforms
+- CEA-708: Rust decoder is now default instead of C decoder
+- CEA-708 subs are now extracted by default
+- New: Add check for Minimum supported rust version (MSRV) (#1387) 
+- Fix: Fix CEA-708 Carriage Return command implementation
+- Fix: Fix bug with startat/endat parameter (#1396)
+- Fix: Mac Build processes (#1390)
+- Fix: Fix bug with negative delay parameter (#1365)
+
+0.93 (2021-08-16)
+-----------------
+- Minor Rust updates (format, typos, docs)
+- Updated GUI
+
+0.92 (2021-08-10)
+-----------------
+- Rust updates: Added srt writer
+- Rust updates:-Added writers for transcripts and SAMI
+- Added missing DLL to Windows installer
+- Updated Windows GUI
+
+0.91 (2021-07-26)
+-----------------
+- More Rust in the 708 decoder (Add Pen Presets and timing functions)
+- Updated GUI
+
+0.90 (2021-07-14)
+-----------------
+- New installer (WiX based)
+- New GUI (flutter based)
+- More Rust (the 708 decoder is being rewritten)
+
+0.89 (2021-06-13)
+-----------------
+- Fix: Fix broken links in README
+- Fix: Timing in DVB, sub duration check for timeout.
+- New: Added support for SCC and CCD encoder formats
+- New: Added support to output captions to MCC file (#733).
+- New: Add support for censoring words ("Kid Friendly") (#1139)
+- New: Extend support of capitalization for all BITMAP and 608 subtitles (#1214)
+- New: Added an option to disable timestamps for WebVTT (In response to issue #1127)
+- Fix: Change inet_ntop to inet_ntoa for Windows XP compatibility
+- Fix: Added italics, underline, and color rendering support for -out=spupng with EIA608/teletext
+- Fix: ccx_demuxer_mxf.c: Parse framerate from MXF captions to fix caption timings.
+- Fix: hardsubx_decoder.c: Fix memory leaks using Leptonica API.
+- Fix: linux/Makefile.am: added some sources to enable rpms to be created.
+- Fix: Crash when using -sc (sentence case) option (#1115)
+- Fix: Segmentation fault on VOB #1128
+- Fix: Hang while processing video #1121
+- Fix: lib_ccx.c: Initialize fatal error logging function before first usage in init_libraries
+- Fix: A few (minor) memory leaks around the code.
+- Fix: General code clean up / reformatting
+- Fix: Fix multiple definitions with new -fno-common default in GCC 10
+- Fix: Mac now builds reproducibly again without errors on the date command (#1230)
+- Fix: Allow all oem modes with tesseract v4 (#1264)
+- Doc: Updated ccextractor.cnf.sample.
+- Update: Updated LibPNG to 1.6.37
+- Remove: Python API (since no one cares about it and it's unmaintained)
+- Remove: -cf , just use FFmpeg if you want a ES from a TS or PS, CCExtractor is a bad tool
+  for this.
+- Fix: Segmentation fault on Windows
+- Update: Updated libGPAC to 1.0.1
+- Fix: Segmentation fault with unsupported and multitrack file reports
+- Fix: Write subtitle header to multitrack outputs
+- Fix: Write multitrack files to the output file directory
+- Fix: Correct frame number calculation in SCC (#1340)
+- Fix: Regression on Teletext that caused dates to be wrong (RT 78 on the sample platform)
+- Fix: CEA-708: Better timing, fixes for missing subtitles
+- Fix: timing for direct rollup
+- Fix: timing for VOB files with multiple chapters
+
+0.88 (2019-05-21)
+-----------------
+- New: More tapping points for debug image in ccextractor.
+- New: Add support for tesseract 4.0
+- Optimize: Remove multiple RGB to grey conversion in OCR.
+- Fix: Update UTF8Proc to 2.2.0
+- Fix: Update LibPNG to 1.6.35
+- Fix: Update Protobuf-c to 1.3.1
+- Fix: Warn instead of fatal when a 0xFF marker is missing
+- Fix: Segfault in general_loop.c due to null pointer dereference (case of no encoder)
+- Fix: Enable printing hdtv stats to console.
+- Fix: Many typos in comments and output messages
+- Fix: Ignore Visual Studio temporary project files
+- New: Add support for non-Latin characters in stdout
+- Fix: Check whether stream is empty
+- New: Add support for EIA-608 inside .mkv
+- New: Add support for DVB inside .mkv
+- Fix: Added -latrusmap Map Latin symbols to Cyrillic ones in special cases
+       of Russian Teletext files (issue #1086)
+- Fix: Several OCR crashes
+
+0.87 (2018-10-23)
+-----------------
+- New: Upgrade libGPAC to 0.7.1.
+- New: mp4 tx3g & multitrack subtitles.
+- New: Guide to update dependencies (docs/Updating_Dependencies.txt).
+- New: Add LICENSE File (#959).
+- New: Display quantisation mode in info box (#954).
+- New: Add instruction required to build ccextractor with HARDSUBX support (#946).
+- New: Added version no. of libraries to --version.
+- New: Added -quant (OCR quantization function).
+- New: Python API now compatible with Python 3.
+- Fix: linux/builddebug: Added non-local directories to the incluye search path so we don't
+       require a locally compiled tesseract or leptonica.
+- Fix: Correct -HARDSUBX Bug In CMake, allow build with hardsubx using cmake (#966).
+- Fix: possible segfaults in hardsubx_classifier.c due to strdup (#963).
+- Fix: Improve the start and end timestamps of extracted burned in captions (#962).
+- Fix: Update COMPILATION.md (#960).
+- Fix: Fixed crash with "-out=report" and "-out=null".
+- Fix: -nocf not working with OCR'ing (#958).
+- Fix: segfault in add_cc_sub_text and initialize to NULL in init_encoder (#950).
+- Fix: ccx_decoders_common.c: Copy data type when creating a copy of the subtitle structure.
+- Fix: Implicit declaration of these functions throws warning during build (#948).
+- Fix: ccx_decoders_common.c: Properly release allocated resources on free_subtitle().
+- Fix: Added a datatype member to struct cc_subtitle - needed so we can properly free all
+       memory when void *data points to a structure that has its own pointers.
+- Fix: dvb_subtitle_decoder.c: When combining image regions verify that the offset is
+       never negative.
+- Fix: Updated traivis.yml to fix osx build (#947).
+- Fix: Add utf8proc src file to cmake, updated header file (#944).
+- Fix: Added required pointers on freep() calls.
+- Fix: Removed dvb_debug_traces_to_stdout and used the usual dbg_print instead.
+- Fix: Additional debug traces for DVB.
+- Fix: Fix minor memory leak in ocr.c.
+- Fix: Fix issue with displaying utf8proc version.
+- Fix: Fix failing cmake due to liblept/tesseract header files.
+- Fix: Added missing \n in params.c.
+- Fix: builddebug: Use -fsanitize=address -fno-omit-frame-pointer.
+- Fix: ccx_decoders_common.c: Removed trivial memory leak.
+- Fix: ccx_encoders_srt.c: Made sure a pointer is non-NULL before dereferencing.
+- Fix: dvb_subtitle_decoder.c: Initialize pointer members to NULL when creating a structure.
+- Fix: lib_ccx.c: Initialize (memset 0) structure cc_subtitle after memory allocation.
+- Fix: Added verboseness to error/warnings in dvb_subtitle_decoder.c.
+- Fix: dvb_subtitle_decoder.c: Work on passing invalid streams errors upstream (plus some
+       warning messages) so we can eventually recover from this situation instead of crashing.
+- Fix: telxcc.c: Currently setting a colour doesn't necessarily add a space even though the
+       specifications mandate it. (#930).
+- Fix: dvb_subtitle_decoder.c: Fix null pointer derefence when region==NULL in write_dvb_sub.
+- Fix: DVB Teletext subtitle incomplete.
+- Fix: replace all 0xA characters within startbox with 0x20.
+- Fix: DVB Teletext subtitle incomplete (#922).
+- Fix: Add missing return value to one of the returns in process_tx3g().
+- Fix: Typos and other minor bugs.
+- Fix: Tidy CMakeLists & vcxproj (#920).
+- Fix: Added m2ts and -mxf to help screen.
+- Fix: Added MKV to demuxer_print_cfg.
+- Fix: Added MXF to demuxer_print_cfg.
+- Fix: "Out of order packets" error had wrong print() parameters.
+- Fix: Updated Python documentation.
+- Fix: Fix incorrect path in XML (#904).
+- Fix: linux build script (non-debug): Don't hide warnings from compiler.
+- Fix: linux build script (debug): Display what's step of the build script we're in.
+- Fix: Make the build reproducible (#976).
+- Fix: Remove instance of o1 and o2 from help.
+- Fix: Colors of DVB subtitles with depth 2 broken due to a missing break.
+- Fix: CEA-708: Caption loss due to CW command (#991).
+- Fix: CEA-708: Update patch for windows priority with functions (#990).
+
+0.86 (2018-01-09)
+-----------------
+- New: Preliminary MXF support
+- New: Added a histogram in one-minute increments of the number of lines in a subtitle.
+- New: Added Autoconf build scripts for CCExtractor to generate makefiles (mac).
+- New: Added Autoconf build scripts for CCExtractor to generate makefiles (linux).
+- New: Added .rpm package generation script.
+- New: Added build/installation script for .pkg.tar.xz (Arch Linux).
+- New: Added tarball generation script.
+- New: Added --analyzevideo. If present the video stream will be processed even if the
+  subtitles are in a different stream. This is useful when we want video information
+  (resolution, frame type, etc). -vides now implies this option too.
+  [Note: Tentative - some possibly breaking changed were made for this, so if you
+  use it validate results]
+- New: Added a GUI in the main CCExtractor binary (separate from the external GUIs
+  such as CCExtractorGUI).
+- New: A Python binding extension so it's possible to use CCExtractor's tools from
+  Python.
+- New: Added -nospupngocr (don't OCR bitmaps when generating spupng, faster)
+- New: Add support for file split on keyframe (-segmentonkeyonly)
+- New: Added WebVTT output from Matroska.
+- New: Support for source-specific multicast.
+- New: FreeType-based text renderer (-out=spupng with teletext/EIA608).
+- New: Upgrade library UTF8proc
+- New: Upgrade library win_iconv
+- New: Upgrade library zlib
+- New: Upgrade library LibPNG
+- New: Support for Source-Specific Multicast
+- New: Added Travis CI support
+- New: Made error messages clearer, less ambiguous
+- Fix: Prevent the OCR being initialized more than once (happened on multiprogram and
+  PAT changes)
+- Fix: Makefiles, build scripts, etc... everything updated and corrected for all
+  platforms.
+ -Fix: Proper line ending for .srt files from bitmaps.
+- Fix: OCR corrections using grayscale before extracting texts.
+- Fix: End timestamps in transcripts from DVB.
+- Fix: Forcing -noru to cause deduplication in ISDB
+- Fix: TS: Skip NULL packets
+- Fix:  When NAL decoding fails, don't dump the whole decoded thing, limit to 160 bytes.
+- Fix: Modify Autoconf scripts to generate tarball for mac from `/package_creators/tarball.sh`
+  and include GUI files in tarball
+- Fix: Started work on libGPAC upgrade.
+- Fix: DVB subtitle not extracted if there's no display segment
+- Fix: Heap corruption in add_ocrtext2str
+- Fix: bug that caused -out=spupng sometimes crashes
+- Fix: Checks for text before newlines on DVB subtitles
+- Fix: OCR issue caused by separated dvb subtitle regions
+- Fix: DVB crash on specific condition (!rect->ocr_text)
+- Fix: DVB bug (Multiple-line subtitle; Missing last line)
+- Fix: --sentencecap for teletext samples
+- Fix: Crash when image passed into OCR is empty
+- Fix: Temporarily wrapped the Python API, not production ready yet
+- Fix: -delay option in DVB
+
+
+0.85b (2017-01-26)
+------------------
 - Fix: Base Windows binary (without OCR) compiled without DLL dependencies.

 0.85 (2017-01-23)
@@ -36,7 +347,7 @@

 0.84 (2016-12-16)
 -----------------
- New: In Windows, both with and without-OCR binaries are bundled, since the OCR one causes problems due to 
+- New: In Windows, both with and without-OCR binaries are bundled, since the OCR one causes problems due to
  dependencies in some system. So unless you need the OCR just use the non-OCR version.
 - New: Added -sbs (sentence by sentence) for DVB output. Each frame in the output file contains a complete
  sentence (experimental).
@@ -59,7 +370,7 @@
 - Fix: Added detail in many error messages.
 - Fix: Memory leaks in videos with XDS.
 - Fix: Makefile compatibility issues with Raspberry pi.
- Fix: missing separation between WebVTT header and body. 
+- Fix: missing separation between WebVTT header and body.
 - Fix: Stupid bug in M2TS that preventing it from working.
 - Fix: OCR libraries dependencies for the release version in Windows.
 - Fix: non-buffered reading from pipes.
@@ -106,7 +417,7 @@
 - Fix: Timing in -ucla
 - Fix: Timing in ISDB (some instances)
 - Fix: "mfra" mp4 box weight changed to 1 (this helps with correct file format detection)
- Fix: Fix for TARGET File is null. 
+- Fix: Fix for TARGET File is null.
 - Fix: Fixed SegFaults while parsing parameters (if mandatory parameter is not present in -outinterval, -codec or -nocodec)
 - Fix: Crash when input small is too small
 - Fix: Update some URLs in code (references to docs)
@@ -164,7 +475,7 @@
 - CCExtractor can be used as library if compiled using cmake
 - By default the Windows version adds BOM to generated UTF files (this is
  because it's needed to open the files correctly) while all other
-  builds don't add it (because it messes with text processing tools). 
+  builds don't add it (because it messes with text processing tools).
  You can use -bom and -nobom to change the behaviour.

 0.74 (2014-09-24)
@@ -203,7 +514,7 @@
 ------------------------
 This is the first release that is part of Google's Summer of Code.
 Anshul, Ruslan and Willem joined CCExtractor to work on a number of things
-over the summer, and their work is already reaching the mainstream 
+over the summer, and their work is already reaching the mainstream
 version of CCExtractor.

 - Added a huge dictionary submitted by Matt Stockard.
@@ -236,7 +547,7 @@ version of CCExtractor.
 		0000101 is the default setting for transcripts
 		1110101 is the default for timed transcripts
 		1111001 is the default setting for -ucla
-	Make sure you use this parameter after others that might affect these 
+	Make sure you use this parameter after others that might affect these
 	settings (-out, -ucla, -xds, -txt, -ttxt, ...)
 - Fixed Negative timing Bug

@@ -254,7 +565,7 @@ version of CCExtractor.
 - Started refactoring and clean-up.
 - Fix: MPEG clock rollover (happens each 26 hours) caused a time
  discontinuity.
- Windows GUI: Started work on HDHomeRun support. For now it just looks 
+- Windows GUI: Started work on HDHomeRun support. For now it just looks
  for HDHomeRun devices. Lots of other things will arrive in the next
  versions.
 - Windows GUI: Some code refactoring, since the HDHomeRun support makes
@@ -271,7 +582,7 @@ version of CCExtractor.
  a good test sample file...
 - Color and fonts in PAC commands were ignored, fixed (Helen Buus).
 - Added a new output format, spupng. It consists on one .png file
-  for each subtitle frame and one .xml with all the timing 
+  for each subtitle frame and one .xml with all the timing
  (Heleen Buus).
 - Some fixes (Chris Small).

@@ -293,12 +604,12 @@ version of CCExtractor.
 - Added -latin1 to select Latin 1 as encoding. Default is now
  UTF-8 (-utf8 still exists but it's not needed).
 - Added -ru1, which emulates a (non-existing in real life) 1 line
-  roll-up mode. 
+  roll-up mode.


 0.66 (2013-07-01)
 -----------------
- Fixed bug in auto detection code that triggered a message 
+- Fixed bug in auto detection code that triggered a message
  about file being auto of sync.
 - Added -investigate_packets
  The PMT is used to select the most promising elementary stream
@@ -307,39 +618,39 @@ version of CCExtractor.
  manually, in case the CC location is not obvious from the PMT
  contents. To assist looking for the right stream, the parameter
  "-investigate_packets" will have CCExtractor look inside each
-  stream, looking for CC markers, and report the streams that 
+  stream, looking for CC markers, and report the streams that
  are likely to contain CC data even if it can't be determined from
  their PMT entry.
 - Added -datastreamtype to manually selecting a stream based on
  its type instead of its PID. Useful if your recording program
-  always hides the caption under the stream type. 
+  always hides the caption under the stream type.
 - Added -streamtype so if an elementary stream is selected manually
-  for processing, the streamtype can be selected too. This can be 
-  needed if you process, for example a stream that is declared as 
+  for processing, the streamtype can be selected too. This can be
+  needed if you process, for example a stream that is declared as
  "private MPEG" in the PMT, so CCExtractor can't tell what it is.
  Usually you'll want -streamtype 2 (MPEG video) or -streamtype 6
  (MPEG private data).
 - PMT content listing improved, it now shows the stream type for
  more types.
- Fixes in roll-up, cursor was being moved to column 1 if a 
+- Fixes in roll-up, cursor was being moved to column 1 if a
  RU2, RU3 or RU4 was received even if already in roll-up mode.
- Added -autoprogram. If a multiprogram TS is processed and 
+- Added -autoprogram. If a multiprogram TS is processed and
  -autoprogram is used, CCExtractor will analyze all PMTs and use
  the first program that has a suitable data stream.
- Timed transcript (ttxt) now also exports the caption mode 
-  (roll-up, paint-on, etc.) next to each line, as it's useful to 
+- Timed transcript (ttxt) now also exports the caption mode
+  (roll-up, paint-on, etc.) next to each line, as it's useful to
  detect things like commercials.
 - Content Advisory information from XDS is now decoded if it's
-  transmitted in "US TV parental guidelines" or "MPA". 
-  Other encoding such as Canada's are not supported yet due 
+  transmitted in "US TV parental guidelines" or "MPA".
+  Other encoding such as Canada's are not supported yet due
  to lack of samples.
 - Copy Management information from XDS is now decoded.
 - Added -xds. If present and export format is timed transcript
  (only), XDS information will be saved to file (same file as the
  transcript, with XDS being clearly marked). Note that for now
-  all XDS data is exported even if it doesn't change, so the 
+  all XDS data is exported even if it doesn't change, so the
  transcript file will be significantly larger.
- Added some PaintOn support, at least enough to prevent it 
+- Added some PaintOn support, at least enough to prevent it
  from breaking things when the other modes are used.
 - Removed afd_data() warning. AFD doesn't carry any caption related
  data. AFD still detected in code in case we want to do something
@@ -357,21 +668,21 @@ version of CCExtractor.
 	   calculated distance, the maximum allowed distance, and whether
 	   the strings are ultimately considered equivalent or not, i.e.
 	   the calculated distance is less or equal than the max allowed.
-	  -levdistmincnt value: Minimum distance we always allow 
-	   regardless of the length of the strings. Default 2. This means 
-	   that if the calculated distance is 0, 1 or 2, we consider the 
+	  -levdistmincnt value: Minimum distance we always allow
+	   regardless of the length of the strings. Default 2. This means
+	   that if the calculated distance is 0, 1 or 2, we consider the
 	   strings to be equivalent.
-	  -levdistmaxpct value: Maximum distance we allow, as a 
-	   percentage of the shortest string length. Default 10%. For 
-	   example, consider a comparison of one string of 30 characters 
-	   and one of 60 characters. We want to determine whether the 
-	   first 30 characters of the longer string are more or less the 
-	   same as the shortest string, i.e. whether the longest string 
-	   is the shortest one plus new characters and maybe some 
-	   corrections. Since the shortest string is 30 characters and 
-	   the default percentage is 10%, we would allow a distance of 
+	  -levdistmaxpct value: Maximum distance we allow, as a
+	   percentage of the shortest string length. Default 10%. For
+	   example, consider a comparison of one string of 30 characters
+	   and one of 60 characters. We want to determine whether the
+	   first 30 characters of the longer string are more or less the
+	   same as the shortest string, i.e. whether the longest string
+	   is the shortest one plus new characters and maybe some
+	   corrections. Since the shortest string is 30 characters and
+	   the default percentage is 10%, we would allow a distance of
 	   up to 3 between the first 30 characters.
- Added -lf : Use UNIX line terminator (LF) instead of Windows (CRLF).	   
+- Added -lf : Use UNIX line terminator (LF) instead of Windows (CRLF).
 - Added -noautotimeref: Prevent UTC reference from being auto set from
  the stream data.

@@ -381,7 +692,7 @@ version of CCExtractor.
 - Added end timestamps in timed transcripts
 - Added support for SMPTE (patch by John Kemp)
 - Initial support for MPEG2 video tracks inside MP4 files (thanks a
-  lot to GPAC's Jean who assisted in analyzing the sample and 
+  lot to GPAC's Jean who assisted in analyzing the sample and
  doing the required changes in GPAC).
 - Improved MP4 auto detection
 - Support for PCR if PTS is not available (needed for some teletext
@@ -407,7 +718,7 @@ version of CCExtractor.
  data (bypassing detections).
 - Added -ru2 and -ru3 to limit the number of visible lines in roll-up
  captions (bypassing whatever the broadcast says).
- Added support for a .hex (hexadecimal) dump of data. 
+- Added support for a .hex (hexadecimal) dump of data.
 - Added support for wtv in Windows. This is done by using a new program
  (wtvccdump.exe) and a new DirectShow filter (CCExtractorDump.dll) that
  process the .wtv using DirecShow's filters and export the line 21 data
@@ -418,9 +729,9 @@ version of CCExtractor.
 0.63 (2012-08-17)
 -----------------
 - Telext support added, by integrating Petr Kutalek's telxcc. Integration is
-  still quite basic (there's equivalent code from both CCExtractor and 
-  telxcc) and some clean up is needed, but it works. Petr has announced that 
-  he's abandoning telxcc so further development will happen directly in 
+  still quite basic (there's equivalent code from both CCExtractor and
+  telxcc) and some clean up is needed, but it works. Petr has announced that
+  he's abandoning telxcc so further development will happen directly in
  CCExtractor.
 - Some bug fixes, as usual.

@@ -430,14 +741,14 @@ version of CCExtractor.
  Mac users that sent this.
 - Hauppauge mode now uses PES timing, needed for files that don't have
  caption data during all the video (such as in commercial breaks).
- Added -mp4 and -in:mp4 to force the input to be processed as MP4. 
+- Added -mp4 and -in:mp4 to force the input to be processed as MP4.
 - CC608 data embedded in a separate stream (as opposed as in the video
-  stream itself) in MP4 files is now supported (not heavily tested). 
+  stream itself) in MP4 files is now supported (not heavily tested).
  This should be rather useful since closed captioned files from iTunes
  use this format.
 - More CEA-708 work. The debugger is now able to dump the "TV" contents for
-  the first time. Also, a .srt can be generated, however timing is not quite 
-  good yet (still need to figure out why). 
+  the first time. Also, a .srt can be generated, however timing is not quite
+  good yet (still need to figure out why).
 - Added -svc (or --service) to select the CEA-708 services to be processed.
  For example, -svc 1,2 will process the primary and secondary language
  services. Valid values are 1-63, where 1 is the primary language, 2 is
@@ -452,9 +763,9 @@ version of CCExtractor.
 - Fix: GCC 3.4.4 can now build CCExtractor.
 - Fix: Damaged TS packets (those that come with 'error in transport' bit
  on) are now skipped.
- Fix: Part of the changes for MP4 support (CC packets buffering in 
-  particular) broke some stuff for other files, causing at least very 
-  annoying character duplication. We hope we've fixed it without breaking 
+- Fix: Part of the changes for MP4 support (CC packets buffering in
+  particular) broke some stuff for other files, causing at least very
+  annoying character duplication. We hope we've fixed it without breaking
  anything but please report).
 - Some non-interesting cleanup.

@@ -465,13 +776,13 @@ version of CCExtractor.
  code, the stream must be a file (no streaming), etc.
 - Fix: The Windows version was writing text files with double \r.
 - Fix: Closed captions blocks with no data could cause a crash.
- Fix: -noru (to generate files without duplicate lines in 
+- Fix: -noru (to generate files without duplicate lines in
  roll-up) was broken, with complete lines being missing.
- Fix: bin format not working as input. 
+- Fix: bin format not working as input.

 0.59 (2011-10-07)
 -----------------
- More AVC/H.264 work. pic_order_cnt_type != 0 will be processed now. 
+- More AVC/H.264 work. pic_order_cnt_type != 0 will be processed now.
 - Fix: Roll-up captions with interruptions for Text (with ResumeTextDisplay
  in the middle of the caption data) were missing complete lines.
 - Added a timed text transcript output format, probably only useful for
@@ -494,7 +805,7 @@ version of CCExtractor.
 - Added -stdout => If used, the captions will be sent to stdout (console)
  instead of file. Combined with -, CCExtractor can work as a filter in
  a larger process, receiving the stream from stdin and sending the
-  captions to stdout. 
+  captions to stdout.
 - Some code clean up, minor refactoring.
 - Teletext detection (not yet processing).

@@ -503,20 +814,20 @@ version of CCExtractor.
 - Implemented new PTS based mode to order the caption information
  of AVC/H.264 data streams.  The old pic_order_cnt_lsb based method
  is still available via the -poc or --usepicorder command switches.
- Removed a couple of those annoying "Impossible!" error messages 
+- Removed a couple of those annoying "Impossible!" error messages
  that appears when processing some (possibly broken, unsure) files.
- Added -nots --notypesettings to prevent italics and underline 
+- Added -nots --notypesettings to prevent italics and underline
  codes from being displayed.
- Note to those not liking the paragraph symbol being used for the 
+- Note to those not liking the paragraph symbol being used for the
  music note: Submit a VALID replacement in latin-1.
- Added preliminary support for multiple program TS files. The 
+- Added preliminary support for multiple program TS files. The
  parameter --program-number (or -pn) will let you choose which
-  program number to process. If no number is passed and the TS 
+  program number to process. If no number is passed and the TS
  file contains more than one, CCExtractor will display a list of
  found programs and terminate.
 - Added support (basic, because I only received one sample) for some
  Hauppauge cards that save CC data in their own format. Use the
-  parameter -haup to enable it (CCExtractor will display a notice 
+  parameter -haup to enable it (CCExtractor will display a notice
  if it thinks that it's processing a Hauppauge capture anyway).
 - Fixed bug in roll-up.
 - More AVC work, now TS files from echostar that provided garbled
@@ -526,7 +837,7 @@ version of CCExtractor.
 0.57 (2010-12-16)
 -----------------
 - Bug fixes in the Windows version. Some debug code was unintentionally
-  left in the released version. 
+  left in the released version.

 0.56 (2010-12-09)
 -----------------
@@ -543,10 +854,10 @@ version of CCExtractor.
 - Start implementation of EIA-708 decoding (not active yet).
 - Add -gt / --goptime switch to use GOP timing instead of PTS timing.
 - Start implementation of AVC/H.264 decoding (not active yet).
- Fixed: The basic problem is that when 24fps movie film gets converted to 30fps NTSC 
-  they repeat every 4th frame. Some pics have 3 fields of CC data with field 3 CC data 
-  belongs to the same channel as field 1. The following pics have the fields reversed 
-  because of the odd number of fields. I used top_field_first to tell when the channels 
+- Fixed: The basic problem is that when 24fps movie film gets converted to 30fps NTSC
+  they repeat every 4th frame. Some pics have 3 fields of CC data with field 3 CC data
+  belongs to the same channel as field 1. The following pics have the fields reversed
+  because of the odd number of fields. I used top_field_first to tell when the channels
  are reversed. See Table 6-1 of the SCTE 20 [Paul Fernquist]

 0.54 (2009-04-16)
@@ -556,9 +867,9 @@ version of CCExtractor.
 - Improve synchronization of captions for source files with
  jumps in their time information or gaps in the caption
  information.
- [R. Abarca] Changed Mac script, it now compiles/link 
-  everything from the /src directory. 
- It's now possible to have CCExtractor add credits 
+- [R. Abarca] Changed Mac script, it now compiles/link
+  everything from the /src directory.
+- It's now possible to have CCExtractor add credits
  automatically.
 - Added a feature to add start and end messages (for credits).
  See help screen for details.
@@ -579,13 +890,13 @@ version of CCExtractor.
  for Raw Captions With Time). This new format
  allows one file to contain all the available
  closed caption data instead of just one stream.
- Added --no_progress_bar to disable status 
+- Added --no_progress_bar to disable status
  information (mostly used when debugging, as the
  progress information is annoying in the middle
  of debug logs).
- The Windows GUI was reported to freeze in some 
+- The Windows GUI was reported to freeze in some
  conditions. Fixed.
- The Windows GUI is now targeted for .NET 2.0 
+- The Windows GUI is now targeted for .NET 2.0
  instead of 3.5. This allows Windows 2000 to run
  it (there's not .NET 3.5 for Windows 2000), as
  requested by a couple of key users.
@@ -593,17 +904,17 @@ version of CCExtractor.
 0.51 (unreleased)
 -----------------
 - Removed -autopad and -goppad, no longer needed.
- In preparation to a new binary format we have 
-  renamed the current .bin to .raw. Raw files 
+- In preparation to a new binary format we have
+  renamed the current .bin to .raw. Raw files
  have only CC data (with no header, timing, etc.).
 - The input file format (when forced) is now
-  specified with 
+  specified with
    	-in=format
  such as -in=ts, -in=raw, -in=ps ...
  The old switches (-ts, -ps, etc.) still work.
  The only exception is -bin which has been removed
  (reserved for the new binary format). Use
-  -in=raw to process a raw file. 
+  -in=raw to process a raw file.
 - Removed -d, which when produced a raw file used
  a DVD format. This has been merged into a new
  output type "dvdraw". So now instead of using
@@ -612,7 +923,7 @@ version of CCExtractor.
 - Removed --noff
 - Added gui_mode_reports for frontend communications,
  see related file.
- Windows GUI rewritten. Source code now included, 
+- Windows GUI rewritten. Source code now included,
  too.
 - [Volker] Dish Network clean-up

@@ -625,12 +936,12 @@ version of CCExtractor.
 0.49 (2008-12-10)
 -----------------
 - [Volker] Major MPEG parser rework. Code much
-  cleaner now. 
+  cleaner now.
 - Some stations transmit broken roll-up captions,
  and for some reason don't send CRs but RUs...
  Added work-around code to make captions readable.
 - Started work on EIA-708 (DTV). Right now you can
-  add -debug-708 to get a dump of the 708 data. 
+  add -debug-708 to get a dump of the 708 data.
  An actually useful decoder will come soon.
 - Some of the changes MIGHT HAVE BROKEN MythTV's
  code. I don't use MythTV myself so I rely on
@@ -646,9 +957,9 @@ version of CCExtractor.
  can now process files that are being recorded
  at the same time.
  
- [Volker] Added a new DVR-MS loop - this is 
+- [Volker] Added a new DVR-MS loop - this is
  completely new, DVR-MS specific code, so we no
-  longer use the generic MPEG code for DVR-MS. 
+  longer use the generic MPEG code for DVR-MS.
  DVR-MS should (or will be eventually at least)
  be as reliable as TS.
  Note: For now, it's only ATSC recordings, not
@@ -667,11 +978,11 @@ version of CCExtractor.
  new options.
 - Added      -lg --largegops
  From the help screen:
-  Each Group-of-Picture comes with timing 
-  information. When this info is too separate 
-  (for example because there are a lot of 
-  frames in a GOP) ccextractor may prefer not 
-  to use GOP timing. Use this option is you 
+  Each Group-of-Picture comes with timing
+  information. When this info is too separate
+  (for example because there are a lot of
+  frames in a GOP) ccextractor may prefer not
+  to use GOP timing. Use this option is you
  need ccextractor to use GOP timing in large
  GOPs.

@@ -690,8 +1001,8 @@ version of CCExtractor.
 0.43 (2008-06-20)
 -----------------
 - Fixed a bug in the read loop (no less)
-  that caused some files to fail when 
-  reading without buffering (which is 
+  that caused some files to fail when
+  reading without buffering (which is
  the default in the Linux build).
 - Several improvements in the GUI, such as
  saving current options as default.
@@ -708,8 +1019,8 @@ version of CCExtractor.
 -----------------
 - Default output is now .srt instead of .bin,
  use -raw if you need the data dump instead of
-  .srt. 
- Added -trim, which removes blank spaces at 
+  .srt.
+- Added -trim, which removes blank spaces at
  the left and rights of each line in .srt.
  Note that those spaces are there to help
  deaf people know if the person talking is
@@ -719,8 +1030,8 @@ version of CCExtractor.

 0.40 (2008-05-20)
 -----------------
- Fixed a bug in the sanity check function 
-  that caused the Myth branch to abort. 
+- Fixed a bug in the sanity check function
+  that caused the Myth branch to abort.
 - Fixed the OSX build script, it needed a
  new #define to work.

@@ -730,30 +1041,30 @@ version of CCExtractor.
  have no time information. Also, if in roll-up
  mode there will be no repeated lines.
 - Lots of changes in the MPEG parser, most of
-  them submitted by Volker Quetschke. 
+  them submitted by Volker Quetschke.
 - Fixed a bug in the CC decoder that could cause
  the first line not to be cleared in roll-up
-  mode. 
+  mode.
 - CCExtractor can now follow number sequences in
  file names, by suffixing the name with +.
  For example,
  
-  DVD0001.VOB+ 
+  DVD0001.VOB+

  means DVD0001.VOB, DVD0002.VOB, etc. This works
  for all files, so part001.ts+ does what you
  could expect.
 - Added -90090 which changes the clock frequency
-  from the MPEG standard 90000 to 90090. It 
+  from the MPEG standard 90000 to 90090. It
  *could* (remains to be seen) help if there are
-  timing issues. 
+  timing issues.
 - Better support for Tivo files.
 - By default ccextractor now considers the whole
  input file list a one large file, instead of
  several, independent, video files. This has
  been changed because most programs (for example
-  DVDDecrypt) just cut the files by size. 
-  If you need the old behaviour (because you 
+  DVDDecrypt) just cut the files by size.
+  If you need the old behaviour (because you
  actually edited the video files and want to
  join the subs), use -ve.

@@ -771,7 +1082,7 @@ version of CCExtractor.
  that have been added because old behaviour was
  annoying to most people: _1 and _2 at the end
  of the output file names is now added ONLY if
-  -12 is used (i.e. when there are two output 
+  -12 is used (i.e. when there are two output
  files to produce). So

  ccextractor -srt sopranos.mpg
@@ -832,7 +1143,7 @@ version of CCExtractor.
  Alan
  Tony

-  So you get 
+  So you get

             You better respect
             this robe, Alan.
@@ -841,7 +1152,7 @@ version of CCExtractor.
  have a different spelling file per TV
  show, or a large file with a lot of
  words, etc.
- ccextractor has been reported to 
+- ccextractor has been reported to
  compile and run on Mac with a minor
  change in the build script, so I've
  created a mac directory with the
@@ -855,17 +1166,17 @@ version of CCExtractor.
 -----------------
 - Added -scr or --screenfuls, to select the
  number of screenfuls ccextractor should
-  write before exiting. A screenful is 
+  write before exiting. A screenful is
  a change of screen contents caused by
  a CC command (not new characters). In
  practice, this means that for .srt each
  group of lines is a screenful, except when
-  using -dru (which produces a lot of 
+  using -dru (which produces a lot of
  groups of lines because each new character
  produces a new group).
 - Completed tables for all encodings.
 - Fixed bug in .srt related to milliseconds
-  in time lines. 
+  in time lines.
 - Font colors are back for .srt (apparently
  some programs do support them after all).
  Use -nofc or --nofontcolor if you don't
@@ -874,7 +1185,7 @@ version of CCExtractor.
 0.32 (unreleased)
 -----------------
 - Added -delay ms, which adds (or subtracts)
-  a number of milliseconds to all times in 
+  a number of milliseconds to all times in
  .srt/.sami files. For example,
  
         -delay 400
@@ -905,8 +1216,8 @@ version of CCExtractor.
 - Fix in extended char decoding, I wasn't
  replacing the previous char.
 - When a sequence code was found before
-  having a PTS, reported time was 
-  undefined. 
+  having a PTS, reported time was
+  undefined.

 0.29 (unreleased)
 -----------------
@@ -931,7 +1242,7 @@ version of CCExtractor.
 0.26 (unreleased)
 -----------------
 - Added -gp (or -goppad) to make ccextractor use
-  GOP timing. Try it for non TS files where 
+  GOP timing. Try it for non TS files where
  subs start OK but desync as the video advances.

 0.25 (unreleased)
@@ -940,7 +1251,7 @@ version of CCExtractor.
  -nomyth to prevent the MytvTV code path to be
  called. I've seen apparently correct files that
  make MythTV's MPEG decoder to choke. So, if it
-  doesn't work correctly automatically: Try 
+  doesn't work correctly automatically: Try
  -nomyth and -myth. Hopefully one of the two
  options will work.

@@ -953,7 +1264,7 @@ version of CCExtractor.
 - Reworked input buffer code, faster now.
 - Completed MythTV's MPEG decoder for Program Streams,
  which results in better processing of some specific
-  files. 
+  files.
 - Automatic file format detection for all kind of
  files and closed caption storage method. No need to
  tell ccextractor anything about your file (but you
@@ -962,10 +1273,10 @@ version of CCExtractor.

 0.22 (2007-05-15)
 -----------------
- Added text mode handling into decoder, which gets rids 
+- Added text mode handling into decoder, which gets rids
  of junk when text mode data is present.
 - Added support for certain (possibly non standard
-  compliant) DVDs that add more captions block in a 
+  compliant) DVDs that add more captions block in a
  user data block than they should (such as Red October).
 - Fix in roll-up init code that caused the previous popup
  captions not to be written to disk.
@@ -976,13 +1287,13 @@ version of CCExtractor.
 -----------------
 - Unicode should be decent now.
 - Added support for Hauppauge PVR 250 cards, and (possibly)
-  many others (bttv) with the same closed caption recording 
+  many others (bttv) with the same closed caption recording
  format.
  This is the result of hacking MythTV's MPEG parser into
  CCExtractor. Integration is not very good (to put it
  midly) but it seems to work. Depending on the feedback I
  may continue working on this or just leave it 'as it'
-  (good enough). 
+  (good enough).
  If you want to process a file generated by one of these
  analog cards, use -myth. This is essential as it will
  make the program take a totally different code path.
@@ -992,10 +1303,10 @@ version of CCExtractor.

 0.19 (2007-05-03)
 -----------------
- Work on Dish Network streams, timing was completely broken. 
+- Work on Dish Network streams, timing was completely broken.
  It's fixed now at least for the samples I have, if it's not
  completely fixed let me know. Credit for this goes to
-  Jack Ha who sent me a couple of samples and a first 
+  Jack Ha who sent me a couple of samples and a first
  implementation of a semi working-fix.
 - Added support for several input files (see help screen for
  details).
@@ -1032,4 +1343,3 @@ version of CCExtractor.
 - Added video information (as extracted from sequence header).
 - Some code clean-up.
 - FF sanity check enabled by default.
-
--- a/docs/COMPILATION.MD
+++ b/docs/COMPILATION.MD
@@ -0,0 +1,341 @@
+# Installation
+
+## Homebrew 
+The easiest way to install CCExtractor for Mac and Linux is through Homebrew:
+
+```bash
+brew install ccextractor
+```
+Note: If you don't have Homebrew installed, see [brew.sh](https://brew.sh/)
+ for installation instructions.
+
+---
+
+# Compiling CCExtractor
+
+You may compile CCExtractor across all major platforms using `CMakeLists.txt` stored under `ccextractor/src/` directory. Autoconf and custom build scripts are also available. See platform specific instructions in the below sections.
+
+Downloads for precompiled binaries and source code can be found [on our website](https://www.ccextractor.org?id=public:general:downloads).
+
+Clone the latest repository from Github
+
+```bash
+git clone https://github.com/CCExtractor/ccextractor.git
+```
+
+### Hardsubx (Burned-in Subtitles) and FFmpeg Versions
+
+CCExtractor's hardsubx feature extracts burned-in subtitles from videos using OCR. It requires FFmpeg libraries. The build system automatically selects appropriate FFmpeg versions for each platform:
+
+- **Linux**: FFmpeg 6.x (default)
+- **Windows**: FFmpeg 6.x (default)
+- **macOS**: FFmpeg 8.x (default)
+
+You can override the default by setting the `FFMPEG_VERSION` environment variable to `ffmpeg6`, `ffmpeg7`, or `ffmpeg8` before building. This flexibility ensures compatibility with different FFmpeg installations across platforms.
+
+## Docker
+You can now use docker image to build latest source of CCExtractor without any environmental hustle. Follow these [instructions](https://github.com/CCExtractor/ccextractor/tree/master/docker/README.md) for building docker image & usage of it.
+
+## Linux
+
+1. Make sure all the dependencies are met.
+
+Debian:
+
+```bash
+sudo apt-get install -y libgpac-dev libglew-dev libglfw3-dev cmake gcc libcurl4-gnutls-dev tesseract-ocr libtesseract-dev libleptonica-dev clang libclang-dev
+```
+
+RHEL/Fedora:
+
+```bash
+yum install -y glew-devel glfw-devel cmake gcc libcurl-devel tesseract-devel leptonica-devel clang gpac-devel
+```
+
+Arch:
+```bash
+sudo paru -S glew glfw curl tesseract leptonica cmake gcc clang gpac
+```
+or
+```bash
+sudo pacman -S glew glfw curl tesseract leptonica cmake gcc clang gpac
+```
+
+Rust 1.54 or above is also required. [Install Rust](https://www.rust-lang.org/tools/install). Check specific compilation methods below, on how to compile without rust.
+
+**Note:** On Ubuntu Version 23.10 (Mantic) and later, `libgpac-dev` isn't available, you should build gpac from source by following the easy build instructions [here](https://github.com/gpac/gpac/wiki/GPAC-Build-Guide-for-Linux)
+
+**Note:** On Ubuntu Version 18.04 (Bionic) and later, `libtesseract-dev` is installed rather than `tesseract-ocr-dev`, which does not exist anymore.
+
+**Note:** On Ubuntu Version 14.04 (Trusty) and earlier, you should build leptonica and tesseract from source
+
+2. Compiling
+
+### Using the build script
+
+By default build script does not include debugging information hence, you cannot debug the executable produced (i.e. `./ccextractor`) on a debugger. To include debugging information, use the `builddebug` script.
+
+```bash
+# navigate to linux directory and call the build script
+
+cd ccextractor/linux
+
+# compile without debug flags
+./build
+
+# compile with debug info
+./build -debug            # same as ./builddebug
+
+# compile with hardsubx (burned-in subtitle extraction)
+# Hardsubx requires FFmpeg libraries. Different FFmpeg versions are used by default:
+#   - Linux: FFmpeg 6.x (automatic)
+#   - Windows: FFmpeg 6.x (automatic)
+#   - macOS: FFmpeg 8.x (automatic)
+
+./build -hardsubx         # uses platform-specific FFmpeg version
+
+# To override the default FFmpeg version, set FFMPEG_VERSION:
+FFMPEG_VERSION=ffmpeg8 ./build -hardsubx  # force FFmpeg 8 on any platform
+FFMPEG_VERSION=ffmpeg6 ./build -hardsubx  # force FFmpeg 6 on any platform
+FFMPEG_VERSION=ffmpeg7 ./build -hardsubx  # force FFmpeg 7 on any platform
+
+# [Optional] For custom FFmpeg installations, set these environment variables:
+FFMPEG_INCLUDE_DIR=/usr/include 
+FFMPEG_PKG_CONFIG_PATH=/usr/lib/pkgconfig
+
+
+# test your build
+./ccextractor
+```
+
+### Standard linux compilation through Autoconf scripts
+
+```bash
+sudo apt-get install autoconf  # dependency to generate configuration script
+cd ccextractor/linux
+./autogen.sh
+./configure
+make
+
+# test your build
+./ccextractor
+
+# make build systemwide
+sudo make install
+```
+
+### Using CMake
+
+```bash
+# create and navigate to directory where you want to store built files
+cd ccextractor/
+mkdir build
+cd build
+
+# generate makefile using cmake and then compile
+cmake ../src/  # options here
+make
+
+# test your build
+./ccextractor
+
+# make build systemwide
+sudo make install
+```
+
+`cmake` also accepts the options:
+     `-DWITH_OCR=ON` to enable OCR
+     `-DWITH_HARDSUBX=ON` to enable burned-in subtitles (requires FFmpeg)
+     
+For hardsubx with specific FFmpeg versions:
+     Set `FFMPEG_VERSION=ffmpeg6` for FFmpeg 6.x (default on Linux and Windows)
+     Set `FFMPEG_VERSION=ffmpeg7` for FFmpeg 7.x  
+     Set `FFMPEG_VERSION=ffmpeg8` for FFmpeg 8.x
+     (Defaults: Linux=FFmpeg 6, Windows=FFmpeg 6, macOS=FFmpeg 8)
+
+([OPTIONAL] For custom FFmpeg installations, set these environment variables)
+
+     FFMPEG_INCLUDE_DIR=/usr/include 
+     FFMPEG_PKG_CONFIG_PATH=/usr/lib/pkgconfig
+
+### Compiling with GUI
+
+The GUI for CCExtractor has been moved to a separate repository ([https://github.com/CCExtractor/ccextractorfluttergui](https://github.com/CCExtractor/ccextractorfluttergui)).
+
+## macOS
+
+1. Make sure all the dependencies are met. Decide if you want OCR; if so, you'll need to install tesseract and leptonica.
+Dependencies can be installed via Homebrew as:
+
+```bash
+brew install pkg-config
+brew install autoconf automake libtool
+brew install cmake gpac
+# optional if you want OCR:
+brew install tesseract
+brew install leptonica
+# optional if you want hardsubx (burned-in subtitle extraction):
+brew install ffmpeg
+```
+
+If configuring OCR, use pkg-config to verify tesseract and leptonica dependencies, e.g.
+
+```bash
+pkg-config --exists --print-errors tesseract
+pkg-config --exists --print-errors lept
+```
+
+### Compiling
+
+#### Using build.command script:
+
+```bash
+cd ccextractor/mac
+./build.command              # basic build
+./build.command -ocr         # build with OCR support
+./build.command -hardsubx    # build with hardsubx (uses FFmpeg 8 by default on macOS)
+
+# Override FFmpeg version if needed:
+FFMPEG_VERSION=ffmpeg7 ./build.command -hardsubx
+
+# test your build
+./ccextractor
+```
+
+#### Using CMake
+
+```bash
+# create and navigate to directory where you want to store built files
+cd ccextractor/
+mkdir build
+cd build
+
+# generate makefile using cmake and then compile
+cmake ../src/  # options here
+make
+
+# test your build
+./ccextractor
+```
+
+`cmake` also accepts the options:
+     `-DWITH_OCR=ON` to enable OCR
+     `-DWITH_HARDSUBX=ON` to enable burned-in subtitles
+
+#### Standard compilation through Autoconf scripts:
+
+```bash
+cd ccextractor/mac
+./autogen.sh
+./configure         
+make
+
+# test your build
+./ccextractor
+```
+
+#### Compiling with GUI:
+
+The GUI for CCExtractor has been moved to a separate repository ([https://github.com/CCExtractor/ccextractorfluttergui](https://github.com/CCExtractor/ccextractorfluttergui)).
+
+## Windows
+Dependencies are clang and rust. To enable OCR, rust x86_64-pc-windows-msvc or i686-pc-windows-msvc target should be installed
+
+GPAC is also required, you can install it through chocolatey:
+```
+choco install gpac
+```
+
+Other dependencies are required through vcpkg, so you can follow below steps:
+1. Download vcpkg (prefer version `2023.02.24` as it is supported)
+2. Integrate vcpkg into your system, run the below command in the downloaded vcpkg folder:
+     ```
+     vcpkg integrate install
+     ```
+3. Set Environment Variable for Vcpkg triplet, you can choose between x86 or x64 based on your system.
+     ```
+     setx VCPKG_DEFAULT_TRIPLET "x64-windows-static"
+     setx RUSTFLAGS "-Ctarget-feature=+crt-static"
+     ```
+4. Install dependencies from vcpkg
+
+     In this step we are using `x64-windows-static` triplet, but you will have to use the triplet you set in Step 3
+
+     if building Debug-Full, Release-Full (HardSubx)
+     ```
+     vcpkg install ffmpeg leptonica tesseract --triplet x64-windows-static
+     ```
+     Note: Windows builds use FFmpeg 6 by default. To override:
+     ```
+     set FFMPEG_VERSION=ffmpeg8
+     msbuild ccextractor.sln /p:Configuration=Debug-Full /p:Platform=x64
+     ```
+     
+     otherwise if you have Debug, Release
+     ```
+     vcpkg install libpng --triplet x64-windows-static
+     ```
+
+Note: Following screenshots and steps are based on Visual Studio 2017, but they should be more or less same for other versions.
+
+1.Open `windows/` directory to locate `ccextractor.vcxproj` and `ccextractor.sln` (red arrow).
+
+![Project Files](img/projectFiles.png)
+
+2.Accept the security prompt (if any), to proceed with compilation.
+![A warning you can receive](img/Warning.png)
+
+3.Using Visual Studio (2015 or above), open ccextractor.sln. This will build both CCExtractor and its GUI. To build them separately, open the respective .vcxproj file.
+
+4.In Solution Explorer, you'll see two projects with the VS version and Windows release version in parenthesis.  Change them to parameters which are true for you by clicking right mouse button on project and selecting properties.
+
+![Project Section](img/ProjectSection.png)
+
+![Properties, that you have to change](img/Properties.png)
+
+5.Right click and select `build` to compile the project and generate executable file.
+
+![Building button](img/Building.png)
+
+6.Find the executable file in `Debug` or `Release` folder, based on selected configuration.
+
+![Path to Binaries](img/Binaries.png)
+
+Configurations options are: `(Debug|Release)-Full`
+
+Configurations options include dependent libraries which are used for OCR.
+
+### Using CMake
+
+You may also generate `.sln` files for Visual Studio and build using build tools, or open `.sln` files using Visual Studio.
+
+```bash
+cmake ../src/ -G "Visual Studio 14 2015"
+cmake --build . --config Release --ccextractor
+```
+
+### Using MSBuild
+
+Run the following command in `windows/` directory
+
+```bash
+msbuild ccextractor.sln /p:Configuration=Release /p:Platform=x64
+```
+Different configuration options are,
+
+| Configuration | Platform | Rust target required |
+| ------------- |:-------------:| -----:|
+| Release | x64 | default |
+| Debug | x64 | default |
+| Release-Full(OCR) | Win32 | i686-pc-windows-msvc |
+| Debug-Full(OCR) | Win32 | i686-pc-windows-msvc |
+
+## Building Installation Packages
+
+### Arch Linux
+
+Go to the package_creators folder using `cd` and run the `./arch.sh`
+
+### Redhat Package Manager (rpm) based Linux Distributions
+
+Go to the package_creators folder using `cd` and run the `./rpm.sh`
--- a/docs/FFMPEG.TXT
+++ b/docs/FFMPEG.TXT
@@ -1,58 +0,0 @@
-Overview
-========
-FFmpeg Integration was done to support multiple encapsulations.
-
-Dependency
-=========
-FFmpeg library's
-
-Download and Install FFmpeg on your Linux pc.
---------------------------------------------
-
-Download latest source code from following link
-https://ffmpeg.org/download.html
-
-then following command to install ffmpeg
-./configure && make && make install
-
-Note:If you installed ffmpeg on non-standard location, please change/update your
-	 environment variable $PATH and $LD_LIBRARY_PATH
-
-Download and Install FFmpeg on your Windows pc.
----------------------------------------------
-Download prebuild library from following link
-http://ffmpeg.zeranoe.com/builds/
-
-You need to download Shared Versions to run the program and Dev Versions to compile.
-
-How to compile ccextractor
-==========================
-
-In Linux
--------
-make ENABLE_FFMPEG=yes
-
-On Windows
----------
-put the path of libs/include of ffmpeg library in library paths.
-Step 1) In visual studio 2013 right click <Project> and select property.
-Step 2) Select Configuration properties in left panel(column) of property.
-Step 3) Select VC++ Directory.
-Step 4) In the right pane, in the right-hand column of the VC++ Directory property,
-        open the drop-down menu and choose Edit.
-Step 5) Add path of Directory where you have kept uncompressed library of FFmpeg.
-
-
-Set preprocessor flag ENABLE_FFMPEG=1
-Step 1) In visual studio 2013 right click <Project> and select property.
-Step 2) In the left panel, select Configuration Properties, C/C++, Preprocessor.
-Step 3) In the right panel, in the right-hand column of the Preprocessor Definitions property, open the drop-down menu and choose Edit.
-Step 4) In the Preprocessor Definitions dialog box, add ENABLE_FFMPEG=1. Choose OK to save your changes.
-
-Add library in linker
-Step 1) Open property of project
-Step 2) Select Configuration properties
-Step 3) Select Linker in left panel(column)
-Step 4) Select Input
-Step 5) Select Additional dependencies in right panel
-Step 6) Add all FFmpeg's lib in new line
--- a/docs/FFMPEG.md
+++ b/docs/FFMPEG.md
@@ -0,0 +1,48 @@
+# Overview
+
+FFmpeg Integration was done to support multiple encapsulations.
+
+## Dependencies
+FFmpeg libraries
+
+### Download and Install FFmpeg on your Linux pc:
+Download latest source code from following link
+https://ffmpeg.org/download.html
+
+Then following command to install ffmpeg:
+`./configure && make && make install`
+
+Note:If you installed ffmpeg on non-standard location, please change/update your
+	 environment variable `$PATH` and `$LD_LIBRARY_PATH`
+
+### Download and Install FFmpeg on your Windows pc:
+1. Download vcpkg (prefer version `2023.02.24` as it is supported)
+2. Integrate vcpkg into your system, run the below command in the downloaded vcpkg folder:
+	```
+	vcpkg integrate install
+	```
+3. Set Environment Variable for Vcpkg triplet, you can choose between x86 or x64 based on your system.
+	```
+	setx VCPKG_DEFAULT_TRIPLET "x64-windows-static"
+	setx RUSTFLAGS "-Ctarget-feature=+crt-static"
+	```
+4. Install ffmpeg from vcpkg
+
+
+	In this step we are using `x64-windows-static` triplet, but you will have to use the triplet you set in Step 3
+
+	```
+	vcpkg install ffmpeg --triplet x64-windows-static
+	```
+
+## How to compile ccextractor
+
+### On Linux:
+`make ENABLE_FFMPEG=yes`
+
+### On Windows:
+#### Set preprocessor flag `ENABLE_FFMPEG=1`
+1. In visual studio 2022 right click <Project> and select property.
+2. In the left panel, select Configuration Properties, C/C++, Preprocessor.
+3. In the right panel, in the right-hand column of the Preprocessor Definitions property, open the drop-down menu and choose Edit.
+4. In the Preprocessor Definitions dialog box, add `ENABLE_FFMPEG=1`. Choose OK to save your changes.
--- a/docs/HARDSUBX.txt
+++ b/docs/HARDSUBX.txt
@@ -20,6 +20,10 @@ Linux
 Make sure Tesseract, Leptonica and FFMPeg are installed, and that their libraries can be found using pkg-config.
 Refer to OCR.txt for installation details.

+FFmpeg from packages (on Debian) plus a couple of other dependencies you will need:
+sudo apt-get install libavcodec-dev libavformat-dev libavutil-dev libswscale-dev libxcb-shm0-dev liblzma-dev 
+
+FFmpeg from source:
 To install FFmpeg (libav), follow the steps at:-
 https://trac.ffmpeg.org/wiki/CompilationGuide/Ubuntu - For Ubuntu, Debian and Linux Mint
 https://trac.ffmpeg.org/wiki/CompilationGuide/Generic - For generic Linux compilation
@@ -36,11 +40,46 @@ pkg-config --libs libswscale

 On success, you should see the correct include directory path and the linker flags.

-To build the program with hardsubx support, from the Linux directory run:-
-make ENABLE_HARDSUBX=yes
+To build the program with hardsubx support, 
+
+== from the Linux directory run:-
+    ./configure --enable-hardsubx
+    make ENABLE_HARDSUBX=yes
+
+== using cmake from root directory
+    mkdir build
+    cd build
+    cmake -DWITH_OCR=on -DWITH_HARDSUBX=on ../src/
+    make

 NOTE: The build has been tested with FFMpeg version 3.1.0, and Tesseract 3.04.

+macOS
+-----
+
+Install the required dependencies using Homebrew:
+    brew install tesseract leptonica ffmpeg
+
+To build the program with hardsubx support, use one of these methods:
+
+== Using build.command (Recommended):
+    cd ccextractor/mac
+    ./build.command -hardsubx
+
+== Using autoconf:
+    cd ccextractor/mac
+    ./autogen.sh
+    ./configure --enable-hardsubx --enable-ocr
+    make
+
+== Using cmake:
+    cd ccextractor
+    mkdir build && cd build
+    cmake -DWITH_OCR=ON -DWITH_HARDSUBX=ON ../src/
+    make
+
+NOTE: The -hardsubx parameter uses a single dash (not --hardsubx).
+
 Windows
 -------

--- a/docs/MAILINGLIST.TXT
+++ b/docs/MAILINGLIST.TXT
@@ -1,17 +1,9 @@
-A mailing list is now available from sourceforge:
+A mailing list is now available from google groups:
+
+https://groups.google.com/forum/#!forum/ccextractor-dev
+
+The old one, hosted in sourceforge, is discontinued, but here is the link just in case:

 https://lists.sourceforge.net/lists/listinfo/ccextractor-users

-I expect it to be very low traffic (right now there's around 10
-people actively helping with CCExtractor in one way or
-another), so almost everything goes here:
-
- Bug reports
- Feature requests
- Announcements
-
-NOT here:
-
- Samples 
-

--- a/docs/OCR.md
+++ b/docs/OCR.md
@@ -0,0 +1,123 @@
+# Overview
+OCR (Optical Character Recognition) is a technique used to 
+extract text from images. In the World of Subtitle, subtitle stored 
+in bitmap format are common and even necessary. For converting subtitle 
+in bitmap format to subtitle in text format OCR is used.
+
+# Dependency
+1. Tesseract (OCR library by Google)
+2. Leptonica (Image processing library)
+
+# How to compile CCExtractor on Linux with OCR
+
+## Install Dependency
+
+### Using package manager 
+#### Ubuntu, Debian
+```
+sudo apt-get install libleptonica-dev libtesseract-dev tesseract-ocr-eng
+```
+#### Suse
+```
+zypper install leptonica-devel
+```
+
+### Downloading source code and compiling it.
+
+#### Leptonnica.
+This package is available in your distro, you need liblept-devel library.
+
+If Leptonica isn't available for your distribution, or you want to use a newer version
+ than they offer, you can compile your own.
+
+you can download lib leptonica source code from  http://www.leptonica.com/download.html
+
+#### Tesseract.
+Tesseract is available directly from many Linux distributions. The package is generally
+ called 'tesseract' or 'tesseract-ocr' - search your distribution's repositories to
+ find it. Packages are also generally available for language training data (search the
+ repositories,) but if not you will need to download the appropriate training data,
+ unpack it, and copy the .traineddata file into the 'tessdata' directory, probably
+ /usr/share/tesseract-ocr/tessdata or /usr/share/tessdata.
+
+If Tesseract isn't available for your distribution, or you want to use a newer version
+ than they offer, you can compile your own.
+
+If you compile Tesseract then following command in its source code are enough
+```
+./autogen.sh
+./configure
+make
+sudo make install
+sudo ldconfig
+```
+
+Note: 
+1. CCExtractor is tested with Tesseract 3.04 version but it works with older versions. 
+2. Useful Download links:
+    1. *Tesseract*  https://github.com/tesseract-ocr/tesseract/archive/3.04.00.tar.gz
+    2. *Tesseract training data* https://github.com/tesseract-ocr/tessdata/archive/3.04.00.tar.gz
+
+
+##Compilation
+
+###using Build script
+```
+cd ccextractor/linux
+./build
+```
+
+### Passing flags to configure
+```
+cd ccextractor/linux
+./autogen.sh
+./configure --with-gui --enable-ocr
+make
+```
+
+### Passing flags to cmake
+```
+cd <CCExrtactor cloned code>
+mkdir build
+cd build
+cmake -DWITH_OCR=ON ../src
+make
+```
+
+
+
+How to compile CCExtractor on Windows with OCR
+===============================================
+
+Download prebuild library of leptonica and tesseract from following link  
+https://drive.google.com/file/d/0B2ou7ZfB-2nZOTRtc3hJMHBtUFk/view?usp=sharing  
+
+put the path of libs/include of leptonica and tesseract in library paths.  
+1. In visual studio 2022 right click <Project> and select property.
+2. Select Configuration properties in left panel(column) of property.
+3. Select VC++ Directory.
+4. In the right pane, in the right-hand column of the VC++ Directory property, open the drop-down menu and choose Edit.
+5. Add path of Directory where you have kept uncompressed library of leptonica and tesseract.
+
+
+Set preprocessor flag ENABLE_OCR=1  
+1. In visual studio 2022 right click <Project> and select property.
+2. In the left panel, select Configuration Properties, C/C++, Preprocessor.
+3. In the right panel, in the right-hand column of the Preprocessor Definitions property, open the drop-down menu and choose Edit.
+4. In the Preprocessor Definitions dialog box, add ENABLE_OCR=1. Choose OK to save your changes.
+
+Add library in linker
+1. Open property of project
+2. Select Configuration properties
+3. Select Linker in left panel(column)
+4. Select Input
+5. Select Additional dependencies in right panel
+6. Add libtesseract304d.lib in new line
+7. Add liblept172.lib in new line
+
+Download language data from following link  
+https://code.google.com/p/tesseract-ocr/downloads/list  
+after downloading the tesseract-ocr-3.02.eng.tar.gz extract the tar file and put  
+tessdata folder where you have kept CCExtractor executable  
+
+Copy the tesseract and leptonica dll from lib folder downloaded from above link to folder of executable or in system32.
--- a/docs/OCR.txt
+++ b/docs/OCR.txt
@@ -1,94 +0,0 @@
-
-Overview
-========
-OCR (Optical Character Recognition) is a technique used to 
-extract text from images. In the World of Subtitle, subtitle stored 
-in bitmap format are common and even necessary for converting subtitle 
-in bitmap format to subtitle in text format OCR is used.
-
-Dependency
-==========
-Tesseract (OCR library by Google)
-Leptonica (Image processing library)
-
-How to compile CCExtractor on Linux with OCR
-=============================================
-
-Download and Install Leptonnica.
-------------------------------
-This package is available, you need liblept-devel library.
-
-If Leptonica isn't available for your distribution, or you want to use a newer version
- than they offer, you can compile your own.
-
-you can download lib leptonica from  http://www.leptonica.com/download.html
-
-Download and Install Tesseract.
-------------------------------
-Tesseract is available directly from many Linux distributions. The package is generally
- called 'tesseract' or 'tesseract-ocr' - search your distribution's repositories to
- find it. Packages are also generally available for language training data (search the
- repositories,) but if not you will need to download the appropriate training data,
- unpack it, and copy the .traineddata file into the 'tessdata' directory, probably
- /usr/share/tesseract-ocr/tessdata or /usr/share/tessdata.
-
-If Tesseract isn't available for your distribution, or you want to use a newer version
- than they offer, you can compile your own.
-
-If you compile Tesseract then following command in its source code are enough
-./autogen.sh
-./configure
-make
-sudo make install
-sudo ldconfig
-
- Note: 
-1) CCExtractor is tested with Tesseract 3.04 version but it works with older versions. 
-
-you can download tesseract from https://github.com/tesseract-ocr/tesseract/archive/3.04.00.tar.gz
-you can download tesseract training data from https://github.com/tesseract-ocr/tessdata/archive/3.04.00.tar.gz
-
-
-
-Compile CCExtractor passing flags like following
-------------------------------------------------
-make ENABLE_OCR=yes
-
-
-How to compile CCExtractor on Windows with OCR
-===============================================
-
-Download prebuild library of leptonica and tesseract from following link
-https://drive.google.com/file/d/0B2ou7ZfB-2nZOTRtc3hJMHBtUFk/view?usp=sharing
-
-put the path of libs/include of leptonica and tesseract in library paths.
-step 1) In visual studio 2013 right click <Project> and select property.
-step 2) Select Configuration properties in left panel(column) of property.
-step 3) Select VC++ Directory.
-step 4) In the right pane, in the right-hand column of the VC++ Directory property,
-	open the drop-down menu and choose Edit.
-Step 5) Add path of Directory where you have kept uncompressed library of leptonica
-	and tesseract.
-
-
-Set preprocessor flag ENABLE_OCR=1
-Step 1) In visual studio 2013 right click <Project> and select property.
-Step 2) In the left panel, select Configuration Properties, C/C++, Preprocessor.
-Step 3) In the right panel, in the right-hand column of the Preprocessor Definitions property, open the drop-down menu and choose Edit.
-Step 4) In the Preprocessor Definitions dialog box, add ENABLE_OCR=1. Choose OK to save your changes.
-
-Add library in linker
-step 1) Open property of project
-Step 2) Select Configuration properties
-Step 3) Select Linker in left panel(column)
-Step 4) Select Input
-Step 5) Select Additional dependencies in right panel
-Step 6) Add libtesseract304d.lib in new line
-Step 7) Add liblept172.lib in new line
-
-Download language data from following link
-https://code.google.com/p/tesseract-ocr/downloads/list
-after downloading the tesseract-ocr-3.02.eng.tar.gz extract the tar file and put
-tessdata folder where you have kept CCExtractor executable
-
-Copy the tesseract and leptonica dll from lib folder downloaded from above link to folder of executable or in system32.
--- a/docs/README.TXT
+++ b/docs/README.TXT
@@ -1,59 +1,16 @@
-ccextractor, 0.85
-----------------
-Authors: Carlos Fernández (cfsmp3), Volker Quetschke.
-Maintainer: cfsmp3
+## CCExtractor
+check AUTHORS.TXT for history and developers

-Lots of credit goes to other people, though:
-McPoodle (author of the original SCC_RIP), Neuron2, and others (see source
-code).
-
-Home: http://www.ccextractor.org
-
-Google Summer of Code 2014 students
- Willem Van Iseghem
- Ruslan KuchumoV
- Anshul Maheshwari
-
-Google Summer of Code 2015 students
- Willem Van Iseghem
- Ruslan Kuchumov
- Anshul Maheshwari
- Nurendra Choudhary
- Oleg Kiselev
- Vasanth Kalingeri
-
-Google Summer of Code 2016 students
- Willem Van Iseghem
- Ruslan Kuchumov
- Abhishek Vinjamoori
- Abhinav Shukla
- Rishabh Garg
-
-Google Code-in 2016 students
- Evgeny Shulgin
- Manveer Basra
- Alexandru Bratosin
-(more, but they forgot to add themselves...)
-
-
-
-License
-------
+## License
 GPL 2.0. 

-Description
-----------
-ccextractor was originally a mildly optimized C port of McPoodle's excellent
-but painfully slow Perl script SCC_RIP. It lets you rip the raw closed
-captions (read: subtitles) data from a number of sources, such as DVD or
-ATSC (digital TV) streams.

-Since the original port, lots of changes have been made, such as HDTV
-support, analog captures support (via bttv cards), direct .srt/.smi
-generation, time adjusting, and more.
+## Description
+Since the original port, the whole code has been rewritten (more than once,
+one might add) and support for most subtitle formats around the world has
+been added (teletext, DVB, CEA-708, ISDB...)

-Basic Usage 
-----------
+## Basic Usage 
 (please run ccextractor with no parameters for the complete manual -
 this is for your convenience, really).

@@ -69,9 +26,16 @@ Running ccextractor without parameters shows the help screen. Usage is
 trivial - you just need to pass the input file and (optionally) some
 details about the input and output files.

+Example:

-Languages
---------
+ccextractor input_video.ts
+
+This command extracts subtitles from the input video file and generates a subtitle output file
+(such as .srt) in the same directory.
+
+
+
+## Languages
 Usually English captions are transmitted in line 21 field 1 data,
 using channel 1, so the default values are correct so you don't
 need to do anything and you don't need to understand what it all
@@ -89,20 +53,17 @@ So try adding these parameter combinations to your other parameters.

 If there are Spanish subtitles, one of them should work. 

-McPoodle's page
---------------
+## McPoodle's page
 http://www.theneitherworld.com/mcpoodle/SCC_TOOLS/DOCS/SCC_TOOLS.HTML

 Essential CC related information and free (with source) tools.

-Encoding
--------
+## Encoding
 This version, in both its Linux and Windows builds generates by
 default Unicode files. You can use -latin1 and -utf8 if you prefer 
 these encodings (usually it just depends on what your specific
 player likes).

-Future work
-----------
+## Future work
 - Please check www.ccextractor.org for news and future work.

--- a/docs/Rust_migration_guide.md
+++ b/docs/Rust_migration_guide.md
@@ -0,0 +1,71 @@
+# C to Rust Migration Guide
+
+## Porting C Functions to Rust
+
+This guide outlines the process of migrating C functions to Rust while maintaining compatibility with existing C code.
+
+### Step 1: Identify the C Function
+
+First, identify the C function you want to port. For example, let's consider a function named `net_send_cc()` in a file called `networking.c`:
+
+```c
+void net_send_cc() {
+    // Some C code
+}
+```
+
+### Step 2: Create a Pure Rust Equivalent
+
+Write an equivalent function in pure Rust within the `lib_ccxr` module:
+
+```rust
+fn net_send_cc() {
+    // Rust equivalent code to `net_send_cc` function in `networking.c`
+}
+```
+
+### Step 3: Create a C-Compatible Rust Function
+
+In the `libccxr_exports` module, create a new function that will be callable from C:
+
+```rust
+#[no_mangle]
+pub extern "C" fn ccxr_net_send_cc() {
+    net_send_cc() // Call the pure Rust function
+}
+```
+
+### Step 4: Declare the Rust Function in C
+
+In the original C file (`networking.c`), declare the Rust function as an external function:
+
+```rust
+extern void ccxr_net_send_cc();
+```
+
+### Step 5: Modify the Original C Function
+
+Update the original C function to use the Rust implementation when available:
+
+```c
+void net_send_cc() {
+    #ifndef DISABLE_RUST
+        return ccxr_net_send_cc(); // Use the Rust implementation
+    #else
+        // Original C code
+    #endif
+}
+```
+
+## Rust module system
+
+- `lib_ccxr` crate -> **The Idiomatic Rust layer**
+
+  - Path: `src/rust/lib_ccxr`
+  - This layer will contain the migrated idiomatic Rust. It will have complete documentation and tests.
+
+- `libccxr_exports` module -> **The C-like Rust layer**
+
+  - Path: `src/rust/src/libccxr_exports`
+  - This layer will have function names the same as defined in C but with the prefix `ccxr_`. These are the functions defined in the `lib_ccx` crate under appropriate modules. And these functions will be provided to the C library.
+  - Ex: `extern "C" fn ccxr_<function_name>(<args>) {}`
--- a/docs/Updating_Dependencies.txt
+++ b/docs/Updating_Dependencies.txt
@@ -0,0 +1,27 @@
+A guide to how dependencies should be updated in CCExtractor.
+
+Author: thealphadollar
+======================
+
+CCExtractor depends on multiple dependencies and they are updated from time to time. On every major revision of the dependencies, the changes need to be incorporated into our repository.
+
+It is not straightforward since we make minor (or sometimes major) changes into the library to use it and these changes are lost in case of direct file replacement. To overcome this issue, we should follow the below pathway.
+
+*) Create a duplicate copy of the CCExtractor's folder of the library, to be updated (we will be calling this folder lib(copy) in steps and original one as lib).
+*) Download the latest files of the library from official source (the folder is called as lib(orig) in further steps).
+*) Look for files with the same name in lib and lib(orig). It can be done manually in case of small libraries (libpng), otherwise a script can be written utilising the grep command to find out files from the library which we use.
+*) In lib, replace all the files (found in previous step) with their updated versions from lib(orig). A copy command can be used in the script written for the previous step to accomplish this step.
+
+Now, the files in our repository have been updated. In steps to follow, we will try to grab lost changes using lib(copy).
+
+*) Run diff command between lib(copy) and lib for all files and store the output in a text document. Here files from lib(copy) should be given as first argument to notice deletions clearly.
+*) Look for deletions in an updated file and manually inspect (or ask mentor) whether that part is to be restored or not. In most cases, it is to be restored but it's better to ask than to break.
+
+Once the changes have been restored, try to compile CCExtractor. It is very much likely that the compilation will fail. The most probably reason for this could be inclusion of unnecessary lines of code and their accompanying dependencies.
+e.g "X is not defined" can be an error when we don't include the file in which X is defined nor remove the unnecessary line using X. 
+CCExtractor doesn't use a library fully, we use only the code and files necessary. This requires manual removal of extra lines and dependencies.
+
+*) Output the compilation erros in a text document while compiling.
+*) Use inspection and comparison with lib(copy) to decide whether the line causing error is to be removed.
+
+Compile again, debug and push the change for the Continuous Integration tests on samples.
--- a/docs/VOBSUB.md
+++ b/docs/VOBSUB.md
@@ -0,0 +1,129 @@
+# VOBSUB Subtitle Extraction from MKV Files
+
+CCExtractor supports extracting VOBSUB (S_VOBSUB) subtitles from Matroska (MKV) containers. VOBSUB is an image-based subtitle format originally from DVD video.
+
+## Overview
+
+VOBSUB subtitles consist of two files:
+- `.idx` - Index file containing metadata, palette, and timestamp/position entries
+- `.sub` - Binary file containing the actual subtitle bitmap data in MPEG Program Stream format
+
+## Basic Usage
+
+```bash
+ccextractor movie.mkv
+```
+
+This will extract all VOBSUB tracks and create paired `.idx` and `.sub` files:
+- `movie_eng.idx` + `movie_eng.sub` (first English track)
+- `movie_eng_1.idx` + `movie_eng_1.sub` (second English track, if present)
+- etc.
+
+## Converting VOBSUB to SRT (Text)
+
+Since VOBSUB subtitles are images, you need OCR (Optical Character Recognition) to convert them to text-based formats like SRT.
+
+### Using subtile-ocr (Recommended)
+
+[subtile-ocr](https://github.com/gwen-lg/subtile-ocr) is an actively maintained Rust tool that provides accurate OCR conversion.
+
+#### Option 1: Docker (Easiest)
+
+We provide a Dockerfile that builds subtile-ocr with all dependencies:
+
+```bash
+# Build the Docker image (one-time)
+cd tools/vobsubocr
+docker build -t subtile-ocr .
+
+# Extract VOBSUB from MKV
+ccextractor movie.mkv
+
+# Convert to SRT using OCR
+docker run --rm -v $(pwd):/data subtile-ocr -l eng -o /data/movie_eng.srt /data/movie_eng.idx
+```
+
+#### Option 2: Install subtile-ocr Natively
+
+If you have Rust and Tesseract development libraries installed:
+
+```bash
+# Install dependencies (Ubuntu/Debian)
+sudo apt-get install libleptonica-dev libtesseract-dev tesseract-ocr tesseract-ocr-eng
+
+# Install subtile-ocr
+cargo install --git https://github.com/gwen-lg/subtile-ocr
+
+# Convert
+subtile-ocr -l eng -o movie_eng.srt movie_eng.idx
+```
+
+### subtile-ocr Options
+
+| Option | Description |
+|--------|-------------|
+| `-l, --lang <LANG>` | Tesseract language code (required). Examples: `eng`, `fra`, `deu`, `chi_sim` |
+| `-o, --output <FILE>` | Output SRT file (stdout if not specified) |
+| `-t, --threshold <0.0-1.0>` | Binarization threshold (default: 0.6) |
+| `-d, --dpi <DPI>` | Image DPI for OCR (default: 150) |
+| `--dump` | Save processed subtitle images as PNG files |
+
+### Language Codes
+
+Install additional Tesseract language packs as needed:
+
+```bash
+# Examples
+sudo apt-get install tesseract-ocr-fra  # French
+sudo apt-get install tesseract-ocr-deu  # German
+sudo apt-get install tesseract-ocr-spa  # Spanish
+sudo apt-get install tesseract-ocr-chi-sim  # Simplified Chinese
+```
+
+## Technical Details
+
+### .idx File Format
+
+The index file contains:
+1. Header with metadata (size, palette, alignment settings)
+2. Language identifier line
+3. Timestamp entries with file positions
+
+Example:
+```
+# VobSub index file, v7 (do not modify this line!)
+size: 720x576
+palette: 000000, 828282, ...
+
+id: eng, index: 0
+timestamp: 00:01:12:920, filepos: 000000000
+timestamp: 00:01:18:640, filepos: 000000800
+...
+```
+
+### .sub File Format
+
+The binary file contains MPEG Program Stream packets:
+- Each subtitle is wrapped in a PS Pack header (14 bytes) + PES header (15 bytes)
+- Subtitles are aligned to 2048-byte boundaries
+- Contains raw SPU (SubPicture Unit) bitmap data
+
+## Troubleshooting
+
+### Empty output files
+- Ensure the MKV file actually contains VOBSUB tracks (check with `mediainfo` or `ffprobe`)
+- CCExtractor will report "No VOBSUB subtitles to write" if the track is empty
+
+### OCR quality issues
+- Try adjusting the `-t` threshold parameter
+- Ensure the correct language pack is installed
+- Use `--dump` to inspect the processed images
+
+### Docker permission issues
+- The output files may be owned by root; use `sudo chown` to fix ownership
+- Or run Docker with `--user $(id -u):$(id -g)`
+
+## See Also
+
+- [OCR.md](OCR.md) - General OCR support in CCExtractor
+- [subtile-ocr GitHub](https://github.com/gwen-lg/subtile-ocr) - OCR tool documentation
--- a/docs/build-wsl.md
+++ b/docs/build-wsl.md
@@ -0,0 +1,137 @@
+# Building CCExtractor on Windows using WSL
+
+This guide explains how to build CCExtractor on Windows using WSL (Ubuntu).
+It is based on a fresh setup and includes all required dependencies and
+common build issues encountered during compilation.
+
+---
+
+## Prerequisites
+
+- Windows 10 or Windows 11
+- WSL enabled
+- Ubuntu installed via Microsoft Store
+
+---
+
+## Install WSL and Ubuntu
+
+From PowerShell (run as Administrator):
+
+```powershell
+wsl --install -d Ubuntu
+```
+
+Restart the system if prompted, then launch Ubuntu from the Start menu.
+
+---
+
+## Update system packages
+
+```bash
+sudo apt update
+```
+
+---
+
+## Install basic build tools
+
+```bash
+sudo apt install -y build-essential git pkg-config
+```
+
+---
+
+## Install Rust (required)
+
+CCExtractor includes Rust components, so Rust and Cargo are required.
+
+```bash
+curl https://sh.rustup.rs -sSf | sh
+source ~/.cargo/env
+```
+
+Verify installation:
+
+```bash
+cargo --version
+rustc --version
+```
+
+---
+
+## Install required libraries
+
+```bash
+sudo apt install -y \
+  libclang-dev clang \
+  libtesseract-dev tesseract-ocr \
+  libgpac-dev
+```
+
+---
+
+## Clone the repository
+
+```bash
+git clone https://github.com/CCExtractor/ccextractor.git
+cd ccextractor
+```
+
+---
+
+## Build CCExtractor
+
+```bash
+cd linux
+./build
+```
+
+After a successful build, verify by running:
+
+```bash
+./ccextractor
+```
+
+You should see the help/usage output.
+
+---
+
+## Common build issues
+
+### cargo: command not found
+
+```bash
+source ~/.cargo/env
+```
+
+---
+
+### Unable to find libclang
+
+```bash
+sudo apt install libclang-dev clang
+```
+
+---
+
+### gpac/isomedia.h: No such file or directory
+
+```bash
+sudo apt install libgpac-dev
+```
+
+---
+
+### please install tesseract development library
+
+```bash
+sudo apt install libtesseract-dev tesseract-ocr
+```
+
+---
+
+## Notes
+
+- Compiler warnings during the build process are expected and do not indicate failure.
+- This guide was tested on Ubuntu (WSL) running on Windows 11.
--- a/docs/ccextractor.cnf.sample
+++ b/docs/ccextractor.cnf.sample
@@ -1,7 +1,7 @@
 #######################################################
-# Version 0.01
+# Version 0.02
 #
-# To enable required option please uncommnent option
+# To enable required option please uncomment option
 #


@@ -12,12 +12,15 @@
 # 0 = file
 # 1 = stdin
 # 2 = network
+# 3 = tcp

 INPUT_SOURCE=0

 # The Buffer Input tag
 # This tag takes number in its input.

+# Is it ccx_bufferdata_type ?
+
 #BUFFER_INPUT=0

 # The Direct Rollup tag
@@ -45,22 +48,28 @@ INPUT_SOURCE=0
 #NOTYPE_SETTING=

 # The Codec Tag takes the preference of codec
-# tag CCX_CODEC_ANY is by default
+# tag CCX_CODEC_ANY by default
+# This tag takes number in its input and their meanings
+# are following
+# 0 = CCX_CODEC_ANY (default)
+# 1 = CCX_CODEC_TELETEXT
+# 2 = CCX_CODEC_DVB
+# 3 = CCX_CODEC_ISDB_CC
+# 4 = CCX_CODEC_ATSC_CC
+# 5 = CCX_CODEC_NONE
+
+#CODEC=
+
+# The NO Codec Tag uses codec specified
+# tag CCX_CODEC_NONE by default
 # This tag takes number in its input and their meanings
 # are following
 # 0 = CCX_CODEC_ANY
 # 1 = CCX_CODEC_TELETEXT
 # 2 = CCX_CODEC_DVB
-
-#CODEC=
-
-# The NO Codec Tag do not use codec specified
-# tag CCX_CODEC_NONE is by default
-# This tag takes number in its input and their meanings
-# are following
-# 1 = CCX_CODEC_TELETEXT
-# 2 = CCX_CODEC_DVB
-# 3 = CCX_CODEC_NONE
+# 3 = CCX_CODEC_ISDB_CC
+# 4 = CCX_CODEC_ATSC_CC
+# 5 = CCX_CODEC_NONE (default)

 #NOCODEC=

@@ -68,15 +77,21 @@ INPUT_SOURCE=0
 # by default output format is srt
 # This tag takes number in its input and their meanings
 # are following
-# 0 = CCX_OF_RAW
-# 1 = CCX_OF_SRT (default)
-# 2 = CCX_OF_SAMI
-# 3 = CCX_OF_TRANSCRIPT
-# 4 = CCX_OF_RCWT
-# 5 = CCX_OF_NULL
-# 6 = CCX_OF_SMPTETT
-# 7 = CCX_OF_SPUPNG
-# 8 = CCX_OF_DVDRAW
+# 0  = CCX_OF_RAW
+# 1  = CCX_OF_SRT (default)
+# 2  = CCX_OF_SAMI
+# 3  = CCX_OF_TRANSCRIPT
+# 4  = CCX_OF_RCWT
+# 5  = CCX_OF_NULL
+# 6  = CCX_OF_SMPTETT
+# 7  = CCX_OF_SPUPNG
+# 8  = CCX_OF_DVDRAW
+# 9	 = CCX_OF_WEBVTT
+# 10 = CCX_OF_SIMPLE_XML
+# 11 = CCX_OF_G608
+# 12 = CCX_OF_CURL
+# 13 = CCX_OF_SSA
+# 14 = CCX_OF_MCC

 #OUTPUT_FORMAT=

--- a/docs/freetype.TXT
+++ b/docs/freetype.TXT
@@ -0,0 +1,340 @@
+		    GNU GENERAL PUBLIC LICENSE
+		       Version 2, June 1991
+
+ Copyright (C) 1989, 1991 Free Software Foundation, Inc.
+     51 Franklin St, Fifth Floor, Boston, MA  02110-1301  USA
+ Everyone is permitted to copy and distribute verbatim copies
+ of this license document, but changing it is not allowed.
+
+			    Preamble
+
+  The licenses for most software are designed to take away your
+freedom to share and change it.  By contrast, the GNU General Public
+License is intended to guarantee your freedom to share and change free
+software--to make sure the software is free for all its users.  This
+General Public License applies to most of the Free Software
+Foundation's software and to any other program whose authors commit to
+using it.  (Some other Free Software Foundation software is covered by
+the GNU Library General Public License instead.)  You can apply it to
+your programs, too.
+
+  When we speak of free software, we are referring to freedom, not
+price.  Our General Public Licenses are designed to make sure that you
+have the freedom to distribute copies of free software (and charge for
+this service if you wish), that you receive source code or can get it
+if you want it, that you can change the software or use pieces of it
+in new free programs; and that you know you can do these things.
+
+  To protect your rights, we need to make restrictions that forbid
+anyone to deny you these rights or to ask you to surrender the rights.
+These restrictions translate to certain responsibilities for you if you
+distribute copies of the software, or if you modify it.
+
+  For example, if you distribute copies of such a program, whether
+gratis or for a fee, you must give the recipients all the rights that
+you have.  You must make sure that they, too, receive or can get the
+source code.  And you must show them these terms so they know their
+rights.
+
+  We protect your rights with two steps: (1) copyright the software, and
+(2) offer you this license which gives you legal permission to copy,
+distribute and/or modify the software.
+
+  Also, for each author's protection and ours, we want to make certain
+that everyone understands that there is no warranty for this free
+software.  If the software is modified by someone else and passed on, we
+want its recipients to know that what they have is not the original, so
+that any problems introduced by others will not reflect on the original
+authors' reputations.
+
+  Finally, any free program is threatened constantly by software
+patents.  We wish to avoid the danger that redistributors of a free
+program will individually obtain patent licenses, in effect making the
+program proprietary.  To prevent this, we have made it clear that any
+patent must be licensed for everyone's free use or not licensed at all.
+
+  The precise terms and conditions for copying, distribution and
+modification follow.
+
+		    GNU GENERAL PUBLIC LICENSE
+   TERMS AND CONDITIONS FOR COPYING, DISTRIBUTION AND MODIFICATION
+
+  0. This License applies to any program or other work which contains
+a notice placed by the copyright holder saying it may be distributed
+under the terms of this General Public License.  The "Program", below,
+refers to any such program or work, and a "work based on the Program"
+means either the Program or any derivative work under copyright law:
+that is to say, a work containing the Program or a portion of it,
+either verbatim or with modifications and/or translated into another
+language.  (Hereinafter, translation is included without limitation in
+the term "modification".)  Each licensee is addressed as "you".
+
+Activities other than copying, distribution and modification are not
+covered by this License; they are outside its scope.  The act of
+running the Program is not restricted, and the output from the Program
+is covered only if its contents constitute a work based on the
+Program (independent of having been made by running the Program).
+Whether that is true depends on what the Program does.
+
+  1. You may copy and distribute verbatim copies of the Program's
+source code as you receive it, in any medium, provided that you
+conspicuously and appropriately publish on each copy an appropriate
+copyright notice and disclaimer of warranty; keep intact all the
+notices that refer to this License and to the absence of any warranty;
+and give any other recipients of the Program a copy of this License
+along with the Program.
+
+You may charge a fee for the physical act of transferring a copy, and
+you may at your option offer warranty protection in exchange for a fee.
+
+  2. You may modify your copy or copies of the Program or any portion
+of it, thus forming a work based on the Program, and copy and
+distribute such modifications or work under the terms of Section 1
+above, provided that you also meet all of these conditions:
+
+    a) You must cause the modified files to carry prominent notices
+    stating that you changed the files and the date of any change.
+
+    b) You must cause any work that you distribute or publish, that in
+    whole or in part contains or is derived from the Program or any
+    part thereof, to be licensed as a whole at no charge to all third
+    parties under the terms of this License.
+
+    c) If the modified program normally reads commands interactively
+    when run, you must cause it, when started running for such
+    interactive use in the most ordinary way, to print or display an
+    announcement including an appropriate copyright notice and a
+    notice that there is no warranty (or else, saying that you provide
+    a warranty) and that users may redistribute the program under
+    these conditions, and telling the user how to view a copy of this
+    License.  (Exception: if the Program itself is interactive but
+    does not normally print such an announcement, your work based on
+    the Program is not required to print an announcement.)
+
+These requirements apply to the modified work as a whole.  If
+identifiable sections of that work are not derived from the Program,
+and can be reasonably considered independent and separate works in
+themselves, then this License, and its terms, do not apply to those
+sections when you distribute them as separate works.  But when you
+distribute the same sections as part of a whole which is a work based
+on the Program, the distribution of the whole must be on the terms of
+this License, whose permissions for other licensees extend to the
+entire whole, and thus to each and every part regardless of who wrote it.
+
+Thus, it is not the intent of this section to claim rights or contest
+your rights to work written entirely by you; rather, the intent is to
+exercise the right to control the distribution of derivative or
+collective works based on the Program.
+
+In addition, mere aggregation of another work not based on the Program
+with the Program (or with a work based on the Program) on a volume of
+a storage or distribution medium does not bring the other work under
+the scope of this License.
+
+  3. You may copy and distribute the Program (or a work based on it,
+under Section 2) in object code or executable form under the terms of
+Sections 1 and 2 above provided that you also do one of the following:
+
+    a) Accompany it with the complete corresponding machine-readable
+    source code, which must be distributed under the terms of Sections
+    1 and 2 above on a medium customarily used for software interchange; or,
+
+    b) Accompany it with a written offer, valid for at least three
+    years, to give any third party, for a charge no more than your
+    cost of physically performing source distribution, a complete
+    machine-readable copy of the corresponding source code, to be
+    distributed under the terms of Sections 1 and 2 above on a medium
+    customarily used for software interchange; or,
+
+    c) Accompany it with the information you received as to the offer
+    to distribute corresponding source code.  (This alternative is
+    allowed only for noncommercial distribution and only if you
+    received the program in object code or executable form with such
+    an offer, in accord with Subsection b above.)
+
+The source code for a work means the preferred form of the work for
+making modifications to it.  For an executable work, complete source
+code means all the source code for all modules it contains, plus any
+associated interface definition files, plus the scripts used to
+control compilation and installation of the executable.  However, as a
+special exception, the source code distributed need not include
+anything that is normally distributed (in either source or binary
+form) with the major components (compiler, kernel, and so on) of the
+operating system on which the executable runs, unless that component
+itself accompanies the executable.
+
+If distribution of executable or object code is made by offering
+access to copy from a designated place, then offering equivalent
+access to copy the source code from the same place counts as
+distribution of the source code, even though third parties are not
+compelled to copy the source along with the object code.
+
+  4. You may not copy, modify, sublicense, or distribute the Program
+except as expressly provided under this License.  Any attempt
+otherwise to copy, modify, sublicense or distribute the Program is
+void, and will automatically terminate your rights under this License.
+However, parties who have received copies, or rights, from you under
+this License will not have their licenses terminated so long as such
+parties remain in full compliance.
+
+  5. You are not required to accept this License, since you have not
+signed it.  However, nothing else grants you permission to modify or
+distribute the Program or its derivative works.  These actions are
+prohibited by law if you do not accept this License.  Therefore, by
+modifying or distributing the Program (or any work based on the
+Program), you indicate your acceptance of this License to do so, and
+all its terms and conditions for copying, distributing or modifying
+the Program or works based on it.
+
+  6. Each time you redistribute the Program (or any work based on the
+Program), the recipient automatically receives a license from the
+original licensor to copy, distribute or modify the Program subject to
+these terms and conditions.  You may not impose any further
+restrictions on the recipients' exercise of the rights granted herein.
+You are not responsible for enforcing compliance by third parties to
+this License.
+
+  7. If, as a consequence of a court judgment or allegation of patent
+infringement or for any other reason (not limited to patent issues),
+conditions are imposed on you (whether by court order, agreement or
+otherwise) that contradict the conditions of this License, they do not
+excuse you from the conditions of this License.  If you cannot
+distribute so as to satisfy simultaneously your obligations under this
+License and any other pertinent obligations, then as a consequence you
+may not distribute the Program at all.  For example, if a patent
+license would not permit royalty-free redistribution of the Program by
+all those who receive copies directly or indirectly through you, then
+the only way you could satisfy both it and this License would be to
+refrain entirely from distribution of the Program.
+
+If any portion of this section is held invalid or unenforceable under
+any particular circumstance, the balance of the section is intended to
+apply and the section as a whole is intended to apply in other
+circumstances.
+
+It is not the purpose of this section to induce you to infringe any
+patents or other property right claims or to contest validity of any
+such claims; this section has the sole purpose of protecting the
+integrity of the free software distribution system, which is
+implemented by public license practices.  Many people have made
+generous contributions to the wide range of software distributed
+through that system in reliance on consistent application of that
+system; it is up to the author/donor to decide if he or she is willing
+to distribute software through any other system and a licensee cannot
+impose that choice.
+
+This section is intended to make thoroughly clear what is believed to
+be a consequence of the rest of this License.
+
+  8. If the distribution and/or use of the Program is restricted in
+certain countries either by patents or by copyrighted interfaces, the
+original copyright holder who places the Program under this License
+may add an explicit geographical distribution limitation excluding
+those countries, so that distribution is permitted only in or among
+countries not thus excluded.  In such case, this License incorporates
+the limitation as if written in the body of this License.
+
+  9. The Free Software Foundation may publish revised and/or new versions
+of the General Public License from time to time.  Such new versions will
+be similar in spirit to the present version, but may differ in detail to
+address new problems or concerns.
+
+Each version is given a distinguishing version number.  If the Program
+specifies a version number of this License which applies to it and "any
+later version", you have the option of following the terms and conditions
+either of that version or of any later version published by the Free
+Software Foundation.  If the Program does not specify a version number of
+this License, you may choose any version ever published by the Free Software
+Foundation.
+
+  10. If you wish to incorporate parts of the Program into other free
+programs whose distribution conditions are different, write to the author
+to ask for permission.  For software which is copyrighted by the Free
+Software Foundation, write to the Free Software Foundation; we sometimes
+make exceptions for this.  Our decision will be guided by the two goals
+of preserving the free status of all derivatives of our free software and
+of promoting the sharing and reuse of software generally.
+
+			    NO WARRANTY
+
+  11. BECAUSE THE PROGRAM IS LICENSED FREE OF CHARGE, THERE IS NO WARRANTY
+FOR THE PROGRAM, TO THE EXTENT PERMITTED BY APPLICABLE LAW.  EXCEPT WHEN
+OTHERWISE STATED IN WRITING THE COPYRIGHT HOLDERS AND/OR OTHER PARTIES
+PROVIDE THE PROGRAM "AS IS" WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESSED
+OR IMPLIED, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF
+MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE.  THE ENTIRE RISK AS
+TO THE QUALITY AND PERFORMANCE OF THE PROGRAM IS WITH YOU.  SHOULD THE
+PROGRAM PROVE DEFECTIVE, YOU ASSUME THE COST OF ALL NECESSARY SERVICING,
+REPAIR OR CORRECTION.
+
+  12. IN NO EVENT UNLESS REQUIRED BY APPLICABLE LAW OR AGREED TO IN WRITING
+WILL ANY COPYRIGHT HOLDER, OR ANY OTHER PARTY WHO MAY MODIFY AND/OR
+REDISTRIBUTE THE PROGRAM AS PERMITTED ABOVE, BE LIABLE TO YOU FOR DAMAGES,
+INCLUDING ANY GENERAL, SPECIAL, INCIDENTAL OR CONSEQUENTIAL DAMAGES ARISING
+OUT OF THE USE OR INABILITY TO USE THE PROGRAM (INCLUDING BUT NOT LIMITED
+TO LOSS OF DATA OR DATA BEING RENDERED INACCURATE OR LOSSES SUSTAINED BY
+YOU OR THIRD PARTIES OR A FAILURE OF THE PROGRAM TO OPERATE WITH ANY OTHER
+PROGRAMS), EVEN IF SUCH HOLDER OR OTHER PARTY HAS BEEN ADVISED OF THE
+POSSIBILITY OF SUCH DAMAGES.
+
+		     END OF TERMS AND CONDITIONS
+
+	    How to Apply These Terms to Your New Programs
+
+  If you develop a new program, and you want it to be of the greatest
+possible use to the public, the best way to achieve this is to make it
+free software which everyone can redistribute and change under these terms.
+
+  To do so, attach the following notices to the program.  It is safest
+to attach them to the start of each source file to most effectively
+convey the exclusion of warranty; and each file should have at least
+the "copyright" line and a pointer to where the full notice is found.
+
+    <one line to give the program's name and a brief idea of what it does.>
+    Copyright (C) <year>  <name of author>
+
+    This program is free software; you can redistribute it and/or modify
+    it under the terms of the GNU General Public License as published by
+    the Free Software Foundation; either version 2 of the License, or
+    (at your option) any later version.
+
+    This program is distributed in the hope that it will be useful,
+    but WITHOUT ANY WARRANTY; without even the implied warranty of
+    MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
+    GNU General Public License for more details.
+
+    You should have received a copy of the GNU General Public License
+    along with this program; if not, write to the Free Software
+    Foundation, Inc., 51 Franklin St, Fifth Floor, Boston, MA  02110-1301  USA
+
+
+Also add information on how to contact you by electronic and paper mail.
+
+If the program is interactive, make it output a short notice like this
+when it starts in an interactive mode:
+
+    Gnomovision version 69, Copyright (C) year  name of author
+    Gnomovision comes with ABSOLUTELY NO WARRANTY; for details type `show w'.
+    This is free software, and you are welcome to redistribute it
+    under certain conditions; type `show c' for details.
+
+The hypothetical commands `show w' and `show c' should show the appropriate
+parts of the General Public License.  Of course, the commands you use may
+be called something other than `show w' and `show c'; they could even be
+mouse-clicks or menu items--whatever suits your program.
+
+You should also get your employer (if you work as a programmer) or your
+school, if any, to sign a "copyright disclaimer" for the program, if
+necessary.  Here is a sample; alter the names:
+
+  Yoyodyne, Inc., hereby disclaims all copyright interest in the program
+  `Gnomovision' (which makes passes at compilers) written by James Hacker.
+
+  <signature of Ty Coon>, 1 April 1989
+  Ty Coon, President of Vice
+
+This General Public License does not permit incorporating your program into
+proprietary programs.  If your program is a subroutine library, you may
+consider it more useful to permit linking proprietary applications with the
+library.  If this is what you want to do, use the GNU Library General
+Public License instead of this License.
--- a/docs/img/Binaries.png
+++ b/docs/img/Binaries.png
--- a/docs/img/Building.png
+++ b/docs/img/Building.png
--- a/docs/img/ProjectSection.png
+++ b/docs/img/ProjectSection.png
--- a/docs/img/Properties.png
+++ b/docs/img/Properties.png
--- a/docs/img/Warning.png
+++ b/docs/img/Warning.png
--- a/docs/img/projectFiles.png
+++ b/docs/img/projectFiles.png
--- a/docs/raspberrypi.md
+++ b/docs/raspberrypi.md
--- a/docs/using_cmake_build.txt
+++ b/docs/using_cmake_build.txt
@@ -11,8 +11,10 @@ Step 2) create a separate directory where you want to build the target.
 	~> cd ccextractor
 	~> mkdir build

-Step 3) make the build system using cmake
-	~> cmake ../src/
+Step 3) make the build system using cmake. Params in [] are optional and have
+been explained later in the document.
+	~> cmake [-DWITH_FFMPEG=ON] [-DWITH_OCR=ON]
+    [-DWITH_HARDSUBX=ON] ../src/

 Step 4) Compile the code.
 	~> make
@@ -27,8 +29,8 @@ cmake -DWITH_FFMPEG=ON ../src/
 If you want to build CCExtractor with OCR you need to pass
 cmake -DWITH_OCR=ON ../src/

-If you want to build CCExtractor with Sharing and Translating service:
-cmake -DWITH_SHARING ../src/
+If you want to build CCExtractor with HARDSUBX support
+cmake -DWITH_HARDSUBX=ON ../src/

 Hint for looking all the things you want to set from outside
 cmake -LAH ../src/
--- a/fonts/Cousine-Regular.ttf
+++ b/fonts/Cousine-Regular.ttf
--- a/fonts/DroidSans.ttf
+++ b/fonts/DroidSans.ttf
--- a/fonts/Karla-Regular.ttf
+++ b/fonts/Karla-Regular.ttf
--- a/fonts/ProggyClean.ttf
+++ b/fonts/ProggyClean.ttf
--- a/fonts/ProggyTiny.ttf
+++ b/fonts/ProggyTiny.ttf
--- a/fonts/Raleway-Bold.ttf
+++ b/fonts/Raleway-Bold.ttf
--- a/fonts/Roboto-Bold.ttf
+++ b/fonts/Roboto-Bold.ttf
--- a/fonts/Roboto-Light.ttf
+++ b/fonts/Roboto-Light.ttf
--- a/fonts/Roboto-Regular.ttf
+++ b/fonts/Roboto-Regular.ttf
--- a/fonts/kenvector_future.ttf
+++ b/fonts/kenvector_future.ttf
--- a/fonts/kenvector_future_thin.ttf
+++ b/fonts/kenvector_future_thin.ttf
--- a/icon/computer.png
+++ b/icon/computer.png
--- a/icon/default.png
+++ b/icon/default.png
--- a/icon/desktop.png
+++ b/icon/desktop.png
--- a/icon/directory.png
+++ b/icon/directory.png
--- a/icon/drive.png
+++ b/icon/drive.png
--- a/icon/font.png
+++ b/icon/font.png
--- a/icon/home.png
+++ b/icon/home.png
--- a/icon/img.png
+++ b/icon/img.png
--- a/icon/movie.png
+++ b/icon/movie.png
--- a/icon/music.png
+++ b/icon/music.png
--- a/icon/text.png
+++ b/icon/text.png
--- a/linux/.gitignore
+++ b/linux/.gitignore
@@ -0,0 +1,2 @@
+libccx_rust.a
+rust
--- a/Show More
+++ b/Show More