[PR #651] Improve SBS #1501

Closed
opened 2026-01-29 17:16:47 +00:00 by claunia · 0 comments
Owner

Original Pull Request: https://github.com/CCExtractor/ccextractor/pull/651

State: closed
Merged: Yes


In this PR:

  • use Levenshtein distance for joining subs into sentences (allow to fix errors)
  • SBS tests are moved to separated files
  • SBS buffer removed from encoder_context, SBS uses its own buffers and structures

The code is still dirty, but it already works good enough

Important notice

SBS is color-blind, so <font> tags cause errors in resulting output (</font> will be placed in next sub). Please, run ccextractor -sbs only with -nodvbcolor

**Original Pull Request:** https://github.com/CCExtractor/ccextractor/pull/651 **State:** closed **Merged:** Yes --- In this PR: * use Levenshtein distance for joining subs into sentences (allow to fix errors) * SBS tests are moved to separated files * SBS buffer removed from `encoder_context`, SBS uses its own buffers and structures The code is still dirty, but it already works good enough **Important notice** SBS is color-blind, so `<font>` tags cause errors in resulting output (`</font>` will be placed in **next** sub). Please, run `ccextractor -sbs` only with `-nodvbcolor`
claunia added the pull-request label 2026-01-29 17:16:47 +00:00
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: starred/ccextractor#1501