Improve built-in dictionary code #25

Open
opened 2026-01-29 16:33:12 +00:00 by claunia · 0 comments
Owner

Originally created by @cfsmp3 on GitHub (May 28, 2014).

Originally assigned to: @anshul1912 on GitHub.

CCExtractor has a small (maybe 20 words) dictionary that is used to correct capitalization.

Because the dictionary is so small the implementation that uses is to do the correction is extremely trivial, and also time consuming (basically it checks against each word in the dictionary rather than do a binary search).

We didn't care about this until someone sent a 11,000 word dictionary for us to use :-)

The function is in 608_helpers.
void correct_case (int line_num, struct eia608_screen *data)

Job: Implement sort and binary search so we can use that dictionary efficiently.

Originally created by @cfsmp3 on GitHub (May 28, 2014). Originally assigned to: @anshul1912 on GitHub. CCExtractor has a small (maybe 20 words) dictionary that is used to correct capitalization. Because the dictionary is so small the implementation that uses is to do the correction is extremely trivial, and also time consuming (basically it checks against each word in the dictionary rather than do a binary search). We didn't care about this until someone sent a 11,000 word dictionary for us to use :-) The function is in 608_helpers. void correct_case (int line_num, struct eia608_screen *data) Job: Implement sort and binary search so we can use that dictionary efficiently.
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: starred/ccextractor#25