Use UAX#29 word boundary rules for selections by default #18899

Open
opened 2026-01-31 06:27:39 +00:00 by claunia · 0 comments
Owner

Originally created by @lhecker on GitHub (Nov 15, 2022).

Description of the new feature/enhancement

Source: https://github.com/microsoft/terminal/issues/14374#issuecomment-1315297524

[...] I think I just now realized how "crazy" those ASCII word delimiters are in an international setting. For instance this:

ねこはかわいい。

It consists of 3 words and a delimiter (ねこ , は , かわいい , 。) and your browser probably handles this correctly, whereas terminals are perpetually stuck in the "What do you mean there are people who don't speak English?". conhost's whitespace-only splitting doesn't feel any better in that regard. I feel like terminals are in a dire need of some UAX #29, Section 4.

Proposed technical implementation details (optional)

Originally created by @lhecker on GitHub (Nov 15, 2022). # Description of the new feature/enhancement Source: https://github.com/microsoft/terminal/issues/14374#issuecomment-1315297524 > [...] I think I just now realized how "crazy" those ASCII word delimiters are in an international setting. For instance this: > > ねこはかわいい。 > > It consists of 3 words and a delimiter (ねこ , は , かわいい , 。) and your browser probably handles this correctly, whereas terminals are perpetually stuck in the "What do you mean there are people who don't speak English?". conhost's whitespace-only splitting doesn't feel any better in that regard. I feel like terminals are in a dire need of some [UAX #29, Section 4](https://unicode.org/reports/tr29/#Word_Boundaries). # Proposed technical implementation details (optional) * Use this by default: https://unicode.org/reports/tr29/#Word_Boundaries * Use character-set splitting, if the user has configured `wordDelimiters`
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: starred/terminal#18899