mirror of
https://github.com/xoofx/markdig.git
synced 2026-02-11 21:37:17 +00:00
Converting emphasis with angled quotation marks #541
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @alexeyfv on GitHub (Jun 19, 2022).
Originally assigned to: @MihaZupan on GitHub.
Hi,
I'm trying to convert a document which contains
"«_word_»"string. As you can see on example below, the parser cannot recognize it as emphasis:But
"_«word»_"has been converted ok:I'm using Markdig 0.30.2. Is it a bug? If yes, is there any workaround to avoid the issue? Thanks.
@xoofx commented on GitHub (Jun 19, 2022):
Oh, interesting... you might hit a specific case of the specs, as there is a split between the results of the different CommonMark parsers here
So the spec about emphasis is here and I would think that it is not a bug as per the rule:
I haven't checked but it is high likely that the character
«and»are Unicode punctuation character.cc: @MihaZupan thoughts?
@MihaZupan commented on GitHub (Jul 17, 2022):
This is a bug, our
CheckUnicodeCategoryhelper is not matching what CommonMark defines as Unicode Whitespace and Unicode punctuation.Specifically, we are off in the 128-255 range (where « and » are) and with our Unicode space categories.
IsWhitespacealso isn't matching the spec rn.