Unicode double quotation marks get replaced

Started by MegaBrutal, Yesterday at 07:45:00 PM

Previous topic - Next topic

MegaBrutal

Not sure if this is a bug per se, or a feature I could disable, but I recently upgraded my forum to SMF 2.1.4, and noticed that the Unicode double quotation marks I post get replaced with other, similarly looking characters.

Specifically, U+201E DOUBLE LOW-9 QUOTATION MARK gets replaced with the character sequence: ,, (2 ASCII comma characters, U+002C). And U+201D RIGHT DOUBLE QUOTATION MARK gets replaced with " (ASCII quotation mark, U+0022).

I'd prefer to preserve the original characters, not make arbitrary character replacements, as these replacements look ugly, I also don't know what other characters might get replaced by surprise.

shawnb61

,,I'm not seeing this..."

If I look at the hex values in the DB, they're correct, e2809d & e2809e.

However...  If I copy & paste from elsewhere, you are correct, it happens...  It seems to be dependent where & how you're copying & pasting from...
A question worth asking is born in experience & driven by necessity. - Fripp

MegaBrutal

Yes, I also noticed that the source text remains intact in the DB, because when I quote or modify the post, I get the original characters back. The conversion happens when the post is rendered for viewing.

Here I try to post the original characters: ,,".

Advertisement: