Advertisement:

Author Topic: Enhance name duplication detection to account for homoglyphs  (Read 3086 times)

Online shawnb61

  • Developer
  • Sophist Member
  • *
  • Posts: 1,341
    • sbulen on GitHub
Enhance name duplication detection to account for homoglyphs
« on: December 09, 2018, 06:05:29 PM »
SMF's existing name duplication checks can be thwarted using homoglyphs.

For example, these two names are distinct & would both be allowed in SMF: "Mіau!" and "Miau!".

More discussion here:
https://www.simplemachines.org/community/index.php?topic=563837.0

Definition & examples of homoglyphs here:
https://en.wikipedia.org/wiki/Homoglyph
Address the process rather than the outcome.  Then, the outcome becomes more likely.   - Fripp

Offline Arantor

  • Resident Overthinker
  • SMF Friend
  • SMF Legend
  • *
  • Posts: 71,403
    • StoryBB/StoryBB on GitHub
Re: Enhance name duplication detection to account for homoglyphs
« Reply #1 on: December 09, 2018, 06:21:14 PM »
The problem is how large the list is. How far down that list do you go?
Don’t try to tell me that some power can corrupt a person. You haven’t had enough to know what it’s like.

No good deed goes unpunished / No act of charity goes unresented.

Online shawnb61

  • Developer
  • Sophist Member
  • *
  • Posts: 1,341
    • sbulen on GitHub
Re: Enhance name duplication detection to account for homoglyphs
« Reply #2 on: December 09, 2018, 06:44:27 PM »
Yep.  Still, worthy to have that discussion & consider the enhancement. 

I think with more sites using utf8 (esp. 2.1) this warrants consideration.
Address the process rather than the outcome.  Then, the outcome becomes more likely.   - Fripp

Offline Kindred

  • The Mean One
  • Support Specialist
  • SMF Legend
  • *
  • Posts: 58,433
  • Gender: Male
    • Kindred-999 on GitHub
Re: Enhance name duplication detection to account for homoglyphs
« Reply #3 on: December 09, 2018, 09:08:47 PM »
Personally, I don’t think that this is a large enough issue to waste development time doing it...  seriously, I have seen this sort of issue reported exactly twice in over a decade.
Please do not PM, IM or Email me with support questions.  You will get better and faster responses in the support boards.  Thank you.

Offline Sesquipedalian

  • The Mad Doctor
  • Lead Developer
  • Sr. Member
  • *
  • Posts: 947
  • Gender: Male
  • It works! ... in theory.
    • Sesquipedalian on GitHub
Re: Enhance name duplication detection to account for homoglyphs
« Reply #4 on: December 18, 2018, 12:31:42 PM »
It might be possible to create a mod to do this relatively reliably and without killing the server, but it won't become a standard feature of SMF any time soon.

Such a mod could use an approach similar to the one I used in 2.1's set_tld_regex() function, by periodically downloading and processing this official file to build an array of substitutions to normalize confusable characters and strings. This array would need to be stored in the database somewhere, and one would want to add a column to the members table that recorded each member's normalized username to use for comparisons.
I promise you nothing.

Sesqu... Sesqui... what?
Sesquipedalian, the best word in the English language.