Translate unicode back into standard readable ascii
207f is a utility file you can import into your projects to translate unicode characters back into standard ascii. A common technique used to bypass filters and moderation tools is replacing characters with unicode that looks simial to the original ascii character. This library reverses that effect so it can be checked normally.
Normal ASCII
A bad word
Unicode Characters
А bаԁ ԝоrԁ
Looks identical right?
Inspect some text and see if the characters are made up from any unicode
Code | Character | Unicode? | Name |
---|---|---|---|
410 | А | Yes | CYRILLIC CAPITAL LETTER A |
20 | No | SPACE | |
62 | b | No | LATIN SMALL LETTER B |
430 | а | Yes | CYRILLIC SMALL LETTER A |
501 | ԁ | Yes | CYRILLIC SMALL LETTER KOMI DE |
51d | ԝ | Yes | CYRILLIC SMALL LETTER WE |
43e | о | Yes | CYRILLIC SMALL LETTER O |
72 | r | No | LATIN SMALL LETTER R |