What’s up with these Unicode combining characters and how can we filter them?
What’s up with these unicode characters? That’s a character with a series of combining characters. Because the combining characters in question want to go above the base character, they stack up (literally). For instance, the case of ก้้้้้้้้้้้้้้้้้้้้ …it’s an ก (Thai character ko kai) (U+0E01) followed by 20 copies of the Thai combining character mai tho (U+0E49). How … Read more