when comparing Text with rare Unicode characters.
While doing some tests with some of them for myself,
I found that TC seems to fail comparing them in different encodings:
Code: Select all
𤽜𤹜𤵜𤱜
Just use one or all four characters, write them to an e.g. UTF-16 file.
(just these characters, no additional ASCII chars or others)
Now use the same character sequence and encode it to an UTF-8 file.
Do a Compare by Content for both files in Text mode and:
TC highlights them as different line(s), although they aren't!
It even fails when comparing BOM-less UTF-16 with BOM-included UTF-16,
i.e. the files just differ in the missing "xFFFE" at the beginning.
(of course, I have to set the encoding of the BOM-less file manually, otherwise it's seen as binary)
(Tested on TC 8.01 32bit, Win 7 64bit
older versions and TC64 not tested so far)