This bug can be reproduced using this simple example. Here are two Cyrillic words: [nova] (suffix and short form of adjective "new") and [noga] (the leg). The difference is in the 3rd letter. Because this forum isn't in UTF-8, I place here a plain text equivalent of this example (copy and save in ANSI mode, e.g as uni_test.txt):
Code: Select all
РЅРѕРІР° // nova
РЅРѕРіР° // noga
РЅРѕРІР°
РЅРѕРіР°
View this file in the Lister in UTF-8 mode, select the first word and copy to the search dialog. Then press Shift+F7. Select the second word, copy to the search dialog and also use Shift+F7. In both cases Lister shows all words as instances, but it's wrong!
Cause: by default, Lister and TC use case-insensitive search. However, in UTF-8 case this should be done only with original Unicode source and destination, not with their UTF-8 representations. In my example, the difference is in the characters "I" and "i" in the UTF-8 encoding: they shouldn't be counted as similar in any circumstances.