Page 1 of 1
compare files in coding “Utf-8”
Posted: 2007-01-09, 14:03 UTC
by Emmanuil
If possible add in “Compare by Content” compare files in coding “Utf-8”, now it is my compare only files in coding “Unicode”. The coding files in Utf-8 it is the most popular in format XML. In this version it isn’t absent.
Posted: 2007-01-09, 14:19 UTC
by Flint
Emmanuil
Use search. It has been discussed many times.
Posted: 2007-01-09, 15:53 UTC
by Emmanuil
2Emmanuil
Dear Flint, Your answer is not correct, all links which you give not about that,
The problem is - when I compare two file in coding Utf-8 - I see symbol which consist from one byte is correct , if see symbol which have coding how two byte it is not correct
see sample:
document.oncontextmenu = test;
//Еапрет на выведение контекÑтного меню
Because I see it ANSI coding,
If I switch on Unicode I can see nothing
Sample:
;⠯⥃∠
Font for view is MS Arial Unicode
Posted: 2007-01-09, 16:09 UTC
by Flint
Emmanuil
I did not give any links, I just told that the question has been discussed on this forum. Simple search request immediately gave me the following threads that talk exactly about the thing you're talking about:
http://ghisler.ch/board/viewtopic.php?t=12520
http://ghisler.ch/board/viewtopic.php?t=12422
http://ghisler.ch/board/viewtopic.php?t=10387
http://ghisler.ch/board/viewtopic.php?t=4928
Posted: 2007-01-09, 21:26 UTC
by Emmanuil
Dear Konstantin,
Thanks for link!
I am looking it and understud that author.
"Unfortunately I haven't found a way to add it, sorry. The problem is that UTF-8 characters can be between 1 and 5 bytes long, so we cannot simply make a bytewise comparison."
I send to you leter on Russian how I thinck may is decide it poblem.
If you can't read Russian, character I try explain it in English.
Thanks!
Posted: 2007-01-10, 08:45 UTC
by Sosna
smth. tells me that flint CAN read russian (i'm not sure about speak, bud read - its sure)

(sorry for offtop)
Posted: 2007-01-10, 08:56 UTC
by Flint
[OT]
Sosna
I think,
Emmanuil meant the problems with encoding. He sent me a message via E-mail from the forum (using the "email" button), but all Russian characters in this letter were replaced with the corresponding HTML entities like
〹. Of course, there is no problem to decode them, but the problem itself takes place.

[/OT]
Posted: 2007-01-10, 14:16 UTC
by Sosna
Flint - I see, didn't know that.