compare files in coding “Utf-8”

Here you can propose new features, make suggestions etc.

Moderators: Hacker, petermad, Stefan2, white

Post Reply
Emmanuil
Junior Member
Junior Member
Posts: 3
Joined: 2007-01-09, 13:52 UTC

compare files in coding “Utf-8”

Post by *Emmanuil »

If possible add in “Compare by Content” compare files in coding “Utf-8”, now it is my compare only files in coding “Unicode”. The coding files in Utf-8 it is the most popular in format XML. In this version it isn’t absent.
User avatar
Flint
Power Member
Power Member
Posts: 3501
Joined: 2003-10-27, 09:25 UTC
Location: Belgrade, Serbia
Contact:

Post by *Flint »

Emmanuil
Use search. It has been discussed many times.
Flint's Homepage: Full TC Russification Package, VirtualDisk, NTFS Links, NoClose Replacer, and other stuff!
 
Using TC 11.03 / Win10 x64
Emmanuil
Junior Member
Junior Member
Posts: 3
Joined: 2007-01-09, 13:52 UTC

Post by *Emmanuil »

2Emmanuil
Dear Flint, Your answer is not correct, all links which you give not about that,
The problem is - when I compare two file in coding Utf-8 - I see symbol which consist from one byte is correct , if see symbol which have coding how two byte it is not correct
see sample:
document.oncontextmenu = test;
//Еапрет на выведение контекÑ￾тного меню
Because I see it ANSI coding,
If I switch on Unicode I can see nothing
Sample:
;⠯⥃∠

Font for view is MS Arial Unicode
User avatar
Flint
Power Member
Power Member
Posts: 3501
Joined: 2003-10-27, 09:25 UTC
Location: Belgrade, Serbia
Contact:

Post by *Flint »

Emmanuil
I did not give any links, I just told that the question has been discussed on this forum. Simple search request immediately gave me the following threads that talk exactly about the thing you're talking about:
http://ghisler.ch/board/viewtopic.php?t=12520
http://ghisler.ch/board/viewtopic.php?t=12422
http://ghisler.ch/board/viewtopic.php?t=10387
http://ghisler.ch/board/viewtopic.php?t=4928
Flint's Homepage: Full TC Russification Package, VirtualDisk, NTFS Links, NoClose Replacer, and other stuff!
 
Using TC 11.03 / Win10 x64
Emmanuil
Junior Member
Junior Member
Posts: 3
Joined: 2007-01-09, 13:52 UTC

Post by *Emmanuil »

Dear Konstantin,
Thanks for link!

I am looking it and understud that author.

"Unfortunately I haven't found a way to add it, sorry. The problem is that UTF-8 characters can be between 1 and 5 bytes long, so we cannot simply make a bytewise comparison."

I send to you leter on Russian how I thinck may is decide it poblem.
If you can't read Russian, character I try explain it in English.
Thanks!
User avatar
Sosna
Member
Member
Posts: 143
Joined: 2006-10-24, 10:52 UTC

Post by *Sosna »

smth. tells me that flint CAN read russian (i'm not sure about speak, bud read - its sure) ;) (sorry for offtop)
Ave Caesar Imperator,
moritari te salutant!
User avatar
Flint
Power Member
Power Member
Posts: 3501
Joined: 2003-10-27, 09:25 UTC
Location: Belgrade, Serbia
Contact:

Post by *Flint »

[OT]
Sosna
I think, Emmanuil meant the problems with encoding. He sent me a message via E-mail from the forum (using the "email" button), but all Russian characters in this letter were replaced with the corresponding HTML entities like 〹. Of course, there is no problem to decode them, but the problem itself takes place. :)
[/OT]
Flint's Homepage: Full TC Russification Package, VirtualDisk, NTFS Links, NoClose Replacer, and other stuff!
 
Using TC 11.03 / Win10 x64
User avatar
Sosna
Member
Member
Posts: 143
Joined: 2006-10-24, 10:52 UTC

Post by *Sosna »

Flint - I see, didn't know that.
Ave Caesar Imperator,
moritari te salutant!
Post Reply