Page 1 of 1

Default value of "choose the text encoding type"

Posted: 2010-04-11, 12:19 UTC
by BeckYang
There is a button "ANSI <-> ANSI" in "Compare by Content" dialog.
Currently, I have to click it and choose UTF-8/UTF-8 every time when I compare lots of UTF-8 text files. So the text displayed in left/right side are changed as I want.

It would be great if "text encoding type" can be loaded from default value(Include the display the left/right side panel).
For saved the default value, I suggest to add a checkbox "Save as default"
in "Choose encoding" dialog. If we need to save it, we can check it and click OK button.

Thanks for your development.

Posted: 2010-04-11, 15:33 UTC
by ghisler(Author)
The compare tool currently detects UTF-8 and UTF-16 only by the BOM (byte order marker) at the beginning of the file. Files without BOM are detected as ANSI text. Changing the default for all files would be easy to add, but probably not very useful. Any better ideas? Maybe some kind of improved detection?

Posted: 2010-04-11, 21:07 UTC
by Hacker
Christian,
Changing the default for all files would be easy to add, but probably not very useful. Any better ideas? Maybe some kind of improved detection?
There has been a request to be able to set the default in the Slovak/Czech forum, too (if possible in addition to any improved detection routine).

Roman

Posted: 2010-04-12, 14:36 UTC
by BeckYang
For me, most of the text file are UTF-8 or ASCII(windows system encoding). TC already support them good enough.
But I prefer to remove BOM in UTF-8 files. I got too many trouble when handle BOM text files. :(

For your reference, there is a C/C++/Java library named ICU which can detected charset.
http://userguide.icu-project.org/conversion/detection