[9.2x] TC display garbled characters Unicode filename in compressed package
Moderators: Hacker, petermad, Stefan2, white
- thomasmolover
- Member
- Posts: 177
- Joined: 2016-12-12, 01:32 UTC
[9.2x] TC display garbled characters Unicode filename in compressed package
When using TC to browse the files in the compressed package, the filename in the compressed package are compressed by Unicode - such as the linux compressed package, which will display garbled characters. but in any other compress app, they are displayed correct.
here is the package
https://drive.google.com/open?id=1Q--TliqT8H1QcgqkGd7FjFOlnYwEBDCu
LEFT is the correct in 7z, Right is wrong in TC
https://imgur.com/a/XY0MaMf
here is the package
https://drive.google.com/open?id=1Q--TliqT8H1QcgqkGd7FjFOlnYwEBDCu
LEFT is the correct in 7z, Right is wrong in TC
https://imgur.com/a/XY0MaMf
- ghisler(Author)
- Site Admin
- Posts: 50471
- Joined: 2003-02-04, 09:46 UTC
- Location: Switzerland
- Contact:
Re: [9.2x] TC display garbled characters Unicode filename in compressed package
Strange, I get the same names as 7zip here. Which language do you use for non-Unicode programs on your system?
Control panel -> Regional and language ioptions - last tab - language for non-Unicode programs
Control panel -> Regional and language ioptions - last tab - language for non-Unicode programs
Author of Total Commander
https://www.ghisler.com
https://www.ghisler.com
Re: [9.2x] TC display garbled characters Unicode filename in compressed package
Since it is a ZIP archive, there is a problem of missing the only standard for Unicode names...
However in my TC and 7Z names look like on the left screenshot. I have Russian as non-Unicode language. TC 9.21a 32 bit, 7-Zip 16.04.
However in my TC and 7Z names look like on the left screenshot. I have Russian as non-Unicode language. TC 9.21a 32 bit, 7-Zip 16.04.
- thomasmolover
- Member
- Posts: 177
- Joined: 2016-12-12, 01:32 UTC
Re: [9.2x] TC display garbled characters Unicode filename in compressed package
I set to language with Simple Chinese, all my friend use Chinese have the problem.ghisler(Author) wrote: 2018-10-25, 09:33 UTC Strange, I get the same names as 7zip here. Which language do you use for non-Unicode programs on your system?
Control panel -> Regional and language ioptions - last tab - language for non-Unicode programs
One of my friends tould me that he guess Unix compress filename with UTF8noBOM, and TC can display ANSI with local or Unicode16LE,
it display the utf8noBOM in ANSI.
- ghisler(Author)
- Site Admin
- Posts: 50471
- Joined: 2003-02-04, 09:46 UTC
- Location: Switzerland
- Contact:
Re: [9.2x] TC display garbled characters Unicode filename in compressed package
ZIP has a special flag for Unicode names in its standard. This shouldn't happen if the ZIP file follows the ZIP standard. I will have to analyze the file in detail to find out what's wrong.
Author of Total Commander
https://www.ghisler.com
https://www.ghisler.com
Re: [9.2x] TC display garbled characters Unicode filename in compressed package
2thomasmolover
It seems that you use different fonts in 7zip (what version?) and TC. Change fonts in TC, restart Windows and stop digging in fonts when testing software, please.
You can also read https://winaero.com/blog/rebuild-font-cache-windows-10/ or find similar hints for older Windows versions.
It seems that you use different fonts in 7zip (what version?) and TC. Change fonts in TC, restart Windows and stop digging in fonts when testing software, please.
You can also read https://winaero.com/blog/rebuild-font-cache-windows-10/ or find similar hints for older Windows versions.
Andrzej P. Wozniak
Polish subforum moderator
Polish subforum moderator
- ghisler(Author)
- Site Admin
- Posts: 50471
- Joined: 2003-02-04, 09:46 UTC
- Location: Switzerland
- Contact:
Re: [9.2x] TC display garbled characters Unicode filename in compressed package
I have tested this archive: The UTF-8 flag is NOT set in the headers. The problem is that in Chinese, both the UTF-8 encoding and the Chinese ANSI encoding share valid characters. Does anyone know how to reliably determine that it's UTF-8 and not local encoding? I know how to detect valid UTF-8, but this will also detect many non-UTF-8 names falsely as UTF-8.
Author of Total Commander
https://www.ghisler.com
https://www.ghisler.com