[WCX] ZPAQ

Horst.Epp · Post by *Horst.Epp » 2023-12-23, 20:45 UTC

fcorbelli wrote: 2023-12-23, 20:14 UTC
Horst.Epp wrote: 2023-12-23, 19:18 UTC I already have the "Italian Interface" switched off
but there are stiil Italian descriptions and buttons.
The pakka.ini is present?

Yes, it is.
Not all is italian, only Backup / Settings

fcorbelli · Post by *fcorbelli » 2023-12-23, 22:29 UTC

It is a feature

I'll take a look, tnx for the report

fcorbelli · Post by *fcorbelli » 2023-12-24, 15:31 UTC

Just a bit OT, but maybe I'll find some power user here

https://encode.su/threads/456-zpaq-updates?p=81660&viewfull=1#post81660

Horst.Epp · Post by *Horst.Epp » 2023-12-24, 16:20 UTC

fcorbelli wrote: 2023-12-24, 15:31 UTC Just a bit OT, but maybe I'll find some power user here

https://encode.su/threads/456-zpaq-updates?p=81660&viewfull=1#post81660

I had to fight with Windows Defender.
Only after setting an exclusion, I could download and execute it.
This was not the case with the previous version.

fcorbelli · Post by *fcorbelli » 2023-12-24, 18:51 UTC

Horst.Epp wrote: 2023-12-24, 16:20 UTC
fcorbelli wrote: 2023-12-24, 15:31 UTC Just a bit OT, but maybe I'll find some power user here

https://encode.su/threads/456-zpaq-updates?p=81660&viewfull=1#post81660
I had to fight with Windows Defender.
Only after setting an exclusion, I could download and execute it.
This was not the case with the previous version.

Windows Defender is... evil... aehm...

Code: Select all

C:\zpaqfranz>sha256deep64 zpaqfranz.exe
ce8ab930d3778ad4bb677ba2077a263f21f241347163a7f0fef1379a4d0c2f22  C:\zpaqfranz\zpaqfranz.exe

Horst.Epp · Post by *Horst.Epp » 2023-12-24, 21:46 UTC

fcorbelli wrote: 2023-12-24, 18:51 UTC Windows Defender is... evil... aehm...
Code: Select all
C:\zpaqfranz>sha256deep64 zpaqfranz.exe
ce8ab930d3778ad4bb677ba2077a263f21f241347163a7f0fef1379a4d0c2f22  C:\zpaqfranz\zpaqfranz.exe

I like the Defender, it gives fewer problems than other Antivirus tools.
I was many years responsible for some large companies Antivirus solutions.
The checksum of the zpaqfranz.exe is ok.

I try to understand what the benefit is for storing the list in ADS streams ?

fcorbelli · Post by *fcorbelli » 2023-12-25, 17:14 UTC

Horst.Epp wrote: 2023-12-24, 21:46 UTC
fcorbelli wrote: 2023-12-24, 18:51 UTC Windows Defender is... evil... aehm...
Code: Select all
C:\zpaqfranz>sha256deep64 zpaqfranz.exe
ce8ab930d3778ad4bb677ba2077a263f21f241347163a7f0fef1379a4d0c2f22  C:\zpaqfranz\zpaqfranz.exe
I like the Defender, it gives fewer problems than other Antivirus tools.
I was many years responsible for some large companies Antivirus solutions.
The checksum of the zpaqfranz.exe is ok.

I don't trust antivirus so much, preferring to focus on anti-virus and anti-ransomware mechanisms.
The various antiviruses are the very first thing I disable after a Windows installation, as well as UAC, non-admin executions etc.
Yep, I am VERY old school

I try to understand what the benefit is for storing the list in ADS streams ?

Speed

zpaq store the file list inside i blocks, with added and removed files (aka: date==0)
you can see (on unencrypted single zpaq) with

Code: Select all

zpaqfranz dump thefile.zpaq
zpaqfranz dump thefile.zpaq -verbose

zpaq's journaled archives are stored in blocks (aka: chunks) one after the other
Here an example https://encode.su/threads/456-zpaq-updates?p=81361&viewfull=1#post81361

During the l (list) every block is readed, decoded and (sometimes) jumped off (aka: a seek on datablocks with the "real" compressed data), for every version
Therefore if you have a lot of blocks, and a lot of version, and worst on a spinning drive, listing the files will require a lot of work
Sometimes even minutes, to list a .zpaq
https://encode.su/threads/4168-Virus-like-data-storage-(!)

The -ads switch will
(1) fill up the dt map (aka: the already-present file) during a (add)
(2) do the add as always
(3) seeks to the newer i-blocks inside the archive
(4) decode the new dt map (to the last) [side note: why? to test here the future next-to-be-released hash-based DTMap]
(5) compress one line at time with LZ4 (yes, LZ4)
(6) store inside a zpaqlist ADS

When you ask to list the content of the zpaq, if an ADS is present, the file will be LZ4-decompressed
(one line at times), then printed
LZ4 means "very,very fast, on single threaded CPU, even the older one" and "line by line" means "do-not-create-a-giant-IN-RAM-vector-of the file list " => list a ridicolous big zpaq even on older laptop)
=> you get about just the same data as a regular list, but way, way faster

Sometimes even 100x faster (for huge archives)

Decompressing a LZ4 flux takes about 1/2 seconds (of course depending on file size and computer speed), so the list is almost immediate for (about) any filelist size, because only the very last is stored (on multipart 1 for piece)

You can do a quick test downloading this
http://www.francocorbelli.it/zpaqfranz/zbiz.zpaq

suppose on h:\zbiz.zpaq (a NTFS drive) and running (of course with a tail somewhere in path)

Code: Select all

zpaqfranz l h:\zbiz.zpaq -out normal.txt |tail

this will create a normal.txt with the filelist, using "standard" zpaq read_archive (aka: recomputing from scratch)

Now we create the ADS

Code: Select all

zpaqfranz ads h:\zbiz.zpaq -force

Then we list back, this time with the LZ4

Code: Select all

zpaqfranz l h:\zbiz.zpaq |tail

OK, now suppose to add something to the archive (as always), just one file (or whatever you like)

Code: Select all

zpaqfranz a h:\zbiz.zpaq normal.txt -ads

Now, if you list "the standard way"

Code: Select all

zpaqfranz l h:\zbiz.zpaq -ads

you get, every time, a full decoding => very slow speed

With newer ADS support

Code: Select all

zpaqfranz l h:\zbiz.zpaq

in no time

It is not yet the final version, but I hope it makes the usefulness clear (-all does not work etc, it is just a nightly build)

Obviously if you work with tiny archives the difference is negligible, but I need to handle large amounts of data, and zpaqfranz is just my 'toolbox'

Final note: size matters. Even just creating a file list of a few million files can require multiple gigabytes of RAM. Few things are worse than not even being able to see the contents of an archive, because you are using an emegency laptop in the field, very different from the super-powerful office-PC with 128 or even a 768GB Xeon server. zpaq (and zpaqfranz)
behaviour is to read each block, create a map in memory, and then display it
As you can see, this behaviour (which still persists, which is why I am rewriting read_archive to read_archive2) allocates large amounts of memory.
In the future, at least in my intentions, I will arrive at a list function per row, and not per map, because "this" is very "bad" (=>RAM eater)

Code: Select all

dt[fn]=dtr;

Basically, it overwrites the data of the individual filename in the various versions added.
This ensures that, when finished, the data is the most up-to-date
However, fn is really the filename complete with path, which can easily be 100 or 200 bytes long (i.e. fn=z:/biz/biz01/01/mingw32/include/c++/13.2.0/ext/pb_ds/detail/rc_binomial_heap_/rc_binomial_heap_.hpp) , whereas I was thinking of just keeping 8 bytes (the 64-bit XXHASH code). Faster and frugal (but prone to hash collisions)
=>It still needs a lot of work.
ADS is therefore the first step: fast AND frugal in memory once the stream exists (i.e. when the server has created it during the backup update), but NOT very frugal in the CREATION of the filelist[/i]

Final note: those are OT considerations with respect to the zpaq plug in, maybe if they are not interesting the moderators can always delete them.
On the other hand, they also apply to zpaq 707 and 715

Horst.Epp · Post by *Horst.Epp » 2023-12-25, 19:16 UTC

Thanks a lot for the detailed explanation.

I use ZPAQ since a long time for a lot of different data, for example all versions of my file managers.
To recover files from it, the TC plugin with a branch view and filtering makes it very easy to find any version.

Please continue to add new versions and information in this forum.
Hopefully, more TC users will find the beauty of ZPAQ/zpqafranz

fcorbelli · Post by *fcorbelli » 2023-12-25, 21:37 UTC

99.99% of the credit should go to Dr.Mahoney

Ps on zpaqfranz for windows there is the gui command

fcorbelli · Post by *fcorbelli » 2023-12-25, 21:43 UTC

In fact I am thinking in a filelist embedded inside fake file. In the source there is the first implementation
Translation
Injecting the LZ4 file list inside a fake uncompressed undeduplicated file
Aka: a sequence of byte stored as data blocks
The problem is where to store a pointer (I am working on it)
In a fake hash blocks, very last after i blocks
To be quickly founded with a seek to end, read back some KB, find the signature (if any), going back to fake file, decompress as LZ4
Just a "bit" of work

It is very, very hard to hack zpaq fileformat to maintain backward compatibility
I have an ambitious todo list, but sadly very little help

One step after the other

Horst.Epp · Post by *Horst.Epp » 2023-12-26, 09:44 UTC

fcorbelli wrote: 2023-12-25, 21:37 UTC Ps on zpaqfranz for windows there is the gui command

I tried it, but there are no functions, whatever keys I use.
It just displays the file I added as parameter.
The menu entries File and Help do nothing.

fcorbelli · Post by *fcorbelli » 2023-12-26, 15:37 UTC

Horst.Epp wrote: 2023-12-26, 09:44 UTC
fcorbelli wrote: 2023-12-25, 21:37 UTC Ps on zpaqfranz for windows there is the gui command
I tried it, but there are no functions, whatever keys I use.
It just displays the file I added as parameter.
The menu entries File and Help do nothing.

Image: http://www.francocorbelli.it/1.jpg

Horst.Epp · Post by *Horst.Epp » 2023-12-26, 16:11 UTC

fcorbelli wrote: 2023-12-26, 15:37 UTC Image: http://www.francocorbelli.it/1.jpg

Not for me.
If I click on the Help entry, it just highlights the clicked character.

fcorbelli · Post by *fcorbelli » 2023-12-26, 18:29 UTC

Horst.Epp wrote: 2023-12-26, 16:11 UTC
fcorbelli wrote: 2023-12-26, 15:37 UTC Image: http://www.francocorbelli.it/1.jpg
Not for me.
If I click on the Help entry, it just highlights the clicked character.

Ahem... no mouse support, only keyboard

If you select a file (zpaqfranz gui something.zpaq)
you must
press ESC (enabling menu)
right arrow (go to help)
CR (open help menu)
CR (ask for help)
then ESC again to go back

Horst.Epp · Post by *Horst.Epp » 2023-12-26, 19:11 UTC

fcorbelli wrote: 2023-12-26, 18:29 UTC Ahem... no mouse support, only keyboard

Ok, works.
I prefer Pakka

Total Commander

[WCX] ZPAQ

Re: [WCX] ZPAQ

Re: [WCX] ZPAQ

Re: [WCX] ZPAQ

Re: [WCX] ZPAQ

Re: [WCX] ZPAQ

Re: [WCX] ZPAQ

Re: [WCX] ZPAQ

Re: [WCX] ZPAQ

Re: [WCX] ZPAQ

Re: [WCX] ZPAQ

Re: [WCX] ZPAQ

Re: [WCX] ZPAQ

Re: [WCX] ZPAQ

Re: [WCX] ZPAQ

Re: [WCX] ZPAQ