Add Ignore names option to Synchronize Dirs
Moderators: Hacker, petermad, Stefan2, white
Add Ignore names option to Synchronize Dirs
TC currently does not support search for copies of the same data scattered all over the disk space if filenames differ.
Proposal:
Please consider implementing Ignore name option in Synchronize Dirs.
Rationale
Hunting for copies created by CTRL+C and CTRL+V is quite a common task whenever you want to get rid of redundant data or simply need to free up disk space.
The list of typical filename discrepancies include:
Copy and paste "myfile" vs "myfile – copy (2)" // by far the most common use case
space "myfile" vs "my file" // edited manually
underscore "myfile" vs "my_file" // edited manually
automated padding "myfile" vs "myfile0002" // made by automation machines
[Christian Ghisler, author] 2022-03-04
Unfortunately you cannot do this job with my sync tool.
Why? There wouldn't be a 1:1 relationship between files on the left and those one on the right if names were ignored.
[mmm]
Yes, I know. Having said that, I am convinced that a large number of TC users would greatly benefit from the option that is currently missing.
I would like to invite comments and thoughts from broader audience.
Proposal:
Please consider implementing Ignore name option in Synchronize Dirs.
Rationale
Hunting for copies created by CTRL+C and CTRL+V is quite a common task whenever you want to get rid of redundant data or simply need to free up disk space.
The list of typical filename discrepancies include:
Copy and paste "myfile" vs "myfile – copy (2)" // by far the most common use case
space "myfile" vs "my file" // edited manually
underscore "myfile" vs "my_file" // edited manually
automated padding "myfile" vs "myfile0002" // made by automation machines
[Christian Ghisler, author] 2022-03-04
Unfortunately you cannot do this job with my sync tool.
Why? There wouldn't be a 1:1 relationship between files on the left and those one on the right if names were ignored.
[mmm]
Yes, I know. Having said that, I am convinced that a large number of TC users would greatly benefit from the option that is currently missing.
I would like to invite comments and thoughts from broader audience.
Re: Add Ignore names option to Synchronize Dirs
mmm,
Roman
Would Duplicate search fit your needs? It searches for copies of the same data scattered all over the disk space and filenames do not matter.TC currently does not support search for copies of the same data scattered all over the disk space if filenames differ.
How would you suggest to implement it? Currently it looks like the author says it's not possible, and you say "I know, but please implement it anyway". Without some idea of how to make it work it will be difficult to implement.[Christian Ghisler, author] 2022-03-04
Unfortunately you cannot do this job with my sync tool.
Why? There wouldn't be a 1:1 relationship between files on the left and those one on the right if names were ignored.
[mmm]
Yes, I know. Having said that, I am convinced that a large number of TC users would greatly benefit from the option that is currently missing.
Roman
Mal angenommen, du drückst Strg+F, wählst die FTP-Verbindung (mit gespeichertem Passwort), klickst aber nicht auf Verbinden, sondern fällst tot um.
Re: Add Ignore names option to Synchronize Dirs
Roman,
I will be more than happy to discuss with you more details of "what" I would like to accomplish with the new tool; whether it be a search tool for duplications or a true sync tool.
With all due respect I am going to refrain from commenting on "how" to implement the algorithm as I would consider it totally inappropriate, not to mention that I do not find myself qualified enough for mentoring TC developers.
Hope you understand,
mmm
I will be more than happy to discuss with you more details of "what" I would like to accomplish with the new tool; whether it be a search tool for duplications or a true sync tool.
With all due respect I am going to refrain from commenting on "how" to implement the algorithm as I would consider it totally inappropriate, not to mention that I do not find myself qualified enough for mentoring TC developers.
Hope you understand,
mmm
Last edited by mmm on 2022-06-29, 10:50 UTC, edited 1 time in total.
Re: Add Ignore names option to Synchronize Dirs
I do not think your statement is 100% correct. If am not mistaken, current implementation of "search for duplicates" does not help when you are looking for duplicates of files located in a selected folder (left pane) and you want to investigate potential copies located in a target data structure that is not within the "source" folder (right pane).Would Duplicate search fit your needs? It searches for copies of the same data scattered all over the disk space and filenames do not matter.
Put another, 'search for duplicates' does the job within a single data structure only; i.e. from the selected folder down the hierarchy.
Yet another, you cannot find copies of local files located on a remote server.
Please let me know if I am wrong.
Re: Add Ignore names option to Synchronize Dirs
mmm,
Roman
That is not correct, you can add as many folders as you like under "Search in".'search for duplicates' does the job within a single data structure only; i.e. from the selected folder down the hierarchy.
Actually, it's the complete opposite - it's the best way to get a suggestion implemented - to present a workable concept / idea.With all due respect I am going to refrain from commenting on "how" to implement the algorithm as I would consider it totally inappropriate, not to mention that I do not find myself qualified enough for mentoring TC developers.
Roman
Mal angenommen, du drückst Strg+F, wählst die FTP-Verbindung (mit gespeichertem Passwort), klickst aber nicht auf Verbinden, sondern fällst tot um.
Re: Add Ignore names option to Synchronize Dirs
Oh my goodness. I did not know I could search in multiple folders.
Many thanks for your educating me!
When I fully test the entire search process I will let you know about the outcome.
Thanks and regards,
mmm
Many thanks for your educating me!
When I fully test the entire search process I will let you know about the outcome.
Thanks and regards,
mmm
Re: Add Ignore names option to Synchronize Dirs
Just my 2¢ :
I usually first search for folders (for a whole list or projects to cleanup) -> feed to listbox (select ALL)-> search for duplicates in selected file/folder ->feed to listbox -> use specific numpad[+]
If you have to search many times in same list of folder, you can save/edit the list into a file and use loadsearch to fead the listbox and then select all then do your searchs....
Re: Add Ignore names option to Synchronize Dirs
Very nice. Thanks.
This kind of feedback is highly appreciated. Always.
Just out of curiosity - did you ever consider submitting a request for automating this process in TC in order to eliminate/minimize manual work ? Do you think it is worth implementing, considering number of potential users and the number of steps to follow?
This kind of feedback is highly appreciated. Always.
Just out of curiosity - did you ever consider submitting a request for automating this process in TC in order to eliminate/minimize manual work ? Do you think it is worth implementing, considering number of potential users and the number of steps to follow?
Re: Add Ignore names option to Synchronize Dirs
You can do it yourself in suggestion forum..mmm wrote: 2022-06-29, 13:01 UTC Just out of curiosity - did you ever consider submitting a request for automating this process in TC in order to eliminate/minimize manual work ? Do you think it is worth implementing, considering number of potential users and the number of steps to follow?
//Now it is a bit late for 10.5 as during R.C version Christian usually do not add features (he just do BugFix).
I use PowerPro to automate my work and in many case i find a way to complement TC with scripts.
Re: Add Ignore names option to Synchronize Dirs
Well, this conversation has just circled.mmm
Just out of curiosity - did you ever consider submitting a request for automating this process in TC in order to eliminate/minimize manual work ? Do you think it is worth implementing, considering number of potential users and the number of steps to follow?
nsp
You can do it yourself in suggestion forum..
---------------------------------------
Let's go second round then:
---------------------------------------
Much as I try I cannot figure out how to hunt for copies scattered all over disk space and select redundant files for removal.
Synchronize Dirs is absolutely fascinating tool designed for this type of job; however, there is one major constraint - it is restricted to files with the same names only. As far as I know you cannot find copies created by means of Copy and Paste on Windows; e.g. "myfile - copy (53).xlsx".
Does TC file manager support disk cleanup?
If not, I would like to encourage TC team to give it a thought.
I cannot stress more that I am looking for a robust built-in solution with graphical interface and minimum user interaction.
Re: Add Ignore names option to Synchronize Dirs
mmm,
So the main question is what does not suit you about duplicate search?
Roman
So the main question is what does not suit you about duplicate search?
Roman
Mal angenommen, du drückst Strg+F, wählst die FTP-Verbindung (mit gespeichertem Passwort), klickst aber nicht auf Verbinden, sondern fällst tot um.
Re: Add Ignore names option to Synchronize Dirs
Search part - GUI for selecting data to search for aka source files aka golden files (left pane)
GUI for selecting data structure to search in (right pane)
Selection and deletion part - GUI for picking duplications/copies to be deleted
Put simply, I am missing the incredibly smooth and intuitive workflow implemented in Synchronize Dirs.
GUI for selecting data structure to search in (right pane)
Selection and deletion part - GUI for picking duplications/copies to be deleted
Put simply, I am missing the incredibly smooth and intuitive workflow implemented in Synchronize Dirs.
Re: Add Ignore names option to Synchronize Dirs
mmm,
Personally I am using specialized tools for duplicate search that offer an option to keep a database of file hashes to speed up consecutive searches.
Roman
True, if you treat a group of files as "originals" and another group as "possible duplicates", I don't think there is really a good way in TC to do that.Search part - GUI for selecting data to search for aka source files aka golden files (left pane)
GUI for selecting data structure to search in (right pane)
Well, after the search is complete, if you click Feed to listbox and press Num+, there is a dialog which allows you to select duplicates based on various criteria (it is different from the usual Num+ selection dialog).Selection and deletion part - GUI for picking duplications/copies to be deleted
Personally I am using specialized tools for duplicate search that offer an option to keep a database of file hashes to speed up consecutive searches.
Roman
Mal angenommen, du drückst Strg+F, wählst die FTP-Verbindung (mit gespeichertem Passwort), klickst aber nicht auf Verbinden, sondern fällst tot um.
Re: Add Ignore names option to Synchronize Dirs
How sad.Personally I am using specialized tools for duplicate search that offer an option to keep a database of file hashes to speed up consecutive searches.
I hope my suggestion is still valid regardless your personal preference.
Re: Add Ignore names option to Synchronize Dirs
mmm,

Roman
I have learned to live with this deep sadnessHow sad.

Of course. My preference is irrelevant. Just please remember, the more detailed your suggestion / concept is, the better chance of it being implemented.I hope my suggestion is still valid regardless your personal preference.
Roman
Mal angenommen, du drückst Strg+F, wählst die FTP-Verbindung (mit gespeichertem Passwort), klickst aber nicht auf Verbinden, sondern fällst tot um.