How to search effective in 100GB of files for'text' content?
Moderators: Hacker, petermad, Stefan2, white
-
- Junior Member
- Posts: 3
- Joined: 2017-10-13, 20:33 UTC
- Location: Hungary
How to search effective in 100GB of files for'text' content?
Hello,
I got 100+GB of company work files. PDF,doc,excel,txt,ppt....
I made a simple file search in total cmd , left every setting on default, i just set the folder to search in and checked "find text".
It found some files like after 5 min search, but they are not nearly the amount what should have been.
For example i know lot of files which contain the "text" i searched for, but these files weren't listed on the search's results.
So my question:
How can i optimize totalcmd search to find all and every file containing the "text" i'm searching for, no matter how long time the search will take?
Is there any trick/tool/script/plugin, or a totally different program which can do the search properly? I tried win10 search multiple ways but it stuck/doesnt work
Thanks very much!!
I got 100+GB of company work files. PDF,doc,excel,txt,ppt....
I made a simple file search in total cmd , left every setting on default, i just set the folder to search in and checked "find text".
It found some files like after 5 min search, but they are not nearly the amount what should have been.
For example i know lot of files which contain the "text" i searched for, but these files weren't listed on the search's results.
So my question:
How can i optimize totalcmd search to find all and every file containing the "text" i'm searching for, no matter how long time the search will take?
Is there any trick/tool/script/plugin, or a totally different program which can do the search properly? I tried win10 search multiple ways but it stuck/doesnt work
Thanks very much!!
chrysamere,
Also, e.g. PDF files contain their text in an encoded form, so a simple text search in TC will not find the text if it is contained in a PDF. Same with DOC, XLS, PPT, etc. files. Some of those can be searched for text by using plugins.
Roman
I'd say let TC finish the search.How can i optimize totalcmd search to find all and every file containing the "text" i'm searching for, no matter how long time the search will take?
Also, e.g. PDF files contain their text in an encoded form, so a simple text search in TC will not find the text if it is contained in a PDF. Same with DOC, XLS, PPT, etc. files. Some of those can be searched for text by using plugins.
Roman
Mal angenommen, du drückst Strg+F, wählst die FTP-Verbindung (mit gespeichertem Passwort), klickst aber nicht auf Verbinden, sondern fällst tot um.
-
- Junior Member
- Posts: 3
- Joined: 2017-10-13, 20:33 UTC
- Location: Hungary
Hello, thanks for the reply.Hacker wrote:...
I meant the search finished after like 5min, i didnt stop it.
By plugins you meant ones like these ?
totalcmd net/plugring/xpdfsearch.html
totalcmd net/plugring/TextSearch.html
I found these after first google search. Maybe there's a better collecting forum post for these kind of plugins?
chrysamere,
Roman
Yes, like these, but I don't think I have ever personally tried them, so I can't really comment on them.By plugins you meant ones like these ?
Roman
Mal angenommen, du drückst Strg+F, wählst die FTP-Verbindung (mit gespeichertem Passwort), klickst aber nicht auf Verbinden, sondern fällst tot um.
2chrysamere
Try PCREsearch 2.5 from https://www.ghisler.ch/board/viewtopic.php?t=38098
Try PCREsearch 2.5 from https://www.ghisler.ch/board/viewtopic.php?t=38098
It's impossible to lead us astray for we don't care even to choose the way.
#259941, TC 11.01 x64, Windows 7 SP1 x64
#259941, TC 11.01 x64, Windows 7 SP1 x64
-
- Junior Member
- Posts: 3
- Joined: 2017-10-13, 20:33 UTC
- Location: Hungary
I'd search with tab Plugins and set it like this:chrysamere wrote:what settings should i use to make it find the word "humidity" inside it?
Code: Select all
xPDFSearch | Text | contains | humidity
Dalai
#101164 Personal licence
Ryzen 5 2600, 16 GiB RAM, ASUS Prime X370-A, Win7 x64
Plugins: Services2, Startups, CertificateInfo, SignatureInfo, LineBreakInfo - Download-Mirror
Ryzen 5 2600, 16 GiB RAM, ASUS Prime X370-A, Win7 x64
Plugins: Services2, Startups, CertificateInfo, SignatureInfo, LineBreakInfo - Download-Mirror
2chrysamere
1. Download http://download.oracle.com/otn/outsidein/ca-8-5-3-win-x86-32.zip or http://download.oracle.com/otn/outsidein/ca-8-5-3-win-x86-64.zip or both
2. Extract content of folder redist to somewhere, eg c:\outsidein\redist32 or c:\outsidein\redist64
3. Install PCResearch content plugin
4. Open PCREsearch.ini, find keys:
OitDllPath32=
OitDllPath64=
Write to those keys paths where your files from step 2 resides. eg
OitDllPath32=c:\outsidein\redist32
OitDllPath64=c:\outsidein\redist64
Save and close ini file
5. Restart TC
6. Press Alt+F7
7. Mark find text
8. Click ± at bottom
9. Click Add
10. Type Name of your search and click Ok
11. Type *.* and click Ok
12. Click +
13. in Left pane choose PCResearch, in right -pcresearch.Oracle Outside In fulltext search, press Ok
14. Mark Plugins near ±
15. Enter text to search for
16. Enter file name(s)/mask in search for field at top
17. Click Search
1. Download http://download.oracle.com/otn/outsidein/ca-8-5-3-win-x86-32.zip or http://download.oracle.com/otn/outsidein/ca-8-5-3-win-x86-64.zip or both
2. Extract content of folder redist to somewhere, eg c:\outsidein\redist32 or c:\outsidein\redist64
3. Install PCResearch content plugin
4. Open PCREsearch.ini, find keys:
OitDllPath32=
OitDllPath64=
Write to those keys paths where your files from step 2 resides. eg
OitDllPath32=c:\outsidein\redist32
OitDllPath64=c:\outsidein\redist64
Save and close ini file
5. Restart TC
6. Press Alt+F7
7. Mark find text
8. Click ± at bottom
9. Click Add
10. Type Name of your search and click Ok
11. Type *.* and click Ok
12. Click +
13. in Left pane choose PCResearch, in right -pcresearch.Oracle Outside In fulltext search, press Ok
14. Mark Plugins near ±
15. Enter text to search for
16. Enter file name(s)/mask in search for field at top
17. Click Search
It's impossible to lead us astray for we don't care even to choose the way.
#259941, TC 11.01 x64, Windows 7 SP1 x64
#259941, TC 11.01 x64, Windows 7 SP1 x64
Re: How to search effective in 100GB of files for'text' cont
Total commander with PCRESearch + outsidein do a pretty good job, for one shot search.chrysamere wrote:Hello,
I got 100+GB of company work files. PDF,doc,excel,txt,ppt....
I made a simple file search in total cmd , left every setting on default, i just set the folder to search in and checked "find text".
It found some files like after 5 min search, but they are not nearly the amount what should have been.
For example i know lot of files which contain the "text" i searched for, but these files weren't listed on the search's results.
So my question:
How can i optimize totalcmd search to find all and every file containing the "text" i'm searching for, no matter how long time the search will take?
Is there any trick/tool/script/plugin, or a totally different program which can do the search properly? I tried win10 search multiple ways but it stuck/doesnt work
Thanks very much!!
If you need to use this kind of search regularly, you can use a tools like "Doc Fetcher" or enable content search for Windows Search.
For text only i also use "Depeche View Pro".
Some official PDF have font encryption/password protection that prevent search, cut and paste... It will not be easy to find text inside those files anyway !
For such large content searches I use the Windows indexed search.
Using IFilters it delivers instant results for most file types
and I can go to TC with a search result using context menu.
For small trees the PCREserach together with the Oracle files works fine.
Using IFilters it delivers instant results for most file types
and I can go to TC with a search result using context menu.
For small trees the PCREserach together with the Oracle files works fine.
Windows 11 Home, Version 24H2 (OS Build 26100.4061)
TC 11.55 RC2 x64 / x86
Everything 1.5.0.1391a (x64), Everything Toolbar 1.5.2.0, Listary Pro 6.3.2.88
QAP 11.6.4.4 x64
TC 11.55 RC2 x64 / x86
Everything 1.5.0.1391a (x64), Everything Toolbar 1.5.2.0, Listary Pro 6.3.2.88
QAP 11.6.4.4 x64
Different program to search
I used http://www.inforapid.com/html/searchreplace.htm with great satisfaction. This is somewhat dated, but still works better than any other program I know of.Is there any trick/tool/script/plugin, or a totally different program which can do the search properly?