How to search effective in 100GB of files for'text' content?

English support forum

Moderators: white, Hacker, petermad, Stefan2

Post Reply
chrysamere
Junior Member
Junior Member
Posts: 3
Joined: 2017-10-13, 20:33 UTC
Location: Hungary

How to search effective in 100GB of files for'text' content?

Post by *chrysamere »

Hello,

I got 100+GB of company work files. PDF,doc,excel,txt,ppt....

I made a simple file search in total cmd , left every setting on default, i just set the folder to search in and checked "find text".

It found some files like after 5 min search, but they are not nearly the amount what should have been.
For example i know lot of files which contain the "text" i searched for, but these files weren't listed on the search's results.

So my question:
How can i optimize totalcmd search to find all and every file containing the "text" i'm searching for, no matter how long time the search will take?

Is there any trick/tool/script/plugin, or a totally different program which can do the search properly? I tried win10 search multiple ways but it stuck/doesnt work

Thanks very much!!
User avatar
Hacker
Moderator
Moderator
Posts: 13052
Joined: 2003-02-06, 14:56 UTC
Location: Bratislava, Slovakia

Post by *Hacker »

chrysamere,
How can i optimize totalcmd search to find all and every file containing the "text" i'm searching for, no matter how long time the search will take?
I'd say let TC finish the search.
Also, e.g. PDF files contain their text in an encoded form, so a simple text search in TC will not find the text if it is contained in a PDF. Same with DOC, XLS, PPT, etc. files. Some of those can be searched for text by using plugins.

Roman
Mal angenommen, du drückst Strg+F, wählst die FTP-Verbindung (mit gespeichertem Passwort), klickst aber nicht auf Verbinden, sondern fällst tot um.
chrysamere
Junior Member
Junior Member
Posts: 3
Joined: 2017-10-13, 20:33 UTC
Location: Hungary

Post by *chrysamere »

Hacker wrote:...
Hello, thanks for the reply.
I meant the search finished after like 5min, i didnt stop it.

By plugins you meant ones like these ?
totalcmd net/plugring/xpdfsearch.html
totalcmd net/plugring/TextSearch.html

I found these after first google search. Maybe there's a better collecting forum post for these kind of plugins?
User avatar
Hacker
Moderator
Moderator
Posts: 13052
Joined: 2003-02-06, 14:56 UTC
Location: Bratislava, Slovakia

Post by *Hacker »

chrysamere,
By plugins you meant ones like these ?
Yes, like these, but I don't think I have ever personally tried them, so I can't really comment on them.

Roman
Mal angenommen, du drückst Strg+F, wählst die FTP-Verbindung (mit gespeichertem Passwort), klickst aber nicht auf Verbinden, sondern fällst tot um.
User avatar
Ovg
Power Member
Power Member
Posts: 756
Joined: 2014-01-06, 16:26 UTC

Post by *Ovg »

2chrysamere
Try PCREsearch 2.5 from https://www.ghisler.ch/board/viewtopic.php?t=38098
It's impossible to lead us astray for we don't care even to choose the way.
#259941, TC 11.01 x64, Windows 7 SP1 x64
chrysamere
Junior Member
Junior Member
Posts: 3
Joined: 2017-10-13, 20:33 UTC
Location: Hungary

Post by *chrysamere »

Thanks for the suggestion, i tried, but couldn't get it work. I can't understand its setting its too complicated:P

By the way, here is an example file (1page pdf)
filedropper DOT com/1_399

what settings should i use to make it find the word "humidity" inside it?
User avatar
Dalai
Power Member
Power Member
Posts: 9364
Joined: 2005-01-28, 22:17 UTC
Location: Meiningen (Südthüringen)

Post by *Dalai »

chrysamere wrote:what settings should i use to make it find the word "humidity" inside it?
I'd search with tab Plugins and set it like this:

Code: Select all

xPDFSearch | Text | contains | humidity
Regards
Dalai
#101164 Personal licence
Ryzen 5 2600, 16 GiB RAM, ASUS Prime X370-A, Win7 x64

Plugins: Services2, Startups, CertificateInfo, SignatureInfo, LineBreakInfo - Download-Mirror
User avatar
Ovg
Power Member
Power Member
Posts: 756
Joined: 2014-01-06, 16:26 UTC

Post by *Ovg »

2chrysamere

1.
Download http://download.oracle.com/otn/outsidein/ca-8-5-3-win-x86-32.zip or http://download.oracle.com/otn/outsidein/ca-8-5-3-win-x86-64.zip or both

2. Extract content of folder redist to somewhere, eg c:\outsidein\redist32 or c:\outsidein\redist64

3. Install PCResearch content plugin

4. Open PCREsearch.ini, find keys:
OitDllPath32=
OitDllPath64=
Write to those keys paths where your files from step 2 resides. eg
OitDllPath32=c:\outsidein\redist32
OitDllPath64=c:\outsidein\redist64
Save and close ini file

5. Restart TC

6. Press Alt+F7

7. Mark find text

8. Click ± at bottom

9. Click Add

10. Type Name of your search and click Ok

11. Type *.* and click Ok

12. Click +

13. in Left pane choose PCResearch, in right -pcresearch.Oracle Outside In fulltext search, press Ok

14. Mark Plugins near ±

15. Enter text to search for

16. Enter file name(s)/mask in search for field at top

17. Click Search
It's impossible to lead us astray for we don't care even to choose the way.
#259941, TC 11.01 x64, Windows 7 SP1 x64
User avatar
nsp
Power Member
Power Member
Posts: 1803
Joined: 2005-12-04, 08:39 UTC
Location: Lyon (FRANCE)
Contact:

Re: How to search effective in 100GB of files for'text' cont

Post by *nsp »

chrysamere wrote:Hello,

I got 100+GB of company work files. PDF,doc,excel,txt,ppt....

I made a simple file search in total cmd , left every setting on default, i just set the folder to search in and checked "find text".

It found some files like after 5 min search, but they are not nearly the amount what should have been.
For example i know lot of files which contain the "text" i searched for, but these files weren't listed on the search's results.

So my question:
How can i optimize totalcmd search to find all and every file containing the "text" i'm searching for, no matter how long time the search will take?

Is there any trick/tool/script/plugin, or a totally different program which can do the search properly? I tried win10 search multiple ways but it stuck/doesnt work

Thanks very much!!
Total commander with PCRESearch + outsidein do a pretty good job, for one shot search.

If you need to use this kind of search regularly, you can use a tools like "Doc Fetcher" or enable content search for Windows Search.
For text only i also use "Depeche View Pro".

Some official PDF have font encryption/password protection that prevent search, cut and paste... It will not be easy to find text inside those files anyway !
User avatar
Horst.Epp
Power Member
Power Member
Posts: 6450
Joined: 2003-02-06, 17:36 UTC
Location: Germany

Post by *Horst.Epp »

For such large content searches I use the Windows indexed search.
Using IFilters it delivers instant results for most file types
and I can go to TC with a search result using context menu.
For small trees the PCREserach together with the Oracle files works fine.
Windows 11 Home x64 Version 23H2 (OS Build 22631.3374)
TC 11.03 x64 / x86
Everything 1.5.0.1371a (x64), Everything Toolbar 1.3.2, Listary Pro 6.3.0.69
QAP 11.6.3.2 x64
sorcar
Member
Member
Posts: 100
Joined: 2005-04-12, 17:45 UTC
Location: U.S.

Different program to search

Post by *sorcar »

Is there any trick/tool/script/plugin, or a totally different program which can do the search properly?
I used http://www.inforapid.com/html/searchreplace.htm with great satisfaction. This is somewhat dated, but still works better than any other program I know of.
Post Reply