WCX_RedTess - OCR (text recognition in TC)

Discuss and announce Total Commander plugins, addons and other useful tools here, both their usage and their development.

Moderators: white, Hacker, petermad, Stefan2

User avatar
alexanderwdark
Senior Member
Senior Member
Posts: 270
Joined: 2008-04-14, 07:20 UTC
Location: Russia
Contact:

WCX_RedTess - OCR (text recognition in TC)

Post by *alexanderwdark »

WCX plugin for text recognition. Make your PDF and images visible as archives with text files (with language code as name) inside.

Just try to press Ctrl+PgDown on PDF, JPG, etc.

Code: Select all

WCX_TESS - C++ image to text & PDF to text converter in the form of TC packer plugin.

Based on code from Tesseract, Poppler, Leptonica and/or OpenCV libraries.

Text recognition here working using "trained models" from Tesseract.

Russian and English models are included in basic archive (*.traineddata files).

If you need any other models, download it and write language codes into "redtess.json" config.

You need "langs" key for this. Mixed records such as "eng, rus" are allowed.

You will see all these values in TC panel as virtual archive's files with txt extension.

There is "Fast" version of "trained models" by default.

It works fast, though can have some problems (but no so bad!).

But you can get "Best" version of models using this link:
https://github.com/tesseract-ocr/tessdata_best
And replace tessdata folder content.

Or use normal models:
https://github.com/tesseract-ocr/tessdata

Also you can enable support of many other image formats (see "formats" key in config).

You can use any of Leptonica or OpenCV supported pictures with this plugin.

Multi-page at this moment enabled for TIFF format.

PDF get rasterized in memory before recognition, so try to tune DPI in configuration file.

Leptonica is default library for plugin, but you can switch to OpenCV.
Latest version is here
Last edited by alexanderwdark on 2019-10-22, 09:05 UTC, edited 4 times in total.
mikedepetris
Junior Member
Junior Member
Posts: 84
Joined: 2006-11-07, 16:36 UTC
Location: Trieste, Italy
Contact:

Re: WCX_RedTess - OCR (text recognition in TC)

Post by *mikedepetris »

Installing 64 bit version I get "This is not a valid plugin! You can find valid plugin files on www.ghisler.com under section 'Addons'."
User avatar
Dalai
Power Member
Power Member
Posts: 9385
Joined: 2005-01-28, 22:17 UTC
Location: Meiningen (Südthüringen)

Re: WCX_RedTess - OCR (text recognition in TC)

Post by *Dalai »

2mikedepetris
Did you install MS Visual C++ runtime (x64), as the author said?

Regards
Dalai
#101164 Personal licence
Ryzen 5 2600, 16 GiB RAM, ASUS Prime X370-A, Win7 x64

Plugins: Services2, Startups, CertificateInfo, SignatureInfo, LineBreakInfo - Download-Mirror
User avatar
Gral
Power Member
Power Member
Posts: 1467
Joined: 2005-01-26, 15:12 UTC

Re: WCX_RedTess - OCR (text recognition in TC)

Post by *Gral »

mikedepetris wrote: 2019-10-15, 20:34 UTC Installing 64 bit version I get "This is not a valid plugin! You can find valid plugin files on www.ghisler.com under section 'Addons'."
Same here with 32 bit version.
User avatar
alexanderwdark
Senior Member
Senior Member
Posts: 270
Joined: 2008-04-14, 07:20 UTC
Location: Russia
Contact:

Re: WCX_RedTess - OCR (text recognition in TC)

Post by *alexanderwdark »

I just tested on Windows 7. What is your system? Are current updates installed there?

Installed runtime from https://support.microsoft.com/help/2977003 and it works.
(You can get VC_redist.x64.exe or VC_redist.x86.exe from plugin's archive, install and restart)

Also you can read some info here: https://support.microsoft.com/help/2999226

FileInfo lister plugin for TC can show you info about missing libraries (red color).
mikedepetris
Junior Member
Junior Member
Posts: 84
Joined: 2006-11-07, 16:36 UTC
Location: Trieste, Italy
Contact:

Re: WCX_RedTess - OCR (text recognition in TC)

Post by *mikedepetris »

Installed successfully on a Windows 10 but had the problem on apparently similar different PC
tried (re) installing VC++ libs
tried running TC as admin
User avatar
alexanderwdark
Senior Member
Senior Member
Posts: 270
Joined: 2008-04-14, 07:20 UTC
Location: Russia
Contact:

Re: WCX_RedTess - OCR (text recognition in TC)

Post by *alexanderwdark »

mikedepetris wrote: 2019-10-16, 07:31 UTC tried (re) installing VC++ libs
Do you have api-ms-win*.dll files in %systemroot%\System32 folder ?

Microsoft says, they must install themselves along with updates (on Windows 10).
User avatar
alexanderwdark
Senior Member
Senior Member
Posts: 270
Joined: 2008-04-14, 07:20 UTC
Location: Russia
Contact:

Re: WCX_RedTess - OCR (text recognition in TC)

Post by *alexanderwdark »

You can try also ucrt libraries from this archive https://yadi.sk/d/jKWElEwD9tb_Fg (from Windows 10 SDK redist package)
Put x86 or x64 content into plugin archive before install.
User avatar
Usher
Power Member
Power Member
Posts: 1675
Joined: 2011-03-11, 10:11 UTC

Re: WCX_RedTess - OCR (text recognition in TC)

Post by *Usher »

2alexanderwdark
It doesn't work. In Windows XP it displays error message "File RedTess.wcx not found" when trying to register supported file extensions in TC. What is more, all dlls are compiled for Windows Vista, so they may NOT work in Windows XP at all.
Andrzej P. Wozniak
Polish subforum moderator
mikedepetris
Junior Member
Junior Member
Posts: 84
Joined: 2006-11-07, 16:36 UTC
Location: Trieste, Italy
Contact:

Re: WCX_RedTess - OCR (text recognition in TC)

Post by *mikedepetris »

alexanderwdark wrote: 2019-10-16, 08:12 UTC
mikedepetris wrote: 2019-10-16, 07:31 UTC tried (re) installing VC++ libs
Do you have api-ms-win*.dll files in %systemroot%\System32 folder ?

Microsoft says, they must install themselves along with updates (on Windows 10).
nothing in %systemroot%\System32 thats maps to c:\WINDOWS\System32\
found a lot of them under c:\WINDOWS\System32\downlevel
what could I try?
User avatar
alexanderwdark
Senior Member
Senior Member
Posts: 270
Joined: 2008-04-14, 07:20 UTC
Location: Russia
Contact:

Re: WCX_RedTess - OCR (text recognition in TC)

Post by *alexanderwdark »

Usher wrote: 2019-10-16, 17:42 UTC 2alexanderwdark
It doesn't work. In Windows XP it displays error message "File RedTess.wcx not found" when trying to register supported file extensions in TC. What is more, all dlls are compiled for Windows Vista, so they may NOT work in Windows XP at all.
Yes, this compiler from MS no longer supports XP
User avatar
alexanderwdark
Senior Member
Senior Member
Posts: 270
Joined: 2008-04-14, 07:20 UTC
Location: Russia
Contact:

Re: WCX_RedTess - OCR (text recognition in TC)

Post by *alexanderwdark »

mikedepetris wrote: 2019-10-16, 21:15 UTC
alexanderwdark wrote: 2019-10-16, 08:12 UTC
mikedepetris wrote: 2019-10-16, 07:31 UTC tried (re) installing VC++ libs
Do you have api-ms-win*.dll files in %systemroot%\System32 folder ?

Microsoft says, they must install themselves along with updates (on Windows 10).
nothing in %systemroot%\System32 thats maps to c:\WINDOWS\System32\
found a lot of them under c:\WINDOWS\System32\downlevel
what could I try?
I have only one system with such problem. It was windows 8.1, then updated to 10. Clear install works ok after VC_redist.x64.exe
I have not found a solution yet. Maybe, Microsoft experts know something.
User avatar
ghisler(Author)
Site Admin
Site Admin
Posts: 48072
Joined: 2003-02-04, 09:46 UTC
Location: Switzerland
Contact:

Re: WCX_RedTess - OCR (text recognition in TC)

Post by *ghisler(Author) »

Please compile the plugin with the option:
"Multithreaded (/MT)"
and not
"Multithreaded-DLL (/MD)".

This is misleading: The option does NOT mean that you want to create a DLL. It means that it uses the Microsoft redistribution package. When you use /MT option, then all necessary functions will be compiled into the plugin. Normally this doesn't make it much larger.
Author of Total Commander
https://www.ghisler.com
User avatar
Usher
Power Member
Power Member
Posts: 1675
Joined: 2011-03-11, 10:11 UTC

Re: WCX_RedTess - OCR (text recognition in TC)

Post by *Usher »

alexanderwdark wrote: 2019-10-17, 05:06 UTCYes, this compiler from MS no longer supports XP
Of course the latest MSVC compiler supports XP (32-bit at least). It's been already explained that you should install xp toolset and properly configure it.
Andrzej P. Wozniak
Polish subforum moderator
User avatar
alexanderwdark
Senior Member
Senior Member
Posts: 270
Joined: 2008-04-14, 07:20 UTC
Location: Russia
Contact:

Re: WCX_RedTess - OCR (text recognition in TC)

Post by *alexanderwdark »

Usher wrote: 2019-10-17, 16:28 UTC
alexanderwdark wrote: 2019-10-17, 05:06 UTCYes, this compiler from MS no longer supports XP
Of course the latest MSVC compiler supports XP (32-bit at least). It's been already explained that you should install xp toolset and properly configure it.
How can I compile/install libs in vcpkg using XP toolset?
Post Reply