WCX_TESS - C++ image to text & PDF to text converter in the form of TC packer plugin.
Based on code from Tesseract, Poppler, Leptonica and/or OpenCV libraries.
Text recognition here working using "trained models" from Tesseract.
Russian and English models are included in basic archive (*.traineddata files).
If you need any other models, download it and write language codes into "redtess.json" config.
You need "langs" key for this. Mixed records such as "eng, rus" are allowed.
You will see all these values in TC panel as virtual archive's files with txt extension.
There is "Fast" version of "trained models" by default.
It works fast, though can have some problems (but no so bad!).
But you can get "Best" version of models using this link:
https://github.com/tesseract-ocr/tessdata_best
And replace tessdata folder content.
Or use normal models:
https://github.com/tesseract-ocr/tessdata
Also you can enable support of many other image formats (see "formats" key in config).
You can use any of Leptonica or OpenCV supported pictures with this plugin.
Multi-page at this moment enabled for TIFF format.
PDF get rasterized in memory before recognition, so try to tune DPI in configuration file.
Leptonica is default library for plugin, but you can switch to OpenCV.
mikedepetris wrote: 2019-10-15, 20:34 UTC
Installing 64 bit version I get "This is not a valid plugin! You can find valid plugin files on www.ghisler.com under section 'Addons'."
I just tested on Windows 7. What is your system? Are current updates installed there?
Installed runtime from https://support.microsoft.com/help/2977003 and it works.
(You can get VC_redist.x64.exe or VC_redist.x86.exe from plugin's archive, install and restart)
Installed successfully on a Windows 10 but had the problem on apparently similar different PC
tried (re) installing VC++ libs
tried running TC as admin
You can try also ucrt libraries from this archive https://yadi.sk/d/jKWElEwD9tb_Fg (from Windows 10 SDK redist package)
Put x86 or x64 content into plugin archive before install.
2alexanderwdark
It doesn't work. In Windows XP it displays error message "File RedTess.wcx not found" when trying to register supported file extensions in TC. What is more, all dlls are compiled for Windows Vista, so they may NOT work in Windows XP at all.
Usher wrote: 2019-10-16, 17:42 UTC
2alexanderwdark
It doesn't work. In Windows XP it displays error message "File RedTess.wcx not found" when trying to register supported file extensions in TC. What is more, all dlls are compiled for Windows Vista, so they may NOT work in Windows XP at all.
mikedepetris wrote: 2019-10-16, 07:31 UTC
tried (re) installing VC++ libs
Do you have api-ms-win*.dll files in %systemroot%\System32 folder ?
Microsoft says, they must install themselves along with updates (on Windows 10).
nothing in %systemroot%\System32 thats maps to c:\WINDOWS\System32\
found a lot of them under c:\WINDOWS\System32\downlevel
what could I try?
I have only one system with such problem. It was windows 8.1, then updated to 10. Clear install works ok after VC_redist.x64.exe
I have not found a solution yet. Maybe, Microsoft experts know something.
Please compile the plugin with the option:
"Multithreaded (/MT)"
and not
"Multithreaded-DLL (/MD)".
This is misleading: The option does NOT mean that you want to create a DLL. It means that it uses the Microsoft redistribution package. When you use /MT option, then all necessary functions will be compiled into the plugin. Normally this doesn't make it much larger.
alexanderwdark wrote: 2019-10-17, 05:06 UTCYes, this compiler from MS no longer supports XP
Of course the latest MSVC compiler supports XP (32-bit at least). It's been already explained that you should install xp toolset and properly configure it.
alexanderwdark wrote: 2019-10-17, 05:06 UTCYes, this compiler from MS no longer supports XP
Of course the latest MSVC compiler supports XP (32-bit at least). It's been already explained that you should install xp toolset and properly configure it.
How can I compile/install libs in vcpkg using XP toolset?