Page 1 of 1

Limit "Find text" in Office and other docs to displayed text

Posted: 2024-04-05, 06:58 UTC
by nemadeka
Good day,
I often search within Word documents, looking for short strings, for example special delimiters -- {0>, and TC finds them everywhere, producing many false hits. It would be nice to have a check box which would limit the search to the range of data representing the text actually displayed in the Office apps.
Thank you.

Re: Limit "Find text" in Office and other docs to displayed text

Posted: 2024-04-07, 14:01 UTC
by ghisler(Author)
Unfortunately Office xml documents are very complex, therefore I prefer to search all the included xml files to not miss any text.

Re: Limit "Find text" in Office and other docs to displayed text

Posted: 2024-04-07, 15:36 UTC
by Horst.Epp
I use 2 methods for this purpose.
1. Everything content indexing
2. The Oracle Content Access library with the TC PCREsearch content plug-in.
Both ways the real content is searched, but the Everything search is of course faster.

Re: Limit "Find text" in Office and other docs to displayed text

Posted: 2024-04-07, 19:00 UTC
by nemadeka
ghisler(Author) wrote: 2024-04-07, 14:01 UTC Unfortunately Office xml documents are very complex, therefore I prefer to search all the included xml files to not miss any text.
This is not a winning attitude.
I have just performed a search for a term beginning with "wrap" in 13 MS Word documents, TC put them all to the list box, the term was found in 2.
If you ignore a constructive feedback from a dediceted user expressed clearly and concisely, you should retire.

UPD:
Translation software packages are all able to retrieve text from Office documents.
An Office document is a ZIP archive of a folder structure where all teh text is in XML file placed in one folder.
I clearly have an impression that you lost interest in TC.

Re: Limit "Find text" in Office and other docs to displayed text

Posted: 2024-04-07, 19:01 UTC
by nemadeka
Horst.Epp wrote: 2024-04-07, 15:36 UTC I use 2 methods for this purpose.
1. Everything content indexing
2. The Oracle Content Access library with the TC PCREsearch content plug-in.
Both ways the real content is searched, but the Everything search is of course faster.
Thank you, but I did not understand anything. Please don't explain in greater detail, I need a simple duct tape solution, not anything clever.