TC6: RegEx / Unicode bug?

English support forum

Moderators: Hacker, petermad, Stefan2, white

Post Reply
gurba
Junior Member
Junior Member
Posts: 14
Joined: 2003-02-13, 09:36 UTC

TC6: RegEx / Unicode bug?

Post by *gurba »

Hi

I have a bunch of unicode files in a directory tree

When I search for alternatives (e.g. "(IsMsShipped|IsUserTable)" ) and check both the Unicode and Regex boxes I get 0 files, although I know there are plenty.
A plain search for one of the alternatives returns several files.

So my guess is that the combination of RegEx and Unicode doesn't work.

Another thing is:
If I search for "table", check unicode box I get more resultfiles than if I search for "table" check both unicode and regex.
Shouldn't the results be the same as long as the search criteria doesn't include any regex "special characters"?

regards,
gurba
User avatar
ghisler(Author)
Site Admin
Site Admin
Posts: 50475
Joined: 2003-02-04, 09:46 UTC
Location: Switzerland
Contact:

Post by *ghisler(Author) »

So my guess is that the combination of RegEx and Unicode doesn't work.
Your guess is right! I should disable the other checkbox somehow.
If I search for "table", check unicode box I get more resultfiles than if I search for "table" check both unicode and regex.
Shouldn't the results be the same as long as the search criteria doesn't include any regex "special characters"?
Hmm, can you give an example of a string which isn't found this way?
Author of Total Commander
https://www.ghisler.com
Thany
Senior Member
Senior Member
Posts: 293
Joined: 2003-09-30, 09:20 UTC
Location: Netherlands

Post by *Thany »

Your guess is right! I should disable the other checkbox somehow.
You should fix the problem at its root, not work around it :)

I heard you are using some kind of free regex lib for delphi. I use one too in my own apps, and AFAIK, it works perfectly well with unicode strings. Or maybe you could adapt the library (if allowed by its license).
User avatar
pdavit
Power Member
Power Member
Posts: 1529
Joined: 2003-02-05, 21:41 UTC
Location: Kavala -> Greece -> Europe -> Earth -> Solar System -> Milky Way -> Space
Contact:

Post by *pdavit »

Thany wrote:I use one too in my own apps, and AFAIK, it works perfectly well with unicode strings.
So... are you going to help Christian or what! ;)

I don't see any link with the source code! :P
"My only reason for still using M$ Window$ as an OS is the existence of Total Commander!"
Christian Ghisler Rules!!!
Thany
Senior Member
Senior Member
Posts: 293
Joined: 2003-09-30, 09:20 UTC
Location: Netherlands

Post by *Thany »

Alright, here ya go...

This is the one I use ;)
User avatar
ghisler(Author)
Site Admin
Site Admin
Posts: 50475
Joined: 2003-02-04, 09:46 UTC
Location: Switzerland
Contact:

Post by *ghisler(Author) »

Currently I have no plans to add RegEx support for Unicode, this feature would again eat a lot of memory. Btw, I use the same RegEx library, but it doesn't seem to support ANSI and Unicode at the same time - it has to be recompiled for Unicode, or included twice = twice the memory.
Author of Total Commander
https://www.ghisler.com
Post Reply