Bug in std.regexp with capturing subpatterns?
Markus Dangl
danglm at in.tum.de
Sun Feb 25 14:51:37 PST 2007
Hi,
i don't know if this is really a bug, so i'm posting here in the learn
group.
Consider the following regular expression: "^((\\w)#)?(\\w+)$"
If there is no sharp in the string to search, the whole first subpattern
won't match, so it captures nothing (an empty string). So the subpattern
(\\w) shouldn't capture anything, either, right?
I tested it with the following code:
------------------------------------
import std.stdio;
import std.regexp;
int main(char[][] args)
{
auto m = std.regexp.search("test", "^((\\w)#)?(\\w+)$");
if (m is null) { writefln("Doesn't match"); return 0; }
for (int i=0;i<6;i++)
writefln("%d: %s", i, m.match(i));
return 0;
}
------------------------------------
Which displays:
0: test
1:
2: t
3: test
4:
5:
As you can see, subpattern #2 captures the first letter of the string.
I don't really know the regexp implementation, but i assume it doesn't
throw away previously captured subpatterns after it encounters the
mismatch with '#'.
More information about the Digitalmars-d-learn
mailing list