
Anonymous
Not applicable
2008-09-04
01:00 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
non-alphanumeric characters that aren't special RegExp characters
Can someone please verify for me that I have these characters grouped appropriately, and let me know about the "Unknown" characters (if they are ordinary/allowable or metacharacters/not ordinary/special), please?
I'm trying to figure out what characters we should NOT use (in strings that Talend will be using/calling/whatever) - just to avoid headaches.
Thanks!
Ordinary/Allowable Characters
*Characters that "match themselves" in Regular Expressions
**"Characters other than . $ ^ { } * + ? \ match themselves." ( http://regexlib.com/CheatSheet.aspx)
Metacharacters/Not Ordinary/Special Characters
*These will NOT match themselves
**Unless prefaced by a \
**Except for the backslash (\) character, which must be prefaced by \\\ to come out as a single backslash. ( http://www.amk.ca/python/howto/regex/regex.html#SECTION000420000000000000000)
Unknown
*I don't know which of the two categories, above, the following characters fall into
Sources of the above information:
http://www.regular-expressions.info/reference.html
http://regexlib.com/CheatSheet.aspx
http://www.amk.ca/python/howto/regex/regex.html
http://www.fileformat.info/tool/regex.htm
I'm trying to figure out what characters we should NOT use (in strings that Talend will be using/calling/whatever) - just to avoid headaches.
Thanks!
Ordinary/Allowable Characters
*Characters that "match themselves" in Regular Expressions
**"Characters other than . $ ^ { } * + ? \ match themselves." ( http://regexlib.com/CheatSheet.aspx)
Character (Name)
@ (Ampersand)
# (Number sign)
% (Percent)
- (Hyphen, Minus sign)
_ (Underscore)
= (Equals)
~ (Tilde)
` (Grave accent)
: (Colon)
; (Semicolon)
< (Less than, Left Angle Bracket)
> (Greater than, Right Angle Bracket)
, (Comma)
Metacharacters/Not Ordinary/Special Characters
*These will NOT match themselves
**Unless prefaced by a \
**Except for the backslash (\) character, which must be prefaced by \\\ to come out as a single backslash. ( http://www.amk.ca/python/howto/regex/regex.html#SECTION000420000000000000000)
Character (Name)
$ (Dollar Sign)
( (Left Parentheses, Left Round Bracket)
) (Right Parentheses, Right Round Bracket)
* (Asterisk)
? (Question Mark)
\ (Backslash)
^ (Caret)
(Right Square Bracket/Brace)
{ (Left Curly Bracket/Brace)
} (Right Curly Bracket/Brace)
| (Pipe)
+ (Plus)
Unknown
*I don't know which of the two categories, above, the following characters fall into
Character (Name)
. (Period, Full stop)
? (Single quote, Apostrophe)
" (Double quote)
Sources of the above information:
http://www.regular-expressions.info/reference.html
http://regexlib.com/CheatSheet.aspx
http://www.amk.ca/python/howto/regex/regex.html
http://www.fileformat.info/tool/regex.htm
258 Views
3 Replies

Anonymous
Not applicable
2008-09-22
04:11 PM
Author
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Can someone please verify for me that I have these characters grouped appropriately, and let me know about the "Unknown" characters (if they are ordinary/allowable or metacharacters/not ordinary/special), please?
I'm trying to figure out what characters we should NOT use (in strings that Talend will be using/calling/whatever) - just to avoid headaches.
Thanks!
pleaseeeee?
258 Views

Anonymous
Not applicable
2008-09-22
04:33 PM
Author
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi,
your groups are correct.
For the last three signs: "." (Period, Full stop) => special (matches any character but also will match itself)
The other two do not have any special function in regex.
Bye
Volker
your groups are correct.
For the last three signs: "." (Period, Full stop) => special (matches any character but also will match itself)
The other two do not have any special function in regex.
Bye
Volker
258 Views

Anonymous
Not applicable
2008-09-22
04:42 PM
Author
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
ok, cool - just wanted to make sure cause I saw conflicting information about those three characters (between "regular" regular expressions and then java regular expressions)
258 Views
