Skip to main content
Announcements
A fresh, new look for the Data Integration & Quality forums and navigation! Read more about what's changed.
cancel
Showing results for 
Search instead for 
Did you mean: 
Anonymous
Not applicable

non-alphanumeric characters that aren't special RegExp characters

Can someone please verify for me that I have these characters grouped appropriately, and let me know about the "Unknown" characters (if they are ordinary/allowable or metacharacters/not ordinary/special), please?
I'm trying to figure out what characters we should NOT use (in strings that Talend will be using/calling/whatever) - just to avoid headaches.
Thanks!

Ordinary/Allowable Characters
*Characters that "match themselves" in Regular Expressions
**"Characters other than . $ ^ { } * + ? \ match themselves." ( http://regexlib.com/CheatSheet.aspx)
Character (Name)
@ (Ampersand)
# (Number sign)
% (Percent)
- (Hyphen, Minus sign)
_ (Underscore)
= (Equals)
~ (Tilde)
` (Grave accent)
: (Colon)
; (Semicolon)
< (Less than, Left Angle Bracket)
> (Greater than, Right Angle Bracket)
, (Comma)

Metacharacters/Not Ordinary/Special Characters
*These will NOT match themselves
**Unless prefaced by a \
**Except for the backslash (\) character, which must be prefaced by \\\ to come out as a single backslash. ( http://www.amk.ca/python/howto/regex/regex.html#SECTION000420000000000000000)
Character (Name)
$ (Dollar Sign)
( (Left Parentheses, Left Round Bracket)
) (Right Parentheses, Right Round Bracket)
* (Asterisk)
? (Question Mark)
\ (Backslash)
^ (Caret)
(Right Square Bracket/Brace)
{ (Left Curly Bracket/Brace)
} (Right Curly Bracket/Brace)
| (Pipe)
+ (Plus)

Unknown
*I don't know which of the two categories, above, the following characters fall into
Character (Name)
. (Period, Full stop)
? (Single quote, Apostrophe)
" (Double quote)

Sources of the above information:
http://www.regular-expressions.info/reference.html
http://regexlib.com/CheatSheet.aspx
http://www.amk.ca/python/howto/regex/regex.html
http://www.fileformat.info/tool/regex.htm
Labels (2)
3 Replies
Anonymous
Not applicable
Author

Can someone please verify for me that I have these characters grouped appropriately, and let me know about the "Unknown" characters (if they are ordinary/allowable or metacharacters/not ordinary/special), please?
I'm trying to figure out what characters we should NOT use (in strings that Talend will be using/calling/whatever) - just to avoid headaches.
Thanks!

pleaseeeee?
Anonymous
Not applicable
Author

Hi,
your groups are correct.
For the last three signs: "." (Period, Full stop) => special (matches any character but also will match itself)
The other two do not have any special function in regex.
Bye
Volker
Anonymous
Not applicable
Author

ok, cool - just wanted to make sure cause I saw conflicting information about those three characters (between "regular" regular expressions and then java regular expressions)