...one of the most highly
regarded and expertly designed C++ library projects in the
world.
— Herb Sutter and Andrei
Alexandrescu, C++
Coding Standards
The following character classes are only supported by Unicode Regular Expressions:
that is those that use the u32regex
type. The names used are the same as those from Chapter 4 of the Unicode
standard.
Short Name |
Long Name |
---|---|
ASCII |
|
Any |
|
Assigned |
|
C* |
Other |
Cc |
Control |
Cf |
Format |
Cn |
Not Assigned |
Co |
Private Use |
Cs |
Surrogate |
L* |
Letter |
Ll |
Lowercase Letter |
Lm |
Modifier Letter |
Lo |
Other Letter |
Lt |
Titlecase |
Lu |
Uppercase Letter |
M* |
Mark |
Mc |
Spacing Combining Mark |
Me |
Enclosing Mark |
Mn |
Non-Spacing Mark |
N* |
Number |
Nd |
Decimal Digit Number |
Nl |
Letter Number |
No |
Other Number |
P* |
Punctuation |
Pc |
Connector Punctuation |
Pd |
Dash Punctuation |
Pe |
Close Punctuation |
Pf |
Final Punctuation |
Pi |
Initial Punctuation |
Po |
Other Punctuation |
Ps |
Open Punctuation |
S* |
Symbol |
Sc |
Currency Symbol |
Sk |
Modifier Symbol |
Sm |
Math Symbol |
So |
Other Symbol |
Z* |
Separator |
Zl |
Line Separator |
Zp |
Paragraph Separator |
Zs |
Space Separator |