General category is defined by Unicode.org and defines following
groups for Unicode.org characters
| L |
letter |
| Lu |
uppercase letter |
| Ll |
lowercase letter |
| Lt |
titlecase letter |
| Lm |
modifier letter |
| Ln |
other letter |
| M |
mark |
| Mn |
nonspacing mark |
| Mc |
spacing combining mark |
| Me |
enclosing mark |
| N |
number |
| Nd |
decimal digit number |
| Nl |
letter number |
| No |
other number |
| Z |
separator |
| Zs |
space separator |
| Zl |
line separator |
| Zp |
paragraph separator |
| C |
other |
| Cc |
control other |
| Cf |
format other |
| Cs |
surrogate other |
| Co |
private use other |
| Cn |
not assigned other |
| P |
puntuation |
| Pc |
connector punctuation |
| Pd |
dash punctuation |
| Ps |
open punctuation |
| Pe |
close punctuation |
| Pi |
initial quote punctuation |
| Pf |
final quote punctuation |
| Po |
other punctuation |
| S |
symbol |
| Sm |
math symbol |
| Sc |
currency symbol |
| Sk |
modifier symbol |
| So |
other symbol |