General category is defined by Unicode.org and defines following
      groups for Unicode.org characters
    
| L | 
          letter | 
        
| Lu | 
          uppercase letter | 
        
| Ll | 
          lowercase letter | 
        
| Lt | 
          titlecase letter | 
        
| Lm | 
          modifier letter | 
        
| Ln | 
          other letter | 
        
| M | 
          mark | 
        
| Mn | 
          nonspacing mark | 
        
| Mc | 
          spacing combining mark | 
| Me | 
          enclosing mark | 
| N | 
          number | 
        
| Nd | 
          decimal digit number | 
        
| Nl | 
          letter number | 
        
| No | 
          other number | 
        
| Z | 
          separator | 
        
| Zs | 
          space separator | 
        
| Zl | 
          line separator | 
        
| Zp | 
          paragraph separator | 
        
| C | 
          other | 
        
| Cc | 
          control other | 
        
| Cf | 
          format other | 
        
| Cs | 
          surrogate other | 
        
| Co | 
          private use other | 
        
| Cn | 
          not assigned other | 
        
| P | 
          puntuation | 
        
| Pc | 
          connector punctuation | 
        
| Pd | 
          dash punctuation | 
        
| Ps | 
          open punctuation | 
| Pe | 
          close punctuation | 
| Pi | 
          initial quote punctuation | 
| Pf | 
          final quote punctuation | 
| Po | 
          other punctuation | 
| S | 
          symbol | 
        
| Sm | 
          math symbol | 
| Sc | 
          currency symbol | 
| Sk | 
          modifier symbol | 
| So | 
          other symbol |