| java.lang.Object com.ibm.icu.impl.PatternTokenizer
PatternTokenizer | public class PatternTokenizer (Code) | | A simple parsing class for patterns and rules. Handles '...' quotations, \\uxxxx and \\Uxxxxxxxx, and symple syntax.
The '' (two quotes) is treated as a single quote, inside or outside a quote
- Any ignorable characters are ignored in parsing.
- Any syntax characters are broken into separate tokens
- Quote characters can be specified: '...', "...", and \x
- Other characters are treated as literals
|
BACK_SLASH | final public static char BACK_SLASH(Code) | | |
DONESYNTAXLITERALBROKEN_QUOTEBROKEN_ESCAPEUNKNOWN | final public static int DONESYNTAXLITERALBROKEN_QUOTEBROKEN_ESCAPEUNKNOWN(Code) | | |
SINGLE_QUOTE | final public static char SINGLE_QUOTE(Code) | | |
getLimit | public int getLimit()(Code) | | |
getStart | public int getStart()(Code) | | |
isUsingQuote | public boolean isUsingQuote()(Code) | | |
isUsingSlash | public boolean isUsingSlash()(Code) | | |
quoteLiteral | public String quoteLiteral(CharSequence string)(Code) | | Quote a literal string, using the available settings. Thus syntax characters, quote characters, and ignorable characters will be put into quotes.
Parameters: string - |
setEscapeCharacters | public PatternTokenizer setEscapeCharacters(UnicodeSet escapeCharacters)(Code) | | Set characters to be escaped in literals, in quoteLiteral and normalize, eg new UnicodeSet("[^\\u0020-\\u007E]");
Parameters: escapeCharacters - |
setIgnorableCharacters | public PatternTokenizer setIgnorableCharacters(UnicodeSet ignorableCharacters)(Code) | | Sets the characters to be ignored in parsing, eg new UnicodeSet("[:pattern_whitespace:]");
Parameters: ignorableCharacters - |
setSyntaxCharacters | public PatternTokenizer setSyntaxCharacters(UnicodeSet syntaxCharacters)(Code) | | Sets the characters to be interpreted as syntax characters in parsing, eg new UnicodeSet("[:pattern_syntax:]")
Parameters: syntaxCharacters - |
|
|