weka.core.tokenizers |
|
Java Source File Name | Type | Comment |
AlphabeticTokenizer.java | Class |
Alphabetic string tokenizer, tokens are to be formed only from contiguous alphabetic sequences.
author: Asrhaf M. |
CharacterDelimitedTokenizer.java | Class | Abstract superclass for tokenizers that take characters as delimiters. |
NGramTokenizer.java | Class |
Splits a string into an n-gram with min and max grams. |
Tokenizer.java | Class | A superclass for all tokenizer algorithms. |
WordTokenizer.java | Class |
A simple tokenizer that is using the java.util.StringTokenizer class to tokenize the strings. |