| org.apache.lucene.analysis.Tokenizer org.apache.lucene.analysis.ngram.EdgeNGramTokenizer
EdgeNGramTokenizer | public class EdgeNGramTokenizer extends Tokenizer (Code) | | Tokenizes the input from an edge into n-grams of given size(s).
author: Otis Gospodnetic author: Adam Hiatt |
Inner Class :public static class Side | |
Method Summary | |
final public Token | next() Returns the next token in the stream, or null at EOS. |
DEFAULT_MAX_GRAM_SIZE | final public static int DEFAULT_MAX_GRAM_SIZE(Code) | | |
DEFAULT_MIN_GRAM_SIZE | final public static int DEFAULT_MIN_GRAM_SIZE(Code) | | |
DEFAULT_SIDE | final public static Side DEFAULT_SIDE(Code) | | |
EdgeNGramTokenizer | public EdgeNGramTokenizer(Reader input, Side side, int minGram, int maxGram)(Code) | | Creates EdgeNGramTokenizer that can generate n-grams in the sizes of the given range
Parameters: input - Reader holding the input to be tokenized Parameters: side - the Side from which to chop off an n-gram Parameters: minGram - the smallest n-gram to generate Parameters: maxGram - the largest n-gram to generate |
EdgeNGramTokenizer | public EdgeNGramTokenizer(Reader input, String sideLabel, int minGram, int maxGram)(Code) | | Creates EdgeNGramTokenizer that can generate n-grams in the sizes of the given range
Parameters: input - Reader holding the input to be tokenized Parameters: sideLabel - the name of the Side from which to chop off an n-gram Parameters: minGram - the smallest n-gram to generate Parameters: maxGram - the largest n-gram to generate |
|
|