| org.apache.lucene.analysis.fr.FrenchAnalyzer
FrenchAnalyzer | final public class FrenchAnalyzer extends Analyzer (Code) | | Analyzer for french language. Supports an external list of stopwords (words that
will not be indexed at all) and an external list of exclusions (word that will
not be stemmed, but indexed).
A default set of stopwords is used unless an other list is specified, the
exclusionlist is empty by default.
author: Patrick Talbot (based on Gerhard Schwarz work for German) version: $Id: FrenchAnalyzer.java,v 1.1 2005/06/02 01:36:00 jfendler Exp $ |
FRENCH_STOP_WORDS | final public static String[] FRENCH_STOP_WORDS(Code) | | Extended list of typical french stopwords.
|
FrenchAnalyzer | public FrenchAnalyzer(String[] stopwords)(Code) | | Builds an analyzer with the given stop words.
|
FrenchAnalyzer | public FrenchAnalyzer(Hashtable stopwords)(Code) | | Builds an analyzer with the given stop words.
|
setStemExclusionTable | public void setStemExclusionTable(String[] exclusionlist)(Code) | | Builds an exclusionlist from an array of Strings.
|
setStemExclusionTable | public void setStemExclusionTable(Hashtable exclusionlist)(Code) | | Builds an exclusionlist from a Hashtable.
|
setStemExclusionTable | public void setStemExclusionTable(File exclusionlist) throws IOException(Code) | | Builds an exclusionlist from the words contained in the given file.
throws: IOException - |
tokenStream | final public TokenStream tokenStream(String fieldName, Reader reader)(Code) | | Creates a TokenStream which tokenizes all the text in the provided Reader.
A TokenStream build from a StandardTokenizer filtered withStandardFilter, StopFilter, FrenchStemFilter and LowerCaseFilter |
|
|