Extension of Lucene Analyzer for use with Harmonise.
This is a simple extension of the Lucene Analyzer class to meet the
requirements for more sophisticated text indexing, in particular for use with the Becta
VocabManager application.
It provides an implementation that uses Lucene's built in implementation of the Porter
stemming algorithm based on lower case tokenization of the input text.
Note: this algorithm is designed specifically for English language text. |