| |
|
| java.lang.Object org.apache.lucene.search.Filter org.apache.lucene.search.DuplicateFilter
DuplicateFilter | public class DuplicateFilter extends Filter (Code) | | |
KM_USE_FIRST_OCCURRENCE | final public static int KM_USE_FIRST_OCCURRENCE(Code) | | |
KM_USE_LAST_OCCURRENCE | final public static int KM_USE_LAST_OCCURRENCE(Code) | | |
PM_FAST_INVALIDATION | final public static int PM_FAST_INVALIDATION(Code) | | |
PM_FULL_VALIDATION | final public static int PM_FULL_VALIDATION(Code) | | |
keepMode | int keepMode(Code) | | KeepMode determines which document id to consider as the master, all others being
identified as duplicates. Selecting the "first occurrence" can potentially save on IO.
|
processingMode | int processingMode(Code) | | "Full" processing mode starts by setting all bits to false and only setting bits
for documents that contain the given field and are identified as none-duplicates.
"Fast" processing sets all bits to true then unsets all duplicate docs found for the
given field. This approach avoids the need to read TermDocs for terms that are seen
to have a document frequency of exactly "1" (i.e. no duplicates). While a potentially
faster approach , the downside is that bitsets produced will include bits set for
documents that do not actually contain the field given.
|
DuplicateFilter | public DuplicateFilter(String fieldName)(Code) | | |
DuplicateFilter | public DuplicateFilter(String fieldName, int keepMode, int processingMode)(Code) | | |
getKeepMode | public int getKeepMode()(Code) | | |
getProcessingMode | public int getProcessingMode()(Code) | | |
hashCode | public int hashCode()(Code) | | |
setFieldName | public void setFieldName(String fieldName)(Code) | | |
setKeepMode | public void setKeepMode(int keepMode)(Code) | | |
setProcessingMode | public void setProcessingMode(int processingMode)(Code) | | |
|
|
|