Java Doc for BitStreamIndex.java in  » Search-Engine » mg4j » it » unimi » dsi » mg4j » index » Java Source Code / Java DocumentationJava Source Code and Java Documentation

Java Source Code / Java Documentation
1. 6.0 JDK Core
2. 6.0 JDK Modules
3. 6.0 JDK Modules com.sun
4. 6.0 JDK Modules com.sun.java
5. 6.0 JDK Modules sun
6. 6.0 JDK Platform
7. Ajax
8. Apache Harmony Java SE
9. Aspect oriented
10. Authentication Authorization
11. Blogger System
12. Build
13. Byte Code
14. Cache
15. Chart
16. Chat
17. Code Analyzer
18. Collaboration
19. Content Management System
20. Database Client
21. Database DBMS
22. Database JDBC Connection Pool
23. Database ORM
24. Development
25. EJB Server geronimo
26. EJB Server GlassFish
27. EJB Server JBoss 4.2.1
28. EJB Server resin 3.1.5
29. ERP CRM Financial
30. ESB
31. Forum
32. GIS
33. Graphic Library
34. Groupware
35. HTML Parser
36. IDE
37. IDE Eclipse
38. IDE Netbeans
39. Installer
40. Internationalization Localization
41. Inversion of Control
42. Issue Tracking
43. J2EE
44. JBoss
45. JMS
46. JMX
47. Library
48. Mail Clients
49. Net
50. Parser
51. PDF
52. Portal
53. Profiler
54. Project Management
55. Report
56. RSS RDF
57. Rule Engine
58. Science
59. Scripting
60. Search Engine
61. Security
62. Sevlet Container
63. Source Control
64. Swing Library
65. Template Engine
66. Test Coverage
67. Testing
68. UML
69. Web Crawler
70. Web Framework
71. Web Mail
72. Web Server
73. Web Services
74. Web Services apache cxf 2.0.1
75. Web Services AXIS2
76. Wiki Engine
77. Workflow Engines
78. XML
79. XML UI
Java
Java Tutorial
Java Open Source
Jar File Download
Java Articles
Java Products
Java by API
Photoshop Tutorials
Maya Tutorials
Flash Tutorials
3ds-Max Tutorials
Illustrator Tutorials
GIMP Tutorials
C# / C Sharp
C# / CSharp Tutorial
C# / CSharp Open Source
ASP.Net
ASP.NET Tutorial
JavaScript DHTML
JavaScript Tutorial
JavaScript Reference
HTML / CSS
HTML CSS Reference
C / ANSI-C
C Tutorial
C++
C++ Tutorial
Ruby
PHP
Python
Python Tutorial
Python Open Source
SQL Server / T-SQL
SQL Server / T-SQL Tutorial
Oracle PL / SQL
Oracle PL/SQL Tutorial
PostgreSQL
SQL / MySQL
MySQL Tutorial
VB.Net
VB.Net Tutorial
Flash / Flex / ActionScript
VBA / Excel / Access / Word
XML
XML Tutorial
Microsoft Office PowerPoint 2007 Tutorial
Microsoft Office Excel 2007 Tutorial
Microsoft Office Word 2007 Tutorial
Java Source Code / Java Documentation » Search Engine » mg4j » it.unimi.dsi.mg4j.index 
Source Cross Reference  Class Diagram Java Document (Java Doc) 


java.lang.Object
   it.unimi.dsi.mg4j.index.Index
      it.unimi.dsi.mg4j.index.BitStreamIndex

All known Subclasses:   it.unimi.dsi.mg4j.index.MemoryMappedIndex,  it.unimi.dsi.mg4j.index.remote.RemoteBitStreamIndex,  it.unimi.dsi.mg4j.index.FileIndex,  it.unimi.dsi.mg4j.index.InMemoryIndex,  it.unimi.dsi.mg4j.index.BitStreamHPIndex,
BitStreamIndex
abstract public class BitStreamIndex extends Index (Code)
A index. Instances of this class contains additional index data related to compression, such as the codes used for each part of the index.

Implementing subclasses must provide access to the index bitstream both at and level. A bitstream-based index usually exposes or maps, but this is not compulsory. Additionally, the index could also expose the and the ; the latter, in particular, is compulsory with certain codings.

Wired implementations

The standard readers associated to an instance of this class are of type BitStreamIndexReader . Nonetheless, it is possible to generate automatically sources for wired classes that work only for a particular set of codings and flags. The wired classes will be fetched automagically by reflection, if available. Please read the section about performance in the MG4J manual.
author:
   Sebastiano Vigna
since:
   1.1


Inner Class :public static enum PropertyKeys

Field Summary
final public static  intDEFAULT_BUFFER_SIZE
     The default buffer size.
final public static  intDEFAULT_HEIGHT
     The default height (fairly low, due to memory consumption).
final public static  intDEFAULT_QUANTUM
     The default quantum.
final public static  intFIXED_POINT_BITS
     Fixed number of fractional binary digits used in fixed-point computation of Golomb moduli.
final public static  longFIXED_POINT_MULTIPLIER
     1L << BitStreamIndex.FIXED_POINT_BITS .
final public  intbufferSize
     The size of the buffer used to read the bit stream.
final public  CodingcountCoding
     The coding for counts.
final public  CodingfrequencyCoding
     The coding for frequencies.
final public  intheight
     The parameter h (the maximum height of a skip tower), or -1 if this index has no skips.
final public  LongListoffsets
     The offset of each term, if offsets were loaded or specified at creation time, or null.
final public  CodingpointerCoding
     The coding for pointers.
final public  CodingpositionCoding
     The coding for positions.
final public  PrefixMap<? extends CharSequence>prefixMap
     The prefix map for this index, or null if the prefix map was not loaded.
final public  intquantum
     The quantum, or -1 if this index has no skips.
final public  Constructor<? extends IndexReader>readerConstructor
     The constructor that will be used to create new index readers.
final public  StringMap<? extends CharSequence>termMap
     The term map for this index, or null if the term map was not loaded.

Constructor Summary
public  BitStreamIndex(int numberOfDocuments, int numberOfTerms, long numberOfPostings, long numberOfOccurrences, int maxCount, Payload payload, Coding frequencyCoding, Coding pointerCoding, Coding countCoding, Coding positionCoding, int quantum, int height, int bufferSize, TermProcessor termProcessor, String field, Properties properties, StringMap<? extends CharSequence> termMap, PrefixMap<? extends CharSequence> prefixMap, IntList sizes, LongList offsets)
    

Method Summary
public  IndexIteratordocuments(CharSequence prefix, int limit)
     Returns a MultiTermIndexIterator over all terms starting with the given prefix, provided their number does not exceed the given limit and that this index has a BitStreamIndex.prefixMap .
protected static  StringfeatureName(Coding coding)
    
public static  intgaussianGolombModulus(long quantumSigma, int shift)
     Computes the Gaussian Golomb modulus for a given standard deviation and shift using fixed-point arithmetic.

The Golomb modulus for (positive and negative) integers normally distributed with standard deviation σ can be computed using the formula ⌈ 2 sqrt( 2 / π ) ln(2) σ ⌉.

The resulting Golomb modulus is near to optimal for coding such integers after they have been passed through Fast.int2nat(int) .

protected  Constructor<? extends IndexReader>getConstructor()
    
abstract public  InputBitStreamgetInputBitStream(int bufferSize)
     Returns an input bit stream over the index.
Parameters:
  bufferSize - a suggested buffer size.
abstract public  InputStreamgetInputStream()
     Returns an input stream over the index.
public  IndexReadergetReader(int bufferSize)
    
public static  intgolombModulus(int p, int q)
     Computes the Golomb modulus for a given fraction using fixed-point arithmetic and a precomputed table for small values.
public static  longquantumSigma(int frequency, int numberOfDocuments, int quantum)
     Computes the standard deviation associated to a given quantum and document frequency.
Parameters:
  frequency - the document frequency.
Parameters:
  numberOfDocuments - the overall number of documents.
Parameters:
  quantum - the quantum.

Field Detail
DEFAULT_BUFFER_SIZE
final public static int DEFAULT_BUFFER_SIZE(Code)
The default buffer size.



DEFAULT_HEIGHT
final public static int DEFAULT_HEIGHT(Code)
The default height (fairly low, due to memory consumption).



DEFAULT_QUANTUM
final public static int DEFAULT_QUANTUM(Code)
The default quantum.



FIXED_POINT_BITS
final public static int FIXED_POINT_BITS(Code)
Fixed number of fractional binary digits used in fixed-point computation of Golomb moduli.



FIXED_POINT_MULTIPLIER
final public static long FIXED_POINT_MULTIPLIER(Code)
1L << BitStreamIndex.FIXED_POINT_BITS .



bufferSize
final public int bufferSize(Code)
The size of the buffer used to read the bit stream.



countCoding
final public Coding countCoding(Code)
The coding for counts. See CompressionFlags .



frequencyCoding
final public Coding frequencyCoding(Code)
The coding for frequencies. See CompressionFlags .



height
final public int height(Code)
The parameter h (the maximum height of a skip tower), or -1 if this index has no skips.



offsets
final public LongList offsets(Code)
The offset of each term, if offsets were loaded or specified at creation time, or null.



pointerCoding
final public Coding pointerCoding(Code)
The coding for pointers. See CompressionFlags .



positionCoding
final public Coding positionCoding(Code)
The coding for positions. See CompressionFlags .



prefixMap
final public PrefixMap<? extends CharSequence> prefixMap(Code)
The prefix map for this index, or null if the prefix map was not loaded.



quantum
final public int quantum(Code)
The quantum, or -1 if this index has no skips.



readerConstructor
final public Constructor<? extends IndexReader> readerConstructor(Code)
The constructor that will be used to create new index readers.



termMap
final public StringMap<? extends CharSequence> termMap(Code)
The term map for this index, or null if the term map was not loaded.




Constructor Detail
BitStreamIndex
public BitStreamIndex(int numberOfDocuments, int numberOfTerms, long numberOfPostings, long numberOfOccurrences, int maxCount, Payload payload, Coding frequencyCoding, Coding pointerCoding, Coding countCoding, Coding positionCoding, int quantum, int height, int bufferSize, TermProcessor termProcessor, String field, Properties properties, StringMap<? extends CharSequence> termMap, PrefixMap<? extends CharSequence> prefixMap, IntList sizes, LongList offsets)(Code)




Method Detail
documents
public IndexIterator documents(CharSequence prefix, int limit) throws IOException, TooManyTermsException(Code)
Returns a MultiTermIndexIterator over all terms starting with the given prefix, provided their number does not exceed the given limit and that this index has a BitStreamIndex.prefixMap .



featureName
protected static String featureName(Coding coding)(Code)



gaussianGolombModulus
public static int gaussianGolombModulus(long quantumSigma, int shift)(Code)
Computes the Gaussian Golomb modulus for a given standard deviation and shift using fixed-point arithmetic.

The Golomb modulus for (positive and negative) integers normally distributed with standard deviation σ can be computed using the formula ⌈ 2 sqrt( 2 / π ) ln(2) σ ⌉.

The resulting Golomb modulus is near to optimal for coding such integers after they have been passed through Fast.int2nat(int) . Note, however, that Golomb coding is not optimal for a normal distribution.

This function is used to compute the correct Golomb modulus for skip towers.
Parameters:
  quantumSigma - the standard deviation of a quantum as returned by BitStreamIndex.quantumSigma(int,int,int).
Parameters:
  shift - a shift parameter. the Golomb modulus for the standard deviation obtained multiplying quantumSigma bythe square root of 2shift-1.




getConstructor
protected Constructor<? extends IndexReader> getConstructor()(Code)



getInputBitStream
abstract public InputBitStream getInputBitStream(int bufferSize) throws IOException(Code)
Returns an input bit stream over the index.
Parameters:
  bufferSize - a suggested buffer size. an input bit stream over the index.



getInputStream
abstract public InputStream getInputStream() throws IOException(Code)
Returns an input stream over the index. an input stream over the index.



getReader
public IndexReader getReader(int bufferSize) throws IOException(Code)



golombModulus
public static int golombModulus(int p, int q)(Code)
Computes the Golomb modulus for a given fraction using fixed-point arithmetic and a precomputed table for small values. This gives results that are extremely close to ⌈ log( 2 - p/q ) / log( 1 - p/q ) ⌉, but the computation is orders of magnitude quicker.
Parameters:
  p - the numerator.
Parameters:
  q - the denominator (larger than or equal to p). the Golomb modulus for p/q.



quantumSigma
public static long quantumSigma(int frequency, int numberOfDocuments, int quantum)(Code)
Computes the standard deviation associated to a given quantum and document frequency.
Parameters:
  frequency - the document frequency.
Parameters:
  numberOfDocuments - the overall number of documents.
Parameters:
  quantum - the quantum. a long representing in fixed-point arithmetic the value Math.sqrt( quantum * ( 1 - p ) ) / p, wherep is the relative frequency.



Fields inherited from it.unimi.dsi.mg4j.index.Index
final public EmptyIndexIterator emptyIndexIterator(Code)(Java Doc)
final public String field(Code)(Java Doc)
final public boolean hasCounts(Code)(Java Doc)
final public boolean hasPayloads(Code)(Java Doc)
final public boolean hasPositions(Code)(Java Doc)
public Index keyIndex(Code)(Java Doc)
final public int maxCount(Code)(Java Doc)
final public int numberOfDocuments(Code)(Java Doc)
final public long numberOfOccurrences(Code)(Java Doc)
final public long numberOfPostings(Code)(Java Doc)
final public int numberOfTerms(Code)(Java Doc)
final public Payload payload(Code)(Java Doc)
final public Properties properties(Code)(Java Doc)
public ReferenceSet<Index> singletonSet(Code)(Java Doc)
final public IntList sizes(Code)(Java Doc)
final public TermProcessor termProcessor(Code)(Java Doc)

Methods inherited from it.unimi.dsi.mg4j.index.Index
public IndexIterator documents(int term) throws IOException(Code)(Java Doc)
public IndexIterator documents(CharSequence term) throws IOException(Code)(Java Doc)
abstract public IndexIterator documents(CharSequence prefix, int limit) throws IOException, TooManyTermsException(Code)(Java Doc)
public static Index getInstance(CharSequence uri, boolean randomAccess, boolean documentSizes, boolean maps) throws IOException, ConfigurationException, URISyntaxException, ClassNotFoundException, SecurityException, InstantiationException, IllegalAccessException, InvocationTargetException, NoSuchMethodException(Code)(Java Doc)
public static Index getInstance(CharSequence uri, boolean randomAccess, boolean documentSizes) throws IOException, ConfigurationException, URISyntaxException, ClassNotFoundException, SecurityException, InstantiationException, IllegalAccessException, InvocationTargetException, NoSuchMethodException(Code)(Java Doc)
public static Index getInstance(CharSequence uri, boolean randomAccess) throws ConfigurationException, IOException, URISyntaxException, ClassNotFoundException, SecurityException, InstantiationException, IllegalAccessException, InvocationTargetException, NoSuchMethodException(Code)(Java Doc)
public static Index getInstance(CharSequence uri) throws ConfigurationException, IOException, URISyntaxException, ClassNotFoundException, SecurityException, InstantiationException, IllegalAccessException, InvocationTargetException, NoSuchMethodException(Code)(Java Doc)
public IndexReader getReader() throws IOException(Code)(Java Doc)
abstract public IndexReader getReader(int bufferSize) throws IOException(Code)(Java Doc)
protected static TermProcessor getTermProcessor(Properties properties)(Code)(Java Doc)
public void keyIndex(Index newKeyIndex)(Code)(Java Doc)

Methods inherited from java.lang.Object
native protected Object clone() throws CloneNotSupportedException(Code)(Java Doc)
public boolean equals(Object obj)(Code)(Java Doc)
protected void finalize() throws Throwable(Code)(Java Doc)
final native public Class getClass()(Code)(Java Doc)
native public int hashCode()(Code)(Java Doc)
final native public void notify()(Code)(Java Doc)
final native public void notifyAll()(Code)(Java Doc)
public String toString()(Code)(Java Doc)
final native public void wait(long timeout) throws InterruptedException(Code)(Java Doc)
final public void wait(long timeout, int nanos) throws InterruptedException(Code)(Java Doc)
final public void wait() throws InterruptedException(Code)(Java Doc)

www.java2java.com | Contact Us
Copyright 2009 - 12 Demo Source and Support. All rights reserved.
All other trademarks are property of their respective owners.