abstractpublic class BasicDocMaker implements DocMaker(Code)
Create documents for the test.
Maintains counters of chars etc. so that sub-classes just need to
provide textual content, and the create-by-size is handled here.
Config Params (default is in caps):
doc.stored=true|FALSE
doc.tokenized=TRUE|false
doc.term.vector=true|FALSE
doc.term.vector.positions=true|FALSE
doc.term.vector.offsets=true|FALSE
doc.store.body.bytes=true|FALSE //Store the body contents raw UTF-8 bytes as a field
Return the data of the next document.
All current implementations can create docs forever.
When the input data is exhausted, input files are iterated.
This re-iteration can be avoided by setting doc.maker.forever to false (default is true).
data of the next document. if cannot create the next doc data exception: NoMoreDataException - if data is exhausted (and 'forever' set to false).