Package org.apache.lucene.codecs
Class KnnVectorsWriter.MergedVectorValues
java.lang.Object
org.apache.lucene.search.DocIdSetIterator
org.apache.lucene.index.VectorValues
org.apache.lucene.codecs.KnnVectorsWriter.MergedVectorValues
- Enclosing class:
- KnnVectorsWriter
View over multiple VectorValues supporting iterator-style access via DocIdMerger.
-
Field Summary
FieldsModifier and TypeFieldDescriptionprivate final intprivate KnnVectorsWriter.VectorValuesSubprivate intprivate final DocIDMerger<KnnVectorsWriter.VectorValuesSub>private final intprivate final List<KnnVectorsWriter.VectorValuesSub>Fields inherited from class org.apache.lucene.index.VectorValues
EMPTY, MAX_DIMENSIONSFields inherited from class org.apache.lucene.search.DocIdSetIterator
NO_MORE_DOCS -
Constructor Summary
ConstructorsModifierConstructorDescriptionprivateMergedVectorValues(List<KnnVectorsWriter.VectorValuesSub> subs, MergeState mergeState) -
Method Summary
Modifier and TypeMethodDescriptionintadvance(int target) Advances to the first beyond the current whose document number is greater than or equal to target, and returns the document number itself.Return the binary encoded vector value for the current document ID.longcost()Returns the estimated cost of thisDocIdSetIterator.intReturn the dimension of the vectorsintdocID()Returns the following:-1ifDocIdSetIterator.nextDoc()orDocIdSetIterator.advance(int)were not called yet.(package private) static KnnVectorsWriter.MergedVectorValuesmergeVectorValues(FieldInfo fieldInfo, MergeState mergeState) Returns a merged view over all the segment'sVectorValues.intnextDoc()Advances to the next document in the set and returns the doc it is currently on, orDocIdSetIterator.NO_MORE_DOCSif there are no more docs in the set.
NOTE: after the iterator has exhausted you should not call this method, as it may result in unpredicted behavior.intsize()TODO: should we use cost() for this? We rely on its always being exactly the number of documents having a value for this field, which is not guaranteed by the cost() contract, but in all the implementations so far they are the same.float[]Return the vector value for the current document ID.Methods inherited from class org.apache.lucene.search.DocIdSetIterator
all, empty, range, slowAdvance
-
Field Details
-
subs
-
docIdMerger
-
cost
private final int cost -
size
private final int size -
docId
private int docId -
current
-
-
Constructor Details
-
MergedVectorValues
private MergedVectorValues(List<KnnVectorsWriter.VectorValuesSub> subs, MergeState mergeState) throws IOException - Throws:
IOException
-
-
Method Details
-
mergeVectorValues
static KnnVectorsWriter.MergedVectorValues mergeVectorValues(FieldInfo fieldInfo, MergeState mergeState) throws IOException Returns a merged view over all the segment'sVectorValues.- Throws:
IOException
-
docID
public int docID()Description copied from class:DocIdSetIteratorReturns the following:-1ifDocIdSetIterator.nextDoc()orDocIdSetIterator.advance(int)were not called yet.DocIdSetIterator.NO_MORE_DOCSif the iterator has exhausted.- Otherwise it should return the doc ID it is currently on.
- Specified by:
docIDin classDocIdSetIterator
-
nextDoc
Description copied from class:DocIdSetIteratorAdvances to the next document in the set and returns the doc it is currently on, orDocIdSetIterator.NO_MORE_DOCSif there are no more docs in the set.
NOTE: after the iterator has exhausted you should not call this method, as it may result in unpredicted behavior.- Specified by:
nextDocin classDocIdSetIterator- Throws:
IOException
-
vectorValue
Description copied from class:VectorValuesReturn the vector value for the current document ID. It is illegal to call this method when the iterator is not positioned: before advancing, or after failing to advance. The returned array may be shared across calls, re-used, and modified as the iterator advances.- Specified by:
vectorValuein classVectorValues- Returns:
- the vector value
- Throws:
IOException
-
binaryValue
Description copied from class:VectorValuesReturn the binary encoded vector value for the current document ID. These are the bytes corresponding to the float array return byVectorValues.vectorValue(). It is illegal to call this method when the iterator is not positioned: before advancing, or after failing to advance. The returned storage may be shared across calls, re-used and modified as the iterator advances.- Overrides:
binaryValuein classVectorValues- Returns:
- the binary value
- Throws:
IOException
-
advance
public int advance(int target) Description copied from class:DocIdSetIteratorAdvances to the first beyond the current whose document number is greater than or equal to target, and returns the document number itself. Exhausts the iterator and returnsDocIdSetIterator.NO_MORE_DOCSif target is greater than the highest document number in the set.The behavior of this method is undefined when called with
target ≤ current, or after the iterator has exhausted. Both cases may result in unpredicted behavior.When
target > currentit behaves as if written:int advance(int target) { int doc; while ((doc = nextDoc()) < target) { } return doc; }Some implementations are considerably more efficient than that.NOTE: this method may be called with
DocIdSetIterator.NO_MORE_DOCSfor efficiency by some Scorers. If your implementation cannot efficiently determine that it should exhaust, it is recommended that you check for that value in each call to this method.- Specified by:
advancein classDocIdSetIterator
-
size
public int size()Description copied from class:VectorValuesTODO: should we use cost() for this? We rely on its always being exactly the number of documents having a value for this field, which is not guaranteed by the cost() contract, but in all the implementations so far they are the same.- Specified by:
sizein classVectorValues- Returns:
- the number of vectors returned by this iterator
-
cost
public long cost()Description copied from class:DocIdSetIteratorReturns the estimated cost of thisDocIdSetIterator.This is generally an upper bound of the number of documents this iterator might match, but may be a rough heuristic, hardcoded value, or otherwise completely inaccurate.
- Specified by:
costin classDocIdSetIterator
-
dimension
public int dimension()Description copied from class:VectorValuesReturn the dimension of the vectors- Specified by:
dimensionin classVectorValues
-