Add support for determining off-heap memory requirements for KnnVectorsReader #14426

ChrisHegarty · 2025-04-01T10:59:05Z

This PR adds support to KnnVectorsReader in order to determine the off-heap memory requirements.

The motivation here is to give better insight into the size of off-heap memory that will be needed, so that deployments can be better scaled so that vector search workloads fit in memory, in order to provide best execution performance.

…eaders

tteofili

LGTM, thanks Chris, this sounds like a nice building block for assessing memory requirements.

...rd-codecs/src/java/org/apache/lucene/backward_codecs/lucene90/Lucene90HnswVectorsReader.java

benwtrent

I like the direct API for gathering "codec off-heap file size". However, I am not sure we are doing that accounting correct in this PR.

I do not like calling it "Off heap requirements". Really, its just the potential size if all the file must be loaded in off-heap.

lucene/core/src/java/org/apache/lucene/util/OffHeapAccountable.java

.../core/src/java/org/apache/lucene/codecs/lucene102/Lucene102BinaryQuantizedVectorsReader.java

ChrisHegarty · 2025-04-02T15:45:11Z

Given the feedback so far, I've pivot this quite a bit to now include per-field metrics. To support this I removed the previously proposed OffHeapAccountable interface and put the accessor on KnnVectorsReader - where other per-field accessors are. I used FieldInfo as the lookup since that it needed to determine if the field is either Byte of Float32, which is required to know in the "richer" codec implementations which provide quantization of floats and passthrough for bytes.

lucene/core/src/java/org/apache/lucene/codecs/KnnVectorsReader.java

jimczi

LGTM on the new API, I left some nit comments on naming and assertions.

.../core/src/java/org/apache/lucene/codecs/lucene102/Lucene102BinaryQuantizedVectorsReader.java

lucene/core/src/java/org/apache/lucene/codecs/KnnVectorsReader.java

mayya-sharipova

Thanks Chris, organization by file extensions is very good

benwtrent

I like this much better. I do think this requires a CHANGES.txt as it modifies the KnnVectorReader object API. And possibly the API shouldn't be required for all vector readers due to BWC concerns?

lucene/core/src/java/org/apache/lucene/codecs/KnnVectorsReader.java

navneet1v · 2025-04-14T17:22:00Z

@ChrisHegarty this PR provides the info on how much off heap space is needed but this doesn't provide info on how much is loaded into memory correct? and do we have any plans to expose the current usage info?

ChrisHegarty · 2025-04-15T08:27:45Z

@ChrisHegarty this PR provides the info on how much off heap space is needed but this doesn't provide info on how much is loaded into memory correct? and do we have any plans to expose the current usage info?

Hi @navneet1v. You are correct, this is informational only. I do plan to follow up with the actual usage, which may evolve this particular API.

…rsReader (apache#14426) This PR adds support to KnnVectorsReader in order to determine the off-heap memory requirements. The motivation here is to give better insight into the size of off-heap memory that will be needed, so that deployments can be better scaled so that vector search workloads fit in memory, in order to provide best execution performance.

navneet1v · 2025-04-15T10:35:00Z

@ChrisHegarty this PR provides the info on how much off heap space is needed but this doesn't provide info on how much is loaded into memory correct? and do we have any plans to expose the current usage info?

Hi @navneet1v. You are correct, this is informational only. I do plan to follow up with the actual usage, which may evolve this particular API.

thanks that will be super useful.

…nnVectorsReader (#14426) (#14497) This PR adds support to KnnVectorsReader in order to determine the off-heap memory requirements. The motivation here is to give better insight into the size of off-heap memory that will be needed, so that deployments can be better scaled so that vector search workloads fit in memory, in order to provide best execution performance.

…rsReader (apache#14426) This PR adds support to KnnVectorsReader in order to determine the off-heap memory requirements. The motivation here is to give better insight into the size of off-heap memory that will be needed, so that deployments can be better scaled so that vector search workloads fit in memory, in order to provide best execution performance.

Add support for determining off-heap memory requirements for vector r…

1f170c1

…eaders

github-project-automation bot added this to OpenSearch Lucene & Core Performance Tracking Apr 1, 2025

github-project-automation bot moved this to Open in OpenSearch Lucene & Core Performance Tracking Apr 1, 2025

github-actions bot added module:core/index module:core/codecs module:test-framework labels Apr 1, 2025

tteofili approved these changes Apr 1, 2025

View reviewed changes

...rd-codecs/src/java/org/apache/lucene/backward_codecs/lucene90/Lucene90HnswVectorsReader.java Outdated Show resolved Hide resolved

benwtrent reviewed Apr 1, 2025

View reviewed changes

ChrisHegarty added 4 commits April 1, 2025 15:04

experimental

9e02566

itr

f5d8dd9

itr

c144d46

major refactor - move to per-field metrics

072cb8a

mayya-sharipova reviewed Apr 3, 2025

View reviewed changes

lucene/core/src/java/org/apache/lucene/codecs/KnnVectorsReader.java Outdated Show resolved Hide resolved

jimczi reviewed Apr 3, 2025

View reviewed changes

.../core/src/java/org/apache/lucene/codecs/lucene102/Lucene102BinaryQuantizedVectorsReader.java Show resolved Hide resolved

lucene/core/src/java/org/apache/lucene/codecs/KnnVectorsReader.java Outdated Show resolved Hide resolved

use file extension as category

e9d7959

mayya-sharipova approved these changes Apr 7, 2025

View reviewed changes

ChrisHegarty added 3 commits April 14, 2025 14:42

Merge branch 'main' into offHeapAccountable

2bd7b94

implement me

e47deb2

tests

a7d8642

ChrisHegarty mentioned this pull request Apr 14, 2025

Add dense vector off-heap stats to Node stats and Index stats APIs elastic/elasticsearch#126704

Merged

ChrisHegarty added 2 commits April 14, 2025 17:49

unused import

7e15568

tidy

a5dd7ee

benwtrent reviewed Apr 14, 2025

View reviewed changes

lucene/core/src/java/org/apache/lucene/codecs/KnnVectorsReader.java Outdated Show resolved Hide resolved

ChrisHegarty added 2 commits April 15, 2025 09:23

default impl

4507034

Merge branch 'main' into offHeapAccountable

c670b3a

ChrisHegarty merged commit 686a5b8 into apache:main Apr 15, 2025
7 checks passed

github-project-automation bot moved this from Open to Merged in OpenSearch Lucene & Core Performance Tracking Apr 15, 2025

ChrisHegarty added a commit that referenced this pull request Apr 15, 2025

Add changelog for #14426

cf661ba

ChrisHegarty mentioned this pull request Apr 15, 2025

[10.x] Add support for determining off-heap memory requirements for KnnVectorsReader #14497

Merged

ChrisHegarty added a commit that referenced this pull request Apr 15, 2025

Add changelog for #14426

2d026b7

ChrisHegarty mentioned this pull request Apr 21, 2025

Add getOffHeapByteSize to ES vector readers elastic/elasticsearch#127104

Merged

jpountz pushed a commit to jpountz/lucene that referenced this pull request Apr 24, 2025

Add changelog for apache#14426

378e20f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for determining off-heap memory requirements for KnnVectorsReader #14426

Add support for determining off-heap memory requirements for KnnVectorsReader #14426

ChrisHegarty commented Apr 1, 2025 •

edited

Loading

tteofili left a comment

benwtrent left a comment

ChrisHegarty commented Apr 2, 2025

jimczi left a comment

mayya-sharipova left a comment

benwtrent left a comment

navneet1v commented Apr 14, 2025

ChrisHegarty commented Apr 15, 2025

navneet1v commented Apr 15, 2025

Add support for determining off-heap memory requirements for KnnVectorsReader #14426

Add support for determining off-heap memory requirements for KnnVectorsReader #14426

Conversation

ChrisHegarty commented Apr 1, 2025 • edited Loading

tteofili left a comment

Choose a reason for hiding this comment

benwtrent left a comment

Choose a reason for hiding this comment

ChrisHegarty commented Apr 2, 2025

jimczi left a comment

Choose a reason for hiding this comment

mayya-sharipova left a comment

Choose a reason for hiding this comment

benwtrent left a comment

Choose a reason for hiding this comment

navneet1v commented Apr 14, 2025

ChrisHegarty commented Apr 15, 2025

navneet1v commented Apr 15, 2025

ChrisHegarty commented Apr 1, 2025 •

edited

Loading