Skip to content

Commit

Permalink
Update _posts/2025-02-03-Reduce-Cost-with-Disk-based-Vector-Search.md
Browse files Browse the repository at this point in the history
Signed-off-by: kolchfa-aws <[email protected]>
  • Loading branch information
kolchfa-aws authored Feb 6, 2025
1 parent 1436072 commit 24b8ff1
Showing 1 changed file with 1 addition and 1 deletion.
Original file line number Diff line number Diff line change
Expand Up @@ -257,7 +257,7 @@ Interestingly, for this dataset, the on-disk approach with rescoring produces si

## Learnings

Our testing shows that the two-phase ANN approach performs effectively in low-memory environments, though results vary significantly by dataset. When running your own experiments, we recommend testing with the `index.knn.disk.vector.shard_level_rescoring_disabled` setting both enabled and disabled to measure the performance benefit for your use case. Additionally, with disk-based search, ensure that your secondary storage is optimized for high read traffic---we found that SSDs generally provide the best results.
Our testing shows that the two-phase approximate nearest neighbor approach performs effectively in low-memory environments, though results vary significantly by dataset. When running your own experiments, we recommend testing with the `index.knn.disk.vector.shard_level_rescoring_disabled` setting both enabled and disabled to measure the performance benefit for your use case. Additionally, with disk-based search, ensure that your secondary storage is optimized for high read traffic---we found that SSDs generally provide the best results.

## What's next?

Expand Down

0 comments on commit 24b8ff1

Please sign in to comment.