https://duckdb.org/2024/05/03/vector-similarity-search-vss
+1 on this one, I've been pleasantly surprised by this for a small (<3GB) local project
does duckdb scale well over large datasets for vector search ?
What order of magnitude would you define as „large“ in this case?
like over 1tb.
Some people are using DuckDB for large datasets, https://duckdb.org/docs/stable/guides/performance/working_wi... , but you'd probably do some testing under the specific conditions of your rig to figure out if it is a good match or not.
its clear many DuckDB sql queries can handle terabytes of data, but the question here was about vector search..
+1 on this one, I've been pleasantly surprised by this for a small (<3GB) local project
does duckdb scale well over large datasets for vector search ?
What order of magnitude would you define as „large“ in this case?
like over 1tb.
Some people are using DuckDB for large datasets, https://duckdb.org/docs/stable/guides/performance/working_wi... , but you'd probably do some testing under the specific conditions of your rig to figure out if it is a good match or not.
its clear many DuckDB sql queries can handle terabytes of data, but the question here was about vector search..