https://duckdb.org/2024/05/03/vector-similarity-search-vss

+1 on this one, I've been pleasantly surprised by this for a small (<3GB) local project

does duckdb scale well over large datasets for vector search ?

What order of magnitude would you define as „large“ in this case?

like over 1tb.

Some people are using DuckDB for large datasets, https://duckdb.org/docs/stable/guides/performance/working_wi... , but you'd probably do some testing under the specific conditions of your rig to figure out if it is a good match or not.

its clear many DuckDB sql queries can handle terabytes of data, but the question here was about vector search..