Hacker News

fumeux_fume 4 days ago [ - ]

In the same talk, Mark acknowledges that "for data science workflows, database systems are frustrating and slow." Granted DuckDB is an attempt to fix that, most data scientists don't get to choose what database the data is stored in.

willvarfar 4 days ago [ - ]

(I use duckdb to query data stored in parquet files)

mrtimo 4 days ago [ - ]

Same. But, I use Malloy which uses duckdb to query data stored in hundreds of parquet files (as if they were one big file).

willvarfar 4 days ago [ - ]

I haven't looked at Mallory, but I do regularly scan lots of parquet files using wildcards etc from duckdb. Its a neat builtin duckdb feature.