Running Dask in Databricks

from blog Posts on Hi, I'm Ben 🛸, | ↗ original
I should probably admit that there’s a bit of a contradiction between two thoughts that I have: I really love spark I really hate spark Spark is one of the most powerful dataframe libraries on the planet. It can process multiple petabytes of data. But it’s also overkill and unwieldy for most jobs. For smaller datasets, tools like Polars or Duckdb...