> No, polars or spark is not a good answer, those are optimized for data enginee...

fifilura · 2025-12-02T17:08:21 1764695301

I have not work with Polars, but I would imagine any incompatibility with existing libraries (e.g. plotting libraries like plotnine, bokeh) would quickly put me off.

It is a curse I know. I would also choose a better interface. Performance is meh to me, I use SQL if i want to do something at scale that involves row/column data.

rbartelme · 2025-12-02T17:14:30 1764695670

This is a non-issue with Polars dataframes to_pandas() method. You get all the performance of Polars for cleaning large datasets, and to_pandas() gives you backwards compatibility with other libraries. However, plotnine is completely compatible with Polars dataframe objects.

maleldil · 2025-12-02T17:14:39 1764695679

You can always convert from Polars to Pandas. Plotnine will do it automatically for you, even.