Discussion about this post

User's avatar
Rushi Jariwala's avatar

These results are quite interesting. Daft seems to have the power to replace pyspark even for distributed computing. Atleast thats whats the authors say. And they have made it easy to integrate with ray with Daft Launcher.

Minor Error: Dask is mentioned instead of Daft in multiple places. Both are different libraries.

Expand full comment
Gerhard Brueckl's avatar

You could make the access even more generic by reading the URL and AccessToken from the current Databricks-session via dbutils

Expand full comment

No posts