Subscribe
Sign in
Home
Podcast
Work With Me
Merch
Archive
About
Latest
Top
Discussions
Data Engineering Central Podcast - 09
Cluster Fatigue and the Death of Open Source
Nov 13
•
Daniel Beach
3
2
6:51
650GB of Data (Delta Lake on S3). Polars vs DuckDB vs Daft vs Spark.
cluster fatigue
Nov 12
•
Daniel Beach
60
13
5
Databricks Compute.
Examining the Inscrutable
Nov 10
•
Daniel Beach
24
2
2
_internal.DeltaProtocolError:
The Databricks and Delta Lake lie (the mighty have fallen)
Nov 6
•
Daniel Beach
12
6
1
ClickHouse: A Super-Fast Columnar Database
Guest Post Series
Nov 5
•
Ahmed Shaaban
20
1
Gzip. CSV. Python. S3. (Polars vs DuckDB)
headaches ya' know?
Nov 3
•
Daniel Beach
20
2
3
October 2025
Lance file format. Parquet killer?
time will tell
Oct 27
•
Daniel Beach
3
1
AWS Outage. Pipeline Problems? What next?
do nothing?
Oct 21
•
Daniel Beach
10
4
Simplifying CI/CD with Databricks Asset Bundles (DABs)
From Chaos to Control
Oct 20
•
Daniel Beach
26
9
2
Drainage: The Missing Piece in Your Lake House Health Strategy
How a Rust-powered Python library is revolutionizing data lake monitoring and optimization
Oct 16
•
Daniel Beach
19
2
2
Fivetran + dbt Labs merger. What does it mean?
anything at all?
Oct 13
•
Daniel Beach
24
1
Run Llama 3.1 8B Locally with LangChain and SQLite Memory
Learn AI by doing
Oct 13
•
Daniel Beach
9
1
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts