Data Engineering Central

Data Engineering Central

Home
Podcast
Archive
Leaderboard
About

Sitemap - 2025 - Data Engineering Central

Rust, Python, Perl, COBOL, PHP ... oh my.

Golang with DuckDB, and more.

Lakebase from Databricks.

Nushell in a Nutshell

DuckDB enters the Lake House race.

Databricks SQL Scripting

Apache Iceberg Rant.

What?! An Iceberg Catalog that works?

How do eat a Data Platform?

DuckDB + PyIceberg + Lambda

Postgres to Delta Lake ... and back again.

AI is NEVER going to take your job.

DataFrame / SQL Column Manipulation.

Ray on Databricks. Distributed Python.

Becoming a "better" Data Engineer

Clustering vs Partitions - Pick your poison.

Cloudflare R2 + Apache Iceberg + R2 Data Catalog + Daft

Complicated != Good

Review of Data Orchestration Landscape

Data Engineering Central Podcast - 07

Databricks Compute. Thoughts and more.

Apache Polaris (Iceberg Catalog) ... with Daft

Partitions in Distributed Compute.

Test, test, and then test again.

What is a "healthy" Lake House (Delta Lake style)?

smallpond ... distributed DuckDB?

dbt on Databricks

Data Analysis is Hell

AI Code Revolution. Embrace or Deny?

Replace Python's pip with uv?

Lord have mercy. Apache XTable.

Data Engineering Central Podcast - 06

Delta Lake vs Iceberg. UniForm and Unity Catalog.

Your Limiting Factor (in Software)

Bespoke vs Managed Data Platforms

Polars and DuckDB release Unity Catalog (Delta Lake) integrations. Who lied? Who didn't?

Data Quality with Databricks Labs new DQX tool.

Terraform Sucks. Long Live Terraform.

DuckDB processing remote (s3) JSON files.

How to write better PySpark code.

© 2025 dataengineeringdude
Privacy ∙ Terms ∙ Collection notice
Start writingGet the app
Substack is the home for great culture

Share