Sitemap - 2024 - Data Engineering Central
You're Doing Data Engineering Wrong.
Date and Time Manipulation with DuckDB
Kubernetes Sucks. Long Live K8s.
Ain't no room for AI (in my workflow)
Replace Databricks Spark Jobs (using Delta) with Polars
Snowflake is Dying on the Vine?
Data Validation for Data Engineers
DuckDB 1.0.0 - Let's Kick The Tires
When to Rust for Data Engineering ... and when NOT to.
Introduction to Daft ( ... vs Polars)
Real Life Example of the QuickSort Algo (Rust)
Premature Optimization is NOT the root of all evil?
I See Window Functions Everywhere
How Tech Debt, Databricks, and Spark UDFs ruined my weekend.
Cost Savings for Databricks Users
Why Analytics is a Lose Lose Game
Redshift vs Snowflake vs BigQuery vs Databricks vs ...
Transitioning to Senior Engineer
Delta Lake - Map and Array data types
Spark Connect - What is this madness?
How to Build an Open Source Python Package
Why Aren’t You Filtering More?
Default Values - Thoughts and More
Error Handling for Data Engineers
Microservices for Data Engineering
UDTFs (User-defined Table Functions) in PySpark.
Apple Pie. Angry People. Other News.
DuckDB vs Polars - Thunderdome.
New SQL Practice Problems - Free For Paid Subscribers
Unit Testing for Data Engineers
Batch vs Near-Realtime vs Streaming
Why DuckDB is losing to Polars
LLMs Part 2 - Fine Tuning OpenLLaMA
Introduction to Write-Audit-Publish Pattern
Data Warehouse Analytics - Latency