Data Engineering Central

Data Engineering Central

Share this post

Data Engineering Central
Data Engineering Central
Bytes for Data Engineers

Bytes for Data Engineers

and Buffers ... and Streams.

Daniel Beach's avatar
Daniel Beach
Aug 21, 2023
∙ Paid
3

Share this post

Data Engineering Central
Data Engineering Central
Bytes for Data Engineers
Share

One of the most underutilized pieces of code I’ve seen in all my many years of Data Engineering is Bytes and Streams. I’m not sure why. It just never appears.

I see Strings, Ints, Floats, I see everything, but never a plain old Byte. Poor little bugger. I don’t know if people think it’s too complicated, in fact, it is less, less to go wrong, less complexity.

What’s more computationally expensive than Serialization and Deserialization? Especially in a Data Engineering context. Lots of data moving around, coming from this place and going to that place. Does it really need to be a String all the time? No.

Let’s take a look at Bytes, Streams, and Buffers in Python and Rust.

Thanks to Delta for sponsoring this newsletter! I personally use Delta Lake on a daily basis, and I believe this technology represents the future of Data Engineering. Check out their website below.

This post is for paid subscribers

Already a paid subscriber? Sign in
© 2025 dataengineeringdude
Privacy ∙ Terms ∙ Collection notice
Start writingGet the app
Substack is the home for great culture

Share