Data Engineering Central

Data Engineering Central

Lance file format. Parquet killer?

time will tell

Daniel Beach's avatar
Daniel Beach
Oct 27, 2025
∙ Paid

Who doesn’t love to play with something new? At least new to us. The Lance file format has been around since … well, I’m not sure, but the PRs in the GitHub repo start around 2022. Young in digital years. Just a little ‘fella.

There isn’t really anything to do but jump in the deep end when it comes to file formats. It’s hard to imagine anything unseating Parquet file, the proverbial Altas carrying Iceberg and Delta Lake on its shoulders. But, parquet files started from nothing too.

I’m not going to waste my time with nonsensical deeply technical questions about how this Lance file format works. No one cares.

What do we care about?

  • What big name Data Engineering frameworks support Lance?

  • How well does it work with S3?

  • Is the performance at least on par with Parquet?

So today I hope we can leave with a little bit of knowledge about the Lance file format, its use cases, and if it’s worth your trouble to play around with it.

User's avatar

Continue reading this post for free, courtesy of Daniel Beach.

Or purchase a paid subscription.
© 2026 dataengineeringdude · Privacy ∙ Terms ∙ Collection notice
Start your SubstackGet the app
Substack is the home for great culture