Discussion about this post

User's avatar
Daniel Warfield's avatar

Guilty as charged. When creating new models/exploring new modeling paradigms/doing iterative and in-depth experimentation/explaining complex topics I simply have not found a better alternative. They're rich, portable, and are implicitly designed to support iterative workflows while keeping data in memory which saves me, probably, a billion hours a day.

new problem -> a series of notebooks as I explore the problem -> the final notebook where I solve the problem -> save artifacts -> test artifacts -> build production code around artifacts.

Modern version control systems and coding environments are not designed for the complex and experimentative work that usually represents the impetus of a data science project. That's what notebooks are for. But of course the notebooks themselves never run in a production setting, or at least I hope to god they don't.

(This is all for data science work. For data engineering I'm not really sure why they're so popular)

Robert Ward's avatar

Guilty but only because I’m using MS Fabric and have no other choice (mostly). Fabric users have been clamoring for the ability to run normal Python scripts for a while now. Hopefully they announce it eventually.

2 more comments...

No posts

Ready for more?