Discussion about this post

User's avatar
Daniel Warfield's avatar

Guilty as charged. When creating new models/exploring new modeling paradigms/doing iterative and in-depth experimentation/explaining complex topics I simply have not found a better alternative. They're rich, portable, and are implicitly designed to support iterative workflows while keeping data in memory which saves me, probably, a billion hours a day.

new problem -> a series of notebooks as I explore the problem -> the final notebook where I solve the problem -> save artifacts -> test artifacts -> build production code around artifacts.

Modern version control systems and coding environments are not designed for the complex and experimentative work that usually represents the impetus of a data science project. That's what notebooks are for. But of course the notebooks themselves never run in a production setting, or at least I hope to god they don't.

(This is all for data science work. For data engineering I'm not really sure why they're so popular)

Robert Long's avatar

Personally I much prefer a laptop to a notebook ;)

Or if you mean a "Jupyter Notebook" type of thing, well I have heard of those but I didn't like them so much - apart for some quick data exploration - and even then I prefer using RStudio (for both R and Python) or even just vscode with a few useful extensions.

3 more comments...

No posts

Ready for more?