Get Free Shipping on orders over $79
Effective Data Science Infrastructure - Ville Tuulos

Effective Data Science Infrastructure

By: Ville Tuulos

Paperback | 19 September 2022

At a Glance

Paperback


$95.75

or 4 interest-free payments of $23.94 with

 or 

Ships in 10 to 15 business days

Simplify data science infrastructure to give data scientists an efficient path from prototype to production.

In Effective Data Science Infrastructure you will learn how to:

    Design data science infrastructure that boosts productivity
    Handle compute and orchestration in the cloud
    Deploy machine learning to production
    Monitor and manage performance and results
    Combine cloud-based tools into a cohesive data science environment
    Develop reproducible data science projects using Metaflow, Conda, and Docker
    Architect complex applications for multiple teams and large datasets
    Customize and grow data science infrastructure

Effective Data Science Infrastructure: How to make data scientists more productive is a hands-on guide to assembling infrastructure for data science and machine learning applications. It reveals the processes used at Netflix and other data-driven companies to manage their cutting edge data infrastructure. In it, you'll master scalable techniques for data storage, computation, experiment tracking, and orchestration that are relevant to companies of all shapes and sizes. You'll learn how you can make data scientists more productive with your existing cloud infrastructure, a stack of open source software, and idiomatic Python.

The author is donating proceeds from this book to charities that support women and underrepresented groups in data science.

Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications.

About the technology
Growing data science projects from prototype to production requires reliable infrastructure. Using the powerful new techniques and tooling in this book, you can stand up an infrastructure stack that will scale with any organization, from startups to the largest enterprises.

About the book
Effective Data Science Infrastructure teaches you to build data pipelines and project workflows that will supercharge data scientists and their projects. Based on state-of-the-art tools and concepts that power data operations of Netflix, this book introduces a customizable cloud-based approach to model development and MLOps that you can easily adapt to your company's specific needs. As you roll out these practical processes, your teams will produce better and faster results when applying data science and machine learning to a wide array of business problems.

What's inside

    Handle compute and orchestration in the cloud
    Combine cloud-based tools into a cohesive data science environment
    Develop reproducible data science projects using Metaflow, AWS, and the Python data ecosystem
    Architect complex applications that require large datasets and models, and a team of data scientists

About the reader
For infrastructure engineers and engineering-minded data scientists who are familiar with Python.

About the author
At Netflix, Ville Tuulos designed and built Metaflow, a full-stack framework for data science. Currently, he is the CEO of a startup focusing on data science infrastructure.

Table of Contents
1 Introducing data science infrastructure
2 The toolchain of data science
3 Introducing Metaflow
4 Scaling with the compute layer
5 Practicing scalability and performance
6 Going to production
7 Processing data
8 Using and operating models
9 Machine learning with the full stack
Industry Reviews

"Do not miss the opportunity to cover all key aspects of data science infrastructure on your next project." Jesus A. Juarez Guerrero

"Useful book that provides tactical guidance on how to use Metaflow to streamline data science workflows but also includes great frameworks and abstractions to consider when defining your data science infrastructure stack." Sarah Catanzaro

"This is the ultimate book to learn how to handle infrastructure in data science!" Ninoslav Cerkez

"If you need a workflow management tool to glue your data code, look at metaflow. It's simple yet efficient." Mikael Dautrey

More in Computing & I.T.

How a Game Lives - Jacob Geller

RRP $49.99

$38.75

22%
OFF
The Amazing Generation - Catherine Price

RRP $24.99

$19.99

20%
OFF
Microsoft Power BI For Dummies : For Dummies (Computer/Tech) - Jack A. Hyman
AI for Business : A Guide to AI Adoption - Jon Whittle

RRP $49.99

$40.75

18%
OFF
SPSS Statistics : 5th Edition - A Practical Guide - Kellie Bennett

RRP $104.95

$89.75

14%
OFF
How to Win At Chess : The Ultimate Guide for Beginners and Beyond - Levy Rozman
Doppelganger : A Trip Into the Mirror World - Naomi Klein

RRP $26.99

$22.99

15%
OFF
iPhone For Dummies, 2026 Edition : iPhone for Dummies - Guy Hart-Davis
Creative Machines : AI, Art & Us - Maya Ackerman

RRP $57.95

$44.75

23%
OFF
Grey Area : Dark Web Data Collection and the Future of OSINT - Vinny Troia
Machine Learning For Dummies : For Dummies (Computer/Tech) - Luca Massaron
This Is for Everyone - Tim Berners-Lee

RRP $36.99

$29.75

20%
OFF