Get Free Shipping on orders over $79
Pretrain Vision and Large Language Models in Python : End-to-end techniques for building and deploying foundation models on AWS - Andrea Olgiati Emily Webber

Pretrain Vision and Large Language Models in Python

End-to-end techniques for building and deploying foundation models on AWS

By: Andrea Olgiati Emily Webber

eText | 31 May 2023 | Edition Number 1

At a Glance

eText


$61.59

or 4 interest-free payments of $15.40 with

 or 

Instant online reading in your Booktopia eTextbook Library *

Why choose an eTextbook?

Instant Access *

Purchase and read your book immediately

Read Aloud

Listen and follow along as Bookshelf reads to you

Study Tools

Built-in study tools like highlights and more

* eTextbooks are not downloadable to your eReader or an app and can be accessed via web browsers only. You must be connected to the internet and have no technical issues with your device or browser that could prevent the eTextbook from operating.

Conceptual fundamentals and practical guidance from industry experts to pretrain the large vision and language models of the future.

Key Features

  • Learn how and where to develop, train, tune, and apply your own pretrained models
  • Master distributed training concepts for models & datasets, with code examples for AWS and SageMaker
  • Evaluate, deploy, and operationalize your custom models with bias detection and pipeline monitoring

Book Description

Large models have forever changed machine learning. From BERT to GPT-3, Vision Transformers to DALL-E, when billions of parameters are combined with large datasets and hundreds to thousands of GPUs, the result is nothing short of record-breaking. The recommendations, advice, and code samples in this book will help you pretrain your large models from scratch on AWS and Amazon SageMaker and apply them to hundreds of use cases across your organization.

With advice from seasoned AWS ML expert Emily Webber, this book provides everything you need to go from project ideation, dataset preparation, training, evaluation, and deployment for large language, vision, and multimodal models. With step-by-step explanations of essential concepts and practical examples, you'll go all the way from mastering the concept of pretraining itself to preparing your dataset and model, configuring your environment, training, evaluating, and deploying your models.

From applying the scaling laws to distributing your model and dataset over multiple GPUs, you'll learn how to successfully train, evaluate, and deploy your model on Amazon SageMaker. By the end of this book, you will have everything you need to embark on your own project to pretrain the large language models of the future, purpose-built for your organization.

What you will learn

  • Prepare to train large models from the right dataset to your GPU needs
  • Configure environments on AWS and SageMaker for optimal performance
  • Select the right hyperparameters for your model, given your constraints
  • Distribute your model and dataset with different types of parallelism
  • Avoid pitfalls with job restarts, intermittent health checks, and more
  • Evaluate your model with quantitative and qualitative insights
  • Deploy your models with runtime improvements and Monitoring
  • Detect and mitigate bias in your deploy and retrain pipelines

Who This Book Is For

If you're a machine learning enthusiast or researcher who wants to get started on your very own large modeling project, this book is for you. Applied scientists, data scientists, machine learning engineers, solution architects, product managers, and students will all enjoy the material. Basic Python is a must, and introductory concepts around cloud computing will be very helpful. We'll assume some level of deep learning fundamentals but will explain advanced topics.

Table of Contents

  1. An introduction to pretraining
  2. Dataset preparation: part one
  3. Model preparation
  4. Into the GPU
  5. Parallelization basics
  6. Dataset preparation: part two
  7. Find the right hyperparameters
  8. Make sure your loss goes down
  9. Troubleshoot ongoing performance
  10. Determine the right length of training time
  11. Finetune and compare with open source models
  12. Detect and mitigate bias
  13. How small can you go?
  14. Use cases: scale across organizations
  15. Ongoing operations, monitoring and maintenance
on
Desktop
Tablet
Mobile

More in Artificial Intelligence

Medium Hot : Images in the Age of Heat - Hito Steyerl

eBOOK

RRP $22.66

$18.99

16%
OFF
AI Futures - Evgeny Morozov

eBOOK

RRP $16.88

$13.99

17%
OFF
Where the Axe is Buried - Ray Nayler

eBOOK

AI-Powered Search - Trey Grainger

eBOOK

HBR Guide to Generative AI for Managers : HBR Guide - Elisa Farri

eBOOK