Get Free Shipping on orders over $79
Hands-On LLM Serving and Optimization : Hosting LLMs at Scale - Chi Wang

Hands-On LLM Serving and Optimization

Hosting LLMs at Scale

By: Chi Wang, Peiheng Hu

Paperback | 29 May 2026

At a Glance

Paperback


$73.75

or 4 interest-free payments of $18.44 with

 or 

Available: 29th May 2026

Preorder. Will ship when available.

As the demand for real-time AI applications grows, along comes this comprehensive guide to the complexities of deploying and optimizing LLMs at scale. The authors take a real-world approach backed by practical examples and code, and assemble essential strategies for designing infrastructures that are equal to the demands of modern AI applications.

More in Natural Language & Machine Translation

AI Engineering : Building Applications with Foundation Models - Chip Huyen
Scaling Responsible AI : From Enthusiasm to Execution - Noelle Russell
ChatGPT For Dummies : For Dummies (Computer/Tech) - Pam Baker

RRP $56.99

$39.99

30%
OFF
AI ChatBots For Dummies : For Dummies (Computer/Tech) - Kelly Noble Mirabella
The Governance of Artificial Intelligence - Tshilidzi, Ph.D.  Marwala

RRP $327.95

$291.75

11%
OFF
Visualizing Generative AI : How AI Paints, Writes, and Assists - Priyanka Vergadia
Acting : Keywords and Concepts - John  Matthews

RRP $130.00

$118.75

Acting : Keywords and Concepts - John Matthews

RRP $56.99

$56.75

Federated Learning for Healthcare : Applications with Case Studies - R. Anandan