Get Free Shipping on orders over $79
Hands-On LLM Serving and Optimization : Hosting LLMs at Scale - Chi Wang

Hands-On LLM Serving and Optimization

Hosting LLMs at Scale

By: Chi Wang, Peiheng Hu

Paperback | 2 June 2026

At a Glance

Paperback


$142.75

or 4 interest-free payments of $35.69 with

 or 

Available: 2nd June 2026

Preorder. Will ship when available.

As the demand for real-time AI applications grows, along comes this comprehensive guide to the complexities of deploying and optimizing LLMs at scale. The authors take a real-world approach backed by practical examples and code, and assemble essential strategies for designing infrastructures that are equal to the demands of modern AI applications.

More in Natural Language & Machine Translation

Scaling Responsible AI : From Enthusiasm to Execution - Noelle Russell
Think Python : How To Think Like a Computer Scientist - Allen B. Downey
ChatGPT For Dummies : For Dummies (Computer/Tech) - Pam Baker

RRP $41.95

$33.75

20%
OFF
The Scaling Era : An Oral History of AI, 2019-2025 - Dwarkesh Patel
Natural Language Processing with Transformers, Revised Edition - Leandro Von Werra
Federated Learning for Healthcare : Applications with Case Studies - D. Balaganesh
Acting : Keywords and Concepts - John  Matthews
Acting : Keywords and Concepts - John  Matthews

RRP $130.00

$118.75