Jump to ratings and reviews
Rate this book

AI Systems Performance Engineering: Optimizing Model Training and Inference Workloads with GPUs, CUDA, and PyTorch

Not yet published
Expected 30 Dec 25
Rate this book

700 pages, Paperback

Expected publication December 30, 2025

1 person is currently reading
8 people want to read

About the author

Chris Fregly

6 books2 followers

Ratings & Reviews

What do you think?
Rate this book

Friends & Following

Create a free account to discover what your friends think of this book!

Community Reviews

5 stars
1 (33%)
4 stars
2 (66%)
3 stars
0 (0%)
2 stars
0 (0%)
1 star
0 (0%)
Displaying 1 of 1 review
Profile Image for Xianshun Chen.
88 reviews2 followers
May 11, 2025
very enlightening book on llm distributed training and inference at scale, concise and well versed, packed with lots of details. I particularly like the discussion on the software hardware co-design, and discussion on the compute, storage, and network io
Displaying 1 of 1 review

Can't find what you're looking for?

Get help and learn more about the design.