Optimizing LLM Performance : Framework-Agnostic Techniques for Speed, Scalability, and Cost-Efficient Inference Across PyTorch, ONNX, vLLM, and More Book Discussion

Optimizing LLM Performance : Framework-Agnostic Techniques for Speed, Scalability, and Cost-Efficient Inference Across PyTorch, ONNX, vLLM, and More

Optimizing LLM Performance : Framework-Agnostic Techniques for Speed, Scalability, and Cost-Efficient Inference Across PyTorch, ONNX, vLLM, and More (Kindle Edition)
by

Peter E. Poisson

Topics About This Book Topics That Mention This Book

There are no discussion topics on this book yet. Be the first to start one »

see all book discussions »
post a new topic »