Jump to ratings and reviews
Rate this book

AWS Glue for Data Engineering: Serverless ETL, Analytics, and Workflow Automation for Modern Cloud Pipelines

Rate this book
AWS Glue for Data Serverless ETL, Analytics, and Workflow Automation for Modern Cloud Pipelines

Building and scaling data pipelines shouldn’t require endless cluster management or overnight troubleshooting. What if you could design, deploy, and automate end-to-end data workflows entirely serverless—freeing your time to focus on insight rather than infrastructure?


This book is your complete guide to mastering AWS Glue, the cornerstone of serverless data engineering on AWS. Written for data engineers, cloud architects, and analytics professionals, it walks you through the full lifecycle of building modern data pipelines—from raw ingestion to production analytics. You’ll learn how to use Glue to design efficient ETL workflows, manage metadata through the Data Catalog, process both batch and streaming data, and orchestrate jobs that scale automatically with your business needs. Each chapter blends real-world techniques with best practices to help you transform your organization’s data landscape into a governed, automated, and cost-effective architecture.

What sets this book apart?
Unlike high-level overviews, this guide provides a practical, hands-on framework for mastering AWS Glue within the broader AWS data ecosystem. Through detailed explanations and working examples, you’ll

How to build your first ETL pipeline with Glue Studio and PySpark.

Techniques for metadata management, schema evolution, and integration with Athena, Redshift, and Lake Formation.

Workflow orchestration with Glue Workflows, Triggers, and Dependencies.

Real-time and batch data processing strategies using Kinesis and Glue Streaming.

Serverless data lake architecture design, performance optimization, and cost control.

Advanced enterprise use cases, including multi-account deployments and metadata-driven pipelines.

Each section distills years of experience into actionable insights, ensuring that you can confidently design pipelines that are both scalable and production-ready. Whether you are building your first Glue job or leading a team deploying hundreds of workflows, this book gives you the clarity and technical depth to succeed.

If you want to move beyond manual ETL scripts and embrace the future of serverless data engineering, this book will show you how to make AWS Glue your most powerful ally. Turn the complexity of modern data pipelines into a seamless, automated process—and start building smarter, faster, and more resilient data systems today.

200 pages, Kindle Edition

Published October 6, 2025

About the author

Tony Bozeman

42 books

Ratings & Reviews

What do you think?
Rate this book

Friends & Following

Create a free account to discover what your friends think of this book!

Community Reviews

5 stars
0 (0%)
4 stars
0 (0%)
3 stars
0 (0%)
2 stars
0 (0%)
1 star
0 (0%)
No one has reviewed this book yet.

Can't find what you're looking for?

Get help and learn more about the design.