Improve your analytics and data platform to solve major challenges, including operationalizing big data and advanced analytics workloads on Azure. You will learn how to monitor complex pipelines, set alerts, and extend your organization's custom monitoring requirements.
This book starts with an overview of the Azure Data Factory as a hybrid ETL/ELT orchestration service on Azure. The book then dives into data movement and the connectivity capability of Azure Data Factory. You will learn about the support for hybrid data integration from disparate sources such as on-premise, cloud, or from SaaS applications. Detailed guidance is provided on how to transform data and on control flow. Demonstration of operationalizing the pipelines and ETL with SSIS is included. You will know how to leverage Azure Data Factory to run existing SSIS packages. As you advance through the book, you will wrap up by learning how to create a single pane for end-to-end monitoring, which is a key skill in building advanced analytics and big data pipelines.
What You'll Learn Who This Book Is For Data engineers and big data developers. ETL (extract, transform, load) developers also will find the book useful in demonstrating various operations.
El libro se encuentra a la mitad de camino entre un libro técnico y lo que se conoce como un "Cookbook". Entonces no se puede profundizar en tópicos de interés ni se puede entender como sería un funcionamiento adecuado de la plataforma. No posee nada sobre soluciones analíticas en el contexto de Big Data. De hecho, la mayoría de las guías son sobre base de datos relacionales.
Empecé buscando ideas sobre operacionalización de tareas a través de json, como crear clusters de manera eficiente, y/o como orquestar soluciones que funcionan principalmente utilizando como fuentes datos semiestructurados.
Puede ser de utilidad para aquellos desarrolladores de SSIS que estén migrando los sistemas a entornos cloud.