Presenting the much-anticipated Second Edition, which boasts nearly 40 percent new and revised coverage, reflecting the latest best practices This unparalleled tutorial approach covers everything from planning the data warehouse project to implementing the design and managing the data warehouse environment The Kimball Group has streamlined the lifecycle methodology to be more efficient and user-friendly based on their thousands of hours of experience in both consulting and training They have also revised various lifecycle topics, including dimensional modeling, data warehouse architecture, ETL, and Business Intelligence New sections at the end of every process and techniques chapter feature coverage of managing the effort and reducing risk, assuring quality, estimating considerations, Web site resources, and more
I believe I bought this book a week or two after it came out (January 2008) with the intention of reading it immediately. Unfortunately, life happens and I have no idea what came up but I’m sure it was good. And, to be honest, I’m somewhat happy that whatever it was did come up because this was a good time for me to read through this.
I started reading this book in 20 August 2011 and finished it 11 September 2011. Oddly enough, I started this just prior to being told that the data warehousing project that we are planning at my current employer may be coming along a bit more quickly than originally thought. As such, I’m ramping up on some refresher material and this was perfect for that need. (I’m also thinking that I’ll be refreshing my SSRS / SSAS and that I might take a stab at the MCITP 70-452 Business Intelligence Developer exam)
As some background information, I have read several other Kimball Group books, including:
The Data Warehouse Toolkit: The Complete Guide to Dimensional Modeling (Second Edition) which is thoroughly dog-eared and highlighted; a rather rare occurrence for me. The Data Warehouse ETL Toolkit: Practical Techniques for Extracting, Cleaning, Conforming and Delivering Data The Microsoft Data Warehouse Toolkit: With SQL Server 2005 and the Microsoft Business Intelligence Toolset.
If you have read more than one of these books you will realize that there is a lot of material that overlaps between them but that each has its’ own focus. The focus of the Lifecycle Toolkit seemed to me to be on the project manager, or at least on giving an overview of the process as a whole. It seemed to me to be more of a “whet the appetite” with the ability to dive in more expansively through some of the other books referenced above. What The Data Warehouse Lifecycle Toolkit provided more than any of the others listed above was to give a good list of all of the tasks that will be required (or at least recommended) to complete a Data Warehouse project. I believe that a good portion of the text was set aside to drive home how much work is really involved and that this is not a trivial task. It continuously mentioned brining in the business, and having the project manager read through what was required in each of the major steps so that this could be conveyed back up the chain.
With that said, this book was a somewhat generic introduction to the topic at hand and no one topic was pursued in depth to the point that you would have all of the knowledge necessary to be an expert. However, it provided lots of great information relating to the relevant portions of the project, from preparing the project definition, to gathering the key players, to what is really necessary for the business requirements and how you can begin to put them together, etc. There is also lots of information relating to the dimensional modeling, the ETL subsystems, data profiling, data quality and the B.I. presentation layers.
As such, I’d recommend The Data Warehouse Lifecycle Toolkit to anyone looking to get involved in a Data Warehouse Project, particularly if you are going to be expected to be in a leadership role on the project. If you are to be involved as a data modeler or an ETL Developer and are a bit strapped for time it might be more advantageous to skip this volume and go straight to The Data Warehouse Toolkit followed by The Data Warehouse ETL Toolkit. (Alternately, you can start with the Microsoft Toolkit if that is the technology stack you will be using).
Kimball is considered one of the fathers of the data warehousing movement and among many consulting firms is considered the de-facto representation of reporting database infrastructure. This specific book details on a higher-level the step-by-step processes required for a successful implementation focusing on the business users and requirements, while laying common pitfalls in the ETL, data warehouse infrastructure and application-level. For IS professionals, I would consider this a must read and among BI consultants crucial in understanding the field.
I would consider this a more critical read than even Tufte and Few, who are considered the guru's of data visualization movement. The reason being that the infrastructure of any reporting effort underlies these later front-end developments. A most noted aspect of this book is it's heavy emphasis on the business need, the requirements for reporting and making sure you have strong executive support. This is too be expected when dealing with an abstract concept like data analysis and reporting, which is often strongly correlated with interpretation and the ability to act. As one can state, the ability to utilize this infrastructure is based in as much as the culture as the ability of those end-users to make the most of the tool.
I would recommend this to any aspiring IS professional as a theoretical foundation to their career and would even state that this is more important, from a management perspective, than knowing the actual data warehousing elements (which are important in and of themselves).
This book was the textbook of my Data Warehousing class in the Cal State Fullerton MS in Information Technology program.
I can't say I cared much for some of the books we had in the program but this book was a notable exception. The book was great at explaining the steps and processes associated with data warehouse implementation using the Kimball lifecycle methodology. It also provided great insights into the program management elements necessary to successfully complete medium to large size data warehouse projects.
Read chapter 6 and 7 (partly). These two were the only ones relevant for my current job. I have to say that if the reader hasn't got any hands-on experience with building datawarehouse databases this book (chapter 6) has way too few examples. You are required to deduct too much by yourself. I would prefer the author going slower and explaining in more detail.
An exhaustive and exhausting read. This book covers all of the bases. I am not a beginner in this subject, and I still learned a lot. That being said a different editor could have shortened this book by 100 pages without losing any information.
Excellent Introduction to Data Warehousing. I have read this book and reference it. I would recommend using these principles with some of the methods Bill Inmon teaches.
Recommended by client. For those informally trained in data structure, this is the book you need to put theory with what you're likely practicing professionally already.