This is the complete course for anybody who wants to get to grips with Talend Open Studio for Data Integration. From the basics of transferring data to complex integration processes, it will give you a head start. In Detail Talend Open Studio for Data Integration (TOS) is an open source graphical development environment for creating custom integrations between systems. It comes with over 600 pre-built connectors that make it quick and easy to connect databases, transform files, load data, move, copy and rename files and connect individual components in order to define complex integration processes. "Getting Started with Talend Open Studio for Data Integration" illustrates common uses and scenarios in a simple, practical manner and, building on knowledge as the book progresses, works towards more complex integration solutions. TOS is a code generator and so does a lot of the "heavy lifting" for you. As such, it is a suitable tool for experienced developers and non-developers alike. You'll start by learning how to construct some common integrations tasks – transforming files and extracting data from a database, for example. These building blocks form a "toolkit" of techniques that you will learn how to apply in many different situations. By the end of the book, once complex integrations will appear easy and you will be your organization's integration expert! Best of all, TOS makes integrating systems fun! What you will learn from this book Approach "Getting Started with Talend Open Studio for Data Integration" takes a step-by-step, hands-on approach to learning with lots of examples and clear instructions. Who this book is written for Are you a developer, business analyst, project manager, business intelligence specialist, system architect or a consultant who needs to undertake integration projects, then this book is for you. The book assumes a certain level of familiarity with Relational database management systems with SQL and experience and Java.
A well-structured and easy to read introduction into Talend Open Studio. The book does not go super deep into the technical details but nevertheless it explains the most important concepts like contexts, job management and packaging for deployment. Surely you can find all this information in tutorials online but the book presents it to you in a curated and well-written form so you save the time searching the web.
Good as introduction to Talend concepts. A bit shallow but the samples are ok and as the code is provided its quite good. Even if it is an older version of Talend it is still very relevant to the current version 8. Using XML files is a bit outdated, I would say as other formats have overtaken. A REST API interface is also not covered as in most business areas this becomes the standard - so a newer version of the book should cover that.
Hi, I'd like to recommend "Getting Started with Talend Open Studio for Data Integration" by Jonathan Bowen over at www.packtpub.com for people wanting to do data integration with Talend 5.x!
Jonathan did a very good job in his approach to teaching you how to use Talend, which is, by the way, free data integration software. Instead of creating a reference manual, which is sometimes handy, he created a walk-through of most of the processes you would normally use to get data from point A to point B in many well thought-out examples. As I was reading and working through the chapters, since I had already become familiar with many of the processes in Talend, I found myself thinking "Jonathan should have include information about ....", but soon I was reading where he had indeed included such information a little further into the book. I learned several things I did not already know and learned from better examples than I could have created. The book includes downloadable data that you will need to complete the exercises and the instructions to load that data is in Appendix A.
One thing to be forewarned about: if you just want to learn by reading, you may be disappointed in this book. There are not enough screen captures for all the jobs presented to let you just read through as if you were watching a video, but if you actually work through the examples, you will like the concise nature of the steps given. The author is generous with warnings and explanations, so you should not get lost anywhere as you work through each chapter. Some examples do build on prior examples and, so, you should take things in order. There are a couple of things I wished the author had included that I have found useful in my Talend jobs, but had to learn them on my own. One is job logging and the other is error handling; Talend has very good features built in for both, but maybe he considered those topics more advanced than a beginner would want. Overall, I was very please with this excellent tutorial. It will get you up an running with Talend quickly, especially if you are starting with no experience with Talend at all.
As an ETL practitioner I can also see the rise in OSS use, ETL use in general, but no significant changes in how ETL is done or its tooling.
In short, getting this book for free for review from Packt was an awesome deal, big thank you to their awesome team, I enjoyed reading this book from cover to cover.
As a side note, I am a big believer in cookbooks (or in other words quick recipes). These are the books that you can read and re-read, and then keep at your desk at work. I am sure I will revisit it.
Another note is this book is good to read even to those developers who do not use Talend, even SSIS folks will benefit from thoughtful advices given by Rick as the principals described in the book apply to about any ETL process.
OK, now more about the book: my $0.02 whether you use Talend or Informatica – you have to know Java. It is simply because you will not be efficient enough or will not complete an assignment. The book uses quite a bit of Java code.
But do not be intimidated, the book has a relatively low entry barrier.
Even better, the book in my view has a complete coverage letting an ETL developer to properly tackle any given task. Specifically, it walks you thru from designing a simple job to troubleshooting common issues and also covers best approaches.
The book has a short task scenario overview with a fully outlined solution steps as it should be in any cookbook!
It even has common pitfalls resolutions covered for you so the author made sure you are not going to stuck with the IDE designing ETL with Talend.
Also, all the new and web technologies are covered as working with REST and SOAP, XML and so forth.
To sum up, this book receives 5+ out of 5 from me (a very demanding person).
Thank you Rick for putting so much effort in wiring a comprehensive book with so much accompanying code.
Getting Started with Talend Open Studio for Data Integration is a thorough introduction to TOS and can be considered a replacement to Talend's own introductory documentation. The book will take you from installation through the basics to some real-world examples, and covers enough to get you up to speed quickly.
Perhaps because the book is under Packt's open source label, there is not a lot of material on the paid-for software that Talend offers. Being built on top of the open source studio, it addresses several topics that TOS doesn't, including version control and scheduling. This book does cover these subjects but only on a very high level. I think a mention of this in an appendix would be enough. Additionally there isn't even a brief mention of customization through routines, custom components, or the tJava component. Because this is an introduction, again a brief mention of these things in an appendix (perhaps named Where to go From Here) would whet the appetite of those users who would be so inclined.
This book is not a thorough coverage of everything that TOS can do, and that is fine. Even someone who's very experienced with Talend products will learn a thing or two.
Everything considered, I am recommending you purchase this book if you are new to TOS or even if you are curious about a modern ETL tool.