This second edition of Data Management Using Stata focuses on tasks that bridge the gap between raw data and statistical analysis. It has been updated throughout to reflect new data management features that have been added over the last 10 years. Such features include the ability to read and write a wide variety of file formats, the ability to write highly customized Excel files, the ability to have multiple Stata datasets open at once, and the ability to store and manipulate string variables stored as Unicode. Further, this new edition includes a new chapter illustrating how to write Stata programs for solving data management tasks. As in the original edition, the chapters are organized by data management reading and writing datasets, cleaning data, labeling datasets, creating variables, combining datasets, processing observations across subgroups, changing the shape of datasets, and programming for data management. Within each chapter, each section is a self-contained lesson illustrating a particular data management task (for instance, creating date variables or automating error checking) via examples. This modular design allows you to quickly identify and implement the most common data management tasks without having to read background information first. In addition to the “nuts and bolts” examples, author Michael Mitchell alerts users to common pitfalls (and how to avoid them) and provides strategic data management advice. This book can be used as a quick reference for solving problems as they arise or can be read as a means for learning comprehensive data management skills. New users will appreciate this book as a valuable way to learn data management, while experienced users will find this information to be handy and time saving―there is a good chance that even the experienced user will learn some new tricks.
Gostei bastante desse livro. Ele têm tudo que uma pessoa precisa para começar para trabalhar com stata: labelling, variable management, datasets operations, grouping operations e o básico de programação em stata, como loops e ado files.
O livro não é, porém, um guia compreensivo (no sentido anglicano do termo). Tudo é apresentado de forma bastante simples e sem aprofundamento. Mas é suficiente para iniciar o usuário no programa e o resto se resolve com statalist e stackoverflow. Para interessados em uma abordagem mais aprofundada e mais focada em econometria, outros manuais, como o do Cameron (Microeconometrics Using Stata), podem ser uma pedida mais interessante.
This book was absolutely wonderful for data management, especially dealing with missing values and observation, miss-classification tables and recorded variables. Furthermore, it has very clear language and well-organised. It is useful for those who are interesting to use STATA and statistical programming.