This book teaches the fundamental concepts and tools behind reporting modern data analyses in a reproducible manner. As data analyses become increasingly complex, the need for clear and reproducible report writing is greater than ever. The material for this book was developed as part of the industry-leading Johns Hopkins Data Science Specialization.
This ebook is a particularly useful guide for researchers of all kinds, with strong emphasis on how to systematically create reproducible reports. Reproducibility is a major key to effectively communicating our analyses and research findings to others so that anyone else can re-run or examine specific steps in our investigations to validate our results. And since nowadays most people have access to substantial computing power, free open-source statistical softwares, and cloud-based data infrastructures (e.g. Github), creating reproducible reports has never been easier. The author also provides a general introduction to R markdown using RStudio, which has become one of the most popular tools for creating reproducible reports.