This book is written as a companion book to the Developing Data Products¹ Coursera class as part of the Data Science Specialization². However, if you do not take the class, the book mostly stands on its own. A useful component of the book is a series of YouTube videos³ that comprise the Coursera class.
The book is intended to be a low cost introduction to the important field of data products. The intended audience are students who are numerically and computationally literate, who would like to put those skills to use in Data Science. The book is offered for free as a series of markdown documents on github and in more convenient forms (epub, mobi) on LeanPub.
Brian Caffo is a professor in the Department of Biostatistics at the Johns Hopkins Bloomberg School of Public Health. He graduated from the Department of Statistics at the University of Florida in 2001, and from the Department of Mathematics at UF in 1995. His doctoral advisor was James G. Booth. He works in the fields of computational statistics and neuroinformatics and co-created the SMART working group. He has been the recipient of the Presidential Early Career Award for Scientists and Engineers, Johns Hopkins Bloomberg School of Public Health Golden Apple and AMTRA teaching awards.