Jump to ratings and reviews
Rate this book

Statistical Foundations of Data Science

Rate this book
Statistical Foundations of Data Science gives a thorough introduction to commonly used statistical models, contemporary statistical machine learning techniques and algorithms, along with their mathematical insights and statistical theories. It aims to serve as a graduate-level textbook and a research monograph on high-dimensional statistics, sparsity and covariance learning, machine learning, and statistical inference. It includes ample exercises that involve both theoretical studies as well as empirical applications. The book begins with an introduction to the stylized features of big data and their impacts on statistical analysis. It then introduces multiple linear regression and expands the techniques of model building via nonparametric regression and kernel tricks. It provides a comprehensive account on sparsity explorations and model selections for multiple regression, generalized linear models, quantile regression, robust regression, hazards regression, among others. High-dimensional inference is also thoroughly addressed and so is feature screening. The book also provides a comprehensive account on high-dimensional covariance estimation, learning latent factors and hidden structures, as well as their applications to statistical estimation, inference, prediction and machine learning problems. It also introduces thoroughly statistical machine learning theory and methods for classification, clustering, and prediction. These include CART, random forests, boosting, support vector machines, clustering algorithms, sparse PCA, and deep learning.

774 pages, Hardcover

Published August 17, 2020

2 people are currently reading
18 people want to read

About the author

Jianqing Fan

23 books2 followers

Ratings & Reviews

What do you think?
Rate this book

Friends & Following

Create a free account to discover what your friends think of this book!

Community Reviews

5 stars
0 (0%)
4 stars
0 (0%)
3 stars
1 (50%)
2 stars
0 (0%)
1 star
1 (50%)
Displaying 1 - 2 of 2 reviews
4 reviews
May 5, 2025
Terrible intuition given. Just equation after equation. Did not like. Did not finish.
Profile Image for smoh  cat.
45 reviews1 follower
April 28, 2022
“Things I wish are true but are rarely true in practice. “ This book has various typos and is extremely difficult to read. Nevertheless this is a much needed book so when I do machine learning I’m aware of every assumption I’m making.
Displaying 1 - 2 of 2 reviews

Can't find what you're looking for?

Get help and learn more about the design.