ElvinOuyang’s Kindle Notes & Highlights

Data Science for Business: What You Need to Know about Data Mining and Data-Analytic Thinking, by Foster Provost

The key is to realize that there was nothing special about the first training/test split we made. Let’s say we are saving the test set for a final assessment. We can take the training set and split it again into a training subset and a testing subset. Then we can build models on this training subset and pick the best model based on this testing subset. Let’s call the former the sub-training set and the latter the validation set for clarity. The validation set is separate from the final test set, on which we are never going to make any modeling decisions. This procedure is often called nested ...more

nested holdout testing to achieve best complexity before running on test set to build model

See ElvinOuyang’s 35 notes & 181 highlights

Data Science for Business: What You Need to Know about Data Mining and Data-Analytic Thinking

by Foster Provost

Rate this book

Clear rating

1 of 5 stars 2 of 5 stars 3 of 5 stars 4 of 5 stars 5 of 5 stars

Open Preview

ElvinOuyang’s Kindle Notes & Highlights

Data Science for Business: What You Need to Know about Data Mining and Data-Analytic Thinking, by Foster Provost

See a Problem?

Preview — Data Science for Business by Foster Provost

ElvinOuyang’s Kindle Notes & Highlights Data Science for Business: What You Need to Know about Data Mining and Data-Analytic Thinking, by Foster Provost

See a Problem?

Preview — Data Science for Business by Foster Provost

ElvinOuyang’s Kindle Notes & Highlights

Data Science for Business: What You Need to Know about Data Mining and Data-Analytic Thinking, by Foster Provost