Doug Lautzenheiser

22%
Flag icon
In the simplest form of random sampling, you give all samples in the population equal probabilities of being selected.4 For example, you randomly select 10% of the population, giving all members of this population an equal 10% chance of being selected. The advantage of this method is that it’s easy to implement. The drawback is that rare categories of data might not appear in your selection. Consider the case where a class appears only in 0.01% of your data population. If you randomly select 1% of your data, samples of this rare class will unlikely be selected. Models trained on this selection ...more
Designing Machine Learning Systems: An Iterative Process for Production-Ready Applications
Rate this book
Clear rating
Open Preview