What you want is to organize the data set. But in principle there are a huge number of ways to do so. Should you organize it by type, color, surface quality, or a combination of one, two, or n variables? Which is the best (or “right”) way to organize it? The “right” answer is the one that offers the deepest understanding, and it turns out that in this example the answer is by size.