So what's the goal? We're given m examples xi drawn independently from the distribution D, and for each xi, we're given f(xi); that is, we're told whether each of our examples is or isn't grammatical. Using this, we want to output a hypothesis language h such that