The second approach to imitation learning in robotics is called “inverse reinforcement learning.” Instead of trying to directly copy the driving decisions that a human makes in response to a picture of the road, what if an AI system first attempts to identify the intent of the human’s driving decisions?