Even if only a small percentage of backend calls are slow, the chance of getting a slow call increases if an end-user request requires multiple backend calls, and so a higher proportion of end-user requests end up being slow
If your service mkes 10 calls to a backend, your tp99 is their tp99.9.
Wait times in your thread pools can also be a big problem, and it is worth measuring.