In “Eradicating Non-Determinism in Tests”, Martin Fowler advocates the approach that if you have flaky tests, you should track them down and if you can’t immediately fix them, remove them from the suite so you can treat them.
Agree with removing these from test suite. Fowler's post suggests keeping these tests (in "quarantine") but limiting size of quarantine to a small number and force yourself to fix one if you add above that limit. Eventually get to zero one hopes, but if one pops up, immediately fix or quarantine. Keep tests such that a failure means regression (not just bad luck)

