Understanding and Optimizing Parallel Performance in Multi-tenant Cloud