Calibrating ensembles for scalable uncertainty quantification in deep learning-based medical image segmentation