Improving Deep Ensembles by Estimating Confusion Matrices

Bibliographic Details
Title: Improving Deep Ensembles by Estimating Confusion Matrices
Authors: Kuzin, Danil, Isupova, Olga, Reece, Steven, Simmons, Brooke D
Publication Year: 2025
Collection: Computer Science
Statistics
Subject Terms: Computer Science - Machine Learning, Statistics - Machine Learning
More Details: Ensembling in deep learning improves accuracy and calibration over single networks. The traditional aggregation approach, ensemble averaging, treats all individual networks equally by averaging their outputs. Inspired by crowdsourcing we propose an aggregation method called soft Dawid Skene for deep ensembles that estimates confusion matrices of ensemble members and weighs them according to their inferred performance. Soft Dawid Skene aggregates soft labels in contrast to hard labels often used in crowdsourcing. We empirically show the superiority of soft Dawid Skene in accuracy, calibration and out of distribution detection in comparison to ensemble averaging in extensive experiments.
Document Type: Working Paper
Access URL: http://arxiv.org/abs/2503.07119
Accession Number: edsarx.2503.07119
Database: arXiv
More Details
Description not available.