Wiley: Statistical Analysis and Data Mining: The ASA Data Science Journal: Table of Contents

Quantifying Epistemic Uncertainty in Binary Classification via Accuracy Gain

Christopher Qian, Tyler Ganter, Joshua Michalenko, Feng Liang, Jason Adams — Wed, 18 Sep 2024 03:16:34 -0700

ABSTRACT

Recently, a surge of interest has been given to quantifying epistemic uncertainty (EU), the reducible portion of uncertainty due to lack of data. We propose a novel EU estimator in the binary classification setting, as the posterior expected value of the empirical gain in accuracy between the current prediction and the optimal prediction. In order to validate the performance of our EU estimator, we introduce an experimental procedure where we take an existing dataset, remove a set of points, and compare the estimated EU with the observed change in accuracy. Through real and simulated data experiments, we demonstrate the effectiveness of our proposed EU estimator.

Conformal Multi‐Target Hyperrectangles

Max Sampson, Kung‐Sik Chan — Mon, 16 Sep 2024 05:20:25 -0700

ABSTRACT

We propose conformal hyperrectangular prediction regions for multi-target regression. We propose split conformal prediction algorithms for both point and quantile regression to form hyperrectangular prediction regions, which allow for easy marginal interpretation and do not require covariance estimation. In practice, it is preferable that a prediction region is balanced, that is, having identical marginal prediction coverage, since prediction accuracy is generally equally important across components of the response vector. The proposed algorithms possess two desirable properties, namely, tight asymptotic overall nominal coverage as well as asymptotic balance, that is, identical asymptotic marginal coverage, under mild conditions. We then compare our methods to some existing methods on both simulated and real data sets. Our simulation results and real data analysis show that our methods outperform existing methods while achieving the desired nominal coverage and good balance between dimensions.

Issue Information

Tue, 27 Aug 2024 22:19:26 -0700

Statistical Analysis and Data Mining: The ASA Data Science Journal, Volume 17, Issue 5, October 2024.