Hierarchical biomarker thresholding: a model-agnostic framework for stability
By: O. Debeaupuis
Potential Business Impact:
Fixes how computers judge health tests.
Many biomarker pipelines require patient-level decisions aggregated from instance-level (cell/patch) scores. Thresholds tuned on pooled instances often fail across sites due to hierarchical dependence, prevalence shift, and score-scale mismatch. We present a selection-honest framework for hierarchical thresholding that makes patient-level decisions reproducible and more defensible. At its core is a risk decomposition theorem for selection-honest thresholds. The theorem separates contributions from (i) internal fit and patient-level generalization, (ii) operating-point shift reflecting prevalence and shape changes, and (iii) a stability term that penalizes sensitivity to threshold perturbations. The stability component is computable via patient-block bootstraps mapped through a monotone modulus of risk. This framework is model-agnostic, reconciles heterogeneous decision rules on a quantile scale, and yields monotone-invariant ensembles and reportable diagnostics (e.g. flip-rate, operating-point shift).
Similar Papers
Higher-Order Network Structure Inference: A Topological Approach to Network Selection
Social and Information Networks
Finds the best way to connect ideas in data.
Joint Score-Threshold Optimization for Interpretable Risk Assessment Under Partial Supervision
Machine Learning (CS)
Improves doctor's risk scores for patients.
Sequential Testing for Assessing the Incremental Value of Biomarkers Under Biorepository Specimen Constraints with Robustness to Model Misspecification
Methodology
Finds better cancer tests using fewer samples.