The following table provides a brief overview of the measures used in this benchmark. Unfortunately, the identifiers used under the hood do not directly correspond to measure names and abbreviations as used in the paper.
ID refers to the shorthand used in the result files listed above.
mlr3 ID refers to the measure as it is implemented in mlr3proba
Label refers to the measure as it is named consistently throughout the paper and resulting plots.
ID
mlr3 ID
Label
harrell_c
surv.cindex
Harrell’s C
uno_c
surv.cindex
Uno’s C
isll
surv.isll
Integrated Survival Log-Likelihood (ISLL)
isll_erv
surv.isll
Integrated Survival Log-Likelihood (ISLL) [ERV]
isbs
surv.brier
Integrated Survival Brier Score (ISBS)
isbs_erv
surv.brier
Integrated Survival Brier Score (ISBS) [ERV]
dcalib
surv.dcalib
D-Calibration
alpha_calib
surv.calib_alpha
Van Houwelingen’s Alpha
Tasks
The following table gives a summary of the included datasets (tasks) in the benchmark.
This table shows the models (learners) used in the benchmark with their mlr3 IDs and additional metadata.
“Parameters” is 0 for learners such as KM, NA, CPH, which do not have any hyperparameters. It is also 0 for CoxBoost, which uses its own tuning method.
“Survival Prediction” indicates whether the learner provides a survival probability prediction (a distr object), which allows evaluation with measures like the ISBS.