Plackett-Luce (PL) models are used as a sensitivity analysis complementary to the critical difference plots. Each dataset acts as a “rater” providing a complete ranking over all models (“items”).
KM and NEL baselines are excluded as they are uninformative for model comparison.
We refer to the PlackettLuce package for more details.
Full Model
Subgroup Analysis
Separate PL models are fit for subgroups based on dataset characteristics. Likelihood ratio tests compare the combined log-likelihood of subgroup-specific models against the full model.
PH Assumption Violation
Censoring Proportion
Sample Size to Feature Ratio (n/p)
PL Trees
PL trees use model-based recursive partitioning to automatically detect dataset subgroups where model rankings differ significantly.










