Skip to Main content Skip to Navigation
New interface
Conference papers

Statistical Comparison of Algorithm Performance Through Instance Selection

Abstract : Empirical performance evaluations, in competitions and scientific publications, play a major role in improving the state of the art in solving many automated reasoning problems, including SAT, CSP and Bayesian network structure learning (BNSL). To empirically demonstrate the merit of a new solver usually requires extensive experiments, with computational costs of CPU years. This not only makes it difficult for researchers with limited access to computational resources to test their ideas and publish their work, but also consumes large amounts of energy. We propose an approach for comparing the performance of two algorithms: by performing runs on carefully chosen instances, we obtain a probabilistic statement on which algorithm performs best, trading off between the computational cost of running algorithms and the confidence in the result. We describe a set of methods for this purpose and evaluate their efficacy on diverse datasets from SAT, CSP and BNSL. On all these datasets, most of our approaches were able to choose the correct algorithm with about 95% accuracy, while using less than a third of the CPU time required for a full comparison; the best methods reach this level of accuracy within less than 15% of the CPU time for a full comparison.
Document type :
Conference papers
Complete list of metadata
Contributor : Nathanaël Fijalkow Connect in order to contact the contributor
Submitted on : Thursday, October 6, 2022 - 2:13:25 PM
Last modification on : Monday, October 10, 2022 - 1:44:04 PM


Publisher files allowed on an open archive




Théo Matricon, Mario Anastacio, Nathanaël Fijalkow, Laurent Simon, Holger H Hoos. Statistical Comparison of Algorithm Performance Through Instance Selection. International Conference on Principles and Practice of Constraint Programming (CP 2021), Oct 2021, Montpellier, France. ⟨10.4230/LIPIcs.CP.2021.43⟩. ⟨hal-03800492⟩



Record views


Files downloads