Interpretable hierarchical symbolic regression for safety-critical systems with an application to highway crash prediction - CNRS - Centre national de la recherche scientifique Accéder directement au contenu
Article Dans Une Revue Engineering Applications of Artificial Intelligence Année : 2022

Interpretable hierarchical symbolic regression for safety-critical systems with an application to highway crash prediction

François Fouquet
  • Fonction : Auteur
  • PersonId : 1176036

Résumé

We introduce a framework to discover interpretable regression models for high-stakes decision making in the context of safety-critical systems. The core of our proposal is a multi-objective hierarchical symbolic regression algorithm able to compute cluster-specific rankings of regression models ordered by increasing complexity. We discover the hierarchical structure by clustering the features' importances of a post-hoc explainability framework (viz., SHAP) applied to a highly flexible predictive model (viz., XGBoost). We rely on a symbolic regression algorithm based on the simulated annealing meta-heuristic to infer sparse linear models which may include non-linear effects (e.g., log-transforms, multiplicative interactions...). This search is guided by two objectives: maximizing predictive performance and minimizing complexity. It ends on a list of Paretooptimal models that fosters a dynamic interpretative process: the user navigates from the least to the most complex model and decides the ones he can trust depending on whether he understands them, and whether he is satisfied by the quantified uncertainty of their parameters and predictions. Our approach achieves promising results when compared to more than ten other interpretable or black-box predictive models on eleven public regression datasets of various volumes, dimensionalities or domains, and on a proprietary dataset for highway crash prediction. On this last dataset, we demonstrate the usefulness of our new ranking-by-complexity of inherently interpretable models.
Fichier principal
Vignette du fichier
VERAN_2022_Interpretable_hierarchical_symbolic_regression_HAL.pdf (2.55 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03819953 , version 1 (18-10-2022)

Identifiants

Citer

Thomas Veran, Pierre-Edouard Portier, François Fouquet. Interpretable hierarchical symbolic regression for safety-critical systems with an application to highway crash prediction. Engineering Applications of Artificial Intelligence, 2022, ⟨10.1016/j.engappai.2022.105534⟩. ⟨hal-03819953⟩
118 Consultations
36 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More