Towards Rigorous Interpretations: a Formalisation of Feature Attribution - Institut des Systèmes Intelligents et de Robotique Accéder directement au contenu
Communication Dans Un Congrès Année : 2021

Towards Rigorous Interpretations: a Formalisation of Feature Attribution

Darius Afchar
  • Fonction : Auteur
Romain Hennequin
  • Fonction : Auteur

Résumé

Feature attribution is often loosely presented as the process of selecting a subset of relevant features as a rationale of a prediction. Taskdependent by nature, precise definitions of "relevance" encountered in the literature are however not always consistent. This lack of clarity stems from the fact that we usually do not have access to any notion of ground-truth attribution and from a more general debate on what good interpretations are. In this paper we propose to formalise feature selection/attribution based on the concept of relaxed functional dependence. In particular, we extend our notions to the instance-wise setting and derive necessary properties for candidate selection solutions, while leaving room for task-dependence. By computing ground-truth attributions on synthetic datasets, we evaluate many state-of-the-art attribution methods and show that, even when optimised, some fail to verify the proposed properties and provide wrong solutions.
Fichier principal
Vignette du fichier
afchar21a.pdf (970.95 Ko) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte

Dates et versions

hal-03909852 , version 1 (21-12-2022)

Identifiants

  • HAL Id : hal-03909852 , version 1

Citer

Darius Afchar, Romain Hennequin, Vincent Guigue. Towards Rigorous Interpretations: a Formalisation of Feature Attribution. Thirty-eighth International Conference on Machine Learning (ICML 2021), Jul 2021, Virtual (on line), France. ⟨hal-03909852⟩
5 Consultations
16 Téléchargements

Partager

Gmail Facebook X LinkedIn More