Reinforcement-based Display Selection for Frugal Learning

Sébastien Deschamps; Hichem Sahbi

Communication Dans Un Congrès Année : 2022

Reinforcement-based Display Selection for Frugal Learning

(1) , (1)

Sébastien Deschamps

Fonction : Auteur

Systèmes Electroniques

Hichem Sahbi

Fonction : Auteur
PersonId : 742003
IdHAL : hichem-sahbi
IdRef : 077125223

Systèmes Electroniques

Résumé

Most of the existing learning models, particularly deep neural networks, are reliant on large datasets whose handlabeling is expensive and time demanding. A current trend is to make the learning of these models frugal and less dependent on large collections of labeled data. Among the existing solutions, deep active learning is currently witnessing a major interest and its purpose is to train deep networks using as few labeled samples as possible. However, the success of active learning is highly dependent on how critical are these samples when training models. In this paper, we devise a novel active learning approach for label-efficient training. The proposed method is iterative and aims at minimizing a constrained objective function that mixes diversity, representativity and uncertainty criteria. The proposed approach is probabilistic and unifies all these criteria in a single objective function whose solution models the probability of relevance of samples (i.e., how critical) when learning a decision function. We also introduce a novel weighting mechanism based on reinforcement learning, which adaptively balances these criteria at each training iteration, using a particular stateless Q-learning model. Extensive experiments conducted on staple image classification data, including Object-DOTA, show the effectiveness of our proposed model w.r.t. several baselines including random, uncertainty and flat as well as other work.

Domaines

Intelligence artificielle [cs.AI] Vision par ordinateur et reconnaissance de formes [cs.CV] Traitement des images [eess.IV]

Fichier principal

paper.pdf (557.04 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Hichem Sahbi : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03848481

Soumis le : jeudi 10 novembre 2022-18:15:49

Dernière modification le : samedi 7 octobre 2023-21:36:22

Dates et versions

hal-03848481 , version 1 (10-11-2022)

Identifiants

HAL Id : hal-03848481 , version 1

Citer

Sébastien Deschamps, Hichem Sahbi. Reinforcement-based Display Selection for Frugal Learning. International Conference on Pattern Recognition (ICPR), Aug 2022, Montréal, Canada. ⟨hal-03848481⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS LIP6 SORBONNE-UNIVERSITE SU-SCIENCES

43 Consultations

53 Téléchargements

Reinforcement-based Display Selection for Frugal Learning

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager