Convergence of Langevin-Simulated Annealing algorithms with multiplicative noise

Pierre Bras; Gilles Pagès

Pré-Publication, Document De Travail Année : 2021

Convergence of Langevin-Simulated Annealing algorithms with multiplicative noise

(1) , (2)

1
2

Pierre Bras

Fonction : Auteur
PersonId : 1094627
ORCID : 0000-0001-9577-687X
IdRef : 272874248

Laboratoire de Probabilités, Statistique et Modélisation

Gilles Pagès

Fonction : Auteur
PersonId : 856726
IdHAL : gilpag

Laboratoire de Probabilités, Statistique et Modélisation

Résumé

We study the convergence of Langevin-Simulated Annealing type algorithms with multiplicative noise, i.e. for $V : \mathbb{R}^d \to \mathbb{R}$ a potential function to minimize, we consider the stochastic equation $dY_t = - \sigma \sigma^\top \nabla V(Y_t) dt + a(t)\sigma(Y_t)dW_t + a(t)^2\Upsilon(Y_t)dt$, where $(W_t)$ is a Brownian motion, where $\sigma : \mathbb{R}^d \to \mathcal{M}_d(\mathbb{R})$ is an adaptive (multiplicative) noise, where $a : \mathbb{R}^+ \to \mathbb{R}^+$ is a function decreasing to $0$ and where $\Upsilon$ is a correction term. This setting can be applied to optimization problems arising in Machine Learning. The case where $\sigma$ is a constant matrix has been extensively studied however little attention has been paid to the general case. We prove the convergence for the $L^1$-Wasserstein distance of $Y_t$ and of the associated Euler-scheme $\bar{Y}_t$ to some measure $\nu^\star$ which is supported by $\text{argmin}(V)$ and give rates of convergence to the instantaneous Gibbs measure $\nu_{a(t)}$ of density $\propto \exp(-2V(x)/a(t)^2)$. To do so, we first consider the case where $a$ is a piecewise constant function. We find again the classical schedule $a(t) = A\log^{-1/2}(t)$. We then prove the convergence for the general case by giving bounds for the Wasserstein distance to the stepwise constant case using ergodicity properties.

Domaines

Probabilités [math.PR]

Gilles Pagès : Connectez-vous pour contacter le contributeur

https://cnrs.hal.science/hal-03891234

Soumis le : vendredi 9 décembre 2022-09:38:11

Dernière modification le : samedi 27 avril 2024-03:10:24

Dates et versions

hal-03891234 , version 1 (09-12-2022)

Identifiants

HAL Id : hal-03891234 , version 1
ARXIV : 2109.11669

Citer

Pierre Bras, Gilles Pagès. Convergence of Langevin-Simulated Annealing algorithms with multiplicative noise. 2021. ⟨hal-03891234⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INSMI LPSM SORBONNE-UNIVERSITE SU-SCIENCES UP-SCIENCES

19 Consultations

0 Téléchargements

Convergence of Langevin-Simulated Annealing algorithms with multiplicative noise

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager