Arnaud Guyader’s research while affiliated with Sorbonne University and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (55)


Robust non-parametric regression via median-of-means
  • Preprint

January 2023

·

25 Reads

Anna Ben-Hamou

·

Arnaud Guyader

This paper is devoted to the problem of determining the concentration bounds that are achievable in non-parametric regression. We consider the setting where features are supported on a bounded subset of Rd\mathbb{R}^d, the regression function is Lipschitz, and the noise is only assumed to have a finite second moment. We first specify the fundamental limits of the problem by establishing a general lower bound on deviation probabilities, and then construct explicit estimators that achieve this bound. These estimators are obtained by applying the median-of-means principle to classical local averaging rules in non-parametric regression, including nearest neighbors and kernel procedures.


On the Hill relation and the mean reaction time for metastable processes

November 2022

·

20 Reads

·

19 Citations

Stochastic Processes and their Applications

We illustrate how the Hill relation and the notion of quasi-stationary distribution can be used to analyse the biasing error introduced by many numerical procedures that have been proposed in the literature, in particular in molecular dynamics, to compute mean reaction times between metastable states for Markov processes. The theoretical findings are illustrated on various examples demonstrating the sharpness of the biasing error analysis as well as the applicability of our study to elliptic diffusions.



Recursive Estimation of a Failure Probability for a Lipschitz Function

July 2021

·

8 Reads

Let g : Ω\Omega = [0, 1] d \rightarrow R denote a Lipschitz function that can be evaluated at each point, but at the price of a heavy computational time. Let X stand for a random variable with values in Ω\Omega such that one is able to simulate, at least approximately, according to the restriction of the law of X to any subset of Ω\Omega. For example, thanks to Markov chain Monte Carlo techniques, this is always possible when X admits a density that is known up to a normalizing constant. In this context, given a deterministic threshold T such that the failure probability p := P(g(X) > T) may be very low, our goal is to estimate the latter with a minimal number of calls to g. In this aim, building on Cohen et al. [9], we propose a recursive and optimal algorithm that selects on the fly areas of interest and estimate their respective probabilities.



a General λQB(k)\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\lambda _Q^B(k)$$\end{document}. b Imposing λQB(-2)=2λQB(-1)\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\lambda _Q^B(-2) = 2\lambda _Q^B(-1)$$\end{document} implies, by convexity of λQB(k)\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\lambda _Q^B(k)$$\end{document}, that this function passes through the origin P0\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$P_0$$\end{document} and is linear for k∈[-2,0]\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$k\in [-2,0]$$\end{document}
a Efficiency condition for convex and left-differentiable IQB(w)\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$I_Q^B(w)$$\end{document}. b Asymptotically efficient IQB(w)\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$I_Q^B(w)$$\end{document} diverging at the left of w∗\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$w^*$$\end{document}. c Nonconvex IQB(w)\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$I_Q^B(w)$$\end{document} that is also asymptotically efficient
Line w=f(m)\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$w=f(m)$$\end{document} in the (m, w) plane on which JQ(m,w)\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$J_Q(m,w)$$\end{document} is defined for the exponential change of measure. a Asymptotically efficient Qn\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$Q_n$$\end{document} for which the typical value m∗\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$m^*$$\end{document} of Mn\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$M_n$$\end{document} is chosen on the boundary of B. b Non-efficient Qn\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$Q_n$$\end{document} associated with m∗\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$m^*$$\end{document} in the interior of B
IQB(w)\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$I_Q^B(w)$$\end{document} for the Gaussian sample mean for aμ=-1\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mu =-1$$\end{document} and σ=1\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sigma =1$$\end{document} and bμ=2\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mu =2$$\end{document} and σ=1\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sigma =1$$\end{document}. Note that only the finite part of IQB(w)\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$I_Q^B(w)$$\end{document} is shown
Rate function IQB(w)\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$I_Q^B(w)$$\end{document} for the Gaussian sample mean and nonconvex set B

+2

Efficient Large Deviation Estimation Based on Importance Sampling
  • Article
  • Publisher preview available

October 2020

·

80 Reads

·

18 Citations

Journal of Statistical Physics

We present a complete framework for determining the asymptotic (or logarithmic) efficiency of estimators of large deviation probabilities and rate functions based on importance sampling. The framework relies on the idea that importance sampling in that context is fully characterized by the joint large deviations of two random variables: the observable defining the large deviation probability of interest and the likelihood factor (or Radon–Nikodym derivative) connecting the original process and the modified process used in importance sampling. We recover with this framework known results about the asymptotic efficiency of the exponential tilting and obtain new necessary and sufficient conditions for a general change of process to be asymptotically efficient. This allows us to construct new examples of efficient estimators for sample means of random variables that do not have the exponential tilting form. Other examples involving Markov chains and diffusions are presented to illustrate our results.

View access options

On the Hill relation and the mean reaction time for metastable processes

August 2020

·

31 Reads

We illustrate how the Hill relation and the notion of quasi-stationary distribution can be used to analyse the biasing error introduced by many numerical procedures that have been proposed in the literature, in particular in molecular dynamics, to compute mean reaction times between metastable states for Markov processes. The theoretical findings are illustrated on various examples demonstrating the sharpness of the biasing error analysis as well as the applicability of our study to elliptic diffusions.


Efficient large deviation estimation based on importance sampling

March 2020

·

26 Reads

We present a complete framework for determining the asymptotic (or logarithmic) efficiency of estimators of large deviation probabilities and rate functions based on importance sampling. The framework relies on the idea that importance sampling in that context is fully characterized by the joint large deviations of two random variables: the observable defining the large deviation probability of interest and the likelihood factor (or Radon-Nikodym derivative) connecting the original and modified process used in importance sampling. We recover with this framework known results about the asymptotic efficiency of the exponential tilting and obtain new necessary and sufficient conditions for a general change of process to be asymptotically efficient. This allows us to construct new examples of efficient estimators for sample means of random variables that do not have the exponential tilting form. Other examples involving Markov chains and diffusions are presented to illustrate our results.



On Synchronized Fleming-Viot Particle Systems

November 2019

·

12 Reads

This article presents a variant of Fleming-Viot particle systems, which are a standard way to approximate the law of a Markov process with killing as well as related quantities. Classical Fleming-Viot particle systems proceed by simulating N trajectories, or particles, according to the dynamics of the underlying process, until one of them is killed. At this killing time, the particle is instantaneously branched on one of the (N1)(N-1) other ones, and so on until a fixed and finite final time T. In our variant, we propose to wait until K particles are killed and then rebranch them independently on the (NK)(N-K) alive ones. Specifically, we focus our attention on the large population limit and the regime where K/N has a given limit when N goes to infinity. In this context, we establish consistency and asymptotic normality results. The variant we propose is motivated by applications in rare event estimation problems.


Citations (40)


... As we mentioned above, in the non-convex case, most of this literature concerns the criticality and saddle-point avoidance guarantees of the method, under different structural and regularity assumptions. For our purposes, the most relevant threads in the literature revolve around (i) treating x n as a discrete-time Markov chain and examining its tails [24,25,58]; (ii ) considering it as a discrete-time approximation of a stochastic differential equation (SDE) and employing tools like dynamic mean-field theory (DMFT) to study the resulting "diffusion approximation" limit [48,49,64]; and/or (iii) focusing on the time it takes (SGD) to escape a spurious local minimum [6,23,26,29,50,73]. Our analysis shares the same high-level goal as these general threads -that is, understanding the global convergence properties of (SGD) in non-convex landscapes -but we are not otherwise aware of any comparable results. ...

Reference:

The global convergence time of stochastic gradient descent in non-convex landscapes: Sharp estimates via large deviations
On the Hill relation and the mean reaction time for metastable processes
  • Citing Article
  • November 2022

Stochastic Processes and their Applications

... These algorithms draw inspiration from methodologies outlined in [4] originally devised to find the minimum of a Lipschitz function. In this case, the rate of convergence is contingent, as for our case, to the dimension with exponential convergence when d = 1 and polynomial convergence when d > 1.Notice that this algorithm was further adapted in [2] in the case of the computation of a failure probability that is the computation of P(f (X) > c) for a given c when the Lipschitz constant is known. In this case, similar regimes of convergences are observed according to the value of d. ...

Recursive Estimation of a Failure Probability for a Lipschitz Function
  • Citing Article
  • April 2022

SMAI Journal of Computational Mathematics

... In [7], a similar formula is given for the large sample size variance of all estimators, see Corollary 2.8 and Theorem 2.13. The extension to the case k > 1 under the same assumptions, where k is fixed and N → +∞ can be obtained using the results of [10]. ...

On synchronized Fleming–Viot particle systems
  • Citing Article
  • March 2021

Theory of Probability and Mathematical Statistics

... Offline learning saves the processing time of the tracking algorithm compared to online learning, but also ignores the changes in the target feature model during the tracking process, sacrificing the tracking accuracy. Therefore, building more robust and efficient target feature models and statistical learning models is a research focus and research trend in the field of visual target tracking [6,7]. The face is a type of natural structural object with quite complex detailed changes. ...

Variance Estimation in Adaptive Sequential Monte Carlo
  • Citing Article
  • January 2020

The Annals of Applied Probability

... This dynamics describes the path realizing fluctuations in the low temperature limit. There are various ways to introduce the instanton dynamics (21). Let us provide a maybe unusual, PDEoriented motivation for this object, by introducing the function ...

Efficient Large Deviation Estimation Based on Importance Sampling

Journal of Statistical Physics

... The interaction between genetic type and sex was incorporated into the ANOVA model. The significance of each effect was determined using the Fisher F-test with the AovSum function in the Facto-MineR library (Cornillon et al., 2018). Adjusted averages are presented in Tables 1 and 2. In the event of a significant genetic type effect, a Student's t-test was employed to facilitate a comparison between two breeds. ...

R pour la statistique et la science des données
  • Citing Book
  • October 2018

·

·

Arnaud Guyader

·

[...]

·

Erwann Le Pennec

... Estimating the probabilities of rare but impactful events, an important problem throughout science and engineering, is usually done through -generally expensive -Monte Carlo methods such as importance sampling [1] or importance splitting methods [2,3]. A simple alternative approach, which is principled and sampling-free but only asymptotically exact under certain assumptions, consists of using a Laplace approximation, see e.g. ...

Adaptive Multilevel Splitting: Historical Perspective and Recent Results
  • Citing Article
  • April 2019

... = τ {ξ l} (x), the estimator of the rare event probability associated with level l, where I N l is the random number of iterations required so that all clones have reached the target set {ξ l}. The estimator p N l,ams (as well as other nonnormalized estimators) is unbiased E p N l,ams = p ε l (see [7,2]). The empirical distribution of clones at iteration I = Law(X ε | τ l (X ε ) < τ A (X ε )). ...

On the Asymptotic Normality of Adaptive Multilevel Splitting
  • Citing Article
  • April 2018

SIAM/ASA Journal on Uncertainty Quantification

... Moreover, we only consider soft killing at some continuous rate, and no hard killing which would correspond to the case where T is the escape time from some sub-domain (see e.g. [4,21]). Finally, as will be seen below, as far as the long-time behaviour of the process is concerned we will work in a perturbative regime, namely we will assume that the variations of λ are small with respect to the mixing time of the diffusion (1.1) (while λ ∞ itself is not required to be small). ...

A Central Limit Theorem for Fleming-Viot Particle Systems with Hard Killing
  • Citing Article
  • September 2017

... For both standard and reduced estimates (ARMS and ART's IS estimate), we observe approximately a unitary negative slope (on a logarithmic scale) of the variance as a function of cost, implying that the product of variance and cost is constant. Such a behavior suggests asymptotic normality of the proposed IS estimate, as it has been demonstrated for AMS [9]. ...

Fluctuation analysis of adaptive multilevel splitting
  • Citing Article
  • December 2016