Institut Camille Jordan
This paper addresses the statistical estimation of Gaussian Mixture Models (GMMs) with unknown diagonal covariances from independent and identically distributed samples. We employ the Beurling-LASSO (BLASSO), a convex optimization framework that promotes sparsity in the space of measures, to simultaneously estimate the number of components and their parameters. Our main contribution extends the BLASSO methodology to multivariate GMMs with component-specific unknown diagonal covariance matrices-a significantly more flexible setting than previous approaches requiring known and identical covariances. We establish non-asymptotic recovery guarantees with nearly parametric convergence rates for component means, diagonal covariances, and weights, as well as for density prediction. A key theoretical contribution is the identification of an explicit separation condition on mixture components that enables the construction of non-degenerate dual certificates-essential tools for establishing statistical guarantees for the BLASSO. Our analysis leverages the Fisher-Rao geometry of the statistical model and introduces a novel semi-distance adapted to our framework, providing new insights into the interplay between component separation, parameter space geometry, and achievable statistical recovery.
We introduce homotopical methods based on rewriting on higher-dimensional categories to prove coherence results in categories with an algebraic structure. We express the coherence problem for (symmetric) monoidal categories as an asphericity problem for a track category and we use rewriting methods on polygraphs to solve it. The setting is extended to more general coherence problems, seen as 3-dimensional word problems in a track category, including the case of braided monoidal categories.
We study Voronoi percolation on a large class of dd-dimensional Riemannian manifolds, which includes hyperbolic space Hd\mathbb{H}^d for d2d\geq 2. We prove that as the intensity λ\lambda of the underlying Poisson point process tends to infinity, both critical parameters pc(M,λ)p_c(M,\lambda) and pu(M,λ)p_u(M,\lambda) converge to the Euclidean critical parameter pc(Rd)p_c(\mathbb{R}^d). This extends a recent result of Hansen & M\"uller in the special case M=H2M=\mathbb{H}^2 to a general class of manifolds of arbitrary dimension. A crucial step in our proof, which may be of independent interest, is to show that if MM is simply connected and one-ended, then embedded graphs induced by a general class of tessellations on MM have connected minimal cutsets. In particular, this result applies to ε\varepsilon-nets, allowing us to implement a "fine-graining" argument. We also develop an annealed way of exploring the Voronoi cells that we use to characterize the uniqueness phase.
We obtain the optimal proxy variance for the sub-Gaussianity of Beta distribution, thus proving upper bounds recently conjectured by Elder (2016). We provide different proof techniques for the symmetrical (around its mean) case and the non-symmetrical case. The technique in the latter case relies on studying the ordinary differential equation satisfied by the Beta moment-generating function known as the confluent hypergeometric function. As a consequence, we derive the optimal proxy variance for the Dirichlet distribution, which is apparently a novel result. We also provide a new proof of the optimal proxy variance for the Bernoulli distribution, and discuss in this context the proxy variance relation to log-Sobolev inequalities and transport inequalities.
The use of Machine Learning (ML) has rapidly spread across several fields, having encountered many applications in Structural Dynamics and Vibroacoustic (SD\&V). The increasing capabilities of ML to unveil insights from data, driven by unprecedented data availability, algorithms advances and computational power, enhance decision making, uncertainty handling, patterns recognition and real-time assessments. Three main applications in SD\&V have taken advantage of these benefits. In Structural Health Monitoring, ML detection and prognosis lead to safe operation and optimized maintenance schedules. System identification and control design are leveraged by ML techniques in Active Noise Control and Active Vibration Control. Finally, the so-called ML-based surrogate models provide fast alternatives to costly simulations, enabling robust and optimized product design. Despite the many works in the area, they have not been reviewed and analyzed. Therefore, to keep track and understand this ongoing integration of fields, this paper presents a survey of ML applications in SD\&V analyses, shedding light on the current state of implementation and emerging opportunities. The main methodologies, advantages, limitations, and recommendations based on scientific knowledge were identified for each of the three applications. Moreover, the paper considers the role of Digital Twins and Physics Guided ML to overcome current challenges and power future research progress. As a result, the survey provides a broad overview of the present landscape of ML applied in SD\&V and guides the reader to an advanced understanding of progress and prospects in the field.
01 Nov 2017
We consider minimax signal detection in the sequence model. Working with certain ellipsoids in the space of square-summable sequences of real numbers, with a ball of positive radius removed, we obtain upper and lower bounds for the minimax separation radius in the non-asymptotic framework, i.e., for a fixed value of the involved noise level. We use very weak assumptions on the noise (i.e., fourth moments are assumed to be uniformly bounded). In particular, we do not use any kind of Gaussianity or independence assumption on the noise. It is shown that the established minimax separation rates are not faster than the ones obtained in the classical sequence model (i.e., independent standard Gaussian noise) but, surprisingly, are of the same order as the minimax estimation rates in the classical setting. Under an additional condition on the noise, the classical minimax separation rates are also retrieved in benchmark well-posed and ill-posed inverse problems.
In this paper we consider the possibility to use numerical simulations for a computer assisted analysis of integrability of dynamical systems. We formulate a rather general method of recovering the obstruction to integrability for the systems with a small number of degrees of freedom. We generalize this method using the results of KAM theory and stochastic approaches to the families of parameter depending systems. This permits the localization of possible integrability regions in the parameter space. We give some examples of application of this approach to dynamical systems having mechanical origin.
This article proposes a new way of deriving mean-field exponents for the weakly self-avoiding walk model in dimensions d>4d>4. Among other results, we obtain up-to-constant estimates for the full-space and half-space two-point functions in the critical and near-critical regimes. A companion paper proposes a similar analysis for spread-out Bernoulli percolation in dimensions d>6d>6.
We consider chiral, generally nonlinear density waves in one dimension, modelling the bosonized edge modes of a two-dimensional fermionic topological insulator. Using the coincidence between bosonization and Lie-Poisson dynamics on an affine U(1) group, we show that wave profiles which are periodic in time produce Berry phases accumulated by the underlying fermionic field. These phases can be evaluated in closed form for any Hamiltonian, and they serve as a diagnostic of nonlinearity. As an explicit example, we discuss the Korteweg-de Vries equation, viewed as a model of nonlinear quantum Hall edge modes.
27 May 2015
The main purpose of this paper is to investigate the strong approximation of the integrated empirical process. More precisely, we obtain the exact rate of the approximations by a sequence of weighted Brownian bridges and a weighted Kiefer process. Our arguments are based in part on the Koml\'os, Major and Tusn\'ady's results. Applications include the two-sample testing procedures together with the change-point problems. We also consider the strong approximation of the integrated empirical process when the parameters are estimated. Finally, we study the behavior of the self-intersection local time of the partial sum process representation of the integrated empirical process. Reference: Koml\'os, J., Major, P. and Tusn\'ady, G. (1975). An approximation of partial sums of independent RV's and the sample DF. I. Z. Wahrscheinlichkeitstheorie und Verw. Gebiete, 32, 111-131.
We propose a new approach to the existence of constant transversal scalar curvature Sasaki structures drawing on ideas and tools from the CR Yamabe problem, establishing a link between the CR Yamabe invariant, the existence of Sasaki structures of constant transversal scalar curvature, and the K-stability of Sasaki manifolds. Assuming that the Sasaki-Reeb cone contains a regular vector field, we show that if the CR Yamabe invariant of a compact Sasaki manifold attains a specific value determined by the geometry of the Reeb cone, then the Sasaki manifold is K-semistable. Under the additional assumption of non-positive average scalar curvature, the CR Yamabe invariant attains this topological value if the manifold admits approximately constant scalar curvature Sasaki structures, and we also show a partial converse. As an application, we provide a new numerical criterion for the K-semistability of polarised compact complex manifolds.
We investigate the statistical properties, based on numerical simulations and analytical calculations, of a recently proposed stochastic model for the velocity field of an incompressible, homogeneous, isotropic and fully developed turbulent flow. A key step in the construction of this model is the introduction of some aspects of the vorticity stretching mechanism that governs the dynamics of fluid particles along their trajectory. An additional further phenomenological step aimed at including the long range correlated nature of turbulence makes this model depending on a single free parameter γ\gamma that can be estimated from experimental measurements. We confirm the realism of the model regarding the geometry of the velocity gradient tensor, the power-law behaviour of the moments of velocity increments (i.e. the structure functions), including the intermittent corrections, and the existence of energy transfers across scales. We quantify the dependence of these basic properties of turbulent flows on the free parameter γ\gamma and derive analytically the spectrum of exponents of the structure functions in a simplified non dissipative case. A perturbative expansion in power of γ\gamma shows that energy transfers, at leading order, indeed take place, justifying the dissipative nature of this random field.
This work is concerned with the recovery of piecewise constant images from noisy linear measurements. We study the noise robustness of a variational reconstruction method, which is based on total (gradient) variation regularization. We show that, if the unknown image is the superposition of a few simple shapes, and if a non-degenerate source condition holds, then, in the low noise regime, the reconstructed images have the same structure: they are the superposition of the same number of shapes, each a smooth deformation of one of the unknown shapes. Moreover, the reconstructed shapes and the associated intensities converge to the unknown ones as the noise goes to zero.
In this paper we continue the description of the possibilities to use numerical simulations for mathematically rigorous computer assisted analysis of integrability of dynamical systems. We sketch some of the algebraic methods of studying the integrability and present a constructive algorithm issued from the Ziglin's approach. We provide some examples of successful applications of the constructed algorithm to physical systems.
In this paper we give new bounds on the bisection width of random 3-regular graphs on nn vertices. The main contribution is a new lower bound of 0.103295n0.103295n based on a first moment method together with a structural analysis of the graph, thereby improving a 27-year-old result of Kostochka and Melnikov. We also give a complementary upper bound of 0.139822n0.139822n by combining a result of Lyons with original combinatorial insights. Developping this approach further, we obtain a non-rigorous improved upper bound with the help of Monte Carlo simulations.
This article proposes a new way of deriving mean-field exponents for sufficiently spread-out Bernoulli percolation in dimensions d>6d>6. We obtain an upper bound for the full-space and half-space two-point functions in the critical and near-critical regimes. In a companion paper, we apply a similar analysis to the study of the weakly self-avoiding walk model in dimensions d>4d>4.
In this note we classify sequences according to whether they are morphic, pure morphic, uniform morphic, pure uniform morphic, primitive morphic, or pure primitive morphic, and for each possibility we either give an example or prove that no example is possible.
This paper advances the general theory of continuous sparse regularisation on measures with the Beurling-LASSO (BLASSO). This TV-regularized convex program on the space of measures allows to recover a sparse measure using a noisy observation from an appropriate measurement operator. While previous works have uncovered the central role played by this operator and its associated kernel in order to get estimation error bounds, the latter requires a technical local positive curvature (LPC) assumption to be verified on a case-by-case basis. In practice, this yields only few LPC-kernels for which this condition is proved. At the heart of our contribution lies the kernel switch, which uncouples the model kernel from the LPC assumption: it enables to leverage any known LPC-kernel as a pivot kernel to prove error bounds, provided embedding conditions are verified between the model and pivot RKHS. We increment the list of LPC-kernels, proving that the "sinc-4" kernel, used for signal recovery and mixture problems, does satisfy the LPC assumption. Furthermore, we also show that the BLASSO localisation error around the true support decreases with the noise level, leading to effective near regions. This improves on known results where this error is fixed with some parameters depending on the model kernel. We illustrate the interest of our results in the case of translation-invariant mixture model estimation, using bandlimiting smoothing and sketching techniques to reduce the computational burden of BLASSO.
We describe conditions on non-gradient drift diffusion Fokker-Planck equations for its solutions to converge to equilibrium with a uniform exponential rate in Wasserstein distance. This asymptotic behaviour is related to a functional inequality, which links the distance with its dissipation and ensures a spectral gap in Wasserstein distance. We give practical criteria for this inequality and compare it to classical ones. The key point is to quantify the contribution of the diffusion term to the rate of convergence, which to our knowledge is a novelty.
In this paper, we build the Hamiltonian system and the corresponding Lax pairs associated to a twisted connection in gl2(C)\mathfrak{gl}_2(\mathbb{C}) admitting an irregular and ramified pole at infinity of arbitrary degree, hence corresponding to the Painlevé 11 hierarchy. We provide explicit formulas for these Lax pairs and Hamiltonians in terms of the irregular times and standard 2g2g Darboux coordinates associated to the twisted connection. Furthermore, we obtain a map that reduces the space of irregular times to only gg non-trivial isomonodromic deformations. In addition, we perform a symplectic change of Darboux coordinates to obtain a set of symmetric Darboux coordinates in which Hamiltonians and Lax pairs are polynomial. Finally, we apply our general theory to the first cases of the hierarchy: the Airy case (g=0)(g=0), the Painlevé 11 case (g=1)(g=1) and the next two elements of the Painlevé 11 hierarchy.
There are no more papers matching your filters at the moment.