A Connection Between Score Matching and Local Intrinsic Dimension
Yeats, Jacobson, Hannan et al.
The local intrinsic dimension (LID) of data is a fundamental quantity in signal processing and learning theory, but quantifying the LID of high-dimensional, complex data has been a historically challenging task. Recent works have discovered that diffusion models capture the LID of data through the spectra of their score estimates and through the rate of change of their density estimates under various noise perturbations. While these methods can accurately quantify LID, they require either many forward passes of the diffusion model or use of gradient computation, limiting their applicability in compute- and memory-constrained scenarios.
We show that the LID is a lower bound on the denoising score matching loss, motivating use of the denoising score matching loss as a LID estimator. Moreover, we show that the equivalent implicit score matching loss also approximates LID via the normal dimension and is closely related to a recent LID estimator, FLIPD. Our experiments on a manifold benchmark and with Stable Diffusion 3.5 indicate that the denoising score matching loss is a highly competitive and scalable LID estimator, achieving superior accuracy and memory footprint under increasing problem size and quantization level.
academic
A Connection Between Score Matching and Local Intrinsic Dimension
Local Intrinsic Dimension (LID) is a fundamental quantity in signal processing and learning theory; however, quantifying the LID of high-dimensional complex data has historically been a challenging task. Recent research has found that diffusion models capture data LID through the spectral properties of their score estimates and the rate of change in density estimation under various noise perturbations. While these methods can accurately quantify LID, they require multiple forward passes through diffusion models or gradient computations, which limits their applicability in computationally and memory-constrained scenarios.
This paper demonstrates that LID serves as a lower bound for denoising score matching loss, thereby providing theoretical justification for using denoising score matching loss as an LID estimator. Furthermore, the authors prove that the equivalent implicit score matching loss also approximates LID through normal dimension and is closely related to the recent LID estimator FLIPD. Experiments on manifold benchmarks and Stable Diffusion 3.5 demonstrate that denoising score matching loss is a highly competitive and scalable LID estimator, achieving superior accuracy and memory efficiency as problem scale and quantization levels increase.
High-dimensional data typically exhibits low-dimensional structure, known as the manifold hypothesis, which is a core assumption in machine learning. Local Intrinsic Dimension (LID) is a fundamental quantity that encapsulates the low-dimensional structure of data. For a point x, LID represents the local dimensionality required to losslessly encode data in the neighborhood of x.
Non-parametric Methods: Require substantial sampled data, are strongly influenced by hyperparameter selection, and fail to generalize in low-data settings
Parametric Methods: While leveraging deep generative models for scalability, LIDL requires multiple generative models, and FLIPD and normal bundle methods require gradient computation or numerous forward passes
Existing parametric LID estimation methods have limitations in computational and memory efficiency, particularly in large-scale applications. This paper aims to discover a more efficient and scalable LID estimation method.
Theoretical Contribution: Proves that denoising score matching loss has LID as a lower bound, providing theoretical foundation for its use as a scalable LID estimator
Method Connection: Establishes close relationships between score matching loss and current leading estimators (FLIPD and normal bundle methods)
Experimental Validation: Experiments on manifold benchmarks and Stable Diffusion 3.5/2.0 demonstrate that denoising score matching loss is a highly competitive LID estimator
Practical Advantages: Demonstrates superior scalability in memory consumption and quantization consistency
Given a point x sampled from a d-dimensional data manifold M⊂Rⁿ, estimate its local intrinsic dimension d. Input consists of high-dimensional data points, with output being the corresponding LID estimate.
The normal bundle method computes singular values of an m×n matrix, while the proposed error bundle method computes eigenvalues of the error vector matrix. The denoising loss equals the trace (area) of Gram matrix eigenvalues, remaining accurate with small samples.
The paper cites multiple important related works, including:
Vincent (2011): Connection between denoising and generative modeling
Hyvärinen & Dayan (2005): Foundational theory of score matching
Kamkari et al. (2024): FLIPD method
Stanczuk et al. (2024): Normal bundle method
Related literature on diffusion models and flow matching
Overall Assessment: This is an excellent paper combining theory and practice, providing new theoretical perspectives and practical methods for LID estimation. While certain technical details could be improved, its core contributions hold significant value for understanding the geometric properties of diffusion models and advancing LID estimation methods.