Sylvester's criterion characterizes positive definite (PD) and positive semidefinite (PSD) matrices without the need of eigendecomposition. It states that a symmetric matrix is PD if and only if all of its leading principal minors are positive, and a symmetric matrix is PSD if and only if all of its principal minors are nonnegative. For an $m\times m$ symmetric matrix, Sylvester's criterion requires computing $m$ and $2^m-1$ determinants to verify it is PD and PSD, respectively. Therefore, it is less useful for PSD matrices due to the exponential growth in the number of principal submatrices as the matrix dimension increases. We provide a stronger Sylvester's criterion for PSD matrices which only requires to verify the nonnegativity of $m(m+1)/2$ determinants. Based on the new criterion, we provide a method to derive elementwise criteria for PD and PSD matrices. We illustrate the applications of our results in PD or PSD matrix completion and highlight their statistics applications via nonlinear semidefinite program.
- Paper ID: 2501.00894
- Title: A stronger Sylvester's criterion for positive semidefinite matrices
- Authors: Mingrui Zhang (UC Berkeley), Peng Ding (UC Berkeley)
- Classification: math.RA (Ring and Algebra), math.ST (Statistics Theory), stat.TH (Statistics Theory)
- Publication Date: January 1, 2025 (arXiv preprint)
- Paper Link: https://arxiv.org/abs/2501.00894
Sylvester's criterion is a classical method for determining positive definite (PD) and positive semidefinite (PSD) matrices without eigenvalue decomposition. The classical criterion requires that a symmetric matrix is positive definite if and only if all leading principal minors are positive; a symmetric matrix is positive semidefinite if and only if all principal minors are non-negative. For an m×m symmetric matrix, Sylvester's criterion requires computing m and 2m−1 determinants to verify positive definiteness and positive semidefiniteness, respectively. Due to the exponential growth in the number of principal submatrices, this criterion has limited practical utility for positive semidefinite matrices. This paper proposes a stronger Sylvester's criterion for positive semidefinite matrices, requiring verification of only m(m+1)/2 determinant non-negativity conditions. Based on the new criterion, the authors provide a method for deriving element-wise criteria for positive definite and positive semidefinite matrices, and demonstrate applications in matrix completion and nonlinear semidefinite programming.
This research addresses the problem of excessive computational complexity in the classical Sylvester's criterion for determining positive semidefiniteness. Specifically:
- Computational Complexity Issue: For an m×m matrix, verifying positive semidefiniteness requires checking 2m−1 principal minors, exhibiting exponential growth with matrix dimension
- Limited Practical Utility: The exponential computational burden renders the classical criterion impractical for determining positive semidefiniteness of high-dimensional matrices
- Need for Theoretical Refinement: Existing literature contains misapplications and improper extensions of Sylvester's criterion
Positive semidefinite matrices hold important positions in mathematics, statistics, optimization, and related fields:
- Covariance matrices must be positive semidefinite
- Core constraints in semidefinite programming problems
- Key properties in matrix completion problems
- Fundamental tools in statistical inference
- Classical Sylvester's Criterion: Requires O(2m) determinant computations for positive semidefinite matrices
- Eigenvalue Decomposition Method: High computational complexity and lacks intuitive appeal in certain applications
- Graph-Theoretic Methods: Applicable only to specific structures (e.g., chordal graphs) with limited generality
- Proposes a Stronger Positive Semidefinite Sylvester's Criterion: Reduces required determinant computations from 2m−1 to m(m+1)/2
- Introduces the Concept of Inner-Saturated Submatrices: Provides theoretical foundation for the new criterion
- Establishes Element-Wise Determination Methods: Provides systematic approaches for determining element ranges
- Demonstrates Practical Applications: Validates method effectiveness in matrix completion and nonlinear semidefinite programming
- Provides Complete Theoretical Proofs: Includes rigorous mathematical proofs and supporting lemmas
Definition 2: For an m×m matrix X and integers a≤b, Xa:b,a:b is called a consecutive principal submatrix of X.
Definition 3: For a symmetric m×m matrix X, define XI,I as an inner-saturated submatrix, where I={1,m}∪J, and the index set J satisfies:
- When m≤2, J=∅
- When m≥3, {X2:(m−1),j:j∈J} is a maximal linearly independent set of column vectors of X2:(m−1),2:(m−1)
Theorem 2 (New Sylvester's Criterion): For a symmetric m×m matrix X, the following conditions are equivalent:
- X is a positive semidefinite matrix
- For any consecutive principal submatrix of X, some inner-saturated submatrix has non-negative determinant
- For any consecutive principal submatrix of X, every inner-saturated submatrix has non-negative determinant
- Complexity Optimization: Reduction from O(2m) to O(m2)
- Equivalence Proof: The equivalence of conditions (ii) and (iii) is the key innovation
- Constructive Method: Provides concrete algorithms for determining matrix element ranges
Define a partial order relation ⪯ on upper triangular elements: Xi′,j′⪯Xi,j if and only if i≤i′≤j′≤j.
- Diagonal Elements: Must be non-negative
- k-Diagonal Elements: Ranges determined sequentially according to partial order relations
- Recursive Determination: Utilizes determinant constraints from inner-saturated submatrices of consecutive principal submatrices
The paper primarily validates theoretical correctness through mathematical proofs, including:
- Proofs of three key lemmas
- Inductive proof of the main theorem
- Constructive proofs of Propositions 1 and 2
Example 3: Consider a partially observed 5×5 symmetric matrix containing three missing elements x1,x2,x3. The new criterion determines the feasible region of missing elements and verifies whether a positive definite completion exists.
Example 4: Optimization problem
maxX112+X222+X332+X442−X12X23X34−X13X24+X14
Subject to: X is positive semidefinite, 0≤Xii≤1
- Classical Method: 2m−1 determinant computations
- New Method: m(m+1)/2 determinant computations
- Improvement: Reduction from exponential to polynomial complexity
- Matrix Completion: Successfully determines completion feasibility for non-chordal graph cases
- Semidefinite Programming: Provides reparameterization methods for element-wise constraints
- Computational Efficiency: Significantly reduces required determinant computations
- Sylvester's Criterion: Criterion for determining positive definite matrices proposed by James Joseph Sylvester (1814-1897)
- Positive Semidefinite Extension: Prussing (1986) first provided the correct Sylvester's criterion for positive semidefinite matrices
- Grone et al. (1984): Theory of positive definite/semidefinite matrix completion on chordal graphs
- Barrett et al. (1989): Determinant formulas for matrix completion related to chordal graphs
- Johnson (1990): Survey on matrix completion problems
- Yamashita and Yabe (2015): Survey of numerical methods for nonlinear semidefinite programming
- Theoretical Breakthrough: Reduces complexity of positive semidefinite matrix determination from exponential to polynomial level
- Practical Value: Provides feasible tools for determining positive semidefiniteness of high-dimensional matrices
- Broad Applicability: Demonstrates practical utility in matrix completion and semidefinite programming
- Special Case Handling: Additional boundary case analysis required when certain submatrices are singular
- Computational Implementation: Although theoretical complexity is reduced, numerical stability must be considered in practical implementation
- High-Dimensional Extension: For extremely high-dimensional matrices, O(m2) complexity may still become a bottleneck
- Numerical Algorithms: Develop efficient and stable numerical implementation algorithms
- Parallel Computing: Leverage parallel computation for further efficiency improvements
- Application Extension: Explore applications in machine learning, signal processing, and related fields
- Strong Theoretical Innovation: Fundamentally improves efficiency of classical Sylvester's criterion
- High Mathematical Rigor: Provides complete theoretical proof system
- Significant Practical Value: Solves practical problems in determining positive semidefiniteness of high-dimensional matrices
- Rich Application Examples: Demonstrates method operability through concrete examples
- Insufficient Implementation Details: Lacks concrete numerical implementation algorithms and complexity analysis
- Missing Large-Scale Validation: Lacks large-scale numerical experiments validating theoretical advantages
- Complex Boundary Cases: Special case handling increases implementation complexity
- Significant Theoretical Contribution: Provides important theoretical tools for matrix theory
- Broad Application Prospects: Holds application potential in optimization, statistics, machine learning, and related fields
- Good Reproducibility: Theoretical results are fully reproducible, providing foundation for subsequent research
- High-Dimensional Covariance Matrix Analysis: Verification of covariance matrix positive semidefiniteness in statistics
- Semidefinite Programming Solution: Provides new constraint handling methods for semidefinite programming
- Matrix Completion Problems: Particularly suitable for matrix completion with non-chordal graph structures
- Machine Learning: Verification of positive semidefiniteness for kernel matrices and similarity matrices
The paper cites 18 related references covering classical and frontier work in matrix theory, semidefinite programming, matrix completion, and related fields, providing solid theoretical foundation for the research.
Overall Evaluation: This is a high-quality theoretical mathematics paper achieving important breakthroughs based on the classical Sylvester's criterion. Although lacking large-scale numerical experiments, its theoretical contributions and practical value make it an important advance in matrix theory.