← Math|22 of 100
Linear Algebra

Advanced Topics in Linear Algebra

Explore tensor products, matrix exponentials, Jordan normal form, quadratic forms, and advanced decompositions with applications in quantum computing, deep learning, and dynamical systems.

📂 Advanced📖 Lesson 22 of 100🎓 Free Course

Advertisement


Tensor Products

DfTensor Product (Kronecker Product)

For matrices ARm×nA \in \mathbb{R}^{m \times n} and BRp×qB \in \mathbb{R}^{p \times q}, the Kronecker product is:

Kronecker Product

AB=[a11Ba12Ba1nBa21Ba22Ba2nBam1Bam2BamnB]A \otimes B = \begin{bmatrix} a_{11}B & a_{12}B & \cdots & a_{1n}B \\ a_{21}B & a_{22}B & \cdots & a_{2n}B \\ \vdots & \vdots & \ddots & \vdots \\ a_{m1}B & a_{m2}B & \cdots & a_{mn}B \end{bmatrix}

Here,

  • AA=Left matrix of size m × n
  • BB=Right matrix of size p × q
  • AotimesBA otimes B=Resulting matrix of size mp × nq

The result is a block matrix where each block aijBa_{ij}B is a p×qp \times q submatrix scaled by aija_{ij}.

ThProperties of Kronecker Product

  1. Mixed-product property: (AB)(CD)=(AC)(BD)(A \otimes B)(C \otimes D) = (AC) \otimes (BD)
  2. Distributivity: A(B+C)=AB+ACA \otimes (B + C) = A \otimes B + A \otimes C
  3. Transposition: (AB)T=ATBT(A \otimes B)^T = A^T \otimes B^T
  4. Inverse: (AB)1=A1B1(A \otimes B)^{-1} = A^{-1} \otimes B^{-1} (when both exist)
  5. Eigenvalues: If AA has eigenvalues λi\lambda_i and BB has eigenvalues μj\mu_j, then ABA \otimes B has eigenvalues λiμj\lambda_i \mu_j

ℹ️ Outer Product vs Kronecker Product

The outer product of two vectors uRm\vec{u} \in \mathbb{R}^m and vRn\vec{v} \in \mathbb{R}^n is uvTRm×n\vec{u}\vec{v}^T \in \mathbb{R}^{m \times n}, a rank-1 matrix. The Kronecker product generalizes this to arbitrary matrices and produces a larger block matrix.

📝Kronecker Product Example

Let A=[1234]A = \begin{bmatrix} 1 & 2 \\ 3 & 4 \end{bmatrix} and B=[5678]B = \begin{bmatrix} 5 & 6 \\ 7 & 8 \end{bmatrix}.

AB=[1B2B3B4B]=[5610127814161518202421242832]A \otimes B = \begin{bmatrix} 1 \cdot B & 2 \cdot B \\ 3 \cdot B & 4 \cdot B \end{bmatrix} = \begin{bmatrix} 5 & 6 & 10 & 12 \\ 7 & 8 & 14 & 16 \\ 15 & 18 & 20 & 24 \\ 21 & 24 & 28 & 32 \end{bmatrix}

💡 Applications in Quantum Computing

Multi-qubit quantum gates are constructed via Kronecker products. A 2-qubit gate is G=G1G2G = G_1 \otimes G_2 where G1,G2G_1, G_2 are single-qubit gates. The CNOT gate cannot be decomposed as a Kronecker product of single-qubit gates — it is an entangling gate.


Matrix Exponential

DfMatrix Exponential

For any square matrix ARn×nA \in \mathbb{R}^{n \times n}:

Matrix Exponential

eA=k=0Akk!=I+A+A22!+A33!+e^A = \sum_{k=0}^{\infty} \frac{A^k}{k!} = I + A + \frac{A^2}{2!} + \frac{A^3}{3!} + \cdots

Here,

  • AA=Square matrix of size n × n
  • II=Identity matrix of same size
  • AkA^k=Matrix power (A multiplied by itself k times)
  • k!k!=Factorial of k

This series converges absolutely for all matrices AA.

ThKey Properties of the Matrix Exponential

  1. Identity: e0=Ie^0 = I
  2. Inverse: eA=(eA)1e^{-A} = (e^A)^{-1}
  3. Determinant: det(eA)=etr(A)\det(e^A) = e^{\text{tr}(A)}
  4. Commutativity: If AB=BAAB = BA, then eA+B=eAeBe^{A+B} = e^A e^B (fails when ABBAAB \neq BA)
  5. Similarity: ePAP1=PeAP1e^{PAP^{-1}} = Pe^AP^{-1}
  6. Derivative: ddteAt=AeAt=eAtA\frac{d}{dt}e^{At} = Ae^{At} = e^{At}A

ℹ️ Applications of the Matrix Exponential

  • Solving linear ODEs: For x˙=Ax\dot{x} = Ax, the solution is x(t)=eAtx(0)x(t) = e^{At}x(0)
  • Quantum mechanics: Time evolution U(t)=eiHt/U(t) = e^{-iHt/\hbar} where HH is the Hamiltonian
  • Control theory: State transition matrix Φ(t)=eAt\Phi(t) = e^{At} for linear time-invariant systems
  • Merton model: Portfolio value evolution in continuous-time finance

📝Computing $e^A$ via Diagonalization

For A=[4103]A = \begin{bmatrix} 4 & 1 \\ 0 & 3 \end{bmatrix}:

Step 1: Eigenvalues are λ1=4,λ2=3\lambda_1 = 4, \lambda_2 = 3 with eigenvectors v1=[10],v2=[11]\vec{v}_1 = \begin{bmatrix} 1 \\ 0 \end{bmatrix}, \vec{v}_2 = \begin{bmatrix} 1 \\ -1 \end{bmatrix}.

Step 2: A=PDP1A = PDP^{-1} where P=[1101]P = \begin{bmatrix} 1 & 1 \\ 0 & -1 \end{bmatrix}, D=[4003]D = \begin{bmatrix} 4 & 0 \\ 0 & 3 \end{bmatrix}.

Step 3: eA=PeDP1=P[e400e3]P1e^A = Pe^DP^{-1} = P\begin{bmatrix} e^4 & 0 \\ 0 & e^3 \end{bmatrix}P^{-1}

eA=[e4e4e30e3]e^A = \begin{bmatrix} e^4 & e^4 - e^3 \\ 0 & e^3 \end{bmatrix}

Matrix Logarithm

DfMatrix Logarithm

The matrix logarithm is the inverse of the matrix exponential: if eB=Ae^B = A, then B=ln(A)B = \ln(A). It exists if and only if AA is invertible and has no eigenvalues on the negative real axis (principal logarithm).

Matrix Logarithm via Diagonalization

ln(A)=Pln(D)P1=P[ln(λ1)ln(λn)]P1\ln(A) = P \ln(D) P^{-1} = P \begin{bmatrix} \ln(\lambda_1) & & \\ & \ddots & \\ & & \ln(\lambda_n) \end{bmatrix} P^{-1}

Here,

  • AA=Invertible matrix with positive eigenvalues
  • PP=Matrix of eigenvectors
  • DD=Diagonal matrix of eigenvalues
  • λi\lambda_i=Eigenvalues of A (must be positive for principal log)

ℹ️ Applications

  • Solving matrix equations: Find BB such that eB=Ae^B = A
  • Multivariate statistics: Log-normal distributions involve matrix logarithms of covariance matrices
  • Information geometry: The Kullback-Leibler divergence on Gaussian distributions uses matrix log

Matrix Powers

Matrix Powers via Diagonalization

Ak=PDkP1=P[λ1kλnk]P1A^k = PD^kP^{-1} = P \begin{bmatrix} \lambda_1^k & & \\ & \ddots & \\ & & \lambda_n^k \end{bmatrix} P^{-1}

Here,

  • AA=Diagonalizable matrix
  • PP=Matrix of eigenvectors
  • DD=Diagonal matrix of eigenvalues
  • kk=Non-negative integer power

💡 Why This Matters

For large kk, computing AkA^k directly by multiplication is O(n3k)O(n^3 k). Via diagonalization, it is O(n3)O(n^3) regardless of kk — the eigenvalues are simply raised to the power kk. This is critical for Markov chains, dynamical systems, and iterative algorithms.

📝Fibonacci via Matrix Powers

The recurrence Fn+1=Fn+Fn1F_{n+1} = F_n + F_{n-1} can be written as:

[Fn+1Fn]=[1110]n[F1F0]\begin{bmatrix} F_{n+1} \\ F_n \end{bmatrix} = \begin{bmatrix} 1 & 1 \\ 1 & 0 \end{bmatrix}^n \begin{bmatrix} F_1 \\ F_0 \end{bmatrix}

The matrix M=[1110]M = \begin{bmatrix} 1 & 1 \\ 1 & 0 \end{bmatrix} has eigenvalues ϕ=1+52\phi = \frac{1+\sqrt{5}}{2} (golden ratio) and ψ=152\psi = \frac{1-\sqrt{5}}{2}, giving the closed-form Binet formula:

Fn=ϕnψn5F_n = \frac{\phi^n - \psi^n}{\sqrt{5}}

Generalized Eigenvalue Problem

DfGeneralized Eigenvalue Problem

Given two square matrices A,BRn×nA, B \in \mathbb{R}^{n \times n}, find scalars λ\lambda and non-zero vectors v\vec{v} satisfying Av=λBvA\vec{v} = \lambda B\vec{v}. This reduces to the standard eigenvalue problem when B=IB = I.

Generalized Eigenvalue Equation

det(AλB)=0\det(A - \lambda B) = 0

Here,

  • AA=Symmetric matrix (often the stiffness or data matrix)
  • BB=Symmetric positive definite matrix (often a mass or weight matrix)
  • λ\lambda=Generalized eigenvalue
  • v\vec{v}=Generalized eigenvector

ThProperties of Generalized Eigenvalues

  1. If AA and BB are symmetric and BB is positive definite, all eigenvalues λi\lambda_i are real
  2. Eigenvectors are BB-orthogonal: viTBvj=0\vec{v}_i^T B \vec{v}_j = 0 for iji \neq j
  3. The generalized eigenvalues equal the eigenvalues of B1AB^{-1}A
  4. Can be computed via the QZ algorithm (generalized Schur decomposition)

ℹ️ Applications

  • Structural engineering: Vibration analysis Ku=ω2MuK\vec{u} = \omega^2 M\vec{u} (stiffness vs mass matrices)
  • Machine learning: Linear discriminant analysis (LDA) maximizes wTSBw/wTSWw\vec{w}^T S_B \vec{w} / \vec{w}^T S_W \vec{w}
  • Canonical correlation analysis: Finding correlated pairs between two data sets

Matrix Functions

DfMatrix Functions

For a function ff and a diagonalizable matrix A=PDP1A = PDP^{-1}:

Matrix Function via Diagonalization

f(A)=Pf(D)P1=P[f(λ1)f(λn)]P1f(A) = P f(D) P^{-1} = P \begin{bmatrix} f(\lambda_1) & & \\ & \ddots & \\ & & f(\lambda_n) \end{bmatrix} P^{-1}

Here,

  • AA=Diagonalizable matrix
  • ff=Analytic function (e.g., sin, cosh, log, sqrt)
  • PP=Matrix of eigenvectors
  • DD=Diagonal matrix of eigenvalues

This extends scalar functions to matrices: eAe^A, sin(A)\sin(A), cos(A)\cos(A), A\sqrt{A}, ln(A)\ln(A), sgn(A)\text{sgn}(A), and fractional powers AαA^\alpha.

💡 When Diagonalization Fails

If AA is not diagonalizable, use the Jordan canonical form A=PJP1A = PJP^{-1} and apply ff to each Jordan block:

f(Ji)=[f(λ)f(λ)f(λ)2!f(λ)f(λ)f(λ)]f(J_i) = \begin{bmatrix} f(\lambda) & f'(\lambda) & \frac{f''(\lambda)}{2!} \\ & f(\lambda) & f'(\lambda) \\ & & f(\lambda) \end{bmatrix}

The off-diagonal entries involve derivatives of ff evaluated at the eigenvalue.


Quadratic Forms

DfQuadratic Form

For a symmetric matrix ARn×nA \in \mathbb{R}^{n \times n}, the quadratic form maps xRn\vec{x} \in \mathbb{R}^n to a scalar:

Quadratic Form

Q(x)=xTAx=i=1nj=1naijxixjQ(\vec{x}) = \vec{x}^T A \vec{x} = \sum_{i=1}^{n}\sum_{j=1}^{n} a_{ij} x_i x_j

Here,

  • AA=Symmetric n × n matrix
  • x\vec{x}=Vector in R^n
  • Q(x)Q(\vec{x})=Scalar output (real number)

ThClassification of Quadratic Forms

The quadratic form xTAx\vec{x}^T A \vec{x} is classified by the signs of the eigenvalues of AA:

ConditionClassificationAll Q(x)0Q(\vec{x}) \neq 0
All λi>0\lambda_i > 0Positive definiteQ(x)>0Q(\vec{x}) > 0
All λi0\lambda_i \geq 0Positive semi-definiteQ(x)0Q(\vec{x}) \geq 0
All λi<0\lambda_i < 0Negative definiteQ(x)<0Q(\vec{x}) < 0
All λi0\lambda_i \leq 0Negative semi-definiteQ(x)0Q(\vec{x}) \leq 0
Mixed signsIndefiniteTakes both signs

Equivalent conditions for positive definiteness: all leading principal minors >0> 0 (Sylvester's criterion).

ℹ️ Applications

  • Optimization: The Hessian matrix H=2fH = \nabla^2 f defines a quadratic form near critical points; positive definite \Rightarrow local minimum
  • Statistics: The Mahalanobis distance d2=(xμ)TΣ1(xμ)d^2 = (\vec{x} - \vec{\mu})^T \Sigma^{-1} (\vec{x} - \vec{\mu}) is a quadratic form
  • Physics: Kinetic energy T=12q˙TMq˙T = \frac{1}{2}\dot{\vec{q}}^T M \dot{\vec{q}} is a quadratic form in the mass matrix MM

Rayleigh Quotient

Rayleigh Quotient

R(A,x)=xTAxxTxR(A, \vec{x}) = \frac{\vec{x}^T A \vec{x}}{\vec{x}^T \vec{x}}

Here,

  • AA=Symmetric matrix
  • x\vec{x}=Non-zero vector
  • R(A,x)R(A, \vec{x})=Scalar ratio (Rayleigh quotient)

ThProperties of the Rayleigh Quotient

  1. Bounds: λminR(A,x)λmax\lambda_{\min} \leq R(A, \vec{x}) \leq \lambda_{\max} for all x0\vec{x} \neq \vec{0}
  2. Stationary points: R(A,x)R(A, \vec{x}) is minimized at eigenvectors corresponding to λmin\lambda_{\min} and maximized at eigenvectors corresponding to λmax\lambda_{\max}
  3. Scale invariance: R(A,αx)=R(A,x)R(A, \alpha\vec{x}) = R(A, \vec{x}) for all α0\alpha \neq 0
  4. Rayleigh quotient iteration: xk+1=(AσkI)1xk\vec{x}_{k+1} = (A - \sigma_k I)^{-1}\vec{x}_k with σk=R(A,xk)\sigma_k = R(A, \vec{x}_k) converges cubically to an eigenvector

💡 Applications

  • Iterative eigensolvers: The Rayleigh quotient iteration is one of the fastest algorithms for finding individual eigenvectors
  • Vibration analysis: Natural frequency ω2=uTKuuTMu\omega^2 = \frac{\vec{u}^T K \vec{u}}{\vec{u}^T M \vec{u}} is a generalized Rayleigh quotient
  • PCA: The first principal component maximizes the Rayleigh quotient of the covariance matrix

Polar Decomposition

DfPolar Decomposition

Any square matrix ARn×nA \in \mathbb{R}^{n \times n} can be decomposed as:

Polar Decomposition

A=UPA = UP

Here,

  • UU=Orthogonal matrix (rotation/reflection), U^T U = I
  • PP=Positive semi-definite symmetric matrix, P = (A^T A)^{1/2}

When AA is invertible, UU is unique and PP is positive definite. The left polar decomposition is A=PUA = PU' where P=(AAT)1/2P = (AA^T)^{1/2}.

ThExistence and Uniqueness

  1. Every square matrix has a polar decomposition
  2. If AA is invertible, both UU and PP are unique
  3. If AA is singular, UU is unique only on the range of AA
  4. For an invertible matrix: U=A(ATA)1/2U = A(A^TA)^{-1/2}, P=(ATA)1/2P = (A^TA)^{1/2}

ℹ️ Geometric Interpretation

Polar decomposition separates a linear transformation into a rotation/reflection (UU) followed by a scaling along principal axes (PP). This is analogous to writing a complex number as z=reiθz = re^{i\theta} (magnitude and phase).


Jordan Normal Form

DfJordan Normal Form

When a matrix AA is defective (lacks enough eigenvectors for diagonalization), the Jordan normal form provides the closest possible simplification:

Jordan Canonical Form

A=PJP1,J=[J1Jk]A = PJP^{-1}, \quad J = \begin{bmatrix} J_1 & & \\ & \ddots & \\ & & J_k \end{bmatrix}

Here,

  • AA=Any n × n matrix over an algebraically closed field
  • PP=Invertible matrix of generalized eigenvectors
  • JJ=Jordan form — block diagonal with Jordan blocks
  • JiJ_i=Jordan block for eigenvalue λ_i

Each Jordan block has the form:

Ji=[λi1λi11λi]J_i = \begin{bmatrix} \lambda_i & 1 & & \\ & \lambda_i & 1 & \\ & & \ddots & 1 \\ & & & \lambda_i \end{bmatrix}

ThProperties of Jordan Normal Form

  1. Every square matrix has a Jordan form (over an algebraically closed field)
  2. The number of Jordan blocks equals the number of linearly independent eigenvectors
  3. The algebraic multiplicity of λ\lambda equals the total size of all Jordan blocks for λ\lambda
  4. The geometric multiplicity equals the number of Jordan blocks for λ\lambda
  5. A matrix is diagonalizable iff all Jordan blocks are 1×11 \times 1

ℹ️ When Is Jordan Form Needed?

Jordan form is theoretically essential but numerically unstable — small perturbations can destroy the block structure. In practice, Schur decomposition is preferred for numerical computation. Jordan form is most useful for:

  • Analyzing nilpotent matrices (Jk=0J^k = 0)
  • Computing matrix functions analytically
  • Studying stability of defective systems in control theory

Python Implementation

💡 Numerical Stability

scipy.linalg.expm uses Pade approximation with scaling and squaring — more numerically stable than computing the Taylor series directly. For large matrices, scipy.sparse.linalg.expm handles sparse systems efficiently.


Applications in AI/ML


Common Mistakes

MistakeCorrection
Assuming eA+B=eAeBe^{A+B} = e^A e^B alwaysThis only holds when AB=BAAB = BA; for non-commuting matrices, use the Baker-Campbell-Hausdorff formula
Using Jordan form for numerical computationJordan form is numerically unstable; use Schur decomposition for floating-point arithmetic
Treating tensor product as element-wise multiplicationABA \otimes B produces a block matrix of size mp×nqmp \times nq, not ABA \cdot B which requires compatible dimensions
Assuming ln(eA)=A\ln(e^A) = A alwaysThe matrix logarithm is multi-valued; ln(eA)=A+2πikI\ln(e^A) = A + 2\pi i k I for integer kk in the complex case
Ignoring positive definiteness in quadratic formsAlways verify eigenvalues of HH before concluding a critical point is a minimum; positive semi-definite is not sufficient
Confusing algebraic and geometric multiplicityGeometric multiplicity \leq algebraic multiplicity; they differ for defective matrices
Forgetting that polar decomposition requires PP to be PSDP=(ATA)1/2P = (A^TA)^{1/2} must be the positive semi-definite square root, not just any square root

Interview Questions

📝Question 1: Matrix Exponential Intuition

Q: Why does eA+BeAeBe^{A+B} \neq e^A e^B when AA and BB don't commute? Give a simple example.

💡Solution

A: When ABBAAB \neq BA, the order of multiplication matters. For A=[0100]A = \begin{bmatrix} 0 & 1 \\ 0 & 0 \end{bmatrix} and B=[0010]B = \begin{bmatrix} 0 & 0 \\ 1 & 0 \end{bmatrix}:

eA=I+A=[1101]e^A = I + A = \begin{bmatrix} 1 & 1 \\ 0 & 1 \end{bmatrix}, eB=I+B=[1011]e^B = I + B = \begin{bmatrix} 1 & 0 \\ 1 & 1 \end{bmatrix}

eAeB=[2111]e^A e^B = \begin{bmatrix} 2 & 1 \\ 1 & 1 \end{bmatrix} but eA+B=e[0110]=[cosh1sinh1sinh1cosh1][1.5431.1751.1751.543]e^{A+B} = e^{\begin{bmatrix} 0 & 1 \\ 1 & 0 \end{bmatrix}} = \begin{bmatrix} \cosh 1 & \sinh 1 \\ \sinh 1 & \cosh 1 \end{bmatrix} \approx \begin{bmatrix} 1.543 & 1.175 \\ 1.175 & 1.543 \end{bmatrix}

These differ, confirming the non-commutativity effect.

📝Question 2: Quadratic Form Classification

Q: Classify the quadratic form Q(x1,x2)=3x12+4x1x2+3x22Q(x_1, x_2) = 3x_1^2 + 4x_1 x_2 + 3x_2^2.

💡Solution

A: The associated symmetric matrix is A=[3223]A = \begin{bmatrix} 3 & 2 \\ 2 & 3 \end{bmatrix}. Eigenvalues: det(AλI)=(3λ)24=0λ=1,5\det(A - \lambda I) = (3-\lambda)^2 - 4 = 0 \Rightarrow \lambda = 1, 5. Both eigenvalues positive, so QQ is positive definite.

📝Question 3: Jordan Form vs Diagonalization

Q: When does a matrix fail to be diagonalizable? How does Jordan form handle this?

💡Solution

A: A matrix fails to diagonalize when it is defective — the geometric multiplicity of some eigenvalue is less than its algebraic multiplicity. For example, A=[2102]A = \begin{bmatrix} 2 & 1 \\ 0 & 2 \end{bmatrix} has eigenvalue λ=2\lambda = 2 with algebraic multiplicity 2 but only one eigenvector. Jordan form gives J=[2102]J = \begin{bmatrix} 2 & 1 \\ 0 & 2 \end{bmatrix} — a single Jordan block. The 11 in the super-diagonal captures the "near-diagonal" structure.

📝Question 4: Polar Decomposition Geometry

Q: What does polar decomposition tell us about a matrix geometrically?

💡Solution

A: Polar decomposition A=UPA = UP separates any linear map into a rotation/reflection UU (orthogonal) followed by a scaling along principal axes PP (positive semi-definite). This reveals the "pure stretching" part (PP) and the "pure rotation" part (UU) of a transformation. For example, if AA represents a stretch along one axis plus a rotation, polar decomposition isolates these two effects.

📝Question 5: Rayleigh Quotient Optimization

Q: How can the Rayleigh quotient be used to find eigenvalues?

💡Solution

A: The Rayleigh quotient R(A,x)=xTAx/xTxR(A, \vec{x}) = \vec{x}^T A \vec{x} / \vec{x}^T \vec{x} has stationary points exactly at eigenvectors of AA, where it equals the corresponding eigenvalue. The Rayleigh quotient iteration xk+1=(AσkI)1xk\vec{x}_{k+1} = (A - \sigma_k I)^{-1}\vec{x}_k with σk=R(A,xk)\sigma_k = R(A, \vec{x}_k) converges cubically to an eigenvector — one of the fastest known iterative methods.


Practice Problems

📝Problem 1: Matrix Exponential of a Nilpotent Matrix

Compute eAe^A for A=[010001000]A = \begin{bmatrix} 0 & 1 & 0 \\ 0 & 0 & 1 \\ 0 & 0 & 0 \end{bmatrix}.

💡Solution

Since A3=0A^3 = 0 (nilpotent), the series truncates exactly:

eA=I+A+A22=[1112011001]e^A = I + A + \frac{A^2}{2} = \begin{bmatrix} 1 & 1 & \frac{1}{2} \\ 0 & 1 & 1 \\ 0 & 0 & 1 \end{bmatrix}

📝Problem 2: Kronecker Product Eigenvalues

If AA has eigenvalues 2,32, 3 and BB has eigenvalues 4,54, 5, what are the eigenvalues of ABA \otimes B?

💡Solution

The eigenvalues of ABA \otimes B are all pairwise products: 2×4=82 \times 4 = 8, 2×5=102 \times 5 = 10, 3×4=123 \times 4 = 12, 3×5=153 \times 5 = 15.

📝Problem 3: Positive Definiteness

Is A=[2112]A = \begin{bmatrix} 2 & 1 \\ 1 & 2 \end{bmatrix} positive definite? Verify using Sylvester's criterion.

💡Solution

Sylvester's criterion: all leading principal minors must be positive.

  • M1=2>0M_1 = 2 > 0
  • M2=det[2112]=41=3>0M_2 = \det\begin{bmatrix} 2 & 1 \\ 1 & 2 \end{bmatrix} = 4 - 1 = 3 > 0

Yes, AA is positive definite.

📝Problem 4: Generalized Eigenvalue Problem

Solve Av=λBvA\vec{v} = \lambda B\vec{v} for A=[4003]A = \begin{bmatrix} 4 & 0 \\ 0 & 3 \end{bmatrix} and B=[1002]B = \begin{bmatrix} 1 & 0 \\ 0 & 2 \end{bmatrix}.

💡Solution

Since both matrices are diagonal, the generalized eigenvalues are λi=aii/bii\lambda_i = a_{ii}/b_{ii}:

λ1=4/1=4\lambda_1 = 4/1 = 4, λ2=3/2=1.5\lambda_2 = 3/2 = 1.5

Eigenvectors are the standard basis: v1=[10]\vec{v}_1 = \begin{bmatrix} 1 \\ 0 \end{bmatrix}, v2=[01]\vec{v}_2 = \begin{bmatrix} 0 \\ 1 \end{bmatrix}.


Quick Reference

ConceptFormulaKey Property
Kronecker ProductABA \otimes BSize mp×nqmp \times nq, eigenvalues λiμj\lambda_i \mu_j
Matrix ExponentialeA=Akk!e^A = \sum \frac{A^k}{k!}det(eA)=etr(A)\det(e^A) = e^{\text{tr}(A)}
Matrix Logarithmln(A)=Pln(D)P1\ln(A) = P \ln(D) P^{-1}Inverse of eAe^A
Matrix PowerAk=PDkP1A^k = PD^kP^{-1}O(n3)O(n^3) via diagonalization
Generalized EigenAv=λBvA\vec{v} = \lambda B\vec{v}det(AλB)=0\det(A - \lambda B) = 0
Matrix Functionf(A)=Pf(D)P1f(A) = Pf(D)P^{-1}Applies ff to each eigenvalue
Quadratic FormxTAx\vec{x}^T A \vec{x}Positive definite iff all λi>0\lambda_i > 0
Rayleigh QuotientR=xTAx/xTxR = \vec{x}^T A \vec{x} / \vec{x}^T \vec{x}λminRλmax\lambda_{\min} \leq R \leq \lambda_{\max}
Polar DecompositionA=UPA = UPRotation UU + Scaling PP
Jordan Normal FormA=PJP1A = PJP^{-1}Handles defective matrices

Cross-References

  • Eigenvalues and Eigenvectors: Foundation for all matrix functions — A=QΛQTA = Q\Lambda Q^T enables f(A)=Qf(Λ)QTf(A) = Q f(\Lambda) Q^T
  • Singular Value Decomposition: A=UΣVTA = U\Sigma V^T is the polar decomposition of AA when written as (UVT)(VΣVT)(U V^T)(V \Sigma V^T)
  • Positive Definite Matrices: Quadratic forms and Rayleigh quotients rely on positive definiteness
  • Matrix Calculus: Derivatives of matrix exponentials and quadratic forms appear in optimization
  • Optimization: The Hessian determines whether critical points are minima/maxima via quadratic form classification
  • Probability and Statistics: Covariance matrices are positive semi-definite; Mahalanobis distance uses quadratic forms
  • Numerical Methods: Pade approximation computes eAe^A; Schur decomposition avoids Jordan form instability
  • Discrete Mathematics: Graph Laplacians and their eigenvalues power spectral graph theory and GNNs
Lesson Progress22 / 100