← Math|97 of 100
Advanced Topics

Differential Geometry

Understand manifolds, geodesics, curvature, and their applications in optimization and relativity.

📂 Geometry📖 Lesson 97 of 100🎓 Free Course

Advertisement

Differential Geometry

ℹ️ Why It Matters

Differential geometry provides the mathematical framework for describing curved spaces. It is essential for general relativity (gravity as spacetime curvature), manifold learning (discovering low-dimensional structure in high-dimensional data), and geometric deep learning (extending neural networks to non-Euclidean domains like graphs and meshes). Understanding curvature, geodesics, and connections enables optimization on manifolds—rotational matrices, positive-definite matrices, and probability simplices—which appear throughout modern machine learning. The language of differential geometry unifies disparate geometric phenomena under a coherent mathematical framework.


Core Definitions

DfManifold

A topological space MM is a (topological) manifold of dimension nn if it is Hausdorff, second-countable, and locally homeomorphic to Rn\mathbb{R}^n. That is, for every point pMp \in M, there exists an open neighborhood UpU \ni p and a homeomorphism ϕ:UVRn\phi: U \to V \subseteq \mathbb{R}^n called a coordinate chart. A smooth manifold additionally requires that transition maps between overlapping charts are smooth (CC^\infty). Manifolds generalize curves, surfaces, and higher-dimensional spaces to abstract settings where calculus can be performed.

DfTangent Space

The tangent space TpMT_pM at a point pp on a manifold MM is the vector space of all tangent vectors at pp. It can be defined as the space of derivations on smooth functions at pp, or equivalently as equivalence classes of curves through pp. The tangent bundle TM=pMTpMTM = \bigcup_{p \in M} T_pM is the disjoint union of all tangent spaces and carries a natural smooth structure.

DfRiemannian Metric

A Riemannian metric on a smooth manifold MM is a smoothly varying inner product gpg_p on each tangent space TpMT_pM. In local coordinates, the metric is represented by a symmetric positive-definite matrix gij(x)g_{ij}(x) such that the infinitesimal distance is ds2=gijdxidxjds^2 = g_{ij}dx^i dx^j. The metric encodes all intrinsic geometric information: distances, angles, areas, and volumes.

DfGeodesic

A geodesic on a Riemannian manifold is a curve γ(t)\gamma(t) that is locally distance-minimizing. Formally, γ\gamma is a geodesic if its covariant derivative vanishes: γ˙γ˙=0\nabla_{\dot{\gamma}} \dot{\gamma} = 0. Geodesics generalize straight lines to curved spaces. On a sphere, geodesics are great circles; on a hyperbolic plane, geodesics are arcs of circles perpendicular to the boundary.

DfCurvature Tensor

The Riemann curvature tensor R jkliR^i_{\ jkl} measures how parallel transport around an infinitesimal loop fails to return a vector to its original orientation. For a vector vjv^j transported around a loop:

δvi=R jklivjdxkdxl\delta v^i = R^i_{\ jkl} v^j \, dx^k \wedge dx^l

The Riemann tensor has symmetries: Rijkl=Rjikl=Rijlk=RklijR_{ijkl} = -R_{jikl} = -R_{ijlk} = R_{klij}, and satisfies the first Bianchi identity Ri[jkl]=0R_{i[jkl]} = 0. In dimension nn, there are n2(n21)12\frac{n^2(n^2-1)}{12} independent components.

DfChristoffel Symbols

The Christoffel symbols of the second kind are defined in terms of the metric:

Γijk=12gkl(glixj+gljxigijxl)\Gamma^k_{ij} = \frac{1}{2}g^{kl}\left(\frac{\partial g_{li}}{\partial x^j} + \frac{\partial g_{lj}}{\partial x^i} - \frac{\partial g_{ij}}{\partial x^l}\right)

These are not tensors themselves but describe how basis vectors change from point to point. They appear in the geodesic equation and the covariant derivative. The Christoffel symbols encode the "connection" on the manifold—how to differentiate vector fields.

DfCovariant Derivative

The covariant derivative \nabla extends differentiation to tensor fields on a manifold. For a vector field VjV^j:

iVj=Vjxi+ΓikjVk\nabla_i V^j = \frac{\partial V^j}{\partial x^i} + \Gamma^j_{ik}V^k

The covariant derivative of a scalar is just the ordinary partial derivative. For tensors of higher rank, each index gets a Christoffel symbol correction. The covariant derivative is tensorial (produces a tensor from a tensor), unlike partial derivatives which are not coordinate-invariant.

DfLie Bracket

The Lie bracket of two vector fields XX and YY is defined as [X,Y]=XYYX[X, Y] = XY - YX (as differential operators). In coordinates: [X,Y]i=XjjYiYjjXi[X, Y]^i = X^j \partial_j Y^i - Y^j \partial_j X^i. The Lie bracket measures the failure of partial derivatives to commute and is related to the Lie algebra of a Lie group.


Key Formulas

Geodesic Equation

d2xidt2+Γjkidxjdtdxkdt=0\frac{d^2 x^i}{dt^2} + \Gamma^i_{jk}\frac{dx^j}{dt}\frac{dx^k}{dt} = 0

Here,

  • xi(t)x^i(t)=Coordinates of the geodesic curve as a function of parameter t
  • GammajkiGamma^i_{jk}=Christoffel symbols encoding the connection (how coordinates curve)
  • tt=Affine parameter along the geodesic

Arc Length on a Riemannian Manifold

L(γ)=abgijdxidtdxjdtdtL(\gamma) = \int_a^b \sqrt{g_{ij}\frac{dx^i}{dt}\frac{dx^j}{dt}} \, dt

Here,

  • gijg_{ij}=Riemannian metric tensor components
  • dxi/dtdx^i/dt=Tangent vector components along the curve
  • [a,b][a,b]=Parameter interval

Riemann Curvature Tensor (Christoffel Form)

R jkli=ΓjlixkΓjkixl+ΓkmiΓjlmΓlmiΓjkmR^i_{\ jkl} = \frac{\partial \Gamma^i_{jl}}{\partial x^k} - \frac{\partial \Gamma^i_{jk}}{\partial x^l} + \Gamma^i_{km}\Gamma^m_{jl} - \Gamma^i_{lm}\Gamma^m_{jk}

Here,

  • GammajkiGamma^i_{jk}=Christoffel symbols
  • RjkliR^i_{ jkl}=Riemann curvature tensor component

Ricci Curvature Tensor

Rjl=R jiliR_{jl} = R^i_{\ jil}

Here,

  • RjlR_{jl}=Ricci tensor (contraction of Riemann tensor)
  • RjiliR^i_{ jil}=Riemann tensor with first and third indices contracted

Scalar Curvature

R=gjlRjl\mathcal{R} = g^{jl}R_{jl}

Here,

  • mathcalRmathcal{R}=Scalar curvature (single number at each point)
  • gjlg^{jl}=Inverse metric tensor
  • RjlR_{jl}=Ricci tensor

Sectional Curvature

K(u,v)=Rijkluivjukvl(gikgjlgilgjk)uivjukvlK(u,v) = \frac{R_{ijkl}u^i v^j u^k v^l}{(g_{ik}g_{jl} - g_{il}g_{jk})u^i v^j u^k v^l}

Here,

  • u,vu, v=Linearly independent tangent vectors spanning a 2-plane
  • K(u,v)K(u,v)=Curvature of the 2-dimensional section spanned by u and v

Gaussian Curvature (Surface)

K=R1212g11g22g122K = \frac{R_{1212}}{g_{11}g_{22} - g_{12}^2}

Here,

  • R1212R_{1212}=Single independent component of Riemann tensor for a 2D surface
  • gijg_{ij}=Metric components on the surface

Geodesic Distance on a Sphere

d=Rarccos(p1p2R2)d = R \arccos\left(\frac{\mathbf{p}_1 \cdot \mathbf{p}_2}{R^2}\right)

Here,

  • RR=Radius of the sphere
  • mathbfp1,mathbfp2mathbf{p}_1, mathbf{p}_2=Position vectors of two points on the sphere

Einstein Field Equations

Rμν12Rgμν+Λgμν=8πGc4TμνR_{\mu\nu} - \frac{1}{2}Rg_{\mu\nu} + \Lambda g_{\mu\nu} = \frac{8\pi G}{c^4}T_{\mu\nu}

Here,

  • RmuuR_{mu u}=Ricci curvature tensor
  • RR=Scalar curvature
  • gmuug_{mu u}=Metric tensor
  • TmuuT_{mu u}=Stress-energy tensor (matter/energy content)
  • LambdaLambda=Cosmological constant

Important Theorems

ThTheorema Egregium (Gauss)

The Gaussian curvature of a surface is invariant under local isometries. That is, if two surfaces are locally isometric (distance-preserving map between them), they have the same Gaussian curvature at corresponding points. This means curvature is an intrinsic property of the surface, not dependent on how it is embedded in R3\mathbb{R}^3. A cylinder has zero Gaussian curvature (it can be unrolled flat), while a sphere has positive curvature.

ThGauss-Bonnet Theorem

For a compact 2-dimensional Riemannian manifold MM without boundary:

MKdA=2πχ(M)\int_M K \, dA = 2\pi \chi(M)

where KK is the Gaussian curvature, dAdA is the area element, and χ(M)\chi(M) = \beta_0 - \beta_1 + \beta_2 is the Euler characteristic. This relates the total curvature (geometry) to a topological invariant. For surfaces with boundary, the geodesic curvature of the boundary also contributes.

ThParallel Transport on a Sphere

Parallel transport of a vector along a closed geodesic triangle on a sphere of radius RR rotates the vector by an angle Δθ=AR2\Delta\theta = \frac{A}{R^2}, where AA is the area enclosed by the triangle. This demonstrates that curvature causes holonomy—parallel transport around a loop does not generally return the vector to its original orientation.

ThBonnet-Myers Theorem

If a complete Riemannian manifold MM of dimension nn has Ricci curvature bounded below by (n1)k>0(n-1)k > 0, then MM is compact and has diameter at most π/k\pi/\sqrt{k}. This provides a geometric constraint: positive Ricci curvature implies the manifold is "small" and cannot extend indefinitely.

ThCartan-Hadamard Theorem

If MM is a complete, simply connected Riemannian manifold with non-positive sectional curvature everywhere (K0K \leq 0), then MM is diffeomorphic to Rn\mathbb{R}^n via the exponential map at any point. Moreover, the exponential map is a diffeomorphism. This implies that simply connected spaces of non-positive curvature have no "holes" and behave like flat Euclidean space globally.


Worked Examples

📝Geodesic on a Sphere

Problem: Find the geodesic distance between New York City (40.7°N, 74°W) and London (51.5°N, 0.1°W) on Earth (radius R=6371R = 6371 km).

Solution:

Step 1: Convert to spherical coordinates in radians.

  • NYC: θ1=40.7°×π180=0.710\theta_1 = 40.7° \times \frac{\pi}{180} = 0.710 rad, ϕ1=74°×π180=1.291\phi_1 = -74° \times \frac{\pi}{180} = -1.291 rad
  • London: θ2=51.5°×π180=0.899\theta_2 = 51.5° \times \frac{\pi}{180} = 0.899 rad, ϕ2=0.1°×π180=0.002\phi_2 = -0.1° \times \frac{\pi}{180} = -0.002 rad

Step 2: Convert to Cartesian coordinates.

p1=R(sinθ1cosϕ1,sinθ1sinϕ1,cosθ1)\mathbf{p}_1 = R(\sin\theta_1\cos\phi_1, \sin\theta_1\sin\phi_1, \cos\theta_1)
p16371×(0.497,0.575,0.657)\mathbf{p}_1 \approx 6371 \times (0.497, -0.575, 0.657)
p2=R(sinθ2cosϕ2,sinθ2sinϕ2,cosθ2)\mathbf{p}_2 = R(\sin\theta_2\cos\phi_2, \sin\theta_2\sin\phi_2, \cos\theta_2)
p26371×(0.623,0.001,0.782)\mathbf{p}_2 \approx 6371 \times (0.623, -0.001, 0.782)

Step 3: Compute the dot product.

p1p2=R2[sinθ1sinθ2cos(ϕ1ϕ2)+cosθ1cosθ2]\mathbf{p}_1 \cdot \mathbf{p}_2 = R^2[\sin\theta_1\sin\theta_2\cos(\phi_1-\phi_2) + \cos\theta_1\cos\theta_2]
=R2[0.650×0.783×0.278+0.760×0.625]= R^2[0.650 \times 0.783 \times 0.278 + 0.760 \times 0.625]
=R2[0.141+0.475]=R2×0.616= R^2[0.141 + 0.475] = R^2 \times 0.616

Step 4: Compute geodesic distance.

d=Rarccos(0.616)=6371×0.906=5772 kmd = R \arccos(0.616) = 6371 \times 0.906 = 5772 \text{ km}

The actual great circle distance is approximately 5570 km (the slight difference comes from Earth's oblateness, which a sphere doesn't capture).

📝Parallel Transport on a Sphere

Problem: A vector v=(1,0)\mathbf{v} = (1, 0) pointing east is parallel transported along the equator from longitude 0° to longitude 90°E. What is the resulting vector?

Solution:

Step 1: Along the equator, the Christoffel symbols for spherical coordinates (θ,ϕ)(\theta, \phi) on a unit sphere are:

Γϕϕθ=sinθcosθ,Γθϕϕ=cotθ\Gamma^\theta_{\phi\phi} = -\sin\theta\cos\theta, \quad \Gamma^\phi_{\theta\phi} = \cot\theta

Step 2: On the equator, θ=π/2\theta = \pi/2, so Γθϕϕ=cot(π/2)=0\Gamma^\phi_{\theta\phi} = \cot(\pi/2) = 0.

Step 3: The parallel transport equation along the equator (parameterized by ϕ\phi) is:

dvϕdϕ+Γθϕϕvθ=0    dvϕdϕ=0\frac{dv^\phi}{d\phi} + \Gamma^\phi_{\theta\phi}v^\theta = 0 \implies \frac{dv^\phi}{d\phi} = 0

Step 4: Since the connection coefficient vanishes on the equator, the vector components remain constant in the coordinate basis. The vector remains tangent to the equator throughout transport.

Result: After transport to 90°E, the vector still points along the equator. In the local tangent space at the new point, it points in the direction of increasing ϕ\phi (east), which is consistent with parallel transport along the equator.

Key insight: Transport along a geodesic (the equator is a geodesic) preserves the angle between the vector and the geodesic tangent. The vector maintains its orientation relative to the local coordinate system.

📝Computing Gaussian Curvature

Problem: Compute the Gaussian curvature of a sphere of radius RR.

Solution:

Step 1: The metric on a sphere in coordinates (θ,ϕ)(\theta, \phi) is:

ds2=R2dθ2+R2sin2θdϕ2ds^2 = R^2 d\theta^2 + R^2\sin^2\theta \, d\phi^2

So gθθ=R2g_{\theta\theta} = R^2, gϕϕ=R2sin2θg_{\phi\phi} = R^2\sin^2\theta, gθϕ=0g_{\theta\phi} = 0.

Step 2: Compute Christoffel symbols. The only non-zero ones are:

Γϕϕθ=sinθcosθ,Γθϕϕ=Γϕθϕ=cotθ\Gamma^\theta_{\phi\phi} = -\sin\theta\cos\theta, \quad \Gamma^\phi_{\theta\phi} = \Gamma^\phi_{\phi\theta} = \cot\theta

Step 3: Compute the Riemann tensor component R ϕθϕθR^\theta_{\ \phi\theta\phi}:

R ϕθϕθ=ΓϕϕθθΓϕθθϕ+ΓθmθΓϕϕmΓϕmθΓϕθmR^\theta_{\ \phi\theta\phi} = \frac{\partial\Gamma^\theta_{\phi\phi}}{\partial\theta} - \frac{\partial\Gamma^\theta_{\phi\theta}}{\partial\phi} + \Gamma^\theta_{\theta m}\Gamma^m_{\phi\phi} - \Gamma^\theta_{\phi m}\Gamma^m_{\phi\theta}

Since Γϕθθ=0\Gamma^\theta_{\phi\theta} = 0 and Γθmθ=0\Gamma^\theta_{\theta m} = 0:

R ϕθϕθ=θ(sinθcosθ)0+0ΓϕϕθΓϕθϕR^\theta_{\ \phi\theta\phi} = \frac{\partial}{\partial\theta}(-\sin\theta\cos\theta) - 0 + 0 - \Gamma^\theta_{\phi\phi}\Gamma^\phi_{\phi\theta}
=(cos2θ+sin2θ)(sinθcosθ)(cotθ)= (-\cos^2\theta + \sin^2\theta) - (-\sin\theta\cos\theta)(\cot\theta)
=cos2θ+sin2θ+cos2θ=sin2θ= -\cos^2\theta + \sin^2\theta + \cos^2\theta = \sin^2\theta

Step 4: Lower the first index: Rθϕθϕ=gθθR ϕθϕθ=R2sin2θR_{\theta\phi\theta\phi} = g_{\theta\theta}R^\theta_{\ \phi\theta\phi} = R^2\sin^2\theta

Step 5: Gaussian curvature:

K=Rθϕθϕgθθgϕϕgθϕ2=R2sin2θR2R2sin2θ=1R2K = \frac{R_{\theta\phi\theta\phi}}{g_{\theta\theta}g_{\phi\phi} - g_{\theta\phi}^2} = \frac{R^2\sin^2\theta}{R^2 \cdot R^2\sin^2\theta} = \frac{1}{R^2}

Result: The Gaussian curvature of a sphere of radius RR is K=1/R2K = 1/R^2, which is constant everywhere on the sphere. This confirms that spheres have constant positive curvature.


Practice Problems

📝Problem 1: Geodesic on a Cylinder

Problem: A cylinder has the metric ds2=dz2+R2dϕ2ds^2 = dz^2 + R^2 d\phi^2. Find the geodesics.

Solution:

The Christoffel symbols for this metric are all zero because gzz=1g_{zz} = 1 and gϕϕ=R2g_{\phi\phi} = R^2 are constants.

The geodesic equation becomes:

d2zdt2=0,d2ϕdt2=0\frac{d^2 z}{dt^2} = 0, \quad \frac{d^2\phi}{dt^2} = 0

Solution: z(t)=at+bz(t) = at + b, ϕ(t)=ct+d\phi(t) = ct + d

In Cartesian coordinates on the unrolled cylinder (x=Rϕx = R\phi, y=zy = z):

x=Rct+Rd,y=at+bx = Rct + Rd, \quad y = at + b

Eliminating tt: y=aRc(xRd)+by = \frac{a}{Rc}(x - Rd) + b

Result: Geodesics on a cylinder are helices (straight lines when the cylinder is unrolled). This makes intuitive sense: the cylinder has zero Gaussian curvature and is locally isometric to the plane.

📝Problem 2: Curvature of a Torus

Problem: Compute the Gaussian curvature at the inner and outer points of a torus with major radius RR and minor radius rr.

Solution:

Step 1: Parametrize the torus:

x(θ,ϕ)=((R+rcosθ)cosϕ,(R+rcosθ)sinϕ,rsinθ)\mathbf{x}(\theta, \phi) = ((R + r\cos\theta)\cos\phi, (R + r\cos\theta)\sin\phi, r\sin\theta)

Step 2: Compute the metric:

gθθ=r2,gϕϕ=(R+rcosθ)2,gθϕ=0g_{\theta\theta} = r^2, \quad g_{\phi\phi} = (R + r\cos\theta)^2, \quad g_{\theta\phi} = 0

Step 3: Gaussian curvature (derived from the second fundamental form):

K=cosθr(R+rcosθ)K = \frac{\cos\theta}{r(R + r\cos\theta)}

Step 4: At the outermost point (θ=0\theta = 0):

Kouter=1r(R+r)=1rR+r2>0K_{\text{outer}} = \frac{1}{r(R + r)} = \frac{1}{rR + r^2} > 0

Step 5: At the innermost point (θ=π\theta = \pi):

Kinner=1r(Rr)=1rRr2<0K_{\text{inner}} = \frac{-1}{r(R - r)} = \frac{-1}{rR - r^2} < 0

Result: The torus has positive curvature on the outside and negative curvature on the inside, with zero curvature at the top and bottom (θ=π/2,3π/2\theta = \pi/2, 3\pi/2). The total curvature is KdA=0\int K \, dA = 0, consistent with the Euler characteristic of the torus being χ=0\chi = 0.

📝Problem 3: Christoffel Symbols for the Hyperbolic Plane

Problem: The hyperbolic plane in the Poincaré half-plane model has metric ds2=dx2+dy2y2ds^2 = \frac{dx^2 + dy^2}{y^2}. Compute the Christoffel symbols.

Solution:

Step 1: Identify the metric components:

gxx=1y2,gyy=1y2,gxy=0g_{xx} = \frac{1}{y^2}, \quad g_{yy} = \frac{1}{y^2}, \quad g_{xy} = 0

Step 2: Compute partial derivatives of the metric:

gxxy=2y3,gyyy=2y3\frac{\partial g_{xx}}{\partial y} = -\frac{2}{y^3}, \quad \frac{\partial g_{yy}}{\partial y} = -\frac{2}{y^3}

Step 3: Compute Christoffel symbols using Γijk=12gkl(jgli+igljlgij)\Gamma^k_{ij} = \frac{1}{2}g^{kl}(\partial_j g_{li} + \partial_i g_{lj} - \partial_l g_{ij}):

Since gxx=y2g^{xx} = y^2, gyy=y2g^{yy} = y^2, gxy=0g^{xy} = 0:

Γxxx=0,Γxyx=Γyxx=1y,Γyyx=0\Gamma^x_{xx} = 0, \quad \Gamma^x_{xy} = \Gamma^x_{yx} = -\frac{1}{y}, \quad \Gamma^x_{yy} = 0
Γxxy=1y,Γxyy=Γyxy=0,Γyyy=1y\Gamma^y_{xx} = \frac{1}{y}, \quad \Gamma^y_{xy} = \Gamma^y_{yx} = 0, \quad \Gamma^y_{yy} = -\frac{1}{y}

Step 4: Verify: The Gaussian curvature is K=1K = -1 everywhere, confirming constant negative curvature.

Result: The non-zero Christoffel symbols are Γxyx=1/y\Gamma^x_{xy} = -1/y, Γxxy=1/y\Gamma^y_{xx} = 1/y, and Γyyy=1/y\Gamma^y_{yy} = -1/y. These encode the "push" of geodesics toward the boundary y=0y = 0.

📝Problem 4: Geodesic Distance in Hyperbolic Space

Problem: Find the geodesic distance between two points in the Poincaré disk model of hyperbolic space.

Solution:

Step 1: In the Poincaré disk model D={zC:z<1}\mathbb{D} = \{z \in \mathbb{C} : |z| < 1\}, the metric is:

ds2=4dz2(1z2)2ds^2 = \frac{4|dz|^2}{(1-|z|^2)^2}

Step 2: The geodesic distance between points z1,z2Dz_1, z_2 \in \mathbb{D} is:

d(z1,z2)=arccosh(1+2z1z22(1z12)(1z22))d(z_1, z_2) = \text{arccosh}\left(1 + \frac{2|z_1 - z_2|^2}{(1-|z_1|^2)(1-|z_2|^2)}\right)

Step 3: For points at the origin z1=0z_1 = 0 and z2=rz_2 = r (real):

d(0,r)=arccosh(1+2r21(1r2))=arccosh(1+r21r2)d(0, r) = \text{arccosh}\left(1 + \frac{2r^2}{1 \cdot (1-r^2)}\right) = \text{arccosh}\left(\frac{1+r^2}{1-r^2}\right)

Step 4: Using arccosh(x)=ln(x+x21)\text{arccosh}(x) = \ln(x + \sqrt{x^2-1}):

d(0,r)=ln(1+r21r2+2r1r2)=ln((1+r)21r2)=ln(1+r1r)=2arctanh(r)d(0, r) = \ln\left(\frac{1+r^2}{1-r^2} + \frac{2r}{1-r^2}\right) = \ln\left(\frac{(1+r)^2}{1-r^2}\right) = \ln\left(\frac{1+r}{1-r}\right) = 2\text{arctanh}(r)

Result: The distance grows logarithmically as r1r \to 1, reflecting the infinite extent of hyperbolic space within the unit disk. This exponential growth of volume with radius is what makes hyperbolic space suitable for embedding hierarchical data.

📝Problem 5: Ricci Curvature of a 2-Sphere

Problem: Compute the Ricci tensor and scalar curvature of a 2-sphere of radius RR.

Solution:

Step 1: From the previous calculation, the Riemann tensor components are:

Rθϕθϕ=R2sin2θR_{\theta\phi\theta\phi} = R^2\sin^2\theta

Step 2: The Ricci tensor is Rjl=R jiliR_{jl} = R^i_{\ jil}:

Rθθ=R θϕθϕ=gϕϕRϕθϕθ=1R2sin2θR2sin2θ=1R_{\theta\theta} = R^\phi_{\ \theta\phi\theta} = g^{\phi\phi}R_{\phi\theta\phi\theta} = \frac{1}{R^2\sin^2\theta} \cdot R^2\sin^2\theta = 1
Rϕϕ=R ϕθϕθ=gθθRθϕθϕ=1R2R2sin2θ=sin2θR_{\phi\phi} = R^\theta_{\ \phi\theta\phi} = g^{\theta\theta}R_{\theta\phi\theta\phi} = \frac{1}{R^2} \cdot R^2\sin^2\theta = \sin^2\theta
Rθϕ=Rϕθ=0R_{\theta\phi} = R_{\phi\theta} = 0

Step 3: The Ricci tensor is Rjl=gjlR_{jl} = g_{jl} (proportional to the metric).

Step 4: Scalar curvature:

R=gjlRjl=gθθRθθ+gϕϕRϕϕ=1R21+1R2sin2θsin2θ=2R2\mathcal{R} = g^{jl}R_{jl} = g^{\theta\theta}R_{\theta\theta} + g^{\phi\phi}R_{\phi\phi} = \frac{1}{R^2} \cdot 1 + \frac{1}{R^2\sin^2\theta} \cdot \sin^2\theta = \frac{2}{R^2}

Result: The scalar curvature is R=2/R2\mathcal{R} = 2/R^2, and the Ricci tensor satisfies Rjl=1R2gjlR_{jl} = \frac{1}{R^2}g_{jl}, confirming that the 2-sphere is an Einstein manifold (Ricci tensor proportional to metric).

📝Problem 6: Exponential Map on a Sphere

Problem: Compute the exponential map expp:TpS2S2\text{exp}_p: T_pS^2 \to S^2 for a point pp on the north pole of a unit sphere.

Solution:

Step 1: At the north pole p=(0,0,1)p = (0, 0, 1), the tangent space TpS2T_pS^2 consists of vectors (v1,v2,0)(v_1, v_2, 0).

Step 2: The exponential map sends a tangent vector v=(v1,v2,0)\mathbf{v} = (v_1, v_2, 0) to the point on S2S^2 reached by following the geodesic starting at pp with initial velocity v\mathbf{v} for unit time.

Step 3: The geodesic from the north pole in direction v\mathbf{v} is a great circle. Parameterize:

γ(t)=cos(vt)p+sin(vt)vv\gamma(t) = \cos(\|\mathbf{v}\|t) \cdot p + \sin(\|\mathbf{v}\|t) \cdot \frac{\mathbf{v}}{\|\mathbf{v}\|}

Step 4: At t=1t = 1:

expp(v)=cos(v)(0,0,1)+sin(v)(v1,v2,0)v\text{exp}_p(\mathbf{v}) = \cos(\|\mathbf{v}\|)(0, 0, 1) + \sin(\|\mathbf{v}\|)\frac{(v_1, v_2, 0)}{\|\mathbf{v}\|}
=(v1sinvv,v2sinvv,cosv)= \left(\frac{v_1 \sin\|\mathbf{v}\|}{\|\mathbf{v}\|}, \frac{v_2 \sin\|\mathbf{v}\|}{\|\mathbf{v}\|}, \cos\|\mathbf{v}\|\right)

Step 5: For v=(π,0,0)\mathbf{v} = (\pi, 0, 0):

expp(v)=(0,0,cosπ)=(0,0,1)\text{exp}_p(\mathbf{v}) = (0, 0, \cos\pi) = (0, 0, -1)

This is the south pole, reached by traveling a geodesic of length π\pi from the north pole.

Result: The exponential map is a diffeomorphism from the open ball of radius π\pi in TpS2T_pS^2 to S2{p}S^2 \setminus \{-p\}. It fails to be injective at v=π\|\mathbf{v}\| = \pi (antipodal point), which is the cut locus.

📝Problem 7: Length of a Curve on a Surface

Problem: Compute the length of the helix γ(t)=(cost,sint,t)\gamma(t) = (\cos t, \sin t, t) for t[0,2π]t \in [0, 2\pi] on the cylinder x2+y2=1x^2 + y^2 = 1.

Solution:

Step 1: The cylinder metric in coordinates (θ,z)(\theta, z) is ds2=dθ2+dz2ds^2 = d\theta^2 + dz^2 (since R=1R = 1).

Step 2: The helix in these coordinates is θ(t)=t\theta(t) = t, z(t)=tz(t) = t.

Step 3: The tangent vector is (θ˙,z˙)=(1,1)(\dot\theta, \dot z) = (1, 1).

Step 4: The speed is:

γ˙=gθθθ˙2+gzzz˙2=11+11=2\|\dot\gamma\| = \sqrt{g_{\theta\theta}\dot\theta^2 + g_{zz}\dot z^2} = \sqrt{1 \cdot 1 + 1 \cdot 1} = \sqrt{2}

Step 5: The length is:

L=02πγ˙dt=02π2dt=2π2L = \int_0^{2\pi} \|\dot\gamma\| \, dt = \int_0^{2\pi} \sqrt{2} \, dt = 2\pi\sqrt{2}

Result: The helix has length 2π22\pi\sqrt{2}. This is longer than the straight line along the cylinder (2π2\pi for one revolution), reflecting the extra distance traveled vertically.

📝Problem 8: Inverse Metric Tensor Computation

Problem: Compute the inverse metric tensor for the metric ds2=2dx2+4dxdy+3dy2ds^2 = 2dx^2 + 4dxdy + 3dy^2.

Solution:

Step 1: Identify the metric matrix:

gij=(2223)g_{ij} = \begin{pmatrix} 2 & 2 \\ 2 & 3 \end{pmatrix}

Note: gxy=gyx=2g_{xy} = g_{yx} = 2 (symmetric).

Step 2: Compute the determinant:

det(g)=2322=64=2\det(g) = 2 \cdot 3 - 2 \cdot 2 = 6 - 4 = 2

Step 3: Compute the inverse:

gij=12(3222)=(1.5111)g^{ij} = \frac{1}{2}\begin{pmatrix} 3 & -2 \\ -2 & 2 \end{pmatrix} = \begin{pmatrix} 1.5 & -1 \\ -1 & 1 \end{pmatrix}

Step 4: Verify: gikgkj=δjig^{ik}g_{kj} = \delta^i_j

g11g11+g12g21=1.5×2+(1)×2=32=1=δ11g^{11}g_{11} + g^{12}g_{21} = 1.5 \times 2 + (-1) \times 2 = 3 - 2 = 1 = \delta^1_1

g11g12+g12g22=1.5×2+(1)×3=33=0=δ21g^{11}g_{12} + g^{12}g_{22} = 1.5 \times 2 + (-1) \times 3 = 3 - 3 = 0 = \delta^1_2

Result: The inverse metric is gij=(1.5111)g^{ij} = \begin{pmatrix} 1.5 & -1 \\ -1 & 1 \end{pmatrix}. This is used to raise indices: vi=gijvjv^i = g^{ij}v_j.

📝Problem 9: Christoffel Symbols for Polar Coordinates

Problem: Compute the Christoffel symbols for the Euclidean plane in polar coordinates (r,θ)(r, \theta).

Solution:

Step 1: The metric in polar coordinates is ds2=dr2+r2dθ2ds^2 = dr^2 + r^2 d\theta^2.

So grr=1g_{rr} = 1, gθθ=r2g_{\theta\theta} = r^2, grθ=0g_{r\theta} = 0.

Step 2: Inverse metric: grr=1g^{rr} = 1, gθθ=1/r2g^{\theta\theta} = 1/r^2, grθ=0g^{r\theta} = 0.

Step 3: Compute Christoffel symbols using Γijk=12gkl(jgli+igljlgij)\Gamma^k_{ij} = \frac{1}{2}g^{kl}(\partial_j g_{li} + \partial_i g_{lj} - \partial_l g_{ij}):

Γθθr=12grr(rgθθ)=12(1)(2r)=r\Gamma^r_{\theta\theta} = \frac{1}{2}g^{rr}(-\partial_r g_{\theta\theta}) = \frac{1}{2}(1)(-2r) = -rΓrθθ=Γθrθ=12gθθ(rgθθ)=121r2(2r)=1r\Gamma^\theta_{r\theta} = \Gamma^\theta_{\theta r} = \frac{1}{2}g^{\theta\theta}(\partial_r g_{\theta\theta}) = \frac{1}{2}\frac{1}{r^2}(2r) = \frac{1}{r}

All other Christoffel symbols are zero.

Step 4: Verify with the geodesic equation. For a straight line in polar coordinates, the geodesic equation gives:

r¨rθ˙2=0,θ¨+2rr˙θ˙=0\ddot{r} - r\dot{\theta}^2 = 0, \quad \ddot{\theta} + \frac{2}{r}\dot{r}\dot{\theta} = 0

These are the correct equations for straight-line motion in polar coordinates.

Result: The non-zero Christoffel symbols are Γθθr=r\Gamma^r_{\theta\theta} = -r and Γrθθ=1/r\Gamma^\theta_{r\theta} = 1/r. These encode how the polar coordinate system "curves" relative to Cartesian coordinates.

📝Problem 10: Sectional Curvature of a Surface of Revolution

Problem: Compute the Gaussian curvature of the surface of revolution z=f(r)z = f(r) where r=x2+y2r = \sqrt{x^2+y^2}.

Solution:

Step 1: Parametrize the surface: x(r,θ)=(rcosθ,rsinθ,f(r))\mathbf{x}(r, \theta) = (r\cos\theta, r\sin\theta, f(r)).

Step 2: Compute the metric:

E=xrxr=1+f(r)2,F=xrxθ=0,G=xθxθ=r2E = \mathbf{x}_r \cdot \mathbf{x}_r = 1 + f'(r)^2, \quad F = \mathbf{x}_r \cdot \mathbf{x}_\theta = 0, \quad G = \mathbf{x}_\theta \cdot \mathbf{x}_\theta = r^2

Step 3: The Gaussian curvature for a surface of revolution is:

K=f(r)r(1+f(r)2)2K = -\frac{f''(r)}{r(1 + f'(r)^2)^2}

Step 4: For a sphere of radius RR: f(r)=R2r2f(r) = \sqrt{R^2 - r^2}, f(r)=rR2r2f'(r) = \frac{-r}{\sqrt{R^2-r^2}}, f(r)=R2(R2r2)3/2f''(r) = \frac{-R^2}{(R^2-r^2)^{3/2}}.

K=R2/(R2r2)3/2r(1+r2/(R2r2))2=R2r(R2r2)3/2(R2r2)2R4=1R2K = -\frac{-R^2/(R^2-r^2)^{3/2}}{r(1 + r^2/(R^2-r^2))^2} = \frac{R^2}{r(R^2-r^2)^{3/2}} \cdot \frac{(R^2-r^2)^2}{R^4} = \frac{1}{R^2}

This confirms the constant curvature K=1/R2K = 1/R^2 for the sphere.

Result: The Gaussian curvature of a surface of revolution depends only on ff and its derivatives. For a paraboloid z=r2z = r^2: K=4/(1+4r2)2K = -4/(1+4r^2)^2, which is negative everywhere.

📝Problem 11: Geodesics on the Hyperbolic Plane

Problem: Find the geodesics in the Poincaré half-plane model H2={(x,y):y>0}\mathbb{H}^2 = \{(x,y) : y > 0\} with metric ds2=(dx2+dy2)/y2ds^2 = (dx^2 + dy^2)/y^2.

Solution:

Step 1: The Christoffel symbols were computed earlier:

Γxyx=1/y,Γxxy=1/y,Γyyy=1/y\Gamma^x_{xy} = -1/y, \quad \Gamma^y_{xx} = 1/y, \quad \Gamma^y_{yy} = -1/y

Step 2: The geodesic equations are:

x¨2yx˙y˙=0,y¨+1yx˙21yy˙2=0\ddot{x} - \frac{2}{y}\dot{x}\dot{y} = 0, \quad \ddot{y} + \frac{1}{y}\dot{x}^2 - \frac{1}{y}\dot{y}^2 = 0

Step 3: The geodesics are:

  1. Vertical lines: x=cx = c (half-lines perpendicular to the boundary y=0y = 0)
  2. Semicircles: (xa)2+y2=R2(x-a)^2 + y^2 = R^2 (arcs of circles centered on the boundary)

Step 4: For a vertical geodesic x=cx = c, parameterize as (c,y(t))(c, y(t)). The geodesic equation gives y¨y˙2/y=0\ddot{y} - \dot{y}^2/y = 0, with solution y(t)=et+c0y(t) = e^{t+c_0}.

Step 5: For a semicircular geodesic (xa)2+y2=R2(x-a)^2 + y^2 = R^2, parameterize as (a+Rcosθ,Rsinθ)(a + R\cos\theta, R\sin\theta). The arc length element is ds=dθ/sinθds = d\theta/\sin\theta, so the geodesic parameter is θ=2arctan(et)\theta = 2\arctan(e^t).

Result: Geodesics in H2\mathbb{H}^2 are vertical lines and semicircles perpendicular to the boundary. This is the foundation of hyperbolic geometry and is used in hyperbolic embeddings for hierarchical data.


Common Mistakes

MistakeCorrect Approach
Assuming geodesics are always the shortest pathsGeodesics are locally distance-minimizing; global minima require additional analysis
Confusing the Christoffel symbols with tensor componentsChristoffel symbols are NOT tensors; they transform inhomogeneously under coordinate changes
Using Euclidean geometry on curved surfacesAlways account for the metric: ds2=gijdxidxjdxidxids^2 = g_{ij}dx^idx^j \neq dx^idx^i in general
Assuming parallel transport preserves vector orientationOn curved manifolds, parallel transport around a loop causes rotation (holonomy)
Forgetting that curvature is intrinsicGaussian curvature can be computed from the metric alone, without reference to the embedding space
Confusing extrinsic and intrinsic curvatureExtrinsic curvature depends on embedding; intrinsic curvature (Gaussian) does not
Assuming Rijkl=RikjlR_{ijkl} = R_{ikjl}The Riemann tensor has specific symmetries: Rijkl=Rjikl=Rijlk=RklijR_{ijkl} = -R_{jikl} = -R_{ijlk} = R_{klij}

Connections to Machine Learning

ℹ️ Connections to Machine Learning

Differential geometry underlies several ML paradigms: (1) Hyperbolic embeddings represent hierarchical data (taxonomies, knowledge graphs) in Hn\mathbb{H}^n, where distance grows exponentially with depth, matching tree-like structure naturally. (2) Riemannian optimization generalizes gradient descent to manifolds of constrained structure (rotation matrices, positive-definite covariance matrices), using retractions and vector transport instead of Euclidean updates. (3) Geometric deep learning extends CNNs to non-Euclidean domains (graphs, meshes) by defining convolutions via the Laplace-Beltrami operator, which depends on the manifold's curvature. (4) Information geometry treats the space of probability distributions as a Riemannian manifold with the Fisher information metric, leading to natural gradient descent. (5) General relativity uses the Einstein field equations Gμν=8πTμνG_{\mu\nu} = 8\pi T_{\mu\nu}, relating spacetime curvature to matter-energy, which has inspired energy-based models in ML.


Exam/Interview Questions

Q1: What is the difference between intrinsic and extrinsic curvature?

Answer: Intrinsic curvature (Gaussian curvature) is determined entirely by the metric tensor and measurements made within the manifold. It is invariant under isometries (Theorema Egregium). Extrinsic curvature depends on how the manifold is embedded in a higher-dimensional space. For example, a cylinder has zero intrinsic curvature (it can be unrolled flat) but non-zero extrinsic curvature in R3\mathbb{R}^3.


Q2: Write the geodesic equation and explain each term.

Answer: d2xidt2+Γjkidxjdtdxkdt=0\frac{d^2x^i}{dt^2} + \Gamma^i_{jk}\frac{dx^j}{dt}\frac{dx^k}{dt} = 0. The first term is the acceleration. The second term involves Christoffel symbols that encode how the coordinate system curves; they act as "fictitious forces" in curved coordinates. On a flat manifold in Cartesian coordinates, all Γjki=0\Gamma^i_{jk} = 0, and the equation reduces to x¨i=0\ddot{x}^i = 0 (straight lines).


Q3: Why can't we do optimization directly on a manifold using standard gradient descent?

Answer: Standard gradient descent moves in the direction of steepest descent in the ambient Euclidean space, which may leave the manifold. For example, optimizing a rotation matrix by adding a Euclidean gradient step produces a matrix that is no longer orthogonal. Riemannian optimization uses retractions (maps from the tangent space back to the manifold) and vector transport (moving tangent vectors between tangent spaces) to stay on the manifold while descending.


Q4: What is the significance of the Gauss-Bonnet theorem?

Answer: The Gauss-Bonnet theorem MKdA=2πχ(M)\int_M K \, dA = 2\pi\chi(M) connects geometry (curvature KK) to topology (Euler characteristic χ\chi). It implies that the total curvature of a closed surface is a topological invariant—you cannot change it by smoothly deforming the surface. For example, a sphere always has total curvature 4π4\pi regardless of its shape, and a torus always has total curvature 00.


Q5: How does hyperbolic geometry benefit representation learning?

Answer: Hyperbolic space has exponential volume growth: the number of points at distance rr grows as e(n1)re^{(n-1)r}, matching the exponential growth of nodes in a tree. Hierarchical data (taxonomies, organizational charts, phylogenetic trees) can be embedded in hyperbolic space with arbitrarily low distortion using far fewer dimensions than Euclidean space requires. The Poincaré disk model allows gradient-based optimization of these embeddings.


Q6: What is the Laplace-Beltrami operator, and why is it important for graph neural networks?

Answer: The Laplace-Beltrami operator ΔMf=div(gradf)\Delta_M f = \text{div}(\text{grad} \, f) generalizes the Laplacian to manifolds. On a graph, the combinatorial Laplacian L=DAL = D - A (degree matrix minus adjacency matrix) is the discrete analog. Its eigenvectors provide a Fourier basis on the graph, enabling spectral graph convolutions that are the foundation of spectral graph neural networks (ChebNet, GCN).


Quick Reference

ConceptFormulaKey Insight
Metric Tensords2=gijdxidxjds^2 = g_{ij}dx^idx^jDefines distances and angles
Christoffel SymbolsΓijk=12gkl(jgli+igljlgij)\Gamma^k_{ij} = \frac{1}{2}g^{kl}(\partial_j g_{li} + \partial_i g_{lj} - \partial_l g_{ij})Encode connection
Geodesic Equationx¨i+Γjkix˙jx˙k=0\ddot{x}^i + \Gamma^i_{jk}\dot{x}^j\dot{x}^k = 0Generalizes straight lines
Riemann TensorR jkli=kΓjlilΓjki+ΓkmiΓjlmΓlmiΓjkmR^i_{\ jkl} = \partial_k\Gamma^i_{jl} - \partial_l\Gamma^i_{jk} + \Gamma^i_{km}\Gamma^m_{jl} - \Gamma^i_{lm}\Gamma^m_{jk}Measures curvature
Ricci TensorRjl=R jiliR_{jl} = R^i_{\ jil}Contraction of Riemann tensor
Scalar CurvatureR=gjlRjl\mathcal{R} = g^{jl}R_{jl}Single number summarizing curvature
Gaussian CurvatureK=R1212/(g11g22g122)K = R_{1212}/(g_{11}g_{22} - g_{12}^2)Intrinsic curvature of surfaces
Sphere Distanced=Rarccos(p1p2/R2)d = R\arccos(\mathbf{p}_1\cdot\mathbf{p}_2/R^2)Great circle distance
Hyperbolic Distanced=arccosh(1+2z1z22/((1z12)(1z22)))d = \text{arccosh}(1 + 2|z_1-z_2|^2/((1-|z_1|^2)(1-|z_2|^2)))Poincaré disk distance
Gauss-BonnetMKdA=2πχ(M)\int_M K \, dA = 2\pi\chi(M)Geometry equals topology
Einstein Field EquationsRμν12Rgμν=8πTμνR_{\mu\nu} - \frac{1}{2}Rg_{\mu\nu} = 8\pi T_{\mu\nu}Curvature = matter-energy

Cross-References

Lesson Progress97 / 100