📋Key Takeaways
- Covariance measures the joint variability of two variables. Positive means they co-move; negative means they move in opposite directions; zero means no linear relationship.
- Correlation normalizes covariance to a unitless scale, making it comparable across different variable pairs. indicates a perfect linear relationship.
- Correlation ≠ Causation: Correlation measures association, not causation. Confounding variables, reverse causation, and coincidence can all produce spurious correlations.
- Uncorrelated ≠ Independent: Zero correlation only rules out linear dependence. Non-linear dependencies (e.g., ) can exist even when . Only for bivariate normal distributions does imply independence.
- Covariance Matrix is symmetric and positive semi-definite. Its diagonal contains variances, off-diagonals contain covariances. Eigenvalue decomposition of is the foundation of PCA.
- Applications: Feature selection (multicollinearity), PCA (dimensionality reduction), portfolio optimization (Markowitz), Gaussian distributions, attention mechanisms, and natural gradient methods all rely on covariance and correlation.