Basis and Dimension

ℹ️ Why It Matters

Basis and dimension are foundational concepts in linear algebra that determine how we represent, compress, and transform data. Every data point in machine learning exists in a high-dimensional vector space, and choosing the right basis — the right coordinate system — can reveal hidden structure, reduce computational cost, and improve model performance. PCA finds the basis of maximum variance; Fourier transforms use frequency basis to decompose signals; wavelet basis enables multi-resolution analysis. Understanding basis and dimension is essential for feature engineering, dimensionality reduction, and efficient computation in AI/ML pipelines.

Vector Space Axioms

DfVector Space

A vector space over a field $\mathbb{F}$ (typically $\mathbb{R}$ or $\mathbb{C}$ ) is a set $V$ together with two operations — vector addition ( $+$ ) and scalar multiplication ( $\cdot$ ) — satisfying the following eight axioms for all $\vec{u}, \vec{v}, \vec{w} \in V$ and all $a, b \in \mathbb{F}$ :

Closure under addition: $\vec{u} + \vec{v} \in V$
Commutativity: $\vec{u} + \vec{v} = \vec{v} + \vec{u}$
Associativity of addition: $(\vec{u} + \vec{v}) + \vec{w} = \vec{u} + (\vec{v} + \vec{w})$
Additive identity: There exists $\vec{0} \in V$ such that $\vec{v} + \vec{0} = \vec{v}$
Additive inverse: For each $\vec{v} \in V$ , there exists $-\vec{v} \in V$ such that $\vec{v} + (-\vec{v}) = \vec{0}$
Closure under scalar multiplication: $a \cdot \vec{v} \in V$
Distributivity: $a \cdot (\vec{u} + \vec{v}) = a \cdot \vec{u} + a \cdot \vec{v}$ and $(a + b) \cdot \vec{v} = a \cdot \vec{v} + b \cdot \vec{v}$
Compatibility of scalar multiplication: $a \cdot (b \cdot \vec{v}) = (ab) \cdot \vec{v}$ , and $1 \cdot \vec{v} = \vec{v}$

📝Examples of Vector Spaces

$\mathbb{R}^n$ : The set of all $n$ -tuples of real numbers with componentwise addition and scalar multiplication. This is the canonical example.

Polynomials of degree $\leq n$ : The set $P_n = \{a_0 + a_1 x + \cdots + a_n x^n : a_i \in \mathbb{R}\}$ with polynomial addition and scalar multiplication.

Matrices: The set $M_{m \times n}(\mathbb{R})$ of all $m \times n$ real matrices with matrix addition and scalar multiplication.

Function spaces: The set of all continuous functions $C[0,1]$ with pointwise addition and scalar multiplication: $(f+g)(x) = f(x) + g(x)$ , $(cf)(x) = c \cdot f(x)$ .

📝Non-Example

The set $\{(x, y) \in \mathbb{R}^2 : x \geq 0, y \geq 0\}$ (first quadrant) is not a vector space because it is not closed under additive inverses — e.g., $(1,1) \in V$ but $-(1,1) = (-1,-1) \notin V$ .

Subspace

DfSubspace

A subspace of a vector space $V$ is a non-empty subset $W \subseteq V$ that is itself a vector space under the same operations. Equivalently, $W$ is a subspace if and only if:

$\vec{0} \in W$ (contains the zero vector)
If $\vec{u}, \vec{v} \in W$ , then $\vec{u} + \vec{v} \in W$ (closed under addition)
If $\vec{v} \in W$ and $c \in \mathbb{R}$ , then $c\vec{v} \in W$ (closed under scalar multiplication)

📝Examples of Subspaces

Line through the origin: $W = \{t\vec{v} : t \in \mathbb{R}\}$ for any fixed $\vec{v} \in \mathbb{R}^n$ .

The set of symmetric matrices: $W = \{A \in M_{n \times n}(\mathbb{R}) : A = A^T\}$ is a subspace of $M_{n \times n}(\mathbb{R})$ .

The set of solutions to $A\vec{x} = \vec{0}$ : The null space $\ker(A)$ is always a subspace of $\mathbb{R}^n$ .

📝Non-Subspace

The set $\{(x, y) \in \mathbb{R}^2 : x + y = 1\}$ is not a subspace because $\vec{0} \notin W$ (since $0 + 0 \neq 1$ ).

Linear Combinations

DfLinear Combination

A linear combination of vectors $\vec{v_1}, \vec{v_2}, \ldots, \vec{v_k}$ is any expression of the form:

c_1 \vec{v_1} + c_2 \vec{v_2} + \cdots + c_k \vec{v_k}

where $c_1, c_2, \ldots, c_k$ are scalars (called weights or coefficients).

📝Example

Let $\vec{v_1} = \begin{bmatrix}1\\0\end{bmatrix}$ , $\vec{v_2} = \begin{bmatrix}0\\1\end{bmatrix}$ .

Then $\begin{bmatrix}3\\-2\end{bmatrix} = 3\vec{v_1} + (-2)\vec{v_2}$ is a linear combination of $\vec{v_1}$ and $\vec{v_2}$ .

The vector $\begin{bmatrix}1\\1\\1\end{bmatrix}$ cannot be written as a linear combination of $\vec{v_1}$ and $\vec{v_2}$ because they live in $\mathbb{R}^2$ , not $\mathbb{R}^3$ .

ℹ️ Geometric Intuition

In $\mathbb{R}^2$ , the set of all linear combinations of two non-parallel vectors is the entire plane. If the vectors are parallel (scalar multiples of each other), their linear combinations form only a line through the origin.

Span

DfSpan

The span of a set of vectors $\{\vec{v_1}, \vec{v_2}, \ldots, \vec{v_k}\}$ is the set of all linear combinations:

\text{span}\{\vec{v_1}, \vec{v_2}, \ldots, \vec{v_k}\} = \{c_1 \vec{v_1} + c_2 \vec{v_2} + \cdots + c_k \vec{v_k} : c_i \in \mathbb{R}\}

The span is always a subspace — it is the smallest subspace containing all the given vectors.

📝Finding the Span

Let $\vec{v_1} = \begin{bmatrix}1\\2\\3\end{bmatrix}$ , $\vec{v_2} = \begin{bmatrix}4\\5\\6\end{bmatrix}$ .

$\text{span}\{\vec{v_1}, \vec{v_2}\}$ is the plane in $\mathbb{R}^3$ passing through the origin and containing both vectors. Since $\vec{v_1}$ and $\vec{v_2}$ are not parallel, this is a 2-dimensional subspace of $\mathbb{R}^3$ .

📝Span of Standard Basis

$\text{span}\{\vec{e_1}, \vec{e_2}, \vec{e_3}\}$ in $\mathbb{R}^3$ is all of $\mathbb{R}^3$ , since any vector $\begin{bmatrix}a\\b\\c\end{bmatrix} = a\vec{e_1} + b\vec{e_2} + c\vec{e_3}$ .

Linear Independence

DfLinear Independence

A set of vectors $\{\vec{v_1}, \vec{v_2}, \ldots, \vec{v_k}\}$ is linearly independent if the only solution to:

c_1 \vec{v_1} + c_2 \vec{v_2} + \cdots + c_k \vec{v_k} = \vec{0}

is $c_1 = c_2 = \cdots = c_k = 0$ .

If a non-trivial solution exists (some $c_i \neq 0$ ), the set is linearly dependent.

ThTest for Linear Independence

A set of vectors $\{\vec{v_1}, \vec{v_2}, \ldots, \vec{v_k}\}$ in $\mathbb{R}^n$ is linearly independent if and only if the matrix $A = [\vec{v_1} \mid \vec{v_2} \mid \cdots \mid \vec{v_k}]$ has a pivot in every column (i.e., $\text{rank}(A) = k$ ).

📝Linearly Independent

$\left\{\begin{bmatrix}1\\0\\0\end{bmatrix}, \begin{bmatrix}0\\1\\0\end{bmatrix}, \begin{bmatrix}0\\0\\1\end{bmatrix}\right\}$ is linearly independent because:

c_1 \begin{bmatrix}1\\0\\0\end{bmatrix} + c_2 \begin{bmatrix}0\\1\\0\end{bmatrix} + c_3 \begin{bmatrix}0\\0\\1\end{bmatrix} = \begin{bmatrix}c_1\\c_2\\c_3\end{bmatrix} = \begin{bmatrix}0\\0\\0\end{bmatrix}

implies $c_1 = c_2 = c_3 = 0$ .

📝Linearly Dependent

$\left\{\begin{bmatrix}1\\2\end{bmatrix}, \begin{bmatrix}2\\4\end{bmatrix}\right\}$ is linearly dependent because $\begin{bmatrix}2\\4\end{bmatrix} = 2 \cdot \begin{bmatrix}1\\2\end{bmatrix}$ , so $2\vec{v_1} - \vec{v_2} = \vec{0}$ with non-zero coefficients.

ThKey Properties

Any set containing the zero vector is linearly dependent.
A set of $k$ vectors in $\mathbb{R}^n$ with $k > n$ is always linearly dependent.
A set with one vector is linearly independent if and only if that vector is non-zero.

Basis

DfBasis

A basis for a vector space $V$ is a set of vectors $\{\vec{v_1}, \vec{v_2}, \ldots, \vec{v_n}\}$ that is:

Linearly independent, and
Spans $V$ (i.e., $\text{span}\{\vec{v_1}, \ldots, \vec{v_n}\} = V$ )

In other words, a basis is a minimal spanning set and a maximal linearly independent set.

📝Standard Basis

For $\mathbb{R}^n$ , the standard basis is:

\vec{e_1} = \begin{bmatrix}1\\0\\\vdots\\0\end{bmatrix}, \vec{e_2} = \begin{bmatrix}0\\1\\\vdots\\0\end{bmatrix}, \ldots, \vec{e_n} = \begin{bmatrix}0\\0\\\vdots\\1\end{bmatrix}

For $\mathbb{R}^2$ : $\vec{e_1} = \begin{bmatrix}1\\0\end{bmatrix}$ , $\vec{e_2} = \begin{bmatrix}0\\1\end{bmatrix}$ .

Every vector $\vec{x} = \begin{bmatrix}x_1\\x_2\\\vdots\\x_n\end{bmatrix}$ can be uniquely written as $\vec{x} = x_1\vec{e_1} + x_2\vec{e_2} + \cdots + x_n\vec{e_n}$ .

📝Non-Standard Basis for R2

The vectors $\vec{v_1} = \begin{bmatrix}1\\1\end{bmatrix}$ and $\vec{v_2} = \begin{bmatrix}1\\-1\end{bmatrix}$ also form a basis for $\mathbb{R}^2$ because:

They are linearly independent (not scalar multiples).
They span $\mathbb{R}^2$ : $\begin{bmatrix}a\\b\end{bmatrix} = \frac{a+b}{2}\vec{v_1} + \frac{a-b}{2}\vec{v_2}$ .

⚠️ Uniqueness

A basis is not unique — any vector space has infinitely many bases. However, all bases for a given vector space have the same number of vectors (this is the dimension).

Dimension

DfDimension

The dimension of a vector space $V$ , denoted $\dim(V)$ , is the number of vectors in any basis for $V$ .

ThDimension Theorems

$\dim(\mathbb{R}^n) = n$ .
If $V$ has a basis with $n$ vectors, then every basis for $V$ has exactly $n$ vectors.
If $\dim(V) = n$ , then any set of more than $n$ vectors in $V$ is linearly dependent.
If $\dim(V) = n$ , then any set of fewer than $n$ vectors cannot span $V$ .
If $\dim(V) = n$ , then any linearly independent set of $n$ vectors is automatically a basis.
If $\dim(V) = n$ , then any spanning set of $n$ vectors is automatically a basis.

📝Dimension Examples

$\dim(\mathbb{R}^3) = 3$ . Standard basis: $\{\vec{e_1}, \vec{e_2}, \vec{e_3}\}$ .
$\dim(P_2) = 3$ (polynomials of degree $\leq 2$ ). Basis: $\{1, x, x^2\}$ .
$\dim(M_{2 \times 2}(\mathbb{R})) = 4$ . Basis: $\left\{\begin{bmatrix}1&0\\0&0\end{bmatrix}, \begin{bmatrix}0&1\\0&0\end{bmatrix}, \begin{bmatrix}0&0\\1&0\end{bmatrix}, \begin{bmatrix}0&0\\0&1\end{bmatrix}\right\}$ .
The subspace $\text{span}\left\{\begin{bmatrix}1\\2\\3\end{bmatrix}, \begin{bmatrix}4\\5\\6\end{bmatrix}\right\}$ in $\mathbb{R}^3$ has dimension 2 (a plane through the origin).

Change of Basis

DfChange of Basis

Let $B = \{\vec{v_1}, \ldots, \vec{v_n}\}$ and $B' = \{\vec{w_1}, \ldots, \vec{w_n}\}$ be two bases for $\mathbb{R}^n$ . The change-of-basis matrix from $B$ to $B'$ is:

P_{B' \leftarrow B} = \begin{bmatrix} [\vec{v_1}]_{B'} & [\vec{v_2}]_{B'} & \cdots & [\vec{v_n}]_{B'} \end{bmatrix}

Then for any vector $\vec{x}$ :

[\vec{x}]_{B'} = P_{B' \leftarrow B} \cdot [\vec{x}]_B

📝Change of Basis Example

Let $B = \left\{\begin{bmatrix}1\\1\end{bmatrix}, \begin{bmatrix}1\\-1\end{bmatrix}\right\}$ and $B' = \left\{\begin{bmatrix}1\\0\end{bmatrix}, \begin{bmatrix}0\\1\end{bmatrix}\right\}$ (standard basis).

To find $P_{B' \leftarrow B}$ , express each vector of $B$ in the standard basis:

P_{B' \leftarrow B} = \begin{bmatrix}1&1\\1&-1\end{bmatrix}

For $\vec{x} = \begin{bmatrix}3\\1\end{bmatrix}$ in the standard basis:

[\vec{x}]_B = P_{B' \leftarrow B}^{-1} \vec{x} = \frac{1}{-2}\begin{bmatrix}-1&-1\\-1&1\end{bmatrix}\begin{bmatrix}3\\1\end{bmatrix} = \begin{bmatrix}2\\1\end{bmatrix}

Verification: $2\begin{bmatrix}1\\1\end{bmatrix} + 1\begin{bmatrix}1\\-1\end{bmatrix} = \begin{bmatrix}3\\1\end{bmatrix}$ ✓

import numpy as np

# Change of basis
B = np.array([[1, 1], [1, -1]])  # New basis vectors as columns
B_inv = np.linalg.inv(B)

# Vector in standard basis
x_standard = np.array([3, 1])

# Coordinates in basis B
x_in_B = B_inv @ x_standard
print(f"Coordinates in new basis: {x_in_B}")  # [2. 1.]

# Verify: reconstruct from new coordinates
x_reconstructed = B @ x_in_B
print(f"Reconstructed: {x_reconstructed}")  # [3. 1.]

Rank

DfRank

The rank of a matrix $A$ , denoted $\text{rank}(A)$ , is the dimension of the column space (equivalently, the row space):

\text{rank}(A) = \dim(\text{col}(A)) = \dim(\text{row}(A))

It equals the number of pivot positions in the row echelon form of $A$ .

ThRank-Nullity Theorem

For an $m \times n$ matrix $A$ :

\text{rank}(A) + \text{nullity}(A) = n

where $\text{nullity}(A) = \dim(\ker(A))$ is the dimension of the null space.

📝Computing Rank

Let $A = \begin{bmatrix}1&2&3\\4&5&6\\7&8&9\end{bmatrix}$ .

Row reduce: $\begin{bmatrix}1&2&3\\4&5&6\\7&8&9\end{bmatrix} \xrightarrow{R_2 - 4R_1, R_3 - 7R_1} \begin{bmatrix}1&2&3\\0&-3&-6\\0&-6&-12\end{bmatrix} \xrightarrow{R_3 - 2R_2} \begin{bmatrix}1&2&3\\0&-3&-6\\0&0&0\end{bmatrix}$

Two pivots, so $\text{rank}(A) = 2$ . By rank-nullity: $\text{nullity}(A) = 3 - 2 = 1$ .

Null Space

DfNull Space

The null space (or kernel) of an $m \times n$ matrix $A$ is the set of all solutions to $A\vec{x} = \vec{0}$ :

\ker(A) = \{\vec{x} \in \mathbb{R}^n : A\vec{x} = \vec{0}\}

The null space is always a subspace of $\mathbb{R}^n$ .

📝Finding the Null Space

Let $A = \begin{bmatrix}1&2&0\\0&0&1\\0&0&0\end{bmatrix}$ .

Solve $A\vec{x} = \vec{0}$ :

Row 1: $x_1 + 2x_2 = 0 \Rightarrow x_1 = -2x_2$
Row 2: $x_3 = 0$
$x_2$ is free.

\ker(A) = \text{span}\left\{\begin{bmatrix}-2\\1\\0\end{bmatrix}\right\}

The null space is a line through the origin in $\mathbb{R}^3$ with dimension 1 (nullity = 1).

import numpy as np
from scipy.linalg import null_space

A = np.array([[1, 2, 0],
              [0, 0, 1],
              [0, 0, 0]])

ns = null_space(A)
print("Null space basis:")
print(ns)
# [[-0.89442719]
#  [ 0.4472136 ]
#  [ 0.        ]]

Column Space

DfColumn Space

The column space (or range) of an $m \times n$ matrix $A$ is the span of its columns:

\text{col}(A) = \{A\vec{x} : \vec{x} \in \mathbb{R}^n\} = \text{span}\{\vec{a_1}, \vec{a_2}, \ldots, \vec{a_n}\}

The column space is a subspace of $\mathbb{R}^m$ .

ThConsistency of Linear Systems

The system $A\vec{x} = \vec{b}$ has a solution if and only if $\vec{b} \in \text{col}(A)$ .

📝Column Space and Ax = b

Let $A = \begin{bmatrix}1&0\\0&1\\1&1\end{bmatrix}$ . The column space is the plane $\left\{\begin{bmatrix}a\\b\\a+b\end{bmatrix} : a, b \in \mathbb{R}\right\}$ in $\mathbb{R}^3$ .

$A\vec{x} = \begin{bmatrix}1\\2\\3\end{bmatrix}$ : Is $\begin{bmatrix}1\\2\\3\end{bmatrix}$ in $\text{col}(A)$ ? Yes, since $3 = 1 + 2$ . Solution: $\vec{x} = \begin{bmatrix}1\\2\end{bmatrix}$ .
$A\vec{x} = \begin{bmatrix}1\\2\\4\end{bmatrix}$ : Is $\begin{bmatrix}1\\2\\4\end{bmatrix}$ in $\text{col}(A)$ ? No, since $4 \neq 1 + 2$ . No solution exists.

ℹ️ Row Space vs Column Space

The row space of $A$ is the column space of $A^T$ : $\text{row}(A) = \text{col}(A^T)$ .
$\text{rank}(A) = \text{rank}(A^T)$ — row rank always equals column rank.
The null space is orthogonal to the row space: $\ker(A) \perp \text{row}(A)$ .

Python Implementation

import numpy as np
from scipy.linalg import null_space, orth

def analyze_vector_space(A):
    """Comprehensive analysis of a matrix's vector space properties."""
    m, n = A.shape
    
    print(f"Matrix A ({m}x{n}):")
    print(A)
    print()
    
    # Rank
    rank = np.linalg.matrix_rank(A)
    print(f"Rank: {rank}")
    
    # Nullity
    nullity = n - rank
    print(f"Nullity: {nullity}")
    print(f"Rank + Nullity = {rank} + {nullity} = {rank + nullity} = n ✓")
    
    # Null space
    ns = null_space(A)
    print(f"\nNull space basis ({nullity} vectors):")
    print(ns) if ns.size > 0 else print("  {0}")
    
    # Column space
    cs = orth(A)
    print(f"\nColumn space basis ({rank} vectors):")
    print(cs)
    
    # Check if Ax = b has solutions
    b = np.array([1, 2, 3])
    try:
        x, residuals, rank_check, sv = np.linalg.lstsq(A, b, rcond=None)
        if np.allclose(A @ x, b):
            print(f"\nAx = b has solution: x = {x}")
        else:
            print(f"\nAx = b has no solution (b not in column space)")
    except:
        print(f"\nAx = b has no solution")
    
    return rank, nullity

# Example
A = np.array([[1, 2, 3],
              [4, 5, 6],
              [7, 8, 9]])

rank, nullity = analyze_vector_space(A)

print("\n" + "="*50)

# Finding a basis for the column space
def column_space_basis(A):
    """Find a basis for the column space using row reduction."""
    # Row reduce and find pivot columns
    U = np.array(A, dtype=float)
    pivot_cols = []
    row, col = 0, 0
    
    while row < U.shape[0] and col < U.shape[1]:
        # Find pivot
        max_row = np.argmax(np.abs(U[row:, col])) + row
        if U[max_row, col] == 0:
            col += 1
            continue
        
        # Swap rows
        U[[row, max_row]] = U[[max_row, row]]
        pivot_cols.append(col)
        
        # Eliminate below
        for i in range(row + 1, U.shape[0]):
            if U[i, col] != 0:
                U[i] -= U[i, col] / U[row, col] * U[row]
        
        row += 1
        col += 1
    
    return A[:, pivot_cols], pivot_cols

basis, pivots = column_space_basis(A)
print(f"Pivot columns: {pivots}")
print(f"Column space basis:")
print(basis)

# Linear independence check
def check_linear_independence(vectors):
    """Check if a set of vectors is linearly independent."""
    if vectors.shape[1] == 0:
        return True
    rank = np.linalg.matrix_rank(vectors)
    return rank == vectors.shape[1]

# Examples
v1 = np.array([[1], [0], [0]])
v2 = np.array([[0], [1], [0]])
v3 = np.array([[1], [1], [0]])
v4 = np.array([[1], [1], [1]])

print("\n{[1,0,0], [0,1,0]}:", check_linear_independence(np.hstack([v1, v2])))
print("{[1,0,0], [0,1,0], [1,1,0]}:", check_linear_independence(np.hstack([v1, v2, v3])))
print("{[1,0,0], [0,1,0], [1,1,1]}:", check_linear_independence(np.hstack([v1, v2, v4])))

# Change of basis
def change_of_basis(x_standard, P):
    """Change coordinates from standard basis to basis P."""
    P_inv = np.linalg.inv(P)
    return P_inv @ x_standard

# Vector in standard basis
x = np.array([3, 1])

# New basis: rotated 45 degrees
theta = np.pi / 4
P = np.array([[np.cos(theta), -np.sin(theta)],
              [np.sin(theta),  np.cos(theta)]])

x_new = change_of_basis(x, P)
print(f"\nVector {x} in standard basis")
print(f"Coordinates in rotated basis: {x_new}")
print(f"Verification: {P @ x_new}")

Applications in AI/ML

Dimensionality Reduction

PCA (Principal Component Analysis): Finds the orthonormal basis of maximum variance. The first $k$ principal components form a basis for the $k$ -dimensional subspace that best approximates the data in the least-squares sense.
SVD (Singular Value Decomposition): Provides optimal low-rank approximation. Truncating to $k$ singular values gives the best rank- $k$ approximation.

Feature Spaces

Kernel methods: Map data to high-dimensional feature spaces where linear separation is possible. The basis of this feature space determines the kernel function.
Word embeddings: Words are represented as vectors in a learned basis. The dimension of the embedding space (e.g., 300 for Word2Vec) is the number of basis vectors.

Signal Processing

Fourier transform: Decomposes signals into frequency basis vectors.
Wavelet transform: Multi-resolution basis for time-frequency analysis.
Compressed sensing: Exploits sparsity in a given basis for signal recovery from few measurements.

Neural Networks

Weight matrices: Each layer's weight matrix transforms from one basis (input features) to another (hidden features). Learning is finding optimal bases.
Batch normalization: Normalizes activations to have zero mean and unit variance, effectively choosing a convenient basis for each layer.

# PCA as change of basis
from sklearn.decomposition import PCA
import numpy as np

# Generate sample data
np.random.seed(42)
n_samples = 200
X = np.random.randn(n_samples, 2) @ np.array([[3, 1], [1, 2]])

# PCA finds the optimal basis
pca = PCA(n_components=2)
X_transformed = pca.fit_transform(X)

print("Original data shape:", X.shape)
print("Transformed data shape:", X_transformed.shape)
print("Explained variance ratio:", pca.explained_variance_ratio_)
print("Principal components (new basis):")
print(pca.components_)  # These are the basis vectors

# The principal components form an orthonormal basis
dot_product = np.dot(pca.components_[0], pca.components_[1])
print(f"Dot product of components (should be 0): {dot_product:.10f}")

# Reconstruction: X ≈ X_transformed @ pca.components_
X_reconstructed = X_transformed @ pca.components_ + pca.mean_
reconstruction_error = np.mean((X - X_reconstructed) ** 2)
print(f"Reconstruction MSE: {reconstruction_error:.6f}")

Common Mistakes

Mistake	Correction
Confusing basis with spanning set	A basis must be both spanning AND linearly independent
Assuming the standard basis is the only basis	Any linearly independent set of $n$ vectors in $\mathbb{R}^n$ is a basis
Forgetting the zero vector is always dependent	Any set containing $\vec{0}$ is linearly dependent
Using $P^{-1}\vec{x}$ when $P$ has new basis as rows	Use columns: $P = [\vec{v_1} \mid \cdots \mid \vec{v_n}]$
Confusing column space with row space	$\text{col}(A) \subseteq \mathbb{R}^m$ , $\text{row}(A) \subseteq \mathbb{R}^n$ (different ambient spaces!)
Assuming rank = number of rows	Rank = number of pivots, not rows; a tall matrix can have rank < rows
Forgetting rank-nullity theorem	Always: $\text{rank}(A) + \text{nullity}(A) = n$ (number of columns)
Confusing linear independence with orthogonality	Orthogonal non-zero vectors are independent, but independent vectors need not be orthogonal
Assuming span of $k$ vectors in $\mathbb{R}^n$ has dimension $k$	If vectors are dependent, $\dim(\text{span}) < k$
Not checking if $b \in \text{col}(A)$ before solving $Ax = b$	First verify $b$ is in the column space; otherwise no solution exists

Interview Questions

Q1: What is the difference between a basis and a spanning set?

A spanning set generates the entire space but may contain redundant vectors. A basis is a spanning set with no redundancy — it is linearly independent. Every spanning set can be reduced to a basis by removing dependent vectors.

Q2: How do you find a basis for the null space of a matrix?

Solve $A\vec{x} = \vec{0}$ via row reduction. Express the solution in parametric vector form. The vectors multiplying each free variable form a basis for the null space.

Q3: Why is the dimension of $\mathbb{R}^n$ equal to $n$ ?

Because any basis for $\mathbb{R}^n$ must have exactly $n$ vectors. The standard basis $\{\vec{e_1}, \ldots, \vec{e_n}\}$ has $n$ vectors, and by the dimension theorem, all bases have the same size.

Q4: What is the relationship between rank, column space, and null space?

$\text{rank}(A) = \dim(\text{col}(A))$ = number of pivots.
$\text{nullity}(A) = \dim(\ker(A))$ = number of free variables.
By rank-nullity: $\text{rank}(A) + \text{nullity}(A) = n$ (columns).
The column space tells you which $\vec{b}$ allow solutions to $A\vec{x} = \vec{b}$ .
The null space tells you the solutions when they exist (particular + null space).

Q5: Can a matrix have the same row space and column space?

Yes. Example: $A = \begin{bmatrix}1&0\\0&1\end{bmatrix}$ (identity matrix). Row space = column space = $\mathbb{R}^2$ . But generally, row space $\subseteq \mathbb{R}^n$ and column space $\subseteq \mathbb{R}^m$ are in different ambient spaces unless $m = n$ .

Q6: How does change of basis relate to diagonalization?

If $A$ is diagonalizable, $A = PDP^{-1}$ where $D$ is diagonal and $P$ 's columns are eigenvectors. This means: in the eigenbasis, $A$ acts as scaling by eigenvalues. Change of basis to the eigenbasis simplifies matrix operations from $O(n^2)$ to $O(n)$ .

Q7: What is the rank of the product $AB$ ?

$\text{rank}(AB) \leq \min(\text{rank}(A), \text{rank}(B))$ . Equality holds when the column space of $B$ intersects the null space of $A$ only at $\vec{0}$ .

Practice Problems

📝Problem 1

Find a basis and the dimension of the subspace $W = \{(x, y, z) \in \mathbb{R}^3 : x + y + z = 0\}$ .

💡Solution

The equation $x + y + z = 0$ gives $x = -y - z$ . So:

W = \{(-y-z, y, z) : y, z \in \mathbb{R}\} = \text{span}\left\{\begin{bmatrix}-1\\1\\0\end{bmatrix}, \begin{bmatrix}-1\\0\\1\end{bmatrix}\right\}

These two vectors are linearly independent (not scalar multiples), so they form a basis.

$\dim(W) = 2$ .

📝Problem 2

Determine if $\left\{\begin{bmatrix}1\\2\\3\end{bmatrix}, \begin{bmatrix}4\\5\\6\end{bmatrix}, \begin{bmatrix}7\\8\\9\end{bmatrix}\right\}$ is a basis for $\mathbb{R}^3$ .

💡Solution

Form the matrix and row reduce:

\begin{bmatrix}1&4&7\\2&5&8\\3&6&9\end{bmatrix} \to \begin{bmatrix}1&4&7\\0&-3&-6\\0&-6&-12\end{bmatrix} \to \begin{bmatrix}1&4&7\\0&-3&-6\\0&0&0\end{bmatrix}

Only 2 pivots, so $\text{rank} = 2 < 3$ . The vectors are not a basis for $\mathbb{R}^3$ .

They are linearly dependent and span only a 2-dimensional subspace (a plane).

📝Problem 3

Find a basis for the column space of $A = \begin{bmatrix}1&2&3\\4&5&6\\7&8&9\end{bmatrix}$ and determine the dimension.

💡Solution

Row reduce $A$ :

\begin{bmatrix}1&2&3\\4&5&6\\7&8&9\end{bmatrix} \to \begin{bmatrix}1&2&3\\0&-3&-6\\0&0&0\end{bmatrix}

Pivot columns are columns 1 and 2. The corresponding columns of the original matrix form a basis:

\text{Basis} = \left\{\begin{bmatrix}1\\4\\7\end{bmatrix}, \begin{bmatrix}2\\5\\8\end{bmatrix}\right\}

$\dim(\text{col}(A)) = \text{rank}(A) = 2$ .

📝Problem 4

If $A$ is a $5 \times 3$ matrix with rank 2, what is the dimension of the null space?

💡Solution

By rank-nullity: $\text{rank}(A) + \text{nullity}(A) = n = 3$ .

$\text{nullity}(A) = 3 - 2 = 1$ .

The null space is a 1-dimensional subspace of $\mathbb{R}^3$ (a line through the origin).

📝Problem 5

Find a change-of-basis matrix from $B = \left\{\begin{bmatrix}1\\1\end{bmatrix}, \begin{bmatrix}0\\1\end{bmatrix}\right\}$ to the standard basis, and use it to find the coordinates of $\vec{x} = \begin{bmatrix}3\\5\end{bmatrix}$ in basis $B$ .

💡Solution

The change-of-basis matrix from $B$ to standard is:

P = \begin{bmatrix}1&0\\1&1\end{bmatrix}

To find $[\vec{x}]_B$ : solve $P[\vec{x}]_B = \vec{x}$ :

[\vec{x}]_B = P^{-1}\vec{x} = \begin{bmatrix}1&0\\-1&1\end{bmatrix}\begin{bmatrix}3\\5\end{bmatrix} = \begin{bmatrix}3\\2\end{bmatrix}

Verification: $3\begin{bmatrix}1\\1\end{bmatrix} + 2\begin{bmatrix}0\\1\end{bmatrix} = \begin{bmatrix}3\\5\end{bmatrix}$ ✓

Quick Reference

Concept	Definition	Key Formula
Linear combination	$c_1\vec{v_1} + \cdots + c_k\vec{v_k}$	Scalars multiply vectors, then add
Span	Set of all linear combinations	$\text{span}\{\vec{v_1}, \ldots, \vec{v_k}\}$
Linear independence	Only trivial solution to $\sum c_i\vec{v_i} = \vec{0}$	Row reduce; pivot in every column
Basis	Linearly independent spanning set	Minimal spanning set = maximal independent set
Dimension	Number of vectors in any basis	$\dim(\mathbb{R}^n) = n$
Rank	Dimension of column space	$\text{rank}(A) = \#$ pivots
Null space	Solutions to $A\vec{x} = \vec{0}$	$\ker(A)$ ; subspace of $\mathbb{R}^n$
Column space	Span of columns of $A$	$\text{col}(A)$ ; subspace of $\mathbb{R}^m$
Rank-nullity	Fundamental theorem	$\text{rank}(A) + \text{nullity}(A) = n$
Change of basis	$[\vec{x}]_{B'} = P^{-1}[\vec{x}]_B$	$P = [\vec{v_1} \mid \cdots \mid \vec{v_n}]$

Cross-References

Previous topic: 019 - Linear Algebra Span
Next topic: 021 - Linear Algebra Eigenvalues
Related concepts:
- Linear Transformations — basis determines matrix representation
- Matrix Multiplication — change of basis is matrix multiplication
- Determinant — $A$ is invertible iff $\det(A) \neq 0$ iff columns form a basis
- Orthogonal Basis — orthonormal basis simplifies computations
- SVD — provides optimal basis for low-rank approximation