12 Maps and Structure

A matrix is more than a grid of numbers — it is a linear map, a function that transforms vectors while preserving the rules of addition and scaling. This chapter explores what matrices do geometrically and introduces the structural concepts — subspaces, rank, and the four fundamental subspaces — that reveal the full picture of a linear map.

(ns lalinea-book.maps-and-structure
  (:require
   ;; La Linea (https://github.com/scicloj/lalinea):
   [scicloj.lalinea.linalg :as la]
   [scicloj.lalinea.tensor :as t]
   [scicloj.lalinea.elementwise :as el]
   ;; Dataset manipulation (https://scicloj.github.io/tablecloth/):
   [tablecloth.api :as tc]
   ;; Interactive Plotly charts (https://scicloj.github.io/tableplot/):
   [scicloj.tableplot.v1.plotly :as plotly]
   ;; Visualization annotations (https://scicloj.github.io/kindly-noted/):
   [scicloj.kindly.v4.kind :as kind]
   ;; Arrow diagrams for 2D vectors:
   [scicloj.lalinea.vis :as vis]
   [clojure.math :as math]))

We use the same example vectors as in the previous chapter:

(def u (t/column [3 1]))

(def v (t/column [1 2]))

Linear maps

Matrices as transformations

A linear map (or linear transformation) is a function between vector spaces that respects the structure:

For all vectors \(\mathbf{u}, \mathbf{v}\): \(T(\mathbf{u} + \mathbf{v}) = T(\mathbf{u}) + T(\mathbf{v})\)
For every scalar \(\alpha\) and vector \(\mathbf{v}\): \(T(\alpha \mathbf{v}) = \alpha\, T(\mathbf{v})\)

Informally: the map commutes with addition and scaling. This definition applies to any vector spaces, not just \(\mathbb{R}^n\). Familiar examples from calculus are linear maps:

Differentiation: \(D(\alpha f + \beta g) = \alpha f' + \beta g'\)
Integration: \(\int_0^x (\alpha f + \beta g) = \alpha \int_0^x f + \beta \int_0^x g\)

These operate on infinite-dimensional function spaces and cannot be represented as finite matrices.

For finite-dimensional spaces, there is a concrete representation:

Every linear map from \(\mathbb{R}^n\) to \(\mathbb{R}^m\) can be written as multiplication by an \(m \times n\) matrix.

This is why matrices are central to computational linear algebra — they are the universal representation of linear maps between finite-dimensional spaces.

Matrix–vector multiplication

Concretely, applying a linear map means matrix–vector multiplication: each entry of the output is the dot product of a row of \(A\) with the input vector \(\mathbf{x}\):

\[(A\mathbf{x})_i = \sum_j A_{ij}\, x_j\]

Equivalently, \(A\mathbf{x}\) is a linear combination of the columns of \(A\), weighted by the entries of \(\mathbf{x}\). This column view is often the more illuminating one — it says “the output lives in the span of \(A\)’s columns.” In La Linea, la/mmul is the concrete realisation of applying a linear map (represented as a matrix) to a vector.

Example: rotation

Rotation by 90° counter-clockwise:

(def R90
  (t/matrix [[0 -1]
             [1  0]]))

It maps the x-axis to the y-axis:

(la/mmul R90 (t/column [1 0]))

#la/R [:float64 [2 1]
       [[0.000]
        [1.000]]]

And the y-axis to the negative x-axis:

(la/mmul R90 (t/column [0 1]))

#la/R [:float64 [2 1]
       [[-1.000]
        [ 0.000]]]

Visualising the effect on our vectors — each arrow rotates 90° counter-clockwise:

(vis/arrow-plot [{:label "u" :xy [3 1] :color "#2266cc"}
                 {:label "Ru" :xy [-1 3] :color "#2266cc" :dashed? true}
                 {:label "v" :xy [1 2] :color "#cc4422"}
                 {:label "Rv" :xy [-2 1] :color "#cc4422" :dashed? true}]
                {})

Let us verify the linearity properties.

Additivity: \(R(\mathbf{u} + \mathbf{v}) = R(\mathbf{u}) + R(\mathbf{v})\):

(la/close? (la/mmul R90 (el/+ u v))
           (el/+ (la/mmul R90 u) (la/mmul R90 v)))

true

Homogeneity: \(R(3\mathbf{u}) = 3 R(\mathbf{u})\):

(la/close? (la/mmul R90 (el/scale u 3.0))
           (el/scale (la/mmul R90 u) 3.0))

true

Example: stretching

(def stretch-mat
  (t/matrix [[3 0]
             [0 1]]))

Stretches the x-direction by 3, leaves y unchanged. Let us see what it does to a set of points on the unit circle:

(let [angles (el/* (/ (* 2.0 math/PI) 40.0) (t/make-reader :float64 41 idx))
      circle-x (el/cos angles)
      circle-y (el/sin angles)
      data-mat (t/hstack [(t/column circle-x) (t/column circle-y)])
      stretched-mat (la/transpose (la/mmul stretch-mat (la/transpose data-mat)))]
  (-> (tc/dataset {:x (t/select stretched-mat :all 0)
                   :y (t/select stretched-mat :all 1)
                   :shape (repeat 41 "stretched")})
      (tc/concat (tc/dataset {:x circle-x
                              :y circle-y
                              :shape (repeat 41 "original")}))
      (plotly/base {:=x :x :=y :y :=color :shape})
      (plotly/layer-line)
      plotly/plot))

The unit circle becomes an ellipse — the matrix stretches one axis more than the other.

Example: projection

Not all linear maps are invertible. A projection collapses one dimension, losing information:

(def proj-xy
  (t/matrix [[1 0 0]
             [0 1 0]
             [0 0 0]]))

This projects 3D space onto the \(xy\)-plane by zeroing out \(z\):

(la/mmul proj-xy (t/column [5 3 7]))

#la/R [:float64 [3 1]
       [[5.000]
        [3.000]
        [0.000]]]

The \(z\)-component is gone and cannot be recovered. The matrix is not invertible — its determinant is zero:

(la/det proj-xy)

0.0

Example: shear

A shear slides one direction proportionally to another. It changes shape but preserves area (determinant = 1):

(def shear-mat
  (t/matrix [[1 2]
             [0 1]]))

(la/det shear-mat)

1.0

The shear fixes \(\mathbf{e}_1\) but slides \(\mathbf{e}_2\) sideways — it tilts the vertical axis:

(vis/arrow-plot [{:label "e₁" :xy [1 0] :color "#2266cc"}
                 {:label "e₂" :xy [0 1] :color "#cc4422"}
                 {:label "Se₁" :xy [1 0] :color "#2266cc" :dashed? true}
                 {:label "Se₂" :xy [2 1] :color "#cc4422" :dashed? true}]
                {})

Composition of maps

Applying one linear map and then another is the same as multiplying their matrices. The order matters — matrix multiplication is not commutative in general.

Rotate then stretch \(\neq\) stretch then rotate:

(def AB (la/mmul stretch-mat R90))

(def BA (la/mmul R90 stretch-mat))

(la/norm (el/- AB BA))

2.8284271247461903

Applying both orderings to \(\mathbf{e}_1 = [1,0]^T\) shows different results:

(vis/arrow-plot [{:label "e₁" :xy [1 0] :color "#999999"}
                 {:label "R then S" :xy [0 1] :color "#2266cc"}
                 {:label "S then R" :xy [0 3] :color "#cc4422"}]
                {:width 600})

The result depends on the order — just like “put on socks, then shoes” is different from “put on shoes, then socks.”

Subspaces

What is a subspace?

A subspace is a subset of a vector space that is itself a vector space — it is closed under addition and scaling. Every subspace must contain the zero vector.

Examples in \(\mathbb{R}^3\):

The zero vector alone (dimension 0)
Any line through the origin (dimension 1)
Any plane through the origin (dimension 2)
All of \(\mathbb{R}^3\) (dimension 3)

These are the only subspaces of \(\mathbb{R}^3\). Notably, a line or plane that does not pass through the origin is not a subspace.

Column space and null space

Every matrix \(A\) defines two important subspaces:

The column space (or range) \(\text{Col}(A)\): the set of all vectors \(\mathbf{b}\) such that \(A\mathbf{x} = \mathbf{b}\) has a solution. It is the span of \(A\)’s columns — the set of outputs the map can produce.
The null space (or kernel) \(\text{Null}(A)\): the set of all vectors \(\mathbf{x}\) with \(A\mathbf{x} = \mathbf{0}\). These are the inputs that the map annihilates.

Consider a matrix whose third column equals the sum of the first two:

(def M
  (t/matrix [[1 2 3]
             [4 5 9]
             [7 8 15]]))

Since column 3 = column 1 + column 2, the vector \([1, 1, -1]^T\) is in the null space:

(la/mmul M (t/column [1 1 -1]))

#la/R [:float64 [3 1]
       [[0.000]
        [0.000]
        [0.000]]]

Think of it this way: if we walk 1 unit along column 1, 1 unit along column 2, and \(-1\) unit along column 3, we get back to the origin — because column 3 is redundant.

The null space is not just \(\{[1,1,-1]^T\}\) but all its scalar multiples. Any scalar times a null space vector is again a null space vector (that is the subspace property):

(la/mmul M (el/scale (t/column [1 1 -1]) 7.0))

#la/R [:float64 [3 1]
       [[0.000]
        [0.000]
        [0.000]]]

Rank and the rank-nullity theorem

Rank

The rank of a matrix is the dimension of its column space — the number of genuinely independent columns.

The SVD (covered in the Eigenvalues and Decompositions chapter) reveals the rank: it equals the number of non-zero singular values.

(def sv-M (:S (la/svd M)))

sv-M

#la/R [:float64 [3] [21.76 0.5848 4.345E-17]]

(def rank-M (la/rank M))

rank-M

Although \(M\) is \(3 \times 3\), it has rank 2 — only two of its three columns carry independent information.

Nullity

The nullity is the dimension of the null space — the number of independent directions collapsed to zero.

(def nullity-M (- (second (t/shape M)) rank-M))

nullity-M

The rank-nullity theorem

\[\text{rank}(A) + \text{nullity}(A) = n\]

where \(n\) is the number of columns. The rank counts the dimensions “used” by the map, and the nullity counts the dimensions “collapsed” to zero. Together they account for all \(n\) input dimensions.

(= (+ rank-M nullity-M)
   (second (t/shape M)))

true

We can extract the null space from the SVD. The columns of \(V\) corresponding to zero singular values span it:

(def null-basis (la/null-space M))

null-basis

#la/R [:float64 [3 1]
       [[ 0.5774]
        [ 0.5774]
        [-0.5774]]]

Verify it lies in the null space:

(la/norm (la/mmul M null-basis))

8.881784197001252E-16

What rank tells you about \(A\mathbf{x} = \mathbf{b}\)

Full column rank (\(r = n\)): the system has at most one solution for any \(\mathbf{b}\) (the null space is trivial)
Full row rank (\(r = m\)): the system has at least one solution for every \(\mathbf{b}\) (the column space is all of \(\mathbb{R}^m\))
Full rank (\(r = m = n\)): exactly one solution — the matrix is invertible

Full-rank square matrix:

(def A-full (t/matrix [[2 1] [1 3]]))

(la/rank A-full)

Invertible — unique solution for any right-hand side:

(la/solve A-full (t/column [5 7]))

#la/R [:float64 [2 1]
       [[1.600]
        [1.800]]]

Rank-deficient matrix — la/solve returns nil:

(la/solve M (t/column [1 2 3]))

nil

The four fundamental subspaces

Every \(m \times n\) matrix \(A\) defines not two but four subspaces:

Subspace	Dimension	Lives in
Column space \(\text{Col}(A)\)	\(r\)	\(\mathbb{R}^m\)
Left null space \(\text{Null}(A^T)\)	\(m - r\)	\(\mathbb{R}^m\)
Row space \(\text{Row}(A) = \text{Col}(A^T)\)	\(r\)	\(\mathbb{R}^n\)
Null space \(\text{Null}(A)\)	\(n - r\)	\(\mathbb{R}^n\)

In each ambient space, the two subspaces are complementary — every vector can be written uniquely as a sum of one vector from each subspace, so their dimensions add up to the dimension of the ambient space.

Let us compute all four for our rank-2 matrix \(M\):

Column space — the columns of \(U\) corresponding to non-zero singular values:

(def col-space-basis (la/col-space M))

col-space-basis

#la/R [:float64 [3 2]
       [[0.1702 -0.8969]
        [0.5075 -0.2754]
        [0.8447  0.3462]]]

Left null space — the remaining columns of \(U\):

(def svd-M (la/svd M))

(def left-null-basis
  (let [r (la/rank M)
        U (:U svd-M)]
    (t/submatrix U :all (range r (first (t/shape M))))))

left-null-basis

#la/R [:float64 [3 1]
       [[ 0.4082]
        [-0.8165]
        [ 0.4082]]]

Row space — the columns of \(V\) corresponding to non-zero singular values:

(def row-space-basis
  (let [r (la/rank M)
        Vt (:Vt svd-M)]
    (la/transpose (t/submatrix Vt (range r) :all))))

row-space-basis

#la/R [:float64 [3 2]
       [[0.3728  0.7264]
        [0.4427 -0.6860]
        [0.8155 0.04039]]]

Null space — already computed above as null-basis.

Let us verify the dimensions match the table:

{:col-space (second (t/shape col-space-basis))
 :left-null (second (t/shape left-null-basis))
 :row-space (second (t/shape row-space-basis))
 :null-space (second (t/shape null-basis))}

{:col-space 2, :left-null 1, :row-space 2, :null-space 1}

Verify each subspace: the left null space satisfies \(M^T v = 0\):

(< (la/norm (la/mmul (la/transpose M) left-null-basis)) 1e-10)

true

Column space and left null space are orthogonal complements:

(< (la/norm (la/mmul (la/transpose col-space-basis) left-null-basis)) 1e-10)

true

Row space and null space are orthogonal complements:

(< (la/norm (la/mmul (la/transpose row-space-basis) null-basis)) 1e-10)

true

rank = 2, nullity = 1, and the dimensions add up:

In \(\mathbb{R}^3\) (output): col space (2) + left null (1) = 3
In \(\mathbb{R}^3\) (input): row space (2) + null space (1) = 3

Every vector in \(\mathbb{R}^3\) splits uniquely into a row-space part and a null-space part (input side), or a column-space part and a left-null-space part (output side). The matrix \(M\) maps the row space onto the column space and annihilates the null space.

When we have a notion of angles (as we do in \(\mathbb{R}^n\) spaces), these complementary pairs turn out to be perpendicular. The next chapter introduces inner products, which make this precise.

source: notebooks/lalinea_book/maps_and_structure.clj