2025-09-17 Projections, rotations, and reflections

2025-09-17 Projections, rotations, and reflections#

Last time#

Algebra of linear transformations
Polynomial evaluation and fitting

Today#

Orthogonality
Projections
Rotations
Reflections
Oh my!

using LinearAlgebra
using Plots
using Polynomials
default(linewidth=4, legendfontsize=12)

function vander(x, k=nothing)
    if isnothing(k)
        k = length(x)
    end
    m = length(x)
    V = ones(m, k)
    for j in 2:k
        V[:, j] = V[:, j-1] .* x
    end
    V
end

vander (generic function with 2 methods)

Matrices as linear transformations#

Linear algebra is the study of linear transformations on vectors, which represent points in a finite dimensional space. The matrix-vector product \(y = A x\) is a linear combination of the columns of \(A\). The familiar definition,

\[ y_i = \sum_j A_{i,j} x_j \]

can also be viewed as

\[\begin{split} y = \Bigg[ A_{:,1} \Bigg| A_{:,2} \Bigg| \dotsm \Bigg] \begin{bmatrix} x_1 \\ x_2 \\ \vdots \end{bmatrix} = \Bigg[ A_{:,1} \Bigg] x_1 + \Bigg[ A_{:,2} \Bigg] x_2 + \dotsb . \end{split}\]

Some common terminology#

The range of \(A\) is the space spanned by its columns. This definition coincides with the range of a function \(f(x)\) when \(f(x) = A x\).
The (right) nullspace of \(A\) is the space of vectors \(x\) such that \(A x = 0\).
The rank of \(A\) is the dimension of its range.
A matrix has full rank if the nullspace of either \(A\) or \(A^T\) is empty (only the 0 vector). Equivalently, if all the columns of \(A\) (or \(A^T\)) are linearly independent.
A nonsingular (or invertible) matrix is a square matrix of full rank. We call the inverse \(A^{-1}\) and it satisfies \(A^{-1} A = A A^{-1} = I\).

\(\DeclareMathOperator{\rank}{rank} \DeclareMathOperator{\null}{null} \) If \(A \in \mathbb{R}^{m\times m}\), which of these doesn’t belong?

\(A\) has an inverse \(A^{-1}\)
\(\rank (A) = m\)
\(\null(A) = \{0\}\)
\(A A^T = A^T A\)
\(\det(A) \ne 0\)
\(A x = 0\) implies that \(x = 0\)

A = rand(3, 3)
B = A' * A - A * A'
@show B
det(A)

B = [1.0118637999916473 0.586246013848885 0.039477851966610356; 0.586246013848885 0.018415120242815997 -0.5317013077433472; 0.039477851966610356 -0.5317013077433472 -1.0302789202344633]

-0.050853178345112604

What is an inverse?#

When we write \(x = A^{-1} y\), we mean that \(x\) is the unique vector such that \(A x = y\). (It is rare that we explicitly compute a matrix \(A^{-1}\), though it’s not as “bad” as people may have told you.) A vector \(y\) is equivalent to \(\sum_i \mathbf e_i y_i\) where \(\mathbf e_i\) are columns of the identity. Meanwhile, \(x = A^{-1} y\) means that we are expressing that same vector \(y\) in the basis of the columns of \(A\), i.e., \(\sum_i A_{:,i} x_i\).

using LinearAlgebra
A = rand(4, 4)

4×4 Matrix{Float64}:
679461  0.85377    0.724781   0.76951
750986  0.0441536  0.102605   0.717493
460786  0.460758   0.0456096  0.284741
332401  0.660751   0.398993   0.163007

A \ A

4×4 Matrix{Float64}:
  1.0           0.0   1.04587e-17  -1.24438e-16
  1.68969e-16   1.0   1.36421e-16   1.48495e-16
 -2.44552e-16  -0.0   1.0          -2.35153e-16
  1.41529e-16  -0.0  -0.0           1.0

inv(A) * A

4×4 Matrix{Float64}:
  1.0          1.98738e-16   5.58249e-16  -6.49513e-16
  2.50367e-16  1.0          -9.27867e-18   3.51728e-16
 -4.52051e-17  6.77929e-17   1.0          -1.77827e-16
 -3.1629e-16   3.79973e-16  -5.01748e-16   1.0

Inner products and orthogonality#

The inner product

\[ x^T y = \sum_i x_i y_i \]

of vectors (or columns of a matrix) tell us about their magnitude and about the angle. The norm is induced by the inner product,

\[ \lVert x \rVert = \sqrt{x^T x} \]

and the angle \(\theta\) is defined by

\[ \cos \theta = \frac{x^T y}{\lVert x \rVert \, \lVert y \rVert} . \]

Inner products are bilinear, which means that they satisfy some convenient algebraic properties

\[\begin{split} \begin{split} (x + y)^T z &= x^T z + y^T z \\ x^T (y + z) &= x^T y + x^T z \\ (\alpha x)^T (\beta y) &= \alpha \beta x^T y \\ \end{split} . \end{split}\]

Examples with inner products#

x = [0, 1]
y = [1, 2]
@show x' * y
@show y' * x;

x' * y = 2
y' * x = 2

ϕ = pi/6
y = [cos(ϕ), sin(ϕ)]
@show x' * y
cos_θ = x'*y / (norm(x) * norm(y))
@show cos_θ
@show cos(ϕ-pi/2);
acosd(cos_θ)

x' * y = 0.49999999999999994
cos_θ = 0.49999999999999994
cos(ϕ - pi / 2) = 0.4999999999999999

60.00000000000001

Polynomials can be orthogonal too!#

x = LinRange(-1, 1, 50)
A = vander(x, 4)
M = A * [.5 0 0 0; # 0.5
         0  1 0 0;  # x
         0  0 1 0]' # x^2
scatter(x, M, legend=:none)
plot!(x, 0*x, label=:none, color=:black)

Which inner product will be zero?
- Which functions are even and odd?

Polynomial inner products#

M[:,1]' * M[:,2]

-2.220446049250313e-16

acosd(M[:,1]' * M[:,3] / (norm(M[:,1]) * norm(M[:,3])))

41.79321606483336

M[:,2]' * M[:,3]

-4.440892098500626e-16

Geometry of Linear Algebra#

We’d like to develop intuition about transformations on vectors. We’ll start with a peanut-shaped cluster of points and see how they transform.

default(aspect_ratio=:equal)

function peanut()
    theta = LinRange(0, 2*pi, 50)
    r = 1 .+ .4*sin.(3*theta) + .6*sin.(2*theta)
    x = r .* cos.(theta)
    y = r .* sin.(theta)
    x, y
end
x, y = peanut()
scatter(x, y)

We’ll group all these points into a \(2\times n\) matrix \(X\). Note that multiplication by any matrix \(A\) is applied to each column separately, i.e.,

\[ A \underbrace{\Bigg[ \mathbf x_1 \Bigg| \mathbf x_2 \Bigg| \dotsb \Bigg]}_X = \underbrace{\Bigg[ A \mathbf x_1 \Bigg| A \mathbf x_2 \Bigg| \dotsb \Bigg]}_{AX} \]

X = [x y]'
size(X)

(2, 50)

Inner products and projections#

Consider the operation \(f(x) = v (v^T x)\) for some vector \(v\). We’ll apply this same operation to every point in the original cluster \(X\).

function tplot(X, Y)
    "Plot a transformation from X to Y"
    scatter(X[1,:], X[2,:], label="X")
    scatter!(Y[1,:], Y[2,:], label="Y")
end

v = [1, 0]
tplot(X, v * (v' * X))

Are the parentheses important? Why or why not?

\[v (v^T X) \overset?= v v^T X\]

Questions#

v = [1, 1] / sqrt(2)

# What happens if v is not normalized?
# Discuss with your group and make a prediction before uncommenting and running the line below.
tplot(X,  v * v' * X)

# Vector transformations are "nice" if norm(v) = 1

theta = pi * .3
v = [cos(theta), sin(theta)]
@show norm(v)
tplot(X, (I - v * v') * v * v' * X)

norm(v) = 1.0

Discuss#

Discuss with your neighbor what the vector \(v\) looks like in these diagrams.

What is the null space of each transformation?
What is the range of each transformation?
For each figure, what happens if you apply the transformation \(v v^T\) to the red points?
What about \(I - v v^T\)?

Givens Rotation#

We can rotate the input using a \(2\times 2\) matrix, parametrized by \(\theta\).

function givens(theta)
    s = sin(theta)
    c = cos(theta)
    A = [c -s; s c]
    A
end

givens(0)

2×2 Matrix{Float64}:
 1.0  -0.0
 0.0   1.0

givens(1*π)

2×2 Matrix{Float64}:
 -1.0          -1.22465e-16
  1.22465e-16  -1.0

tplot(X, givens(.8) * X)

What is the null space and range of the Givens matrix?
Is the Givens matrix orthogonal? What does that mean?

Reflection#

Using a construct of the form \(I - \alpha v v^T\), create a reflection across the plane defined by unit normal vector \(v = [\cos\theta, \sin\theta]\). Try different values of \(\alpha\) and convince yourself of why it works or sketch a diagram for the projections \(v v^T\) and \(I - v v^T\) and decide how to modify it to make a reflection.

function reflect(theta)
    v = [cos(theta), sin(theta)]
    I - ??? * v * v'
end

tplot(X, reflect(pi/2) * X)

../_images/b6f055a64b7e863521bde20de78aa6f26af42cf2a5926b7d80a2645590bb5a42.svg

using Test
@test mapslices(norm, X; dims=1) ≈ mapslices(norm, reflect(0.3) * X; dims=1)
@test X ≈ reflect(0.5) * reflect(0.5) * X

Test Passed
  Expression: X ≈ reflect(0.5) * reflect(0.5) * X
   Evaluated: [1.0 1.291607690975099 … 0.6919723366713914 0.9999999999999993; 0.0 0.16653437041080127 … -0.08921995295821092 -2.449293598294705e-16] ≈ [1.0 1.291607690975099 … 0.6919723366713914 0.9999999999999993; 0.0 0.16653437041080127 … -0.08921995295821092 -2.449293598294705e-16]

Where is the vector \(v\) on the figure above?
Does the reflection matrix change if you replace \( v(\theta)\) with \(v(\theta + \pi)\)? Why or why not?
What happens if you reflect twice?
What is the null space and range of the reflection matrix?
Is the reflection matrix orthogonal?