Linear Algebra Ch1: Vectors and Vector Spaces

中文版點此

This series will the the culmination of the last 4 months I spent studying linear algebra. I intend to make a video series on linear algebra. This text version will serve as a stepping stone that will allow me to organize the logic and content to include in the video version, so that I can later focus entirely on crafting the animations without splitting my attention.

Most of the content is extracted from Linear Algebra Done Right and put in my own language. The numbering for all the definitions and theorems will be the same as the original text to allow the reader to refer back to the original text.

The purpose of this series is to provide an easier starting point for beginning learners, not to replace the original text or serve as a standalone resource for learning. I encourage motivated readers to read the original text for deeper understanding and insights.

The original text provides many exercises for readers to complete. Maybe too many! And I think doing exercises is only effective up to a certain point. The main goal is to understand the key concepts. You will forget a lot of the details after you are done with the book anyway. Hence, I will pick a selection of the problems for readers to complete for optimal efficiency.

Math cannot be learnt without actually writing it down on paper and thinking it through step by step. So I highly recommend doing these exercises. Refer to the original text or use help from AI to to facilitate your learning. They will definitely be of great help!

1.A：Vectors

Before taking linear algebra, most of us have seen a lot of vectors. In high school math, what we regarded as vectors were arrows or points on the 2D plane. Such as $\left[\begin{array}{r}2 \\ 1 \end{array}\right]$ .

We've also seen arrows and points in the 3D space like $\left[\begin{array}{r}2 \\ 1 \\ -2 \end{array}\right]$ . But vectors are not limited to 2 or 3 dimensional spaces. We can extend this concept to n-dimensional spaces.

\left[\begin{array}{r}x_1 \\ x_2 \\ \vdots \\ x_n \end{array}\right]

Up to this point, we have denoted vectors by vertical $n \times 1$ matrices. Conventionally, we call vertical $n \times 1$ matrices vectors, not horizontal $1 \times n$ matrices.

Horizontal matrices such as $\left[\begin{array}{r}2 & 1\end{array}\right]$ are called row vectors.

Another way to denote (column) vectors are parentheses. For example, Both of these notations denote the same vector.

\left[\begin{array}{r}-1.1 \\ 0.0 \\ 3.6 \\ -7.2\end{array}\right] = (-1.1,0.0,3.6,-7.2)

More generally,

\left[\begin{array}{r}x_1 \\ x_2 \\ \vdots \\ x_n\end{array}\right] = (x_1, x_2, ...,x_n)

Besides arrows or points in spaces, vectors can take on many physical meanings:

Color Codes

Colors can be expressed by RGB vectors: colors

Asset Allocation

The asset allocation below can be represented by $(100, 50, 25, 10)$

Stock Returns

A vector can represent the daily return of a stock. $(−0.022, +0.014, +0.004)$ , for example, can represent a stock that went down 2.2% on the first day, up 1.4% the next day, and up again 0.4% on the third day.

Cash Flow

Can flow can be represented by a vector as well. Suppose each entry gives the cash flow in a quarter. $(1000, -10, -10, -1010)$ means a 1 year loan of $1000 with 1% interest payments made each quarter, and the principle and the last interest payment at the end.

Images

Above is a $256 \times 256$ image. Flattening it gives us a 65536-vector that represents the image.

(0.792,0.788, 0.819, 0.792, 0.795, \dots, 0.803, 0.803, 0.807, 0.780, 0.835)

Videos

A grayscale video composed of $K$ frames that are $M \times N$ , can be represented by a $KMN$ -vector.

Other Examples

There are many other examples I can't fit into this article. Here are some of them. Try to think of a way to represent these data with vectors:

quantities of products hold
values across a population (e.g. blood pressure)
probabilities (e.g. coin flip)
time series (temperature measurements)
customer purchases
features or attributes of an entity
frequencies (e.g. word count)

Notations

One of the biggest mistakes I've made with math is the mindset of "as long as it's understandable" and not appreciating the importance of writing math in a neat, organized fashion. I used to just write down just a pile of numbers to show the process of my calculation. It was through learning how to write proofs that I understood how you can use language and words to explain and illustrate what you are doing with the numbers and notations. Knowing how to do this will make your math much more readable and understandable.

Knowing the notations well is necessary in order to translate your thoughts into clear and rigorous expressions.

Think of code. Using your notations well is equivalent to naming your variables well when coding. Explain each steps in your reasoning is equivalent to writing readable code and meaningful commit messages and comments.

Now we will look at notations that you will often see and use in linear algebra. It is crucial to familiarize yourself with notations.

$0$ vectors

For convenience, we have cleaner notations for vectors. For example:

0_3 = (0, 0, 0)

We often leave out the subscript as well. In this example, the dimensionality of $0$ is implied by the context. So we can write:

(3,4,1) + 0 = (3,4,1) + (0, 0, 0)

unit vectors

The i-th unit vector refers to the vector that has 1 in the ith position and 0 everywhere else.

e_1=\left[\begin{array}{l}1 \\ 0 \\ 0\end{array}\right], \quad e_2=\left[\begin{array}{l}0 \\ 1 \\ 0\end{array}\right], \quad e_3=\left[\begin{array}{l}0 \\ 0 \\ 1\end{array}\right]

In other words:

\left(e_i\right)_j= \begin{cases}1 & j=i \\ 0 & j \neq i,\end{cases}

Here, $e_i$ is a vector, and $(e_i)_j$ is a number. (the $j$ -th element).

ones vectors

We use the notation $\mathbf{1}_n$ for the n-vector with all its elements equal to 1. As with zero vectors, the size is usually determined from the context, in which case we just write $\mathbf{1}$ .

Definition of Vector and Notations

Lists

Having looked at various examples, we've familiarized ourselves with vectors. However, we have not defined what a vector truly is. Before we do that, let us define lists.

Definition 1.8：list

Suppose $n$ is a non-negative number, a list of length $n$ is an ordered collection of elements. (an element could be a number, a list or other objects)
Two lists are equal if they are equal in length and have the same elements in the same order

Some examples of valid lists：

$(1, 2)$
$()$
$((1, 2), (3), 3, i)$

You maybe thinking

Lists look similar to vectors, are vectors just lists?

The answer is it depends. Lists are indeed one of the most commonly seen form of vectors. However, vectors are a more general concept, so besides lists, there are other instantiations of vectors. As we formally define vector spaces, and see examples that are not lists, you will gradually develop an intuition for it.

$\mathbf{R}$ , $\mathbf{C}$ , and $\mathbf{F}$

You are probably already familiar with the notations $\mathbf{R}$ and $\mathbf{C}$ . $\mathbf{R}$ represents the set of all the real numbers, and $\mathbf{C}$ the set of all the complex numbers.

$\mathbf{R}^n$ , naturally, represents the set of all n-lists of real numbers.

For example：

$(1, 5) \in \mathbf{R}^2$
$(-1+i, 2+3i, 0) \in \mathbf{C}^3$

So, are vectors just lists of length n of elements of $\mathbf{R}$ or $\mathbf{C}$ ?

From an application perspective, this definition indeed works most for of the cases. But mathematicians are not satisfied with this. They want to go as abstract as possible. They want to find a structure more general than $\mathbf{R}$ or $\mathbf{C}$ . You will see what this means should you stay till the end. For now, you can think of vectors as just $\mathbf{R}^n$ or $\mathbf{C}^n$ to help conceptualize it.

Many times, we don't specify if we are working with $\mathbf{R}$ or $\mathbf{C}$ , so we will use $\mathbf{F}$ to denote $\mathbf{R}$ or $\mathbf{C}$ .

Definition 1.6：notation: F

\mathbf{F}

\mathbf{R}

\mathbf{C}

More specifically, if

\mathbf{F}=\mathbf{R}

and

n=2

, then

\mathbf{F}^n

is the real plane(

\mathbf{R}^2

). If

n=3

, then

\mathbf{F}^n

represents

\mathbf{R}^3

In fact, many of the theorems that work for $\mathbf{R}$ and $\mathbf{C}$ also work for arbitrary fields. A field is essentially a set over which addition, subtraction, multiplication and division are defined. So $\mathbf{R}$ and $\mathbf{C}$ are both fields.

Even though many theorems can be applied to any field, most applications are concerned with $\mathbf{R}$ and $\mathbf{C}$ . In this series, we will not deal with fields other than $\mathbf{R}$ and $\mathbf{C}$ .

Addition and Scalar Multiplication on F^n

What linear algebra does, to put it in very simple terms, is to extend addition and multiplication from 1 dimension to multi-dimensional spaces. So let's start with the most intuitive example, extending addition to $\mathbf{F}^n$ .

Addition on $\mathbf{F}^n$

Addition on $\mathbf{F}^n$ is identical to the familiar addition on $\mathbf{R}^2$ 以及 $\mathbf{R}^3$ : you just perform number addition on each entry of the list.

定義1.13： F^n中的加法

$(x_1, x_2, ..., x_n) + (y_1, y_2, ..., y_n) = (x_1+y_1, x_2+y_2, ..., x_n+y_n)$

A few examples：

If $F=R$ ，then $(1,-1) + (-3, 5) = (-2, 4)$
If $F=C$ ，then $(1+i, 2-i) + (i, 0) = (1+2i, 2-i)$

This definition of addition satisfies commutativity, as you may verify.

Theorem 1.14：commutativity in F^n

If $x, y \in \mathbf{F}^n$ , then $x+y=y+x$ .

Notice that we use the word "define" here. This implies we could define a non-commutative addition. For example:

(x_1, x_2, ..., x_n) + (y_1, y_2, ..., y_n) = (x_1+2y_1, x_2+2y_2, ..., x_n+2y_n)

This definition of addition is not commutative.

We are not typically interested in these types of addition though, as they don't have the properties we desire.

Scalar Multiplication

Now we turn to define scalar multiplication on $\mathbf{F}^n$

As its name suggests, we multiply a vector by a scalar. We don't multiply 2 vectors with scaler multiplication.

Suppose we have a vector $(x_1, x_2, ..., x_n) \in F^n$ , and a scalar $\lambda \in F$ , then

Definition 1.18：scalar multiplication on F^n

$\lambda (x_1, x_2, ..., x_n) = (\lambda x_1, \lambda x_2, ..., \lambda x_n)$

A few examples：

If $F=R$ ，then $3(1,-1) = (3, -3)$
If $F=R$ ，then $0(1,-1) = 0$ , the $0$ on the right being a vector, not a number
If $F=C$ ，then $i(1+i, 2-i) = (-1+i, 1+2i)$

Selected Exercises from Linear Algebra Done Right, Ch1.A

Find $x \in \mathbf{R}^4$ such that

(4,-3,1,7)+2 x=(5,9,-6,8)

Explain why there does not exist $\lambda \in \mathrm{C}$ such that

\lambda(2-3 i, 5+4 i,-6+7 i)=(12-5 i, 7+22 i,-32-9 i) .

Show that $\lambda(x+y)=\lambda x+\lambda y$ for all $\lambda \in \mathbf{F}$ and all $x, y \in \mathbf{F}^n$ .

1.B：Vector Spaces

Defining Vector Spaces

We previously defined addition and multiplication over $\mathbf{F}^n$ to show how these operations extend from one dimension to multiple dimensions.

Now we can proceed to define a vector space.

In simple terms, a vector space is a set where we can perform addition and scalar multiplication meaningfully

Let's break this down:

For any two elements $u, v \in V$ in a vector space $V$ , their sum $u+v$ is also in $V$ .
For any scalar $\lambda \in \mathbf{F}$ and vector $v \in V$ , their product $\lambda v$ is in $V$ .

To make this intuitive concept more formal, let us look at the actual definition of a vector space.

Definition 1.20：definition of vector space

A vector space $V$ is a set along with an addition and scalar multiplication on $V$ such that the following properties hold.

commutativity

$u+v=v+u$ for all $u, v \in V$ 。

associativity

$(u+v)+w=u+(v+w)$ and $(a b) v=a(b v)$ for all $u, v, w \in V$ and for all $a, b \in \mathbf{F}$ 。

additive identity

There exists an element $0 \in V$ such that $v+0=v$ for all $v \in V$ 。

additive inverse

For every $v \in V$ ，there exists $w \in V$ such that $v+w=0$ 。

multiplicative identity

$1 v=v$ for all $v \in V$ 。

distributive properties

$a(u+v)=a u+a v$ 且 $(a+b) v=a v+b v$ for all $a, b \in \mathbf{F}$ and all $u, v \in V$ 。

Wow, that's a lot of conditions all at once - looks complicated!

Actually, this concept is quite simple. Remember what we said earlier? The most intuitive way to understand a vector space is as a "multi-dimensional set with defined addition and multiplication."

We want all the familiar properties of addition and multiplication that we know from $\mathbf{R}$ and $\mathbf{C}$ to apply to $V$ as well. That's really all there is to it - you'll notice that all the rules above are just properties we're familiar with from addition and multiplication in $\mathbf{R}$ or $\mathbf{C}$ .

We want these same properties to work in vector spaces, which is why we define vector spaces this way.

At this point, you might want to go back and verify that the addition and multiplication operations we defined earlier for $\mathbf{F}^n$ satisfy all these conditions. Check why $\mathbf{F}^n$ is a vector space, whether $\mathbf{F}=\mathbf{R}$ or $\mathbf{F}=\mathbf{C}$ . Let me demonstrate how to verify that $\mathbf{F}^n$ satisfies the third rule.

F^n has an additive identity

Suppose $V= \mathbf{F}^n$ and $u\in V$ ，then there exists $0 \in V$ such that $u+0=u$

Proof：

$(0, \dots, 0) \in \mathbf{F}^n$
$u + (0, \dots, 0) = u$

"Isn't this obvious just by looking at it? Why do we need to prove it?"

It might seem tedious, but this is the nature of mathematical proof - we need to ensure each step is rigorous and can't skip any steps. This example is simple, so you might feel an urge to skip the proof since the result seems obvious. However, as the things we need to prove become more complex, you'll start to appreciate the importance of becoming familiar with this step-by-step reasoning process.

You can verify in the same way that $\mathbf{F}^n$ satisfies all the conditions for a vector space. It's precisely because $\mathbf{F}^n$ meets these conditions that we can say $\mathbf{F}^n$ is indeed a vector space.

The vector spaces $\mathbf{R}^n$ and $\mathbf{C}^n$ are so common they get their own names:

Definition 1.22: Real Vector Space, Complex Vector Space

A vector space over $\mathbf{R}$ is called a real vector space
A vector space over $\mathbf{C}$ is called a complex vector space.

Now we look at an example to illustrate that vector spaces aren't limited to $\mathbf{F}^n$ , but are actually a more general structure.

Definition 1.24：Vector Space of Functions

If $S$ is a set, then $\mathbf{F}^S$ denotes the set of all functions from set $S$ to $\mathbf{F}$
For $f, g \in \mathbf{F}^S$ , their sum $f+g \in \mathbf{F}^S$ is a function. For all $x \in S$ , we define $f+g$ by

(f+g)(x)=f(x)+g(x)

For $\lambda \in \mathbf{F}$ and $f \in \mathbf{F}^S$ ，their product $\lambda f \in \mathbf{F}^S$ is a function. For any $x \in S$ ，we define $\lambda f$ by $(\lambda f)(x)=\lambda f(x)$

The above definition, is a set of functions. We defined addition and multiplication on this set to make it a vector space. You may verify this definition satisfies the definition of a vector space, making $\mathbf{F}^S$ a valid vector space.

As an example, let us verify $\mathbf{F}^S$ has an additive identity.

In the set $\mathbf{F}^S$ , there exists $0: S \rightarrow \mathbf{F}$ defined by

0(x)=0

for all $x \in S$

This $0$ function is the additive identity in $\mathbf{F}^S$ . The other conditions can be verified in a similar fashion.

As such, $\mathbf{F}^S$ is a valid vector space, and we can refer to functions in this set as vectors, as with any other vector spaces.

At this point, you should be able to appreciate that vector spaces are not limited to the form $\mathbf{F}^n$ , and a vector doesn't have to be a list.

Further Thoughts: Matrices

We can use $\mathbf{F}^{m,n}$ to represent the set of all $m \times n$ matrices.

For example:

$\begin{bmatrix} 1 & 0 \\ 0 & 1 \end{bmatrix} \in \mathbf{F}^{2, 2}$
$\begin{bmatrix} 1 & 2 \\ 3 & 4 \\ 4 & 4 \end{bmatrix} \in \mathbf{F}^{3,2}$

So, is $\mathbf{F}^{m,n}$ a vector space? What definitions of matrix addition and scalar multiplication would make $\mathbf{F}^{m,n}$ satisfy the conditions of a vector space?

Selected Exercises From Linear Algebra Done Right, Ch1.B

The empty set is not a vector space. The empty set fails to satisfy only one of the requirements listed in the definition of a vector space (1.20). Which one?
Show that in the definition of a vector space (1.20), the additive inverse condition can be replaced with the condition that
$0 v=0 \text { for all } v \in V \text {. }$
Here the 0 on the left side is the number 0, and the 0 on the right side is the additive identity of $V$ .
The phrase a "condition can be replaced" in a definition means that the collection of objects satisfying the definition is unchanged if the original condition is replaced with the new condition.
Suppose $V$ is a real vector space.
- The complexification of $V$ , denoted by $V_{\mathrm{C}}$ , equals $V \times V$ . An element of $V_{\mathrm{C}}$ is an ordered pair $(u, v)$ , where $u, v \in V$ , but we write this as $u+i v$ .
- Addition on $V_{\mathrm{C}}$ is defined by
  $\left(u_1+i v_1\right)+\left(u_2+i v_2\right)=\left(u_1+u_2\right)+i\left(v_1+v_2\right)$
  for all $u_1, v_1, u_2, v_2 \in V$ .
- Complex scalar multiplication on $V_{\mathrm{C}}$ is defined by
  $(a+b i)(u+i v)=(a u-b v)+i(a v+b u)$
  for all $a, b \in \mathbf{R}$ and all $u, v \in V$ .
Prove that with the definitions of addition and scalar multiplication as above, $V_{\mathrm{C}}$ is a complex vector space.

1.C：Subspaces

Now that we have a clear idea of what vector spaces are, we are ready to learn what a subspace is.

Definition 1.33：Subspace

Let $U$ be a subset of $V$ . If $U$ satisfies the definition of a vector space under the same additive identity, vector addition, and scalar multiplication definitions, then $U$ is a subspace of $V$ .

In simple terms, if a subset of $V$ is itself a vector space, then it's a subspace. For example, $\mathbf{R}^2$ is a vector space, and ${(1, 1)}$ is a subset, but clearly it's not a vector space. This is because when you multiply $(1, 1)$ by any scalar, the resulting vector isn't in this subset. However, ${\lambda(1, 1) : \lambda \in R}$ is a subspace.

So, how can we quickly check if a subset is a valid subspace? Do we really need to verify all the six rules every time?

Actually, it's simpler than you might think. We only need to check:

Theorem 1.34: quick check for subspaces

A subset $U$ of a vector space $V$ is a subspace of $V$ if and only if it satisfies these three conditions:

Additive identity

$0 \in U$

Closed under addition

If $u, w \in U$ , then $u+w \in U$

Closed under scalar multiplication

If $a \in \mathbf{F}$ and $u \in U$ , then $au \in U$ 。

Straightforward, isn't it? A vector space is, by definition, closed under addition and multiplication after all.

Straightforward as it may seem, it not not that simple. To prove these 3 conditions are both necessary and sufficient for a subspace, we need to prove both directions: left to right and right to left.

From left to right, that is

$U$ is a subspace $\implies$ the three conditions hold

This proof is straightforward because if we assume $U$ is a subspace, then $U$ satisfies all 6 basic definitions of a vector space, so naturally it satisfies these three conditions as well.

The trickier part is proving from right to left, that is:

The three conditions hold $\implies$ $U$ is a subspace

To complete this proof, we need to first assume those 3 conditions holds and derive from there that $U$ satisfies the definition of a subspace.

Since this series is meant to be just an introduction, providing an entry point, we won't delve into the details of the proof. If you want to understand more deeply, you can refer to Linear Algebra Done Right.

Back to the main topic. Now that we know how to verify a subspace, let's look at a few examples to familiarize ourselves with the concept:

If $b \in \mathbf{F}$ , then
$U=\left\{\left(x_1, x_2, x_3, x_4\right) \in \mathbf{F}^4: x_3=5 x_4+b\right\}$
is a subspace of $\mathbf{F}^4$ if and only if $b=0$ .
The set of all differentiable real-valued functions on $\mathbf{R}$ is a subspace of $\mathbf{R}^{\mathbf{R}}$ .

Example 1

Let's first look at the first example

U=\left\{\left(x_1, x_2, x_3, x_4\right) \in \mathbf{F}^4: x_3=5 x_4+b\right\}

If $b \neq 0$ ，then $x_4=0$ implies $x_3$ is not $0$ . This implies the additive identity $0$ is not in this set. Therefore $U$ is not a subspace.

The above characterization provides an intuition as an entry point. To complete a rigorous proof, we, again, need to prove both directions.

$(\implies)$ Suppose $U$ is a subspace, and then prove $b=0$
$(\impliedby)$ Suppose $b=0$ ，and then prove $U$ is a subspace

I will leave out the details here. Try to complete the proof following the sketch of the proof provided.

Example 2

The set of all differentiable functions on $\mathbf{R}$ can be denoted by:

U=\{f: \mathbf{R} \rightarrow \mathbf{R} \mid f \text{ is differentiable} \}

Since $f \in U$ is a function that maps $\mathbf{R}$ to $\mathbf{R}$ , $f \in \mathbf{R}^\mathbf{R}$ , so $U$ is a subset of $\mathbf{R}^\mathbf{R}$ .

So to show, $U$ is a subspace, we only need to verify $U$ is (a) closed under addition, (b) closed under scalar multiplication, and (c) has an additive identity.

(a): The sum of two differntiable function is clearly differentiable. Hence (a) holds.

(b): A scalar multiple of a differentiable function is still differentiable. Hence (b) holds.

(c): Lastly, $0(x) = 0$ is trivially a differentiable function, so the additive identity exists in $U$ .

With that, we have successfully proved $U$ is indeed a subspace of $\mathbf{R}^\mathbf{R}$ .

Sum and Direct Sum of Vector Spaces

We have learned what is a valid subspace via some examples.

An idea that naturally follows is, how do we combine subspaces?

The Union operation on sets may come to mind as vector spaces are, in fact, sets. However, some quick verification will reveal that the sum of two vector spaces is not always a valid vector space with union as addition

For example, suppose $A = \{(x, 0) : x \in R\}$ , and $B = \{(0, y) : y \in R\}$ . Even though both $A$ and $B$ are vector spaces, their union $U = A \cup B$ is not a vector space, because $(1, 0) \in U$ and $(0, 1) \in U$ , but the sum $（1, 1）\notin U$ 。

So, union is not the definition of addition that we want.

You may pause and think, what kind of definition of addition will make the sum still a valid subspace? Below is the natural result you will arrive at:

Definition 1.36: sum of subspaces

Suppose $V_1, \ldots, V_m$ are subspaces of $V$ . The sum of $V_1, \ldots, V_m$ denoted by $V_1+\cdots+V_m$ , is the set of all possible sums of elements in $V_1, \ldots, V_m$ . Specifically,

V_1+\cdots+V_m=\left\{v_1+\cdots+v_m: v_1 \in V_1, \ldots, v_m \in V_m\right\} .

With this definition of addition, the sum of vector spaces will still be a vector space.

Going back to the example we just looked at: $A = \{(x, 0) : x \in R\}$ and $B = \{(0, y) : y \in R\}$ . With this definition of addition, $A+B=\mathbf{R}^2$ .

Direct Sums

The sum of vector spaces involves adding vector spaces together, and a common application of this is when we want to decompose a vector space into multiple subspaces. In this case, we are particularly interested in situations where each vector can be uniquely decomposed.

This situation is called a direct sum. Let's look at the definition of a direct sum.

Definition 1.41: Direct Sum

Suppose $V_1, \ldots, V_m$ are subspaces of $V$ .

If every element in $V_1+\cdots+V_m$ can be represented uniquely as a sum $v_1+\cdots+v_m$ (where each $v_k \in V_k$ ), then the sum $V_1+\cdots+V_m$ is called a direct sum.
If $V_1+\cdots+V_m$ is a direct sum, we denote $V_1+\cdots+V_m$ as $V_1 \oplus \cdots \oplus V_m$ , where the symbol $\oplus$ is used to indicate that this is a direct sum.

Let's use a few examples to deepen our understanding of direct sums:

Example of a Direct Sum

Suppose $U$ is the subspace of $\mathbf{F}^3$ where the last coordinate is equal to 0, and $W$ is the subspace of $\mathbf{F}^3$ where the first two coordinates are equal to 0, that is:

U=\left\{(x, y, 0) \in \mathbf{F}^3: x, y \in \mathbf{F}\right\} \quad \text { and } \quad W=\left\{(0,0, z) \in \mathbf{F}^3: z \in \mathbf{F}\right\} .

Then $\mathbf{F}^3=U \oplus W$ , which you can verify for yourselves.

Here's the English translation of the paragraph:

Example of a Non-Direct Sum

Suppose

\begin{aligned} & V_1=\left\{(x, y, 0) \in \mathbf{F}^3: x, y \in \mathbf{F}\right\}, \\ & V_2=\left\{(0,0, z) \in \mathbf{F}^3: z \in \mathbf{F}\right\}, \\ & V_3=\left\{(0, y, y) \in \mathbf{F}^3: y \in \mathbf{F}\right\} \end{aligned}

Then $\mathbf{F}^3=V_1+V_2+V_3$ , because every vector $(x, y, z) \in \mathbf{F}^3$ can be written as

(x, y, z)=(x, y, 0)+(0,0, z)+(0,0,0)

where the first vector on the right side belongs to $V_1$ , the second vector belongs to $V_2$ , and the third vector belongs to $V_3$ .

However, $\mathbf{F}^3$ is not a direct sum of $V_1, V_2, V_3$ , because the vector $(0,0,0)$ can be written as a sum $v_1+v_2+v_3$ in multiple ways. Specifically, here's one way to decompose it:

(0,0,0)=(0,1,0)+(0,0,1)+(0,-1,-1)

And here's another way:

(0,0,0)=(0,0,0)+(0,0,0)+(0,0,0),

where in each of the above equations, the first vector on the right side belongs to $V_1$ , the second vector belongs to $V_2$ , and the third vector belongs to $V_3$ . Therefore, the sum $V_1+V_2+V_3$ is not a direct sum.

Next, let's look at some properties of direct sums.

Theorem 1.45: Condition for Direct Sum

Suppose $V_1, \ldots, V_m$ are subspaces of $V$ . Then $V_1+\cdots+V_m$ is a direct sum $\Longleftrightarrow$ the only way to represent 0 as a sum $v_1+\cdots+v_m$ (where each $v_k \in V_k$ ) is by having each $v_k$ equal to 0.

First, how do we prove this theorem? Since this is an if and only if relationship, we need to prove both directions.

$(\implies)$

To prove the forward direction, we first assume that $V_1, \ldots, V_m$ is a direct sum. Then by the definition of a direct sum, all vectors have only one way to be decomposed. So naturally, the 0 vector also has only one way to be decomposed.

$(\impliedby)$

To prove the reverse direction, we first assume that 0 has only one way to be decomposed. Since $V_1, \ldots, V_m$ are individually subspaces, they each contain the 0 vector (the additive identity element). So, 0 can be decomposed in the following way:

0 = 0+\ldots+0

Since we've already assumed that 0 has only one way to be decomposed, this means the above decomposition is the only way.

Next, assume any vector $v \in V_1, \ldots V_m$ , and let's decompose $v$ in two ways:

v=v_1+\cdots+v_m \\ v=u_1+\cdots+u_m

Subtracting these two equations, we get

0=\left(v_1-u_1\right)+\cdots+\left(v_m-u_m\right) .

Because we've already proven that the only way to decompose 0 is

0 = 0 + \ldots + 0

We get

u_1=v_1, u_2=v_2, \ldots , u_m=v_m

Therefore, any vector v has only one way to be decomposed. With this, we have proven both directions.

How should we interpret this theorem? What does it mean?

It means that "we only need to confirm that the only way to decompose the 0 vector into a combination of $V_1, \dots V_m$ is $0+0+\dots+0$ , to be certain that $V_1, \dots, V_m$ is a direct sum." In other words, we don't need to confirm that every vector in $V$ can only be decomposed into a unique combination, we just need to confirm that 0 has only one combination.

You might have also got a vague sense that the intersection of two subspaces $V$ and $W$ must be $\{0\}$ for them to be a direct sum. Because any subspace must contain $\{0\}$ , the intersection must also have $\{0\}$ .

Moreover, if the intersection contains vectors other than 0, it won't be a direct sum, because there will be vectors with more than one way to decompose. Specifically: if $U \cap W$ contains a non-zero vector $v$ , then it can have two decompositions $v=0+v=v+0$ , so they are not a direct sum.

In fact, $U \cap W=\{0\}$ is not only a necessary condition, it's also a sufficient condition. That is, as long as $U \cap W=\{0\}$ , it means $U \oplus W$ . Let's see why below.

Theorem 1.46: Direct Sum Implies Intersection is Zero

Suppose $U$ and $W$ are subspaces of $V$ , then:

$U+W$ is a direct sum $\Longleftrightarrow U \cap W=\{0\}$ .

Proof:

This proof also requires proving both directions.