1. Introduction to Methods of Applied Mathematics
or
Advanced Mathematical Methods for Scientists and Engineers
Sean Mauch
http://www.its.caltech.edu/˜sean
January 24, 2004
25. Anti-Copyright
Anti-Copyright @ 1995-2001 by Mauch Publishing Company, un-Incorporated.
No rights reserved. Any part of this publication may be reproduced, stored in a retrieval system, transmitted or
desecrated without permission.
xxiv
26. Preface
During the summer before my final undergraduate year at Caltech I set out to write a math text unlike any other,
namely, one written by me. In that respect I have succeeded beautifully. Unfortunately, the text is neither complete nor
polished. I have a “Warnings and Disclaimers” section below that is a little amusing, and an appendix on probability
that I feel concisesly captures the essence of the subject. However, all the material in between is in some stage of
development. I am currently working to improve and expand this text.
This text is freely available from my web set. Currently I’m at http://www.its.caltech.edu/˜sean. I post new
versions a couple of times a year.
0.1 Advice to Teachers
If you have something worth saying, write it down.
0.2 Acknowledgments
I would like to thank Professor Saffman for advising me on this project and the Caltech SURF program for providing
the funding for me to write the first edition of this book.
xxv
27. 0.3 Warnings and Disclaimers
• This book is a work in progress. It contains quite a few mistakes and typos. I would greatly appreciate your
constructive criticism. You can reach me at ‘sean@caltech.edu’.
• Reading this book impairs your ability to drive a car or operate machinery.
• This book has been found to cause drowsiness in laboratory animals.
• This book contains twenty-three times the US RDA of fiber.
• Caution: FLAMMABLE - Do not read while smoking or near a fire.
• If infection, rash, or irritation develops, discontinue use and consult a physician.
• Warning: For external use only. Use only as directed. Intentional misuse by deliberately concentrating contents
can be harmful or fatal. KEEP OUT OF REACH OF CHILDREN.
• In the unlikely event of a water landing do not use this book as a flotation device.
• The material in this text is fiction; any resemblance to real theorems, living or dead, is purely coincidental.
• This is by far the most amusing section of this book.
• Finding the typos and mistakes in this book is left as an exercise for the reader. (Eye ewes a spelling chequer
from thyme too thyme, sew their should knot bee two many misspellings. Though I ain’t so sure the grammar’s
too good.)
• The theorems and methods in this text are subject to change without notice.
• This is a chain book. If you do not make seven copies and distribute them to your friends within ten days of
obtaining this text you will suffer great misfortune and other nastiness.
• The surgeon general has determined that excessive studying is detrimental to your social life.
xxvi
28. • This text has been buffered for your protection and ribbed for your pleasure.
• Stop reading this rubbish and get back to work!
0.4 Suggested Use
This text is well suited to the student, professional or lay-person. It makes a superb gift. This text has a boquet that
is light and fruity, with some earthy undertones. It is ideal with dinner or as an apertif. Bon apetit!
0.5 About the Title
The title is only making light of naming conventions in the sciences and is not an insult to engineers. If you want to
learn about some mathematical subject, look for books with “Introduction” or “Elementary” in the title. If it is an
“Intermediate” text it will be incomprehensible. If it is “Advanced” then not only will it be incomprehensible, it will
have low production qualities, i.e. a crappy typewriter font, no graphics and no examples. There is an exception to this
rule: When the title also contains the word “Scientists” or “Engineers” the advanced book may be quite suitable for
actually learning the material.
xxvii
30. Chapter 1
Sets and Functions
1.1 Sets
Definition. A set is a collection of objects. We call the objects, elements. A set is denoted by listing the elements
between braces. For example: {e, ı, π, 1} is the set of the integer 1, the pure imaginary number ı =
√
−1 and the
transcendental numbers e = 2.7182818 . . . and π = 3.1415926 . . .. For elements of a set, we do not count multiplicities.
We regard the set {1, 2, 2, 3, 3, 3} as identical to the set {1, 2, 3}. Order is not significant in sets. The set {1, 2, 3} is
equivalent to {3, 2, 1}.
In enumerating the elements of a set, we use ellipses to indicate patterns. We denote the set of positive integers as
{1, 2, 3, . . .}. We also denote sets with the notation {x|conditions on x} for sets that are more easily described than
enumerated. This is read as “the set of elements x such that . . . ”. x ∈ S is the notation for “x is an element of the
set S.” To express the opposite we have x ∈ S for “x is not an element of the set S.”
Examples. We have notations for denoting some of the commonly encountered sets.
• ∅ = {} is the empty set, the set containing no elements.
• Z = {. . . , −3, −2, −1, 0, 1, 2, 3 . . .} is the set of integers. (Z is for “Zahlen”, the German word for “number”.)
2
31. • Q = {p/q|p, q ∈ Z, q = 0} is the set of rational numbers. (Q is for quotient.) 1
• R = {x|x = a1a2 · · · an.b1b2 · · · } is the set of real numbers, i.e. the set of numbers with decimal expansions. 2
• C = {a + ıb|a, b ∈ R, ı2
= −1} is the set of complex numbers. ı is the square root of −1. (If you haven’t seen
complex numbers before, don’t dismay. We’ll cover them later.)
• Z+
, Q+
and R+
are the sets of positive integers, rationals and reals, respectively. For example, Z+
= {1, 2, 3, . . .}.
We use a − superscript to denote the sets of negative numbers.
• Z0+
, Q0+
and R0+
are the sets of non-negative integers, rationals and reals, respectively. For example, Z0+
=
{0, 1, 2, . . .}.
• (a . . . b) denotes an open interval on the real axis. (a . . . b) ≡ {x|x ∈ R, a < x < b}
• We use brackets to denote the closed interval. [a..b] ≡ {x|x ∈ R, a ≤ x ≤ b}
The cardinality or order of a set S is denoted |S|. For finite sets, the cardinality is the number of elements in the
set. The Cartesian product of two sets is the set of ordered pairs:
X × Y ≡ {(x, y)|x ∈ X, y ∈ Y }.
The Cartesian product of n sets is the set of ordered n-tuples:
X1 × X2 × · · · × Xn ≡ {(x1, x2, . . . , xn)|x1 ∈ X1, x2 ∈ X2, . . . , xn ∈ Xn}.
Equality. Two sets S and T are equal if each element of S is an element of T and vice versa. This is denoted,
S = T. Inequality is S = T, of course. S is a subset of T, S ⊆ T, if every element of S is an element of T. S is a
proper subset of T, S ⊂ T, if S ⊆ T and S = T. For example: The empty set is a subset of every set, ∅ ⊆ S. The
rational numbers are a proper subset of the real numbers, Q ⊂ R.
1
Note that with this description, we enumerate each rational number an infinite number of times. For example: 1/2 = 2/4 =
3/6 = (−1)/(−2) = · · · . This does not pose a problem as we do not count multiplicities.
2
Guess what R is for.
3
32. Operations. The union of two sets, S ∪ T, is the set whose elements are in either of the two sets. The union of n
sets,
∪n
j=1Sj ≡ S1 ∪ S2 ∪ · · · ∪ Sn
is the set whose elements are in any of the sets Sj. The intersection of two sets, S ∩ T, is the set whose elements are
in both of the two sets. In other words, the intersection of two sets in the set of elements that the two sets have in
common. The intersection of n sets,
∩n
j=1Sj ≡ S1 ∩ S2 ∩ · · · ∩ Sn
is the set whose elements are in all of the sets Sj. If two sets have no elements in common, S ∩ T = ∅, then the sets
are disjoint. If T ⊆ S, then the difference between S and T, S T, is the set of elements in S which are not in T.
S T ≡ {x|x ∈ S, x ∈ T}
The difference of sets is also denoted S − T.
Properties. The following properties are easily verified from the above definitions.
• S ∪ ∅ = S, S ∩ ∅ = ∅, S ∅ = S, S S = ∅.
• Commutative. S ∪ T = T ∪ S, S ∩ T = T ∩ S.
• Associative. (S ∪ T) ∪ U = S ∪ (T ∪ U) = S ∪ T ∪ U, (S ∩ T) ∩ U = S ∩ (T ∩ U) = S ∩ T ∩ U.
• Distributive. S ∪ (T ∩ U) = (S ∪ T) ∩ (S ∪ U), S ∩ (T ∪ U) = (S ∩ T) ∪ (S ∩ U).
1.2 Single Valued Functions
Single-Valued Functions. A single-valued function or single-valued mapping is a mapping of the elements x ∈ X
into elements y ∈ Y . This is expressed as f : X → Y or X
f
→ Y . If such a function is well-defined, then for each
x ∈ X there exists a unique element of y such that f(x) = y. The set X is the domain of the function, Y is the
codomain, (not to be confused with the range, which we introduce shortly). To denote the value of a function on a
4
33. particular element we can use any of the notations: f(x) = y, f : x → y or simply x → y. f is the identity map on
X if f(x) = x for all x ∈ X.
Let f : X → Y . The range or image of f is
f(X) = {y|y = f(x) for some x ∈ X}.
The range is a subset of the codomain. For each Z ⊆ Y , the inverse image of Z is defined:
f−1
(Z) ≡ {x ∈ X|f(x) = z for some z ∈ Z}.
Examples.
• Finite polynomials, f(x) = n
k=0 akxk
, ak ∈ R, and the exponential function, f(x) = ex
, are examples of single
valued functions which map real numbers to real numbers.
• The greatest integer function, f(x) = x , is a mapping from R to Z. x is defined as the greatest integer less
than or equal to x. Likewise, the least integer function, f(x) = x , is the least integer greater than or equal to
x.
The -jectives. A function is injective if for each x1 = x2, f(x1) = f(x2). In other words, distinct elements are
mapped to distinct elements. f is surjective if for each y in the codomain, there is an x such that y = f(x). If a
function is both injective and surjective, then it is bijective. A bijective function is also called a one-to-one mapping.
Examples.
• The exponential function f(x) = ex
, considered as a mapping from R to R+
, is bijective, (a one-to-one mapping).
• f(x) = x2
is a bijection from R+
to R+
. f is not injective from R to R+
. For each positive y in the range, there
are two values of x such that y = x2
.
• f(x) = sin x is not injective from R to [−1..1]. For each y ∈ [−1..1] there exists an infinite number of values of
x such that y = sin x.
5
34. Injective Surjective Bijective
Figure 1.1: Depictions of Injective, Surjective and Bijective Functions
1.3 Inverses and Multi-Valued Functions
If y = f(x), then we can write x = f−1
(y) where f−1
is the inverse of f. If y = f(x) is a one-to-one function, then
f−1
(y) is also a one-to-one function. In this case, x = f−1
(f(x)) = f(f−1
(x)) for values of x where both f(x) and
f−1
(x) are defined. For example ln x, which maps R+
to R is the inverse of ex
. x = eln x
= ln(ex
) for all x ∈ R+
.
(Note the x ∈ R+
ensures that ln x is defined.)
If y = f(x) is a many-to-one function, then x = f−1
(y) is a one-to-many function. f−1
(y) is a multi-valued function.
We have x = f(f−1
(x)) for values of x where f−1
(x) is defined, however x = f−1
(f(x)). There are diagrams showing
one-to-one, many-to-one and one-to-many functions in Figure 1.2.
Example 1.3.1 y = x2
, a many-to-one function has the inverse x = y1/2
. For each positive y, there are two values of
x such that x = y1/2
. y = x2
and y = x1/2
are graphed in Figure 1.3.
We say that there are two branches of y = x1/2
: the positive and the negative branch. We denote the positive
branch as y =
√
x; the negative branch is y = −
√
x. We call
√
x the principal branch of x1/2
. Note that
√
x is a
one-to-one function. Finally, x = (x1/2
)2
since (±
√
x)2
= x, but x = (x2
)1/2
since (x2
)1/2
= ±x. y =
√
x is graphed
in Figure 1.4.
6
35. rangedomain rangedomain rangedomain
one-to-one many-to-one one-to-many
Figure 1.2: Diagrams of One-To-One, Many-To-One and One-To-Many Functions
Figure 1.3: y = x2
and y = x1/2
Figure 1.4: y =
√
x
7
36. Now consider the many-to-one function y = sin x. The inverse is x = arcsin y. For each y ∈ [−1..1] there are an
infinite number of values x such that x = arcsin y. In Figure 1.5 is a graph of y = sin x and a graph of a few branches
of y = arcsin x.
Figure 1.5: y = sin x and y = arcsin x
Example 1.3.2 arcsin x has an infinite number of branches. We will denote the principal branch by Arcsin x which
maps [−1..1] to −π
2
..π
2
. Note that x = sin(arcsin x), but x = arcsin(sin x). y = Arcsin x in Figure 1.6.
Figure 1.6: y = Arcsin x
Example 1.3.3 Consider 11/3
. Since x3
is a one-to-one function, x1/3
is a single-valued function. (See Figure 1.7.)
11/3
= 1.
Example 1.3.4 Consider arccos(1/2). cos x and a portion of arccos x are graphed in Figure 1.8. The equation
cos x = 1/2 has the two solutions x = ±π/3 in the range x ∈ (−π..π]. We use the periodicity of the cosine,
8
37. Figure 1.7: y = x3
and y = x1/3
cos(x + 2π) = cos x, to find the remaining solutions.
arccos(1/2) = {±π/3 + 2nπ}, n ∈ Z.
Figure 1.8: y = cos x and y = arccos x
1.4 Transforming Equations
Consider the equation g(x) = h(x) and the single-valued function f(x). A particular value of x is a solution of the
equation if substituting that value into the equation results in an identity. In determining the solutions of an equation,
we often apply functions to each side of the equation in order to simplify its form. We apply the function to obtain
a second equation, f(g(x)) = f(h(x)). If x = ξ is a solution of the former equation, (let ψ = g(ξ) = h(ξ)), then it
9
38. is necessarily a solution of latter. This is because f(g(ξ)) = f(h(ξ)) reduces to the identity f(ψ) = f(ψ). If f(x) is
bijective, then the converse is true: any solution of the latter equation is a solution of the former equation. Suppose
that x = ξ is a solution of the latter, f(g(ξ)) = f(h(ξ)). That f(x) is a one-to-one mapping implies that g(ξ) = h(ξ).
Thus x = ξ is a solution of the former equation.
It is always safe to apply a one-to-one, (bijective), function to an equation, (provided it is defined for that domain).
For example, we can apply f(x) = x3
or f(x) = ex
, considered as mappings on R, to the equation x = 1. The
equations x3
= 1 and ex
= e each have the unique solution x = 1 for x ∈ R.
In general, we must take care in applying functions to equations. If we apply a many-to-one function, we may
introduce spurious solutions. Applying f(x) = x2
to the equation x = π
2
results in x2
= π2
4
, which has the two solutions,
x = {±π
2
}. Applying f(x) = sin x results in x2
= π2
4
, which has an infinite number of solutions, x = {π
2
+2nπ | n ∈ Z}.
We do not generally apply a one-to-many, (multi-valued), function to both sides of an equation as this rarely is useful.
Rather, we typically use the definition of the inverse function. Consider the equation
sin2
x = 1.
Applying the function f(x) = x1/2
to the equation would not get us anywhere.
sin2
x
1/2
= 11/2
Since (sin2
x)1/2
= sin x, we cannot simplify the left side of the equation. Instead we could use the definition of
f(x) = x1/2
as the inverse of the x2
function to obtain
sin x = 11/2
= ±1.
Now note that we should not just apply arcsin to both sides of the equation as arcsin(sin x) = x. Instead we use the
definition of arcsin as the inverse of sin.
x = arcsin(±1)
x = arcsin(1) has the solutions x = π/2+2nπ and x = arcsin(−1) has the solutions x = −π/2+2nπ. We enumerate
the solutions.
x =
π
2
+ nπ | n ∈ Z
10
39. 1.5 Exercises
Exercise 1.1
The area of a circle is directly proportional to the square of its diameter. What is the constant of proportionality?
Hint, Solution
Exercise 1.2
Consider the equation
x + 1
y − 2
=
x2
− 1
y2 − 4
.
1. Why might one think that this is the equation of a line?
2. Graph the solutions of the equation to demonstrate that it is not the equation of a line.
Hint, Solution
Exercise 1.3
Consider the function of a real variable,
f(x) =
1
x2 + 2
.
What is the domain and range of the function?
Hint, Solution
Exercise 1.4
The temperature measured in degrees Celsius 3
is linearly related to the temperature measured in degrees Fahrenheit 4
.
Water freezes at 0◦
C = 32◦
F and boils at 100◦
C = 212◦
F. Write the temperature in degrees Celsius as a function
of degrees Fahrenheit.
3
Originally, it was called degrees Centigrade. centi because there are 100 degrees between the two calibration points. It is now
called degrees Celsius in honor of the inventor.
4
The Fahrenheit scale, named for Daniel Fahrenheit, was originally calibrated with the freezing point of salt-saturated water to
be 0◦
. Later, the calibration points became the freezing point of water, 32◦
, and body temperature, 96◦
. With this method, there are
64 divisions between the calibration points. Finally, the upper calibration point was changed to the boiling point of water at 212◦
.
This gave 180 divisions, (the number of degrees in a half circle), between the two calibration points.
11
40. Hint, Solution
Exercise 1.5
Consider the function graphed in Figure 1.9. Sketch graphs of f(−x), f(x + 3), f(3 − x) + 2, and f−1
(x). You may
use the blank grids in Figure 1.10.
Figure 1.9: Graph of the function.
Hint, Solution
Exercise 1.6
A culture of bacteria grows at the rate of 10% per minute. At 6:00 pm there are 1 billion bacteria. How many bacteria
are there at 7:00 pm? How many were there at 3:00 pm?
Hint, Solution
Exercise 1.7
The graph in Figure 1.11 shows an even function f(x) = p(x)/q(x) where p(x) and q(x) are rational quadratic
polynomials. Give possible formulas for p(x) and q(x).
Hint, Solution
12
41. Figure 1.10: Blank grids.
Exercise 1.8
Find a polynomial of degree 100 which is zero only at x = −2, 1, π and is non-negative.
Hint, Solution
13
42. 1 2
1
2
2 4 6 8 10
1
2
Figure 1.11: Plots of f(x) = p(x)/q(x).
1.6 Hints
Hint 1.1
area = constant × diameter2
.
Hint 1.2
A pair (x, y) is a solution of the equation if it make the equation an identity.
Hint 1.3
The domain is the subset of R on which the function is defined.
Hint 1.4
Find the slope and x-intercept of the line.
Hint 1.5
The inverse of the function is the reflection of the function across the line y = x.
Hint 1.6
The formula for geometric growth/decay is x(t) = x0rt
, where r is the rate.
14
43. Hint 1.7
Note that p(x) and q(x) appear as a ratio, they are determined only up to a multiplicative constant. We may take the
leading coefficient of q(x) to be unity.
f(x) =
p(x)
q(x)
=
ax2
+ bx + c
x2 + βx + χ
Use the properties of the function to solve for the unknown parameters.
Hint 1.8
Write the polynomial in factored form.
15
44. 1.7 Solutions
Solution 1.1
area = π × radius2
area =
π
4
× diameter2
The constant of proportionality is π
4
.
Solution 1.2
1. If we multiply the equation by y2
− 4 and divide by x + 1, we obtain the equation of a line.
y + 2 = x − 1
2. We factor the quadratics on the right side of the equation.
x + 1
y − 2
=
(x + 1)(x − 1)
(y − 2)(y + 2)
.
We note that one or both sides of the equation are undefined at y = ±2 because of division by zero. There are
no solutions for these two values of y and we assume from this point that y = ±2. We multiply by (y −2)(y +2).
(x + 1)(y + 2) = (x + 1)(x − 1)
For x = −1, the equation becomes the identity 0 = 0. Now we consider x = −1. We divide by x + 1 to obtain
the equation of a line.
y + 2 = x − 1
y = x − 3
Now we collect the solutions we have found.
{(−1, y) : y = ±2} ∪ {(x, x − 3) : x = 1, 5}
The solutions are depicted in Figure /reffig not a line.
16
45. -6 -4 -2 2 4 6
-6
-4
-2
2
4
6
Figure 1.12: The solutions of x+1
y−2
= x2−1
y2−4
.
Solution 1.3
The denominator is nonzero for all x ∈ R. Since we don’t have any division by zero problems, the domain of the
function is R. For x ∈ R,
0 <
1
x2 + 2
≤ 2.
Consider
y =
1
x2 + 2
. (1.1)
For any y ∈ (0 . . . 1/2], there is at least one value of x that satisfies Equation 1.1.
x2
+ 2 =
1
y
x = ±
1
y
− 2
Thus the range of the function is (0 . . . 1/2]
17
46. Solution 1.4
Let c denote degrees Celsius and f denote degrees Fahrenheit. The line passes through the points (f, c) = (32, 0) and
(f, c) = (212, 100). The x-intercept is f = 32. We calculate the slope of the line.
slope =
100 − 0
212 − 32
=
100
180
=
5
9
The relationship between fahrenheit and celcius is
c =
5
9
(f − 32).
Solution 1.5
We plot the various transformations of f(x).
Solution 1.6
The formula for geometric growth/decay is x(t) = x0rt
, where r is the rate. Let t = 0 coincide with 6:00 pm. We
determine x0.
x(0) = 109
= x0
11
10
0
= x0
x0 = 109
At 7:00 pm the number of bacteria is
109 11
10
60
=
1160
1051
≈ 3.04 × 1011
At 3:00 pm the number of bacteria was
109 11
10
−180
=
10189
11180
≈ 35.4
18
47. Figure 1.13: Graphs of f(−x), f(x + 3), f(3 − x) + 2, and f−1
(x).
Solution 1.7
We write p(x) and q(x) as general quadratic polynomials.
f(x) =
p(x)
q(x)
=
ax2
+ bx + c
αx2 + βx + χ
We will use the properties of the function to solve for the unknown parameters.
19
48. Note that p(x) and q(x) appear as a ratio, they are determined only up to a multiplicative constant. We may take
the leading coefficient of q(x) to be unity.
f(x) =
p(x)
q(x)
=
ax2
+ bx + c
x2 + βx + χ
f(x) has a second order zero at x = 0. This means that p(x) has a second order zero there and that χ = 0.
f(x) =
ax2
x2 + βx + χ
We note that f(x) → 2 as x → ∞. This determines the parameter a.
lim
x→∞
f(x) = lim
x→∞
ax2
x2 + βx + χ
= lim
x→∞
2ax
2x + β
= lim
x→∞
2a
2
= a
f(x) =
2x2
x2 + βx + χ
Now we use the fact that f(x) is even to conclude that q(x) is even and thus β = 0.
f(x) =
2x2
x2 + χ
Finally, we use that f(1) = 1 to determine χ.
f(x) =
2x2
x2 + 1
20
49. Solution 1.8
Consider the polynomial
p(x) = (x + 2)40
(x − 1)30
(x − π)30
.
It is of degree 100. Since the factors only vanish at x = −2, 1, π, p(x) only vanishes there. Since factors are non-
negative, the polynomial is non-negative.
21
50. Chapter 2
Vectors
2.1 Vectors
2.1.1 Scalars and Vectors
A vector is a quantity having both a magnitude and a direction. Examples of vector quantities are velocity, force
and position. One can represent a vector in n-dimensional space with an arrow whose initial point is at the origin,
(Figure 2.1). The magnitude is the length of the vector. Typographically, variables representing vectors are often
written in capital letters, bold face or with a vector over-line, A, a, a. The magnitude of a vector is denoted |a|.
A scalar has only a magnitude. Examples of scalar quantities are mass, time and speed.
Vector Algebra. Two vectors are equal if they have the same magnitude and direction. The negative of a vector,
denoted −a, is a vector of the same magnitude as a but in the opposite direction. We add two vectors a and b by
placing the tail of b at the head of a and defining a + b to be the vector with tail at the origin and head at the head
of b. (See Figure 2.2.)
The difference, a − b, is defined as the sum of a and the negative of b, a + (−b). The result of multiplying a by
a scalar α is a vector of magnitude |α| |a| with the same/opposite direction if α is positive/negative. (See Figure 2.2.)
22
51. x
z
y
Figure 2.1: Graphical representation of a vector in three dimensions.
a+b
a
b
-a
a
2a
Figure 2.2: Vector arithmetic.
Here are the properties of adding vectors and multiplying them by a scalar. They are evident from geometric
23
52. considerations.
a + b = b + a αa = aα commutative laws
(a + b) + c = a + (b + c) α(βa) = (αβ)a associative laws
α(a + b) = αa + αb (α + β)a = αa + βa distributive laws
Zero and Unit Vectors. The additive identity element for vectors is the zero vector or null vector. This is a vector
of magnitude zero which is denoted as 0. A unit vector is a vector of magnitude one. If a is nonzero then a/|a| is a
unit vector in the direction of a. Unit vectors are often denoted with a caret over-line, ˆn.
Rectangular Unit Vectors. In n dimensional Cartesian space, Rn
, the unit vectors in the directions of the
coordinates axes are e1, . . . en. These are called the rectangular unit vectors. To cut down on subscripts, the unit
vectors in three dimensional space are often denoted with i, j and k. (Figure 2.3).
x
z
y
j
k
i
Figure 2.3: Rectangular unit vectors.
24
53. Components of a Vector. Consider a vector a with tail at the origin and head having the Cartesian coordinates
(a1, . . . , an). We can represent this vector as the sum of n rectangular component vectors, a = a1e1 + · · · + anen.
(See Figure 2.4.) Another notation for the vector a is a1, . . . , an . By the Pythagorean theorem, the magnitude of
the vector a is |a| = a2
1 + · · · + a2
n.
x
z
y
a
a
a
1
3
i
k
ja2
Figure 2.4: Components of a vector.
2.1.2 The Kronecker Delta and Einstein Summation Convention
The Kronecker Delta tensor is defined
δij =
1 if i = j,
0 if i = j.
This notation will be useful in our work with vectors.
Consider writing a vector in terms of its rectangular components. Instead of using ellipses: a = a1e1 +· · ·+anen, we
could write the expression as a sum: a = n
i=1 aiei. We can shorten this notation by leaving out the sum: a = aiei,
where it is understood that whenever an index is repeated in a term we sum over that index from 1 to n. This is the
25
54. Einstein summation convention. A repeated index is called a summation index or a dummy index. Other indices can
take any value from 1 to n and are called free indices.
Example 2.1.1 Consider the matrix equation: A · x = b. We can write out the matrix and vectors explicitly.
a11 · · · a1n
...
...
...
an1 · · · ann
x1
...
xn
=
b1
...
bn
This takes much less space when we use the summation convention.
aijxj = bi
Here j is a summation index and i is a free index.
2.1.3 The Dot and Cross Product
Dot Product. The dot product or scalar product of two vectors is defined,
a · b ≡ |a||b| cos θ,
where θ is the angle from a to b. From this definition one can derive the following properties:
• a · b = b · a, commutative.
• α(a · b) = (αa) · b = a · (αb), associativity of scalar multiplication.
• a · (b + c) = a · b + a · c, distributive. (See Exercise 2.1.)
• eiej = δij. In three dimensions, this is
i · i = j · j = k · k = 1, i · j = j · k = k · i = 0.
• a · b = aibi ≡ a1b1 + · · · + anbn, dot product in terms of rectangular components.
• If a · b = 0 then either a and b are orthogonal, (perpendicular), or one of a and b are zero.
26
55. The Angle Between Two Vectors. We can use the dot product to find the angle between two vectors, a and
b. From the definition of the dot product,
a · b = |a||b| cos θ.
If the vectors are nonzero, then
θ = arccos
a · b
|a||b|
.
Example 2.1.2 What is the angle between i and i + j?
θ = arccos
i · (i + j)
|i||i + j|
= arccos
1
√
2
=
π
4
.
Parametric Equation of a Line. Consider a line in Rn
that passes through the point a and is parallel to the
vector t, (tangent). A parametric equation of the line is
x = a + ut, u ∈ R.
Implicit Equation of a Line In 2D. Consider a line in R2
that passes through the point a and is normal,
(orthogonal, perpendicular), to the vector n. All the lines that are normal to n have the property that x · n is a
constant, where x is any point on the line. (See Figure 2.5.) x · n = 0 is the line that is normal to n and passes
through the origin. The line that is normal to n and passes through the point a is
x · n = a · n.
The normal to a line determines an orientation of the line. The normal points in the direction that is above the
line. A point b is (above/on/below) the line if (b − a) · n is (positive/zero/negative). The signed distance of a point
27
56. =0
=1 = a n
n a
=-1
x n
x n
x n
x n
Figure 2.5: Equation for a line.
b from the line x · n = a · n is
(b − a) ·
n
|n|
.
Implicit Equation of a Hyperplane. A hyperplane in Rn
is an n − 1 dimensional “sheet” which passes through
a given point and is normal to a given direction. In R3
we call this a plane. Consider a hyperplane that passes through
the point a and is normal to the vector n. All the hyperplanes that are normal to n have the property that x · n is a
constant, where x is any point in the hyperplane. x · n = 0 is the hyperplane that is normal to n and passes through
the origin. The hyperplane that is normal to n and passes through the point a is
x · n = a · n.
The normal determines an orientation of the hyperplane. The normal points in the direction that is above the
hyperplane. A point b is (above/on/below) the hyperplane if (b − a) · n is (positive/zero/negative). The signed
28
57. distance of a point b from the hyperplane x · n = a · n is
(b − a) ·
n
|n|
.
Right and Left-Handed Coordinate Systems. Consider a rectangular coordinate system in two dimensions.
Angles are measured from the positive x axis in the direction of the positive y axis. There are two ways of labeling the
axes. (See Figure 2.6.) In one the angle increases in the counterclockwise direction and in the other the angle increases
in the clockwise direction. The former is the familiar Cartesian coordinate system.
x y
xy
θ
θ
Figure 2.6: There are two ways of labeling the axes in two dimensions.
There are also two ways of labeling the axes in a three-dimensional rectangular coordinate system. These are called
right-handed and left-handed coordinate systems. See Figure 2.7. Any other labelling of the axes could be rotated into
one of these configurations. The right-handed system is the one that is used by default. If you put your right thumb in
the direction of the z axis in a right-handed coordinate system, then your fingers curl in the direction from the x axis
to the y axis.
Cross Product. The cross product or vector product is defined,
a × b = |a||b| sin θ n,
where θ is the angle from a to b and n is a unit vector that is orthogonal to a and b and in the direction such that
the ordered triple of vectors a, b and n form a right-handed system.
29
58. x
z
yj
i
k
z
k
j
i
y
x
Figure 2.7: Right and left handed coordinate systems.
You can visualize the direction of a × b by applying the right hand rule. Curl the fingers of your right hand in the
direction from a to b. Your thumb points in the direction of a × b. Warning: Unless you are a lefty, get in the habit
of putting down your pencil before applying the right hand rule.
The dot and cross products behave a little differently. First note that unlike the dot product, the cross product is not
commutative. The magnitudes of a × b and b × a are the same, but their directions are opposite. (See Figure 2.8.)
Let
a × b = |a||b| sin θ n and b × a = |b||a| sin φ m.
The angle from a to b is the same as the angle from b to a. Since {a, b, n} and {b, a, m} are right-handed systems,
m points in the opposite direction as n. Since a × b = −b × a we say that the cross product is anti-commutative.
Next we note that since
|a × b| = |a||b| sin θ,
the magnitude of a × b is the area of the parallelogram defined by the two vectors. (See Figure 2.9.) The area of the
triangle defined by two vectors is then 1
2
|a × b|.
From the definition of the cross product, one can derive the following properties:
30
59. a
b
b a
a b
Figure 2.8: The cross product is anti-commutative.
b
sin
b
b
a
θ
a
Figure 2.9: The parallelogram and the triangle defined by two vectors.
• a × b = −b × a, anti-commutative.
• α(a × b) = (αa) × b = a × (αb), associativity of scalar multiplication.
• a × (b + c) = a × b + a × c, distributive.
• (a × b) × c = a × (b × c). The cross product is not associative.
• i × i = j × j = k × k = 0.
31
60. • i × j = k, j × k = i, k × i = j.
•
a × b = (a2b3 − a3b2)i + (a3b1 − a1b3)j + (a1b2 − a2b1)k =
i j k
a1 a2 a3
b1 b2 b3
,
cross product in terms of rectangular components.
• If a · b = 0 then either a and b are parallel or one of a or b is zero.
Scalar Triple Product. Consider the volume of the parallelopiped defined by three vectors. (See Figure 2.10.)
The area of the base is ||b||c| sin θ|, where θ is the angle between b and c. The height is |a| cos φ, where φ is the angle
between b × c and a. Thus the volume of the parallelopiped is |a||b||c| sin θ cos φ.
φ
θ
b c
a
b
c
Figure 2.10: The parallelopiped defined by three vectors.
Note that
|a · (b × c)| = |a · (|b||c| sin θ n)|
= ||a||b||c| sin θ cos φ| .
32
61. Thus |a · (b × c)| is the volume of the parallelopiped. a·(b×c) is the volume or the negative of the volume depending
on whether {a, b, c} is a right or left-handed system.
Note that parentheses are unnecessary in a · b × c. There is only one way to interpret the expression. If you did the
dot product first then you would be left with the cross product of a scalar and a vector which is meaningless. a · b × c
is called the scalar triple product.
Plane Defined by Three Points. Three points which are not collinear define a plane. Consider a plane that
passes through the three points a, b and c. One way of expressing that the point x lies in the plane is that the vectors
x − a, b − a and c − a are coplanar. (See Figure 2.11.) If the vectors are coplanar, then the parallelopiped defined by
these three vectors will have zero volume. We can express this in an equation using the scalar triple product,
(x − a) · (b − a) × (c − a) = 0.
b
c
x
a
Figure 2.11: Three points define a plane.
2.2 Sets of Vectors in n Dimensions
Orthogonality. Consider two n-dimensional vectors
x = (x1, x2, . . . , xn), y = (y1, y2, . . . , yn).
33
62. The inner product of these vectors can be defined
x|y ≡ x · y =
n
i=1
xiyi.
The vectors are orthogonal if x · y = 0. The norm of a vector is the length of the vector generalized to n dimensions.
x =
√
x · x
Consider a set of vectors
{x1, x2, . . . , xm}.
If each pair of vectors in the set is orthogonal, then the set is orthogonal.
xi · xj = 0 if i = j
If in addition each vector in the set has norm 1, then the set is orthonormal.
xi · xj = δij =
1 if i = j
0 if i = j
Here δij is known as the Kronecker delta function.
Completeness. A set of n, n-dimensional vectors
{x1, x2, . . . , xn}
is complete if any n-dimensional vector can be written as a linear combination of the vectors in the set. That is, any
vector y can be written
y =
n
i=1
cixi.
34
63. Taking the inner product of each side of this equation with xm,
y · xm =
n
i=1
cixi · xm
=
n
i=1
cixi · xm
= cmxm · xm
cm =
y · xm
xm
2
Thus y has the expansion
y =
n
i=1
y · xi
xi
2
xi.
If in addition the set is orthonormal, then
y =
n
i=1
(y · xi)xi.
35
64. 2.3 Exercises
The Dot and Cross Product
Exercise 2.1
Prove the distributive law for the dot product,
a · (b + c) = a · b + a · c.
Hint, Solution
Exercise 2.2
Prove that
a · b = aibi ≡ a1b1 + · · · + anbn.
Hint, Solution
Exercise 2.3
What is the angle between the vectors i + j and i + 3j?
Hint, Solution
Exercise 2.4
Prove the distributive law for the cross product,
a × (b + c) = a × b + a × b.
Hint, Solution
Exercise 2.5
Show that
a × b =
i j k
a1 a2 a3
b1 b2 b3
Hint, Solution
36
65. Exercise 2.6
What is the area of the quadrilateral with vertices at (1, 1), (4, 2), (3, 7) and (2, 3)?
Hint, Solution
Exercise 2.7
What is the volume of the tetrahedron with vertices at (1, 1, 0), (3, 2, 1), (2, 4, 1) and (1, 2, 5)?
Hint, Solution
Exercise 2.8
What is the equation of the plane that passes through the points (1, 2, 3), (2, 3, 1) and (3, 1, 2)? What is the distance
from the point (2, 3, 5) to the plane?
Hint, Solution
37
66. 2.4 Hints
The Dot and Cross Product
Hint 2.1
First prove the distributive law when the first vector is of unit length,
n · (b + c) = n · b + n · c.
Then all the quantities in the equation are projections onto the unit vector n and you can use geometry.
Hint 2.2
First prove that the dot product of a rectangular unit vector with itself is one and the dot product of two distinct
rectangular unit vectors is zero. Then write a and b in rectangular components and use the distributive law.
Hint 2.3
Use a · b = |a||b| cos θ.
Hint 2.4
First consider the case that both b and c are orthogonal to a. Prove the distributive law in this case from geometric
considerations.
Next consider two arbitrary vectors a and b. We can write b = b⊥ + b where b⊥ is orthogonal to a and b is
parallel to a. Show that
a × b = a × b⊥.
Finally prove the distributive law for arbitrary b and c.
Hint 2.5
Write the vectors in their rectangular components and use,
i × j = k, j × k = i, k × i = j,
and,
i × i = j × j = k × k = 0.
38
67. Hint 2.6
The quadrilateral is composed of two triangles. The area of a triangle defined by the two vectors a and b is 1
2
|a · b|.
Hint 2.7
Justify that the area of a tetrahedron determined by three vectors is one sixth the area of the parallelogram determined
by those three vectors. The area of a parallelogram determined by three vectors is the magnitude of the scalar triple
product of the vectors: a · b × c.
Hint 2.8
The equation of a line that is orthogonal to a and passes through the point b is a · x = a · b. The distance of a point
c from the plane is
(c − b) ·
a
|a|
39
68. 2.5 Solutions
The Dot and Cross Product
Solution 2.1
First we prove the distributive law when the first vector is of unit length, i.e.,
n · (b + c) = n · b + n · c. (2.1)
From Figure 2.12 we see that the projection of the vector b + c onto n is equal to the sum of the projections b · n and
c · n.
Now we extend the result to the case when the first vector has arbitrary length. We define a = |a|n and multiply
Equation 2.1 by the scalar, |a|.
|a|n · (b + c) = |a|n · b + |a|n · c
a · (b + c) = a · b + a · c.
Solution 2.2
First note that
ei · ei = |ei||ei| cos(0) = 1.
Then note that that dot product of any two distinct rectangular unit vectors is zero because they are orthogonal. Now
we write a and b in terms of their rectangular components and use the distributive law.
a · b = aiei · bjej
= aibjei · ej
= aibjδij
= aibi
Solution 2.3
Since a · b = |a||b| cos θ, we have
θ = arccos
a · b
|a||b|
40