Sets
A set is a collection of distinct elements, written by listing its members inside curly braces or by
specifying a property that its members satisfy.
Notation
∅ \emptyset ∅ — the empty set (contains no elements)
x ∈ X x \in X x ∈ X — the element x x x belongs to the set X X X
x ∉ X x \notin X x ∈ / X — the element x x x does not belong to X X X
A ⊆ B A \subseteq B A ⊆ B — A A A is a subset of B B B : every element of A A A is also an element of B B B
∣ A ∣ |A| ∣ A ∣ — the cardinality (number of elements) of a finite set A A A
′ { ′ P ′ } ′ ( A ) \mathcal{'\{'}P{'\}'}(A) ′ { ′ P ′ } ′ ( A ) — the power set of A A A : the set of all subsets of A A A
Set-Builder Notation
A set can be defined by a property:
{ x ∈ ′ { ′ R ′ } ′ ∣ x 2 − 4 = 0 } = { − 2 , 2 } \{x \in \mathbb{'\{'}R{'\}'} \mid x^2 - 4 = 0\} = \{-2, 2\} { x ∈ ′ { ′ R ′ } ′ ∣ x 2 − 4 = 0 } = { − 2 , 2 }
The vertical bar is read "such that."
Set Operations
Let A A A and B B B be subsets of a universal set U U U .
Union: A ∪ B = { x ∈ U ∣ x ∈ A o r x ∈ B } A \cup B = \{x \in U \mid x \in A \mathrm{ or } x \in B\} A ∪ B = { x ∈ U ∣ x ∈ A or x ∈ B }
Intersection: A ∩ B = { x ∈ U ∣ x ∈ A a n d x ∈ B } A \cap B = \{x \in U \mid x \in A \mathrm{ and } x \in B\} A ∩ B = { x ∈ U ∣ x ∈ A and x ∈ B }
Complement: A ′ = { x ∈ U ∣ x ∉ A } A' = \{x \in U \mid x \notin A\} A ′ = { x ∈ U ∣ x ∈ / A }
Set difference: A ∖ B = { x ∈ A ∣ x ∉ B } A \setminus B = \{x \in A \mid x \notin B\} A ∖ B = { x ∈ A ∣ x ∈ / B }
These operations are conveniently visualised with Venn diagrams. In a Venn diagram, the universal
set U U U is drawn as a rectangle, and subsets are drawn as overlapping circles. The union A ∪ B A \cup B A ∪ B
is the entire region covered by either circle; the intersection A ∩ B A \cap B A ∩ B is the overlapping
region; the complement A ′ A' A ′ is everything in the rectangle outside the circle for A A A .
De Morgan's Laws
Theorem. For any subsets A A A and B B B of a universal set U U U :
( A ∪ B ) ′ = A ′ ∩ B ′ (A \cup B)' = A' \cap B' ( A ∪ B ) ′ = A ′ ∩ B ′
( A ∩ B ) ′ = A ′ ∪ B ′ (A \cap B)' = A' \cup B' ( A ∩ B ) ′ = A ′ ∪ B ′
Proof of the first law. We show mutual inclusion.
(⊆ \subseteq ⊆ ) Suppose x ∈ ( A ∪ B ) ′ x \in (A \cup B)' x ∈ ( A ∪ B ) ′ . Then x ∉ A ∪ B x \notin A \cup B x ∈ / A ∪ B , so x ∉ A x \notin A x ∈ / A and
x ∉ B x \notin B x ∈ / B . Hence x ∈ A ′ x \in A' x ∈ A ′ and x ∈ B ′ x \in B' x ∈ B ′ , which means x ∈ A ′ ∩ B ′ x \in A' \cap B' x ∈ A ′ ∩ B ′ .
(⊇ \supseteq ⊇ ) Suppose x ∈ A ′ ∩ B ′ x \in A' \cap B' x ∈ A ′ ∩ B ′ . Then x ∈ A ′ x \in A' x ∈ A ′ and x ∈ B ′ x \in B' x ∈ B ′ , so x ∉ A x \notin A x ∈ / A and
x ∉ B x \notin B x ∈ / B . Therefore x ∉ A ∪ B x \notin A \cup B x ∈ / A ∪ B , giving x ∈ ( A ∪ B ) ′ x \in (A \cup B)' x ∈ ( A ∪ B ) ′ .
The second law follows by symmetry or by applying the first law to A ′ A' A ′ and B ′ B' B ′ . ■ \blacksquare ■
Power Sets
The power set ′ { ′ P ′ } ′ ( A ) \mathcal{'\{'}P{'\}'}(A) ′ { ′ P ′ } ′ ( A ) of a set A A A is the set of all subsets of A A A , including
∅ \emptyset ∅ and A A A itself.
If ∣ A ∣ = n |A| = n ∣ A ∣ = n , then ∣ ′ { ′ P ′ } ′ ( A ) ∣ = 2 n |\mathcal{'\{'}P{'\}'}(A)| = 2^n ∣ ′ { ′ P ′ } ′ ( A ) ∣ = 2 n .
Worked example: Power set Let A = { 1 , 2 , 3 } A = \{1, 2, 3\} A = { 1 , 2 , 3 } . Then
′ { ′ P ′ } ′ ( A ) = { ∅ , { 1 } , { 2 } , { 3 } , { 1 , 2 } , { 1 , 3 } , { 2 , 3 } , { 1 , 2 , 3 } } \mathcal{'\{'}P{'\}'}(A) = \big\{\emptyset,\, \{1\},\, \{2\},\, \{3\},\, \{1,2\},\, \{1,3\},\, \{2,3\},\, \{1,2,3\}\big\} ′ { ′ P ′ } ′ ( A ) = { ∅ , { 1 } , { 2 } , { 3 } , { 1 , 2 } , { 1 , 3 } , { 2 , 3 } , { 1 , 2 , 3 } } .
There are 2 3 = 8 2^3 = 8 2 3 = 8 subsets, as expected.
Cardinality of Finite Sets
For finite sets A A A and B B B :
∣ A ∪ B ∣ = ∣ A ∣ + ∣ B ∣ − ∣ A ∩ B ∣ |A \cup B| = |A| + |B| - |A \cap B| ∣ A ∪ B ∣ = ∣ A ∣ + ∣ B ∣ − ∣ A ∩ B ∣
This is the inclusion-exclusion principle for two sets. It subtracts the overlap that would
otherwise be double-counted.
For three sets A A A , B B B , C C C :
∣ A ∪ B ∪ C ∣ = ∣ A ∣ + ∣ B ∣ + ∣ C ∣ − ∣ A ∩ B ∣ − ∣ A ∩ C ∣ − ∣ B ∩ C ∣ + ∣ A ∩ B ∩ C ∣ |A \cup B \cup C| = |A| + |B| + |C| - |A \cap B| - |A \cap C| - |B \cap C| + |A \cap B \cap C| ∣ A ∪ B ∪ C ∣ = ∣ A ∣ + ∣ B ∣ + ∣ C ∣ − ∣ A ∩ B ∣ − ∣ A ∩ C ∣ − ∣ B ∩ C ∣ + ∣ A ∩ B ∩ C ∣
Worked example: Cardinality In a class of 40 students, 25 study Physics, 20 study Chemistry, and 10 study both. How many study
neither subject?
∣ P ∪ C ∣ = ∣ P ∣ + ∣ C ∣ − ∣ P ∩ C ∣ = 25 + 20 − 10 = 35 |P \cup C| = |P| + |C| - |P \cap C| = 25 + 20 - 10 = 35 ∣ P ∪ C ∣ = ∣ P ∣ + ∣ C ∣ − ∣ P ∩ C ∣ = 25 + 20 − 10 = 35 .
The number studying neither is 40 − 35 = 5 40 - 35 = 5 40 − 35 = 5 .
Functions
A function f f f is an assignment from a domain (X X X , the set of acceptable inputs) to a
codomain (Y Y Y , the set into which all outputs must fall), such that:
Every element in X X X is mapped to an element in Y Y Y : ∀ x ∈ X , f ( x ) ∈ Y \forall x \in X,\; f(x) \in Y ∀ x ∈ X , f ( x ) ∈ Y
No element in X X X is mapped to more than one element in Y Y Y : f ( x 1 ) = f ( x 2 ) ⟹ x 1 = x 2 f(x_1) = f(x_2) \implies x_1 = x_2 f ( x 1 ) = f ( x 2 ) ⟹ x 1 = x 2
Notation
A function f f f with domain X X X and codomain Y Y Y :
f : X → Y f: X \to Y f : X → Y
Non-examples of functions
f 1 : ′ { ′ R ′ } ′ + → ′ { ′ R ′ } ′ , f ( x ) = ± x f_1: \mathbb{'\{'}R{'\}'}^+ \to \mathbb{'\{'}R{'\}'},\; f(x) = \pm\sqrt{x} f 1 : ′ { ′ R ′ } ′ + → ′ { ′ R ′ } ′ , f ( x ) = ± x — Since x x x maps to two values, f 1 f_1 f 1 is
not a function.
f 2 : ′ { ′ R ′ } ′ → ′ { ′ R ′ } ′ , f 2 ( x ) = 1 x f_2: \mathbb{'\{'}R{'\}'} \to \mathbb{'\{'}R{'\}'},\; f_2(x) = \frac{1}{x} f 2 : ′ { ′ R ′ } ′ → ′ { ′ R ′ } ′ , f 2 ( x ) = x 1 — At x = 0 x = 0 x = 0 , f 2 ( 0 ) f_2(0) f 2 ( 0 ) is undefined, so
not every element of the domain is mapped. Redefine as
f 2 : ′ { ′ R ′ } ′ ∖ { 0 } → ′ { ′ R ′ } ′ f_2: \mathbb{'\{'}R{'\}'} \setminus \{0\} \to \mathbb{'\{'}R{'\}'} f 2 : ′ { ′ R ′ } ′ ∖ { 0 } → ′ { ′ R ′ } ′ .
f 3 : ∅ → Y f_3: \emptyset \to Y f 3 : ∅ → Y — Since no elements are in the domain, uniqueness is vacuously satisfied.
This is a valid (empty) function.
Range
The range (A A A ) of a function f : X → Y f: X \to Y f : X → Y is the set of all values actually produced:
A = { f ( x ) ∣ x ∈ X } , A ⊆ Y A = \{f(x) \mid x \in X\}, \quad A \subseteq Y A = { f ( x ) ∣ x ∈ X } , A ⊆ Y
Classes of Functions
Surjective (onto): Every element of the codomain is hit:
∀ y ∈ Y , ∃ x ∈ X , y = f ( x ) \forall y \in Y,\; \exists x \in X,\; y = f(x) ∀ y ∈ Y , ∃ x ∈ X , y = f ( x )
Injective (one-to-one): Distinct inputs give distinct outputs:
f ( x 1 ) = f ( x 2 ) ⟹ x 1 = x 2 f(x_1) = f(x_2) \implies x_1 = x_2 f ( x 1 ) = f ( x 2 ) ⟹ x 1 = x 2
Bijective: Both surjective and injective
Odd: f ( − x ) = − f ( x ) f(-x) = -f(x) f ( − x ) = − f ( x ) for all x ∈ ′ { ′ R ′ } ′ x \in \mathbb{'\{'}R{'\}'} x ∈ ′ { ′ R ′ } ′
Even: f ( − x ) = f ( x ) f(-x) = f(x) f ( − x ) = f ( x ) for all x ∈ ′ { ′ R ′ } ′ x \in \mathbb{'\{'}R{'\}'} x ∈ ′ { ′ R ′ } ′
Injectivity, Surjectivity, and Bijectivity — Worked Examples
Example: Determining injectivity and surjectivity Let f : ′ { ′ R ′ } ′ → ′ { ′ R ′ } ′ , f ( x ) = x 2 f: \mathbb{'\{'}R{'\}'} \to \mathbb{'\{'}R{'\}'},\; f(x) = x^2 f : ′ { ′ R ′ } ′ → ′ { ′ R ′ } ′ , f ( x ) = x 2 .
Injective? No. f ( 1 ) = f ( − 1 ) = 1 f(1) = f(-1) = 1 f ( 1 ) = f ( − 1 ) = 1 , so distinct inputs map to the same output. (This is also an
even function.)
Surjective? No. There is no x ∈ ′ { ′ R ′ } ′ x \in \mathbb{'\{'}R{'\}'} x ∈ ′ { ′ R ′ } ′ such that f ( x ) = − 1 f(x) = -1 f ( x ) = − 1 , so
− 1 ∉ r a n g e ( f ) -1 \notin \mathrm{range}(f) − 1 ∈ / range ( f ) .
The range is [ 0 , ∞ ) [0, \infty) [ 0 , ∞ ) , a proper subset of ′ { ′ R ′ } ′ \mathbb{'\{'}R{'\}'} ′ { ′ R ′ } ′ .
Example: Proving a function is bijective Let f : ′ { ′ R ′ } ′ → ′ { ′ R ′ } ′ , f ( x ) = 2 x + 3 f: \mathbb{'\{'}R{'\}'} \to \mathbb{'\{'}R{'\}'},\; f(x) = 2x + 3 f : ′ { ′ R ′ } ′ → ′ { ′ R ′ } ′ , f ( x ) = 2 x + 3 .
Injective: Suppose f ( x 1 ) = f ( x 2 ) f(x_1) = f(x_2) f ( x 1 ) = f ( x 2 ) . Then 2 x 1 + 3 = 2 x 2 + 3 2x_1 + 3 = 2x_2 + 3 2 x 1 + 3 = 2 x 2 + 3 , so x 1 = x 2 x_1 = x_2 x 1 = x 2 .
Surjective: Let y ∈ ′ { ′ R ′ } ′ y \in \mathbb{'\{'}R{'\}'} y ∈ ′ { ′ R ′ } ′ . We need 2 x + 3 = y 2x + 3 = y 2 x + 3 = y , i.e. x = y − 3 2 x = \frac{y - 3}{2} x = 2 y − 3 . Since
y − 3 2 ∈ ′ { ′ R ′ } ′ \frac{y-3}{2} \in \mathbb{'\{'}R{'\}'} 2 y − 3 ∈ ′ { ′ R ′ } ′ , every y y y is in the range.
Therefore f f f is bijective.
Inverse Functions
If f : X → Y f: X \to Y f : X → Y is bijective, the inverse function f − 1 : Y → X f^{-1}: Y \to X f − 1 : Y → X exists and satisfies:
f − 1 ( f ( x ) ) = x f o r a l l x ∈ X , f ( f − 1 ( y ) ) = y f o r a l l y ∈ Y f^{-1}(f(x)) = x \quad \mathrm{for all } x \in X, \qquad f(f^{-1}(y)) = y \quad \mathrm{for all } y \in Y f − 1 ( f ( x )) = x forall x ∈ X , f ( f − 1 ( y )) = y forall y ∈ Y
To find f − 1 f^{-1} f − 1 : write y = f ( x ) y = f(x) y = f ( x ) , solve for x x x in terms of y y y , then interchange x x x and y y y .
Existence condition: A function has an inverse (on its given domain and codomain) if and only if
it is bijective.
Example: Finding an inverse function Let f : ′ { ′ R ′ } ′ → ′ { ′ R ′ } ′ , f ( x ) = 2 x + 1 x − 3 f: \mathbb{'\{'}R{'\}'} \to \mathbb{'\{'}R{'\}'},\; f(x) = \frac{2x + 1}{x - 3} f : ′ { ′ R ′ } ′ → ′ { ′ R ′ } ′ , f ( x ) = x − 3 2 x + 1 , with domain
′ { ′ R ′ } ′ ∖ { 3 } \mathbb{'\{'}R{'\}'} \setminus \{3\} ′ { ′ R ′ } ′ ∖ { 3 } .
Set y = 2 x + 1 x − 3 y = \frac{2x+1}{x-3} y = x − 3 2 x + 1 . Then y ( x − 3 ) = 2 x + 1 y(x-3) = 2x+1 y ( x − 3 ) = 2 x + 1 , so y x − 3 y = 2 x + 1 yx - 3y = 2x + 1 y x − 3 y = 2 x + 1 , hence x ( y − 2 ) = 3 y + 1 x(y-2) = 3y+1 x ( y − 2 ) = 3 y + 1 ,
giving x = 3 y + 1 y − 2 x = \frac{3y+1}{y-2} x = y − 2 3 y + 1 .
Therefore f − 1 ( x ) = 3 x + 1 x − 2 f^{-1}(x) = \frac{3x+1}{x-2} f − 1 ( x ) = x − 2 3 x + 1 with domain ′ { ′ R ′ } ′ ∖ { 2 } \mathbb{'\{'}R{'\}'} \setminus \{2\} ′ { ′ R ′ } ′ ∖ { 2 } .
Self-Inverse Functions
A function f f f is self-inverse if f ( f ( x ) ) = x f(f(x)) = x f ( f ( x )) = x for all x x x in the domain, i.e. f − 1 = f f^{-1} = f f − 1 = f .
Common examples: f ( x ) = 1 x f(x) = \frac{1}{x} f ( x ) = x 1 on ′ { ′ R ′ } ′ ∖ { 0 } \mathbb{'\{'}R{'\}'} \setminus \{0\} ′ { ′ R ′ } ′ ∖ { 0 } ; f ( x ) = a − x f(x) = a - x f ( x ) = a − x on
′ { ′ R ′ } ′ \mathbb{'\{'}R{'\}'} ′ { ′ R ′ } ′ .
Domain Restriction to Achieve Injectivity
A non-injective function can be made injective by restricting its domain.
Example: Restricting domain f ( x ) = x 2 f(x) = x^2 f ( x ) = x 2 is not injective on ′ { ′ R ′ } ′ \mathbb{'\{'}R{'\}'} ′ { ′ R ′ } ′ since f ( 1 ) = f ( − 1 ) f(1) = f(-1) f ( 1 ) = f ( − 1 ) . But by restricting to
[ 0 , ∞ ) [0, \infty) [ 0 , ∞ ) , every non-negative real has exactly one non-negative square root, so
f : [ 0 , ∞ ) → [ 0 , ∞ ) f: [0, \infty) \to [0, \infty) f : [ 0 , ∞ ) → [ 0 , ∞ ) is bijective with inverse f − 1 ( x ) = x f^{-1}(x) = \sqrt{x} f − 1 ( x ) = x .
Similarly, f : ( − ∞ , 0 ] → [ 0 , ∞ ) , f ( x ) = x 2 f: (-\infty, 0] \to [0, \infty),\; f(x) = x^2 f : ( − ∞ , 0 ] → [ 0 , ∞ ) , f ( x ) = x 2 is bijective with inverse
f − 1 ( x ) = − x f^{-1}(x) = -\sqrt{x} f − 1 ( x ) = − x .
Scientific Notation and Approximation
Scientific Notation
A number in scientific notation has the form:
a = b × 10 k , b ∈ ′ { ′ R ′ } ′ , 1 ≤ ∣ b ∣ < 10 , k ∈ ′ { ′ Z ′ } ′ a = b \times 10^k, \quad b \in \mathbb{'\{'}R{'\}'},\; 1 \le |b| \lt 10,\; k \in \mathbb{'\{'}Z{'\}'} a = b × 1 0 k , b ∈ ′ { ′ R ′ } ′ , 1 ≤ ∣ b ∣ < 10 , k ∈ ′ { ′ Z ′ } ′
Multiplication of Numbers in Scientific Notation
Let a 1 = b 1 × 10 k 1 a_1 = b_1 \times 10^{k_1} a 1 = b 1 × 1 0 k 1 and a 2 = b 2 × 10 k 2 a_2 = b_2 \times 10^{k_2} a 2 = b 2 × 1 0 k 2 . Then:
a 1 ⋅ a 2 = b 1 ⋅ b 2 × 10 k 1 + k 2 a_1 \cdot a_2 = b_1 \cdot b_2 \times 10^{k_1 + k_2} a 1 ⋅ a 2 = b 1 ⋅ b 2 × 1 0 k 1 + k 2
If ∣ b 1 ⋅ b 2 ∣ ≥ 10 |b_1 \cdot b_2| \ge 10 ∣ b 1 ⋅ b 2 ∣ ≥ 10 or ∣ b 1 ⋅ b 2 ∣ < 1 |b_1 \cdot b_2| \lt 1 ∣ b 1 ⋅ b 2 ∣ < 1 , adjust the mantissa back into [ 1 , 10 ) [1, 10) [ 1 , 10 ) by
absorbing a factor of 10 10 10 into the exponent.
The rules for significant figures:
All non-zero digits are significant.
Zeros between non-zero digits are significant.
Leading zeros are not significant.
Trailing zeros after a decimal point are significant.
Trailing zeros in a whole number without a decimal point are ambiguous.
Examples: 0.00450 0.00450 0.00450 has 3 s.f.; 1020 1020 1020 has at least 3 s.f.; 1020. 1020. 1020. has 4 s.f.
Absolute and Relative Error
If a quantity's true value is x 0 x_0 x 0 and the approximate value is x a x_a x a :
Absolute error: ∣ x a − x 0 ∣ |x_a - x_0| ∣ x a − x 0 ∣
Relative error: ∣ x a − x 0 ∣ ∣ x 0 ∣ \dfrac{|x_a - x_0|}{|x_0|} ∣ x 0 ∣ ∣ x a − x 0 ∣
Upper and Lower Bounds
If a value is given as x = 12.5 x = 12.5 x = 12.5 (correct to 1 decimal place), then:
Upper bound: 12.55 12.55 12.55
Lower bound: 12.45 12.45 12.45
The interval is [ 12.45 , 12.55 ) [12.45,\; 12.55) [ 12.45 , 12.55 ) . The maximum possible error (absolute) is
1 2 × 10 − d \frac{1}{2} \times 10^{-d} 2 1 × 1 0 − d where d d d is the number of decimal places.
Worked example: Bounds The sides of a rectangle are measured as 5.2 c m 5.2\,\mathrm{cm} 5.2 cm and 3.8 c m 3.8\,\mathrm{cm} 3.8 cm (each to 1 d.p.).
Upper bounds: 5.25 5.25 5.25 and 3.85 3.85 3.85 . Lower bounds: 5.15 5.15 5.15 and 3.75 3.75 3.75 .
Maximum area: 5.25 × 3.85 = 20.2125 c m 2 5.25 \times 3.85 = 20.2125\,\mathrm{cm}^2 5.25 × 3.85 = 20.2125 cm 2 .
Minimum area: 5.15 × 3.75 = 19.3125 c m 2 5.15 \times 3.75 = 19.3125\,\mathrm{cm}^2 5.15 × 3.75 = 19.3125 cm 2 .
The area is 19.8 c m 2 ± 0.45 c m 2 19.8\,\mathrm{cm}^2 \pm 0.45\,\mathrm{cm}^2 19.8 cm 2 ± 0.45 cm 2 (to 1 d.p.), but in bounds form we write
19.3125 ≤ A < 20.2125 19.3125 \le A \lt 20.2125 19.3125 ≤ A < 20.2125 .
Sequences and Series
A sequence is a function f f f with domain ′ { ′ N ′ } ′ \mathbb{'\{'}N{'\}'} ′ { ′ N ′ } ′ (or ′ { ′ N ′ } ′ 0 \mathbb{'\{'}N{'\}'}_0 ′ { ′ N ′ } ′ 0 ) and codomain X X X . Writing
u n = f ( n ) u_n = f(n) u n = f ( n ) , every sequence is ordered by its index.
f : ′ { ′ N ′ } ′ → X , u n = f ( n ) , n ∈ ′ { ′ N ′ } ′ f: \mathbb{'\{'}N{'\}'} \to X, \quad u_n = f(n), \; n \in \mathbb{'\{'}N{'\}'} f : ′ { ′ N ′ } ′ → X , u n = f ( n ) , n ∈ ′ { ′ N ′ } ′
Series and Partial Sums
A series is the sum of the terms of a sequence. A partial sum S k S_k S k is the sum of the first
k k k terms:
S k = ∑ n = 1 k u n S_k = \sum_{n=1}^{k} u_n S k = ∑ n = 1 k u n
Sigma Notation Properties
Sigma notation obeys the following rules (where c c c is a constant independent of the index):
Linearity:
∑ n = 1 k ( a u n + b v n ) = a ∑ n = 1 k u n + b ∑ n = 1 k v n \sum_{n=1}^{k} (au_n + bv_n) = a\sum_{n=1}^{k} u_n + b\sum_{n=1}^{k} v_n ∑ n = 1 k ( a u n + b v n ) = a ∑ n = 1 k u n + b ∑ n = 1 k v n
Proof. Distribute the sum over each term and factor constants out. Each term a u n au_n a u n appears
exactly once in the expansion, so grouping gives a a a times the sum of u n u_n u n , and similarly for
b v n bv_n b v n . ■ \blacksquare ■
Index shifting: Replacing n n n with n + m n + m n + m shifts the bounds:
∑ n = p q u n = ∑ n = p + m q + m u n − m \sum_{n=p}^{q} u_n = \sum_{n=p+m}^{q+m} u_{n-m} ∑ n = p q u n = ∑ n = p + m q + m u n − m
Telescoping: If u n = v n − v n − 1 u_n = v_n - v_{n-1} u n = v n − v n − 1 , then ∑ n = 1 k u n = v k − v 0 \sum_{n=1}^{k} u_n = v_k - v_0 ∑ n = 1 k u n = v k − v 0 .
Worked example: Sigma manipulation Simplify ∑ n = 1 50 ( 3 n + 2 ) \sum_{n=1}^{50} (3n + 2) ∑ n = 1 50 ( 3 n + 2 ) .
By linearity:
3 ∑ n = 1 50 n + ∑ n = 1 50 2 = 3 ⋅ 50 ⋅ 51 2 + 2 ⋅ 50 = 3825 + 100 = 3925 3\sum_{n=1}^{50} n + \sum_{n=1}^{50} 2 = 3 \cdot \frac{50 \cdot 51}{2} + 2 \cdot 50 = 3825 + 100 = 3925 3 ∑ n = 1 50 n + ∑ n = 1 50 2 = 3 ⋅ 2 50 ⋅ 51 + 2 ⋅ 50 = 3825 + 100 = 3925 .
Arithmetic Sequences
An arithmetic sequence has a constant common difference d d d between consecutive terms:
a n = a 1 + ( n − 1 ) d a_n = a_1 + (n-1)d a n = a 1 + ( n − 1 ) d
Arithmetic Series — Proof by Pairing
Theorem. S k = k 2 ( a 1 + a k ) S_k = \frac{k}{2}(a_1 + a_k) S k = 2 k ( a 1 + a k )
Proof. Write the sum forward and backward:
S k = a 1 + ( a 1 + d ) + ( a 1 + 2 d ) + ⋯ + ( a 1 + ( k − 1 ) d ) S_k = a_1 + (a_1 + d) + (a_1 + 2d) + \cdots + (a_1 + (k-1)d) S k = a 1 + ( a 1 + d ) + ( a 1 + 2 d ) + ⋯ + ( a 1 + ( k − 1 ) d )
S k = a k + ( a k − d ) + ( a k − 2 d ) + ⋯ + ( a k − ( k − 1 ) d ) S_k = a_k + (a_k - d) + (a_k - 2d) + \cdots + (a_k - (k-1)d) S k = a k + ( a k − d ) + ( a k − 2 d ) + ⋯ + ( a k − ( k − 1 ) d )
Adding term-by-term, each pair sums to a 1 + a k a_1 + a_k a 1 + a k , and there are k k k such pairs:
2 S k = k ( a 1 + a k ) ⟹ S k = k 2 ( a 1 + a k ) 2S_k = k(a_1 + a_k) \implies S_k = \frac{k}{2}(a_1 + a_k) 2 S k = k ( a 1 + a k ) ⟹ S k = 2 k ( a 1 + a k )
Substituting a k = a 1 + ( k − 1 ) d a_k = a_1 + (k-1)d a k = a 1 + ( k − 1 ) d gives the alternative form:
S k = k 2 ( 2 a 1 + ( k − 1 ) d ) S_k = \frac{k}{2}\big(2a_1 + (k-1)d\big) S k = 2 k ( 2 a 1 + ( k − 1 ) d )
■ \blacksquare ■
In the IB formula booklet this is written as:
S n = n 2 ( 2 u 1 + ( n − 1 ) d ) = n 2 ( u 1 + u n ) S_n = \frac{n}{2}(2u_1 + (n-1)d) = \frac{n}{2}(u_1 + u_n) S n = 2 n ( 2 u 1 + ( n − 1 ) d ) = 2 n ( u 1 + u n )
Geometric Sequences
A geometric sequence has a constant common ratio r r r between consecutive terms:
u n = u 1 ⋅ r n − 1 u_n = u_1 \cdot r^{n-1} u n = u 1 ⋅ r n − 1
Geometric Series — Proof by Subtraction
Theorem. S k = u 1 ( 1 − r k ) 1 − r S_k = \dfrac{u_1(1 - r^k)}{1 - r} S k = 1 − r u 1 ( 1 − r k ) for r ≠ 1 r \ne 1 r = 1 .
Proof. Write S k S_k S k and r S k rS_k r S k :
S k = u 1 + u 1 r + u 1 r 2 + ⋯ + u 1 r k − 1 S_k = u_1 + u_1 r + u_1 r^2 + \cdots + u_1 r^{k-1} S k = u 1 + u 1 r + u 1 r 2 + ⋯ + u 1 r k − 1
r S k = u 1 r + u 1 r 2 + u 1 r 3 + ⋯ + u 1 r k rS_k = u_1 r + u_1 r^2 + u_1 r^3 + \cdots + u_1 r^k r S k = u 1 r + u 1 r 2 + u 1 r 3 + ⋯ + u 1 r k
Subtracting:
S k − r S k = u 1 − u 1 r k S_k - rS_k = u_1 - u_1 r^k S k − r S k = u 1 − u 1 r k
S k ( 1 − r ) = u 1 ( 1 − r k ) S_k(1 - r) = u_1(1 - r^k) S k ( 1 − r ) = u 1 ( 1 − r k )
S k = u 1 ( 1 − r k ) 1 − r ■ S_k = \frac{u_1(1 - r^k)}{1 - r} \qquad \blacksquare S k = 1 − r u 1 ( 1 − r k ) ■
Convergence of Geometric Series
If ∣ r ∣ < 1 |r| \lt 1 ∣ r ∣ < 1 , then lim k → ∞ r k = 0 \lim_{k \to \infty} r^k = 0 lim k → ∞ r k = 0 , so:
S ∞ = u 1 1 − r S_{\infty} = \frac{u_1}{1 - r} S ∞ = 1 − r u 1
If ∣ r ∣ ≥ 1 |r| \ge 1 ∣ r ∣ ≥ 1 , the series diverges.
Applications: Compound Interest
If a principal P P P is invested at rate r r r per period, compounded each period, the value after n n n
periods is:
A = P ( 1 + r ) n A = P(1 + r)^n A = P ( 1 + r ) n
This is a geometric sequence with u 1 = P u_1 = P u 1 = P and common ratio 1 + r 1 + r 1 + r .
Applications: Annuities
An annuity pays d d d per period for n n n periods, with interest rate r r r per period. The present value
is:
P V = d r ( 1 − 1 ( 1 + r ) n ) PV = \frac{d}{r}\left(1 - \frac{1}{(1+r)^n}\right) P V = r d ( 1 − ( 1 + r ) n 1 )
This follows directly from the geometric series sum with first term d 1 + r \frac{d}{1+r} 1 + r d and ratio
1 1 + r \frac{1}{1+r} 1 + r 1 .
Worked example: Compound interest A deposit of $5,000 earns 4% per year, compounded annually. Find the value after 10 years.
A = 5000 ( 1.04 ) 10 ≈ 5000 × 1.48024 = 7401.22 A = 5000(1.04)^{10} \approx 5000 \times 1.48024 = 7401.22 A = 5000 ( 1.04 ) 10 ≈ 5000 × 1.48024 = 7401.22
The value is approximately $7,401.22.
Worked example: Summation Find the sum of the first 20 terms of the arithmetic sequence 3 , 7 , 11 , … 3, 7, 11, \ldots 3 , 7 , 11 , …
Here a 1 = 3 a_1 = 3 a 1 = 3 , d = 4 d = 4 d = 4 , n = 20 n = 20 n = 20 .
S 20 = 20 2 ( 2 ( 3 ) + 19 ( 4 ) ) = 10 ( 6 + 76 ) = 10 × 82 = 820 S_{20} = \frac{20}{2}\big(2(3) + 19(4)\big) = 10(6 + 76) = 10 \times 82 = 820 S 20 = 2 20 ( 2 ( 3 ) + 19 ( 4 ) ) = 10 ( 6 + 76 ) = 10 × 82 = 820
Worked example: Infinite geometric series Find the sum of 1 + 1 2 + 1 4 + 1 8 + ⋯ 1 + \frac{1}{2} + \frac{1}{4} + \frac{1}{8} + \cdots 1 + 2 1 + 4 1 + 8 1 + ⋯
Here u 1 = 1 u_1 = 1 u 1 = 1 , r = 1 2 r = \frac{1}{2} r = 2 1 . Since ∣ r ∣ < 1 |r| \lt 1 ∣ r ∣ < 1 :
S ∞ = 1 1 − 1 / 2 = 1 1 / 2 = 2 S_{\infty} = \frac{1}{1 - 1/2} = \frac{1}{1/2} = 2 S ∞ = 1 − 1/2 1 = 1/2 1 = 2
Worked example: Geometric series — finding n The sum of the first n n n terms of 2 , 6 , 18 , … 2, 6, 18, \ldots 2 , 6 , 18 , … is 6560 6560 6560 . Find n n n .
Here u 1 = 2 u_1 = 2 u 1 = 2 , r = 3 r = 3 r = 3 . Using S n = 2 ( 3 n − 1 ) 3 − 1 = 3 n − 1 S_n = \frac{2(3^n - 1)}{3 - 1} = 3^n - 1 S n = 3 − 1 2 ( 3 n − 1 ) = 3 n − 1 .
3 n − 1 = 6560 ⟹ 3 n = 6561 = 3 8 ⟹ n = 8 3^n - 1 = 6560 \implies 3^n = 6561 = 3^8 \implies n = 8 3 n − 1 = 6560 ⟹ 3 n = 6561 = 3 8 ⟹ n = 8
Logarithms
A logarithm is the inverse function of the exponential f ( x ) = b x f(x) = b^x f ( x ) = b x :
log b ( b x ) = x , b ∈ { r ∈ ′ { ′ R ′ } ′ ∣ r > 0 , r ≠ 1 } \log_b(b^x) = x, \quad b \in \{r \in \mathbb{'\{'}R{'\}'} \mid r \gt 0,\; r \ne 1\} log b ( b x ) = x , b ∈ { r ∈ ′ { ′ R ′ } ′ ∣ r > 0 , r = 1 }
The logarithm log b x \log_b x log b x answers the question: "to what power must b b b be raised to obtain x x x ?"
Logarithm Laws — Proofs
Law 1 (Product rule): log a ( x y ) = log a x + log a y \log_a(xy) = \log_a x + \log_a y log a ( x y ) = log a x + log a y
Proof. Let m = log a x m = \log_a x m = log a x and n = log a y n = \log_a y n = log a y . Then a m = x a^m = x a m = x and a n = y a^n = y a n = y . So
x y = a m ⋅ a n = a m + n xy = a^m \cdot a^n = a^{m+n} x y = a m ⋅ a n = a m + n . Taking log a \log_a log a of both sides:
log a ( x y ) = m + n = log a x + log a y \log_a(xy) = m + n = \log_a x + \log_a y log a ( x y ) = m + n = log a x + log a y . ■ \blacksquare ■
Law 2 (Quotient rule): log a ( x y ) = log a x − log a y \log_a\!\left(\dfrac{x}{y}\right) = \log_a x - \log_a y log a ( y x ) = log a x − log a y
Proof. Let m = log a x m = \log_a x m = log a x and n = log a y n = \log_a y n = log a y . Then a m = x a^m = x a m = x and a n = y a^n = y a n = y .
x y = a m a n = a m − n \frac{x}{y} = \frac{a^m}{a^n} = a^{m-n} y x = a n a m = a m − n . Taking log a \log_a log a :
log a ( x / y ) = m − n = log a x − log a y \log_a(x/y) = m - n = \log_a x - \log_a y log a ( x / y ) = m − n = log a x − log a y . ■ \blacksquare ■
Law 3 (Power rule): log a ( x m ) = m log a x \log_a(x^m) = m\log_a x log a ( x m ) = m log a x
Proof. Let n = log a x n = \log_a x n = log a x , so a n = x a^n = x a n = x . Then x m = ( a n ) m = a m n x^m = (a^n)^m = a^{mn} x m = ( a n ) m = a mn . Taking log a \log_a log a :
log a ( x m ) = m n = m log a x \log_a(x^m) = mn = m\log_a x log a ( x m ) = mn = m log a x . ■ \blacksquare ■
Theorem. log a x = log b x log b a \log_a x = \dfrac{\log_b x}{\log_b a} log a x = log b a log b x for any valid bases a , b > 0 a, b \gt 0 a , b > 0 , a , b ≠ 1 a, b \ne 1 a , b = 1 .
Proof. Let y = log a x y = \log_a x y = log a x . Then a y = x a^y = x a y = x . Taking log b \log_b log b of both sides:
log b ( a y ) = log b x \log_b(a^y) = \log_b x log b ( a y ) = log b x . By the power rule, y log b a = log b x y\log_b a = \log_b x y log b a = log b x , so
y = log b x log b a y = \frac{\log_b x}{\log_b a} y = l o g b a l o g b x . ■ \blacksquare ■
This is particularly useful for computing logarithms in bases other than 10 10 10 or e e e using a
calculator.
Solving Exponential Equations
When the variable is in the exponent, take logarithms of both sides to bring it down.
Worked example: Exponential equation Solve 3 2 x + 1 = 7 x − 2 3^{2x+1} = 7^{x-2} 3 2 x + 1 = 7 x − 2 for x x x .
Taking ln \ln ln of both sides: ( 2 x + 1 ) ln 3 = ( x − 2 ) ln 7 (2x+1)\ln 3 = (x-2)\ln 7 ( 2 x + 1 ) ln 3 = ( x − 2 ) ln 7 .
2 x ln 3 + ln 3 = x ln 7 − 2 ln 7 2x\ln 3 + \ln 3 = x\ln 7 - 2\ln 7 2 x ln 3 + ln 3 = x ln 7 − 2 ln 7
x ( 2 ln 3 − ln 7 ) = − 2 ln 7 − ln 3 x(2\ln 3 - \ln 7) = -2\ln 7 - \ln 3 x ( 2 ln 3 − ln 7 ) = − 2 ln 7 − ln 3
x = − 2 ln 7 − ln 3 2 ln 3 − ln 7 = 2 ln 7 + ln 3 ln 7 − 2 ln 3 ≈ 2 ( 1.946 ) + 1.099 1.946 − 2 ( 1.099 ) ≈ 4.990 − 0.252 ≈ − 19.8 x = \frac{-2\ln 7 - \ln 3}{2\ln 3 - \ln 7} = \frac{2\ln 7 + \ln 3}{\ln 7 - 2\ln 3} \approx \frac{2(1.946) + 1.099}{1.946 - 2(1.099)} \approx \frac{4.990}{-0.252} \approx -19.8 x = 2 l n 3 − l n 7 − 2 l n 7 − l n 3 = l n 7 − 2 l n 3 2 l n 7 + l n 3 ≈ 1.946 − 2 ( 1.099 ) 2 ( 1.946 ) + 1.099 ≈ − 0.252 4.990 ≈ − 19.8
Solving Logarithmic Equations
Use the logarithm laws to combine terms, then exponentiate both sides.
Worked example: Logarithmic equation Solve log 2 ( x + 3 ) + log 2 ( x − 3 ) = 4 \log_2(x + 3) + \log_2(x - 3) = 4 log 2 ( x + 3 ) + log 2 ( x − 3 ) = 4 .
By the product rule: log 2 ( ( x + 3 ) ( x − 3 ) ) = 4 \log_2\!\big((x+3)(x-3)\big) = 4 log 2 ( ( x + 3 ) ( x − 3 ) ) = 4 , i.e. log 2 ( x 2 − 9 ) = 4 \log_2(x^2 - 9) = 4 log 2 ( x 2 − 9 ) = 4 .
x 2 − 9 = 2 4 = 16 x^2 - 9 = 2^4 = 16 x 2 − 9 = 2 4 = 16 , so x 2 = 25 x^2 = 25 x 2 = 25 , giving x = 5 x = 5 x = 5 or x = − 5 x = -5 x = − 5 .
Check domain: x + 3 > 0 x + 3 \gt 0 x + 3 > 0 and x − 3 > 0 x - 3 \gt 0 x − 3 > 0 requires x > 3 x \gt 3 x > 3 . So x = − 5 x = -5 x = − 5 is rejected.
Solution: x = 5 x = 5 x = 5 .
Worked example: Change of base Evaluate log 3 20 \log_3 20 log 3 20 to 3 significant figures.
log 3 20 = ln 20 ln 3 = 2.9957 … 1.0986 … ≈ 2.73 \log_3 20 = \frac{\ln 20}{\ln 3} = \frac{2.9957\ldots}{1.0986\ldots} \approx 2.73 log 3 20 = l n 3 l n 20 = 1.0986 … 2.9957 … ≈ 2.73
Worked example: Exponential growth A bacteria culture doubles every 3 hours. If the initial population is 500 500 500 , when will it reach
32,000?
P ( t ) = 500 ⋅ 2 t / 3 P(t) = 500 \cdot 2^{t/3} P ( t ) = 500 ⋅ 2 t /3 . Set 500 ⋅ 2 t / 3 = 32000 500 \cdot 2^{t/3} = 32000 500 ⋅ 2 t /3 = 32000 :
2 t / 3 = 64 = 2 6 2^{t/3} = 64 = 2^6 2 t /3 = 64 = 2 6 , so t / 3 = 6 t/3 = 6 t /3 = 6 , giving t = 18 t = 18 t = 18 hours.
Alternatively, using logarithms: t 3 ln 2 = ln 64 = 6 ln 2 \frac{t}{3}\ln 2 = \ln 64 = 6\ln 2 3 t ln 2 = ln 64 = 6 ln 2 , so t = 18 t = 18 t = 18 .
Proof by Mathematical Induction
Mathematical induction is a technique for proving statements that are true for all natural numbers
(or all integers greater than or equal to some starting value).
Structure of an Inductive Proof
To prove P ( n ) P(n) P ( n ) for all n ≥ n 0 n \ge n_0 n ≥ n 0 :
Base case: Verify P ( n 0 ) P(n_0) P ( n 0 ) is true directly.
Inductive hypothesis: Assume P ( k ) P(k) P ( k ) is true for some arbitrary k ≥ n 0 k \ge n_0 k ≥ n 0 .
Inductive step: Using the hypothesis, prove that P ( k + 1 ) P(k+1) P ( k + 1 ) is true.
Conclusion: By the principle of mathematical induction, P ( n ) P(n) P ( n ) is true for all n ≥ n 0 n \ge n_0 n ≥ n 0 .
The logic is analogous to an infinite chain of dominoes: the base case knocks over the first domino,
and the inductive step ensures each domino knocks over the next.
Example: Prove the sum of squares formula Prove ∑ i = 1 n i 2 = n ( n + 1 ) ( 2 n + 1 ) 6 \sum_{i=1}^{n} i^2 = \frac{n(n+1)(2n+1)}{6} ∑ i = 1 n i 2 = 6 n ( n + 1 ) ( 2 n + 1 ) .
Base case (n = 1 n = 1 n = 1 ): LHS = 1 = 1 = 1 . RHS = 1 ⋅ 2 ⋅ 3 6 = 1 = \frac{1 \cdot 2 \cdot 3}{6} = 1 = 6 1 ⋅ 2 ⋅ 3 = 1 . True.
Inductive hypothesis: Assume ∑ i = 1 k i 2 = k ( k + 1 ) ( 2 k + 1 ) 6 \sum_{i=1}^{k} i^2 = \frac{k(k+1)(2k+1)}{6} ∑ i = 1 k i 2 = 6 k ( k + 1 ) ( 2 k + 1 ) for some k ≥ 1 k \ge 1 k ≥ 1 .
Inductive step:
∑ i = 1 k + 1 i 2 = k ( k + 1 ) ( 2 k + 1 ) 6 + ( k + 1 ) 2 \sum_{i=1}^{k+1} i^2 = \frac{k(k+1)(2k+1)}{6} + (k+1)^2 ∑ i = 1 k + 1 i 2 = 6 k ( k + 1 ) ( 2 k + 1 ) + ( k + 1 ) 2
= k ( k + 1 ) ( 2 k + 1 ) + 6 ( k + 1 ) 2 6 = \frac{k(k+1)(2k+1) + 6(k+1)^2}{6} = 6 k ( k + 1 ) ( 2 k + 1 ) + 6 ( k + 1 ) 2
= ( k + 1 ) [ k ( 2 k + 1 ) + 6 ( k + 1 ) ] 6 = \frac{(k+1)\big[k(2k+1) + 6(k+1)\big]}{6} = 6 ( k + 1 ) [ k ( 2 k + 1 ) + 6 ( k + 1 ) ]
= ( k + 1 ) ( 2 k 2 + k + 6 k + 6 ) 6 = \frac{(k+1)(2k^2 + k + 6k + 6)}{6} = 6 ( k + 1 ) ( 2 k 2 + k + 6 k + 6 )
= ( k + 1 ) ( 2 k 2 + 7 k + 6 ) 6 = ( k + 1 ) ( k + 2 ) ( 2 k + 3 ) 6 = \frac{(k+1)(2k^2 + 7k + 6)}{6} = \frac{(k+1)(k+2)(2k+3)}{6} = 6 ( k + 1 ) ( 2 k 2 + 7 k + 6 ) = 6 ( k + 1 ) ( k + 2 ) ( 2 k + 3 )
This equals ( k + 1 ) ( ( k + 1 ) + 1 ) ( 2 ( k + 1 ) + 1 ) 6 \frac{(k+1)((k+1)+1)(2(k+1)+1)}{6} 6 ( k + 1 ) (( k + 1 ) + 1 ) ( 2 ( k + 1 ) + 1 ) , which is the formula for n = k + 1 n = k+1 n = k + 1 . ■ \blacksquare ■
Divisibility Proofs
Example: Prove a divisibility result Prove 3 2 n − 1 3^{2n} - 1 3 2 n − 1 is divisible by 8 for all n ∈ ′ { ′ N ′ } ′ n \in \mathbb{'\{'}N{'\}'} n ∈ ′ { ′ N ′ } ′ .
Base case (n = 1 n = 1 n = 1 ): 3 2 − 1 = 9 − 1 = 8 3^2 - 1 = 9 - 1 = 8 3 2 − 1 = 9 − 1 = 8 , which is divisible by 8. True.
Inductive hypothesis: Assume 3 2 k − 1 = 8 m 3^{2k} - 1 = 8m 3 2 k − 1 = 8 m for some integer m m m .
Inductive step: Consider 3 2 ( k + 1 ) − 1 = 3 2 k + 2 − 1 = 9 ⋅ 3 2 k − 1 3^{2(k+1)} - 1 = 3^{2k+2} - 1 = 9 \cdot 3^{2k} - 1 3 2 ( k + 1 ) − 1 = 3 2 k + 2 − 1 = 9 ⋅ 3 2 k − 1 .
We rewrite: 9 ⋅ 3 2 k − 1 = 9 ( 3 2 k − 1 ) + 9 − 1 = 9 ⋅ 8 m + 8 = 8 ( 9 m + 1 ) 9 \cdot 3^{2k} - 1 = 9(3^{2k} - 1) + 9 - 1 = 9 \cdot 8m + 8 = 8(9m + 1) 9 ⋅ 3 2 k − 1 = 9 ( 3 2 k − 1 ) + 9 − 1 = 9 ⋅ 8 m + 8 = 8 ( 9 m + 1 ) .
Since 9 m + 1 9m + 1 9 m + 1 is an integer, 3 2 ( k + 1 ) − 1 3^{2(k+1)} - 1 3 2 ( k + 1 ) − 1 is divisible by 8. ■ \blacksquare ■
Inequality Proofs
Example: Prove 2 n > n 2^n \gt n 2 n > n for all n ≥ 1 n \ge 1 n ≥ 1 Base case (n = 1 n = 1 n = 1 ): 2 1 = 2 > 1 2^1 = 2 \gt 1 2 1 = 2 > 1 . True.
Inductive hypothesis: Assume 2 k > k 2^k \gt k 2 k > k for some k ≥ 1 k \ge 1 k ≥ 1 .
Inductive step: 2 k + 1 = 2 ⋅ 2 k > 2 k 2^{k+1} = 2 \cdot 2^k \gt 2k 2 k + 1 = 2 ⋅ 2 k > 2 k (by the hypothesis).
Since k ≥ 1 k \ge 1 k ≥ 1 , we have 2 k = k + k ≥ k + 1 2k = k + k \ge k + 1 2 k = k + k ≥ k + 1 . Therefore 2 k + 1 > k + 1 2^{k+1} \gt k + 1 2 k + 1 > k + 1 . ■ \blacksquare ■
Common Mistakes in Induction
Forgetting the base case: The inductive step alone proves only an implication
P ( k ) ⟹ P ( k + 1 ) P(k) \implies P(k+1) P ( k ) ⟹ P ( k + 1 ) . Without the base case, the chain never starts.
Using what you need to prove: The inductive hypothesis is P ( k ) P(k) P ( k ) , not P ( k + 1 ) P(k+1) P ( k + 1 ) . You must
derive P ( k + 1 ) P(k+1) P ( k + 1 ) , not assume it.
Incorrect algebra: Errors in the inductive step (especially with fractions or factorisation)
are the most common source of failed induction proofs.
Wrong starting value: If the statement is only claimed for n ≥ 3 n \ge 3 n ≥ 3 , verify the base case at
n = 3 n = 3 n = 3 , not n = 1 n = 1 n = 1 .
The Binomial Theorem
Statement
For any non-negative integer n n n and any a , b ∈ ′ { ′ R ′ } ′ a, b \in \mathbb{'\{'}R{'\}'} a , b ∈ ′ { ′ R ′ } ′ :
( a + b ) n = ∑ k = 0 n ( n k ) a n − k b k (a + b)^n = \sum_{k=0}^{n} \binom{n}{k} a^{n-k} b^k ( a + b ) n = ∑ k = 0 n ( k n ) a n − k b k
where the binomial coefficient is:
( n k ) = n ! k ! ( n − k ) ! \binom{n}{k} = \frac{n!}{k!(n-k)!} ( k n ) = k ! ( n − k )! n !
Pascal's Triangle
The binomial coefficients can be generated recursively via Pascal's triangle . Each entry is the
sum of the two entries above it:
( n k ) = ( n − 1 k − 1 ) + ( n − 1 k ) \binom{n}{k} = \binom{n-1}{k-1} + \binom{n-1}{k} ( k n ) = ( k − 1 n − 1 ) + ( k n − 1 )
Proof of Pascal's identity. Using the factorial definition:
( n − 1 k − 1 ) + ( n − 1 k ) = ( n − 1 ) ! ( k − 1 ) ! ( n − k ) ! + ( n − 1 ) ! k ! ( n − 1 − k ) ! \binom{n-1}{k-1} + \binom{n-1}{k} = \frac{(n-1)!}{(k-1)!(n-k)!} + \frac{(n-1)!}{k!(n-1-k)!} ( k − 1 n − 1 ) + ( k n − 1 ) = ( k − 1 )! ( n − k )! ( n − 1 )! + k ! ( n − 1 − k )! ( n − 1 )!
= ( n − 1 ) ! ⋅ k k ! ( n − k ) ! + ( n − 1 ) ! ⋅ ( n − k ) k ! ( n − k ) ! = \frac{(n-1)! \cdot k}{k!(n-k)!} + \frac{(n-1)! \cdot (n-k)}{k!(n-k)!} = k ! ( n − k )! ( n − 1 )! ⋅ k + k ! ( n − k )! ( n − 1 )! ⋅ ( n − k )
= ( n − 1 ) ! ( k + n − k ) k ! ( n − k ) ! = n ⋅ ( n − 1 ) ! k ! ( n − k ) ! = n ! k ! ( n − k ) ! = ( n k ) ■ = \frac{(n-1)!\big(k + n - k\big)}{k!(n-k)!} = \frac{n \cdot (n-1)!}{k!(n-k)!} = \frac{n!}{k!(n-k)!} = \binom{n}{k} \qquad \blacksquare = k ! ( n − k )! ( n − 1 )! ( k + n − k ) = k ! ( n − k )! n ⋅ ( n − 1 )! = k ! ( n − k )! n ! = ( k n ) ■
Properties of Binomial Coefficients
Symmetry: ( n k ) = ( n n − k ) \dbinom{n}{k} = \dbinom{n}{n-k} ( k n ) = ( n − k n )
Sum of row: ∑ k = 0 n ( n k ) = 2 n \displaystyle\sum_{k=0}^{n} \binom{n}{k} = 2^n k = 0 ∑ n ( k n ) = 2 n (set a = b = 1 a = b = 1 a = b = 1 in the theorem)
Alternating sum: ∑ k = 0 n ( − 1 ) k ( n k ) = 0 \displaystyle\sum_{k=0}^{n} (-1)^k \binom{n}{k} = 0 k = 0 ∑ n ( − 1 ) k ( k n ) = 0 (set a = 1 , b = − 1 a = 1,\; b = -1 a = 1 , b = − 1 )
Finding Specific Terms
The general term (the ( k + 1 ) (k+1) ( k + 1 ) -th term, since the sum starts at k = 0 k=0 k = 0 ) is:
T k + 1 = ( n k ) a n − k b k T_{k+1} = \binom{n}{k} a^{n-k} b^k T k + 1 = ( k n ) a n − k b k
Worked example: Expansion Expand ( 2 x − 3 ) 4 (2x - 3)^4 ( 2 x − 3 ) 4 using the binomial theorem.
( 2 x − 3 ) 4 = ( 4 0 ) ( 2 x ) 4 + ( 4 1 ) ( 2 x ) 3 ( − 3 ) + ( 4 2 ) ( 2 x ) 2 ( − 3 ) 2 + ( 4 3 ) ( 2 x ) ( − 3 ) 3 + ( 4 4 ) ( − 3 ) 4 (2x - 3)^4 = \binom{4}{0}(2x)^4 + \binom{4}{1}(2x)^3(-3) + \binom{4}{2}(2x)^2(-3)^2 + \binom{4}{3}(2x)(-3)^3 + \binom{4}{4}(-3)^4 ( 2 x − 3 ) 4 = ( 0 4 ) ( 2 x ) 4 + ( 1 4 ) ( 2 x ) 3 ( − 3 ) + ( 2 4 ) ( 2 x ) 2 ( − 3 ) 2 + ( 3 4 ) ( 2 x ) ( − 3 ) 3 + ( 4 4 ) ( − 3 ) 4
= 16 x 4 − 4 ⋅ 8 x 3 ⋅ 3 + 6 ⋅ 4 x 2 ⋅ 9 − 4 ⋅ 2 x ⋅ 27 + 81 = 16x^4 - 4 \cdot 8x^3 \cdot 3 + 6 \cdot 4x^2 \cdot 9 - 4 \cdot 2x \cdot 27 + 81 = 16 x 4 − 4 ⋅ 8 x 3 ⋅ 3 + 6 ⋅ 4 x 2 ⋅ 9 − 4 ⋅ 2 x ⋅ 27 + 81
= 16 x 4 − 96 x 3 + 216 x 2 − 216 x + 81 = 16x^4 - 96x^3 + 216x^2 - 216x + 81 = 16 x 4 − 96 x 3 + 216 x 2 − 216 x + 81
Worked example: Finding a specific coefficient Find the coefficient of x 3 x^3 x 3 in the expansion of ( 1 + 2 x ) 7 (1 + 2x)^7 ( 1 + 2 x ) 7 .
The term containing x 3 x^3 x 3 occurs when k = 3 k = 3 k = 3 :
T 4 = ( 7 3 ) ( 1 ) 7 − 3 ( 2 x ) 3 = 35 ⋅ 8 x 3 = 280 x 3 T_4 = \binom{7}{3}(1)^{7-3}(2x)^3 = 35 \cdot 8x^3 = 280x^3 T 4 = ( 3 7 ) ( 1 ) 7 − 3 ( 2 x ) 3 = 35 ⋅ 8 x 3 = 280 x 3
The coefficient is 280 280 280 .
Worked example: Binomial coefficient properties Evaluate ( 10 0 ) + ( 10 1 ) + ( 10 2 ) + ⋯ + ( 10 10 ) \binom{10}{0} + \binom{10}{1} + \binom{10}{2} + \cdots + \binom{10}{10} ( 0 10 ) + ( 1 10 ) + ( 2 10 ) + ⋯ + ( 10 10 ) .
By the row-sum property: ∑ k = 0 10 ( 10 k ) = 2 10 = 1024 \sum_{k=0}^{10} \binom{10}{k} = 2^{10} = 1024 ∑ k = 0 10 ( k 10 ) = 2 10 = 1024 .
Common Pitfalls
Confusing Subset and Element
1 ∈ { 1 , 2 , 3 } 1 \in \{1, 2, 3\} 1 ∈ { 1 , 2 , 3 } but 1 ∉ { { 1 } , 2 , 3 } 1 \notin \{\{1\}, 2, 3\} 1 ∈ / {{ 1 } , 2 , 3 } . In the second set, { 1 } \{1\} { 1 } is an element, not
1 1 1 . Meanwhile, { 1 } ⊆ { 1 , 2 , 3 } \{1\} \subseteq \{1, 2, 3\} { 1 } ⊆ { 1 , 2 , 3 } is true, but { 1 } ∈ { 1 , 2 , 3 } \{1\} \in \{1, 2, 3\} { 1 } ∈ { 1 , 2 , 3 } is false.
Domain and Range Errors
When finding inverse functions, always check that the original function is bijective on the given
domain. A common error is to write f − 1 ( x ) = x f^{-1}(x) = \sqrt{x} f − 1 ( x ) = x for f ( x ) = x 2 f(x) = x^2 f ( x ) = x 2 without restricting the
domain to [ 0 , ∞ ) [0, \infty) [ 0 , ∞ ) first.
Off-by-One in Sigma Notation
∑ n = 1 k \sum_{n=1}^{k} ∑ n = 1 k has k k k terms, not k − 1 k-1 k − 1 . ∑ n = 0 k \sum_{n=0}^{k} ∑ n = 0 k has k + 1 k+1 k + 1 terms. Be careful when
shifting indices.
Induction Base Case Errors
If the statement is claimed for n ≥ 2 n \ge 2 n ≥ 2 , verify the base case at n = 2 n = 2 n = 2 , not n = 0 n = 0 n = 0 or n = 1 n = 1 n = 1 .
A base case at the wrong value invalidates the entire proof.
Logarithm Domain Restrictions
The arguments of logarithms must be strictly positive. When solving log a f ( x ) = c \log_a f(x) = c log a f ( x ) = c , always check
that f ( x ) > 0 f(x) \gt 0 f ( x ) > 0 in your solution. A solution that violates this is extraneous.
Problem Set
Problem 1: Sets Let U = { 1 , 2 , 3 , 4 , 5 , 6 , 7 , 8 , 9 , 10 } U = \{1, 2, 3, 4, 5, 6, 7, 8, 9, 10\} U = { 1 , 2 , 3 , 4 , 5 , 6 , 7 , 8 , 9 , 10 } , A = { 1 , 2 , 3 , 4 , 5 } A = \{1, 2, 3, 4, 5\} A = { 1 , 2 , 3 , 4 , 5 } , B = { 3 , 4 , 5 , 6 , 7 } B = \{3, 4, 5, 6, 7\} B = { 3 , 4 , 5 , 6 , 7 } .
Find: (a) A ∩ B A \cap B A ∩ B , (b) A ∪ B A \cup B A ∪ B , (c) A ′ A' A ′ , (d) ( A ∩ B ) ′ (A \cap B)' ( A ∩ B ) ′ , (e) ∣ A ∪ B ∣ |A \cup B| ∣ A ∪ B ∣ .
Solution.
(a) A ∩ B = { 3 , 4 , 5 } A \cap B = \{3, 4, 5\} A ∩ B = { 3 , 4 , 5 }
(b) A ∪ B = { 1 , 2 , 3 , 4 , 5 , 6 , 7 } A \cup B = \{1, 2, 3, 4, 5, 6, 7\} A ∪ B = { 1 , 2 , 3 , 4 , 5 , 6 , 7 }
(c) A ′ = { 6 , 7 , 8 , 9 , 10 } A' = \{6, 7, 8, 9, 10\} A ′ = { 6 , 7 , 8 , 9 , 10 }
(d) ( A ∩ B ) ′ = { 1 , 2 , 6 , 7 , 8 , 9 , 10 } (A \cap B)' = \{1, 2, 6, 7, 8, 9, 10\} ( A ∩ B ) ′ = { 1 , 2 , 6 , 7 , 8 , 9 , 10 } . (Verify: this equals A ′ ∪ B ′ A' \cup B' A ′ ∪ B ′ by De Morgan's law.)
(e) ∣ A ∪ B ∣ = 7 |A \cup B| = 7 ∣ A ∪ B ∣ = 7
Problem 2: Functions — bijectivity Let f : ′ { ′ R ′ } ′ → ′ { ′ R ′ } ′ f: \mathbb{'\{'}R{'\}'} \to \mathbb{'\{'}R{'\}'} f : ′ { ′ R ′ } ′ → ′ { ′ R ′ } ′ be defined by f ( x ) = x 3 − x f(x) = x^3 - x f ( x ) = x 3 − x . Determine whether f f f is
injective, surjective, or neither.
Solution.
Injective? No. f ( 0 ) = 0 f(0) = 0 f ( 0 ) = 0 and f ( 1 ) = 0 f(1) = 0 f ( 1 ) = 0 , so distinct inputs map to the same output.
Surjective? Yes. f ( x ) = x 3 − x = x ( x − 1 ) ( x + 1 ) f(x) = x^3 - x = x(x-1)(x+1) f ( x ) = x 3 − x = x ( x − 1 ) ( x + 1 ) . As x → ∞ x \to \infty x → ∞ , f ( x ) → ∞ f(x) \to \infty f ( x ) → ∞ ; as
x → − ∞ x \to -\infty x → − ∞ , f ( x ) → − ∞ f(x) \to -\infty f ( x ) → − ∞ . By continuity (a polynomial is continuous everywhere), f f f
attains every real value by the Intermediate Value Theorem.
Therefore f f f is surjective but not injective.
Problem 3: Scientific notation and error The speed of light is measured as 3.00 × 10 8 m / s 3.00 \times 10^8\,\mathrm{m/s} 3.00 × 1 0 8 m/s to 3 s.f. Find the absolute and
relative error.
Solution.
Upper bound: 3.005 × 10 8 3.005 \times 10^8 3.005 × 1 0 8 . Lower bound: 2.995 × 10 8 2.995 \times 10^8 2.995 × 1 0 8 .
Maximum absolute error:
1 2 × 10 8 − 2 = 0.005 × 10 8 = 5 × 10 5 m / s \frac{1}{2} \times 10^{8-2} = 0.005 \times 10^8 = 5 \times 10^5\,\mathrm{m/s} 2 1 × 1 0 8 − 2 = 0.005 × 1 0 8 = 5 × 1 0 5 m/s .
Relative error:
5 × 10 5 3.00 × 10 8 = 5 300 = 1 60 ≈ 0.0167 \frac{5 \times 10^5}{3.00 \times 10^8} = \frac{5}{300} = \frac{1}{60} \approx 0.0167 3.00 × 1 0 8 5 × 1 0 5 = 300 5 = 60 1 ≈ 0.0167 (about
1.67%).
Problem 4: Arithmetic series The sum of the first 40 terms of an arithmetic sequence is 4200 4200 4200 . The sum of the first 20 terms is
1600 1600 1600 . Find the first term and the common difference.
Solution.
S 40 = 20 ( 2 a 1 + 39 d ) = 4200 ⟹ 2 a 1 + 39 d = 210 ⋯ ( 1 ) S_{40} = 20(2a_1 + 39d) = 4200 \implies 2a_1 + 39d = 210 \quad \cdots (1) S 40 = 20 ( 2 a 1 + 39 d ) = 4200 ⟹ 2 a 1 + 39 d = 210 ⋯ ( 1 )
S 20 = 10 ( 2 a 1 + 19 d ) = 1600 ⟹ 2 a 1 + 19 d = 160 ⋯ ( 2 ) S_{20} = 10(2a_1 + 19d) = 1600 \implies 2a_1 + 19d = 160 \quad \cdots (2) S 20 = 10 ( 2 a 1 + 19 d ) = 1600 ⟹ 2 a 1 + 19 d = 160 ⋯ ( 2 )
( 1 ) − ( 2 ) (1) - (2) ( 1 ) − ( 2 ) : 20 d = 50 ⟹ d = 2.5 20d = 50 \implies d = 2.5 20 d = 50 ⟹ d = 2.5
From (2): 2 a 1 + 19 ( 2.5 ) = 160 ⟹ 2 a 1 = 160 − 47.5 = 112.5 ⟹ a 1 = 56.25 2a_1 + 19(2.5) = 160 \implies 2a_1 = 160 - 47.5 = 112.5 \implies a_1 = 56.25 2 a 1 + 19 ( 2.5 ) = 160 ⟹ 2 a 1 = 160 − 47.5 = 112.5 ⟹ a 1 = 56.25
Problem 5: Geometric series A geometric series has first term 5 5 5 and common ratio 1 3 \frac{1}{3} 3 1 . Find the sum to infinity and
determine how many terms are needed for the partial sum to exceed 99% of S ∞ S_{\infty} S ∞ .
Solution.
S ∞ = 5 1 − 1 / 3 = 5 2 / 3 = 7.5 S_{\infty} = \frac{5}{1 - 1/3} = \frac{5}{2/3} = 7.5 S ∞ = 1 − 1/3 5 = 2/3 5 = 7.5
We need S n > 0.99 × 7.5 = 7.425 S_n \gt 0.99 \times 7.5 = 7.425 S n > 0.99 × 7.5 = 7.425 .
S n = 5 ( 1 − ( 1 / 3 ) n ) 2 / 3 = 7.5 ( 1 − ( 1 / 3 ) n ) S_n = \frac{5(1 - (1/3)^n)}{2/3} = 7.5\big(1 - (1/3)^n\big) S n = 2/3 5 ( 1 − ( 1/3 ) n ) = 7.5 ( 1 − ( 1/3 ) n )
7.5 ( 1 − ( 1 / 3 ) n ) > 7.425 ⟹ 1 − ( 1 / 3 ) n > 0.99 ⟹ ( 1 / 3 ) n < 0.01 7.5\big(1 - (1/3)^n\big) \gt 7.425 \implies 1 - (1/3)^n \gt 0.99 \implies (1/3)^n \lt 0.01 7.5 ( 1 − ( 1/3 ) n ) > 7.425 ⟹ 1 − ( 1/3 ) n > 0.99 ⟹ ( 1/3 ) n < 0.01
n ln ( 1 / 3 ) < ln ( 0.01 ) n\ln(1/3) \lt \ln(0.01) n ln ( 1/3 ) < ln ( 0.01 ) . Since ln ( 1 / 3 ) < 0 \ln(1/3) \lt 0 ln ( 1/3 ) < 0 , dividing reverses the inequality:
n > ln ( 0.01 ) ln ( 1 / 3 ) = − 4.605 − 1.099 ≈ 4.19 n \gt \frac{\ln(0.01)}{\ln(1/3)} = \frac{-4.605}{-1.099} \approx 4.19 n > l n ( 1/3 ) l n ( 0.01 ) = − 1.099 − 4.605 ≈ 4.19
So n ≥ 5 n \ge 5 n ≥ 5 terms are needed.
Problem 6: Logarithms Solve for x x x : log 3 ( x − 1 ) + log 3 ( x + 1 ) = log 3 8 \log_3(x - 1) + \log_3(x + 1) = \log_3 8 log 3 ( x − 1 ) + log 3 ( x + 1 ) = log 3 8 .
Solution.
By the product rule: log 3 ( ( x − 1 ) ( x + 1 ) ) = log 3 8 \log_3\!\big((x-1)(x+1)\big) = \log_3 8 log 3 ( ( x − 1 ) ( x + 1 ) ) = log 3 8
( x − 1 ) ( x + 1 ) = 8 ⟹ x 2 − 1 = 8 ⟹ x 2 = 9 ⟹ x = 3 o r x = − 3 (x-1)(x+1) = 8 \implies x^2 - 1 = 8 \implies x^2 = 9 \implies x = 3 \mathrm{ or } x = -3 ( x − 1 ) ( x + 1 ) = 8 ⟹ x 2 − 1 = 8 ⟹ x 2 = 9 ⟹ x = 3 or x = − 3
Domain check: x − 1 > 0 ⟹ x > 1 x - 1 \gt 0 \implies x \gt 1 x − 1 > 0 ⟹ x > 1 . So x = − 3 x = -3 x = − 3 is rejected.
Solution: x = 3 x = 3 x = 3 .
Problem 7: Mathematical induction Prove by induction that n 3 + 2 n n^3 + 2n n 3 + 2 n is divisible by 3 3 3 for all n ∈ ′ { ′ N ′ } ′ n \in \mathbb{'\{'}N{'\}'} n ∈ ′ { ′ N ′ } ′ .
Solution.
Base case (n = 1 n = 1 n = 1 ): 1 3 + 2 ( 1 ) = 3 1^3 + 2(1) = 3 1 3 + 2 ( 1 ) = 3 , which is divisible by 3 3 3 . True.
Inductive hypothesis: Assume k 3 + 2 k = 3 m k^3 + 2k = 3m k 3 + 2 k = 3 m for some integer m m m .
Inductive step:
( k + 1 ) 3 + 2 ( k + 1 ) = k 3 + 3 k 2 + 3 k + 1 + 2 k + 2 (k+1)^3 + 2(k+1) = k^3 + 3k^2 + 3k + 1 + 2k + 2 ( k + 1 ) 3 + 2 ( k + 1 ) = k 3 + 3 k 2 + 3 k + 1 + 2 k + 2
= ( k 3 + 2 k ) + 3 k 2 + 3 k + 3 = (k^3 + 2k) + 3k^2 + 3k + 3 = ( k 3 + 2 k ) + 3 k 2 + 3 k + 3
= 3 m + 3 ( k 2 + k + 1 ) = 3 ( m + k 2 + k + 1 ) = 3m + 3(k^2 + k + 1) = 3\big(m + k^2 + k + 1\big) = 3 m + 3 ( k 2 + k + 1 ) = 3 ( m + k 2 + k + 1 )
Since m + k 2 + k + 1 m + k^2 + k + 1 m + k 2 + k + 1 is an integer, ( k + 1 ) 3 + 2 ( k + 1 ) (k+1)^3 + 2(k+1) ( k + 1 ) 3 + 2 ( k + 1 ) is divisible by 3 3 3 . ■ \blacksquare ■
Problem 8: Binomial theorem Find the term independent of x x x in the expansion of ( x 2 + 1 x ) 9 \left(x^2 + \dfrac{1}{x}\right)^9 ( x 2 + x 1 ) 9 .
Solution.
The general term is
T k + 1 = ( 9 k ) ( x 2 ) 9 − k ( 1 x ) k = ( 9 k ) x 18 − 2 k − k = ( 9 k ) x 18 − 3 k T_{k+1} = \binom{9}{k}(x^2)^{9-k}\!\left(\dfrac{1}{x}\right)^k = \binom{9}{k} x^{18-2k-k} = \binom{9}{k} x^{18-3k} T k + 1 = ( k 9 ) ( x 2 ) 9 − k ( x 1 ) k = ( k 9 ) x 18 − 2 k − k = ( k 9 ) x 18 − 3 k .
For the term to be independent of x x x , we need 18 − 3 k = 0 18 - 3k = 0 18 − 3 k = 0 , so k = 6 k = 6 k = 6 .
The term is ( 9 6 ) x 0 = ( 9 3 ) = 84 \binom{9}{6} x^0 = \binom{9}{3} = 84 ( 6 9 ) x 0 = ( 3 9 ) = 84 .
tip
Ready to test your understanding of Number and Algebra ? The diagnostic test contains the hardest questions within the IB specification for this topic, each with a full worked solution.
Unit tests probe edge cases and common misconceptions. Integration tests combine Number and Algebra with other IB mathematics topics to test synthesis under exam conditions.
See Diagnostic Guide for instructions on self-marking and building a personal test matrix.