Group
← Back
Part 1 of 2 · The exponential thread

The exponential thread - The puzzle

September 24, 2023Math

This is a story about how a single line of 3rd-grade arithmetic, taken seriously for long enough, leads to one of the most extraordinary unifications in modern mathematics.

The line is this:

Multiplication is repeated addition. Most people meet this idea once, accept it, and never look at it again. But every interesting question in this series is hiding inside it, waiting for someone to keep asking "what if".

By the end of the climb, the same chain of questions will have walked us to

and through it to a structural picture in which , , , , complex numbers, matrices, and rotations are not five separate ideas that happen to be related. They are projections of one object: the exponential map of a one-dimensional Lie group (more about both later). Right now that sentence is a mouthful with no meaning attached to it. By Part 9, every word in it will have a concrete picture behind it.

The cast

Before we climb, here is the gallery of objects this series is about. You have probably met all of them in school.

  • , a special constant that turns up everywhere
  • , the natural logarithm (where "natural" gets a real definition in Part 2)
  • and , the functions of the unit circle
  • , Euler's identity, often called the most beautiful equation in mathematics

These look like residents of different chapters of a textbook. One from algebra, one from calculus, two from geometry, the last from a chapter of complex analysis nobody assigned. But, they are not. The whole point of this series is to see, slowly and with the right pictures, that they are the same idea presented in different languages.

The first crack

Look again at the opening line.

Multiplication is the shorthand for "add the same number to itself N times". Once you have a notation for "do an operation N times", it becomes natural to ask what happens at the edges of the notation. What if N is zero? Negative? A fraction? An irrational number? Imaginary?

For multiplication those edge cases push the number system to widen. "Multiply by zero" forces zero to exist as a thing you multiply with, with its own rule (). "Multiply by negative two" forces the negative integers to make sense as multipliers, with their own sign rules. Each shorthand we adopt opens a new question, and each new question demands the number system grow to accommodate the answer.

Now apply the trick one rung up the ladder. If multiplication is repeated addition, then exponentiation is repeated multiplication.

Same story, one level higher. We have a shorthand for "multiply N copies of a number together". The same family of edge cases shows up at the door, now louder.

ExpressionWhat it meansWhat it forces into existence
zero copies multipliedthe convention that
a negative exponentthe rationals (with )
a fractional exponentthe irrationals (and a Pythagorean crisis)
a real-number powercontinuity and limits
an imaginary powerthe complex numbers

The simple game of shorthand has already opened the door to most of analysis. We have not even touched the last row, which is where the rest of this series lives.

Two inverses, not one

Subtraction is the inverse of addition. Division is the inverse of multiplication. Most school math leaves it at that, one inverse per operation.

Exponentiation breaks the pattern, because it has two inverses, not one. Both deserve a name.

Take the equation

with three letters in it. Fix any two and try to recover the third, and you get a different inverse for each choice.

GivenSolve forOperationNotation
and exponentiation
and the -th root
and the logarithm in base

Roots and logarithms are siblings, two distinct inverses of the same operation. Most people's intuition for the logarithm stays fuzzy through their entire mathematical life because they never noticed it lived in a separate family from the root.

Once you see it cleanly, the logarithm becomes one of the central characters of this series. The "natural log" in particular will turn out to be the inverse of one specific exponential, not just written with a different notation.

Where this is going

A nine-part climb. Each part hangs on one intuitive picture, and each step revisits a fact you "already knew" from the previous part and reveals it to have been a special case.

  1. The puzzle (this post). The cast and the climb.
  2. Why is the natural base. A unique number with the property that the exponential function equals its own derivative. Compounding interest, learning-rate decay, and the softmax function in machine learning all reach for for the same structural reason.
  3. Sin and cos as a moving point. Geometry first. All the trig identities fall out of a single point moving on the unit circle. A pure tone in audio is the projection of that point.
  4. Taylor series. A function reconstructed from its derivatives at one point. This is also how libm actually computes and on your CPU.
  5. The reveal. Plug into the Taylor series for . Out comes . Multiplication by is a 90° rotation, and that fact makes Euler's identity feel inevitable. The Fourier transform sits one paragraph downstream.
  6. Same equation, different costume. describes growth and decay. describes oscillation. They are the same equation in disguise, and numerical ODE solvers exploit this.
  7. Matrix exponentials. Once accepts a matrix as input, and appear as entries of a rotation matrix. RoPE (Rotary Position Embedding), the position encoding inside every modern transformer, is exactly this idea applied to attention vectors.
  8. Groups and homomorphisms. Identities like and the formula are the same fact stated in two presentations. RSA's modular exponentiation and the Diffie-Hellman key exchange live in this house too.
  9. One map to rule them all. The 1-dimensional Lie groups classified, and the exponential map identified. Looking back at the climb, we will see we were studying one function the entire time.

What's next

Part 2 picks up the thread at the number itself. Where does it come from, why does this one number out of all real numbers get the title "the natural" base, and why does every probabilistic and machine-learning context that needs an exponential reach for this one and not or ?