1. Introduction

To my generous Patreon editors: please note that this Chapter is not quite ready to be edited. As it often happens, the introductory chapter will be the last one to be completed before the book is done!

This is a continuation of An Introduction to Tensor Calculus which was devoted to the construction of the analytical framework of Tensor Calculus. In this book, we will use the Tensor Calculus framework to study shape -- in other words, curvature.

Historically, the development of understanding of curvature unfolded in two distinct phases. The first phase, which lasted until about the middle of the nineteenth century, was preoccupied with gaining insight into curvature by developing measures for quantifying it and discovering fundamental laws relating those measures. This phase is associated primarily with the groundbreaking works of Leonhard Euler and Carl Friedrich Gauss. The second phase, originated by Gauss and advanced by Bernhard Riemann and Albert Einstein, is aimed at understanding the geometric properties of the underlying space itself.

Our narrative will speak to both of these elements, but it will have a different focal point. Our focus will be on the development of the analytical framework for studying curvature.

1.1Curvature and Tensor Calculus

Although spectacular in terms of achievement, development of humanity's understanding of curvature has been laborious and slow. Meanwhile, the tensor framework will make most of the concepts and equations appear deceptively simple as many of the most profound and hard-fought facts will fall before us like dominos. For example, Gauss's famous Theorema Egregium -- the Remarkable Theorem -- will come across as almost entirely self-evident. His celebrated equations of the surface

B_{\alpha\gamma}B_{\beta\delta}-B_{\beta\gamma}B_{\alpha\delta}=R_{\alpha \beta\gamma\delta} \tag{7.38}

will be derived in an exercise in the next Chapter. Meanwhile, Euler's remarkable discovery that the surface swept out by tangents to a three-dimensional curve is developable will, too, be found in an exercise. In fact, most of the facts that might occur to us naturally, some found in exercises, would have constituted results worthy of an admirable nineteenth-century doctoral thesis.

Such retrospective ease is not unique to Tensor Calculus but is a general phenomenon of human forward movement. In music, it is said, the dissonance of today is the consonance of tomorrow. This law of evolution of human thought should in no way diminish our admiration for the remarkable achievements of the great minds of the past. As we celebrate our superior tools of the trade, we should contemplate what has been lost because of them.

1.2The three degrees of curvature

Let us begin by having an informal discussion of curvature and, in particular, its interplay with coordinates systems. The crucial observations made in this Section, although stated as facts, are meant to appeal to your geometric intuition and to guide our future analytical investigations.

1.2.1Curvilinear coordinates in a Euclidean space

In Introduction to Tensor Calculus, we devoted much of our attention to Euclidean spaces characterized by straightness. A Euclidean space can accommodate a straight line in any direction and can therefore be referred to an affine -- and, in particular, Cartesian -- coordinate system. Nevertheless, in almost all of our theoretical explorations we assumed a more general curvilinear coordinate system.

(1.1)

Our primary motivation for this choice was to eliminate the possible artifacts of a special coordinate system. Thus, we did not assume that the chosen coordinate system had any special features, such as straight regularly-spaced coordinate lines. When we did, on rare occasions, resort to affine coordinates, it was to specifically call attention to the Euclidean nature of our space. Our diligent self-limitation paid off as it led us to develop a more general framework of great robustness.

However, in order to arrive at such a framework, we had to first contend the curvature effects associated with curvilinear coordinates. The most obvious and striking complication lies in the fact that the covariant basis

\mathbf{Z}_{i}

, which is regular (i.e. constant) in affine coordinates, varies from one point to the next in curvilinear coordinates.

(1.2)

As a result, vectors at different points cannot be compared, added, or otherwise related in any way by analyzing their components in the familiar Cartesian fashion. Furthermore, the partial derivative

\frac{\partial}{\partial Z^{i}}\tag{1.3}

applied to the components of a vector field does not fully capture its rate of change since it ignores the variability in the accompanying basis. This loss of geometric information manifested itself in the lack of the tensor property of the partial derivative of a tensor.

Outside of our geometric study of curves, this was our first encounter with the effects of curvature. However, this curvature is of the spurious nature: since a Euclidean space is fundamentally straight, these effects are artifacts of our analysis and, as a result, can be completely mitigated. This is accomplished by introducing the covariant derivative

\nabla_{i}

. The covariant derivative restores the tensor property, possesses the metrinilic property with respect to the basis, i.e.

\nabla_{i}\mathbf{Z}_{j}=\mathbf{0}\tag{1.4}

which is akin to the affine property

\frac{\partial\mathbf{i}}{\partial Z^{i}}=\frac{\partial\mathbf{j}}{\partial Z^{i}}=\frac{\partial\mathbf{k}}{\partial Z^{i}}=\mathbf{0},\tag{1.5}

and preserves the product rule.

In summary, the tensor framework handles this lowest "degree" of curvature rather easily while offering a great deal of insight.

1.2.2Developable surfaces

The next "degree" of curvature is represented by surfaces that can be formed by smoothly curving a flat sheet of paper, i.e. a region of a plane, without any distortion. For a transformation to not introduce any distortion it must preserve the distance between any two points as measured within the surface. We must note, however, that the concept of distance as measured within the surface is arguably unclear and needs to be precisely defined. Nevertheless, such transformations are known as isometric and surfaces that can be isometrically transformed into a region of a plane are known as developable. Thus, we can continue to think of a developable surface as a Euclidean plane from the viewpoint of their internal geometry but not from the overall three-dimensional point of view.

(1.6)

Developable surfaces were first investigated by Euler in his 1772 work De solidis quorum superficiem in planum explicare licet or On shapes that can be unwrapped onto a plane. The classical examples of developable surfaces are the cylinder and the cone.

(1.7)

Note that these two surfaces need to be cut in order to be transformed into a region of a plane.

Another general type of a developable surface is known as the tangent developable of a curve. It is the surface swept out by the tangent lines to a three-dimensional curve.

(1.8)

In De solidis quorum superficiem in planum explicare licet, Euler not only demonstrated that the tangent developable is, indeed, a developable surface, but that every developable surface that is not a cylinder or a cone is a tangent developable for some curve.

On the one hand, developable surfaces are undeniably curved. On the other hand, they allow regular coordinate systems which can be accomplished easily by referring the flat sheet of paper to a regular coordinate system prior to curving it into the final shape, as illustrated in the following figure.

(1.9)

Of course, we must clarify the sense in which a coordinate system on the curve surface is regular. Indeed, in the above figure, all of the coordinate lines are curved and the coordinate nodes are not arranged in any kind of regular pattern in the three-dimensional space. Yet, it is intuitively clear that, with respect to the surface, the coordinate lines are indeed regular. Thus, there is a clear dichotomy between how the image appears in the overall three-dimensional space and how it is perceived within the surface. We will use the term intrinsic to describe the geometric action that takes place strictly in the surface without a reference to ambient objects. For example, we will describe developable surfaces as curved in the three-dimensional sense and as flat in the intrinsic sense.

A surface swept by straight lines (which are not necessarily the tangents of a given curve) is known as a ruled surface. We will show that every developable surface is ruled. This becomes intuitively clear when inspecting a curved sheet of paper and noting that, at every point, the direction in which the sheet is straight continues to infinity or, at least, to the edge of the sheet.

(1.10)

The shape of a fluttering flag is often used as an example of a developable surface since a flag, it would seem, can be flattened out without distortion. However, a fluttering flag is clearly not a ruled surface since it is not developable, as seen in the following figure.

(1.11)

Therefore, modeling it as a developable surface will inevitably fail to capture some of its properties.

Finally, note that not all ruled surfaces are developable as evidenced by the parabolic hyperboloid in the following figure.

(1.12)

In our pursuit of understanding curvature, one of our overarching goals will be to derive analytical criteria for developable surfaces.

1.2.3Intrinsically curved surfaces

Non-developable surfaces represent the third, highest "degree" of curvature. Such surfaces can be considered to be curved in a more profound way since they cannot be flattened without distortion. For example, a sphere -- or any part, no matter how small, thereof -- cannot be unwrapped onto a region of a plane without distortion. This is a well-known phenomenon in cartography when one attempts to represent the round Earth on a two dimensional map. When one attempts to do so in a smooth fashion, significant distortion results.

(1.13)

The amount of distortion can be reduced with the help of tears, but it cannot be eliminated.

(1.14)

It follows from these observations that a non-developable surface does not admit an intrinsically regular coordinate system. This has important implications since a number of arguments used in Introduction to Tensor Calculus relied on the availability of affine coordinates. In particular, the fact the Riemann-Christoffel tensor vanishes fundamentally relied on affine coordinates. We can therefore expect that the corresponding surface object continues to vanish for developable surfaces but not for non-developable surfaces. In fact, this holds the promise of yielding a criterion for developable surfaces which will be explored in Chapter 7.

1.3A brief review of Euclidean spaces and the tensor framework

Let us now summarize the key concepts and notation introduced in Introduction to Tensor Calculus.

The position vector

\mathbf{R}

at a point

P

in a Euclidean space is the vector that emanates from an arbitrary fixed point

O

, known as the origin, and terminates

P

Refer the Euclidean space to arbitrary coordinates

Z^{i}

. Then the position vector

\mathbf{R}

becomes a function of the coordinates which, in a three-dimensional space reads

\mathbf{R}=\mathbf{R}\left( Z^{1},Z^{2},Z^{3}\right) .\tag{1.15}

We typically abbreviate such expressions by representing the enumerated arguments of functions by a single letter, i.e.

\mathbf{R}=\mathbf{R}\left( Z\right) .\tag{1.16}

The covariant basis

\mathbf{Z}_{i}

is given by the partial derivative

\mathbf{Z}_{i}=\frac{\partial\mathbf{R}\left( Z\right) }{\partial Z^{i}}.\tag{1.17}

The covariant metric tensor

Z_{ij}

is obtained by pairwise dot products of the covariant basis vectors, i.e.

Z_{ij}=\mathbf{Z}_{i}\cdot\mathbf{Z}_{j}.\tag{1.18}

The contravariant metric tensor

Z^{ij}

is the matrix inverse of

Z_{ij}

, i.e.

Z^{ij}Z_{jk}=\delta_{k}^{i},\tag{1.19}

where

\delta_{k}^{i}

is the Kronecker delta system. The contravariant basis

\mathbf{Z}^{i}

is given by the contraction

\mathbf{Z}^{i}=Z^{ij}\mathbf{Z}_{j}.\tag{1.20}

The covariant and the contravariant bases are mutually orthogonal, i.e.

\mathbf{Z}^{i}\cdot\mathbf{Z}_{j}=\delta_{j}^{i}.\tag{1.21}

Also, the last property can serve as an alternative definition of the contravariant basis

\mathbf{Z}^{i}

. The determinant of the covariant metric tensor

Z_{ij}

is denoted by

Z

and its square root,

\sqrt{Z}

, is referred to as the volume element.

The permutation systems

e_{ijk}

and

e^{ijk}

have the following values

e_{ijk},e^{ijk}=\left\{ \begin{array} {ll} \phantom{+} 1\text{,} & \text{if }ijk\text{ is an even permutation of the numbers }1,2,3\\ -1\text{,} & \text{if }ijk\text{ is an odd permutation of the numbers }1,2,3\\ \phantom{+} 0\text{,} & \text{if }ijk\text{ is not a permutation of the numbers }1,2,3. \end{array} \right.\tag{1.22}

The Levi-Civita symbols

\varepsilon_{ijk}

and

\varepsilon^{ijk}

are defined by the identities

\begin{aligned}\varepsilon_{ijk} & =\sqrt{Z}e_{ijk}\text{ and}\ \ \ \ \ \ \ \ \ \ \left(1.23\right)\\\varepsilon^{ijk} & =\frac{e^{ijk}}{\sqrt{Z}}.\ \ \ \ \ \ \ \ \ \ \left(1.24\right)\end{aligned}

The complete delta system

\delta_{rst}^{ijk}

is given by

\delta_{rst}^{ijk}=e^{ijk}e_{rst}\tag{1.25}

or, equivalently,

\delta_{rst}^{ijk}=\varepsilon^{ijk}\varepsilon_{rst}\tag{1.26}

and can be expressed in terms of the Kronecker delta by the identity

\delta_{rst}^{ijk}=\delta_{r}^{i}\delta_{s}^{j}\delta_{t}^{k}+\delta_{s} ^{i}\delta_{t}^{j}\delta_{r}^{k}+\delta_{t}^{i}\delta_{r}^{j}\delta_{s} ^{k}-\delta_{r}^{i}\delta_{t}^{j}\delta_{s}^{k}-\delta_{t}^{i}\delta_{s} ^{j}\delta_{r}^{k}-\delta_{s}^{i}\delta_{r}^{j}\delta_{t}^{k}.\tag{1.27}

The partial delta system

\delta_{rs}^{ij}

has the values

\tag{1.28}

delta_{rs}^{ij}=left{ begin{tabular} {ll}

\begin{array} {c} \\ \phantom{+} 1\text{, }\\ \\ \phantom{+} \end{array}

\begin{array} {l} \\ \text{if the superscripts and the subscripts are identical sets}\\ \text{of distinct numbers related by an \textit{even} permutation}\\ \phantom{+} \end{array}

\begin{array} {c} -1\text{,}\\ \\ \phantom{+} \end{array}

\begin{array} {l} \text{if the superscripts and the subscripts are identical sets}\\ \text{of distinct numbers related by an \textit{odd} permutation}\\ \phantom{+} \end{array}

\begin{array} {c} \phantom{+} 0\text{,}\\ \end{array}

\begin{array} {c} \text{for all other combinations of indices.}\\ \end{array}

end{tabular} right. end{equation} It satisfies the identity

\delta_{rs}^{ij}=\delta_{r}^{i}\delta_{s}^{j}-\delta_{r}^{j}\delta_{s}^{i}\tag{1.29}

and can be obtained from the complete delta system

\delta_{rst}^{ijk}

by a single contraction, i.e.

\delta_{rs}^{ij}=\delta_{rsk}^{ijk}.\tag{1.30}

Thus, we also have the frequently used identity

\delta_{rsk}^{ijk}=\delta_{r}^{i}\delta_{s}^{j}-\delta_{r}^{j}\delta_{s}^{i}.\tag{1.31}

1.3.1Index juggling

The metric tensors are used to implement the operations of index juggling. Contraction with the covariant metric tensor

Z_{ij}

is captured by index lowering, e.g.

T_{ik}=Z_{ij}T_{\cdot k}^{j}.\tag{1.32}

Contraction with the contravariant metric tensor

Z^{ij}

is captured by index raising, e.g.

T_{\cdot k}^{i}=Z^{ij}T_{jk}.\tag{1.33}

The two operations are the inverses of each other.

1.3.2The Christoffel symbols

The Christoffel symbol

\Gamma_{jk}^{i}

is given by

\Gamma_{jk}^{i}=\mathbf{Z}^{i}\cdot\frac{\partial\mathbf{Z}_{j}}{\partial Z^{k}}\tag{1.34}

or, equivalently, by

\Gamma_{jk}^{i}=\frac{1}{2}Z^{im}\left( \frac{\partial Z_{mj}}{\partial Z^{k}}+\frac{\partial Z_{mk}}{\partial Z^{j}}-\frac{\partial Z_{jk}}{\partial Z^{m}}\right) .\tag{1.35}

The symbol

\Gamma_{jk}^{i}

is sometimes referred to as the Christoffel symbol of the second kind. The Christoffel symbol of the first kind

\Gamma_{i,jk}

is obtained by lowering the index

i

\Gamma_{jk}^{i}

, i.e.

\Gamma_{i,jk}=Z_{im}\Gamma_{jk}^{m}.\tag{1.36}

Alternatively,

\Gamma_{i,jk}

is given by

\Gamma_{i,jk}=\frac{1}{2}\left( \frac{\partial Z_{ij}}{\partial Z^{k}} +\frac{\partial Z_{ik}}{\partial Z^{j}}-\frac{\partial Z_{jk}}{\partial Z^{i} }\right) .\tag{1.37}

The partial derivative of the covariant metric tensor

Z_{ij}

with respect to

Z^{k}

is given by the identity

\frac{\partial Z_{ij}}{\partial Z^{k}}=\Gamma_{i,jk}+\Gamma_{j,ik}.\tag{1.38}

The derivative of the volume element

\sqrt{Z}

is given by

\frac{\partial\sqrt{Z}}{\partial Z^{i}}=\sqrt{Z}\Gamma_{ji}^{j}.\tag{1.39}

The divergence

\nabla_{i}U^{i}

of a variant field

U^{i}

is given by the Voss-Weyl formula

\nabla_{i}U^{i}=\frac{1}{\sqrt{Z}}\frac{\partial}{\partial Z^{i}}\left( \sqrt{Z}U^{i}\right) .\tag{1.40}

Correspondingly, the Laplacian

\nabla_{i}\nabla^{i}U

of a scalar field

U

is given by

\nabla_{i}\nabla^{i}U=\frac{1}{\sqrt{Z}}\frac{\partial}{\partial Z^{i}}\left( \sqrt{Z}Z^{ij}\frac{\partial U}{\partial Z^{j}}\right) .\tag{1.41}

The Christoffel symbol vanishes in affine coordinates, i.e.

\Gamma_{jk}^{i}=0.\tag{1.42}

Its nonzero elements in cylindrical coordinates

r,\theta,z

are

\begin{aligned}\Gamma_{22}^{1} & =-r\ \ \ \ \ \ \ \ \ \ \left(1.43\right)\\\Gamma_{12}^{2} & =\Gamma_{21}^{2}=\frac{1}{r}.\ \ \ \ \ \ \ \ \ \ \left(1.44\right)\end{aligned}

while in spherical coordinates

r.\theta,\varphi

they are

\begin{aligned}\Gamma_{22}^{1} & =-r\ \ \ \ \ \ \ \ \ \ \left(1.45\right)\\\Gamma_{33}^{1} & =-r\sin^{2}\theta\ \ \ \ \ \ \ \ \ \ \left(1.46\right)\\\Gamma_{12}^{2} & =\Gamma_{21}^{2}=\frac{1}{r}\ \ \ \ \ \ \ \ \ \ \left(1.47\right)\\\Gamma_{33}^{2} & =-\sin\theta\cos\theta\ \ \ \ \ \ \ \ \ \ \left(1.48\right)\\\Gamma_{13}^{3} & =\Gamma_{31}^{3}=\frac{1}{r}\ \ \ \ \ \ \ \ \ \ \left(1.49\right)\\\Gamma_{23}^{3} & =\Gamma_{32}^{3}=\frac{1}{\tan\theta}.\ \ \ \ \ \ \ \ \ \ \left(1.50\right)\end{aligned}

1.3.3The covariant derivative

For a variant

T_{j}^{i}

with a representative collection of indices, the covariant derivative

\nabla_{k}

is given by the identity

\nabla_{k}T_{j}^{i}=\frac{\partial T_{j}^{i}}{\partial Z^{k}}-\Gamma_{kj} ^{m}T_{m}^{i}+\Gamma_{km}^{i}T_{j}^{m}.\tag{1.51}

The cornerstone feature of the covariant derivative is its tensor property -- namely, that it produces tensor outputs for tensor inputs. Furthermore, the covariant derivative satisfies the sum rule, the product rule, commutes with contraction. It coincides with the partial derivative in affine coordinates. Finally, it is metrinilic with respect to all of the fundamental tensors in a Euclidean space, i.e.

\nabla_{l}\mathbf{Z}_{i},\ \text{\ }\nabla_{l}\mathbf{Z}^{j}=\mathbf{0}\tag{1.52}

and

\nabla_{l}Z_{ij},\ \nabla_{l}\delta_{j}^{i},\ \nabla_{l}Z^{ij},\ \nabla _{l}\varepsilon_{ijk},\ \nabla_{l}\varepsilon^{ijk}=0.\tag{1.53}

Note that the metrinilic property follows easily by a combination of the tensor property and the fact that the covariant derivative coincides with the partial derivative in affine coordinates.

1.3.4The commutator $\nabla_{i}\nabla_{j}-\nabla_{j}\nabla_{i}$ and the Riemann-Christoffel tensor

Covariant derivatives commute. In other words, the commutator

\nabla _{i}\nabla_{j}-\nabla_{j}\nabla_{i}

vanishes for all inputs, i.e.

\left( \nabla_{i}\nabla_{j}-\nabla_{j}\nabla_{i}\right) T^{k}=0.\tag{1.54}

Since it can be generally shown that

\nabla_{i}\nabla_{j}T^{k}-\nabla_{j}\nabla_{i}T^{k}=\left( \frac {\partial\Gamma_{jm}^{k}}{\partial Z^{i}}-\frac{\partial\Gamma_{im}^{k} }{\partial Z^{j}}+\Gamma_{in}^{k}\Gamma_{jm}^{n}-\Gamma_{jn}^{k}\Gamma _{im}^{n}\right) T^{m}\tag{1.55}

for any

T^{m}

, we conclude that

\frac{\partial\Gamma_{jm}^{k}}{\partial Z^{i}}-\frac{\partial\Gamma_{im}^{k} }{\partial Z^{j}}+\Gamma_{in}^{k}\Gamma_{jm}^{n}-\Gamma_{jn}^{k}\Gamma _{im}^{n}=0.\tag{1.56}

The combination on the left is a tensor known as the Riemann-Christoffel tensor

R_{\cdot mij}^{k}

, i.e.

R_{\cdot mij}^{k}=\frac{\partial\Gamma_{jm}^{k}}{\partial Z^{i}} -\frac{\partial\Gamma_{im}^{k}}{\partial Z^{j}}+\Gamma_{in}^{k}\Gamma_{jm} ^{n}-\Gamma_{jn}^{k}\Gamma_{im}^{n}.\tag{1.57}

Thus the fact that covariant derivatives commute is equivalent to the statement that the Riemann-Christoffel tensor vanishes, i.e.

R_{\cdot mij}^{k}=0.\tag{1.58}

Note that the logic that leads to this conclusion that covariant derivatives commute (and therefore

R_{\cdot mij}^{k}=0

) fundamentally relies on the Euclidean nature of the space. The argument goes like this. A Euclidean space admits an affine coordinate system where the covariant derivative coincides with the partial derivative, thus

\left( \nabla_{i}\nabla_{j}-\nabla_{j}\nabla_{i}\right) T^{k}=\left( \frac{\partial^{2}}{\partial Z^{i}\partial Z^{j}}-\frac{\partial^{2}}{\partial Z^{i}\partial Z^{j}}\right) T^{k}.\tag{1.59}

Meanwhile, as we know from ordinary Calculus, partial derivatives commute, i.e.

\frac{\partial^{2}T^{k}}{\partial Z^{i}\partial Z^{j}}=\frac{\partial^{2} T^{k}}{\partial Z^{i}\partial Z^{j}}.\tag{1.60}

Therefore, in affine coordinates, the commutator

\left( \nabla_{i}\nabla _{j}-\nabla_{j}\nabla_{i}\right) T^{k}

vanishes, i.e.

\left( \nabla_{i}\nabla_{j}-\nabla_{j}\nabla_{i}\right) T^{k}=0.\tag{1.61}

However, since

\left( \nabla_{i}\nabla_{j}-\nabla_{j}\nabla_{i}\right) T^{k}

is a tensor, it must vanish in all coordinate systems which completes the argument.

Note, that we can expect the same argument to work for developable surfaces since they admit regular affine-like coordinate systems.

Conclusion to follow.

Introduction

1.1Curvature and Tensor Calculus

1.2The three degrees of curvature

1.2.1Curvilinear coordinates in a Euclidean space

1.2.2Developable surfaces

1.2.3Intrinsically curved surfaces

1.3A brief review of Euclidean spaces and the tensor framework

1.3.1Index juggling

1.3.2The Christoffel symbols

1.3.3The covariant derivative

1.3.4The commutator ∇i∇j−∇j∇i\nabla_{i}\nabla_{j}-\nabla_{j}\nabla_{i}∇i​∇j​−∇j​∇i​ and the Riemann-Christoffel tensor

1.3.4The commutator $\nabla_{i}\nabla_{j}-\nabla_{j}\nabla_{i}$ and the Riemann-Christoffel tensor