To my generous Patreon editors: please note that this Chapter is not quite ready to be edited. As
it often happens, the introductory chapter will be the last one to be completed before the book is
done!
This is a continuation of An Introduction to Tensor Calculus which was devoted to the
construction of the analytical framework of Tensor Calculus. In this book, we will use the Tensor
Calculus framework to study shape -- in other words, curvature.
Historically, the development of understanding of curvature unfolded in two distinct phases. The
first phase, which lasted until about the middle of the nineteenth century, was preoccupied with
gaining insight into curvature by developing measures for quantifying it and discovering
fundamental laws relating those measures. This phase is associated primarily with the
groundbreaking works of Leonhard Euler and Carl Friedrich Gauss. The second phase, originated by
Gauss and advanced by Bernhard Riemann and Albert Einstein, is aimed at understanding the geometric
properties of the underlying space itself.
Our narrative will speak to both of these elements, but it will have a different focal point. Our
focus will be on the development of the analytical framework for studying curvature.
1.1Curvature and Tensor Calculus
Although spectacular in terms of achievement, development of humanity's understanding of curvature
has been laborious and slow. Meanwhile, the tensor framework will make most of the concepts and
equations appear deceptively simple as many of the most profound and hard-fought facts will fall
before us like dominos. For example, Gauss's famous Theorema Egregium -- the Remarkable
Theorem -- will come across as almost entirely self-evident. His celebrated equations of the
surface
\[
B_{\alpha\gamma}B_{\beta\delta}-B_{\beta\gamma}B_{\alpha\delta}=R_{\alpha \beta\gamma\delta}
\tag{7.38}
\]
will be derived in an exercise in the next Chapter. Meanwhile, Euler's remarkable
discovery that the surface swept out by tangents to a three-dimensional curve is developable will,
too, be found in an exercise. In fact, most of the facts that might occur to us naturally, some
found in exercises, would have constituted results worthy of an admirable nineteenth-century
doctoral thesis.
Such retrospective ease is not unique to Tensor Calculus but is a general phenomenon of human
forward movement. In music, it is said, the dissonance of today is the consonance of
tomorrow. This law of evolution of human thought should in no way diminish our admiration for
the remarkable achievements of the great minds of the past. As we celebrate our superior tools of
the trade, we should contemplate what has been lost because of them.
1.2The three degrees of curvature
Let us begin by having an informal discussion of curvature and, in particular, its interplay with
coordinates systems. The crucial observations made in this Section, although stated as facts, are
meant to appeal to your geometric intuition and to guide our future analytical
investigations.
1.2.1Curvilinear coordinates in a Euclidean space
In Introduction to Tensor Calculus, we devoted much of our attention to Euclidean spaces
characterized by straightness. A Euclidean space can accommodate a straight line in any direction
and can therefore be referred to an affine -- and, in particular, Cartesian -- coordinate system.
Nevertheless, in almost all of our theoretical explorations we assumed a more general curvilinear
coordinate system.
(1.1)
Our
primary motivation for this choice was to eliminate the possible artifacts of a special coordinate
system. Thus, we did not assume that the chosen coordinate system had any special features, such as
straight regularly-spaced coordinate lines. When we did, on rare occasions, resort to affine
coordinates, it was to specifically call attention to the Euclidean nature of our space. Our
diligent self-limitation paid off as it led us to develop a more general framework of great
robustness.
However, in order to arrive at such a framework, we had to first contend the curvature effects
associated with curvilinear coordinates. The most obvious and striking complication lies in the
fact that the covariant basis \(\mathbf{Z}_{i}\), which is regular (i.e. constant) in affine
coordinates, varies from one point to the next in curvilinear coordinates.
(1.2)
As a result, vectors at different points cannot be compared,
added, or otherwise related in any way by analyzing their components in the familiar Cartesian
fashion. Furthermore, the partial derivative
\[
\frac{\partial}{\partial Z^{i}}\tag{1.3}
\]
applied to the components of a vector field does not fully capture its rate of change since
it ignores the variability in the accompanying basis. This loss of geometric information manifested
itself in the lack of the tensor property of the partial derivative of a tensor.
Outside of our geometric study of curves, this was our first encounter with the effects of
curvature. However, this curvature is of the spurious nature: since a Euclidean space is
fundamentally straight, these effects are artifacts of our analysis and, as a result, can be
completely mitigated. This is accomplished by introducing the covariant derivative \(\nabla_{i}\).
The covariant derivative restores the tensor property, possesses the metrinilic property with
respect to the basis, i.e.
\[
\nabla_{i}\mathbf{Z}_{j}=\mathbf{0}\tag{1.4}
\]
which is akin to the affine property
\[
\frac{\partial\mathbf{i}}{\partial Z^{i}}=\frac{\partial\mathbf{j}}{\partial
Z^{i}}=\frac{\partial\mathbf{k}}{\partial Z^{i}}=\mathbf{0},\tag{1.5}
\]
and preserves the product rule.
In summary, the tensor framework handles this lowest "degree" of curvature rather easily while
offering a great deal of insight.
1.2.2Developable surfaces
The next "degree" of curvature is represented by surfaces that can be formed by smoothly curving a
flat sheet of paper, i.e. a region of a plane, without any distortion. For a transformation to not
introduce any distortion it must preserve the distance between any two points as measured within
the surface. We must note, however, that the concept of distance as measured within the
surface is arguably unclear and needs to be precisely defined. Nevertheless, such
transformations are known as isometric and surfaces that can be isometrically transformed
into a region of a plane are known as developable. Thus, we can continue to think of a
developable surface as a Euclidean plane from the viewpoint of their internal geometry but not from
the overall three-dimensional point of view.
(1.6)
Developable surfaces were first investigated by Euler in his 1772 work De solidis quorum
superficiem in planum explicare licet or On shapes that can be unwrapped onto a plane.
The classical examples of developable surfaces are the cylinder and the cone.
(1.7)
Note that these two surfaces need to be cut in order to be
transformed into a region of a plane.
Another general type of a developable surface is known as the tangent developable of a
curve. It is the surface swept out by the tangent lines to a three-dimensional curve.
(1.8)
In De solidis quorum superficiem in planum explicare
licet, Euler not only demonstrated that the tangent developable is, indeed, a developable
surface, but that every developable surface that is not a cylinder or a cone is a tangent
developable for some curve.
On the one hand, developable surfaces are undeniably curved. On the other hand, they allow regular
coordinate systems which can be accomplished easily by referring the flat sheet of paper to a
regular coordinate system prior to curving it into the final shape, as illustrated in the following
figure.
(1.9)
Of course, we must clarify the
sense in which a coordinate system on the curve surface is regular. Indeed, in the above
figure, all of the coordinate lines are curved and the coordinate nodes are not arranged in any
kind of regular pattern in the three-dimensional space. Yet, it is intuitively clear that, with
respect to the surface, the coordinate lines are indeed regular. Thus, there is a clear
dichotomy between how the image appears in the overall three-dimensional space and how it is
perceived within the surface. We will use the term intrinsic to describe the geometric
action that takes place strictly in the surface without a reference to ambient objects. For
example, we will describe developable surfaces as curved in the three-dimensional sense and as flat
in the intrinsic sense.
A surface swept by straight lines (which are not necessarily the tangents of a given curve) is
known as a ruled surface. We will show that every developable surface is ruled. This
becomes intuitively clear when inspecting a curved sheet of paper and noting that, at every point,
the direction in which the sheet is straight continues to infinity or, at least, to the edge of the
sheet.
(1.10)
The shape of a fluttering flag is often used as an example of
a developable surface since a flag, it would seem, can be flattened out without distortion.
However, a fluttering flag is clearly not a ruled surface since it is not developable, as seen in
the following figure. (1.11)
Therefore, modeling it as a developable surface will
inevitably fail to capture some of its properties.
Finally, note that not all ruled surfaces are developable as evidenced by the parabolic hyperboloid
in the following figure.
(1.12)
In our pursuit of understanding curvature, one of our overarching goals will be to derive
analytical criteria for developable surfaces.
1.2.3Intrinsically curved surfaces
Non-developable surfaces represent the third, highest "degree" of curvature. Such surfaces can be
considered to be curved in a more profound way since they cannot be flattened without distortion.
For example, a sphere -- or any part, no matter how small, thereof -- cannot be unwrapped onto a
region of a plane without distortion. This is a well-known phenomenon in cartography when one
attempts to represent the round Earth on a two dimensional map. When one attempts to do so in a
smooth fashion, significant distortion results.
(1.13)
The amount of distortion can be reduced with the help of
tears, but it cannot be eliminated. (1.14)
It follows from these observations that a non-developable surface does not admit an intrinsically
regular coordinate system. This has important implications since a number of arguments used in
Introduction to Tensor Calculus relied on the availability of affine coordinates. In
particular, the fact the Riemann-Christoffel tensor vanishes fundamentally relied on affine
coordinates. We can therefore expect that the corresponding surface object continues to vanish for
developable surfaces but not for non-developable surfaces. In fact, this holds the promise of
yielding a criterion for developable surfaces which will be explored in Chapter 7.
1.3A brief review of Euclidean spaces and the tensor framework
Let us now summarize the key concepts and notation introduced in Introduction to Tensor
Calculus.
The position vector \(\mathbf{R}\) at a point \(P\) in a Euclidean space is the vector that
emanates from an arbitrary fixed point \(O\), known as the origin, and terminates \(P\).
Refer the Euclidean space to arbitrary coordinates \(Z^{i}\). Then the position vector
\(\mathbf{R}\) becomes a function of the coordinates which, in a three-dimensional space reads
\[
\mathbf{R}=\mathbf{R}\left( Z^{1},Z^{2},Z^{3}\right) .\tag{1.15}
\]
We typically abbreviate such expressions by representing the enumerated arguments of
functions by a single letter, i.e.
\[
\mathbf{R}=\mathbf{R}\left( Z\right) .\tag{1.16}
\]
The covariant basis \(\mathbf{Z}_{i}\) is given by the partial derivative
\[
\mathbf{Z}_{i}=\frac{\partial\mathbf{R}\left( Z\right) }{\partial Z^{i}}.\tag{1.17}
\]
The covariant metric tensor \(Z_{ij}\) is obtained by pairwise dot products of the
covariant basis vectors, i.e.
\[
Z_{ij}=\mathbf{Z}_{i}\cdot\mathbf{Z}_{j}.\tag{1.18}
\]
The contravariant metric tensor \(Z^{ij}\) is the matrix inverse of \(Z_{ij}\), i.e.
\[
Z^{ij}Z_{jk}=\delta_{k}^{i},\tag{1.19}
\]
where \(\delta_{k}^{i}\) is the Kronecker delta system. The contravariant basis
\(\mathbf{Z}^{i}\) is given by the contraction
\[
\mathbf{Z}^{i}=Z^{ij}\mathbf{Z}_{j}.\tag{1.20}
\]
The covariant and the contravariant bases are mutually orthogonal, i.e.
\[
\mathbf{Z}^{i}\cdot\mathbf{Z}_{j}=\delta_{j}^{i}.\tag{1.21}
\]
Also, the last property can serve as an alternative definition of the contravariant basis
\(\mathbf{Z}^{i}\). The determinant of the covariant metric tensor \(Z_{ij}\) is denoted by \(Z\)
and its square root, \(\sqrt{Z}\), is referred to as the volume element.
The permutation systems \(e_{ijk}\) and \(e^{ijk}\) have the following values
\[
e_{ijk},e^{ijk}=\left\{ \begin{array} {ll} \phantom{+} 1\text{,} & \text{if }ijk\text{ is an even
permutation of the numbers }1,2,3\\ -1\text{,} & \text{if }ijk\text{ is an odd permutation of the
numbers }1,2,3\\ \phantom{+} 0\text{,} & \text{if }ijk\text{ is not a permutation of the numbers
}1,2,3. \end{array} \right.\tag{1.22}
\]
The Levi-Civita symbols \(\varepsilon_{ijk}\) and \(\varepsilon^{ijk}\) are defined by the
identities
\[
\begin{aligned}
\varepsilon_{ijk} & =\sqrt{Z}e_{ijk}\text{ and}\ \ \ \ \ \ \ \ \ \ \left(1.23\right)\\
\varepsilon^{ijk} & =\frac{e^{ijk}}{\sqrt{Z}}.\ \ \ \ \ \ \ \ \ \ \left(1.24\right)
\end{aligned}
\]
The complete delta system \(\delta_{rst}^{ijk}\) is given by
\[
\delta_{rst}^{ijk}=e^{ijk}e_{rst}\tag{1.25}
\]
or, equivalently,
\[
\delta_{rst}^{ijk}=\varepsilon^{ijk}\varepsilon_{rst}\tag{1.26}
\]
and can be expressed in terms of the Kronecker delta by the identity
\[
\delta_{rst}^{ijk}=\delta_{r}^{i}\delta_{s}^{j}\delta_{t}^{k}+\delta_{s}
^{i}\delta_{t}^{j}\delta_{r}^{k}+\delta_{t}^{i}\delta_{r}^{j}\delta_{s}
^{k}-\delta_{r}^{i}\delta_{t}^{j}\delta_{s}^{k}-\delta_{t}^{i}\delta_{s}
^{j}\delta_{r}^{k}-\delta_{s}^{i}\delta_{r}^{j}\delta_{t}^{k}.\tag{1.27}
\]
The partial delta system \(\delta_{rs}^{ij}\) has the values
\[
\tag{1.28}
\]
delta_{rs}^{ij}=left{ begin{tabular} {ll} \( \begin{array} {c} \\ \phantom{+} 1\text{, }\\
\\ \phantom{+} \end{array} \) & \( \begin{array} {l} \\ \text{if the superscripts and the
subscripts are identical sets}\\ \text{of distinct numbers related by an \textit{even}
permutation}\\ \phantom{+} \end{array} \) \( \begin{array} {c} -1\text{,}\\ \\ \phantom{+}
\end{array} \) & \( \begin{array} {l} \text{if the superscripts and the subscripts are identical
sets}\\ \text{of distinct numbers related by an \textit{odd} permutation}\\ \phantom{+} \end{array}
\) \( \begin{array} {c} \phantom{+} 0\text{,}\\ \end{array} \) & \( \begin{array} {c} \text{for all
other combinations of indices.}\\ \end{array} \) end{tabular} right. end{equation} It
satisfies the identity
\[
\delta_{rs}^{ij}=\delta_{r}^{i}\delta_{s}^{j}-\delta_{r}^{j}\delta_{s}^{i}\tag{1.29}
\]
and can be obtained from the complete delta system \(\delta_{rst}^{ijk}\) by a single
contraction, i.e.
\[
\delta_{rs}^{ij}=\delta_{rsk}^{ijk}.\tag{1.30}
\]
Thus, we also have the frequently used identity
\[
\delta_{rsk}^{ijk}=\delta_{r}^{i}\delta_{s}^{j}-\delta_{r}^{j}\delta_{s}^{i}.\tag{1.31}
\]
1.3.1Index juggling
The metric tensors are used to implement the operations of index juggling. Contraction with
the covariant metric tensor \(Z_{ij}\) is captured by index lowering, e.g.
\[
T_{ik}=Z_{ij}T_{\cdot k}^{j}.\tag{1.32}
\]
Contraction with the contravariant metric tensor \(Z^{ij}\) is captured by index
raising, e.g.
\[
T_{\cdot k}^{i}=Z^{ij}T_{jk}.\tag{1.33}
\]
The two operations are the inverses of each other.
1.3.2The Christoffel symbols
The Christoffel symbol \(\Gamma_{jk}^{i}\) is given by
\[
\Gamma_{jk}^{i}=\mathbf{Z}^{i}\cdot\frac{\partial\mathbf{Z}_{j}}{\partial Z^{k}}\tag{1.34}
\]
or, equivalently, by
\[
\Gamma_{jk}^{i}=\frac{1}{2}Z^{im}\left( \frac{\partial Z_{mj}}{\partial Z^{k}}+\frac{\partial
Z_{mk}}{\partial Z^{j}}-\frac{\partial Z_{jk}}{\partial Z^{m}}\right) .\tag{1.35}
\]
The symbol \(\Gamma_{jk}^{i}\) is sometimes referred to as the Christoffel symbol of the
second kind. The Christoffel symbol of the first kind \(\Gamma_{i,jk}\) is obtained by lowering the
index \(i\) on \(\Gamma_{jk}^{i}\), i.e.
\[
\Gamma_{i,jk}=Z_{im}\Gamma_{jk}^{m}.\tag{1.36}
\]
Alternatively, \(\Gamma_{i,jk}\) is given by
\[
\Gamma_{i,jk}=\frac{1}{2}\left( \frac{\partial Z_{ij}}{\partial Z^{k}} +\frac{\partial
Z_{ik}}{\partial Z^{j}}-\frac{\partial Z_{jk}}{\partial Z^{i} }\right) .\tag{1.37}
\]
The partial derivative of the covariant metric tensor \(Z_{ij}\) with respect to \(Z^{k}\)
is given by the identity
\[
\frac{\partial Z_{ij}}{\partial Z^{k}}=\Gamma_{i,jk}+\Gamma_{j,ik}.\tag{1.38}
\]
The derivative of the volume element \(\sqrt{Z}\) is given by
\[
\frac{\partial\sqrt{Z}}{\partial Z^{i}}=\sqrt{Z}\Gamma_{ji}^{j}.\tag{1.39}
\]
The divergence \(\nabla_{i}U^{i}\) of a variant field \(U^{i}\) is given by the Voss-Weyl
formula
\[
\nabla_{i}U^{i}=\frac{1}{\sqrt{Z}}\frac{\partial}{\partial Z^{i}}\left( \sqrt{Z}U^{i}\right)
.\tag{1.40}
\]
Correspondingly, the Laplacian \(\nabla_{i}\nabla^{i}U\) of a scalar field \(U\) is given by
\[
\nabla_{i}\nabla^{i}U=\frac{1}{\sqrt{Z}}\frac{\partial}{\partial Z^{i}}\left(
\sqrt{Z}Z^{ij}\frac{\partial U}{\partial Z^{j}}\right) .\tag{1.41}
\]
The Christoffel symbol vanishes in affine coordinates, i.e.
\[
\Gamma_{jk}^{i}=0.\tag{1.42}
\]
Its nonzero elements in cylindrical coordinates \(r,\theta,z\) are
\[
\begin{aligned}
\Gamma_{22}^{1} & =-r\ \ \ \ \ \ \ \ \ \ \left(1.43\right)\\
\Gamma_{12}^{2} & =\Gamma_{21}^{2}=\frac{1}{r}.\ \ \ \ \ \ \ \ \ \ \left(1.44\right)
\end{aligned}
\]
while in spherical coordinates \(r.\theta,\varphi\) they are
\[
\begin{aligned}
\Gamma_{22}^{1} & =-r\ \ \ \ \ \ \ \ \ \ \left(1.45\right)\\
\Gamma_{33}^{1} & =-r\sin^{2}\theta\ \ \ \ \ \ \ \ \ \ \left(1.46\right)\\
\Gamma_{12}^{2} & =\Gamma_{21}^{2}=\frac{1}{r}\ \ \ \ \ \ \ \ \ \ \left(1.47\right)\\
\Gamma_{33}^{2} & =-\sin\theta\cos\theta\ \ \ \ \ \ \ \ \ \ \left(1.48\right)\\
\Gamma_{13}^{3} & =\Gamma_{31}^{3}=\frac{1}{r}\ \ \ \ \ \ \ \ \ \ \left(1.49\right)\\
\Gamma_{23}^{3} & =\Gamma_{32}^{3}=\frac{1}{\tan\theta}.\ \ \ \ \ \ \ \ \ \ \left(1.50\right)
\end{aligned}
\]
1.3.3The covariant derivative
For a variant \(T_{j}^{i}\) with a representative collection of indices, the covariant derivative
\(\nabla_{k}\) is given by the identity
\[
\nabla_{k}T_{j}^{i}=\frac{\partial T_{j}^{i}}{\partial Z^{k}}-\Gamma_{kj}
^{m}T_{m}^{i}+\Gamma_{km}^{i}T_{j}^{m}.\tag{1.51}
\]
The cornerstone feature of the covariant derivative is its tensor property -- namely, that it
produces tensor outputs for tensor inputs. Furthermore, the covariant derivative satisfies the sum
rule, the product rule, commutes with contraction. It coincides with the partial derivative in
affine coordinates. Finally, it is metrinilic with respect to all of the fundamental tensors in a
Euclidean space, i.e.
\[
\nabla_{l}\mathbf{Z}_{i},\ \text{\ }\nabla_{l}\mathbf{Z}^{j}=\mathbf{0}\tag{1.52}
\]
and
\[
\nabla_{l}Z_{ij},\ \nabla_{l}\delta_{j}^{i},\ \nabla_{l}Z^{ij},\ \nabla _{l}\varepsilon_{ijk},\
\nabla_{l}\varepsilon^{ijk}=0.\tag{1.53}
\]
Note that the metrinilic property follows easily by a combination of the tensor property and
the fact that the covariant derivative coincides with the partial derivative in affine coordinates.
1.3.4The commutator \(\nabla_{i}\nabla_{j}-\nabla_{j}\nabla_{i}\) and the Riemann-Christoffel tensor
Covariant derivatives commute. In other words, the commutator \(\nabla
_{i}\nabla_{j}-\nabla_{j}\nabla_{i}\) vanishes for all inputs, i.e.
\[
\left( \nabla_{i}\nabla_{j}-\nabla_{j}\nabla_{i}\right) T^{k}=0.\tag{1.54}
\]
Since it can be generally shown that
\[
\nabla_{i}\nabla_{j}T^{k}-\nabla_{j}\nabla_{i}T^{k}=\left( \frac
{\partial\Gamma_{jm}^{k}}{\partial Z^{i}}-\frac{\partial\Gamma_{im}^{k} }{\partial
Z^{j}}+\Gamma_{in}^{k}\Gamma_{jm}^{n}-\Gamma_{jn}^{k}\Gamma _{im}^{n}\right) T^{m}\tag{1.55}
\]
for any \(T^{m}\), we conclude that
\[
\frac{\partial\Gamma_{jm}^{k}}{\partial Z^{i}}-\frac{\partial\Gamma_{im}^{k} }{\partial
Z^{j}}+\Gamma_{in}^{k}\Gamma_{jm}^{n}-\Gamma_{jn}^{k}\Gamma _{im}^{n}=0.\tag{1.56}
\]
The combination on the left is a tensor known as the Riemann-Christoffel tensor \(R_{\cdot
mij}^{k}\), i.e.
\[
R_{\cdot mij}^{k}=\frac{\partial\Gamma_{jm}^{k}}{\partial Z^{i}}
-\frac{\partial\Gamma_{im}^{k}}{\partial Z^{j}}+\Gamma_{in}^{k}\Gamma_{jm}
^{n}-\Gamma_{jn}^{k}\Gamma_{im}^{n}.\tag{1.57}
\]
Thus the fact that covariant derivatives commute is equivalent to the statement that the
Riemann-Christoffel tensor vanishes, i.e.
\[
R_{\cdot mij}^{k}=0.\tag{1.58}
\]
Note that the logic that leads to this conclusion that covariant derivatives commute (and therefore
\(R_{\cdot mij}^{k}=0\)) fundamentally relies on the Euclidean nature of the space. The argument
goes like this. A Euclidean space admits an affine coordinate system where the covariant derivative
coincides with the partial derivative, thus
\[
\left( \nabla_{i}\nabla_{j}-\nabla_{j}\nabla_{i}\right) T^{k}=\left( \frac{\partial^{2}}{\partial
Z^{i}\partial Z^{j}}-\frac{\partial^{2}}{\partial Z^{i}\partial Z^{j}}\right) T^{k}.\tag{1.59}
\]
Meanwhile, as we know from ordinary Calculus, partial derivatives commute, i.e.
\[
\frac{\partial^{2}T^{k}}{\partial Z^{i}\partial Z^{j}}=\frac{\partial^{2} T^{k}}{\partial
Z^{i}\partial Z^{j}}.\tag{1.60}
\]
Therefore, in affine coordinates, the commutator \(\left( \nabla_{i}\nabla
_{j}-\nabla_{j}\nabla_{i}\right) T^{k}\) vanishes, i.e.
\[
\left( \nabla_{i}\nabla_{j}-\nabla_{j}\nabla_{i}\right) T^{k}=0.\tag{1.61}
\]
However, since \(\left( \nabla_{i}\nabla_{j}-\nabla_{j}\nabla_{i}\right) T^{k}\) is a
tensor, it must vanish in all coordinate systems which completes the argument.
Note, that we can expect the same argument to work for developable surfaces since they admit
regular affine-like coordinate systems.
Conclusion to follow.