1. Introduction

Mathematics is a tool for finding order within the chaotic reality. Oftentimes, however, mathematical methods create their own chaos, which impedes their progress. This is the case for the method of coordinates, which is deservedly considered one of the greatest breakthroughs in the history of mathematical thought. Tensor Calculus represents an analytical framework that harnesses the power of coordinates while mitigating the unwanted artifacts that otherwise accompany them. Thus, Tensor Calculus can be described as the art of using coordinate systems to gain deep insights into the nature of space and time and therefore a wide range of physical phenomena.

But this is not the main reason why you should read this book. You should read this book because Tensor Calculus is a breathtakingly beautiful subject that will help you organize your knowledge, and thus improve your understanding of fundamental Applied Mathematics -- in particular, multivariable Calculus and its interplay with Linear Algebra. Tensor Calculus will improve your ability to think geometrically, algebraically, and algorithmically at the same time. And, perhaps most importantly, I hope that you will take inspiration from this subject on how to choose a starting point in your investigations and proceed confidently from it.

1.1Tensor Calculus as a junction of Algebra and Geometry

Tensor Calculus is many things at once, but the original impetus behind its invention was the development of an analytical framework for preserving the geometric meaning in calculations involving coordinate systems. The resulting framework succeeds in achieving this goal beyond expectations. Tensor Calculus has proven to be a powerful tool for scientific investigations and has provided a path to many important discoveries in Mathematics and Physics.

The invention of coordinate systems in the seventeenth century ushered in the era of close interplay between Algebra and Geometry. The often quoted pronouncement by Lagrange perfectly captures the importance of that interaction: As long as algebra and geometry proceeded along separate paths, their advance was slow and their applications limited. But when these sciences joined forces, they drew from each other fresh vitality and thenceforward marched on at a rapid pace toward perfection.

Over the next three centuries, the cross-pollination of geometric and algebraic ideas led to the development of a number of analytical subjects built upon ideas from geometry. A central place among those subjects is occupied by Linear Algebra. While the modern applications of Linear Algebra go far beyond geometric problems, its origins are fundamentally geometric. Many of its applications that seemingly have little or nothing to do with geometry, such as deep learning, rely on essential geometric notions of transformation and distance.

In its relationship to Geometry, Tensor Calculus may be thought of as an extension of Linear Algebra. While Linear Algebra studies straight objects in straight spaces, Tensor Calculus studies curved objects in curved spaces. Straight objects can be studied with Algebra. The presence of curvature necessitates the use of Calculus. That is why Linear Algebra is an algebra while Tensor Calculus is a calculus.

1.2Tensor Calculus and coordinate systems

Coordinate systems unlock the power of analytical methods in geometric applications. However, the immense robustness of the coordinate approach also creates a tendency to use it with unnuanced bluntness. In some circumstances, a certain lack of finesse does not necessarily get us into trouble. Relatively simple calculations allow us to get away with it, as evidenced by many examples in Calculus and Physics textbooks that involve solving a special problem with the help of a special coordinate system. This creates the profoundly wrong impression that the mathematical description of the natural world can be constructed with such calculations. In actuality, while special coordinate systems are indeed well-suited for a wide range of tasks, some of the most imaginative, profound, and intellectually satisfying mathematical ideas lie well beyond their reach.

An even more fundamental problem lies not in which coordinate systems are used but how coordinate systems are used. Generally speaking, a problem solved by geometric means requires an individual approach and at least some degree of ingenuity. Analytical methods, on the other hand, are more universal and robust. As a result, there is a tendency to completely replace the geometric problem with an analytical one. However, in doing so, we risk throwing out the baby with the bathwater. When analytical calculations are disengaged from the geometry, it usually becomes impossible to reconstruct the geometric meaning of the eventual analytical answer. This point can be illustrated by Euler's analysis of minimal surfaces, i.e. surfaces of least area, found in his 1744 treatise Methodus inveniendi lineas curvas maximi minimive proprietate gaudentes, sive solutio problematis isoperimetrici lattissimo sensu accepti, which can be translated as A method for finding curves with properties of maximum or minimum, or solution of isoperimetric problems in the broadest possible sense. In this work, considered by many to be one of the most beautiful mathematical works ever written, Euler laid a foundation for a new branch of Mathematics that later became known as the Calculus of Variations.

As an illustration of his new method, Euler found a minimal surface of revolution which he showed to be catenoid, i.e. the surface of revolution generated by a catenary.

(1.1)

Operating in cylindrical coordinates, Euler described the unknown minimal surface by the function

r\left( z\right)

. He showed that

r\left( z\right)

satisfies the differential equation

r^{\prime\prime}\left( z\right) r\left( z\right) -r^{\prime}\left( z\right) ^{2}-1=0\tag{1.2}

which he then solved to reveal that

r\left( z\right) =a\cosh\frac{z}{a}\tag{1.3}

and thus describes a catenary. This was the first instance of an explicit equation for a minimal surface. Despite this singular breakthrough, Euler's approach did not reveal to him the geometric meaning of the differential equation -- namely, that a minimal surface is characterized by zero mean curvature

B_{\alpha}^{\alpha}

. This was realized only half a century later by the French mathematician Jean Baptiste Meusnier. This example teaches us that special care must be taken to preserve the connection between the analysis and the geometry.

Another unavoidable consequence of the loss of geometric meaning is the rapid growth in the complexity of the analytical expressions. Consider the expression for the mean curvature

B_{\alpha}^{\alpha}=\frac{\frac{\partial^{2}F}{\partial x^{2}}+\frac {\partial^{2}F}{\partial y^{2}}+\frac{\partial^{2}F}{\partial x^{2}}\left( \frac{\partial F}{\partial y}\right) ^{2}+\frac{\partial^{2}F}{\partial y^{2}}\left( \frac{\partial F}{\partial x}\right) ^{2}-2\frac{\partial F}{\partial x}\frac{\partial F}{\partial y}\frac{\partial^{2}F}{\partial x\partial y}}{\left( 1+\left( \frac{\partial F}{\partial x}\right) ^{2}+\left( \frac{\partial F}{\partial y}\right) ^{2}\right) ^{3/2}}\tag{1.4}

of a surface given by the equation

z=F\left( x,y\right)

in Cartesian coordinates. An expression of this complexity would be an appropriate destination of an analytical investigation, but certainly not an acceptable point of departure. Countless analyses never see the light of day as a result of the mathematician retreating in the face of overwhelming complexity.

The underlying reason for the loss of the geometric insight lies not in choosing the wrong coordinate system for the task, but in committing to a specific coordinate system too early in one's analysis. The use of any specific coordinate system invariably obscures the true meaning of the expressions by introducing the artifacts of that system's particular characteristics.

1.3Tensor Calculus in a nutshell

Tensor Calculus solves this problem by restricting its attention to algorithms that work simultaneously in all coordinate systems. This restriction is severe, but it ensures no specific coordinate system has the opportunity to let its artifacts obscure the geometric meaning. As a result, Tensor Calculus offers the best of both worlds: it produces universal analytical expressions with a clear geometric meaning that can be evaluated in any coordinate system according to a prescribed algorithm. Thus, the framework keeps Geometry and Algebra joined at the hip by embracing the use of coordinate systems while organizing the calculations in such a way that the geometric meaning (or something close to it) can be readily restored for any intermediate object. In order to achieve this result, Tensor Calculus relies on several elements.

First is the way coordinate systems are utilized. All coordinate systems used in the course of constructing the framework are completely general, i.e. never specified in a concrete way and never presumed to have any special features beyond some essential differentiability characteristics. Coordinates will be denoted by symbols with superscripts, e.g.

Z^{1}

Z^{2}

, and

Z^{3}

or, collectively,

Z^{i}

. The use of superscripts will be carefully explained starting with Chapter 7 on the tensor notation and finally culminating in Chapter 14 on the tensor property.

Second is the fundamental concepts of a variant and a tensor . A variant is an object that is constructed by a specific algorithm that references the coordinates in the same way in all coordinate systems. For example, for a temperature field

T

in the domain

\Omega

, the collection of partial derivatives with respect to the coordinates, i.e.

\frac{\partial T}{\partial Z^{1}}\text{, }\frac{\partial T}{\partial Z^{2} }\text{, and }\frac{\partial T}{\partial Z^{3}}\tag{1.5}

or, collectively,

\frac{\partial T}{\partial Z^{i}},\tag{1.6}

is a variant. It is a variant because the algorithm that reads differentiate

T

with respect to each of the coordinates is the same in all coordinate systems. By the same token, the collection of the second-order partial derivatives

\frac{\partial^{2}T}{\partial Z^{i}\partial Z^{j}}\tag{1.7}

is also a variant.

A variant is characterized by its order, i.e. the number of indices required to enumerate its elements. Since, at any given physical point, the temperature field

T

is a single number, it does not require any enumeration and is therefore considered to be a variant of order zero. The collection of the partial derivatives

\partial T/\partial Z^{i}

is a variant of order one, while the collection of second-order derivatives

\partial^{2}T/\partial Z^{i}\partial Z^{j}

is a variant of order two.

Variants are subject to a series of operations that include addition, multiplication, covariant differentiation, and contraction. Addition applies to variants of the same order and produces another variant of the same order. Multiplication applies to variants of arbitrary orders and produces a variant of the combined order. Covariant differentiation increases the order by one. Contraction decreases the order by two.

Crucially, the values of a variant change from one coordinate system to another -- thus the name variant. In the example above, the temperature field

T

itself does not change from one coordinate system to another. That makes it a special kind of variant known as an invariant. The values of

\partial T/\partial Z^{i}

and

\partial^{2}T/\partial Z^{i}\partial Z^{j}

, on the other hand, certainly vary from one coordinate system to another. The relationship between the values of a variant in different coordinate systems is known as the transformation rule.

This brings us to tensors. A tensor is a variant of a special kind that transforms from one coordinate system to another by a particular rule known as the tensor transformation rule. Crucially, there are two opposite flavors of tensor transformations: covariant and contravariant. The two transformation rules oppose each other in the sense of the matrix inverse. This inverse relationship is the key to the eventual elimination of the nongeometric artifacts of coordinate systems. When the covariant mode encounters a contravariant mode and the accompanying transformation matrix meets its inverse, the operation of contraction forces the two matrices to multiply and therefore cancel each other's influences, thus bringing the combination a step closer to invariance.

Tensors possess three properties that enable them to faithfully preserve the geometry of the initial problem. First, the components of a geometric vector form a tensor. Second, tensors are closed under addition, multiplication, covariant differentiation, and contraction. Third, when a tensor is reduced to order zero by contraction or a series of contractions, the resulting value is the same in all coordinate systems. Checkmate.

A variant of order zero, i.e. a single quantity, that has the same value in all coordinate systems is known as an invariant. Since the value of an invariant does not depend on the choice of coordinates, it must have an independent geometric significance. Experience shows that for simpler invariants, we are able to find a way to calculate the same value by a geometric construction.

The overall logic of the tensor framework is, therefore, not surprising since it follows that of component spaces in Linear Algebra. Component space analysis in Linear Algebra starts with decomposition, i.e. the translation of "real life" objects into their components with respect to a basis. The bulk of the analysis is subsequently conducted in terms of the components, thanks to the analytical robustness of the component space which has recently been boosted further by the relatively newly-acquired ability to delegate complex computational tasks to electronic devices. In the final step, the results of the component analysis are reinterpreted in the real world by recombining their components with the original basis.

In Tensor Calculus, the first step is the same: decomposition with respect to a basis. The fact that the components of a vector form a tensor ensures that the analysis starts in the tensor space. The bulk of the analysis then proceeds in the component space. We must therefore develop a framework that ensures that all intermediate objects are tensors. Much of our energies will be directed towards this task. The main difficulty lies with differentiation, as it does not preserve the tensor property, and the main breakthrough is achieved with the introduction of the covariant derivative. The covariant derivative is an extension of the ordinary derivative that preserves the tensor property. It therefore replaces the ordinary derivative in all aspects of the analysis. Having thus assured that the tensor property of objects is preserved throughout the analysis, contraction produces an invariant in the final step, thus bringing us back to "real life".

In summary, Tensor Calculus combines the best of the geometric and algebraic worlds. It enables us to leverage the unmatched utility of coordinate systems in a way that faithfully preserves the geometric meaning of the objects under investigation.