To my Patrons who generously edit my book: this Chapter is not quite ready for editing. I have left
in a number of paragraphs that I will eventually modify or remove.
In a Euclidean space, the shortest path between two points is a straight line. The concept of a
geodesic is a generalization of this idea to curved surfaces and, more broadly, to Riemannian
spaces.
Given two points on a surface, a geodesic is an embedded curve of shortest length connecting the
two points. Before we go any further, we must note that the Euclidean experience teaches us that
the concept of a geodesic is closely connect to the question of curvature or, more precisely, that
of zero curvature. For a curve in a Euclidean space, straightness corresponds to the condition of
vanishing absolute curvature, i.e.
\[
\sigma=0.\tag{10.1}
\]
For a geodesic on a surface, the analogous criterion is the condition of vanishing geodesic
curvature, i.e.
\[
b_{\Phi}^{\Phi}=0.\tag{10.2}
\]
Establishing this equation and exploring its implications are the central goals of this
Chapter.
The condition of zero geodesic curvature is necessary but not sufficient for a curve to be the
shortest path between two points. However, we will eventually broaden the definition of a geodesic
to include any embedded curve of zero geodesic curvature even if it does not represent the shortest
path between two given points.
10.1Geometric insights
Before we turn to our analytical investigations let us begin my making a few insightful geometric
observations.
For a developable surface, constructing a geodesic is a straightforward task. Since a developable
surface can be transformed into a region of a plane without distortion, every geodesic maps to a
straight line in corresponding planar region -- provided that the surface does not require a cut
that passes right through the geodesic in order to be transformed into a section of a plane.
(10.3)
This observation suggests a practical approach for constructing the geodesic between two points on
a developable surface and that is to 1. isometrically transform the surface onto a region of a
plane, 2. connect the images of the two points by a straight line, provided that line is wholly
contained within the region of the plane, and 3. isometrically transform the region of a plane back
into the original surface.
(10.4)
(10.5)
Of course, this method would yield an incorrect curve if the developable surface required a cut
that passed through the actual geodesic. This possibility is illustrated on the surface of a
cylinder in the following figure.
(10.6)
Let us now turn our attention to the sphere. On the surface of a sphere, the shortest path
connecting two points is a segment of a great circle.
(10.7)
Two
distinct points on the surface of a sphere uniquely determine a great circle. Naturally, the
shorter of the two circular segments represents the shortest distance between two points. Recall
from Chapter 9 that the geodesic curvature of a
great circle is zero. This is our first, albeit narrow, confirmation that geodesics are
characterized by zero curvature.
The sphere also clearly shows us that geodesics are not unique: all "meridians" simultaneously
represent the shortest distance between two diametrically opposed points.
(10.8)
10.2In search of the geodesic equation
Consider the practical task of finding the geodesic connecting two points \(A\) and \(B\) on a
surface specified by analytical means. This task can be accomplished by formulating and solving a
set of ordinary differential equations, known collectively as the geodesic equation, for the
functions \(S^{\alpha}\left( \gamma\right) \) representing the equations of the curve
\[
S^{\alpha}=S^{\alpha}\left( \gamma\right) .\tag{10.9}
\]
If our intuition regarding the relevance of curvature is correct (and it is!), then
we should expect a system of second-order differential equations.
As our discussion will feature numerous instances of the derivatives of \(S^{\alpha}\left(
\gamma\right) \), let us borrow the "dot" notation from the Calculus of Variations and use the
symbols \(\dot{S}^{\alpha}\) and \(\ddot {S}^{\alpha}\) to denote those derivatives, i.e.
\[
\begin{aligned}
\dot{S}^{\alpha} & =\frac{dS^{\alpha}\left( \gamma\right) }{d\gamma}\ \ \ \ \ \ \ \ \ \
\left(10.10\right)\\
\ddot{S}^{\alpha} & =\frac{d^{2}S^{\alpha}\left( \gamma\right) } {d\gamma^{2}}.\ \ \ \ \ \ \ \
\ \ \left(10.11\right)
\end{aligned}
\]
You will also find it very helpful to think of \(\dot{S}^{\alpha}\) as the speed with which
an image particle moves along the curve and and \(\ddot{S}^{\alpha}\) as its acceleration.
We will derive the geodesic equation in two ways. In the first approach, we will expand the zero
geodesic curvature condition
\[
b_{\Phi}^{\Phi}=0 \tag{10.2}
\]
in terms of the underlying functions \(S^{\alpha}\left( \gamma\right) \). In the second
approach, we will apply the techniques of the Calculus of Variations to the length functional
\[
L=\int_{a}^{b}\sqrt{S_{\alpha\beta}\dot{S}^{\alpha}\dot{S}^{\beta}}d\gamma.\tag{10.12}
\]
Regardless of the approach, however, we must recognize that such a system cannot be of the usual
"Newton's second law" form
\[
\ddot{S}^{\alpha}=F^{\alpha}\left( \gamma,S,\dot{S}\right) ,\tag{10.13}
\]
i.e. one where the leading-order derivatives are expressed as explicit functions of the
lower-order ones. The fundamental reason for that is this: while the solutions of such systems are
-- generally speaking -- unique, the functions \(S^{\alpha}\left( \gamma\right) \) cannot be
unique.
The unavoidable lack of uniqueness is, of course, due to the possibility of reparameterization.
After all, we have not restricted the parameterization of the geodesic in any way. As a result,
there are infinitely many functions that represent the geodesic connecting the points \(A\) and
\(B\). For example, suppose the geodesic is represented by the equations
\[
S^{\alpha}=S^{\alpha}\left( \gamma\right)\tag{10.14}
\]
where, say, \(A\) corresponds to \(\gamma=0\) and \(B\) corresponds to \(\gamma=1\). Then it
is likewise represented by the equations
\[
S^{\alpha}=S^{\alpha}\left( \sin\frac{\pi\gamma}{2}\right) ,\tag{10.15}
\]
or, more generally, by the equations
\[
S^{\alpha}=S^{\alpha}\left( f\left( \gamma\right) \right) ,\tag{10.16}
\]
where \(f\left( \gamma\right) \) is any strictly monotonically increasing function from
\(0\) to \(1\). To use a physical analogy, such a reparameterization changes the speed of a
particle moving along a trajectory but not the trajectory itself.
This a priori lack of uniqueness dictates that we must expect two features from the eventual
system of differential equations. First, the system will be of the form
\[
F^{\alpha}\left( \gamma,S,\dot{S},\ddot{S}\right) =0\tag{10.17}
\]
where the leading derivatives are engaged in a set of implicit functions that cannot be
solved uniquely for the leading derivatives \(\ddot{S}^{\alpha }\left( \gamma\right) \). And,
second, in order to achieve uniqueness, the system must require to be supplemented by a
specification of the precise manner in which the parameterization evolves along the curve. For
example, we may wish, as is often done, to specify that the geodesic is parameterized by arc
length. Then the supplementary condition would be
\[
S_{\beta\gamma}\dot{S}^{\beta}\dot{S}^{\gamma}=1.\tag{10.18}
\]
10.3Intuitive considerations
In this Section, we will describe several intuitive geometric reasons why geodesics are
characterized by zero geodesic curvature, i.e.
\[
b_{\Phi}^{\Phi}=0. \tag{10.2}
\]
Recall that the geodesic curvature is a measure of how much the curve bends within the
surface. The quintessential example of this perspective is the difference between of a "straight"
road and a curved one.
(10.19)
Surely, the shortest path between two points on
the surface of the Earth is represented by a "straight" road, i.e. a curve characterized by zero
geodesic curvature. Of course, a "straight" road is not actually straight but follows a great
circle which, as we reminded ourselves earlier, is characterized by zero geodesic curvature.
We may also notice that a great circle has the property that its principal curvature \(\mathbf{P}\)
is aligned with the surface normal \(\mathbf{N}\). A little bit of mental experimentation should
convince us that this is a general characteristic of a geodesic. As a visual aid, take a
non-spherical curved surface, such as an egg, and draw the shortest path connecting two points.
(10.20)
If you visualize the curvature normal \(\mathbf{P}\) of the
resulting curve and the surface normal \(\mathbf{N}\) you will observe that the two vectors are
aligned. According to the equation of the three curvatures
\[
\sigma\mathbf{P}=B_{\beta}^{\alpha}T_{\alpha}T^{\beta}\mathbf{N}+b_{\Phi }^{\Phi}\mathbf{n,}
\tag{9.73}
\]
this relationship between \(\mathbf{P}\) and \(\mathbf{N}\) implies that the component of
the geodesic normal \(\mathbf{n}\) vanishes, i.e.
\[
b_{\Phi}^{\Phi}=0. \tag{10.2}
\]
This argument can be augmented by a physical thought experiment. A free particle moving in a
Euclidean space without the influence of any forces moves in a straight line which represents the
shortest path between any two points that it visits. Suppose now that the particle is constrained
to move within surface but without any additional influences, such as friction. Then two
things are intuitively clear about the motion of this particle. First, it travels along the
shortest path connecting any two points that it visits. Second, the force that constrains it to the
motion within the surface acts in the normal direction \(\mathbf{N}\). As a result, its
acceleration also points along \(\mathbf{N}\). This implies that the particle moves with constant
velocity and therefore, as we first established in Chapter TBD of Introduction to Tensor
Calculus, its acceleration, being \(\mathbf{B}=\sigma\mathbf{P}\), also points along the
principal normal \(\mathbf{P}\). Thus, \(\mathbf{N}\) and \(\mathbf{P}\) are collinear and
therefore and we have another confirmation of the fact that the geodesic curvature vanishes.
Note that this argument also establishes that the constraining force \(\mathbf{F}\) acting on the
particle is given by
\[
\mathbf{F}=B_{\beta}^{\alpha}T_{\alpha}T^{\beta}\mathbf{N,}\tag{10.21}
\]
which gives additional meaning to the normal acceleration \(B_{\beta}^{\alpha
}T_{\alpha}T^{\beta}\) and therefore also to the curvature tensor \(B_{\beta }^{\alpha}\).
10.4The expanded definition of a geodesic
Let us broaden the definition of a geodesic to include any surface curve characterized by zero
geodesic curvature, i.e.
\[
b_{\Phi}^{\Phi}=0. \tag{10.2}
\]
By definition, for a geodesic, the equation of three curvatures
\[
\sigma\mathbf{P}=B_{\beta}^{\alpha}T_{\alpha}T^{\beta}\mathbf{N}+b_{\Phi }^{\Phi}\mathbf{n}
\tag{9.73}
\]
reduces to
\[
\sigma\mathbf{P}=B_{\beta}^{\alpha}T_{\alpha}T^{\beta}\mathbf{N.}\tag{10.22}
\]
Thus, the absolute curvature \(\sigma\) of a geodesic equals the absolute value of the
normal curvature \(B_{\beta}^{\alpha}T_{\alpha}T^{\beta}\), i.e.
\[
\sigma=\left\vert B_{\beta}^{\alpha}T_{\alpha}T^{\beta}\right\vert .\tag{10.23}
\]
In this sense, among all possible embedded curves tangential to a given direction
\(T^{\alpha}\), a geodesic is the curve that is least curved or, to put it another way, is the one
that is as straight as possible.
10.5The geodesic equation from the zero geodesic curvature condition
Suppose that a geodesic is described by the equations of the curve
\[
S^{\alpha}=S^{\alpha}\left( \gamma\right) .\tag{10.24}
\]
The goal of this Section is to construct a set of ordinary differential equations that
determine \(S^{\alpha}\left( \gamma\right) \). As we discussed earlier in Section 10.2, we expect the resulting system to be of the implicit
form
\[
F^{\alpha}\left( \gamma,S,\dot{S},\ddot{S}\right) =0,\tag{10.25}
\]
that cannot be uniquely solved for the leading derivatives \(\ddot{S}\left( \gamma\right)
\). Furthermore, we should expect to find a system that requires to be augmented by the rule for
parameterizing the curve.
The task at hand can be accomplished by interpreting the analytical definition
\[
b_{\Phi}^{\Phi}=0 \tag{10.2}
\]
of a geodesic in terms the underlying functions \(S^{\alpha}\left( \gamma\right) \). For
the time being, let us switch from the symbol \(\gamma\) to \(S^{\Phi}\).
Recall that the geodesic curvature tensor \(b_{\Phi\Psi}\) is given by
\[
b_{\Phi\Psi}=n_{\alpha}\nabla_{\Phi}S_{\Psi}^{\alpha},\tag{9.67}
\]
and therefore
\[
b_{\Phi}^{\Phi}=n_{\alpha}\nabla^{\Phi}S_{\Phi}^{\alpha}.\tag{9.62}
\]
However, for our purposes, we need the more general identity
\[
\nabla^{\Phi}S_{\Phi}^{\alpha}=b_{\Phi}^{\Phi}n^{\alpha}.\tag{9.63}
\]
From this identity we see that the condition of zero geodesic curvature translates into
equations
\[
\nabla^{\Phi}S_{\Phi}^{\alpha}=0.\tag{10.26}
\]
Suppose that the equations of the curve are
\[
S^{\alpha}=S^{\alpha}\left( S\right) .\tag{10.27}
\]
Recall that the shift tensor \(S_{\Phi}^{\alpha}\) represents the derivative of
\(S^{\alpha}\left( S\right) \) with respect to the curve coordinate \(S^{\Phi} \), i.e.
\[
S_{\Phi}^{\alpha}=\frac{\partial S^{\alpha}\left( S\right) }{\partial S^{\Phi}}. \tag{9.12}
\]
By directly appealing to the definition of the covariant derivative, we have
\[
\nabla_{\Phi}S_{\Psi}^{\alpha}=\frac{\partial^{2}S^{\alpha}}{\partial S^{\Phi }\partial
S^{\Psi}}+S_{\Phi}^{\beta}\Gamma_{\beta\gamma}^{\alpha}S_{\Psi
}^{\gamma}-\Gamma_{\Phi\Psi}^{\Omega}S_{\Omega}^{\alpha}.\tag{10.28}
\]
Thus, \(\nabla^{\Phi}S_{\Phi}^{\alpha}\) is given by
\[
\nabla^{\Phi}S_{\Phi}^{\alpha}=S^{\Phi\Psi}\left( \frac{\partial^{2} S^{\alpha}}{\partial
S^{\Phi}\partial S^{\Psi}}+\Gamma_{\beta\gamma}^{\alpha
}S_{\Phi}^{\beta}S_{\Psi}^{\gamma}-\Gamma_{\Phi\Psi}^{\Omega}S_{\Omega }^{\alpha}\right)
.\tag{10.29}
\]
Equating it to zero yields
\[
S^{\Phi\Psi}\left( \frac{\partial^{2}S^{\alpha}}{\partial S^{\Phi}\partial
S^{\Psi}}+\Gamma_{\beta\gamma}^{\alpha}S_{\Psi}^{\beta}S_{\Phi}^{\gamma
}-\Gamma_{\Phi\Psi}^{\Omega}S_{\Omega}^{\alpha}\right) =0.\tag{10.30}
\]
It is left as an exercise to show that the term \(\Gamma_{\Phi\Psi}^{\Omega
}S_{\Omega}^{\alpha}\) is given by
\[
\Gamma_{\Phi\Psi}^{\Omega}S_{\Omega}^{\alpha}=S^{\Delta\Omega}\left(
\Gamma_{\omega,\beta\gamma}S_{\Omega}^{\alpha}S_{\Phi}^{\beta}S_{\Psi}
^{\gamma}S_{\Delta}^{\omega}+\frac{\partial^{2}S^{\beta}}{\partial S^{\Phi }\partial
S^{\Psi}}S_{\beta\gamma}S_{\Omega}^{\alpha}S_{\Delta}^{\gamma }\right) .\tag{10.31}
\]
Let us now switch back to the parameter \(\gamma\) and note that the term
\(\partial^{2}S^{\alpha}/\partial S^{\Phi}\partial S^{\Psi}\) corresponds to \(\ddot{S}^{\alpha}\)
while the shift tensor \(S_{\Phi}^{\alpha}\) corresponds to \(\dot{S}^{\alpha}\). With this in
mined, note that the factor of \(S^{\Phi\Psi}\) can be cancelled, and we are left with
\[
\ddot{S}^{\alpha}+\Gamma_{\beta\gamma}^{\alpha}\dot{S}^{\beta}\dot{S}^{\gamma
}-\frac{\Gamma_{\omega,\beta\gamma}\dot{S}^{\alpha}\dot{S}^{\beta}\dot
{S}^{\gamma}\dot{S}^{\omega}+\ddot{S}^{\beta}S_{\beta\gamma}\dot{S}^{\alpha
}\dot{S}^{\gamma}}{S_{\rho\tau}\dot{S}^{\rho}\dot{S}^{\tau}}=0\tag{10.32}
\]
Combining the leading terms, we have
\[
\ddot{S}^{\alpha}+\frac{S_{\beta\gamma}\dot{S}^{\alpha}\dot{S}^{\gamma}
}{S_{\rho\tau}\dot{S}^{\rho}\dot{S}^{\tau}}\ddot{S}^{\beta}+\Gamma
_{\beta\gamma}^{\alpha}\dot{S}^{\beta}\dot{S}^{\gamma}-\frac{\Gamma
_{\omega,\beta\gamma}\dot{S}^{\beta}\dot{S}^{\gamma}\dot{S}^{\omega}}
{S_{\rho\tau}\dot{S}^{\rho}\dot{S}^{\tau}}\dot{S}^{\alpha}=0\tag{10.33}
\]
which can be written in the form
\[
\left( \delta_{\beta}^{\alpha}-\frac{S_{\beta\gamma}\dot{S}^{\alpha}\dot
{S}^{\gamma}}{S_{\rho\tau}\dot{S}^{\rho}\dot{S}^{\tau}}\right) \ddot
{S}^{\beta}+\Gamma_{\beta\gamma}^{\alpha}\dot{S}^{\beta}\dot{S}^{\gamma}
-\frac{\Gamma_{\omega,\beta\gamma}\dot{S}^{\alpha}\dot{S}^{\beta}\dot
{S}^{\gamma}\dot{S}^{\omega}}{S_{\rho\tau}\dot{S}^{\rho}\dot{S}^{\tau}}=0\tag{10.34}
\]
With the help of the symbol
\[
\dot{S}_{\alpha}=S_{\alpha\beta}\dot{S}^{\beta},\tag{10.35}
\]
it can be rewritten more compactly as
\[
\left( \delta_{\beta}^{\alpha}-\frac{\dot{S}^{\alpha}\dot{S}_{\beta}}{\dot
{S}_{\tau}\dot{S}^{\tau}}\right) \ddot{S}^{\beta}+\Gamma_{\beta\gamma
}^{\alpha}\dot{S}^{\beta}\dot{S}^{\gamma}-\frac{\Gamma_{\beta,\gamma\delta
}\dot{S}^{\alpha}\dot{S}^{\beta}\dot{S}^{\gamma}\dot{S}^{\delta}}{\dot
{S}_{\tau}\dot{S}^{\tau}}=0.\tag{10.36}
\]
We will return to the analysis of this equation after we derive it again with the help of
the Calculus of Variations.
10.6The geodesic equation by the Calculus of Variations
Perhaps the most straightforward analytical approach to deriving the geodesic equation is the
Calculus of Variations. The classical problem of the Calculus of Variations is to minimize or
maximize the functional
\[
E=\int_{a}^{b}f\left( \gamma,y\left( \gamma\right) ,\dot{y}\left( \gamma\right) \right)
d\gamma\tag{10.37}
\]
with respect to all possible smooth functions \(y\left( \gamma\right) \) for
\(a\leq\gamma\leq b\). The solution of the problem is represented by the celebrated Euler-Lagrange
equation which reads
\[
\frac{\partial f}{\partial y}-\frac{d}{d\gamma}\frac{\partial f}{\partial \dot{y}}=0.\tag{10.38}
\]
In the case of multidimensional functions \(y^{\alpha}\left( \gamma\right) \), the
Euler-Lagrange equation becomes a system which reads
\[
\frac{\partial f}{\partial y^{\alpha}}-\frac{d}{d\gamma}\frac{\partial
f}{\partial\dot{y}^{\alpha}}=0.\tag{10.39}
\]
Thus, the problem of describing a geodesic, is precisely an example of the classical problem of the
Calculus of Variations where the functions \(S^{\alpha}\left( \gamma\right) \) play the role of
the independent variations. The length \(L\) of the segment of the embedded curve corresponding to
the values of \(\gamma\) between \(a\) and \(b\) is given by the integral
\[
L=\int_{a}^{b}\sqrt{S_{\beta\gamma}\dot{S}^{\beta}\dot{S}^{\gamma}}d\gamma,\tag{10.40}
\]
where the metric tensor \(S_{\alpha\beta}\) is treated as a function of \(S^{\alpha}\left(
\gamma\right) \).
For this functional, the Euler-Lagrange equation reads
\[
\frac{\partial\sqrt{S_{\beta\gamma}\dot{S}^{\beta}\dot{S}^{\gamma}}}{\partial
S^{\alpha}}-\frac{d}{dt}\frac{\partial\sqrt{S_{\beta\gamma}\dot{S}^{\beta}
\dot{S}^{\gamma}}}{\partial\dot{S}^{\alpha}}=0.\tag{10.41}
\]
Let us now treat each of these terms one by one.
10.6.1The term \(\frac{\partial\sqrt{S_{\beta\gamma}\dot{S}^{\beta} \dot{S}^{\gamma}}}{\partial S^{\gamma}}\)
By the chain rule
\[
\frac{\partial\sqrt{S_{\beta\gamma}\dot{S}^{\beta}\dot{S}^{\gamma}}}{\partial
S^{\alpha}}=\frac{\frac{\partial}{\partial S^{\alpha}}\left( S_{\beta\gamma
}\dot{S}^{\beta}\dot{S}^{\gamma}\right) }{2\sqrt{S_{\sigma\tau}\dot
{S}^{\sigma}\dot{S}^{\tau}}}=\frac{\frac{\partial S_{\beta\gamma}}{\partial
S^{\alpha}}\dot{S}^{\beta}\dot{S}^{\gamma}}{2\sqrt{S_{\sigma\tau}\dot
{S}^{\sigma}\dot{S}^{\tau}}}.\tag{10.42}
\]
Recall that
\[
\frac{\partial S_{\beta\gamma}}{\partial S^{\alpha}}=\Gamma_{\beta
,\gamma\alpha}+\Gamma_{\gamma,\beta\alpha} \tag{2.63}
\]
Substituting this identity into the previous equation, we find
\[
\frac{\partial\sqrt{S_{\beta\gamma}\dot{S}^{\beta}\dot{S}^{\gamma}}}{\partial
S^{\gamma}}=\frac{\left( \Gamma_{\beta,\gamma\alpha}+\Gamma_{\gamma ,\beta\alpha}\right)
\dot{S}^{\beta}\dot{S}^{\gamma}}{2\sqrt{S_{\sigma\tau }\dot{S}^{\sigma}\dot{S}^{\tau}}}\tag{10.43}
\]
Notice, that owing to the symmetry of the combination \(\dot{S}^{\beta}\dot {S}^{\gamma}\),
we have
\[
\Gamma_{\beta,\gamma\alpha}\dot{S}^{\beta}\dot{S}^{\gamma}=\Gamma
_{\gamma,\beta\alpha}\dot{S}^{\beta}\dot{S}^{\gamma}.\tag{10.44}
\]
Therefore,
\[
\frac{\partial\sqrt{S_{\beta\gamma}\dot{S}^{\beta}\dot{S}^{\gamma}}}{\partial
S^{\alpha}}=\frac{\Gamma_{\beta,\gamma\alpha}\dot{S}^{\beta}\dot{S}^{\gamma}
}{\sqrt{S_{\sigma\tau}\dot{S}^{\sigma}\dot{S}^{\tau}}}\tag{10.45}
\]
10.6.2The term \(\frac{d}{dt}\frac{\partial\sqrt{S_{\beta\gamma}\dot {S}^{\beta}\dot{S}^{\gamma}}}{\partial\dot{S}\alpha}\)
The analysis of this term is certainly more involved. For the partial derivative, we have by the
chain rule
\[
\frac{\partial\sqrt{S_{\beta\gamma}\dot{S}^{\beta}\dot{S}^{\gamma}}}
{\partial\dot{S}^{\alpha}}=\frac{S_{\beta\gamma}\frac{\partial}{\partial \dot{S}^{\gamma}}\left(
\dot{S}^{\beta}\dot{S}^{\gamma}\right) }
{2\sqrt{S_{\sigma\tau}\dot{S}^{\sigma}\dot{S}^{\tau}}}.\tag{10.46}
\]
By the product rule,
\[
\frac{\partial}{\partial\dot{S}^{\gamma}}\left( \dot{S}^{\beta}\dot {S}^{\gamma}\right)
=\frac{\partial\dot{S}^{\beta}}{\partial\dot{S}^{\alpha}
}\dot{S}^{\gamma}+\dot{S}^{\beta}\frac{\partial\dot{S}^{\gamma}}{\partial
\dot{S}^{\alpha}}=\delta_{\alpha}^{\beta}\dot{S}^{\gamma}+\dot{S}^{\beta
}\delta_{\alpha}^{\gamma}.\tag{10.47}
\]
Substituting this into the preceding equation, we get
\[
\frac{\partial\sqrt{S_{\beta\gamma}\dot{S}^{\beta}\dot{S}^{\gamma}}}
{\partial\dot{S}^{\alpha}}=\frac{S_{\alpha\gamma}\dot{S}^{\gamma}
+S_{\beta\alpha}\dot{S}^{\beta}}{2\sqrt{S_{\sigma\tau}\dot{S}^{\sigma}\dot {S}^{\tau}}}.\tag{10.48}
\]
Owing to the symmetry of the metric tensor, the two terms in the numerator are equal and
therefore
\[
\frac{\partial\sqrt{S_{\beta\gamma}\dot{S}^{\beta}\dot{S}^{\gamma}}}
{\partial\dot{S}^{\alpha}}=\frac{S_{\alpha\beta}\dot{S}^{\beta}}
{\sqrt{S_{\sigma\tau}\dot{S}^{\sigma}\dot{S}^{\tau}}}.\tag{10.49}
\]
Next comes the application of the time derivative \(d/dt\). By the product rule,
\[
\begin{array} {l} \frac{d}{dt}\frac{S_{\alpha\beta}\dot{S}^{\beta}}{\sqrt{S_{\sigma\tau}\dot
{S}^{\sigma}\dot{S}^{\tau}}}=\left. \frac{\frac{dS_{\alpha\beta}}{dt}\dot
{S}^{\beta}}{\sqrt{S_{\sigma\tau}\dot{S}^{\sigma}\dot{S}^{\tau}}}\right\vert _{I}+\left.
\frac{S_{\alpha\beta}\frac{d\dot{S}^{\beta}}{dt}}{\sqrt
{S_{\sigma\tau}\dot{S}^{\sigma}\dot{S}^{\tau}}}\right\vert _{II}\\ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \
\ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ +\left. \frac{d}{dt}\left(
\frac{1}{\sqrt{S_{\sigma\tau}\dot{S}^{\sigma}\dot{S} ^{\tau}}}\right)
S_{\alpha\beta}\dot{S}^{\beta}\right\vert _{III}. \end{array}\tag{10.50}
\]
The first two terms are rather straightforward. Since
\[
\frac{dS_{\alpha\beta}}{dt}=\frac{\partial S_{\alpha\beta}}{\partial
S^{\gamma}}\frac{dS^{\gamma}}{dt}=\left( \Gamma_{\alpha,\beta\gamma}
+\Gamma_{\beta,\alpha\gamma}\right) \dot{S}^{\gamma},\tag{10.51}
\]
we have
\[
I=\frac{\left( \Gamma_{\alpha,\beta\gamma}+\Gamma_{\beta,\alpha\gamma }\right)
\dot{S}^{\beta}\dot{S}\gamma}{\sqrt{S_{\sigma\tau}\dot{S}^{\sigma }\dot{S}^{\tau}}}.\tag{10.52}
\]
For the second term, since \(d\dot{S}^{\beta}/dt=\ddot{S}^{\beta}\), we have
\[
II=\frac{S_{\alpha\beta}\ddot{S}^{\beta}}{\sqrt{S_{\sigma\tau}\dot{S}^{\sigma
}\dot{S}^{\tau}}}.\tag{10.53}
\]
Turning our attention to the most labor-intensive third term, we have by the chain rule
\[
\frac{d}{dt}\left( \frac{1}{\sqrt{S_{\sigma\tau}\dot{S}^{\sigma}\dot{S} ^{\tau}}}\right)
=-\frac{\frac{d}{dt}\left( S_{\gamma\delta}\dot{S}^{\gamma }\dot{S}^{\delta}\right) }{2\left(
S_{\sigma\tau}\dot{S}^{\sigma}\dot {S}^{\tau}\right) ^{3/2}}.\tag{10.54}
\]
Then, we have by the product rule,
\[
\frac{d}{dt}\left( \frac{1}{\sqrt{S_{\sigma\tau}\dot{S}^{\sigma}\dot{S} ^{\tau}}}\right)
=-\frac{\frac{S_{\gamma\delta}}{dt}\dot{S}^{\gamma}\dot
{S}^{\delta}+S_{\gamma\delta}\frac{d\dot{S}^{\gamma}}{dt}\dot{S}^{\delta
}+S_{\gamma\delta}\dot{S}^{\gamma}\frac{d\dot{S}^{\delta}}{dt}}{2\left(
S_{\sigma\tau}\dot{S}^{\sigma}\dot{S}^{\tau}\right) ^{3/2}}.\tag{10.55}
\]
We have already discussed how to deal with each of the terms and so we jump to the form
\[
\frac{d}{dt}\left( \frac{1}{\sqrt{S_{\sigma\tau}\dot{S}^{\sigma}\dot{S} ^{\tau}}}\right)
=-\frac{\left( \Gamma_{\gamma,\delta\varepsilon} +\Gamma_{\delta,\gamma\varepsilon}\right)
\dot{S}^{\gamma}\dot{S}^{\delta
}\dot{S}^{\varepsilon}+S_{\gamma\delta}\ddot{S}^{\gamma}\dot{S}^{\delta
}+S_{\gamma\delta}\dot{S}^{\gamma}\ddot{S}^{\delta}}{2\left( S_{\sigma\tau
}\dot{S}^{\sigma}\dot{S}^{\tau}\right) ^{3/2}}.\tag{10.56}
\]
Owing to the available symmetries, we have
\[
\frac{d}{dt}\left( \frac{1}{\sqrt{S_{\sigma\tau}\dot{S}^{\sigma}\dot{S} ^{\tau}}}\right)
=-\frac{\Gamma_{\gamma,\delta\varepsilon}\dot{S}^{\gamma
}\dot{S}^{\delta}\dot{S}^{\varepsilon}+S_{\gamma\delta}\ddot{S}^{\gamma} \dot{S}^{\delta}}{\left(
S_{\sigma\tau}\dot{S}^{\sigma}\dot{S}^{\tau}\right) ^{3/2}}.\tag{10.57}
\]
Thus,
\[
III=\frac{d}{dt}\left( \frac{1}{\sqrt{S_{\sigma\tau}\dot{S}^{\sigma}\dot {S}^{\tau}}}\right)
=-\frac{S_{\alpha\beta}\Gamma_{\gamma,\delta\varepsilon
}\dot{S}^{\beta}\dot{S}^{\gamma}\dot{S}^{\delta}\dot{S}^{\varepsilon
}+S_{\alpha\beta}S_{\gamma\delta}\dot{S}^{\beta}\ddot{S}^{\gamma}\dot {S}^{\delta}}{\left(
S_{\sigma\tau}\dot{S}^{\sigma}\dot{S}^{\tau}\right) ^{3/2}}\tag{10.58}
\]
Finally, combining the three terms, we find
\[
\begin{array} {l} \frac{d}{dt}\frac{\partial\sqrt{S_{\beta\gamma}\dot{S}^{\beta}\dot{S}^{\gamma
}}}{\partial\dot{S}^{\alpha}}=\frac{\left( \Gamma_{\alpha,\beta\gamma}
+\Gamma_{\beta,\alpha\gamma}\right) \dot{S}^{\beta}\dot{S}^{\gamma}
+S_{\alpha\beta}\ddot{S}^{\beta}}{\sqrt{S_{\sigma\tau}\dot{S}^{\sigma}\dot {S}^{\tau}}}\\ \ \ \ \ \
\ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ -\frac
{S_{\alpha\beta}\Gamma_{\gamma,\delta\varepsilon}\dot{S}^{\beta}\dot
{S}^{\gamma}\dot{S}^{\delta}\dot{S}^{\varepsilon}+S_{\alpha\beta}
S_{\gamma\delta}\dot{S}^{\beta}\ddot{S}^{\gamma}\dot{S}^{\delta}}{\left(
S_{\sigma\tau}\dot{S}^{\sigma}\dot{S}^{\tau}\right) ^{3/2}}. \end{array}\tag{10.59}
\]
10.6.3The combined equation
Putting the two terms together, the combined Euler-Lagrange equation reads
\[
\begin{array} {l} \frac{\Gamma_{\beta,\gamma\alpha}\dot{S}^{\beta}\dot{S}^{\gamma}}
{\sqrt{S_{\sigma\tau}\dot{S}^{\sigma}\dot{S}^{\tau}}}-\frac{\left(
\Gamma_{\alpha,\beta\gamma}+\Gamma_{\beta,\alpha\gamma}\right) \dot{S}
^{\beta}\dot{S}^{\gamma}+S_{\alpha\beta}\ddot{S}^{\beta}}{\sqrt{S_{\sigma\tau
}\dot{S}^{\sigma}\dot{S}^{\tau}}}\\ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \
+\frac {S_{\alpha\beta}\Gamma_{\gamma,\delta\varepsilon}\dot{S}^{\beta}\dot
{S}^{\gamma}\dot{S}^{\delta}\dot{S}^{\varepsilon}+S_{\alpha\beta}
S_{\gamma\delta}\dot{S}^{\beta}\ddot{S}^{\gamma}\dot{S}^{\delta}}{\left(
S_{\sigma\tau}\dot{S}^{\sigma}\dot{S}^{\tau}\right) ^{3/2}}=0. \end{array}\tag{10.60}
\]
Observe that the terms \(\Gamma_{\beta,\gamma\alpha}\dot{S}^{\beta}\dot {S}^{\gamma}\) and
\(\Gamma_{\beta,\alpha\gamma}\dot{S}^{\beta}\dot{S}^{\gamma}\) are equivalent and therefore cancel,
and therefore we are left with
\[
-\frac{\Gamma_{\alpha,\beta\gamma}\dot{S}^{\beta}\dot{S}^{\gamma}
+S_{\alpha\beta}\ddot{S}^{\beta}}{\sqrt{S_{\sigma\tau}\dot{S}^{\sigma}\dot
{S}^{\tau}}}+\frac{S_{\alpha\beta}\Gamma_{\gamma,\delta\varepsilon}\dot
{S}^{\beta}\dot{S}^{\gamma}\dot{S}^{\delta}\dot{S}^{\varepsilon}
+S_{\alpha\beta}S_{\gamma\delta}\dot{S}^{\beta}\ddot{S}^{\gamma}\dot {S}^{\delta}}{\left(
S_{\sigma\tau}\dot{S}^{\sigma}\dot{S}^{\tau}\right) ^{3/2}}=0.\tag{10.61}
\]
Next, multiply the equation by the nonzero quantity \(-\sqrt{S_{\sigma\tau}
\dot{S}^{\sigma}\dot{S}^{\tau}}\), which yields
\[
\Gamma_{\alpha,\beta\gamma}\dot{S}^{\beta}\dot{S}^{\gamma}+S_{\alpha\beta
}\ddot{S}^{\beta}-\frac{S_{\alpha\beta}\Gamma_{\gamma,\delta\varepsilon}
\dot{S}^{\beta}\dot{S}^{\gamma}\dot{S}^{\delta}\dot{S}^{\varepsilon}
+S_{\alpha\beta}S_{\gamma\delta}\dot{S}^{\beta}\ddot{S}^{\gamma}\dot
{S}^{\delta}}{S_{\sigma\tau}\dot{S}^{\sigma}\dot{S}^{\tau}}=0.\tag{10.62}
\]
Next, raise the index \(\alpha\), which yields
\[
\Gamma_{\beta\gamma}^{\alpha}\dot{S}^{\beta}\dot{S}^{\gamma}+\ddot{S}^{\alpha
}-\frac{\Gamma_{\gamma,\delta\varepsilon}\dot{S}^{\alpha}\dot{S}^{\gamma}
\dot{S}^{\delta}\dot{S}^{\varepsilon}+S_{\gamma\delta}\dot{S}^{\alpha}\ddot
{S}^{\gamma}\dot{S}^{\delta}}{S_{\sigma\tau}\dot{S}^{\sigma}\dot{S}^{\tau}}=0.\tag{10.63}
\]
Then, recall the notation
\[
\dot{S}_{\alpha}=S_{\alpha\beta}\dot{S}^{\beta}.\tag{10.64}
\]
We have
\[
\Gamma_{\beta\gamma}^{\alpha}\dot{S}^{\beta}\dot{S}^{\gamma}+\ddot{S}^{\alpha
}-\frac{\Gamma_{\gamma,\delta\varepsilon}\dot{S}^{\alpha}\dot{S}^{\gamma}
\dot{S}^{\delta}\dot{S}^{\varepsilon}+\dot{S}^{\alpha}\ddot{S}^{\gamma}\dot
{S}_{\gamma}}{\dot{S}_{\tau}\dot{S}^{\tau}}=0.\tag{10.65}
\]
Finally, after rearranging the terms and renaming several indices, we arrive at the exact
same equation
\[
\left( \delta_{\beta}^{\alpha}-\frac{\dot{S}^{\alpha}\dot{S}_{\beta}}{\dot
{S}_{\tau}\dot{S}^{\tau}}\right) \ddot{S}^{\beta}+\Gamma_{\beta\gamma
}^{\alpha}\dot{S}^{\beta}\dot{S}^{\gamma}-\frac{\Gamma_{\beta,\gamma\delta
}\dot{S}^{\alpha}\dot{S}^{\beta}\dot{S}^{\gamma}\dot{S}^{\delta}}{\dot {S}_{\tau}\dot{S}^{\tau}}=0.
\tag{10.36}
\]
we obtained by our earlier analysis based on the zero geodesic curvature condition. Note,
however, that had this been our only approach, we would have had no way of knowing the geometric
interpretation of this equation.
10.7Analysis of the geodesic equation
The equation
\[
\left( \delta_{\beta}^{\alpha}-\frac{\dot{S}^{\alpha}\dot{S}_{\beta}}{\dot
{S}_{\tau}\dot{S}^{\tau}}\right) \ddot{S}^{\beta}+\Gamma_{\beta\gamma
}^{\alpha}\dot{S}^{\beta}\dot{S}^{\gamma}-\frac{\Gamma_{\beta,\gamma\delta
}\dot{S}^{\alpha}\dot{S}^{\beta}\dot{S}^{\gamma}\dot{S}^{\delta}}{\dot {S}_{\tau}\dot{S}^{\tau}}=0.
\tag{10.36}
\]
is precisely of the implicit form
\[
F^{\alpha}\left( \gamma,S,\dot{S},\ddot{S}\right) =0\tag{10.66}
\]
that we hypothesized in Section 10.2. We further
hypothesized that these equations cannot be solved uniquely for the acceleration \(\ddot
{S}^{\beta}\). This is indeed the case for this system and to understand why this is so, let us
consider the second-order system
\[
\delta_{\beta}^{\alpha}-\frac{\dot{S}^{\alpha}\dot{S}_{\beta}}{\dot{S}_{\tau
}\dot{S}^{\tau}}.\tag{10.67}
\]
Let us ignore the placement of indices for a moment. Note that if \(\dot {S}^{\alpha}\)
corresponds to a vector \(v\), then the object above corresponds to the symmetric matrix
\[
A=I-\frac{vv^{T}}{v^{T}v}.\tag{10.68}
\]
This matrix is necessarily nonsingular. Indeed, its null space is represented by the vector
\(v\) since
\[
Av=\left( I-\frac{vv^{T}}{v^{T}v}\right) v=v-\frac{vv^{T}v}{v^{T}v}=v-v=0.\tag{10.69}
\]
It is left as an exercise to show that the null space of \(A\) is strictly one-dimensional.
For a specific illustration, let
\[
v=\left[ \begin{array} {c} 1\\ 2\\ 3 \end{array} \right] .\tag{10.70}
\]
Then
\[
A=\left[ \begin{array} {ccc} 1 & 0 & 0\\ 0 & 1 & 0\\ 0 & 0 & 1 \end{array} \right]
-\frac{1}{14}\left[ \begin{array} {ccc} 1 & 2 & 3\\ 2 & 4 & 6\\ 3 & 6 & 9 \end{array} \right]
=\left[ \begin{array} {rrr} \frac{13}{14} & -\frac{1}{7} & -\frac{3}{14}\\ -\frac{1}{7} &
\frac{5}{7} & -\frac{3}{7}\\ -\frac{3}{14} & -\frac{3}{7} & \frac{5}{14} \end{array}
\right]\tag{10.71}
\]
and you can easily check that the null space \(A\) in fact corresponds to \(v\).
Taking the placement of indices into account does not change anything since it merely changes
\[
A=I-\frac{vv^{T}}{v^{T}v}\tag{10.72}
\]
into
\[
A=I-\frac{v\left( Mv\right) ^{T}}{v^{T}Mv}\tag{10.73}
\]
where \(M\) is the symmetric matrix corresponding to \(S_{\alpha\beta}\). Then
\[
Av=\left( I-\frac{v\left( Mv\right) ^{T}}{v^{T}Mv}\right) v=v-\frac
{vv^{T}Mv}{v^{T}Mv}=v-v=0.\tag{10.74}
\]
To repeat this logic in the tensor notation, observe the following chain of identities:
\[
\left( \delta_{\beta}^{\alpha}-\frac{\dot{S}^{\alpha}\dot{S}_{\beta}}{\dot
{S}_{\tau}\dot{S}^{\tau}}\right) \dot{S}^{\beta}=\delta_{\beta}^{\alpha}
\dot{S}^{\beta}-\frac{\dot{S}^{\alpha}\dot{S}_{\beta}\dot{S}^{\beta}}{\dot
{S}_{\tau}\dot{S}^{\tau}}=\dot{S}^{\alpha}-\dot{S}^{\alpha}=0.\tag{10.75}
\]
Recall from Linear Algebra that the general solution of a linear system
\[
Ax=b,\tag{10.76}
\]
where \(A\) is a singular matrix with the null space spanned by a vector \(v\), is given by
\[
x=x_{0}+cv,\tag{10.77}
\]
where \(x_{0}\) is any solution to the system, known as a particular solution,
and \(\alpha\) is an arbitrary number. Rewrite the geodesic equation
\[
\left( \delta_{\beta}^{\alpha}-\frac{\dot{S}^{\alpha}\dot{S}_{\beta}}{\dot
{S}_{\tau}\dot{S}^{\tau}}\right) \ddot{S}^{\beta}+\Gamma_{\beta\gamma
}^{\alpha}\dot{S}^{\beta}\dot{S}^{\gamma}-\frac{\Gamma_{\beta,\gamma\delta
}\dot{S}^{\alpha}\dot{S}^{\beta}\dot{S}^{\gamma}\dot{S}^{\delta}}{\dot {S}_{\tau}\dot{S}^{\tau}}=0
\tag{10.36}
\]
in the shape of a linear system, i.e.
\[
\left( \delta_{\beta}^{\alpha}-\frac{\dot{S}^{\alpha}\dot{S}_{\beta}}{\dot
{S}_{\tau}\dot{S}^{\tau}}\right) \ddot{S}^{\beta}=-\Gamma_{\beta\gamma
}^{\alpha}\dot{S}^{\beta}\dot{S}^{\gamma}+\frac{\Gamma_{\beta,\gamma\delta
}\dot{S}^{\alpha}\dot{S}^{\beta}\dot{S}^{\gamma}\dot{S}^{\delta}}{\dot
{S}_{\tau}\dot{S}^{\tau}}.\tag{10.78}
\]
Then the solution \(\ddot{S}^{\alpha}\) of this linear system at a given moment of "time"
\(\gamma\) would have the form
\[
\ddot{S}^{\alpha}=\ddot{S}_{0}^{\alpha}+c\dot{S}^{\alpha},\tag{10.79}
\]
where, once again, \(\ddot{S}_{0}^{\beta}\) is any solution and \(c\) is an arbitrary number
which we have no way of determining from the system alone.
Note, however, that the undetermined part \(c\dot{S}^{\alpha}\) of the acceleration
\(\ddot{S}^{\alpha}\) is proportional to the velocity \(\dot {S}^{\alpha}\). In other words, the
particular value of \(c\) determines the speed with which the imagined particle moves along
the trajectory and not the trajectory itself. In other words, what is missing from the geodesic
equation is the choice of parameterization and supplying one makes the system complete.
For example, let us specify that the geodesic is parameterized by arc length, i.e.
\[
S_{\alpha\beta}\dot{S}^{\alpha}\dot{S}^{\beta}=1.\tag{10.80}
\]
It is left as an exercise to show that differentiating this identity with with respect to
\(\gamma\) (i.e. arc length) yields
\[
\Gamma_{\alpha,\beta\gamma}\dot{S}^{\alpha}\dot{S}^{\beta}\dot{S}^{\gamma
}+\dot{S}_{\beta}\ddot{S}^{\beta}=0.\tag{10.81}
\]
Thus,
\[
\Gamma_{\alpha,\beta\gamma}\dot{S}^{\alpha}\dot{S}^{\beta}\dot{S}^{\gamma }+\dot{S}_{\alpha}\left(
\ddot{S}_{0}^{\alpha}+c\dot{S}^{\alpha}\right) =0\tag{10.82}
\]
or
\[
\Gamma_{\alpha,\beta\gamma}\dot{S}^{\alpha}\dot{S}^{\beta}\dot{S}^{\gamma
}+\dot{S}_{\alpha}\ddot{S}_{0}^{\alpha}+c=0.\tag{10.83}
\]
Solving for \(c\), we find
\[
c=-\dot{S}_{\alpha}\ddot{S}_{0}^{\alpha}-\Gamma_{\alpha,\beta\gamma}\dot
{S}^{\alpha}\dot{S}^{\beta}\dot{S}^{\gamma}.\tag{10.84}
\]
Thus,
\[
\ddot{S}^{\alpha}=\left( \delta_{\beta}^{\alpha}-\dot{S}^{\alpha}\dot {S}_{\beta}\right)
\ddot{S}_{0}^{\beta}-\Gamma_{\beta,\gamma\delta}\dot
{S}^{\alpha}\dot{S}^{\beta}\dot{S}^{\gamma}\dot{S}^{\delta}.\tag{10.85}
\]
10.7.1Example: A Euclidean plane referred to affine coordinates
In the Euclidean plane referred to affine coordinates, the surface Christoffel symbols vanish and
therefore the geodesic equations
\[
\left( \delta_{\beta}^{\alpha}-\frac{\dot{S}^{\alpha}\dot{S}_{\beta}}{\dot
{S}_{\tau}\dot{S}^{\tau}}\right) \ddot{S}^{\beta}+\Gamma_{\beta\gamma
}^{\alpha}\dot{S}^{\beta}\dot{S}^{\gamma}-\frac{\Gamma_{\beta,\gamma\delta
}\dot{S}^{\alpha}\dot{S}^{\beta}\dot{S}^{\gamma}\dot{S}^{\delta}}{\dot {S}_{\tau}\dot{S}^{\tau}}=0
\tag{10.36}
\]
become
\[
\left( \delta_{\beta}^{\alpha}-\frac{\dot{S}^{\alpha}\dot{S}_{\beta}}{\dot
{S}_{\tau}\dot{S}^{\tau}}\right) \ddot{S}^{\beta}=0.\tag{10.86}
\]
This is undoubtedly the simplest form of the geodesic equation that still exhibits the
feature associated with the arbitrariness of parameterization. Solving for \(\ddot{S}^{\beta}\), we
find
\[
\ddot{S}^{\alpha}=c\dot{S}^{\alpha}\tag{10.87}
\]
In other words
\[
\begin{aligned}
\ddot{x} & =c\dot{x}\ \ \ \ \ \ \ \ \ \ \left(10.88\right)\\
\ddot{y} & =c\dot{y}\ \ \ \ \ \ \ \ \ \ \left(10.89\right)
\end{aligned}
\]
Let us stipulate that
\[
\gamma=\frac{1}{2}x^{2}\tag{10.90}
\]
Thus
\[
\begin{aligned}
x\dot{x} & =1\ \ \ \ \ \ \ \ \ \ \left(10.91\right)\\
\dot{x}^{2}+x\ddot{x} & =0\ \ \ \ \ \ \ \ \ \ \left(10.92\right)\\
\dot{x}^{2}+xc\dot{x} & =0\ \ \ \ \ \ \ \ \ \ \left(10.93\right)\\
\dot{x}+xc & =0\ \ \ \ \ \ \ \ \ \ \left(10.94\right)
\end{aligned}
\]
So
\[
\begin{aligned}
\ddot{x} & =-\frac{\dot{x}^{2}}{x}\ \ \ \ \ \ \ \ \ \ \left(10.95\right)\\
\ddot{y} & =-\frac{\dot{x}\dot{y}}{x}\ \ \ \ \ \ \ \ \ \ \left(10.96\right)
\end{aligned}
\]
Thus
\[
x=a+b\sqrt{2\gamma}\tag{10.97}
\]
and
\[
\ddot{y}=\frac{1}{2\gamma}\dot{y}\tag{10.98}
\]
\[
y=c+d\sqrt{2\gamma}\tag{10.99}
\]
10.8Arc length parameterization
A tremendous simplification occurs for arc-length parameterization:
\[
\ddot{S}^{\alpha}+\Gamma_{\beta\gamma}^{\alpha}\dot{S}^{\beta}\dot{S}^{\gamma }=0.\tag{10.100}
\]
or
\[
\frac{d^{2}S^{\alpha}}{ds^{2}}+\Gamma_{\beta\gamma}^{\alpha}\frac{dS^{\beta}
}{ds}\frac{dS^{\gamma}}{ds}=0.\tag{10.101}
\]
This is the most familiar form of the geodesic equation
10.9The Calculus of Moving Surfaces
Let us recount, in broad terms, how the Euler-Lagrange equation is derived for the functional
\[
E=\int_{a}^{b}f\left( \gamma,y\left( \gamma\right) ,\dot{y}\left( \gamma\right) \right)
d\gamma.\tag{10.102}
\]
The starting point for the derivation is to consider a family of functions \(y\left(
t,\gamma\right) \) enumerated by a smooth parameter \(t\) in which the sought-after optimal
function occurs at \(t=0\). Then the quantity \(E\) also becomes a function of \(t\), i.e.
\[
E\left( t\right) =\int_{a}^{b}f\left( \gamma,y\left( t,\gamma\right) ,\dot{y}\left(
t,\gamma\right) \right) d\gamma,\tag{10.103}
\]
whose minimum occurs at \(t=0\). Thus, the analysis proceeds by equating to \(0\) the
derivative of the integral with respect to \(t\).
To apply this tactic to the geodesic problem, consider a family of curves given by the equations
\[
S^{\alpha}=S^{\alpha}\left( t,\gamma\right) .\tag{10.104}
\]
A specific family of such curves may appear as in the following figure.
\[
\tag{10.105}
\]
Correspondingly, the curve length \(L\) becomes a function of \(t\), i.e.
\[
L\left( t\right) =\int_{a}^{b}\sqrt{S_{\alpha\beta}\left( S\left( t,\gamma\right) \right)
\dot{S}^{\alpha}\left( t,\gamma\right) \dot {S}^{\beta}\left( t,\gamma\right)
}d\gamma,\tag{10.106}
\]
where \(\dot{S}^{\alpha}\) still denotes the derivative with respect to \(\gamma\).
Instead of describing this collection of curves as a family, we may think of it as a single
moving curve that passes through the optimal confutations, i.e. the geodesic, at \(t=0\).
This is where the Calculus of Moving Surfaces comes in. The Caclulus of Moving Surfaces is an
extension of Tensor Calculus on surfaces to families of surfaces parameterized by a time-like
variable \(t\), i.e. moving surfaces. The Calculus of Moving Surfaces has the following
formula for a time-dependent line integral
\[
\frac{d}{dt}\int_{L}UdL=\int_{L}\left( \dot{\nabla}U-cb_{\Phi}^{\Phi}\right) dL.\tag{10.107}
\]
Note that
\[
L=\int_{a}^{b}\sqrt{S_{\alpha\beta}\dot{S}^{\alpha}\dot{S}^{\beta}} d\gamma=\int_{L}dL,\tag{10.108}
\]
thus
\[
U=1\tag{10.109}
\]
Noting that \(\dot{\nabla}F=0\), we find
\[
L^{\prime}\left( t\right) =-\int_{L}cb_{\Phi}^{\Phi}dL.\tag{10.110}
\]
Since \(c\) can be arbitrary, we arrive at the equation
\[
b_{\Phi}^{\Phi}=0. \tag{10.2}
\]
10.10Exercises
Exercise 10.1Derive the identity
\[
\Gamma_{\Phi\Psi}^{\Omega}S_{\Omega}^{\alpha}=S^{\Delta\Omega}\left( \Gamma_{\omega,\beta\gamma}S_{\Omega}^{\alpha}S_{\Phi}^{\beta}S_{\Psi} ^{\gamma}S_{\Delta}^{\omega}+\frac{\partial^{2}S^{\beta}}{\partial S^{\Phi }\partial S^{\Psi}}S_{\beta\gamma}S_{\Omega}^{\alpha}S_{\Delta}^{\gamma }\right) .\tag{10.111}
\]
As a starting point, use the identity
\[
\Gamma_{\beta\gamma}^{\alpha}=\Gamma_{jk}^{i}Z_{i}^{\alpha}Z_{\beta} ^{j}Z_{\gamma}^{k}+Z_{i}^{\alpha}\frac{\partial Z_{\beta}^{i}}{\partial S^{\gamma}}\tag{3.125}
\]
that relates the surface and the ambient Christoffel symbols and then translate it to curves embedded in surfaces, i.e.
\[
\Gamma_{\Phi\Psi}^{\Omega}=\Gamma_{\beta\gamma}^{\alpha}S_{\alpha}^{\Omega }S_{\Phi}^{\beta}S_{\Psi}^{\gamma}+S_{\beta}^{\Omega}\frac{\partial S_{\Phi }^{\beta}}{\partial S^{\Psi}}.\tag{10.112}
\]