Fundamentals of General Relativity
This page is print-friendly. Simply press Ctrl + P (or Command + P if you use a Mac) to print the page and download it as a PDF. Report an issue or error here.
Table of contents
Note: it is highly recommended to navigate by clicking links in the table of contents! It means you can use the back button in your browser to go back to any section you were reading, so you can jump back and forth between sections!
This is a guide to the fundamentals of General Relativity and its applications in astrophysics and cosmology. Topics covered (or will be covered) include the equivalence principle, the concept of a geodesic, the metric tensor, the Einstein Field Equations, and a study of black holes as well as relativistic cosmology.
I thank Professor Giedt at Rensselaer Polytechnic for teaching the GR course that made this guide possible.
Mathematical prerequisites
This guide presumes strong knowledge of vector calculus, basic linear algebra, differential equations, electromagnetism, and classical mechanics (including Lagrangian and Hamiltonian mechanics). If any of them are unfamiliar to you, consult the below guides to learn/review these topics:
- For a review of calculus (in particular multivariable and vector calculus), see the calculus series
- For a review of basic differential equations, see the introductory differential equations guide
- For a review of electromagnetic theory, see the fundamentals of electromagnetism guide as well as the in-depth electromagnetism guide
- For a review of partial differential equations, boundary-value problems, and Fourier series, and see the PDEs guide
- For a review of special relativity and tensors, see the advanced classical mechanics guide
- For an optional review of quantum mechanics, see the introductory quantum mechanics guide as well as the in-depth quantum mechanics guide. There might be a brief discussion on quantum gravity at the very end of this guide.
An introduction to General Relativity
General Relativity (GR) is fundamentally a theory about gravity. More precisely it is a geometric theory of gravity. Even more precisely it is a geometric theory of gravity based on the equivalence principle. Ah, we are getting too far of ourselves here; but the point is that GR is a very complicated theory that doesn't make sense.
Which is a shame, because GR predicts fascinating phenomena: black holes, wormholes, the slowing of time by gravity, the beautiful bending of light by galaxies and stars, and so, so much more. And GR describes how our Universe evolves at the largest scales, telling us both about the Universe's past and its present, and even its future. It is mystical, beautiful, and perplexing at the same time. I hope that after reading to the end of this guide, this will be the impression that GR leaves on you.
Gravitational lensing by the galaxy LRG 3-757, taken by the Hubble Space Telescope. Source: Wikipedia
The Newtonian and GR regimes
General relativity is the most precise theory of gravity ever created - and indeed, it still is to this day, over a hundred years since it was first proposed. However, the precision of the theory is often not needed. Indeed, we are frequently able to describe gravitational phenomena using the Newtonian theory, which is sufficiently accurate for almost all applications in everyday life, from calculating ballistic trajectories to spacecraft orbits to even simulating the dynamics of galaxies.
It is only in cases of extremely strong gravitational fields and extreme astronomical events, such as around black holes, neutron stars, or binary black-hole/neutron-star mergers that general relativity plays a major role in gravitational interactions. We therefore can distinguish between two physical regimes: the Newtonian regime, comprising of all physical systems that can be adequately described by Newtonian gravity, and the GR regime, where Newtonian gravity becomes insufficient and general-relativistic effects dominate. Indeed, we may quantify when exactly general-relativistic effects matter by considering the dimensionless quantity $\phi_{GR}$, defined as:
$$ \phi_{GR} = \frac{2GM}{c^2 R} $$
When $\phi_{GR} > 1$, the effects of GR become evident and we are in the GR regime, whereas for $\phi_{GR} \ll 1$ the effects of GR are negligible and we are in the Newtonian regime. But note that since $\phi_{GR} \sim M$, $M$ must be very, very large for $\phi_{GR}$ to be larger than 1. Indeed, even for the Sun, which has a mass on the order of $\pu{10^{30} kg}$, we find that $\phi_{GR} \sim 10^{-8}$, meaning that the GR corrections to the predictions of Newtonian gravity are very, very small. It is only for supermassive black holes and similar ultra-massive (or ultra-dense) astronomical bodies that $\phi_{GR} \gg 1$, and general relativity becomes essential.
Note: Physically, $\phi_{GR}$ is equal to the Newtonian gravitational potential $\Phi$ divided by the speed of light. How this quantity is constructed will be something we'll discuss later.
Review of relativistic physics and Newtonian gravity
Most theories of physics grew out of a desire to explain phenomena that could not be described (or contradicted) prior theories; general relativity is no different. Indeed, to understand how GR arose, we must first take a look at the two theories that GR supersedes: special relativity and Newtonian gravity.
To start, special relativity is fundamentally a theory about how our conventional understanding of "space" and "time" is incorrect. This is a result of the fundamental postulate of special relativity: physical interactions cannot propagate faster than the speed of light, which we denote as $c$. Even more strangely, no matter how fast you are moving, you will always see light traveling at the exact same speed of $c$. For the laws of physics to be consistent (that is, to avoid any paradoxes), the result is that we find that time and space become relative concepts. Instead, a new concept of spacetime must be devised that supersedes our old notions of space and time. We find that spacetime can mathematically be described using the tools of differential geometry, and thus special relativity is, in some sense, a geometric theory. However, most special relativistic calculations end up using only algebra (with a sprinkling of some calculus), turning away from a full differential geometry based approach.
Now, let us take a look at Newtonian gravity. Newtonian gravity is the classical (as in non-relativistic) theory of gravity, which is formulated in the laws of Newtonian mechanics. The heart of Newtonian gravity is the principle of universal gravitation. The principle can be stated as follows:
- All masses in the Universe attract each other by the force of gravity
- The gravitational force is instantaneous, regardless of the distance separating any two masses, and has infinite range
To state the first principle more precisely, the gravitational force exerted by mass $m_1$ at position $\mathbf{r}_1$ on some other mass $m_2$ at another position $\mathbf{r}_2$ is given by:
$$ \mathbf{F}_{12} = -Gm_{1}m_{2} \frac{\mathbf{r}_{2} - \mathbf{r}_{1}}{|\mathbf{r}_{2} - \mathbf{r}_{1}|^3} $$
In principle, we can use this equation, along with Newton's second law $m \ddot{\mathbf{r}}_i = \sum \mathbf{F}_{ij}$, to precisely calculate the future positions and velocities of $n$ particles interacting gravitationally with each other. (In practice this is actually quite a difficult problem and requires powerful computers to solve adequately). However, the general idea holds: in Newton's theory gravity is an instantaneous force, meaning that Newtonian gravity (wrongly) predicts that gravitational interactions can travel faster than the speed of light, breaking special relativity!
For instance, Newtonian gravity predicts that a star 100 light years away would gravitationally act on Earth just as fast as our Sun, which would act just as fast as the Moon, and so forth. This idea does not hold up to experimental evidence, which can be demonstrated with a very similar thought experiment: in telescope images we have of distant galaxies, the vast majority of stars are long dead, but their light, emitted hundreds or even thousands of years ago, can still be seen. But if their gravitational influence was instant (as Newtonian gravity claims) while their light was not (as special relativity claims), then we would observe gravitational effects from the dead stars while their light had not yet reached us - "ghosts" that invisibly act across astronomical distances without any visible indication they actually exist. Clearly, this does not happen!
Note: Here, I am not discussing dark matter; I am just discussing regular matter as an illustrative example of where special relativity and Newtonian gravity contradict each other.
In discussing special relativity and Newtonian gravity, the key takeaway is this: these two theories of physics are fundamentally incompatible with each other. Special relativity says that physical interactions can propagate at a maximum speed of $c$, but Newtonian relativity says that gravity is an instantaneous force. To resolve this conflict, we need general relativity.
The analogue with electromagnetism
To try to have a basic intuitive understanding of how Newtonian gravity must be modified to incorporate the postulates of special relativity, we can consider the theory of electromagnetism. Electromagnetism and gravity are remarkably similar in many ways; both are long-range interactions (having theoretically infinite distance) and both follow inverse-square laws (Coulomb's law for electromagnetism and Newton's law for gravity). They are also highly evident because unlike the other fundamental interactions of nature (the strong and weak nuclear interactions), we see the effects of gravity and electromagnetism every day, in the forms of falling objects, bright flashes of light, and falling objects that cause bright flashes of light and tend to kill dinosaurs. Okay, maybe not the last one. But the point still stands: both are interactions we can experience, and this similarity can be used as an analogy to give us clues about the nature of gravity.
Historically, the development of electromagnetism originated in the study of electrostatics. Electrostatics governs the behavior of electric fields and electric charges that are slow-moving, and is the origin for Coulomb's law. In its original form, Coulomb's law describes the force between two point charges with charges $q_{1}, q_{2}$ and at locations $\mathbf{r}_{1}, \mathbf{r}_{2}$ as follows:
$$ \mathbf{F}_{12} = \frac{q_{1} q_{2}}{4 \pi \varepsilon_{0}} \frac{\mathbf{r}_{2} - \mathbf{r}_{1}}{|\mathbf{r}_{2} - \mathbf{r}_{1}|^3} $$
Note how similar Coulomb's law is to Newton's law of universal gravitation. In modern physics, it is conventional to write out Coulomb's law not in its force form, but rather in the potential form using the electric scalar potential $V$, that is:
$$ \ddot{\mathbf{r}} = -\frac{q}{m}\nabla V $$
Where the electric scalar potential takes the following form:
$$ V(\mathbf{r}) = \sum_{i} \frac{q_i}{4 \pi \varepsilon_{0}} \frac{1}{|\mathbf{r}_i - \mathbf{r}|} $$
And the electric field takes the form:
$$ \mathbf{E} = -\nabla V $$
Coulomb's law is a good approximation for time-independent electric fields and charges moving slowly, but fails to apply once charges are moving close to the speed of light, or if the electric field is time-dependent. One finds that it is actually necessary to define another potential - the magnetic vector potential $\mathbf{A}$ - to describe the particle adequately. But wait, that's not all! We also find that for Lorentz invariance - that is, for the speed of light to be constant for all observers - we need to "package" the scalar potential and vector potential together into a new 4-dimensional potential, denoted as $A^\mu$ (I'll explain what this notation means soon). This 4-dimensional vector, also called a 4-vector, looks like this:
$$ A^{\mu} = \left( \frac{V}{c}, \mathbf{A} \right) = \begin{pmatrix} V / c \\ A_{x} \\ A_{y} \\ A_{z} \end{pmatrix} $$
Indeed, we find that this 4-vector is what is necessary to accurately describe the electromagnetic field in the most general (relativistic and time-dependent) case. The evolution of this 4-vector is then given by the Maxwell equations, which, when written using Einstein's notation, are given by $\partial^\nu \partial_\nu A^\mu = \mu_0 J^\nu$. This generalization of the laws of non-relativistic electromagnetism is called relativistic electrodynamics, and permits classical electromagnetism to be fully consistent with relativity.
In the same idea, general relativity is a theory that takes Newtonian gravity and changes it - radically so - to be consistent with special relativity. Just like in electromagnetism, where the electric scalar potential $V$ had to be replaced by the four-dimensional potential $A^\mu$, in general relativity, the Newtonian gravitational potential $\Phi$ must be replaced by a tensor potential in GR, and is written as $g_{\mu \nu}$ (for now, think of a tensor as a "matrix"; more on that later). Since $g_{\mu \nu}$ has 10 independent components, this tensor potential can be thought of as a collection of 10 different potentials that are coupled with each other in a very complicated way. Indeed, if we write out $g_{\mu \nu}$ in matrix form, we have:
$$ g_{\mu \nu}(\mathbf{r}, t) = \begin{pmatrix} g_{00} & g_{01} & g_{02} & g_{03} \\ g_{10} & g_{11} & g_{12} & g_{13} \\ g_{20} & g_{21} & g_{22} & g_{23} \\ g_{30} & g_{31} & g_{32} & g_{33} \end{pmatrix} $$
In this form, we can see that unlike $A^\mu$ for electromagnetism, which has "only" 4 components, $g_{\mu \nu}$ has 16 components(!) (although only 10 components are independent since $g_{\mu \nu}$ is symmetric). Each component is a sort of "potential" that depends on space and time, and interacts with the other components, which are also "potentials" (roughly-speaking). Crucially, just like $A^\mu$ for electromagnetism, $g_{\mu \nu}$ is Lorentz invariant, meaning that it ensures the speed of light is constant. Thus, we have now generalized gravity in a manner that is consistent with the
We'll see that in the Newtonian limit (weak gravity and slow-moving objects), $g_{00} = -\left( 1 + \dfrac{2\Phi}{c^2} \right)$ becomes the dominant component of the field and most of the other components can be more or less ignored. However, in the ultra-relativistic case, and whenever there is strong gravity, we need to consider all the components of $g_{\mu \nu}$ to describe gravity - effectively, we need to consider 10 different potentials!
Indeed, this is where GR and electromagnetism become very different, because unlike Maxwell's equations for EM, which are a system of linear PDEs, the fundamental equations of GR (the Einstein Field Equations) are a system of highly-nonlinear PDEs. Solving them is so difficult that finding an exact solution for non-trivial cases can give you everlasting fame and glory in the astrophysics community.
Einstein's dilemma with rotating reference frames
General relativity also fixes another, less-evident issue with Newtonian gravity. Since Newtonian gravity comes from Newtonian mechanics, it comes along with all of its problems as well. In particular, accelerating reference frames are a famously-annoying issue in Newtonian gravity. As a demonstration, imagine you were standing aboard a bus that was moving forwards with some constant speed $v$. That is, until the bus rapidly brakes and comes to a stop. At that moment, you feel a "force" pushing on you (and might embarrassingly fall and drop all your things onto the floor). However, there is no such force - you are perfectly stationary with respect to the bus (unless you fall, but that's another thing altogether). So what was pushing on you, then? It was a fictitious force caused by the bus's acceleration. While you were stationary with respect to the bus, the bus was accelerating (more precisely, decelerating), and thus you feel the effect of that acceleration. To be more precise, you were present within a accelerating reference frame, and this led to your unfortunate experience of (possibly) falling.
Indeed, we could go with a much more complicated scenario. Try solving, for instance, the Newtonian equations of motion for a sliding block on rotating platform (DON'T!!! there are more productive uses of your time!!) and you will find terms corresponding to the fictitious Coriolis force and equally-fictitious centrifugal force, among others. We'll soon see that General relativity, by its construction, removes fictitious forces from the picture. Indeed, it tells us that gravity itself is a fictitious force, and that we can describe gravitational phenomena without needing to use the concept of forces at all. To summarize, while we are not able to rigorously-define either of these statements at the moment, GR posits that:
- There are no preferred reference frames: mathematically, this is governed by the fact that GR is formulated using mathematical objects called tensors
- Gravity is not a force, but rather a consequence of spacetime geometry which is described by the Einstein field equations using the mathematics of differential geometry
A geometric theory of physics
When we say general relativity is a geometric theory, we want to be careful about what we mean by "geometry". In one sense, most of classical physics can be described as geometric in origin, but the geometry used was Euclidean geometry - a system formalized by Euclid over 2,000 years ago. General relativity, however, uses non-Euclidean geometry, which was developed by Gauss, Riemann, and other 19th- and 20th-century mathematicians. It is only with this unconventional mathematical paradigm that Einstein was able to model gravity geometrically.
General relativity, specifically, uses a form of non-Euclidean geometry that has the following features:
- The Universe is described by four-dimensional spacetime, which is a geometric object (technically, a manifold) that possesses curvature
- There may be curvature in time as well as space; that is, spacetime is dynamical (time-dependent)
The presence of curvature in the geometry of spacetime is especially relevant, because it challenges one of the foundational axioms of Euclidean geometry. In Euclidean geometry, the shortest path between two points is a straight line. But in general relativity, even a particle attempting to travel along a "straight line" in curved spacetime ends up following a curved path due to the spacetime curvature. Thus, the shortest path between two points in general relativity is no longer (always) a straight line. Instead, it is a special curve called a geodesic curve, which is usually curved.
The effects of spacetime curvature on particle trajectories
Consider a ball thrown up and then allowed to fall back down to Earth under the influence of gravity. In relativity, the trajectory of a particle through space and time is known as its worldline. For our ball, the initial position of the ball is at $h = 0, t = 0$, where $h$ is the height of the ball above the ground. We may draw a spacetime diagram (a glorified graph) plotting the trajectory of the thrown ball, with the height on the $x$ axis and the time (multiplied by the speed of light, $c$) on the $y$ axis:
In weak gravity, there is approximately no spacetime curvature, and thus the trajectory of the ball is almost exactly a straight line, with nearly infinite slope (since $c \sim 10^8$ is such a large number). However, in strong gravity, there is non-negligible spacetime curvature, and thus the trajectory of the ball becomes a curve. From this basic example, we can already see that adding curvature dramatically changes the dynamics of particles travelling through spacetime.
Note: For another (more professional) visualization of the curvature of gravity, please see this interactive website.
Manifolds and differential geometry
Since we use non-Euclidean geometry in general relativity, we must abandon the idea of straight lines and flat spaces and consider curved spaces. (Here, we use the word "space" as opposed to "plane" or "cube" or "line" to be more general, since an infinite line, infinite plane, and infinite cube are just special cases of Cartesian space).
To describe any sort of curved space, we need to introduce the idea of a manifold. A manifold is any space that is locally flat - that is to say, if we zoom in close enough, we can locally approximate it with Euclidean geometry. For instance, the Earth is roughly a manifold on human scales; indeed, while we obviously know that the Earth is (to a good approximation) a sphere, we do not perceive its curvature significantly on everyday distances, meaning that it seems flat to us. Generalizing this concept, we can say that an infinitesimal portion of a sphere can be regarded as flat, but a sphere is ultimately curved.
A triangle placed on a sphere can have a sum of angles greater than 180 degrees!
Since a sphere is curved, it does not follow the typical rules of Euclidean geometry. Indeed, if we try to apply Euclidean geometry for shapes on the surface of a sphere of radius $a$, we find that we get completely unexpected results:
| Sphere | Flat 2D space | |
|---|---|---|
| Sum of angles of an inscribed triangle | $\pi + A/a^2$ where $A$ is the area of the triangle | $\pi$ |
| Circumference of an inscribed circle | $C = 2\pi a \sin(r/a)$ where $r = a\theta$ | $C = 2\pi r$ |
Note: It is also common to use the term spherical triangle to refer to an inscribed triangle on a sphere, and likewise the term spherical circle to refer to an inscribed circle on a sphere. Note that a spherical circle is defined as a curve of constant $\theta$ (colatitude); in geography, they are called latitudinal lines or parallels, and are used to describe circles on the (nearly) spherical Earth.
These results stem from the fact that the surface of a sphere is a curved space. Since the surface is curved, we must use non-Euclidean geometry. However, note that in the limit that $a$ is large (that is, the sphere has a very large radius), $\sin(r/a) \approx r/a$ and therefore $C \approx 2\pi a\left( \frac{r}{a} \right) = 2\pi r$. This tells us that for very large spheres (like the Earth), the curvature becomes hardly noticeable and we can essentially ignore it and use the results of Euclidean geometry.
Note: We must be careful to note that we speak of the surface of a sphere as a curved space, but we do not say the sphere itself is a curved space. There is a major difference; the surface of a sphere is essentially two-dimensional (since a particle confined to move along the sphere can only travel in two directions - along the lines of longitude or latitude) while looking at a sphere is to consider a three-dimensional shape embedded within 3D Euclidean space. This is why the surface of a sphere is often called a 2-sphere, since it is a 2D space.
By considering non-Euclidean geometry, we can describe curved spaces as well as flat spaces. Indeed, we can categorize curved spaces into several categories, depending on their properties:
- Hyperbolic space: negative curvature (e.g. surface of a hyperboloid)
- Flat space: zero curvature; the currently-accepted model of the universe suggests close to zero (spatial) curvature, though with non-zero curvature in the time dimension
- Elliptic space: positive curvature (e.g. surface of a sphere)
The following diagram illustrates the properties of each of these categories of spaces:
Source: Wikipedia
Introduction to the metric
To understand how it could be possible that a sphere may have such drastically-different geometry as compared to what Euclidean geometry predicts, we must introduce the tools of differential geometry, which generalizes the concepts of Euclidean geometry to curved spaces. Differential geometry can be applied to any manifold, and uses the fact that if you zoom up close, any manifold is locally flat to describe highly-complex curved spaces as a combination of infinitesimally-flat patches.
As a basic introduction, recall Pythagoras's theorem in 2D space reads $s^2 = x^2 + y^2$. This tells us the distance $s$ between the origin and a point $(x, y)$ on the Cartesian 2D plane. Now, for two points $(x_1, y_1)$ and $(x_2, y_2)$, Pythagoras's theorem reads:
$$ \Delta s^2 = (x_{2} - x_{1})^2 + (y_{2} - y_{1})^2 = \Delta x^2 + \Delta y^2 $$
Note: Here we use the notation that $\Delta s^2 = (\Delta s)^2$. Thus, $\Delta x^2 = (\Delta x)^2$ and $\Delta y^2 = (\Delta y)^2$.
Where $\Delta s$ is the distance between the two points. Now, we can generalize this result for infinitesimal displacements $dx, dy$ with:
$$ ds^2 = dx^2 + dy^2 $$
Unlike the prior two formulae, which was only valid for flat 2D space (Euclidean space), this expression for Pythagoras's theorem holds true for curved 2D spaces as well. For instance, consider a sphere of unit radius. As we discussed prior, the surface of a sphere is a 2D space, since a particle constrained to the surface of a sphere can only move in one of two directions (corresponding to the $\theta$ and $\phi$ coordinates in spherical coordinates). Unlike a plane, however, the surface of a sphere is a curved space, as we show in the below diagram:
Infinitesimal displacements $dx, dy, ds$ on a unit sphere. Note: size of the displacements are exaggerated.
Indeed, we see that for infinitesimally-small displacements, Pythagoras's theorem holds true on the surface of the sphere, even though it doesn't hold true for finite displacements $\Delta x, \Delta y,\Delta s$. We have a special name for $ds$: it is called the line element. For the surface of the unit sphere, we can express the line element in the following form:
$$ ds^2 = d \theta^2 + \sin^2 \theta d \phi^2 $$
In the most general case, the line element in $n$ dimensions is given by the following expression:
$$ ds^2 = \sum_{a}^n \sum_{b}^n g_{ab} dx^a dx^b $$
Where $g_{ab}$ is called the metric tensor (or just metric for short), and is a $(n \times n)$ matrix that describes the geometry of the space; it thus packages the information about the curvature of the space. Here, we use what is known as index notation, meaning that when we have a superscript or subscript index (like $dx^a$ or $g_{ab}$) we use the index to denote a coordinate. For instance, if we were using Cartesian coordinates $(x, y, z)$, then $dx^a = (dx, dy, dz)$ such that $dx^1 = dx$, $dx^2 = dy$ and $dx^3 = dz$. Meanwhile, if we were using cylindrical coordinates $(r, \phi, z)$ then $dx^a = (dr, d\phi, dz)$ so $dx^1 = dr$, $dx^2 = d\phi$, and $dx^3 = dz$. It is important to remember that in index notation, indices are NOT exponents! That is to say, $dx^3$ should be interpreted as "the infinitesimal displacement along the third coordinate ($z$)" rather than "$dx$ raised to the power of three".
With that clarified, we can now consider some examples of metrics. For instance, a metric tensor for a two-dimensional space takes the general form:
$$ g_{ab} = \begin{pmatrix} g_{11} & g_{12} \\ g_{21} & g_{22} \end{pmatrix} $$
For 2D Euclidean space expressed in Cartesian coordinates, the metric is simply the 2D identity matrix:
$$ g_{ab} = \begin{pmatrix} 1 & 0 \\ 0 & 1 \end{pmatrix} $$
From which we can see that the only two nonzero components of the metric are $g_{11} = 1$ and $g_{22} = 1$. Thus, the line element is given by:
$$ ds^2 = g_{11} dx^1 dx^1 + g_{22} dx^2 dx^2 = dx^2 + dy^2 $$
Note that in 3D Euclidean space, the metric is the 3D identity matrix:
$$ g_{ab} = \begin{pmatrix} 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \end{pmatrix} $$
For which the nonzero elements are $g_{11} = 1$, $g_{22} = 1$, $g_{33} = 1$, and thus:
$$ ds^2 = g_{11} dx^1 dx^1 + g_{22} dx^2 dx^2 + g_{33} dx^3 dx^3 = dx^2 + dy^2 + dz^2 $$
Finally, let us take the example of 3D Euclidean space in spherical coordinates. The line element reads:
$$ ds^2 = dr^2 + r^2 d \theta^2 + r^2 \sin^2 \theta d \phi^2 $$
And the metric reads:
$$ g_{ab} = \begin{pmatrix} 1 & 0 & 0 \\ 0 & r^2 & 0 \\ 0 & 0 & r^2 \sin^2 \theta \end{pmatrix} $$
Using the metric, we can generalize quantities in conventional flat space to curved spaces. For instance, an infinitesimal patch of area $dA$ can be described in an arbitrary curved space with the area element:
$$ d A = \sqrt{ g }\, dx^1 dx^2 = \sqrt{ g }\, d^2x $$
Where here, $g = \sqrt{ \det g_{ab} }$ is the determinant of the metric tensor $g_{ab}$, and $d^2 x = dx^1 dx^2$ is the product of the infinitesimal displacements in the two coordinate directions (for instance, $dx^1 = dx$ and $dx^2 = dy$ in 2D Euclidean space in cartesian coordinates, so $d^2 x = dx dy$). Meanwhile, an infinitesimal patch of volume $dV$ can be described in an arbitrary curved space with the volume element:
$$ dV = \sqrt{ g }\, dx^1 dx^2 dx^3 = \sqrt{\det g }\, d^3 x $$
Note: It is very common to use the notation $\det g$ to represent the determinant of the metric, but the determinant of the metric is sometimes (confusingly!) also denoted as $g$. We will use the former notation and stick with it for consistency.
Collectively, this allows us to find the areas and volumes of shapes in different spaces; one may use it, for instance, to calculate the area of a triangle lying on the surface of a sphere or hyperboloid, or that of a sphere in curved 3D space. One simply needs to integrate over the regions the shapes occupy to find their total volume $V_{shape}$ and total surface area $S_{shape}$, that is:
$$ \begin{align*} S_{shape} &= \int \limits_\text{shape} dA = \int \limits_\text{shape} \sqrt{ g }\, dx^1 dx^2 \\ V_{shape} &= \int \limits_\text{shape} dV = \int \limits_\text{shape} \sqrt{ g }\, dx^1 dx^2 dx^3 \\ \end{align*} $$
Similarly, one may find the length $s$ of some curve in an arbitrary space (for instance, the circumference of a circle on the surface of a sphere) by integrating over the line element:
$$ s = \int \limits_\text{curve} \sqrt{ds^2} $$
We'll later see that the famous Einstein-Hilbert action in General Relativity can also be expressed in terms of the metric:
$$ S = \frac{c^4}{16 \pi G} \int R \sqrt{ -g }\, d^4x $$
Where here, $S$ is the GR action and $R$ is a scalar-valued function that depends on the metric and its derivatives (it's called the Ricci scalar, but we'll get to that later). The essential feature of the metric is that it is the complete description of a space - whether flat or curved - and therefore, the metric is the essential quantity of interest in general relativity, since the curvature of spacetime, which is described by the metric, is what we perceive as gravity.
The inverse metric
When we have the matrix form of the metric, the inverse metric can be obtained by just taking the matrix inverse. The simplest case is when the metric is purely diagonal; then, the inverse metric can be obtained by just taking the reciprocal of the metric
$$ g_{ab} = \begin{pmatrix} g_{11} & 0 & 0 & 0 & 0 \\ 0 & g_{22} & 0 & 0 & 0 \\ 0 & 0 & g_{33} & 0 & 0 \\ 0 & 0 & 0 & \ddots & 0 \\ 0 & 0 & 0 & 0 & g_{nn} \end{pmatrix} \quad \Rightarrow \quad g^{ab} = \begin{pmatrix} \frac{1}{g_{11}} & 0 & 0 & 0 & 0 \\ 0 & \frac{1}{g_{22}} & 0 & 0 & 0 \\ 0 & 0 & \frac{1}{g_{33}} & 0 & 0 \\ 0 & 0 & 0 & \ddots & 0 \\ 0 & 0 & 0 & 0 & \frac{1}{g_{nn}} \end{pmatrix} $$
Indeed, we find that a diagonal metric corresponds to an orthogonal coordinate system, which makes things much more convenient. For instance, it means that the the line element is given by a single sum over one coordinate $i$ rather than over two coordinates $i, j$:
$$ \begin{align*} ds^2 &= g_{ii} dx^i dx^i \\ &= g_{00} dx^0 dx^0 + g_{11} dx^1 dx^1 + g_{22} dx^2 dx^2 + \dots \end{align*} $$
For instance, if we used coordinates $v_1, v_2, v_3, \dots, v_n$, then:
$$ ds^2 = g_{11} dv_{1}^2 + g_{22} dv_{2}^2 + g_{33} dv_{3}^2 + \dots + g_{nn} dx_{n}^2 $$
In the case of Cartesian coordinates, where $v_1 = x, v_2 = y, v_3 = z$, we have $g_{11} = g_{22} = g_{33} = 1$ and therefore we have:
$$ ds^2 = dx^2 + dy^2 + dz^2 $$
Note: We find that sometimes, the inverse metric can become singular (this occurs for polar coordinates and spherical coordinates, for instance). From a mathematical point of view, this means that the metric is defined everywhere except at these singularities, where we have a coordinate singularity; from a physical point of view, this doesn’t really matter, since we know that the singularities come purely from a choice of coordinates. The distinction between coordinate and physical singularities will become very important once we discuss black holes.
Paths in two-dimensional spaces
One of the oldest math problems of all times is to find distance travelled along a curve — something that is very useful for anyone who wonders if they have the time to grab a coffee before heading to school (or work)! In mathematical terms, this is the classical problem of finding the arc length (or path length) of a curve — something that we can now tackle with our tools of differential geometry.
In flat (Euclidean) 2D space, we know that the metric is simply the 2D identity matrix, and so the line element is given by:
$$ ds^2 = dx^2 + dy^2 $$
To find the total length of some curve with endpoints $A = (x_1, y_1)$ and $B = (x_2, y_2)$, we must integrate over the line element:
$$ s = \int_{A}^B ds = \int_{A}^B \sqrt{ dx^2 + dy^2 } $$
This integral cannot be solved directly, but we have some options. First, if we let $y = y(x)$, then $dy = y’(x) dx$ and thus we have:
$$ s = \int_{A}^B \sqrt{ dx^2 + y'(x)^2 dx^2 } = \int_{x_{A}}^{x_{B}} \sqrt{ 1 + y'(x) }\, dx $$
Another option, which is often more useful is to parametrize $x, y$ in terms of some parameter $\tau$ (this can, although does not have to, represent time). Thus we have $dx = x'(t) dt$ and $dy = y'(t) dt$ and we have:
$$ \begin{align*} s &= \int_{A}^B \sqrt{ dx^2 + dy^2 } \\ &= \int_{A}^B \sqrt{ x'(t)^2 dt^2 + y'(t)^2 dt^2 } \\ &= \int_{t_{A}}^{t_{B}} \sqrt{ x'(t)^2 + y'(t)^2 } dt \end{align*} $$
These methods are simply the standard formulas for the arc length from single-variable calculus, and only apply in flat spaces. Let us now generalize the same methods to an arbitrary curved 2D space. It turns out that the general formula for the length of a curve to an arbitrary curved space is given by:
$$ \begin{align*} s &= \int_{A}^B \sqrt{ ds^2 } \\ &= \int_{A}^B \sqrt{ g_{ab} dx^a dx^b } \\ &= \int_{A}^B \sqrt{ g_{ab} \left( \frac{dx^a}{d\tau}d\tau \right) \left( \frac{dx^b}{d\tau} d\tau \right) } \\ &= \int_{\tau_{A}}^{\tau_{B}} d\tau \sqrt{ g_{ab} \frac{dx^a}{d\tau} \frac{dx^b}{d\tau} } \end{align*} $$
For instance, let’s try to calculate the length of a path on a sphere. We know from before that:
$$ ds^2 = R^2 d\theta^2 + R^2 \sin^2 \theta d\phi^2 $$
In our case, the metric is given by:
$$ g_{ab} = \begin{pmatrix} R^2 & 0 \\ 0 & R^2 \sin^2 \theta \end{pmatrix} $$
For a sphere, the coordinates we use are $(\theta, \phi)$, and since $a, b$ sum over the coordinates, we have:
$$ \begin{align*} g_{ab} \frac{dx^a}{d\tau} \frac{dx^b}{d\tau} &= g_{aa} \frac{dx^a}{d\tau} \frac{dx^a}{d\tau} \\ &= g_{11} \left(\frac{d\theta}{d\tau}\right)^2 + g_{22} \left(\frac{d\phi}{d\tau}\right)^2 \\ &= R^2 \left(\frac{d\theta}{d\tau}\right)^2 + R^2 \sin^2 (\theta) \left(\frac{d\phi}{d\tau}\right)^2 \end{align*} $$
Therefore, the arc length for $\tau \in [a, b]$ (where $\tau$ is some parameter; it can be time in a physical setting) is given by:
$$ \begin{align*} s &= \oint ds \\ &= \int_a^b d\tau \sqrt{g_{ab} \frac{dx^a}{d\tau} \frac{dx^b}{d\tau}} \\ &= \int_a^b d\tau \sqrt{R^2 \left(\frac{d\theta}{d\tau}\right)^2 + R^2 \sin^2 (\theta) \left(\frac{d\phi}{d\tau}\right)^2} \\ &= R\int_a^b d\tau \sqrt{\left(\frac{d\theta}{d\tau}\right)^2 + \sin^2 (\theta) \left(\frac{d\phi}{d\tau}\right)^2} \end{align*} $$
If we supply the explicit forms of $\theta(\tau)$ and $\phi(\tau)$, we can therefore compute the arc length of any path on a sphere! For instance, we may want to find the circumference of the sphere at some fixed polar angle $\theta$ (on a globe this would be lines of constant latitude, known to navigators as parallels). Then, we treat $\theta$ as a constant (meaning that $\frac{d\theta}{d\tau} = 0$) while we let $\phi(\tau) = \tau$, meaning that $\phi(0) = 0$ and $\phi(2\pi) = 2\pi$ - allowing us to cover all the way around the sphere's circumference. Performing the integral gives us:
$$ s = R\int_0^{2\pi} d\tau \sqrt{\sin^2 (\theta)\left(\frac{d\phi}{d\tau}\right)^2} = R\int_0^{2\pi} d\tau \sin \theta = 2\pi R \sin \theta $$
Notice that this is not the formula $s = 2\pi R$, as it would be on a circle of radius $R$, except at the equator ($\theta = \pi/2$). Of course it's not, because a sphere is curved! Thus we see that - as we expected - curvature causes distances to change.
Note: Here, $\theta$ is more accurately referred to as the colatitude, which is different from the latitude typically used in cartography and geography. In the case of the latitude $\alpha$, the formula becomes $s = 2\pi R \cos \alpha$.
Now, we still haven’t answered a very important question: what if we only know the metric, but we don’t know the parametric forms of the curves? Can we use the formula for $s$ to work backwards to determine the trajectory $x^a(\tau)$ of a particle moving in a curved space? The answer, in fact, is yes. Answering this question is in fact one of the most important problems in general relativity, and will be the critical link between the abstract mathematics of differential geometry and the physics of gravity.
Transformation of coordinates and the origin of tensors
The coordinates we have been working with have primarily been the familiar coordinate systems of flat space: Cartesian, spherical, polar, and so forth. However, recall that the line element is a very general quantity that is also defined in curved spaces. Moreover, it can be expressed in arbitrary coordinate systems, and in fact it will often be easier for us to work with the line element in one coordinate system rather than another, especially in problems with symmetry.
To be able transform the line element into another coordinate system, recall that the general form of the line element is written as:
$$ ds^2 = g_{ab} dx^a dx^b $$
Here, we need to use two transformations. First, we must transform the differentials as follows (where $j$ is summed over across all the coordinates):
$$ dx^j = \frac{\partial x^i}{\partial x^j} dx^j $$
We must also transform the components of the metric, as follows (where $a, b$ are both summed over all the coordinates):
$$ g_{ij} = \frac{\partial x^a}{\partial x^i} \frac{\partial x^b}{\partial x^j} g_{ab} $$
Then, the line element in our new coordinates takes the form:
$$ ds^2 = g_{ij} dx^i dx^j $$
Notice that the general formula for $ds^2$ stays the same, with only a switch of indices. We therefore recognize that $ds^2$ is an invariant quantity since it stays the same, regardless of which space we are working in. Another invariant is the the Gaussian curvature $K$, which is given by:
$$ \begin{align*} K &= \frac{1}{2g_{11}g_{22}} \bigg\{ - \frac{\partial^2 g_{11}}{\partial (x^2)^2} - \frac{\partial^2 g_{22}}{\partial(x^1)^2} \\ &\qquad + \frac{1}{2g_{11}} \left[\frac{\partial g_{11}}{\partial (x^1)} \frac{\partial g_{22}}{\partial (x^1)} + \left( \frac{\partial g_{11}}{\partial (x^2)} \right)^2\right] + \frac{1}{2g_{22}} \left[ \frac{\partial g_{11}}{\partial (x^2)} \frac{\partial g_{22}}{\partial (x^2)} + \left( \frac{\partial g_{22}}{\partial (x^1)} \right)^2 \right] \bigg\} \end{align*} $$
Where $x^1, x^2$ are the coordinates used to describe the 2D space (they are not exponents!) and where $K$ is a combination of derivatives and components of the metric. Note that the Gaussian curvature is not defined for spaces that aren’t two-dimensional. We will explore a more general invariant quantity for spaces of arbitrary dimension, known as the Ricci scalar, once we get to curved 4D spacetime.
Another invariant quantity that we’ll study extensively is — you guessed it — the metric! While the component form of the metric depends on our choice of our coordinates, the metric itself is a universal mathematical object defined in any space (flat or curved), and the basic mathematical expressions involving the metric — including physical laws in GR — remain the same.
These invariant objects are known as tensors, and whether they are scalars (like $ds^2$ as well as the Gaussian curvature that we’ll soon see), vectors, matrices (like the metric), or something more exotic, they obey the special property that they do not depend on the choice of coordinates.
Intrinsic and extrinsic coordinates
On the subject of coordinates, it is important to note that we actually have a choice of different coordinate systems to describe a given space:
- We can describe a space in terms of its intrinsic coordinates; that is, the coordinates that a particle “living” in the space (constrained to move within the space) would use
- We can also describe a space in terms of extrinsic coordinates; that is, the coordinates of the space that it is embedded in
Note: Intrinsic coordinates are also known as Gaussian coordinates since Gauss was one of the first mathematicians who studied curved spaces.
For example, we know that a sphere of radius $R$ is embedded in 3D space, where we have the extrinsic coordinates $(x, y, z)$. The metric of the embedding space (the space where the sphere “curves outwards” into) is simply that of 3D Euclidean space:
$$ ds^2 = dx^2 + dy^2 + dz^2 $$
Where the equation of a sphere is given by:
$$ x^2 + y^2 + z^2 = R^2 $$
But assume that you were an ant living on the surface of a sphere. You could never leave the sphere (let’s ignore any advanced ant-flying technology), and would be forever stuck on the sphere. Therefore, it would be more natural to use the intrinsic coordinates $(\theta, \phi)$ to represent your position on the surface of a sphere. Assuming that you were (somehow) an ant with unimaginable intelligence, you could describe the 2D space on the surface of a sphere with the following metric:
$$ ds^2 = R^2 d\theta^2 + R^2 \sin^2 \theta d\phi^2 $$
To relate the intrinsic coordinates on the sphere’s 2D surface with the extrinsic coordinates of the embedding space (Euclidean 3D space), we can use the following coordinate transformations:
$$ \begin{align*} x &= R \sin \theta \cos \phi \\ y &= R \sin \theta \sin \phi \\ z &= R \cos \theta \end{align*} $$
Note that we can then compute the differentials $dx, dy, dz$ as follows:
$$ \begin{align*} dx &= \frac{\partial x}{\partial \theta} d\theta + \frac{\partial x}{\partial \phi} d\phi \\ &= R \cos \theta \cos \phi d\theta -R \sin \phi \sin \theta d\phi \\ dy &= \frac{\partial y}{\partial \theta} d\theta + \frac{\partial y}{\partial \phi} d\phi \\ &= R \sin \phi \cos \theta d\theta + R \sin \theta \cos \phi d\phi \\ dz &= \frac{\partial z}{\partial \theta} d\theta + \cancel{ \frac{\partial z}{\partial \phi} d\phi }^0 \\ &= -R \sin \theta d\theta \end{align*} $$
Substituting everything into the Euclidean 3D metric (the metric of the embedding space), we have:
$$ \begin{align*} ds^2 &= dx^2 + dy^2 + dz^2 \\ &= (R \cos \theta \cos \phi d\theta -R \sin \phi \sin \theta d\phi)^2 \\ &\qquad + (R \sin \phi \cos \theta d\theta + R \sin \theta \cos \phi d\phi)^2 \\ &\qquad + (-R \sin \theta d\theta)^2 \\ &= R^2 d\theta^2 + R^2 \sin^2 d\phi^2 \end{align*} $$
Which is simply the metric of the surface of a sphere! Furthermore, we note that if we substitute our coordinate conversions of $x, y, z$ in terms of $\theta$ and $\phi$, we find that:
$$ \begin{align*} x^2 + y^2 + z^2 &= (R \sin \theta \cos \phi)^2 + (R \sin \theta \sin \phi)^2 \\ &\qquad+ (R \cos \theta)^2 \\ &= R^2 \sin^2 \theta \cos^2 \phi + R^2 \sin^2 \theta \sin^2 \phi + R^2 \cos^2 \theta \\ &= R^2 \sin^2 \theta \underbrace{ (\cos^2 \phi + \sin^2 \phi) }_{ 1 } + R^2 \cos^2 \theta \\ &= R^2 \underbrace{ (\sin^2 \theta + \cos^2 \theta) }_{ 1 } \\ &= R^2 \end{align*} $$
Which gives us $x^2 + y^2 + z^2 = R^2$, precisely the equation of a sphere! Thus we see that the geometric description of a space embedded in some higher-dimensional space is equivalent. This is also true for the hyperbolic space, the 2D that describes the surface of a 3D hyperboloid. In the coordinates $(\xi, \phi)$, the metric is given by:
$$ g_{ab} = \begin{pmatrix} R^2 & 0 \\ 0 & R^2 \sinh^2 \chi \end{pmatrix} $$
For which the line element takes the form:
$$ ds^2 = R^2 d\chi^2 + R^2 \sinh^2 \chi d\phi^2 $$
When embedded in Euclidean 3D space, the equation of a hyperboloid reads:
$$ x^2 + y^2 - z^2 = -1 $$
We can find the coordinate conversions in the same manner as for the 2D sphere, although in this case we will be using the hyperbolic functions instead of the trigonometric functions.
A philosophical interlude: the nature of gravity
Gravity is a bit of an oddball when it comes to the four fundamental interactions of nature (the gravitational, electromagnetic, strong, and weak interaction). On one hand, gravity seems to be like electromagnetism in that it is a long-ranged force that propagates at the speed of light. After all, gravity is classically described by the gravitational potential $\Phi$, which has the form:
$$ \Phi \sim \frac{1}{r} $$
This is very similar to the electric potential, which also varies by the inverse of the distance. By contrast, the strong force can be modelled by the Yukawa potential, which is given by:
$$ V_\text{strong}(r) \sim \frac{e^{-r/\lambda}}{r} $$
Where $\lambda$ is the Compton wavelength of the pion (the carrier particle for the strong force), which characterizes the range of the strong interaction. Since the Yukawa potential falls off so quickly with distance, we say it is a short-ranged force. The same is true for the weak force; its carrier particles are the W and Z bosons, which have even shorter Compton wavelengths, and therefore fall off even more rapidly.
However, unlike electromagnetism, gravity is extremely weak (by around 36 orders of magnitude!) Therefore, gravity is essentially negligible for the majority of particle physics experiments, and its effects are only visible on astronomical scales. In our solar system, precision experiments can be done to measure general-relativistic effects. At Earth’s orbit, a rough estimate of the strength of gravity can be found from the solar mass $M_\odot \approx \pu{2E30 kg}$ and the Earth’s orbit $R_\oplus \sim \pu{1.5E8 km}$, giving us:
$$ \frac{2GM_\odot}{c^2 R_\oplus} \sim 10^{-8} $$
While this is small, the effects of general relativity accumulate over the centuries (particularly for Mercury, the planet closest to the Sun), leading to a phenomenon known as the perihelion precession of Mercury.
Another physical context in which it is important to consider the effects of GR is when we consider stellar collapse. When a star of sufficient mass dies, it collapses inwards, leading to a supernova and the formation of either a neutron star or a black hole. These processes are described by the Tolman–Oppenheimer–Volkoff equation, which is derived directly from general relativity. We also see quasars, which are some of the most intense radiation sources in the Universe, thought to be the result of accretion disks around supermassive black holes at the center of galaxies.
Special relativity and relativistic electromagnetism
To understand general relativity, we must first consider its “spiritual predecessor” — the theory of special relativity. Interestingly enough, the origins of special relativity have nothing to do with gravity; they actually originate in electromagnetism. Specifically, the key insight that led to the development of special relativity was a peculiar feature of Maxwell’s equations, the fundamental equations of electromagnetism. In Gaussian units (an antiquated system of units, but still commonly used in high-energy physics), they take the form:
$$ \begin{align*} \nabla \cdot \mathbf{E} &= 4\pi \rho \\ \nabla \cdot \mathbf{B} &= 0 \\ \nabla \times \mathbf{E} + \frac{1}{c} \frac{\partial \mathbf{B}}{\partial t} &= 0 \\ \nabla \times \mathbf{B} - \frac{1}{c} \frac{\partial \mathbf{E}}{\partial t} &= \frac{4\pi}{c}\mathbf{J} \end{align*} $$
Notice how the only constant that appears in Maxwell’s equations is $c$, the speed of light. But contrary to the expectations of physical, Maxwell’s equations predicted $c$ to be identical no matter the observer’s reference frame. This meant that whether you were stationary along the Earth or blasting through a window at 99% of the speed of light, the speed of light would always be the same!
The constant speed of light defied classical expectations, which assumed that velocities would add. If you were travelling at speed $v$ relative to the ground, you observed a light ray travelling at speed $c$, classical physics would predict that the speed of light with respect to the ground $c’$ would be given by:
$$ c' = v + c $$
This, of course, is in direct contradiction with the predictions of Maxwell’s equations. Einstein’s solution to resolving this paradox was profound, and extremely radical. He said that time and space are no longer absolute, but rather, they are relative concepts that are perceived differently by each observer (more generally, dependent on the chosen coordinates). Rather, it is necessary to consider four-dimensional spacetime, the fusion of space and time, as the background over which all events in the Universe occur.
Note: We won’t go over special relativity much; for more information see the special relativity guide.
An quick review of tensors for special relativity
Since relativity (both special and general relativity) demand that we must treat time and space as relative concepts in spacetime, we need to define coordinate-independent objects to describe physical quantities that “live” in spacetime — these are tensors.
Note: We will go pretty fast through tensors here and it may be a bit too fast-paced for beginners. A more gentle guide to tensors can be found on this page.
Tensors are written in the index notation that we have shown previously, but in spacetime, we typically use Greek indices (like $\mu, \nu$, etc.) to show that they exist in 4D spacetime. For instance, the 4-position (the 4D generalization of the 3D position vector) is written as:
$$ x^\mu = \begin{pmatrix} ct \\ x \\ y \\ z \end{pmatrix} = \begin{pmatrix} ct \\ \mathbf{x} \end{pmatrix}, \quad \mu = 0, 1, 2, 3 $$
Any event in the Universe — that is, anything that happens in the Universe — can be assigned a 4-position in spacetime. We can transform between two 4-positions (given by $x^\nu$ and $x’^\mu$, where the primed coordinates indicate the coordinates within another reference frame) with the Lorentz transformations, which can be written in matrix form as:
$$ \Lambda^\mu{}_{\nu} = \begin{pmatrix} \gamma & -\gamma v/c & 0 & 0 \\ -\gamma v/c & 0 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \end{pmatrix} $$
Where $\gamma \equiv (1 - (v/c)^2)^{-1/2}$ is known as the Lorentz factor, and $\gamma \approx 1$ for “slow” objects (that is, $v \ll c$). In tensor notation, the Lorentz transformations can be written as:
$$ x'^\mu = \Lambda^\mu{}_{\nu} x^\nu $$
Note that this is actually just a special case of a more general transformation law in GR, which is given by:
$$ x^\mu = f^\mu(x^\nu) $$
Where $f^\mu$ can be some nonlinear function. It is more typical to use the differential form, given by:
$$ dx^\mu = \frac{\partial f^\mu}{\partial x^\nu} dx^\nu $$
Higher-dimensional spaces
Before we go from special relativity to general relativity, we want to first consider a description of higher-dimensional spaces. For instance, we can consider the space of a 3-sphere (that is, the surface of a 4-dimensional sphere). To start, the extrinsic description of a 3-sphere is given by:
$$ x^2 + y^2 + z^2 + w^2 = R^2 $$
We can write it in terms of intrinsic coordinates $\psi, \theta, \phi$, which are given by:
$$ \begin{align*} x &= R \sin \psi \sin \theta \cos \phi \\ y &= R \sin \psi \sin \theta \sin \phi \\ z &= R \sin \psi \cos \theta \\ w &= R \cos \psi \end{align*} $$
And where:
$$ \begin{align*} \psi &\in [0, \pi] \\ \theta &\in [0, \pi] \\ \phi &\in [0, 2\pi] \end{align*} $$
From which we can find that the metric is given by:
$$ ds^2 = R^2 d\psi^2 + R^2\sin^2 \psi d\theta^2 + R^2 \sin^2 \theta d\phi^2 $$
It can be shown that the determinant of the metric $\det g_{\mu \nu}$ is given by $g = R^6 \sin^4 \psi \sin^2 \theta$, from which we can find that $\sqrt{\det g} = R^3 \sin^2 \psi \sin \theta$. The volume element is thus given by:
$$ \begin{align*} dV = \sqrt{\det g}~ dx\, d\psi\, d\theta\, d\phi \end{align*} $$
Thus, by integrating over the volume element, we can find the total volume of a 4D sphere:
$$ \begin{align*} V &= \int dV \\ &= R^3 \int_{0}^\pi d\psi \sin^2 \psi \int_{0}^\pi d\theta \sin \theta \int_{0}^{2\pi} d\phi \\ &= 2\pi^2 R^3 \end{align*} $$
We can also consider the metric of a 3D hyperbolic space (that is, the surface of a 4D version of a hyperboloid, also known as a pseudosphere). The intrinsic coordinates $\eta, \theta, \phi$ are related to the extrinsic coordinates $x, y, z, w$ is given by:
$$ \begin{align*} x &= R \sinh \eta \sin \theta \cos \phi \\ y &= R \sinh \eta \sin \theta \sin \phi \\ z &= R \sinh \eta \cos \theta \\ w &= R \cosh \eta \end{align*} $$
Where:
$$ \begin{align*} \eta &\in [-\infty, \infty] \\ \theta &\in [0, \pi] \\ \phi &\in [0, 2\pi] \end{align*} $$
The metric can be written in the forms:
$$ \begin{align*} ds^2 &= dx^2 + dy^2 + dz^2 - dw^2 \\ &= R^2 d \eta^2 + R^2 \sinh^2 \eta d\theta^2 + R^2 \sin^2 \theta d\phi^2 \end{align*} $$
We can use the same methods to compute the volume element $dV$, but note that the total volume is infinite, since $\eta \in [-\infty, \infty]$. That is:
$$ \int dV = \infty $$
The general metric for the 3D spaces of constant curvature (which are embedded in a 4-dimensional space) can be written as:
$$ ds^2 = R^2 d\chi^2 + \frac{R^2}{k} \sin^2 (\sqrt{ k }\chi) d\Omega^2, \quad d\Omega^2 \equiv d\theta^2 + \sin^2 \theta d\phi^2 $$
Note: It is useful to note here that $i = \sqrt{-1}$ and $\sin^2 i\chi = -\sin^2 \chi$. That is to say, the hyperbolic functions are essentially the trigonometric functions with an imaginary argument.
For $k = 0$, we have flat space $\mathbb{R}^3$, while for $k = 1$ we have the 3-sphere and for $k = -1$ we have the pseudosphere. The three are related because the pseudosphere can be thought of as a sphere with imaginary radius, while flat space can be thought of as a sphere with infinite radius.
While this may all seem to be highly-abstract mathematics, there are actual applications of higher-dimensional curved spaces. For instance, cosmological models often use the following metric to describe the spatial curvature of the Universe:
$$ ds^2 = R^2 \frac{d\xi^2}{1 - k \xi^2} + R^2 \xi^2 d\Omega^2 $$
We will see later that this metric is essentially a spatial “slice” of the FLRW metric, which is the standard cosmological model of the Universe; in the FLRW metric, we typically use the letter $r$ rather than $\xi$ and incorporate a time-dependent radius, such that $R = R(t)$. In the case $k = 0$ we have a flat, spatially-infinite Universe, while $k > 0$ and $k < 0$ gives us a finite Universe. Current empirical data suggests that $k$ is very, very small for the actual Universe, meaning that the Universe has a very little 3D curvature. It is still debated whether $k = 0$ is indeed true, which would tell us whether the Universe is finite or infinite.
Finally, we will briefly mention that if we allow for the coordinates to be complex-valued, we can describe the higher-dimensional spaces often used in string theory. A 6D example of such a space is known as a Calabi-Yau manifold and is widely studied; however, it is outside the scope of this guide.
Curvature as physics
Why do we care so much about curved spaces in higher dimensions that we can’t see? It is because in GR, geometry literally is gravity. The gravitational field is the metric tensor $g_{\mu \nu}$. Mass, energy, momentum, and other matter fields together affects the gravitational field, which manifests as the curvature of spacetime. Correspondingly, the curvature of spacetime lead to deviations from straight paths in curved space, as we saw in our discussion of paths in curved spaces. As the famous physicist John Archibald Wheeler surmised:
“Spacetime tells matter how to move; matter tells spacetime how to curve.” - John Archibald Wheeler
The consequences of spacetime curvature are profound; light becomes bent, distances are stretched and shortened, and gravity slows down time. Indeed, understanding the phenomena that arise from the curvature of spacetime forms the essence of general relativity!
Show table of contents Back to all notes