Kepler's Second Law

Return to Contents

Recall that Kepler's laws assert:

1) The shape of each planet's orbit is an ellipse with the sun at one focus.

2) If an imaginary line is drawn from the sun to the planet, the line will sweep out equal areas in space in equal periods of time for all points in the orbit.

3) The ratio of the cube of the semimajor axis of the ellipse (i.e., the average distance of the planet from the sun) to the square of the planet's period (the time it needs to complete one revolution around the sun) is the same for all the planets.

In one beautiful calculation, using his Calculus, Newton showed that all of these would follow if the planets, in their motion around the Sun, obeyed his Universal Law of Gravitation. And, of course, he developed a theory of Lunar motion on the same principles. Thus, these laws about the motion of the planets, derived from empirical observation, could now be deduced.

Newton's Universal Law of Gravitation

Let us start with the Law of Universal Gravitation as Newton stated it. Given two objects separated by a distance d from each other, there is a tendency for each to accelerate towards the other, that is, each will feel its velocity change towards the other, with a rate of change of velocity that is, for each, proportional to the mass of the other, and inversely proportional to d squared, the square of the distance separating them. The precise formula is this. A body of mass is attracted towards a body of mass with a force which is equal to:

(1.1)

 

where is a unit vector ( ) directed from towards .

According to Newton, force is a vector that is equal to mass times acceleration

 

 

where the acceleration (the rate of change of velocity) is given by

 

 

where unit vector is directed along the line separating the "centers of mass" of the two bodies. Of course, its direction, as well as its strength, changes continually.

On this page we will see why (from a 3 dimensional viewpoint) that it follows from Newton's Gravitational law that each of the planets should move in a plane that contains the Sun. That allowed us to use a polar coordinate representation for the motion in the Polar Coordinates section of the Satellite Orbits Chapter. We will justify that assumption here and then we will revisit the conserved quantity that we discovered in Motion in a Gravitational Field.

If m is the mass of a planet, then in polar coordinates, its distance from the Sun, and the time rate of change of its angle measured in radians, we called the constant quantity

 

 

the angular momentum of the planet. And we saw that the angular momentum is always conserved. This is an example of a "conservation law" and we will use the 3-dimensional Cartesian representation to establish the first of those conservation laws on this page. This is called: Conservation of Angular Momentum. It is essentially Kepler's second law: if an imaginary line is drawn from a planet to the center of the Sun, that line sweeps out equal areas in equal times. In the Galilean language of the Introduction, "The area swept out increases uniformly with time."

Deduction of Conservation of Angular Momentum

Let us suppose then that there is a three-dimensional system of coordinates with the Sun at the Origin. Each point in space is then represented with three coordinates

 

 


An object (such as a planet) whose position changes with time may then be represented by a space curve (which is a

function of time)

 

 


Now according to what we saw in Satellite Orbits, the velocity of such a moving point is given by

 

 


where the "dot" indicates time derivative. And the acceleration is given by

 

 


In terms of these coordinates, Newton's Universal law may be stated (for M the mass of the Sun) as

(1.2)

 


where

 

 


is the "unit vector" in the direction of the point and the negative sign accounts for the direction of the acceleration.

Also, recall that when you "multiply" a vector by a number, it means that you multiply each of its coordinates by that same number. The velocity of a moving point is represented by an arrow emanating from the point itself. The same is true of the acceleration, when you place it at the tail of the velocity vector.

We may simplify the acceleration somewhat by writing it

(1.3)

 

In this form we see that at each instant is a negative multiple of .

Now, why does an object that moves according to Newton's Law move in a plane? For that we need to understand that each plane passing through the Origin (the Sun) can be described by an equation:

 

 

for constants A,B, and C not all 0.

Question 1: Given two points in space,

 

 

and

 

 

then the points on the line that those (distinct) points generate have the form:

 

 


For example,

 

 

Show that if P and Q satisfy the equation that is, if,

 

 

and

 

 


then all the points on this line also satisfy the equation. Explain, in your own words, why this means that the set of points that satisfy the equation (for constant A,B, and C not all 0) must, in fact, be a plane.

End of Question

Now, suppose that our "planet" moves in the curve

 

 


under the influence of Newton's Law. This means that


for each instant of time t. Suppose that at some instant of time, , the velocity

 

 

and the position

 

 

do not lie in the same line through the origin, so that .

So there is a unique plane through the Origin that contains them both. Suppose that plane is defined by the equation

 

 

Then it means that

 

 

and

 

 

Now if we call we are saying that

 

 

Therefore is some multiple of say for scalar


We will show then that for all times , we must have

 

 

and

 

 


And so the planet must remain in this plane forever. To this end, use the cross product to define the vector:

 

 

Question 2: Show that

 

 

Conclude that since

 

 

and of course,

 

 

that

 

 

and so is a constant vector.

End of Question

But this implies that since

 

 

and that means that

 

 

and


Thus the planet remains in the plane defined by the equation:

It is also easy to see that if for some time , do not lie in the same line through the origin, so that then for all future (and past) time , it must be true that do not lie in the same line through the origin since is constant. We consider the cases where, for time the velocity is a multiple of position,

 

 

or where position is a multiple of velocity

 

 

to be singular, or unusual. In such cases, the planet must move in a straight line, and not an ellipse at all.

It is not obvious that if the planet does not satisfy the singular condition for some time then it can never arrive at the center of the Sun. That is, that

 

 

for all time t. So far, we have only used the fact that gravitational acceleration is central (towards the Sun) to guarantee that the planet remains in the plane. We actually will need the particular form of Newton's acceleration to show that the planet does not eventually fall into the Sun, even if it does not satisfy the singular condition for some time .

To see this, suppose that at a later time the planet arrived at the center of the Sun, so that

 

 


We may assume that the function is continuous, and that restricted to the interval and that it is continuously differentiable. Thus we can argue that is constant

on the interval as we did. Now, if the velocity was also a continuous function of t and if was defined, then since we would have

This means that cannot be defined in such a way that it extends to be a continuous function on all of .

It is not difficult to see in fact that as since remains constant, but .

We will see later however that this strange behavior does not occur for Newton's force law if the planet does not satisfy the singular condition for some time . This is because, as we shall see later, in fact there is a positive number such that for all . If the orbit is not singular at some point, then the planet cannot fall into the Sun (in principle, after all it could still come so close that for all practical purposes it did so).

Under the circumstances that the planet never falls into the Sun, the quantity (essentially the energy, see the discussion of conservation of energy in Satellite Orbits)

(1.4)

 

is defined for all time in its orbit.

Question 3: Show that if the orbit does not satisfy the singular condition, the function

 

 

is constant for all time t. For this, you should differentiate with respect to time, and use the chain rule. This is essentially the "Conservation of Energy" calculation that we have already seen, but done now for planetary (orbital) motion. The earlier calculation we made was for the singular case -- straight up and down motion. Reason carefully how we may conclude that is constant.

End of Question

Plane planetary motion

Now we have accomplished something important. We now know that for a single planet, ignoring all others, if we could assume the Sun did not move, then the planet must move in a plane that contains the Sun. The effect of the planet on the Sun is, however, not entirely negligible. The Sun does wobble just a little because of the planet's pull. Newton therefore imagined a fictitious "center of mass" of the planet-Sun system and showed that his slightly modified equations with that center of mass at the origin (instead of the Sun) gave the Keplerian prediction. Now this wobble is in any case so vanishingly small for Earth that we may safely ignore it in our calculations, and we will. I only mention the correction for completeness.

The task ahead is now the following. We want to show that planets "sweep out equal areas in equal times." Now that we know that the motion is in a plane, we will cast the argument in the polar coordinate plane as we did in Satellite Orbits. We will then reinterpret our conservation of angular momentum result in terms of our new determinant concept and area. Now, we have with the constancy of a powerful analytic tool. So let us put it to use.

We will now restrict attention to the Euclidean plane in which we know the planet moves, and will attach, as needs be, an appropriate metric structure inherited from . In this calculation, we assume that the motion is not singular at some time, and that the planet does not "fall into the Sun" so that it is not singular for all time. We will justify the latter assumption in the next Chapter. Recall the polar coordinate description of a plane curve in Satellite Orbits: Polar Curves and their Moving Frames, and Motion in a Gravitational Field.

We defined a polar curve, in the following way. For each angle , there is a point along the curve at that angle. Its polar coordinates are: , and its Cartesian coordinates are:

 

 

where

 

 
 

 

If we connect this point to the origin (the Sun) by an imaginary line, that line intersects the unit circle, the set of points at distance 1 from the origin, in a unique point (since, as we observed, and will show later, is never 0). This point, for each , is denoted . So the curve is the projection of the curve onto the unit circle.

We represented the point on the unit circle with the symbol: , which is, apparently, nothing but a "wrapping

function" discussed in most trigonometry texts,

 

 

so finally:

 

 


The important point here is that has two Cartesian components for each angle . It is a point on the unit circle.

We also assumed that we can actually use to parameterize the curve . It is important to know that as time increases, increases. Since our quantity is constant, it follows that as long as the planet does not fall into the Sun, we may assume that this is so. The planet does not "back up" as Ptolemy thought. So we observed that differentiation was with respect to the variable , rather than with respect to time. In particular, we calculated the derivative of .

 

 


by differentiating each coordinate separately as we do for all curves. This gave a new curve that we called

 

 


We know that

 

 

and also that for each is perpendicular to .

Summing up Newton's observations so far, we have plane curves, parameterized now by the angle ,

 

 
 

 
 

 

all associated with our planet.

The pair of vectors are perpendicular to each other, and each has length 1. They defined a "moving frame" along the curve . Imagine that you ride along the orbit, and at each point , you construct the perpendicular pair: (blue) and (dark red) at that point, as pictured below:


The bright red vector is the "velocity" as we traverse the curve by covering equal angles in equal times in the counterclockwise sense.

We let itself be an increasing function of time t. That is, for the planet, the angle increases with t. So we may write the curve that describes the planet's position also as

 

 


Differentiation with respect to time, t, is denoted by a "dot" rather than a "prime", but remember that

 

 

First, we calculated the velocity,

 

 
 

 

using the chain rule and the observation that

 

 


Thus, in the coordinates of our moving frame:

 

 

Now we differentiate again to see that the acceleration is given by

 

 

or, in the coordinates of our moving frame:

 

 

We called the first coordinate

 

, the radial acceleration

 

and the second coordinate

 

, the angular acceleration.

 

We saw that in the coordinates of our moving frame:

Now when we described the acceleration also in terms of our moving frame, we saw using Newton's Law that we must have

 

 

That is, the angular acceleration must be 0! This was the basic reason for introducing these moving frames. So it follows that

 

 

Now consider the quantity associated with the planet at each point in time:

 

 

Clearly,

 

 

We concluded that the quantity is constant throughout the motion. It is a conserved quantity for the orbit in the plane. If m is the mass of the planet, we called the constant quantity

 

 

the angular momentum of the planet. And from this, we saw that the angular momentum was always conserved.

Interpretation of Angular momentum in terms of cross product

We are now in a position to interpret in terms of the cross product: .

Recall that

So

Thus from properties of the cross product,

 

 

So it follows from the fact that are orthogonal unit vectors,

 

 

In this way, we rediscover the conserved quantity . The quantity

is the angular momentum of the planet, which does not change with time.

We now wish to interpret the constancy of as the statement of Kepler's Second Law:

"If an imaginary line is drawn from the sun to the planet, the line will sweep out equal areas in space in equal periods of time for all points in the orbit."

We do not know yet that the orbit is an ellipse. That is Kepler's First Law, but we will know that soon. So the following picture is suggestive of the situation.

For that, we shall approximate the area of the ellipse using the time, t, as the parameter. And the parameter should vary from 0 to T where T is the amount of time required for the planet to make one complete revolution.

Now if this seems strange to you, it should. What do ellipses have to do with planets? Well, Kepler and Newton showed us. And we can (and often do) take such serendipitous connections with the physical world as clues about the deeper structure (and significance) of purely mathematical objects.

Having chosen the parameter interval, we must next find the appropriate "approximating function." For that, recall our diagram


and recall that the quantity Q that we associate with this interval is the "area swept out" by as t varies from 0 to T. This is the area of a growing wedge. We want to measure the value of using integration.

We argued that in fact the area of this sector is approximately equal to the area of the approximating triangle


We called the area of the triangle DA. You showed that the area of this triangle, DA, is exactly equal to

for small positive angles Dq . Thus, calculating

 

 


we saw that this was

since


Now the quantity


is the constant for the planet.

Now I claim that if we choose two angles sufficiently close, say , the area swept out between those angles between is exactly equal to

 

 


for some angle between . To see this, consider the new picture, where we choose an interval for on which is increasing or decreasing:

where

 

 

and

 

 


then the area of the sector of circle between is

 

 

and the area of the sector of circle between is

 

 


and so there is an angle between such that


by the Intermediate Value Theorem.

This means that if we use the angle as the parameter for the quantity Q, we have a simple approximating function for Q.

So we know that

 

 

It would be difficult, however, to find an anti-derivative for .

Instead, we observe that is a strictly increasing function of the time t if we traverse the ellipse as a planet would using Newton's gravitational law. So we write and now use time t as the parameter.

We see that for each partition,

 

 


of the parameter interval, with all small enough, there is a corresponding partition

 

 

and we may choose a sequence of parameter values:

 

 

such that

 

 

For each i, in the partition, there is another parameter point
such that

 

 

so that finally,

 

 

The problem is that will not equal . The best we can do with this construction is to say that there are two times

for each i such that

 

 


Thus, we do not have a simple approximating function for Q for the time parameterization, but we do have a general approximating function f, where

 

 

and so

 

 

Recall that the function of t

 

 

is constant for the planet. This fact was proven independently of the present considerations. It only requires the we assume that the acceleration is central, or towards the Sun.

Deduction of Kepler's Second Law

According to what we have said, is an anti-derivative for the constant function . Put another way, we have finally justified the assertion that . The time rate of change of area is the constant .

This is Kepler's Second Law.

Later, when we come to Kepler's Third Law, we will be interested in calculating the actual area of the elliptical orbit. Thus, since the function is another obvious anti-derivative of , we may conclude that

 

 

or

 

 

Therefore, we have measured using integration. We see that

 

 

where T is the time required for one revolution, the planetary year.

We also now know why the (constant) function , which is not a simple approximating function for Q, for the time parameterization, but is a general approximating function, may still be integrated to yield Q.

Exploration: Testing Kepler's Second Law

How does the moon "fall around the Earth?" Why do astronauts float in the Space Shuttle even when they are only a few hundred Km above the Earth? The answer to these questions is in Newton's Law of Universal Gravitation. Here, we apply it to deduce Kepler's second law. It is often cloaked in an obscure language, so that Newton's clear geometric intuition, when he explained this is often missed. We will attempt to recover that picture in the simulation on this page.

We will also "test" Kepler's Second Law. The second law states that if an imaginary line is drawn from the sun to the planet, the line will sweep out equal areas in space in equal periods of time for all points in the orbit.

You may launch a "solar probe" into orbit from the Earth when it is at the position (1,0) on the screen. That is where we assume the Earth is at launch. In terms of absolute coordinates this is a point 150 million km from the Sun. For simplicity, we use a grid for which one unit on the screen is one Astronomical Unit: that is, the mean distance from the Earth to the Sun.

When you launch the probe, you may choose its absolute direction. For that, use the "Direction" scroller.

Choose also the duration of a time interval during which you will track the probe each time you press "Continue." Type the number (between 0 and 1) in the field:

The smaller the number, the more accurate the tracking will be. Then press the button to start the probe.

The probe leaves the Earth radially at "escape" velocity, and so the absolute velocity will be obtained by combining the velocity it has because it moves with the Earth with the velocity it has with respect to the Earth. This combination is what is depicted with the green vector. This indicates the absolute direction and magnitude of motion. The position at any time of the probe is represented by the blue vector. The length of the green vector is magnified by a factor of 20 so that you may see it on the screen.

Now the probe will travel for the duration of time you set, then it will stop. Each time you want to track it for the next leg of its journey, press the Continue button:

The first few steps may look like:

As the probe traverses its orbit, a dark red ray is drawn from the Sun after each leg of the journey (not to be confused with "Step Size" which you should probably leave alone). These rays define sectors of the orbit whose areas ought to be equal, since they are computed for simulation time equal to the "Duration" you chose. That duration has nothing to do with real time, by the way. It is simply a convenient measure of time. Above, you see 5 such sectors.

At the end of each leg, the system prints an approximate area for the sector just drawn. It does the approximation using

and so is the area of a triangle, not a sector. These numbers should be approximately equal for each leg. They are the more closely, the smaller the duration, as you will see.

You will notice that the lengths and directions of the green and blue vectors change with time. This is to be expected. As the probe nears the Sun (as in the illustration), the green vector, the velocity, becomes quite large. When it is far from the Sun, its length is quite small. The blue vector behaves in the opposite way.

Now, the green vector is drawn from the probe rather than from the Sun. This is for picturesque reasons. It highlights the fact that this is the vector of motion. The vector of motion of the probe is "tangent" to the orbit it traces at each point of the motion. But notice that at each point, the two vectors: the blue vector and the green vector together determine a parallelogram. This parallogram has the following simple interpretation. If units for distance are taken as Astronomical Unit (AU), and if units for velocity are taken as: then the area of the parallelogram is twice the rate of change of area of the ellipse being swept out at any point in time, where the rate of change of area is measured in units of .

Notice that we have "magnified" for the screen the appearance of the "green" vector. So that these units do not exactly coincide with what appears on the screen.

Now this brings us to the red vector that shows up in our pictures. It is drawn, like the green one, from the probe itself. And, like the green one, it is magnified to enable us to view it. The factor of magnification that we use for it here is 100. It is the vector of acceleration, that is, of time rate of change of velocity, that the Sun exerts on the probe. Notice that it is always pointing towards the Sun.

As the probe nears the Sun, the acceleration vector becomes larger, and as it recedes, that vector becomes smaller. Notice, however, that it always points towards the Sun! Now the interpretation of that vector is a little more complicated than the interpretation of the others. It acts continually to change the velocity vector, to "pull" it towards the Sun. Since it changes the velocity, and, (recalling Galileo's observation) because it changes the velocity vector, it causes a change in position, also.

This explains how an object can "fall" around the Sun. The probe "wants" to move in the direction of the velocity vector. As it does, that vector changes direction towards the Sun in accordance with its acceleration towards the Sun. Thus, it is always "falling", but it is also "moving sideways". Now any object (like an astronaut) that is moving with the probe is subject to exactly the same accelerations. The mass does not matter. Thus, the astronaut "falls" in exactly the same way as the probe, and therefore does not accelerate towards any of its walls. (To be honest, there are "tidal accelerations" that would eventually cause some small drift, since the center of mass of the astronaut is not the same as that of the probe, but we can ignore these.)