Prévia do material em texto
M E T H O D S O F
Celestial Mechanics
DIRK BROUWER GERALD M. CLEMENCE
Director of the Observatory Scientific Director
Yale University United States Naval Observatory
New Haven, Connecticut Washington, D.C.
Academic Press
N E W Y O R K A N D L O N D O N • 1961
COPYRIGHT © 1 9 6 1 , BY ACADEMIC PRESS INC.
ALL RIGHTS RESERVED
NO PART OF THIS BOOK MAY BE REPRODUCED IN ANY FORM
BY PHOTOSTAT, MICROFILM, OR ANY OTHER MEANS,
WITHOUT WRITTEN PERMISSION FROM THE PUBLISHERS.
A C A D E M I C P R E S S I N C .
I l l FIFTH AVENUE
NEW YORK, NEW YORK 1 0 0 0 3
United Kingdom Edition
Published by
ACADEMIC PRESS INC. (LONDON) LTD.
BERKELEY SQUARE HOUSE, LONDON W . 1
Library of Congress Catalog Card Number 60-16909
First Printing, 1 9 6 1
Second Printing, 1 9 6 5
PRINTED IN THE UNITED STATES OF AMERICA
PREFACE
T h i s volume on celestial mechanics is in tended to provide a compre -
hensive background for practical applications. I t is suitable for advanced
undergradua te and graduate s tudents as well as for engineers and others
desiring to acquire some working knowledge of the subject. I t may
be used also as a reference volume by specialists. T h e presentat ion and
ar rangement have been designed to facilitate numerical work as m u c h
as possible. But we have aimed at much more than a handbook of com-
putat ion. W e have developed the subject logically from first principles,
and have tried to show the motivation for the processes employed,
and to lay the groundwork for future independen t investigations by
others .
Celestial mechanics is the branch of as t ronomy that is devoted to
the motions of celestial bodies. T h e principal force governing these
motions, at least those within our own galaxy, is gravitation, and tha t
is the only force dealt with in this book. In nearly all cases it is believed
that the observed mot ions can be completely described wi thout recourse
to non-gravitat ional phenomena , and that the numerous discrepancies
between observation and theory are for the most par t due to the in-
completeness of the existing theoretical deve lopments ; dur ing the past
two centuries, as a rule, the precision of observations has increased
faster than the precision of the calculations. But from a strictly pragmat ic
viewpoint, this belief is founded on nothing more substantial than
previous successes in dealing with the relatively rough observational
material then available. T h e vast quanti t ies of calculations requi red for
a stricter examination of the subject have deterred many investigators
from taking it up . T h e recent development of bet ter calculating machines
has already led to renewed activity in the field, in which exciting dis-
coveries doubtless remain to be made , and a desire to facilitate such
work has been our chief motive in wri t ing this book.
T h e outs tanding examples of impor tan t non-gravitat ional forces now
known are those that cause the irregular and seasonal changes in the
speed of rotation of the earth, those tha t affect the mot ions of satellites
within the ear th ' s a tmosphere , and those affecting the mot ions of comets
and the components of close binary systems. T h e only section in this
book that deals with any of these topics is tha t on a tmospher ic drag as it
affects the motion of an artificial satellite (Chapter X V I I , Section 1 4 ) .
V
vi PREFACE
We restrict ourselves to the inverse-square law of gravitation. T h e
cases where it is known to be inadequate are the motions of the lines
of apsides of the inner planets, and they can be very simply dealt wi th
by adding small corrections in accordance with the principles of general
relativity. For them the reader is referred to any general work on
relativity.
Celestial mechanics may be s tudied as a mental discipline, to gain a
general appreciation of principles and formal relations, or it may be
s tudied with a view to practical applications. I t is the latter that we have
had exclusively in view. We give all of the principal methods tha t have
been used most extensively for numerical calculations and none tha t
are not suitable for the purpose .
A good working knowledge of differential and integral calculus is
presupposed. In addition, some knowledge of college algebra, the theory
of equations, and ordinary differential equations is necessary for full
unders tanding of some of the derivations, bu t lack of it will not prevent
practical use of the methods .
T h e organization of the book is not in logical sequence, so that it
mus t be read th rough from beginning to end, bu t rather in more or less
independent chapte rs ; the order of chapters might be rearranged in
various ways wi thout hamper ing a proper unders tanding . T h e exceptions
are Chapters I and I I , which belong before X I - X V I I . Also, Chapter IV
logically precedes V, and Chapters VI - IX are connected with one another.
Chapters I I I and X - X V I I are each virtually self-contained. T h u s a
considerable variety of courses of s tudy may be laid out by making a
judic ious selection of chapters .
Chapter IV is given chiefly as prerequisi te to Chapte r V, in order
that the latter may be unders tood wi thout a full course in finite differ-
ences, and also in order to make precepts for interpolation readily
available. W e have included in Chapters V I - I X an account of those
parts of spherical and practical as t ronomy that are essential for dealing
with astronomical observations. Aberrat ion has been t reated more
completely than the other parts , on account of the casual way in which
it has been passed over by other authors .
T h e methods themselves are all given in Chapters X I - X V I I . In the
final chapter we have stressed the use of canonical variables for the
quanti tat ive solution of problems in celestial mechanics. W e have thought
to serve this purpose best by present ing a single unified approach,
leaving aside all tha t is not specifically needed. Perhaps conspicuous by
its absence is the partial differential equation of Hami l ton-Jacobi in
its usual form, which is of part icular value for the solution of problems
that permi t separation of the variables. W e prefer to emphasize the
advantages of the more general form of the equation, due to von Zeipel,
PREFACE vii
which was specially designed to serve as the basis of a powerful method
of per turbat ion theory. I t contains Delaunay 's method as a special case.
Thos e interested in other aspects of the subject will find a wealth of
additional information in E. T . Whi t taker ' s Analytical Dynamics and
other treatises.
Notat ion in celestial mechanics presents many difficulties, mostly
owing to the great n u m b e r of symbols required. I t is often impossible
to avoid using one symbol in two or more different senses wi thout
in t roducing notations that are excessively cumbersome, or so novel as
to confuse those who already have some familiarity with the subject.
W e have generally preferred the notat ions most commonly used by
those who have wri t ten most on a part icular topic, even to the extent
of occasionally using two symbols in different chapters to mean the
same thing. W e have, however, made an exception for the most commonly
occurr ing quanti t ies , such as mean anomaly, for which a single symbol
is used th roughout . In this way we have hoped to make it as easy as
possible for the reader to consult the original l i terature, wi thout imposing
an u n d u e burden on one who restricts himself to this book.
I t is our pleasant duty to express our cordial thanks to Dr . Gen- ichi ro
Hori of the depar tment of astronomy of Tokyo Universi ty. T h e first
year of his sojourn at the Yale Observatory coincided with the period
when the manuscr ip t was receiving its finishing touches. T o his careful
reading of the entire manuscr ip t we owe many corrections and improve-
ments . Dr . Hor i hasbeen equally helpful in reading the proofs.
DIRK BROUWER
G. M . CLEMENCE
February, 1961
CHAPTER I
ELLIPTIC MOTION
1. H i s t o r i c a l i n t r o d u c t i o n . T h e subject of celestial mechanics
may be said to have had its beginning in the publication by Sir Isaac
Newton of his Philosophiae Naturalis Principia Mathematica in 1687,
usually referred to as the Principia. I n this celebrated work Newton
formulated the laws of mot ion and the law of universal gravitation; and
he derived some of the most significant propert ies of planetary and
satellite motion.
T h e derivation of the famous three laws of Keple r had preceded
Newton ' s work by many years, and the historical way of present ing the
introduct ion to celestial mechanics is therefore that chosen by Newton
in Book I, sections II and I I I , where he enquires what information
concerning the force acting upon a planet may be derived from Kepler ' s
laws, which may be stated as follows:
I . T h e orbit of a planet lies in a plane that passes th rough the sun,
and the area swept over by the line joining the sun and the planet is
proport ional to the t ime elapsed.
I I . T h e orbit of a planet is an ellipse of which the sun occupies one
focus.
I I I . T h e ratio between the square of the period of revolution and the
cube of the semi-major axis is the same for all planets revolving around
the sun.
F r o m the s ta tements that the orbit of a planet lies in a plane that
passes th rough the sun and that the area swept over by the radius vector
is proport ional to the t ime, it may be concluded that the force acting
upon a planet mus t be directed along the line joining it wi th the sun.
Th i s force mus t be attractive rather than repulsive since the path of a
planet is concave toward the sun. F r o m the s ta tement that the orbit of
a planet is an ellipse with the sun in one focus it then follows that the
force that keeps a planet in its orbit mus t vary inversely as the square
of its distance from the sun. Finally, Kepler ' s th i rd law is found to
require that the forces acting upon different planets mus t be directly
proport ional to the planetary masses. Actually, since the orbits of the
principal planets lie nearly in the same plane, this reasoning cannot
1
2 ELLIPTIC MOTION
fully prove the isotropic character of gravitational attraction. T h e argu-
ment is s t rengthened appreciably by the consideration of cometary
orbits , many of which have large inclinations relative to the orbital
planes of the principal planets. Th i s , also, was considered by
Newton .
T h e s tudy of visual binary stars toward the end of the eighteenth
century enhanced the interest in the quest ion concerning the universality
of the law of gravitation. In most cases of visual binary stars nothing is
known about the motion in the line of sight, and the only information
about the orbit is to be obtained from the observed fact that the projec-
tion of the t rue relative orbit upon the plane tangent to the sky is an
ellipse in which the law of areas is obeyed but in which the focus is
not occupied by the central star. T h e question therefore arises what
law of gravitational attraction would produce orbits having this proper ty .
T h e answer is that the inverse square law is the only isotropic law that
satisfies the observations, bu t that other more complicated and noniso-
tropic laws would also be compatible with the observed facts. Investiga-
tions of this type were made before the int roduct ion of the spectrograph
for measurements of radial velocities. Wi th the aid of information
concerning the motion in the line of sight the answer might be made
more positive.
In Newton ' s t ime there was ample justification for proceeding first
with the establ ishment of the basic principles, by induct ion, from
observed phenomena . Even now an analysis of the problem along these
lines is of considerable interest and instructive. In this volume we
assume that the laws of mot ion and the law of gravitation are available
as the basic rules of celestial mechanics, and derive the consequences
of these assumpt ions . Th i s amounts to following from the beginning
the principle of the t ime-honored and fruitful method of scientific
investigation: the validity of the basic assumptions is tested by the
comparison of observation with theory. Newton stated this principle
in his "Rules of Reasoning in Phi losophy" (Principia, Book I I I ) as
follows: " I n experimental philosophy we are to look upon proposit ions
inferred by general induct ion from phenomena as accurately or very
nearly t rue, notwi ths tanding any contrary hypothesis that may be
imagined, till such t ime as other phenomena occur, by which they may
either be made more accurate, or liable to except ions."
I t is well known that observations of the motions of the heavenly
bodies have confirmed the validity of these basic assumptions to a
remarkably high degree of precision. T h e full significance of the preci-
sion of the quanti tat ive agreement between theory and observation is
evidently not realized by numerous writers of letters and pamphle ts
who year after year advocate changes in the basis of celestial mechanics .
LAWS OF MOTION 3
T h e r e is one significant exception: the in t roduct ion of the general
theory of relativity has made it necessary to look upon the Newtonian
basis of celestial mechanics as an approximat ion to a more fundamental
set of rules. T h e difficulty of the relativistic formulation and the high
degree of approximat ion achieved with the Newtonian laws are the
justification for proceeding on the basis of the classical theory. In the
few cases where modifications are required by the general theory of
relativity they may be in t roduced as small corrections to the results of
the analysis.
T h e most significant observed deviation from Newtonian theory is
the excess in the advance of the perihelion of Mercury , which is so
well established that this observational confirmation of general relativity
can no longer be seriously quest ioned. A similar bu t m u c h smaller
excess in the motion of the perihelion of the ear th ' s orbit is now fairly
well confirmed observationally, bu t not with the same s t rength as in
the case of Mercury ' s perihelion.
2* T h e l a w s of m o t i o n a n d the l a w of g r a v i t a t i o n . T h e three
laws of mot ion may be stated:
I. Every body continues in its state of rest, or of uniform mot ion
in a straight line, unless compelled by an impressed force to change that
state.
I I . T h e rate of change of m o m e n t u m is proport ional to the impressed
force, and takes place in the direction in which the force acts.
I I I . T o every action corresponds an equal and opposite reaction.
T h e Newtonian law of gravitational at traction is: Every two particles
in the universe attract each other with a force that is directly proport ional
to the p roduc t of their masses, and inversely proport ional to the square
of the distance between them.
Before under taking to apply these laws to the simplest p rob lem of
celestial mechanics, a few comments may be made .
(a) I t is assumed that coordinate systems exist in which the laws of
motion apply. Such a coordinate system is called a Newton ian frame
of reference.
(b) T h e velocity of a particle of mass m is defined by the differential
quot ient dsjdt—the limiting value of As/At as At approaches zero, As
being the distance traveled in the interval At.
(c) T h e m o m e n t u m is defined by m dsjdt.
(d) Velocity, m o m e n t u m , and force are vectors wi th components .
In a Cartesian coordinate system,
4 ELLIPTIC MOTION
Force Fr9 F.
For brevity, derivatives with respect to the t ime will frequently be
denoted by a dot. Hence the velocity components will be wri t ten xy y, z
and the components of the acceleration x, y> z.
(f) Newton ' s law of gravitationis stated for particles of mass only,
and does not apply to bodies of finite dimensions. I t can be shown,
however, (see Chapter I I I ) that bodies with spherical symmetry attract
each other as if their masses were concentrated in their respective centers.
As will be shown in Chapter I I I , for bodies whose distr ibution of mass
differs from spherical symmetry, if the distances between the bodies
are large compared with their dimensions, the mutua l attraction ap-
proaches that which would apply if their masses were concentrated
in their respective centers of mass.
In the solar system the sun and the principal planets are spheroids
with relatively small degrees of oblateness, and their mutua l distances
are large compared with their dimensions. Hence, the attraction between
the sun and planets not a t tended by satellites is found to be very nearly
the same as if the bodies were point masses, that is, their masses were
concentrated in their centers . For planets a t tended by satellites, a first
approximation will give the motion of the centers of mass of these
systems. T h e motion of the ea r th -moon system is in several respects the
most interesting of these. One might first obtain the motion of the center
of mass of the ea r th -moon system on the assumption that the mass of
this system were concentrated in the center of mass. T h e next step
would be to obtain the small correction to this mot ion due to the
distr ibution of mass within this system. T h e complete solution of this
problem requires a knowledge of the principal features of the motion
of the moon around the earth.
= Fz. ( 1 ) F F m -a-
m~d
m
Velocity
Momentum
(e) T h e second law of motion for a particle of mass may then be
stated
F = F
Since m will be considered a constant, these may be wri t ten
EQUATIONS OF MOTION 5
T h e smaller bodies in the solar system, the minor planets and meteo-
rites, may have very irregular shapes, bu t on account of their small
dimensions compared with their distances from the sun and the principal
planets, it is permit ted , well within the observational accuracy, to treat
these bodies as point masses. For comets the approximat ion may appear
to be less justified on account of their large dimensions in some instances.
However , even for comets there is good evidence of a s t rong concentra-
tion of mass in the nuclei. N o well-established deviations in cometary
orbits that may be ascribed to nonspherical mass distr ibution are known.
A m o n g the planets a t tended by satellites there are several examples
of satellites with distances from their pr imaries that are only a few t imes
the radius of the pr imary. T h e smallest known ratio is tha t of Jupi te r ' s
fifth satellite with the ratio 2.54 to 1. In such cases the effect of the
nonspherici ty of the planet upon the satellite's motion is not at all
negligible. Th i s problem is s tudied in detail in Chapters I I I and X V I I ,
also in relation to orbits of artificial satellites for which a more elaborate
solution is required than for natural satellites in the solar system.
(g) T h e r e is one further circumstance that simplifies to a considerable
extent the s tudy of the motions of bodies in the solar system. Compared
with the sun, the masses of the principal planets are small. T h e largest
mass is that of Jupi ter , about 1/1000 that of the sun. T h e next largest
mass is that of Saturn, about 1/3500 that of the sun. Hence , except in
the case of a body close to a principal planet, the attraction by the sun
is always overwhelmingly m u c h greater than the attractions by the
principal planets . T h e masses of the minor planets, comets, and meteo-
rites are known to be very small. For example, the total mass of all the
minor planets combined has been est imated to be less than 1/500 the
mass of the ear th.
I n view of these circumstances a useful first approximation to the
motion of a principal planet, minor planet, comet, or meteoric particle
may be obtained by considering only the mutua l at traction between the
sun and the body whose motion is to be studied, t reat ing both bodies
as point masses. T h i s is known as the two-body problem, which will
be studied in the following sections.
3 . E q u a t i o n s o f m o t i o n for the t w o - b o d y p r o b l e m . Consider
two point masses, ma and mbi with coordinates in a Cartesian coordinate
system fa, rjaj £a; £&, r)bi £ 6 . W h e n motions in the solar system are to be
treated, the coordinate system is chosen to be a r ight -handed system.
Denote the distance between the two masses by r. T h e force acting
between the two masses is then
p __ fr2 mamb
r2 '
6 ELLIPTIC MOTION
and the components of the force acting upon mb are
Fte = k*mamb Fbn = k*mamb -^=^», FH = ^F^-
It is seen that Fa$ = — J F 5 | , etc. , as should be the case in view of the
third law of motion. If the expressions for Fa§, Fa , ... Fh^ are taken as
the r ight -hand members of the equations of motion (1), the explicit
equations of motion are found to be
maL = k2mamb
1 F. Tisserand, "Mecanique Celeste," Gauthier-Villars, Paris, 1889.
k2mamb
F = F
1 at] — x
( 2 )
maL = k2mamb
Hence the components of the force acting upon ma are
k2maml
the value of k depending upon the units of mass, t ime, and distance
chosen. For numerical calculations the choice of uni ts is commonly
determined by those employed in the national ephemerides and in
auxiliary tables. For theoretical investigations it is common to adopt
uni ts such that k2 = 1 or to eliminate k from the equations. By paying
proper at tention to the dimensions of derived quanti t ies it will always
be possible to restore k in the final results of the analysis if desired.
Especially in view of numerical applications, it is advantageous to choose
the factor k2 in the expressions for F rather than a factor / (such as
used by T i s se rand 1 ) .
T h e line mamb will have the direct ion cosines '
MOTION OF CENTER OF MASS 7
with
r2 = (L - Lf + (va - i»T + (L - Q2-
In this problem the equations of mot ion could be wri t ten down almost
at once, and the problem is thereby reduced to that of a system of six
differential equations, each of the second order. T h e complete integrat ion
of this system requires the introduct ion of twelve constants of integra-
t ion.
4 . M o t i o n of the c e n t e r o f m a s s . Six constants of integrat ion
may be obtained by making use of the fact that the r igh t -hand member s
of the corresponding equat ions of (2) and (3) are equal bu t opposi te .
Hence , by adding in pairs, there result:
maL + = 0,
WaVa + mbVb = 0, (4)
ma£a + mbtb = 0.
U p o n integrat ing these once, we obtain
™aL + Wbkb = «i,
™arja + nihr)b = a2, (5)
™aL + mbtb = <*3>
ocly oc2, and oc3 being arbi trary constants . One further integration gives
MaL + Mb€b = *it + ft,
MaVa + MbVb = OC2t + ft, ( 6 )
"*aL + ™>blb = O^f + ft.
ft, ft, and ft being three additional arbi t rary constants .
™>bL = k2mamb
l-
(3) mB7]H = k2mamb
J
mbtb = k2mam
8 ELLIPTIC MOTION
Another form for these expressions may be obtained by introducing
£g> Vgy ta as the coordinates of the center of mass. T h e n
MaL + mb€b = (™a + ™b)i;gi
WaVa + ™bVb = (ma + ^b)Vg> (7)
™ala + Wblb = (ma + mb)£g.
Hence
2 - i - a? '3
I t is impor tant to have a clear unders t and ing of the meaning of con-
stants of integration. Some confusion may persist unless both the ma the -
matical and the physical meanings are fully grasped. In this particular
case, the interpretat ion is immediately clear. F r o m a mathematical
point of view the meaning of the constants of integration is s imply
that , whatever values be assigned to ocl9 a 2, a 3 , fSl9 j82, j83, the solution of
the problem in which these constants appear will satisfy the original
equations. F r o m a physical point of view the solution will usually be
employed to describethe motion of a particular physical system. In
this case, a particular set of values of the constants of integration will
serve to identify the system.
As an illustration, suppose the solar system to be limited to two
bodies, the sun and the earth. T h e constants fil9 jS2, ^83, divided by the
sum of the masses, furnish the coordinates of the center of mass of the
system at t = 0. T h e constants ocl9 a 2 , a 3 , divided by the sum of the
(8)
which state that the center of mass of the system has a uniform motion
in a straight line, the coordinates of the center of mass at t = 0 being
N(ma + ™>b), j8 2/(m a + j8 3/(m a + mb\ and t h e ' c o n s t a n t velocity
being given by
T h e direction cosines of the velocity are evidently
EQUATIONS FOR CENTER OF MASS 9
masses, furnish the velocity components of the center of mass of the
system.
I t is known tha t the velocity of the solar system, if de termined relative
to the nearer stars, is 19 km/sec in the direction right ascension 1 8 h ,
declination + 3 0 ° , approximately. T h e direction cosines of the apex, in
an equatorial coordinate system with the £ axis toward the nor th celestial
pole, are
0.000, - 0.866, + 0.500,
and the velocity components of the motion of the solar system in km/sec
0.0, - 16.5, + 9.5.
These are evidently the values in km/sec of
T h e integration constants ocv a 2 , a 3 , ft, ft, ft are therefore significant
only for the description of the location of the center of mass of the system
of two bodies; they are irrelevant if the motion within the system is to
be studied.
Wi th the aid of (6) it is possible to reduce the system (2), (3) from
one of the twelfth order to one of the sixth order by either of two
methods .
5* E q u a t i o n s of m o t i o n a b o u t the c e n t e r of m a s s . In t roduce
x a ~ £a £gy J a = Va ^gy %a = £ a %>gy
* 6 = £b — igy yb = Vb~ Vgy *b = £& ~ £</•
Since
— L = x b — xay Vb — Va = Jb ~ Jay £& ~ £ a = %b ~ Za>
and since
lg = 0, 7 ) , = 0, £, = 0,
Eqs . (2) and (3) may be wri t ten:
ma xa = k2mamh
maya = k2mamb
ma za = k2mamb
(9)
10 ELLIPTIC MOTION
(10)
with
mb zb = k2mamb -
r2 = (*« - xbf + (ya - ybf + (za - zbf
T h i s amounts to using the moving center of mass as origin, and com-
parison of Eqs . (2), (3) with (9), (10) shows that the equations are not
changed by this t ransformation. I t would be simple to show quite
generally tha t the equat ions remain unchanged if the new origin has any
arbitrary uniform motion with reference to the original coordinate
system and that they also remain unchanged regardless of the orientation
of the coordinate system.
Equat ions (7) now become
maxa + mbxb = 0,
WaJa + ™>bjb = °> (11)
maza + mbzb = 0.
Hence xb, yb, zb can be eliminated from (9) and xa,yayza from (10).
Th i s requires expressing r in te rms of xa> yai za alone or in te rms of
x b j Vb> z b alone. Since
mbJb = k2mamb'-
Mb
x b = klmamb
xa xb -
( 1 2 )
z a z b ~
it follows that
EQUATIONS FOR RELATIVE MOTION 11
or, if
rl = x2
a+yl + z2
ai xb
(13)
Mo k2
Eqs . (9) and (10) may be wri t ten
ya = (14)
(15)
Evidently, the solution of either (14) or (15) is all that is required. If,
by the complete solution of (14), xay yay za are obtained as functions of
the t ime and six constants of integration, the expressions for xby yby zb
can be wri t ten down with the aid of (11). Hence by the procedure
followed the reduct ion to a set of equations of the sixth order has been
achieved. These equations will not be used for the integration, bu t they
will be referred to in a later section.
6. E q u a t i o n s for the re la t ive m o t i o n . A more common procedure
is to obtain the motion of mb relative to ma. I n order to obtain these
equations, pu t
x = xb - xa, y = y b - y a , z = zb - za.
By subtract ing Eqs . (9), divided by may from the corresponding equa-
tions (10), divided by mby with
r2 = x2 + y2 + # 2 , /x = k2 (ma + mb\
the result ing equations are:
Xb —-
X — y -- z - (16)
T h e equations (16) represent the new system, reduced to one of the
sixth order. If x, y, z are obtained by integration as functions of t and
six constants of integration, the expressions for xay xby yay yby zay zb may
be obtained from
12 ELLIPTIC MOTION
which may be derived with the aid of (11).
7 . T h e i n t e g r a l s of area* I t is seen that if the first equat ion
of (16) is mult ipl ied by y> and the second by x, the r ight -hand m e m -
bers become identical. Subtract ion gives
xy — yx = 0.
T h e three equations (16) can be combined by pairs in three different
ways, as follows:
yz — zy = 0,
zx — xz = 0, (18)
xy — yx = 0.
These equations may be integrated, yielding
yz — zy = cx,
zx — xz — c2, (19)
xy — yx = c3 .
These are the three integrals of area, cly c2, cs being arbitrary constants.
I t is seen by subst i tut ion of (19) that
cxx + c2y + csz = 0. (20)
T h i s is the equat ion of a plane passing th rough the origin, which the
coordinates in the relative orbit mus t satisfy. Hence: T h e relative orbit
of a mass mb about ma lies in a fixed plane passing th rough ma. T h i s
plane is the orbital plane. T h e orientation of the orbital plane is defined
by the ratios between cl9 c2) cs.
Put , for brevity,
G = Vcl + c\ + 4
(17)
Xb =
jb
AREA INTEGRALS 13
G always to be taken positively. T h e direction cosines of the normal
to the orbital plane (20) are then
C\ C2 C3
G * G ' G '
T h e orbital plane is usually defined geometrically by the longitude of the
ascending node and the inclination. If the motion in the orbit is direct,
tha t is, counterclockwise viewed from the positive z direction, / is taken
to be in the first quadran t and the normal is taken to be on the same side
of the xy plane as the positive z axis. If the motion in the orbit is re t ro-
grade, / is taken in the second quadrant , and the normal is taken on
the side of the xy plane that contains the negative z axis. Again, viewed
from this side the motion in the orbit is counterclockwise, bu t viewed
from the positive z axis the motion is clockwise.
Z
F I G . 1. Orientation of an orbital plane with its normal OQ.
Imagine a sphere (Fig. 1) with center at the origin of coordinates.
T h e orbital plane will intersect this sphere in a great circle. T h e point
N is the intersection of this circle with the xy plane, where the body
crosses to that side of the xy plane that contains the positive z axis.
Let the normal intersect the sphere in Q. Consider the spherical triangles
QXN and QYN. T h e cosine formula gives
^ c i
cos XQ = = sin Q sin / ,
cos YQ = -Q — — cos Q sin / .
Also
/\ c3
cos ZQ = — — cos / .
14 ELLIPTIC MOTION
I t will now be shown that \cly \c2y \cz represent the projections on the
yz, zx9 and xy planes, respectively, of the area swept over by the radius
vector in a uni t of t ime. Consider first the motion in the plane of the
orbit, and let polar coordinates ry ip be used to specify the position of mb
relative to ma. T h e origin of the angular coordinate may be chosen in
F I G . 2 . Differential area.
dA/dt being the area swept over in the uni t of t ime. Now let rectangular
coordinates in the orbital plane be defined by
(#) — r cos ipy (y) = r sin ip.
T h e n
(x) = — rip sin ip + f cos ip> (y) = rip cos ip -\- f sin ipy
and it is found that
(*) C P ) - 0 0 (*) = '¥• ( 2 2 )
Let the projection of r on the yz plane be denoted by rly and the projection
of Aip by Aipx\ similarly, let the projections on the zx plane be r 2 , Aip2y
and on the xy plane r 3 , Aip3, the angles ipv ip2y 0 3 being counted from the
yy zy and x axes respectively. I t follows then that the projections of the
area of the triangle OPPx are, with the omission of the terms containing
the products ArjAipjy
any fixed direction in the plane of the orbit . In Fig. 2 , let P be the posi-
tion of mb att ime t and P x that at t ime t -f At. T h e area of the triangle
OPP1 is then
\r (r +Ar)AiP = %r2AiP + ^rArAip.
By taking the limiting value for At —• 0, there is obtained
( 2 1 )
A change to rectangular coordinates may be made with the t ransforma-
tions
y — rY cos , z — r2 cos </r2 , x = r3 cos </r3 ,
z = ^ sin , x = r 2 sin i/r2 > ^ — r3 s i n 03 >
which finally give, similarly to (22),
yz — zy = r 2 ^ ,
zx — xz = r\^i2, (24)
xy — yx = rlifj.
3
T h e sum of the squares of the three equat ions (19) is
G2 = C2 + c l + c l
— y2%2 z2y2 _J_ z2£2 _|_ X2Z2 _|_ 2̂̂ ,2 _|_ ^2^2 _ 2j,2'jte — 2ZXZX — Ixyxy
— (r2 — x2) x2 + (r2 — y2) y2 + (y2 — z2) z2 — Tyzyz — 2zxzx — 2xyxy,
with the final expression
G2 = r2 (x2 + y2 + s 2 ) - (** + yy + **) 2 . (25)
In this form, since
xx -\- yy -\- zz = rr,
it is seen that G is independent of the orientation of the coordinate
system. In the particular case that 7 = 0, G = c3 is twice the area
swept over by the radius vector in the orbital plane. In view of the
independence of G of the orientation of the coordinate system this
must hold generally, and hence
G = r2i(;.
AREA INTEGRALS 15
(23)
Hence, from (24), (23), and (19), it follows tha t
T h e projections of the areas swept over by the radius vector in the uni t
of t ime on the three planes are, therefore,
16 ELLIPTIC MOTION
T h e constant of areas for the relative orbit may be represented as a
vector having the length G and directed along the normal to the orbital
plane. T h e components cly c2y cs of this vector represent the constants
of areas for the planes perpendicular to the three coordinate axes.
8. T h e v i s v i v a i n t e g r a l . T h e latin words vis viva mean living
force, often translated kinetic energy. Equat ions (16), mult ipl ied suc-
cessively by the factors xy yy z and added, give
xx + yy + zz •
T h i s may be wri t ten
[x2 + y2 + z2)
U p o n integration this gives
(26) x2 + y2 + z2 = 2
C being a constant of integration. T h e left-hand member of (26) is the
square of the relative velocity. T h e constant C is, therefore, independent
of the orientation of the coordinate system. Equat ion (26) states that
the square of the velocity of one body relative to the other is inversely
proport ional to the distance between them, with the exception of an
additive constant .
I t is useful to write this integral in a different form. Put t ing U = ybjry
Eqs. (16) may be wri t ten
(27)
Since U is a function of the coordinates xy yy z only, i.e., not of the
velocity components xy yy zy while also the independent variable t does
not occur explicitly in U, we have
MOTION IN PLANE 1 7
dt
Integrat ion gives
±(*«+^+**) = U + C. (28)
Th i s may be wri t ten
i> V2 — U = C.
For a further discussion of the integral, see Section 15 of this chapter .
9. M o t i o n in the o r b i t a l p l a n e . Since the motion takes place in
a plane, it is permissible to introduce a coordinate system x, y, z such
that the xy plane coincides with the orbital plane. T h e n z = 0, and the
equations of mot ion become
Now, int roduce polar coordinates by
x = r cos ip> y = r sin ijj.
T h e n
x2 + y2 = r2 + r2ifj2.
These equations are a system of the second order, bu t the presence of
two constants of integration renders them fully equivalent to the system
(29), which is of the fourth order.
(29)
with
x — y =
r2 = x2 + y2.
Consequently, the integral of area and the vis viva integral may be
writ ten
rV = G,
(30)
from which it follows that , if the three equat ions (27) are mult ipl ied
successively by x, y, z and added,
dU
x x + yy + zz =
18 ELLIPTIC MOTION
T h e variable I/J occurs th rough its derivative only. Elimination of
if) yields
or
(31)
T h u s (31) may be wri t ten
= - V ^ f . (33)
gives
Th i s equation will be useful later on. In order to obtain the equation
for r in te rms of the angular coordinate ipy it is convenient to eliminate
t between (31) and the first equation of (30). T h e result is
dr dr dib
dt di/j dt
dr G
difj r2
(32)
T h e expression unde r the radical may be expressed as
Hence, if
(32) becomes
One further transformation,
or
MOTION IN PLANE 19
or
Th i s may be integrated:
I/J = arc cos a + y , (34)
y being a constant of integration. T h e expression (34) is equivalent to
a = cos (if; — y ) . (35)
But
Hence, if this expression for o is in t roduced into (35) there results
(36)
Th i s may be compared with the equation for r in polar coordinates if
the orbit is an ellipse with the origin at the focus. T h e n
(37)
co being the longitude of the perihelion, a the semi-major axis, and e the
eccentricity.
Wi th one slight further modification (36) becomes
1 cos (0 - y ) . (38)
Comparison of (37) and (38) then gives
y = co,
h 1 - e1.
(39)
20 ELLIPTIC MOTION
G = y W l - e2)
(40)
= V£p,
where p is the parameter ,
p = -
Subst i tut ion of this expression for G into the last equat ion of (39) gives
C = ~ T a - <41>
With the aid of the new constants of integration a, e> the integrals (30)
may be writ ten
r2ifj = VJip = v W l — e1),
T h e area of the entire ellipse is -nab, where b is the semi-minor axis, or
b == a V l — e2.
Let the period of revolution in the ellipse be P. T h e mean motion n is
defined as the mean rate at which the angle I/J increases dur ing a per iod;
f2 + f 2 iA 2 — \^
10. Kep ler ' s th i rd l a w . Since G = 2dA/dt, the area swept over
by the radius vector in the uni t of t ime is given by
(42)
n
Consequently, the areal velocity is
Th i s mus t equal the r ight -hand member of (42), from which
na2 = V / ^ >
F r o m these results it follows that the value of the integration constant
G in terms of the common constants a, e is
ECCENTRIC ANOMALY 21
or, with /x = n2az,
ndt = a—v*
{1 + ecos(0 - c o ) ) 2 '
the integration of which would furnish f as a function of I/J — to, from
which I/J as a function of t could be obtained. Al though this procedure is
possible, in order to accomplish our purpose it is bet ter to re turn to
Eq. (31) which, with the aid of (40) and (41) may be wri t ten
lia(l -e2)+2^r
With /x = a 3 « 2 this becomes
V - a2 + a2e2 + 2ar - r2
'a2e2 - (r - a) 2 ,
V ^ ( l — * 2) =
(44)
if / , the t rue anomaly, is in t roduced here by
/ = tfj - d > .
W e have not yet obtained an expression giving I/J or / as a function of t.
A direct attack upon this problem would be the subst i tut ion of (37)
into the first equat ion of (30), giving
or
/x = nW. (43)
Thi s is the analytical expression for Kepler ' s th i rd law.
11. T h e e c c e n t r i c a n o m a l y . So far, only three of the four
constants required for the complete integration of the Eqs . (29) have
been obtained. These three are ei ther
w, C, G,
or
to, a, e.
T h e equation for the orbit has been found in the form
T h e integration of this expression is elementary. Pu t
r — a — — ae cos u, (45)
u being an auxiliary variable named the eccentric anomaly. W e have
dr = + ae sin u du (46)
and
a2e2 — (r — a)2 — a2e2 sin 2 u.
Hence, the result of the transformation is
n dt = (1 — e cos u) du.
Thi s furnishes upon integration
u — e sin u — n (t — T), (47)
T being a constant of integration.
12. T h e m e a n a n o m a l y . F r o m Equat ion (44) for r in te rms of
the t rue anomaly, it follows that at perihelion
/ = 0 , r = a(l-e)\
at aphelion
/ = 7T, r = a (1 + e).
F rom (45) it follows also that
for u = 0, r — a (1 — e);
for u = TT, r = a (1 + e).
Hence the angles w and / are equal for
u = f = 0 and w = f = TT.
In (47) let 7 1 correspond to the instant of perihelion passage, and put
n(t-T) = I. (48)
Th i s defines the mean anomaly /, which by definition is zero at perihelion.
22 ELLIPTIC MOTION
or
na dt =
POSITION IN PLANE 23
For u = 77 it follows that / = TT.Hence all three anomalies are zero
at perihelion and equal 77 at aphelion. Equat ion (47) may now be wri t ten
u — e sin u = /, (49)
which is Kepler ' s equat ion.
T h e constant T occurr ing in (47) is the fourth constant of integration
required to define the motion in the orbital plane. In this form it would
be necessary to give the t ime of one perihelion passage. T h i s da tum,
together with the values of the constants of integration a, e, & and the
parameter / x , would define the motion in the orbit for any t ime. T h i s
form is frequently used for cometary orbits . For planetary orbits it is
more common to use as the fourth constant of integration the value
of / at the zero epoch. Let this epoch be t = t0. T h e n , if / = l0 for
t = t0, the expression for the mean anomaly is
l=l0 + n(t- g , (50)
n being given by (43) as a function of a and / x .
13. F o r m u l a s for o b t a i n i n g the p o s i t i o n in the o r b i t a l p l a n e .
In order to compute the position in the orbit at any t ime, it is necessary
to solve u from Kepler ' s equation, which is t ranscendental in u. After
u has been found, the radius vector may be obtained from (45),
r = a (1 — e cos u). (51)
In order to obtain / it is useful first to obtain r c o s / from (44), which
yields
r cos / =
cos / =
T h e n it is easily seen that
1 + cos / :
1 — cos / =
a (cos u — e),
24 ELLIPTIC MOTION
These may be wri t ten
T h e expressions (51) and (52) thus furnish the radius vector and the
t rue anomaly if the eccentric anomaly u is known. T h e r e is never any
question concerning the quadran t of / . T h e quadran t of f/2 is evidently
the same as that of u/2 since u and / go th rough 0 and IT s imultaneously.
Formula (52) is particularly useful for logarithmic calculation. T h e
factor (1 -f e)* (1 — e)~* rnay be simplified by introducing cp by
e = sin 99.
For elliptic orbits, therefore,
°<<P<2-
T h e n , if <p' = (TT/2) — cp,
1 + e I + sing?
= tan
Hence
For calculations with calculating machines rectangular coordinates
are frequently more convenient to use than polar coordinates. In order
to obtain the desired expression, introduce a coordinate system xy lying
in the orbital plane with the x axis in the direction of the perihelion.
T h e quotient of these gives
tan^ (52)
V 1 — e V 1 — sin 9
= cot = cot (
(53)
MOTION ABOUT CENTER OF MASS 25
T h e n , in t e rms of the eccentric anomaly,
x = r c o s / = a (cos u — e), (54)
y — r sin / = a V1 — e2 sin uy (55)
the former of which was derived above. T h e latter is found at once from
r2 — r2 c o s 2 / = a2 (1 — 2e cos u + e2 cos 2 u — cos 2 w + 2e cos w — e2)y
r2 sin 2 / = a 2 (1 — e2) sin 2 w.
T h e square root of the two members of this equat ion leads to no ambi -
guity of sign because
0 < u < 77 requires 0 < / < TT,
7T < U < 2TT requires n < / < 27r.
14. M o t i o n a b o u t the c e n t e r of m a s s . I t is desirable at this
stage to re turn to some of the equations that were obtained before those
for the relative motion (16) upon which the discussion in Sections 7
th rough 13 is based. I t was remarked in section 5 that the integration
of the two-body problem might be based upon either of Eqs . (14) or (15).
Compar ison of these equat ions with those of (16) shows tha t their
integration should lead to the same form of solution as tha t obtained
by the integration of (16). T h e essential difference is that /x should be
replaced by /xa for (14) and by FIB for (15).
F r o m (17) it appears that in order to express xa, ya, zay it is necessary to
replace a in the expression for x> y> z by
aa = H ; a>
ma + mb
and to change & to 6J + ?r. I n order to express xbi ybi zb) it is necessary
to replace a in the expression for xy yy zy by
a = _j a
ma + rnb
leaving all the other constants of integration unchanged. I n particular,
n remains unchanged. T h i s should give
n* al = H>a-
n2 d\ = pb
which are in agreement with the values for / x , / x a , / x b in t roduced.
26 ELLIPTIC MOTION
with similar expressions for the variables rj and £. Hence, if
U* = ^ » , (56)
Eqs . (2) and (3) may be wri t ten
™a L =
dU*
Mb I& -
dm
Mb I& -
^b
a t / * e m
mbr)b =
dVb
dU*
nib L =
dU*
Ha 9
nib L =
Kb
m a Va = -^r> m b Vb = -j^r' (57)
™a L
I t is impor tan t to note that [/* is a function of the six dependent
variables £ a , rjaj £a, £b, rjb, £ 6 , i.e., C7* is not a function of the velocity
components and the independent variable, ty is not explicitly present
in the expression for U*. As a consequence of these propert ies , the
following relation will hold:
• (58)
By mult iplying the six equations (57) successively by £ a , rjay £a, £6,
£ 6, and adding, the r ight -hand member of the sum will equal the r ight-
hand member of (58). Hence , the result may be wri t ten
™A (IJA + VaVa + LATA) + m b (£Jb + VbVb + TBTB)
Thi s equation may be integrated, giving
*« (t + Vl + O + \mb + v l + (I) =U* + C*, (59)
15. T h e e n e r g y i n t e g r a l . We now re turn to Eqs . (2) and (3), the
original equations for the problem of two bodies, with the purpose of
obtaining the vis viva integral and the integrals of area for these equa-
tions. Since in these equat ions
r2 = (ib - L ) 2 + (Vb - Va)2 + (£* - U 2 ,
it follows that
POTENTIAL ENERGY
C* being a constant of integration. Wri t ing
27
et + vi + ti = vt2>
the kinetic energy of the system, 7 1*, may be wri t ten
Hence the integral (59) may be wri t ten
T* — U* = C*. (60)
T h i s is the symbolic form of the energy equation, stating that the sum
of the kinetic energy, T*, and the potential energy, — [/*, is a constant .
T h e potential energy is counted from zero for infinite separation of
the two masses.
16. T h e p o t e n t i a l e n e r g y . In order to verify the preceding
statement , consider the amount of work that mus t be done in order to
increase the distance between two masses, ma and m&, from r 7 to r n
(rn > ri)- T h e work done if the distance is increased by an amoun t dr
is given by
D W DR.
r2
U p o n integration this becomes
which equals £/* for r = r 7 . Hence the potential energy is zero for
r = oo and is negative for a finite distance between the two bodies.
T h e function C7* is also called the force function of the system,
because the partial derivatives of [/* with respect to the coordinates
equal the components of the forces acting on the masses.
- k2mamb
For rn — oo the work done is
28 ELLIPTIC MOTION
A third name for £/* is the potential . A word of caution on the use
of the word potential is in order, since it is occasionally defined with
the opposite sign, especially in Ge rman works on physics.
17. C h a n g e to a c o o r d i n a t e s y s t e m w i t h the o r i g i n a t the
c e n t e r of m a s s . T h e energy integral obtained applies to any arbi-
trary coordinate system in which the Newtonian laws of mechanics are
valid. Suppose now that the origin be placed at the center of mass and
that therefore Eqs . (9) and (10) are used, with
x a — £a £g> J a ~ Va ên z a = Ca tg>
Xb ~ £b ig> Jb = Vb Vo' Zb ~ £g>
and
I t follows that
If now
we obtain
[<h* + A ) ,
- (oc2t + /?2), Vg
lg ( a 3 * + AO-
£a — xa
Va = ya
£b — xb '
i b = yb-
U = zb • £ a — z a
vi = *l + yl + K>
y*2 — J / 2 j
a a (*lxa + a 2 J a + a 3 * a )
- ( a ^ & + oc2yb + OC3ZB) -f
b
"X2 K + AL + a 3 ) >
_ ( a 2 + a 2 + a 2 ) <
ORIGIN AT CENTER OF MASS 29
Hence,
T*= T** + l K + m & ) F 2 . (62)
T h e energy integral for the system having the origin at the center
of mass will be
T** - U* = C**, (63)
since £/* is independent of the origin. Compar ison of T* and 7 1 **
by (61) or (62) gives
C* = C * * + i K + m 6 ) F 2 ,
showing that the total energy in the system £, 17, £ differs from tha t in
the system having the origin in the center of mass by one half thesum
of the masses mult ipl ied by the square of the velocity of the center of
mass.
I t is of interest to compare the integral (63) with that obtained for
the relative motion (26). T h e relations (17) give, upon differentiation,
Hence ,
T h e n , if
T** = imaVl + lmtVl,
it follows that
r = ^ + 2 K k ) K H H 4 (61)
the second par t of the expressions for F * 2 , F * 2 canceling each other
because
maxa + mbxb = 0, maya + mbyb = 0, maza + mbzb = 0.
If Vg is the velocity of the center of mass in the RJY £ coordinate system,
we have
30 ELLIPTIC MOTION
If this is in t roducted into (63) the result is
while (26) gave
Th i s , then, is the total energy in the motion in a coordinate system the
origin of which coincides with the center of mass.
c .
I t is seen that the two expressions are identical, provided that
Since the integration of the equations of the relative mot ion gave
it follows that
18. T h e i n t e g r a l s of a r e a . If the first of Eqs . (2) is mult iplied
by — rja and the second by +£ f f l , .we obtain, by addition,
= k2mamh (64)
Similarly, if the first of Eqs . (3) is mult ipl ied by — rjb and the second
by we obtain, by addit ion,
= k2mamb (65)
T h e r ight -hand members of the two equat ions, (64) and (65), are
opposite. Hence, by addition,
MaiLVa — VaL) + ^bUbVb ~ Vbtb) = 0.
REFERENCE TO ECLIPTIC 31
T h i s equat ion may be integrated:
MaiLVa — yJa) + ^bi^blb ~ Vb€b) = Cz • (66)
By the same procedure , two similar integrals are obtained:
Jla) + MbiVbtb - Lib) = C* 1 >
(67)
(68)
In this form the integrals (66), (67), and (68) express the constancy
of the moment s of m o m e n t u m about the three coordinate axes. Again,
it may be emphasized that these integrals hold in an arbitrary coordinate
system in which the Newtonian laws of mot ion are valid. I t is left to
the reader to examine the constants CFY CGY C£ in a coordinate system
having its origin in the center of mass, and also to compare the integrals
that apply in this case with the integrals of area (19) obtained in the
problem of relative motion.
T h e formulas derived in the preceding sections permi t the calculation
of the position in the relative orbit at any t ime in a coordinate system
the xy plane of which coincides with the plane of the orbit , the x axis
being directed toward the perihelion. I t is assumed that the three
elements ay e, l0 are known, and that in addit ion the parameter / x =
k2{ma + mh) is given.
T h e requi rements of practical calculation go beyond this . W e mus t
develop a procedure for obtaining the position in a coordinate system
having the same origin, may as the xy system, bu t an arbi trary orientation.
T w o part icular cases occur frequently in problems of planetary motion.
19. C o o r d i n a t e s r e f e r r e d to the e c l i p t i c . Let the coordinates
be designated by # ( e ) , y i e \ z(e), and let the x{e)yie) plane coincide with
the ecliptic for a given date, say 1950.0, and the xie) axis be directed
toward the mean vernal equinox of that same date.
For convenience of later applications we shall consider the more
general case in which the coordinate z is not necessarily zero. T h e change
from x, y, zy to x(e), y{e\ z l e ) can be accomplished by a linear t ransforma-
tion
^(E) = pie)x + Qle)y + Rle)g
X ~ X X
y ,(E) = P I E ) * + Q(e)y + R(e)S
Y 1 & Y J 1 Y '
(69)
•(E) = p(e)x + Q(e)y _|_ P ( E ) 5 .
32 ELLIPTIC MOTION
T h e nine vectorial orbital constants for the ecliptic are to be found as
functions of the three elements, / , Q> <JJ that define the orientation of
the xyz system relative to that of the xie)y(e)zie) system. T h e nine con-
stants are the nine direction cosines of the axes x> y, z relative to the
axes in the ecliptic sys tem; for example, the three direction cosines of
the x axis are Px™, Py
{e\ P 2
( e ) .
F I G . 3. Direction cosines.
F r o m this relationship, it follows at once that the relation may be
inverted:
X = P^x(e) + P<eye) _|_ Pl&zie),
y = Qfx™ + Q{
y
e)yie) + 0 < e , * ( e \
X 1 Y - / 1 z 7
and that the following relations will hold,
P(e)2 i Q(e)2 i P(e)2 _ 1
X ' X^X 1 X •
p(e)2 + p(e)2 + p(e)2 = 1 ?
P(e)Q(e) _L p(e)Q(e) i p(e)Q(e) — Q
a; ^ 03 ' 1/ ^ 2 / 1 z ~ z '
P(e)P(e) i 0 ( e ) 0 ( e ) 4 - Rie)Rie)
= 0
(70)
(71)
as well as all the other similar relations that may be writ ten down by
a cyclical change of x, y, z, P , Q, R.
T h e direction cosines may be obtained by spherical t r igonometry by
considering the intersections of the coordinate planes on a uni t sphere
with mn as center.
REFERENCE TO EQUATOR 33
P(e) _
X
cos CO cos Q — sin co sin Q cos / ,
n(e) =
X- X
— sin co cos Q — cos co sin Q cos / ,
n e ) = sin Q sin / ,
pie) —
y
cos co sin Q + sin co cos Q cos / ,
Qie) =
X-> y — sin co sin .Q + cos co cos Q cos / ,
V
— cos Q sin / ,
p(e) _
z
sin co sin / ,
QT = cos co sin / ,
i ? « e . = cos / .
20 . C o o r d i n a t e s r e f e r r e d to the e q u a t o r . Le t the coordinates
be designated by xy yy z and let the xy plane coincide with the mean
equator for a given date, and let the x axis be directed toward the mean
vernal equinox of the same date. T h e s tandard equator and equinox
of 1950.0 are used extensively for calculations of orbits of minor planets
and comets.
If the orbital elements Q'y CD' are referred directly to the s tandard
equator and equinox, the vectorial orbital constants for the equator ,
(72)
Suppose that we wish to obtain the direction cosines of the xy yy z axes
relative to the x{e) axis.
I n the spherical triangle xie)Nxy x{e)N = Qy Nx = co, <£ xie)Nx =
180° — / . T h e cosine formula gives
A
pie) — c o s x(e)x — c o s ^ c o s Q — s i n w $'mQ cos / .
x
I n the spherical tr iangle xie)Nyy x{e)N = Q, Ny = 90° + co, # ( e ) iVy =
180° — / , and the cosine formula gives
Q[x = c o s x ( e ) y = — s u i 6 0 c o s ^ — c o s 6 0 s ^ n * ^ c o s I*
In the spherical triangle x{e)Nzy x{e)N = Qy Nz = 90°, <£ * ( e ) JV2 =
90° — / , the cosine formula gives
== cos # ( e ) # — sin 42 sin / .
X
T h e remaining six direction cosines may be obtained similarly. T h e
complete set of nine vectorial orbital constants for the ecliptic is
34 ELLIPTIC MOTION
Px, Qx, Rx> etc., may be obtained from the formula for Px
e\ Qx
e), Rx
e\
etc., by merely subst i tut ing / ' for / , Q' for Q, and a/ for a>. I t is common,
however, to give the elements / , £?, OJ referred to the ecliptic. If this is
the ecliptic of the same date as that of the equator and equinox of the
coordinate system x9 y, z, it is evident that the change from xie\ yie), z{e)
F I G . 4 . Ecliptic and equatorial coordinates.
z(e)
F I G . 5. Orbital and ecliptic coordinates.
to x9 y, z may be obtained by a rotation of the x{e)yie)zie) system about
the x{e) axis, clockwise by the angle e, the obliquity of the ecliptic. Th i s
transformation is
x — x^^
y == ye) c o s e (̂e) s j n e >
z = yie) sin e + z(e) cos e.
REFERENCE TO EQUATOR 35
p x
= n e ) .
= QR>
Rx = ^ e ) .
Py = P ( e ) cos e
y
— P ( e ) sin e,
Qy = Q < e ) cos e - Qif s i n € j
Ry = R(e) COS 6
y
— P ( e ) sin e,
2 7
= P ( e ) sin e
y
+ P< e ) COS 6,
Qz = sin e + COS €,
= # l e > sin e
1/
+ P ( e ) COS €.
' 2
T h e complete formulas are, therefore,
p« cos co cos Q — sin co sin Q cos 7,
= — sin co cos Q — cos co sin Q cos 7,
= sin Q sin 7,
Py = (cos CD sin Q + sin co cos Q cos 7) cos e — sin co sin 7 sin e,
Qy = (— sin co sin Q + cos co cos Q cos 7) cos € — cos co sin 7 sin e,
Ry = — cos .Q sin 7 cos e — cos 7 sin e,
PZ = (cos co SIN Q + sin to cos Q cos 7) sin € + sin co sin 7 cos e,
Q* (— sin co sin Q + cos co cos Q cos 7) sin e + cos co sin 7 cos e,
= — cos sin 7 sin e + cos 7 cos e.
(73)
Since in an elliptic orbit z = 0, the actual formulas needed for the
calculation of a position are
x = Pjc+ Qxy,
y = Pyx+Qyy, (74)
= Pzx +
I t is not u n c o m m o n to introduce the eccentric anomaly directly into
these expressions. If we define
Ax = aPx, Bx = a VT^~e*Qx,
Av = aPv, Bv = a V T ^ Q V , (75)
A, = aPz, Bz = a W ^ Q Z ,
Consequently,
36 ELLIPTIC MOTION
the rectangular equatorial coordinates may also be obtained from
x = Ax (cos u — e) + Bx sin w,
y — Ay (cos u — e) + By sin uy (76)
z — Az (cos w — e) + £ 3 sin u.
Since the equatorial vectorial constants are direction cosines, the
same relations (71) hold among these constants as among the ecliptic
constants . Among the A's and B ' s the corresponding relations are
Al + A\ + A\ = a\
Bl + Bl + Bl=a*{\-e% (77)
AXBX + AvBy + ,4 A = 0.
I t is sometimes impor tant to have expressions similar to (76) for the
derivatives. T h i s requires the expression of x, y in te rms of the eccentric
anomaly. Wi th the aid of
an n
u = — = --,
r 1 — e cos u
it follows that
a2n . an sin a
x = — a sin uu = sin u = —
r 1 — e cos u '
y = a \/\ — e2 cos uu — \ / l — e2 cos w =
an Vl — ^2 cos w
1 — e cos w
Hence
x = ^ (— Ax sin w + ^ cos w),
j> == ^ (— Ay sin w + By cos w), (78)
z = — (— sin w + cos u).
I t is hardly necessary to emphasize that n should be expressed in radians
per uni t of t ime , and that the uni t of t ime used in expressing the
derivatives should be the same as that used in n.
MATRICES 37
2 1 . I n t r o d u c t i o n of m a t r i c e s . T h e expressions for the vectorial
orbital constants in t e rms of the elements show so m u c h regularity in
s t ructure that it is not difficult to devise an effective scheme for the
calculation. Nevertheless , there are advantages in a different procedure
tha t avoids the use of spherical t r igonometry altogether. T h i s procedure
involves the use of matrices, in t roduced into the mathematical l i terature
by Cayley. Matr ix algebra is an impor tan t b ranch of mathemat ics . W e
shall not treat it as such, bu t shall give here only so m u c h as is necessary
for the convenient t ransformation of coordinates.
For our purpose , a matr ix is to be considered as a rectangular array
of number s (denoted here by symbols) which is operated upon by the
use of simple rules.
Le t two matrices be given:
< < 4 *I * I
a = < < K K
< < < K K
the superscr ipt of any e lement denot ing the row, the subscript denot ing
the column.
T h e produc t ab is defined by
C = ab =
a\b\ + a\b\ + aft a\b\ + a\b\ + a\b\ a\b\ + «ft +
a\b\ + ^bl + albl a\b\ + a\bl + alb\ aft + aft + a®
a\b\+a\b\ + alb\ a\b\ + a\b\ + a\b\+a\bl + albl
tha t is, if the general e lement of a is denoted by a[, that of b by b{9
of C by c[9 the general e lement of ab is
(79)
T h e essential feature is that the rows of a are mult ipl ied by the columns
of b.
I t is not necessary that two matrices to be mult ipl ied each have the
same n u m b e r of rows and co lumns , bu t it is necessary that a has as
many columns as b has rows. T h u s ,
a\ a\
b\
a\b\ + a\b\
2 2 1
b\
= a\b\ + a\b\
< < 1 a\b\ + a\b\
38 ELLIPTIC MOTION
the product ab having invariably as many rows as a has, and as many
columns as b has. If the n u m b e r of columns of a is different from the
n u m b e r of rows of b> the p roduc t ab is not defined. So it happens easily
that the product ab may be defined, while ba is not. Even when ab and
ba are bo th defined, they are generally different from each o ther ; tha t
is, multiplication of matrices is in general not commutat ive .
If the rows and columns of a matr ix are interchanged, the new matr ix
is said to be the transpose of the old one. T h e transpose of a will be
denoted by a'. T h u s , if
a =
1 1
a2
< -\
< -\
then
< < <
< < <
Although multiplication of matrices is not in general commutat ive ,
it is associative, as in ordinary algebra. T h a t is,
abc = (ab)c = a(bc).
22 . C h a n g e of o r d e r in a p r o d u c t of m a t r i c e s . I n the applica-
tions to follow it will often be convenient to change the order in a
p roduc t of matrices. I t may be verified by means of the rule for mul t i -
plication that
ab = (b'aj, (80)
abc = (c'b'a'Y = (b'a')'c = a(c'b')' = [(bc)'a']' = [c'(ab)r\\ (81)
By application of these rules, matrices may be mult ipl ied in any order
that proves convenient, provided of course that the number s of rows
and columns are such as to make the multiplication possible.
23 . R o t a t i o n m a t r i c e s . Let two Cartesian systems x, y, z and
x\ y\ z\ having the same origin be given, and suppose the direction
cosines are known. T h e n
x' = P'xx+Q'J + R'xz,
yf = P'yx+0'yy + R'yz> (82)
z' = P'zx+Q'J + R'zz.
ROTATION MATRICES 39
Assume that the system x\y\ z' is rotated counterclockwise about
the x axis through an angle a, as seen from the positive end of the x'
axis. T h e new coordinates being x'\ y", z", we may write
x" = P'±x+Q'£ + R'fr
y" = P'y'x+Q'y'y + R'y'z, (83)
z" = P'z'x+Qf
z'y + R'z'z.
T h e relations between the coordinates x , y', z' and x'\ y'\ z" are
y" = y' cos OL + z' sin a,
z" = — y' sin a + z cos a.
If this is subst i tu ted into the left-hand member of (83), there results
with the aid of (82),
P" = P',
X X'
P'' = P' cos OL + P' sin a,
V
P" P' sin a 4 - P ' cos a,
= Q'y C 0 S a + S i n a>
0 * = - Q ; s i n a +Q'zcos a,
i ? " = R' cos a + R! sin a,
2/ ^ ' Z '
i?' sin a + i?' cos a.
Thi s transformation may be performed by the multiplication of two
matrices:
P" P" P"
x y z
Qx Qy Qz R" R" R"
X V Z
P' P'
x y
PL
Q' Q' Q'
x z~y *~ z
R' R' R'
1 0 0
0 cos oc — sin OL
0 sin OL cos OL
40 ELLIPTIC MOTION
Also, the transformation from x', y\ z to x'\ y'\ z" is given by
x y
1 0 0
0 cos a — sin a
0 sin a cos a
or equivalently,
1 0 0 x'
/ ' = 0 cos a sin a /
0 — sin a cos a z'
T h e first of these alternative forms is the one we shall use in what
follows. With this unders tanding, that the coordinates to be transformed
are wri t ten to the left of the matr ix effecting the transformation, we
shall denote the latter by p(oc):
p(oc) =
1 0 0
0 cos a — sin a
0 sin a cos a
which corresponds to a positive (counterclockwise as seen from the
positive end of the axis) rotation about the x' axis th rough an angle a.
Similarly, a positive rotation about the y' axis corresponds to the
transformation
x = x cos a — z sin a,
y" = y\
z" = x' sin a + z cos a,
or
x" y" z"
cos a 0 sin a
0 1 0
— sin a 0 cos a
and we put
q(oc) =
cos OL 0 sin a
0 1 0
— sin a 0 cos a
II * ' / * ' ii
| x" y" z' z'
GENERAL ROTATIONS 41
Again, a positive rotation about the z' axis corresponds to the t rans-
formation
x" = x' cos a + y' sin a,
y" = — sin a + y' cos a,
z" = z\
or
# y z x y z
cos a — sin a 0
sin a cos a 0
0 0 1
and we pu t
»•(<*) =
cos a — sin a 0
sin a cos a 0
0 0 1
I t is seen that
P ( ~ <*) = />'(«)> = tf'(°0> r (— a) = r ' (a).
24. G e n e r a l r o t a t i o n s of c o o r d i n a t e s y s t e m s . Any rotation
whatever of a coordinate system may be decomposed into a succession
of rotations about the various axes. Consider, for example, the t rans-
formation of coordinates in the system xy y, z to the equatorial system
Xy y, z tha t was treated in Section 20. T h i s t ransformation may be
accomplished as follows:
1. A rotation — co about the z axis, r (— co),
2. A rotation — I about the x' axis, p (— 7),
3. A rotation — Q about the z" axis, R (— Q)>
4. A rotation — e about the # ( e ) axis, p (— e).
T h e transformation is then
\\xyz\\ =
\xyz\\
cos co sin co 0
-sin co cos co 0
0 0 1
1 0 0
0 cos I sin I
0 -sin I cos /
cos Q sin Q 0
-sin Q cos .Q 0
0 0 1
1 0 0
0 cos € sin e
0 -sin e cos e
42 ELLIPTIC MOTION
In a numerical application, the last four matrices would be multiplied
first, yielding the vectorial constants in matrixform, which would then
be used as required. T h u s ,
P P P
Qx Qy Qz
Rx Ry Rz
= r(-u>)p(-I)r(-Q)p(-e),
x y z \\ = \\ x y z
P P P
x x *• y z
Qx Qy Qz Rx Ry Rz
T h e computat ion of a search ephemeris is given as an example of the
use of these rotation matrices.
Computation of an Ephemeris
I t is required to calculate a search ephemeris for Periodic Comet
Harr ington, using the orbital elements given in I .A.U. Circular N o . 1713,
as follows:
T 1960 June 28.8327 co 232?8391
e 0.559273 1211 9?1327
a 3.590373 / 8?6838
n = ka 3/:
0.01720209895
(3.590373) 3 / 2
n° = 57?29577951 x n = 0.°1448754
e° = 57P29577951 x e = 32?0440
Equinox 1950.0
= 0.002528553
a V l - e* = 3.590373 x 0.828984 = 2.976360
p p
x x x y
Q* Qy Qz
Rx Ry Rz
cos co sin co 0
-sin co cos co 0
0 0 1
1 0 0
0 cos I sin I
0 -sin / c o s /
cos Q sin Q 0
-sin Q cos Q 0
0 0 1
1 0 0
0 cos € sin e
0 -sin e cos €
GENERAL ROTATIONS 43
-0 .60406 -0 .79694 0
+0.79694 -0 .60406 0
0 0
x
1
1 0 0
0 +0.98854 +0.15098
0 -0 .15098 +0.98854
-0.48683 +0.87349 0
-0.87349 -0 .48683 0
0
-0 .60406 -0 .78781 -0 .12032
+0.79604 -0 .59714 -0 .09120
0 -0 .15098 +0.98854
0 1
1 0 0
0 +0.91744 +0.39788
0 -0 .39788 +0.91744
-0 .48683 +0.87349 0
-0 .87349 -0 .48683 0
0
1
0 1
0 0
+0.98222 -0.14411 -0 .12032
+0.13362 +0.93682 -0 .09120
+0.13188 +0.07350 +0.98854
+0.98222 -0 .08434 -0 .16772
+0.13362 +0.94163 +0.30897
+0.11388 -0 .32589 +0.93617
Check:
1 0
0 +0.91744 +0.39788
0 -0 .39788 +0.91744
0
0 +0.91744 +0.39788
0 -0 .39788 +0.91744
'\\x,y, * l l = I I * , y, 0||
ZP2 = ZQ* = ZR2 = 1
ZPQ = ZQR = ZRP = 0
P P P
Qx Qy Q,
R~ R„ R~
= \\a (cos u — e), a\/l — e2 sin w, 0||
P P P
x x y M z
Qx Qy Q,
Rx Ry Rz
44
|cos u — e, sin u, 0| |
= ||cos u — e, sin u, 0| |
ELLIPTIC MOTION
aP„ aP„ aP,
a VT^QX aVV^Qy aVY^Q,
+3.52653 -0 .30280 -0 .60220
+0.39768 +2.80266 +0.91960
-q, £|| = \\p cos a cos 8, p sin a cos 8, p sin 8||
= \\x,y,*\\ + \\X,Y,Z\\,
where X, Y, Z, the geocentric equatorial coordinates of the sun, may
be taken from the Astronomical Ephemeris. Then
tan a — VIE
tan 8 =
P = Vie + V2 + H
r = V(x2 +y2 + z2)
= a (1 — e cos u).
t 1960 June 5.0 June 15.0 June 25.0 July 5.0 July 15.0 July 25.0
t - T - 2 3 . 8 3 3 - 1 3 . 8 3 3 - 3 . 8 3 3 + 6.167 + 16.167 + 26.167
I — 3.4528 - 2.0041 - 0 . 5 5 5 3 + 0 . 8 9 3 4 + 2.3422 + 3.7910
u — 7.803 - 4 .540 - 1 . 2 5 9 + 2.028 + 5.306 + 8.563
cos u + 0.9907 + 0.9969 4-0.9998 + 0 . 9 9 9 4 + 0.9957 + 0.9889
cos u — e + 0.4314 + 0.4376 + 0.4405 + 0.4401 + 0.4364 + 0.4296
sin u - 0.1358 - 0.0792 - 0 . 0 2 2 0 + 0.0354 + 0.0925 + 0.1489
X + 1.4673 + 1.5117 + 1.5447 + 1.5661 + 1.5758 + 1.5742
Y — 0.5112 - 0.3545 - 0 . 1 9 5 0 - 0 . 0 3 4 1 + 0.1271 + 0.2872
z - 0.3847 - 0.3364 - 0 . 2 8 5 5 - 0 . 2 3 2 5 - 0.1777 - 0.1218
X 0.2764 + 0.1106 - 0 . 0 5 8 4 - 0 . 2 2 5 7 — 0.3866 — 0.5368
Y 0.8957 + 0.9264 + 0 . 9 3 1 1 + 0.9095 + 0.8625 + 0.7911
Z 0.3884 + 0.4018 + 0.4038 + 0 . 3 9 4 4 + 0.3740 + 0.3431
t + 1.7437 + 1.6223 + 1.4863 + 1.3404 + 1.1892 + 1.0374
+ 0.3845 + 0.5719 + 0 . 7 3 6 1 + 0.8754 + 0.9896 + 1.0783
i + 0.0037 + 0.0654 + 0.1183 + 0 . 1 6 1 9 + 0.1963 + 0.2213
a 0 M 9 . m 6 l M 7 . r a 6 l M 5 . m 4 2 M 2 ! n 6 2 * 3 9 . m l 3 h 0 4 . m 4
8 4 - 0 ° 0 7 4 -2° l l ' + 4°05' + 5°46' + 7°14' + 8°25'
P 1.7856 1.7214 1.6628 1.6091 1.5595 1.5126
r 1.6007 1.5887 1.5829 1.5836 1.5909 1.6048
At perihelion r = q = a(l — e) = 1.582376.
POLAR COORDINATES 45
25 . U s e of p o l a r c o o r d i n a t e s . For analytical developments it is
useful to obtain expressions for xie), y(e), z i e ) in t e rms of the polar
coordinates in the orbital plane, r a n d / . In these expressions the orienta-
tional e lements co, Q, and I will be retained. W e begin with
*<e> = p<e> r cos / + Qx
e) r s in/ ,
ye) = pie) r cos / + Q{
y
e) r s in / ,
z(e) = p(e) r c o s / + r s in/ ,
in which the explicit expression (72) for the P ' s and Q's mus t be sub -
st i tuted. I t is then found that
xie) — (j cos /cos co — r s i n / s i n to) cos Q
— (r cos / sin co + r s i n / cos co) sin Q cos 7,
ye) — ( r cos /cos co — r sin /s inco) sin Q
+ (r cos / sin co + r sin / cos co) cos Q cos 7,
#<e) = ( r cos / s in co -f- Y sin / c o s co) sin I.
These may be wri t ten
#(e) = r cos ( / + co) cos Q — Y sin ( / + co) sin Q cos 7,
ye) = r cos ( / + co) sin Q + r sin ( / + co) cos £? cos 7, (84)
(̂e) _ r s i n _|_ ^ sin I.
In x(e) and y e ) a further combinat ion of t e rms can be made by making
use of
cos 2 - + sin 2 - = 1, cos 2 - — sin 2 ^ = cos 7,
which produces
x(e) __ r c o s 2 ^ c o s (y -f- ,Q) -)_ r sin 2 ^cos ( / + to — Q),
ye) = r cos 2 ^ sin ( / + to + Q) — Y sin 2 ^ sin ( / + co — 13),
#(e) _ R s i n / sin ( / + co).
I t is frequently useful to express the a rguments in t e rms of
v = / + co + Qy the true orbital longitude;
to = co + Qy the longitude of the perihelion;
Q , the longitude of the ascending node.
46 ELLIPTIC MOTION
T h e expressions become
y
*<e>
.(e)
.(e) r cos 2 - cos v + r sin 2 - cos (v — 2Q),
r cos 2 ^ sin ^ — r sin 2 ^ sin — 2Q)y
r sin / sin (v — Q).
(85)
T h e a rgument occurr ing in £ ( e ) , / + OJ = v — Q, is called the a rgument
of the lat i tude.
If B is the lati tude of the planet above the xie)yie) plane, then
in agreement with the expression for zie) in formula (85).
T h e angle v is counted as the sum of the arcs xie)N + NP, a slight
inconvenience which could be remedied by adopt ing a depar ture point O '
in the orbital plane such that ON = x{e)N = Q. T h e use of a depar ture
point in the orbital plane is common in certain forms of planetary
theory. However, in such cases the orbital plane is in constant motion,
and the definition of a depar ture point is not as simple a mat te r as
equat ing the arcs O'N and x{e)N. I t will be found to depend upon the
solution of a differential equation.
26. R e d u c t i o n to the e c l i p t i c . Finally, it may be desired to
obtain t rue polar coordinates, i.e., the position of the planet may be
defined by the three coordinates r, V9 By in which V is the arc xie)Q.
If the inclination is small, it is evident that at all t imes V — v is small.
Th i s quanti ty is known as the reduct ion to the ecliptic. An analytical
solution of this problem is obtained from the formula in the spherical
triangle NPQ,
z(e) — r s j n ft
T h e spherical triangle NPQ on the uni t sphere gives
sin B = sin / sin ( / + co)
= sin / sin (v — Q),
tan(V-Q)
For brevity, pu t V — Q = p, v
cos / tan (v — Q)
• Q = q, and pu t
J3 =
cos / or
CALCULATION OF ELEMENTS 47
T h e equation then has the form
1 - / 3
tan/) = Y^Tptm
which is t reated in chapter II and has the solution
p - q = — (j8 sin 2q - \ fi2 sin 4q + ^ jS3 sin 6q — ...)
or, after subst i tut ion of the original quanti t ies for p> q, /?,
V-v = - tan 2 !- sin 2(v -Q)+\ tan 4 ^s in 4(v - Q) - \ tan 6 ~ sin 6(v -Q) + ...
2 2 3 2 (86)
For small values of I this formula for the reduction to the ecliptic
is rapidly convergent and in many applications only one or two te rms
of the expansion are required.
27. C a l c u l a t i o n of the e l e m e n t s f r o m the c o o r d i n a t e s a n d
v e l o c i t y c o m p o n e n t s at a g i v e n t i m e . In Section 20 the rectangular
coordinates and velocity components have been obtained from the
orbital e lements . T h e reverse problem mus t frequently be solved:
given y, z, x, y, z for a particular epoch, find the elements of the
orbit.
In order to simplify the calculations, it is desirable to eliminate FI
from the formulas. By inspection of Eqs . (16) it is seen that this can
be accomplished by using, instead of T> a variable R such that
V / x DT = DR.
Equat ions (16) then become
(87)
and / x cannot appear in any of the resultsderived from these equat ions.
Since the change from T to R as independent variable is equivalent to
pu t t ing FX = 1, all the formulas that apply to the relative motion in the
problem of two bodies derived in Sections 6 th rough 13 apply to Eqs . (87)
provided the subst i tut ion / x = 1 is made .
For orbits of asteroids and comets ma represents the mass of the
sun, which is taken as unity, and mh is pu t equal to zero; consequently
/ x = k2.
If the uni t of t ime in which T is expressed is the mean solar day, the
introduct ion of the variable r amounts to the use of a uni t of K ' 1 mean
48 ELLIPTIC MOTION
solar days. T h e numerical value of k is given in Section 31 . If the
velocity components were obtained with the uni t of t ime w mean solar
days, these values mus t first be divided by wk or, more generally, by
w\/Jx. In the following it is assumed that xy y, z have been divided by
the proper factor, and are therefore expressed in the uni t of t ime
required to eliminate / x from the formulas.
T h e problem is conveniently resolved into two parts:
(a) T h e calculation of the elements a, e, / 0 , which define the position
in the orbital plane, relative to the perihelion of the orbit . T h e first
step in the calculation is to compute for the given date, t>
r2 = x2 + y2 + z2
y
V2 = x2 + y2 + z2,
rf = xx + yy + zz.
T h e following relations are then used:
- = - - V\ (88)
a r y
e sin u = a Kf> (89)
e cos u = rV2 — 1, (90)
/ = u — e sin u. (91)
T h e mean daily motion is found from
n = Vv<a*>2 (92)
in the general case, or
n = ka-*12 (93)
for minor planets and comets. Finally,
l0 = l - n ( t - tQ) (94)
furnishes the mean anomaly at t — t0.
Equat ion (88) is the vis viva integral (26) with JJL — 1, C = — \i\2a.
Equat ion (89) follows from (46); (90) results from (45) which may be
writ ten
r f cos u = 1 —
a
in which r/a is replaced by
r
a
which follows from (83).
= 2 - rV2,
file:///i/2a
CALCULATION OF ELEMENTS 49
(96)
— Ax
= x (a/r) cos u — xaz/2 sin w,
aPy = Ay = y (a/r) cos u — yazl2 sin w,
aPz = A = z (a/r) cos u — zaz/2 sin w,
bQx
= BX = x (a/r) sin u + xa3/2 (cos u
bQy = By — y ( A L R ) s m u + ya*/2 (cos u -e),
= BZ = z (a/r) sin u + zazl2 (cos u
(97)
with
b = a cos <f> = a V l — e2.
These formulas follow from (76) and (78). For example, the first of
each set gives
Ax (cos u — e) + Bx sin u = xy
— Ax sin u -\- Bx cos u = xra1/2
y
if in the expression for x the factor r/an is replaced by ral12. Solution
of Axy Bx gives
Ax (1 — e cos u) = x cos u — xra1/2 sin w,
Bx (1 — e cos u) = x sin u + tfra1/2 (cos w — e)y
which may then be b rough t into the form of Eqs . (96), (97).
Finally, the orbital elements co, Qy I may be computed from the vec-
torial orbital constants by the formulas
sin I sin co = — Py sin e + Pz cos 6,
sin I cos co = — Qy sin e + £?z cos e,
which give 7 and co, and
sin Q = ( P y cos co — sin co) sec e,
cos Q = Px cos to — (Jz sin co,
I t may be remarked that the integral of area (25), which with /x = 1
becomes
a (1 - e2) = r2V2 - (rf)2
y (95)
could be used, with (88), to obtain a and e. T h e disadvantage is that
this would lead to an evaluation of e from 1 — e2
y which gives a weak
determinat ion of e if the eccentricity is small. T h e use of (89) and (90)
for the determinat ion of e and u avoids this loss of significant figures.
(b) T h e calculation of the orientational elements co, Qy L T h e con-
stants AxyAyyAzy BxyByyBz and subsequent ly the vectorial orbital
constants Pxy Pyy Pzy Qxy Qyy Qz are obtained from
50 ELLIPTIC MOTION
which give Q. T h e s e relations follow from the explicit expressions (73)
for the vectorial orbital constants .
T h e relations (77) may be used as a check on the constants A and B
or the corresponding relations (71) on the P ' s and Qys. Of these the most
useful is
PxQx + PyQy + PzQt = 0.
A complete check of the elements a>, Q> I would be obtained by com-
put ing the vectorial constants in matr ix form (see Section 24) after
these angles have been found.
28. A c c u r a c y of the e l e m e n t s . I t is of interest to examine the
determinat ion of e and u from Eqs . (89) and (90). T h e r e is an apparent
loss of accuracy due to the circumstance that bo th these equations have
the eccentricity as a factor. T h e max imum numerical value of the r ight-
hand members of these equations is e. Hence , if e is small, these r ight-
hand members will have fewer significant figures than the coordinates
and velocity components used in obtaining them. T h e loss of significant
figures is t ransmit ted th rough tan u to u.
Suppose that the r igh t -hand members are computed to six decimal
places. The i r max imum error is, let us say, e = 5 X 10~ 7 . T h e max imum
error in tan u and in u is therefore e - 1 e . For example, if e = 0 .01,
(89) and (90) give tan u to only four decimal places, and the uncer-
tainty in u becomes 5 X 10~ 5 radians = 0?003 — 10", while the
uncer ta inty that would normally be expected with the use of six decimal
places is of the order 0 " 1 .
T h e uncer ta inty in w, of the order e~xe, is carried forward into / and
affects any angle counted from the perihelion. I t is a reflection of the
fact that the position of the perihelion is indeterminate in the case of
circular orbits, and that the determinacy of the perihelion may be said
to be proport ional to the eccentricity of the orbit . T h i s does not, how-
ever, affect the accuracy with which the position of the planet is deter-
mined in space, as may be verified from the expressions
u = I + e sin «, rja = 1 — e cos w,
which show that the deviation from uniform circular mot ion is expressed
by t r igonometr ic functions of u that have the eccentricity as a factor.
I t is important , however, to use u in the evaluation of / in (91) and in
the calculation of the orientational elements (96), (97) to the full
n u m b e r of decimal places desired in the elements.
A similar uncer ta inty occurs in the evalution of co from (98) if / is
small, and this is carried forward into the evaluation of Q by (99).
CONSTANTS FOR EQUATOR 51
If either e or I or bo th are small, the use of the full numbers of significant
figures in u and co will assure that the sum of the angles Q + to + I,
the mean orbital longitude, is accurately determined, regardless of the
fact that all three of the angles, Q, co, and /, are individually known with
considerably lower accuracy. T h e significance of these considerations
may be illustrated by the expressions in polar coordinates (85).
29. C o n s t a n t s for the e q u a t o r . F r o m (84) the corresponding
expressions for the rectangular equatorial coordinates are readily
obtained:
x = r cos ( / + co) cos Q — r sin ( / + co) sin Q cos 7,
y = r cos ( / + co) sin Q cos e + r sin ( / + co) (cos Q cos I cos e — sin I sin e),
z = r cos ( / + co) sin Q sin e + f sin ( / + co) (cos Q cos I sin e + sin I cos e).
These formulas may be pu t into the form
x = r sin a sin (A + / + co),
y = r sin b sin (B + / + co),
z = r sin c sin (C + / + co),
if sin a, sin b, sin c and A, B, C are de termined as follows:
sin a sin A = cos Q,
sin a cos A = — sin Q cos 7,
sin 6 sin B = sin .Q cos e,
sin 6 cos B = cos cos 7 cos e — sin 7 sin e,
sin c sin C = sin ,Q sin e,
sin c cos C = cos i3 cos 7 sin e + sin 7 cos e.
Finally, if
A' = A +co ,
75' = £ +co ,
C - - - C + CO,
the expressions become
x = r sin # sin (^4' + / ) ,
y = r sin b sin (7?' + / ) ,
z = r s'm c sin (C" + / ) .
These formulas have been used extensively in the past for logari thmic
calculations. T h e constants sin a, sin b> sin c, A\ B\ C are referred to
as the constants for the equator .
52 ELLIPTIC MOTION
30. E x p r e s s i o n s in t e r m s of in i t ia l c o o r d i n a t e s a n d v e l o c i t y
c o m p o n e n t s . Let x0,y0,z0 be the coordinatesand x0yy0,z0 the
velocity components at t = t0 in the relative motion in the two-body
problem. T h e solution of the problem is
x = x (*), y = y (*), z = z (t).
T h e object of this section is to find such expressions with x0, y0, z0> x0)yQy
z0 as constants of integration.
For short intervals of t ime it may be expected that these functions of
the t ime may be developed in Taylor ' s series
x (t) = x0 + x0(t — t0) + ^xQ (t — tQf + ...
and similar expressions for y and z.
Let the differential equations
x = — jjixr*3, y = — fJLyr~3, z = — / x ^ r - 3 ,
be wri t ten
x = Fxxy y = Fxyy z = Fxz.
T h e n , by successive differentiation,
x — Fx x + Fx x = F2 x + G2 x
"x = F2x + (F2 + G2)x + G2x
= (F2 + G2FX) x + (F2 + G2)x=Fzx + G,x.
T h i s may be cont inued indefinitely. T h e same functions FP9 Gp arise
with y and z. Subst i tu ted in the Taylor ' s series there results:
x = f(t) x0 +g(t)xQi
y=f(t)y0+g(t)y0, (100)
* = f(t) * o + * ( 0 *o-
F r o m their formation it is evident that f(t)y g(t) are obtained as functions
of the t ime in the form
00
1=0
(101)
00
g(t) = 2 W - foY>
3=0
in which the coefficients a j y bj are originally expressed as functions of
INITIAL CONDITIONS 53
/xr" 3 and its successive derivatives, all for t = t0. By differentiating the
expressions (100) twice it is found tha t
x=f*o+ 8xo
= ~ J
r l x = - 7 s (/*o + £*<))•
T h u s / and g mus t satisfy the equations
/ + /x/r- 3 = 0, g+iipr* = 0.
T h e y are, therefore, part icular solutions of these differential equat ions
with the initial condit ions, at t = tQi
/(0) = + 1, g(0) = 0,
/ (0) = 0, g(0) = + 1, ( 1 0 2 )
which follow from the expressions (ICO) and their first derivatives.
A simple me thod for obtaining the expansions of / and g in power
series in t — t0 requires obtaining first the coefficients ck in the expansion
00
4 - = X c * e - '«)*•
T h e coefficients ck are obtained by repeated differentiation of /xr~~3
at t = t0. T h e derivatives may all be expressed in t e rms of r 0 , r0 by
making use of
iw = i(**+# + **)
= + J'2 + z2 + + + zz
= — —.
T h e energy integral gives
whence
T h e result ing expressions for ck are
' 1 = ~ 3 / x r - 5 ( r 0 r 0 ) ,
54 ELLIPTIC MOTION
a0 = + 1,
ai =
a2 =
0,
1 3
aB =
<*4 =
a5 =
3 7
+ 8 ^ 0 "
1 2 7-
a6 =
7 -1 2 1 7 -
M i + + ^ 3 - 2 + ...]» (103)
and similarly for g>
- + q ^ V i + ^ , - 2 + . - ] . (104) B3+2
T h e values of a0, aiy b0, bx follow immediately from the initial condi-
tions (102) of / , gy and their first derivatives. T h e results are
<2 = + y F o W - j m V +
35 15
= - y / ^ W + W ( V o ) - y A o 7 f l _ 1 ( ¥ o ) .
<4 = + y F o n ( ¥ o ) 4 - y / i V W + ^ V ^ V o ) 2
31 15
+ - —nhfcr1 + —fih-7a-2.
If this series is subst i tu ted into
/ + /Lt/r-8 = 0, i ' + j ^ = 0,
replacing / , £ by the series (101), there results
X 0' + *) 0' + 2)*, + a (t - *0)' - - X - *o)fc X < l ~ *oY>
Equat ing the coefficients of (t — t0)
j on both sides gives
INITIAL CONDITIONS 55
An example of the usefulness of the series expansions of the functions
/ and g arises in the problem of determining the orbit of a newly dis-
covered object. T h e calculations yield the heliocentric coordinates and
velocity components for a date near the mean date of the observations.
T h e next step might be the calculation of the elements, such as by the
me thod in Section 27 of this chapter . However , frequently the most
immediate need is the calculation of an ephemeris for a period of a
few weeks or mon ths . I t is evident that the series expressions for the
functions / a n d g may serve to advantage for this purpose . T h i s calcula-
tion can be made before the evaluation of the elements other than the
semi-major axis, which is present in the expressions for the coefficients.
Another use of the series is in the start of a numerical integration.
T h e form in which the coefficients a and b are given renders the
series for / and g applicable to any conic section. If the series are applied
to an elliptic orbit , a modified form may have advantages. If the nota-
tion D for djndt is in t roduced, the differential equations for / and g
may be wri t ten
T h e Taylor ' s series expansions become power series in / — / 0 ; i.e.,
increments in the mean anomaly instead of in t — t0. I n particular, the
expansion of # 3 / r 3 becomes
DJ + ^f=0, D*g + £g = 0.
If now
1 P ••
h = 0,
b. = 0,
K
f* 'o '
/ * V o -
56 ELLIPTIC MOTION
C 0 = +P3,
Cl = - *P%
CT= + \P\I + W ) - \ P \
C3= - ^ 6 9 ( 3 + 7^) + 8/»'?>
C 4 = + y /»'(! + 14? 2 + 21q*) - ±/>8(31 + 233?*) + 2p\
These te rms can easily be verified by not ing that
Dp* = —jp'+1q,
Dq> = P - pq* + p*).
Precisely the same recurrence relations hold between the coefficients
A, B, C in the expansions of / and g as were found for a, b, c. T h e
final result is
/ = 1 - M - HF - p*q(l - l0f
P*(l+5q*) pV(3 + 7?')
, / > 9 ] ( ' - ' o ) M 08(19 + 140?2) -;i + i v + 2
g = i - h •p\i-hf-
j/>5(l + 5? 2) ^ ( 3 + 7^) ,P7qW-lo)«-
For circular orbits p = 1 and q = 0, and the expansions reduce to
expansions of cos (/ — / 0 ) and sin (/ — / 0 ) respectively.
I t is necessary to obtain in this form, in addit ion to a, n = f j L l l 2 a ~ s l 2 .
Closed forms for the functions / and g may be derived by making
use of the fact that these functions are applicable also to the
coordinate system X, Y in the orbital plane, with the X axis directed
toward the perihelion. T h e n
X(t)=f(t)X0+g(t)X0,
Y(t)=f(t)Y0+g(t)Y0.
GAUSSIAN CONSTANT 57
T h e solution of / and g from these equat ions is
/ ( / ) = G - W o - YXJ9
g(t) = G~\YX, - XY0),
in which
G — X0Y0 — Y0X0i
= |>ifl(l - e 2 ) ] 1 / 2
= na\\ - e2)1'2
— nab.
I t is now convenient to express / , g in t e rms of the eccentric anomaly
by us ing
X = #(cos u — e)y X = — an sin w(l — e cos w) - 1 ,
Y = a ( l — e 2 ) 1 / 2 sin w, y = + <ro(l — e 2 ) 1 / 2 cos w(l - e cos w)"1.
Le t w 0 be the eccentric anomaly at t = £ 0, w that at t = t. T h e n it is
easily found that
XY0 — y X 0 = abn[cos (u — u0) — e cos u0] (1 — 6 cos « 0)
_ 1>
y Z 0 — Z y 0 == 0&[sin (u — w0) — d?(sin u — sin w0)].
T h e result is
/ =
= 1 -
g =
3 1 . T h e G a u s s i a n c o n s t a n t . I t was remarked in Section 2 that
the numerical value of the gravitational constant depends on the uni ts
of mass, t ime, and distance chosen. Keple r ' s th i rd law,
k2(ma + mh) = n2a3,
or, with n = ITTJP,
a?
k\ma + mb) = 4TT2^ ,
58 ELLIPTIC MOTION
is a simple relation between mass, t ime, and distance that may con-
veniently be used in practical applications of elliptic motion to find
any one of these three quantit ies when the other two are known. T h e
units of mass, t ime, and distance might be arbitrarily chosen, and then
k might be determined by observation. In particular, it might seem
logical to adopt for these uni ts the gram, centimeter, and the ephemeris
second, as is done in physics. Such a choice of uni ts would be excessively
inconvenient, however. If the gram were taken as uni t of mass, the
mass of the earth would first have to be determined in grams (which is
a geophysical, not an astronomical, problem), and then transferred to
other bodies. T h e result would run to a large n u m b e r of figures, and
no beneficial result would ensue; there is no practical problem in celestial
mechanics that requires knowledge of the mass of any body in grams.
Greater difficulty would be encountered with the uni t of distance.
Distances between bodies in the solar system can not be measured
directly in centimeters with m u c h precision. For example, the distance
from the earth to the sun in centimeters is known to only three or four
significant figures, and not as the result of a direct measurement b u t
of derivation by complicated processes.
Purely as a mat ter of convenience, therefore, theunits are chosen
such that the masses, t imes, and distances involved in astronomical
problems can be easily measured and expressed. In many applications
of celestial mechanics to the motions of bodies in the solar system, the
uni t of mass chosen is the mass of the sun, and all other masses are
expressed as fractions of it. T h u s , if ma is the mass of the sun, we adopt
ma = 1 and write m = mb/ma. T h e n
k2(l + m) = n2a\
T h e unit of t ime chosen is the ephemeris day, because the periods
of bodies in the solar system can be easily and precisely measured in
ephemeris days. T h u s , in the expression of Kepler ' s third law, P is
commonly measured in ephemeris days and n in radians per ephemeris
day.
Fur the rmore , owing to the difficulty of measur ing distances, distance
is not defined in absolute physical uni ts or any arbitrary units , bu t k is
chosen arbitrarily instead. T h e value of k is 0.017 202 098 950 000 000,
and this n u m b e r is known as the Gaussian constant . T h e uni t of distance
is therefore a derived one, obtained from the Gaussian constant and
the units of mass and t ime. I t is called the astronomical unit , often ab-
breviated to a.u. T h e dimensions of k are then M ~ 1 / 2 L 3 / 2 7 1 - 1 .
I t is impor tant to r emember that the definition of the astronomical
uni t is given by Kepler ' s th i rd law with the stated numerical value of ky
the sun 's mass as the uni t of mass, and the ephemeris day as the uni t
NOTES AND REFERENCES 59
of t ime. It is often stated that the astronomical uni t is the distance from
the sun to a body of infinitesimal mass revolving in a circular orbit
with a period of 3 6 5 . 2 4 + ephemeris days, bu t this is a description
only, not a definition. Even as a description it leaves something to be
desired.
As a mat ter of historical interest it may be noted that Gauss in tended
to choose k such that the uni t of distance would tu rn out to be half
the major axis of the ear th 's orbit . Actually, because Gauss used a
value for the ear th 's mass that we know to be too small by about
7 % , it tu rns out that half the major axis of the ear th 's orbit , as derived
from Kepler ' s th i rd law with Gauss 's value of k, is 1.000 000 03 a.u.
M u c h confusion has been caused by careless usage of the words semi-
major axis and mean distance in astronomical l i terature. In fact, Kepler ' s
th i rd law loses its geometrical significance when a th i rd body of appre-
ciable mass is in t roduced in the system. In such cases the law should
be looked upon as what it is in fact, merely a definition of the uni t of
distance. For example, owing to the dis turbing influence of the other
planets, an ellipse that has a semi-major axis of about 1.000 000 2 a.u.
gives a bet ter approximation to the ear th 's actual orbit than does the
one derived from Kepler ' s th i rd law.
N O T E S A N D R E F E R E N C E S
T h e epicyc l ica l t h e o r y o f H i p p a r c h u s w a s t h e first a t t e m p t t o represent t h e
m o t i o n s of the p lane t s b y a p u r e l y m a t h e m a t i c a l theory , w i t h o u t re ference to
phys ica l causes ; in f o r m it r e s e m b l e s the d e v e l o p m e n t s in t r i g o n o m e t r i c ser ies ,
w h i c h are still i n d i s p e n s a b l e to t h e subjec t , and w h i c h o c c u p y m u c h of th is book .
Y e t there is a vast d i f ference b e t w e e n t h e empir ica l representa t ion b y H i p p a r c h u s
and his s u c c e s s o r s a n d the m o d e r n representa t ion b y t r i g o n o m e t r i c ser ies that
satisfy t h e e q u a t i o n s of m o t i o n . In t h e f o r m e r t h e a m p l i t u d e s a n d p h a s e c o n s t a n t s
of all the e p i c y c l e s w e r e o b t a i n e d i n d e p e n d e n t l y f r o m observat ion; in t h e latter
the n u m b e r of avai lable cons tant s is s tr ict ly l i m i t e d . U n t i l the t i m e of K e p l e r
it w a s c u s t o m a r y to represent the annual parallax of a p lanet b y o n e e p i c y c l e a n d
the eccentr i c i ty of the orbit b y another . In K e p l e r ' s tabulae Rudolphinae, 1621, the
e q u a t i o n of the cen ter is d e r i v e d for the first t i m e b y an el l ipt ic formula , a n d the
m o d e r n w a y o f t rans forming f r o m h e l i o c e n t r i c to g e o c e n t r i c p o s i t i o n s w a s in tro -
d u c e d .
K e p l e r w a s led to his t h e o r y of e l l ipt ic m o t i o n b y analys is o f observa t ions of
M a r s , the o n e p lanet w i t h a suff ic iently large eccentr i c i ty and rapid m o t i o n to
require , in h i s day, m o r e than o n e e p i c y c l e to represent the eccentr ic i ty . H i s
f a m o u s e q u a t i o n has rece ived m o r e a t tent ion t h a n any o ther in a s t r o n o m y .
T h e first ful ly a d e q u a t e t r e a t m e n t of e l l ipt ic m o t i o n w a s b y N e w t o n in t h e
Principia, 1687, e m p l o y i n g geometr i ca l m e t h o d s . A deta i l ed analytical t r e a t m e n t
w a s g i v e n b y E u l e r in Theoria motuum planetarum et cometarum, \1AA, w h o also
under took the first ca lculat ion of the m u t u a l per turbat ions of Jupi ter and S a t u r n
in accordance w i t h N e w t o n ' s law of gravi tat ion.
CHAPTER I I
EXPANSIONS IN ELLIPTIC MOTION
1. I n t r o d u c t i o n . T h e methods presented in Chapter I suffice for the
calculation of the coordinates of a planet in an elliptic orbit at any t ime
from the elements of the orbit . For various applications in celestial
mechanics it is desirable to have available methods that will permi t the
expansion of the coordinates and functions of the coordinates in an
elliptic orbit in periodic series. In the movement in an ellipse all finite
and continuous functions of the coordinates re turn to the same values
after a complete revolution. Such functions are therefore developable
in periodic series in t e rms of any continuously increasing angular
variable that increases by 2TT dur ing a complete revolution. T h e angular
variables that are of particular interest in this connection are the mean
anomaly, /, the eccentric anomaly, w, and the t rue anomaly, / . These
are not the only arguments that may be considered; other arguments
have actually been used in some applications. T h e functions that present
themselves most naturally are either even or odd periodic functions of
these variables, giving rise to either cosine series or sine series. Since
it is frequently more convenient to operate with exponential series than
with tr igonometric series, it is useful to be familiar with the develop-
ments in the exponential form.
2 . E x p a n s i o n s in a F o u r i e r s e r i e s . Let f(x) be a periodic function
with the period 2-TT in x, the expansion being
00 OC
(1)
1 1
T h e n
(2)
f(x) sinpx dxy p = + 1, + 2 ... oo.
60
f(x) cos px dxy
In
f(x) dx}
FOURIER SERIES 61
a0 -- f(x) dx,
o
71
f(x) cospx dx, p = + 1, + 2 ... 0 0 .
(3)
If f(x) is an odd periodic function,
oot
f(x) = 2 ^ p sin px,
i
(4)
f(x) sinpx dx, p = + 1, + 2 ... 0 0 .
o
In the exponential form, let
+00
/ (*) = 2 A » E v i x '
—oo
(5)
where E is the base of natural logari thms, a n d i2 = — 1; then
*/(*) £ - 3 , i a : ^ , £ = — 0 0 .» + °°. (6)
T h e integral expressions for the coefficients Av, as applied to real
functions / ( # ) , are to be looked upon merely as an abbreviation for the
corresponding expression
271
f(x) (cos px — i sin px) dx.
o
Hence
A0 — ciQi
Ap = av ibp,
A-v = ap + ibp, p = + 1, + 2, ... °°.
(7)
If the periodic series is expressed in the form
+00 +00
/ (*) = X a p c ° s + X b'psin p X y (g)
—oo —oo
If f(x) is an even periodic function,
00
f(x) = a0 + 2 X av
c o s Px>
i
62 ELLIPTIC EXPANSIONS
one single relation,
*LP = a'P = <V (9)
b'_p = - b'v = - bv,
A , = "P-*',> (10)
applies for all values of p> positive,negative, and zero.
Expansions in terms of the exponentials may be writ ten in a par t i -
cularly convenient form by the introduct ion of
E" = A, E™ = Y", E*' = 0 , (11)
so that
2 cos / - A + A~\ 2 cos u = Y + Y~\ 2 c o s / = 0 + &-\
2i sin / = A — A-1, 2i sin u = Y — Y'1, 2i sin / = 0 —
3 . T h e true a n o m a l y e x p r e s s e d in t e r m s of the e c c e n t r i c
a n o m a l y . T h e equation relating the t rue anomaly to the eccentric
anomaly is
so that
and (12) becomes
Since /? as a function of e is obtained as the root of a quadrat ic equation,
there are two solutions, the second of which, fi' = e - 1 ( l + V I — e2)>
tan ; (12)
For the development of the solution it is desirable to introduce /? by
e = sin cp = (13)
(14)
(15)
with
TRUE ANOMALY 63
|8
If (12) is wri t ten in the exponential form and the notation (11) is
used, the equation becomes
(16)
(17)
(18)
F r o m this equat ion <J> may be expressed in te rms of fi and Y by
which may be wri t ten
By taking the natural logarithm of both sides of (18) there results
if = iu + In (1 - ^ y - 1 ) - l n (1 - j8Y)
= iu - OSY-1 4
f = u + 2 (p sin u sin 2u 4 sin 3M + ... j . (19)
will not be considered. I t is seen that if e is small, /? is approximately
\e, while /?' = is approximately 2 e _ 1 .
For further applications the following expansions of powers of j8 in
powers of e are useful.
64 ELLIPTIC EXPANSIONS
Equat ions similar to (12) or (15) occur also in quite different fields.
For example, the result obtained in this section was used in the derivation
of the development in series of the reduction to the ecliptic in Section 26
of Chapter I .
4 . T h e m e a n a n o m a l y e x p r e s s e d in t e r m s of the t rue a n o m a l y .
T h e expression of u in te rms of / , or of y in te rms of 0y may be subst i tuted
into Kepler ' s equat ion in order to obtain / expressed in te rms of / .
Kepler ' s equation,
may be wri t ten
/ = u — e sin w, (22)
I — u —
F r o m (21) we obtain
and developed with the binomial theorem as
Y - r - 1 = (1 - /32) (0 - p&2 + )8 2 0 3 - fi*0* + ...
- 0-1 + P0-* - £ 2 0 - 3 + £ 3 0 - 4 - ...)
= 2/(1 - £ 2 ) (sin / - p sin 2 / + jS2 sin 3 / - ...).
(23)
(Y - Y-1)
Thi s may be writ ten
Y - r - 1 = (1 - / S 2 )
= ( l - £ 2 ) -
4
y — y - 1 = —
(Y- y - 1 ) .
Th i s relation may be inverted since (15) is not changed if fi is changed
to — j8 while u a n d / a r e interchanged. Consequently, (19) becomes
u =f~2 ( jSsin/ (20)
T h e relation corresponding to (18) is
(21)
the final result may be writ ten
/ = / - 2[fi(l + coscp) s i n / - fi2(\ + coscp) sin If + j3 3(^ + coscp) sin 3 / - . . . ] .
(25)
5. I n t r o d u c t i o n of B e s s e l f u n c t i o n s . Al though the relations
obtained in the preceding sections are of considerable usefulness, of
more general importance is the development of functions in periodic
series in te rms of the mean anomaly. T h e first problem to be considered
is the solution of Kepler ' s equation in the form of an expression for u
as a Fourier series in te rms of the mean anomaly.
F r o m
u — I = e sin u\
it is evident that u — / is an odd periodic function of u and therefore
of /. I t can therefore be expanded in a Fourier series
00
e sin u = 2 ^ 6S sin si,
I
so that
- ± r .
7 7 J 0
By integrating by parts , it is found that
b„ = f — e sin u cos si -\- — [ cos si die sin «)1
L STT ^STT) v y J u = 0
T h e first part is zero and the second part may be writ ten, by subst i tut ion
of u — / for e sin w,
I f * .
b. = — e sin u sin si dL
BESSEL FUNCTIONS 65
Subst i tut ion into (23) gives
(fi s i n / - £ 2 sin If + £ 3 sin 3 / - ...). (24)
In this expression the expansion (20) of u in te rms of / m a y be subst i tu ted .
Since
: cos<p = V l — £2>
cos J/ -f cos <iw.
66 ELLIPTIC EXPANSIONS
so that
and
(28)
OO j
e sin u = 2 ^ - Js(
se) s m SU (29)
CO i
u = I + 2 X - JsM sin (30)
i s
I t is necessary to s tudy in some detail the propert ies of the Bessel
functions of the first kind, defined by (27); these may be writ ten
71
cos (stp — x sin cp) dep. (31)
Consider the integral
cos (scp - x sin cp) dtp -
271
sin (scp - x sin cp) dep.
I t is seen tha t for cp — 2TT — cp',
sin (scp — x sin 99) = sin (2sn — scp' -{- x sin 9 ' )
= — sin (scp' — x sin cp').
Hence the value of the integrand of the second integral for <p' = 2n — cp
is equal but opposite to the value of the integrand for cp. T h e integration
between 0 and TT is therefore canceled by that between 77 and 277. Conse-
quent ly an expression equivalent to (31) is
571
£-i(s<p-x shop) Jy, (32)
Again the first integral disappears, and the second integral, upon
subst i tut ion of u — e sin u for /, becomes
1 C31
bs — — cos (su — se sin u) du. (26)
SIT J o
In accordance with the conventional notation for Bessel functions, let
1 CN
Js(se) = — cos (su — se sin u) du, (27)
7 7 J o
it is seen from (33) that Js(x) is the coefficient of Esi<p in the Four ier
expansion
+00
£ix sin <p = ^ JJ^ £ s t > (34)
s=—oo
If now 5: is in t roduced by
so that
2i sin g> = z ,
T z
(34) may be wri t ten
+00
E(xl2Hz-l/z) = ^ ^ ^ (35)
s=—00
Consequently, / s ( # ) is the coefficient of # s in the expansion of
F(x, z) = £<*/2><*-i/*> (36)
in a series of positive and negative powers of z.
Thi s series for F{x, z) may be obtained by the multiplication of two
exponential series:
F(x, z) = Exz/2 • E~x/2z
T h e subst i tut ion
gives
OC - j3 = s
(37)
(38)
F(x, z)
BESSEL FUNCTIONS 67
(33) £-si<p . Eixsin<P d(p,
If this is wri t ten
68 ELLIPTIC EXPANSIONS
has only positive powers of x/2, as is evident from the two exponential
series that were mult ipl ied in order to obtain F(x, z).
For application to the expansions in elliptic motion x may be considered
real and s an integer.
Since F(x, z) does not change if x is changed to — x while z is changed
to —zy it follows from (38) that
U - x) = ( - 1)' Js(x). (40)
Similarly, F(xy z) does no t change if z is changed to — z'1, which
furnishes
J-S{x) = ( - 1)' Js(x). (41)
T h e combinat ion of (40) and (41) gives
/_s(- x) = Js(x). (42)
F r o m these relations it follows that only positive values of the index s
need be considered. For positive values of s the series (39) may be
writ ten
the numerical value of which is less than uni ty regardless or the value
of Xy provided that /3 is chosen sufficiently large. Hence the series (43)
for Js is absolutely convergent for all values of x. I t may also be concluded
that the series (38) for F(xy z) is absolutely convergent regardless of
the value of xt for all values of z excluding 0 and oo. Th i s follows from
(43)
T h e ratio of any te rm to the preceding one is
In the double series (37) the lower limit of summat ion for s has been
chosen as — oo instead of that corresponding to a = 0, which would
be equivalent to s + /? = 0. Since for negative values of s + j8 the
factorial (s + /?)! is infinite, it is seen that extending the summat ion
to s = — oo is permissible. T h i s absence of t e rms for which s + P is
negative is seen to be equivalent to the fact that the series in powers
of x/2y
(39)
BESSEL FUNCTIONS 69
if the notation
is in t roduced. Equat ing the coefficients of zs on both sides results in
\ L/.-i(*) - = ;;(*)• (45)
F r o m (44) and (45) it is seen that
Js-i(x) = £ /,(*) + /;(*),
7«+i(*) = ^ /«(*) - J » -
(46)
(47)
Changing s to s — 1 and s + 1, respectively, in (44) gives
J six) + J8+i(x)
or, with (46),
Js-4*) =
Differentiation of (35) with respect to x gives
the fact that F(x, z) has no poles in the complex plane except for z = 0
and z = oo.
Differentiation of (35) with respect to z gives
(
1 +00 +00
1 +7i)2/ s(*)* s = 2*./sM*s-1-
* ' -00 -00
By equat ing the coefficients of z8"1 on both sides of this equality thereresults
\ Us-M + Jm(*)] = * Ux). (44)
70 ELLIPTIC EXPANSIONS
Thi s procedure may be cont inued, and leads to the conclusion that
any Bessel function with a rgument x and with index s - f k> k being an
arbitrary positive or negative integer, may be expressed as a linear
combination of Js(x) and J's(x) with coefficients that are functions of the
a rgument x. An equally impor tant proper ty is that Js(x) satisfies a
linear differential equat ion of the second order with variable coefficients.
Th i s differential equation may be obtained by differentiating (45) and
eliminating / s - i ( * ) a n d J*+i(x)y
W i t h the aid of (45), this may be writ ten
[/s_2(*) - 2Js(x) + / s + 2 ( x ) ] ,
and, with (47),
Js(x) - U*) -
or
/;(*) - Js(x) = 0. (48)
Th i s differential equation may be used to define the Bessel functions.
T h e mathematical theory shows that , in addit ion to the Bessel function
of the first kind that was int roduced in the preceding section, another
class of functions satisfies this differential equation. For the applications
in this chapter only the Bessel functions of the first kind with real
a rgument x and with integral index s are needed.
An interesting proper ty of the functions Js(x) may be obtained by
multiplying the two series
£<*/2><«-i/,> = ; o W + + jz(x)z* + ; 3 ( ^ 3 + _
- J^z-i + Mx)*r* - Mx)ar* + ...
and
+ Ux)^1 + A W * " 2 + / s W * " 8 +
the coefficients of which have been expressed as Bessel functions of the
argument + x and positive index with the aid of (40), (41), and (42).
APPLICATIONS 71
T h e product of these two series is found to be
1 = t / o W ] 2 + 2[ / , (*)] 2 + 2[J2{x)Y + 2[Ux)f + ...
» (49)
= t / o W l 2 + 2 % [Js{x)Y.
1
For a real a rgument x all functions / s ( ^ ) for integral values of the
index are real. Hence it follows that no te rm in the r ight -hand member
of (49) can exceed unity. Consequent ly ,
l / 0 ( * ) l < i ; < 1/V5T ^ o .
6* A p p l i c a t i o n of B e s s e l f u n c t i o n s . T h e function Epiu, developed
in an imaginary exponential series in te rms of /, will give rise to the
expansion
Eviu = ^ AsEsiK (50)
—oo
T h e coefficients As follow from
(51)
Eviu d{E~sil).
]^{se\ s ^ 0. (52)
For s = 0, (51) gives
71 Eviu dl
£viu £-8x1 ft A* =
Integrat ion by parts gives
T h e first part in the r ight -hand member disappears, and the second part
may be wri t ten
]?-i[(s-p)u-se s i n u ] fiu
By the definition (32) this may be wri t ten
T h e exponentials in the integrand give rise to contr ibut ions different
from zero only if / > , / > + 1, or/> — 1 is zero. T h e following different
cases mus t therefore be considered:
P = 0, AQ = l;
P = ±h A0=-el2; (53)
P ^ 0 , ^ ± l , Ao=0.
T h e results (52), (53), giving
+oo t -J
E«* = A0 + p% - JrJse) E«l,
—CO
or
(54)
the symbol Z' indicating that the te rm for s = 0 is omit ted, may be
applied to t r igonometric expansions. Since (50) may be wri t ten
+00 +00
cos pu + i sin pu = ^ As cos si + i ^ As sin sly
— CO —00
it follows that
+00
cos pu = ^ As cos si
— 00
+oo f 2
= A o + P X ' 7 Js-p(se)cos sl> (55)
-oo
the constant te rm of which is determined by (53). T h e terms with equal
arguments of opposite sign may be combined:
Y* = AQ + p -Js.v(se)A\
[Js-P(se) - Ja+9(se)] cos si, (56)
cos pu — A0 + p cos si
72 ELLIPTIC EXPANSION
4 . =
) i u _ £ d > - l ) t u j d U t
or, since
dl = (1 — e cos u) du
APPLICATIONS 73
A0 being given by (53). Also
+00
sin pu = 2) As sin si
—oo
+oo f |
= P X 7 s i n 5/> ( 5 ? )
-oo A
which may be writ ten
0 0 rl 1 l
smpu = p | j / 8 - J , (w) - — j JS-V(- sin 5 /
= * X 7 + sin rf. (58)
1 A
For p — \ the special cases are
£ + 0 0 , j
COS ft = — - + ̂ ~ Js-l(s*) C0S ̂
—00
00 1
L X S
which, by (45), becomes
e ^ l
C O S M = — - + 2 2 ^ - J's(
se) c o s ^ ; (59)
z 1 ^
also,
+00 j
sin w = 2) _ Js-i(se) s m sl
-00 ^
= X 7 Us-i(se) + Js+i(se)] sin 5 / ,
1 A
which, by (44), may be writ ten
2 0 0 1
sin u = - 2) ~ /s(^) s i n (60)
These results could have been obtained, of course, by int roducing
instead of (50) the t r igonometr ic Fourier series for cos pu> sin pu.
T h e expansions of Eviu, cos pu, and sin pu are of great importance
in applications to problems in celestial mechanics. M a n y functions of
the coordinates in elliptic motion are readily expressed in periodic
74 ELLIPTIC EXPANSIONS
A0 = B0
(63)
Similarly, by (56),
00
f(u) = a0 + 2 2) « p c o s P w (64)
gives
00
f(u) = c0 + 2 cs cos ^/ (65)
with
1 0 0
c* = ~ X Pav[J*-v(se) - J8+P(se)l
C0 = aQ — eaV (66)
Also, by (58),
oc
f(u) = 2y£bp sin />u (67)
gives
00
/(«) = 2 X d»sin rf» (68)
= 7 X PhUs-vW + / , + » ] . (69)
A + S-x).
series in terms of the eccentric anomaly. T h e series (54), (56), (58) can
then be used to change to series expressed in terms of the mean anomaly.
For example, consider
+00 +00
/(«) = X B,E»» = %B9Y>. (61)
p=—00 —00
T h i s can be expanded, by application of (54), as
+ 00 +00
/(«)=£ AsE
sil =2^yl8> (62)
s=—00 —00
with
s ^
APPLICATIONS 75
/„(*) = i -
m -
m -
m
m -
m -
m =
m -
T h e following developments obtained from the general series (43) will
be useful for further applications.
76 ELLIPTIC EXPANSIONS
(b) T h e ratio between the radius vector and the semi-major axis,
with (59) becomes
= 1 — e cos uf
I sin 3/
(71)
sin 5/ sin 41
or
J's(se) cos sly (72)
cos /
cos 3/
(73)
cos 5/
T h e two cases jus t treated are examples of developments in which
the coefficients may be obtained as finite expressions in te rms of the
Bessel functions. Th i s is t rue for only a very limited n u m b e r of functions
of the coordinates in elliptic motion. Among those for which infinite
series of Bessel functions appear as coefficients, one of the most impor tan t
is the development of / in terms of /.
A few examples will now be worked out in de t a i l :
(a) T h e solution of Kepler ' s equation was found by (30):
00 j
u = / + 2 X - JM s i n si. (70)
1 s
With the expansions for Js(x) given in the present section it is then found
that , to the seventh power of e,
APPLICATIONS 77
In order to expand this logari thm in positive and negative powers of Yy
let C and x be in t roduced by
1 _ * y _ * y - i = q i - * Y ) ( 1 -xY-1)
= C(l + x2) - CxY - CxY-\
Thi s gives
Cx = ^ C(l+x2) = \f
f=l +
| sin 31
(75)
?6 sin 61 e1 sin 11.
(d) T h e logari thm of the radius vector. T h e problem is the expression
of
= In
(c) T h e equation of the center. T h e expression (19) f o r / i n t e rms of
u gives by application of (67)
CO -I 00
f = u + 2%-X PvUs-vM + Js+vM] sin si.
S = l * p=l
If the expression for u in t e rms of / is in t roduced, the series for / becomes
/ = 1 + 2 X 7 + X PUrJ.") + J«JL«)]\ (74)
S = l * ' P=l )
Wri t ten more explicitly this becomes
/ = / + \ Ui(e) + fl/oto + JM] + W - i t o + JM + •••} sin /
+ \ (U2e) + RU2e) + J3(2e)] + p[Jtfe) + Jt(2e)] + ...} sin 2/,etc.
In this series the Bessel functions and the various powers of f3 may be
replaced by expressions in powers of e. T h e result is, to the seventh
power in e,
78 ELLIPTIC EXPANSIONS
1 + * 2 2 "
Th i s is the same relation as that found between /3 and e in Eq . (13).
Hence
= i ( i + vT=l») .
T h e problem is then reduced to the expansion of
In - = - In (1 + £ 2 ) + In (1 - j3Y) + In (1 - ^SV-1)
In
(76)
- l n ( l + J 8
2 ) - / 3 r
= - l n ( l + £ 2 ) - 2 cos pu
W i t h the aid of ( 6 4 ) this becomes
In - In (1 + |32) + ej8
CO j 00
s=l
s
p=i
T h e constant t e rm in this expression, in te rms of e, is
T h e series may be expanded and yields
ln^
cos /
cos 3/
(77)
or x e
APPLICATIONS 79
An impor tant proper ty of the four functions that have been developed
in Fourier series with multiples of / as a rguments and power seriesof e
as coefficients is that the lowest power of e occurr ing in the coefficient
of a sine or cosine t e rm equals the mult iple of / in the argument . T h e
power series further progress in powers of e2 so that in a coefficient of
the cosine or sine of an odd argument only odd powers of e occur, and
in the coefficient of a te rm with an even a rgument only even powers of e
occur. Th i s proper ty is closely related to the propert ies of the expansions
of the Bessel functions. I t was first emphasized by d 'Alembert . For this
reason it was called by E. W . Brown the d 'Alember t characteristic.
I t is seen that this same proper ty will also apply to functions of the
types
(f) Expansion of (r/a) s i n / = yja — V l — e2 sin u. T h e series (60)
for sin u mus t be mult ipl ied by
£im(f-l) cos m(f — /), sin m(f — /),
n and m being positive or negative integers. On the other hand, in the
development of Fourier series in the mean anomaly of functions of the
types
Eimi\ cos mj sin m/,
the relation between the lowest power of e in the coefficient and the
multiple of / in the a rgument will be destroyed. T h e relations that
exist can however be discovered, as may be shown in the following
examples:
(e) Expansion of (r/a) cos / = x/a = cos u — e. T h e series (59) for
cos u gives, neglecting powers of e above the seventh,
cos / =
V l - e2 = 1
Extensive tabulat ions of series in elliptic motion were given by
U . J. J . Leverr ier . 2
Leverr ier ' s basic series are the series in mult iples of the mean anomaly
for
x = r--h y=f-l,
from which he derives the series for xp> yq; p, q = 0, 1, ... 7, p + q < 7.
These are used in t u rn for obtaining series for xp sin hy, xp cos hy,
with coefficients expressed as polynomials in h.
Based upon this work by Leverr ier are the extremely useful tables of
the developments of functions in the theory of elliptic mot ion of A.
Cay ley. 3 T o the seventh power of e Cayley gives
x* cos;/ , x* sin;/ , ; , p = 0, 1, . . . 7,
80 ELLIPTIC EXPANSIONS
- sin / =
i J
sin 3/
j sin 5/
5 sin 11
In cos;/ , s i n ; /
j = 0, 1, 2, ... 5; q = - 5, - 4 , . . . - 1, + 1, . . . + 4.
Newcomb ' s deve lopments 4 are in effect based on the series for
p = lr -N=I(F-I),
expressed in positive and negative powers of A = Eil. T h e n
E?P = 1 + fa •
EM = 1 + yrj
2 U. J. J. LEVERRIER, Ann. Obs. Paris 1, 343-357 (1855).
3 A. CAYLEY, Mem. Roy. Astron. Soc. 29, 191-306 (1861).
4 S. NEWCOMB, Astron. Papers, 5, 1-48 (1895).
T h e result is, again to the seventh power of e>
CALCULATION 81
and, finally,
m = 0 , l , . . . o o , w - \q\ = 0 , 2 , 4 , ....
T h e coefficients i7^(j8, y) are polynomials in j8 and y of degree m in
both /? and y . T h e y are impor tant in Newcomb ' s me thod for developing
the dis turbing function. See Chapter X I I I .
7 . C a l c u l a t i o n of the B e s s e l f u n c t i o n s . In principle, formula (44)
can be used to obtain the Bessel functions of all orders required for a
given argument , once any two of t hem are known. But this process,
a l though it is algebraically exact, leads to the rapid accumulat ion of end
figure errors if it is applied several t imes in succession. In numerical
work it is preferable either to apply formula (43) directly, or to interpolate
the functions from published tables of the Bessel functions. Still bet ter
for systematic calculation, especially with automatic calculating machines,
is to obtain J0(x) in either of the two ways ment ioned, and then to cal-
culate Js(x) by successive mult ipl icat ion of J0(x) by factors tha t are
very easily calculated.
Denot ing the ratio of Js{x) to / s _ i (# ) by ps, we have from Eq. (44)
whence we obtain the cont inued fraction
T o make use of this cont inued fraction in calculation, derive ps from
1
where
or
82 ELLIPTIC EXPANSIONS
starting with so large a value of s that for the first application it is
permit ted to pu t ps+1 = 0, and working down to pv T h e n
7 iW = Pi /oM.
J2(x) = p2 J^x),
Js(x) = Ps h(*\ etc->
and formula (49) may be used as a check. T h e largest value of s required
for the first application may be determined by trial, using the largest
value of x that is to be employed in the problem; the same value of s
will then suffice for all smaller values of x.
As an example we give in Table 1 the figures necessary for the
calculation of J8(e), e being the eccentricity of Mars , 0.0932 6685. By
application of (43) it is found that J0(e) = 0.99782 65057.
T A B L E 1
Calculation of Js(e)
s rs = 2s/e JS(e)
1 1 2 3 5 . 8 8 2 3 0 9 7 0 . 0
1 0 2 1 4 . 4 3 8 4 6 3 4 0 . 0 0 4 2 3 9 4 —
9 1 9 2 . 9 9 4 6 1 7 1 0 . 0 0 4 6 6 3 4 —
8 1 7 1 . 5 5 0 7 7 0 7 0 . 0 0 5 1 8 1 6 1 6 9 —
7 1 5 0 . 1 0 6 9 2 4 4 0 . 0 0 5 8 2 9 3 5 4 2 —
6 1 2 8 . 6 6 3 0 7 8 0 0 . 0 0 6 6 6 2 1 7 6 6 0 . 0 0 0 0 0 0 0 0 0 0
5 1 0 7 . 2 1 9 2 3 1 7 0 . 0 0 7 7 7 2 6 4 0 0 0 . 0 0 0 0 0 0 0 0 1 8
4 8 5 . 7 7 5 3 8 5 3 6 0 . 0 0 9 3 2 7 3 6 1 2 0 . 0 0 0 0 0 0 1 9 7 0
3 6 4 . 3 3 1 5 3 9 0 2 0 . 0 1 1 6 5 9 6 2 4 1 0 . 0 0 0 0 1 6 8 9 2 9
2 4 2 . 8 8 7 6 9 2 6 8 0 . 0 1 5 5 4 7 2 9 2 8 0 . 0 0 1 0 8 6 5 5 0 2
1 2 1 . 4 4 3 8 4 6 3 4 0 . 0 2 3 3 2 5 1 6 8 1 0 . 0 4 6 5 8 2 7 3 7 0
0 — 0 . 0 4 6 6 8 4 2 0 4 9 0 . 9 9 7 8 2 6 5 0 5 7
A partial check is obtained by use of (49), writ ten in the form
[J0(X)Y . [1 + 2{PI + P\PI + PIPIPI + ...)] : = 1.
T h e particular example furnishes
1 - 1.00000 00000 0
p \ = 0.00217 94149 9 2pl = 0.00435 88299 8
p \ = 0.00054 40634 7 2p\p\ = 23714 8
CALCULATION 83
Pi = 0.00024 17183 1 2p\p\pl = 5 7
Pl = 0.00013 59468 3 2p\p\plp\ = 0
Sum = [Jo(e)]~2 = 1.00436 12020 3
Me) = 0.99782 65057,
in agreement to the ten th decimal place with the value obtained from
the series.
In most applications it is necessary also to compute ]s(2e), Js(3e), etc.
T h e numerical convergence of the series for J0 and of the calculation
of the values of ps remains satisfactory even for large values of the
argument that arise with high mult iples of e.
Following is an example of the use of Bessel functions in the t rans-
formation of a rguments from the eccentric anomaly to the mean anomaly.
In the development of the series for a'/A, where 2a' is the major axis
of the orbit of Jupi ter and A is the value of the distance between Jupi ter
and Mars (assuming elliptic motion for both planets), the following
terms are found, together with the appropria te Bessel functions, cal-
culated with the eccentricity of Mars 0.09326685, that are needed in
order to calculate the coefficient of cos (/' — /) by formulas (55), (57).
All the number s are expressed in uni ts of the eighth decimal (Table 2).
T A B L E 2
Terms in a'jA Bessel functions Products
+ 396 cos (/' + 2M), -2J_3(-e) = - 3379, o,
+ 41206 cos (/' + u), - /_,(- e) = - 108655, - 45 ,
+ 2879796 cos /', -OJ^i-e) = 0, 0,
+ 23572402 cos (/' - M), Jo(-e) = -f- 99782651, + 23521168,
- 108643 cos (/' - 2M), 2M-e) =
- 9316547, + 10122,
+ 1677 cos (/' - 3M), 3 / , ( - e ) = 4- 325965, + 5,
— 17 cos (/' — 4w), 4 / . ( - e ) = - 6757, 0.
In these expressions the pr imed symbols relate to Jupi ter and the others
to Mars . Mult ip lying the pairs of numbers on a line, and adding the
products at the right, we obtain + 0.23531250 for the coefficient of
cos (/' — /). T h e same Bessel functions will suffice for the calculation
of the coefficients of sin (JV — /) and cos (JLF — I), J being any integer
or zero. For other multiples of /, different Bessel functions will be
needed.
84 ELLIPTIC EXPANSIONS
8. S o l u t i o n of K e p l e r ' s e q u a t i o n . T h e numerical solution of
Kepler ' s equation
I = u — € sin u
for u, given e and /, received m u c h attention, as was natural for one of
the earliest t ranscendental equations in as t ronomy requir ing frequent
solution. H u n d r e d s of methods have been published. Equat ion (70) is
practicablefor numerical work when the eccentricity is small. Here we
shall describe two general methods that suffice for all the needs of the
practical computer . T h e first is the method of differential correction,
which is especially useful when no calculating machine is at hand .
F r o m an approximate value of u corresponding to the given values
of e and /, a more accurate value is found by successive approximations.
We denote the approximate value of u by u0 and the initial correction
to it by Au0f so that u0 + Au0 is a bet ter approximation to u than u0.
Calculate the value of / corresponding to u0 from Kepler ' s equation
and, calling it / 0 , denote the difference / — / 0 by Al0. W e have, rigorously
l0 = u0 — e sin u0.
T h e n by Taylor ' s theorem, omit t ing the square and higher powers of
Au0> we have the approximate equation
*l« = Hr0
Au°
= (1 — e cos u0) AuQi
or
Aun = 0
u 1 — e cos u0
Now put ux = u0 + AUQ\ calculate the value of / corresponding to uv
and calling it lx pu t / — / x = Alv T h e n , by the same process as before,
we have
A
J / l
Aux = 1 1 — e cos ux'
and the procedure may be repeated as often as is desired. I t will be
found that the successive values of AUJ diminish rapidly, so that only
a few repetit ions are necessary.
For the first approximation u0 it is convenient to use the following
abridged form of (70) or (71):
u = I + e sin / + \ e2 sin 21.
KEPLER S EQUATION 85
As a numerical example we show the figures for the calculation of
u when / == 30° and e = 0.3 precisely. For convenience in using the
t r igonometr ic tables we mult iply the equat ions by the n u m b e r of
degrees in a radian, thus using 17.18873 instead of e and 2.57831 instead
of \ e2. But in forming the divisor of the expression for Au it is necessary
to use e for itself. So we have
40?82725
41.35818
41.35756
0.6537806
0.6607642
0.6607560
K
29?58959
30.00048
30.00000
+ 0?41041
- 0 . 0 0 0 4 8
1
0.77300
0.775
+ 0?53093
- 0 . 0 0 0 6 2
W h e n a desk calculating machine is at hand it is much more expedi-
tious to use another me thod that does not require any figures to be
wri t ten down if a table of sines with a rgument in decimals of a degree
or in radians is at hand . If decimals of a degree are used, mult iply e
by the n u m b e r of degrees in a radian; all angles will t hen be expressed
in degrees and decimals.
Wr i t e the formula
« = / + e sin u.
Inser t / in the products dial and e in the keyboard. T h e n , by repeated
addit ions and subtract ions various mult iples of e are added to or s u b -
tracted from / unt i l the condit ion is fulfilled tha t the n u m b e r appear ing
in the mult iplier dial, which is an approximat ion to sin u, is equal to
the sine of the angle in the products dial, which is an approximat ion
to u. At each stage it is necessary to look u p the sine of the angle that
is in the products dial, and to increase or decrease the n u m b e r in the
mult ipl ier dial in the direction indicated. For the preceding example,
the number s s tanding in the machine at successive stages may be as
follows.
Multiplier dial
0.0
0.5
0.62
0.652
0.6588
Products dial
30.00000
38.59437
40.65701
41.20705
41.32394
Multiplier dial
0.6603
0.6607
0.66075
0.6607560
Products dial
41.34972
41.35659
41.35745
41.35756
In practice the result will be approached much more rapidly than the
figures above would indicate. T h e compute r soon learns to "overshoot
sin Uj - COS Uj
86 ELLIPTIC EXPANSIONS
the m a r k " in making the successive approximations, so that the actual
n u m b e r of approximations may be reduced to three or four.
Ei ther of the methods described is applicable to a high-speed auto-
matic calculating machine, the t r igonometr ic functions being calculated
in the machine instead of being read from pr in ted tables. As with most
applications of such machines, the n u m b e r of successive approximations
required depends m u c h on the ingenuity of the computer .
9. S o l u t i o n of the e q u a t i o n s of m o t i o n in t e r m s of the m e a n
a n o m a l y . T h e expansions of the coordinates in elliptic motion in
the preceding sections have been obtained as Fourier series in which the
arguments are multiples of the mean anomaly and the coefficients are
power series in the eccentricity. We shall now treat expansions of the
coordinates that can be obtained directly from the equations of motion.
T h e procedure will be to obtain the coordinates as power series in the
eccentricity, the coefficients of which are Fourier series in the mean
anomaly.
T h e equations of mot ion of a planet in an orbit , the plane of which
coincides with the plane of reference, may be writ ten in polar coordinates:
These equations are satisfied by a circular orbit with radius a. T h e
angular motion is then uniform and v = A = nt + A0. T h e two equa-
tions give
if Gc is the value of the integral of area in the circular orbit with radius a.
Next, consider noncircular orbits that have the same period, i.e.,
orbits with mean motion n and semi-major axis a. In t roduce
(78)
Gc = a2n,
(79)
r = a{\ + p\
v — A = (80)
G = G c ( l + y ) ,
and use as independent variable
A = nt + A0.
SOLUTION OF EQUATIONS OF MOTION 87
T h e equat ions, wri t ten as
may, if the middle t e rm in the left-hand member of the first equat ion
is wri t ten
be t ransformed into
(82)
For the circular orbit these equations are satisfied by
p = y = 0.
Expansion in powers of p and y gives
g + p = 3 p 2 _ 6pp + l 0 p i __ ...
+ Y(2-6P + l2P*-20p* + ...) (83)
+ y\\ - 3P + 6P* - ...)
and, if v — A = < f , the equat ion of the center,
= - 2p + 3P
2 - V + 5p 4 - ...
+ y(l - 2p + 3p* - V + ...).
(84)
Consider p and y as small quantit ies of the first order. Ignor ing the
second powers and products of these quantit ies in the r ight -hand
members , pu t
+ PI = 2yi>
= YI — 2PV
(81)
8 8 ELLIPTIC EXPANSIONS
from which we obtain successively:
px = Zy1 — e cos (A — o>j,
— 3y x + 2e cos (A — ou),
— 3yxA + cx + 2e sin (A — # ) . (86)
(85)
Th i s solution is entirely general; e and o> are the two necessary constants
of integration for the equat ion for pv I t is seen that yx = 0 is necessary
in order to avoid a t e rm proport ional to A in the difference vx — A =
for which a periodic function of A is desired. T h e constant cx is taken
to be zero, so that the mean value of $ may be zero. For brevity we
introduce / for A — d>.
T h e next step is to subst i tute p = px + p2> 7 = 72 m t r i e r ight -hand
members , and to retain te rms of the order of the square of p± and of
the first power of p2 and y 2 . T h e result is
: 3P? +72 — 1p2.
(87)
T h e value
p1 = — e cos /
is subst i tuted in the te rms 3pl appearing in the r ight -hand members ,
giving
- P2 = + 2y 2
e2 cos 2/,
- 72 — 2p2 + e2
- f e2 cos 2/.
Integrat ion of the first equat ion gives
Subst i tut ion of this value of p2 into the second equation gives
— 3y 2 - e2 cos 2/,
<?2 = - 3 (y 2
2 sin 2/,
a constant c 2 having been omit ted in the expression for S2.
SOLUTION OF EQUATIONS OF MOTION 89
In order to avoid a secular t e rm in i 2 it is necessary to choose
Hence
I t would have been permissible to add a t e rm a cos / + b sin / to the
solution for p2. However, this would have dis turbed the d 'Alember t
character of the solution. Moreover , it is seen that the e in t roduced as
a constant of integration in px corresponds to the eccentricity, so tha t
at perihelion (/ = 0),
p = — e, P l = — e, p2 = 0;
at aphelion (/ = 7r),
p = + e, P l = + e, p2 = 0.
Any addit ion to p2 of the form a cos / would have modified the meaning
of e> and the addit ion of a sine te rm, b sin /, would have modified the
meaning of to.
Proceeding now to the determinat ion of p 3 , <^3, the equations are foundto be
D2P
~J%T + Ps = 6PiP* - 6p\ - 6y2/>! + 2y 3,
DS
-ffi = - 2Ps + 6pip2 - 4p\ - 2V2pi + 73-
T h e calculation may be arranged as in Tables 3 and 4.
T A B L E 3
Calculation for p 3
(89)
e3 cos / e3 cos 3/
+
+ 2 y 3 = + 2 y 3
3
2
^ 2
- 3
4- * + 2
4- ^ + 2
dX2 + Pz
Pi
= + 2 y 3
= + 2 y 3 4- xe3 cos /
+ 3tf3 cos 3/
- | e z cos 3/
e2 cos 2/,
e 2 sin 21.
72 =
(88)
90 ELLIPTIC EXPANSIONS
£ 3 cos / e 3 cos 3 /
- 2/>3 + y 3 =
+ f>P\P% =
— 3 y 3
— 2x
3
"2
+ 3
+ 4
+ 2
+ 1
— 2y2pl = - 1 —
- 3 y 3 + <*- 2jc) e 3 cos / + ^ e 3 cos 3/
$ 3 = - 3 y 3 A + <*- 2x) e3 sin /
A constant cz is omit ted in the expression for cfz.
T h e expression for cf3 requires y 3 = 0. T h e n , in order to obtain
p s = 0 at perihelion and at aphelion, we take x = + F - T h e final
result is then
(90)
P 3 =
3 3 =
?3 cos / - £3 cos 3/,
ez sin 3/. j £3 sin /
T h e general character of the solution has now become apparent :
only even powers of e will occur in the series for y, while odd-order
contr ibutions to p may require additional terms with a rgument cos /
in order to satisfy the condition that all contr ibut ions to p beyond p 1
vanish at perihelion and aphelion. I t may be remarked that these
addit ions to p of te rms of odd order are necessary only to make e
correspond to the eccentricity. Suppose the addit ion had not been made
to p s . Designate the e so int roduced by ev T h e expression for p would
then have become
(A)
instead of
P = e2 + (—«i + 0 e 3 ) cos/ e\ cos 21 - e\ cos 3/
(B) P — e2 + (-e- ez) cos / - e 2 cos 21 - e3 cos 3/,
T A B L E 4
CALCULATION FOR @ Z
SOLUTION OF EQUATIONS OF MOTION 91
and the transformation from (A) to (B), correct to te rms of the th i rd
order in e, would have been made afterwards by the subst i tut ion
3 3
Finally, the expansion could have been made by subst i tut ing in (82)
P = Pi + P2 + Ps + P4 + •••>
y = 72 + r 4 +
in which use is made of the circumstance that only te rms of even order
appear in y. By collecting t e rms of corresponding order the equations
could have been writ ten, at the beginning of the integration, to any
order desired:
dX2
dX2 P2 = ?>p\ + 2y 2,
d ^ + Pa = fyiPi ~ 6Pi — fyzPv dX2
d2o
4 + Pt = GpiPa + 3pa - Wplp*. + 10p 4 + 2y 4 — 6y2/>2 + 12y2p? + y\, dX2
~dX ~ ~ 2 p i '
— = — 2p2 + 3pr + y2,
— 2p 3 + 6pYp2 — 4p\ — 2y2pv
= — 2p 4 + 3p2
2 + 6 ^ 3 — \2o\p2 + 5p* + y 4 — 2y 2 p 2 + 3y 2 p
2 .
T h e calculation of the te rms of the fourth order proceeds as in Tables 5
and 6.
dto±
dX
d£«
dX
92 E L L I P T I C E X P A N S I O N S
T A B L E 5
Calculation for /d 4
e4 e4 cos 21 e* cos 4/
+ 3p°
+ i o r f
- 6y2p2
+ \1YA
+ y\
+ 2y«
=
+ 2y4
9
8
-4- —
+ 8
9
4
+ 1 5
+ 4
+ 2
- 3
+ i
_ 3
2
+ 5
3
2
- 3
+ 8
+ 8
4- ^
+ 4
4- ^
+ 4
= + 2y4 + i *
— e4 cos 2/ 4- 5e4 cos 4/
P4 + 2y4 + *«• + | « 4 cos 2/ — -| g4 cos 4/
T A B L E 6
Calculation for
e4 e4 cos 21 e4 cos 4/
+ M
+ 6pjp3
- 12pJP2
+ 5PJ
- 2y2p2
+ 3 W J
=
- 4 y 4
+ y*
1
2
+ 8
9
8
3
2
3
4
2
3
3
2
+ 2
1
2
3
4
4- ^
+ 3
+ 1
+ 1
4 - * + 2
4- ^
+ 8
= - 3 y 4 8 E — e4 cos 2/ 4- J ^ c 4 c o s 4 /
4̂ - 3 ( y 4 + 1 9 ~ 6 ~ S I N 4 1
which would have been integrated independent ly of the equat ion for £ .
T h e disadvantage of this procedure is that some of the features that
arose in the course of the integration wi th regard to the evaluation of
the successive te rms in y are eliminated. These features are essential
in applications to problems in which the constant of integration corres-
ponding to y is not be available at the start.
I t is worth not ing that the expansion
y = y 2*
2 + y 4*
4 + . . .
illustrates the development of a constant of integration, y, in te rms of
the constant e which is in t roduced in the course of the solution. T h i s
procedure was required in order to satisfy the conditions that $ shall
be a periodic function of / wi thout linear te rm. If a linear t e rm in A
had been permi t ted in & the solution would have been that of a motion
in an orbit with a mean motion different from the one prescribed.
T h e equations of elliptic motion in the plane of the orbit form a
system of the fourth order. Hence four constants are required for the
general solution. T h e expansion obtained in this section contains four
constants, a> e, to, and A0. T h e constant a appears as the constant giving
the size of the orbit and ( through n) the period, bu t the expansions of
p and $ are independent of the value of a. Hence the developments
obtained represent, by permit t ing a to vary, the complete solution of the
differential equat ions.
SOLUTION OF EQUATIONS OF MOTION 9
T h i s requires
y 4 =
T h e final results are therefore
P4 =
4 cos 21 -
e* sin 21
?4 cos 4/,
- e4 sin 4/.
(91)
T h e r e is no constant t e rm in /o 4 , and no further constant te rms should
appear in the higher approximations since the constant t e rm of r/a is
1 + 2e2> a s obtained in Section 6.
T h e calculation would have been simplified somewhat by in t roducing
for G its value
G = a2n Vl - e\
yielding for the equat ion for p
94 ELLIPTIC EXPANSIONS
10* T h e r o t a t i n g c o o r d i n a t e s y s t e m . I t was found that the
expansion of the rectangular coordinates with the aid of the Bessel
functions is considerably simpler than that of the equation of the
center. Th i s suggests that perhaps the same condit ion may apply to
the direct expansion of the solution from the differential equations.
T h e use of rectangular coordinates also opens up the possibility of
int roducing imaginary exponentials instead of t r igonometr ic functions,
which may simplify the operations. In order to obtain coordinates that
are closely related to p and <f, which are zero in the case of circular
motion, it is suggested to introduce a rectangular coordinate system
that rotates uniformly with an angular velocity n, the mean motion in
a circular orbit from which the deviations are to be obtained.
T h e original equations in rectangular coordinates for an orbit whose
plane coincides with the xy plane are
In t roduce a coordinate system XY rotat ing with an angular velocity n
in the direction of the motion of the planet. If
x =
A = nt + A0,
then
X
Y
x cos A + y sin A,
— x sin A + y cos A,
(92)
y
X X cos A — Y sin A,
X sin A + Y cos A.
(93)
Successive differentiation gives
X — x cos A + y sin A — nx sin A + ny cos A = x cos A + y sin A + nYy
Y = — x sin A + y cos A — nx cos A — ny sin A = — x sin A + y cos A — nX,
X = x cos A + y sin A — nx sin A + ny cos A + nY,
Y = — x sin A + y cos A — nx cos A — ny sin A — nX.
By subst i tut ing
— nx sin A + ny cos A = nY + n2X,
— nx cos A — ny sin A = — nX + # 2 F ,
r2 = x2 + y2'
ROTATING COORDINATES 95
(94)
Now
But these expressions are identical with the r ight -hand members of
the equations (94). T h e latter may then be wri t ten
with
F = — , r2 = X2 + Y 2 .
T h e th i rd te rms in the left-hand members may be absorbed in the
partial differential quotients in the r ight -hand members by put t ing
F' =F + \n\X2+Y%
X-2nY = ^ , (96)
V + 2nX=3£.
X - 2nY - n2X
Y + 2nX - n2Y =
(95)
X - 2nY - n2X = cos A -f - sin A,
Y + 2nX - n2Y - sin A cos A.
- cos A -f • sin A,
we obtain
X = x cos A + y1 sin A + 2nY + n2X,
Y = — x sin A + y cos A — 2wX + n2Y.
T h u s the equations may be wri t ten
96 ELLIPTIC EXPANSIONS
(98)
Hence, if
(99)
the equations are
(100)
a£ — lanrj
(97)
aij + 2an£ -
If, further, the mean longitude A is used as independent variable the
equations become
Suppose we wish to deal with elliptic orbits with mean motion n.
T h e equations may be satisfied by the circular orbit with radius a>
related to n by
^ = a?n2.
If the constant A0 is chosen properly, theX axis is directed constantly
toward the planet , and in this circular orbit we have
X = ay 7 = 0.
I n order to s tudy the deviations from circular mot ion in elliptic orbits,
we may introduce
X = a(l+ I ) , Y = ar,9
and use £ and rj as our new variables. T h e subst i tut ion into Eqs . (96) gives
ROTATING COORDINATES 97
X = r cos (v — A) = r cos $ = a(l + p) cos
y = r sin (z; — A) = r sin < f = <z(l + p) sin
Hence
_ l = (l + p ) C o s « f - 1
or, conversely,
(1 + p ) 2 = (l + t ? + v)\
P = Vl+2£ + e + V2- i
tan 6
(101)
(102)
(103)
= v — £v + £2v — i v3 — £*y + dy* + ••• •
W e note that at perihelion and at aphelion $ = 0, so that for those
points of the orbit f = p, rj = 0, and hence at perihelion (/ = 0),
€ = — e, V = 0,
at aphelion (/ = 7r),
f = + e% rj = 0.
= (1 + p ) s i n « f
= ^ + p<̂
I t is of interest to express the variables £ and rj in t e rms of the quant i -
ties p and $ in t roduced in Section 9. Since
x — r cos v, y = r sin v,
it follows from (92) that
98 ELLIPTIC EXPANSIONS
For the development of the r ight -hand members of Eqs . (100), say
to the fourth power in £ and 77, advantage may be taken of the c i rcum-
stance that these r ight -hand members are the partial differential quotients
of a single function Q. Hence it will be sufficient to develop Q in powers
of £ and r] to the fifth power and take the derivatives with respect to
£ and 77 afterwards. W e may write
We now subst i tute
f = f 1 + f 2 + & + f *
V = Vi + V2 + Vs + t?4>
and collect te rms of corresponding order in the equations. T h e results are
= 1 - f + e - e + f4 - f6
In order to obtain jQ, the te rms
should be added. Hence,
ROTATING COORDINATES 99
I t is seen that the equations for each order have the same left-hand
members , while the r ight -hand members are again functions of
and rj's tha t have been obtained in the preceding approximat ions .
T h e equations for gl9 rj1 may be solved as follows. A first integral of
the second equation is
If this is mult ipl ied by 2 and added to the first equation there results
the equat ion
% + t1 = 2Cl,
with the integral
& = 2C1 + A cos (A - B\
A and B being constants of integration. Subst i tut ion into the equation
for yields
% = - 3CX - 2A cos (A - B)
aA
V l = - 3QA + q - 2A sin (A - B).
We wish a solution in which rj1 is purely periodic with mean value zero.
Th i s requires
q = 0, q = o.
; ^ 2 + 1 0 | I \ - Y ^ i 3 .
f 3 ^ 3 + 3 ^ 2 + 3 £ 3 7 H - 6 ^ 2 — 1 2 ^ ^ !
- 1 2 ^ 2 - 5 ^ + 1 5 ^ -
— 3f 4 = - 6 ^ 3 — 3 | 2
2 + 3T ? 1 T ? 3 - rj2 + 1 2 ^ 2 — 6^2rjl
+ + 3 ^ i - 6&? :
- 3 ^ 2 H - 3£ 2 = -
100 ELLIPTIC EXPANSIONS
Fur ther , the perihelion and aphelion conditions for fx require
A = — e, B = GJ.
In t roducing the mean anomaly / = A — <2, we finally obtain
£i = — e cos /, rj1 = -\- 2e sin /.
In all further approximations the aim will be to have, at perihelion
and aphelion, £ 2 = 0, £ 3 = 0, etc.
For any successive approximation, say of £j9 rjj9 the equations will be
where Xjf Yj are the r ight -hand members which will be found to be
cosine series and sine series, respectively, in /. Th i s may be seen at
once for X2, Y2 by subst i tut ion of the results for gl9 rjv Hence Yj will
have no constant t e rm and the integral of the second equation gives
(104) -2f, = C , + J Y,d\,
Cj being a constant of integration. By adding twice this integral to the
equation for ^ we obtain
r - 6 = 2C, + * , + 2 j Y,d\. (105)
T h e constant t e rm of ^ obtained from this equation is
f,o = 2Cj + Xi0, (106)
in which Xj0 denotes the constant te rm of Xj. Subst i tut ion into (104)
gives a constant te rm in drjj/dX,
— 3C3- — 2Xj0.
Since this must be zero, Cj is determined by
Cj = — § Xj0.
If this is subst i tuted into the r ight -hand member of Eq. (106) it is
seen thaj the constant te rm in this r ight -hand member , and therefore
the constant te rm in becomes
= (107)
ROTATING COORDINATES 101
For the th i rd approximat ion we obtain
e3 cos / e* cos 3 /
3 3
3 3
4 + 3171̂ 2 = + 4
+ 4 g = - 3 - 1
- 6TLVL = + 6 - 6
e* sin I e3 sin 3/
+ 3 ^ 2 =
9 3
+ = — 2 + 2
- fiffo = - 3 - 3
+ = + 9 - 3
9 39
F 3 = + q e 3 sin / — £ 3 sin 3/.
o o
Y2
Y2dX =
e 2 sin 2/,
e 2 cos 2/,
e 2 cos 2/
X2 + 2 J F 2 ^ A = £2 cos 2/,
e 2 cos 2/,
e2 cos 2/,
• e2 sin 2/.
X3 = e3 cos / - ez cos 3/,
Th i s will hold for every approximation, and it becomes therefore u n -
necessary to obtain C, explicitly. T h e procedure is s imply to determine
the constant in ^ by (107) and to ignore the constant that arises in
DRJT/DX by (104).
For the second approximat ion we obtain
For the determinat ion of | 4 , ry4 we have Tab les 7 and 8.
T A B L E 7
C a l c u l a t i o n o f AT4
e4 e* cos 21 e* cos 4 /
= 9
8 4-- a
^ 8
9
8 + 2
3
8
+ 317x773 = 9
8
+ 2 7
8
_L 3 2
+ A
3
64
+ 1 2 # a
3
2 + 2
+ 9 — 12 + 3
— 12^172 4- -3-+ 2
3
2
15
8
5
2
5
8
+ 1 5 ^ + 2
15
2
_ 1 5 4
8 î
= 45
4 4~ 15 15
4
4- — e4
^ 64 * + 4e 4 cos 21 — W * 4 c o s 4 /
102 ELLIPTIC EXPANSIONS
£3 cos / - - e3 cos 3/,
Xz + 2jY3dX = — 3 ez cos 3/,
^3 = e 3 cos / H e 3 cos 31.
T h e te rm — § e3 cos / has been added in order to make f3 zero at
perihelion and aphelion.
C3 - 2f 8 H y 3 = e 3 cos / + e 3 cos 3/,
*?3 = £ 3 sin / H - £3 sin 3/.
ROTATING COORDINATES 103
ex sin 21 e4, sin 41
+ i
7
1 6
3
8 ^ 1 6
+ = 9
4 + 8
- 6 ^ 2 = 3
4
3
8
- n^Uii = - 6 + 3
I 9 2 _
+ 2 — + 4
9
8
+ mim = - 5 5
2
1 5 > 3 _ + 15 1 5
~ 2
Y, = + 3e4 sin 2/ - ^g1 e4 sin 41,
- f e4 cos 2/ + f|- e4 cos 41,
- f e*> cos 2/ - -W* 4 c o s 4/>
— J e4 cos 2/ + _ 6 _ 7 _ , 4 c o s 4 / >
— f e4 cos 21 + ff e4 cos 4/,
- <?4 sin 21 + M *4 s i n
I t is a useful exercise to verify that the results for £ and rj obtained
by this method agree with the series for (r/a) cos / and (r/a) sin / given
in Section 6 of this chapter .
T h e calculation of the fourth-order t e rms by subst i tut ion of the
previous solutions in their t r igonometr ic form in the r ight -hand members
X^y Y 4 is, at least for some of the terms, rather tedious. T h e use of the
imaginary exponentials is recommended . For example, if A = exp il,
£ 3 = + ^-*{A* - A - A-* + A-*).
T h e n , by multiplication,
- 6 ^ 3 = e* (A* - 2 + A-*)
T A B L E 8
CALCULATION OF F 4
11. C o m p l e x r e c t a n g u l a r c o o r d i n a t e s . T h e greater ease in
performing operations with imaginary exponentials than with t r igono-
metr ic functions suggests the use of imaginary variables in the differen-
tial equations. T h e variables that appear to have particular advantages
are
u = £ + irj = - 1,
s = £ - i-q = r-E-i(f-l) - 1,
so that
(108)
dhi du dQ
%
+ i 8 Q
dt]
dh 8Q . dQ
d\* dr/
T h e derivatives of Q with respect to u and s may easily be expressed:
dQ dQ d£ , dQ drj
du d£ du drj du
1 dQ , \ dQ
2 d£ 1 2i dV
1 / dQ . dQ\
2\d£ drj )
Equat ions (100) may then be wri t ten
By adding and subtract ing these equations it follows that
104 E L L I P T I C E X P A N S I O N S
Also,
(1A* -9A + 9A~l - 7A-*),
(1A* - 16 A2 + 18 - 16A-2 + 7A-*)
- 2 cos 21 — cos 4/j .
C O M P L E X C O O R D I N A T E S 105
8s 2 \ 0 ? dr) J
T h i s gives for the equat ions (108)
Since
it follows that
£2 + = us,
T h u s
= I (1 + «) (1 + ,) + (1 + u)^'2 (1 + s)~v\
Following G. W . Hill we introduce the notat ion
(109)
(1 + + I2 = 1 + % + ¥ + V2
= 1 + u+s + us (110
= (1 + « ) ( ! + « ) .
( I l l )
(112)
T h e principal advantage of this form is that the imaginary uni t does not
appear, and will not have to appear in the solution if the developments
are made with imaginary exponentials. An impor tan t feature of the
variables u and s is that they are conjugate complex. Hence it is necessary
only to obtain the solution for one of the two;the other can be found by
taking the conjugate.
and, similarly,
3Q _ 1 / _ a Q . 3Q\
so that
T h e equat ions become
D2u + 2Du
D2s - 2Ds =
106 ELLIPTIC EXPANSIONS
T h e n
Th i s expression is subst i tuted into the first of Eqs . (112), and the equa-
tion is split u p into separate equations for the te rms of the first order,
second order, etc., by int roducing
u = ux + u2 + us + ...
s = *i + s2 + *3 + ••••
T h e resulting equations are
D2ux + 2Dux -
D2u2 + 2Du2 (u2 + s2) =
T h e corresponding equations,
D\ - 2Dsx -
D\ — 2Ds2 -f
(«2 +
S2) = etc.,
may be writ ten down by merely interchanging u and s everywhere in
the r ight -hand members .
T h e solution desired for W ,̂ S-^y IS of the form
u, = aA + bA~\
(113)
s1 = bA + aA-1,
in which A = Eil, I — A — co and a and b are real factors to be deter-
mined.
T h e definition of the differential operator D shows that
DE**X = pEivX, D2EivX = p2EipX.
Chapter X I V contains an illustration of the use of complex rectangular
coordinates in the problem of the moon ' s motion. A detailed application
to the problem of elliptic motion is therefore omit ted, and only the
first steps are indicated.
Expansion of Q, given by (111), in powers of u and s gives
COMPLEX COORDINATES 107
D 2tt x
2Du,
= aA+ bA-1
= 2aA - 2bA-1
2 { U l = \{a + b)A + + b)A~*
0 - & . + \>)A +(! \a+\*)A->
D\
IDs,
= bA+ aA-1
= - 2bA + 2aA - 1
= j(a + b)A +
2 >
+ b)A~i
In order to satisfy the equations, either a or b may be chosen arbitrarily,
but the two constants must satisfy the relation
3a + b = 0.
We adopt
a = + - e, b = — -e,
so that the solution becomes
ux = + ±eA - | e^l"1,
I t is seen that this solution corresponds to
(A + A-1) = — e cos 7,
(114)
{A - A-1) = +2e sin /,
and is therefore in agreement with the requi rement that e shall be the
Hence
DA* = pA*>, D2Ap = p2Av.
Subst i tut ion of the solution into the equations for ux and s1 gives
108 ELLIPTIC EXPANSIONS
eccentricity of the orbit . In order to assure this for the higher approxi-
mations, conditions mus t be in t roduced at perihelion (/ = 0),
A = Ei0 = + 1, u = £ = — e, s = g = — e,
and at aphelion (/ = 7r),
A = Ein = — 1, u = g = + e, s = £ = + e.
Since the first approximat ion gives at perihelion,
ux = — e, s1 = — ey
and at aphelion,
ux = + ey sx = + e,
it is necessary that , at perihelion and at aphelion, respectively,
« 2 = W 3 = W 4 = - = °>
^2 = ^3 = ^4 = ••• = ^ '
These conditions are sufficient for the evaluation of the coefficients of
A0 in the even approximations and A1, A-1 in the odd approximations.
12. E x p a n s i o n s b y h a r m o n i c a n a l y s i s . In any practical applica-
tion of celestial mechanics the aim is finally to obtain results in numerical
form. I t is always possible, at least in principle, to solve a specific problem
requir ing extensive calculations by retaining some or all of the para-
meters involved (say the elements of an elliptic orbit) in literal form
until the final s tep. Since the invention of calculating machines and their
gradual increase in power and efficiency, it has become more efficient
to introduce numerical developments at earlier stages of a problem, and
sometimes at the very beginning, as with the numerical integration of
orbits. I t is impor tant for the worker in the field to be familiar with
both literal and numerical methods , so that he can use the one best
adapted to the case in hand; it often happens that a mixture of the two
is useful. A common example is expansion in Fourier series where the
arguments are retained as literal quantit ies while the coefficients are
expressed as number s .
T h e numerical coefficients of the series given in preceding sections
of this chapter may be calculated easily if the eccentricity is small enough,
bu t in cases where it is much greater than 0.1 or 0.2 the numerical
coefficients may be found more readily by harmonic analysis. Also,
H A R M O N I C A N A L Y S I S 109
the series developments of such functions as rm cos"/ , m and n being
any integers, may be m u c h facilitated by harmonic analysis.
Le t 8 be any independent angular variable, and F a periodic function
of 8 which it is desired to express in the form
F = \ c0 + q cos 8 + c2 cos 28 + ... + \ cn cos nd
+ Si sin 8 + s2 sin 26 + ... + sn sin nd>
the coefficients being number s . If 2n + 1 special values of 8 are chosen,
and if for each special value the value of F is calculated as well as the
values of the t r igonometr ic functions, then there result 2n + 1 linear
equat ions from which, if they are independent , the coefficients may be
calculated. T o render the equat ions independent it is sufficient to divide
the period of 8 into an even n u m b e r of equal par ts . If the initial value of
8 is taken as zero it is found that 2n parts suffice to determine all the
coefficients wi th the exception of sn.
If F(8) = F(— 6), then the series consists exclusively of a constant
and cosine te rms, in which case only the first n + 1 special values of
F, F(0) and F(7T) included, are distinct from the remaining ones. If
F(6) = — F(— 0 ) , then the series consists exclusively of sine te rms,
F(0) = F(TT) = 0, and only the first n — 1 special values of F, F(0)
and ^ ( 7 7 ) excepted, are distinct from the remaining ones.
In the general cases, where both cosines and sines are present , denote
the angle 2ir\2n by a, and take the special values of 8 equal to 0, a, 2a,
... noc,(2n — l)oc in tu rn . T h e n the solution of the equat ions is given by
1 x^ 1
ck = - 2j F(J a ) cos kjot, k = 0, 1, 2 , « ,
1 2 ^ : 1
sk = -Z/ FU0C) s i n * = 1. 2 , n — 1.
71 3=1
I t will be noticed that since sin noc = 0, there is no formula for sn.
Also it will be noticed that since cos noc = — 1, cos 2noc = + 1, etc.,
the formula for cn reduces to the sum of the special values taken with
alternating signs, divided by n.
T h e foregoing formulas, al though they are very systematic and
therefore adaptable to automatic calculating machines, are not well
suited to hand calculation because they lead to needless repeti t ion of
the same operat ions. For hand calculation it is easy to derive a m u c h
less laborious scheme for any part icular value of 2n. As an example
we give here a scheme for 2n = 8. Deno te the special values of
110 ELLIPTIC EXPANSIONS
F by F0y Fv F29 F7. T h e n the coefficients are given by the following
equations:
0.4 + ^ 4 ,
0
4 = *"o -FA,
1.5 + F»
1
5 = *"i - F »
2.6 + FE,
2
6 = F* ~F»
3.7 = ^ 3 + F„
3
7
= FS
Thi s check cannot be relied upon to the last decimal since the end-
figure errors of the coefficients are increased by the multiplications.
Both checks should always be applied.
T h e series given by a harmonic analysis may be regarded as an inter-
polation formula suitable for obtaining values of the function inter-
mediate between those specially calculated. As with any interpolation
formula, the accuracy of an interpolated value increases as the n u m b e r
of specially calculated values of the function that are taken into account
0.2 = 0.4 + 2.6,
1.3 = 1.5 + 3.7,
4 * 0 = 0 . 2 + 1.3, 2 ( * 1 + * 8 ) = j ,
4* 4 = 0.2 - 1.3, 2fo - r 3 ) = | cos 45°,
4c2 = 0.4 — 2.6, 2(s1 + s3) = sin 45°,
4s 2 = 1.5 — 3.7, 2(sx — s9)
A valuable check on the results may be obtained for the cosine
coefficients by the condit ion
-^0 = \CQ ~t~ Cl + C2 H~ ••• ~f~ \Cn'
A complete check would consist in verifying that every special value of
F is correctly given by the series.
A valuable check on the sine coefficients may be obtained by finding
the value of dFjdd for 0 = 0. T h e n
s1 + 2s2 + 3*3 + . . . + (n — 1) sn-.v
HARMONIC ANALYSIS 111
increases. T h e coefficients obtained by harmonic analysis are only
approximations, which can be made as accurate as we please by taking
n sufficiently great. I t can be shown that thecoefficient we have denoted
by ciy i < n, would, if the series were indefinitely extended, be equal to
and the value of cn is
cn ~T~ cZn ~f~ cbn "f" ••••
Hence, the lower the order of a cosine coefficient the higher the
accuracy with which it is found, with the exception of cny which is more
accurate than any of the preceding ones.
Similarly, if the series were indefinitely extended, st would represent
and again it is seen that the coefficients of lower order are de termined
with higher accuracy.
As a good working rule the n u m b e r of special values should be taken
large enough so that cn¥1 and sn may be regarded as having zero values.
I t is sometimes difficult to decide in advance how many special values
of the function will yield results of the required accuracy without
unnecessary labor. If only a few analyses are to be made, it is possible
to begin with eight special values, and to start the analysis by calculating
cn and sn_x. If either of these has appreciable values, the n u m b e r of
special values may then be doubled, t hus saving the ones already
calculated, and the test repeated. If many analyses having similar
convergence are to be performed, it will be preferable to exper iment
with an intermediate n u m b e r of special values.
Special schemes of analysis may be devised that make it possible to
increase the n u m b e r of special values only slightly, and still to save
most of the work already done. Examples of such schemes are given
by Brown and Brouwer . 4
As an example of harmonic analysis we calculate the Fourier expansion
of the function
F = + [1 - 0.6 cos (0 + 30°)] 1 / 2
with eight special values, to five decimals.
F0 = 0.69310 F± = 0.91908 F2 = 1.14018 F3 = 1.25680
F 4 = 1.23273 F5 = 1.07484 Fe = 0.83666 F 7 = 0.64842
4 E . W . B R O W N AND D . BROUWER, Tables for the development of the disturbing function
with schedules for harmonic analysis. Trans. Yale Univ. Obs. 6, Pt. 5 . , 1 4 3 ( 1 9 3 2 ) .
112 ELLIPTIC EXPANSIONS
0.4 = + 1.92583 °r = - 0.53963
1.5 = + 1.99392 0.2 = + 3.90267 ^ = - 0.15576
2.6 = + 1.97684 1.3 = + 3.89914 ? = + 0.30352
3.7 = + 1.90522 I = + 0.60838
4c0 = + 7.80181 2 (« i + c3) = - 0.53963
4c 4 = + 0.00353 2(^1 - c3) = - 0.54033
4c 2 = - 0.05101 2(h + h) = + 0.32005
4s 2 = + 0.08870 2 («i - * 3 ) = + 0.30352
1 , _
2 c 0 —
+ 0.97523
- 0.26999 = + 0.15589
C 2 = - 0.01275 ^2 = + 0.02218
c 3 = + 0.00018 *3 = + 0.00413
1, _
2 C 4 — + 0.00044
+ 0.69311 + 0.21264
Check: + 0.69310 Check: + 0.21642
I t is evident from the convergence of the coefficients that the cosine
coefficients can be relied upon to not more than four decimals and the
sine coefficients to not more than three . In other words, an interpolated
value of the function calculated from the series would be accurate to
not more than three decimals. T o obtain more accurate results it would
be necessary to repeat the analysis with a greater n u m b e r of special
values.
Whenever a function of two independent angular variables a and j3
can be expressed in the form
F(a, /?) = 2 (Ajtk cos jot cos kfi + Bj fc sin joc sin k/3 + C3 k cosja sin fyS
+ Dj k sin joc cos
where J, K = 0, 1, 2, the numerical calculation of the coefficients
can be reduced to a double application of harmonic analysis for a single
variable.
H A R M O N I C A N A L Y S I S 113
Suppose the period of a to be divided into 2m parts and the period
of fi into 2n par ts . T h e n Amn special values of F are required. Beginning
with the 2m special values corresponding to fi = 0 calculate the m + 1
cosine coefficients corresponding to j = 0, 1, 2, m> and the m — 1
sine coefficients corresponding to j = 1, 2, m — 1. T h e n repeat
the process for the 2m special values of F corresponding to another
special value of /J, then do the same for the remaining special values
of F. In this way 2n special values of the coefficients of cos joc, sin joc
will be obtained. Choosing any set of these corresponding to a single
value of j , say the 2n special values of cos 2a, analyze these with respect
to fi; the results are the coefficients A2 fc, C2 k. T h e other coefficients
are obtained in similar fashion.
T h e analysis being completed, it will usually be desirable to express
the result as a conventional double Fourier series, by replacing products
of sines and cosines by sums and differences. T h e following formulas
suffice for the purpose .
A cos joc cos kfi = + \ A cos (joc + kfi) + \ A cos (joc — A/?),
B sin joc sin kfi = — \ B cos (joc + kfi) + \ B cos (joc — kfi),
C cos joc sin kfi = + ^ C sin (joc + kfi) — \ C sin (joc — kfi),
D sin joc cos kfi — + J D sin (/a + kfi) + ^ Z) sin (;a — fyS).
W h e n either j or & is zero or both are zero, the formulas are not used,
the te rms already being in the desired form. I t is wor th not ing that
B0 k and Bj 0 do not appear. Also, if the analysis is made first with
respect to a, then D0 k does not appear, and in any case C 0 0 and D0 0
are absent. I t is also worth not ing that when j = k, both differing from
zero, the formulas yield formal values for coefficients of sin 0; these
are fictitious and should never be wri t ten down because they may lead
to errors in subsequent operations.
I t is always possible and desirable to arrange the series so that either
j or k does not take on negative values or, alternatively, so that one of
t h e m does not take on positive values, using the relations
+ \A cos (joc + kfi) = + \A cos ( — joc — kfi),
+ \ C sin (j<x + kfi) = — ijr C sin (-joc - kfi).
W h e n this has been done, then supposing j not to be negative, the
series should be arranged in blocks, so that within each block j is constant
and k ranges from its largest positive to its largest negative value.
114 ELLIPTIC EXPANSIONS
N O T E S A N D R E F E R E N C E S
E . W . B r o w n a n d C . A . S h o o k , Planetary Theory, C h a p t e r s II a n d I I I , C a m b r i d g e
U n i v . P r e s s , L o n d o n a n d N e w York , 1933 , c o n t a i n s m u c h u s e f u l in format ion
o n t h e d e v e l o p m e n t of f u n c t i o n s in e l l ipt ic m o t i o n . V a r i o u s s c h e d u l e s for h a r m o n i c
analys is are c o n t a i n e d in A p p e n d i x A of Planetary Theory, as w e l l as in E . W . B r o w n
a n d D i r k B r o u w e r , Tables for the Development of the Disturbing Function, C a m -
b r i d g e , U n i v . Pres s , L o n d o n a n d N e w York , 1933 , a n d Trans. Yale Univ. Obser-
vatory 6, P t . 5 (1932) ;
CHAPTER I I I
GRAVITATIONAL ATTRACTION BETWEEN BODIES
OF FINITE DIMENSIONS
1. I n t r o d u c t i o n . T h e law of gravitational attraction is valid for
two material particles, not for bodies of finite dimensions and with
arbitrary distr ibution of mass. I t will be shown, however, that spherical
bodies with a dis tr ibut ion of mass such that layers of equal density are
concentric spheres, attract each other as if the mass were concentrated
in their centers. Moreover , it will be shown that if the distance between
two bodies is large compared with their dimensions, the attraction
between them is sensibly as if their mass were concentrated in their
centers of mass. These results make it possible in many cases to disregard
the dimensions and the distr ibution of mass, and to treat the gravitational
attraction between two bodies as if they were material particles.
Nevertheless, there are cases in the solar system and in systems of binary
stars in which deviations from sphericity have impor tant effects. Hence
it is necessary to examine the case of gravitational at traction between
two finite bodies each having an arbi trary distr ibution of mass. Th i s
problem presents considerable complications. It is much easier to treat
the attraction between a body of finite dimensions and a material particle.
Th i s simplified problem applies to many cases that present themselves
in astronomy,and will be treated first.
2 . A t t r a c t i o n of a par t i c l e b y a b o d y of f inite d i m e n s i o n s
a n d a r b i t r a r y d i s t r i b u t i o n of m a s s . In a Cartesian coordinate
system, fixed in the sense of Newtonian mechanics , let X0y F 0 , Z 0 be
the coordinates of the center of mass E of a body M , and let X, Y, Z
be the coordinates of a particle at P with mass m. Consider at Q an
element of mass dM of the body M, and let rj, £ be the coordinates
of Q in a coordinate system with axes parallel to those of the fixed system
bu t with its origin at E. Pu t
x = X — X0, y = Y — y 0 , z = Z — Z 0 ,
so that x, y, z are the coordinates of m relative to the center of mass
of M. T h e distance A between P and Q is obtained by
A' = (* - f ) 2 +{y- jf + (z- if. (1)
115
These integrals may be pu t in a more precise form by introducing the
density, K> which mus t be considered a finite function of the coordinates
£, r), £. T h e n
dM = K(g9ri9Z)d£dvdZ,
and the components of the force are expressed as triple integrals, the
integration to be extended over the whole mass M. T h e components
then become
116 BODIES OF FINITE DIMENSIONS
Hence the three components of the force due to the attraction by dM are
Integrat ion over the entire body M yields the components of the force
Fx=fm
(M)
d£ drj d£
(2) b\ =fm
(M)
Fz =fm
J (M)
d£ drj dl
In this form three triple integrations are required. I t is possible to
obtain expressions in which only one triple integral occurs. Th i s form
is obtained by considering the partial derivatives of J 2 with respect to
x9 y, z. I t follows from (1) that
AdA = (x — £) dx + (y — rj) dy + (z — £) dzy
or
T h e force acting upon the particle m, due to the attraction by the
element of mass dMy is fm dMjA2 in the direction PQ. T h e components
of the force along the three coordinate axes are obtained by mult iplying
the force by the direction cosines of the direction PQ. Since the
coordinates of Q relative to P are £ — x> rj — y, £ — zy it appears, with
the aid of (1), that the direction cosines are
ATTRACTION BY FINITE BODY 117
In these integrals x, y> z are to be regarded as parameters that mus t
be t reated as constants in the integrations. Also, the limits of integration
are independent of these parameters . I t is permissible, in this case, to
interchange the order of differentiation with respect to a parameter and
integration, provided that everywhere within the range of integration
A 7^ 0. Th i s condit ion merely states that the particle m should not be
a part of the mass M\ it should be either exterior to M or within a cavity
in it. T h e case that the particle is a par t of M would require special
investigation.
Now define the potential U by
A (6)
Similarly, the forces acting upon the different e lements of M may be
transferred to the center of mass of M, and the resultant force will be
equal and opposite to the force acting upon m. Hence the components
may be writ ten
T h e left-hand member may be writ ten as d(A 1). Hence
(3)
Subst i tut ion of (3) into (2) gives for the components of the force
(4)
Fx=fm
(M)
(M)
Fv=fmj
F% =fin
) KM) ^
d£ drj 0%.
I t follows from (4) that the components of the force acting upon
m may be wri t ten
(7)
(8) FT F m -
1 V ~
- m Ff
u =
' (M)
d£ drj 0%, (5)
which, for brevity, will usually be wri t ten
dM
118 BODIES OF FINITE DIMENSIONS
where r is defined by
U =
r2 = x2 + y2
T h e methods used in the theory of the potential are of great importance
in the theory of equi l ibr ium of rotating fluids. I ts application to the
figure of celestial bodies is properly a province of dynamical astronomy.
Th i s subject, however, lies outside the scope of th is book.
3* L c g c n d r e p o l y n o m i a l s . For the subjects treated in this
volume it is sufficient to obtain a method of expressing U as a function
of x, y, z that is generally applicable when the distance between m and
the center of mass of M is large compared with the dimensions of the
body M. On this assumption a development in series is available that
will be presented in the following sections.
For later applications it is desirable to re turn to the original fixed
Cartesian coordinate system. T h e n U mus t be considered a function
of the differences X — X0i Y — Y 0 , Z — Z 0 , and the components of the
force on m become
(9)
T h e components of the force that determines the motion of the center
of mass of M are
(10)
T h e problem of finding the mutua l attraction between M and m has
been reduced to that of obtaining U9 defined by (5) as a function of
xy yy z. General propert ies of U are s tudied in the theory of the
Newtonian potential . T h e most fundamental of these is that for all points
not belonging to the mass M , U satisfies the partial differential equation
of the second order:
d2u , d2u , d2u
8x2 1 By2 1 dz2
This equation was first given by Laplace. I t is obtained immediately
by partial differentiation of (3). Wi th the aid of this equation it may
be shown that the potential for a body M with spherical symmetry
becomes
LEGENDRE POLYNOMIALS 119
A2 = r2-2(x£+yrj+zQ+p2
For brevity, pu t
(12)
: q — cos S, (13)
so that
A2 = r2 (1 - Iqoc + a 2). (14)
I t is evident from (13) that S is the angle POQ and that , therefore,
I ? I < i-
T h e integrand of
(15)
will now be developed as a power series in oc with the aid of the poly-
nomials of Legendre , Pn{q), defined by
= P 0 + Pi* + P2oc2 + ... + Pnoc« + ... (16)
in which Pn are polynomials in q of degree n.
T h e general expression for Pn(q) is obtained by the binomial develop-
ment of the left-hand member of (16) as
[1 - {Iqoc - oc2)}-1'2 (17)
where the usukl notation is employed for the binomial coefficients
Let p be the distance of an element of mass dM from the center of
mass of the body M. T h e n
e + v2 + z2
= p2- (ii)
I t is then unders tood that for any point of the body My p/r< 1. T h e
expression (1) for A2 may now be writ ten
120 BODIES OF FINITE DIMENSIONS
Since the r ight-hand members of (16) and (17) should be identical, the
general expression for Pn(q) is obtained by finding the coefficient of a n
in the r ight-hand member of (17). Th i s is a double series in a and q,
and in the r igh t -hand member of (16) this double series is arranged as a
single series in a with coefficients that are functions, polynomials in this
case, of q.
Collection of the coefficients of an in the r ight-hand member of (17)
gives for the general expression of Pn(q)
Po(q) = l,
=
PM = \ W -i) .
P*{q) 3?),
PM) - 30<?2 + 3),
P*{q) - 7 0 9
3 + 15?),
P*{q) =4^(
' - 315? 4 + 105?2 - 5).
These polynomials will be of frequent usefulness.
Another form of development is obtained by pu t t ing
then, in view of (13),
2q = or + a~1.
Also,
2 cos nS = o-n + <j" n,
1 - 2qoc + oc2 = (1 - oca) (1 - aa" 1 ) ,
(1 - Iqoi + a 2 ) - 1 / 2 = (1 - oca)-1'2 (1 - a a " 1 ) " 1 7 2
(18)
F rom this expression it appears that Pn is a polynomial of degree n
with only odd powers of q if n is odd and with only even powers if n
is even. Subst i tut ion of part icular values of n gives the following
polynomials of Legendre of order 0 th rough 6:
(19)
POTENTIAL—PRINCIPAL PARTS 121
2 cos (n - 4) S + ....
T h e two binomial series of which the produc t is formed in the r ight-
hand member of (19) are absolutely convergent for |a| < 1 since |a| = 1.
Hence the double series obtained is absolutely convergent for |a| < 1.
T h e following propert ies of Pn will be useful for our immediate needs:
(a) Pn(+ 1) = + 1, P«(- 1) = ( - I f . (21)
Th i s proper ty follows at once by put t ing q = ± 1 in (14) which then
reduces to the special cases
1 = 1 ± oc + a
2 ± a 3 + ...;
1 Toe
(b) \Pn(q)\ < 1 if - K q < + L (22)
Since the coefficients in the r ight -hand member of (20) are all positive,the max imum value of Pn for real values of S is the sum of the
coefficients. I t is reached for S = 0, q = + 1; this, in view of (21),
proves (22).
T h e importance of (22) for our present purpose is that it shows that
the series (16) is absolutely convergent for |a| < 1 and all values of q
that belong to real values of S. T h e proof is obtained by comparing
(16) with the absolutely convergent series for |a| < 1,
= 1 + oc + oc2 + ... + ocn + .... (23)
1 - (
T h e absolute values of the coefficients in the r ight -hand member of (16)
being at most equal to the corresponding coefficients in (23), the former
mus t also converge absolutely.
4 . T h e p r i n c i p a l p a r t s of U. Return ing now to the integral (15)
for U, subst i tut ion of (16) yields an absolutely convergent series, for
oc < 1. I t is therefore permit ted to write
T h e two series are mult ipl ied, and the coefficient of ocn yields:
• 2 cos (n — 2) S
(20)
Pn(cosS) 2 cos nS
dM. (24)
122 BODIES OF FINITE DIMENSIONS
Let this integral be writ ten
u=u0 + u1 + u2 + us + ....
T h e individual parts are to be evaluated by restoring | , rj, £ in the
integrands by using qp = (x£ + yy + #£)/r-
T h e various parts of U are as follows:
U0=£(dM = f^. (25)
Th i s is the result that would be obtained if all the mass of the body
M were concentrated in its center of mass.
But these three integrals are zero since rj9 £ are measured in a
coordinate system that has the center of mass of M as origin. Hence
f/x = 0. (26)
T h e expression for U2 is
which may be writ ten
(27)
T h e six integrals that appear in this expression are moments of the
second order which are related to the moments of inertia of the body M.
In order to arrive at a more suitable form, let I be the moment of inertia
about an axis through the center of mass, and let a, /?, y be the direction
cosines of this axis. T h e n , if 6 is the angle between this axis and the line
connecting an element dM with coordinates £, rj, £,
I = J p2 sin 2 BdM,
POTENTIAL—PRINCIPAL PARTS 123
X = Y : Z = X2 + Y 2 + Z 2
if X, Y, Z are the coordinates of any point on the surface of the ellipsoid
and oc, fi, y the direction cosines of the line joining this point with the
center. T h e last of these shows that the distance of any point X, Y, Z
from the center equals 1 / V I , which proves that the surface has no
real points at infinity since the moments of inertia of a three-dimensional
body about any axis are different from zero. T h e surface is therefore
an ellipsoid and no other form of quadrat ic surface.
By a proper choice of axes the equation of an ellipsoid may be put
in the form
AX2 + BY2 + CZ2 = + 1. (30)
where p 2 = £ 2 + rf -f £ 2 as before. T h i s integral may be developed by
making use of
p cos 6 = & + r)p + £y.
Hence
/ = J 0 2 - ( f « + ^ + ^ ) 2 ] d M ,
which may be writ ten as
J = J[(L 2 + *?2 + £ 2 ) ( « 2 + P2 + R 2 ) - (F* + + Cy)2] dM
= /[ft2 + i2) «2 + (C2 + A ]8 2 + ( £ 2 + V2) J2 ~ 2ritPy - lUyoc-l^ocfi] dM.
T h e moments of inertia A , B , C about the X, Y, Z axes, respectively,
and the products of inertia D , E, F are defined by the integrals
A = $(rj2 + £2) dM, D = fr)£ dM,
B = Stt2
+ £2)dM, E = jtfdM,
C = J (£ 2 + T)2) dM, F = fijr) dM.
T h e momen t of inertia about an axis th rough the center of mass with
direction cosines a, j3, y may, therefore, be writ ten
I = Aoc2 + Bfi2 + Cy 2 - IDfiy - 2£ya: - IFocfi. (28)
Now consider the surface of the second degree having the origin at
its center,
AX2 + BY2 + CZ2 - 2DYZ - 2EZX - 2FXY = + 1. (29)
Th i s is called the ellipsoid of inertia of the body M. Comparison between
(28) and (29) shows that the following relations will hold:
124 BODIES OF FINITE DIMENSIONS
T h e reduction from the general form (28) to that of (30) requires the
solution of an algebraic equation of the third degree. Th i s solution
locates the direction of the axes bu t leaves complete freedom in the
assignment of the XY YY and Z axes. As a rule there is a mechanical
preference for a particular choice. If the positive directions of the X
and Z axes have been decided upon, that of the Y axis follows wi thout
ambiguity if the coordinates are to form, say, a r ight -handed system.
Let it be assumed that the axes are so chosen that the equation of
the ellipsoid has the form (30). These axes are said to be the principal
axes of the body. Wi th this choice of axes we have D = E = F = 0.
T h e importance in connection with our present problem is that in this
case the last three integrals in the expression (27) for U2 vanish. T h e
three remaining integrals may be expressed in terms of A y By and C.
F rom the definition of A y By C it follows that
{ ? + y]* + t*)dM
£2dM
rfdM =
l2dM =
Thi s gives after simple reduction,
\
2
A + B - C
This expression for t / 2 must be looked upon as merely a convenient
2
C + A - B
B + C - A
A + B +C
2
Hence
(31)
For a moment , let a, j8, y be the direction cosines of the line joining
the point x, y, z with the origin. T h e n
= Aoc2 + Bp2 + c y 2 ,
which, in view of (28) and the choice of the principal axes as coordinate
axes, is the moment of inertia of the body about the line joining its center
of mass with the point x, yy z. If this is denoted by / , then (31) may
be writ ten
(32)
POLAR COORDINATES 125
short form. In all applications it is necessary to obtain for the potential
U an expression in which the coordinates of the point m appear explicitly
so that partial differential quotients with respect to these coordinates
may be readily obtained. Th i s requi rement is satisfied by the expression
(31).
A more special case is that of the spheroid, which, in addit ion to being
rotationally symmetrical about the Z axis, is also symmetrical about an
equatorial plane perpendicular to this axis. I t will be noted that the
expression (34) for U2 is the same regardless of whether the body
possesses an equatorial plane of symmetry . T h e same expression holds
for a spheroid as for a pear-shaped body that is rotationally symmetrical
about the Z axis.
T h i s simplified form of U2 may be used for the attraction of planets
upon their satellites. I t is applicable to binary star systems if one of
the components is of such small dimensions that it may be t reated as
a particle of mass. In very close pairs the tidal deformation may become
significant, destroying the rotational symmetry and requir ing the use
of the general formula (33).
5. I n t r o d u c t i o n of p o l a r c o o r d i n a t e s . Let polar coordinates be
in t roduced by
x = r cos ip cos
y = r sin I/J cos /?,
z = r sin /?,
I/J being the longitude and j8 the lati tude, the latter to be dist inguished
from the direction cosine j8 used previously. T h e expression for U2 may
then be wri t ten
(A cos 2 I/J + B sin 2 I/J) cos 2 p - C sin 2 p\
Csin2£J, [A+B + C)
(A+B + C)-
giving finally
(33)
N o w suppose that the body M is rotationally symmetrical about the
Z axis. T h e n A = B, and (33) reduces to
(34)
126 BODIES OF FINITE DIMENSIONS
6. T h e e x p r e s s i o n for U3. T h e next part of the potential is
T h e development will give rise to ten different integrals of the type
J f y d M with j + k + l = 3,
j , ky and / being zero or positive integers 1, 2, or 3.
Suppose the body is symmetrical about the XY plane and about the
ZX plane. In this case only three of the ten integrals are different from
zero, and the expression for U3 becomes
An example of a body to which this expression is applicable is a pear-
shaped body with the X axis as axis of the pear.
If the body is symmetrical about all three coordinate planes, as is,
for example, a homogeneous ellipsoid with three unequal axes, all of
the ten integrals will vanish. In this case the moments of odd order
will vanish generally and U3 = 0, U5 — 0, etc.
7* T h e e x p r e s si o n for £/ 4 . T h e general expression (24) gives
(37)
In the most general case there will be fifteen different integrals of
moments of the fourth order of the form J^jrjk^ldM. T h e expression
(37) can readily be writ ten in te rms of these integrals. T h e development
will be given here only for the case of a body symmetrical about all
three coordinate planes. In this case only six moments of the fourth
order will be different from zero, namely, those in which all three of
the exponents j , k> I are even. Th i s t rea tment will apply, for example,
to a homogeneous ellipsoid with three unequal axes. T h e n
•3(P + V2 + P)]dM. (35)
(36)
- 3 (e + v2
+ £ 2) 2] dM.
In the more special case of a spheroid, let the Z axis be the axis of
rotational symmetry . T h e n
\^dM = J V dM, J f £ 2 dM = fr]2£2 dM.
In addit ion there should be a relation between j^2rj2dM and j^dM.
Thi s relation can easily be discovered as follows. Let
P\ = ? + v2,
and introduce cylindrical coordinates p x , ip, £. T h e density K will, on
account of the rotational symmetry, be a function of p 1 and £ but
independent of ip. Hence, wi th
dM = K di px dpx dip,
| £ Y dM = | | * < / £ P J </Pl I'" sin 2 0 cos 2 ifj d^
77" /* *
4
T h e limits of the integrations with respect to P l , £ mus t be int roduced
to correspond to the particular body being considered. Similarly,
| (i2 + V
2)2 dM = | | * d£ pt dPl diP
Hence
= 2TT II K d£ p\ dpv
POTENTIAL—FOURTH ORDER 127
£ 2f 2 dM
f y </m].
(38)
rfdM
128 BODIES OF FINITE DIMENSIONS
I t then follows at once that
J ? m = J yfdM = | J (£2 + iffiM.
After introduct ion of these relations and further simplification, the
expression (38) for C74 for a spheroid reduces to
# 4 = r 4 ( s in 2 £ - ^sin 22j8),
the expression becomes
A rather unexpected simplification is that , whereas the various integrals
in (38) have different functions of the coordinates of m as factor, in
the spheroidal case one common function of these coordinates appears.
Th i s feature could be obtained from the general properties of the
Newtonian potential, which will not be examined here .
8. T h e p o t e n t i a l of a s p h e r o i d . T h e results obtained for a
spheroid may be collected in the form
dM.
(40)
with
] dM,
] dM.
If then polar coordinates are in t roduced by
dM.
(39)
z2 = r2 sin 2 jS,
POTENTIAL OF SPHEROID 129
T h e constants M , / , K tha t determine the potential of the spheroid
mus t be obtained from observational data, i.e., from the gravitational
attraction of the spheroid upon another body.
For the earth the parameter / may be derived from measurements
of the acceleration of gravity at the ear th 's surface. T h e parameter K
for the earth has tu rned out to be small, of the same order of magni tude
as coefficients of harmonics of higher order that appear in the potential
of a body with as irregular a surface s t ructure as the earth has.
In the moon ' s motion the parameter / is impor tant bu t the solar effect
on the moon ' s mot ion is so m u c h greater than the effect of the ear th 's
oblateness that the evaluation of / from the moon ' s motion presents
peculiar difficulties.
An interest ing method of obtaining / for the earth is by writ ing /
as the produc t of two factors,
3C-A C
J ~ 2 C ' MR2 *
T h e ratio (C — A)/C may be found from the observed values of the
constant of precession, while the ratio C/MR2 may be secured from the
theory of hydrostat ic equi l ibr ium of the earth. T h i s method was used
with much success by G. H . Darwin and further developed by W. de
Si t ter ; its l imitation depends on the degree of applicability of the theory
of hydrostat ic equi l ibr ium to the earth.
T h e s tudy of motions of artificial satellites has opened up the oppor t -
unity of evaluating the constants / and K, and other parameters that
enter into the ear th 's potential , by a method free from the complications
present in the older methods .
Evidently, B2 is of dimension [L 2 ] , J5 4 of dimension [L 4 ] . T h e more
commonly used parameters are / and K, related to B2i J5 4 by
in which R is the equatorial radius of the spheroid. T h e n
U - . . ] . (41)
Instead of K the parameter D defined as - 8 - BJt~* is frequently used.
T h e n the K t e rm in (41) becomes
s i n 2 ^ + sin 4 p\.
130 BODIES OF FINITE DIMENSIONS
With the availability of artificial satellites it is possible to apply to
the earth the method that has been available for the s tudy of the potential
of Saturn from its natural satellites. T h e inner six satellites of this planet,
from Mimas to Ti tan , have mean distances ranging from 3.11 R to
20.48 R. T h e motions of the nodes and pericenters of these satellites
are determined to a large extent by the oblateness of the pr imary. F rom
the observed motions for two or more satellites at different distances
from the pr imary the values of / and K (or D) may be obtained. T h e
application by H . Jeffreys to the Saturn sys tem 5 is recommended for
further reading. For the motions of the pericenter and node of a close
satellite, see Chapter X V I I .
T h e form of the potential of a spheroid given in Eq . (40) corresponds
to a general result obtained by Laplace. For a body with rotational
symmetry the general expression is
in which Pk are Legendre polynomials . In the case of the spheroid,
on account of the symmetry with respect to the equatorial plane, only
even te rms are present in the series. For bodies wi thout rotational
symmetry a generalized expansion, also given by Laplace, is available. 6
9. P o t e n t i a l for t w o b o d i e s of f inite d i m e n s i o n s . T h e develop-
ment of the potential as given in the preceding sections is applicable
to the great majority of problems in dynamical astronomy. T h e r e are
some problems, however, that require the development of the potential
for two bodies each of finite dimensions. T h e function U mus t then
be writ ten
where A is now the distance between the two elements of mass dM of
the body M and dM' of the body M'. T h e integrations are to be extended
over the masses of both bodies.
Let O be the center of mass of M , O ' that of M ' , and let O be the
origin of a Cartesian coordinate system with coordinates 77, £, and O'
that of a coordinate system with parallel axes with coordinates rj\
5 H . JEFFREYS, Monthly Notices Roy. Astron. Soc, 113, 8 1 ( 1 9 5 3 ) ; 114, 4 3 3 ( 1 9 5 4 ) .
6 F . TISSERAND, "Mecanique Celeste," Vol. 2 , Chaps. X V I - X I X . Gauthier Villars,
Paris, 1 8 9 2 .
POTENTIAL FOR TWO FINITE BODIES 131
Fur ther , let O O ' coincide with the Ox axis. If r is the distance O O ' ,
77, £ the coordinates of dM, and 77', £' the coordinates of d M ' ,
J 2 = ( r + f _ £ ) 2 + _ ^ 2 + _
Hence, if
= r 2 ( ! _ 2qoc + a 2),
the reciprocal of A may be developed by (16), giving
l+P1(q)'CL+P2(q)'cfi+J9 (42)
and, similar to the expansion used in preceding sections,
(43)
since
J £ dM =-- 0 , $£'dM'=0.
U2 =
)-2£?+riV'+ U']dMdM'.
K f - e)2
+ - V ) 2 + (£ - D 2 ] | dM dM'
T h e product terms yield nothing on account of the choice of origins.
Hence U2 may be writ ten
[ f 2 + V 2 + £ ' 2 ( V 2 + £' 2)] ««f'.
132 BODIES OF FINITE DIMENSIONS
Regardless of the orientation of the coordinate systems,
J(£ 2 +r!* + P)dM = l(A+B + C),
J ( f 2 + rj'2 + Cf2) dM' = \(A' + B' + C ) ,
if A, Bf C, A'yB', C are the moments of inertia about the principal axes.
Moreover,
are the moments of inertia of the two bodies about the line OO'. Hence ,
/ = Aa* + Bb* + Cc\
I' = A'a'2 + B'b'* + C'c'\
if a, b, c are the direction cosines of OO' relative to the principal axes
of M, and a', b', c' those relative to the principal axes of M'. T h e
resulting expression for U2 is
fM' rl
g ( ^ + B + C)-| (Aa* + Bb* + Cc2)] (44)
] •
which may be compared with (31), (32).
Th i s expression for U2 is not yetdefinitive, since the mot ion of the
center of mass of M will be determined by the components of the force
and that of M' by the components
where X, Y, Z are the coordinates of M and X\ Y\ Z' those of M'
in a fixed coordinate system. Hence the coordinates of O ' relative to
O are X' - Xy Y' — Y, Z' - Z, and
r 2 = (x - xy + (Y - Yy + ( z - zy.
T h e direction cosines that appear in (44) must now be expressed in
terms of these coordinates. Let the principal axes of M have the direction
NOTES AND REFERENCES 133
b2 =
Y') + - z'W
r*
Y') + v2(Z
-x')+npr- Y') + v3(Z - z'W
r2
Similar expressions may be wri t ten for a'2
y b'2
y c'2 by introducing
A/, \i{y v^y etc., for the direction cosines of the principal axes of M'.
T h e r e is no difficulty in principle concerning the development of the
higher por t ions in the expression (42) for U. T h e procedure is the same
as that followed for the at traction of a finite body upon a material
particle. In the most general case these higher port ions soon consist of
so numerous te rms that the development becomes difficult. For most
astronomical problems, the developments given in this chapter suffice.
If additional te rms are needed for any particular problem they may
readily be obtained by extending the procedure followed. In all such
cases it is desirable to take advantage of all the simplifications due to
symmetry and particular orientation that each individual problem may
present .
N O T E S A N D R E F E R E N C E S
F . T i s s e r a n d ' s Traite de Mecanique Celeste, V o l . 2 , c o n t a i n s an e x h a u s t i v e
t r e a t m e n t of t h e subjec t c o v e r e d i n th i s chapter . A use fu l addi t ional re ference is
T h o m s o n a n d T a i t ' s Treatise on Natural Philosophy, Part I I , n e w e d i t i o n , e s p e -
cial ly C h a p t e r V I , C a m b r i d g e U n i v . P r e s s , L o n d o n a n d N e w York , 1 8 8 3 .
a2 =
c2 --
cosines Xly fily vly A2, /x2, v2y A3, /z 3, vs relative to this fixed coordinate
system. T h e n
CHAPTER I V
CALCULUS OF FINITE DIFFERENCES
1. R e p r e s e n t a t i o n of f u n c t i o n s . A function is often represented
by an analytic expression in te rms of one or more independent variables,
indicated by symbols, which may be imagined to vary continuously over
a range of numerical values, infinite or finite. Th i s formula explicitly
prescribes a set of mathematical operations on the variables, by means
of which the function is defined for any particular values of the variables.
T h e infinitesimal calculus is concerned with the differentiation and
integration of such expressions. An alternative form for the representa-
tion of functions is the numerical form, in which numerical values of
the function are given for certain definite values of its independent
variable or variables. T h e values of the independent variable, when there
is only one, are usually wri t ten in a column, and beside each one is
placed the corresponding value of the function. Such an exhibit is called
a table. T h e independent variable is called the argument. T h e argument
is usually, bu t not always, given at equal intervals; the difference between
two consecutive arguments taken without regard to sign is called the
tabular interval, the interval of the argument , or simply the interval.
W h e n there are two independent variables the values of one (called the
vertical a rgument) may be wri t ten along the left margin and the other
(the horizontal a rgument ) across the top, the values of the function
forming a rectangular array which is known as a table of double entry.
Tables with one independent variable are called tables of single entry.
A third form in which functions may be represented is the graphical,
in which, unlike the numerical , the representation is cont inuous. Th i s
form is not, however, much used in celestial mechanics, for it is not
compatible with great precision.
T h e numerical form of a function may be preferable to the analytic
for any of a number of reasons. Tables of logari thms and of t r igonometr ic
functions are made available as a t ime-saving device because the calcula-
tion of individual values by substi tut ion of numbers into infinite series
is too laborious for the computer . T h e ephemerides of the sun, moon,
and planets are tabulated in the annual volumes of the American Ephe-
meris and similar publications because their calculation from the
expressions that form the basis of the ephemerides is not practicable
134
DIFFERENCES 135
for many who require these data. As a th i rd example we have tables
that represent numerical solutions of differential equations obtained
directly by numerical processes, that is, wi thout initial expression of
the solution analytically.
T h e more common tables of logari thms and of t r igonometric functions
are furnished with such a small tabular interval that interpolation is
rendered very easy, the process being known as linear interpolation.
Such extended tabulat ion is not always practicable, even for tables that
mus t be used frequently, and it is therefore necessary to have more
general methods of interpolation than the linear method, applicable
when linear interpolation would lead to inaccuracy. I t is useful also to
be able to differentiate and integrate functions expressed in the numerical
form, particularly to integrate those functions that cannot be integrated
analytically, or whose analytical development would require great labor.
These three operations—interpolation, numerical differentiation, and
numerical integrat ion—consti tute the calculus of finite differences,
sometimes called the finite calculus.
T h e advantage of analytic methods is that they are the only ones
that can lead to perfectly general results, that is, results known to be
valid for all values of the independent variables. T h e advantage of
numerical methods is that they are always available. Again, numerical
methods are often less laborious and less liable to error than analytic
ones, and they are capable of yielding equally great accuracy. Moreover,
celestial mechanics, like any other branch of applied mathematics , aims
at practical applications. Here numerical results are always needed, and
for many practical applications intermediary analytic developments are
entirely superfluous.
2. D i f f e r e n c e s . Table 1 gives values of the polynomial x3 —
3x — 23 for some integral values of x, with their differences.
T h e numbers f1 are obtained by subtract ing each value of / from the
one immediately below it, in succession, and the other columns are
calculated similarly. T h e number s fl are called the first differences, the
f11 are the second differences, etc. I t will be seen later that the successive
orders of differences are closely related to the successive derivatives of
the funct ion; in fact it may be noticed that the second, third, and fourth
differences are precisely equal to the corresponding derivatives in this
particular example, bu t this is not t rue in general. T h e differences of
any order are in general smaller than those of the order immediately
preceding, that is, the differences converge. If the function and all of
its derivatives are cont inuous and do not become infinite within the
range of the table, the differences converge more rapidly the smaller
136 FINITE DIFFERENCES
T A B L E 1
POLYNOMIAL
X / / /"
— 3 — 4 1
+ 1 6
— 2 — 2 5
+ 4
— 1 2
+ 6
— 1 — 2 1
— 2
— 6
+ 6
0
0 — 2 3
— 2
0
+ 6
0
+ 1 — 2 5
+ 4
+ 6
+ 6
0
+ 2 — 2 1
+ 1 6
+ 1 2
+ 6
0
+ 3 — 5
+ 3 4
+ 1 8
+ 6
0
+ 4 + 2 9
+ 5 8
+ 2 4
+ 5 + 8 7
the tabular interval. In fact we may always make the convergence as
rapid as we please by taking the interval sufficiently small (which in
this case would of course require nonintegral values of x).
A polynomial is a special kind offunction exhibit ing properties not
ordinarily encountered in celestial mechanics. A special proper ty of
Table 1 above is that the representation is exact, without any errors.
Table 2, giving the sun ' s declination as calculated on the assumption
of elliptic motion for the earth, is more representative of the functions
to be dealt with later.
T h e differences in this table converge but slowly, and only up to a
certain order, after which they begin to alternate in sign. If further
orders of differences are calculated they will be found to diverge. Th i s
behavior is caused by the errors of rounding present in / . Each value
of / , al though calculated with all possible care, is necessarily affected
by an error that may be of any size between + 0?00005 and — 0?00005.
These errors build up in the successive orders of differences until they
assume the governing role at the order where the differences alternate
in sign. T h e r e is never any advantage in carrying the differences beyond
this point, which in this example would be / I X .
Th i s property of bui lding up , possessed by the errors of rounding,
is exhibited in Tab le 3, which presents the worst possible case of
divergence of errors that can arise when the functions are cont inuous
D I F F E R E N C E S
T A B L E 2
DECLINATION OF THE SUN
137
1966 / fl /"
[V y V I I l
Feb. 19.75 — l l ! 2 4 4 1
+ 3.6921
Mar. 1.75 — 7.5520 + 1812
+ 3.8733 — 1075
11.75 — 3.6787 + 737 + 51
+ 3.9470 — 1024 — 14
21.75 + 0.2683 — 287 + 37 + 8
+ 3.9183 — 987 — 6 — 9
31.75 + 4.1866 — 1274 + 31 — 1 + 23
+ 3.7909 — 956 — 7 + 14
Apr. 10.75 + 7.9775 - ^ 2 2 3 0 + 24 + 13 — 11
+ 3.5679 — 932 + 6 + 3
20.75 + 11.5454 — 3162 + 30 + 16 — 2
+ 3.2517 — 902 + 22 + 1
30.75 + 14.7971 — 4064 + 52 + 17 — 4
+ 2.8453 — 850 + 39 — 3
May 10.75 + 17.6424 — 4914 + 91 + 14 — 5
+ 2.3539 — 759 + 53 — 8
20.75 + 19.9963 — 5673 + 144 + 6
+ 1.7866 — 615 + 59
30.75 + 21.7829 — 6288 + 203
+ 1-1578 — 412
June 9.75 + 22.9407 — 6700
+ 0.4878
19.75 + 23.4285
and are accurately calculated, and when the tabular interval is small
enough for good convergence. T h e hypothetical errors in the function
are tabulated, instead of the function itself.
Th i s array suggests a rule that can often be used to advantage in
checking calculated functions. Difference the function to the order
where the differences alternate in sign, and call this the mth order.
T h e n if any mth difference exceeds 2 m _ 1 in absolute magni tude there
is certainly an error in the function greater than half a uni t of the last
decimal. T h e converse is not t r u e ; an error greater than half a uni t
does not necessarily produce such a large effect, as will be seen in the
following section.
An exception to the rule occurs when the tabular interval is taken
so great that the differences do not converge, as for example if cos x
is tabulated at an interval of 180° beginning at 0°, bu t such cases can
138 FINITE DIFFERENCES
I II III IV v VI
€ € € € € € €
+ i
— 1
- 1
+ 1
+ 1
— 1
I
— s
+ 1
+ 1
— 1
+ 1
+ i
always be easily avoided by trial, or by sufficient knowledge of the
behavior of the function.
In extended calculations it is seldom practicable, and never efficient,
to limit the errors to half a uni t of the last decimal, bu t the rule is easily
modified to apply to such cases. If, for example, errors of 3 uni ts are
permit ted, then no mth difference should exceed 3 X 2 m , in general,
for errors of p units , the mth differences should not exceed p X 2m.
3* D e t e c t i o n of c a s u a l e r r o r s * By a casual error is meant an
error that affects only isolated values of the function, such as a typo-
graphical error or an error of calculation. Such errors, if they are not
too numerous , can easily be detected by examination of the differences.
In order to show the effect of such an error, the polynomial tabulated
in Tab le 1 is given again in Table 4 with the value of / corresponding
to x = + 1 altered by one unit .
T h e fourth differences, instead of vanishing as they did before, exhibit
a pa t te rn that is easily recognized as the binomial coefficients in the
expansion of (a — 6) 4 . If the fifth differences are wri t ten it will be seen
that they correspond to the coefficients in the expansion of (a — b)h.
If the error in the function had been p un i t s instead of 1 the pat tern
would still be easily recognized; each of the fourth differences would
merely be mult ipl ied by p. I n general, an isolated error of p units in
the function produces a characteristic pat tern in the differences of the
mth order, which corresponds to the binomial coefficients in the ex-
+ 2
— 4
— 2 + 8
+ 4 — 1 6
+ 2 — 8 + 3 2
— 4 + 1 6
— 2 + 8
+ 4
+ 2
T A B L E 3
DIVERGENCE OF ERRORS
CASUAL ERRORS 139
T A B L E 4
POLYNOMIAL WITH AN ERROR
X / z 1 /"
— 3 — 4 1
+ 1 6
— 2 — 2 5
+ 4
— 1 2
+ 6
— 1 — 2 1
— 2
— 6
+ 7
+ 1
0 — 2 3
— 1
+ 1
+ 3
— 4
+ 1 — 2 4
+ 3
+ 4
+ 9
+ 6
+ 2 — 2 1
+ 1 6
+ 1 3
+ 5
— 4
+ 3 — 5
+ 3 4
+ 1 8
+ 6
+ 1
+ 4 + 2 9
+ 5 8
+ 2 4
+ 5 + 8 7
pansion of p(a — b)m. For more general types of functions than poly-
nomials the pat tern is likely to be partly obscured by the effects of errors
of rounding, which may make it difficult to detect errors of small
magni tude , that is, errors only a little larger than the errors of round ing ;
in general, the pat tern becomes more difficult to discern the more
slowly the differences converge.
T h e rule for correcting casual errors in a function is as follows: W h e n
the characteristic pat tern is detected in the differences of the mth order,
then if m is even take the largest mth difference, divide it by the value
taken from the following table, and add the quot ient (nearest whole
number ) to the function in the same l ine; if m is odd take the mean,
without regard to sign, of the two largest consecutive mth differences,
prefix the sign of the lower, divide by the value in the table below and
add the quotient (nearest whole number ) to the function on the inter-
mediate line. These number s are, except for sign, the largest coefficients
in the binomial expansions of order m. T h e y may usefully be compared
with the numbers 2 m _ 1 , of the preceding section.
m
Divisor
2 3 4 5 6 7 8 9
+ 2 - 3 - 6 + 1 0 + 2 0 - 3 5 - 7 0 +126
140 FINITE DIFFERENCES
Table 5 is given to show the application of this rule. Th i s is a table
of five-place logari thms, with an error of a uni t purposely in t roduced
into log 50. T h e tabular interval has been chosen large enough to make
the differences slowly convergent. T h e effect of the errors of rounding
is so large in the eighth differences as almost to obscure the characteristic
pat tern, bu t the two salient features remain. T h e differences gradually
diminish for some distance above and below the largest one, and they
alternate in sign. Dividing the largest difference, + 94, by the value
from the table, — 70, gives — 1 as the correction to be applied to the
function 69898. If the table is now constructed anew it will be seen
that the new eighth differences are much smoother than before; the largest
in the region where they alternate in sign is 34.
T A B L E 5
LOGARITHMS WITH AN ERROR
X / f /" in
f
VI / V i i i
1 5 1 7 6 0 9
+ 1 2 4 9 4
2 0 3 0 1 0 3
+ 9 6 9 1
— 2 8 0 3
+ 1 0 3 0
2 5 3 9 7 9 4
+ 7 9 1 8
— 1 7 7 3
+ 5 5 0
— 4 8 0
+ 2 5 7
3 0 4 7 7 1 2
+ 6 6 9 5
— 1 2 2 3
+ 3 2 7
— 2 2 3
+ 1 0 8
— 1 4 9
+ 9 0
3 5 5 4 4 0 7
+ 5 7 9 9
— 8 9 6
+ 2 1 2
— 1 1 5
+ 4 9
— 5 9
+ 2 9
— 6 1
4 0 6 0 2 0 6
+ 5 1 1 5
— 6 8 4
+ 1 4 6
— 6 6
+ 1 9
— 3 0
+ 3 9
+ 1 0
4 5 6 5 3 2 1
+ 4 5 7 7
— 5 3 8
+ 9 9
— 4 7
+ 2 8
+ 9
— 4 2
— 8 1
5 0 6 9 8 9 8
+ 4 1 3 8
— 4 3 9
+ 8 0
— 1 9
— 5
— 3 3+ 5 2
+ 9 4
5 5 7 4 0 3 6
+ 3 7 7 9
— 3 5 9
+ 5 6
— 2 4
+ 1 4
+ 1 9
— 3 5
— 8 7
6 0 7 7 8 1 5
+ 3 4 7 6
— 3 0 3
+ 4 6
— 1 0
— 2
— 1 6
+ 2 6
+ 6 1
6 5 8 1 2 9 1
+ 3 2 1 9
— 2 5 7
+ 3 4
— 1 2
+ 8
+ 1 0
— 2 1
— 4 7
7 0 8 4 5 1 0
+ 2 9 9 6
— 2 2 3
+ 3 0
— 4
— 3
— 1 1
7 5 8 7 5 0 6
+ 2 8 0 3
— 1 9 3
+ 2 3
— 7
8 0 9 0 3 0 9
+ 2 6 3 3
— 1 7 0
8 5 9 2 9 4 2
D I R E C T I N T E R P O L A T I O N 141
Th i s example exhibits well the power of this method of detecting
casual errors . In most cases arising in practice the differences will
converge more rapidly, and the errors are likely to be larger; both of
these features conduce to ease in applying the method . T w o cautions
are needed. If the errors are so numerous that the characteristic pat terns
overlap one another the method becomes difficult if not impossible to
apply, though the large size and " raggedness" (lack of smoothness) of
the differences may nevertheless indicate that errors are present . T h e
method can be used only to detect casual e r rors ; systematic errors, such
as those caused by omit t ing significant te rms of a series expansion,
cannot be detected.
4 . D i r e c t i n t e r p o l a t i o n . Direct interpolat ion is the process of
numerical ly evaluating a function by means of a table of it, for a value
of the independent variable between two of those tabulated. Tab le 6
shows the symbolic notation used.
T A B L E 6
NOTATION
Arg. / f f"1
*-• 8 L 2
8 4 - 2
S - 3 / 2
* - i 8 * i
8 - l / 2
*o /. S 2
0
S 4
0
8
1/2 1/2
*i A 8 2 i 8 t
8 3 / 2 8 3 / 2
* 2 /. K
In addit ion /x8x is used to mean \ ( § 3 / 2 + §1/2)? means \ ( 8 i 1 / 2 + 8i / 2 ) , etc.
T h e tabular interval is denoted by w, that is, w = xx — x0, if the interval
is constant. I t is usually, bu t not always, considered that the a rgument
for which the interpolated value of / is wanted lies between x0 and xv
Call this a rgument x. T h e n n — (x — x0)/w is called the interpolating
factor, and the value of / for a rgument x may be denoted by / n . In
many of the tables in common use w is a power of 10 and it may be
expressed in any uni ts whatever, such as 1 day, 0.00001 day, 0.01
142 FINITE DIFFERENCES
degree, etc. I t is to be noted that n, on the contrary, is always a pure
number . In linear interpolation, commonly used in t r igonometr ic tables,
it is assumed that
/ n = / o + w8 1 / 2 . (1)
Since § 1 / 2 = / i — / 0 , this is equivalent to the equation
fn = / o + n(fi ~fo)
(2)
= (1 - n)fo + nfv
If / is plotted as ordinate against x as abscissa, then, since this is a
linear equation in n, and since for n = 0, fn = / 0 , and for n = 1,
fn == fly the equation represents a straight line th rough the points f0
and fv T h u s the process of linear interpolation subst i tutes for the
function a series of straight-line segments joining consecutive tabular
values. If the tabular interval is so small that the function has no
appreciable curvature between two consecutive tabular values, the result
of a linear interpolation is sufficiently accurate, otherwise not . Funct ions
that are in widespread use, such as five-place sines and tangents , can
be efficiently tabulated at such small intervals, bu t this is not practicable
for functions having a restricted application, or when many significant
figures are needed; in such cases, instead of construct ing very extensive
tables it is easier to use more general methods of interpolation.
Since two points on a curve can furnish no information about its
curvature, it is evident that more than two points are needed for a more
general method of interpolation than the linear. Wha t is done is to pass
a smooth curve through 3, 4, 5, or more consecutive tabular values,
and to assume that the curve represents the function in between the
tabular values. A smooth curve may be passed through a finite n u m b e r
of points in infinitely many ways; it is desirable to choose a way that
will give a simple analytic approximation to the function. T h e commonest
procedure is to subst i tute for the function a polynomial in powers of n.
If three points are used the polynomial contains a constant te rm and
the first and second powers of n; if four points are used the th i rd power
is needed, etc. T h e result resembles the familiar Taylor series expansion
of a function, bu t instead of the successive derivatives appearing in the
formula, the successive orders of differences occur. As an example of
the process the case of three points, or second-order interpolation, is
examined in detail.
Let the three tabular values of f(x) be f_l9 / 0 , and fx. A polynomial
of the second degree has the form A + Bx + Cx2, where A> B, and C
are constants. T h e y are to be determined by the condition that the curve
DIRECT INTERPOLATION 143
must pass th rough f_ly / 0 , and fv For x — 0 we have immediately
A = f0. For x = + 1 and x = — 1 we have
A+B + C=fl9
A - B + C=f-V
Subtract ing the second equation from the first gives
2 # = fx —f-i =A —f0 + / 0 — / - l = §1 /2 + 8 - i / 2 -
But since
S - l / 2
B
Adding the two equations gives
2A+2C=f1 +/_,.
Subst i tut ing for A its value / 0 ,
2 C = / 1 - 2 / 0 + / _ 1
= ( / 1 - / „ ) - ( / o - / _ 1 )
= 8 l / 2 - S - l / 2
Hence
/ ( * ) = / o + ( s 1 / 2 - i ^ ) * + i § ^ 2
= / 0 + S 1 / 2 * + S ^ ( * - l ) / 2 ,
or, when x is equal to «,
/„ = / 0 + S 1 / 2 « + S ^ ( « - l ) / 2 .
Th i s is one example of a formula for interpolation with second
differences. A formula including the effect of th i rd or higher differences
could be found by a similar method. T h e formula may be used in its
present form or it may be rearranged in numerous ways, making use
of the definitions of 8 and 8 2 in terms of / , or collecting terms in various
ways. One form that is particularly convenient for mental calculation is
/ n = / o + [ 8 i / 2 - i ( l - ^ ) S 0
2 ] n.
= 8 l / 2 80>
— °l/2 2 °0*
144 FINITE DIFFERENCES
Another form, known as Lagrange 's formula, is
fn = / - i ( i « 2 - i » ) + / o ( i - « 2 ) + / i ( i « 2 + *»)•
If w = ^ this takes the simple form
5. E v e r e t t ' s a n d B e s s e l ' s f o r m u l a s . Once the general principles
by which interpolation formulas may be constructed have been grasped,
it is not necessary to examine the detailed derivations of all the formulas
likely to be used in practice. Of these the most generally useful are
Evere t t ' s and Bessel's.
Everet t ' s formula is
/„=/«. + » 8!/2 + E l 8 o + E\ S 2 + El SJ + E\ S* +
where
Bessel's formula is
fn = fo + » 8 ! / 2 + + 8 D + B 3 8 ?/ 2 + fi4(§0 + 8 l ) + & 8V2 +
Lagrange 's formula has the apparent advantage that the differences
are not required, bu t whether this is a real advantage is quest ionable.
Every manuscr ipt table, and many pr inted ones, ought to be differenced
before use in order to verify the accuracy. Once the differences have
been formed it is usually efficient to use them in interpolation, because
the functions of n are needed to fewer significant figures than n itself,
whereas with Lagrange 's formula all the numerical coefficients are
needed to the same accuracy as the function to be interpolated. Fu r the r -
more, without knowing the size of the differences it is impossible to
tell how many values of / a r e needed in any part icular case, as will appear
later.
E V E R E T T A N D B E S S E L 145
T h e n u m b e r of te rms that mus t be used in any part icular application
of an interpolation formula depends on the size of the error that can
be tolerated in the result , and on the size of the differences. In most
work it is considered permissible to cut off the formula at the point
where the size of the largest neglected te rm does not exceed half a uni t
of the last decimal of the function. T h e extreme values of the Besselian
coefficientsare, approximately,
F r o m these are easily deduced the following convenient l imit ing values
of the differences that may be neglected.
Difference neglected: second third fourth fifth
Largest numerical value: 4 60 20 500
T h u s , if the fourth differences are larger than 20 and the fifth differences
smaller than 500, the first five t e rms of Bessel 's formula should be used.
T h e use of Everet t ' s formula is l imited to an even n u m b e r of t e rms
because E0 and Ex are of the same order of magni tude . T h e accuracy
of any even n u m b e r of te rms of Everet t ' s formula is precisely equal
to that of the same n u m b e r of te rms of Bessel's. T h u s the first four
te rms of Everet t ' s formula include the effect of th i rd differences, and
the first six include the effect of fifth differences. In general, Everet t ' s
formula is to be preferred when the highest order of differences taken
into account is odd, and Bessel's when it is even; the fifth te rm of
Bessel's formula may, however, be used together with the first four of
Everet t ' s when fourth differences mus t be taken into account, fifth
differences being negligible.
W h e n fourth differences are not negligible either formula may
nevertheless often be limited to its first four t e rms by utilizing a powerful
device known as the throwback, said to have been invented by Lagrange.
where
146 FINITE DIFFERENCES
It will be observed that B*/B2 = (n + l)(n — 2)/12, and this function
varies over a small range for n between 0 and 1, with an average value
of about — 0 . 1 8 4 . If we put
M2 = S 2 - 0 . 1 8 4 S 4
then M2 may be used instead of S 2 in Everet t ' s or Bessel's formula,
with an error of less than half a uni t if S 4 is less than 1000. In such
cases the throwback should nearly always be used.
Bessel's and Everet t ' s coefficients are somewhat laborious to calculate,
especially for the higher orders . T h e y have therefore been tabulated
extensively as functions of n. In practical calculation m u c h labor may
be wasted by carrying more figures in the coefficients than are actually
needed. As a general rule, the interpolation coefficients are needed to
the same n u m b e r of decimals as the n u m b e r of significant figures in the
differences they mult iply.
6. N e w t o n ' s f o r m u l a . Everet t ' s and Bessel's formulas are known
as central-difference formulas because the differences involved all lie near
the center of the port ion of the table used. Another type of formula,
due to Newton, makes use of the diagonal differences. I t may be writ ten
which is known as the backward formula.
T h e forward formula is sometimes useful for interpolating near the
beginning of a table, where the central differences may not be available,
and the backward formula may be used near the end of a table. T h e
fn = / o + w 8 l / 2
which is known as the forward formula, or
U-m/2
INVERSE INTERPOLATION 147
chief value of the formula is, however, for extrapolation in connection
with s tep-by-step integration (Section 12). I t may also be used for
extending a table, by put t ing n = — 1 in the forward formula or n = 1
in the backward one, bu t this should not usually be done for more than
one step if at all, owing to the rapid accumulation of error. T h e use
of Newton ' s formula for extending a table is exactly equivalent to
assuming the highest order of differences taken into account to be
constant. T h e result can be obtained simply by copying the value of the
highest ordered difference immediately below, and extending the table
down and to the left by successive addit ions.
7. L a g r a n g e ' s f o r m u l a for i n t e r p o l a t i o n to h a l v e s . Al though
Lagrange 's formula is not recommended for general use there is one
application where its convenience may outweigh its disadvantages, that
is, extensive interpolation to halves with a calculating machine. T h e
four-point formula, to be used when fourth differences are negligible,
takes the very simple form
/ i / 2 = ^ ( - / - i + 9 / „ + 9 / 1 - / 2 ) ,
and the six-point formula, to be used when sixth differences are
negligible, is
fxn = 2^6 ( 3 / - * ~ 2 5 / - i + 1 5 0 / o + 1 5 0 / i " 2 5 / 2 + 3 / 3 } -
8. Inverse i n t e r p o l a t i o n . Th i s name is given to the process of
finding the numerical value of the independent variable for a given value
of the function. T h e problem is often one of great difficulty if analytical
methods are employed, bu t with numerical methods it is only slightly
more t roublesome than direct interpolation. A formula may be found
simply by solving any interpolation formula for n in te rms of / w , / 0 ,
and the differences, bu t if the formula is cut off at any order of
differences, say the 7 t h , it will be found that n has to be determined
by solving an equation of the jih degree, which may be very laborious.
It is better to proceed by reiteration, a special case of successive approxima-
tions. In this powerful numerical method the formula is arranged in such
a way that by subst i tut ing an approximate value of the result into it
a more accurate value is obtained, and the process may be repeated
as often as necessary. In the present problem an approximate value of n
is found by linear inverse interpolation. Th i s value of n is used to
obtain approximate values of the interpolation coefficients, which can
148 FINITE DIFFERENCES
then be used to yield a more accurate value of n. For example, Bessel's
formula may be used in the form
n 3 1 / 2 =fn -f0 - B2 (8» + o2) - B3 S 3
/ 2 ,
where S 2 + S* should be replaced by Ml + M\ if the fourth differences
are appreciable and not too large to be th rown back. By neglecting the
te rms involving B2 and B3 in the first approximation the equation can
easily be solved for n. T h i s value of n being used to find B2 and B3>
the equation may again be solved for w, and this new value of n may
be used to get improved values of B2 and B3. T h e process should be
repeated unti l the value of n is not changed by further repetit ions.
An impor tant caution is necessary in inverse interpolation. T h e value
of n can be obtained to no more significant figures than are present
in 8 1 / 2 . Additional figures, if writ ten down, are likely to deceive the
computer into thinking that he has more accuracy than he really does.
One must always be on guard against "manufactur ing decimals, ' ' which
is justifiable only in very unusual instances. A familiar example of the
difficulty arises when a small angle is sought to be determined from
its cosine. One remedy is to obtain the first difference to more decimals,
but it is usually preferable to avoid the difficulty by a suitable t rans-
formation. Instead of the cosine, for example, the versed sine or the
haversine may be used.
9. E r r o r of a n i n t e r p o l a t e d q u a n t i t y . I t has been pointed
out that the max imum error of a value of a function that has been
calculated with all possible care is ± 0 . 5 uni t of the last decimal.
Fur thermore , all values of the error between these limits are equally
likely, which means that about half of the errors in an extended series
of values lie between the limits ± 0.25, and half lie outside these limits.
Th i s fact may be expressed by the s tatement that the probable error
is ± 0.25. A further error is in t roduced by the process of interpolation.
If second and higher differences are zero, and if the interpolated quanti ty
is rounded to the same n u m b e r of decimals as the tabular value, another
similar error is introduced. In this case the max imum error of the
interpolate is ± 1 . 0 and the probable error is about ± 0 . 3 . Slightly
greater accuracy may be obtained by keeping additional figures in the
interpolate. If this is done the error of the interpolate is only very
slightly greater than that of the tabular value. In general, however,the
additional figures are not wor th the t rouble of writ ing them down.
T h e y are not significant figures in any sense of the word ; the first extra
decimal is subject to an error of 5 units and the second is entirely
fictitious.
DIFFERENTIATION 149
In practice the second differences are usually different from zero, and
if they are neglected an additional error is introduced, which may
amount at max imum to 1/8 of the second difference. Since it is the usual
practice to neglect contr ibut ions of the second and higher-ordered
differences if they are less than 0.5, it may be stated as a general rule
that the max imum error of an interpolate is somewhat greater than
± 1 . 5 uni ts , and its probable error approaches ± 0 . 5 unit .
T h e error of a quant i ty obtained by inverse interpolation is governed
by different rules. If the errors of the tabular functions are kept within
+ 0.5, the error of any first difference is within + 1 . 0 and the error
of the result will lie in the range + 1/8, if the contr ibut ion of differences
beyond the first is inappreciable.
10. N u m e r i c a l d i f f e r e n t i a t i o n . It has been seen in Section 2
that the successive orders of differences of a function are closely related
to the successive derivatives. T o find the precise expression for the
derivatives in te rms of the differences it is necessary only to differentiate
any of the interpolation formulas as many t imes as required. We have
bu t
and hence
Similarly,
df dfdn
dn 1
... 4f df
o d2f d2f
w2 -M = -J-. e t c .
Other forms may be obtained by rearranging terms and by simple
substi tut ions. Usually a derivative is wanted either for a point of
tabulation or for a halfway point. Some of the most useful formulas
follow.
dx dn dx '
dx w
dx dn'
dx* an'
For example, differentiating Bessel's formula gives
T A B L E 7
SIN x
X / = s i n x fl /"
— 3 0 ° — 0 . 5 0 0 0 0
+ 1 5 7 9 8
— 2 0 — 0 . 3 4 2 0 2
+ 1 6 8 3 7
+ 1 0 3 9
— 5 1 1
— 1 0 — 0 . 1 7 3 6 5
+ 1 7 3 6 5
+ 5 2 8
— 5 2 8
0 0 . 0 0 0 0 0
+ 1 7 3 6 5
0
— 5 2 8
+ 1 0 + 0 . 1 7 3 6 5
+ 1 6 8 3 7
— 5 2 8
— 5 1 1
+ 2 0 + 0 . 3 4 2 0 2
+ 1 5 7 9 8
— 1 0 3 9
+ 3 0 + 0 . 5 0 0 0 0
For x = 0 and n = 0 we have
+ 10° £ = + 0.17365 + 0.00088 = + 0.17453.
In the s tudy of the infinitesimal calculus it is usually not necessary
to consider in what units a function or its independent variable may
be expressed, bu t in numerical work it is necessary to keep the uni ts
in mind at all t imes. A derivative of a function takes on different
numerical values according to the uni t employed. In the present case x
has been expressed in degrees. Hence we may say that dfjdx at x = 0
is 0.17453 per 10 degrees, or 0.017453 per degree, or 1.00000 per radian
( remembering that one radian is 57.29578 degrees), or 1.00000 for
brevity. T h e last s ta tement is already familiar.
150 FINITE DIFFERENCES
..,(3)
Table 7 illustrates the principle of numerical differentiation.
(4)
(5)
(6)
INTEGRATION 151
12. N u m e r i c a l i n t e g r a t i o n . In celestial mechanics it is necessary
to consider the numerical solution of ordinary differential equations of
the second order. In the most usual form, to which the succeeding
chapter is devoted, three s imultaneous equations of the second order
are integrated to give the rectangular coordinates of a planet or comet
as functions of the t ime. T h i s section is devoted to the consideration
of two simpler examples.
T h e method used is precisely the opposite of numerical differentiation,
where the differences of a function are used to calculate the derivatives,
and formulas have been given for the purpose. In the present problem
the derivatives of a function are calculated first, and differenced as a
check. T h e resulting table is then extended to the left by summation,
that is, by successively adding consecutive values of the derivative to
a starting value. Finally the values of the integrals are obtained from
the summat ion by means of a formula. Formulas for the purpose can
be derived by integrating an interpolation formula, considering n as the
independent variable. Those needed are given below, where the first
summat ion is denoted by and the second by n / .
(9)
(10)
As a first example consider y = J(3x2 — 3) dx. I t is known that the
result is y = xs — 3x + C, C being an arbitrary constant. T h e word
arbitrary means that no matter what constant is chosen the result is
mathematically correct. It will, however, be noticed that no numerical
value of y can be stated unless C is a known, definite quant i ty . In physical
problems the constants of integration are determined by the initial
conditions, which amount in this example simply to the numerical value
11. S p e c i a l f o r m u l a s . A particularly useful form of (3) may be
obtained by subst i tut ing for the differences their values in t e rms of the
functions. T h u s , if the fifth differences are less than 15,
n w ( ^ ) 0 = f ~ 2 - ^ i + ^ - f v (7)
and if the seventh differences are less than 70,
- / _ 3 + 9/_ 2 - 45f_± + 45/i - 9/ 2 + fy (8) 60w (-
152 FINITE DIFFERENCES
of y for some particular value of x. (For a double integration either
two values of y or one value of y and one of its first derivative are needed.)
In the present problem the initial condition is at our disposal; we shall
choose it to give the polynomial of Section 2, xz — 3x — 23 .
First are calculated some values of 3x2 — 3, which are differenced.
T h e results are in Tab le 8, where the column lf is yet to be filled in.
T A B L E 8
NUMERICAL INTEGRATION
y / f1 /" y
i
y
i i
y
in
y
— 3
— 3 0 . 5
+ 2 4
- \ 5 — 3 1 . 1 2 5
— 2
— 2 1 . 5
+ 9
— 9
+ 6
— 2 1 . 8 7 5
+ 9 . 2 5
— 9
— 1
— 2 1 . 5
0
— 3
+ 6
— 2 1 . 6 2 5
+ 0 . 2 5
— 3
+ 6
0
— 2 4 . 5
— 3
+ 3
+ 6
— 2 4 . 3 7 5
— 2 . 7 5
+ 3
+ 6
+ 1
— 2 4 . 5
0
+ 9
+ 6
— 2 4 . 1 2 5
+ 0 . 2 5
+ 9
+ 6
+ 2
— 1 5 . 5
+ 9
+ 1 5
+ 6
— 1 4 . 8 7 5
+ 9 . 2 5
+ 1 5
+ 6
+ 3
+ 8 . 5
+ 2 4
+ 2 1
+ 6
+ 9 . 3 7 5
+ 2 4 . 2 5
+ 2 1
+ 6
+ 4
+ 5 3 . 5
+ 4 5
+ 2 7
+ 6
+ 5 4 . 6 2 5
+ 4 5 . 2 5
+ 5 + 7 2
I t is evident that the values of lf obtained by summat ion will be on
the half-lines and the star t ing value is therefore needed for some half-line
value of the argument . T h e value of y for x = + 0.5, de termined by
direct substi tut ion, is — 24.375. T h e value of lf at the same point is
determined by formula (9). T h u s
- 24.375 = w | " / 1 / I + l ( + 3 ) j ,
and since we have w = 1, lf1/2 = — 2 4 . 5 . Inser t ing this value in the
table, it is now easy to extend the table as far as we please, in either
direction. F rom these values of lf the corresponding values of y may
be found by the formula. T h e y are given to the right on Table 8, with
their differences. If this table of y is interpolated to halves, the results
will be found to agree with the table in Section 2.
INTEGRATION 153
t I u cos u sin u x y
— 3 0 — 1 0 . 4 5 3 9 — 1 3 . 0 3 9 3 + 0 . 9 7 4 2 1 6 — 0 . 2 2 5 6 2 0 + 1 . 5 4 8 4 3 — 0 . 4 4 2 1 2
— 2 0 — 6 . 9 6 9 3 — 8 . 7 0 3 3 + 0 . 9 8 8 4 8 5 — 0 . 1 5 1 3 1 8 + 1 . 5 7 6 9 7 — 0 . 2 9 6 5 2
— 1 0 — 3 . 4 8 4 6 — 4 . 3 5 4 7 + 0 . 9 9 7 1 1 3 — 0 . 0 7 5 9 3 1 + 1 . 5 9 4 2 3 — 0 . 1 4 8 7 9
0 0 . 0 0 0 0 0 . 0 0 0 0 + 1 . 0 0 0 0 0 0 0 . 0 0 0 0 0 0 + 1 . 6 0 0 0 0 0 . 0 0 0 0 0
+ 1 0 + 3 . 4 8 4 6 + 4 . 3 5 4 7 + 0 . 9 9 7 1 1 3 + 0 . 0 7 5 9 3 1 + 1 . 5 9 4 2 3 + 0 . 1 4 8 7 9
+ 2 0 + 6 . 9 6 9 3 + 8 . 7 0 3 3 + 0 . 9 8 8 4 8 5 + 0 . 1 5 1 3 1 8 + 1 . 5 7 6 9 7 + 0 . 2 9 6 5 2
+ 3 0 + 1 0 . 4 5 3 9 + 1 3 . 0 3 9 3 + 0 . 9 7 4 2 1 6 + 0 . 2 2 5 6 2 0 + 1 . 5 4 8 4 3 + 0 . 4 4 2 1 2
T h i s first example differs from the integration of a planetary or
cometary or satellite orbit intwo impor tant respects. First , the results
are exact, no errors of rounding being present . Second, the values of
the function to be integrated could all be calculated in advance. In the
integration of orbits this cannot be done, for the function to be integrated
can be determined only after the integral is known, and it is necessary
to proceed from step to step by extrapolation, with successive approxim-
ations at each step. At any stage in the calculation the table of n / i s extra-
polated one step, by assuming some order of differences (usually about
the sixth) to be constant . T h e corresponding value of y is then calculated
and tested to see if it yields the extrapolated value of / ; if not, it is
adjusted. T o illustrate this process we integrate the orbit of a hypothetical
planet with negligible mass, moving under the attraction of the sun
alone from perihelion to aphelion. I t is known that the orbit is an ellipse.
Let its semi-major axis be 2 astronomical uni ts and its eccentricity 0.2.
Take the plane of the orbit as the reference plane, with the X axis directed
toward the perihelion. T h e equations of motion are
d2x __ L 2 x
where r2 = x2 + y2, and k is the Gaussian constant , 0.01720209895.
I t is known that these equations are t rue , with the stated value of
ky if Xy yy and r are measured in astronomical uni ts and t in ephemeris
days. I t has been found by experience that for an orbit of this size and
shape a tabular interval of 20 days will give the requisite degree of con-
vergence to the differences, bu t in this example an interval of 10 days
is used to save space in pr int ing the differences. T h u s we shall have
dt2 r 3 :
d2y ¥n y
dt2 r 3 '
T A B L E 9
STARTING VALUES
154 FINITE DIFFERENCES
w2 = 100 for the factor necessary in convert ing the second summat ions
of x and y into x and y. But it is somewhat inconvenient to mult iply
— x/rs and — y / r 3 by the small factor k2
y and afterward their second
summat ions by 100. I t is better to keep 11 fx and 11 fy of the same order
of magni tude as x and yy which is easily accomplished by taking for
the equat ions to be integrated,
& = (12)
where the factor w2k2 is 0.0295 9122 for a 10-day interval.
T h e origin of t ime being at our disposal, we take it to be the instant
of perihelion passage. T w o of the constants of integration are obtained
immediately from the dimensions and orientation of the orbit . W e have
xo = ro = a(\ — e) = 1-6 a.u. and y0 = 0. For the other two constants
we take zvkx0 and wky0\ wkx0 = 0 since x0 = 0, and wky0 =
wkamr-\\ — e2)112 = 0.1489745. T o start the integration it is convenient
to calculate seven consecutive values of fx and fy. T h e mean motion in
degrees per day is 57°2957795 ka~312 = 0°3484 64933, from which are
derived the values of the mean anomaly. T h e eccentric anomaly is
obtained by Kepler ' s equation, and x and y by
x = <z(cos u — e)y
y = <z(l — e2)112 sin u.
From these values of x and y we first find ry then 1/r3, then w2k2/rz
y
and finally fx and fy. F r o m these the first seven lines of Tables 10 and 11
are constructed, leaving llf and lf to be filled in later.
T h e start ing values uf0 and J / 1 / 2 are obtained by the formulas
(13)
(14)
remember ing that /JLS0 = ^ ( 8 _ ] / 2 + S 1 / 2 ) , with two similar equations for
y and yy after which the tables may be extended by inspection down
and to the left to t = 40.
T h e next step is to calculate fx and fy for t = 40. For this the values
of x and y are needed. Provisional values may be obtained by means
of formula (10) if the necessary values in the tables are extrapolated
by copying the value of / v on the two lines next below. T h e values
INTEGRATION 155
t "/. y . A J X fl
— 30 — 109729
— 3220
— 20 — 112949
— 1975
+ 1245
+ 63
— 10 — 114924
— 667
+ 1308
+ 26
— 37
— 15
0 + 1.6009638
— 57796
— 115591
+ 667
+ 1334
— 26
— 52
+ 15
10 + 1.5951842
— 172720
— 114924
+ 1975
+ 1308
— 63
— 37
— 7
20 + 1.5779122
— 285669
— 112949
+ 3220
+ 1245
— 107
— 44
+ 15
30 + 1.5493453
— 395398
— 109729
+ 4358
+ 1138
— 136
— 29
+ 8
40 + 1.5098055
— 500769
— 105371
+ 5360
+ 1002
— 157
— 21
+ 8
50 + 1.4597286
— 600780
100011
+ 6205
+ 845
— 170
— 13
+ 8
500 — 2.3933823
— 59327
+ 51314
+ 51
— 45
+ 1
+ 1
— 3
510 — 2.3993150
— 7962
+ 51365
+ 7
— 44
— 1
— 2
520 — 2.4001112
-f 43410
+ 51372
— 38
— 45
530 — 2.3957702
+ 94744
+ 51334
540 — 2.3862958 + 51253
of x and y obtained in this way are sufficiently accurate, bu t this will
not be t rue of the differences wri t ten in by inspection, which will have
to be revised after fx and fy become known, and it is more convenient
to use instead of (10) a formula that will give provisional values of x
and y in te rms of the diagonal differences already writ ten down. Such
a formula may be obtained by integrating Newton ' s backward inter-
polation formula twice. After a little manipulat ion the result is, wi th
an accuracy that will suffice for most practical applications,
*o = "/o + iV-i + S - 3 / 2 + 0.0791667 8% + 0.075 S ! 5 / 2
1 (15)
+ 0.07135 84_3 + 0.0682 S 5_ 7 / 2 + 0.065 8 ^ + 0.06 S7_9/2.
T A B L E 1 0
INTEGRATION FOR X
156 FINITE DIFFERENCES
T A B L E 1 1
INTEGRATION FOR y
t fv /!? A11 fl
— 3 0 + 3 1 3 3 1
1 0 0 9 3
— 2 0 + 2 1 2 3 8
1 0 5 1 2
— 4 1 9
+ 2 0 5
— 1 0 + 1 0 7 2 6
1 0 7 2 6
— 2 1 4
+ 2 1 4
+ 9
— 9
0 0 . 0 0 0 0 0 0 0
+ 1 4 8 8 8 4 8
0
1 0 7 2 6
0
+ 2 1 4
0
— 9
1 0 + 0 . 1 4 8 8 8 4 8
+ 1 4 7 8 1 2 2
— 1 0 7 2 6
1 0 5 1 2
+ 2 1 4
+ 2 0 5
— 9
— 1 5
2 0 + 0 . 2 9 6 6 9 7 0
+ 1 4 5 6 8 8 4
— 2 1 2 3 8
1 0 0 9 3
+ 4 1 9
+ 1 8 1
— 2 4
— 3
3 0 + 0 . 4 4 2 3 8 5 4
+ 1 4 2 5 5 5 3
— 3 1 3 3 1
9 4 9 3
+ 6 0 0
+ 1 5 4
— 2 7
— 1 1
4 0 + 0 . 5 8 4 9 4 0 7
+ 1 3 8 4 7 2 9
— 4 0 8 2 4
8 7 3 9
+ 7 5 4
+ 1 1 6
— 3 8
+ 4
5 0 + 0 . 7 2 3 4 1 3 6
+ 1 3 3 5 1 6 6
— 4 9 5 6 3
— 7 8 6 9
+ 8 7 0
+ 8 2
— 3 4
— 4
5 0 0 + 0 . 1 6 4 2 4 5 4
— 9 9 1 8 3 6
— 3 5 2 1
+ 2 1 2 8
— 7
+ 5
+ 3
— 4
5 1 0 + 0 . 0 6 5 0 6 1 8
— 9 9 3 2 2 9
— 1 3 9 3
+ 2 1 2 6
— 2
+ 4
— 1
5 2 0 — 0 . 0 3 4 2 6 1 1
— 9 9 2 4 9 6
+ 7 3 3
+ 2 1 2 8
+ 2
5 3 0 — 0 . 1 3 3 5 1 0 7
— 9 8 9 6 3 5
+ 2 8 6 1
5 4 0 — 0 . 2 3 2 4 7 4 2 + 4 9 9 3
T h e forward formula, used for integrat ing backward, may be obtained
from this simply by changing the signs of all the subscripts, and of the
coefficients of the odd-ordered te rms . T h e results of the application of
this formula are used to start a third table, which contains x, y> and
r 2 . If a table is available giving 1/r3 with a rgument r 2 nothing else need
be writ ten down; 1/r3 may be placed on the keyboard of a calculating
machine, multiplied by w2k2> the result transferred to the keyboard, and
multiplied first by — x and then by — y, giving fx and fy, which may
be entered in the integration tables. T h e computer will soon learn by
experience whether it is worthwhile to write down the value of 1/r3,
and to difference these quantit ies as it goes along, for a check. I t may
even be desirable to difference x and y if the computer is prone to make
INTEGRATION 157
- 0.0031 8 5 _ 5 / 2 - 0.003 8I3.
If the new values of x and y differ from the provisional ones by half
a un i t or more they should be corrected and the values of fx and fy
examined to ascertain whether they also need correction. If so, the
corrected values of fx and fy should be entered in the integration table,
the differences corrected, and formula (16) reapplied. In general the
tabular interval should be chosen so that the provisional values of x
and y may need to be corrected by a few units of the last decimal, and
small enough that the provisional values of fx and fy need be altered
bu t seldom.W h e n the final values of fx and fy have been obtained, the tables may
be extended to the left, giving llfx and ufy for t = 50, and the first
step of the integration is completed. T h e second step is like the first;
and the rules for any step can be reduced to the following:
1. Having a diagonal of the integration tables complete, find p ro -
visional values of x and y by formula (15).
2. Calculate fx and fy from the differential equations and write them
in the tables, filling in the differences above and to the right.
3. Tes t x and y by formula (16); correct them if necessary, and
re-examine fx and fy. T h e n complete the diagonal of the tables.
In the present example the tabular interval is so small that step 3
is hardly necessary. Th i s makes for convenience in the calculation of
a single step, bu t more steps than necessary are required and there is
no net saving in labor. Moreover , the accumulat ion of end-figure errors
is greater than necessary.
T h e complete tables are not pr inted, bu t only a few lines at the
beginning and end of the integration.
many mistakes. In any case the only check on the work as a whole is
given by the differences in the integration tables ; great care mus t be
taken to write t hem in correctly. Owing to the accumulat ion of errors
of rounding, the last decimal of x and y becomes uncertain after a few
steps of the integration, and it is usually permissible to cut it off.
Wi th fx and fy for t = 40 entered in the integration tables, the diagonal
differences above and to the right may be filled in, bu t before filling
in those to the left the provisional values of x and y mus t be tested
by the more accurate formula
(16)
158 FINITE DIFFERENCES
13. A c c u m u l a t i o n of e r r o r s in n u m e r i c a l i n t e g r a t i o n . In
general it is not possible to determine how much error is accumulated
in an integration, bu t it can be determined by the theory of errors what
probable error is to be expected after any n u m b e r of steps. For the
example of the preceding section, however, the theory of elliptic motion
makes feasible exact determinat ion of the error. T h e t rue coordinates
and velocities at aphelion are found to be x — — 2.4 a.u., y = 0,
x = 0, y = — 0.5773503 a.u. per ephemeris day. T h e theory also gives
the t ime of aphelion passage as t = 516.551259. Formulas (13) and (14)
may be used to obtain the coordinates and velocity components for
several dates, which are afterward interpolated to this t ime. T h e results
are
x = - 2.3999888, wkx = + 0.0000007, x = + 0.0000041,
y = _ 0.0000047, why = - 0.0993166, y = - 0.5773516.
I t is seen that the errors in x and y are greater than those in x and y.
Thi s might have been expected, because an error in / a t any step produces
an equal error in all the values of lf that follow, whereas the corresponding
error in 11 f increases by this amount at every step.
T h e general theory of the accumulation of errors in numerical
integration shows that after n steps the probable error of a double integral
is 0.1124 7z 3 / 2 (in units of the last decimal), which means that in an
extended n u m b e r of examples about half the errors should be larger
than this, and half smaller. In the present example n = 52, which gives
42 for the probable error of x and y. Both of the actual errors are larger
than the probable error, and this is not to be wondered at, partly because
the general theory applies only when the integration is carried through
a number of revolutions of the planet, and partly because two examples
are not sufficient for a test of any theory of errors.
Use of the above expression for the probable error will enable the
programmer to decide before starting an integration how many decimals
must be carried th roughout the work. Suppose for example that an
asteroid is to be integrated for 10 years with a 10-day interval, that
is, for 365 steps, or 183 each way if the integration is commenced in
the middle of the arc. T h e probable error after 183 steps is about 280.
Suppose further that a probable error of 0.1 second or arc, or 0.0000005
radian, is the largest that can be tolerated. Compar ing this with 280 shows
that nine decimals are necessary in the calculations.
Th i s test should always be applied before starting any integration.
Otherwise the programmer will either have intolerable errors in his
work or he will expend much useless labor manipula t ing superfluous
decimals.
SYMBOLIC OPERATORS 159
T h e general theory of the accumulat ion of errors has an impor tan t
application to the errors of the osculating Kepler ian elements of an
orbit obtained by numerical integration. I t has been shown that whereas
the mean error of the mean orbital longitude is proportional to the three-
halves power of the number of steps, the mean errors of the other five
e lements are proport ional to the square root of the n u m b e r of steps.
Th i s is t rue , whether the elements are given directly by the integration
or whether they are obtained afterwards by t ransforming the rectan-
gular coordinates and velocity components .
14. S y m b o l i c o p e r a t o r s . All of the formulas given so far in
this chapter , as well as many others, may be derived in a very elegant
and simple manner by means of symbolic operators , which we discuss
in this section. First , however, we make a few remarks about inter-
polation from a more general s tandpoint than before.
Let the values of any function f(x) be given for n discrete values of
OC • 0C~^ y OC 2) • • • j OC* y
which need not be in ari thmetical progression bu t may
be any values of x whatever. Consider the polynomial
(x — X2) (x — ff3) ... (x — Xn) ^ ^ (x — X}) (x — ff3) ... (x — xn)
(xn x±) (xn x2)... (xn xn-i)
Evidently, cp(x) = / ( # i ) for x = xl9 since all the coefficients vanish
except the one factored by / f a ) , which coefficient becomes uni ty.
Similarly, <p{x) = f(x2) for x = x2, etc. T h e polynomial cp(x) is called
Lagrange 's interpolation formula; it is a generalized form of the formula
previously denoted by the same name.
T h e quot ient
/ f a + i )
<P(X)
fa - x2) fa - * 3 ) . . . (x1 - x n ) J (x2 - xj fa - x3)... fa - xn)
(X - X±) (X - X2) ... (X — ^ n -x) rt \
+ - + (x.. - x,\ (x- - x.) ... (x- - x—*YKn)'
; / f a )
xk+l Xk
is called the first divided difference of / f a ; we denote it for brevity
by the symbolic notation [k, k + 1]. T h e second divided difference is
denoted [k, k + 1, k + 2] and is given by
[*,* + ! , * + 2] =
[* + ! , * + 2 ] - [ * , * + !]
xk+2 xk
Similarly the pth divided difference is given by
[k,k + 1 , k + p]
[* + 1, A + 2 , k + p] - [k, k + 1 , k + p - l ]
xk+p xk
160 FINITE DIFFERENCES
[ X >
1 ] = ^ - *
/ = / i + [*.
whence
Similarly,
whence
[x, 1] = [1, 2 ] + [*, 1,2] ( * - * , ) ,
and in general,
/(*) = A + (* ~ xi) [h 2] + (x - * x ) <* - * 2 ) [1, 2, 3] + ...
+ (x — xj (x — x2) (x — xn^) [1, 2 , w ] +R(x)9 (17)
where
JR(^) = (# — ^ ) (# — x2) ••• (^ — #n) [x9 1, 2, »].
Lagrange 's interpolation polynomial has the same values as f(x) for
and therefore has the same divided differences at
these points. Now cp(x) is a polynomial of the (n— l) th degree, and
the wth divided difference of such a polynomial is zero, as may be shown
by induction. Hence the same is t rue of the nth divided difference of
f(x)y and
/(*) = cp(x) + R(x).
Since f(x) = cp(x) for x — xl9 x2, xny then at these values of x9
R(x) = 0. Hence the (n— l) th derivative of R(x) is zero for at least
one value of x in the interval xx to xn.
Now, differentiating n — 1 t imes,
f(n-i) W = („ _ 1)! [1, 2, n] + R<»-u (x).
If | is the value of x for which i ? ( n _ 1 ) ( ^ ) = 0, then
/ ( n - l ) ( £ ) = ( „ _ ! ) ! [ i , 2 , . . . , » ] ,
whence
[1,2 it] =
W e may extend the meaning of divided differences to include the
case where x is not one of the discretevalues xl9 x2> xn. W e may
then write
L*' 1 J ~ * , - X '
or, with yet more brevity,
fx-f
S Y M B O L I C O P E R A T O R S 161
and
[*, 1, 2, n] = f
{n)(£)
nl
For any value of x there will be some value of £ for which this equation
holds.
W e may now write for R(x)y
f{n)(i)
R(x) = —^r (x — xi) (x - x2)... ( * - * n ) >
and this equation shows that if the values xly x2, xn are in ari thmetical
progression, R(x) has a m i n i m u m value when x lies in the middle of
the interval x1 to xn. Fu r the rmore , for any values of xv x2> xny R(x)
is smaller if x is within the interval xx to xn than if it is outside.
If we confine ourselves to represent ing the function f(x) over the
interval xx to xn by a polynomial of the (n — l ) th degree, then we use
formula (17) dropping the final t e rm R(x), and we have Newton ' s
interpolation formula.
If the function is tabulated for values of x tha t are in ari thmetical
progression, which is the case commonly found in astronomy, then there
are three conventional notations for the differences, which are shown
in Tab le 12.
T A B L E 12
NOTATIONS FOR DIFFERENCES
Central
differences
Forward
differences
Backward
differences
/-
S - 5 / 2 - 3
A2
- 3
/ - 2 ^ x
8 - 3 / 2
8 3 ,
-3/2
A
- 2 - 8 F-x
f-x
8 2
- i f-x ^ 2 f-x
8 - l / 2
8 3 ,
-1/2
-X
A3
- 2 P . K
/ » K /. A2
- i fo K
\„ h3,
1/2
J
o
J 3
—1 I 7 ! K
fx
\„
K
h3,
1/2
fx 0 fx ' 2
S 3 /2
8\
3/2
A
i
J 3
o ^ 2 K 3
/. 8 2
2
8\
3/2
/. J 2
i ^ 3
S 5/2 2
A3
i I 7 , K
K
f3 A2
2
/ a K
S 7/2
A
3
/ < /.
162 FINITE DIFFERENCES
For the central differences the subscripts are equal along a horizontal
line, for the forward differences along the forward diagonal, and. for
the backward differences along the backward diagonal. In previous
sections we have used only the notation for central differences. W e have,
for example,
and, as earlier in this chapter,
§1/2 — A ~~/o>
8 0
= S l / 2 ~ 8 - 1 / 2 = f l - 2fo +/-!>
/ x 8 0 = i ( 8 l / 2 + 8 - l / 2 ) -
If as before we denote the interval of the a rgument by w, n consecutive
tabular a rguments by and any n u m b e r within the range
x1 to xn by x, then the general interpolat ing factor 9 is defined by
x = xx + 9w,
where 9 is not necessarily between 0 and 1. Also,
x — x1 = 9wy
x — x2 = (9 — I) w, etc.,
and, using our symbolic notation,
[ U ] = ( / 8 - / ^ = ^ ,
[1,2, 3] =A\\2w\
[1, 2, 3, 4] = J s /3 ! w3, etc.
Subst i tu t ing these expressions into (17), omit t ing the remainder R,
f(Xl + 6w) =f(Xl) 4
(18)
which is known as Gregory 's formula. Except for notation, it is the same
as Newton ' s formula of Section 6 .
I t mus t be remembered that As
y for example, does not mean the cube
of A bu t instead the thi rd forward difference of / f a ) . I t is only in
consequence of the particular symbolic notat ion we have chosen that
(19) may be derived from (18) by treat ing (1 + A)6 as if (1 + A) were
being raised to the power 9 . T h e symbol A as used here is said to be
an operator.
We now introduce the new operator Ey such that E operat ing on f(x)
gives f(x + w). Symbolically,
Ef(x) = f(x + w).
Also, we have the operators
Af(x)=f(x + w)-f(x),
§/(*) =f(x + \ w ) - f ( x - \w\
/ * / ( * ) = £ [ / ( * + i « ) + / ( * - i « 0 ] -
Now since
we have
Ef(x) = / (* + «,) = (! +A)f(x),
E=l+A (20)
Also, if all derivatives off(x) exist, we may write by Taylor ' s theorem
/ ( * + W ) = / ( * ) + £ ^/<«>(x). (21)
Since e, the base of natural logari thms, raised to any power, say y,
is given by
^ = i + y + ^ + f r + - .
Eq. (21) may be writ ten symbolically as
f(x + zv)= e™Df(x). (22)
SYMBOLIC OPERATORS 163
Symbolically we may write (18) as
f(Xl + 0tv) = {l (19)
since if (1 + A)6 is expanded by the binomial theorem the result is
164 FINITE DIFFERENCES
I t is t rue that in general we do not know whether the first n derivatives
of f(x) exist since we have only the tabulated values to refer to, bu t
for purposes of interpolation we rely on the representation of f(x) by
a polynomial, the derivatives of which are known to exist.
W e may now write
Ef(x) =f(x + zv) = (l +A)f(x) = e"*>fix\
whence, symbolically,
E= 1 + A = ewD. (23)
For backward differences we have
TO = / ( * ) - / ( * - » ) •
Operat ing on both sides with E,
EVf(x) =/(* + w) -/(*) = Af{x\
whence
EP =A = E - l , (24)
or
E = rh?- (25)
N o w we can express f(x± + 8w) in te rms of the backward differences as
f(Xl + dw) = (i - ?)-7(*i)
0(6 + 1) (6+ 2)... (e + n-2)_n_,
(26)
(» - 1)!
which is known as Gregory 's backward formula.
Fur the r we have
»/(*) =/(* + iw)-/(*-i»)
= (i + j ) i / 2 / ( * ) _ ( i +j)-i/vw,
whence
S = ( 1 + J ) i / 2 _ ( i + J ) - i / « .
F rom this we find
= 1 + ^
E = I +A = (27)
= 2 Ak 8 */(*,) = 2 Bk | .8*/(*i), (29)
in which S 0 / ^ ) = f(x^) and /x8°/(#i) = Hf(xi)i a n d -4 and denote the
coefficients in the expansion of the lines above. W e write
S Ak 8 = 2 A'k 8* / (* i ) + ^ K » V ( * i ) ,
in which ^4' and fi' correspond to odd values of k, and ^4" and B"
to even values. Now, since
[|-8 + (1 + ^ S 2 ) 1 / 2 ] - 2 0 - [ | 8 - (1 + i S 2 ) 1 / 2 ] 2 0 = [— i S + (1 + i S 2 ) 1 ' 2 ] 2 0 ,
it follows that A' and B' are odd functions of 0 and ^4" and S " even
functions of 0 , and that
i:̂ 8V(*1)=̂ B;M8V(*,),
^^'8*/(*I)=Zfi;>8V(*i).
Hence we may also write
+ to) = 8*/(*i) + ZB't'ii&ftxJ
= ZB'k M s */(*!) + £ ^ ' s*/(*i). ( 3 0 )
If in the first of (30) we pu t xx -f- for xlt we have
/(*! + \w + 6w)=SA'k 8*/(*i + \w) + 2B'k'fi8
kf(x1 + \w),
which is Bessel 's formula of Section 5. T h e second of (30) is known
as the Newton-Cotes formula. In Bessel's formula we may change
8/(*i + iw) to / ( * , ) — / ( * , ) and Ssf(Xl + \w) to S 2 / (* 2 ) — 8*f(Xl), etc.,
obtaining Everet t ' s formula of Section 5.
S Y M B O L I C O P E R A T O R S 165
and, similarly,
,t = (1 + | 8 2 ) 1 / 2 - (28)
T h e n
/(*! + da) = E»/(Xl)
= [ i s + ( i + i W « » / ( * i )
166 FINITE DIFFERENCES
T h e formulas for numerical differentiation may be obtained from the
equation
1 + J = ewD = [i-S + (1 + i -8 2 ) 1 ' 2 ] 2 ,
which can be used to express D and its powers in terms of the diagonal
differences A or V or the central differences S. For example,
A2 AZ An
WD = In (1 + A) = A - y + y - ... + ( ~ l )*" 1 - ,
whence
Formulas in te rms of central differences may be obtained using
wD = 2 In [|-S + (1 + i S 2 ) 1 ' 2 ] = j ?C f c 8*,
:27G f r Sfc.
or
Formulas for numerical integration may be obtained with equal
facility. Symbolically we may write D~1f(x) for a function of which the
derivative is f(x), so that
S - V ( * ) = Sf(*)dx,
D~2f(x) = $$f(x) dx\ etc.
Also, we may write for a function whose first differences are
f(x). T h e n t
N O T E S A N D R E F E R E N C E S
A g o o d b o o k o n n u m e r i c a l ca lcu lat ion is A n d r e w D . B o o t h , Numerical Methods
( 2 n d ed . ) A c a d e m i c Pres s , N e w York , 1957. In it are p r o v e d m o s t of the f o r m u l a s
that are g i v e n h e r e , as w e l l as m a n y o t h e r s , and also g i v e n are re ferences to t h e
l i terature sufficient to e n a b l e t h e reader to p u r s u e any b r a n c h of t h e subjec t as
far as h e w i s h e s .
A n e x h a u s t i v e treatise o n the ca lcu lus of finite d i f ferences is H e n r i M i n e u r ,
Techniques de Calcul Numerique, Librair ie P o l y t e c h n i q u e C h . Beranger , Paris , 1952 .
A n e x c e l l e n t co l l ec t ion of aids to in terpo la t ion , w i t h p r e c e p t s for the ir u s e , is
Interpolation and Allied Tables, prepared b y H . M . N a u t i c a l A l m a n a c Office,
H . M . S ta t ionery Office, L o n d o n , 1956. It a lso conta ins use fu l s e c t i o n s o n n u m e -
rical di f ferent iat ion a n din tegrat ion , differential e q u a t i o n s , and prec i s ion of
ca lculat ion . A c o m p a n i o n book le t o n Subtabulation w a s i s s u e d in 1958 .
CHAPTER V
NUMERICAL INTEGRATION OF ORBITS
1. I n t r o d u c t i o n . T h e method of numerical integration is the most
powerful known in celestial mechanics for calculating the motion of any
body in the solar system for a few revolutions a round its pr imary, with
all the precision demanded by modern observations. For obtaining the
orbit for many revolutions experience indicates that analytic methods
are likely to be more efficient, except for orbits of large eccentricity
where analytic methods become progressively more difficult. T h u s ,
numerical methods have been used for most comets and for many minor
planets, while analytic methods have been applied to the eight principal
planets, to the moon, and to most other satellites and a n u m b e r of
minor planets as well. Whe the r this condition will long persist cannot
be foretold. T h e recent advances in comput ing with punched cards and
the experiments with electronic techniques now in progress will certainly
render both numerical and analytic methods much more efficient than
they have been in the past. It cannot be known as yet whether either
method will gain at the expense of the other, bu t it is certain that the
practical celestial mechanician will always profit by the use of a judicious
combination of numerical and analytic methods .
In this chapter are discussed in detail the two most common methods
of numerical integration of orbits, Cowell 's and Encke 's . These methods
owe their populari ty partly to the widespread availability of calculating
machines and partly to the availability of the rectangular coordinates
of the seven largest planets up to 1980, which have been publ ished in
a convenient arrangement with auxiliary tables by the British Nautical
Almanac Office; without these two aids these methods would probably
be less efficient than others.
In Cowell 's method no explicit use is made of a conic section as the
first approximation to the orbit . T h e equations of motion in rectangular
coordinates are integrated directly, giving the rectangular coordinates
of the dis turbed body. T h e process is analogous to that used in
Chapter IV, Section 12, with the difference that three coordinates are
needed instead of two, and at each step the attractions of the d is turbing
planets are added to that of the sun. T h e origin is usually taken at the
pr imary but this restriction is not necessary, and the center of mass
167
168 NUMERICAL INTEGRATION
of the system or of any dis turbing body may be used. T h e only restriction
is that the motions of all bodies exerting appreciable effects are supposed
to be already known relative to the chosen origin. Since no use is made
of the conic section as a first approximation the method could be applied
in systems not dominated by a single mass, such as the motion of a
satellite of a binary star. T h e only practical disadvantage of the me thod
is that the integrals contain many significant figures and change rapidly
with the t ime. In consequence the integration tables are slowly con-
vergent, which compels the use of a small tabular interval.
In Encke 's method the coordinates are not obtained directly, bu t
instead the integration gives the difference between the actual coor-
dinates and the coordinates in the osculating orbit, that is, the position
the body would have if it cont inued to move in the conic section
corresponding to the coordinates and velocity components at a particular
instant called the epoch of osculation. T h e departures from the osculating
orbit are called per turbat ions . T h e y vanish at the epoch of osculation.
T h e advantage of this method is that for dates near the epoch the
perturbat ions are small ; they can be expressed by a few significant
figures, which permits a larger tabular interval than with Cowell 's
method. T h e disadvantage is that in the course of t ime the perturbat ions
increase to a large size, which requires occasional rectification of the
orbit. T h e coordinates and velocities are determined at a new epoch
and the integration recommenced. I t seems that the difficulty might be
avoided by taking for the first approximation a conic section that
approximates the actual motion over a longer interval of t ime than does
the osculating orbit , bu t ordinarily this is not practicable for lack of
sufficient information.
A method much used dur ing the nineteenth century, and still of value,
is that of the variation of elements. In this method the quantities obtained
by the integration are the six osculating elements. T h e y change
comparatively slowly, which means that a rather large tabular interval
can be used, bu t the differential equations are more complicated in form
than the equations in rectangular coordinates.
Probably the method most elegant in conception is that of Hansen.
He starts with the osculating ellipse as Encke does, bu t instead of
perturbat ions of the rectangular coordinates he integrates the per turba-
tions of three other quantit ies. T h e fundamental one, the per turbat ion
of the mean anomaly, is given by a double integral. T h i s is the quanti ty
to be added to the mean anomaly in the ellipse osculating at the
fundamental epoch, the result being used with the other osculating
elements to calculate the longitude, latitude, and radius vector. T h e
longitude thus obtained is the actual longitude, bu t the radius vector
and the latitude require small corrections, obtained by integrating two
COWELL'S METHOD 169
equat ions of the first order. T h u s it appears that the complete solution
involves only four constants of integration, bu t this is not strictly t rue .
T h e complete solution involves three other variables, which are closely
related to the per turbat ions of the inclination and node of the osculating
ellipse bu t which are m u c h smaller in magni tude , becoming appreciable
only after many revolutions of the dis turbed body. T h e advantage of
the me thod is that it is usually possible to neglect these last three
quanti t ies (but if not, they may be obtained very simply) and that the
other three per turbat ions are smaller in magni tude than in any other
known method . T h e disadvantage is that several quanti t ies are needed
that have not been extensively tabulated (as have the rectangular coor-
dinates of the d is turbing planets), which requires several rather laborious
transformations. N o extensive application of the method has ever been
made, and it is likely that with special tables to facilitate the t rans-
formations it might be very efficient.
2 . E q u a t i o n s f o r C o w c l l ' s m e t h o d . T h e equat ions of mot ion of
two point-masses ma and mb unde r their mutua l action have been given
in Section 3 of Chapter I. T h e f - componen t s are given by
In t roduce additional point-masses mv m 2 , m3> ... into the system, and
denote any one of these by rrij. Evidently the at tract ions of mj on ma
and mb are given by equations similar to (1), and the total accelerations
of ma and mb be obtained by s u m m i n g all these attractions, giving
™a£a = k2mamb
b ^ a , mh£b = k2mbma (1)
k2mbma
k2mamb
(3)
(2)
with similar equations for rj and £, and
Pla = (L - W + (va - v,Y + (£« - Q\
Let the origin of coordinates be taken at ma, which is equivalent to
the linear t ransformation
with similar equat ions for rj and £, and
r2 = (L ~ to)2 + (Va ~ V,)2 + (£a - £&)
2.
£b f a —
x> £j f a —
xj>
170 NUMERICAL INTEGRATION
T h i s and similar equations for y and z are the fundamental equations
in CoweH's method. If ma represents the sun then it is evident that
xy yy z are the heliocentric coordinates of the body whose motion is
sought, and xjy yjy zj are the heliocentric coordinates of any otherbodies
acting on m. F rom the way in which the equations have been derived
it is seen that in (5) the first t e rm represents the action of the sun on my
the first t e rm in the parentheses represents the action of m3- on my and
the second represents the action of ntj on the sun.
T h e equations may be used equally well for satellite motion by taking
the origin at the pr imary and one of the ntj for the sun.
If the body whose motion is to be determined is an asteroid or comet,
m is pu t equal to zero. In this case the m3 are all small compared with
uni ty and the xjy yjy z3- may usually be regarded as completely known,
which permits the solution to be obtained by successive approximations.
At each step of the integration approximate coordinates of the body
are obtained by extrapolation, and these coordinates are used to calculate
w2xy w2yy w2z by a process similar to that used in the preceding chapter
except that the port ions contr ibuted by the dis turbing planets mus t be
calculated separately for each planet and added to the port ion contr ibuted
by the sun. W h e n the integration extends over a very few revolutions
the body will not depart very far from the orbit given by the ellipse
at the epoch of osculation, and unless the body approaches very close
to a major planet the planetary attractions may be calculated in advance,
from which follows
£ j £b ~ X3 X>
and put
r) = x2 + y2 + #2,
p2 = (Xj - x)2 + {y3 - y)2 + (zj - zf.
Divide Eq. (2) by ma and Eq . (3) by mb, and subtract the first from
the second. T h e result is the equation of motion of mb relative to ma,
(4) x = — k2 (ma + mb)
with similar equat ions for y and z.
Suppose that ma is taken as the uni t of mass, and that all other masses
are measured in this unit . T h e n we may pu t ma = 1 and drop the
subscript from mby and Eq. (4) may be wri t ten
(5) x = — k\\ + m)
COWELL'S METHOD 171
using the position of the body in the osculating orbit instead of in its
actual orbit . In this way much labor may be saved. W h e n the integration
extends over many revolutions, however, such an approximation is not
permissible; in such cases it is sometimes possible to use the results
of a previous integration in calculating the planetary attractions.
3* N u m e r i c a l a p p l i c a t i o n of C o w e l l ' s m e t h o d . T o illustrate the
numerical processes described in the preceding section, we set up an
integration scheme for the asteroid Ceres. T h e elements referred to the
ecliptic and mean equinox 1950.0, which are chosen for the basis of
the work, are
Epoch of osculation,
1941 January 6.0 E.T. = J.D. 243 0000.5,
Mean anomaly,
/ = 75°46 ,11794 = 75?76998,
Arc from the node on the ecliptic to the perihelion,
OJ = 71°4 ,5. , ,06,
Longi tude of node on the ecliptic,
ft = 80048'5o:'7i,
Inclination to the ecliptic,
/ = 10o35'49."00,
Eccentricity,
e = 0.0794 2668,
Semi-major axis,
a = 2.7672 3786 a.u.
F rom these are obtained
sin / = + 0.1838 9893, sin ft = + 0.9871 7554,
cos / = + 0.9829 4516, cos ft = + 0.1596 3850,
sin g o = +0 .9459 0471, cos cp = 0.9968 4071,
cos = + 0.3244 4457,
and from the obliquity of the ecliptic at 1950.0,
sin e = + 0.3978 8118, cos e = + 0.9174 3695,
172 NUMERICAL INTEGRATION
Ax = - 2.3965 7958, Ay = + 0.9984 2280, A2 = + 0.9576 8656,
Bx = - 1.2849 7379, By = - 2.2997 8838, 5 , = - 0.8179 9285.
Check: AXBX + AyBy + J 2 5 , = + 0.0000 0003.
In deriving the mean motion from the semi-major axis by Kepler ' s
th i rd law we add the mass of Mercury to that of the sun, for a reason
to be described in Section 7, which gives the mass of the sun -f- Mercury
= 1.0000 00167, whence for a 10-day interval wk = 0.1720 2101 82,
and therefore the mean motion is 0.2141 0874 66 degrees per day. Next
are calculated the coordinates and velocity components for the osculating
date, to eight decimal places.
/ = 75°46 , l i : , 94 = 1^3224 35683, u = 114007 16333,
sin u = + 0.9855 71232, cos u = + 0.1692 61179,
x = - 1.4817 2875, r 2 = 7.4530 9385, wkx = + 0.0812 3006,
y = - 2.1769 1244, r = 2.7300 3550, wky = - 0.0520 1752,
z = - 0.7201 5692, —T= = 0.0378 78256, wkz = - 0.0409 9650.
r\a
T h e most convenient way to start the integration is to calculate from
the osculating elements the coordinates and attractions for three dates
on each side of the epoch. These may require slight correction later
when the planetary at tractions are taken into account (Table 1).
T A B L E 1
STARTING DATA
Julian Date / u sin u cos u
242 9970.5 69.°34672 73P71495 + 0.959 8785 + 0.280 4162
242 9980.5 71.48781 75.90155 + 0.969 8785 + 0.243 5888
242 9990.5 73.62889 78.08160 + 0.978 4427 + 0.206 5184
243 0000.5 75.76998 80.25513 + 0.985 5712 + 0.169 2612
243 0010.5 77.91107 82.42214 + 0.991 2665 + 0.131 8732
243 0020.5 80.05215 84.58264 + 0.995 5334 + 0.094 4100
243 0030.5 82.19324 86.73667 + 0.998 3785 + 0.056 9250
whence
COWELL'S METHOD 173
— w2k2jrz
1 . 7 1 5 1 0 6
• 1 . 6 3 9 6 9 6
• 1 . 5 6 1 8 5 9
• 1 . 4 8 1 7 2 9
• 1 . 3 9 9 4 4 4
• 1 . 3 1 5 1 4 3
1 . 2 2 8 9 6 3
- 2 . 0 0 6 8 4 5
- 2 . 0 6 6 6 1 2
- 2 . 1 2 3 3 2 0
- 2 . 1 7 6 9 1 2
- 2 . 2 2 7 3 3 9
- 2 . 2 7 4 5 5 6
- 2 . 3 1 8 5 2 5
- 0 . 5 9 2 6 8 9
- 0 . 6 3 6 1 3 8
- 0 . 6 7 8 6 4 5
- 0 . 7 2 0 1 5 7
- 0 . 7 6 0 6 2 2
- 0 . 7 9 9 9 9 0
- 0 . 8 3 8 2 1 6
7 . 3 2 0 2 9 6
7 . 3 6 4 1 6 0
7 . 4 0 8 4 5 0
7 . 4 5 3 0 9 3
7 . 4 9 8 0 2 8
7 . 5 4 3 1 9 0
7 . 5 8 8 5 1 4
- 0 . 0 0 1 4
- 0 . 0 0 1 4
- 0 . 0 0 1 4
- 0 . 0 0 1 4
- 0 . 0 0 1 4
- 0 . 0 0 1 4
- 0 . 0 0 1 4
9 4 0 6 7
8 0 7 3 8
6 7 4 7 9
5 4 3 1 4
4 1 2 6 0
2 8 3 3 7
1 5 5 5 9
Since eight decimals are to be used in the integration, the last co lumn
is needed to nine so tha t the eighth will be accurate after mult iplying
by the coordinates. Next , the planetary attractions are calculated for
the same seven dates, with an extra decimal so that after being added
to the solar at traction the sum may be accurately rounded to eight
decimals. In Tab le 2, the results are given for each planet, with their
sum, the three quanti t ies for each date being the x, y, and z components .
T A B L E 2
PLANETARY ATTRACTIONS ON CERES
Julian Date Venus Earth Mars Jupiter Saturn S u m
+ 1 4 . 6 — 2 . 1 + 0 . 4 — 4 9 . 0 — 3 . 1 — 3 9 . 2
2 4 2 9 9 7 0 . 5 + 0 . 7 — 7 . 7 + 0 . 5 — 4 0 . 6 — 2 . 4 — 4 9 . 5
— 0 . 7 — 3 . 4 + 0 . 2 — 1 7 . 6 — 0 . 9 — 2 2 . 4
+ 1 4 . 3 — 0 . 5 + 0 . 4 — 4 8 . 6 — 3 . 1 — 3 7 . 5
2 4 2 9 9 8 0 . 5 + 4 . 4 — 7 . 9 + 0 . 6 — 4 1 . 1 — 2 . 4 — 4 6 . 4
+ 1 . 0 — 3 . 5 + 0 . 2 — 1 7 . 8 — 0 . 9 — 2 1 . 0
+ 1 3 . 0 + 1.1 + 0 . 4 — 4 8 . 1 — 3 . 1 — 3 6 . 7
2 4 2 9 9 9 0 . 5 + 8 . 0 — 7 . 9 + 0 . 6 — 4 1 . 6 — 2 . 4 — 4 3 . 3
+ 2 . 6 — 3 . 5 + 0 . 2 — 1 7 . 9 — 0 . 9 — 1 9 . 5
+ 1 0 . 7 + 2 . 7 + 0 . 4 — 4 7 . 6 — 3 . 1 — 3 6 . 9
2 4 3 0 0 0 0 . 5 + 1 0 . 8 — 7 . 5 + 0 . 7 — 4 2 . 1 — 2 . 3 — 4 0 . 4
+ 4 . 0 — 3 . 3 + 0 . 2 — 1 8 . 0 — 0 . 9 — 1 8 . 0
+ 7 . 7 + 4 . 2 + 0 . 4 — 4 7 . 2 — 3 . 1 — 3 8 . 0
2 4 3 0 0 1 0 . 5 + 1 2 . 7 — 7 . 0 + 0 . 7 — 4 2 . 5 — 2 . 3 — 3 8 . 4
+ 5 . 1 — 3 . 1 + 0 . 2 — 1 8 . 1 — 0 . 9 — 1 6 . 8
+ 4 . 1 + 5 . 6 + 0 . 4 — 4 6 . 7 — 3 . 1 — 3 9 . 7
2 4 3 0 0 2 0 . 5 + 1 3 . 7 — 6 . 1 + 0 . 8 — 4 3 . 0 — 2 . 3 — 3 6 . 9
+ 5 . 8 — 2 . 7 + 0 . 3 — 1 8 . 2 — 0 . 9 — 1 5 . 7
+ 0 . 3 + 6 . 8 + 0 . 4 — 4 6 . 2 — 3 . 2 — 4 1 . 9
2 4 3 0 0 3 0 . 5 + 1 3 . 7 — 5 . 1 + 0 . 8 — 4 3 . 5 — 2 . 3 — 3 6 . 4
+ 6 . 1 — 2 . 2 + 0 . 3 — 1 8 . 3 — 0 . 9 — 1 5 . 0
x y z r2
174 NUMERICAL INTEGRATION
These data together with the starting values calculated by the formulas
of Chapter IV give the first seven lines of the Integrat ion Tables 3 ,4 ,
and 5. Only enough figures of the Jul ian Date are given to suffice for
identification.
T A B L E 3
INTEGRATION FOR X
Date y fx /
997 + 256209
— 13450
998 + 242759
— 13596— 146
+ 32
999 + 229163
— 13710
— 114
+ 29
— 3
000 — 1.48190830
+ 8229586
+ 215453
— 13795
— 85
+ 29
0
001 — 1.39961244
+ 8431244
+ 201658
— 13851
— 56
+ 25
— 4
002 — 1.31530000
+ 8619051
+ 187807
— 13882
— 31
003 — 1.22910949 + 173925
T A B L E 4
INTEGRATION FOR y
Date y fy z 1 /" / 1 V
997 + 299787
+ 6178
998 + 305965
+ 5584
— 594
+ 12
999 + 311549
+ 5002
— 582
+ 8
— 4
000 — 2.17717626
— 5043084
+ 316551
+ 4428
— 574
+ 13
4- 5
001 — 2.22760710
— 4722105
+ 320979
+ 3867
— 561
+ 12
— 1
002 — 2.27482815
— 4397259
+ 324846
+ 3318
— 549
003 — 2.31880074 + 328164
COWELL'S METHOD 175
T A B L E 5
INTEGRATION FOR Z
Date y /. / /" r
9 9 7 + 8 8 5 2 9
+ 5 6 4 5
9 9 8 + 9 4 1 7 4
+ 5 3 9 6
— 2 4 9
— 2
9 9 9 + 9 9 5 7 0
+ 5 1 4 5
— 2 5 1
0
0 0 0 — 0 . 7 2 0 2 4 4 1 9
— 4 0 4 6 8 7 4
+ 1 0 4 7 1 5
+ 4 8 9 4
— 2 5 1
— 2
0 0 1 — 0 . 7 6 0 7 1 2 9 3
— 3 9 3 7 2 6 5
+ 1 0 9 6 0 9
+ 4 6 4 1
— 2 5 3
+ 1
0 0 2 — 0 . 8 0 0 0 8 5 5 8
— 3 8 2 3 0 1 5
+ 1 1 4 2 5 0
+ 4 3 8 9
— 2 5 2
0 0 3 — 0 . 8 3 8 3 1 5 7 3 + 1 1 8 6 3 9
= + 0.0822 9586,
and similarly for y and z.
T h e next step is to recalculate fx,fyyfz for dates 001 , 002, 003, using
values of x9 y9 z obtained from the integration tables. I t is found that
the provisional values of x9 y9 z require small corrections, bu t that the
values of / are not changed. T h e integration now proceeds as in the
example of the last chapter . For purposes of the integration the values
of x9 y, z obtained at each step need to be wri t ten to only six decimals,
bu t if the integration orbit is afterward to be compared with observation
it will be desirable to keep seven decimals, and to make the small
corrections needed to the extrapolated values at each s tep ; the values
of x9 y, z will then be definitive, and may be used for comparison with
observations.
Fo r the x coordinate
! / i / 2 = wk*Q + ^
176 N U M E R I C A L I N T E G R A T I O N
4 . E q u a t i o n s f o r E n c k e ' s m e t h o d . Let xQy yQy z0 be the helio-
centric rectangular coordinates of a point-mass m moving under the
attraction of the sun alone. I t is known that the orbit is given by the
equations
Xo=-k%l+m)%, y0 = -k*(l+m)%, z0=-k%l+m)^, (6)
ro ro ro
with
rl = 4+yl + *o-
Let £, 77, £ represent the increments of xQy yQy z0 produced by the
attractions of the planets. T h e n the actual coordinates x, yy z of m at
any t ime are
x = x0 + f, y = y0 + 77, z = z0 + £,
and the actual equations of motion are Eq. (5),
x = - k2(l + m) (7)
and similar equations for y and z.
Subtract ing Eq. (6) from (7) gives
(8) x — x0 = £ = k2(l + m)
with similar equations for ij and £.
T h e per turbat ions £, 17, J might be determined by direct integration
of Eq. (8). T h e term x0/rl might be calculated for every step of the
integration in advance by the laws of elliptic motion, the term x\rz
determined at each step by extrapolating £ and adding it to x0 to give xy
etc., bu t this way of proceeding would not be convenient in practice.
Since £ is a small quanti ty, # 0 /
r o * s nearly equal to x/r*y and these two
terms would have to be calculated to many more significant figures than
are needed in their difference. Encke was therefore led to seek a t rans-
formation that avoids this difficulty.
Trea t ing only the equation for £, since those for ij and £ are exactly
similar, we have
But
whence
If we put
we have
ENCKE'S METHOD 177
1 + 2 ? ,
(1 + 2q)-™,
If TJ, I are so small in comparison with x0, y0, z0 that their squares
may be neglected, then we may write
g = « r f + j w + « b C t ( 1 0 )
which is an easily calculated function, whereas 1 — (1 + 2q)~sl2 is
not . Tables might be constructed giving this latter function with
a rgument q, bu t such tables would have to be rather extensive and would
be rather inconvenient to interpolate. If q is small compared with
unity, then an approximate value of 1 — ( 1 + 2q)~312 will be given
by the first few te rms of the binomial expansion. W e have approximately
1 - ( 1 +2q)-^2 = 3q-~q2.
If we now define a function / (not to be confused with the / of the
integration tables) to be
( 9 )
r2 = x2 + y2 + z2
= (*O + £) 2 +(JO + ^ ) 2 + (*O + 0 2
= r« + 2* 0£ + 2y07, + 2z^ + ? + r,* + t,\
178 NUMERICAL INTEGRATION
which are the equations to be integrated in Encke 's method . T h e solution
involves six constants of integration, which are chosen so that the
coordinates and velocity components in the undis turbed orbit (xQy yQy z0y
x0y j>0, z0) are the same as those in the actual orbit at a part icular date,
that is, f, 77, £, £ , 7], £ are all zero. T h e date at which this is the case
is called the epoch of osculation.
Equat ions (11) are rigorous provided that q is calculated by formula
(9). In practice formula (10) will be used on starting the integration.
T h e per turbat ions f, 77, £ will gradually increase in size and when their
squares become appreciable formula (9) will be introduced.
Since xy yy z differ from x09 yQ9 z0 by the small quanti t ies f, 77, £, and
since mi is small, we may subst i tute x0y y0y z0 for x9 y, z in formulas
(11) and the error thus caused will be of the order of mult ipl ied
by its attraction on the asteroid or comet, that is, of the order of mf.
Thi s subst i tu t ion is usually made, and it permits the planetary attractions
to be calculated for a large n u m b e r of steps in advance. T h e result ing
per turbat ions are said to be accurate to the first order of dis turbing
forces. If the integration covers a sufficiently long t ime interval the
per turbat ions will become so large that the error commit ted becomes
appreciable, and then the planetary attractions may be calculated by
the rigorous formulas. T h e necessity for this may, however, be avoided
by an operation called rectifying the orbit . T h e coordinates of the
asteroid or comet are obtained at a new date by calculating accurate
values of xQy y0y z0 and adding them to f, 77, £, and the velocity components
are obtained by adding wx09 wy09 wz0 to values of w£y wr\y wt derived
from f, 77, £ by numerical differentiation. T h e values of a9 e9 ny and the
vectorial constants of the ellipse corresponding to the coordinates and
velocity components are calculated and the integration is recommenced
at the new epoch of osculation. In general it is preferable to rectify
the orbit rather than to use the rigorous formulas for the planetary
attractions, because the labor is less.
then when q is small / w i l l be close to 3, and since / changes much less
rapidly than q9 it is easy to interpolate in a table giving / as a function of q.
Equat ions (8) now become, th rough multiplication by w2
w2£ = w2k2
W2TJ = w2k2(l + m) -
w2t = w2k2{\ + m) -
(11)
ENCKE'S METHOD 179
5. N u m e r i c a l a p p l i c a t i o n of E n c k e ' s m e t h o d . T h e example
given for Cowell 's method is used, bu t with a 20-day interval, which
makes w2k2 = 0.1183 6492. We begin by calculating the coordinates
from the osculating orbit for one date before the epoch and for a number
of dates afterward (Table 6).
T A B L E 6
J . D . / u sin u cos u
2 4 2 9 9 8 0 . 5 7 1 . 4 8 8 7 5 . 9 0 2 4- 0 . 9 6 9 8 8 + 0 . 2 4 3 5 9
2 4 3 0 0 0 0 . 5 7 5 . 7 7 0 8 0 . 2 5 5 + 0 . 9 8 5 5 7 + 0 . 1 6 9 2 6
2 4 3 0 0 2 0 . 5 8 0 . 0 5 2 8 4 . 5 8 3 + 0 . 9 9 5 5 3 + 0 . 0 9 4 4 1
2 4 3 0 0 4 0 . 5 8 4 . 3 3 4 8 8 . 8 8 4 + 0 . 9 9 9 8 1 + 0 . 0 1 9 4 8
2 4 3 0 0 6 0 . 5 8 8 . 6 1 7 9 3 . 1 6 1 + 0 . 9 9 8 4 8 — 0 . 0 5 5 1 4
2 4 3 0 0 8 0 . 5 9 2 . 8 9 9 9 7 . 4 1 2 + 0 . 9 9 1 6 4 — 0 . 1 2 9 0 0
2 4 3 0 1 0 0 . 5 9 7 . 1 8 1 1 0 1 . 6 3 8 + 0 . 9 7 9 4 4 — 0 . 2 0 1 7 3
rl w2k2lr3
0
— 1 . 6 3 9 7 — 2 . 0 6 6 6 — 0 . 6 3 6 1 7 . 3 6 4 4- 0 . 0 0 5 9 2 3
— 1 . 4 8 1 7 — 2 . 1 7 6 9 — 0 . 7 2 0 2 7 . 45 3 + 0 . 0 0 5 8 1 7
— 1 . 3 1 5 1 — 2 . 2 7 4 6 — 0 . 8 0 0 0 7 . 5 4 3 4- 0 . 0 0 5 7 1 3
— 1 . 1 4 1 1 — 2 . 3 5 9 2 — 0 . 8 7 5 2 7 . 6 3 4 4- 0 . 0 0 5 6 1 1
— 0 . 9 6 0 5 — 2 . 4 3 0 7 — 0 . 9 4 5 6 7 . 7 2 5 4- 0 . 0 0 5 5 1 3
— 0 . 7 7 4 7 — 2 . 4 8 8 7 — 1 . 0 1 0 8 7 . 8 1 6 4- 0 . 0 0 5 4 1 7
— 0 . 5 8 4 7 — 2 . 5 3 3 2 — 1 . 0 7 0 4 7 . 9 0 5 4- 0 . 0 0 5 3 2 5
T h e planetary attractions for the first three dates may be obtained
by multiplyini y those in the previous example by 4. T h e addit ional
values needed are given in Table 7 in the same form as before.
T A B L E 7
PLANETARY ATTRACTIONS ON CERES
Julian Date Venus Earth Mars Jupiter Saturn S u m
2 4 3 0 0 4 0 . 5 1 4 . 0 4- 3 0 . 8 + 1 .2 — 1 8 2 . 8 — 1 2 . 6 — 1 7 7 . 4
4- 5 1 . 2 — 1 5 . 2 + 3 . 2 — 1 7 5 . 6 — 9 . 3 — 1 4 5 . 7
4- 2 3 . 6 — 6 . 8 4- 1 . 2 — 7 3 . 6 — 3 . 4 — 5 9 . 0
— 3 9 . 6 + 3 5 . 4 + 1 .3 — 1 7 8 . 6 — 1 2 . 7 — 1 9 4 . 2
2 4 3 0 0 6 0 . 5 + 3 4 . 3 — 3 . 9 + 3 . 5 — 1 7 9 . 0 — 9 . 2 — 1 5 4 . 3
+ 1 7 . 8 — 1 .9 4- 1 .3 — 7 4 . 2 — 3 . 3 — 6 0 . 3
— 5 2 . 8 4- 3 5 . 4 + 1.1 — 1 7 4 . 3 — 1 2 . 7 — 2 0 3 . 3
2 4 3 0 0 8 0 . 5 + 8 . 5 4- 8 . 4 4- 3 . 5 — 1 8 2 . 3 — 9 . 0 — 1 7 0 . 9
4- 7 .1 4- 3 . 5 4- 1 . 4 — 7 4 . 9 — 3 . 1 — 6 6 . 0
— 4 9 . 7 4- 3 1 . 2 + 0 . 8 — 1 6 9 . 9 — 1 2 . 8 — 2 0 0 . 4
2 4 3 0 1 0 0 . 5 — 1 9 . 1 + 2 0 . 2 + 3 . 3 — 1 8 5 . 4 — 8 . 9 — 1 8 9 . 9
— 5 . 5 + 8 . 7 4- 1 . 4 — 7 5 . 4 — 3 . 0 — 7 3 . 8
180 NUMERICAL INTEGRATION
W e find the starting values by the equations of Chapter IV, imposing
the condition that the per turbat ions and their first derivatives must be
zero at the epoch, and we then have sufficient information to fill in the
first three lines of the integration Tables 8, 9, and 10 which are shown
with several steps of integration added.
T A B L E 8
INTEGRATION FOR X
Date "/ y f1 /" f m
980 — 150
+ 2
000 + 12 — 148 — 13
— 74 — 11 + 4
020 — 62 — 159 — 9 + 8
— 233 — 20 + 12
040 — 295 — 179 + 3 — 6
— 412 — 17 + 6
060 — 707 — 196 + 9 0
— 608 — 8 + 6
080 — 1315 — 204 + 15
— 812 + 7
100 — 2127 — 197
— 1009
120 — 3136
T A B L E 9
INTEGRATION FOR y
Date "/ y f /"
980 — 186
+ 24
000 + 13 — 162 — 11
— 79 + 13 — 3
020 — 66 — 149 — 14 + 4
— 228 — 1 + 1
040 — 294 — 150 — 13 + 2
— 378 — 14 + 3
060 — 672 — 164 — 10 + 3
— 542 — 24 + 6
080 — 1214 — 188 — 4
— 730 — 28
100 — 1944 — 216
— 946
120 — 2 8 9 0
ENCKE'S METHOD 181
Date " / y /
980 — 84
+ 12
000 + 6
— 35
— 72 — 3
+ 9 — 4
020 — 29
— 98
— 63 — 7
+ 2
+ 6
+ 2
040 — 127
— 159
— 61 — 5
— 3
— 3
— 1
060 — 286
— 223
— 64 — 6
— 9
+ 4
+ 3
080 — 509
— 296
— 73 — 3
— 12
100 — 805
— 381
— 85
120 — 1186
T h e prel iminary values of / on the first three lines are s imply the
planetary at tractions. T h e n , for the f coordinate,
Jo
I f j L _ / ]
12 y ° ^ 2 4 0 y c
1 = 0 — — (-
1 1 2 1 - 1 4 8 > + 25><-
13) = + 12,
Yl/2 = 6 + 2 /o -r 1 2 / o = 0 + j ( --148) + j j ( - 4 ) = - 7 4 ,
and similarly for rj and £.
T h e first three lines of the integration tables being filled in, the
attractions are recomputed for the dates preceding and following the
osculating date . I t is found that some of them require corrections of
a unit , which are made and the differences corrected. T h e integration
now proceeds in the usual way. First the values of £, 77, £ are calculated
for date 040 by formula (15) of Chapter IV, then q and fq are obtained
and finally the attractions, which are added to the planetary attractions
and the results writ ten in the tables. Next , 77, £ are calculated again
by the more accurate formula (16) of Chapter IV and the former values
are corrected if necessary. I t is convenient to form a table of 77, £,
q, and fq, as they are obta ined step by s tep. Tab le 11 is given as a
example with the necessary corrections to five of the provisional per-
turbat ions already int roduced. In this example no corrections to the
provisional values of fq were found to be necessary.
T A B L E 10
INTEGRATION FOR Z
182 NUMERICAL INTEGRATION
T A B L E 1 1
ENCKE PERTURBATIONS
Date i V £ q fq
0 0 0 0 0 0 0 0
0 2 0 — 7 5 — 7 8 — 3 4 + 4 0 + 1 2 0
0 4 0 — 3 1 0 — 3 0 6 — 1 3 2 + 1 5 6 + 4 6 8
0 6 0 — 7 2 3 — 6 8 6 — 2 9 1 + 3 4 1 + 1 0 2 3
0 8 0 — 1 3 3 2 — 1 2 3 0 — 5 1 5 + 5 9 0 + 1 7 7 0
1 0 0 — 2 1 4 3 — 1 9 6 2 — 8 1 2 + 8 9 7 + 2 6 9 1
X
pa
Vi - y
Zj - z
P)
T h e data in Table 11 and in the integration tables are all in uni ts of the
eighth decimal.
6. E q u a t i o n s w i t h o r i g i n at the c e n t e r of m a s s . Equat ions (9)
of Chapter I may be extended to any n u m b e r of dis turbing point-masses
simply by writ ing
# = X k2m3
3
i
z = ^ k2m3-
3
with pf = (Xj — x)2 + (y3 — y)2 + (z3- — z)2, in which the origin of
coordinates is at the center of mass of the whole system. These equations
are simpler in form than (5) owing to the absence of the te rms that
express the action of the dis turbing planets on the sun, and they would
be more convenient to use than (5) if the coordinates of the sun and
planets referred to the center of mass of the solar system were readily
available, which is not the case. T h e general theories that give the
coordinates of the major planets as functions of the t ime are all referred
to the sun as origin. T h e slight complication of these theories, and of
the equations of motion for another body, that arises from this choice
of origin is more than compensated by the advantage that a separate
theory of the sun 's motion is rendered unnecessary.
Notwi ths tanding the inconvenience to the computer of having to
calculate the coordinates of the sun and planets referred to the center
of mass, it is sometimes advantageous to use this origin when integrat ing
the orbit of a comet or asteroid. W h e n the sun is used as center and
such a body is very distant from the sun, the te rm x3/r* of equations (5)
ORIGIN AT CENTER OF MASS 183
* = *,fV^ + S*^iV£' (I3)
Pa j Pj
and similar equations hold for y and z. These are the equations for
Cowell 's me thod referred to the barycenter. In applying them it is
necessary to obtain the barycentric coordinates xS9 yS9 zsy xj9 yjy zi from
the heliocentric coordinates of the planets. If the heliocentric coordinates
are denoted by xj9 yjy zjy then
1
Xjy
1 + m} (14)
with similar equations for the other coordinates.
T h e coordinates and velocity components at the epoch of osculation
differ from those referred to the sun. If they have been obtained from
heliocentric elements it is necessary to make the following t ransforma-
t ions before calculating the starting values for the integration. T h e
coordinates are
(15)
with similar equations for y and z. T h e velocity components referred
to the barycenter may be obtained from
(16)
with similar equations for y and z9 where the second part of the cor-
rections may be conveniently calculated by numerical differentiation of
the second part of (15) calculated for several dates.
and (11) may become much greater than (Xj — x)lp)> In the case of
the action of Jupi ter on Pluto, for example, the ratio of these two te rms
averages about 4 0 2 / 5 2 or 64. Consequent ly the former of these two
terms produces an additional significant figure in the attractions, which
shortens the tabular interval needed for good convergence of the differ-
ences. T h e use of the center of mass as origin avoids this difficulty and
permits a larger tabular interval to be used.
Separat ing the solar t e rm from the others, as was done in Eq. (5)
and denoting the barycentric (referred to the center of mass) coordinates
of the sun by xS9 ysy zsy Eq . (12) for x becomes
184 NUMERICAL INTEGRATION
Thi s is equivalent to assuming, so far as the action of mv is concerned,
that the asteroid or comet moves in an elliptic orbit around the center
of mass ofthe sun and mp. In applying Eqs. (14) and (15) the coordinates
of mp may be included, but it often happens that even this is not necessary
and that throughout the whole problem the sun 's mass may be augmented
by mp and no further at tention paid to it. For example, the center of
mass of Mercury and the sun is about 0.0000 0007 a.u. from the center
of the sun, and in most asteroid orbits so small a correction to the
coordinates of the disturbed body may be neglected.
Upon completion of the integration it will often be desired to refer
the coordinates again to the sun, which may be done by
with similar equations for y and z.
T h e equations for Encke 's method referred to the barycenter are
(17)
and similar equations for rj and £, where q and r\ are obtained from
and the x0- and xs are derived by (14).
7. I n t e g r a t i o n w i t h a u g m e n t e d m a s s of the s u n . When the
disturbed body is so far from the sun that, in Eqs . ( i2 ) , one or more
of the pj is nearly equal to psy then, denot ing such a p}- by pp and the
corresponding dis turbing mass by mpy if mp is sufficiently small we may
have
(18)
to the same n u m b e r of significant figures as are needed in the attractions.
Th i s is commonly the case with the action of Mercury on an asteroid,
or with the action of the four inner planets on Pluto. In such cases,
instead of (13) we may write
(19)
AUGMENTED MASS OF SUN 185
An augmented mass of the sun may be used in Eqs . (5) or (11) equally
as well as in (13). In such cases the te rm — xjrp is neglected in the
integration, and afterward included if necessary by the use of Eqs .
(14) and (15).
Whenever an augmented mass of the sun is used in an integration
the same augmented mass must be used in calculating the start ing values
for the integration. T h e way in which the augmented mass is to be
used depends on the elements that are chosen to correspond to the
constants of integration. Wi th either Encke 's or Cowell 's me thod the
actual constants of integration come from the coordinates and velocity
components at the epoch of osculation, and these constants are not the
same for the augmented mass of the sun as for the unaugmented mass.
F rom the way in which the coordinates and velocities were derived from
the elements in Chapter I it is evident that the only effect is to change
the relation between n, the mean motion, and a, the semi-major axis;
in applying Kepler ' s th i rd law, k2 (1 + m -f- mp) is to be subst i tuted
for k2(l + m).
I t is indifferent w rhether, on commencing operations, n is derived from
a or a from n, bu t once the decision is made all subsequent operations
must be consistent with it. If the equations of chapter I, section 27
are to be used at the end of the integration it will be more convenient
to regard a as the fundamental element. T h e n the procedure will be
governed by the initial data available. If a alone is given the course is
obvious. If a and n are both given, reject n and use the value given by
If only n is given it is to be assumed, lacking any s tatement to the
contrary, that it is referred to the sun alone. Therefore derive a from
then, using this value of <z, obtain the corrected value of n from (20).
In using the equations of Chapter I, Section 27 at the end of the
integration the augmented mass should not be used if, as is usual, the
elements are to be referred to the sun.
8 . R e l a t i v e a d v a n t a g e s of C o w e l l ' s a n d E n c k e ' s m e t h o d s . In
general it may be said that neither method is clearly superior to the
other when the work is carried out with a desk calculating machine.
Encke's method permits the use of a larger tabular interval bu t each
step requires more t ime than in Cowell 's method. For comets it is often
recommended to use Encke 's method when the comet is near the sun
k2(l + m + mD) = n2a3. (20)
k2(l + m) = n2a3, (21)
186 NUMERICAL INTEGRATION
and Cowell 's when it is far from the sun. W h e n close approaches occur
the Encke per turbat ions increase in size very rapidly, necessitating a
small tabular interval, and then the method loses all its advantages. T h e
change from either method to the other may be made without difficulty
by calculating coordinates and velocity components for a new epoch of
osculation and starting the integration anew at the new epoch.
Wi th modern large-scale calculating machinery, where the process of
integration may be carried out by operators who have little knowledge
of the art of computat ion, or it may even be made entirely automatic,
Cowell 's method is clearly superior to Encke 's . Wi th Encke 's method
the necessity for making judgmen t s regarding the formula to be used
for q, and the desirability of rectification, as well as the continually
increasing size of the per turbat ions , are complications that are easily
dealt with by a computer as they arise, bu t they cannot be easily foreseen
in advance, as is necessary with automatic methods .
N O T E S A N D R E F E R E N C E S
C o w e l l ' s m e t h o d w a s first u s e d in p r e d i c t i n g the re turn of H a l l e y ' s c o m e t
in 1910 , P . H . C o w e l l , A . C . D . C r o m m e l i n , " I n v e s t i g a t i o n of t h e m o t i o n of
H a l l e y ' s C o m e t f r o m 1 7 5 9 - 1 9 1 0 , " A p p e n d i x to Greenwich Observations 1909,
N e i l l , B e l l e v u e , E n g l a n d , 1910. In th is appl icat ion t h e authors ca lculate t h e
s e c o n d di f ferences of the c o o r d i n a t e s , w h i c h are t h e n d irect ly o b t a i n e d b y t h e
d o u b l e s u m m a t i o n , ins tead of s u m m i n g the s e c o n d der ivat ives a n d a p p l y i n g t h e
in tegrat ion formula to t h e s u m m a t i o n s , as is d o n e in th i s chapter . H o w e v e r ,
t h e y d o r e c o m m e n d u s e of the latter p r o c e d u r e , w h i c h is assoc ia ted w i t h a smal ler
a c c u m u l a t i o n of errors o f r o u n d i n g .
E n c k e ' s m e t h o d is d e s c r i b e d b y h i m in the Berliner jfahrbuch, 1857 .
J a m e s C. W a t s o n , Theoretical Astronomy, L i p p i n c o t t , 1900 , C h a p t e r V I I I ,
descr ibes several m e t h o d s of specia l p e r t u r b a t i o n s , w i t h n u m e r i c a l e x a m p l e s .
E x a m p l e s o f a n u m b e r of m e t h o d s are g i v e n in Planetary Coordinates for the
years 1960-1980, L o n d o n , H . M . S ta t ionery Office.
T h e a c c u m u l a t i o n of errors in n u m e r i c a l in tegra t ion , for t h e case w h e r e the
o n l y errors arise f r o m r o u n d i n g the e n d - f i g u r e s , w a s first treated b y D i r k B r o u w e r
in Astron. J. 46, 149 (1937 ) .
T h e a d v e n t of a u t o m a t i c h i g h - s p e e d ca lcu la t ing m a c h i n e s has p r o d u c e d w i d e -
spread interest in m e t h o d s of n u m e r i c a l in tegrat ion . M a n y n e w formulas have
b e e n i n t r o d u c e d , and C o w e l l ' s m e t h o d has b e e n red i scovered several t i m e s ,
b e i n g n o w k n o w n b y a var ie ty of n a m e s . B u t it d o e s n o t appear that any m e t h o d
is super ior to C o w e l l ' s and E n c k e ' s for general u s e in the n u m e r i c a l in tegrat ion
of orbi ts . T h e a c c u m u l a t i o n of error m a y b e r e d u c e d to an abso lu te m i n i m u m
b y caus ing the s tep- in terva l to b e automat ica l ly var ied , a c c o r d i n g to the s ize of
the dif ferences of h i g h e s t order reta ined.
CHAPTER V I
ABERRATION
1• I n t r o d u c t i o n . Because of the finite velocity of light, the apparent
position of any celestial body depends upon the motions both of this
body and of the earth dur ing the interval of t ime required for light
to travel from the body to the observer.
While the light is travelling from a celestial body to an observer on
the moving earth, bo th the body and the earth move awayfrom the
positions they occupied in space at the instant the light left the body.
T h e ray that is received at the observer is one that was emitted, at
some t ime previous to the instant of observation, in the direction toward
the position that the earth was later to occupy; consequently, by the
t ime the ray is received at the observer, the body is no longer located
in the direction from which the ray comes. Moreover , because the
earth is in motion, the direction whence the ray is coming when it
arrives at the observer differs from the direction whence it appears
to be coming; the latter (which is the direction in which the body is
actually seen) is the direction of motion of the light relative to
the observer, not its actual geometrical direction in space, and this
relative direction depends on the velocity of the observer at the
instant.
Consequently, the apparent direction in which a body is observed
on the celestial sphere is neither the actual geometric direction at the
instant of observation, nor the direction where the body was geo-
metrically located at the t ime the light left it.
T h e apparent displacement of celestial bodies from their actual
geometric directions that is caused by the progressive motion of light
in combination with the motions of the observer and of the bodies
themselves is called aberration. T h e displacement of the observed
apparent position from the actual geometric position at the instant of
observation is known as planetary aberration. T h e displacement of the
observed position from the position where the body was geometrically
located at the instant when the light left it is called stellar aberration.
Planetary aberration may be regarded as the resultant of two effects:
the stellar aberration due to the instantaneous velocity of the observer
at the instant of observation and the geometric displacement of the body
187
188 ABERRATION
in space due to its motion dur ing the interval while the light was
travelling to the earth.
In calculating the position in which a body will appear at a particular
t ime, from the geometric positions of the body and of the earth as derived
from gravitational theory, allowance mus t be made for these two effects.
Conversely, in deriving from observed positions the geometric positions
in space, appropriate corrections mus t be applied.
F I G . 1 . Aberration
In Fig. 1 let ET and PT denote the actual geometric positions of the
earth and a celestial body at the instant T when an observation is made,
and let Et and Pt denote their actual positions at an earlier instant t
when the ray that reaches the observer at t ime T left the body. T h e
geometrical path of the light-ray is along PtET. Denote by V the velocity
vector of the actual light-ray, and by v the velocity vector of the earth
at the instant T. T h e relative velocity of the light ray is represented
by Vr, the side of a parallelogram having v for its other side and V
for its diagonal. N o w draw a line from Pt parallel to v, cut t ing Vr
produced at Pa. T h e n Pa is the apparent position of the body at the
instant T. W e have further PtPaIPtET = v/V, and PtET = V(T — t),
whence PtPa — v(T — t), which is an impor tant principle often of value
in treating the stellar aberration. I t may be stated: the apparent position
of an object seen by an observer in motion is displaced from the position
in which it would be seen if the observer were at rest by an angular
amount rigorously equivalent to a linear displacement of the object
parallel to the observer 's direction of instantaneous motion, and equal
in amount to v{T — t), where v is the observer 's instantaneous velocity,
and T — / is the t ime required for light to travel from the object to
the observer.
T h e principles jus t stated are completely rigorous and general, bu t
owing to our imperfect knowledge of the distances and motions of the
STELLAR ABERRATION 189
stars, they are never rigorously applied to stellar positions in practice.
T h e distances and velocities of the stars are known with such low
accuracy that it is not practicable to calculate their motions dur ing the
interval T — t. Hence the planetary aberrat ion is never calculated for
the stars, bu t only the stellar aberrat ion. Moreover, in calculating the
stellar aberrat ion the motion of the solar system th rough space is
neglected. In consequence the positions of stars given in star catalogues
are not their t rue positions, and the proper motions are not the t rue
proper motions, bu t these discrepancies are of no consequence to
celestial mechanics ; stellar positions are used in celestial mechanics only
for defining a reference system of coordinates, and the catalogue positions
(with one small exception noted on page 197 ) are as useful for this
purpose as the t rue positions would be.
2 . S t e l l a r a b e r r a t i o n . T h e amount of the apparent displacement
due to stellar aberrat ion depends on the instantaneous velocity of the
observer at the instant the light reaches h im. In a r ight -handed
rectangular coordinate system with fixed origin and axes in fixed direc-
tions in space, the components of the velocity vector of the observer,
xy y, Zy are each the sum of the linear velocity due to the diurnal rotation
of the earth, the orbital motion of the earth relative to the sun (including
lunar and planetary per turbat ions as well as the elliptic motion) , and
the motion of the sun relative to the center of mass of the solar system
(but excluding the motion of the solar system among the stars).
Relative to parallel axes with origin at the observer, let L, B denote
longitudinal and latitudinal spherical coordinates referred to the xy plane.
T h e direction cosines of the line toward Pt whence the light actually
comes are
where L and B are the spherical coordinates of the point Pt. T h e
direction cosines of the apparent position Lay Ba are similarly
£ a = cos Ba cos L a , rja = cos Ba sin L a , £ a = sin Ba.
T h e points Pty Pa are diametrically opposi te to the points toward
which V and Vr are directed. Hence
£ = cos B cos L, rj = cos B sin L, £ — sin By
vjv, v = - vjv, - vjv,
with similar equations for £ a , rja, £a.
190 ABERRATION
W e have also
VJa =V£ + x, Varja = Vy]+y, Va£a = VC+z,
where Va£a and V£ represent direction components of the lines toward
Pa and Pt. Therefore
V x
cos Ba cos La = cos B cos L + — ,
V . y
cos Ba sin La = cos B sin L + — , (1)
-^r sin £ a = sin B + — .
Mult iplying the first of these equations by cos L, the second by sin L,
and adding, then mult iplying the first by sin L and the second by
cos L and subtracting, we obtain
V 1
-—- cos Ba cos (La — L) = cos B + — (x cos L + y sin L),
F 1
- ~ cos Ba sin (L a — L) = — — (x sin L — y cos L),
(2)
whence
Mult iplying the first of (2) by cos \ (La — L), the second by sin \
(La — L), and adding, an equation for cos Ba is obtained, which being
combined with the preceding equation for sin Ba gives after some
manipulat ion
tan (Ba - B) (4)
zcosB —ixcosL + j ) s i n L + t a n \ ( L a —L) (ycosL — xsmL)\sinB
tan (La - L) ,
— sec B(x sin L — y cos L)
V + sec B{x cos L + y sin L)' (3)
V + 2r sin B + cos L + j sin ^ + tan \ (La— L) (y cos L — x sin L)} cos 5*
T h e values for L a — L and i ? a — B so obtained are completely
rigorous and general. F rom them may be developed practical formulas
for the actual calculation of the stellar aberration in any system of
coordinates, to any desired degree of accuracy. Formulas (3) and (4)
are independent of the distance of the object, and they, or formulas
derived from them, are convenient to use when the distance of the
observed object is unknown.
If the distance is known with sufficient precision, however, it is
sometimes more convenient to treat the stellar aberration as equivalent
PLANETARY ABERRATION 191
to a linear displacement of the object. If Xu Yu Zu Rt arethe t rue
geometric rectangular coordinates and distance of the object at the
instant t when light left the body, Xay Ya, Za its apparent coordinates
at the instant of observation T, and xy y, z the coordinates of the observer
at t ime T, then we have directly, from the principles of the preceding
section, with complete rigor and generality,
Xa — Xt = — Rt,
Y a - Y t = ^Rt, (5)
Za Zt = — Rt,
in which, for perfect rigor, the origin of coordinates mus t be taken at
the center of mass of the solar system. If the travel-t ime of the light,
RtlVy is denoted by r, Eqs . (5) become
Xa — Xt = XTy
Y a - Y t = y T i (6)
Za — Zt = ZT.
3 . P l a n e t a r y a b e r r a t i o n . T h e stellar aberration, due to the
velocity of the observer, is only the displacement of the apparent position
(Pa in the diagram) from the geometr ic position Pt which the body had
when the light left it. By means of a further correction for the motion
of the body from Pt to PT dur ing the l ight-t ime, if this motion can
be determined, the geometric position of the body at the instant of
observation can be calculated. T h e combinat ion of the two effects, that
is, the displacement of the apparent position of the body from its t rue
position at the instant of observation Ty is the planetary aberration,
angle PTETPa in the diagram. Denot ing the actual coordinates of the
body at the instant of observation by X T , Y T , Z T , and developing the
quantit ies XT — Xu YT — Yu ZT — Zt in powers of r by Taylor ' s
theorem, we have
Y T - Y t = Ytr -\
ZT — Zt — ZtT -j
(7)
XT — Xt — Xtr +
192 ABERRATION
Subtract ing Eqs. (7) from (6) gives rigorous expressions for the
rectangular components of the planetary aberration,
— Xj> — {x - Xt)r -
Ya
- Y t ) r - \ YtT* - i YtT* -
— ZT = (z -Zt)r ~ \ l :tr* - \zy -
It is especially impor tan t to notice that in these equations the origin
of coordinates is not arbitrary, bu t must be taken at the center of mass
of the solar system if perfect rigor is aimed at.
Equat ions (8) are not suited for practical use, owing to the difficulty
of calculating the derivatives appearing therein. An equally rigorous and
more convenient method of calculating the planetary aberration is to
treat its two parts separately. Knowing XTy YTy ZTand the corresponding
distance RT we take rx — RT/V as the first approximation to r and
calculate Xv Yly Zly R1 for the instant T — r v T h e second approxima-
tion to r is r 2 = Ri/V> which yields a second approximation to the
coordinates, X2y Y2> Z 2 , R2. Proceeding in this way it will be found
after two or three approximations that r does not change. At this stage
we have Xty Yti Zh and the stellar aberration may then be calculated
by Eqs. (6) or Eqs. (3) and ( 4 ) , or approximately by some other method.
If the motion of the observed body is sensibly rectilinear dur ing the
t ime interval r , then in Eqs. (8) the terms involving T 2 and higher
powers of r become negligibly small, and fur thermore we have sensibly
Xt — XTy Yt = YTy Zt — ZT, whence
- X T = (x- XT) T ,
YA YT — {y YT) T ,
zn
— ZT = (z — ZT) T .
These equations may be yet further simplified for calculation by
writing rx for T , and by taking the sun instead of the center of mass
of the solar system for the origin of coordinates. In this shape they are
often used for practical calculations; in the case of comets and planets
the errors int roduced by the approximations are generally less than
O'/Ol, but for satellites the simplified equations should be used with
caution.
T h e r ight-hand members of Eqs. (9), simplified as jus t described,
express the change in the coordinates of the observed body relative to
the moving observer as origin, during the period in which the light
(8)
(9)
DIURNAL ABERRATION 193
passes from the body to the observer. T h e planetary aberrat ion is
therefore, unde r the assumptions noted, equal to this change. Hence ,
under the same assumptions similar equat ions will hold for any system of
coordinates with origin at the moving observer. For example, the planet-
ary aberrat ion in r ight ascension, a, and declination, S, is given by
AOL = — a T j ,
M = - 8 r v
( 1 0 )
which are to be applied to the geometrical place. If a geometric ephemeris
giving the right ascension and declination of the object as functions of
the t ime is available, the derivatives d and S may easily be found by
numerical differentiation.
4 . D i u r n a l a b e r r a t i o n . In the expressions of the preceding
sections the velocity components x, y, z are, in the complete theory,
those of the observer. I t is customary, however, for convenience, to
separate the stellar aberration into two parts , and to treat the port ion
depending on the orbital motion of the earth as distinct from the port ion
due to the rotation of the earth on its axis. T h e latter port ion is
called the diurnal aberra t ion; when this is t reated separately, the
remaining port ion, called the annual aberration, may be obtained
wi thout loss of rigor by using in expressions (3), (4), (6) the velocity
components of the center of the earth instead of those of the observer.
In consequence of the ear th ' s rotation on its axis, the observer is
continuously carried toward the east point of the horizon with a speed,
in meters per second.
v = 464 p cos 9?' ,
<p' being his geocentric lat i tude and p the radius of the earth at his
station in te rms of the equatorial radius. T h e constant 464 is the speed
of a point on the equator .
T h e effect of this motion is to produce a displacement of the apparent
positions of all celestial bodies toward the east point of the horizon,
on great circles passing th rough this point, expressed by
v • a
s = — sin 0 ,
6 being the angular distance of the body from the east point . Pu t t ing
for v and V their numerical values and dividing by arc 1",
s = 0''319 p cos cp' sin 0.
194 ABERRATION
T o find the effect of the displacement on the right ascension and
declination of a body, consider the spherical triangle PES formed by
the pole P , the east point of the horizon E, and the body S. T h e n 6
is the side ES; and if we put q for the angle at 5 , and h for the hour
angle of the body, the aberrat ion in right ascension and declination will
be
cos SAoc = s sin q,
AS — s cos q.
W e have also
sin 6 sin q = cos A,
sin 9 cos q = sin 8 sin h,
whence
AOL = 0"319 p cos cp' cos h sec S,
AS = 0"319 p cos cp' sin h sin S.
In using these expressions we may pu t p = 1 and the astronomical
latitude for cp' wi thout appreciable error.
T h e diurnal aberrat ion is of no consequence in differential astro-
metric work such as photographic work, because it affects all bodies
in the same region by nearly the same amount , and the small residual
effects are eliminated by the plate constants. I t must , however, be taken
into account in all measurements of the relative positions of widely
separated bodies, as in meridian astronomy.
5. C a l c u l a t i o n of the a n n u a l a b e r r a t i o n . W h e n a numerical
accuracy approaching O'.'OOl or bet ter is aimed at, the velocity
components x> y> z may be most conveniently obtained by numerical
differentiation of the ear th 's rectangular equatorial coordinates referred
to the center of mass of the solar system, and to a fixed equinox and
equator. T h e coordinates of the earth relative to the sun may be obtained
from the American Ephemeris simply by changing the signs of the sun ' s
rectangular equatorial coordinates, and the coordinates of the sun
relative to the center of mass may be derived from the data in Planetary
Coordinates.
In practice, however, this procedure has seldom been followed in the
past. Instead, the velocity of the sun relative to the center of mass was
neglected, and the earth was assumed to move in an elliptic orbit a round
the sun. It is still customary to use differentialformulas instead of the
rigorous expressions (3) and (4).
CALCULATION 195
: cos B cos Z> — sin B cos LJZ? — cos B sin L J L ,
: cos B sin L — sin B sin LzJi? + cos B cos L J L , (12)
: sin B + cos B AB.
Mult iplying the first of these equations by — sin L and the second
by cos L, and adding,
--cos BAL. (13)
Mult iplying the first of (12) by — sin B cos L, the second by — sin B
sin L, the third by cos By and adding,
— sin B cos L — sin B sin L - cos 5 — AB. (14)
Subst i tut ing for Zl l ( e», AY™, AZ™ their values (5) in (13) and (14)
we have
(15)
cos BAL =
where i ( e ) , y { e \ z(e) are referred to the ecliptic.
In order to express i ( e ) , y i e \ i ( e ) in te rms of the elliptic elements of
the ear th 's orbit , let us define
ifj, the longitude of the earth in its orbit,
r, the earth's radius vector,
co, the longitude of the earth's perihelion,
e = sin 99, the eccentricity of its orbit
its mean angular motion,
a, its semi-major axis.
If we put L for the ecliptic longitude of any body, B for its lat i tude,
and R for its distance, we have
Z ( e ) = R cos B cosL,
7<e> = R cos 5 sin L, (11)
Z<*> = sin 5 .
Differentiating,
196 ABERRATION
We then have, neglecting the earth 's lati tude,
= r cos
ye) = r sin
= 0,
(17)
Subst i tut ing these values of ift and f in (16) we find
xie) = — an sec cp (sin if; + e sin co),
ye) _ _|_ a w s e c ^ ( c o s ifj -\- e cos to) .
Subst i tut ing these values in (15) and put t ing for brevity
_ an sec cp
K — y ,
cos BAL = K (sin ifj + e sin co) sin L + /c (cos I/J + e cos to) cos L
= /c cos (0 — L) + £/C cos (to — L), (18)
= AC sin 5 sin (ip — L) + e/c sin J5 sin (to — L).
T h e last te rms of these equations are independent of the earth 's
longitude. For a particular star they are nearly constant dur ing several
centuries, and by convention they are omit ted from the formulas, being
included in the coordinates of the stars as determined by observation.
In the usual formulas we put O = <A— 180°, the t rue longitude of
the sun, and obtain
cos BAL = — K cos ( O — L),
AB = — K sin B sin ( O — L).
T h e quanti ty K is the constant of aberration, and its conventional
value is 20' '47.
and, by differentiation,
x i e ) = f cos I/J — rijj sin xjj,
y(e) — f sin ifj + rip cos I/J> (1
= 0.
By the laws of elliptic mot ion we have
n cos (pt
C A L C U L A T I O N 197
In equatorial coordinates we have, by a similar development ,
x
— = K (sin O — e sin to),
y
-p: = — K cos e (cos O — e cos co),
-p: = — /c sin e (cos O — £ cos co),
and , again omit t ing the te rms involving co,
cos SAoc = — K cos e cos O cos a — K sin O sin a, (20)
J 8 = K cos 8 cos O sin S sin a — /<: sin Q sin 8 cos a — K sin e cos Q cos S.
If we put for brevity
C = — K cos e cos O , D = — /c sin O,
c = cos a sec S, d = sin a sec S,
c' — tan e cos 8 — sin a sin 8, d' = cos a sin 8,
Eqs . (20) become
Aoc = Cc + Dd,
AS = Cc' + Dd\
(21)
which is Bessel's form, often used in calculating the annual aberrat ion
of the stars. T h e quanti t ies C and D are tabulated for every day in the
American Ephemeris.
T h e terms of the stellar aberrat ion involving GO, which affect the
catalogue mean places of stars, are of no consequence in meridian
astronomy, because the position of any object observed with a transit
circle is its apparent position, and it is usually compared with an apparent
ephemeris in which the planetary aberrat ion has been calculated by Eqs .
(10); these equat ions automatically include the effect of the elliptic term, as
it is called. On the other hand, photographic positions of celestial bodies
are obtained by referring the image of the body to the catalogue mean
places of stars in the immediate neighborhood. Such observations are
affected by the elliptic port ion of the aberrat ion and, unless due p re -
cautions are taken, appreciable errors will be produced in the elements
of the orbit. One remedy is to apply a special correction to every
published photographic observation, before making any use of it. T h e
corrections are given, in seconds of arc, by the matr ix product
cos a — sin a sin 8 I
sin a cos oc sin 8
0 cos 8
(22) |cos SAccASW = ||-f- 061 - 0."336 + 0'/027|
II
198 ABERRATION
T h e matrix on the right is usually needed for another purpose, which
makes this calculation an easy one.
T h e approximate differential methods commonly used in calculating
the annual aberration suffice for the ordinary purposes of celestial
mechanics, the order of accuracy being about O'.'Ol. T h e neglect of the
square of the displacement in the differential formulas produces errors
in the zodiacal region of the sky not exceeding O'.'Ol, provided that
the sun 's longitude and the coordinates of the object are referred to
the same equinox. T h e chief error in t roduced by the assumption that
the earth moves around the sun in an elliptic orbit arises from the
velocity of the earth around the center of mass of the ear th-moon
system. For a particular region of the sky this error takes the form of
a periodic error with period one mon th and coefficient 0'.'008. Another
similar error of period twelve years and coefficient 0'.'008 arises from
neglecting the velocity of the sun around the center of mass of the sun
and Jupi ter . T h e corresponding error involving Saturn has a coefficient
of 0'.'002. In the discussion of photographic observations of objects in
the solar system errors of this size may usually be neglected; the elliptic
port ion of the stellar aberration is the only port ion of it that needs to
be taken into account.
6. E p h e m e r i d e s * In refined work the positions that are observed
are never used directly, but instead the differences between the observed
positions and positions calculated from some theory of the motion (in
the sense observed position minus calculated position, or O — C) are
discussed. An ephemeris is a table of calculated positions with the t ime
as a rgument ; with some looseness the te rm may be applied to any set
of calculated positions. In calculating a precise ephemeris it is desirable
to treat the aberration in the same way as the observations are affected,
so as to make the two directly comparable, and different methods mus t
be used for different classes of observations.
A geometric ephemeris gives the actual position of the body at the
times indicated. I t is impossible to observe t rue geometric positions,
and hence a geometric ephemeris is useless for any precise work. I t is
chiefly employed when only a low degree of accuracy is aimed at, as
when the only purpose is to find the body.
An apparent ephemeris may be obtained by applying the planetary
aberration to a geometric ephemeris . Positions observed with a transit
circle are apparent positions. Hence ephemerides of the sun, moon, and
principal planets are usually apparent ephemerides . Phenomena such as
eclipses of the sun or of Jupi ter ' s satellites depend on the apparent
positions of the bodies concerned, and therefore apparent ephemerides
are useful in this connection also.
SPECIAL CASES 199
Observed positions obtained by compar ing an object with catalogue
places of stars in the immediate neighborhood are neither geometric
positions nor apparent positions, bu t of an intermediate class that may
be te rmed astrometric positions. T h e y are free from the principal parts
of the stellar aberration, that is, of the diurnal aberrat ion and of the
circular port ion of the annual aberration, but they are affected by the
barycentric motion of the observed object in the t ime taken for light
to travel from it to the observer, and by the elliptic port ion of the annual
aberrat ion. An astrometric ephemeris may therefore be obtained by
antedat ing the heliocentric position of the object (which may be sub -
sti tuted for the barycentr ic position with an error not exceeding O'.'Ol)for the l ight-t ime by successive approximations in the way described
in Section 3, and by then subtract ing the results of (22) from the geo-
centric ephemeris . Alternatively, the apparent ephemeris may first be
calculated and the circular port ion of the annual aberrat ion then sub -
tracted. Th i s latter method is not perfectly rigorous if the abridged
formulas are used for the planetary aberration, bu t is often sufficiently
accurate.
A fourth kind of ephemeris , called an astrographic ephemeris , has
been used to some extent. I t is easier to calculate than the astrometric
ephemeris , from which it differs only by the elliptic port ion of the annual
aberration. Before comparing it with a photographic observation the
expression (22) must be either added to the observed position or sub -
tracted from the ephemeris .
7. S p e c i a l c a s e s of a b e r r a t i o n .
(a) T h e sun. Dur ing the t ime required for light to pass from the sun
to the earth, the motion of the sun around the center of mass of the solar
system is virtually zero. T h u s , of the two parts making up the planetary
aberration one vanishes and only the stellar aberration remains. I t is
worthy of note that, contrary to the practice for the stars, the complete
annual aberrat ion is applied to the solar ephemeris . Were this not done,
observations of the sun 's apparent place would disagree with the ephe-
meris, the discrepancy in the longitude having an annual period with
a coefficient of 0'.'343.
(b) T h e moon. Dur ing the t ime required for light to pass from the
moon to the earth, the moon moves approximately 0'.'7 in geocentric
orbital longitude, with a variation of about 0"05 due to the eccentricity
of its orbit . Th i s is the full amount of the planetary aberration, the
annual aberration being canceled by the heliocentric motion of the moon
during the l ight-t ime. In fact, the aberration is not applied to the lunar
ephemeris at all; the tables of the moon are constructed so as to give
200 ABERRATION
the apparent position of the moon directly, by slightly modifying the
geometric elements of the moon ' s orbit. In some applications it is im-
portant to remember that the lunar parallax tabulated in the ephemeris
is not, strictly speaking, the t rue parallax at the t ime indicated.
(c) A satellite. W h e n a satellite is observed by photographing it
against the background of stars, the derived position is the astrometric
position. When it is observed by measur ing its distance and position
angle from its primary, the stellar aberration, being virtually the same
for both bodies, may be neglected, or if it is appreciable it may be applied
differentially. Leaving this aside, the observation connects the position
of the satellite when light left it with the position of the pr imary when
light left the primary. T o a very high degree of approximation the
satellite partakes of the heliocentric motion of its pr imary dur ing the
l ight-time of the latter, and therefore it is necessary in practice only
to correct the observation for the motion of the satellite relative to its
pr imary dur ing the t ime required for light to pass from the satellite
to the observer.
N O T E S A N D R E F E R E N C E S
T h e aberrat ion of l ight w a s d i s c o v e r e d i n 1725 b y J a m e s Brad ley , a f terwards
A s t r o n o m e r Roya l , d u r i n g a ser ies o f a t t e m p t s b y several a s t r o n o m e r s to m e a s u r e
t h e annual parallax of a star, an enterpr i se u n d e r t a k e n w i t h a v i e w to e s t a b l i s h i n g
observat iona l ly that t h e earth w a s n o t at t h e c e n t e r of t h e solar s y s t e m . Brad ley
o b s e r v e d o n D e c e m b e r 17 that g a m m a D r a c o n i s w a s at a m o r e s o u t h e r l y d e c l i n a -
t ion than w a s e x p e c t e d , a n d o n D e c e m b e r 2 0 f o u n d it t o b e still farther s o u t h ,
t h e m o t i o n b e i n g o p p o s i t e in d irec t ion f r o m t h e m o t i o n that w o u l d h a v e r e s u l t e d
f r o m parallax. H e o b t a i n e d a t e l e s c o p e for t h e p u r p o s e of s t u d y i n g t h e effect ,
and o b s e r v e d a n u m b e r of br ight stars t h r o u g h o u t a year , 1 7 2 7 - 2 8 , after w h i c h ,
b y analysis of t h e observat iona l mater ia l , h e ascer ta ined t h e character o f the
p h e n o m e n o n a n d e x p l a i n e d it correct ly .
CHAPTER VII
COMPARISON OF OBSERVATION A N D THEORY
1. I n t r o d u c t i o n . T h e word theory is used in celestial mechanics
to mean any mathematical expression from which can be derived the
coordinates of a body as functions of the t ime. Theor ies are of two types,
special and general. A special theory is one that gives the coordinates
for special values of the t ime only; a numerical integration of the equa-
tions of heliocentric mot ion of a comet or planet is an example of a
special theory. In a general theory the t ime is represented by a symbol
for which any value may be subst i tu ted at will and the coordinates
obtained for the corresponding da te ; therefore a general theory cannot
be entirely numerical in form. I t may be entirely analytic, as the lunar
theory of C . E. Delaunay, which expresses the coordinates as functions
of seven symbols corresponding to the six elements of the orbit and
the t ime ; or it may be part ly analytic and part ly numerical , as E. W .
Brown's lunar theory, in which numerical values are subst i tu ted for
some of the elements . T h e r e are also general theories in which numerical
values are subst i tu ted for all of the elements, the only quant i ty denoted
by a symbol being the t ime, such as G. W . Hill 's theory of Jup i te r ;
such theories are commonly, though somewhat inexactly, called n u m e r -
ical general theories.
Al though any value of the t ime may be subst i tuted in a general
theory, it does not necessarily follow that the result has any physical
meaning. General planetary theories usually have contained t r igono-
metr ic functions factored by the t ime, the values of which increase
wi thout limit as the t ime increases wi thout l imi t ; this characterist ic
prevents their being valid for more than a few centuries. T h e lunar
theory is free from this drawback and in form is valid for all t ime. But
the elements of the orbit and the masses of the dis turbing bodies still
have to be de termined by observation. T h e observations being limited
in n u m b e r and of limited accuracy, the theory necessarily departs further
and further from the t ru th for t imes further and further removed from
the fundamental epoch.
A theory is of no use unti l it is compared with observation, since
this is the only way of testing its validity. Fu r the rmore , no theory has
ever been constructed which was at first as accurate as the existing
201
202 OBSERVATION AND THEORY
observations, because the elements of the orbit are not sufficiently well
known at the beginning. Therefore , the initial comparison of any theory
with observation is primari ly for the purpose of improving the provisional
elements on which the theory was based; often at the same t ime the value
of some dis turbing mass or of some other astronomical constant may
be determined.
A theory usually gives the coordinates of a body referred to the center
of its pr imary as origin. T h u s , the lunar theory gives the geocentric
coordinates of the moon, the theory of Jupi ter the heliocentric coor-
dinates of Jupi ter , and the theory of Hyper ion the Saturnicentr ic
coordinates of this satellite. T h e observations, however, are always
referred to the earth as origin, and not to the center of the earth bu t
to the point where the observer stands. A comparison of observation
and theory necessarily entails one or more transformations of coordinates,
and these transformations are the subject of the present chapter.
2 . M o t i o n sof the p l a n e s of r e f e r e n c e . T h e planes of reference
most commonly used in celestial mechanics are the plane of the ecliptic
and the plane of the equator . T h e great circles in the sky where these
two planes cut the celestial sphere are the ecliptic and equator. T h e
vernal equinox (or for brevity, the equinox) is the one of the two
intersections of the ecliptic and equator that the sun passes about
March 21 . T h e ecliptic, equator, and equinox are all in continuous
motion, and hence the lat i tude, longitude, declination, and right
ascension of any celestial body all change continuously. Most of the
changes are different in different regions of the sky. W h e n they are
developed analytically in the way commonly used, it is found that two
kinds of terms appear. The re are periodic terms, which have in their
a rguments certain elements of the orbits of the earth and m o o n ; these
are called nutational te rms. T h e r e are also secular terms, which contain
powers of the t ime and are independent of the instantaneous positions
of the earth and moon ; these are the precessional te rms. It is convenient
to treat these two classes of terms separately.
If the declination and right ascension of any body are measured with
a transit circle the declination is referred to the instantaneous equator
and the right ascension to the instantaneous equinox. T h e coordinates
are termed apparent declination and apparent right ascension. These
coordinates are affected by planetary aberration, and if they are freed
from the effects of aberration they are then called t rue declination and
t rue right ascension. T h e effects of the nutat ion may be removed from
the t rue coordinates, and the coordinates are then said to be referred
to the mean equator and mean equinox of date. In addition, the effects
PRECESSION 203
of the precession over a specified interval of t ime may be removed. T h e
interval of t ime is usually taken to be that since the beginning of some
tropical year, as 1950.0, and the coordinates are then said to be referred
to the equator and equinox 1950.0. ( T h e beginning of the tropical year
is the instant when the sun 's mean longitude is 280°, and it mus t be
carefully dist inguished from the beginning of the calendar year. T h e
instant 1950.0, for example, is 1950 January 0.923 E .T . )
3 . P r e c e s s i o n . T h e instantaneous rates of change per tropical
century of the right ascension and declination of any body, due to
precession, are given by
pa = m + n sin a tan S, ^
pd = n cos a,
where a and S are the instantaneous right ascension and declination
referred to the mean equinox and equator of date, and the conventionally
adopted values of m and n are
m = 4608'/50 + 2'/79 T
= 307?233 + 0?186 T, (2)
n = 2004'/68 - 0"85 Ty
T being reckoned in tropical centuries from 1900.0.
T h e total change of the right ascension and declination of any body,
due to precession, between any two dates t0 and tl9 is given by
Aoc = [hp*dTy
J (3)
AS = I xp6dT. 1
Equat ions (3) although writ ten as definite in tegra ls , are actually
differential equations. T h e y might be solved by s tep-by-s tep numerical
integration as described in Chapter IV, but this would be rather laborious;
it is easier to proceed by successive approximations if tx — £0 is less than
a century or so, unless the body is within a few degrees of the pole.
It will usually be sufficient to evaluate m and n at the t ime
T = \{t1 + t0)y and then to subst i tute in the approximate equat ions
Aoc = (t1 — t0) (m + n s in a 0 tan 8 0 ) ,
AS = (tx — t0) (n cos a 0 ) ,
(4)
204 OBSERVATION AND THEORY
a 0 and S 0 being the values of oc and 8 at t ime t0. T h e first approximation is
a i = a o + ^ a > ^
S, = S0+A8.
Now recalculate the expressions (4), using oc1 and 8L instead of oc0 and S 0 .
T h e new values of and AS will differ very little from the first
approximation, and the mean of the two results may usually be
adopted.
In practice it will usually be possible to avoid all calculation beyond
a little interpolation, by using the special tables provided in the American
Ephemeris or the British Astronomical Ephemeris.
4 . N u t a t i o n . T h e effects of nutat ion on the right ascension and
declination of a body, unless it is wi th in a few degrees of the pole, may
be calculated by
Aoc — (cos e + sin e sin oc tan 8) AJJ — cos oc tan 8 Ae>
(6)
AS = sin e cos ocAip + sin ocAe,
where oc and 8 are the right ascension and declination of the body, e
is the obliquity of the ecliptic, and AI/J and Ae are the nutat ion in longitude
and obliquity given in the American Ephemeris for every day of the
year. Equat ions (6) are not rigorous, being derived from the differential
relations among the parts of a spherical triangle. T h e quantit ies Aoc and
AS cannot exceed 2 0 " on a great circle, and hence the equations suffice
to give these quantit ies accurate to 0"01 unless tan 8 is very large.
Owing to the smallness of Aoc and AS the tr igonometric functions are
needed to only three or four significant figures, and hence it is not
necessary to define a, 8, and e r igorously; either the apparent values
or the values at the beginning of the tropical year in which the
observation is made may be used.
In practice, right ascensions and declinations are seldom if ever
referred to the mean equinox and equator of date. Transit-circle ob-
servers usually publish apparent positions, while micrometric and
photographic observers usually publish positions referred to the mean
equinox and equator of the beginning of the tropical year of observation
(occasionally a s tandard equinox and equator such as 1900.0 or 1950.0
is used). I t is therefore convenient to combine Eqs . (6) with the reduction
for precession dur ing a fraction of a year; any additional reduction for
precession is then made for a whole n u m b e r of tropical years.
NUTATION 205
p cos e — X = m,
p sin s = n.
T h e n if we pu t for brevity
F = cos s + sin s sin oc tan S,
it is seen from (8) that
m + n sin oc tan 8 = pF — A,
whence
m + n sin a tan 8 A
r = .
p p
T h e n the sum of Eqs . (6) and (7) may be writ ten
(8)
Aoc = (T H j (m + n sin oc tan 8) + A —— — As cos a tan 8,
' ' (9) AI/J
P
If we put
Aip
A = r H — , a = m + « sin a tan 8, P
B = — As,
b = cos a tan 8,
a = n cos a,
E = A —— , = — sin a,
Eqs. (9) become
= + 56 + E,
AS = Aa' + Bb\
(10)
Wi th the elapsed fraction of the tropical year denoted by r , Eqs . (4)
for the precession become
Aoc = r(m + n sin oc tan 8),
(7)
AS — rn cos oc,
where the subscripts of oc and S may be dropped. T h e reduct ion for
precession and nutat ion combined is obtained by adding Eqs . (6) to
Eqs . (7). T h e resulting expressions may be simplified by a t rans -
formation.
Let us define two quantit ies p and A by the equations
206 OBSERVATION AND THEORY
T h e quantit ies A, B, E may be obtained from the American Ephemeris
or the British Astronomical Ephemeris, while a, b,'a\ V may be calculated
from the approximate position of the body.
I t is customary to express Aoc in seconds of t ime, by dividing a, b,
and E by 15.
It is convenient to combine the reduct ion for nutat ion and precession
since the beginning of the year with the reduction for annual aberration,
by adding Eqs . (10) to Eqs . (21) of Chapter VI . T h e quantit ies obtained,
when added to the astrometr ic place at the beginning of the year,
give the apparent place. Conversely they may be subtracted from the
apparent place to obtain the astrometric place at the beginning of the
tropical year. In all ordinary cases the quanti t ies a, 6, cy d, a', b\ c\ d'
may be calculated wi th the use of either the astrometric place or
the apparent place of the object.
5. G e o c e n t r i c p a r a l l a x . T h e horizontal parallax of any body at
a particular instant is defined as theangle at the center of the body
subtended by the ear th 's equatorial radius. I t is nearly equal to the
apparent vertical displacement of the body against the background of
stars when it is rising or setting. T h e constant of the solar parallax is
the angle subtended by the earth 's equatorial radius at a distance of
one astronomical unit , and its adopted value is 8780. Hence if the
geocentric distance to any body, expressed in astronomical units , is
denoted by r, then the horizontal parallax is given by
sin 77 = sin 8780/r. (11)
Unless the body is very near the earth the simpler formula
77 = 8780/r
may be subst i tuted for (11). T h e error in this approximation is less than
0701 if r is greater than 0.007 a.u.
W h e n a body is observed on the meridian, as is the case always with
the transit circle and sometimes with photography, the parallactic
displacement is in declination only, the right ascension not being affected.
T h e correction to be applied to the observed declination in order to
obtain the t rue declination is
AS = pir sin (<pf — S),
where p is the radius of the earth at the observer 's lati tude, expressed
in units of the equatorial radius, and q>' is the geocentric lat i tude.
GEOCENTRIC PARALLAX 207
~~ sin y
where
tan y = tan <p '/cos hy
and h is the hour angle. T h e hour angle is found from the local sidereal
t ime, 8y by
h = 6 - oc.
Formulas (12) suffice for correcting any observation for parallax,
unless the body is very near the earth. I t is, however, seldom necessary
to use them. I t is the practice of most observers to publ ish so-called
parallax factors with their observations. These factors may take either
of two forms. T h e y may be the components in right ascension and
declination corresponding to a distance of 1 a.u., in which case they
need only to be divided by the actual distance, or they may be the com-
ponents corresponding to a parallax of 1", in which case they are to
be mult ipl ied by the actual horizontal parallax.
T h e application of parallax corrections to the observed coordinates
of an object reduces them to the values that they would have if the
observations had been made from the center of the earth, and hence
this way of proceeding is appropr ia te when the observations are to be
compared with a geocentric ephemeris . I t often happens , however, that
no precise ephemeris exists, and then the observations may be compared
with theoretical positions especially calculated for the purpose . In such
cases it is advantageous, instead of correcting the observations for
parallax, to refer the calculated positions to the observer by the use of
topocentric coordinates.
Let xy yy z be the calculated equatorial rectangular heliocentric
coordinates of the observed object at the instant, preceding the t ime
of observation, when light left i t ; let Xy Yy Z be the geocentric coor-
dinates of the sun at the instant of observation referred to the same
axes; and let JC, yy z be the topocentr ic coordinates of the center of the
earth. T h e n the calculated distance, right ascension, and declination
appropriate for direct comparison with observation are given by
p cos S cos oc = x + X + x,
p cos § sin oc = y + Y + y, (13)
p sin § = z + Z + z-
If a body is observed off the meridian, the corrections to the observed
right ascension and declination are
. pTT cos cp' sin h
Aoc = - ^ , cos 6
J § = P77 s i n s i n ~ s)
(12)
208 OBSERVATION AND THEORY
T h e coordinates x, yy z are functions of the observer 's lat i tude, and
of the local sidereal t ime, given in uni ts of the seventh decimal by
A = 426.6 cos 9 ' ,
x = — A cos 0,
y = — A sin 0,
Z = — 426.6 sin cp'.
T h e sidereal t ime 9, referred to the equinox 1950.0, may be found in
decimals of a circumference by
0.79115 + 1.0027 37803 (J.D. — 243 0000.0) - A,
A being the west longitude of the observer.
6. P r e c e p t s . Chapters VI and VII contain all the principles that
are necessary for comparing observations with theory. T h e procedure
for the more common cases arising in practice may be described as
follows.
Case 1. Observed place apparent topocent r ic—Computed place
apparent geocentric. Correct the observed place for parallax by (12) or
by using the parallax factors.
Case 2. Observed place apparent topocent r ic—Computed place
astrographic topocentric . Free the observation from the circular port ion
of the annual aberration, from the nutat ion, and from the precession
since the beginning of the current year by subtract ing the results of
(10) and Chapter VI, Eq. (21). T h e n reduce the observation for precession
to the same equinox and equator as the computed place by expressions
(4) and (5) or by special tables. Free the observation from the elliptic
term of the aberration by adding the results of Chapter VI, Eq. (22).
Case 3. Observed place astrometric topocent r ic—Computed place
astrographic topocentric . Reduce the observation to the same equinox
and equator as the computed place, and add the results of Chapter VI,
Eq . (22).
Case 4. Observed place astrometric topocen t r ic—Computed place
astrometric geocentric. Reduce the observation to the same equinox and
equator as the computed place, and correct the observation for parallax
by (12) or by the parallax factors.
NOTES AND REFERENCES 209
N O T E S A N D R E F E R E N C E S
T h e correc t ion of a s t ronomica l o b s e r v a t i o n s for t h e effects of p r e c e s s i o n , p r o p e r
m o t i o n , n u t a t i o n , aberrat ion , and paral lax b e l o n g s to the subjec t of spher ica l
a s t r o n o m y , of w h i c h o n l y the barest out l ine can be g i v e n here . A va luab le treat ise
o n the subjec t is W . C h a u v e n e t , Manual of Spherical and Practical Astronomy,
2 V o l s . , L i p p i n c o t t , P h i l a d e l p h i a , 1855, repr in ted b y D o v e r P u b l i c a t i o n s , N e w
York , 1960 . P e r h a p s e v e n m o r e va luable is S i m o n N e w c o m b , A Compendium of
Spherical Astronomy, M a c m i l l a n , N e w York , 1906; a p h o t o g r a p h i c reprint has
b e e n p r o d u c e d b y U n i v e r s i t y M i c r o f i l m s , A n n A r b o r , M i c h i g a n . T h e m o s t
m o d e r n work o n the subjec t is W . M . S m a r t , Spherical Astronomy, C a m b r i d g e
U n i v . Pres s , L o n d o n and N e w York , 1944 , re i s sued C a m b . U n i v . Press , 1 9 6 0 .
CHAPTER V I I I
THE METHOD OF LEAST SQUARES
1. I n t r o d u c t i o n . In any theory of the motion of a body, either
around another body or about its axis of rotation, certain constants
appear that have to be determined by observation. W h e n nothing about
these constants is known in advance, as is the case with the elements
of the orbit of a newly discovered object, the determinat ion of them
may be difficult; in the case ment ioned an extensive literature exists
for this specific purpose . W h e n approximate values of the constants are
known, however, they may be incorporated into a theory of the motion,
which may then be used to calculate theoretical positions of the object.
A comparison of the theory with observations will show that the theory
does not represent the observations exactly. Each observation furnishes
a residual, observed place minus computed place, O — C, which is due
to three causes, and to no others. Firs t , the theory may be inadequate .
Second, the observations are affected by errors. Th i rd , the errors of
the approximate constants used in determining the computed posit ions
contr ibute to the residuals. In this chapter are discussed the two classes
of errors last ment ioned, and it is shown how the approximate values
of the constants may be improved by an analysis of the discrepancies
between observation and theory.
T h e method of analysis may be used to obtain not only more accurate
values of the orbital elements of a body, but also improved values of
any other parameter on which the observations depend. As examples
may be ment ioned the e lements of the ear th 's orbit, the masses of
dis turbing planets, the solar parallax, the constant of nutat ion, and other
astronomical constants . In any particular case it is necessary that the
observations be such that an error in the assumed value of the constant
will exert an appreciable influence.
2 . F r e q u e n c y d i s t r i b u t i o n of e r r o r s of o b s e r v a t i o n s . T h e
errors of observations are due to various causes, among which are the
optical defects of telescopes, fluctuations of the density of the at-
mosphere , and imperfections of the screws used in measuring pho to-
graphic plates. T h e errors of a series of observations are of different
210
DISTRIBUTION OF ERRORS 211
sizes. If we imagine the size of an error to be plot ted as ordinate against
the number of observations having that size of error as abscissa, the
resulting curve is called a frequency dis t r ibut ion curve or probabil i ty
distr ibution curve of errors . I n general such a curve is shaped like a
bell that is more or less symmetrical about the point of zero error ,
showing that positive and negative errors occur wi th equal frequency
and tha t small errors are more numerous than large ones.
In most cases dealt with in as t ronomy the frequency distr ibut ion of
errors is not known in advance, bu t can be de te rmined only after a
very careful investigation of the observational material , if at all. I t is,
however, possible to make an assumption about the dis t r ibut ion that
will be adequate if the errors are each the sum of a n u m b e r of
simultaneously occurring errors, each one of different origin. Such errors
are often called accidental errors as dist inguished from systematic errors,
which may have a single origin. In practice astronomical observations
are usually affected by errors of bo th kinds, and the presence of
systematic errors may invalidate the conclusions drawn. W e are, however,
forced to proceed as if the errors were accidental, at least in the first
instance, since there is no mathematical theory of systematic errors .
Regarding accidental errors, there is a theorem in the theory of
probabil i ty stating that the frequency distr ibution of a sum of r andom
variables, each wi th its own arbitrary frequency distr ibution, asym-
totically approaches the function
/(*) dx = hu-1'2 exp { - h2x2} dx (1)
if the individual random variables are independent of one another . In
this expression f(x) dx is the relative n u m b e r of cases where the r andom
variable has a value between x and x + dx, TT is the familiar constant,
the expression exp {— h2x2} means the base of natural logari thms
raised to the power — h2x2, and h is called the modulus of precision.
T h e factor hrr~l12 is in t roduced in order to make
f(x) dx = 1.
Instead of h we may introduce the mean of the squares of the random
variable x. T h e square root of this mean is called the s tandard deviation,
c j , or the r .m.s . ( root-mean-square) error. T h e relation between cr and h is
a2 = hrr-1'2 x2 exp {— h2x2}d*
212 LEAST SQUARES
T A B L E 1
DISTRIBUTION OF ERRORS
Multiple of p.e. N u m b e r of larger errors
1 5 0 0 0 or 1 in 2
2 1 7 7 4 or 1 in 6
3 4 3 0 or 1 in 2 3
4 7 0 or 1 in 1 4 0
5 8 or 1 in 1 2 0 0
6 1 or 1 in 1 0 , 0 0 0
Any marked deviation from the figures in the table is an indication
that the dis tr ibut ion is not Gaussian.
If we have a linear combinat ion of two random variables
x = axxx + a2x2i (2)
in which ax and a2 are constants , while xx and x2 have normal
distr ibutions with s tandard deviations G 1 and cr 2, the distr ibution of x
can be found as follows.
or
and
T h e frequency dis tr ibut ion (1) is said to be a normal dis tr ibut ion or
a Gaussian distr ibution. If the errors have such a dis tr ibut ion the
s tandard deviation a is called the mean error or the s tandard error.
Another parameter , m u c h used in astronomy, is the probable error,
which is defined as the quant i ty tha t exceeds half of the errors in absolute
magni tude , and is exceeded by half of them. T h e probable error (p.e.)
as thus defined has a definite meaning no mat ter what the distr ibution
of errors is. If, and only if, the distr ibution is Gaussian, then it may
be shown that the probable error is connected with the s tandard deviation
by the relation
p.e. = 0.6745a.
Table 1 shows, for a Gaussian dis tr ibut ion of 10,000 errors, the
n u m b e r that exceed various mult iples of the probable error.
in which
a* = a\a\ + a\a\. (4)
Hence the distr ibution of a linear combinat ion of two random variables
each with a normal distr ibution is a normal distr ibution, with a s tandard
deviation given by (4). T h i s principle can be extended to a linear
combinat ion of any n u m b e r of r andom variables.
3 . T h e p r e f e r r e d v a l u e of a m e a s u r e d q u a n t i t y . Let us consider
n measurements xly x2y xn of a quanti ty, and let us denote the
t rue value of the quant i ty by a. T h e n the error e,- of any measurement
Xj is Sj = Xj — a. If the errors have a normal dis t r ibut ion the probabil i ty
of any measurement xi is
^ = e x p { - ( * , - a ) V 2 o * } ,
and the joint probabil i ty for the n measurements is
fefefexp i ~ i [ { x i - a ) 2 + ( * 2 ~ a ) 2 + - + ( * « ~ a ) 2 ] ! - ( 5 )
T h e preferred estimate of a is defined as tha t which makes (5) a
max imum or
J_
2o*
-Z{Xj-af (6)
a min imum. T o make (6) a m i n i m u m it is sufficient to have
dZ
whence
= 0 or 227 (*, - A) = 0,
a = -ZXj. (7)
PREFERRED VALUE 213
| ^ </#2. (3)
I t can be shown that for
x < axxx + tfgtfg < # + dxy
T h e s imultaneous or joint probabili ty that xx has a value between
and xx + d ^ , and also x2 between x2 and ,x:2 -f rf#2 is
214 LEAST SQUARES
Note that in (8) a denotes the t rue (unknown) value of the measured
quanti ty, and not the preferred estimate of it given by (7).
If the frequency distr ibution of the errors is not a normal one we
can still find preferred estimates for a and a if the form of the distr ibution
is known. Let the probabili ty of any single measurement be f(a, xly x2,
xn)y where / is any known function. T h e n the joint probabil i ty is
nf(Xj - a),
and the preferred estimates can be found from
T h e mean error of the ari thmetic mean (7), as dist inguished from
the expression (8), may be determined as follows. Let there be n observa-
tions Xj,j = 1, 2, w, with t rue errors ejy normally distr ibuted. T h e
error of the ar i thmetic mean is
1 y
n J
which is a linear combinat ion of n random variables. If a is the mean
error of one observation, then by (4),
a2 = na2/n2 = o2\n,
whence o~e = on~ll2
y or the mean error of the a r i thmet ic mean is
the mean error of one observation divided by the square root of the
number of observations.
We have seen that if the t rue value of the measured quanti ty were
known, the mean error of one observation would be given by (8). T h e
t rue value is, however, not known; we have only the estimate given
by (7). After this estimate is obtained we can subtract it from each of
the measured values; the differences are called the residuals v. Let us
T h u s the preferred est imate of a is the ari thmetic mean of the
observations. T h e preferred estimate of a may be found from
whence
Ufa -a)2. (8)
and
WEIGHTS 215
whence
x = a -
and
Vj = 8j
Now if av is the s tandard deviation of the residuals Vj and oE of the
errors ejt we have
Since a\ = [^2]/w, it follows that
or
T h e n , denoting the mean error of the ar i thmetic mean by fi,
(9)
4 . W e i g h t s of o b s e r v a t i o n s . Usually we are compelled to act as
if the quality of different observations were the same, bu t in general
it is not. Assume that the mean errorof an observation xx is a1 and
that of x2 is cr 2 . T h e n the joint probabil i ty is given by
T h e function to be made a m i n i m u m is
denote the ar i thmetic mean of the x3- by x> as before the t rue error of
Xj by ej9 and the t rue value of the measured quant i ty by <z. T h e n
Vj = Xj — x,
8j = Xj — a.
Denot ing by [x] the sum of the xjy and by [e] the sum of the 8 j y
[e] = [x] — na = nx — na,
T h e quant i ty p, is said to be the weight of the observation xs.
5. I n d i r e c t m e a s u r e m e n t s . As was ment ioned in Section 1, the
usual cases in as t ronomy are not those where the quant i ty sought is
measured directly; more commonly the quant i ty measured is some
known function of the quant i ty sought. T h e quant i ty sought is usually
a constant whose value is already approximately known. In the simplest
case only one quant i ty is sought ; let us call this quant i ty x and the
measured quanti ty y = f(x). Suppose we have n observations y j y
j = 1, 2, n9 from which it is required to deduce the preferred value
of x. Denote the approximately known value of x by xc, the value to
be determined from the observations by x0> and the difference x0 — xc
by Ax. T h e procedure is to calculate y from xc for each observation,
obtaining y c y and then to form the differences A5y = yi—yc. Since
in general y is a function of other parameters as well as of x9 the quant i ty
y c varies from observation to observat ion; to show the variation we may
write Aij = yj—yic.
Now Ay may be expressed as a function of Ax by T ay lo r ' s expansion
(id
216 LEAST SQUARES
(10)
If we pu t px = \\o\ and p2 = \ja\y then
or, for n observations,,
I t is assumed that Ax is so small that its square and higher powers
are negligible, whence, with sufficient accuracy,
so we may find for each observation an equat ion of the form
whence
EQUATIONS OF CONDITION 217
where the quant i ty in parentheses is a known number . Equat ion (11)
is called an equat ion of condition, and Ax is called an unknown.
For brevity let us wri te (11)
aAx = Ay. (12)
As in previous sections, the condit ion for the preferred value of Ax is
d
dAx
or
E(a0Ax-Aoyf = 0,
[a2Ax] = [a Ay], (13)
whence
(14)
and finally x0 is obta ined by
x0 = xc+ Ax.
Suppose tha t of the n equations of the form (12), supposed of equal
weight, it happens that for q of them, a is a constant . T h e n these q
equat ions contr ibute to (13) a por t ion
qa2Ax = a[Ay] = Vq a[Ay]/Vq-
Comparison with (12) shows that we may write an equation of condi t ion
Vq aAx = [Ay]/Vq = Vq [Ay]/q,
or
Vq aAx = Vq Ay, (15)
which shows that the q equations may be replaced by the single equat ion
(15), Ay being the mean of the q values of Ay. I n this case the equat ion
aAx = Ay
is said to have the weight q, and it is to be mul t ip l ied by V~k before
use wi th the other n — q equat ions to obtain (13) and (14).
6. E q u a t i o n s o f c o n d i t i o n . In the general case it is desired to
determine not one bu t n unknowns from a series of observations. Le t
the observed quant i ty , such as a spherical coordinate of a body, be
denoted by x, and let Ax denote the discrepancy observed x minus
computed x (O — C) between any observation and the corresponding
Ax =
218 LEAST SQUARES
Assume for the momen t tha t Ax is not affected by errors of the first
two classes ment ioned in Section 1; then it is composed entirely of
contr ibut ions from {jjm T h e partial derivatives appearing as coefficients
in (16) can be calculated and expressed as n u m b e r s ; three significant
figures are almost always sufficient. T h e unknowns of the equations are
the which are sought to be determined. I t is known from the theory
of linear equations that n equations of the type (16) are a necessary
and sufficient n u m b e r for the determinat ion of the provided that
the equations are all independent , tha t is, provided that the determinant
of the coefficients on the left-hand side does not vanish. If the equations
are independent , values of the ^ can be found that will exactly satisfy
the n equations, and if the observations were not affected by errors
nothing more would be required. T h e presence of accidental errors in
the r ight -hand members of the equations, however, prevents the t rue
values of the ^ from being de te rmined; only approximate values can
be found, which will be nearer to or further from the t ru th according
as the errors of the observations are smaller or greater. T h e effect of
the accidental errors may be diminished by increasing the n u m b e r of
observations, and the n u m b e r of equat ions. In astronomical problems
the number of equat ions used is seldom less than 2n and it is often
m u c h greater ; when the highest precision is wanted several hundreds ,
or even thousands , of observations may be employed for the determina-
tion of a few unknowns .
W h e n the n u m b e r of equat ions is greater than the n u m b e r of un -
knowns, then in view of the errors of the observations, no exact solution
exists; it is not possible to find values of the ^ that exactly satisfy all
of the equations. If any particular values of the ijj are subst i tuted into
the left-hand members of the equations and the results severally sub-
t racted from the Ax, the remaining number s are called residuals "after
the solut ion." As has been seen earlier in this chapter the values of the
to be preferred are those that render the sum of the squares of these
residuals a m i n i m u m ; this is the principle of least squares.
I t is not immediately obvious how the equations are to be solved in
order to obtain the desired m i n i m u m sum of the squares of the residuals;
computed value. Let ch j = . 1, 2, n denote the assumed values
of the constants used in calculating the computed values, and let the
corrections to the assumed values of the constants be denoted by
so that the corrected value of any constant is c, + T h e n the equat ions
of condit ion, derived in the same way as (11), may be writ ten
L=Ax- (16)
NORMAL EQUATIONS 219
the remainder of this chapter is devoted to the exposition of an
expeditious method and, equally important , to a method for es t imat ing
the uncer ta inty in the values of the unknowns as they are finally obtained.
7* W e i g h t s of e q u a t i o n s . Every equation of condit ion may be the
result of a single observat ion; in this case if the several observat ions
are assumed to be of equal precision the equations are said to be of
equal weight. On the other hand an equat ion may express the result
of several observations, or the precision of the different observations
may be known to vary; in either of these cases it may be desirable to
assign different weights to the equat ions. If the precision of the different
observations is the same, then the weight of an equat ion may be taken
as proport ional to the n u m b e r of observations that it depends on. If
the probable errors of the observations are known varying quanti t ies ,
then the weight of a single observation should be taken to be inversely
proport ional to the square of the error. T h u s the w e i g h t y of an equat ion
based on q observations having probable errors el9 e2, eqy is given by
where k is a constant , to be chosen so as to make the weights of the
different equations vary over a convenient range, say from 1 to 9, or
from 0.1 to 0.9. I t is seldom justified to keep more than one significant
figure in p , and almost never more than two.
T h e value of k being fixed and the e3- known, then the probable error
of uni t weight, often wri t ten p.e. 1, is k1/2.
I t often happens , however, that the probable error of an observation
is not known in advance bu t mus t be derived from the equations
themselves. In such cases it may be convenient to assign a weight of
unity to each observation, in which case the weightof an equat ion is
equal to the number of observations that it depends on.
W h e n the equations of condition are of unequal weight, then the result
to be preferred is not that which renders the sum of the squares of the
residuals a m i n i m u m ; instead it is required to minimize the sum of
the squares of the numbers obtained by mul t ip ly ing each residual by
the square root of its weight.
8. F o r m a t i o n of n o r m a l e q u a t i o n s . Suppose that there are
m equations of condition involving n unknown quanti t ies , m > n\ then
the least-square criterion may be satisfied by combining the m equat ions
into n normal equat ions as was indicated in Section 5, and solving the
220 LEAST SQUARES
normal equations. Instead of mult iplying each equat ion by the square
root of its weight, it may be preferable to operate with the weight
itself, organizing the calculation so as to produce the same result as
if the square root had been used. I n this case, mult iply each equation
by its weight, and calculate the sum of all the numerical quanti t ies
entering into each equation. T h e result may be wri t ten according to
the scheme of Table 2, where for convenience the literal quanti t ies and
the sign of equality are placed at the top instead of in the equations.
All of the symbols below the line denote n u m b e r s ; the partial derivatives,
denoted by a for brevity, are pure numbers , while the Ax of Eq. (16),
denoted by C, will usually be expressed in seconds of arc, degrees, or
radians. T h e mixing of dimensions in the 2 will not cause any difficulty.
T A B L E 2
SCHEME FOR C O N D I T I O N A L EQUATIONS
u = C z
a l n Cx * 1
Pia12 Pl&lZ Pl^ln PA
a 2 1 a 2 2 a 2 n c 2
p2a21 p2a22 p2a2Z P2a2n p2C2 P2%2
aml a m n cm
Pmaml Pmam1 Pmam3 Pmamn Pm^m
T h e first normal equation, called the normal equation in £jl9 is obtained
by mult iplying the first of the original equations by pxall9 the second
by p2a2v t n e t h i rd by p3a31) etc., and forming the sum of the resulting
m equations. T h e normal equation in £ 2 is obtained by mult iplying the
first of the original equations by pid129 the second by p2^22> e t c - > a n d
summing the result ing m equat ions. Repeating this process n t imes
yields the n normal equat ions. T h e coefficients in the normal equations
should actually be calculated one at a t ime, by accumulat ing the partial
products . T h u s , the coefficient of £ 2
m the normal equation in £ x is
P\AWa\2 + P2a2\a22 + — + Pm^ml^m2>
and the absolute te rm in the normal equation in | 3 is
Plal3Cl + P2a2^2 + — + PmamBCm'
NORMAL EQUATIONS 221
I t will be observed that the quantit ies to the left of the sign of equality
in the normal equations form a square symmetr ic array about the
diagonal from upper left to lower r ight . T h u s the number s below and
to the left of the pr incipal diagonal need not be calculated.
T h e same operat ions performed on the summat ion as on the other
number s const i tute a valuable check on the formation of the normal
equations. Each result should be the sum of the n u m b e r s formed by
proceeding to the left as far as the principal diagonal, then u p the
column. If the equat ions of condi t ion are more than ten in n u m b e r it
will usually be worthwhile to carry one more decimal in the normal
equations than was present in the equations of condi t ion; if of the order
of a hundred , two extra decimals may be retained.
9. N o r m a l e q u a t i o n s . T h e normal equations may be wri t ten as
in Table 3, where the same notat ion as was used for the equations of
condi t ion is employed for the new number s .
T A B L E 3
N O R M A L EQUATIONS
tl £ 3
L = c 2
011 alz aln c t
« 2 n c 2
033 aZn c 3 £ 3
c„
T h e r e are now n equat ions in n unknowns to be solved, and the
symmetry produced by the method used to obtain t hem simplifies their
solution.
I t is known from the theory of equations tha t a system of n independent
linear equations in n unknowns has a un ique solution if, and only if,
the de terminant of the coefficients is different from zero. I n this system
of equations the de terminant of the coefficients is
a i \
a\2 a i z ••• a m
a21 a22 a23 ••' a2n
a31 a32 a33 a3n
anl an2 an3 ••• ann
222 LEAST SQUARES
and a.u = aH. T h e value of any unknown £t- is the quotient of two
determinants . In this quot ient the denominator is the de terminant of
the coefficients, wr i t ten above, and the numera tor is a de terminant that
may be obtained from the other by subst i tut ing for the zth column the
column of C 's . T h e problem is to find the most efficient method of
evaluating the quot ients of these various determinants .
Al though in a strict mathematical sense a solution may always be
obtained if the denominator determinant is different from zero, it often
happens in practice that this de te rminant , while not zero, is a very
small quanti ty, that is, small in comparison with the product of the
quantit ies in the principal diagonal of the de terminant . For example,
if the principal diagonal consists of numbers of the order of unity, then
0.01 is a small quant i ty . If this is the case, and if the numerators are
not also small, the physical interpretat ion is simply that the original
observations are not well suited to determine the quantit ies sought, as
for example when a small angle is sought to be de te rmined from its
cosine. The re will be a loss of significant figures in the results, and the
computer ' s first impulse will be to overcome this defect by carrying
additional decimals in the equations. By this means a formal solution
may be obtained to any desired n u m b e r of significant figures, bu t the
labor is likely to be wasted so far as any physical significance is concerned.
Th i s will be indicated by the large probable errors of the unknowns .
T h e computer will find it worthwhile to estimate the errors before
increasing the n u m b e r of significant figures in the equat ions, and
thereby he will often save himself m u c h useless work.
Another condition frequently arising in practice is that bo th the
denominator de terminant and some or all of the numera tor determinants
are small quanti t ies. In this case the solution is said to be nearly
indeterminate , and this happens when two or more of the unknowns
are strongly correlated; that is, a certain range of solution exists, and
the equations will be nearly satisfied for a considerable range of values
of the correlated unknowns , provided that these values have a certain
linear relation to each other. I t will generally be advisable in such cases
not to a t tempt to solve for the correlated unknowns separately, bu t to
solve instead for the linear combinat ion of them, and for only one of
a pair of the correlated unknowns themselves. N o real information is
lost by this procedure, and the computer will be saved a great deal of
labor. In addition, the physical significance of the resul ts is much more
readily seen.
These refinements of the work are discussed later. Firs t is given a
formal solution of the normal equations.
FORMAL SOLUTION 223
10. F o r m a l s o l u t i o n . T h e solution of normal equations has
received the attention of many writers, as is natural for a subject of
widespread application. T h e theory is simple and easily unders tood, and
will be found in almost any book on the theory of equations. T h e problem
is the arrangement of the actual calculations in such a form as to require
few numbers to be wri t ten, to make accessible all of the per t inent
information contained implicitly in the equations, and to provide
adequate checks on the numerical operat ions. T h e method given here
is the result of the jo int efforts of many expert computers at different
t imes. I t seems first to have been conceived in principle by M . H .
Doolittle about 1878, bu t the details have beengreatly altered and it
is not possible to credit the method as it now stands to any one person.
In accordance with the principle tha t no solution is of m u c h value
without an estimate of the errors involved, no a t tent ion is given to any
method lacking this quality. T o avoid cumbrous notat ion, a literal
solution is given in T a b l e 4 for the case of four unknowns . T h i s is
followed by a numerical example. T h e principle, once apprehended,
can easily be extended to any number of unknowns .
T A B L E 4
SOLUTION OF N O R M A L EQUATIONS
d iz £ 4
c 2 c 3
c 4 z
"ll «12 «13 014 1 0 0 0 Si
N2 <*22 «23 «24 0 1 0 0 St
033 034 0 0 1 0
Nt 044 0 0 0 1
Ni 013 014 1 0 0 0 Si
Ei 1 6» &13 ^14 1/011 0 0 0 Zilan
0*23 ^24 1 0 0
E, 1 &23 &24 «l/<*22 1/^22 0 0 ^2lld22
0*33 0*34 fx ft 1 0 £ 3 1
E3
1 &34 fi/dn fz/dn 1/0*33 0 ^zifd^z
di4t gi £ 3 1
Et
1 gi/du £3/0*44 1/0*44 ^41/0*44
sa
1 $41 $42 $43 s4i ^44
s3
1 $31 $32 $33 s3i
1 $21 $22 $23 $24 ^24
224 LEAST SQUARES
First write the normal equations in order, bu t wri te the absolute
terms along the top in a row instead of down the column. Below each
absolute te rm write 1 on the line containing the equation to which it
belongs, and 0 on the other lines. Wri te the check-sums in the r ight -hand
column. T h e y are different from those obtained in forming the normal
equations, and are
2\ = aU + #12 + #13 + au + 1>
E2 = a12 + a22 + a23 + a24t + 1,
^ 3 = #13 + #23 + #33 + #34 +
^ 4 = #14 + #24 + #34 + #44 + ! •
1. Wri te the first normal equation in the next row.
2. Mult iply every n u m b e r in row N1 by l/an to get row Ev and
verify the check-sum, placing a check-mark beside it. Th i s is the
equation by which £ x will be determined after £ 2 , £4 a r e known; it
is often called the elimination equat ion in
3. Cover up the upper line Nx wi th a strip of paper and calculate
#*22 = #22 ^12#12» e i ~ ^12»
^23 = #23 ^12#13> 221 = E2 b12Uv
^24 — #24 ^12#14>
verifying the check-sum.
4. Mult iply every number in the row just wri t ten by l/d22 to get row
Z?2, and verify the check-sum.
5. Cover up lines N1 and iV 2 and calculate
^33 = #33 ^13#13 ^23#*23> f l = = ~ b13 ^23^1 >
^34 = #34 ^13#14 ^23#*24» j2 = ^23>
^ 3 1 = 2Z — b13E1 — &23^21>
verifying the check-sum.
6. Mult iply every number in the row jus t wri t ten by l/d33 to get row
E3, and verify the check-sum.
7. Cover up lines Nlf N2, and iV3, and calculate
du = au — buau — b2id2^ — bMdM,
Si = — b u — b2Aei — bzJl>
g2 = — b2A — h\f*
274 1 = 274 buEx bME21 b3AE3V
F O R M A L S O L U T I O N 225
8. Mul t ip ly every n u m b e r in the row jus t wri t ten by l / d 4 4 to get row
E4 and verify the check-sum.
T h e steps from 1 th rough 8 consti tute what is known as the forward
solution. I t will be observed that line £ 4 gives the value of £ 4 in t e rms
of Cl9 C 2 , C 3 , and C 4 . I n the back solution similar expressions are
obtained for the other unknowns . In the scheme given, the line £ 4 is
copied on the line below and is designated as 5 4 , bu t this is done here
for convenience in notation, and in practice no advantage will be gained
by wri t ing the line twice.
9. T h e line S3 is obtained by
S31 = flld33 — ^41^34*
S32 = fJd33 — ^42^34»
S33 = I M T T — ̂43^34*
^34 = — ^44^34»
^34 - 2 ^
10. T h e line S2 is obtained by
^21 — eild22 %^23 %^24>
^22 = V ^ 2 2 — 532^23 542^24>
% = — ^33^23 %^24>
$24 = — ^34^23 ^44^24*
^24 = S21jd22 -- ^34^23 "~ ^44^24
11. T h e line Sx is obtained by
hi = l/an - %^12 ~~ •̂ 31̂ 13 %^ 1 4,
^12 = — ^22^12 ^32^13 ,y42^14>
S13 = — 523^12 — •̂ 33̂ 13 ^43^14*
SU = — •̂ 24̂ 12 •̂ 34̂ 13 ^44^14?
^14 - ^24^12 " ~ ^34^13 - ^ 4 4 <
I t will be observed that the quanti t ies s 3 4 , s 2 3 , su, s12, s 1 3 , s 1 4 are
severally identical with s 4 3 , s32, s 4 2 , s21, s 3 1 , sA1. Hence they need not be
calculated, bu t may be filled in by inspection. T h e check-sums should
be verified at each stage of the calculations.
226 LEAST SQUARES
12. I t now remains to calculate the values of the unknowns from
and to tabulate t hem with the factors which, being mult ipl ied by the
probable error of uni t weight, give the probable errors of the unknowns .
These factors are severally s]1*, s^ 2 , s^, s1^.
13. T h e values of the unknowns should now be subst i tuted into the
normal equations, which should be exactly satisfied, except for errors
of rounding.
14. Next subst i tute the values of the unknowns into the original
equations and calculate the residuals Vj. If the v5 have a Gaussian
distr ibution, then the probable error of uni t weight is given by
and the probable error of any unknown is obtained by mult iplying
by the factor previously calculated.
If the law of distr ibution of the residuals is not Gaussian, this is good
evidence that the observations were affected by some systematic effect
in addit ion to those designated ijly £2, gn. I n this case the probable
errors calculated as jus t described may be entirely illusory, and the
physical interpretat ion of the unknowns themselves is open to doubt .
T h e doubt will be the more serious, the greater the depar ture of the
residuals from a Gaussian distr ibution.
I t will be noticed that if the n u m b e r of equations of condition is
equal to the n u m b e r of unknowns (m = n), the probable error cannot
be calculated; the equations will be exactly satisfied and p.e. 1 will be
of the form 0/0. If m exceeds n by a factor of less than 2 then the
probable error is bu t weakly determined, and is entitled to little con-
fidence.
11. N u m e r i c a l e x a m p l e . T h e following example is given as an
illustration of the content of this chapter . I t should be worked th rough
as an exercise by any computer who is using the scheme for the first t ime.
A set of equations of condition, with their weights, is given in Table 5.
T h e v's are the residuals determined after the solution.
T h e first thing to be noted is that the coefficients are of different
orders of magni tude, those of £ 4 being about ten t imes the others. T h e
Sll^l S12^2 S13^3 ~t~ ̂ 14^4»
^21^1 ^22^2 ^23^3 ~f~ ^24^4>
S31^1 " I " S32^2 "f" S33^3 ~f~ ^34^4*
$41 ̂ 1 ~f~ $42^2 H~ $43^3 "f" J44^4>
p.e. 1 = 0.6745
E X A M P L E 227
p fi £ 2 £ 3 £4 = C V
1 + 1.155 + 1.507 + 2.622 — 12.29 + 9'.'16 + 0f'79
2 + 2.417 + 0.833 + 4.702 + 14.62 + 12.43 — 0.44
1 + 0.931 + 1.904 4- 1.554 + 6.10 + 5.48 + 0.41
1 + 2.955 + 0.725 + 3.324 — 10.48 + 6.98 - j - 0.44
1 4- 1.164 + 1.820 + 0.949 + 11.75 + 1.60 — 0.24
3 + 1.188 + 0.828 — 0.016 + 11.48 — 2.37 + 0.44
3 + 1.199 + 1.735 — 0.647 — 12.11 — 4.49 — 0.40
1 + 0.947 + 1.854 — 1.318 — 6.38 — 6.30 — 0.10
numerical work to follow will be simplified if they are reduced to
approximately the same size. Accordingly the several columns are
mult ipled by the factors
0.34 0.53 0.21 0.068 0.080,
each number being approximately the reciprocal of the largest n u m b e r
in the corresponding column above; this amounts to the in t roduct ion
of new unknowns r]v rj2y rj3y rjA such that
f! = 4.25 r]ly £ 3 = 2.62 773,
£ 2 = 6.62 t ? 2 , £ 4 = 0.85 t ? 4 .
T h e new equat ions of condition are given in Tab le 6. T h e product
of each equation by its weight, whenever the weight differs from uni ty,
T A B L E 6
TRANSFORMED EQUATIONS OF C O N D I T I O N
^3 C 2
+ 0.393 + 0.799 + 0.551 — 0.836 + 0.781 4- 1.688
+ 0.822 + 0.441 + 0.987 + 0.994 + 0.994 4- 4.238
+ 1.644 + 0.882 + 1.974 + 1.988 + 1.988 4- 8.476
+ 0.317 + 1.009 + 0.326 + 0.415 4- 0.438 + 2.505
4- 1.005 + 0.384 + 0.698 — 0.713 + 0.558 4- 1.932
+ 0.396 + 0.965 + 0.199 4- 0.799 + 0.128 4- 2.487
+ 0.404 + 0.439 — 0.003 + 0.781 — 0.190 + 1.431
+ 1.212 + 1.317 — 0.009 + 2.343 — 0.570 4- 4.293
4- 0.408 + 0.920 — 0.136 — 0.823 — 0.359 + 0.010
+ 1.224 + 2.760 — 0.408 — 2.469 — 1.077 + 0.030
+ 0.322 + 0.983 — 0.277 — 0.434 — 0.504 4- 0.090T A B L E 5
EQUATIONS OF C O N D I T I O N
228 LEAST SQUARES
is given immediately below the equation, and the check-sums have been
added.
T h e normal equations and their solution appear in Tab le 7, according
to the scheme described. T h e check-sums used to verify the formation
of the normal equations are not pr inted.
T A B L E 7
N O R M A L EQUATIONS AND SOLUTION
Cx c2 c 3 c 4
+ 1 . 8 5 9 + 0 . 5 4 4 + 3 . 2 3 8 + 1 . 8 6 9 2
Nx + 3 . 8 6 6 + 4 . 1 0 2 + 2 . 4 6 4 + 0 . 8 3 6 1 0 0 0 + 1 2 . 2 6 8
N2 + 7 . 2 0 8 + 1 . 4 4 8 — 0 . 5 4 5 0 1 0 0 + 1 3 . 2 1 3
N, + 3 . 0 1 7 + 1 . 7 4 7 0 0 1 0 + 9 . 6 7 6
N* + 8 . 0 4 4 0 0 0 1 + 1 1 . 0 8 2
Nx + 3 . 8 6 6 + 4 . 1 0 2 + 2 . 4 6 4 + 0 . 8 3 6 1 0 0 0 + 1 2 . 2 6 8
Ex 1 + 1 . 0 6 1 + 0 . 6 3 7 + 0 . 2 1 6 + 0 . 2 5 9 0 0 0 + 3 . 1 7 3
+ 2 . 8 5 6 — 1 . 1 6 6 — 1 . 4 3 2 — 1 . 0 6 1 1 0 0 + 0 . 1 9 7
E2 1 — 0 . 4 0 8 — 0 . 5 0 1 — 0 . 3 7 1 + 0 . 3 5 0 0 0 + 0 . 0 6 9
+ 0 . 9 7 2 + 0 . 6 3 0 — 1 . 0 7 0 + 0 . 4 0 8 1 0 + 1 . 9 4 2
E, 1 + 0 . 6 4 8 — 1 . 1 0 1 + 0 . 4 2 0 + 1 . 0 2 9 0 + 1 . 9 9 8
+ 6 . 7 3 8 — 0 . 0 5 4 + 0 . 2 3 7 — 0 . 6 4 8 1 + 7 . 2 7 2
E, = S, 1 — 0 . 0 0 8 + 0 . 0 3 5 — 0 . 0 9 6 + 0 . 1 4 8 + 1 . 0 7 9
s, 1 — 1 . 0 9 6 + 0 . 3 9 7 + 1 . 0 9 1 — 0 . 0 9 6 + 1 . 2 9 9
st
1 — 0 . 8 2 2 + 0 . 5 3 0 + 0 . 3 9 7 + 0 . 0 3 5 + 1 . 1 4 0
Sx 1 + 1 . 8 3 1 — 0 . 8 2 3 — 1 . 0 9 5 — 0 . 0 0 8 + 0 . 9 0 3
rj1 = - 0.604 ± 1.35 p.e. 1„, ^ = - 2757 ± 0.46 p.e. 1^,
V 2 = + 0.111 ± 0.73 p.e. 1„, £ 2 = + 0:'73 ± 0.39 p.e. 1^,
773 = + 1.532 ± 1.04 p.e. 1„, f3 = + 4701 ± 0.22 p.e. 1 € >
^ 4 = _ 0.030 ± 0.38 p.e. lv, f4 = - 0.'026 ± 0.026 p.e. l s t .
T h e values of the ?/s are subst i tuted into the normal equat ions, which
are found to be satisfied within the errors of rounding. T h e values of
the f's are then subst i tuted into the original equations of condition,
giving the residuals v. T h e sum of the squares of the z/s, each square
being multiplied by its weight, is 2.50, whence
p.e. l f i = 0.6745 (2.50/4) 1 ' 2 = ± 0. , ,53.
COMBINATIONS OF UNKNOWNS 229
Of the eight residuals it is seen that seven are smaller than the probable
error of uni t weight, which is not significant, because the residuals are
not all of uni t weight. T h e probable size of a residual of weight p is
± W h e n the residuals are compared wi th their probable sizes
it is found that four are larger and four smaller in absolute value. In
addit ion half of the residuals are positive and half negative, and none
is as large as twice its probable size. All of these condi t ions are more
favorable than are often encountered, and they indicate that the results
are entit led to m u c h confidence.
T h e final results are
fx = - 2."57 ± 0:'24, & = + 4"01 ± ( H 2 ,
£ 2 = + o:73 ± 0:'21, f4 = - 07026 ± 0:'014.
12. C o m b i n a t i o n s of t h e u n k n o w n s . I t is not infrequently
desired, after making the solution, to combine two or more of the
unknowns . For example, in correcting a nearly circular orbit , it may
be advantageous to use as unknowns A (e sin &) and A (e cos co), e and
w being the eccentricity and longitude of per icenter ; and it is desired
afterward to find Ae and Acoy wi th their probable errors . N o difficulty
is likely to occur in combining the unknowns themselves, bu t the
method of obtaining the probable errors is not immediately obvious.
Le t it be required to determine the probable errors of two quanti t ies ,
u and v, which are known functions of £ x and f2. T h e probable error
of u is given by
and the probable error of v by a similar expression.
Often the probable error of a linear combinat ion of the unknowns
is required. For example, suppose
" = *lf 1 + * 2 & +
the k being constants . T h e n the probable error of u is given by
p . C 1 | + $22^2 H~ ^33^3 2$i2^i^2 ^-S13^l^S H~~ 2 $ 2 3 ^ 2 ^ 3 ) 1 / 2 >
I t may occasionally be desired to find what linear combinat ion of two
unknowns , say £ x and £ 2 , is de termined with the smallest probable e r ror ;
that is, it is desired to find values of kx and k2 that will render the
expression snk\ + s 2 2& 2 + 2s12k1k2 a m i n i m u m . T h e problem as stated
is indeterminate , because kx and k2 may always be chosen so as to make
230 LEAST SQUARES
the probable error of + k2t;2 as small as we please. However, if
either of the k is fixed, then the problem becomes determinate . Suppose
kx — 1. T h e n the expression to be minimized is S-Q -\- S22k2 ~f~" 2s 1 2 ^2>
which is a m i n i m u m when k2 = — s12/s22.
In the preceding numerical example let us find what combinat ion of
£ x and | 2 is de termined with the smallest probable error. Taking k2 = 1,
and working first with the rj's, we have kx = 0.823/1.831 = 0.45 and
the combinat ion is 0 . 4 5 ^ + rj2 = — 0 . 1 6 1 . T h e probable error of
this combination is
p.e. 1, [1.831 (0.45)2 + 0.530 ( l ) 2 - 1.645 (0.45)] 1 / 2 = p.e. l r / (0.40).
W e now have
0.45 i h + i ? a = 0 . 1 0 6 f 1 + 0 . 1 5 1 f a ,
which is determined with a probable error of
p.e. ltj (0.40) = p.e. l f (0.40) (0.080) = p.e. 1,(0.0320),
whence 0 . 7 0 ^ + f 2 = — 1.07 is determined with a probable error
of p.e. 1 | (0.21) or 0711, which is considerably less than the probable
errors of either £ x or £ 2 .
13 . C o r r e l a t i o n s . Whenever a linear combinat ion of two unknowns
can be determined with a smaller probable error than either one of t hem
separately, the unknowns are said to be correlated. T w o unknowns £ x
and £ 2 are correlated whenever the cross t e rm s12 is different from zero,
or equivalently in most cases, when the cross te rm a12 of the normal
equations is different from zero. Correlations are the rule rather than
the exception in least-square solut ions; in the numerical example of this
chapter correlations exist between every pair of unknowns .
Correlations are of different degrees of magni tude . T h e stronger the
correlation, the larger the cross te rm a{j relative to ( a , ^ ^ ) 1 / 2 . F r o m the
way in which the normal equations are formed it is not possible for
to exceed (auajj)1,2
9 bu t it may approach it very closely. In such
cases difficulty will be encountered with the solution, which is nearly
indeterminate ; dur ing the numerical work it will be found that one of
the numbers d22, rf33, du becomes very small, which causes a loss of
significant figures in the elimination equations. T h e computer ' s first
impulse may be to repeat the whole calculation with additional decimals,
and a formal solution may always be obtained in this way, often at
t remendous expense of t ime and energy. Both the correlated unknowns
will, however, have large probable errors and therefore little physical
C O R R E L A T I O N S 231
meaning. I t is often possible to avoid the difficulty, wi thout the need
for extra decimals, by solving for only one of the correlated unknowns
directly, subst i tu t ing for the other a linear combinat ion of the two. T h e
best combinat ion (that is, the one determined with the smallest probable
error) cannot be de termined in advance of making the solution, and when
possible it is usually preferable to make the solution first, then to
determine the best combination to use, and to repeat the solution with
the new unknowns . A fair approximation to the best combinat ion can
often be obtained by such a choice that the cross t e rm in the normal
equations vanishes, which may be done before any solution is made,
in the following way. If the unknowns in question are £ x and £ 2 ,
subst i tute for £ x in the solution £ x + a12/an | 2 . T h i s subst i tut ion
requires all of the coefficients in the normal equat ion for £ 2 to be ap-
propriately modified by the following subst i tut ions:
for a12 write 0,
for a22 write a22 — {al2fjally
for a23 write a 2 3 — (a12/au)a13,
for a 2 4 write tf24 — (a12lan)aUi
for C2 write C 2 — (d12lan)Clyfor E2 write Z2 ~ (a12/au)I]v
T h e normal equation for ^ remains unchanged, with the exception
of the coefficient a12, and it becomes the normal equat ion for
Applying this principle to r)1 and rj2 of the numerical example, we
have as the combinat ion to be subst i tuted for rjly
Vi + ^(4.102/3.866) = V l + l.Oftfc.
Alternatively, we may subst i tute for rj2y
77X4.102/7.208) + V 2 = 0 .57^ + V t .
I t is to be noted that neither of these combinat ions is the one
determined with the smallest probable error, which can be determined
only after the solution is made ; in the present example nothing would
be gained by such a subst i tut ion, because the correlations are not very
strong.
232 LEAST SQUARES
14. N o r m a l p l a c e s . If several observations are taken dur ing a
sufficiently short interval of t ime the equations of condit ion will vary
bu t little over the interval. In such cases it is advantageous to average
the separate residuals O — C, and to form a single equation of condit ion
for the mid-epoch of the observations, giving it a weight equal to the
number of observations. A residual O — C, obtained from several
observations so combined, is called a normal place. N o explicit rule
can be given for the formation of normal places; experience and j udge -
ment mus t play a part . Such a combinat ion of observations is, however,
always permissible if the residuals in question can be regarded as varying
uniformly with the t ime. In addition, if the observations are very
numerous it may be worthwhile to fit a parabola to the residuals, and
to use the constant t e rm of the parabolic expression as the absolute
t e rm of an equat ion of condit ion.
N O T E S A N D R E F E R E N C E S
T h e m e t h o d of least squares b e l o n g s to t h e m o r e genera l s u b j e c t o f t h e ca lcu lus
of observat ions . A g o o d e l e m e n t a r y b o o k is D a v i d B r u n t , The Combination of
Observations, C a m b r i d g e U n i v . P r e s s , L o n d o n a n d N e w York , 1 9 3 1 . A s tandard
treatise is E . T . W h i t t a k e r a n d G. R o b i n s o n , The Calculus of Observations, B lackie ,
L o n d o n , 1924 .
T h e m e t h o d of s o l v i n g the n o r m a l e q u a t i o n s g i v e n i n th i s c h a p t e r is o f t e n ca l l ed
an e l i m i n a t i o n m e t h o d , b e c a u s e in t h e forward s o l u t i o n t h e u n k n o w n s are s u c c e s s -
ive ly e l i m i n a t e d to t h e p o i n t w h e r e o n e e q u a t i o n in o n e u n k n o w n r e m a i n s . O t h e r
m e t h o d s e n j o y i n g p o p u l a r i t y i n s o m e appl i ca t ions are k n o w n as s q u a r e - r o o t
m e t h o d s a n d re laxat ion m e t h o d s . T h e y appear to p o s s e s s n o advantages for t h e
t y p e of app l i ca t ion d i s c u s s e d h e r e , a n d re laxat ion m e t h o d s in part icular m a y
lead to w o r t h l e s s resul t s in s o l u t i o n s w h e r e s o m e of the u n k n o w n s are s t rong ly
corre lated . C o n s i d e r for e x a m p l e a case w h e r e t w o u n k n o w n s , say # a n d y, are s t r o n g -
ly correlated. T h e e q u a t i o n s wi l l b e satisfied over a cons iderab le range of va lues o f
x a n d y, b u t n o t if x a n d y are a l l o w e d to vary o v e r the ir ranges i n d e p e n d e n t l y
of e a c h other; c o r r e s p o n d i n g to a n y p e r m i s s i b l e v a l u e of x there is o n e a n d o n l y
o n e va lue of y sat i s fy ing t h e e q u a t i o n s . T h e e l i m i n a t i o n m e t h o d , b y d e t e r m i n i n g
the u n k n o w n s in s u c c e s s i o n , m a k e s t h e v a l u e of x, say, d e p e n d o n t h e p r e v i o u s l y
a s s igned va lue of y a n d preserves t h e neces sary relat ion b e t w e e n t h e m , w h e r e a s
t h e relat ion is n o t p r e s e r v e d w i t h re laxat ion m e t h o d s ; w i t h t h e m it m a y b e f o u n d
at the e n d that the resul ts d o n o t satisfy t h e e q u a t i o n s .
T h e p r e c e d i n g remarks are s o m e t i m e s o b j e c t e d t o o n t h e g r o u n d that c o n s i s t e n t
va lues of x and y h a v e n o m o r e "phys i ca l real i ty" t h a n t h e i n c o n s i s t e n t o n e s
o b t a i n e d b y re laxat ion. T h e o b j e c t i o n has s o m e w e i g h t . W h a t is o v e r l o o k e d
is that t h e c o n s i s t e n t v a l u e s of x a n d y are b e t t e r s u i t e d t h a n o thers for u s e in
forecas t ing future o b s e r v a t i o n s , a n d also for ver i fy ing t h e a r i t h m e t i c o f t h e
so lu t ion .
CHAPTER I X
THE DIFFERENTIAL CORRECTION OF ORBITS
1. I n t r o d u c t i o n . T h e methods presented in Chap te r s I and I I
serve to calculate the position at any t ime of a body moving in an elliptic
orbit the elements of which are known. For a body in the solar system,
the observed data from which the knowledge of its location is obtained
are
t, oc, 8,
i.e., the t ime of the observation and the geocentric spherical coordinates,
r ight ascension and declination. I t should further be specified to what
equator and equinox these coordinates are referred.
Let x, y, z be the heliocentric coordinates of the object and X, Y, Z
the geocentric coordinates of the sun, referred to the mean equator and
equinox of 1950.0. T h e geocentric coordinates of the object are
x + X, y + Y, z+Z.
Let p be the geocentric distance. T h e geocentric right ascension oc and
declination S are related to the geocentric equatorial coordinates by the
formulas
x + X = p cos oc cos 8,
y + Y = p sin a cos 8,
z + Z = p sin 8.
T h e observed a, 8 at t ime t are not immediately comparable with the
geocentric oc, 8 that appear in these formulas ; appropriate corrections
for geocentric parallax, aberration, etc., mus t be applied to the
observations or to the rectangular coordinates (see Chapters VI and VII ) .
Le t Ax, Ay, Az be small corrections to the heliocentric coordinates
of the object, AX, AY, AZ those to the geocentric coordinates of the
s u n ; then
Ax + AX = — p sin oc cos SAoc — p cos oc sin SJS + cos oc cos SAp,
Ay -{-AY = + p cos oc cos SAoc — p sin oc sin SJS + sin oc cos SAp,
Az +AZ = + p cos SAS + sin SAp,
233
234 ORBIT CORRECTION
from which it is found that
A* = — l — 1 r [ - sin oc(Ax + AX) + cos oc(Ay + AY)],
p COS 8
AS = i [ - cos a sin S(J* + AX) — sin a sin S(Ay + AY) + cos S(J# + AZ)].
If it may be assumed tha t X, Y, Z are not in need of correction,
the relations between Aoc, AS and the corrections Ac3- to any set of six
elliptic elements are
. ^\ doc dx . -%r\ doc dy .
J« = X ^ ^ + X^^>
A * ^\ dS dx A , ^ dS dy A , ^\ dS dz A
= X r x + X % + X
or
doc _ doc dx doc dy dS _ dS dx dS dy dS dz
dCj dx dCj dy dc3-
9 dc3 dx dc3- dy dc3- dz dc3
I t is more common to deal with cos 8Aoc.
In matrix form these expressions are
dx dy dz
dc1 dc± dcx
dx dy dz
dco dc2 dco
dx dy dz
dc$ dc6 dc%
For numerical calculations this plan recommends itself. T h e r e is an
obvious advantage in spli t t ing the calculation into two par ts and obtaining
the final coefficients by a matrix multiplication. T h e partial derivatives
dx/ dcjy dy/ dc3, dz/ dc3- depend on the heliocentric coordinates and
velocity components of the observed object alone, while the elements
of the second matr ix may be wri t ten explicitly
— p~x sin oc — p~x cos a sin 8
+ p~x cos oc — p~x sin a sin S
0 + p'1 cos 8
doc ~ do
— cos d —
ox dx
doc ~ dS
— cos 8 —
dy dy
0
as
dz
doc ~
—— cos o
dc±
dc
doc
dCa
cos 8
db
dc^
doc ~ dS
cos o ——
dc9
_as_
dCR
(1)
doc
cos 8
dS
dx
cos
dx
doc
8
dS
COS 8
dy
COS
dy
0
dS
0 0
dz
RECTANGULAR COORDINATES 235
T h e y can easily be computed from an approximate geocentric
ephemeris provided the geocentric distance is also furnished. T h e latter
is required in any case in order to compute the correction foraberrat ion
and in some methods for the parallax.
2 . U s e of r e c t a n g u l a r e q u a t o r i a l c o o r d i n a t e s . One of the most
common cases is tha t a table of the heliocentric rectangular equatorial
coordinates at equal intervals is available. F r o m this table the values
of xy y, z and their derivatives for any date may be obtained by numerical
processes explained in Chapter IV. Also, let the elements a, e be available.
Let the t ime interval of the table for x, yy z be w days, and let the
derivatives of x, y, z wi th the uni t of t ime w days be denoted x, y, z.
Fur ther , let n be the mean motion in radians in w days, and t the t ime
in uni ts of w days counted from an epoch chosen arbitrarily. Now
consider how the variations in the coordinates depend on the
variations of the elements. I t will be assumed that these variations are
small so that their squares and products may be ignored.
(a) A change in the mean anomaly l0 at the initial epoch. T h i s is
equivalent to an advance
At = Aljn
of the ephemeris . Hence
Ax = xAt = xAlJn,
Ay = yAt = yAlJn,
Az = zAt = zAl0/n.
(b) A change in the orientation of the orbit. T h i s may be thought of
as the resultant of rotations of the orbit, AI/J1 about the x axis, AI/J2 about
the y axis, AI)JS about the z axis. A positive rotation is considered to
be a rotation that is counterclockwise as seen from the positive direction
of the axis about which the rotation is performed. T h e n
Ax = zAifj2 — yAipz,
Ay = xAip3 — ZAI/JV
Az = yA\jj1 — XAI/J2.
(c) A change in the semi-major axis. T h e effect of such a change is
twofold: a change in scale and a change in the mean anomaly proport ional
to the t ime elapsed since the epoch. T h e first of these is obtained from
Ax _ Ay _ Az _ Aa
x y z a
236 ORBIT CORRECTION
and
x y z 2 a
By combining the two parts , the changes in the coordinates are found
to be
Ax 3 .v Aa
a
Ay = (y -\ty)
Aa
a
Az = (* - | » )
Aa
a
(d) A change in the eccentricity. T h e most convenient way of obtaining
these coefficients is to pu t
~ = Hx + Kx,
in which H and K are independent of the directions of the coordinate axes.
If xy y are the coordinates of the planet in its orbital plane with the
x axis directed to the perihelion, these expressions require
8i = Hx + Kx,
% = W + KS.
T h e solution of H and K from these equations is
y(dx/de) — 5c(dyjde)
H
K =
xy — yx
x(dy/de) — y(8x/8e)
xy — yx
Ax _ Ay _ Az _ 3 Aa_
T h e second follows from n2az = constant, which may be expressed by
2 ^ + 3 ^ = 0.
n a
Hence the change in the mean anomaly is
3 Aa
tAn = — -tn ,
2 a
R E C T A N G U L A R C O O R D I N A T E S 237
G = a2n yl—e2 = a2n cos 9?,
if e = sin 9 . For the evaluation of the numera tors , expressions in
te rms of the eccentric anomaly are useful. Wi th
, du sin u . n
u — e sin u = /,
T h u s
cos u + e sin u(2 — e2 — e cos u)
H = : -— , A. = 1 - e2 ' n(l - e2)
F r o m these expressions u may be eliminated by making use of
(x2 + y2 + z2)1/2 = r — a(l — e cos #),
+ yy + = rr — e sin w,
or
T h u s
de I — e cos w ' 1 — e cos u '
# = «(cos u — e), y = a cos cp sin w,
it follows that
y = an cos cp -
Multipl icat ion of the appropriate r ight -hand members gives
dx ± dy a2n — cos u e2 cos w — e + e cos 2 u
y de X de cos 99 1 — e cos w
a 2 n i 1 \ = (cos w + £),
cos cp
(2 — e2 — e cos w).
_ dy _dx a2 sin a
X de ^ de cos 99
cos u sin w
T h e denominator in these expressions is twice the areal velocity in
the orbital plane in w days, which may be wri t ten
2 3 8 O R B I T C O R R E C T I O N
Both H and K contain e as a factor in the denominator . T h u s if e
is small, there is a loss of accuracy in the numerical calculation. T h i s
is not a serious disadvantage. I t merely requires the calculation of rr
and of r/a(l — e2) to more significant figures than are required in H
and K. I t depends on the smallness of e how many extra significant
figures are to be used. T h e coordinates x, y, z and their derivatives
can as a rule be obtained with the necessary n u m b e r of significant
figures, and the additional amount of calculation required is negligible.
T h e combinat ion of these results gives the following expressions:
Set I
Al0 *h 4k # 3 Aaja Ae
Ax = x/n 0 + z - y
3 , .
x - - t x Hx + Kx
Ay = y/n — z 0 + x
3 , .
y ~ i t y
Hy + Ky
Az = z/n + y — X 0
3
z -~tz Hz + Kz
These are to be subst i tuted into the left-hand matrix of the matrix
product (1).
T h e small rotations AI/J19 AI/J2, AI/J3 may be considered to be the three
components in the equatorial coordinate system of a single rotation
vector Aip. Let J p , Aq, Ar be the components of the vector Aif; in a
coordinate system xyzy the xy plane of which coincides wi th the orbital
plane with the x axis directed toward the perihelion. T h e relations among
the two sets of rotation components are evidently
= P J B J p + & J q + * i B J r >
Aifj2=PyAp+QyAq + RyAr) ( 2 )
Ai/j3=PzAp+QzAq + RzAr.
T h e rotation components J p , Aq, AT can easily be expressed in t e rms
of corrections to the elements / , £?, co. Let y be the origin of longi-
tude, N be the ascending node of the orbit with the plane of the
ecliptic, A W = 90°, NX = co, the angular distance of the perihelion
from the node (see Fig. 1). Rotations Ap, Aq about OX and OY r e -
spectively are equivalent to
Ap cos co — Aq sin co about ON,
Ap sin co -f- Aq cos co about ON'.
T h e rotation about ON equals AI, while the diagram shows that the
rotation about ON' equals sin IAQ. As a result of the rotation about
RECTANGULAR COORDINATES 239
s i n i <a / I
F I G . 1. Differential correction of the elements 7, & and CJ.
I t should be noted that these formulas are independent of the choice
of the plane of reference. Hence, if Q,' oo' represent the equatorial
elements, i.e., the inclination relative to the equatorial plane, the longi-
tude of the node on the equator, and the angle from this node to the
perihelion, respectively, the relations are
AI' = Ap cos co' — Aq sin co ' ,
sin FAQ' = Ap sin co' + Aq cos co' ,
d c o ' + cos FAQ' = d r .
Other useful relations may be obtained with the aid of Chapte r I,
Sections 23, 24. For example:
11AF s i n FAQ' d c o ' + cos FAQ' 11
= | | d p d q d r || r ( — co' )
] cos co' + sin co' 0
= || d p d q d r | | — s i n co' c o s c o ' 0
0 0 1
= | | d p cos co' — d q sin co' d p sin co' + d q cos co' d r | | .
ON' the new node will be at B. T h e rotation d r about the z axis is
given by the difference between the arcs AX' and NX, if OX' is the
new x axis. But BX' — NX = d c o , AB = cos IAQ. Since AX' =
+ AX' — NX = BX' — NX+AB, or d r - + cos IAQ.
T h u s , finally
AI = Zip cos co — Aq sin co,
sin IAQ = Ap sin co + Aq cos co, (3)
d c o + cos IAQ = Av.
240 ORBIT CORRECTION
Finally,
| | J & A*p2 A^\\
= | | J p Aq Ar\\r(-co')p(-r)r(-Q')
II 1 0
= \\Ar sin FAQ' Aco' + cos / ' A Q ' 1 1
0
= \\AF — sin VAOJ' AQ' + cos / ' J o / ,
0 cos / ' + sin / '
0 — sin / ' cos / '
cos Q' + sin Q' 0
— sin Q' cos Q' 0
0 0 1
cos Q' + sin Q' 0
— sin Q' cos Q' 0
0 0 1
cos Q'AI' + sin Q' sin / ' A t /
sin Q'AI' — cos £?' sin I'Aaj'
cos / ' J o / + AQ'
where the t ransposed vector has been wri t ten to save space across the
page. T h u s
AF = cos Q'Aifj1 + sin Q'Aifj2i
sin I'AOJ' = sin Q'A\\s1 — cos Q'Aifj2, (4)
cos FAOJ' + AQ' = AI/J3.
If the eccentricity is small, the position of the perihelion will be poorly
determined. Hence there will also be a degree of indeterminateness in
both Al0 and in Aip3. Th i s is apparent from the coefficients of Al0 and
AI/JZ in Ax and Ay. For a circular orbit that coincides with the equatorial
plane,
x/y = — y/x
while z = 0. T h u s J / 0 and Aip3 cannot be separately obtained in a
solution in which these unknowns are used. T h e difficulty may be
avoided by using Al0+ Aip3 and Aip3 as unknowns . Hence:
^ Al0 - yAifj3 = -n (Al0 + A^J3) + ( - 5 - AI/J3)
^Al0 + xA^3 = J ( J / 0 + J<£3) + ( ~ \ + *)
\ A \ , + 0 ^ 3 = J {Ak+m + ( - « ) ^ a -
R E C T A N G U L A R C O O R D I N A T E S 241
— z 0
— x
Hx + Kx
Hy + Ky
Hz + Kz
While this choice of unknowns is suitable for most cases of moderately
small eccentricities, the determinateness may be increased by choosing
d p , d q , and d r as unknowns instead of Aifjly di/r 2, d ^ 3 . Subst i tu t ion of
(2) into the te rms involving these unknowns in Set I , gives:
+ zAfa -yA^3 = (Pyz - Pzy)Ap + (Qyz - Qzy)Aq + (R* - Rzy)Ar,
- ZAI/J, + XAI/Js = (Pzx - Pxz)Ap + (Qzx - Qxz)Aq + (Rzx - Rxz)Av,
+ yAfa - xAi/;2 = (Pxy - Pyx) d p + (Qxy - Qyx) d q + (Rxy - Ryx) d r .
If then the unknown d / 0 + d r is in t roduced instead of d / 0 , and eAv
instead of d r , the final expressions become:
Set III
d / 0 + d r d p d q eAx Aaja Ae
Ax= -n Pyz-Pzy Qyz-Qzy
l-(-^ + Ryz-Rzy^ x - l t x Hx + Kx
Ay= £ Pzx-Pxz Qzx-Qxz l ^ + R ^ - R ^ y-3-ty Hy + Ky
Az= -n Pxy-Pyx Qxy-Qyx )-(-^ +Rxy-RyX} z - \ t z Hz + Kz
T h i s form requires somewhat more calculation than Set I I , bu t
it has the advantage of yielding a determinate solution regardless of the
values of eccentricity or inclination. I t is r ecommended for general
application.
A X
Ax = -
n
n
Az = -
n +y
N o w — x j n — y and — y j n + x will be small , of the order of e.
T h u s the unknown di/r 3 will appear wi th small coefficients, a difficulty
in numerical applications. T h i s is avoided by int roducing eAtfjs as
unknown, the coefficients of which are obtained by dividing those of
AI/JZ by e. Consequent ly , the expressions become:
Set II
Al0 + Aifj3 A\\sx AI/J2 eAi/;3 Aaja Ae
242 ORBIT CORRECTION
I t will also be found that the coefficients of the unknowns in the
observation equations for Aoc, AS will all be of the same order of
magni tude, with one exception. T h i s exception is the coefficient of A a/a.
Thi s coefficient depends upon the interval covered by the observations.
T h e longer the interval, the more significant the par t tha t has t as a
factor will become.
T h e vectorial orbital constants that occur in the coefficients of Set I I I
define the orientation of a plane xy tha t is to be chosen to coincide as
nearly as may be with the osculating orbital plane of the planet, with
the x axis in the direction of the perihelion. I t follows that
Py PZ
Ax Ay Az\\ = \\Ax Ay Az\\ Q* Qy Q,
Rx Ry
Thi s yields the remarkably simple form.
Set IV
AIQ+AT Ap Aq eAr Aa/a Ae
Ax = - 0 0 - ( — - — y) x— Hx + K5c
n e \ n J 1 2
Ay= y- 0 0 + y~\t] Hy + Kf
Az = 0 +y - x 0 0 0
In some applications it will be advantageous to use this form. T h e
coefficients may be expressed with the aid of series in te rms of the mean
anomaly and, if the n u m b e r of observations is very great, tabulat ion
as functions of the mean anomaly may be desirable. T h e coefficient of
Aaja requires special t rea tment on account of the presence of the factor t.
T h e equivalent form given by
X
3 •
= X -In-l )-
y
3 •
-2*y
= y
may be used, / — l0 being expressed in radians. I t is obvious that whole
revolutions must be retained in the value of / — / 0 .
Tab le 1 gives expansions in te rms of the mean anomaly of the coeffi-
cients of Set IV. T o simplify the typography a common factor a has
been omit ted. T h i s factor must be applied before the coefficients are used.
RECTANGULAR COORDINATES 243
T h e series expansions in Table 1 are generally given to the th i rd
power of the eccentricity, with the exception that the parts of the coeffi-
cients of Aaja that have / — / 0 as a factor are complete to e 4 .
T A B L E I
COEFFICIENTS FOR S E T I V EXPANDED IN T E R M S OF THE M E A N A N O M A L Y
Coefficients of Al0 -f- Ar
Coefficients of Aa/a
Ax
in —
a
in
a
| sin / -
Coefficients of eAv
\ sin 3/ + e2 sin 4.
j cos 3/ — e2 cos 41 -
e3 cos 5/
Coefficients of Ae
f- e2 cos 41 + ez cos 5/
244 ORBIT CORRECTION
Coefficient of Ap
Obviously, the same procedure may be applied to the solar coordinates;
AX, AY, AZ may be expressed in te rms of corrections to the elements
of the earth 's orbit . In this case, however, several impor tant special
considerations enter. In the first place, the unknown Aaja should be
omitted. T h e astronomical uni t is used as the uni t of distance for all
measurements in the solar system, and the dimensions of the ear th 's
orbit in terms of this uni t are so well known that nothing can be gained
by the introduct ion of an unknown to correct the scale of the ear th 's
orbit . In the second place, the corrections AI/J are closely related to the
correction to the obliquity of the ecliptic and the equinox correction.
Let Ae be the correction to the obliquity of the ecliptic used in the
calculation of the equatorial rectangular coordinates of the sun, and AE
the equinox correction, i.e., the common correction that should be
applied to all r ight ascensions. F rom (4) with Q' = 0 it is evident that
Atp1 = Ae,
AI/J2 = — sin eAoj',
J03 = — AE + cos eAco'.
Instead of using Aipv Aift2, AI/JZ it may be advantageous to use
|| A+[ J0 2' 4 $ || = || d ^ J 0 2 4 £ 3 II
1 0 0
= || AI/J1 Aip2 Atp3 || 0 cos e — sin e
0 sin e cos e
Coefficient of Aq
| cos 21 - e2 cos 3/— e3 cos 41
R E C T A N G U L A R C O O R D I N A T E S 245
T h u s
or
= cos eAip2 + sin eAi/j3y
= — sin eAip2 + cos SAI/J..
= Ae,
= — sin eAE,
= — cos eAE + Aw.
I t will be noted that , as Q' = 0, Aa>' is identical with Zla/.
These represent rotations about the vernal equinox, the summer
solstice, and the nor th pole of the ecliptic, respectively. T h e principal
advantage of AI/J2', Aifj3 over Aifj2y Aifj3 is the bet ter separation between
Aw and AE. In AI/J2, AE appears with the factor — sin ey while in
Aifj3 it appears in linear combinat ion with cos eAw.
T h e relations to be used are evidently
Set V
Al'0 + AI/J3 Aifjx AI/J2 e!Aifj3 Ae'
AX = 4 0 + Z \ ( - X - - Y ) H'X + K'X
Y 1 / Y \
AY = 4 -Z 0 ±l-JL+x) H'Y + K'Y
n e \ n J
AZ = ~ +Y -X \ ( - — \ H'Z + K'Z
If Aifj2'y AI/J3 are introduced, we obtain
Set VI
Al'0+A^ Ah A^ e'J0 8 Ae'
AX= 4 0 - Y sin e + Z cos e \- (- 4 - Y cos e - Z sin e) H'X + K'X
n e \ n I
AY = 4 — Z + X s i n £ J - ( _ Z + ^ cos e\ H'Y + K't
n e \ n 1
AZ = ^ + y - Z c o s e i.(-4 + ^ 8 i n e ) H'Z + K'Z
n e \ n I
246 O R B I T C O R R E C T I O N
However, if ecliptic rectangular coordinates are in t roduced by the
relations
Xe — X,
Ye = Y cos e + Z sin e,
Z e = — Y sin g + Z cos e,
and if Ze = 0, it is seen that AX may be simplified (see AXe in Set VI I
below). If then AYe, AZe are int roduced, the result ing expressions are
Set VII
Al^+A+'3 A^ e'A^ Ae'
AXe =
AYe =
AZt, =
n'
x±
n'
0
0
Y P -X, 0
dXe
Be
dYP_
0
In Set VII the coordinates Xe, Ye are the solar coordinates referred
to the plane of the ecliptic, with the Xe axis in the direction of the
vernal equinox. For the purpose of calculating the coefficients of the
observation equations, these coefficients could be conveniently tabulated
as a function of the day of the year.
T h e complication arising from the motions of the vernal equinox
and the sun 's perigee mus t be given proper at tention if observations
covering a long interval of t ime are to be used. In many applications
the use of the s tandard equinox and equator 1950.0 is desirable. If Set V
or VI is used, and the equatorial coordinates and velocity components
are derived from the solar coordinates tabulated for this s tandard
equinox and equator, no at tention to this problem is necessary. Set VII ,
with Ze = 0, implies the use of the ecliptic of date. I t is almost necessary
then also to use the mean equinox of date. If the eight coefficients
arising in this set are tabulated as a function of the day of the year