This reprints an essay written ca. 1983, "'What Song the Syrens
Sang': How Did Einstein Discover Special Relativity?" in John
Stachel, Einstein
from "B" to "Z".
If you have read Edgar
Allen Poe's "The Murders in the Rue Morgue," perhaps you remember
the epigraph that Poe chose for this pioneer detective story:
What song the Syrens sang, or what name Achilles assumed when
he hid himself among women, though puzzling questions, are not
beyond all conjecture.1
I believe that the problem of how Einstein discovered the special
theory of relativity (SRT) falls into this category of "puzzling
questions," that "are not beyond all conjecture."2
Let me begin by explaining why.
When I started work on the Einstein Papers, there was already a
large literature on the origins of SRT compared, say, to the rather
scanty amount published on the origins of the general theory of
relativity (GRT). So I assumed that the development of SRT must
be fairly clear. However, I soon learned that the amount of work
published on the origin of SRT and GRT are just about inversely
proportional to the available primary source material. For GRT,
we have a series of Einstein's papers from 1907 to 1915, capturing
the successive steps of his search for the final version of the
theory. In addition, there is extensive contemporary correspondence
on the subject, several research notebooks, records of lectures
given by Einstein during this period, not to mention a number of
later reminiscences and historical remarks by Einstein.3
For SRT we have the paper On the Electrodynamics of Moving
Bodies, in which the theory was first set forth in 1905
in its finished form, indeed a rather polished form (which is
not to say that it bears no traces of its gestation process).
The only earlier documentary evidence consists of literally
a couple of sentences to be found in the handful of preserved
early Einstein letters (I will quote both sentences later).
We do have a number of later historical remarks by Einstein
himself, sometimes transmitted by others (Wertheimer, Reiser-Kayser,
Shankland, Ishiwara, for example), which raise many problems
of authenticity and accuracy; and some very late Einstein letters,
answering questions such as whether he had prior knowledge of
the Michelson-Morley experiment, what works by Lorentz he had
read, the influence of Poincaré, Mach, Hume, etc., on
his ideas; Einstein's replies are not always self-consistent,
it must be noted.4
Yet the urge to provide an answer to the question of the discovery
of SRT has proven irresistible to many scholars. It is not hard
to see why: A twenty-six year old patent expert (third class),
largely self-taught in physics, who had never seen a theoretical
physicist (as he later put it), let alone worked with one, author
of several competent but not particularly distinguished papers,
Einstein produced four extraordinary works in the year 1905,
only one of which (not the relativity paper) seemed obviously
related to his earlier papers. These works exerted the most
profound influence on the development of physics in the 20th
Century. How did Einstein do it? Small wonder that Tetu Hirosige,
Gerald Holton, Arthur I. Miller, Abraham Pais, John Earman,
Clark Glymour, Stanley Goldberg, Robert Rynasiewicz, Roberto
Torretti, et al., have been moved to study this question.
I shall not try to record my debts to and differences with each
of these scholars, lest this survey become even longer and more
tedious than it is already; but must at least acknowledge the
influence of their work on my own.5
I resisted the urge to conjecture for some years, but have finally
succumbed, so I can well understand the temptation.
Contrary to my original, naive expectation,
no general consensus has emerged from all this work. Given the nature
of the available documentation and the difficulty of understanding
any creative process-let alone that of a genius-this really is not
surprising. I now believe that the most one can hope to do in discussing
the discovery of SRT is to construct a plausible conjecture. Such
a conjecture will be based upon a certain weighting of the scanty
evidence we possess, based upon certain methodological hypotheses,
as well as the imagination of the conjecturer.6
There are bound to be differences of opinion in these matters. All
one can demand is that it be made clear on what methodological hypotheses
a conjecture is based, and a demonstration that the conjecture is
in accord with the available evidence when the latter is weighted
in accord with these hypotheses.
Let me emphasize that no such account can hope to encompass
those elements of the creative process that Einstein referred
to as "the irrational, the inconsistent, the droll, even the
insane, which nature, inexhaustibly operative, implants into
the individual, seemingly for her own amusement," for "These
things are singled out only in the crucible of one's own mind."
Yet one may draw courage for the type of conjecture I have in
mind from another remark of Einstein:
"A new idea comes suddenly and in a rather intuitive way. That
means it is not reached by conscious logical conclusions. But, thinking
it through afterwards, you can always discover the reasons which
have led you unconsciously to your guess and you will find a logical
way to justify it. Intuition is nothing but the outcome of earlier
intellectual experience."
I shall discuss only this intellectual, logical side of Einstein's
struggles. Before trying to reconstruct these struggles, it is well
to note that his outward existence was far from tranquil during
the period when he was developing SRT. While attending the Polytechnic
at Zurich, thanks to the support of maternal relatives, he was plagued
by the thought that he was unable to help his family, which was
in dire financial straits due to constant business reverses. He
was the only graduate in his section (VIA) not to get an academic
post, and lived a hand-to-mouth existence for almost two years,
until he got a job at the Swiss Patent Office thanks to help from
a friend's father. During this period he was under severe family
pressure to break with his fiancee, whom he only married in 1903
after his father's death. His first child was born in 1904, and
he had to support wife and child on his modest income from the Patent
Office, while his mother found work as a housekeeper. So one must
not think of Einstein as a tranquil academic, brooding at leisure
on weighty intellectual problems. Rather one must imagine him fitting
his intellectual work into the interstices of a professional career
and personal life that might have overwhelmed someone with a different
nature.
The main methodological hypothesis
guiding my conjecture was stated by Hans Reichenbach some time ago:
"...the logical schema of the theory of relativity corresponds surprisingly
with the program which controlled its discovery." To put it in more
hifalutin' terms, also due to Reichenbach, I believe that "the context
of justification" of SRT used by Einstein can shed light on "the
context of its discovery."7
This hypothesis suggests that we can learn a good deal about the
development of the theory by paying close attention to the logical
structure of its initial presentation in 1905, and to the many accounts
of the theory that Einstein gave afterwards. Of course, I have tried
not to neglect any scrap of evidence known to me, including the
pitifully small amount of contemporary documentation and the later
reminiscences. But I have given special weight to Einstein's early
papers, letters, and lectures, in which he sought to justify the
theory to his contemporaries. Intellectually, Einstein was an exceedingly
self-absorbed person, willing to go over and over the grounds for
the theory again and again. These accounts, given over a number
of years, are remarkably self-consistent. They provide evidence
for a number of conjectures about the course of development of his
own ideas, and occasionally even include explicit statements about
it. I assume that by and large memory tends to deteriorate with
time, and (worse) that pseudo "memories" tend to develop and even
displace correct recollections. So, a second methodological hypothesis
which I shall adopt is that, in case of discrepancies between such
accounts, earlier ones are to be given greater weight than later
ones. Explicit remarks that Einstein makes about the discovery of
SRT in the course of his later expositions must always be given
great weight, but the earlier he made them the greater the weight
I give to them. Of course, if some feature of Einstein's accounts
remains unchanged over many years, I take this as evidence for giving
such a point the most weight.8
It follows from these methodological assumptions that I must preface
my conjectures with a brief resume of the "logical schema of
the theory of relativity" as it was first published in the 1905
paper. In this paper, as in almost all subsequent accounts,
Einstein bases SRT on two fundamental principles: the principle
of relativity and the principle of the constancy of the velocity
of light. The principle of relativity originated in Galilean-Newtonian
mechanics: Any frame of reference in which Newton's law of inertia
holds (for some period of time) is now called an inertial frame
of reference. From the laws of mechanics it follows that, if
one such inertial frame exists, then an infinity of them must:
All frames of reference (and only such frames) moving with constant
velocity with respect to a given inertial frame are also inertial
frames. All mechanical experiments and observations proved to
be in accord with the (mechanical) principle of relativity:
the laws of mechanics take the same form in any of these inertial
frames. The principle of relativity, as Einstein stated it in
1905, asserts that all the laws of physics take the
same form in any inertial frame-in particular, the laws of electricity,
magnetism, and optics in addition to those of mechanics.
The second of Einstein's principles
is based on an important consequence of Maxwell's laws of electricity,
magnetism, and optics, as interpreted by H. A. Lorentz near the
end of the nineteenth century. Maxwell had unified optics with electricity
and magnetism in a single theory, in which light is just one type
of electromagnetic wave. It was then believed that any wave must
propagate through some mechanical medium. Since light waves easily
propagate through the vacuum of interstellar space, it was assumed
that any vacuum, though empty of ordinary, ponderable matter, was
actually filled by such a medium, to which our senses did not respond:
the ether. The question then arose, how does this medium behave
when ordinary matter is present? In particular, is it dragged along
by the motion of matter? Various possible answers were considered
in the course of the nineteenth century, but finally only one view
seemed compatible with (almost) all the known experimental results,
that of H. A. Lorentz: The ether is present everywhere. Ordinary
matter is made up of electrically charged particles, which can move
through the ether, which is basically immobile. These charged particles,
then called "electrons" or "ions", produce all electric and magnetic
fields (including the electromagnetic waves we perceive as light),
which are nothing but certain excited states of the immovable ether.
The important experimental problem then arose of detecting the motion
of ponderable matter-the earth in particular-through the ether.
No other theory came remotely close to Lorentz's in accounting
for so many electromagnetic and especially optical phenomena.
This is not just my view of Lorentz's theory, it was Einstein's
view. In particular, he again and again cites the abberration
of starlight and the results of Fizeau's experiment on the velocity
of light in flowing water as decisive evidence in favor
of Lorentz's interpretation of Maxwell's equations.
A direct consequence of Lorentz's conception of the stationary
ether is that the velocity of light with respect to the ether is
a constant, independent of the motion of the source of light (or
its frequency, amplitude, or direction of propagation in the ether,
etc.).
Einstein adopted a slightly-but crucially-modified version of this
conclusion as his second principle: There is an inertial frame
in which the speed of light is a constant, independent of the velocity
of its source. A Lorentzian ether theorist could agree at once to
this statement, since it was always tacitly assumed that the ether
rest frame is an inertial frame of reference and Einstein had "only"
substituted "inertial frame" for "ether." But Einstein's omission
of the ether was deliberate and crucial: by the time he formulated
SRT he did not believe in its existence. For Einstein a principle
was just that: a principle-a starting point for a process of deduction,
not a deduction from any (ether) theory. (I am here getting ahead
of my story and will return to this point later.) The Lorentzian
ether theorist would add that there can only be one inertial
frame in which the light principle holds. If the speed of light
is a constant in the ether frame, it must be non-constant in every
other inertial frame, as follows from the (Newtonian) law of addition
of velocities. The light principle hence seems to be incompatible
with the relativity principle. For, according to the relativity
principle, all the laws of physics must be the same in
any inertial frame. So, if the speed of light is constant in one
inertial frame, and that frame is not physically singled out by
being the rest frame of some medium (the ether), then the speed
of light must be the same (universal) constant in every
other inertial frame (otherwise the democracy of inertial frames
is violated). As Einstein put it in 1905, his two principles are
"apparently incompatible." Of course, if they really were incompatible
logically or physically, that would be the end of SRT.9
Einstein showed that they are not only logically compatible, but
compatible with the results of all optical and other experiments
performed up to 1905 (and since, we may add). He was able to show
their logical compatibility by an analysis of the concepts of time,
simultaneity, and length, which demonstrated that the speed of light
really could have the privileged status, implied by his two principles,
of being a universal speed, the same in every inertial frame of
reference.10
Now I shall begin my conjecture
about Einstein's discovery of SRT. In a 1921 lecture, Einstein stated
that SRT originated from his interest in the problem of the optics
of moving bodies. He seems to have been fascinated from an early
age by the nature of light, a fascination that persisted throughout
his life. From an essay he wrote in 1895, (at age 16), we know that
he then believed in the ether, and had heard of Hertz's experiments
on the propagation of electromagnetic waves; but he does not show
any knowledge of Maxwell's theory. In much later reminiscences,
he reports that during the following year (1895-1896) he conceived
of a thought experiment: what would happen if an observer tried
to chase a light wave? Could s/he catch up with it? If so, s/he
ought to see a non-moving light wave form, which somehow seemed
strange to him. In retrospect, he called this "the first childish
thought-experiment that was related to the special theory of relativity."
Reliable accounts inform us that during his second year (1897-98)
at the Swiss Federal Technical Institute, or Poly as it was then
called, he tried to design an experiment to measure the velocity
of the earth through the ether, being then unacquainted with either
the theoretical work on this problem by Lorentz or the experiment
of Michelson and Morley (M-M). A precious bit of contemporary documentary
evidence reinforces this later account. In a letter to his schoolmate
and friend Marcel Grossmann, written in the summer of 1901 (by then
both had graduated from the Poly), Einstein wrote:
A considerably simpler method for the investigation of the relative
motion of matter with respect to the light ether has again
occurred to me, which is based on ordinary interference experiments.
If only inexorable destiny gives me the time and peace necessary
to carry it out.
At first sight, it would seem remarkable for Einstein to have written
these words (which also show that he had not yet abandoned the concept
of the ether), if he knew about the M-M experiment at this time.
However, while still at the Poly (i.e., before 1901) he appears
to have studied Maxwell's theory (not covered in his school lectures)
on his own, perhaps from the new textbook of August Föppl (which,
in various reincarnations, such as Föppl-Abraham, Abraham-Becker,
Becker-Sauter, has stayed in print to this day). Föppl discusses
a problem which evidently made a strong and lasting impression on
Einstein, since he opens the 1905 paper with a discussion of it.
This is the problem of the relative motion of a magnet and a conducting
wire loop. If the loop is at rest in the ether and the magnet is
moved with a given velocity, a certain electric current is induced
in the loop. If the magnet is at rest, and the loop moves with the
same relative velocity, a current of the same magnitude
and direction is induced in the loop. However, the ether theory
gives a different explanation for the origin of this current in
the two cases. In the first case an electric field is supposed to
be created in the ether by the motion of the magnet relative to
it (Faraday's law of induction). In the second case, no such electric
field is supposed to be present since the magnet is at rest in the
ether, but the current results from the motion of the loop through
the magnetic field (Lorentz force law). This asymmetry of explanation,
not reflected in any difference in the phenomena observed, must
already have been troubling to Einstein. Even more troubling was
the knowledge, when he acquired it, that all attempts to detect
the motion of ponderable matter through the ether had failed. This
was an "intolerable" (his word, about 1920) situation. Observable
electromagnetic phenomena depend only on the relative motions
of ponderable matter; their explanations differ, however, depending
on the presumed state of motion of that matter relative to the hypothetical
ether; yet all attempts to detect this presumed motion of ordinary
matter relative to the ether end in failure! He later (c. 1920)
recalled that the phenomenon of electromagnetic induction compelled
him to adopt the relativity principle.
In 1938 he wrote "The empirically suggested non-existence of such
an [ether wind] is the main starting point [point of departure]
for the special theory of relativity."11
It is not clear when the significance of the failure of all attempts
to detect the motion of ordinary matter through the ether first
struck him. The letter quoted above suggests that it was after the
summer of 1901. We know from a letter to another friend, Michele
Besso, dating from early 1903, that he had decided to "carry out
comprehensive studies in electron theory." No later than that, and
quite possibly earlier, he read Lorentz's 1895 book, "Attempt at
a Theory of Electrical and Optical Phenomena in Moving Bodies."
Einstein surely learned about, the many such failures by reading
this book, since one of its main purposes was to show that such
failures were compatible with Lorentz's stationary ether theory.
His later comments suggest that study of this book (Einstein says
this is the only work by Lorentz he read before 1905) convinced
him of the essential superiority of Lorentz' approach to the optics
of moving bodies; yet it also convinced him that the Lorentz theory
was still not fully satisfactory. Lorentz could explain away the
failure to detect motion of matter relative to the ether convincingly
to Einstein in all cases but one: the M-M experiment. To explain
this, Lorentz had to introduce a special hypothesis, which to Einstein
seemed completely unconnected with the rest of the theory: the famous
Lorentz contraction. To Einstein, such an approach was not a satisfactory
way out of the "intolerable dilemma." It seemed preferable to him
to accept at face value the failure of the M-M and all similar experiments
to detect motion of matter relative to the ether. Taken by themselves,
these negative results suggested to Einstein that the relativity
principle applied to electromagnetism, while the ether should be
dropped as superfluous. There has been some confusion on this important
point, so I shall expand on it. Sometimes the case is presented
in such a way as to suggest that it was the "philosophical concept"
of the relativity of all motion, as Einstein once called it, which
was the key step in his rejection of the ether. But the concept
of a stationary ether, as well as of a moving ether, is quite compatible
with this philosophical concept of the relativity of motion: one
need only assume that motions relative to the ether in the first
case, as well as relative motions of the parts of the ether in the
second, have physical efficacy. The leading advocates of both the
dragged-along and the immovable ether concepts, Hertz and Lorentz,
respectively, both understood this and both were read by Einstein.12
By the time he gave up the ether
concept, Einstein most likely took this philosophical conception
of the relativity of all motion for granted, presumably under the
influence of his early reading of Mach's Mechanics (around
1897). What bothered him now was that no phenomenon existed that
could be interpreted as empirical evidence for the physical efficacy
of the motion of ordinary matter relative to the ether, in spite
of repeated efforts to find one. Yet the best available theory-
Lorentz's theory-could only attempt to explain away such failures.
These explanations were satisfactory, within the framework of Lorentz'
theory, in almost all known cases (i.e., for all experiments sensitive
only to order v/c), and Einstein even seems to have been
tempted to give up what we may call his physical relativity principle
(with no ether needed). But Lorentz's explanation of the M-M experiment
seemed to Einstein so artificial that he resisted this temptation,
opting for the physical relativity principle. After eliminating
the ether from the story altogether, one can simply take the results
of the M-M and similar experiments as empirical evidence for the
equivalence of all inertial frames for the laws of electricity,
magnetism and optics as well as those of mechanics. I believe Einstein
gave up the ether concept and definitely opted for the physical
relativity principle at least a couple of years before the final
formulation of SRT, perhaps even earlier. At any rate, at some point
well before the 1905 formulation of the theory, he made this choice
and adhered to it thereafter.
There was a related motive for his skepticism with regard to the
ether, which I shall now mention. Not only was Einstein working
on problems of the optics of moving bodies, he was also working
on problems related to the emission and absorption of light by matter
and of the equilibrium behavior of electromagnetic radiation confined
in a cavity-the so-called black body radiation problem. He was using
Maxwell's and Boltzmann's statistical methods, which he had redeveloped
and refined in several earlier papers, to analyze this problem.
This was itself a daring step, since these methods had been developed
to help understand the behavior of ordinary matter while Einstein
was applying them to the apparently quite different field of electromagnetic
radiation.13
The "revolutionary" conclusion to which he came was that, in certain
respects, electromagnetic radiation behaved more like a collection
of particles than like a wave. He announced this result in a paper
published in 1905, three months before his SRT paper. The idea that
a light beam consisted of a stream of particles had been espoused
by Newton and maintained its popularity into the middle of the 19th
century. It was called the "emission theory" of light, a phrase
I shall use. The need to explain the phenomena of interference,
diffraction and polarization of light gradually led physicists to
abandon the emission theory in favor of the competing wave theory,
previously its less-favored rival. Maxwell's explanation of light
as a type of electromagnetic wave seemed to end the controversy
with a definitive victory of the wave theory. However, if Einstein
was right (as events slowly proved he was) the story must be much
more complicated. Einstein was aware of the difficulties with Maxwell's
theory-and of the need for what we now call a quantum theory of
electromagnetic radiation-well before publishing his SRT paper.
He regarded Maxwell's equations as some sort of statistical average-of
what he did not know, of course-which worked very well to explain
many optical phenomena, but could not be used to explain all the
interactions of light and matter. A notable feature of his first
light quantum paper is that it almost completely avoids mention
of the ether, even in discussing Maxwell's theory. Giving up the
ether concept allowed Einstein to envisage the possibility that
a beam of light was "an independent structure," as he put it a few
years later, "which is radiated by the light source, just as in
Newton's emission theory of light."
So abandonment of the concept of the ether was a most important
act of liberation for Einstein's thought in two respects: It
allowed Einstein to speculate more boldly on the nature of light
and it opened the way for adoption of his relativity principle
as a fundamental criterion for all physical laws. I must add
a word about Einstein's use of such principles as a guide to
further research. In 1919 he explicitly formulated a broad distinction
between constructive theories and theories of principle. Constructive
theories attempt to explain some limited group of phenomena
by means of some model, some set of postulated theoretical entities.
For example, many aspects of the behavior of a gas could be
explained by assuming that it was composed of an immense number
of constantly colliding molecules. Theories of principle formulate
broad regularities, presumably obeyed by all physical phenomena,
making these principles criteria ("rules of the game") that
any constructive theory must satisfy. For example, the principles
of thermodynamics are presumed to govern all macroscopic phenomena.
They say nothing about the, micro-structure or detailed behavior
of any particular gas, but do constitute limitations on any
acceptable constructive theory of such a gas. Any theory not
conserving the energy of the gas, for example, would be immediately
rejected. Since the turn of the century, Einstein had been searching
for a constructive theory of light, capable of explaining all
of its properties on the basis of some model, and was to continue
the search to the end of his days. But, "Despair[ing] of the
possibility of discovering the true answer by constructive efforts,"
as he later put it, he decided that the only possible way of
making progress in the absence of such a constructive theory
was to find some set of principles that could serve to limit
and guide the search for a constructive theory.14
There is no contemporary evidence showing when Einstein adopted
this point of view (he first indicated it in print as early
as 1907). I believe he had done so by 1905. The structure of
the 1905 SRT paper is certainly compatible with his having done
so. It is based on the statement of two such principles, deduction
of various kinematic consequences from them, and their application
to Maxwell's electrical and optical theory.
To return to the main thread
of my conjecture, I believe that Einstein dropped the ether
hypothesis and adopted his relativity principle by 1903 or 1904
at the latest. This is by no means the end of the story. It
seemed that he must then drop Lorentz's version of Maxwell's
theory, based as it was on the ether hypothesis. With what was
he to replace it? There is good evidence suggesting he spent
a great deal of effort trying to replace it with an emission
theory of light-the sort of theory suggested by his concurrent
researches into the quantum nature of light.15
An emission theory is perfectly compatible with the relativity
principle. Thus, the M-M experiment presented no problem; nor
is stellar abberration difficult to explain on this basis.16
Einstein seems to have wrestled with the problems of an emission
theory of light for some time, looking for a set of differential
equations describing such a theory that could replace the Maxwell-Lorentz
equations; and trying to explain a number of optical experiments,
notably the Fizeau experiment, based on some version of the emission
theory. He could not find any such equations, and his attempt to
explain the Fizeau experiment led him to more and more bizarre assumptions
to avoid an outright contradiction. So he more-or-less abandoned
this approach (you will soon see why I say more-or-less), after
perhaps a year or more of effort, and returned to a reconsideration
of the Maxwell-Lorentz equations. Perhaps there was a way of making
these equations compatible with the relativity principle once one
abandoned Lorentz's interpretation via the ether concept.
But here he ran into the most blatant-seeming contradiction, which
I mentioned earlier when first discussing the two principles. As
noted then, the Maxwell-Lorentz equations imply that there exists
(at least) one inertial frame in which the speed of light is a constant
regardless of the motion of the light source. Einstein's version
of the relativity principle (minus the ether) requires that, if
this is true for one inertial frame, it must be true for all inertial
frames. But this seems to be nonsense. How can it happen that the
speed of light relative to an observer cannot be increased or decreased
if that observer moves towards or away from a light beam? Einstein
states that he wrestled with this problem over a lengthy period
of time, to the point of despair. We have no details of this struggle,
unfortunately.
Finally, after a day spent wrestling
once more with the problem in the company of his friend and patent
office colleague Michele Besso, the only person thanked in the 1905
SRT paper, there came a moment of crucial insight. In all of his
struggles with the emission theory as well as with Lorentz's theory,
he had been assuming that the ordinary Newtonian law of addition
of velocities was unproblematic. It is this law of addition of velocities
that allows one to "prove" that, if the velocity of light is constant
with respect to one inertial frame, it cannot be constant with respect
to any other inertial frame moving with respect to the first. It
suddenly dawned on Einstein that this "obvious" law was based on
certain assumptions about the nature of time always tacitly made.
In particular, the concept of the velocity of an object with respect
to an inertial frame depends on time readings made at two different
places in that inertial frame. (He later referred to this moment
of illumination as "the step.")17
How do we know that time readings at two such distant places are
properly correlated? Ultimately this boils down to the question:
how do we decide when events at two different places in the same
frame of reference occur at the same time, i.e., simultaneously?
Isn't universal simultaneity an intuitively obvious property of
time? Here, I believe, Einstein was really helped by his philosophical
readings. He undoubtedly got some help from his readings of Mach
and Poincaré, but we know that he was engaged in a careful
reading of Hume at about this time; and his later reminiscences
attribute great significance to his reading of Hume's Treatise
on Human Nature. What could he have gotten from Hume? I think
it was a relational-as opposed to an absolute-concept of time and
space. This is the view that time and space are not to be regarded
as self-subsistent entities; rather one should speak of the temporal
and spatial aspects of physical processes; "The doctrine," as Hume
puts it, "that time is nothing but the manner, in which some real
object exists." I believe the adoption of such a relational concept
of time was a crucial step in freeing Einstein's outlook, enabling
him to consider critically the tacit assumptions about time going
into the usual arguments for the "obvious" velocity addition law.
This was the second great moment of liberation of his thought.
I shall not rehearse Einstein's arguments here, but it led to the
radically novel idea that, once one physically defines simultaneity
of two distant events relative to one inertial frame of reference,
it by no means follows that these two events will be simultaneous
when the same definition is used relative to another inertial
frame moving with respect to the first. It is not logically excluded
that they are simultaneous relative to all inertial frames.
If we make that assumption, we are led back to Newtonian kinematics
and the usual velocity addition law, which is logically quite consistent.
However, if we adopt the two Einstein principles, then we are led
to a new kinematics of time and space, in which the velocity of
light is a universal constant, while simultaneity is different with
respect to different inertial frames; this is also logically quite
consistent. The usual velocity addition law is then replaced by
a new one, in which the velocity of light "added" to any other velocity
("added" in a new sense-it would be better to say "compounded with")
does not increase, but stays the same! The Maxwell-Lorentz equations,
when examined with the aid of this new kinematics, prove to take
the same form in every inertial frame. They are, therefore, quite
compatible with the relativity principle, which demands that the
laws of electricity, magnetism and optics have this property. The
presence or absence of an electric or magnetic field, is then also
found to be relative to an inertial frame, allowing a completely
satisfactory relativistic analysis of the example of the conducting
wire loop and magnet in relative motion. Within six weeks of taking
"the step," Einstein later recalled, he had worked out all of these
consequences and submitted the 1905 SRT paper to Annalen der
Physik.
This does not imply that Lorentz's equations are adequate to explain
all the features of light, of course. Einstein already knew they
did not always correctly do so-in particular in the processes of
its emission, absorption and its behavior in black body radiation.
Indeed, his new velocity addition law is also compatible with an
emission theory of light, just because the speed
of light compounded with any lesser velocity still yields the same
value. If we model a beam of light as a stream of particles, the
two principles can still be obeyed. A few years later (1909), Einstein
first publicly expressed the view that an adequate future theory
of light would have to be some sort of fusion of the wave and emission
theories. This is an example of how the special theory of relativity
functioned as a theory of principle, limiting but not fixing the
choice of a constructive theory of light.
Here I shall end my conjectures
on how Einstein arrived at SRT. To briefly recapitulate, I believe
that the first principle, the relativity principle, recapitulates
his struggles with the mechanical ether concept which led finally
to the first crucial liberation of his thought-the abandonment of
the ether. The second principle, the principle of the constancy
of the speed of light, recapitulates his struggle, once he had definitely
opted for the relativity principle, first to evade the Maxwell-Lorentz
theory by an emission theory; then to isolate what was still valid
in the Maxwell Lorentz theory after giving up the ether concept
and abandoning absolute faith in the wave theory of light. The struggle
to reconcile the two principles could only end successfully after
the second great liberation of his thought: the relativisation of
the concept of time. The resulting theory did not force him to choose
between wave and emission theories of light, but rather led him
to look forward to a synthesis of the two. This synthesis was finally
achieved, over twenty years later, in the quantum theory of fields,
to the satisfaction of most physicists, but ironically, never to
that of Einstein.
I cannot ask you to accept my conjectures after all of my warnings
at the outset of this paper, but will be content if you say
"Si non è vero, è ben trovato," "If it isn't true,
it's well contrived."
You can EXIT
to Einstein's own words on "Ether and the Theory of Relativity"
(1920 address)
Notes
1. Poe is quoting Sir Thomas Browne's
Hydrotaphia. BACK
2. A preliminary question is raised
by my use of the word "discovery." Is it better to speak of the
"discovery" or the "creation" of a theory like SRT? "Discovery"
suggests the finding of some pre-existent, objective structure,
as when we say "Columbus discovered America." "Creation" suggests
an individual, subjective act, as when we say "Tolstoy created Anne
Karenina." Neither word seems really appropriate to describe
what goes on in the scientific endeavor. Einstein apparently preferred
the word "Erfindung" (invention) to describe how scientific theories
come into being. Speaking of Mach, Einstein says: "Er meinte gewissermassen,
dass Theorien durch Entdeckung und nicht durch Erfindung
entstehen." (Einstein-Besso Correspondence (Hermann, Paris
1972), p. 191, dated January 6, 1948. BACK
3. In the study of the discovery
of GRT, therefore, one may hope to formulate conjectures which can
be either confirmed or refuted. For example: A study of Einstein's
published papers and private correspondence between 1912-1915 convinced
me that the standard explanation for his failure to arrive at the
correct gravitational field equations until the end of this period-namely,
his presumed lack of understanding of the meaning of freedom of
coordinate transformations in a generally covariant theory and the
ability to impose coordinate conditions that this freedom implied-could
not be correct (see "Einstein's Search for General Covariance, 1912-1915,"
presented at the Ninth International Conference on General Relativity
and Gravitation, July 17, 1980, in Stachel Einstein from "B"
to "Z", pp. 301338). On the basis of his study of a research
notebook of Einstein from the early part of this period, John Norton
was able to prove that Einstein already was aware of the possibility
of imposing coordinate conditions on a set of field equations, and
indeed had used the harmonic coordinate conditions (see John Norton,
"How Einstein found his field equations: 1912-1915," Historical
Studies in the Physical Sciences 14, 253 (1984). For reasons
discussed in the text, one cannot hope to confirm or disconfirm
most conjectures about the origins of SRT.BACK
4. For a survey of this material
for the period up to 1923, see J. Stachel, "Einstein and Michelson:
The Context of Discovery and the Context of Justification," Astron.
Nachricht. 303, 47 (1982). Unless otherwise noted,
quotations from Einstein are cited from this paper, which gives
the full references. [See Stachel, Einstein from "B" to "Z",
pp. 177-190].BACK
5. See Arthur I. Miller, Albert
Einstein's Special Theory of Relativity (Addison-Wesley, Reading
1981), which contains references to his earlier papers as well as
those of Holton, Hirosige and many others; Abraham Pais, 'Subtle
is the Lord. . .' The Science end the Life of Albert Einstein
(Oxford U.P., New York 1982); Stanley Goldberg, Understanding
Relativity (Birkhauser, Boston 1984); Roberto Torretti, Relativity
end Geometry (Pergamon, Oxford 1983). Earman, Glymour and Rynasiewicz
have not yet published a full account of their views; I thank them
for making available copies of several preprints on this subject.
BACK
6. A popular epigram among historians
runs: "God is omnipotent, but even He cannot change the past. That
is why He created historians."
BACK
7.See the reference in
footnote 4 for the source of the citations from Reichenbach. If
my thesis here is correct, this argues against the still widely
held view that these two contexts should be rigorously separated.
But in this paper I shall not elaborate on the wider issue.
BACK
8. For example, Einstein's
statements of the second principle of SRT, the light principle,
remained remarkably consistent throughout his lifetime (see the
discussion of this principle below). Indeed, an apparent exception
in the printed text of his article "What is the Theory of Relativity?,"
published originally in English translation in the Times
of London in 1919, proved to be based upon an incorrect transcription
of his manuscript. BACK
9. Much of the anti-relativity
literature, which still continues to grow in volume if not in weight,
is based on attempts to show that the two principles are indeed
logically incompatible. BACK
10. Sometimes
(e.g., by Pais and Goldberg), this consequence of Einstein's two
principles is asserted to be his second principle. This is incorrect
factually (Einstein's account of the second principle is one of
the most consistent features of his discussions of SRT over the
years-see footnote 8), and disturbing for several reasons: a) it
makes it impossible to explain why Einstein refers to the two principles
as apparently contradictory. There is no contradiction apparent
between the relativity principle and this deduction from it; b)
it is logically defective, since the two principles would no longer
be logically independent, as they are in Einstein's formulation;
c) most important for present purposes, this formulation deprives
us of important clues to Einstein's reasoning that led to the development
of SRT. BACK
11. Einstein to Max Talmey,
June 6, 1938. The German text reads: "Die empirisch suggerierte
Nichtexistenz einer solchen bevorzugten 'Wind-Richtung' ist der
Haupt-Ausgangspunkt der speziellen Relativitatstheorie."
BACK
12. Hertz
said: "... the absolute motion of a rigid system of bodies has no
effect upon any internal electromagnetic processes whatever in it,
provided that all the bodies under consideration, including the
ether as well, actually share the motion." (Electromagnetic
Waves, p. 246). Lorentz said:
That one cannot speak of the absolute rest of the ether, is
self-evident indeed; the expression wouldn't even have any meaning.
If I say for short, the ether is at rest, this only means that
one part of this medium is not displaced with respect to the others
and that all perceptible movements of the heavenly bodies are
relative movements with respect to the ether. [Versuch,
p. 4 (1895).] BACK
13. He was not alone
in transferring statistical methods from ordinary matter to radiation.
Planck had already done so, but Einstein did not see the relation
of his work to Planck's until after publishing his first paper on
the subject. BACK
14. See Albert Einstein,
Autobiographical Notes (Open Court, LaSalle 1979), pp.
48 (German text) and 49 (English translation).
BACK
15. One such piece of
evidence, not cited in my earlier paper (see footnote 4), has only
recently come to light. It occurs in the most complete review of
SRT that Einstein ever wrote. It was prepared in 1912 but never
published, and is still in private hands. Luckily, a copy has come
into the possession of the Einstein Archive. In it, Einstein explains
at some length the difficulties that are encountered (and presumably
these are the ones he had encountered), if one tries to explain
the results of the Fizeau experiment on the basis of an emission
theory of light combined with the relativity principle and Galilei-Newtonian
kinematics. [See The Collected Papers of Albert Einstein,
vol. 4. The Swiss Years: Writings 1912-1914 (Princeton
University Press, Princeton 1995), Doc. 1, "Manuscript on the Special
Theory of Relativity," pp. 32-36]. BACK
16. Indeed,
the earliest explanation of stellar abberation had been based on
the emission theory. BACK
17. Abraham Pais has
mentioned this in describing his conversations with Einstein.
BACK
This is the text of "'What Song the Syrens Sang':
How Did Einstein Discover Special Relativity?" as printed in John
Stachel, Einstein
from "B" to "Z" (Boston : Birkhäuser, 2002), pp. 157-169.
It is an English text of "Quale canzone cantarono le sirene: come
scopro Einstein la teoria speciale della relatività?" published
in L'Opera di Einstein (1989), pp. 21-37. Copyright ©
1989, 2002 by John Stachel.
John Stachel is Professor of Physics
Emeritus and Director of the Center of Einstein Studies at Boston
University. He has written a variety of articles on aspects of the
history of both special and general relativity and other topics, and
has edited or co-edited a number of books dealing with Einstein and
relativity.
|