| This reprints an essay written ca. 1983, "'What Song the Syrens
                  Sang': How Did Einstein Discover Special Relativity?" in John
                  Stachel, Einstein
                  from "B" to "Z". If you have read Edgar
              Allen Poe's "The Murders in the Rue Morgue," perhaps you remember
              the epigraph that Poe chose for this pioneer detective story: 
               What song the Syrens sang, or what name Achilles assumed when
                he hid himself among women, though puzzling questions, are not
                beyond all conjecture.1 I believe that the problem of how Einstein discovered the special
              theory of relativity (SRT) falls into this category of "puzzling
              questions," that "are not beyond all conjecture."2
              Let me begin by explaining why. When I started work on the Einstein Papers, there was already a
              large literature on the origins of SRT compared, say, to the rather
              scanty amount published on the origins of the general theory of
              relativity (GRT). So I assumed that the development of SRT must
              be fairly clear. However, I soon learned that the amount of work
              published on the origin of SRT and GRT are just about inversely
              proportional to the available primary source material. For GRT,
              we have a series of Einstein's papers from 1907 to 1915, capturing
              the successive steps of his search for the final version of the
              theory. In addition, there is extensive contemporary correspondence
              on the subject, several research notebooks, records of lectures
              given by Einstein during this period, not to mention a number of
              later reminiscences and historical remarks by Einstein.3 For SRT we have the paper On the Electrodynamics of Moving
                  Bodies, in which the theory was first set forth in 1905
                  in its finished form, indeed a rather polished form (which is
                  not to say that it bears no traces of its gestation process).
                  The only earlier documentary evidence consists of literally
                  a couple of sentences to be found in the handful of preserved
                  early Einstein letters (I will quote both sentences later).
                  We do have a number of later historical remarks by Einstein
                  himself, sometimes transmitted by others (Wertheimer, Reiser-Kayser,
                  Shankland, Ishiwara, for example), which raise many problems
                  of authenticity and accuracy; and some very late Einstein letters,
                  answering questions such as whether he had prior knowledge of
                  the Michelson-Morley experiment, what works by Lorentz he had
                  read, the influence of Poincaré, Mach, Hume, etc., on
                  his ideas; Einstein's replies are not always self-consistent,
                  it must be noted.4 Yet the urge to provide an answer to the question of the discovery
                  of SRT has proven irresistible to many scholars. It is not hard
                  to see why: A twenty-six year old patent expert (third class),
                  largely self-taught in physics, who had never seen a theoretical
                  physicist (as he later put it), let alone worked with one, author
                  of several competent but not particularly distinguished papers,
                  Einstein produced four extraordinary works in the year 1905,
                  only one of which (not the relativity paper) seemed obviously
                  related to his earlier papers. These works exerted the most
                  profound influence on the development of physics in the 20th
                  Century. How did Einstein do it? Small wonder that Tetu Hirosige,
                  Gerald Holton, Arthur I. Miller, Abraham Pais, John Earman,
                  Clark Glymour, Stanley Goldberg, Robert Rynasiewicz, Roberto
                  Torretti, et al., have been moved to study this question.
                  I shall not try to record my debts to and differences with each
                  of these scholars, lest this survey become even longer and more
                  tedious than it is already; but must at least acknowledge the
                  influence of their work on my own.5
                  I resisted the urge to conjecture for some years, but have finally
                  succumbed, so I can well understand the temptation. Contrary to my original, naive expectation,
              no general consensus has emerged from all this work. Given the nature
              of the available documentation and the difficulty of understanding
              any creative process-let alone that of a genius-this really is not
              surprising. I now believe that the most one can hope to do in discussing
              the discovery of SRT is to construct a plausible conjecture. Such
              a conjecture will be based upon a certain weighting of the scanty
              evidence we possess, based upon certain methodological hypotheses,
              as well as the imagination of the conjecturer.6
              There are bound to be differences of opinion in these matters. All
              one can demand is that it be made clear on what methodological hypotheses
              a conjecture is based, and a demonstration that the conjecture is
              in accord with the available evidence when the latter is weighted
              in accord with these hypotheses. Let me emphasize that no such account can hope to encompass
                  those elements of the creative process that Einstein referred
                  to as "the irrational, the inconsistent, the droll, even the
                  insane, which nature, inexhaustibly operative, implants into
                  the individual, seemingly for her own amusement," for "These
                  things are singled out only in the crucible of one's own mind."
                  Yet one may draw courage for the type of conjecture I have in
                  mind from another remark of Einstein:  "A new idea comes suddenly and in a rather intuitive way. That
              means it is not reached by conscious logical conclusions. But, thinking
              it through afterwards, you can always discover the reasons which
              have led you unconsciously to your guess and you will find a logical
              way to justify it. Intuition is nothing but the outcome of earlier
              intellectual experience." I shall discuss only this intellectual, logical side of Einstein's
              struggles. Before trying to reconstruct these struggles, it is well
              to note that his outward existence was far from tranquil during
              the period when he was developing SRT. While attending the Polytechnic
              at Zurich, thanks to the support of maternal relatives, he was plagued
              by the thought that he was unable to help his family, which was
              in dire financial straits due to constant business reverses. He
              was the only graduate in his section (VIA) not to get an academic
              post, and lived a hand-to-mouth existence for almost two years,
              until he got a job at the Swiss Patent Office thanks to help from
              a friend's father. During this period he was under severe family
              pressure to break with his fiancee, whom he only married in 1903
              after his father's death. His first child was born in 1904, and
              he had to support wife and child on his modest income from the Patent
              Office, while his mother found work as a housekeeper. So one must
              not think of Einstein as a tranquil academic, brooding at leisure
              on weighty intellectual problems. Rather one must imagine him fitting
              his intellectual work into the interstices of a professional career
              and personal life that might have overwhelmed someone with a different
              nature. The main methodological hypothesis
              guiding my conjecture was stated by Hans Reichenbach some time ago:
              "...the logical schema of the theory of relativity corresponds surprisingly
              with the program which controlled its discovery." To put it in more
              hifalutin' terms, also due to Reichenbach, I believe that "the context
              of justification" of SRT used by Einstein can shed light on "the
              context of its discovery."7
              This hypothesis suggests that we can learn a good deal about the
              development of the theory by paying close attention to the logical
              structure of its initial presentation in 1905, and to the many accounts
              of the theory that Einstein gave afterwards. Of course, I have tried
              not to neglect any scrap of evidence known to me, including the
              pitifully small amount of contemporary documentation and the later
              reminiscences. But I have given special weight to Einstein's early
              papers, letters, and lectures, in which he sought to justify the
              theory to his contemporaries. Intellectually, Einstein was an exceedingly
              self-absorbed person, willing to go over and over the grounds for
              the theory again and again. These accounts, given over a number
              of years, are remarkably self-consistent. They provide evidence
              for a number of conjectures about the course of development of his
              own ideas, and occasionally even include explicit statements about
              it. I assume that by and large memory tends to deteriorate with
              time, and (worse) that pseudo "memories" tend to develop and even
              displace correct recollections. So, a second methodological hypothesis
              which I shall adopt is that, in case of discrepancies between such
              accounts, earlier ones are to be given greater weight than later
              ones. Explicit remarks that Einstein makes about the discovery of
              SRT in the course of his later expositions must always be given
              great weight, but the earlier he made them the greater the weight
              I give to them. Of course, if some feature of Einstein's accounts
              remains unchanged over many years, I take this as evidence for giving
              such a point the most weight.8 It follows from these methodological assumptions that I must preface
                  my conjectures with a brief resume of the "logical schema of
                  the theory of relativity" as it was first published in the 1905
                  paper. In this paper, as in almost all subsequent accounts,
                  Einstein bases SRT on two fundamental principles: the principle
                  of relativity and the principle of the constancy of the velocity
                  of light. The principle of relativity originated in Galilean-Newtonian
                  mechanics: Any frame of reference in which Newton's law of inertia
                  holds (for some period of time) is now called an inertial frame
                  of reference. From the laws of mechanics it follows that, if
                  one such inertial frame exists, then an infinity of them must:
                  All frames of reference (and only such frames) moving with constant
                  velocity with respect to a given inertial frame are also inertial
                  frames. All mechanical experiments and observations proved to
                  be in accord with the (mechanical) principle of relativity:
                  the laws of mechanics take the same form in any of these inertial
                  frames. The principle of relativity, as Einstein stated it in
                  1905, asserts that all the laws of physics take the
                  same form in any inertial frame-in particular, the laws of electricity,
                  magnetism, and optics in addition to those of mechanics. The second of Einstein's principles
              is based on an important consequence of Maxwell's laws of electricity,
              magnetism, and optics, as interpreted by H. A. Lorentz near the
              end of the nineteenth century. Maxwell had unified optics with electricity
              and magnetism in a single theory, in which light is just one type
              of electromagnetic wave. It was then believed that any wave must
              propagate through some mechanical medium. Since light waves easily
              propagate through the vacuum of interstellar space, it was assumed
              that any vacuum, though empty of ordinary, ponderable matter, was
              actually filled by such a medium, to which our senses did not respond:
              the ether. The question then arose, how does this medium behave
              when ordinary matter is present? In particular, is it dragged along
              by the motion of matter? Various possible answers were considered
              in the course of the nineteenth century, but finally only one view
              seemed compatible with (almost) all the known experimental results,
              that of H. A. Lorentz: The ether is present everywhere. Ordinary
              matter is made up of electrically charged particles, which can move
              through the ether, which is basically immobile. These charged particles,
              then called "electrons" or "ions", produce all electric and magnetic
              fields (including the electromagnetic waves we perceive as light),
              which are nothing but certain excited states of the immovable ether.
              The important experimental problem then arose of detecting the motion
              of ponderable matter-the earth in particular-through the ether. No other theory came remotely close to Lorentz's in accounting
                  for so many electromagnetic and especially optical phenomena.
                  This is not just my view of Lorentz's theory, it was Einstein's
                  view. In particular, he again and again cites the abberration
                  of starlight and the results of Fizeau's experiment on the velocity
                  of light in flowing water as decisive evidence in favor
                  of Lorentz's interpretation of Maxwell's equations. A direct consequence of Lorentz's conception of the stationary
              ether is that the velocity of light with respect to the ether is
              a constant, independent of the motion of the source of light (or
              its frequency, amplitude, or direction of propagation in the ether,
              etc.). Einstein adopted a slightly-but crucially-modified version of this
              conclusion as his second principle: There is an inertial frame
              in which the speed of light is a constant, independent of the velocity
              of its source. A Lorentzian ether theorist could agree at once to
              this statement, since it was always tacitly assumed that the ether
              rest frame is an inertial frame of reference and Einstein had "only"
              substituted "inertial frame" for "ether." But Einstein's omission
              of the ether was deliberate and crucial: by the time he formulated
              SRT he did not believe in its existence. For Einstein a principle
              was just that: a principle-a starting point for a process of deduction,
              not a deduction from any (ether) theory. (I am here getting ahead
              of my story and will return to this point later.) The Lorentzian
              ether theorist would add that there can only be one inertial
              frame in which the light principle holds. If the speed of light
              is a constant in the ether frame, it must be non-constant in every
              other inertial frame, as follows from the (Newtonian) law of addition
              of velocities. The light principle hence seems to be incompatible
              with the relativity principle. For, according to the relativity
              principle, all the laws of physics must be the same in
              any inertial frame. So, if the speed of light is constant in one
              inertial frame, and that frame is not physically singled out by
              being the rest frame of some medium (the ether), then the speed
              of light must be the same (universal) constant in every
              other inertial frame (otherwise the democracy of inertial frames
              is violated). As Einstein put it in 1905, his two principles are
              "apparently incompatible." Of course, if they really were incompatible
              logically or physically, that would be the end of SRT.9 Einstein showed that they are not only logically compatible, but
              compatible with the results of all optical and other experiments
              performed up to 1905 (and since, we may add). He was able to show
              their logical compatibility by an analysis of the concepts of time,
              simultaneity, and length, which demonstrated that the speed of light
              really could have the privileged status, implied by his two principles,
              of being a universal speed, the same in every inertial frame of
              reference.10 Now I shall begin my conjecture
              about Einstein's discovery of SRT. In a 1921 lecture, Einstein stated
              that SRT originated from his interest in the problem of the optics
              of moving bodies. He seems to have been fascinated from an early
              age by the nature of light, a fascination that persisted throughout
              his life. From an essay he wrote in 1895, (at age 16), we know that
              he then believed in the ether, and had heard of Hertz's experiments
              on the propagation of electromagnetic waves; but he does not show
              any knowledge of Maxwell's theory. In much later reminiscences,
              he reports that during the following year (1895-1896) he conceived
              of a thought experiment: what would happen if an observer tried
              to chase a light wave? Could s/he catch up with it? If so, s/he
              ought to see a non-moving light wave form, which somehow seemed
              strange to him. In retrospect, he called this "the first childish
              thought-experiment that was related to the special theory of relativity."
              Reliable accounts inform us that during his second year (1897-98)
              at the Swiss Federal Technical Institute, or Poly as it was then
              called, he tried to design an experiment to measure the velocity
              of the earth through the ether, being then unacquainted with either
              the theoretical work on this problem by Lorentz or the experiment
              of Michelson and Morley (M-M). A precious bit of contemporary documentary
              evidence reinforces this later account. In a letter to his schoolmate
              and friend Marcel Grossmann, written in the summer of 1901 (by then
              both had graduated from the Poly), Einstein wrote: 
                   A considerably simpler method for the investigation of the relative
                    motion of matter with respect to the light ether has again
                    occurred to me, which is based on ordinary interference experiments.
                    If only inexorable destiny gives me the time and peace necessary
                    to carry it out. At first sight, it would seem remarkable for Einstein to have written
              these words (which also show that he had not yet abandoned the concept
              of the ether), if he knew about the M-M experiment at this time. However, while still at the Poly (i.e., before 1901) he appears
              to have studied Maxwell's theory (not covered in his school lectures)
              on his own, perhaps from the new textbook of August Föppl (which,
              in various reincarnations, such as Föppl-Abraham, Abraham-Becker,
              Becker-Sauter, has stayed in print to this day). Föppl discusses
              a problem which evidently made a strong and lasting impression on
              Einstein, since he opens the 1905 paper with a discussion of it.
              This is the problem of the relative motion of a magnet and a conducting
              wire loop. If the loop is at rest in the ether and the magnet is
              moved with a given velocity, a certain electric current is induced
              in the loop. If the magnet is at rest, and the loop moves with the
              same relative velocity, a current of the same magnitude
              and direction is induced in the loop. However, the ether theory
              gives a different explanation for the origin of this current in
              the two cases. In the first case an electric field is supposed to
              be created in the ether by the motion of the magnet relative to
              it (Faraday's law of induction). In the second case, no such electric
              field is supposed to be present since the magnet is at rest in the
              ether, but the current results from the motion of the loop through
              the magnetic field (Lorentz force law). This asymmetry of explanation,
              not reflected in any difference in the phenomena observed, must
              already have been troubling to Einstein. Even more troubling was
              the knowledge, when he acquired it, that all attempts to detect
              the motion of ponderable matter through the ether had failed. This
              was an "intolerable" (his word, about 1920) situation. Observable
              electromagnetic phenomena depend only on the relative motions
              of ponderable matter; their explanations differ, however, depending
              on the presumed state of motion of that matter relative to the hypothetical
              ether; yet all attempts to detect this presumed motion of ordinary
              matter relative to the ether end in failure! He later (c. 1920)
              recalled that the phenomenon of electromagnetic induction compelled
              him to adopt the relativity principle. In 1938 he wrote "The empirically suggested non-existence of such
              an [ether wind] is the main starting point [point of departure]
              for the special theory of relativity."11
              It is not clear when the significance of the failure of all attempts
              to detect the motion of ordinary matter through the ether first
              struck him. The letter quoted above suggests that it was after the
              summer of 1901. We know from a letter to another friend, Michele
              Besso, dating from early 1903, that he had decided to "carry out
              comprehensive studies in electron theory." No later than that, and
              quite possibly earlier, he read Lorentz's 1895 book, "Attempt at
              a Theory of Electrical and Optical Phenomena in Moving Bodies."
              Einstein surely learned about, the many such failures by reading
              this book, since one of its main purposes was to show that such
              failures were compatible with Lorentz's stationary ether theory.
              His later comments suggest that study of this book (Einstein says
              this is the only work by Lorentz he read before 1905) convinced
              him of the essential superiority of Lorentz' approach to the optics
              of moving bodies; yet it also convinced him that the Lorentz theory
              was still not fully satisfactory. Lorentz could explain away the
              failure to detect motion of matter relative to the ether convincingly
              to Einstein in all cases but one: the M-M experiment. To explain
              this, Lorentz had to introduce a special hypothesis, which to Einstein
              seemed completely unconnected with the rest of the theory: the famous
              Lorentz contraction. To Einstein, such an approach was not a satisfactory
              way out of the "intolerable dilemma." It seemed preferable to him
              to accept at face value the failure of the M-M and all similar experiments
              to detect motion of matter relative to the ether. Taken by themselves,
              these negative results suggested to Einstein that the relativity
              principle applied to electromagnetism, while the ether should be
              dropped as superfluous. There has been some confusion on this important
              point, so I shall expand on it. Sometimes the case is presented
              in such a way as to suggest that it was the "philosophical concept"
              of the relativity of all motion, as Einstein once called it, which
              was the key step in his rejection of the ether. But the concept
              of a stationary ether, as well as of a moving ether, is quite compatible
              with this philosophical concept of the relativity of motion: one
              need only assume that motions relative to the ether in the first
              case, as well as relative motions of the parts of the ether in the
              second, have physical efficacy. The leading advocates of both the
              dragged-along and the immovable ether concepts, Hertz and Lorentz,
              respectively, both understood this and both were read by Einstein.12 By the time he gave up the ether
              concept, Einstein most likely took this philosophical conception
              of the relativity of all motion for granted, presumably under the
              influence of his early reading of Mach's Mechanics (around
              1897). What bothered him now was that no phenomenon existed that
              could be interpreted as empirical evidence for the physical efficacy
              of the motion of ordinary matter relative to the ether, in spite
              of repeated efforts to find one. Yet the best available theory-
              Lorentz's theory-could only attempt to explain away such failures.
              These explanations were satisfactory, within the framework of Lorentz'
              theory, in almost all known cases (i.e., for all experiments sensitive
              only to order v/c), and Einstein even seems to have been
              tempted to give up what we may call his physical relativity principle
              (with no ether needed). But Lorentz's explanation of the M-M experiment
              seemed to Einstein so artificial that he resisted this temptation,
              opting for the physical relativity principle. After eliminating
              the ether from the story altogether, one can simply take the results
              of the M-M and similar experiments as empirical evidence for the
              equivalence of all inertial frames for the laws of electricity,
              magnetism and optics as well as those of mechanics. I believe Einstein
              gave up the ether concept and definitely opted for the physical
              relativity principle at least a couple of years before the final
              formulation of SRT, perhaps even earlier. At any rate, at some point
              well before the 1905 formulation of the theory, he made this choice
              and adhered to it thereafter. There was a related motive for his skepticism with regard to the
              ether, which I shall now mention. Not only was Einstein working
              on problems of the optics of moving bodies, he was also working
              on problems related to the emission and absorption of light by matter
              and of the equilibrium behavior of electromagnetic radiation confined
              in a cavity-the so-called black body radiation problem. He was using
              Maxwell's and Boltzmann's statistical methods, which he had redeveloped
              and refined in several earlier papers, to analyze this problem.
              This was itself a daring step, since these methods had been developed
              to help understand the behavior of ordinary matter while Einstein
              was applying them to the apparently quite different field of electromagnetic
              radiation.13
              The "revolutionary" conclusion to which he came was that, in certain
              respects, electromagnetic radiation behaved more like a collection
              of particles than like a wave. He announced this result in a paper
              published in 1905, three months before his SRT paper. The idea that
              a light beam consisted of a stream of particles had been espoused
              by Newton and maintained its popularity into the middle of the 19th
              century. It was called the "emission theory" of light, a phrase
              I shall use. The need to explain the phenomena of interference,
              diffraction and polarization of light gradually led physicists to
              abandon the emission theory in favor of the competing wave theory,
              previously its less-favored rival. Maxwell's explanation of light
              as a type of electromagnetic wave seemed to end the controversy
              with a definitive victory of the wave theory. However, if Einstein
              was right (as events slowly proved he was) the story must be much
              more complicated. Einstein was aware of the difficulties with Maxwell's
              theory-and of the need for what we now call a quantum theory of
              electromagnetic radiation-well before publishing his SRT paper.
              He regarded Maxwell's equations as some sort of statistical average-of
              what he did not know, of course-which worked very well to explain
              many optical phenomena, but could not be used to explain all the
              interactions of light and matter. A notable feature of his first
              light quantum paper is that it almost completely avoids mention
              of the ether, even in discussing Maxwell's theory. Giving up the
              ether concept allowed Einstein to envisage the possibility that
              a beam of light was "an independent structure," as he put it a few
              years later, "which is radiated by the light source, just as in
              Newton's emission theory of light." So abandonment of the concept of the ether was a most important
                  act of liberation for Einstein's thought in two respects: It
                  allowed Einstein to speculate more boldly on the nature of light
                  and it opened the way for adoption of his relativity principle
                  as a fundamental criterion for all physical laws. I must add
                  a word about Einstein's use of such principles as a guide to
                  further research. In 1919 he explicitly formulated a broad distinction
                  between constructive theories and theories of principle. Constructive
                  theories attempt to explain some limited group of phenomena
                  by means of some model, some set of postulated theoretical entities.
                  For example, many aspects of the behavior of a gas could be
                  explained by assuming that it was composed of an immense number
                  of constantly colliding molecules. Theories of principle formulate
                  broad regularities, presumably obeyed by all physical phenomena,
                  making these principles criteria ("rules of the game") that
                  any constructive theory must satisfy. For example, the principles
                  of thermodynamics are presumed to govern all macroscopic phenomena.
                  They say nothing about the, micro-structure or detailed behavior
                  of any particular gas, but do constitute limitations on any
                  acceptable constructive theory of such a gas. Any theory not
                  conserving the energy of the gas, for example, would be immediately
                  rejected. Since the turn of the century, Einstein had been searching
                  for a constructive theory of light, capable of explaining all
                  of its properties on the basis of some model, and was to continue
                  the search to the end of his days. But, "Despair[ing] of the
                  possibility of discovering the true answer by constructive efforts,"
                  as he later put it, he decided that the only possible way of
                  making progress in the absence of such a constructive theory
                  was to find some set of principles that could serve to limit
                  and guide the search for a constructive theory.14
                  There is no contemporary evidence showing when Einstein adopted
                  this point of view (he first indicated it in print as early
                  as 1907). I believe he had done so by 1905. The structure of
                  the 1905 SRT paper is certainly compatible with his having done
                  so. It is based on the statement of two such principles, deduction
                  of various kinematic consequences from them, and their application
                  to Maxwell's electrical and optical theory. To return to the main thread
                  of my conjecture, I believe that Einstein dropped the ether
                  hypothesis and adopted his relativity principle by 1903 or 1904
                  at the latest. This is by no means the end of the story. It
                  seemed that he must then drop Lorentz's version of Maxwell's
                  theory, based as it was on the ether hypothesis. With what was
                  he to replace it? There is good evidence suggesting he spent
                  a great deal of effort trying to replace it with an emission
                  theory of light-the sort of theory suggested by his concurrent
                  researches into the quantum nature of light.15
                  An emission theory is perfectly compatible with the relativity
                  principle. Thus, the M-M experiment presented no problem; nor
                  is stellar abberration difficult to explain on this basis.16 Einstein seems to have wrestled with the problems of an emission
              theory of light for some time, looking for a set of differential
              equations describing such a theory that could replace the Maxwell-Lorentz
              equations; and trying to explain a number of optical experiments,
              notably the Fizeau experiment, based on some version of the emission
              theory. He could not find any such equations, and his attempt to
              explain the Fizeau experiment led him to more and more bizarre assumptions
              to avoid an outright contradiction. So he more-or-less abandoned
              this approach (you will soon see why I say more-or-less), after
              perhaps a year or more of effort, and returned to a reconsideration
              of the Maxwell-Lorentz equations. Perhaps there was a way of making
              these equations compatible with the relativity principle once one
              abandoned Lorentz's interpretation via the ether concept. But here he ran into the most blatant-seeming contradiction, which
              I mentioned earlier when first discussing the two principles. As
              noted then, the Maxwell-Lorentz equations imply that there exists
              (at least) one inertial frame in which the speed of light is a constant
              regardless of the motion of the light source. Einstein's version
              of the relativity principle (minus the ether) requires that, if
              this is true for one inertial frame, it must be true for all inertial
              frames. But this seems to be nonsense. How can it happen that the
              speed of light relative to an observer cannot be increased or decreased
              if that observer moves towards or away from a light beam? Einstein
              states that he wrestled with this problem over a lengthy period
              of time, to the point of despair. We have no details of this struggle,
              unfortunately. Finally, after a day spent wrestling
              once more with the problem in the company of his friend and patent
              office colleague Michele Besso, the only person thanked in the 1905
              SRT paper, there came a moment of crucial insight. In all of his
              struggles with the emission theory as well as with Lorentz's theory,
              he had been assuming that the ordinary Newtonian law of addition
              of velocities was unproblematic. It is this law of addition of velocities
              that allows one to "prove" that, if the velocity of light is constant
              with respect to one inertial frame, it cannot be constant with respect
              to any other inertial frame moving with respect to the first. It
              suddenly dawned on Einstein that this "obvious" law was based on
              certain assumptions about the nature of time always tacitly made.
              In particular, the concept of the velocity of an object with respect
              to an inertial frame depends on time readings made at two different
              places in that inertial frame. (He later referred to this moment
              of illumination as "the step.")17
             How do we know that time readings at two such distant places are
              properly correlated? Ultimately this boils down to the question:
              how do we decide when events at two different places in the same
              frame of reference occur at the same time, i.e., simultaneously?
              Isn't universal simultaneity an intuitively obvious property of
              time? Here, I believe, Einstein was really helped by his philosophical
              readings. He undoubtedly got some help from his readings of Mach
              and Poincaré, but we know that he was engaged in a careful
              reading of Hume at about this time; and his later reminiscences
              attribute great significance to his reading of Hume's Treatise
              on Human Nature. What could he have gotten from Hume? I think
              it was a relational-as opposed to an absolute-concept of time and
              space. This is the view that time and space are not to be regarded
              as self-subsistent entities; rather one should speak of the temporal
              and spatial aspects of physical processes; "The doctrine," as Hume
              puts it, "that time is nothing but the manner, in which some real
              object exists." I believe the adoption of such a relational concept
              of time was a crucial step in freeing Einstein's outlook, enabling
              him to consider critically the tacit assumptions about time going
              into the usual arguments for the "obvious" velocity addition law.
              This was the second great moment of liberation of his thought.  I shall not rehearse Einstein's arguments here, but it led to the
              radically novel idea that, once one physically defines simultaneity
              of two distant events relative to one inertial frame of reference,
              it by no means follows that these two events will be simultaneous
              when the same definition is used relative to another inertial
              frame moving with respect to the first. It is not logically excluded
              that they are simultaneous relative to all inertial frames.
              If we make that assumption, we are led back to Newtonian kinematics
              and the usual velocity addition law, which is logically quite consistent.
              However, if we adopt the two Einstein principles, then we are led
              to a new kinematics of time and space, in which the velocity of
              light is a universal constant, while simultaneity is different with
              respect to different inertial frames; this is also logically quite
              consistent. The usual velocity addition law is then replaced by
              a new one, in which the velocity of light "added" to any other velocity
              ("added" in a new sense-it would be better to say "compounded with")
              does not increase, but stays the same! The Maxwell-Lorentz equations,
              when examined with the aid of this new kinematics, prove to take
              the same form in every inertial frame. They are, therefore, quite
              compatible with the relativity principle, which demands that the
              laws of electricity, magnetism and optics have this property. The
              presence or absence of an electric or magnetic field, is then also
              found to be relative to an inertial frame, allowing a completely
              satisfactory relativistic analysis of the example of the conducting
              wire loop and magnet in relative motion. Within six weeks of taking
              "the step," Einstein later recalled, he had worked out all of these
              consequences and submitted the 1905 SRT paper to Annalen der
              Physik. This does not imply that Lorentz's equations are adequate to explain
              all the features of light, of course. Einstein already knew they
              did not always correctly do so-in particular in the processes of
              its emission, absorption and its behavior in black body radiation.
              Indeed, his new velocity addition law is also compatible with an
              emission theory of light, just because the speed
              of light compounded with any lesser velocity still yields the same
              value. If we model a beam of light as a stream of particles, the
              two principles can still be obeyed. A few years later (1909), Einstein
              first publicly expressed the view that an adequate future theory
              of light would have to be some sort of fusion of the wave and emission
              theories. This is an example of how the special theory of relativity
              functioned as a theory of principle, limiting but not fixing the
              choice of a constructive theory of light. Here I shall end my conjectures
              on how Einstein arrived at SRT. To briefly recapitulate, I believe
              that the first principle, the relativity principle, recapitulates
              his struggles with the mechanical ether concept which led finally
              to the first crucial liberation of his thought-the abandonment of
              the ether. The second principle, the principle of the constancy
              of the speed of light, recapitulates his struggle, once he had definitely
              opted for the relativity principle, first to evade the Maxwell-Lorentz
              theory by an emission theory; then to isolate what was still valid
              in the Maxwell Lorentz theory after giving up the ether concept
              and abandoning absolute faith in the wave theory of light. The struggle
              to reconcile the two principles could only end successfully after
              the second great liberation of his thought: the relativisation of
              the concept of time. The resulting theory did not force him to choose
              between wave and emission theories of light, but rather led him
              to look forward to a synthesis of the two. This synthesis was finally
              achieved, over twenty years later, in the quantum theory of fields,
              to the satisfaction of most physicists, but ironically, never to
              that of Einstein.  I cannot ask you to accept my conjectures after all of my warnings
                  at the outset of this paper, but will be content if you say
                  "Si non è vero, è ben trovato," "If it isn't true,
                  it's well contrived."       You can EXIT
              to Einstein's own words on "Ether and the Theory of Relativity"
              (1920 address) 
 Notes 1. Poe is quoting Sir Thomas Browne's
              Hydrotaphia. BACK 2. A preliminary question is raised
              by my use of the word "discovery." Is it better to speak of the
              "discovery" or the "creation" of a theory like SRT? "Discovery"
              suggests the finding of some pre-existent, objective structure,
              as when we say "Columbus discovered America." "Creation" suggests
              an individual, subjective act, as when we say "Tolstoy created Anne
              Karenina." Neither word seems really appropriate to describe
              what goes on in the scientific endeavor. Einstein apparently preferred
              the word "Erfindung" (invention) to describe how scientific theories
              come into being. Speaking of Mach, Einstein says: "Er meinte gewissermassen,
              dass Theorien durch Entdeckung und nicht durch Erfindung
              entstehen." (Einstein-Besso Correspondence (Hermann, Paris
              1972), p. 191, dated January 6, 1948. BACK  3. In the study of the discovery
              of GRT, therefore, one may hope to formulate conjectures which can
              be either confirmed or refuted. For example: A study of Einstein's
              published papers and private correspondence between 1912-1915 convinced
              me that the standard explanation for his failure to arrive at the
              correct gravitational field equations until the end of this period-namely,
              his presumed lack of understanding of the meaning of freedom of
              coordinate transformations in a generally covariant theory and the
              ability to impose coordinate conditions that this freedom implied-could
              not be correct (see "Einstein's Search for General Covariance, 1912-1915,"
              presented at the Ninth International Conference on General Relativity
              and Gravitation, July 17, 1980, in Stachel Einstein from "B"
              to "Z", pp. 301338). On the basis of his study of a research
              notebook of Einstein from the early part of this period, John Norton
              was able to prove that Einstein already was aware of the possibility
              of imposing coordinate conditions on a set of field equations, and
              indeed had used the harmonic coordinate conditions (see John Norton,
              "How Einstein found his field equations: 1912-1915," Historical
              Studies in the Physical Sciences 14, 253 (1984). For reasons
              discussed in the text, one cannot hope to confirm or disconfirm
              most conjectures about the origins of SRT.BACK  4. For a survey of this material
              for the period up to 1923, see J. Stachel, "Einstein and Michelson:
              The Context of Discovery and the Context of Justification," Astron.
              Nachricht. 303, 47 (1982). Unless otherwise noted,
              quotations from Einstein are cited from this paper, which gives
              the full references. [See Stachel, Einstein from "B" to "Z",
              pp. 177-190].BACK 5. See Arthur I. Miller, Albert
              Einstein's Special Theory of Relativity (Addison-Wesley, Reading
              1981), which contains references to his earlier papers as well as
              those of Holton, Hirosige and many others; Abraham Pais, 'Subtle
              is the Lord. . .' The Science end the Life of Albert Einstein
              (Oxford U.P., New York 1982); Stanley Goldberg, Understanding
              Relativity (Birkhauser, Boston 1984); Roberto Torretti, Relativity
              end Geometry (Pergamon, Oxford 1983). Earman, Glymour and Rynasiewicz
              have not yet published a full account of their views; I thank them
              for making available copies of several preprints on this subject.
              BACK 6. A popular epigram among historians
              runs: "God is omnipotent, but even He cannot change the past. That
              is why He created historians." 
              BACK 7.See the reference in
              footnote 4 for the source of the citations from Reichenbach. If
              my thesis here is correct, this argues against the still widely
              held view that these two contexts should be rigorously separated.
              But in this paper I shall not elaborate on the wider issue.
              BACK 8. For example, Einstein's
              statements of the second principle of SRT, the light principle,
              remained remarkably consistent throughout his lifetime (see the
              discussion of this principle below). Indeed, an apparent exception
              in the printed text of his article "What is the Theory of Relativity?,"
              published originally in English translation in the Times
              of London in 1919, proved to be based upon an incorrect transcription
              of his manuscript.  BACK 9. Much of the anti-relativity
              literature, which still continues to grow in volume if not in weight,
              is based on attempts to show that the two principles are indeed
              logically incompatible.  BACK 10.  Sometimes
              (e.g., by Pais and Goldberg), this consequence of Einstein's two
              principles is asserted to be his second principle. This is incorrect
              factually (Einstein's account of the second principle is one of
              the most consistent features of his discussions of SRT over the
              years-see footnote 8), and disturbing for several reasons: a) it
              makes it impossible to explain why Einstein refers to the two principles
              as apparently contradictory. There is no contradiction apparent
              between the relativity principle and this deduction from it; b)
              it is logically defective, since the two principles would no longer
              be logically independent, as they are in Einstein's formulation;
              c) most important for present purposes, this formulation deprives
              us of important clues to Einstein's reasoning that led to the development
              of SRT.  BACK 11. Einstein to Max Talmey,
              June 6, 1938. The German text reads: "Die empirisch suggerierte
              Nichtexistenz einer solchen bevorzugten 'Wind-Richtung' ist der
              Haupt-Ausgangspunkt der speziellen Relativitatstheorie."
              BACK 12.  Hertz
              said: "... the absolute motion of a rigid system of bodies has no
              effect upon any internal electromagnetic processes whatever in it,
              provided that all the bodies under consideration, including the
              ether as well, actually share the motion." (Electromagnetic
              Waves, p. 246). Lorentz said: 
               That one cannot speak of the absolute rest of the ether, is
                self-evident indeed; the expression wouldn't even have any meaning.
                If I say for short, the ether is at rest, this only means that
                one part of this medium is not displaced with respect to the others
                and that all perceptible movements of the heavenly bodies are
                relative movements with respect to the ether. [Versuch,
                p. 4 (1895).] BACK 13. He was not alone
              in transferring statistical methods from ordinary matter to radiation.
              Planck had already done so, but Einstein did not see the relation
              of his work to Planck's until after publishing his first paper on
              the subject.  BACK 14. See Albert Einstein,
              Autobiographical Notes (Open Court, LaSalle 1979), pp.
              48 (German text) and 49 (English translation). 
              BACK 15. One such piece of
              evidence, not cited in my earlier paper (see footnote 4), has only
              recently come to light. It occurs in the most complete review of
              SRT that Einstein ever wrote. It was prepared in 1912 but never
              published, and is still in private hands. Luckily, a copy has come
              into the possession of the Einstein Archive. In it, Einstein explains
              at some length the difficulties that are encountered (and presumably
              these are the ones he had encountered), if one tries to explain
              the results of the Fizeau experiment on the basis of an emission
              theory of light combined with the relativity principle and Galilei-Newtonian
              kinematics. [See The Collected Papers of Albert Einstein,
              vol. 4. The Swiss Years: Writings 1912-1914 (Princeton
              University Press, Princeton 1995), Doc. 1, "Manuscript on the Special
              Theory of Relativity," pp. 32-36].  BACK 16.  Indeed,
              the earliest explanation of stellar abberation had been based on
              the emission theory.  BACK 17. Abraham Pais has
              mentioned this in describing his conversations with Einstein. 
              BACK This is the text of "'What Song the Syrens Sang':
              How Did Einstein Discover Special Relativity?" as printed in John
              Stachel, Einstein
              from "B" to "Z" (Boston : Birkhäuser, 2002), pp. 157-169.
              It is an English text of "Quale canzone cantarono le sirene: come
              scopro Einstein la teoria speciale della relatività?" published
              in L'Opera di Einstein (1989), pp. 21-37. Copyright ©
              1989, 2002 by John Stachel. John Stachel is Professor of Physics
            Emeritus and Director of the Center of Einstein Studies at Boston
            University. He has written a variety of articles on aspects of the
            history of both special and general relativity and other topics, and
            has edited or co-edited a number of books dealing with Einstein and
            relativity.
 
 |