Project Gutenberg

The Principle of Relativity

Einstein, Albert, Minkowski, H. (Hermann)

20 chapters · 106 pages · 37,151 words

Opens the print dialog, where you can choose Save as PDF.

Del 1Page 1 / 106

Del 1

The Principle of Relativity

THE PRINCIPLE OF RELATIVITY

ORIGINAL PAPERS BY
A. EINSTEIN AND H. MINKOWSKI

TRANSLATED INTO ENGLISH BY
M. N. SAHA AND S. N. BOSE
LECTURERS ON PHYSICS AND APPLIED MATHEMATICS University College of Science, Calcutta University

WITH A HISTORICAL INTRODUCTION BY
P. C. MAHALANOBIS PROFESSOR OF PHYSICS, PRESIDENCY COLLEGE, CALCU.

PUBLISHED BY THE

UNIVERSITY OF CALCUTTA
1920
Sole Agents R. CAMBRAY & CO.
PRINTED BY ATULCHANDRA BHATTACHARYYA,
AT THE CALCUTTA UNIVERSITY PRESS, SENATE HOUSE, CALCUTTA

TABLE OF CONTENTS
1. Historical Introduction i-xxiii
[By Mr. P. C. Mahalanobis.]
2. On the Electrodynamics of Moving Bodies 1-34
[Einstein's first paper on the restricted Theory of Relativity, originally published in the Annalen der Physik in 1905. Translated from the original German by Dr. Meghnad Saha.]
3. Albrecht Einstein 35-39
[A short biographical note by Dr. Meghnad Saha.]
4. Principle of Relativity 1-52
[H. Minkowski's original paper on the restricted Principle of Relativity first published in 1909. Translated from the original German by Dr. Meghnad Saha.]
5. Appendix to the above by H. Minkowski 53-88
[Translated by Dr. Meghnad Saha.]
6. The Generalised Principle of Relativity 89-163
[A. Einstein's second paper on the Generalised Principle first published in 1916. Translated from the original German by Mr. Satyendranath Bose.]
7. Notes 165-185
Transcriber's Note:
The plain text version of this ebook includes complex mathematical formulas. Some are simple in-line expressions like k = 1 - 1/μ^2. They may include special notations such as x^y for x to the power of y, x_{y} for x with a subscript of y, [=a] for an 'a' with a bar across the top, [.a] for an 'a' with a dot over it, [..a] for an 'a' with two dots over it. Others are more complex "ASCII Art" like this:
l l 2lc 2l
t₁ = ------ + ------ = -------- = --- β²
c - u c + u c² - u² c
Some are so complex that they must be rendered in the TeX mathematical notation, enclosed between double dollar signs, like this:
$ \beta = (1 - \frac {u^2}{c^2})^{-\frac{1}{2}} $

HISTORICAL INTRODUCTION

Del 2Page 2 / 106

Del 2

Lord Kelvin writing in 1893, in his preface to the English edition of Hertz's Researches on Electric Waves, says "many workers and many thinkers have helped to build up the nineteenth century school of plenum, one ether for light, heat, electricity, magnetism; and the German and English volumes containing Hertz's electrical papers, given to the world in the last decade of the century, will be a permanent monument of the splendid consummation now realised."

Ten years later, in 1905, we find Einstein declaring that "the ether will be proved to be superfluous." At first sight the revolution in scientific thought brought about in the course of a single decade appears to be almost too violent. A more careful even though a rapid review of the subject will, however, show how the Theory of Relativity gradually became a historical necessity.

Towards the beginning of the nineteenth century, the luminiferous ether came into prominence as a result of the brilliant successes of the wave theory in the hands of Young and Fresnel. In its stationary aspect the elastic solid ether was the outcome of the search for a medium in which the light waves may "undulate." This stationary ether, as shown by Young, also afforded a satisfactory explanation of astronomical aberration. But its very success gave rise to a host of new questions all bearing on the central problem of relative motion of ether and matter.

Arago's prism experiment.—The refractive index of a glass prism depends on the incident velocity of light outside the prism and its velocity inside the prism after refraction. On Fresnel's fixed ether hypothesis, the incident light waves are situated in the stationary ether outside the prism and move with velocity c with respect to the ether. If the prism moves with a velocity u with respect to this fixed ether, then the incident velocity of light with respect to the prism should be c + u. Thus the refractive index of the glass prism should depend on u the absolute velocity of the prism, i.e., its velocity with respect to the fixed ether. Arago performed the experiment in 1819, but failed to detect the expected change.

Del 2Page 3 / 106

Del 2 continues

Airy-Boscovitch water-telescope experiment.—Boscovitch had still earlier in 1766, raised the very important question of the dependence of aberration on the refractive index of the medium filling the telescope. Aberration depends on the difference in the velocity of light outside the telescope and its velocity inside the telescope. If the latter velocity changes owing to a change in the medium filling the telescope, aberration itself should change, that is, aberration should depend on the nature of the medium.

Airy, in 1871 filled up a telescope with water—but failed to detect any change in the aberration. Thus we get both in the case of Arago prism experiment and Airy-Boscovitch water-telescope experiment, the very startling result that optical effects in a moving medium seem to be quite independent of the velocity of the medium with respect to Fresnel's stationary ether.

Fresnel's convection coefficient k = 1 - 1/μ^2.—Possibly some form of compensation is taking place. Working on this hypothesis, Fresnel offered his famous ether convection theory. According to Fresnel, the presence of matter implies a definite condensation of ether within the region occupied by matter. This "condensed" or excess portion of ether is supposed to be carried away with its own piece of moving matter. It should be observed that only the "excess" portion is carried away, while the rest remains as stagnant as ever. A complete convection of the "excess" ether ρ′ with the full velocity u is optically equivalent to a partial convection of the total ether ρ, with only a fraction of the velocity k. u. Fresnel showed that if this convection coefficient k is 1 - 1/μ^2 (μ being the refractive index of the prism), then the velocity of light after refraction within the moving prism would be altered to just such extent as would make the refractive index of the moving prism quite independent of its "absolute" velocity u. The non-dependence of aberration on the "absolute" velocity u, is also very easily explained with the help of this Fresnelian convection-coefficient k.

Del 2Page 4 / 106

Del 2 continues

Stokes' viscous ether.—It should be remembered, however, that Fresnel's stationary ether is absolutely fixed and is not at all disturbed by the motion of matter through it. In this respect Fresnelian ether cannot be said to behave in any respectable physical fashion, and this led Stokes, in 1845-46, to construct a more material type of medium. Stokes assumed that viscous motion ensues near the surface of separation of ether and moving matter, while at sufficiently distant regions the ether remains wholly undisturbed. He showed how such a viscous ether would explain aberration if all motion in it were differentially irrotational. But in order to explain the null Arago effect, Stokes was compelled to assume the convection hypothesis of Fresnel with an identical numerical value for k, namely 1 - 1/μ^2. Thus the prestige of the Fresnelian convection-coefficient was enhanced, if anything, by the theoretical investigations of Stokes.

Fizeau's experiment.—Soon after, in 1851, it received direct experimental confirmation in a brilliant piece of work by Fizeau.

Del 2Page 5 / 106

Del 2 continues

If a divided beam of light is re-united after passing through two adjacent cylinders filled with water, ordinary interference fringes will be produced. If the water in one of the cylinders is now made to flow, the "condensed" ether within the flowing water would be convected and would produce a shift in the interference fringes. The shift actually observed agreed very well with a value of k = 1 - 1/μ^2. The Fresnelian convection-coefficient now became firmly established as a consequence of a direct positive effect. On the other hand, the negative evidences in favour of the convection-coefficient had also multiplied. Mascart, Hoek, Maxwell and others sought for definite changes in different optical effects induced by the motion of the earth relative to the stationary ether. But all such attempts failed to reveal the slightest trace of any optical disturbance due to the "absolute" velocity of the earth, thus proving conclusively that all the different optical effects shared in the general compensation arising out of the Fresnelian convection of the excess ether. It must be carefully noted that the Fresnelian convection-coefficient implicitly assumes the existence of a fixed ether (Fresnel) or at least a wholly stagnant medium at sufficiently distant regions (Stokes), with reference to which alone a convection velocity can have any significance. Thus the convection-coefficient implying some type of a stationary or viscous, yet nevertheless "absolute" ether, succeeded in explaining satisfactorily all known optical facts down to 1880.

Michelson-Morley Experiment.—In 1881, Michelson and Morley performed their classical experiments which undermined the whole structure of the old ether theory and thus served to introduce the new theory of relativity. The fundamental idea underlying this experiment is quite simple. In all old experiments the velocity of light situated in free ether was compared with the velocity of waves actually situated in a piece of moving matter and presumably carried away by it. The compensatory effect of the Fresnelian convection of ether afforded a satisfactory explanation of all negative results.

Del 2Page 6 / 106

Del 2 continues

In the Michelson-Morley experiment the arrangement is quite different. If there is a definite gap in a rigid body, light waves situated in free ether will take a definite time in crossing the gap. If the rigid platform carrying the gap is set in motion with respect to the ether in the direction of light propagation, light waves (which are even now situated in free ether) should presumably take a longer time to cross the gap.

We cannot do better than quote Eddington's description of this famous experiment. "The principle of the experiment may be illustrated by considering a swimmer in a river. It is easily realized that it takes longer to swim to a point 50 yards up-stream and back than to a point 50 yards across-stream and back. If the earth is moving through the ether there is a river of ether flowing through the laboratory, and a wave of light may be compared to a swimmer travelling with constant velocity relative to the current. If, then, we divide a beam of light into two parts, and send one-half swimming up the stream for a certain distance and then (by a mirror) back to the starting point, and send the other half an equal distance across stream and back, the across-stream beam should arrive back first.

——>u
O
A—————........
| x
|
|B

Let the ether be flowing relative to the apparatus with velocity u in the direction Ox, and let OA, OB, be the two arms of the apparatus of equal length l, OA being placed up-stream. Let c be the velocity of light. The time for the double journey along OA and back is

l l 2lc 2l
t₁ = ------ + ------ = -------- = --- β²
c - u c + u c² - u² c

where

$ \beta = (1 - \frac {u^2}{c^2})^{-\frac {1}{2}} $

a factor greater than unity.

For the transverse journey the light must have a component velocity n up-stream (relative to the ether) in order to avoid being carried below OB: and since its total velocity is c, its component across-stream must be √(c² - u²), the time for the double journey OB is accordingly

$ t_2 = \frac {2a}{\sqrt {c^2 - u^2}} = \frac {2a}{c} \beta $

so that t₁ > t₂.

Del 2Page 7 / 106

Del 2 continues

But when the experiment was tried, it was found that both parts of the beam took the same time, as tested by the interference bands produced."

After a most careful series of observations, Michelson and Morley failed to detect the slightest trace of any effect due to earth's motion through ether.

The Michelson-Morley experiment seems to show that there is no relative motion of ether and matter. Fresnel's stagnant ether requires a relative velocity of—u. Thus Michelson and Morley themselves thought at first that their experiment confirmed Stokes' viscous ether, in which no relative motion can ensue on account of the absence of slipping of ether at the surface of separation. But even on Stokes' theory this viscous flow of ether would fall off at a very rapid rate as we recede from the surface of separation. Michelson and Morley repeated their experiment at different heights from the surface of the earth, but invariably obtained the same negative results, thus failing to confirm Stokes' theory of viscous flow.

Lodge's experiment.—Further, in 1893, Lodge performed his rotating sphere experiment which showed complete absence of any viscous flow of ether due to moving masses of matter. A divided beam of light, after repeated reflections within a very narrow gap between two massive hemispheres, was allowed to re-unite and thus produce interference bands. When the two hemispheres are set rotating, it is conceivable that the ether in the gap would be disturbed due to viscous flow, and any such flow would be immediately detected by a disturbance of the interference bands. But actual observation failed to detect the slightest disturbance of the ether in the gap, due to the motion of the hemispheres. Lodge's experiment thus seems to show a complete absence of any viscous flow of ether.

Del 2Page 8 / 106

Del 2 continues

Apart from these experimental discrepancies, grave theoretical objections were urged against a viscous ether. Stokes himself had shown that his ether must be incompressible and all motion in it differentially irrotational, at the same time there should be absolutely no slipping at the surface of separation. Now all these conditions cannot be simultaneously satisfied for any conceivable material medium without certain very special and arbitrary assumptions. Thus Stokes' ether failed to satisfy the very motive which had led Stokes to formulate it, namely, the desirability of constructing a "physical" medium. Planck offered modified forms of Stokes' theory which seemed capable of being reconciled with the Michelson-Morley experiment, but required very special assumptions. The very complexity and the very arbitrariness of these assumptions prevented Planck's ether from attaining any degree of practical importance in the further development of the subject.

Del 2Page 9 / 106

Del 2 continues

The sole criterion of the value of any scientific theory must ultimately be its capacity for offering a simple, unified, coherent and fruitful description of observed facts. In proportion as a theory becomes complex it loses in usefulness—a theory which is obliged to requisition a whole array of arbitrary assumptions in order to explain special facts is practically worse than useless, as it serves to disjoin, rather than to unite, the several groups of facts. The optical experiments of the last quarter of the nineteenth century showed the impossibility of constructing a simple ether theory, which would be amenable to analytic treatment and would at the same time stimulate further progress. It should be observed that it could scarcely be shown that no logically consistent ether theory was possible; indeed in 1910, H. A. Wilson offered a consistent ether theory which was at least quite neutral with respect to all available optical data. But Wilson's ether is almost wholly negative—its only virtue being that it does not directly contradict observed facts. Neither any direct confirmation nor a direct refutation is possible and it does not throw any light on the various optical phenomena. A theory like this being practically useless stands self-condemned.

We must now consider the problem of relative motion of ether and matter from the point of view of electrical theory. From 1860 the identity of light as an electromagnetic vector became gradually established as a result of the brilliant "displacement current" hypothesis of Clerk Maxwell and his further analytical investigations. The elastic solid ether became gradually transformed into the electromagnetic one. Maxwell succeeded in giving a fairly satisfactory account of all ordinary optical phenomena and little room was left for any serious doubts as regards the general validity of Maxwell's theory. Hertz's researches on electric waves, first carried out in 1886, succeeded in furnishing a strong experimental confirmation of Maxwell's theory. Electric waves behaved generally like light waves of very large wave length.

Del 2Page 10 / 106

Del 2 continues

The orthodox Maxwellian view located the dielectric polarisation in the electromagnetic ether which was merely a transformation of Fresnel's stagnant ether. The magnetic polarisation was looked on as wholly secondary in origin, being due to the relative motion of the dielectric tubes of polarisation. On this view the Fresnelian convection coefficient comes out to be ½, as shown by J. J. Thomson in 1880, instead of 1 - (1/μ²) as required by optical experiments. This obviously implies a complete failure to account for all those optical experiments which depend for their satisfactory explanation on the assumption of a value for the convection coefficient equal to 1 - (1/μ²).

The modifications proposed independently by Hertz and Heaviside fare no better.[1] They postulated the actual medium to be the seat of all electric polarisation and further emphasised the reciprocal relation subsisting between electricity and magnetism, thus making the field equations more symmetrical. On this view the whole of the polarised ether is carried away by the moving medium, and consequently, the convection coefficient naturally becomes unity in this theory, a value quite as discrepant as that obtained on the original Maxwellian assumption.

Thus neither Maxwell's original theory nor its subsequent modifications as developed by Hertz and Heaviside succeeded in obtaining a value for Fresnelian coefficient equal to 1 - (1/μ^2), and consequently stood totally condemned from the optical point of view.

Del 2Page 11 / 106

Del 2 continues

Certain direct electromagnetic experiments involving relative motion of polarised dielectrics were no less conclusive against the generalised theory of Hertz and Heaviside. According to Hertz a moving dielectric would carry away the whole of its electric displacement with it. Hence the electromagnetic effect near the moving dielectric would be proportional to the total electric displacement, that is to K, the specific inductive capacity of the dielectric. In 1901, Blondlot working with a stream of moving gas could not detect any such effect. H. A. Wilson repeated the experiment in an improved form in 1903 and working with ebonite found that the observed effect was proportional to K - 1 instead of to K. For gases K is nearly equal to 1 and hence practically no effect will be observed in their case. This gives a satisfactory explanation of Blondlot's negative results.

Rowland had shown in 1876 that the magnetic force due to a rotating condenser (the dielectric remaining stationary) was proportional to K, the sp. ind. cap. On the other hand, Röntgen found in 1888 the magnetic effect due to a rotating dielectric (the condenser remaining stationary) to be proportional to K - 1, and not to K. Finally Eichenwald in 1903 found that when both condenser and dielectric are rotated together, the effect observed was quite independent of K, a result quite consistent with the two previous experiments. The Rowland effect proportional to K, together with the opposite Röntgen effect proportional to 1 - K, makes the Eichenwald effect independent of K.

Del 2Page 12 / 106

Del 2 continues

All these experiments together with those of Blondlot and Wilson made it clear that the electromagnetic effect due to a moving dielectric was proportional to K - 1, and not to K as required by Hertz's theory. Thus the above group of experiments with moving dielectrics directly contradicted the Hertz-Heaviside theory. The internal discrepancies inherent in the classic ether theory had now become too prominent. It was clear that the ether concept had finally outgrown its usefulness. The observed facts had become too contradictory and too heterogeneous to be reduced to an organised whole with the help of the ether concept alone. Radical departures from the classical theory had become absolutely necessary.

There were several outstanding difficulties in connection with anomalous dispersion, selective reflection and selective absorption which could not be satisfactory explained in the classic electromagnetic theory. It was evident that the assumption of some kind of discreteness in the optical medium had become inevitable. Such an assumption naturally gave rise to an atomic theory of electricity, namely, the modern electron theory. Lorentz had postulated the existence of electrons so early as 1878, but it was not until some years later that the electron theory became firmly established on a satisfactory basis.

Lorentz assumed that a moving dielectric merely carried away its own "polarisation doublets," which on his theory gave rise to the induced field proportional to K - 1. The field near a moving dielectric is naturally proportional to K - 1 and not to K. Lorentz's theory thus gave a satisfactory explanation of all those experiments with moving dielectrics which required effects proportional to K - 1. Lorentz further succeeded in obtaining a value for the Fresnelian convection coefficient equal to 1 - 1/μ^2, the exact value required by all optical experiments of the moving type.

Del 2Page 13 / 106

Del 2 continues

We must now go back to Michelson and Morley's experiment. We have seen that both parts of the beam are situated in free ether; no material medium is involved in any portion of the paths actually traversed by the beam. Consequently no compensation due to Fresnelian convection of ether by moving medium is possible. Thus Fresnelian convection compensation can have no possible application in this case. Yet some marvellous compensation has evidently taken place which has completely masked the "absolute" velocity of the earth.

In Michelson and Morley's experiment, the distance travelled by the beam along OA (that is, in a direction parallel to the motion of the platform) is 2lβ², while the distance travelled by the beam along OB, perpendicular to the direction of motion of the platform, is 2lβ. Yet the most careful experiments showed, as Eddington says, "that both parts of the beam took the same time as tested by the interference bands produced. It would seem that OA and OB could not really have been of the same length; and if OB was of length l, OA must have been of length l/β. The apparatus was now rotated through 90°, so that OB became the up-stream. The time for the two journeys was again the same, so that 0B must now be the shorter length. The plain meaning of the experiment is that both arms have a length l when placed along Oy (perpendicular to the direction of motion), and automatically contract to a length l/β, when placed along Ox (parallel to the direction of motion). This explanation was first given by Fitz-Gerald."

This Fitz-Gerald contraction, startling enough in itself, does not suffice. Assuming this contraction to be a real one, the distance travelled with respect to the ether is 2lβ and the time taken for this journey is 2lβ/c. But the distance travelled with respect to the platform is always 2l. Hence the velocity of light with respect to the platform is

$ \frac {2l}{\frac {2l\beta}{c}} = \frac {c}{\beta} $

Del 2Page 14 / 106

Del 2 continues

a variable quantity depending on the "absolute" velocity of the platform. But no trace of such an effect has ever been found. The velocity of light is always found to be quite independent of the velocity of the platform. The present difficulty cannot be solved by any further alteration in the measure of space. The only recourse left open is to alter the measure of time as well, that is, to adopt the concept of "local time." If a moving clock goes slower so that one 'real' second becomes 1/β second as measured in the moving system, the velocity of light relative to the platform will always remain c. We must adopt two very startling hypotheses, namely, the Fitz-Gerald contraction and the concept of "local time," in order to give a satisfactory explanation of the Michelson-Morley experiment.

These results were already reached by Lorentz in the course of further developments of his electron theory. Lorentz used a special set of transformation equations[2] for time which implicitly introduced the concept of local time. But he himself failed to attach any special significance to it, and looked on it rather as a mere mathematical artifice like imaginary quantities in analysis or the circle at infinity in projective geometry. The originality of Einstein at this stage consists in his successful physical interpretation of these results, and viewing them as the coherent organised consequences of a single general principle. Lorentz established the Relativity Theorem[3] (consisting merely of a set of transformation equations) while Einstein generalised it into a Universal Principle. In addition Einstein introduced fundamentally new concepts of space and time, which served to destroy old fetishes and demanded a wholesale revision of scientific concepts and thus opened up new possibilities in the synthetic unification of natural processes.

Del 2Page 15 / 106

Del 2 continues

Newton had framed his laws of motion in such a way as to make them quite independent of the absolute velocity of the earth. Uniform relative motion of ether and matter could not be detected with the help of dynamical laws. According to Einstein neither could it be detected with the help of optical or electromagnetic experiments. Thus the Einsteinian Principle of Relativity asserts that all physical laws are independent of the 'absolute' velocity of an observer.

For different systems, the form of all physical laws is conserved. If we chose the velocity of light[4] to be the fundamental unit of measurement for all observers (that is, assume the constancy of the velocity of light in all systems) we can establish a metric "one-one" correspondence between any two observed systems, such correspondence depending only the relative velocity of the two systems. Einstein's Relativity is thus merely the consistent logical application of the well known physical principle that we can know nothing but relative motion. In this sense it is a further extension of Newtonian Relativity.

On this interpretation, the Lorentz-Fitzgerald contraction and "local time" lose their arbitrary character. Space and time as measured by two different observers are naturally diverse, and the difference depends only on their relative motion. Both are equally valid; they are merely different descriptions of the same physical reality. This is essentially the point of view adopted by Minkowski. He considers time itself to be one of the co-ordinate axes, and in his four-dimensional world, that is in the space-time reality, relative motion is reduced to a rotation of the axes of reference. Thus, the diversity in the measurement of lengths and temporal rates is merely due to the static difference in the "frame-work" of the different observers.

Del 2Page 16 / 106

Del 2 continues

The above theory of Relativity absorbed practically the whole of the electromagnetic theory based on the Maxwell-Lorentz system of field equations. It combined all the advantages of classic Maxwellian theory together with an electronic hypothesis. The Lorentz assumption of polarisation doublets had furnished a satisfactory explanation of the Fresnelian convection of ether, but in the new theory this is deduced merely as a consequence of the altered concept of relative velocity. In addition, the theory of Relativity accepted the results of Michelson and Morley's experiments as a definite principle, namely, the principle of the constancy of the velocity of light, so that there was nothing left for explanation in the Michelson-Morley experiment. But even more than all this, it established a single general principle which served to connect together in a simple coherent and fruitful manner the known facts of Physics.

The theory of Relativity received direct experimental confirmation in several directions. Repeated attempts were made to detect the Lorentz-Fitzgerald contraction. Any ordinary physical contraction will usually have observable physical results; for example, the total electrical resistance of a conductor will diminish. Trouton and Noble, Trouton and Rankine, Rayleigh and Brace, and others employed a variety of different methods to detect the Lorentz-Fitzgerald contraction, but invariably with the same negative results. Whether there is an ether or not, uniform velocity with respect to it can never be detected. This does not prove that there is no such thing as an ether but certainly does render the ether entirely superfluous. Universal compensation is due to a change in local units of length and time, or rather, being merely different descriptions of the same reality, there is no compensation at all.

Del 2Page 17 / 106

Del 2 continues

There was another group of observed phenomena which could scarcely be fitted into a Newtonian scheme of dynamics without doing violence to it. The experimental work of Kaufmann, in 1901, made it abundantly clear that the "mass" of an electron depended on its velocity. So early as 1881, J. J. Thomson had shown that the inertia of a charged particle increased with its velocity. Abraham now deduced a formula for the variation of mass with velocity, on the hypothesis that an electron always remained a rigid sphere. Lorentz proceeded on the assumption that the electron shared in the Lorentz-Fitzgerald contraction and obtained a totally different formula. A very careful series of measurements carried out independently by Bücherer, Wolz, Hupka and finally Neumann in 1913, decided conclusively in favour of the Lorentz formula. This "contractile" formula follows immediately as a direct consequence of the new Theory of Relativity, without any assumption as regards the electrical origin of inertia. Thus the complete agreement of experimental facts with the predictions of the new theory must be considered as confirming it as a principle which goes even beyond the electron itself. The greatest triumph of this new theory consists, indeed, in the fact that a large number of results, which had formerly required all kinds of special hypotheses for their explanation, are now deduced very simply as inevitable consequences of one single general principle.

We have now traced the history of the development of the restricted or special theory of Relativity, which is mainly concerned with optical and electrical phenomena. It was first offered by Einstein in 1905. Ten years later, Einstein formulated his second theory, the Generalised Principle of Relativity. This new theory is mainly a theory of gravitation and has very little connection with optics and electricity. In one sense, the second theory is indeed a further generalisation of the restricted principle, but the former does not really contain the latter as a special case.

Del 2Page 18 / 106

Del 2 continues

Einstein's first theory is restricted in the sense that it only refers to uniform rectilinear motion and has no application to any kind of accelerated movements. Einstein in his second theory extends the Relativity Principle to cases of accelerated motion. If Relativity is to be universally true, then even accelerated motion must be merely relative motion between matter and matter. Hence the Generalised Principle of Relativity asserts that "absolute" motion cannot be detected even with the help of gravitational laws.

All movements must be referred to definite sets of co-ordinate axes. If there is any change of axes, the numerical magnitude of the movements will also change. But according to Newtonian dynamics, such alteration in physical movements can only be due to the effect of certain forces in the field.[5] Thus any change of axes will introduce new "geometrical" forces in the field which are quite independent of the nature of the body acted on. Gravitational forces also have this same remarkable property, and gravitation itself may be of essentially the same nature as these "geometrical" forces introduced by a change of axes. This leads to Einstein's famous Principle of Equivalence. A gravitational field of force is strictly equivalent to one introduced by a transformation of co-ordinates and no possible experiment can distinguish between the two.

Thus it may become possible to "transform away" gravitational effects, at least for sufficiently small regions of space, by referring all movements to a new set of axes. This new "framework" may of course have all kinds of very complicated movements when referred to the old Galilean or "rectangular unaccelerated system of co-ordinates."

Del 2Page 19 / 106

Del 2 continues

But there is no reason why we should look on the Galilean system as more fundamental than any other. If it is found simpler to refer all motion in a gravitational field to a special set of co-ordinates, we may certainly look on this special "framework" (at least for the particular region concerned), to be more fundamental and more natural. We may, still more simply, identify this particular framework with the special local properties of space in that region. That is, we can look on the effects of a gravitational field as simply due to the local properties of space and time itself. The very presence of matter implies a modification of the characteristics of space and time in its neighbourhood. As Eddington says "matter does not cause the curvature of space-time. It is the curvature. Just as light does not cause electromagnetic oscillations; it is the oscillations."

We may look on this from a slightly different point of view. The General Principle of Relativity asserts that all motion is merely relative motion between matter and matter, and as all movements must be referred to definite sets of co-ordinates, the ground of any possible framework must ultimately be material in character. It is convenient to take the matter actually present in a field as the fundamental ground of our framework. If this is done, the special characteristics of our framework would naturally depend on the actual distribution of matter in the field. But physical space and time is completely defined by the "framework." In other words the "framework" itself is space and time. Hence we see how physical space and time is actually defined by the local distribution of matter.

There are certain magnitudes which remain constant by any change of axes. In ordinary geometry distance between two points is one such magnitude; so that δx² + δy² + δz² is an invariant. In the restricted theory of light, the principle of constancy of light velocity demands that δx² + δy² + δz² - c²δt² should remain constant.

Del 2Page 20 / 106

Del 2 continues

The separation ds of adjacent events is defined by ds² = -dx² - dy² - dz² + c²dt². It is an extension of the notion of distance and this is the new invariant. Now if x, y, z, t are transformed to any set of new variables x₁, x₂, x₃, x₄, we shall get a quadratic expression for

$ ds^2 = g{1\;1}x{1}^2 + 2g{1\;2}x{1}x{2} +... = \sum g{i\;j}x{i}x{j} $

where the g's are functions of x₁, x₂, x₃, x₄ depending on the transformation.

The special properties of space and time in any region are defined by these g's which are themselves determined by the actual distribution of matter in the locality. Thus from the Newtonian point of view, these g's represent the gravitational effect of matter while from the Relativity stand-point, these merely define the non-Newtonian (and incidentally non-Euclidean) space in the neighbourhood of matter.

We have seen that Einstein's theory requires local curvature of space-time in the neighbourhood of matter. Such altered characteristics of space and time give a satisfactory explanation of an outstanding discrepancy in the observed advance of perihelion of Mercury. The large discordance is almost completely removed by Einstein's theory.

Again, in an intense gravitational field, a beam of light will be affected by the local curvature of space, so that to an observer who is referring all phenomena to a Newtonian system, the beam of light will appear to deviate from its path along an Euclidean straight line.

This famous prediction of Einstein about the deflection of a beam of light by the sun's gravitational field was tested during the total solar eclipse of May, 1919. The observed deflection is decisively in favour of the Generalised Theory of Relativity.

Del 2Page 21 / 106

Del 2 continues

It should be noted however that the velocity of light itself would decrease in a gravitational field. This may appear at first sight to be a violation of the principle of constancy of light-velocity. But when we remember that the Special Theory is explicitly restricted to the case of unaccelerated motion, the difficulty vanishes. In the absence of a gravitational field, that is in any unaccelerated system, the velocity of light will always remain constant. Thus the validity of the Special Theory is completely preserved within its own restricted field.

Einstein has proposed a third crucial test. He has predicted a shift of spectral lines towards the red, due to an intense gravitational potential. Experimental difficulties are very considerable here, as the shift of spectral lines is a complex phenomenon. Evidence is conflicting and nothing conclusive can yet be asserted. Einstein thought that a gravitational displacement of the Fraunhofer lines is a necessary and fundamental condition for the acceptance of his theory. But Eddington has pointed out that even if this test fails, the logical conclusion would seem to be that while Einstein's law of gravitation is true for matter in bulk, it is not true for such small material systems as atomic oscillator.

Del 3Page 22 / 106

Del 3

Conclusion

From the conceptual stand-point there are several important consequences of the Generalised or Gravitational Theory of Relativity. Physical space-time is perceived to be intimately connected with the actual local distribution of matter. Euclid-Newtonian space-time is not the actual space-time of Physics, simply because the former completely neglects the actual presence of matter. Euclid-Newtonian continuum is merely an abstraction, while physical space-*time is the actual framework which has some definite curvature due to the presence of matter. Gravitational Theory of Relativity thus brings out clearly the fundamental distinction between actual physical space-time (which is non-isotropic and non-Euclid-Newtonian) on one hand and the abstract Euclid-Newtonian continuum (which is homogeneous, isotropic and a purely intellectual construction) on the other.

The measurements of the rotation of the earth reveals a fundamental framework which may be called the "inertial framework." This constitutes the actual physical universe. This universe approaches Galilean space-time at a great distance from matter.

Del 3Page 23 / 106

Del 3 continues

The properties of this physical universe may be referred to some world-distribution of matter or the "inertial framework" may be constructed by a suitable modification of the law of gravitation itself. In Einstein's theory the actual curvature of the "inertial framework" is referred to vast quantities of undetected world-matter. It has interesting consequences. The dimensions of Einsteinian universe would depend on the quantity of matter in it; it would vanish to a point in the total absence of matter. Then again curvature depends on the quantity of matter, and hence in the presence of a sufficient quantity of matter space-time may curve round and close up. Einsteinian universe will then reduce to a finite system without boundaries, like the surface of a sphere. In this "closed up" system, light rays will come to a focus after travelling round the universe and we should see an "anti-sun" (corresponding to the back surface of the sun) at a point in the sky opposite to the real sun. This anti-sun would of course be equally large and equally bright if there is no absorption of light in free space.

In de Sitter's theory, the existence of vast quantities of world-matter is not required. But beyond a definite distance from an observer, time itself stands still, so that to the observer nothing can ever "happen" there. All these theories are still highly speculative in character, but they have certainly extended the scope of theoretical physics to the central problem of the ultimate nature of the universe itself.

One outstanding peculiarity still attaches to the concept of electric force—it is not amenable to any process of being "transformed away" by a suitable change of framework. H. Weyl, it seems, has developed a geometrical theory (in hyper-space) in which no fundamental distinction is made between gravitational and electrical forces.

Del 3Page 24 / 106

Del 3 continues

Einstein's theory connects up the law of gravitation with the laws of motion, and serves to establish a very intimate relationship between matter and physical space-*time. Space, time and matter (or energy) were considered to be the three ultimate elements in Physics. The restricted theory fused space-time into one indissoluble whole. The generalised theory has further synthesised space-time and matter into one fundamental physical reality. Space, time and matter taken separately are more abstractions. Physical reality consists of a synthesis of all three.

P. C. MAHALANOBIS.

Note A.

For example consider a massive particle resting on a circular disc. If we set the disc rotating, a centrifugal force appears in the field. On the other hand, if we transform to a set of rotating axes, we must introduce a centrifugal force in order to correct for the change of axes. This newly introduced centrifugal force is usually looked on as a mathematical fiction—as "geometrical" rather than physical. The presence of such a geometrical force is usually interpreted as being due to the adoption of a fictitious framework. On the other hand a gravitational force is considered quite real. Thus a fundamental distinction is made between geometrical and gravitational forces.

In the General Theory of Relativity, this fundamental distinction is done away with. The very possibility of distinguishing between geometrical and gravitational forces is denied. All axes of reference may now be regarded as equally valid.

In the Restricted Theory, all "unaccelerated" axes of reference were recognised as equally valid, so that physical laws were made independent of uniform absolute velocity. In the General Theory, physical laws are made independent of "absolute" motion of any kind.

Footnote 1:

See Note 1.

Footnote 2:

See Note 2.

Footnote 3:

See Note 4.

Footnote 4:

See Notes 9 and 12.

Footnote 5:

Note A.

On The Electrodynamics of Moving Bodies
By
A. Einstein.

INTRODUCTION.

Del 3Page 25 / 106

Del 3 continues

It is well known that if we attempt to apply Maxwell's electrodynamics, as conceived at the present time, to moving bodies, we are led to asymmetry which does not agree with observed phenomena. Let us think of the mutual action between a magnet and a conductor. The observed phenomena in this case depend only on the relative motion of the conductor and the magnet, while according to the usual conception, a distinction must be made between the cases where the one or the other of the bodies is in motion. If, for example, the magnet moves and the conductor is at rest, then an electric field of certain energy-value is produced in the neighbourhood of the magnet, which excites a current in those parts of the field where a conductor exists. But if the magnet be at rest and the conductor be set in motion, no electric field is produced in the neighbourhood of the magnet, but an electromotive force which corresponds to no energy in itself is produced in the conductor; this causes an electric current of the same magnitude and the same career as the electric force, it being of course assumed that the relative motion in both of these cases is the same.

Del 3Page 26 / 106

Del 3 continues

2. Examples of a similar kind such as the unsuccessful attempt to substantiate the motion of the earth relative to the "Light-medium" lead us to the supposition that not only in mechanics, but also in electrodynamics, no properties of observed facts correspond to a concept of absolute rest; but that for all coordinate systems for which the mechanical equations hold, the equivalent electrodynamical and optical equations hold also, as has already been shown for magnitudes of the first order. In the following we make these assumptions (which we shall subsequently call the Principle of Relativity) and introduce the further assumption,—an assumption which is at the first sight quite irreconcilable with the former one—that light is propagated in vacant space, with a velocity c which is independent of the nature of motion of the emitting body. These two assumptions are quite sufficient to give us a simple and consistent theory of electrodynamics of moving bodies on the basis of the Maxwellian theory for bodies at rest. The introduction of a "Lightäther" will be proved to be superfluous, for according to the conceptions which will be developed, we shall introduce neither a space absolutely at rest, and endowed with special properties, nor shall we associate a velocity-vector with a point in which electro-magnetic processes take place.

3. Like every other theory in electrodynamics, the theory is based on the kinematics of rigid bodies; in the enunciation of every theory, we have to do with relations between rigid bodies (co-ordinate system), clocks, and electromagnetic processes. An insufficient consideration of these circumstances is the cause of difficulties with which the electrodynamics of moving bodies have to fight at present.

I.—KINEMATICAL PORTION.

§ 1. Definition of Synchronism.

Del 4Page 27 / 106

Del 4

Let us have a co-ordinate system, in which the Newtonian equations hold. For distinguishing this system from another which will be introduced hereafter, we shall always call it "the stationary system."

If a material point be at rest in this system, then its position in this system can be found out by a measuring rod, and can be expressed by the methods of Euclidean Geometry, or in Cartesian co-ordinates.

If we wish to describe the motion of a material point, the values of its coordinates must be expressed as functions of time. It is always to be borne in mind that such a mathematical definition has a physical sense, only when we have a clear notion of what is meant by time. We have to take into consideration the fact that those of our conceptions, in which time plays a part, are always conceptions of synchronism. For example, we say that a train arrives here at 7 o'clock; this means that the exact pointing of the little hand of my watch to 7, and the arrival of the train are synchronous events.

It may appear that all difficulties connected with the definition of time can be removed when in place of time, we substitute the position of the little hand of my watch. Such a definition is in fact sufficient, when it is required to define time exclusively for the place at which the clock is stationed. But the definition is not sufficient when it is required to connect by time events taking place at different stations,—or what amounts to the same thing,—to estimate by means of time (zeitlich werten) the occurrence of events, which take place at stations distant from the clock.

Now with regard to this attempt;—the time-estimation of events, we can satisfy ourselves in the following manner. Suppose an observer—who is stationed at the origin of coordinates with the clock—associates a ray of light which comes to him through space, and gives testimony to the event of which the time is to be estimated,—with the corresponding position of the hands of the clock. But such an association has this defect,—it depends on the position of the observer provided with the clock, as we know by experience. We can attain to a more practicable result by the following treatment.

Del 4Page 28 / 106

Del 4 continues

If an observer be stationed at A with a clock, he can estimate the time of events occurring in the immediate neighbourhood of A, by looking for the position of the hands of the clock, which are synchronous with the event. If an observer be stationed at B with a clock,—we should add that the clock is of the same nature as the one at A,—he can estimate the time of events occurring about B. But without further premises, it is not possible to compare, as far as time is concerned, the events at B with the events at A. We have hitherto an A-time, and a B-time, but no time common to A and B. This last time (i.e., common time) can be defined, if we establish by definition that the time which light requires in travelling from A to B is equivalent to the time which light requires in travelling from B to A. For example, a ray of light proceeds from A at A-time t{A} towards B, arrives and is reflected from B at B-time t{B}, and returns to A at A-time t′_{A}. According to the definition, both clocks are synchronous, if

t{B} - t{A} = t′{A} - t{B}.

We assume that this definition of synchronism is possible without involving any inconsistency, for any number of points, therefore the following relations hold:—

1. If the clock at B be synchronous with the clock at A, then the clock at A is synchronous with the clock at B.

2. If the clock at A as well as the clock at B are both synchronous with the clock at C, then the clocks at A and B are synchronous.

Thus with the help of certain physical experiences, we have established what we understand when we speak of clocks at rest at different stations, and synchronous with one another; and thereby we have arrived at a definition of synchronism and time.

In accordance with experience we shall assume that the magnitude

$ \frac {2 \overline{AB}}{t'{A} - t{A}} = c $

where c is a universal constant.

We have defined time essentially with a clock at rest in a stationary system. On account of its adaptability to the stationary system, we call the time defined in this way as "time of the stationary system."

§ 2. On the Relativity of Length and Time.

Del 4Page 29 / 106

Del 4 continues

The following reflections are based on the Principle of Relativity and on the Principle of Constancy of the velocity of light, both of which we define in the following way:—

1. The laws according to which the nature of physical systems alter are independent of the manner in which these changes are referred to two co-ordinate systems which have a uniform translators motion relative to each other.

2. Every ray of light moves in the "stationary co-ordinate system" with the same velocity c, the velocity being independent of the condition whether this ray of light is emitted by a body at rest or in motion.[6] Therefore

velocity = Path of Light/Interval of time,

where, by 'interval of time' we mean time as defined in §1.

Let us have a rigid rod at rest; this has a length l, when measured by a measuring rod at rest; we suppose that the axis of the rod is laid along the X-axis of the system at rest, and then a uniform velocity v, parallel to the axis of X, is imparted to it. Let us now enquire about the length of the moving rod; this can be obtained by either of these operations.—

(a) The observer provided with the measuring rod moves along with the rod to be measured, and measures by direct superposition the length of the rod:—just as if the observer, the measuring rod, and the rod to be measured were at rest.

(b) The observer finds out, by means of clocks placed in a system at rest (the clocks being synchronous as defined in §1), the points of this system where the ends of the rod to be measured occur at a particular time t. The distance between these two points, measured by the previously used measuring rod, this time it being at rest, is a length, which we may call the "length of the rod."

According to the Principle of Relativity, the length found out by the operation a), which we may call "the length of the rod in the moving system" is equal to the length l of the rod in the stationary system.

The length which is found out by the second method, may be called 'the length of the moving rod measured from the stationary system.' This length is to be estimated on the basis of our principle, and we shall find it to be different from l.

Del 4Page 30 / 106

Del 4 continues

In the generally recognised kinematics, we silently assume that the lengths defined by these two operations are equal, or in other words, that at an epoch of time t, a moving rigid body is geometrically replaceable by the same body, which can replace it in the condition of rest.

Relativity of Time.

Let us suppose that the two clocks synchronous with the clocks in the system at rest are brought to the ends A, and B of a rod, i.e., the time of the clocks correspond to the time of the stationary system at the points where they happen to arrive; these clocks are therefore synchronous in the stationary system.

We further imagine that there are two observers at the two watches, and moving with them, and that these observers apply the criterion for synchronism to the two clocks. At the time t_{A}, a ray of light goes out from A, is reflected from B at the time t_{B}, and arrives back at A at time t′_{A}. Taking into consideration the principle of, constancy of the velocity of light, we have

t_{B} - t_{A} = r_{AB}/(c - v),

and t′_{A} - t_{B} = r_{AB}/(c + v),

where r_{AB} is the length of the moving rod, measured in the stationary system. Therefore the observers stationed with the watches will not find the clocks synchronous, though the observer in the stationary system must declare the clocks to be synchronous. We therefore see that we can attach no absolute significance to the concept of synchronism; but two events which are synchronous when viewed from one system, will not be synchronous when viewed from a system moving relatively to this system.

§ 3. Theory of Co-ordinate and Time-Transformation from a stationary system to a system which moves relatively to this with uniform velocity.

Del 5Page 31 / 106

Del 5

Let there be given, in the stationary system two co-ordinate systems, i.e., two series of three mutually perpendicular lines issuing from a point. Let the X-axes of each coincide with one another, and the Y and Z-axes be parallel. Let a rigid measuring rod, and a number of clocks be given to each of the systems, and let the rods and clocks in each be exactly alike each other.

Let the initial point of one of the systems (k) have a constant velocity in the direction of the X-axis of the other which is stationary system K, the motion being also communicated to the rods and clocks in the system (k). Any time t of the stationary system K corresponds to a definite position of the axes of the moving system, which are always parallel to the axes of the stationary system. By t, we always mean the time in the stationary system.

We suppose that the space is measured by the stationary measuring rod placed in the stationary system, as well as by the moving measuring rod placed in the moving system, and we thus obtain the co-ordinates (x, y, z) for the stationary system, and (ξ, η, ζ) for the moving system. Let the time t be determined for each point of the stationary system (which are provided with clocks) by means of the clocks which are placed in the stationary system, with the help of light-signals as described in § 1. Let also the time τ of the moving system be determined for each point of the moving system (in which there are clocks which are at rest relative to the moving system), by means of the method of light signals between these points (in which there are clocks) in the manner described in § 1.

To every value of (x, y, z, t) which fully determines the position and time of an event in the stationary system, there correspond a system of values (ξ, η, ζ, τ); now the problem is to find out the system of equations connecting these magnitudes.

Primarily it is clear that on account of the property of homogeneity which we ascribe to time and space, the equations must be linear.

Del 5Page 32 / 106

Del 5 continues

If we put x′ = x - vt, then it is clear that at a point relatively at rest in the system k, we have a system of values (x′ y z) which are independent of time. Now let us find out τ as a function of (x′, y, z, t). For this purpose we have to express in equations the fact that τ is not other than the time given by the clocks which are at rest in the system k which must be made synchronous in the manner described in § 1.

Let a ray of light be sent at time τ₀ from the origin of the system k along the X-axis towards x′ and let it be reflected from that place at time τ₁ towards the origin of moving co-ordinates and let it arrive there at time τ₂; then we must have

½ (τ₀ + τ₂) = τ₁

If we now introduce the condition that τ is a function of co-ordinates, and apply the principle of constancy of the velocity of light in the stationary system, we have

$ \frac {1}{2} (\tau (0,0,0,t) + \tau (0,0,0,(t + \frac {x'}{c-v} + \frac {x'}{c+v}))) $

$ = \tau (x',0,0, t + \frac {x'}{c-v}) $

It is to be noticed that instead of the origin of co-ordinates, we could select some other point as the exit point for rays of light, and therefore the above equation holds for all values of (x′, y, z, t,).

A similar conception, being applied to the y- and z-axis gives us, when we take into consideration the fact that light when viewed from the stationary system, is always propagated along those axes with the velocity √(c² - v²), we have the questions

∂τ ∂τ
---- = 0, ---- = 0.
∂y ∂z

From these equations it follows that τ is a linear function of x′ and t. From equations (1) we obtain

vx′
τ = a (t - -------- )
c² - v²

where a is an unknown function of v.

With the help of these results it is easy to obtain the magnitudes (ξ, η, ζ) if we express by means of equations the fact that light, when measured in the moving system is always propagated with the constant velocity c (as the principle of constancy of light velocity in conjunction with the principle of relativity requires). For a time τ = 0, if the ray is sent in the direction of increasing ξ, we have

vx′
ξ = cτ, i.e. ξ = a c(t - ------------ )
c² - v²

Del 5Page 33 / 106

Del 5 continues

Now the ray of light moves relative to the origin of k with a velocity c - v, measured in the stationary system; therefore we have

x′
---------- = t
c - v

Substituting these values of t in the equation for ξ, we obtain

c²
ξ = a ------------- x′
c² - v²

In an analogous manner, we obtain by considering the ray of light which moves along the y-axis,

vx′
η = cτ = a c(t - ------------- )
c² - v²

where

y
------------------ = t, x′ = 0,
√ (c² - v²)

Therefore

c
η = a ------------------ y,
√ (c² - v²)

c
ζ = a ----------------- z.
√ (c² - v²)

If for x′, we substitute its value x - tv, we obtain

v.c
τ = φ (v). β (t - -----------,
c²

ξ = φ (v). β (x - vt),

η = φ (v) y

ζ = φ (v) z,

where

$ \beta = \frac {1}{\sqrt {1 - \frac {v^2}{c^2}}} $

and

φ (v) = ac / √ (c² - v²) = a / β

is a function of v.

If we make no assumption about the initial position of the moving system and about the null-point of t, then an additive constant is to be added to the right hand side.

We have now to show, that every ray of light moves in the moving system with a velocity c (when measured in the moving system), in case, as we have actually assumed, c is also the velocity in the stationary system; for we have not as yet adduced any proof in support of the assumption that the principle of relativity is reconcilable with the principle of constant light-velocity.

At a time τ = t = 0 let a spherical wave be sent out from the common origin of the two systems of co-ordinates, and let it spread with a velocity c in the system K. If (x, y, z), be a point reached by the wave, we have

x² + y² + z² = c²_t²_

with the aid of our transformation-equations, let us transform this equation, and we obtain by a simple calculation,

ξ² + η² + ζ² = c²τ².

Therefore the wave is propagated in the moving system with the same velocity c, and as a spherical wave.[7] Therefore we show that the two principles are mutually reconcilable.

In the transformations we have got an undetermined function φ(v), and we now proceed to find it out.

Del 5Page 34 / 106

Del 5 continues

Let us introduce for this purpose a third co-ordinate system k′, which is set in motion relative to the system k, the motion being parallel to the ξ-axis. Let the velocity of the origin be (-v). At the time t = 0, all the initial co-ordinate points coincide, and for t = x = y = z = 0, the time t′ of the system k′ = 0. We shall say that (x′ y′ z′ t′) are the co-ordinates measured in the system k′, then by a two-fold application of the transformation-equations, we obtain

v
τ′ = φ(-v)β(-v){τ + ----- ξ}
c²
= φ(v)φ(-v)t,

x′ = φ](v)β(v)(ξ + vτ) = φ(v)φ(-v)x, etc.

Since the relations between (x′, y′, z′, t′), and (x, y, z, t) do not contain time explicitly, therefore K and k′ are relatively at rest.

It appears that the systems K and k′ are identical.

∴ φ(v)φ(-v) = 1.

Let us now turn our attention to the part of the ξ-axis between (ξ = 0, η = 0, ζ = 0), and (ξ = 0, η = 1, ζ = 0). Let this piece of the y-axis be covered with a rod moving with the velocity v relative to the system K and perpendicular to its axis;—the ends of the rod having therefore the co-ordinates

x₁ = vt, y₁ = l / φ(v), z₁ = 0

x₂ = vt, y₂ = 0, z₂ = 0

Therefore the length of the rod measured in the system K is l/φ(v). For the system moving with velocity (-v), we have on grounds of symmetry,

l l
-------- = ---------
φ(v) φ(-v)

∴ φ(v) = φ(-v), ∴ φ(v) = 1.

§ 4. The physical significance of the equations obtained concerning moving rigid bodies and moving clocks.

Let us consider a rigid sphere (i.e., one having a spherical figure when tested in the stationary system) of radius R which is at rest relative to the system (K), and whose centre coincides with the origin of K then the equation of the surface of this sphere, which is moving with a velocity v relative to K, is

ξ² + η² + ζ² = R².

At time t = 0, the equation is expressed by means of (x, y, z, t,) as

$ \frac {x^2}{(\sqrt {1 - \frac {v2}{c2}})^2} + y^2 + z^2 = R^2. $

Del 5Page 35 / 106

Del 5 continues

A rigid body which has the figure of a sphere when measured in the moving system, has therefore in the moving condition—when considered from the stationary system, the figure of a rotational ellipsoid with semi-axes

$ R \sqrt {1 - \frac {v^2}{c^2}}, R, R. $

Therefore the y and z dimensions of the sphere (therefore of any figure also) do not appear to be modified by the motion, but the x dimension is shortened in the ratio

$ 1: \sqrt {1 - \frac {v^2}{c^2}}; $

the shortening is the larger, the larger is v. For v = c, all moving bodies, when considered from a stationary system shrink into planes. For a velocity larger than the velocity of light, our propositions become meaningless; in our theory c plays the part of infinite velocity.

It is clear that similar results hold about stationary bodies in a stationary system when considered from a uniformly moving system.

Let us now consider that a clock which is lying at rest in the stationary system gives the time t, and lying at rest relative to the moving system is capable of giving the time τ; suppose it to be placed at the origin of the moving system k, and to be so arranged that it gives the time τ. How much does the clock gain, when viewed from the stationary system K? We have,

$ \tau = \frac {1}{\sqrt {1-\frac {v^2}{c^2}}} (t - \frac {v}{c^2}x), $

and x = vt,

$ \therefore \tau - t = (1 - \sqrt {1 - \frac {v^2}{c^2}}) t. $

Therefore the clock loses by an amount ½(v²/c²) per second of motion, to the second order of approximation.

From this, the following peculiar consequence follows. Suppose at two points A and B of the stationary system two clocks are given which are synchronous in the sense explained in § 3 when viewed from the stationary system. Suppose the clock at A to be set in motion in the line joining it with B, then after the arrival of the clock at B, they will no longer be found synchronous, but the clock which was set in motion from A will lag behind the clock which had been all along at B by an amount ½t(v²/c²), where t is the time required for the journey.

Del 5Page 36 / 106

Del 5 continues

We see forthwith that the result holds also when the clock moves from A to B by a polygonal line, and also when A and B coincide.

If we assume that the result obtained for a polygonal line holds also for a curved line, we obtain the following law. If at A, there be two synchronous clocks, and if we set in motion one of them with a constant velocity along a closed curve till it comes back to A, the journey being completed in t-seconds, then after arrival, the last mentioned clock will be behind the stationary one by ½t(v²/c²) seconds. From this, we conclude that a clock placed at the equator must be slower by a very small amount than a similarly constructed clock which is placed at the pole, all other conditions being identical.

§ 5. Addition-Theorem of Velocities.

Del 6Page 37 / 106

Del 6

Let a point move in the system k (which moves with velocity v along the x-axis of the system K) according to the equation

$ \xi = w{\xi} \tau, \eta = w{\eta} \tau, \zeta = 0, $

where w_{ξ} and w_{η} are constants.

It is required to find out the motion of the point relative to the system K. If we now introduce the system of equations in § 3 in the equation of motion of the point, we obtain

$ x = (\frac {w{\xi} + v}{1+\frac {vw{\xi}}{c^2}}) t $,

$ y = \frac {(1-\frac {v^2}{c^2})^{\frac {1}{2}} w{\eta}t} {1+\frac {vw{\xi}}{c^2}} $,

z = 0.

The law of parallelogram of velocities hold up to the first order of approximation. We can put

$ U^2 = (\frac {\partial x}{\partial t})^2 + (\frac {\partial y}{\partial t})^2 $,

$ w^2 = w{\xi}^2 + w{\eta}^2 $,

and

$ \alpha = tan^{-1} \frac {w}{w_{\xi}} $

i.e., α is put equal to the angle between the velocities v, and w. Then we have—

$ U = \frac {[(v^2 + w^2 + 2 vw \cos \alpha) - (\frac {vw \sin \alpha}{c})^2]^{\frac {1}{2}}} {1 + \frac {vw \cos \alpha}{c^2}} $

It should be noticed that v and w enter into the expression for velocity symmetrically. If w has the direction of the ξ-axis of the moving system,

$ U = \frac {v + w}{1 + \frac {vw}{c^2}} $

From this equation, we see that by combining two velocities, each of which is smaller than c, we obtain a velocity which is always smaller than c. If we put v = c - χ, and w = c - λ, where χ and λ are each smaller than c,[8]

$ U = c \frac {2c - \chi - \lambda}{2c - \chi - \lambda + \frac {\chi \lambda}{c^2}} < c $

It is also clear that the velocity of light c cannot be altered by adding to it a velocity smaller than c. For this case,

$ U = \frac {c + v}{1 + \frac {cv}{c^2}} = c $

Del 6Page 38 / 106

Del 6 continues

We have obtained the formula for U for the case when v and w have the same direction; it can also be obtained by combining two transformations according to section § 3. If in addition to the systems K, and k, we introduce the system k´, of which the initial point moves parallel to the ξ-axis with velocity w, then between the magnitudes, x, y, z, t and the corresponding magnitudes of k´, we obtain a system of equations, which differ from the equations in § 3, only in the respect that in place of v, we shall have to write,

$ \frac {v + w}{1 + \frac {vw}{c^2}} $

We see that such a parallel transformation forms a group.

We have deduced the kinematics corresponding to our two fundamental principles for the laws necessary for us, and we shall now pass over to their application in electrodynamics.

II.—ELECTRODYNAMICAL PART.

§ 6. Transformation of Maxwell's equations for Pure Vacuum.

On the nature of the Electromotive Force caused by motion in a magnetic field.

The Maxwell-Hertz equations for pure vacuum may hold for the stationary system K, so that

$ \frac {1}{c} \frac {\partial}{\partial t} [X, Y, Z] = \begin{vmatrix} \frac {\partial}{\partial x} & \frac {\partial}{\partial y} & \frac {\partial}{\partial z} L & M & N \end{vmatrix} $

and

$ \frac {1}{c} \frac {\partial}{\partial t} [L, M, N] = \begin{vmatrix} \frac {\partial}{\partial x} & \frac {\partial}{\partial y} & \frac {\partial}{\partial z} X & Y & Z \end{vmatrix} $ (1)

where [X, Y, Z] are the components of the electric force, L, M, N are the components of the magnetic force.

If we apply the transformations in §3 to these equations, and if we refer the electromagnetic processes to the co-ordinate system moving with velocity v, we obtain,

$ \frac {1}{c} \frac {\partial}{\partial \tau} [X, \beta (Y - \frac {v}{c} N), \beta (Z + \frac {v}{c} M)] = \begin{vmatrix} \frac {\partial}{\partial \xi} & \frac {\partial}{\partial \eta} & \frac {\partial}{\partial \zeta} L & \beta(M + \frac {v}{c} Z) & \beta(N - \frac {v}{c} Y) \end{vmatrix}

and

Del 6Page 39 / 106

Del 6 continues

$ \frac {1}{c} \frac {\partial}{\partial \tau} [L, \beta(M + \frac {v}{c} Z), \beta(N - \frac {v}{c} Y)] = - \begin{vmatrix} \frac {\partial}{\partial \xi} & \frac {\partial}{\partial \eta} & \frac {\partial}{\partial \zeta} X & \beta(Y - \frac {v}{c} N) & \beta(Z + \frac {v}{c} M) \end{vmatrix} $... (2)

where

$ \beta = \frac {1}{\sqrt {1 - \frac {v^2}{c^2}}} $

The principle of Relativity requires that the Maxwell-Hertzian equations for pure vacuum shall hold also for the system k, if they hold for the system K, i.e., for the vectors of the electric and magnetic forces acting on electric and magnetic masses in the moving system k, which are defined by their pondermotive reaction, the same equations hold,... i.e....

$ \frac {1}{c} \frac {\partial}{\partial \tau} (X', Y', Z') = \begin{vmatrix} \frac {\partial}{\partial \xi} & \frac {\partial}{\partial \eta} & \frac {\partial}{\partial \zeta} L' & M' & N' \end{vmatrix} $,

$ \frac {1}{c} \frac {\partial}{\partial \tau} (L', M', N') = - \begin{vmatrix} \frac {\partial}{\partial \xi} & \frac {\partial}{\partial \eta} & \frac {\partial}{\partial \zeta} X' & Y' & Z' \end{vmatrix} $... (3)

Clearly both the systems of equations (2) and (3) developed for the system k shall express the same things, for both of these systems are equivalent to the Maxwell-Hertzian equations for the system K. Since both the systems of equations (2) and (3) agree up to the symbols representing the vectors, it follows that the functions occurring at corresponding places will agree up to a certain factor ψ(v), which depends only on v, and is independent of (ξ, η, ζ, τ). Hence the relations,

v v [X′, Y′, Z′] = ψ (v) [X, β(Y - ----- N), β(Z + ------ M)], c c

v v [L′, M′, N′] = ψ (v) [L, β(M - ----- Z), β(N + ----- Y)], c c

Then by reasoning similar to that followed in §(3), it can be shown that ψ(v) = 1.

v v [X′, Y′, Z′] = [X, β(Y - ----- N), β(Z + ------ M)] c c

v v [L′, M′, N′] = [L, β(M - ------ Z), β(N + ----- Y)], c c

Del 6Page 40 / 106

Del 6 continues

For the interpretation of these equations, we make the following remarks. Let us have a point-mass of electricity which is of magnitude unity in the stationary system K, i.e., it exerts a unit force on a similar quantity placed at a distance of 1 cm. If this quantity of electricity be at rest in the stationary system, then the force acting on it is equivalent to the vector (X, Y, Z) of electric force. But if the quantity of electricity be at rest relative to the moving system (at least for the moment considered), then the force acting on it, and measured in the moving system is equivalent to the vector (X′, Y′, Z′). The first three of equations (1), (2), (3), can be expressed in the following way:—

1. If a point-mass of electric unit pole moves in an electro-magnetic field, then besides the electric force, an electromotive force acts on it, which, neglecting the numbers involving the second and higher powers of v/c, is equivalent to the vector-product of the velocity vector, and the magnetic force divided by the velocity of light (Old mode of expression).

2. If a point-mass of electric unit pole moves in an electro-magnetic field, then the force acting on it is equivalent to the electric force existing at the position of the unit pole, which we obtain by the transformation of the field to a co-ordinate system which is at rest relative to the electric unit pole [New mode of expression].

Similar theorems hold with reference to the magnetic force. We see that in the theory developed the electro-magnetic force plays the part of an auxiliary concept, which owes its introduction in theory to the circumstance that the electric and magnetic forces possess no existence independent of the nature of motion of the co-ordinate system.

It is further clear that the asymmetry mentioned in the introduction which occurs when we treat of the current excited by the relative motion of a magnet and a conductor disappears. Also the question about the seat of electromagnetic energy is seen to be without any meaning.

§ 7. Theory of Döppler's Principle and Aberration.

Del 6Page 41 / 106

Del 6 continues

In the system K, at a great distance from the origin of co-ordinates, let there be a source of electrodynamic waves, which is represented with sufficient approximation in a part of space not containing the origin, by the equations:—

X = X₀ sin Φ
Y = Y₀ sin Φ
Z = Z₀ sin Φ
L = L₀ sin Φ
M = M₀ sin Φ
N = N₀ sin Φ
lx + my + nz
Φ = ω(t - ------------ )
c

Here (X₀, Y₀, Z₀) and (L₀, M₀, N₀) are the vectors which determine the amplitudes of the train of waves, (l, m, n) are the direction-cosines of the wave-normal.

Let us now ask ourselves about the composition of these waves, when they are investigated by an observer at rest in a moving medium k:—By applying the equations of transformation obtained in §6 for the electric and magnetic forces, and the equations of transformation obtained in § 3 for the co-ordinates, and time, we obtain immediately:—

X′ = X₀ sin Φ′

v
Y′ = β(Y₀ - --- N₀) sin Φ′
c

v
Z′ = β(Z₀ - --- M₀) sin Φ′
c

L′ = L₀ sin Φ′

v
M′ = β(M₀ - --- Z₀) sin Φ′
c

v
N′ = β(N₀ - --- Y₀) sin Φ′
c

l′ξ + m′η + n′ζ
Φ′ = ω′(t - --------------- )
c

where

$ \omega' = \omega \beta (1 - \frac {lv}{c}) $,

$ l' = \frac {l - \frac {v}{c}}{1 - \frac {lv}{c}} $,

$ m' = \frac {m}{\beta (1 - \frac {lv}{c})} $,

$ n' = \frac {n}{\beta (1 - \frac {lv}{c})} $

From the equation for ω′ it follows:—If an observer moves with the velocity v relative to an infinitely distant source of light emitting waves of frequency ν, in such a manner that the line joining the source of light and the observer makes an angle of Φ with the velocity of the observer referred to a system of co-ordinates which is stationary with regard to the source, then the frequency ν′ which is perceived by the observer is represented by the formula

$ \nu' = \nu \frac {1 - cos \Phi \frac {v}{c}} {\sqrt {1 - \frac {v^2}{c^2}}} $

This is Döppler's principle for any velocity. If Φ = 0, then the equation takes the simple form

$ \nu' = \nu (\frac {1 - \frac {v}{c}}{1 + \frac {v}{c}})^{\frac {1}{2}} $

We see that—contrary to the usual conception—ν = ∞, for v = -c.

If Φ′ = angle between the wave-normal (direction of the ray) in the moving system, and the line of motion of the observer, the equation for l´ takes the form

Del 6Page 42 / 106

Del 6 continues

$ \cos \Phi' = \frac {\cos \Phi - \frac {v}{c}} {1 - \frac {v}{c} \cos \Phi} $

This equation expresses the law of observation in its most general form. If Φ = π/2, the equation takes the simple form

v
cos Φ′ = ---
c

We have still to investigate the amplitude of the waves, which occur in these equations. If A and A′ be the amplitudes in the stationary and the moving systems (either electrical or magnetic), we have

$ A'^2 = A^2 \frac {(1 - \frac {v}{c} \cos \Phi)^2} {1 - \frac {v^2}{c^2}} $

If Φ = 0, this reduces to the simple form

$ A'^2 = A^2 \frac {1 - \frac {v}{c}} {1 + \frac {v}{c}} $

From these equations, it appears that for an observer, which moves with the velocity c towards the source of light, the source should appear infinitely intense.

§ 8. Transformation of the Energy of the Rays of Light. Theory of the Radiation-pressure on a perfect mirror.

Del 7Page 43 / 106

Del 7

Since A²/8π is equal to the energy of light per unit volume, we have to regard A²/8π as the energy of light in the moving system. A′²/A² would therefore denote the ratio between the energies of a definite light-complex "measured when moving" and "measured when stationary," the volumes of the light-complex measured in K and k being equal. Yet this is not the case. If l, m, n are the direction-cosines of the wave-normal of light in the stationary system, then no energy passes through the surface elements of the spherical surface

(x - clt)² + (y - cmt)² + (z - cnt)² = R²,

which expands with the velocity of light. We can therefore say, that this surface always encloses the same light-complex. Let us now consider the quantity of energy, which this surface encloses, when regarded from the system k, i.e., the energy of the light-complex relative to the system k.

Regarded from the moving system, the spherical surface becomes an ellipsoidal surface, having, at the time τ = 0, the equation:—

$ (\beta \xi - l \beta \frac {v}{c} \xi)^2 + (\eta - m \beta \frac {v}{c} \xi)^2 + (\zeta - n \beta \frac {v}{c} \xi)^2 = R^2 $

If S = volume of the sphere, S′ = volume of this ellipsoid, then a simple calculation shows that:

$ \frac {S'}{S} = \frac {\beta}{\sqrt{1 - \frac {v}{c} \cos \Phi}} $

If E denotes the quantity of light energy measured in the stationary system, E′ the quantity measured in the moving system, which are enclosed by the surfaces mentioned above, then

$ \frac {E'}{E} = \frac {\frac {A'^2}{8\pi} S'}{\frac {A^2}{8\pi}S} = \frac {1 - \frac {v}{c} \cos \Phi}{\sqrt{1 - \frac {v^2}{c^2}}} $

If Φ = 0, we have the simple formula:—

$ \frac {E'}{E} = (\frac{1 - \frac{v}{c}}{1 + \frac{v}{c}})^{\frac{1}{2}} $

It is to be noticed that the energy and the frequency of a light-complex vary according to the same law with the state of motion of the observer.

Let there be a perfectly reflecting mirror at the co-ordinate-plane ξ = 0, from which the plane-wave considered in the last paragraph is reflected. Let us now ask ourselves about the light-pressure exerted on the reflecting surface and the direction, frequency, intensity of the light after reflexion.

Del 7Page 44 / 106

Del 7 continues

Let the incident light be defined by the magnitudes A cos Φ, v (referred to the system K). Regarded from k, we have the corresponding magnitudes:

$ A' = A \frac{1 - \frac{v}{c} \cos \Phi}{\sqrt{1 - \frac{v^2}{c^2}}} $

$ \cos \Phi' = \frac{\cos \Phi - \frac{v}{c}}{1 - \frac{v}{c} \cos \Phi} $

$ \nu' = \nu \frac{1 - \frac{v}{c} \cos \Phi}{\sqrt{1 - \frac{v^2}{c^2}}} $

For the reflected light we obtain, when the process is referred to the system k:—

A″ = A′, cos Φ″ = -cos Φ″, ν″ = ν′

By means of a back-transformation to the stationary system, we obtain K, for the reflected light:—

$ A''' = A'' \frac{1 + \frac{v}{c}\cos \Phi''}{\sqrt{1 - \frac{v^2}{c^2}}} = A \frac{1 - 2\frac{v}{c} \cos \Phi + \frac{v^2}{c^2}}{1 - \frac{v^2}{c^2}} $,

$ \cos \Phi''' = \frac{\cos \Phi'' + \frac{v}{c}}{1 + \frac{v}{c}\cos \Phi''} = - \frac{(1 + \frac{v^2}{c^2}) \cos \Phi - 2 \frac{v}{c}} {1 - 2 \frac{v}{c} \cos \Phi + \frac {v^2}{c^2}} $,

$ \nu''' = \nu'' \frac{1 + \frac{v}{c} \cos \Phi''}{\sqrt{1 - \frac{v^2}{c^2}}} = \nu \frac{1 - 2 \frac{v}{c} \cos \Phi + \frac{v^2}{c^2}} {(1 - \frac{v}{c})^2} $

The amount or energy falling on the unit surface of the mirror per unit of time (measured in the stationary system) is A²/(8π (c cos Φ - v)). The amount of energy going away from unit surface of the mirror per unit of time is A‴²/(8π (-c cos Φ″ + v)). The difference of these two expressions is, according to the Energy principle, the amount of work exerted, by the pressure of light per unit of time. If we put this equal to P.v, where P = pressure of light, we have

$ P = 2 \frac{A^2}{8\pi} \frac{(\cos \Phi - \frac{v}{c})^2} {1 - (\frac{v}{c})^2} $

As a first approximation, we obtain

A²
P = 2 -- cos² Φ
8π

which is in accordance with facts, and with other theories.

All problems of optics of moving bodies can be solved after the method used here. The essential point is, that the electric and magnetic forces of light, which are influenced by a moving body, should be transformed to a system of co-ordinates which is stationary relative to the body. In this way, every problem of the optics of moving bodies would be reduced to a series of problems of the optics of stationary bodies.

Del 7Page 45 / 106

Del 7 continues

§ 9. Transformation of the Maxwell-Hertz Equations.

Let us start from the equations:—

$ \frac{1}{c}(\rho u_{x} + \frac{\partial X}{\partial t}) = \frac{\partial N}{\partial y} - \frac{\partial M}{\partial z} $

$ \frac{1}{c}(\rho u_{y} + \frac{\partial Y}{\partial t}) = \frac{\partial L}{\partial z} - \frac{\partial N}{\partial x} $

$ \frac{1}{c}(\rho u_{z} + \frac{\partial Z}{\partial t}) = \frac{\partial M}{\partial x} - \frac{\partial L}{\partial y} $

$ \frac{1}{c} \frac{\partial L}{\partial t} = \frac{\partial Y}{\partial z} - \frac{\partial Z}{\partial y} $

$ \frac{1}{c} \frac{\partial M}{\partial t} = \frac{\partial Z}{\partial x} - \frac{\partial X}{\partial z} $

$ \frac{1}{c} \frac{\partial N}{\partial t} = \frac{\partial X}{\partial y} - \frac{\partial Y}{\partial x} $

where

$ \rho = \frac{\partial X}{\partial x} + \frac{\partial Y}{\partial y} + \frac{\partial Z}{\partial z} $

denotes 4π times the density of electricity, and (u_{x}, u_{y}, u_{z}) are the velocity-components of electricity. If we now suppose that the electrical-masses are bound unchangeably to small, rigid bodies (Ions, electrons), then these equations form the electromagnetic basis of Lorentz's electrodynamics and optics for moving bodies.

If these equations which hold in the system K, are transformed to the system k with the aid of the transformation-equations given in § 3 and § 6, then we obtain the equations:—

$ \frac{1}{c} (\rho' u_{\xi} + \frac{\partial X'}{\partial \tau}) = \frac{\partial N'}{\partial \eta} - \frac{\partial M'}{\partial \zeta} $,

$ \frac{\partial L'}{\partial \tau} = \frac{\partial Y'}{\partial \zeta} - \frac{\partial Z'}{\partial \eta} $,

$ \frac{1}{c} (\rho' u_{\eta} + \frac{\partial Y'}{\partial \tau}) = \frac{\partial L'}{\partial \zeta} - \frac{\partial N'}{\partial \xi} $,

$ \frac{\partial M'}{\partial \tau} = \frac{\partial Z'}{\partial \xi} - \frac{\partial X'}{\partial \zeta} $,

$ \frac{1}{c} (\rho' u_{\zeta} + \frac{\partial Z'}{\partial \tau}) = \frac{\partial M'}{\partial \xi} - \frac{\partial L'}{\partial \eta} $,

$ \frac{\partial N'}{\partial \tau} = \frac{\partial X'}{\partial \eta} - \frac{\partial Y'}{\partial \xi} $,

where

Del 7Page 46 / 106

Del 7 continues

$ \frac{u{x} - v}{1 - \frac{u{x}v}{c}} = u_{\xi} $,

$ \frac{u{y}}{\beta(1 - \frac{vu{x}}{c^2})} = u_{\eta} $,

$ \rho' = \frac{\partial X'}{\partial \xi} + \frac{\partial Y'}{\partial \eta} + \frac{\partial Z'}{\partial \xi} = \beta(1 - \frac{vu_{x}}{c^2}) \rho $,

$ \frac{u{x}}{\beta(1 - \frac{vu{x}}{c^2})} = u_{\zeta} $,

Since the vector (u_{ξ}, u_{η}, u_{ζ}) is nothing but the velocity of the electrical mass measured in the system k, as can be easily seen from the addition-theorem of velocities in § 4—so it is hereby shown, that by taking our kinematical principle as the basis, the electromagnetic basis of Lorentz's theory of electrodynamics of moving bodies correspond to the relativity-postulate. It can be briefly remarked here that the following important law follows easily from the equations developed in the present section:—if an electrically charged body moves in any manner in space, and if its charge does not change thereby, when regarded from a system moving along with it, then the charge remains constant even when it is regarded from the stationary system K.

§ 10. Dynamics of the Electron (slowly accelerated).

Del 8Page 47 / 106

Del 8

Let us suppose that a point-shaped particle, having the electrical charge e (to be called henceforth the electron) moves in the electromagnetic field; we assume the following about its law of motion.

If the electron be at rest at any definite epoch, then in the next "particle of time," the motion takes place according to the equations

d²x d²y d²z m ----- = eX, m ----- = eY, m ----- = eZ dt² dt² dt²

Where (x, y, z) are the co-ordinates of the electron, and m is its mass.

Let the electron possess the velocity v at a certain epoch of time. Let us now investigate the laws according to which the electron will move in the 'particle of time' immediately following this epoch.

Without influencing the generality of treatment, we can and we will assume that, at the moment we are considering, the electron is at the origin of co-ordinates, and moves with the velocity v along the X-axis of the system. It is clear that at this moment (t = 0) the electron is at rest relative to the system k, which moves parallel to the X-axis with the constant velocity v.

From the suppositions made above, in combination with the principle of relativity, it is clear that regarded from the system k, the electron moves according to the equations

d²ξ d²η d²ζ m ----- = eX′, m ----- = eY′, m ----- = eZ′, dτ² dτ² dτ²

in the time immediately following the moment, where the symbols (ξ, η, ζ, τ, X', Y', Z') refer to the system k. If we now fix, that for t = v = y = z = 0, τ = ξ = η = ζ = 0, then the equations of transformation given in § 3 (and § 6) hold, and we have:

v τ = β(t - ---- x), ξ = β(x - vt), η = y, ζ = z, c²

v v
X′ = X, Y′ = β(Y - --- N), Z′ = β(Z + --- M)
c c

With the aid of these equations, we can transform the above equations of motion from the system k to the system K, and obtain:—

(A)

$ \frac{d^2 x}{dt^2} = \frac{e}{m} \frac{1}{\beta} X $,

$ \frac{d^2 y}{dt^2} = \frac{e}{m} \frac{1}{\beta} (Y - \frac{v}{c} N) $,

$ \frac{d^2 z}{dt^2} = \frac{e}{m} \frac{1}{\beta} (Z + \frac{v}{c} M) $

Del 8Page 48 / 106

Del 8 continues

Let us now consider, following the usual method of treatment, the longitudinal and transversal mass of a moving electron. We write the equations (A) in the form

d²x
mβ² ----- = eX = eX′
dt²

d²y v
mβ² ----- = eβ (Y - --- N) = eY′
dt² c

d²z v
mβ² ----- = eβ (Z - --- M) = eZ′
dt² c

and let us first remark, that eX′, eY′, eZ′ are the components of the ponderomotive force acting on the electron, and are considered in a moving system which, at this moment, moves with a velocity which is equal to that of the electron. This force can, for example, be measured by means of a spring-balance which is at rest in this last system. If we briefly call this force as "the force acting on the electron," and maintain the equation:—

Mass-number × acceleration-number = force-number, and if we further fix that the accelerations are measured in the stationary system K, then from the above equations, we obtain:—

Longitudinal mass:

$ \frac{m}{(\sqrt{1 - \frac{v^2}{c^2}})^{\frac{3}{2}}} $

Transversal mass:

$ \frac{m}{\sqrt{1 - \frac{v^2}{c^2}}} $

Naturally, when other definitions are given of the force and the acceleration, other numbers are obtained for the mass; hence we see that we must proceed very carefully in comparing the different theories of the motion of the electron.

We remark that this result about the mass hold also for ponderable material mass; for in our sense, a ponderable material point may be made into an electron by the addition of an electrical charge which may be as small as possible.

Let us now determine the kinetic energy of the electron. If the electron moves from the origin of co-ordinates of the system K with the initial velocity 0 steadily along the X-axis under the action of an electromotive force X, then it is clear that the energy drawn from the electrostatic field has the value ∫eXdx. Since the electron is only slowly accelerated, and in consequence, no energy is given out in the form of radiation, therefore the energy drawn from the electro-static field may be put equal to the energy W of motion. Considering the whole process of motion in questions, the first of equations A) holds, we obtain:—

Del 8Page 49 / 106

Del 8 continues

$ W = \int eXdx = \int_0^v m\beta^3 vdv = mc^2 (\frac{1}{\sqrt{1 - \frac{v^2}{c^2}}} - 1) $

For v = c, W is infinitely great. As our former result shows, velocities exceeding that of light can have no possibility of existence.

In consequence of the arguments mentioned above, this expression for kinetic energy must also hold for ponderable masses.

We can now enumerate the characteristics of the motion of the electrons available for experimental verification, which follow from equations A).

1. From the second of equations A), it follows that an electrical force Y, and a magnetic force N produce equal deflexions of an electron moving with the velocity v, when Y = Nv/c. Therefore we see that according to our theory, it is possible to obtain the velocity of an electron from the ratio of the magnetic deflexion A{m}, and the electric deflexion A{e}, by applying the law:—

$ \frac{A{m}}{A{e}} = \frac{v}{c} $

This relation can be tested by means of experiments because the velocity of the electron can be directly measured by means of rapidly oscillating electric and magnetic fields.

2. From the value which is deduced for the kinetic energy of the electron, it follows that when the electron falls through a potential difference of P, the velocity v which is acquired is given by the following relation:—

$ P = \int Xdx = \frac{m}{e}c^2 (\frac{1}{\sqrt{1 - \frac{v^2}{c^2}}} - 1) $

3. We calculate the radius of curvature R of the path, where the only deflecting force is a magnetic force N acting perpendicular to the velocity of projection. From the second of equations A) we obtain:

$ - \frac{d^2y}{dt^2} = \frac{v^2}{R} = \frac{e}{m} \frac{v}{c} N \sqrt{1 - \frac{v^2}{c^2}} $

mvβc
R = ----------
eN

These three relations are complete expressions for the law of motion of the electron according to the above theory.

Footnote 6:

Vide Note 9.

Footnote 7:

Vide Note 9.

Footnote 8:

Vide Note 12.

ALBRECHT EINSTEIN [A short biographical note.]

Del 9Page 50 / 106

Del 9

The name of Prof. Albrecht Einstein has now spread far beyond the narrow pale of scientific investigators owing to the brilliant confirmation of his predicted deflection of light-rays by the gravitational field of the sun during the total solar eclipse of May 29, 1919. But to the serious student of science, he has been known from the beginning of the current century, and many dark problems in physics has been illuminated with the lustre of his genius, before, owing to the latest sensation just mentioned, he flashes out before public imagination as a scientific star of the first magnitude.

Einstein is a Swiss-German of Jewish extraction, and began his scientific career as a privat-dozent in the Swiss University of Zürich about the year 1902. Later on, he migrated to the German University of Prague in Bohemia as ausser-ordentliche (or associate) Professor. In 1914, through the exertions of Prof. M. Planck of the Berlin University, he was appointed a paid member of the Royal (now National) Prussian Academy of Sciences, on a salary of 18,000 marks per year. In this post, he has only to do and guide research work. Another distinguished occupant of the same post was Van't Hoff, the eminent physical chemist.

It is rather difficult to give a detailed, and consistent chronological account of his scientific activities,—they are so variegated, and cover such a wide field. The first work which gained him distinction was an investigation on Brownian Movement. An admirable account will be found in Perrin's book 'The Atoms.' Starting from Boltzmann's theorem connecting the entropy, and the probability of a state, he deduced a formula on the mean displacement of small particles (colloidal) suspended in a liquid. This formula gives us one of the best methods for finding out a very fundamental number in physics—namely—the number of molecules in one gm. molecule of gas (Avogadro's number). The formula was shortly afterwards verified by Perrin, Prof. of Chemical Physics in the Sorbonne, Paris.

Del 9Page 51 / 106

Del 9 continues

To Einstein is also due the resuscitation of Planck's quantum theory of energy-emission. This theory has not yet caught the popular imagination to the same extent as the new theory of Time, and Space, but it is none the less iconoclastic in its scope as far as classical concepts are concerned. It was known for a long time that the observed emission of light from a heated black body did not correspond to the formula which could be deduced from the older classical theories of continuous emission and propagation. In the year 1900, Prof. Planck of the Berlin University worked out a formula which was based on the bold assumption that energy was emitted and absorbed by the molecules in multiples of the quantity hν, where h is a constant (which is universal like the constant of gravitation), and ν is the frequency of the light.

The conception was so radically different from all accepted theories that in spite of the great success of Planck's radiation formula in explaining the observed facts of black-body radiation, it did not meet with much favour from the physicists. In fact, some one remarked jocularly that according to Planck, energy flies out of a radiator like a swarm of gnats.

But Einstein found a support for the new-born concept in another direction. It was known that if green or ultraviolet light was allowed to fall on a plate of some alkali metal, the plate lost electrons. The electrons were emitted with all velocities, but there is generally a maximum limit. From the investigations of Lenard and Ladenburg, the curious discovery was made that this maximum velocity of emission did not at all depend on the intensity of light, but on its wavelength. The more violet was the light, the greater was the velocity of emission.

To account for this fact, Einstein made the bold assumption that the light is propagated in space as a unit pulse (he calls it a Light-cell), and falling on an individual atom, liberates electrons according to the energy equation

1
hν = --- mv² + A,
2

where (m, v) are the mass and velocity of the electron. A is a constant characteristic of the metal plate.

Del 9Page 52 / 106

Del 9 continues

There was little material for the confirmation of this law when it was first proposed (1905), and eleven years elapsed before Prof. Millikan established, by a set of experiments scarcely rivalled for the ingenuity, skill, and care displayed, the absolute truth of the law. As results of this confirmation, and other brilliant triumphs, the quantum law is now regarded as a fundamental law of Energetics. In recent years, X-rays have been added to the domain of light, and in this direction also, Einstein's photo-electric formula has proved to be one of the most fruitful conceptions in Physics.

The quantum law was next extended by Einstein to the problems of decrease of specific heat at low temperature, and here also his theory was confirmed in a brilliant manner.

We pass over his other contributions to the equation of state, to the problems of null-point energy, and photo-chemical reactions. The recent experimental works of Nernst and Warburg seem to indicate that through Einstein's genius, we are probably for the first time having a satisfactory theory of photo-chemical action.

In 1915, Einstein made an excursion into Experimental Physics, and here also, in his characteristic way, he tackled one of the most fundamental concepts of Physics. It is well-known that according to Ampere, the magnetisation of iron and iron-like bodies, when placed within a coil carrying an electric current is due to the excitation in the metal of small electrical circuits. But the conception though a very fruitful one, long remained without a trace of experimental proof, though after the discovery of the electron, it was generally believed that these molecular currents may be due to the rotational motion of free electrons within the metal. It is easily seen that if in the process of magnetisation, a number of electrons be set into rotatory motion, then these will impart to the metal itself a turning couple. The experiment is a rather difficult one, and many physicists tried in vain to observe the effect. But in collaboration with de Haas, Einstein planned and successfully carried out this experiment, and proved the essential correctness of Ampere's views.

Del 9Page 53 / 106

Del 9 continues

Einstein's studies on Relativity were commenced in the year 1905, and has been continued up to the present time. The first paper in the present collection forms Einstein's first great contribution to the Principle of Special Relativity. We have recounted in the introduction how out of the chaos and disorder into which the electrodynamics and optics of moving bodies had fallen previous to 1895, Lorentz, Einstein and Minkowski have succeeded in building up a consistent, and fruitful new theory of Time and Space.

But Einstein was not satisfied with the study of the special problem of Relativity for uniform motion, but tried, in a series of papers beginning from 1911, to extend it to the case of non-uniform motion. The last paper in the present collection is a translation of a comprehensive article which he contributed to the Annalen der Physik in 1916 on this subject, and gives, in his own words, the Principles of Generalized Relativity. The triumphs of this theory are now matters of public knowledge.

Einstein is now only 45, and it is to be hoped that science will continue to be enriched, for a long time to come, with further achievements of his genius.

Principle of Relativity

INTRODUCTION.

Del 10Page 54 / 106

Del 10

At the present time, different opinions are being held about the fundamental equations of Electro-dynamics for moving bodies. The Hertzian[9] forms must be given up, for it has appeared that they are contrary to many experimental results.
In 1895 H. A. Lorentz[10] published his theory of optical and electrical phenomena in moving bodies; this theory was based on the atomistic conception (vorstellung) of electricity, and on account of its great success appears to have justified the bold hypotheses, by which it has been ushered into existence. In his theory, Lorentz proceeds from certain equations, which must hold at every point of "Äther"; then by forming the average values over "Physically infinitely small" regions, which however contain large numbers of electrons, the equations for electro-magnetic processes in moving bodies can be successfully built up.
In particular, Lorentz's theory gives a good account of the non-existence of relative motion of the earth and the luminiferous "Äther"; it shows that this fact is intimately connected with the covariance of the original equation, when certain simultaneous transformations of the space and time co-ordinates are effected; these transformations have therefore obtained from H. Poincare[11] the name of Lorentz-transformations. The covariance of these fundamental equations, when subjected to the Lorentz-transformation is a purely mathematical fact i.e. not based on any physical considerations; I will call this the Theorem of Relativity; this theorem rests essentially on the form of the differential equations for the propagation of waves with the velocity of light.
Now without recognizing any hypothesis about the connection between "Äther" and matter, we can expect these mathematically evident theorems to have their consequences so far extended—that thereby even those laws of ponderable media which are yet unknown may anyhow possess this covariance when subjected to a Lorentz-transformation; by saying this, we do not indeed express an opinion, but rather a conviction,—and this conviction I may be permitted to call the Postulate of Relativity. The position of affairs here is almost the same as when the Principle of Conservation of Energy was postulated in cases, where the corresponding forms of energy were unknown.
Now if hereafter, we succeed in maintaining this covariance as a definite connection between pure and simple observable phenomena in moving bodies, the definite connection may be styled 'the Principle of Relativity.'
These differentiations seem to me to be necessary for enabling us to characterise the present day position of the electro-dynamics for moving bodies.
H. A. Lorentz[12] has found out the "Relativity theorem" and has created the Relativity-postulate as a hypothesis that electrons and matter suffer contractions in consequence of their motion according to a certain law.
A. Einstein[13] has brought out the point very clearly, that this postulate is not an artificial hypothesis but is rather a new way of comprehending the time-concept which is forced on us by observation of natural phenomena.
The Principle of Relativity has not yet been formulated for electro-dynamics of moving bodies in the sense characterized by me. In the present essay, while formulating this principle, I shall obtain the fundamental equations for moving bodies in a sense which is uniquely determined by this principle.
But it will be shown that none of the forms hitherto assumed for these equations can exactly fit in with this principle.[14]
We would at first expect that the fundamental equations which are assumed by Lorentz for moving bodies would correspond to the Relativity Principle. But it will be shown that this is not the case for the general equations which Lorentz has for any possible, and also for magnetic bodies; but this is approximately the case (if neglect the square of the velocity of matter in comparison to the velocity of light) for those equations which Lorentz hereafter infers for non-magnetic bodies. But this latter accordance with the Relativity Principle is due to the fact that the condition of non-magnetisation has been formulated in a way not corresponding to the Relativity Principle; therefore the accordance is due to the fortuitous compensation of two contradictions to the Relativity-Postulate. But meanwhile enunciation of the Principle in a rigid manner does not signify any contradiction to the hypotheses of Lorentz's molecular theory, but it shall become clear that the assumption of the contraction of the electron in Lorentz's theory must be introduced at an earlier stage than Lorentz has actually done.
In an appendix, I have gone into discussion of the position of Classical Mechanics with respect to the Relativity Postulate. Any easily perceivable modification of mechanics for satisfying the requirements of the Relativity theory would hardly afford any noticeable difference in observable processes; but would lead to very surprising consequences. By laying down the Relativity-Postulate from the outset, sufficient means have been created for deducing henceforth the complete series of Laws of Mechanics from the principle of conservation of Energy alone (the form of the Energy being given in explicit forms).
NOTATIONS.
Let a rectangular system (x, y, z, t,) of reference be given in space and time. The unit of time shall be chosen in such a manner with reference to the unit of length that the velocity of light in space becomes unity.
Although I would prefer not to change the notations used by Lorentz, it appears important to me to use a different selection of symbols, for thereby certain homogeneity will appear from the very beginning. I shall denote the vector electric force by E, the magnetic induction by M, the electric induction by e and the magnetic force by m, so that (E, M, e, m) are used instead of Lorentz's (E, B, D, H) respectively.
I shall further make use of complex magnitudes in a way which is not yet current in physical investigations, i.e., instead of operating with (t), I shall operate with (i t), where i denotes √(-1). If now instead of (x, y, z, i t), I use the method of writing with indices, certain essential circumstances will come into evidence; on this will be based a general use of the suffixes (1, 2, 3, 4). The advantage of this method will be, as I expressly emphasize here, that we shall have to handle symbols which have apparently a purely real appearance; we can however at any moment pass to real equations if it is understood that of the symbols with indices, such ones as have the suffix 4 only once, denote imaginary quantities, while those which have not at all the suffix 4, or have it twice denote real quantities.
An individual system of values of (x, y, z, t) i. e., of (x₁ x₂ x₃ x₄) shall be called a space-time point.
Further let u denote the velocity vector of matter, ε the dielectric constant, μ the magnetic permeability, σ the conductivity of matter, while ρ denotes the density of electricity in space, and x the vector of "Electric Current" which we shall some across in §7 and §8.

Del 10Page 55 / 106

Del 10 continues

PART I
§ 2.
The Limiting Case.
The Fundamental Equations for Äther.
By using the electron theory, Lorentz in his above mentioned essay traces the Laws of Electro-dynamics of Ponderable Bodies to still simpler laws. Let us now adhere to these simpler laws, whereby we require that for the limiting case ε = 1, μ = 1, σ = 0, they should constitute the laws for ponderable bodies. In this ideal limiting case ε = 1, μ = 1, σ = 0, E will be equal to e, and M to m. At every space time point (x, y, z, t) we shall have the equations[15]
(i) Curl m - (δe/δt) = ρu
(ii) div e = ρ
(iii) Curl e + δm/δt = 0
(iv) div m = 0
I shall now write (x₁ x₂ x₃ x₄) for (x, y, z, t) and (ρ₁, ρ₂, ρ₃, ρ₄) for
$ (\rho u{x}, \rho u{y}, \rho u_{z}, i\rho) $
i.e. the components of the convection current ρu, and the electric density multiplied by √ -1
Further I shall write
f_{2 3}, f_{3 1}, f_{1 2}, f_{1 4}, f_{2 4}, f_{3 4}.
for
m{x}, m{y}, m{z}, -ie{x}, -ie{y}, -ie{z}.
i.e., the components of m and (-i.e.) along the three axes; now if we take any two indices (h. k) out of the series
3, 4), f_{k h} = -f_{k h},
Therefore
f₃₂ = -f₂₃, f₁₃ = -f₃₁, f₂₁ = -f₁₂ f₄₁ = -f₁₄, f₄₄ = -f₂₄, f₄₃ = -f₃₄
Then the three equations comprised in (i), and the equation (ii) multiplied by i becomes
$ \begin{vmatrix} & \frac{\delta f{1 2}}{\delta x{2}} & + \frac{\delta f{1 3}}{\delta x{3}} & + \frac{\delta f{1 4}}{\delta x{4}} & = \rho{1} \frac{\delta f{2 1}}{\delta x{1}} & & + \frac{\delta f{2 3}}{\delta x{3}} & \times \frac{\delta f{2 4}}{\delta x{4}} & = \rho{2} \frac{\delta f{3 1}}{\delta x{1}} & \times \frac{\delta f{3 2}}{\delta x{2}} & & + \frac{\delta f{3 4}}{\delta x{4}} & = \rho{3} \frac{\delta f{4 1}}{\delta x{1}} & + \frac{\delta f{4 2}}{\delta x{2}} & + \frac{\delta f{4 3}}{\delta x{3}} & & = \rho{4} \end{vmatrix} × $
On the other hand, the three equations comprised in (iii) and the (iv) equation multiplied by (i) becomes
$ \begin{vmatrix} & \frac{\delta f{3 4}}{\delta x{2}} & + \frac{\delta f{4 2}}{\delta x{3}} & + \frac{\delta f{2 3}}{\delta x{4}} & = = \frac{\delta f{4 3}}{\delta x{1}} & & + \frac{\delta f{1 4}}{\delta x{3}} & + \frac{\delta f{3 1}}{\delta x{4}} & = 0 \frac{\delta f{2 4}}{\delta x{1}} & + \frac{\delta f{4 1}}{\delta x{2}} & & + \frac{\delta f{1 2}}{\delta x{4}} & = 0 \frac{\delta f{3 2}}{\delta x{1}} & + \frac{\delta f{1 3}}{\delta x{2}} & + \frac{\delta f{2 1}}{\delta x{3}} & & = - \end{vmatrix} × $
By means of this method of writing we at once notice the perfect symmetry of the 1st as well as the 2nd system of equations as regards permutation with the indices, (1, 2, 3, 4).
§ 3.

Del 11Page 56 / 106

Del 11

It is well-known that by writing the equations i) to iv) in the symbol of vector calculus, we at once set in evidence an invariance (or rather a (covariance) of the system of equations A) as well as of B), when the co-ordinate system is rotated through a certain amount round the null-point. For example, if we take a rotation of the axes round the z-axis, through an amount φ, keeping e, m fixed in space, and introduce new variables x₁′ x₂′ x₃′ x₄′ instead of x₁ x₂ x₃ x₄ where x′₁ = x₁ cos φ + x₂ sin φ, x′₂ = -x₁ sin φ + x₂ cos φ, x′₃ = x₃, x′₄ = x₄, and introduce magnitudes ρ′₁, ρ′₂, ρ′₃, ρ′₄, where ρ₁′ = ρ₁ cos φ + ρ₂ sin φ, ρ₂′ = - ρ₁ sin φ + ρ₂ cos φ and f′_{1 2},...... f′_{3 4}, where

f′₂₃ = f₂₃ cos φ + f₃₁ sin φ,
f′₃₁ = - f₂₃ sin φ + f₃₁ cos φ,
f′₁₂ = f₁₂,
f′₁₄ = f₁₄ cos φ + f₂₄ sin φ,
f′₂₄ = - f₁₄ sin φ + f₂₄ cos φ,
f′₃₄ = f₃₄_{3 4},
f′_{k h} = - f_{k h} (h l k = 1, 2, 3, 4).

then out of the equations (A) would follow a corresponding system of dashed equations (A´) composed of the newly introduced dashed magnitudes.

So on the ground of symmetry alone of the equations (A) and (B) concerning the suffixes (1, 2, 3, 4), the theorem of Relativity, which was found out by Lorentz, follows without any calculation at all.

I will denote by iψ a purely imaginary magnitude, and consider the substitution

x₁′ = x₁,
x₂′ = x₂,
x₃′ = x₃ cos iψ + x₄ sin iψ, (1)
x₄′´ = - x₃ sin iψ + x₄ cos iψ,

Putting

$ - i \tan i\psi = \frac{e^{\psi} - e^{-\psi}}{e^{\psi}+e^{-\psi}} = q $,

$ \psi = \frac{1}{2} \log \frac{1 + q}{1 - q′} $ (2)

We shall have cos iψ = 1/√(1 - q²), sin iψ = iq/√(1 - q²) where -1 < q < 1, and √(1 - q²) is always to be taken with the positive sign.

Let us now write x′₁ = x′, x′₂ = y′, x′₃ = z′, x′₄ = it′ (3)

then the substitution 1) takes the form

x′ = x, y′ = y, z′ = (z - qt)/√(1 - q²), t′ = (-qz + t)/√(1 - q²), (4)

the coefficients being essentially real.

Del 11Page 57 / 106

Del 11 continues

If now in the above-mentioned rotation round the Z-axis, we replace 1, 2, 3, 4 throughout by 3, 4, 1, 2, and φ by iψ, we at once perceive that simultaneously, new magnitudes ρ′₁, ρ′₂, ρ′₃, ρ′₄, where

ρ′₁ = ρ₁, ρ′₂ = ρ₂, ρ′₃ = ρ₃ cos iψ + ρ₄ sin iψ, ρ′₄ = - ρ₃ sin iψ + ρ₄ cos iψ),

and f′_{1 2}... f′_{3 4}, where

f′_{4 1} = f_{4 1} cos iψ + f_{1 3} sin iψ, f′_{1 3} = - f_{4 1} sin iψ + f_{1 3} cos iψ, f′_{3 4} = f_{3 4}, f′_{3 2} = f_{3 2} cos iψ + f_{4 2} sin iψ, f′_{4 2} = - f_{3 2} sin iψ + f_{4 2} cos iψ, f′_{1 2} = f_{1 2}, f_{k h} = - f′_{k h},

must be introduced. Then the systems of equations in (A) and (B) are transformed into equations (A´), and (B´), the new equations being obtained by simply dashing the old set.

All these equations can be written in purely real figures, and we can then formulate the last result as follows.

If the real transformations 4) are taken, and x´ y´ z´ t´ be taken as a new frame of reference, then we shall have

(5) ρ´ = ρ [(-qu_{z} + 1)/√(1 - q²)],
ρ´u_{z}´ = ρ[(u_{z} - q)/√(1 - q²)],
ρ´u_{x}´ = ρu_{x},
ρ´u_{y}´ = ρu_{y}.

(6) e´_{x´} = (e_{x} - qm_{y})/(√(1 - q²)), m´_{r´} = (qe_{x} + m_{y})/(√(1 - q²)), e´_{z´} = e_{z}.

(7) m´_{x´} = (m_{x} - qe_{y})/(√(1 - q²)), e´_{y´} = (qm_{x} + e_{y})/(√(1 - q²)), m´{z´} = m__{z}.

Then we have for these newly introduced vectors u´, e´, m´ (with components u_{x}´, u_{y}´, u_{z}´; e_{x}´, e_{y}´, e_{z}´; m_{x}´, m_{y}´, m_{z}´), and the quantity ρ´ a series of equations I´), II´), III´), IV´) which are obtained from I), II), III), IV) by simply dashing the symbols.

We remark here that e_{x} - qm_{y}, e_{y} + qm_{x} are components of the vector e + [vm], where v is a vector in the direction of the positive Z-axis, and | v | = q, and [vm] is the vector product of v and m; similarly -qe_{x} + m_{y}, m_{x} + qe_{y} are the components of the vector m - [ve].

The equations 6) and 7), as they stand in pairs, can be expressed as.

Del 11Page 58 / 106

Del 11 continues

e′_{x′} + im′_{x′} = (e_{x} + im_{x}) cos iψ + (e_{y} + im_{y}) sin iψ,

e′_{y′} + im′_{y′} = - (e_{x} + im_{x}) sin iψ + (e_{y} + im_{y}) cos iψ,

e′_{z′} + im′_{z′} = e′_{z} + im_{z}.

If φ denotes any other real angle, we can form the following combinations:—

(e′_{x′} + im′_{x′}) cos. φ + (e′_{y″} + im′_{y′}) sin φ

= (e_{x} + im_{x}) cos. (φ + iψ) + (e_{y} + im_{y}) sin (φ + iψ),

= (e′_{x′} + im′_{x′}) sin φ + (e′_{y′} + im′_{y′}) cos. φ

= - (e_{x} + im_{x}) sin (φ + iψ) + (e_{y} + im_{y}) cos. (φ + iψ).

§ 4. Special Lorentz Transformation.

The rôle which is played by the Z-axis in the transformation (4) can easily be transferred to any other axis when the system of axes are subjected to a transformation about this last axis. So we came to a more general law:—

Let v be a vector with the components v_{x}, v_{y}, v_{z}, and let | v | = q < 1. By ṽ we shall denote any vector which is perpendicular to v, and by r_{v}, r_{ṽ} we shall denote components of r in direction of ṽ and v.

Instead of (x, y, z, t), new magnetudes (x′ y′ z′ t′) will be introduced in the following way. If for the sake of shortness, r is written for the vector with the components (x, y, z) in the first system of reference, r′ for the same vector with the components (x′ y′ z′) in the second system of reference, then for the direction of v, we have

(10) r′_{v} = (r_{v} - qt)/√(1 - q²)

and for the perpendicular direction ṽ,

(11) r′_{ṽ} = r_{ṽ}

and further (12) t′ = (-qr_{v} + t)/√(1 - q²).

The notations (r′_{ṽ}, r′_{v}) are to be understood in the sense that with the directions v, and every direction ṽ perpendicular to v in the system (x, y, z) are always associated the directions with the same direction cosines in the system (x′ y′ z′).

Del 11Page 59 / 106

Del 11 continues

A transformation which is accomplished by means of (10), (11), (12) with the condition 0 < q < 1 will be called a special Lorentz-transformation. We shall call v the vector, the direction of v the axis, and the magnitude of v the moment of this transformation.

If further ρ′ and the vectors u′, e′, m′, in the system (x′ y′ z′) are so defined that,

(13) ρ′ = ρ[(-qu_{v} + 1)/√(1 - q²)],
ρ′u′{v} = ρ(u__{v} - q)/√(1 - q²),
ρ′u_{ṽ} = ρ′u_{v},

further

(14) (e′ + im′){ṽ} = ((e + im) - i[v, (e + im])']{ṽ})/√(1 - q²).

(15) (e′ + im′){v} = (e + im) - i[u, (e + im)]{v}.

Then it follows that the equations I), II), III), IV) are transformed into the corresponding system with dashes.

The solution of the equations (10), (11), (12) leads to

(16) r_{v} = (r′_{v} + qt′)/√(1 - q²), r_{ṽ} = r′_{ṽ}, t = (qr′_{v} + t′)/√(1 - q²),

Now we shall make a very important observation about the vectors u and u′. We can again introduce the indices 1, 2, 3, 4, so that we write (x₁′, x₂′, x₃′, x₄′) instead of (x′, y′, z′, it′) and ρ₁′, ρ₂′, ρ₃′, ρ₄′ instead of (ρ′u′{x′}, ρ′u′{y′}, ρ′u′{z′}, iρ′).

Like the rotation round the Z-axis, the transformation (4), and more generally the transformations (10), (11), (12), are also linear transformations with the determinant + 1, so that

(17) x₁² + x₂² + x₃² + x₄² i. e. x² + y² + z² - t²,

is transformed into

x₁′² + x₂′² + x₃′² + x₄′² i. e. x′² + y′² + z′² - t′².

On the basis of the equations (13), (14), we shall have (ρ₁² + ρ₂² + ρ₃² + ρ₄²) = ρ²(1 - u_{x²}, -u_{y²}, -u_{z²}) = ρ²(1 - u²) transformed into ρ²(1 - u²) or in other words,

(18) ρ√(1 - u²)

is an invariant in a Lorentz-transformation.

If we divide (ρ₁, ρ₂, ρ₃, ρ₄) by this magnitude, we obtain the four values (ω₁, ω₂, ω₃, ω₄) = (1/√(1 - u²))(u_{x}, u_{y}, u_{z}, i) so that ω₁² + ω₂² + ω₃² + ω₄² = -1.

Del 11Page 60 / 106

Del 11 continues

It is apparent that these four values are determined by the vector u and inversely the vector u of magnitude < 1 follows from the 4 values ω₁, ω₂, ω₃, ω₄; where (ω₁, ω₂, ω₃) are real, -iω₄ real and positive and condition (19) is fulfilled.

The meaning of (ω₁, ω₂, ω₃, ω₄) here is, that they are the ratios of dx₁, dx₂, dx₃, dx₄ to

(20) √(-(dx₁² + dx₂² + dx₃² + dx₄²)) = dt√(1 - u²).

The differentials denoting the displacements of matter occupying the spacetime point (x₁, x₂, x₃, x₄) to the adjacent space-time point.

After the Lorentz-transformation is accomplished the velocity of matter in the new system of reference for the same space-time point (x′ y′ z′ t′) is the vector u′ with the ratios dx′/dt′, dy′/dt′, dz′/dt′, dl′/dt′, as components.

Now it is quite apparent that the system of values

x₁ = ω₁, x₂ = ω₂, x₃ = ω₃, x₄ = ω₄

is transformed into the values

x₁′ = ω₁′, x₂′ = ω₂′, x₃′ = ω₃′, x₄′ = ω₄′

in virtue of the Lorentz-transformation (10), (11), (12).

The dashed system has got the same meaning for the velocity u′ after the transformation as the first system of values has got for u before transformation.

If in particular the vector v of the special Lorentz-transformation be equal to the velocity vector u of matter at the space-time point (x₁, x₂, x₃, x₄) then it follows out of (10), (11), (12) that

ω₁′ = 0, ω₂′ = 0, ω₃′ = 0, ω₄′ = i

Under these circumstances therefore, the corresponding space-time point has the velocity v′ = 0 after the transformation, it is as if we transform to rest. We may call the invariant ρ√(1 - u²) the rest-density of Electricity.[16]

§ 5. Space-time Vectors. Of the 1st and 2nd kind.

Del 12Page 61 / 106

Del 12

If we take the principal result of the Lorentz transformation together with the fact that the system (A) as well as the system (B) is covariant with respect to a rotation of the coordinate-system round the null point, we obtain the general relativity theorem. In order to make the facts easily comprehensible, it may be more convenient to define a series of expressions, for the purpose of expressing the ideas in a concise form, while on the other hand I shall adhere to the practice of using complex magnitudes, in order to render certain symmetries quite evident.

Let us take a linear homogeneous transformation,

$ \begin{vmatrix} x{1} x{2} x{3} x{4} \end{vmatrix} = \begin{vmatrix} a{1 1} & a{1 2} & a{1 3} & a{1 4} a{2 1} & a{2 2} & a{2 3} & a{2 4} a{3 1} & a{3 2} & a{3 3} & a{3 4} a{4 1} & a{4 2} & a{4 3} & a{4 4} \end{vmatrix} \begin{vmatrix} x{1}' x{2}' x{3}' x{4}' \end{vmatrix} $

the Determinant of the matrix is +1, all co-efficients without the index 4 occurring once are real, while a₄₁, a₄₂, a₄₃, are purely imaginary, but a₄₄ is real and > 0, and x₁² + x₂² + x₃² + x₄² transforms into x₁′² + x₂′² + x₃′² + x₄′². The operation shall be called a general Lorentz transformation.

(This notation, which is due to Dr. C. E. Cullis of the Calcutta University, has been used throughout instead of Minkowski's notation, x₁ = a₁₁x₁′ + a₁₂x₂′+ a₁₃x₃′+ a₁₄x₄′.)

If we put x₁′ = x′, x₂′ = y′, x₃′ = z′, x₄′ = it′, then immediately there occurs a homogeneous linear transformation of (x, y, z, t) to (x′, y′, z′, t′) with essentially real co-efficients, whereby the aggregate -x² - y² - z² + t² transforms into -x′² - y′² - z′² + t′², and to every such system of values x, y, z, t with a positive t, for which this aggregate > 0, there always corresponds a positive t'; this last is quite evident from the continuity of the aggregate x, y, z, t.

The last vertical column of co-efficients has to fulfil the condition 22) a₁₄² + a₂₄² + a₃₄² + a₄₄² = 1.

Del 12Page 62 / 106

Del 12 continues

If a₁₄ = a₂₄ = a₃₄ = 0, then a₄₄ = 1, and the Lorentz transformation reduces to a simple rotation of the spatial co-ordinate system round the world-point.

If a₁₄, a₂₄, a₃₄ are not all zero, and if we put a₁₄: a₂₄: a₃₄: a₄₄ = v_{x}: v_{y}: v_{z}: i

q = √(v_{x}² + v_{y}² +v_{z}²) < 1.

On the other hand, with every set of values of a₁₄, a₂₄, a₃₄, a₄₄ which in this way fulfil the condition 22) with real values of v_{x}, v_{y}, v_{z}, we can construct the special Lorentz transformation (16) with (a₁₄, a₂₄, a₃₄, a₄₄) as the last vertical column,—and then every Lorentz-transformation with the same last vertical column (a₁₄, a₂₄, a₃₄, a₄₄) can be supposed to be composed of the special Lorentz-transformation, and a rotation of the spatial co-ordinate system round the null-point.

The totality of all Lorentz-Transformations forms a group. Under a space-time vector of the 1st kind shall be understood a system of four magnitudes (ρ₁, ρ₂, ρ₃, ρ₄) with the condition that in case of a Lorentz-transformation it is to be replaced by the set (ρ₁′, ρ₂′, ρ₃′, ρ₄′), where these are the values of (x₁′, x₂′, x₃′, x₄′), obtained by substituting (ρ₁, ρ₂, ρ₃, ρ₄) for (x₁, x₂, x₃, x₄) in the expression (21).

Besides the time-space vector of the 1st kind (x₁, x₂, x₃, x₄) we shall also make use of another space-time vector of the first kind (y₁, y₂, y₃, y₄), and let us form the linear combination

(23) f₂₃(x₂_y₃ - x₃__y₂) + f₃₁(x₃__y₁ - x₁__y₃) + f₁₂(x₁__y₂ - x₂__y₁) + f₁₄(x₁__y₄ - x₄__y₁) + f₂₄(x₂__y₄ - x₄__y₂) + f₃₄(x₃__y₄ - x₄__y₃_)

with six coefficients f₂₃--f₃₄. Let us remark that in the vectorial method of writing, this can be constructed out of the four vectors.

x₁, x₂, x₃; y₁, y₂, y₃; f₂₃, f₃₁, f₁₂; f₁₄, f₂₄, f₃₄ and the constants x₄ and y₄, at the same time it is symmetrical with regard the indices (1, 2, 3, 4).

If we subject (x₁, x₂, x₃, x₄) and (y₁, y₂, y₃, y₄) simultaneously to the Lorentz transformation (21), the combination (23) is changed to:

Del 12Page 63 / 106

Del 12 continues

(24) f₂₃′(x₂′_y₃′ - x₃′__y₂′) + f₃₁(x₃′__y₁′ - x₁′__y₃′) + f₁₂ (x₁′__y₂′ - x₂′__y₁′) + f₁₄′(x₁′__y₄′) - x₄′__y₁′) + f₂₄′(x₂′__y₄′ - x₄′__y₂′) + f₃₄′(x₃′__y₄′ - x₄′__y₃′_),

where the coefficients f₂₃′, f₃₁′, f₁₂′, f₁₄′, f₂₄′, f₃₄′, depend solely on (f₂₃ f₂₄) and the coefficients a₁₁... a₄₄.

We shall define a space-time Vector of the 2nd kind as a system of six-magnitudes f₂₃, f₃₁... f₃₄, with the condition that when subjected to a Lorentz transformation, it is changed to a new system f₂₃′... f₃₄,... which satisfies the connection between (23) and (24).

I enunciate in the following manner the general theorem of relativity corresponding to the equations (I)-(iv),—which are the fundamental equations for Äther.

If x, y, z, it (space co-ordinates, and time it) is subjected to a Lorentz transformation, and at the same time (pu_{x}, pu_{y}, pu_{z}, iρ) (convection-current, and charge density ρi) is transformed as a space time vector of the 1st kind, further (m_{x}, m_{y}, m_{z}, -ie_{x}, -ie_{y}, -ie_{z}) (magnetic force, and electric induction × (-i) is transformed as a space time vector of the 2nd kind, then the system of equations (I), (II), and the system of equations (III), (IV) transforms into essentially corresponding relations between the corresponding magnitudes newly introduced into the system.

These facts can be more concisely expressed in these words: the system of equations (I and II) as well as the system of equations (III) (IV) are covariant in all cases of Lorentz-transformation, where (ρu, iρ) is to be transformed as a space time vector of the 1st kind, (m - ie) is to be treated as a vector of the 2nd kind, or more significantly,—

(ρu, iρ) is a space time vector of the 1st kind, (m - ie)[17] is a space-time vector of the 2nd kind.

I shall add a few more remarks here in order to elucidate the conception of space-time vector of the 2nd kind. Clearly, the following are invariants for such a vector when subjected to a group of Lorentz transformation.

(i) m² - e² = f₂₃² + f₃₁² + f₁₂² + f₁₄² + f₂₄² + f₂₄²

Del 12Page 64 / 106

Del 12 continues

me = i(f₂₃_f₁₄ + f₃₁__f₂₄ + f₁₂__f₃₄_).

A space-time vector of the second kind (m - ie), where (m and e) are real magnitudes, may be called singular, when the scalar square (m - ie)² = 0, ie m² - e² = 0, and at the same time (m e) = 0, ie the vector m and e are equal and perpendicular to each other; when such is the case, these two properties remain conserved for the space-time vector of the 2nd kind in every Lorentz-transformation.

If the space-time vector of the 2nd kind is not singular, we rotate the spacial co-ordinate system in such a manner that the vector-product [me] coincides with the Z-axis, i.e. m_{x} = 0, e_{x} = 0. Then

(m_{x}, -i e_{x})² + (m_{y}, -i e_{y})² ≠ 0.

Therefore (e_{y} + i m_{y})/(e_{x} + i e_{x}) is different from +i, and we can therefore define a complex argument (φ + iψ) in such a manner that

tan (φ + iψ)

e_{y} + i m_{y}
= -------------------------
e_{x} + i m_{x}

If then, by referring back to equations (9), we carry out the transformation (1) through the angle ψ and a subsequent rotation round the Z-axis through the angle φ, we perform a Lorentz-transformation at the end of which m_{y} = 0, e_{y} = 0, and therefore m and e shall both coincide with the new Z-axis. Then by means of the invariants m² - e², (me) the final values of these vectors, whether they are of the same or of opposite directions, or whether one of them is equal to zero, would be at once settled.

§ 6. Concept of Time.

Del 12Page 65 / 106

Del 12 continues

By the Lorentz transformation, we are allowed to effect certain changes of the time parameter. In consequence of this fact, it is no longer permissible to speak of the absolute simultaneity of two events. The ordinary idea of simultaneity rather presupposes that six independent parameters, which are evidently required for defining a system of space and time axes, are somehow reduced to three. Since we are accustomed to consider that these limitations represent in a unique way the actual facts very approximately, we maintain that the simultaneity of two events exists of themselves.[18] In fact, the following considerations will prove conclusive.

Let a reference system (x, y, z, t) for space time points (events) be somehow known. Now if a space point A (x₀, y₀, z₀) the time t₀ be compared with a space point P (x, y, z) at the time t, and if the difference of time t - t₀, (let t > t₀) be less than the length A P i.e. less than the time required for the propagation of light from A to P, and if q = (t - t₀)/(A P) < 1, then by a special Lorentz transformation, in which A P is taken as the axis, and which has the moment q, we can introduce a time parameter t′, which (see equation 11, 12, § 4) has got the same value t′ = 0 for both space-time points (A, t₀), and (P, t). So the two events can now be comprehended to be simultaneous.

Further, let us take at the same time t₀ = 0, two different space-points A, B, or three space-points (A, B, C) which are not in the same space-line, and compare therewith a space point P, which is outside the line A B, or the plane A B C, at another time t, and let the time difference t - t₀ (t > t₀) be less than the time which light requires for propagation from the line A B, or the plane (A B C) to P. Let q be the quotient of (t - t₀) by the second time. Then if a Lorentz transformation is taken in which the perpendicular from P on A B, or from P on the plane A B C is the axis, and q is the moment, then all the three (or four) events (A, t₀), (B, t₀), (C, t₀) and (P, t) are simultaneous.

Del 12Page 66 / 106

Del 12 continues

If four space-points, which do not lie in one plane, are conceived to be at the same time t₀, then it is no longer permissible to make a change of the time parameter by a Lorentz-transformation, without at the same time destroying the character of the simultaneity of these four space points.

To the mathematician, accustomed on the one hand to the methods of treatment of the poly-dimensional manifold, and on the other hand to the conceptual figures of the so-called non-Euclidean Geometry, there can be no difficulty in adopting this concept of time to the application of the Lorentz-transformation. The paper of Einstein which has been cited in the Introduction, has succeeded to some extent in presenting the nature of the transformation from the physical standpoint.

PART II. ELECTRO-MAGNETIC PHENOMENA. § 7. Fundamental Equations for bodies at rest.

Del 13Page 67 / 106

Del 13

After these preparatory works, which have been first developed on account of the small amount of mathematics involved in the limiting case ε = 1, μ = 1, σ = 0, let us turn to the electro-magnetic phenomena in matter. We look for those relations which make it possible for us—when proper fundamental data are given—to obtain the following quantities at every place and time, and therefore at every space-time point as functions of (x, y, z, t):—the vector of the electric force E, the magnetic induction M, the electrical induction e, the magnetic force m, the electrical space-density ρ, the electric current s (whose relation hereafter to the conduction current is known by the manner in which conductivity occurs in the process), and lastly the vector v, the velocity of matter.

The relations in question can be divided into two classes.

Firstly—those equations, which,—when v, the velocity of matter is given as a function of (x, y, z, t),—lead us to a knowledge of other magnitude as functions of x, y, z, t—I shall call this first class of equations the fundamental equations—

Secondly, the expressions for the ponderomotive force, which, by the application of the Laws of Mechanics, gives us further information about the vector u as functions of (x, y, z, t).

For the case of bodies at rest, i.e. when u (x, y, z, t) = 0 the theories of Maxwell (Heaviside, Hertz) and Lorentz lead to the same fundamental equations. They are;—

(1) The Differential Equations:—which contain no constant referring to matter:—

(i) Curl m - δe/δt = C,
(ii) div e = lρ.
(iii) Curl E + δM/δt = 0,
(iv) Div M = 0.

(2) Further relations, which characterise the influence of existing matter for the most important case to which we limit ourselves i.e. for isotopic bodies;—they are comprised in the equations

(V) e = ε E, M = μm, C = σE.

where ε = dielectric constant, μ = magnetic permeability, σ = the conductivity of matter, all given as function of x, y, z, t; s is here the conduction current.

By employing a modified form of writing, I shall now cause a latent symmetry in these equations to appear. I put, as in the previous work,

Del 13Page 68 / 106

Del 13 continues

x₁ = x, x₂ = y, x₃ = z, x₄ = it,

and write s₁, s₂, s₃, s₄ for C{x}, C{y}, C{z_} (√-1)ρ.

Further f₂₃, f₃₁, f₁₂, f₁₄, f₂₄, f₃₄

for m_{x}, m_{y}, m_{z}, -i(e_{x}, e_{y}, e_{z}),

and F₂₃, F₃₁, F₁₂, F₁₄, F₂₄, F₃₄

for M{x}, M{y}, M{z}, -i(E{x}, E{y}, E{z})

lastly we shall have the relation f_{k h} = - f_{h k}, F_{k h} = -F_{h k}, (the letter f, F shall denote the field, s the (i.e. current).

Then the fundamental Equations can be written as

(A) ∂f₁₂/∂x₂ + ∂f₁₃/∂x₃ + ∂f₁₄/∂x₄ = s₁

∂f₂₁/∂x₁ + + ∂f₂₃/∂x₃ + ∂f₂₄/∂x₄ = s₂

∂f₃₁/∂x₁ + ∂f₃₂/∂x₂ + + ∂f₃₄/∂x₄ = s₃

∂f₄₁/∂x₁ + ∂f₄₂/∂x₂ + ∂f₄₃/∂x₃ = s₄

and the equations (3) and (4), are

∂F₃₄/∂x₂ + ∂F₄₂/∂x₃ + ∂F₂₃/∂x₄ = 0

∂F₄₃/∂x₁ + + ∂F₁₄/∂x₃ + ∂F₃₁∂x₄ = 0

∂F₂₄/∂x₁ + ∂F₄₁/∂x₂ + + ∂F₁₂/∂x₄ = 0

∂F₃₂/∂x₁ + ∂F₁₃/∂x₂ + ∂F₂₁/∂x₃ = 0

§ 8. The Fundamental Equations.

We are now in a position to establish in a unique way the fundamental equations for bodies moving in any manner by means of these three axioms exclusively.

The first Axion shall be,—

When a detached region[19] of matter is at rest at any moment, therefore the vector u is zero, for a system (x, y, z, t)—the neighbourhood may be supposed to be in motion in any possible manner, then for the space-time point x, y, z, t, the same relations (A) (B) (V) which hold in the case when all matter is at rest, shall also hold between ρ, the vectors C, e, m, M, E and their differentials with respect to x, y, z, t. The second axiom shall be:—

Every velocity of matter is < 1, smaller than the velocity of propagation of light.[20]

The fundamental equations are of such a kind that when (x, y, z, it) are subjected to a Lorentz transformation and thereby (m - ie) and (M - iE) are transformed into space-time vectors of the second kind, (C, iρ) as a space-time vector of the 1st kind, the equations are transformed into essentially identical forms involving the transformed magnitudes.

Shortly I can signify the third axiom as:—

Del 13Page 69 / 106

Del 13 continues

(m, -ie), and (M, -iE) are space-time vectors of the second kind, (C, ip) is a space-time vector of the first kind.

This axiom I call the Principle of Relativity.

In fact these three axioms lead us from the previously mentioned fundamental equations for bodies at rest to the equations for moving bodies in an unambiguous way.

According to the second axiom, the magnitude of the velocity vector | u | is < 1 at any space-time point. In consequence, we can always write, instead of the vector u, the following set of four allied quantities

ω₁ = u{x}/√(1 - u²_),
ω₂ = u{y_}/√(1 - u²),
ω₃ = u{z_}/√(1 - u²),
ω₄ = i/√(1 - u²)

with the relation

(27) ω₁² + ω₂² + ω₃² + ω₄² = - |

From what has been said at the end of § 4, it is clear that in the case of a Lorentz-transformation, this set behaves like a space-time vector of the 1st kind.

Let us now fix our attention on a certain point (x, y, z) of matter at a certain time (t). If at this space-time point u = 0, then we have at once for this point the equations (A), (B) (V) of § 7. If u ≠ 0, then there exists according to 16), in case | u | < 1, a special Lorentz-transformation, whose vector v is equal to this vector u (x, y, z, t), and we pass on to a new system of reference (x′ y′ z′ t′) in accordance with this transformation. Therefore for the space-time point considered, there arises as in § 4, the new values 28) ω′₁ = 0, ω′₂ = 0, ω′₃ = 0, ω′₄ = i, therefore the new velocity vector ω′ = 0, the space-time point is as if transformed to rest. Now according to the third axiom the system of equations for the transformed point (x′ y′ z′ t) involves the newly introduced magnitude (u′ ρ′, C′, e′, m′, E′, M′) and their differential quotients with respect to (x′, y′, z′, t′) in the same manner as the original equations for the point (x, y, z, t). But according to the first axiom, when u′ = 0, these equations must be exactly equivalent to

(1) the differential equations (A′), (B′), which are obtained from the equations (A), (B) by simply dashing the symbols in (A) and (B).

(2) and the equations

Del 13Page 70 / 106

Del 13 continues

(V′) e′ = εE′, M' = μm′, C′ = σE′

where ε, μ, σ are the dielectric constant, magnetic permeability, and conductivity for the system (x′ y′ z′ t′) i.e. in the space-time point (x y, z t) of matter.

Now let us return, by means of the reciprocal Lorentz-transformation to the original variables (x, y, z, t), and the magnitudes (u, ρ, C, e, m, E, M) and the equations, which we then obtain from the last mentioned, will be the fundamental equations sought by us for the moving bodies.

Now from § 4, and § 6, it is to be seen that the equations A), as well as the equations B) are covariant for a Lorentz-transformation, i.e. the equations, which we obtain backwards from A′) B′), must be exactly of the same form as the equations A) and B), as we take them for bodies at rest. We have therefore as the first result:—

The differential equations expressing the fundamental equations of electrodynamics for moving bodies, when written in ρ and the vectors C, e, m, E, M, are exactly of the same form as the equations for moving bodies. The velocity of matter does not enter in these equations. In the vectorial way of writing, we have

I) curl m - ∂e/∂t = C₁,

II) div e = ρ

III) curl E + ∂M/∂t = 0

IV) div M = 0

The velocity of matter occurs only in the auxiliary equations which characterise the influence of matter on the basis of their characteristic constants ε, μ, σ. Let us now transform these auxiliary equations V′) into the original co-ordinates (x, y, z, and t.)

According to formula 15) in § 4, the component of e′ in the direction of the vector u is the same as that of (e + [u m]), the component of m′ is the same as that of m - [u e], but for the perpendicular direction ū, the components of e′, m′ are the same as those of (e + [u m]) and (m - [u e], multiplied by 1/√(1 - u²). On the other hand E′ and M′ shall stand to E + [uM], and M - [uE] in the same relation as e′ and m′ to e + [um], and m - (ue). From the relation e′ = εE′, the following equations follow

and from the relation M′ = μm′, we have

Del 13Page 71 / 106

Del 13 continues

(D) M - [u E] = μ(m - [ue]),

For the components in the directions perpendicular to u, and to each other, the equations are to be multiplied by √(1 - u²).

Then the following equations follow from the transformation? equations (12), (10), (11) in § 4, when we replace q, r_{v}, r_{ṽ}, t, r′_{v}, r′_{ṽ}, t' by |u|, C{u}, C{ū}, ρ, C′{u}, C′{ū}, ρ′

ρ′ = (-|u| C{u} + ρ)/√(1 - u²_),
C'{u} = (C{u} - |u|ρ)/√(1 - u²),
C′{ū} = C{ū},

E) (C{u} - |u|ρ)/√(1 - u²) = σ(E + [uM]){u},

C{ū} = σ (E + [uM]){u}/√(1 - u²).

In consideration of the manner in which σ enters into these relations, it will be convenient to call the vector C - ρu with the components C{u} - ρ|u| in the direction of u, and C′{ū} in the directions ū perpendicular to u the "Convection current." This last vanishes for σ = 0.

We remark that for ε = 1, μ = 1 the equations e′ = E′, m′ = M′ immediately lead to the equations e = E, m = M by means of a reciprocal Lorentz-transformation with -u as vector; and for σ = 0, the equation C′ = 0 leads to C = ρu; that the fundamental equations of Äther discussed in § 2 becomes in fact the limitting case of the equations obtained here with ε = 1, μ = 1, σ = 0.

§ 9. The Fundamental Equations in Lorentz's Theory.

Del 14Page 72 / 106

Del 14

Let us now see how far the fundamental equations assumed by Lorentz correspond to the Relativity postulate, as defined in §8. In the article on Electron-theory (Ency., Math., Wiss., Bd. V. 2, Art 14) Lorentz has given the fundamental equations for any possible, even magnetised bodies (see there page 209, Eqn XXX′, formula (14) on page 78 of the same (part).

(IIIa″) Curl (H - [uE]) = J + dD/dt + u div D - curl [uD].

(I″) div D = ρ

(IV″) curl E = - dB/dt, Div B = 0 (V′)

Then for moving non-magnetised bodies, Lorentz puts (page 223, 3) μ = 1, B = H, and in addition to that takes account of the occurrence of the di-electric constant ε, and conductivity σ according to equations

(εqXXXIV″, p. 327) D - E = (ε - 1) {E + [uB]}

(εqXXXIII′, p. 223) J = σ(E + [uB])

Lorentz's E, D, H are here denoted by E, M, e, m while J denotes the conduction current.

The three last equations which have been just cited here coincide with eqn (II), (III), (IV), the first equation would be, if J is identified with C, = uρ (the current being zero for σ = 0,

(29) Curl [H - (u, E)] = C + dD/dt - curl [uD],

and thus comes out to be in a different form than (1) here. Therefore for magnetised bodies, Lorentz's equations do not correspond to the Relativity Principle.

On the other hand, the form corresponding to the relativity principle, for the condition of non-magnetisation is to be taken out of (D) in §8, with μ = 1, not as B = H, as Lorentz takes, but as (30) B - [uD] = H - [uD] (M - [uE] = m - [ue]. Now by putting H = B, the differential equation (29) is transformed into the same form as eqn (1) here when m - [ue] = M - [uE]. Therefore it so happens that by a compensation of two contradictions to the relativity principle, the differential equations of Lorentz for moving non-magnetised bodies at last agree with the relativity postulate.

If we make use of (30) for non-magnetic bodies, and put accordingly H = B + [u, (D - E)], then in consequence of (C) in §8,

(ε - 1) (E + [u, B]) = D - E + [u. [u, D - E]],

i.e. for the direction of u,

(ε - 1) (E + [uB]){u} = (D - E){u}

and for a perpendicular direction ū,

Del 14Page 73 / 106

Del 14 continues

(ε - 1) [E + (uB)]{u} = (1 - u²) (D - E){u}

i.e. it coincides with Lorentz's assumption, if we neglect u² in comparison to 1.

Also to the same order of approximation, Lorentz's form for J corresponds to the conditions imposed by the relativity principle [comp. (E) § 8]—that the components of J{u}, J{ū} are equal to the components of σ (E + [u B]) multiplied by √(1 - u²) or 1 / √(1 - u²) respectively.

§10. Fundamental Equations of E. Cohn.

E. Cohn assumes the following fundamental equations.

(31) Curl (M + [u E]) = dE/dt + u div. E + J

- Curl [E - (u. M)] = dM/dt + u div. M.

(32) J = σ E, = ε E - [u M], M = μ (m + [u E.])

where E M are the electric and magnetic field intensities (forces), E, M are the electric and magnetic polarisation (induction). The equations also permit the existence of true magnetism; if we do not take into account this consideration, div. M. is to be put = 0.

An objection to this system of equations, is that according to these, for ε = 1, μ = 1, the vectors force and induction do not coincide. If in the equations, we conceive E and M and not E - (U. M), and M + [U E] as electric and magnetic forces, and with a glance to this we substitute for E, M, E, M, div. E, the symbols e, M, E + [U M], m - [u e], ρ, then the differential equations transform to our equations, and the conditions (32) transform into

J = σ(E + [u M])
e + [u, (m - [u e])] = ε(E + [u M])
M - [u, (E + u M)] = μ(m - [u e])

then in fact the equations of Cohn become the same as those required by the relativity principle, if errors of the order u² are neglected in comparison to 1.

It may be mentioned here that the equations of Hertz become the same as those of Cohn, if the auxiliary conditions are

(33) E = εE, M = μM, J = σE.

§11. Typical Representations of the Fundamental Equations.

Del 14Page 74 / 106

Del 14 continues

In the statement of the fundamental equations, our leading idea had been that they should retain a covariance of form, when subjected to a group of Lorentz-transformations. Now we have to deal with ponderomotive reactions and energy in the electro-magnetic field. Here from the very first there can be no doubt that the settlement of this question is in some way connected with the simplest forms which can be given to the fundamental equations, satisfying the conditions of covariance. In order to arrive at such forms, I shall first of all put the fundamental equations in a typical form which brings out clearly their covariance in case of a Lorentz-transformation. Here I am using a method of calculation, which enables us to deal in a simple manner with the space-time vectors of the 1st, and 2nd kind, and of which the rules, as far as required are given below.

A system of magnitudes a_{h k} formed into the matrix

| a₁₁....................a_{1 q} |
| |
| |
| |
| a_{p 1}...........a_{p q} |

arranged in p horizontal rows, and q vertical columns is called a p × q series-matrix, and will be denoted by the letter A.

If all the quantities a_{h k} are multiplied by C, the resulting matrix will be denoted by CA.

If the roles of the horizontal rows and vertical columns be intercharged, we obtain a q × p series matrix, which will be known as the transposed matrix of A, and will be denoted by Ā.

Ā = | a₁₁...................... a_{p 1} |
| |
| a_{1 q}............ a_{p q} |

If we have a second p × q series matrix B,

B = | b₁₁......................... b₁_{q} | | | | b_{p 1}............. b{p q_} |

then A + B shall denote the p × q series matrix whose members are a_{h k} + b_{h k}.

2⁰ If we have two matrices

A = | a₁₁..................... a_{1 q} |
| |
| a_{p 1}........... a_{p q} |

B = | b_{1 1}.............. b_{1 r} |
| |
| b_{q 1}.......... b_{p r} |

where the number of horizontal rows of B, is equal to the number of vertical columns of A, then by AB, the product of the matrices A and B, will be denoted the matrix

Del 14Page 75 / 106

Del 14 continues

C = | c₁₁...................... c_{1 r} |
| |
| c_{p r}........... c_{p p} |

where c_{h k} = a_{h 1} b₁_{k} + a_{h 2} b_{2 h} +... a_{k s} b_{s k} +... + a_{k q} b_{q h}

these elements being formed by combination of the horizontal rows of A with the vertical columns of B. For such a point, the associative law (AB)S = A(BS) holds, where S is a third matrix which has got as many horizontal rows as B (or AB) has got vertical columns.

For the transposed matrix of C = BA, we have Ċ = ḂĀ

3⁰. We shall have principally to deal with matrices with at most four vertical columns and for horizontal rows.

As a unit matrix (in equations they will be known for the sake of shortness as the matrix 1) will be denoted the following matrix (4 × 4 series) with the elements.

(34) | e₁₁ e₁₂ e₁₃ e₁₄ | = | 1 0 0 0 |
| e₂₁ e₂₂ e₂₃ e₂₄ | | 0 1 0 0 |
| e₃₁ e₃₂ e₃₃ e₃₄ | | 0 0 1 0 |
| e₄₁ e₄₂ e₄₃ e₄₄ | | 0 0 0 1 |

For a 4 × 4 series-matrix, Det A shall denote the determinant formed of the 4 × 4 elements of the matrix. If det A ≠ 0, then corresponding to A there is a reciprocal matrix, which we may denote by A⁻¹ so that A⁻¹A = 1.

A matrix

f = | 0 f₁₂ f₁₃ f₁₄ |
| f₂₁ 0 f₂₃ f₂₄ |
| f₃₁ f₃₂ 0 f₃₄ |
| f₄₁ f₄₂ f₄₃ 0 |

in which the elements fulfil the relation f_{h k} = -f_{h k}, is called an alternating matrix. These relations say that the transposed matrix ḟ = -f. Then by f^{*} will be the dual, alternating matrix

(35)

f^{*} = | 0 f₃₄ f₄₂ f₂₃ |
| f₄₃ 0 f₁₄ f₃₁ |
| f₂₄ f₄₁ 0 f₁₂ |
| f₃₂ f₁₃ f₂₁ 0 |

Then (36) f* f = f₃₄ f₂₂ + f₄₂ f₃₁ + f₃₂ f₂₄

i.e. We shall have a 4 × 4 series matrix in which all the elements except those on the diagonal from left up to right down are zero, and the elements in this diagonal agree with each other, and are each equal to the above mentioned combination in (36).

The determinant of f is therefore the square of the combination, by Det^{½}f we shall denote the expression

Det^{½}f = f₃₂ f₁₄ f₁₃ f₂₄ + f₂₁ f₃₄·

4⁰. A linear transformation

Del 14Page 76 / 106

Del 14 continues

x_{h} = α{h1} x₁′ + α{h2} x₂′ + α{h3} x₃′ + α{h4} x₄′ (h = 1,2,3,

which is accomplished by the matrix

will be denoted as the transformation A.

By the transformation A, the expression

x²₁ + x²₂ + x²₃ + x²₄ is changed into the quadratic for m ∑ α{hk} x__{h}′ x_{k}′,

where α{hk} = α{1k} α{1k} + α{2h} α{2k} + α{3h} α{3k} + α{4h} α{4k_} are the members of a 4 × 4 series matrix which is the product of Ā A, the transposed matrix of A into A. If by the transformation, the expression is changed to

x′₁² + x₂′^2 + x₃′^2 + x′₄²,

we must have Ā A = 1.

A has to correspond to the following relation, if transformation (38) is to be a Lorentz-transformation. For the determinant of A) it follows out of (39) that (Det A)² = 1, or Det A = ± 1.

From the condition (39) we obtain

A⁻¹ = Ā,

i.e. the reciprocal matrix of A is equivalent to the transposed matrix of A.

For A as Lorentz transformation, we have further Det A = +1, the quantities involving the index 4 once in the subscript are purely imaginary, the other co-efficients are real, and a₄₄ > 0.

Del 15Page 77 / 106

Del 15

5⁰. A space time vector of the first kind[21] which s represented by the 1 × 4 series matrix,

(41) s = |s₁ s₂ s₃ s₄|

is to be replaced by sA in case of a Lorentz transformation

A. i.e. s′ = | s₁′ s₂′ s₃′ s₄′| = |s₁ s₂ s₃ s₄| A;

A space-time vector of the 2nd kind[22] with components f₂₃... f₃₄ shall be represented by the alternating matrix

(42) f = | 0 f₁₂ f₁₃ f₁₄ |

|f₂₁ 0 f₂₃ f₂₄ |

|f₃₁ f₃₂ 0 f₃₄ |

|f₄₁ f₄₂ f₄₃ 0 |

and is to be replaced by A⁻¹ f A in case of a Lorentz transformation [see the rules in § 5 (23) (24)]. Therefore referring to the expression (37), we have the identity Det^{½} (Ā f A) = Det A. Det^{½} f. Therefore Det^{½} f becomes an invariant in the case of a Lorentz transformation [see eq. (26) See. § 5].

Looking back to (36), we have for the dual matrix (ĀfA) (A⁻¹fA) = A⁻¹ffA = Det^{½} function. A⁻¹A = Det^{½}f from which it is to be seen that the dual matrix f behaves exactly like the primary matrix f, and is therefore a space time vector of the II kind; f is therefore known as the dual space-time vector of f with components (f₁₄, f₂₄, f₃₄,), (f₂₃}, f₃₁, f₁₂).

6. If w and s are two space-time rectors of the 1st kind then by w ṡ (as well as by s ẇ) will be understood the combination (43) w₁ s₁ + w₂ s₂ + w₃ s₃ + w₄ s₄.

In case of a Lorentz transformation A, since (wA) (Āṡ) = w s, this expression is invariant.—If w ṡ = 0, then w and s are perpendicular to each other.

Two space-time rectors of the first kind (w, s) gives us a 2 × 4 series matrix

| w₁ w₂ w₃ w₄ | | s₁ s₂ s₃ s₄ |

Then it follows immediately that the system of six magnitudes (44)

w₂ s₃ - w₃ s₂,
w₃ s₁ - w₁ s₃,
w₁ s₂ - w₂ s₁,
w₁ s₄ - w₄ s₁,
w₂ s₄ - w₄ s₂,
w₃ s₄ - w₄ s₃,

behaves in case of a Lorentz-transformation as a space-time vector of the II kind. The vector of the second kind with the components (44) are denoted by [w, s]. We see easily that Det^{½} [w, s] = 0. The dual vector of [w, s] shall be written as [w, s].

Del 15Page 78 / 106

Del 15 continues

If ẇ is a space-time vector of the 1st kind, f of the second kind, w f signifies a 1 × 4 series matrix. In case of a Lorentz-transformation A, w is changed into w′ = wA, f into f′ = A⁻¹ f A,—therefore w′ f′ becomes = (wA A⁻¹ f A) = w f A i.e. w f is transformed as a space-time vector of the 1st kind.[23] We can verify, when w is a space-time vector of the 1st kind, f of the 2nd kind, the important identity

(45) [w, w_f] + [w, w__f] = (w] ẇ)f_.

The sum of the two space time vectors of the second kind on the left side is to be understood in the sense of the addition of two alternating matrices.

For example, for ω₁ = 0, ω₂ = 0, ω₃ = 0, ω₄ = i,

ωf = | i_f₄₁, i__f₄₂, i__f₄₃, 0 |; ωf* = | i__f₃₂, i__f₁₃, i__f_₂₁, 0 |

[ω · ωf] = 0, 0, 0, f₄₁, f₄₂, f₄₃; [ω · ωf] = 0, 0, 0, f₃₂, f₁₃, f₂₁.

The fact that in this special case, the relation is satisfied, suffices to establish the theorem (45) generally, for this relation has a covariant character in case of a Lorentz transformation, and is homogeneous in (ω₁, ω₂, ω₃, ω₄).

After these preparatory works let us engage ourselves with the equations (C,) (D,) (E) by means which the constants ε μ, σ will be introduced.

Instead of the space vector u, the velocity of matter, we shall introduce the space-time vector of the first kind ω with the components.

ω₁ = u_{x}/√(1 - u²),
ω₂ = u_{y}/√(1 - u²),
ω₃ = u_{z}/√(1 - u²),
ω₄ = i/√(1 - u²).

(40) where ω₁² + ω₂² + ω₃² + ω₄² = -1 and -iω₄ > 0.

By F and f shall be understood the space time vectors of the second kind M - iE, m - ie.

In Φ = ωF, we have a space time vector of the first kind with components

Φ₁ = ω₂F₁₂ + ω₃F₁₃ + ω₄F₁₄

Φ₂ = ω₁F₂₁ + ω₃F₂₃ + ω₄F₂₄

Φ₃ = ω₁F₃₁ + ω₂F₃₂ + ω₄F₃₄

Φ₄ = ω₁F₄₁ + ω₂F₄₂ + ω₃F₄₃

The first three quantities (φ₁, φ₂, φ₃) are the components of the space-vector (E + [u, M])/√(1 - u²),

and further (φ₄ = i[u E]/√(1 - u²).

Because F is an alternating matrix,

(49) ωΦ = ω₁ φ₁ + ω₂ Φ₂ + ω₃ Φ₃ + ω₄ Φ₄ = 0.

i.e. Φ is perpendicular to the vector ω; we can also write Φ₄ = i[ω{x} Φ₁ + ω{y} Φ₂ + ω_{z} Φ₃].

Del 15Page 79 / 106

Del 15 continues

I shall call the space-time vector Φ of the first kind as the Electric Rest Force.[24]

Relations analogous to those holding between -ωF, E, M, U, hold among -ωf, e, m, u, and in particular -ωf is normal to ω. The relation (C) can be written as

{C} ωf = εωF.

The expression (ωf) gives four components, but the fourth can be derived from the first three.

Let us now form the time-space vector 1st kind, ψ - iωf*, whose components are

ψ₁ = -i(ω₂ f₃₄ + ω₃ f₄₂ + ω₄ f₂₃)
ψ₂ = -i(ω₁ f₄₃ + ω₃ f₄₄ + ω₄ f₃₁)
ψ₃ = -i(ω₁ f₂₄ + ω₂ f₄₁ + ω₄ f₁₂)
ψ₄ = -i(ω₁ f₃₂ + ω₂ f₁₃ + ω₃ f₂₁)

Of these, the first three ψ₁, ψ₂, ψ₃, are the x, y, z components of the space-vector 51) (m - (ue))/√(1 - u²) and further (52) ψ₄ = i(um)/√(1 - u²).

Among these there is the relation

(53) ωψ = ω₁ ψ₁ + ω₂ ψ₂ + ω₃ ψ₃ + ω₄ ψ₄ = 0

which can also be written as ψ₄ = i (u_{x} ψ₁ + u_{y} ψ₂ + u_{z} ψ₃).

The vector ψ is perpendicular to ω; we can call it the Magnetic rest-force.

Relations analogous to these hold among the quantities ωF*, M, E, u and Relation (D) can be replaced by the formula

{ D } -ωF = μψf.

We can use the relations (C) and (D) to calculate F and f from Φ and ψ we have

ωF = -Φ, ωF = -iμψ, ωf = -εΦ, ωf = -iψ.

and applying the relation (45) and (46), we have

F = [ω. Φ] + iμ[ω. ψ] 55) f = ε[ω. Φ] + i[ω. ψ] 56)

i.e.

F₁₂ = (ω₁ Φ₁ - ω₂ Φ₁) + iμ [ω₃ Ψ₄ - ω₄ ψ₃], etc. f₁₂ = ε(ω₁ Φ₂ - ω₂ φ₁) + i [ω₃ ψ₄ - ω₄ ψ₃]., etc.

Let us now consider the space-time vector of the second kind [Φ ψ], with the components

[ Φ₂ ψ₃ - Φ₃ ψ₂, Φ₃ ψ₁ - Φ₁ ψ₃, Φ₁ ψ₂ - Φ₂ ψ₁ ] [ Φ₁ ψ₄ - Φ₄ ψ₁, Φ₂ ψ₄ - Φ₄ ψ₂, Φ₃ ψ₄ - Φ₄ ψ₃ ]

Then the corresponding space-time vector of the first kind ω[Φ, ψ] vanishes identically owing to equations 9) and 53)

for ω[Φ.ψ] = -(ωψ)Φ + (ωΦ)ψ

Let us now take the vector of the 1st kind

(57) Ω = iω[Φψ]*

with the components

Ω₁ = -i | ω₂ ω₃ ω₄ |
| Φ₂ Φ₃ Φ₄ |
| ψ₂ ψ₃ ψ₄ |, etc.

Then by applying rule (45), we have

(58) [Φ.ψ] = i[ωΩ]*

i.e. Φ₁ψ₂ - Φ₂ψ₁ = i(ω₃Ω₄ - ω₄Ω₃) etc.

The vector Ω fulfils the relation

(ωΩ) = ω₁Ω₁ + ω₂Ω₂ + ω₃Ω₃ + ω₄Ω₄ = 0,

Del 15Page 80 / 106

Del 15 continues

(which we can write as Ω₄ = i(ω{x}Ω₁ + ω{y}Ω₂ + ω_{z}Ω₃) and Ω is also normal to ω. In case ω = 0, we have Φ₄ = 0, ψ₄ = 0, Ω₄ = 0, and

[Ω₁, Ω₂, Ω₃ = | Φ₁ Φ₂ Φ₃ | |ψ₁ ψ₂ ψ₃ |.

I shall call Ω, which is a space-time vector 1st kind the Rest-Ray.

As for the relation E), which introduces the conductivity σ we have -ωS = -(ω₁s₁ + ω₂s₂ + ω₃s₃ + ω₄s₄) = (- | u | C{u} + ρ)/√(1 - u²_) = ρ′.

This expression gives us the rest-density of electricity (see §8 and §4).

Then 61) = s + (ωṡ)ω represents a space-time vector of the 1st kind, which since ωω = -1, is normal to ω, and which I may call the rest-current. Let us now conceive of the first three component of this vector as the (x-y-z) co-ordinates of the space-vector, then the component in the direction of u is

C{u} - (| u | ρ′)/√(1 - u²_)
= (c_{u} - | u |ρ)/√(1 - u²)
= J{u}/(1 - u²_)

and the component in a perpendicular direction is C{u} = J{ū}.

This space-vector is connected with the space-vector J = C - ρu, which we denoted in §8 as the conduction-current.

Now by comparing with Φ = -ωF, the relation (E) can be brought into the form

{E} s + (ωṡ)ω = - σωF,

This formula contains four equations, of which the fourth follows from the first three, since this is a space-time vector which is perpendicular to ω.

Lastly, we shall transform the differential equations (A) and (B) into a typical form.

§12. The Differential Operator Lor.

Del 16Page 81 / 106

Del 16

A 4 × 4 series matrix 62) S = | S₁₁ S₁₂ S₁₃ S₁₄ | = | S{kh_} |
| S₂₁ S₂₂ S₂₃ S₂₄ |
| S₃₁ S₃₂ S₃₃ S₃₄ |
| S₄₁ S₄₂ S₄₃ S₄₄ |

with the condition that in case of a Lorentz transformation it is to be replaced by ĀSA, may be called a space-time matrix of the II kind. We have examples of this in:—

1) the alternating matrix f, which corresponds to the space-time vector of the II kind,—

2) the product fF of two such matrices, for by a transformation A, it is replaced by (A⁻¹fA·A⁻¹FA) = A⁻¹fFA,

3) further when (ω₁, ω₂, ω₃, ω₄) and (Ω₁, Ω₂, Ω₃, Ω₄) are two space-time vectors of the 1st kind, the 4 × 4 matrix with the element S{hk} = ω{h}Ω{k_},

lastly in a multiple L of the unit matrix of 4 × 4 series in which all the elements in the principal diagonal are equal to L, and the rest are zero.

We shall have to do constantly with functions of the space-time point (x, y, z, it), and we may with advantage

employ the 1 × 4 series matrix, formed of differential symbols,—

| ∂/∂x, ∂/∂y, ∂/∂z, ∂/i∂t,| or (63) | ∂/∂x₁ ∂/∂x₂ ∂/∂x₃ ∂/∂x₄ |

For this matrix I shall use the shortened from "lor."[25]

Then if S is, as in (62), a space-time matrix of the II kind, by lor S′ will be understood the 1 × 4 series matrix

| K₁ K₂ K₃ K₄ |

where K{k} = ∂S{1k}/∂x₁ + ∂S{2k}/∂x₂ + ∂S{3k}/∂x₃ + ∂S{4h}/∂x₄_.

When by a Lorentz transformation A, a new reference system (x′₁ x′₂ x′₃ x₄) is introduced, we can use the operator

lor′ = | ∂/∂x₁′ ∂/∂x₂′ ∂/∂x₃′ ∂/∂x₄′ |

Then S is transformed to S′= Ā S A = | S′{hk_} |, so by lor 'S′ is meant the 1 × 4 series matrix, whose element are

K'{k} = ∂S′{1k}/∂x₁′ + ∂S′{2k}/∂x₂′ + ∂S′{3k}/∂x₃′ + ∂S′{4k}/∂x₄′_.

Now for the differentiation of any function of (x y z t) we have the rule ∂/∂x_{k}′ = ∂/∂x₁ ∂x₁/∂x_{k}′ + ∂/∂x₂ ∂x₂/∂x_{k}′ + ∂/∂x₃ ∂x₃/∂x_{k}′ + ∂/∂x₄ ∂x₄/∂x_{k}′ = ∂/∂x₁ a_{1k} + ∂/∂x₂ a_{2k} + ∂/∂x₃ a_{3k} + ∂/∂x₄ a_{4k}.

so that, we have symbolically lor′ = lor A.

Therefore it follows that

lor ′S′ = lor (A A⁻¹ SA) = (lor S)A.

i.e., lor S behaves like a space-time vector of the first kind.

Del 16Page 82 / 106

Del 16 continues

If L is a multiple of the unit matrix, then by lor L will be denoted the matrix with the elements

| ∂L/∂x₁ ∂L/∂x₂ ∂L/∂x₃ ∂L/∂x₄ |

If s is a space-time vector of the 1st kind, then

lor ṡ = ∂s₁/∂x₁ + ∂s₂/∂x₂ + ∂s₃/∂x₃ + ∂s₄/∂x₄.

In case of a Lorentz transformation A, we have

lor ′ṡ′ = lor A. Ās = lor s.

i.e., lor s is an invariant in a Lorentz-transformation.

In all these operations the operator lor plays the part of a space-time vector of the first kind.

If f represents a space-time vector of the second kind,—lor f denotes a space-time vector of the first kind with the components

∂f₁₂/∂x₂ + ∂f₁₃/∂x₃ + ∂f₁₄/∂x₄,
∂f₂₁/∂x₁ + ∂f₂₃/∂x₃ + ∂f₂₄/∂x₄,
∂f₃₁/∂x₁ + ∂f₃₂/∂x₂ + ∂f₃₄/∂x₄,
∂f₄₁/∂x₁ + ∂f₄₂/∂x₂ + ∂f₄₃/∂x₃

So the system of differential equations (A) can be expressed in the concise form

{A} lor f = -s,

and the system (B) can be expressed in the form

{B} log F* = 0.

Referring back to the definition (67) for log ṡ, we find that the combinations lor ([=(lor f)=]), and lor ([=(lor F)]) vanish identically, when f and F are alternating matrices. Accordingly it follows out of {A}, that

(68) (∂s₁/∂x₁) + (∂s₂/∂x₂) + (∂s₃/∂x₃) + (∂s₄/∂x₄) = 0,

while the relation

(69) lor (lor F*) = 0,

signifies that of the four equations in {B}, only three represent independent conditions.

I shall now collect the results.

Let ω denote the space-time vector of the first kind

(u/√(1 - u²}), i/√(1 - u²))

(u = velocity of matter),

F the space-time vector of the second kind (M,-iE)

(M = magnetic induction, E = Electric force,

f the space-time vector of the second kind (m,-ie)

(m = magnetic force, e = Electric Induction.

s the space-time vector of the first kind (C, iρ)

(ρ = electrical space-density, C - ρu = conductivity current,

ε = dielectric constant, μ = magnetic permeability,

σ = conductivity,

then the fundamental equations for electromagnetic processes in moving bodies are[26]

{A} lor f = -s

{B} log F* = 0

{C} ωf = εωF

{D} ωF = μωf

{E} s + (ωṡ), w = - σωF.

Del 16Page 83 / 106

Del 16 continues

ω ῶ = -1, and ωF, ωf, ωF, ωf, s + (ωs)ω which are space-time vectors of the first kind are all normal to ω, and for the system {B}, we have

lor (lor F*) = 0.

Bearing in mind this last relation, we see that we have as many independent equations at our disposal as are necessary for determining the motion of matter as well as the vector u as a function of x, y, z, t, when proper fundamental data are given.

§ 13. The Product of the Field-vectors f F.

Del 17Page 84 / 106

Del 17

Finally let us enquire about the laws which lead to the determination of the vector ω as a function of (x, y, z, t.) In these investigations, the expressions which are obtained by the multiplication of two alternating matrices

f = | 0 f₁₂ f₁₃ f₁₄ |
| f₂₁ 0 f₂₃ f₂₄ |
| f₃₁ f₃₂ 0 f₃₄ |
| f₄₁ f₄₂ f₄₃ 0 |

F = | 0 F₁₂ F₁₃ F₁₄ |
| F₂₁ 0 F₂₃ F₂₄ |
| F₃₁ F₃₂ 0 F₃₄ |
| F₄₁ F₄₂ F₄₃ 0 |

are of much importance. Let us write,

(70) fF =| S₁₁ - L S₁₂ S₁₃ S₁₄ |

| S₂₁ S₂₂ - L S₂₃ S₂₄ |

| S₃₁ S₃₂ S₃₃ - L S₃₄ |

| S₄₁ S₄₂ S₄₃ S₄₄ - L |

Then (71) S₁₁ + S₂₂ + S₃₃ + S₄₄ = 0.

Let L now denote the symmetrical combination of the indices 1, 2, 3, 4, given by

(72) L = ½(f₂₃ F₂₃ + f₃₁F₃₁ + f₁₂ + F₁₂ + f₁₄ F₁₄ + f₂₄ F₂₄ + f₃₄ F₃₄)

Then we shall have

(73) S₁₁ = ½(f₂₃ F₂₃ + f₃₄ F₃₄ + f₄₂ F₄₂ - f₁₂ F₁₂ - f₁₃ F₁₃ f₁₄ F₁₄)

S₁₂ = f₁₃ F₃₂ + f₁₄ F₄₂ etc....

In order to express in a real form, we write

(74) S = | S₁₁ S₁₂ S₁₃ S₁₄ |

| S₂₁ S₂₂ S₂₃ S₂₄ |

| S₃₁ S₃₂ S₃₃ S₃₄ |

| S₄₁ S₄₂ S₄₃ S₄₄ |

= | X{x} Y{x} Z{x} -iT{x} |

| X{y} Y{y} Z{y} -iT{y} |

| X{z} Y{z} Z{z} -iT{z} |

| -iX{t} -iY{t} -iZ{t} T{t} |

Now X{x} = ½[m__{x}M{x} - m__{y}M{y} - m__{z}M{z} + e__{x}E{x} - e__{y}E{y} - e__{z}E{z_}]

(75) X{y} = m__{x}M{y} + e__{y}E{x}, Y{x} = m_{y}M{x} + e__{x}E{y_} etc.

X{t} = e__{y}M{z} - e__{z}M{y}, T{x} = m_{x}E{y} - m__{y}E{z_}, etc.

T{t} = ½[m__{x}M{x} + m__{y}M{y} + m__{z}M{z} + e__{x}E{x} + e__{y}E{y} + e__{z}E{z_}]

L{t} = ½[m__{x}M{x} + m__{y}M{y} + m__{z}M{z} - e__{x}E{x} - e__{y}E{y} - e__{z}E{z_}]

These quantities[27] are all real. In the theory for bodies at rest, the combinations (X{x}, X{y}, X{z}, Y{z}, Y{y}, Y{z}, Z{x}, Z{y}, Z{z}) are known as "Maxwell's Stresses," T{x}, T{y}, T{z} are known as the Poynting's Vector, T{t_} as the electromagnetic energy-density, and L as the Langrangian function.

On the other hand, by multiplying the alternating matrices of f and F, we obtain

Del 17Page 85 / 106

Del 17 continues

(77) Ff =| -S₁₁ - L, -S₁₂, -S₁₃. -S₁₄ |

| -S₂₁, -S₂₂ - L, -S₂₃, -S₂₄ |

| -S₃₁ -S₃₂, -S₃₃ - L, -S₃₄ |

| -S₄₁ -S₄₂ -S₄₃ -S₄₄ - L |

and hence, we can put

(78) fF = S - L, Ff = -S - L,

where by L, we mean L-times the unit matrix, i.e. the matrix with elements

| Le_{hk} |, (e_{hh} = 1, e_{hk} = 0, h ≠ k h, k = 1, 2, 3, 4).

Since here SL = LS, we deduce that,

FffF = (-S - L)(S - L) = -SS + L²,

and find, since ff = Det^{½}f, FF = Det^{½}F, we arrive at the interesting

conclusion

(79) SS = L² - Det^{½}f Det^{½}F

i.e. the product of the matrix S into itself can be expressed as the multiple of a unit matrix—a matrix in which all the elements except those in the principal diagonal are zero, the elements in the principal diagonal are all equal and have the value given on the right-hand side of (79). Therefore the general relations

(80) S{h1} S{1k} + S{h2} S{2k} + S{h3} S{3k} + S{h4} S{4k} = 0,

h, k being unequal indices in the series 1, 2, 3, 4, and

(81) S{h1} S{1h} + S{h2} S{2h} + S{h3} S{3h} + S{h4} S{4h} = L² - Det^{½}f_ Det^{½}F,

for h = 1, 2, 3, 4.

Now if instead of F, and f in the combinations (72) and (73), we introduce the electrical rest-force Φ, the magnetic rest-force ψ, and the rest-ray Ω [(55), (56) and (57)], we can pass over to the expressions,—

(82) L = - ½ ε Φ [=Φ] + ½ μ ψ [=ψ],

(83) S{hk} = - ½ ε Φ [=Φ] e__{hk} - ½ μ ψ [=ψ] e_{hk} + ε (Φ{h} Φ{k} - Φ ([=Φ]) ω{h} Ω{k} + μ (ψ{h} ψ{k} - Ψ [=ψ] Ω{h} ω{k}) - ω{h} ω{k} - εμ ω{h} Ω{k} (h₁ k_ = 1, 2, 3, 4).

Here we have

Φ [=Φ] = Φ₁² + Φ₂² + Φ₃² + Φ₄², ψ[=ψ] = ψ₁² + ψ₂² + ψ₃² + ψ₄²

e_{hh} = 1, e_{hk} = 0 (h ≠ k).

Del 17Page 86 / 106

Del 17 continues

The right side of (82) as well as L is an invariant in a Lorentz transformation, and the 4 × 4 element on the right side of (83) as well as S{k h} represent a space time vector of the second kind. Remembering this fact, it suffices, for establishing the theorems (82) and (83) generally, to prove it for the special case ω₁ = 0, ω₂ = 0, ω₃ = 0, ω₄ = i. But for this case ω = 0, we immediately arrive at the equations (82) and (83) by means (45), (51), (60) on the one hand, and e = εE, M = μm_ on the other hand.

The expression on the right-hand side of (81), which equals

[½ (m M - eE)²] + (em) (EM),

is >= 0, because (em = ε Φ [=ψ], (EM) = μ Φ [=ψ]; now referring back to 79), we can denote the positive square root of this expression as Det^{1/4} S.

Since ḟ = -f, and Ḟ = -F, we obtain for Ṡ, the transposed matrix of S, the following relations from (78),

(84) Ff = Ṡ - L, f F = -Ṡ - L,

Then is

Ṡ - S = | S{h k} - S{t k} |

an alternating matrix, and denotes a space-time vector of the second kind. From the expressions (83), we obtain,

(85) S - Ṡ = - (εμ - 1) [ω, Ω],

from which we deduce that [see (57), (58)].

(86) ω (S - Ṡ)* = 0,

(87) ω (S - Ṡ) = (εμ - 1) Ω

When the matter is at rest at a space-time point, ω = 0, then the equation 86) denotes the existence of the following equations

Z{y} = Y{z}, X{z} = Z{x}, Y{x} = X{y},

and from 83),

T{x} = Ω₁, T{y} = Ω₂, T{z_} = Ω₃

X{t} = εμΩ₁, Y{t} = εμΩ₂, Z{t_} = εμΩ₃

Now by means of a rotation of the space co-ordinate system round the null-point, we can make,

Z{y} = Y{z} = 0, X{z} = Z{x} = 0, X{x} = X{y} = 0,

According to 71), we have

(88) X{x} + Y{y} + Z{z} + T{t} = 0,

and according to 83), T{t_} > 0. In special cases, where ω vanishes it follows from 81) that

X{x}² = Y{y}² = Z{z}² = T{t}², = (Det^{1/4} S)²,

and if T, and one of the three magnitudes X{x}, Y{y}, Z{z_} are = ±Det^{1/4} S, the two others = -Det^{1/4} S. If Ω does not vanish let Ω ≠ 0, then we have in particular from 80)

T{z} X{t} = 0, T{z} Y{t} = 0, Z{z} T{z} + T{z} T{t} = 0,

Del 17Page 87 / 106

Del 17 continues

and if Ω₁ = 0, Ω₂ = 0, Z{z} = -T{t} It follows from (81), (see also 83) that

X{x} = -Y{y} = ±Det^{1/4} S,

and -Z{z} = T{t} = √(Det^{½} S + εμΩ₃²) > Det^{1/4}S.

The space-time vector of the first kind

(89) K = lor S,

is of very great importance for which we now want to demonstrate a very important transformation

According to 78), S = L + fF, and it follows that

lor S = lor L + lor fF.

The symbol 'lor' denotes a differential process which in lor fF, operates on the one hand on the components of f, on the other hand also on the components of F. Accordingly lor fF can be expressed as the sum of two parts. The first part is the product of the matrices (lor f) F, lor f being regarded as a 1 × 4 series matrix. The second part is that part of lor fF, in which the diffentiations operate on the components of F alone. From 78) we obtain

fF = -Ff - 2L;

hence the second part of lor fF = -(lor F)f + the part of -2 lor L, in which the differentiations operate on the components of F alone. We thus obtain

lor S = (lor f)F - (lor F)f + N,

where N is the vector with the components

N{h} = ½(∂f₂₃/∂x__{h} F₂₃ + ∂f₃₁/∂x_{h} F₃₁ + ∂f₁₂/∂x_{h} F₁₂ + ∂f₁₄/∂x_{h} F₁₄ + ∂f₂₄/∂x_{h} F₂₄ + ∂f₃₄/∂x_{h} F₃₄ - ∂F₂₃/∂x_{h} f₂₃ - ∂F₃₁/∂x_{h} f₃₁ - ∂F₁₂/∂x_{h} f₁₂ - ∂F₁₄/∂x_{h} f₁₄ - ∂F₂₄/∂x_{h} f₂₄ - ∂F₃₄/∂x_{h} f₃₄),

(h = 1, 2, 3, 4)

By using the fundamental relations A) and B), 90) is transformed into the fundamental relation

(91) lor S = -sF + N.

In the limitting case ε = 1, μ = 1, f = F, N vanishes identically.

Now on the basis of the equations (55) and (56), and referring back to the expression (82) for L, and from 57) we obtain the following expressions as components of N,—

(92) N{h} = - ½ Φ[=Φ]∂ε/∂x__{h} - ½ ψ[=ψ]∂μ/∂x_{h} + (εμ - 1)(Ω₁ ∂ω₁/∂x_{h} + Ω₂ ∂ω₂/∂x_{h} + Ω₃ ∂ω₃/∂x_{h} + Ω₄ ∂ω₄/∂x_{h})

for h = 1, 2, 3, 4.

Now if we make use of (59), and denote the space-vector which has Ω₁, Ω₂, Ω₃ as the x, y, z components by the symbol W, then the third component of 92) can be expressed in the form

(93) (εμ - 1)/√(1 - u²) (W ∂u/∂x_{h}),

Del 17Page 88 / 106

Del 17 continues

The round bracket denoting the scalar product of the vectors within it.

§ 14. The Ponderomotive Force.[28]

Let us now write out the relation K = lor S = -sF + N in a more practical form; we have the four equations

(94) K₁ = ∂X{x}/∂x + ∂X{y}/∂y + ∂X{y}/∂z - ∂X{t}/∂t = ρE{x} + s__{y}M{z} - s__{z}M{x_}

- ½ Φ[=Φ] ∂ε/∂x - ½ ψ[=ψ]∂μ/∂x + (εμ - 1)/√(1 - u²) (W∂u/∂x),

(95) K₂ = ∂Y{x}/∂x + ∂Y{y}/∂y + ∂Y{z}/∂z - ∂Y{t}/∂t = ρE{y} + s__{z}M{x} - s__{x}M{y_}

- ½ Φ[=Φ]∂ε/∂y - ½ ψ[=ψ]∂μ/∂y + (εμ - 1)/√(1 - u²) (W∂u/∂y),

(96) K₃ = ∂Z{x}/∂x + ∂Z{y}/∂y + ∂Z{z}/∂z - ∂Z{t}/∂t = ρE₂ + s_{x}M{y} - s__{y}M₄

- ½ Φ[=Φ] ∂ε/∂z - ½ ψ[=ψ] ∂μ/∂z + (εμ - 1)/√(1 - u²) (W∂u/∂z),

(97) (1/i)K₄ = ∂T{y}/∂x - ∂T{y}/∂y - ∂T{z}/∂z - ∂T{t}/∂t = s_{x}E{x} + s__{y}E{y} + s__{z}E{z_}

- ½ Φ[=Φ]∂ε/∂t - ½ ψ[=ψ]∂μ/∂t + (εμ - 1)/√(1 - u²) (W∂u/∂t).

It is my opinion that when we calculate the ponderomotive force which acts on a unit volume at the space-time point x, y, z, t, it has got, x, y, z components as the first three components of the space-time vector

K + (ωK)ω,

This vector is perpendicular to ω; the law of Energy finds its expression in the fourth relation.

The establishment of this opinion is reserved for a separate tract.

In the limiting case ε = 1, μ = 1, σ = 0, the vector N = 0, S = ρω, ωK = 0, and we obtain the ordinary equations in the theory of electrons.

Footnote 9:

Vide Note 1.

Footnote 10:

Note 2.

Footnote 11:

Vide Note 3.

Footnote 12:

Vide Note 4.

Footnote 13:

Note 5.

Footnote 14:

See notes on § 8 and 10.

Footnote 15:

See note 9.

Footnote 16:

See Note.

Footnote 17:

Vide Note.

Footnote 18:

Just as beings which are confined within a narrow region surrounding a point on a spherical surface, may fall into the error that a sphere is a geometric figure in which one diameter is particularly distinguished from the rest.

Footnote 19:

Einzelne stelle der Materie.

Footnote 20:

Vide Note.

Footnote 21:

Vide note 13.

Footnote 22:

Vide note 14.

Footnote 23:

Vide note 15.

Footnote 24:

Vide note 16.

Footnote 25:

Vide note 17.

Footnote 26:

Vide note 19.

Del 17Page 89 / 106

Del 17 continues

Footnote 27:

Vide note 18.

Footnote 28:

Vide note 40.

APPENDIX Mechanics and the Relativity-Postulate.

Del 18Page 90 / 106

Del 18

It would be very unsatisfactory, if the new way of looking at the time-concept, which permits a Lorentz transformation, were to be confined to a single part of Physics.

Now many authors say that classical mechanics stand in opposition to the relativity postulate, which is taken to be the basis of the new Electro-dynamics.

In order to decide this let us fix our attention on a special Lorentz transformation represented by (10), (11), (12), with a vector v in any direction and of any magnitude q < 1 but different from zero. For a moment we shall not suppose any special relation to hold between the unit of length and the unit of time, so that instead of t, t′, q, we shall write ct, ct′, and q/c, where c represents a certain positive constant, and q is < c. The above mentioned equations are transformed into

r′_{ṽ} = r_{ṽ}, r′_{v} = c(r_{v} - qt)/√(c² - q²), t′ = (qr_{v} + c²_t)/c√(c² - q²_)

They denote, as we remember, that r is the space-vector (x, y, z), r′ is the space-vector (x′ y′ z′)

If in these equations, keeping v constant we approach the limit c = ∞, then we obtain from these

r′_{ṽ} = r_{ṽ},
r′_{v} = r_{v} - qt,
t′ = t.

The new equations would now denote the transformation of a spatial co-ordinate system (x, y, z) to another spatial co-ordinate system (x′ y′ z′) with parallel axes, the null point of the second system moving with constant velocity in a straight line, while the time parameter remains unchanged. We can, therefore, say that classical mechanics postulates a covariance of Physical laws for the group of homogeneous linear transformations of the expression

-x² - y² - z² + c² (1)

when c = ∞.

Now it is rather confusing to find that in one branch of Physics, we shall find a covariance of the laws for the transformation of expression (1) with a finite value of c, in another part for c = ∞.

It is evident that according to Newtonian Mechanics, this covariance holds for c = ∞ and not for c = velocity of light.

Del 18Page 91 / 106

Del 18 continues

May we not then regard those traditional covariances for c = ∞ only as an approximation consistent with experience, the actual covariance of natural laws holding for a certain finite value of c.

I may here point out that by if instead of the Newtonian Relativity-Postulate with c = ∞, we assume a relativity-postulate with a finite c, then the axiomatic construction of Mechanics appears to gain considerably in perfection.

The ratio of the time unit to the length unit is chosen in a manner so as to make the velocity of light equivalent to unity.

While now I want to introduce geometrical figures in the manifold of the variables (x, y, z, t), it may be convenient to leave (y, z) out of account, and to treat x and t as any possible pair of co-ordinates in a plane, referred to oblique axes.

A space time null point 0 (x, y, z, t = 0, 0, 0, 0) will be kept fixed in a Lorentz transformation.

The figure -x² - y² - z² + t² = 1, t > 0... (2)

which represents a hyper boloidal shell, contains the space-time points A (x, y, z, t = 0, 0, 0, 1), and all points A′ which after a Lorentz-transformation enter into the newly introduced system of reference as (x′, y′, z′, t′ = 0, 0, 0, 1).

The direction of a radius vector 0A′ drawn from 0 to the point A′ of (2), and the directions of the tangents to (2) at A′ are to be called normal to each other.

Let us now follow a definite position of matter in its course through all time t. The totality of the space-time points (x, y, z, t) which correspond to the positions at different times t, shall be called a space-time line.

The task of determining the motion of matter is comprised in the following problem:—It is required to establish for every space-time point the direction of the space-time line passing through it.

Del 18Page 92 / 106

Del 18 continues

To transform a space-time point P (x, y, z, t) to rest is equivalent to introducing, by means of a Lorentz transformation, a new system of reference (x′, y′, z′, t′), in which the t′ axis has the direction 0A′, 0A′ indicating the direction of the space-time line passing through P. The space t′ = const, which is to be laid through P, is the one which is perpendicular to the space-time line through P.

To the increment dt of the time of P corresponds the increment

dτ = √(dt² - dx² - dy²) - dz² = dt√(1 - u²)

of the newly introduced time parameter t′. The value of the integral

∫ dτ = ∫ √(-(dx₁² + dx₂² + dx₃² + dx₄²))

when calculated on the space-time line from a fixed initial point P₀ to the variable point P, (both being on the space-time line), is known as the 'Proper-time' of the position of matter we are concerned with at the space-time point P. (It is a generalization of the idea of Positional-time which was introduced by Lorentz for uniform motion.)

If we take a body R₀ which has got extension in space at time t₀, then the region comprising all the space-time line passing through R₀ and t₀ shall be called a space-time filament.

If we have an analytical expression θ(x y, z, t) so that θ(x, y z t) = 0 is intersected by every space time line of the filament at one point,—whereby

-(∂Θ/∂x)², -(∂Θ/∂y)², -(∂Θ/∂z)², -(∂Θ/∂t)² > 0, ∂Θ/∂t > 0.

then the totality of the intersecting points will be called a cross section of the filament.

At any point P of such across-section, we can introduce by means of a Lorentz transformation a system of reference (x′, y, z′ t), so that according to this

∂Θ/∂x′ = 0, ∂Θ/∂y′ = 0, ∂Θ/∂z′ = 0, ∂Θ/∂t′ > 0.

Del 18Page 93 / 106

Del 18 continues

The direction of the uniquely determined t′—axis in question here is known as the upper normal of the cross-section at the point P and the value of dJ = ∫∫∫ dx′ dy′ dz′ for the surrounding points of P on the cross-section is known as the elementary contents (Inhalts-element) of the cross-section. In this sense R₀ is to be regarded as the cross-section normal to the t axis of the filament at the point t = t₀, and the volume of the body R₀ is to be regarded as the contents of the cross-section.

If we allow R₀ to converge to a point, we come to the conception of an infinitely thin space-time filament. In such a case, a space-time line will be thought of as a principal line and by the term 'Proper-time' of the filament will be understood the 'Proper-time' which is laid along this principal line; under the term normal cross-section of the filament, we shall understand the cross-section on the space which is normal to the principal line through P.

We shall now formulate the principle of conservation of mass.

To every space R at a time t, belongs a positive quantity—the mass at R at the time t. If R converges to a point (x, y, z, t), then the quotient of this mass, and the volume of R approaches a limit μ(x, y, z, t), which is known as the mass-density at the space-time point (x, y, z, t).

The principle of conservation of mass says—that for an infinitely thin space-time filament, the product μdJ, where μ = mass-density at the point (x, y, z, t) of the filament (i.e., the principal line of the filament), dJ = contents of the cross-section normal to the t axis, and passing through (x, y, z, t), is constant along the whole filament.

Now the contents dJ{n} of the normal cross-section of the filament which is laid through (x, y, z, t_) is

(4) dJ{n} = (1/√(1 - u²))dJ = -iω₄ dJ = (dt/dτ)d_J.

and the function

ν = μ/-iω₄ = μ√(1 - u²)) = μ(∂τ/∂t. (5)

may be defined as the rest-mass density at the position (x y z t). Then the principle of conservation of mass can be formulated in this manner:—

Del 18Page 94 / 106

Del 18 continues

For an infinitely thin space-time filament, the product of the rest-mass density and the contents of the normal cross-section is constant along the whole filament.

In any space-time filament, let us consider two cross-sections Q° and Q′, which have only the points on the boundary common to each other; let the space-time lines inside the filament have a larger value of t on Q′ than on Q°. The finite range enclosed between Q° and Q′ shall be called a space-time sichel,[29] Q′ is the lower boundary, and Q′ is the upper boundary of the sichel.

If we decompose a filament into elementary space-time filaments, then to an entrance-point of an elementary filament through the lower boundary of the sichel, there corresponds an exit point of the same by the upper boundary, whereby for both, the product νdJ{n} taken in the sense of (4) and (5), has got the same value. Therefore the difference of the two integrals ∫νdJ__{n} (the first being extended over the upper, the second on the lower boundary) vanishes. According to a well-known theorem of Integral Calculus the difference is equivalent to

∫∫∫∫ lor ν[=ω] dx dy dz dt,

the integration being extended over the whole range of the sichel, and (comp. (67), § 12)

lor ν[=ω] = (∂νω₁/∂x₁) + (∂νω₂/∂x₂) + (∂νω₃/∂x₃) + (∂νω₄/∂x₄).

If the sichel reduces to a point, then the differential equation

lor ν[=ω] = 0, (6)

which is the condition of continuity

(∂μu_{x}/∂x) + (∂μu_{y}/∂y) + (∂μu_{z}/∂z) + (∂μ/∂t) = 0.

Further let us form the integral

N = ∫ ∫∫∫ ν dx dy dz dt (7)

extending over the whole range of the space-time sichel. We shall decompose the sichel into elementary space-time filaments, and every one of these filaments in small elements dτ of its proper-time, which are however large compared to the linear dimensions of the normal cross-section; let us assume that the mass of such a filament νdJ{n} = dm and write τ⁰, τ^l for the 'Proper-time' of the upper and lower boundary of the sichel_.

Then the integral (7) can be denoted by

∫∫ νdJ{n} dτ = ∫ (τ′-τ⁰) dm_.

taken over all the elements of the sichel.

Del 18Page 95 / 106

Del 18 continues

Now let us conceive of the space-time lines inside a space-time sichel as material curves composed of material points, and let us suppose that they are subjected to a continual change of length inside the sichel in the following manner. The entire curves are to be varied in any possible manner inside the sichel, while the end points on the lower and upper boundaries remain fixed, and the individual substantial points on it are displaced in such a manner that they always move forward normal to the curves. The whole process may be analytically represented by means of a parameter λ, and to the value λ = 0, shall correspond the actual curves inside the sichel. Such a process may be called a virtual displacement in the sichel.

Let the point (x, y, z, t) in the sichel λ = 0 have the values x + δx, y + δy, z + δz, t + δt, when the parameter has the value λ; these magnitudes are then functions of (x, y, z, t, λ). Let us now conceive of an infinitely thin space-time filament at the point (x y z t) with the normal section of contents dJ{n} and if dJ{n} + δdJ{n} be the contents of the normal section at the corresponding position of the varied filament, then according to the principle of conservation of mass—(ν + d_ν being the rest-mass-density at the varied position),

(8) (ν + δν) (dJ{n} + δdJ{n}) = νdJ{n} = dm_.

In consequence of this condition, the integral (7) taken over the whole range of the sichel, varies on account of the displacement as a definite function N + δN of λ, and we may call this function N + δN as the mass action of the virtual displacement.

If we now introduce the method of writing with indices, we shall have

(9) d(x_{h} + δx_{h}) = dx_{h} + ∑{k} ∂δx__{h}/∂x_{k} + ∂δx_{h}/∂λ dλ

k = 1, 2, 3, 4 h = 1, 2, 3, 4

Now on the basis of the remarks already made, it is clear that the value of N + δN, when the value of the parameter is λ, will be:—

(10) N + δN = ∫∫∫∫ ((νd(τ + δτ))/dτ)dx dy dz dt,

the integration extending over the whole sichel d(τ + δτ) where d(τ + δτ) denotes the magnitude, which is deduced from

Del 18Page 96 / 106

Del 18 continues

√(-(dx₁ + dδx₁)² - (dx₂ + dδx₂)² - (dx₃ + dδx₃)² - (dx₄ + dδx₄)²)

by means of (9) and

dx₁ = ω₁ dτ, dx₂ = ω₂ dτ, dx₃ = ω₃ dτ, dx₄ = ω₄ dτ, dλ = 0

therefore:—

(11) (d(τ + δτ))/dτ = √( -∑(ω{h} + ∑(∂δx__{h}/∂x_{k})ω{k_})²)

k = 1, 2, 3, 4. h = 1, 2, 3, 4.

We shall now subject the value of the differential quotient

(12) ((d(N + δN))/dλ) (λ = 0)

to a transformation. Since each δx_{h} as a function of (x, y, z, t) vanishes for the zero-value of the parameter λ, so in general dδx_{k}/(∂x_{h} = 0, for λ = 0.

Let us now put (∂δx_{h}/∂λ) = ξ{h} (h_ = 1, 2, 3, 4) (13)

λ = 0

then on the basis of (10) and (11), we have the expression (12):—

= -∫∫∫∫ ∑ ω{h}((∂ξ{h}/∂x₁)ω₁ + (∂ξ{h}/∂x₂)ω₂ +(∂ξ{h}/∂x₃)ω₃ + (∂ξ{h}/∂x₄)ω₄) dx dy dz dt_

for the system (x₁ x₂ x₃ x₄) on the boundary of the sichel, (δx₁ δx₂ δx₃ δx₄) shall vanish for every value of λ and therefore ξ₁, ξ₂, ξ₃, ξ₄ are nil. Then by partial integration, the integral is transformed into the form

∫∫∫∫ ∑ ξ{h}(∂νω{h}ω₁/∂x₁ + ∂νω{h}ω₂/∂x₂ + ∂νω{h}ω₃/∂x₃ + ∂νω{h}ω₄/∂x₄) dx dy dz dt_

the expression within the bracket may be written as

= ω{h} ∑ ∂νω{k}/∂x_{k} + ν∑ω{k}∂ω{h}/∂x_{k}.

The first sum vanishes in consequence of the continuity equation (b). The second may be written as

(∂ω{h}/∂x₁)(dx₁/dτ) + (∂ω{h}/∂x₂)(dx₂/dτ) + (∂ω{h}/∂x₃)(dx₃/dτ) + (∂ω{h}/∂x₄)(dx₄/dτ)

= dω{h}/dτ = (d/dτ)(dx__{h}/dτ)

whereby (d/dτ) is meant the differential quotient in the direction of the space-time line at any position. For the differential quotient (12), we obtain the final expression

(14) ∫∫∫∫ ν((∂ω₁/∂τ)ξ₁ + (∂ω₂/∂τ)ξ₂ + (∂ω₃/∂τ)ξ₃ + (∂ω₄/∂τ)ξ₄)

dx dy dz dt.

For a virtual displacement in the sichel we have postulated the condition that the points supposed to be substantial shall advance normally to the curves giving their actual motion, which is λ = 0; this condition denotes that the ξ{h_} is to satisfy the condition

w₁ξ₁ + w₂ξ₂ + w₃ξ₃ + w₄ξ₄ = 0. (15)

Del 18Page 97 / 106

Del 18 continues

Let us now turn our attention to the Maxwellian tensions in the electrodynamics of stationary bodies, and let us consider the results in § 12 and 13; then we find that Hamilton's Principle can be reconciled to the relativity postulate for continuously extended elastic media.

At every space-time point (as in § 13), let a space time matrix of the 2nd kind be known

(16) S = | S₁₁ S₁₂ S₁₃ S₁₄ | = | X{x} Y{x} Z{x} -iT{x} |

| S₂₁ S₂₂ S₂₃ S₂₄ | = | X{y} Y{y} Z{y} -iT{y} |

| S₃₁ S₃₂ S₃₃ S₃₄ | = | X{z} Y{z} Z{z} -iT{z} |

| S₄₁ S₄₂ S₄₃ S₄₄ | = | -iX{t} -iY{t} -iZ{t} T{t} |

where X{n} Y{x}.....X{z}, T{t} are real magnitudes.

For a virtual displacement in a space-time sichel (with the previously applied designation) the value of the integral

(17) W + δW = ∫∫∫∫ (∑S{h k} (∂(x__{k} + δx_{k}))/∂x_{h} dx dy dz dt

extended over the whole range of the sichel, may be called the tensional work of the virtual displacement.

The sum which comes forth here, written in real magnitudes, is

X{x} + Y{y} + Z{z} + T{t} + X{x} (∂δx)/∂x + X{y} (∂δx)/∂y +... Z{z} (∂δz)/∂z_

- X{t} (∂δx/∂t -... + T{x} (∂δt)/∂x +... T{t} (∂δt)/∂t_

we can now postulate the following minimum principle in mechanics.

If any space-time Sichel be bounded, then for each virtual displacement in the Sichel, the sum of the mass-works, and tension works shall always be an extremum for that process of the space-time line in the Sichel which actually occurs.

The meaning is, that for each virtual displacement,

([d(·δN + δW)]/dλ)_{λ = 0} = 0 (18)

By applying the methods of the Calculus of Variations, the following four differential equations at once follow from this minimal principle by means of the transformation (14), and the condition (15).

(19) ν ∂w_{h}/∂τ = K{h} + χw__{h} (h = 1, 2, 3, 4)

whence K{h} = ∂S{1 h}/∂x₁ + ∂S{2 h}/∂x₂ + ∂S{3 h}/∂x₃ + ∂S{4 h}/∂x₄_, (20)

Del 18Page 98 / 106

Del 18 continues

are components of the space-time vector 1st kind K = lor S, and X is a factor, which is to be determined from the relation w_ẇ = - 1. By multiplying (19) by w__{h}, and summing the four, we obtain X = Kẇ, and therefore clearly K + (Kẇ)w will be a space-time vector of the 1st kind which is normal to w. Let us write out the components of this vector as

X, Y, Z, ·iT

Then we arrive at the following equation for the motion of matter,

(21) ν d/dτ (dx/dτ) = X, ν d/dτ (dy/dτ) = Y, ν d/dτ (dz/dτ) = Z,

ν d/dτ (dx/dτ) = T, and we have also

(dx/dτ)² + (dy/dτ)² + (dz/dτ)² > (dt/dτ)² = -1,

and X dx/dτ + Y dy/dτ + Z dz/dτ = T dt/dτ.

On the basis of this condition, the fourth of equations (21) is to be regarded as a direct consequence of the first three.

From (21), we can deduce the law for the motion of a material point, i.e., the law for the career of an infinitely thin space-time filament.

Let x, y, z, t, denote a point on a principal line chosen in any manner within the filament. We shall form the equations (21) for the points of the normal cross section of the filament through x, y, z, t, and integrate them, multiplying by the elementary contents of the cross section over the whole space of the normal section. If the integrals of the right side be R{x} R{y} R{z} R{t} and if m be the constant mass of the filament, we obtain

(22) m d/dτ dx/dτ = R{x_},
m d/dτ dy/dτ = R{y_},
m d/dτ dz/dτ = R{z_},
m d/dτ dt/dτ = R{t_}

R is now a space-time vector of the 1st kind with the components (R{x} R{y} R{z} R{t}) which is normal to the space-time vector of the 1st kind w,—the velocity of the material point with the components

dx/dτ, dy/dτ, dz/dτ, i dt/dτ.

We may call this vector R the moving force of the material point.

If instead of integrating over the normal section, we integrate the equations over that cross section of the filament which is normal to the t axis, and passes through (x, y, z, t), then [See (4)] the equations (22) are obtained, but

Del 18Page 99 / 106

Del 18 continues

are now multiplied by dτ/dt; in particular, the last equation comes out in the form,

m d/dt (dt/dτ) = w_{x} R{x} dτ/dt + w__{y} R{y} dτ/dt + w__{z} R{z} dτ/dt_.

The right side is to be looked on as the amount of work done per unit of time at the material point. In this equation, we obtain the energy-law for the motion of the material point and the expression

m (dt/dτ - 1) = m [1/√(1 - w²) - 1] = m (½ |w₁² + 3/8 |w₁⁴ + )

may be called the kinetic energy of the material point.

Since dt is always greater than dτ we may call the quotient (dt - dτ)/dτ as the "Gain" (vorgehen) of the time over the proper-time of the material point and the law can then be thus expressed;—The kinetic energy of a material point is the product of its mass into the gain of the time over its proper-time.

The set of four equations (22) again shows the symmetry in (x, y, z, t), which is demanded by the relativity postulate; to the fourth equation however, a higher physical significance is to be attached, as we have already seen in the analogous case in electrodynamics. On the ground of this demand for symmetry, the triplet consisting of the first three equations are to be constructed after the model of the fourth; remembering this circumstance, we are justified in saying,—

"If the relativity-postulate be placed at the head of mechanics, then the whole set of laws of motion follows from the law of energy."

I cannot refrain from showing that no contradiction to the assumption on the relativity-postulate can be expected from the phenomena of gravitation.

If B(x, y, z, t*) be a solid (fester) space-time point, then the region of all those space-time points B (x, y, z, t), for which

(23) (x - x)² + (y - y)² + (z - z)² = (t - t)²

t - t* >= 0

may be called a "Ray-figure" (Strahl-gebilde) of the space time point B*.

Del 18Page 100 / 106

Del 18 continues

A space-time line taken in any manner can be cut by this figure only at one particular point; this easily follows from the convexity of the figure on the one hand, and on the other hand from the fact that all directions of the space-time lines are only directions from B towards to the concave side of the figure. Then B may be called the light-point of B.

If in (23), the point (x y z t) be supposed to be fixed, the point (x y z t) be supposed to be variable, then the relation (23) would represent the locus of all the space-time points B*, which are light-points of B.

Let us conceive that a material point F of mass m may, owing to the presence of another material point F, experience a moving force according to the following law. Let us picture to ourselves the space-time filaments of F and F along with the principal lines of the filaments. Let BC be an infinitely small element of the principal line of F; further let B be the light point of B, C be the light point of C on the principal line of F; so that OA′ is the radius vector of the hyperboloidal fundamental figure (23) parallel to BC, finally D is the point of intersection of line BC with the space normal to itself and passing through B. The moving force of the mass-point F in the space-time point B is now the space-time vector of the first kind which is normal to BC, and which is composed of the vectors

(24) mm(OA′/BD)³ BD in the direction of BD, and another vector of suitable value in direction of BC*.

Now by (OA′/BD) is to be understood the ratio of the two vectors in question. It is clear that this proposition at once shows the covariant character with respect to a Lorentz-group.

Let us now ask how the space-time filament of F behaves when the material point F has a uniform translatory motion, i.e., the principal line of the filament of F is a line. Let us take the space time null-point in this, and by means of a Lorentz-transformation, we can take this axis as the t-axis. Let x, y, z, t, denote the point B, let τ denote the proper time of B, reckoned from O. Our proposition leads to the equations

Del 18Page 101 / 106

Del 18 continues

(25) d²_x/dτ² = - mx/(t - τ)², d²__y/dτ² = - my/(t_ - τ)³

d²_z/dτ² = -mz/(t - τ)³, (26) d²__t/dτ² = -m/(t - τ)² d(t - τ*)/dt_

where (27) x² + y² + z² = (t - τ*)²

and (28) (dx/dτ)² + (dy/dτ)² + (dz/dτ)² = (dt/dτ)² - 1.

In consideration of (27), the three equations (25) are of the same form as the equations for the motion of a material point subjected to attraction from a fixed centre according to the Newtonian Law, only that instead of the time t, the proper time τ of the material point occurs. The fourth equation (26) gives then the connection between proper time and the time for the material point.

Now for different values of τ′, the orbit of the space-point (x y z) is an ellipse with the semi-major axis a and the eccentricity e. Let E denote the eccentric anomaly, Τ the increment of the proper time for a complete description of the orbit, finally nΤ = 2π, so that from a properly chosen initial point τ, we have the Kepler-equation

(29) nτ = E - e sin E.

If we now change the unit of time, and denote the velocity of light by c, then from (28), we obtain

(30) (dt/dτ)² - 1 = (m*/ac²) (1 + e cos E)/(1 - e cos E)

Now neglecting c⁻⁴ with regard to 1, it follows that

ndt = ndτ [ 1 + ½ m*/ac² (1 + e cos E)/(1 - e cos E) ]

from which, by applying (29),

(31) nt + const = (1 + ½ m/ac²) nτ + m/ac² Sin E.

the factor m/ac² is here the square of the ratio of a certain average velocity of F in its orbit to the velocity of light. If now m denote the mass of the sun, a the semi major axis of the earth's orbit, then this factor amounts to 10⁻⁸.

The law of mass attraction which has been just described and which is formulated in accordance with the relativity postulate would signify that gravitation is propagated with the velocity of light. In view of the fact that the periodic terms in (31) are very small, it is not possible to decide out of astronomical observations between such a law (with the modified mechanics proposed above) and the Newtonian law of attraction with Newtonian mechanics.

Footnote 29:

Del 18Page 102 / 106

Del 18 continues

Sichel—a German word meaning a crescent or a scythe. The original term is retained as there is no suitable English equivalent.

Del 19Page 103 / 106

Del 19

SPACE AND TIME
A Lecture delivered before the Naturforscher Versammlung (Congress of Natural Philosophers) at Cologne—(21st September, 1908).
Gentlemen,
The conceptions about time and space, which I hope to develop before you to-day, has grown on experimental physical grounds. Herein lies its strength. The tendency is radical. Henceforth, the old conception of space for itself, and time for itself shall reduce to a mere shadow, and some sort of union of the two will be found consistent with facts.
I
Now I want to show you how we can arrive at the changed concepts about time and space from mechanics, as accepted now-a-days, from purely mathematical considerations. The equations of Newtonian mechanics show a twofold invariance, (i) their form remains unaltered when we subject the fundamental space-coordinate system to any possible change of position, (ii) when we change the system in its nature of motion, i. e., when we impress on it any uniform motion of translation, the null-point of time plays no part. We are accustomed to look on the axioms of geometry as settled once for all, while we seldom have the same amount of conviction regarding the axioms of mechanics, and therefore the two invariants are seldom mentioned in the same breath. Each one of these denotes a certain group of transformations for the differential equations of mechanics. We look on the existence of the first group as a fundamental characteristics of space. We always prefer to leave off the second group to itself, and with a light heart conclude that we can never decide from physical considerations whether the space, which is supposed to be at rest, may not finally be in uniform motion. So these two groups lead quite separate existences besides each other. Their totally heterogeneous character may scare us away from the attempt to compound them. Yet it is the whole compounded group which as a whole gives us occasion for thought.
We wish to picture to ourselves the whole relation graphically. Let (x, y, z) be the rectangular coordinates of space, and t denote the time. Subjects of our perception are always connected with place and time. No one has observed a place except at a particular time, or has observed a time except at a particular place. Yet I respect the dogma that time and space have independent existences. I will call a space-point plus a time-point, i.e., a system of values x, y, z, t, as a world-point. The manifoldness of all possible values of x, y, z, t, will be the world. I can draw four world-axes with the chalk. Now any axis drawn consists of quickly vibrating molecules, and besides, takes part in all the journeys of the earth; and therefore gives us occasion for reflection. The greater abstraction required for the four-axes does not cause the mathematician any trouble. In order not to allow any yawning gap to exist, we shall suppose that at every place and time, something perceptible exists. In order not to specify either matter or electricity, we shall simply style these as substances. We direct our attention to the world-point x, y, z, t, and suppose that we are in a position to recognise this substantial point at any subsequent time. Let dt be the time element corresponding to the changes of space coordinates of this point [dx, dy, dz]. Then we obtain (as a picture, so to speak, of the perennial life-career of the substantial point),—a curve in the world—the world-line, the points on which unambiguously correspond to the parameter t from +∞ to -∞. The whole world appears to be resolved in such world-lines, and I may just deviate from my point if I say that according to my opinion the physical laws would find their fullest expression as mutual relations among these lines.
By this conception of time and space, the (x, y, z) manifoldness t = 0 and its two sides t < 0 and t > 0 falls asunder. If for the sake of simplicity, we keep the null-point of time and space fixed, then the first named group of mechanics signifies that at t = 0 we can give the x, y, and z-axes any possible rotation about the null-point corresponding to the homogeneous linear transformation of the expression
x² + y² + z².
The second group denotes that without changing the expression for the mechanical laws, we can substitute (x - αt, y - βt, z - γt for (x, y, z) where (α, β, γ) are any constants. According to this we can give the time-axis any possible direction in the upper half of the world t > 0. Now what have the demands of orthogonality in space to do with this perfect freedom of the time-axis towards the upper half?
To establish this connection, let us take a positive parameter c, and let us consider the figure
c²_t² - x² - y² - z²_ = 1
According to the analogy of the hyperboloid of two sheets, this consists of two sheets separated by t = 0. Let us consider the sheet, in the region of t > 0, and let us now conceive the transformation of x, y, z, t in the new system of variables; (x', y', z', t') by means of which the form of the expression will remain unaltered. Clearly the rotation of space round the null-point belongs to this group of transformations. Now we can have a full idea of the transformations which we picture to ourselves from a particular transformation in which (y, z) remain unaltered. Let us draw the cross section of the upper sheets with the plane of the x- and t-axes, i.e., the upper half of the hyperbola c²_t² - x² = 1, with its asymptotes (vide_ fig. 1).
Then let us draw the radius rector OA′, the tangent A′ B′ at A′, and let us complete the parallelogram OA′ B′ C′; also produce B′ C′ to meet the x-axis at D′. Let us now take Ox′, OA′ as new axes with the unit measuring rods OC′ = 1, OA′ = (1/c); then the hyperbola is again expressed in the form c²_t′² - x′² = 1, t′ > 0 and the transition from (x, y, z, t) to (x′ y′ z′ t) is one of the transitions in question. Let us add to this characteristic transformation any possible displacement of the space and time null-points; then we get a group of transformation depending only on c, which we may denote by G{c}.
Now let us increase c to infinity. Thus (1/c) becomes zero and it appears from the figure that the hyperbola is gradually shrunk into the x-axis, the asymptotic angle becomes a straight one, and every special transformation in the limit changes in such a manner that the t-axis can have any possible direction upwards, and x′ more and more approximates to x. Remembering this point it is clear that the full group belonging to Newtonian Mechanics is simply the group G{c}, with the value of c = ∞. In this state of affairs, and since G{c} is mathematically more intelligible than G{∞}, a mathematician may, by a free play of imagination, hit on the thought that natural phenomena possess an invariance not only for the group G{∞}, but in fact also for a group G{c}, where c_ is finite, but yet exceedingly large compared to the usual measuring units. Such a preconception would be an extraordinary triumph for pure mathematics.
At the same time I shall remark for which value of c, this invariance can be conclusively held to be true. For c, we shall substitute the velocity of light c in free space. In order to avoid speaking either of space or of vacuum, we may take this quantity as the ratio between the electrostatic and electro-magnetic units of electricity.
We can form an idea of the invariant character of the expression for natural laws for the group-transformation G{c_} in the following manner.
Out of the totality of natural phenomena, we can, by successive higher approximations, deduce a coordinate system (x, y, z, t); by means of this coordinate system, we can represent the phenomena according to definite laws. This system of reference is by no means uniquely determined by the phenomena. We can change the system of reference in any possible manner corresponding to the above-mentioned group transformation G{c}, but the expressions for natural laws will not be changed thereby._
For example, corresponding to the above described figure, we can call t′ the time, but then necessarily the space connected with it must be expressed by the manifoldness (x′ y z). The physical laws are now expressed by means of x′, y, z, t′,—and the expressions are just the same as in the case of x, y, z, t. According to this, we shall have in the world, not one space, but many spaces,—quite analogous to the case that the three-dimensional space consists of an infinite number of planes. The three-dimensional geometry will be a chapter of four-dimensional physics. Now you perceive, why I said in the beginning that time and space shall reduce to mere shadows and we shall have a world complete in itself.

Del 19Page 104 / 106

Del 19 continues

Del 20Page 105 / 106

Del 20

Now the question may be asked,—what circumstances lead us to these changed views about time and space, are they not in contradiction with observed phenomena, do they finally guarantee us advantages for the description of natural phenomena?
Before we enter into the discussion, a very important point must be noticed. Suppose we have individualised time and space in any manner; then a world-line parallel to the t-axis will correspond to a stationary point; a world-line inclined to the t-axis will correspond to a point moving uniformly; and a world-curve will correspond to a point moving in any manner. Let us now picture to our mind the world-line passing through any world point x, y, z, t; now if we find the world-line parallel to the radius vector OA′ of the hyperboloidal sheet, then we can introduce OA′ as a new time-axis, and then according to the new conceptions of time and space the substance will appear to be at rest in the world point concerned. We shall now introduce this fundamental axiom:—
The substance existing at any world point can always be conceived to be at rest, if we establish our time and space suitably. The axiom denotes that in a world-point, the expression
c²_dt² - dx² - dy² - dz²_
shall always be positive or what is equivalent to the same thing, every velocity V should be smaller than c. c shall therefore be the upper limit for all substantial velocities and herein lies a deep significance for the quantity c. At the first impression, the axiom seems to be rather unsatisfactory. It is to be remembered that only a modified mechanics will occur, in which the square root of this differential combination takes the place of time, so that cases in which the velocity is greater than c will play no part, something like imaginary coordinates in geometry.
The impulse and real cause of inducement for the assumption of the group-transformation G{c} is the fact that the differential equation for the propagation of light in vacant space possesses the group-transformation G{c}. On the other hand, the idea of rigid bodies has any sense only in a system mechanics with the group G{infinity}. Now if we have an optics with G{c}, and on the other hand if there are rigid bodies, it is easy to see that a t-direction can be defined by the two hyperboloidal shells common to the groups G{∞}, and G{c}, which has got the further consequence, that by means of suitable rigid instruments in the laboratory, we can perceive a change in natural phenomena, in case of different orientations, with regard to the direction of progressive motion of the earth. But all efforts directed towards this object, and even the celebrated interference-experiment of Michelson have given negative results. In order to supply an explanation for this result, H. A. Lorentz formed a hypothesis which practically amounts to an invariance of optics for the group G{c_}. According to Lorentz every substance shall suffer a contraction
1:(√(1 - v²/c²)) in length, in the direction of its motion
l/l′ = 1/√(1 - v²/c²) l′ = l(1 - v²/c²).
This hypothesis sounds rather phantastical. For the contraction is not to be thought of as a consequence of the resistance of ether, but purely as a gift from the skies, as a sort of condition always accompanying a state of motion.
I shall show in our figure that Lorentz's hypothesis is fully equivalent to the new conceptions about time and space. Thereby it may appear more intelligible. Let us now, for the sake of simplicity, neglect (y, z) and fix our attention on a two dimensional world, in which let upright strips parallel to the t-axis represent a state of rest and another parallel strip inclined to the t-axis represent a state of uniform motion for a body, which has a constant spatial extension (see fig. 1). If OA′ is parallel to the second strip, we can take t′ as the t-axis and x′ as the x-axis, then the second body will appear to be at rest, and the first body in uniform motion. We shall now assume that the first body supposed to be at rest, has the length l, i.e., the cross section PP of the first strip on the x-axis = l^. OC, where OC is the unit measuring rod on the x-axis—and the second body also, when supposed to be at rest, has the same length l, this means that, the cross section Q′Q′ of the second strip has a cross-section l^· OC′, when measured parallel to the x′-axis. In these two bodies, we have now images of two Lorentz-electrons, one of which is at rest and the other moves uniformly. Now if we stick to our original coordinates, then the extension of the second electron is given by the cross section QQ of the strip belonging to it measured parallel to the x-axis. Now it is clear since Q′Q′ = l^· OC′, that QQ = l^· OD′.
If (dc/dt) = v, an easy calculation gives that
OD′ = OC √(1-(v²/c²)), therefore (PP/QQ) = (1/√(1-(v²/c²))
This is the sense of Lorentz's hypothesis about the contraction of electrons in case of motion. On the other hand, if we conceive the second electron to be at rest, and therefore adopt the system (x′, t′,) then the cross-section P′P′ of the strip of the electron parallel to OC′ is to be regarded as its length and we shall find the first electron shortened with reference to the second in the same proportion, for it is,
P′P′/Q′Q′ = OD/OC′ = OD′/OC = QQ/PP
Lorentz called the combination t′ of (t and x) as the local time (Ortszeit) of the uniformly moving electron, and used a physical construction of this idea for a better comprehension of the contraction-hypothesis. But to perceive clearly that the time of an electron is as good as the time of any other electron, i.e. t, t′ are to be regarded as equivalent, has been the service of A. Einstein [Ann. d. Phys. 891, p. 1905, Jahrb. d. Radis.... 4-4-11-1907.] There the concept of time was shown to be completely and unambiguously established by natural phenomena. But the concept of space was not arrived at, either by Einstein or Lorentz, probably because in the case of the above-mentioned spatial transformations, where the (x′, t′) plane coincides with the x-t plane, the significance is possible that the x-axis of space some-how remains conserved in its position.
We can approach the idea of space in a corresponding manner, though some may regard the attempt as rather fantastical.
According to these ideas, the word "Relativity-Postulate" which has been coined for the demands of invariance in the group G, seems to be rather inexpressive for a true understanding of the group G{c}, and for further progress. Because the sense of the postulate is that the four-dimensional world is given in space and time by phenomena only, but the projection in time and space can be handled with a certain freedom, and therefore I would rather like to give to this assertion the name "The Postulate of the Absolute world_" [World-Postulate].

Del 20Page 106 / 106

Del 20 continues

III