TIQM: 3.3 The Transactional Model and Relativistic Quantum Mechanics

Go back to Table of Contents
Go back to Section 3.0
Continue to Section 3.8

The Transactional Interpretation of Quantum Mechanics

by John G. Cramer

3.3 The Transactional Model and Relativistic Quantum Mechanics

The transaction model discussed in the previous section deals with the emission and absorption of photons arising from electromagnetic interactions. The model uses advanced and retarded wave functions which are solutions of the electromagnetic wave equation:

[ c]²² = [²]d²/dt². [8]

Note that this differential equation is second order in the time variable. As was shown in a previous publication of the author (Cramer, 1980), the same transaction model can be applied to the emission and absorption of massive particles, either neutral or electrically charged, e.g., electrons {footnote 16}. The only requirement for this application is that the wave equations describing the particles of interest must, like the electromagnetic wave equation, have both advanced and retarded solutions.

This requirement would seem to present a problem. The wave equation which has been the focus of most of the discussion surrounding the interpretation of quantum mechanics is the Schrödinger equation:

-(²/2m)² = i d/dt, [9]

where m is the mass of the particle described by the equation. This equation is first order in the time variable and for this reason does not have advanced solutions. Therefore, if Psi =F(r,t) is a solution of the Schrödinger equation, then Psi *=G(r,t) is not a solution, nor is a linear combination of F and G as used in the transactional model.

We must bear in mind, however, that the Schrödinger equation is ultimately not physically correct because it is not relativistically invariant {footnote 17}. It should properly be considered as the limiting case, in a restricted non-relativistic domain, of some more physically reasonable relativistically invariant wave equation, e.g., the Dirac equation or the Klein-Gordon equation. These relativistic equations, like the electromagnetic wave equation, have both advanced and retarded solutions {footnote 18}.

Considering the Schrödinger equation as a limiting case, the apparent problem created by its lack of advanced solutions can be resolved. When a suitable relativistic wave equations is reduced to the Schrödinger equation by taking a non-relativistic limit (Bjorken, 1964), the reduction procedure leads to two distinct equations, the Schrödinger equation and another equation of the form:

-(²/2M)² = -i d/dt, [10]

which is the complex conjugate or time reverse of the Schrödinger equation. This equation has only advanced solutions. Equations (9) and (10) are equally valid non-relativistic reductions of relativistic dynamics, but equation (10) is usually dropped because it has negative energy eigenvalues. From this it should be clear that F(r,t) and G(r,t) (or Psi and Psi *) are equally valid solutions of the dynamics which underlies the Schrödinger equation. It is therefore valid to use advanced solutions in the transactional model in the non-relativistic limit as if they were solutions of the Schrödinger equation.

We can also look at the need for relativistic invariance in another way. The interpretational problem of Locality (see Section 2.4) which the recent tests of Bell's inequality have brought into sharp focus is essentially a relativistic problem. If the velocity of light were infinite the locality problem would not exist: there would be no difference between local and non-local descriptions. The Schrödinger equation can be considered as the limiting case of a relativistically invariant wave equation when the velocity of light goes to infinity. Therefore it is not particularly surprising that an explicitly non-local description such as the transactional model may have intrinsic inconsistencies with the Schrödinger equation and may require certain properties of relativistically invariant wave equations. This is a subtle link between relativity and quantum mechanics which has not, perhaps, been previously appreciated.

There is another implication of relativistic quantum mechanics which should also be discussed here. In the relativistic domain the uncertainty principle of Heisenberg (CI1) must be reconsidered because of the added restrictions of special relativity (Landau, 1931; Berestetskii, 1971). In particular, introduction of the limiting velocity c imposes a new uncertainty relation on the precision with which momentum p can be measured: Delta p= h-bar /(c Delta t). This relation can be considered to arise from the fact that the position localization Delta q (assumed to initially be small) cannot spread at a rate greater than c. The distance c Delta t is therefore the maximum possible amount by which the position localization can have been broadened in a time interval Delta t to permit a smaller localization of momentum. This places a limit on the precision Delta p with which momentum p can be measured in a time interval Delta t.

There is an analogous limitation on the determination of position q which arises from another characteristic of relativistic quantum field theories. As mentioned above, the solutions to the relativistically invariant wave equations for massive particles include advanced or negative frequency solutions. When a particle is localized to a sufficiently small region of space, these negative frequency functions appear explicitly in the expansion of its position wave packet. Landau and Peierls (1931) have suggested that in order to avoid the inclusion of "physically meaningless" negative frequency solutions it is reasonable to confine position determinations to a domain which does not include such processes. This corresponds to a limiting position uncertainty of Delta q= h-bar c/W where W is the total mass-energy of the particle. This position uncertainty limit for a particle of mass m with an initially small momentum is just Delta q= hbar /mc, the de Broglie wavelength of the particle.

Berestetskii, et al (1971) justify this limit on position localization in a related but slightly different way. They interpret the negative frequency or negative energy components of the wave function as indicating the onset of particle-antiparticle production when the momentum becomes large enough to correspond to a free energy greater than 2mc². They argue that when this threshold is reached in a measurement, for example in determining the position of an electron, "the formation of new particles in a way which cannot be detected by the process itself clearly renders meaningless the measurement of the electron coordinates". Thus the broadening in momentum is cut off at this limit, leading to the hbar c/W limit on the position localization.

Landau and Peierls (1931) have argued that these relativistic limits on determinations of position and momentum irretrievably compromise the utility of these dynamical variables for measurement in the sense of non-relativistic quantum mechanics. Neither the position nor the momentum of a particle can, even in principle, be determined to arbitrary accuracy in a finite time interval nor can either be considered to have a particular value at a particular time. This would seem, in effect, to invalidate Born's statistical interpretation of quantum mechanics (CI2), in that the description of the state vector as a mathematical representation of the probability of finding a definite value of a particular observable as a result of a measurement made at a given instant is untenable. However, Bohr and Rosenfeld (1933; see also Rosenfeld, 1955) defused this problem by demonstrating that in the relativistic formalism of quantum electrodynamics, in which the field quantities are not represented by point functions but by functions of space time regions, there is no discrepancy between limits imposed by the relativistic uncertainty principle and the physical possibilities of measurement.

To put it slightly differently, this is a non-problem. The relativistic limits on the precision with which dynamic variables can be measured do indeed make these variables less directly relevant to the coordinates of relativistic particles. However, this invalidates neither their use nor their usefulness. In a related non-relativistic case the angle of rotation theta remains a valid and sometimes useful dynamical variable although its measurable value is rendered completely uncertain by quantization of the conjugate angular momentum variable. The specification of a dynamical variable "at a given instant" as considered by Landau and Peierls (1931) is neither needed nor desirable. In the relativistic formalism it is integrals over spacetime regions rather than point values which lead to the predictions of observables. In this context, the atemporal and non-local character of the transactional interpretation as discussed below provides a natural way of describing the atemporal collapse of the state vector to some localized value of a dynamic variable. In fact, the notion that the state vector collapses to a particular value of a variable "at a given instant" is inconsistent with the transactional description. See Section 4.3, for example, for discussion of this point.

Another problem which raises some concern about the statistical interpretation of quantum mechanics in the relativistic domain is the observation that the solutions of the field equations cannot always be used to construct a relativistically invariant or positive definite probability density (Bjorken, 1964). Thus, while differential field equations and their solutions remain an indispensible feature of relativistic quantum field theory (Bjorken, 1965), the problem of the identity of these solutions (see Section 2.1) is made more severe because the statistical interpretation, at least in its simplest form, has proved inadequate. The state vector cannot be identified as a simple carrier of probability in the relativistic domain.

The naive statement of the statistical interpretation is clearly insufficient in the relativistic domain, particularly when applied to space-time regions where no measurement is actually made. Further, some of the formal procedures of non-relativistic quantum mechanics, e.g., the integration of wave function products over a large volume of space at a fixed time, are manifestly inconsistent with special relativity. However, the development of relativistic quantum theory has resulted in a formalism with calculational procedures which are appropriate to the relativistic domain. A generalized form of the statistical interpretation is implicit in these procedures for calculation of observables and matrix elements. It is therefore our view that the statistical interpretation should be (and has been) generalized for the relativistic domain rather than discarded.

The scope of the present paper is limited to the interpretation of quantum mechanics in the low-velocity non-relativistic limit, the arena where almost all of the previous discussion on the interpretation of quantum mechanics has been focussed. For this reason we will not further discuss the interpretation of relativistic quantum mechanics. However, we are not aware of any new interpretational problems which are added by a fully relativistic quantum field theory beyond those associated with particle creation and space-time delocalization as just discussed. The transactional interpretation of quantum mechanics presented in the next section is based on solutions of relativistically invariant differential field equations, is fully consistent with special relativity, and seems to accommodate these additional features of a relativistic quantum theory in a very natural way. We are therefore confident that the interpretation presented here, perhaps with minor embellishments, is appropriate for the interpretation of a fully relativistic theory of quantum mechanics.

3.4 The Transactional Interpretation

Now we are prepared to specify the premises of the transactional interpretation (TI). In doing this we will use a framework which is as similar as possible to the description of the Copenhagen interpretation given in Section 2.0. We will similarly use five principle elements which we enumerate here:

{TI1} The uncertainty principle is as in {CI1}. It is a consequence of the fact that a transaction in going to completion can project out and localize only one of a pair of conjugate variables from the offer wave. This will be discussed further in Section 3.8.
{TI2} The statistical interpretation is unchanged from {CI2}. It is a consequence of the fact that the "echo" received by the emitter in initiating the transaction follows the Born probability law P=*. This will be discussed further in Section 3.8, where the character of randomness in QM is examined.
{TI3} All physical processes have equal status. The observer, intelligent or otherwise, has no special status. Measurement and measuring apparatus have no special status, except that they happen to be processes which connect to observers. The "wholeness" of CI3 exists, but is not related to any special character of measurements but rather to the connection between emitter and absorber through the transaction. The "complementary" concept of CI3 likewise exists, but like the uncertainty principle is just a manifestation of the requirement that a given transaction going to completion can project out only one of a pair of conjugate variables.
{TI4} The fundamental quantum mechanical interaction is taken to be the transaction, as defined in the preceding section. The state vector of the quantum mechanical formalism is a real physical wave with spatial extent and is identical with the initial "offer wave" of the transaction. The particle (photon, electron, etc.) and the collapsed state vector are identical with the completed transaction. The transaction may involve a single emitter and absorber or multiple emitters and absorbers, but is only complete when appropriate quantum boundary conditions are satisfied at all loci of emission and absorption. Particles transferred have no separate identity which is independent from the satisfaction of these boundary conditions. The correspondence of the state vector with "knowledge of the system" of {CI4} is a fortuitous but deceptive consequence of the transaction, in that such knowledge must follow and describe the transaction.
{TI5} A distinction is made between observable and inferred quantities. The former are firm predictions of the overall theory and may be subjected to experimental verification. The latter, particularly those which are complex quantities, are not verifiable and are useful only for interpretational and pedagogical purposes. It is assumed that both kinds of quantities must obey conservation laws, macroscopic causality conditions, relativistic invariance, etc. Resort to the positivism of {CI5} is unnecessary and undesirable.

In summary, the TI adopts the first two elements of the CI and is also able to accommodate aspects of CI3. It drops the assertion of CI4 that the solutions of a simple second order differential equation relating mass, energy, and momentum are somehow related to "knowledge", and instead employs a usually neglected solution of that equation to construct the emitter-absorber transaction of TI4. The TI drops the positivism of CI5 because the positivist curtain is no longer needed to hide the nonlocal backstage machinery.

It should also be pointed out that the substitution TI4 for CI4, in giving objective reality to the state vector, colors all of the other elements of the interpretation. Although the uncertainty principle (TI1) and the statistical interpretation (TI2) are formally the same as in the CI, their philosophical implications, about which so much has been written from the viewpoint of the Copenhagen interpretation, may be rather different.

3.5 The Transactional Interpretation and the Formal Notation of QM

Comparison of the formal notation used in quantum wave mechanics with the description of the transaction model as discussed in Section 3.2 shows an excellent correspondence between the two. Once the TI is in firm conceptual grasp, the TI description of quantum processes can be perceived in the mathematical procedures and notations used in the calculation of observables. In particular, the QM formalism makes extensive use of the operation of complex conjugation. For simple systems this operation is equivalent to the operation of time reversal (Wigner, 1950) which transforms retarded waves into advanced waves {footnote 16}. Thus Psi * is the advanced confirmation wave (CW) equivalent of Psi , the retarded offer wave (OW). As mentioned above, Psi Psi * is the offer-confirmation wave echo which the emitter receives from a particular direction. Similarly,

_vol ^*dv [11a]

is the sum of all such OW-CW echoes from all possible locations in space.

Let us consider other examples. A QM "overlap integral" of the form

_vol ₁₂^*dv [11b]

can be interpreted in the TI as representing an average over all space of the "echoes" which an emitter sending out OW Psi ₁ receives from all possible absorbers sending back CW's which confirm transactions involving a final state described by Psi ₂. Moreover, the calculation of an expectation value of the variable x which is deduced from a given wave function Psi by the operator X, so that X = x, has the form:

^{<x> =} _vol ₂^* X ₁ dv. [11c]

This can be viewed as an average over space of the possible values of x which the operator X projects from the components of the OW which appear in the completed transaction. This interpretational approach can also be applied to other aspects of the QM formalism.

From one point of view the transactional interpretation is so apparent in the Schrödinger-Dirac form of the QM formalism, with its combinations of normal and time-reversed waves, that one might fairly ask why this obvious interpretation of the QM formalism had not been previously made. No one can, of course, explain why something did not occur in the history of the development of quantum physics, but several relevant observations can be made:

The Schrödinger equation, which has been the focus of interpretational investigations from the 1920's to the present, does not have advanced solutions because it involves only the first time derivative (as discussed above in Section 3.3), and so the association of ^* with advanced solutions of the wave equation is by no means obvious;
After Heisenberg devised CI4, concern about the nonlocality of quantum mechanics was effectively quenched for a long period and considerations of interpretational problems were directed elsewhere, and so it is only recently that concern about the nonlocality of the formalism has re-emerged because of the Bell inequality tests; and
The relativistic wave equations which do have advanced as well as retarded solutions have been treated with suspicion because they do not uniquely describe the state of a given system {footnote 18}.

Further, the "purpose" of the advanced solutions was thought to have been found with the discovery of the antimatter counterparts of "normal" fermions (e.g., positrons, antiprotons, etc). Therefore other "purposes" for the advanced wave solutions were not sought, and they were not associated with the Psi

^* of the formalism. Appendix A.5 discusses the few previous attempts to use advanced waves in QM interpretation.

3.6. Identity and Complexity in the Transactional Interpretation

In dealing with the problem of identity, TI2 gives the SV the same meaning as did CI2, as the medium for describing the probability of various possible quantum events. TI4 deals more directly with the problem of identity and deals with it in a way which is quite different from that of CI4. It asserts that the state vector is a real physical wave which is generated by the emitter and travels through space to the final absorber and also to many other space-time loci and to many other potential absorbers. The state vector is the "offer wave" which initiates the transaction. Because of the quantum mechanical boundary conditions the transaction is only completed between a single emitter and absorber in a one-quantum event. The particle itself (photon, electron ,etc.) is not identical with the state vector but with the completed transaction, of which the state vector is only the initial phase.

Schrödinger's {footnote 11} original attempt to interpret the state vector in a way similar to this was unsuccessful because of two problems:

It was found that quantum mechanical waves exhibit nonlocal or action-at-a-distance behavior when they are interpreted as real waves physically present in space; and
It was found not to be possible to describe a particle as a "wave packet" which remained in a tight group envelope as it propagated.

The TI deals with (1) directly because it is explicitly nonlocal, and moreover the components of the SV which travel in directions other than that of the eventual absorber do not have to "disappear" because they are only virtual in the sense that they transfer no energy or momentum and participate in no transaction. And (2) is not a problem for the TI because it is the formation of the transaction which localizes the energy and momentum transfer. The SV itself is therefore not required to stay in a tight packet in order to account for the particle-like behavior of the quantum event.

Since the TI considers the SV to be physically present in space it must deal directly with the problem of complexity. However, the transaction model is able to do this because at each point of the completed transaction at which there is a physical interaction, e.g., at the emitter and absorber loci, there is also a superposition of an advanced and a retarded wave of equal amplitude. Since + ^* = 2Re(), the collapsed SV becomes real, and there is no residual imaginary part of the SV to require explanation. The reader is referred to Section 4.4 for a detailed example showing the real net amplitude from a transaction. In regions of space where there was no interaction, the SV is permitted to be complex, but these components are "virtual" and non-interactive since they produce no transfer of energy or momentum. Thus, the TI has restored algebraic reality to the description of the microcosm.

3.7 Collapse and Nonlocality in the Transactional Interpretation

In the TI the collapse of the state vector is interpreted as the completion of the transaction started by the OW and the CW exchanged between emitter and absorber. The emergence of the transaction from the SV does not occur at some particular location in space or at some particular instant of time, but rather forms along the entire four-vector which connects the emission locus with the absorption locus (or loci in the case of multiple correlated particles). The transaction employs both retarded and advanced waves which propagate, respectively, along positive and negative lightlike (or timelike) four-vectors. Since the sum of these four-vectors can span spacelike and negative timelike or lightlike intervals, the "influence" of the transaction in enforcing the correlations of the quantum event is explicitly both nonlocal and atemporal.

Fig. 5 shows an example of such combinations of four-vector for a two photon transaction corresponding to an event in the Freedman-Clauser experiment. Note that although all of the waves in the transaction lie along lightlike world lines, the "influence" which enforces the correlations between the two polarization measurements spans a spacelike interval and is therefore nonlocal. This nonlocality is an explicit feature of the TI arising from the use of advanced waves.

Schrödinger (1935), in analyzing the EPR paradox, concluded that at least part of the problem lay in the way time is used in quantum mechanics (in the context of the CI). The CI treats time in an essentially classical non-relativistic way, and as we have seen in Section 2.5, this leads to inconsistencies with relativity or causality in any non-subjective CI description of collapse in, for example, the Freedman-Clauser experiment. The root of the inconsistencies lies in the implicit assumption of the CI that the SV collapse occurs at a particular instant at which a particular measurement is made and "knowledge" is gained, that before this instant the SV is in its full uncollapsed state, and that there can be a well-defined "before" and "after" in the collapse description. In the TI the collapse, i.e., the development of the transaction, is atemporal and thus avoids the contradictions and inconsistencies implicit in any time-localized SV collapse.

Further, the TI description does not need to invoke arbitrary collapse triggers such as consciousness, etc., because it is the absorber rather than the observer which precipitates the collapse of the SV, and this can occur atemporally and nonlocally across any sort of interval between elements of the measuring apparatus. This will be discussed further in the context of gedanken experiments in Section 4.

Continue to Section 3.8
Go back to Section 3.0
Go back to Table of Contents