12-Trying to find ways out

As the myth about the “role of observer”, “the loss of objective reality” , that was initiated by the use of the word "measurement" in Copenhagen interpretation spread out not only among physicists but in all areas of human thinking because of its philosophical consequences(I will discuss them in “reflections”) , the majority of physicists accepted the Copenhagen interpretation as it is without bothering about the philosophical consequences. They thought that the world at microscopic scales doesn’t have to be fully imaginable with our terminology since our terminology (wave. particle) reflects our experience in macroscopic world. They thought “we have to live with this mystery without further making lengthy philosophical discussions on it”. They said “Copenhagen interpretation gives us a framework where we can make our calculations and compare the experimental results with the calculations. That is what one can all want from a theory. All these philosophical discussions about the nature of duality and about the role of the observer in measurement process don’t affect how we do our work so why bother about these things. Our framework is enough for our purpose of doing science. “

Although majority gave up to discuss these issues, some took the problems of Copenhagen interpretation serious and tried to find a cure for the iatrogenic problem 1 by proposing alternatives to Copenhagen interpretation.

 

Alternative interpretations

Hidden parameters

In a quantum measurement we get different final states for the wave function although we start with the same initial wave function. Following was proposed to explain this situation: The wave function is not the only entity. There is an unobservable parameter that is different each time at the beginning. This unobservable difference in the initial conditions reveals itself in the measurement as different outcomes.

The question arising then are: what are these hidden parameters? what determines its values? Do and how the values of these hidden parameters change in time? ...

The entanglement of distant particles presents a problem for hidden variable theories. The entanglement of photons in EPR paradox means that there cannot be two separate independent hidden variables for each photon since the outcomes of spin measurements are correlated. (see chapter 8)

Thus if there is a hidden variable in this case , it must possess nonlocal properties. A successful hidden parameter theory is Bohm's pilotwave theory that we will discuss as next.

 

Bohm's pilotwave theory

One alternative to conceal wave and particle aspects is to assume that waves are like force fields guiding the pointlike particle. This approach was originally the idea of De Broglie and is known as Pariser school or pilotwave approach. The most successful of these theories is Bohm’s theory. It consists of two axioms 2.

1.The wave function evolves according to the Schrodinger equation.

2. There is a pointlike particle. The velocity of the particle is determined by the classical forces and by the quantum potential that is defined via the wave-function at the location of the particle according to a definite mathematical relation. Thus once the initial position of the particle is known, the wave function determines completely the trajectory of particle. (The particle position is the hidden variable )

The Bohm’s theory successfully predicts the statistical distribution of the bright dots in the double slit experiment it has however some peculiar aspects.

For one particle the quantum potential is function of spatial coordinates like an ether filling the space. And the particle position is the hidden parameter. But particle position is a local parameter. We said that Bells inequality violation of which was verified experimentally implies that hidden parameters have to be nonlocal. Where is the nonlocality in the Bohms theory? The nonlocal nature of Bohms theory becomes apparent if one considers more then one particles . The quantum potential is than not a function of ordinary 3 dimensional space anymore but a function of 3n variables for n particles. This leads to an instantaneous correlation of motions of particles however distant they are from each other.

It is criticized mainly because of this nonlocality. I think however that nonlocality is a fact (see chapter 8) that can not be explained away by considerations like “although there are nonlocal correlations there is no possibility of signal transfer at superluminal speeds since the quantum events have a random nature” . Or by stating that wave function is merely a mathematical construction 3, 4. Therefore it is unjust to eliminate Bohm's model merely because of the inherent nonlocality. We think however there is not enough direct evidence for the existence of pointlike entities since the bright dots on the scintillating screen can well be interpreted as transition of the wave-function from a spatially extended form to a localized form around a nucleus as Schrodinger suggested. In chapter 13 We will rediscuss the objections against Schrodinger's viewpoint mentioned in chapter 6 and try to show that they are unjustified prejudices or pseudoproblems and that they are not unsolvable.

Contrary to the assumption for fermions (material particles like proton electron etc) Bohm's model assumes bosons(photon graviton etc.) as fields without pointlike entity. It argues that despite the missing of pointlike entities the pointlike hits observed in photon detection process can be understood as a consequence of instability and an extreme sensitivity to initial conditions where depending on initial conditions only one of the atoms in the screen absorbs the photon 5. As I will discuss in chapter 14 and chapter 15 I think that a similar kind of instability occurs with electron beams when interactin with the atoms in the scintillating cristall so that we don’t need the pointlike entities for fermion fields either .

If you have read until here and chapter 13 and chapter 14 then Join the discussion on Bohm's theory

 

Everetts many worlds interpretation

One of the suggestions that at first look seem to eliminate subjectivism and indeterminism altogether is Everetts many worlds interpretation.6, 7 According to this view, in a measurement all the probabilities that are not realized in our universe are in truth realized in other parallel universes. There are two ways to imagine these parallel universes. Either a universe splits into many universes at the very moment of measurement i.e. these universes are created at this moment or all these universes already exist together with the common history until the very moment of measurement but they take different histories after the measurement. In The first view a measurement is some very special event class creating new universes, But like the Copenhagen interpretation many worlds interpretation cannot define the objective circumstances that lead to such a special event. The problem is similar to the problem in the objectivist version of Copenhagen interpretation where the role of observer is eliminated but no other consistent criteria (objective criteria) is proposed for defining the conditions when the unitary evolution of the wave function is interrupted . The second view with already existing parallel universes imply that for any possible slightest difference in the history and in the future , a separate universe already should exist at the very beginning. Even in this case the measurement remains as a special event class in the sense that it is the only event class that creates different results in different universes while non measurement events occur with exactly same course in parallel universes. Thus many world interpretation cannot give the objective criteria defining the conditions that leads to such a splitting of histories.

There is another difficulty with many world view. Let me explain it:

Lets assume that we make measurements of spin (internal angular momenta of elementary particles) of electrons . Electron's spin can have at any chosen arbitrary direction 2 values: up or down. We can prepare a particle with a desired spin alignment by an initial measurement and let for example all up’s of the first measurement enter in a second spin measurement apparatus. We can align then the axis of the second measurement apparatus relative to the first so, that that we have a probability of 3/4 for spin up 1/4 for spin down in the second measurement. From now on we refer to the second measurement as "measurement" .

Figure 12 two succesive measurements of spin with Stern Gerlach measuring devices with a nonzero relative alignement angle between the magnetic fields of the two measuring devices

If we obtain spin up in our universe, in the other universe the result must be spin down. or vice versa. We need 2 universes for one measurement . One measurement is not reliable data to speak about probabilities, so we prepare 20 particles with the same spin state using the first Stern-Gerlach device.

We make 20 measurements. In each universe we have a different combination of outcomes an example is:

A possible combination of the outcomes in an arbitrary chosen sample universe

Number of

Measurement

Outcome

1

d

2

d

3

u

...

17

d

18

u

19

d

20

d

 

There are 220 possible combinations. 220 (roughly 69 billion) universes must be involved. Since the relative alignement of the two measurement apparatus is the same in each universe under consideration we should expect for each universe that the ratio of (number of spin up outcomes / number of spin down outcomes) should be close to 3 ( ¾ divided by ¼). especially for large number of measurements . Of course it doesn't mean that there are no odd universes with more downs and less ups. Ours could be one of these. For small number of measurements it is not so unlikely for a universe to be an odd universe. However it should become less and less probable for a universe to remain an odd universe as the number of measurements increase. Although the number of the ODD universes increase with increasing number of measurements since the number of ALL possible universes to be considered increase the ratio (number of ODD universes / number of ALL universes) should go to 0 if number of measurements increase. We expect that in the limit of very large numbers in every universe the ratio of (number of spin up outcomes / number of spin down outcomes) asymptotically approaches the value 3

The problem in many worlds view is that, since each possible combination of outcomes occurs in only one universe, the number of universes with more downs then ups is always (independent of the number of measurements) the same as the number of universes with more ups then downs. In other words each possible combination is equally probable as the other one for a chosen universe. There is no way that the ratio 1/3 (which is determined by the relative alignment of the axis of the two measurement apparatus which is the same in all the considered set of universes because it was prepared before any measurement namely before any splitting of the histories) can reflect itself in every considered universe.

If you have read the paragraphs on many world interpretations, then Join the discussion

  

Multi-valued logic

One suggestion was to change the rules of logic. Conventionally a logical variable can be either true or false. It was proposed to allow values between true and false for logical variables. But of course such a fundamental change with far reaching consequences can be undertaken only if there is no other way to solve the problem. (This should not be confused with practical applications of fuzzy logic).

 

Consistent histories

The consistent histories approach takes the results of measurement events as facts and tries to find an history for the evolution of the wave function that is consistent with the outcomes of these measurements 8.

To illustrate this approach lets consider the following experimental arrangement in figure 13

 

Figure 13 - wave packet |a > entering the beam splitter B1, wave function splitting into two parts |c > and |d >, two detectors E (detects d) and F(detects c) .

 

A photon wave packet a enters a beam splitter B1 (an half silvered mirror that reflects half of the incident light and lets the other half pass through) The ordinary Quantum mechanics tells us that the wave function of even a single photon is split into two parts |c > and |d > . The photon is detected either by detector E or by detector F so that the photon undergoes a transition either to the state |e > or the state |f > . The detection by detector F implies that the wave function part |d> must suddenly disappear during the detection and join the part |c > in the detector F to create |f >. Thus superluminal currents must be associated with the detection process as discussed in chapter 6 if one interprets the wave function as a real physical field.

The consistent histories approach avoids this difficulty by saying simply that if we detect the photon with detector F we have to conclude that the wave function part |d > never existed and that the whole photon must have been reflected by the beam splitter B1. If it is detected by detector E the whole photon must have been passed through the beam splitter. If the experiment is conducted many times, in average half of the photons are detected by E the other half by F . This approach don’t try to answer the question why this photon is reflected and the other one passes through so that it doesn’t remove the indeterministic, probabilistic aspect but the indeterminacy is merely shifted from the position measurement event (namely the detection) to what happens in the beam splitter. Although not removing the probabilistic aspect it seems to avoid at first look the problems of nonlocality. But this is only apparent. The problem emerges if we conduct the experiment by adding a second beam splitter B2 (figure 14)

Figure 14- wave packet |a> entering the beam splitter B1, wave function splitting into two parts |c > and |d >, two mirrors (one for |c > and one for |d > to direct each beam towards a second beam splitter B2 and two detectors E (detects d) and F (detects c) behind B2.

The consistent histories approach says that if the photon is detected by either detector E or detector F it must have been in a superposition of c and d on the way. The following histories are consistent :

|a > ---> 1/Ö 2 (|c > + |d >) -----> |f > (detection by F)

|a > ---> 1/Ö 2 (|c > - |d >) -----> |e > (detection by E)

but not for example |a > ---> |c > -----> |f >

on the other hand the history |a > -----> |c > -----> 1/Ö 2 (|e > + |f >) is also consistent. But since the photon can be detected by only either E or by F, but not by both detectors simultaneously , this history is not physically realized.

Now here is the problem :

If the photon enters the beam splitter B1 it doesn’t know if a second beam splitter is placed on its way. In other words we don’t expect that the presence of the beam splitter B2 should have any influence on what happens in the beam splitter B1. But exactly such an influence is assumed in consistent histories approach. If the beam splitter B2 is present, the beam splits into two parts in beam splitter B1 but if B2 is absent it is either reflected by B1 or passes through B1 as a whole.

Thus neither the indeterminacy nor the effect nonlocal influences can be avoided in this approach. The nonlocal influences are much more strange than the nonlocal correlations due to the entanglement of distant particles.

If you have read the paragraphs on consistent histories then  Join the discussion on the subject

 

Environment induced decoherence

This approach focuses on the collapse process during the measurement and tries to describe the collapse process mathematically. It uses the so called the density matrix representation. The offdiagonal elements in the density matrix represent the quantum correlations without classical analogy. During the collapse process the offdiagonal elements vanish. Zurek tried to show that if one takes the effect of the environment on the combined system quantum system + measuring device one can understand how the offdiagonal elements in density matrix vanish because of decoherence so that we are left with a classical probability distribution given by the diagonal elements. 9,10,11 .The superposition of dead and alive states of the Schordingers cat is unstable . It either becomes a dead cat or an alive cat very rapidly. (in about 10-19 seconds) The vanishing of the offdiagonal elements is however only one aspect of the collapse process. At the end of the measurement only one of the diagonal elements is realized. Which one of the diagonal elements is realized can be predicted only probabilistically where the probability is given by the value of the diagonal element. Thus although decoherence describes while a superposition of dead and alive states of the Schrodinger’s cat is unstable it doesn’t provides us the answer if the final state will be the dead state or alive state. The decoherence approach assumes the existence of a macroscopical measuring device. The indeterministic transitions occur however in purely microscopical events like spontaneous emission and radioactive decay. Thus although decoherence is useful to understand how some properties of the macroscopic world emerge 11 it cannot explain the mechanism of the apparently indeterministic crucial step that, so I think, is the same independent if there is a macroscopic device present or if we have a purely microscopical process like spontaneous emission.

The recently popular approach is to combine the consistent histories approach with decoherence hoping to explain how inconsistent histories are eliminated from real world by the mechanism of decoherence. The combined interpretation is called decoherent histories. I don't think the two approaches can remove each others deficiencies when combined.

If you have read the pharagraphs on decoherence then Join the discussion on decoherence

 

Time dependent perturbation theory(TDPT)/Quantum electrodynamics QED

It may appear to the knowledgeable reader surprising that I will discuss QED and TPDT under the heading alternative interpretations. Everyone believes that they are extensions of ordinary quantum mechanics without a new interpretation. Yet there is an important terminology shift that has occurred and that is not widely recognized as such .

Time dependent perturbation theory was developed in order to understand the time evolution of the wave function under following conditions: There is a static potential and a time dependent potential namely a perturbation that is small compared to the time independent potential is superposed to this static potential. One develops the solution in terms of the eigenstates of the static potential using approximation methods for calculating its effects of the perturbation on the wave function.

TDPT allowed a description of the interaction of radiation with matter where the radiation is considered as a small contribution of oscillating potential in addition to the static Hamiltonian. To explain the terminology shift lets consider the following example:

The calculations showed that a radiation of frequency w could effect a system of charged particle(s) in two ways: 12

1.1) the radiation is absorbed by the charged particle(s) and makes a transition to an higher energy state, the energy of which is about hw greater then the energy of the initial state. Of course this can happen only if there is an empty higher energy eigenstate with this energy. Thus a portion of radiation of the energy hw is absorbed by the particle.

1.2)The radiation causes the particle to radiate its energy away. The particle emits a portion of radiation that has the same frequency w as the incident radiation and makes a transition to a lower energy state the energy of which is about hw lower then the energy of the initial state. This can happen only if there is an empty lower energy state with corresponding energy. This process is called induced emission.

Now consider the following situation:

Figure 15 - a system with the energy eigenstates |1> |2> and |3> equidistantly located on the energy scale where the difference between two neighboring states is hw . Initially the system is in the intermediate state |2 > and a photon of the energy hw is present.

Lets assume that the system is in the state |2> initially and radiation of the frequency w is present . According to the Schrodinger equation alone, both processes described above occur simultaneously because of the linearity of the equation and the contribution of both the upmost and lowest states should increase continuously simultaneously . So that the contribution of |2 > decreases and the system proceeds towards a superposition of |1> and |3> in the form a|1> + b|3>. But in reality the system doesn’t do that but it either makes a transition to |1> by emitting a photon or a transition to |3> by absorbing a photon. So is the prediction of Schrodinger equation wrong?

Now here comes the trick. If the same experiment is conducted many times one finds that the ratio of the number of upwards transitions to the number of downwards transitions gives the ratio of the squared coefficients in the superposition namely b2/a2.

Thus b2 is interpreted as the probability that an upwards transition occurs and a2 is the probability that a downwards transition occurs. Doing so one believes being consistent with probability wave interpretation. However this is an important terminology shift since the word “occurs” replaces the word “observed” in contrast to Copenhagen interpretation. This trick is not justified if we hold strictly to the Copenhagen interpretation. According to the Schrodinger equation the system develops towards a superposition and one should be able to observe of the superposition of |1> and |3> under these conditions. It is a simple fact that such a superposition is not observed . The logical consequence we should draw from this fact is that we have to declare the Schrodinger equation as not being valid when emission and absorption of radiation occur. Instead to overcome this difficulty the terminology was adapted itself to the facts. The reason was that statistical description was sufficient for practical purposes and no one was interested to investigate the detailed mechanism just because of the conceptual problems.

Notice that in TDPT the electromagnetic field is not considered as quantized.

Later the field quantization techniques were developed (see chapter 10). The concept of photon, that was born as a vague concept in order to explain the black body radiation and photo electric effect, established itself and got a sound theoretical basis. The interaction of radiation with matter was described as interaction between quantized fields. The resulting theory is called Quantum electrodynamics(QED) . All interactions with electromagnetic field is considered as absorption and emission of photons. Even the interaction with static field is considered as exchange of so called virtual photons 13 The field quantization has legitimated the terminology of the TDPT namely that “either this or that transition can occur but not both simultaneously” by asserting that a photon can either be emitted or absorbed but not both can happen simultaneously or by saying that half of the photon can not be emitted or absorbed.

In QED, processes are symbolized by Feynmen graphs (Figure 16) :

Figure 16 - Feynmen graphs for Photon emission, Photon absorption, Pair creation.

One can assign probabilities to such elementary processes. A complex physical process in general can be described by consideration of all the possible elementary sub processes relevant for the overall process.

Measurement is an objective physical process that can be described (although only probabilistically) by considering the corresponding Feynman graphs. Measurement doesn’t appear in the axioms of QED as a separate concept and therefore doesn’t play any central role in QED. Consider for example the position measurement. During position measurement the electron is rapidly slowed down by emission of radiation the so called Bremsstrahlung. The electron emits large number of photons and looses energy after each emission so that the Feynmen graph can be given as below.

Figure 17 - Feynmen graph for Bremsstrahlung

Thus the position measurement in double slit experiment has a probabilistic but objective (observation/observer independent) description in QED

In summary the terminology of TDPT and QED is objective-probabilistic contrary to the terminology of Copenhagen interpretation which is subjective-probabilistic. There is no such a thing as the role of the observer in QED. This important terminology shift seems to be unfortunately widely unrecognized by the people who are concerned with interpretations question.

 

If you have read the paragraphs on the terminology difference between QM and QED/TDPT, then Join the discussion on the subject

 

 

Next chapter

Notes and references

1 Iatrogenic is a term used to describe the complications that arise as a consequence of improper medical treatment. Philip Wallace thinks (as I agree) that similarly philosophical problems related to quantum mechanics is not inherent to the theory but merely consequence of misinterpretations. See Wallace 1996

2 Bohm 1993 p. 28,29

3 Chiao 1993

4 Stenger 1995 p.140

5 Bohm 1993 p.259.

6 Everett 1957

7 Everett 1973

8 Griffith 1999

9 Zurek 1982

10 Zurek 1991

11 Zurek 1993

12 Baym 1969 p.255-281

13 The coulomb force as emerging by a superposition of longitudinal and time-like photons , the book in BU,Physics dept..