Foundations of Probability and Physics: Proceedings of the Conference

PQ-QP: Quantum Probability and WItite Noise Analysis Volume XIII ^ ^ Proceedings of the Conference Foundations of p ro...

Author: A.. Khrennikov | A. Khrennikov

124 downloads 1202 Views 5MB Size Report

This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!

Report copyright / DMCA form

DOWNLOAD PDF

PQ-QP: Quantum Probability and WItite Noise Analysis Volume XIII

^ ^ Proceedings of the Conference

Foundations of p robability and physics Edited by A Khrennikov

World Scientific

^ ^ Proceedings of the Conference

foundations of Probability and physics

P Q - Q P : Quantum Probability and White Noise Analysis Managing Editor: W. Freudenberg Advisory Board Members: L. Accardi, T. Hida, R. Hudson and K. R. Parthasarathy

PQ-QP: Quantum Probability and White Noise Analysis Vol. 13:

Foundations of Probability and Physics ed. A. Khrennikov

QP-PQ Vol. 10:

Quantum Probability Communications eds. R. L. Hudson and J. M. Lindsay

Vol. 9:

Quantum Probability and Related Topics ed. L. Accardi

Vol. 8:

Quantum Probability and Related Topics ed. L. Accardi

Vol. 7:

Quantum Probability and Related Topics ed. L. Accardi

Vol. 6:

Quantum Probability and Related Topics ed. L. Accardi

PQ-QP: Quantum Probability and White Noise Analysis Volume XIII

Proceedings of the Conference

foundations of probability and physics Vaxjo, Sweden

25 November - 1 December 2000

Edited by A Khrennikov University of Vaxjo, Sweden

|5% World Scientific m

Jersey'London'Singapore* New Jersey • London • Singapore • Hong Kong

Published by World Scientific Publishing Co. Pte. Ltd. P O Box 128, Farrer Road, Singapore 912805 USA office: Suite IB, 1060 Main Street, River Edge, NJ 07661 UK office: 57 Shelton Street, Covent Garden, London WC2H 9HE

British Library Cataloguing-in-Publication Data A catalogue record for this book is available from the British Library.

FOUNDATIONS OF PROBABILITY AND PHYSICS PQ-QP: Quantum Probability and White Noise Analysis - Vol. 13 Copyright © 2001 by World Scientific Publishing Co. Pte. Ltd. All rights reserved. This book, or parts thereof, may not be reproduced in any form or by any means, electronic or mechanical, including photocopying, recording or any information storage and retrieval system now known or to be invented, without written permission from the Publisher.

For photocopying of material in this volume, please pay a copying fee through the Copyright Clearance Center, Inc., 222 Rosewood Drive, Danvers, MA 01923, USA. In this case permission to photocopy is not required from the publisher.

ISBN 981-02-4846-6

Printed in Singapore by World Scientific Printers (S) Pte Ltd

V

Foreword With the present proceedings of a conference on "Foundations of Probability and Physics" we continue the QP series — the first volume of which appeared more than twenty years ago. The series had its origin in proceedings of conferences and workshops on quantum probability and related topics. Initially published by Springer-Verlag, World Scientific has now been the publisher for about ten years. Much has changed in the world of quantum probability in the last two decades. Quantum probabilistic methods became a mature subject in mathematics and mathematical physics. The number of well-established scientists who have turned their scientific interest to the field of quantum probability is impressively increasing. Scientifically and numerically strong schools of quantum probability evolved in the past years. Moreover, the highly interdisciplinary character of quantum probability became more and more evident. Especially, the close connections to white noise analysis aroused the interest of classical and quantum probabilists and stimulated mutual exchange and cooperation fruitful for both parties. Taking into account this development, during the previous QP conferences we discussed comprehensively and in detail the future profile and main goals of the series. Some changes in the alignment and the objectives of the series resulted from these discussions. First of all the new title reflects the intention to unify white noise analysis and quantum probability. It is important and essential to bring together classical and quantum probabilists, and the success of the World Scientific journal "Infinite Dimensional Analysis, Quantum Probability and Related Topics" shows that such an alliance will benefit both parties. Furthermore, we should be open to a wide audience of scientists and to a broad spectrum of themes. The present volume represents such a field being not very closely connected to quantum probability and white noise analysis but of general interest to the readership of the series. Future volumes of the series will include proceedings of conferences or workshops, lecture notes of schools but also monographs on topics in quantum probability and white noise analysis. Finally, we would like to thank all former editors of the series for their excellent job they did. We especially appreciate the enthusiastic commitment of Luigi Accardi who initiated the series and was the responsible editor for many years. Wolfgang Freudenberg

\

VII

Contents

Foreword

v

Preface

xi

Locality and Bell's Inequality L. Accardi and M. Regoli

1

Refutation of Bell's Theorem G. Adenier

29

Probability Conservation and the State Determination Problem S. Aerts

39

Extrinsic and Intrinsic Irreversibility in Probabilistic Dynamical Laws H. Atmanspacher, R. C. Bishop and A. Amann

50

Interpretations of Probability and Quantum Theory L. E. Ballentine

71

Forcing Discretization and Determination in Quantum History Theories B. Coecke

85

Interpretations of Quantum Mechanics, and Interpretations of Violation of Bell's Inequality W. M. De Muynck

95

Discrete Hessians in Study of Quantum Statistical Systems: Complex Ginibre Ensemble M. M. Duras Some Remarks on Hardy Functions Associated with Dirichlet Series W. Ehm Ensemble Probabilistic Equilibrium and Non-Equilibrium Thermodynamics without the Thermodynamic Limit D. H. E. Gross An Approach to Quantum Probability S. Gudder

115 121

131 147

Innovation Approach to Stochastic Processes and Quantum Dynamics T. Hida

161

Statistics and Ergodicity of Wave Functions in Chaotic Open Systems H. Ishio

170

Origin of Quantum Probabilities A. Khrennikov

180

Nonconventional Viewpoint to Elements of Physical Reality Based on Nonreal Asymptotics of Relative Frequencies A. Khrennikov

201

"Complementarity" or Schizophrenia: Is Probability in Quantum Mechanics Information or Onta? A. F. Kracklauer

219

A Probabilistic Inequality for the Kochen-Specker Paradox J.-A. Larsson Quantum Stochastics. The New Approach to the Description of Quantum Measurements E. Loubenets

236

246

Abstract Models of Probability V. M. Maximov

257

Quantum K-Systems and their Abelian Models H. Narnhofer

274

Scattering in Quantum Tubes B. Nilsson

303

Position Eigenstates and the Statistical Axiom of Quantum Mechanics L. Polley

314

Is Random Event the Core Question? Some Remarks and a Proposal P. Rocchi

321

Constructive Foundations of Randomness V. I. Serdobolskii

335

ix

Structure of Probabilistic Information and Quantum Laws J. Summhammer

350

Quantum Cryptography in Space and Bell's Theorem /. Volovich

364

Interacting Stochastic Process and Renormalization Theory Y. Volovich

373

xi

Preface This volume constitutes the proceedings of the Conference "Foundations of Probability and Physics" held in Vaxjo (Smoland, Sweden) from 25 November to 1 December, 2000. The Organizing Committee of the Conference: L. Accardi (Rome, Italy), W. De Muynck (Eindhoven, the Netherlands), T. Hida (Meijo University, Japan), A. Khrennikov (Vaxjo University, Sweden) and U. V. Maximov (Belostok, Poland). The purpose of the Conference (tentatively the first of a series) was to bring together scientists (physicists as well as mathematicians) who are interested in probabilistic foundations of physics. An emphasis was made on both theory and experiment, the underlying objective being to offer to the physical and mathematical scientific communities a truly interdisciplinary Conference as a privileged place for a scientific interaction among theoreticians and experimentalists. Due to the actual increased role of probabilistic foundations in physical applications (Einstein-Podolsky-Rosen correlation experiments, Bell's inequality, quantum information, computing and teleportation) as well as the necessity to reconsider foundations at the beginning of new millennium, the organizers of the Conference decided that it was just the right time for taking the scientific risk of trying this. Since the creation of Statistical Mechanics, probabilistic description plays more and more important role in physics. The new crucial step in the development of the statistical approach to physics was made in the process of the creation of quantum mechanics. The founders of quantum theory recognized that quantum formalism could not provide the description of physical processes for individual elementary particles. The understanding of this surprising fact induced numerous debates on the possibilities of individual and probabilistic descriptions and relations between them. These debates are characterized by the large diversity of opinions on the origin of quantum stochasticity. One of the viewpoints is that 'quantum stochasticity' differs from 'classical stochasticity'. So quantum (statistical) mechanics could not be reduced to classical statistical mechanics. This viewpoint implies convential interpretation of quantum mechanics. By this interpretation we could not use objective realism in quantum description of reality. The very fundamental physical quantities such as, for example, position and momentum of an elementary particle could not be considered as properties of the object, the elementary particle. The elementary particle can be in a state that is superposition of alternatives. Only the act of a measurement gives the possibility to choose between these alternatives.

xii

We recall historical roots of the origin of such a viewpoint, namely the idea of superposition. In fact, the whole 'quantum building' was built on two experimental cornerstones: 1) the experiment on photoelectric emission, 2) the two slit experiment." The first experiment definitely demonstrated that light has the corpuscular structure (discrete structure of energy). However, the second experiment demonstrated that photons (corpuscular objects), do not follow the standard CLASSICAL STATISTICS. The conventional rule for the addition of probabilistic alternatives: P =

P1+P2

is violated in the interference experiments. Instead of this rule, probabilities observed in interference experiments follow to quantum rule for the addition of probabilistic alternatives: P = Pi + P2 +

2T/P1P2COSO.

Thus in general the classical rule is perturbed by the cos 0-factor. The appearance of NEW STATISTICS induced the revolution in theoretical physics: reconsideration of the role of all basic elements of the physical theory. The common opinion was (and is) that quantum probabilistic rule could not be explained by purely corpuscular model. To explain this rule, we must apply to wave arguments, (see, for example, Dirac's book* for the detailed analysis of the roots of quantum mechanical formalism). This implies the wave-particle dualism and Bohr's principle of complementarity. This was the crucial change of the whole picture of physical reality (at least at micro-level). We underline again that all these revolutionary changes had the purely probabilistic root, namely the appearance of the new probabilistic rule. We also underline that the founders of quantum mechanics, in fact, did not provide deep probabilistic analysis of the problem. Instead of this, they analysed other elements of the physical model. And such an analysis induces the new description of physical reality that we have already discussed, namely 'quantum reality'. We will never know the real reasons of such a development of the a Of course, we must also mention that the necessity for a departure from classical mechanics was shown by experiments demonstrating the remarkable stability of atoms and molecules. The forces known in classical electrodynamics are inadequate for the explanation of this phenomenon. However, quantum mechanical explanation of such a stability is, in fact, based on the same arguments as the explanation of the photoelectric effect. b P. A. M. Dirac, The Principles of Quantum Mechanics (Claredon Press, Oxford, 1995).

xiii

theoretical study of the results of experiments with elementary particles at the beginning of the last century. It might be that one of the reasons was the absence of the mathematical theory of probability: A. N. Kolmogorov proposed the modern axiomatics of probability theory only in 1933. During the round table at this conference, Prof. T. Hida and Prof. I Volovich pointed out to the fundamental role of direct contacts between physicists and mathematician in the creation of new physical theories. It may be that the absence of the direct collaboration between quantum physical and probabilistic communities was the main root of the absence of deep probabilistic analysis of quantum behaviour. Debates on foundations of quantum mechanics were continued with a new excitement in the connection with Einstein-Podolsky-Rosen (EPR) paradox. Unfortunately the probabilistic element played the minor role in the EPR considerations. There was used (in a rather formal way) the notion of probability one in the formulation of the sufficient condition to be an element of physical reality. A new probabilistic impulse to debates on foundations of quantum mechanics was given by Bell's inequality. However, we must recognize that Bell's probabilistic considerations were performed on the formal level that could not be considered as satisfactory (at least from the point of view of mathematician). It may be that this absence of the deep probabilistic analysis of the EPR and Bell arguments was one of the main reasons to concentrate investigations in the direction of nonlocality and no-go theorems for hidden variables. The main aim of the conference "Foundations of Probability and Physics" was to provide probabilistic analysis of foundations of physics, classical as well as quantum (in particular, the E P R and Bell arguments). The present volume contains results of such analysis. It gives the general picture of probabilistic foundations of modern physics. Foundations of probability were considered in the close connection to foundations of physics. We demonstrated that probability plays the fundamental role in models of physical reality. It seems to be impossible to split probabilistic and physical problems. On one hand, many important problems that looks as purely physical are, in fact, just probabilistic problems. On the other hand, the right meaning of probability can be found only on the basis of physical investigations. Such a meaning depends strongly on a physical model. The conference and the present volume give the good example of the fruitful collaboration between physicists and mathematicians, stimulate research on the foundations of probability and physics, especially quantum physics. We would like to thank Swedish Natural Science Foundation, Swedish Technical Science Foundation, Vaxjo University and Vaxjo Commune for fi-

XIV

nancial support that made the Conference possible. We would also like to thank Prof. Magnus Soderstrom, the Rector of Vaxjo University, for support of fundamental investigations and, in particular, this Conference. Andrei Khrennikov International Center for Mathematical Modelling in Physics and Cognitive Sciences University of Vaxjo, Sweden December, 2000.

1 LOCALITY A N D BELL'S I N E Q U A L I T Y

LUIDGI ACCARDI, MASSIMO REGOLI Centro Vito Volterra Universita di Roma "Tor Vergata", Roma, Italy Email: accardi ©volterra. mat. uniroma2. it We prove that the locality condition is irrelevant to Bell in equality. We check that the real origin of the Bell's inequality is the assumption of applicability of classical (Kolmogorovian) probability theory to quantum mechanics. We describe the chameleon effect which allows to construct an experiment realizing a local, realistic, classical, deterministic and macroscopic violation of the Bell inequalities.

1

Inequalities among numbers

In this section we summarize some elementary inequalities among numbers, which correspond t o different forms of the Bell inequality one meets in t h e literature. Since some confusion have arosen about t h e mutual relationships among these inequalities, in particular their (in)equivalence and the cases of equality, such a s u m m a r y might not b e totally useless. L e m m a (1) For any two numbers a,c € [—1,1] t h e following equivalent inequalities hold: \a±c\
(1)

Moreover equality in (1) holds if a n d only if either o = ± l o r c = ± l . P r o o f . T h e equivalence of t h e two inequalities (1) follows from the fact t h a t one is obtained from the other by changing t h e sign of c and c is arbitrary in [-1,1]Since for any a, c 6 [—1,1], 1 ± ac > 0, (1) is equivalent to \a ± c\2 = a2 + c 2 ± 2ac < (1 ± ac)2 = 1 + a2c2 ± 2ac and this is equivalent t o a 2 ( l - c 2 ) + c2 < 1 which is identically satisfied because 1 — c 2 > 0 and therefore a2(l-c2)+c2 < l-c

2

+ c2 = 1

(2)

Notice t h a t in (2) equality holds if and only if a2 = 1 i.e. a = ± 1 . Since, exchanging a and c in (1) the inequality remains unchanged, the thesis follows.

2

Corollary (2) For any three numbers a,b,c € [—1,1] the following equivalent inequalities hold: \ab ± cb\ < 1 ± ac

(3)

and equality holds if and only if b = ± 1 and either a = ± l o r c = i l . Proof. For b e [-1,1], \ab±cb\ = \b\-\a±c\<\a±c\

(4)

so the thesis follows from Lemma (1). In (3.4) equality holds if and only if b = ± 1 , so also the second statement follows from Lemma (1). Lemma (3). For any numbers o, a', b, b', c e [—1,1], one has \ab - bc\ + \ab' + b'c\ < 2

(5)

\ab + ab' + a'b' -a'b\ < 2

(6)

In (5) equality holds if and only if b, b' = ± 1 and either a o r c = ± 1 . Proof. Adding the two inequalities in (3) one finds (5). The left hand side of (6) is < than \ab-ba'\

+ \ab' + l/a'\

(7)

and replacing a' by c, (7) becomes the left hand side of (5). Therefore (6) holds. If b, b' = ±1 and either a or c = ± 1 equality holds in (3) hence in (5). Conversely, suppose that equality holds in (5) and suppose that either \b\ < 1 or | V | < 1. Then we arrive to the contradiction 2 = \b\ • \a - a'\ + \b'\ • |o + a'\ <\a-

a'\ + \a + a'\ < (1 - aa') + (1 + aa!) = 2 (8)

So, if equality holds in (5), we must have |6| = \b'\ = 1. In this case (5) becomes \a-a'\

+ \a + a'\=2

(9)

and we know from Lemma (1) that the identity (4.1) can take place if and only if either a or a' = ± 1 .

3

Corollary (4). If a,a',b,b',c £ {-1,1}, then the inequalities (3), (6) and (5) are equivalent and equality holds in all of them. Proof. From Lemma (1) we know that the inequalities (1) and (2) are equivalent. Prom Lemma (3) we know that (3) implies (5). Choosing b' = a in (5), since a = ± 1 , (5) becomes \ab — cb\ < 1 — ac which is (3). The left hand side of (6) is a(b + b') + a'(b' - b)

(10)

In our assumptions either (b + b') or (b' - b) is zero, so (4) is either equal to \a(b+b')\ = \b + b'\=2 or to \a'(b'-b)\

= \b-b'\

=2

Corollary (5). If a,b',c G (—1,1), then the inequality (5), hence a fortiori (6), is strictly weaker than (3). Proof. We have already proved that that (3) implies (5), hence (6). On the other hand (5) is equivalent to \ab - bc\ < (1 - ac) + (1 + ac - \ab' + b'c\

(11)

B y L e m m a ( l ) , 1+ac— \ab'+b'c\ > 0 and equality holds if and only if | b' | = land either a or c is ± 1 . From this the thesis follows. 2

The Bell inequality

Corollary (1) (Bell inequality) Let A,B,C,D be random variables defined on the same probability space (f2, J-, P) and with values in the interval [—1,1]. Then the following inequalities hold: E(\AB - BC\) < 1 - E(AC)

(1)

E(\AB + BC\) < 1 + E{AC)

(2)

4

E(\AB - BC\) + E(\AD + DC\) < 2

(3)

where E denotes the expectation value in the probability space of the four variables. Moreover (1) is equivalent to (2) and, if either A or C has values ± 1 , then the three inequalities are equivalent. Proof. Lemma (1.1) implies the following inequalities (interpreted pointwise on fi): \AB - BC\ < 1 - AC \AB + BC\ < 1 + AC \AB - BC\ + \AD + DC\ < 2 from which (1), (2), (3) follow by taking expectation and using the fact that |£(-?0I < .Ed-X^). The equivalence is established by the same arguments as in Lemma (1.1). Remark (2). Bell's original proof, as well as the almost totality of the available proofs of Bell's inequality, deal only with the case of random variables assuming only the values + 1 and —1. The present generalization is not without interest because it dispenses from the assumption that the classical random variables, used to describe quantum observables, have the same set of values of the latter ones: a hidden variable theory is required to reproduce the results of quantum theory only when the hidden parameters are averaged over. Theorem (3). Let Sa , 5c , 5^ , 5^ be random variables defined on a probability space (£l,F, P) and with values in the interval -1,+1]. Then the following inequalities holds: £(5«5< 2 >) - E(SWSP)

< 1-

E(SWS^)

(4)

E(SMS12))

< 1+

E(S^SW)

(5)

E(sWsi2))

+ E(SWsi2))

- £ ( 5 « 5 < 2 ) ) + E(S^S{2))

Proof. This is a rephrasing of Corollary (2).

+ E(S^S{2))

<2

(6)

5

3

Implications of the Bell's inequalities for the singlet correlations

To apply Bell's inequalities to the singlet correlations, considered in the EPR paradox, it is enough to observe that they imply the following L e m m a (1) In the ordinary three-dimensional euclidean space there exist sets of three, unit length, vectors a, b, c, such that it is not possible to find a probability space (Q,, T, P) and six random variables SXJ (x = a, 6, c, j = 1,2) denned on ($7, J-, P) and with values in the interval [—1, +1], whose correlations are given by: E(SW-SM)

= -x-y

;

x,y = a,b,c

(1)

where, if x = (xi,X2,X3), y = (2/1,1/2;2/3) are two three-dimensional vectors, x • y denotes their euclidean scalar product, i.e. the sum x\yi + X2J/2 + ^32/3R e m a r k . In the usual EPR-type experiments, the random variables qti) qU) qii) represent the spin (or polarization) of particle j of a singlet pair along the three directions a,b,c in space. The expression in the right-hand side of (1) is the singlet correlation of two spin or polarization observables, theoretically predicted by quantum theory and experimentally confirmed by the Aspect-type experiments. Proof. Suppose that, for any choice of the unit vectors x = a,b,c there exist random variables Si as in the statement of the Lemma. Then, using Bell's inequality in the form (2.5) with A = s£ 1} , B = s f ) , C = S ^ ) , we obtain E(SWsl2))

+ E(S12)SW)\

< 1 + E{S
(2)

Now notice that, if x = y is chosen in (1), we obtain E{SP

• SM) =-x

• x = - \\x\\2 = ~l

;

x = a,b,c

and, since Si J S i ' = 1 this is possible if and only if Si1' = -Sx2>> (x = a, b, c) P-almost everywhere. Using this (2) becomes equivalent to:

\E{SPSI*>) + E(S^SW)\

<1-

E(S^S^)

or, again using (1), to: \a-b + b-c\ < 1 + o - c

(3)

6

If the three vectors a, b, c are chosen to be in the same plane and such that a is perpendicular to c and b lies between a and b, forming an angle 9 with a, then the inequality (3) becomes: cos9 + sin0 < 1

;

0 < 0 < TT/2

(4)

But the maximum of the function of 6 i—> sin 9 + cos 9 in the interval [0, n/2] is \/2 (obtained for 9 = 7r/4). Therefore, for 0 close to 7r/4, the left-hand side of (4) will be close to \/2 which is more that 1. In conclusion, for such a choice of the unit vectors a, b, c, random variables Sa , S^ ',Sc ,Sc as in the statement of the Lemma cannot exist. Definition (2) A local realistic model for the EPR (singlet) correlations is defined by: (1) a probability space (fl, T, P) (2) for every unit vector x, in the three-dimensional euclidean space, two random variables Sx ,SX defined on fi and with values in the interval [—1, +1] whose correlations, for any x, y, are given by equation (1).

Corollary (3) If a, b, c are chosen so to violate (4) then a local realistic model for the EPR correlations, in the sense of Definition (2), does not exist. Proof. Its existence would contradict Lemma (1). Remark. In the literature one usually distinguishes two types of local realistic models - deterministic and stochastic ones. Both are included in Definition (2): the deterministic models are defined by random variables Sx with values in the set{—1, +1}; while, in the stochastic models, the random variables take values in the interval [—1,+1]. The original paper [7] was devoted to the deterministic case. Starting from [9] several papers have been introduced to justify the stochastic models. We prefer to distinguish the definition of the models from their justification. 4

Bell on the meaning of Bell's inequality

In the last section of [8] (submitted before [7], but published after) Bell briefly describes Bohm hidden variable interpretation of quantum theory underlining

7

its non local character. He then raises the question: ... that there is no proof that any hidden variable account of quantum mechanics must have this extraordinary character ... and, in a footnote added during the proof corrections, he claims that: ... Since the completion of this paper such a proof has been found

mIn the short Introduction to [7], Bell reaffirms the same ideas, namely that the result proven by him in this paper shows that: ... any such [hidden variable] theory which reproduces exactly the quantum mechanical predictions must have ... a grossly nonlocal structure. The proof goes along the following scheme: Bell proves an inequality in which, according to what he says (cf. statement after formula (1) in [7]): ... The vital assumption [2] is that the result B for particle 2 does not depend on the setting a, of the magnet for particle \, nor A on b. The paper [2], mentioned in the above statement, is nothing but the Einstein, Podolsky, Rosen paper [11] and the locality issue is further emphasized by the fact that he reports the famous Einstein's statement [12]: ... But on one supposition we should, in my opinion, absolutely hold fast: the real factual situation of the system S2 is independent of what is done with the system Si, which is spatially separated from the former. Stated otherwise: according to Bell, Bell's inequality is a consequence of the locality assumption. It follows that a theory which violates the above mentioned inequality also violates ... the vital assumption needed, according to Bell, for its deduction, i.e. locality. Since the experiments prove the violation of this inequality, Bell concludes that quantum theory does not admit a local completion; in particular quantum mechanics is a nonlocal theory. To use again Bell's words: the statistical predictions of quantum mechanics are incompatible with separable predetermination ([7], p.199). Moreover this incompatibility has to be understood in the sense that: in a theory in which parameters are added to quantum mechanics to determine the results of individual measurements, without changing the statistical predictions, there must be a mechanism whereby the setting of one measuring device can influence the reading of another instrument, howevere remote. Moreover, the signal involved must propagate instantaneously,... 5

Critique of Bell's "vital assumption"

An assumption should be considered "vital" for a theorem if, without it, the theorem cannot be proved.

8

To favor Bell, let us require much less. Namely let us agree to consider his assumption vital if the theorem cannot be proved by taking as its hypothesis the negation of this assumption. If even this minimal requirement is not satisfied, then we must conclude that the given assumption has nothing to do with the theorem. Notice that Bell expresses his locality condition by the requirement that the result B for particle 2 should not depend on the setting a, of the magnet for particle 1 (cf. citation in the preceeding section). Let us denote Mi (M2) the space of all possible measurement settings on system 1 (2). Theorem (1) For each unit vector x in the three dimensional euclidean space (1 6 R 3 , I a; |= 1) let be given two random variables Sx , Sx (spin of particle 1 (2) in direction x), defined on a space D. with a probability P and with values in the 2-point set {+1, —1}- Fix 3 of these unit vectors a, b, c and suppose that the corresponding random variables satisfy the following non locality condition [violating Bell's vital assumption]: suppose that the probability space Cl has the following structure: !) = A x M , x M 2

(1)

so that, for some function Fj1', F^2' : A x Mi x M2 -»• [-1,1], Sal) (w) = F a (1) (A, mi, m 2 )

(S^

Sa2)(u) = Fa(2)(A, m i , m 2 )

(Sa2) depends on mi)

depends on m 2 )

(2) (3)

with mi € Mi,m2 € M.2 and similarly for b and c. [nothing changes in the (2) proof if we add further dependences, for example Fa may depend on all the 4 1 } (w) and F 0 (1) on all the SX2\LJ)}. Then the random variables Si', S^2', Sc satisfy the inequality I (SMS™)

- (S™SW)

|< 1 - (S^SM)

(4)

;

(5)

If moreover the singlet condition <5( 1 )-S( 2 )) = - 1

x = a,b,c

is also satisfied, then Bell's inequality holds in the form \(Sa^si2))-{S^S^)\
+ (sWS^)

(6)

9

Proof. The random variables Sa', S^ , Sc satisfy the assumptions of Corollary (2.3) therefore (4), holds. If also condition (5) is satisfied then, since the variables take values in the set {—1, +1}, with probability 1 one must have

SP = -SW and therefore (S^S^)

= -{S^S^).

(x = a,b,c)

(7)

Using this identity, (4) becomes (6).

Summing up: Theorem (1) proves that Bell's inequality is satisfied if one takes as hypothesis the negation of his "vital assumption". From this we conclude that Bell's "vital assumption" not only is not "vital" but in fact has nothing to do with Bell's inequality. REMARK. Using Lemma (14.1) below, we can allow that the observables take values in [—1,1] also in Theorem (1). REMARK. The above discussion is not a refutation of the Bell inequality: it is a refutation of Bell's claim that his formulation of locality is an essential assumption for its validity: since the locality assumption is irrelevant for the proof of Bell's inequality it follows that this inequality cannot discriminate between local and non local hidden variable theories, as claimed both in the introduction and the conclusions of Bell's paper. In particular: Theorem (1) gives an example of situations in which:

(i) Bell's locality condition is violated while his inequality is satisfied. In a recent experiment with M. Regoli [4] we have produced examples of situations in which: (ii) Bell's locality condition is satisfied while his inequality is violated. 6

The role of the counterfactual argument in Bell's proof

Bell uses the counterfactual argument in an essential way in his proof because it is easy to check that formula (13) in [7] paper is the one which allows him to reduce, in the proof of his inequality, all consideration to the A-variables (Sa in our notations, while Bell's -B-variables are the Sa ^ in our notations). The pairs of chameleons (cf. section (10), as well as the experiment of [4] provide a counterexample precisely to this formula.

10

7

Proofs of Bell's inequality based on counting arguments

There is a widespread illusion to exorcize the above mentioned critiques by restricting one's considerations to results of measurements. The following considerations show why this is an illusion. The counting arguments, usually used to prove the Bell inequality are all based on the following scheme. In the same notations used up to now, consider N simultaneous measurements of the singlet pairs of observables (S^, S%), (S£, S*), (S%, 5*) and one denotes S3XV the results of the v-th measurement of S°x (j = 1,2, x = a, b, c, v = 1 , . . . , N). With these notations one can calculate the empirical correlations on the samples, that is

u

(and similarly for the other ones). In the Bell inequality, 3 such correlations are involved.

(slsl),

{slsD,

{slsD

(2)

Thus in the three experiments observer 1 has to measure 5* in the first and third experiment and S* in the second, while observer 2 has to measure Sjj in the first and second experiment and S* in the third. Therefore the directions a and b can be chosen arbitrarily by the two observers and it is not necessary that observer 1 is informed of the choice of observer 2 or conversely. However the direction c has to be chosen by both observers and therefore at least on this direction there should be a preliminary agreement among the two observers. This preliminary information can be replaced it by a procedure in which each observer chooses at will the three directions only those choices are considered for which it happens (by chance) that the second choice of observer 1 coincides with the third of observer 2 (cf. section (15) for further discussion of this point). Whichever procedure has been chosen, after the results of the experiments one can compute the 3 empirical correlations

^ 2) ^ 1) ) = ^E^ 1 ) (^ 2 ) )^ 2 ) ^ 2 ) )

<4>

11 JV

(5)

where pj ' means the j - t h point of the 3-d experiment etc. If we try to apply the Bell argument directly to the empirical data given by the right hand sides of (3), (4), (5), we meet the expression

Jj E&WWto?) - ± E^^pf )5f (Pf) N

(6)

j=i

J=I

from which we immediately see that, if we try to apply Bell's reasoning to the empirical data, we are stuck at the first step because we find a sum of terms of the type

si^sPip^-sUip^sfHpV)

(7)

to which the inequalities among numbers, of section (1), cannot be applied because in general (8)

More explicitly: since the expression (x.) above is of the form ab — b'c with a, b, b', c € {±1}, the only possible upper bound for it is 2 and not 1 — ac. Even supposing that we, in order to uphold Bell's thesis, can introduce a cleaning operation [3], (cf. [4]), which eliminates all the points in which (8) is not satisfied, we would arrive to the inequality

jf E^frf) W>) - jf E ^ f W (*f) j=i

3= 1

<

i-^E^W^fef)

(9)

j=i

and, in order to deduce from this, something comparable with the experiments we need to use the counterfactual argument, assessing that (2h

^ 1 , ( p 9 ) ) = -s< a >(P a) )

(10)

12

But in the second experiment S^ ' and not Sc ' has been measured. Thus to postulate the validity of (10) means to postulate that: the value assumed by Sjj in the second experiment is the same that we would have found if Sc and (2)

not S^ had been measured. The chameleon effect provides a counterexample to this statement. 8

The quantum probabilistic analysis

Given the results of section (5), (6), (7), it is then legitimate to ask: if Bell's vital assumption is irrelevant for the deduction of Bell's inequality, which is the really vital assumption which guarantees the validity of this inequality? This natural question was first answered in [1] and this result motivated the birth of quantum probability as something more than a mere noncommutative generalization of probability theory; in fact a necessity motivated by experimental data. Theorem (2.3) has only two assumptions: (i) that the random variables take values in the interval [—1, +1] (ii) that the random variables are defined on the same probability space Since we are dealing with spin variables, assumption (i) is reasonable. Let us consider assumption (ii). This is equivalent to the claim that the three probability measures Pab,Pac,Pcb, representing the distributions of the pairs (Sa ,Sl '), (Sc , 5 ^ ), (Sa ,SC ) respectively, can be obtained by restriction from a single probability measure P, representing the distribution of the quadruple si1], s f \ s f \ SJ?\ This is indeed a strong assumption because, due to the incompatibility of the spin variables along non parallel directions, the three correlations

(spsP)

, <s«s<2>> , (s^sP)

(i)

can only be estimated in different, in fact mutually incompatible, series of experiments. If we label each series of experiments by the corresponding pair (i.e. (a, 6), (6, c), (c, a)), then we cannot exclude the possibility that also the probability measure in each series of experiments will depend on the corresponding pair. In other words, each of the measures Pa,b, Pb,c, Pc,a describes the joint statistics of a pair of commuting observables (Si1}, s f } ) , (S^, s f >),

13

(Sa ,Sc ) and there is no a priori reason to postulate that all these joint distributions for pairs can be deduced from a single distribution for the quadruple roU)

c(l)

o(2)

Q(2)I

We have already proved in Theorem (2.3) that this strong assumption implies the validity of the Bell inequality. Now let us prove that it is the truly vital assumption for the validity of this inequality, i.e. that, if this assumption is dropped, i.e. if no single distribution for quadruples exist, then it is an easy exercise to construct counterexamples violating Bell's inequality. To this goal one can use the following lemma: Lemma (1). Let be given three probability measures ±abi *aci -* c6 on & given (measurable) space (S1,f) and let S^, si1], S^, SJp be functions, defined on (Q,J-) with values in the interval [—1,-1-1], and such that the probability measure Pab (resp. Pcb,Pac) is the distribution of the pair (Sa ,Sl ) (resp. ( ^ 1 } , ^ 2 ) ) , ( S i 1 } , ^ 2 ) ) ) . For each pair define the corresponding correlation Kab:={SW,S^):=Jsa^S^dPab

and suppose that, for e,e' = ± , the joint probabilities for pairs Ki

••= P(Si1] = e • S™ = e')

satisfy: p + + _ p— xy

xy

.

p+- _ p-+

>

xy

M

xy

P? = Px = 1/2

(o\ \^I

(3)

then the Bell inequality \Kab - Kbc\
(4)

is equivalent to \p:b+-pb+c+\+p^+<\

(5)

Proof. The inequality (4) is equivalent to W

-

2P

ab" ~ *P&+ + 2P+-1 < 1 - 2Pa+c+ + 2 P + -

(6)

14

Using the identity (equivalent to (3)) *•-.xy

0

":xy

(')

the left hand side of (4) becomes the modulus of

2(^t+-^r )-2(nt + -nr) = 2 (*s + -f +pav) -2 (pbt+-\+nr) = 4(pav-nt+)

(8)

and, again using (7), the right hand side of (6) is equal to 1 - 2 ( P + + - 2 + Pac+ ) = 2 - 4P++

(9)

Summing up, (4) is equivalent to

\Kb+-Kc+\
-PaV

(io)

which is (5). Corollary (2). There exist triples of Pab,Pac,Pcb on the 4-point space { + 1 , - 1 } x { + 1 , - 1 } which satisfy conditions (1), (2) of Lemma (1) and are not compatible with any probability measure P on the 6-point space {+1,-1}X{+1,-1}X{+1,-1}.

Proof. Because of conditions (1), (3) the probability measures Pab, Pac, Pcb are uniquely determined by the three numbers

p:b+,p++,px+€io,i}

(ii)

Thus, if we choose these three numbers so that the inequality (5) is not satisfied, the Bell inequality (4) cannot be satisfied because of Lemma (1). 9

The realism of ballot boxes and the corresponding statistics

The fact that there is no a priori reason to postulate that the joint distributions of the pairs ( S ^ s f 0 ) , (si1],sf}), ( S ^ S ^ ) can be deduced from a single distribution for the quadruple Sa ,Sc ,Sl ' ,Sc , does not necessarily mean that such a common joint distribution does not exist.

15

On the contrary, in several physically meaningful situations, we have good reasons to expect that such a joint distribution should exist even if it might not be accessible to direct experimental verification. This is a simple consequence of the so-called hypothesis of realism which is justified whenever we are entitled to believe that the results of our measurements are pre-determined. In the words of Bell: Since we can predict in advance the result of measuring any chosen component of o
{SW=ai}, where ai,bi,a properties

[Sf^h],

[^ 1} = Cl ]

= ± 1 are equal to the relative frequency of the sextuples of

[S™ = ai] , [Si1] = h] , [SP = Cl ] , [SM = - 0 l ] , [S<2> = -bl]

, [S(2) = _ C l ]

and, since we are confining ourselves to the case of 3 properties and 2 particles, the above ones, when a\,bi,c\ vary in all possible ways in the set {±1}, are all the possible configurations in this situation, the counterfactural argument is applicable and in fact we have used it to deduce the joint distribution of sextuples from the joint distributions of triples.

16

10

The realism of chameleons and the corresponding statistics

According to the quantum probabilistic interpretation, what Einstein, Podolsky, Rosen, Bell and several other who have discussed this topic, call the hypothesis of realism should be called in a more precise way the hypothesis of the ballot box realism as opposed to hypothesis of the chameleon realism. The point is that, according to the quantum probabilistic interpretation, the term predetermined should not be confused with the term realized a priori, which has been discussed in section (9.): it might be conditionally dediced according to the scheme: if such and such will happen, I will react so and so.... The chameleon provides a simple example of this distinction: a chameleon becomes deterministically green on a leaf and brown on a log. In this sense we can surely claim that its color on a leaf is predetermined. However this does not mean that the chameleon was green also before jumping on the leaf. The chameleon metaphora describes a mechanism which is perfectly local, even deterministic and surely classical and macroscopic; moreover there are no doubts that the situation it describes is absolutely realistic. Yet this realism, being different from the ballot box realism, allows to render free from metaphysics statements of the orthodox interpretation such as: the act of measurement creates the value of the measured observable. To many this looks metaphysic or magic; but load how natural it sounds when you think of the color of a chameleon. Finally, and most important for its implications relatively to the EPR argument, the chameleon realism provides a simple and natural counterexample of a situation in which the results are predetermined however the counterfactual argument is not applicable. Imagine in fact a box in which there are many pairs of chameleons. In each pair there is exactly an healthy one, which becomes green on a leaf and brown on a log, and a mutant one, which becomes brown on a leaf and green on a log; moreover exactly one of the chameleons in each pair weights 100 grams and exactly one 200 grams. A measurement consists in separating the members of each pair, each one in a smaller box, and in performing one and only one measurement on each member of each pair. The color on the leaf, color on the log, and weight are 2-valued observables (because we do not know a priori if we are measuring the healthy or the mutant chameleon). Thus, with respect to the observables: color on the leaf color on the long and weight the pairs of chameleons behave exactly as EPR pairs: whenever the same observable is measured on both elements of a pair, the results are opposite. However, suppose I measure the color on the leaf, of one element of a pair and the weight of the other one and suppose the answers I

17

find are: green and 100 grams. Can I conclude that the second element of the pair is brown and weights 100 grams'! Clearly not because there is no reason to believe that the second member of the pair, of which the weight was measured while in a box, was also on a leaf. From this point of view the measurement interaction enters the very definition of an observable. However also in this interpretation, which is more similar to the quantum mechanical situation, the counterfactual argument cannot be applied because it amounts to answer "brown" to the question: which is the color on the leaf, if I have measured the weight and if I know that the chameleon is the mutant one? (this because the measurement of the other one gave green on the leaf). But this answer is not correct, because it could well be that inside the box there is a leaf and the chameleon is interacting with it while I am measuring its weight, but it could also be that it is interacting with a log, also contained inside the box in which case, being a mutant, it would be green. Therefore if we can produce an example of a 2-particle system in which the Heisenberg evolution of each particle's observable satisfies Bell's locality condition, but the Schroedinger evolution of the state, i.e. the expectation value (•), depends on the pair (a,b) of measured observables, we can claim that this counterexample abides with the same definition of locality as Bell's theorem. 11

Bell's inequalities and the chamaleon effect

Definition (1) Let S be a physical system and O a family of observable quantities relative to this system. We say that the it chamaleon effect is realized on S if, for any measurement M of an observable A £ O, the dynamical evolution of S depends on the observable A. If D denotes the state space of S, this means that the change of state from the beginning to the end of the experiment is described by a map (a one-parameter group or semigroup in the case of continuous time) TA : D->D Remark. The explicit form of the dependence of TA on A depends on both the system and the measurement and many concrete examples can be constructed. An example in the quantum domain is discussed in [3] and the experiment of [4] realizes an example in the classical domain. Remark If the system S is composed of two sub-systems S\ and 52, we can also consider the case in which the evolutions of the two subsystems are different in the sense that, for system 1, we have one form of functional dependence,

18 Tjj , of the evolution associated to the observable A and, for system 2, we have another form of functional dependence, Tjj'. In the experiment of [4], the state space is the unit disk D in the plane, the observables are parametrized by angles in [0,2n) (or equivalently by unit vectors in the unit circle) and, for each observable S i of system 1

and, for each observable S„ of system 2

where Ra denotes (counterclockwise) rotation of an angle a. Let us consider Bell's inequalities by assuming that a chamaleon effect

is present. Denoting E the common initial state of the composite system (1,2), (e.g. singlet state), the state at the end of the measurement will be

Now replace Sx

by ;

g(j) . = g{j) o T ( j ) "x

x

-"-x

Since the Sx take values ± 1 , we know from Theorem (2.3) that, if we postulate the existence of joint probabilities for the triple 5„ ' ,S^ ',Sc , compatible with the two correlations E(si1}S^2)), E(si1}S^2)), then the inequality

\E(S^si2))

- E(S^si2))\

<1-

E(S^S^)

holds and, if we also have the singlet condition E{S£\TWp)S?\TWp))

= -l

(1)

then a.e. and we have the Bell's inequality. Thus, if we postulate the same probability space, even the chamaleon effect alone is not sufficient to guarantee violation of the Bell's inequality. Therefore the fact that the three experiments are done on different and incompatible samples must play a crucial role.

19

As far as the chameleon effect is concerned, let us notice that, in the above statement of the problem the fact that we use a single initial probability measure E is equivalent to postulate that, at time t = 0 the three pairs of observables

(^U2)) , (sMa>) , (^U1}) admit a common joint distribution, in fact E. 12

Physical implausibility of Bell's argument

In this section we show that, combining the chameleon effect with the fact that the three experiments refer to different samples, then even in very simple situations, no cleaning conditions can lead to a proof of the Bell's inequality. If we try to apply Bell's reasoning to the empirical data, we have to start from the expression

~ E^W^sfcr^) - 1 E^crJV)^(if Pf) 3

(1)

3

which we majorize by

^N E W'^P^iT^p])

- SW(TJ V ) s f (tf V )

(2)

3

But, if we try to apply the inequality among numbers to the expression

SPiT^S^iTiW)

- S?\TWp»)sl2\T!;%»)\

(3)

we see that we are not dealing with the situation covered by Corollary (1.2), i.e.

\ab -cb\
(4)

because, since si2)(T^)^S^(T^Py)

(5)

the left hand side of (4) must be replaced by \ab-cb'\ whose maximum, for a, b, c,b' € [—1, +1] is 2 and not 1 — ac.

(6)

20

Bell's implicit assumption of the single probability space is equivalent to the postulate that, for each j = 1 , . . . , N P]=P"

(7)

Physically this means that: the hidden parameter in the first experiment is the same as the hidden parameter in the second experiment This is surely a very implausible assumption. Notice however that, without this assumption, Bell's argument cannot be carried over and we cannot deduce the inequality because we must stop at equation (2). 13

The role of the single probability space in CHSH's proof

Clauser, Home, Shimony, Holt [9] introduced the variant (2.6) of the Bell inequality for quadruples (a,b), (a,b'), (a',b), {a',b') which is based on the following inequality among numbers a, b, b', a € [—1,1] \ ab + ab'+ a'b - a'b' |< 2

(1)

Section (1) already contains a proof of (1). A direct proof follows from \b + b'\ + \b-b'\<2

(2)

because | ab + ab' + a'b - a'b' | = | a(b + b') + a'{b - b') | <\a\-\b

+ b' \ + \a' \-\b-b'

\<\b + b' \ + \b-b'

\<2

The proof of (2) is obvious. Remark (1) Notice that an inequality of the form \a1b1+a2b'2

+ a'3b3~a'4b'4\<2

(3)

would be obviously false. In fact, for example the choice c.1 = b\ = a2 = b'2 = a'3 = 63 = b'4 = 1

;

would give I o-ih + a2b'2 + a'3b3 - a'4b'4 \= 4

a 4 = —1

21

That is: for the validity of (1) it is absolutely essential that the number a is the same in the first and the second term and similarly for a' in the 3-d and the 4-th, b' in the 2-d and the 4-th, b in the first and the 3-d. This inequality among numbers can be extended to pairs of random variables by introducing the following postulates: ( P I ) Instead of four numbers a, b, b', a g [—1,1], one considers four functions o(l) c(2) o(l) o(2)

°a J°b ' °a' ' "-V

all defined on the same space A (whose points are called hidden parameters) and with values in [—1,1]. (P2) One postulates that there exists a probability measure P on A which defines the joint distribution of each of the following four pairs of functions

{&,&),

(#>,S), {S<$\SP), {S$\SP)

(4)

Remark (2) Notice that (P2) automatically implies that the joint distributions of the four pairs of functions can be deduced from a joint distribution of the whole quadruple, i.e. the existence of a single Kolmogorov model for these four pairs. With these premises, for each A € A one can apply the inequality (1) to the four numbers

and deduce that

I S£\\)S12\\) + SW{\)S$\\) + S«(A)Sf (A) - S$\\)S™(\) |< 2 (5) From this, taking P-averages, one obtains

I <slM2)) + (^1}42)> + < ^ 2 ) > - <s£W> i=

(6)

I J(SW{\)S12\\) + SW{\)S
22 S$\\)Sl2\\)

- S$\\)Si?>(\)

I dP(X) < 2

(8)

Remark (3) Notice that in the step from (6) to (7) we have used in an essential way the existence of a joint distribution for the whole quadruple, i.e. the fact that all these random variales can be realized in the same probability space. In EPR type experiments we are interested in the case in which the four pairs (a, b), (a, &'), (a',b), (a',b') come from four mutually incompatible experiments. Let us assume that there is a hidden parameter, determining the result of each of these experiments. This means that we interpret the number Sa (A) as the value of the spin of particle 1 in direction a, determined by the hidden parameter A. There is obviously no reason to postulate that the hidden parameter, determining the result of the first experiment is exactly the same one which determines the result of the second experiment. However, when CHSH consider the quantity (5), they are implicitly doing the much stronger assumption that the same hidden parameter A determines the results of all the four experiments. This assumption is quite unreasonable from the physical point of view and in any case it is a much stronger assumption than simply postulating the existence of hidden parameters. The latter assumption would allow CHSH only to consider the expression

SPiWfHXi) + S«(A2)42)(A2) + 5^(A3)5f (A3) - 5^(A4)4)(A4) (9) and, as shown in Remark (1.) above the maximum of this expression is not 2 but 4 and this does not allow to deduce the Bell inequality. 14

The role of the counterfactual argument in CHSH's proof

Contrarily to the original Bell's argument, the CHSH proof of the Bell inequality does not use explicitly the counterfactual argument. Since one can perform experiments also on quadruples, rather than on triples, as originally proposed by Bell, has led some authors to claim that the counterfactual argument is not essential in the deduction of the Bell inequality. However we have just seen in section (7.) that the hidden assumption as in Bell's proof, i.e. the realizability of all the random variales involved in the same probability space, is also present in the CHSH argument. The following lemma shows that, under the singlet assumption, the conclusion of the counterfactual argument follows from the hidden assumption of Bell and of CHSH.

23

Lemma (1) If / and g are random variables defined on a probability space (A, P) and with values in [—1,1], then

(fg) •= I fgdP = - i JA

if and only if P{fg = - i ) = i Proof.

If P(fg > - 1 ) > 0, then

/ fgdP = -P(fg

= -1)- /

JA

\fg\dP > -P(fg

= -1)-P(fg

> -1) > - 1

Jfg>-1

Corollary (2) Suppose that all the random variales in (x.3) are realized in the same probability space. Then, if the singlet condition: (SPSW)

= -1

(1)

is satisfied, then the condition SW = SM

(2)

(i.e. formula (13) in Bell's '64 paper) is true almost everywhere. Proof. Follows from Lemma (1) with the choice f = Sx , g = Si'.

Summing

up: if you want to compare the predictions of a hidden variable theory with quantum theory in the EPR experiment (so that at least we admit the validity of the singlet law) then the hidden assumption, of realizability of all the random variables in (3) in the same probability space, (without which Bell's inequality cannot be proved) implies the same conclusion of the counterfactual argument. Stated otherwise: the counterfactual argument is implicit when you postulate the singlet condition and the realizability on a single probability space. It does not matter if you use triples or quadruples. 15

Physical difference between the CHSH's and the original Bell's inequalities

In the CHSH scheme: (a,b),

(a',b'),

(a,b'),

(a',b')

24

the agreement required by the experimenters is the following: - 1 will measures the same observable in experiments I and III, and the same observable in experiments II and IV; - 2 will measure the same observable in experiments I and II, and the same observable in experiments III and IV. Here there is no restriction a priori on the choice of the observables to be measured. In the Bell scheme the experimentalists agree that: - 1 measures the same observable in experiments I and III, - 2 measures the same observable in experiments I and II - 1 and 2 choose a priori, i.e. before the experiment begins, a direction c and agree that 1 will measure spin in direction c in experiment II and 2 will measure spin in direction c in experiment III (strong agreement) The strong agreement can be replaced by the following (weak agreement): - 1 and 2 choose a priori, i.e. before the experiment begins, a finite set of directions c\,..., CK and agree that 1 will measure spin in a direction choosen randomly among the directions c\,..., CK in experiment II and 2 will do the same in experiment III In this scheme there is an a priori restriction on the choice of some of the observables to be measured. If the directions, fixed a priori in the plane, are K, then the probability of a coincidence, corresponding to a totally random (equiprobable) choice, is

p{*$ = 42A) = X > # =«; 42A =«) = £ h = h a=l

a=l

This shows that, contrarily than in the CHSH scheme, the choice has to be restricted to a finite number of possibilities otherwise the probability of coincidence will be zero. From this point of view we can claim that the Clauser, Home, Shimony, Holt formulation of Bell's inequalities realize a small improvement with respect to the original Bell's formulation. Reproduction of the E P R correlations by t h e chameleon effect Consider a classical dynamical system composed of two particles (1,2). Let S denote the state space of each of the particles and suppose that at time t = to (initial time) the state /ij, of particle 1, and the state /U°J OI particle 2, coincide:

H° = A=ti

(1)

25

Starting from time to, the two particles begin to move in opposite directions and, after a time interval of length T, two independent and non communicating experimenters simultaneously perform a measurement on each particle. Experimenter 1 (resp. 2) can choose among three different measurements, corresponding to the observables

SWSW.SW

(resp. 5 ( 2 ) , 5 f , ^ ) )

(2)

of particle 1 (resp. particle 2). We suppose that both particles satisfy the chameleon effect, described by the following: (1). Let S be the state space of a dynamical system u, let 7 be a set and, for each x € I, let be given a function DEFINITION

Sx : S -> R ;

x € I

(3)

representing an observable of the system. The system S ;

tell

(4)

depends on the measured observable Sx. In our case we consider only two instants of time, the initial one and the one when the measurement takes place, and we omit time from our notations. Moreover, in our case we have two particles and each particle is far away from the other one hence it can only feel the interaction with the measurement apparatus near to it. So, combining the locality principle with the chameleon effect, we conclude that, if experimenter 1 (resp. 2) chooses to measure the observable Sx (resp. Sy ) then particle 1 (resp. 2) will evolve according to the dynamics T1>x

(resp. T2lV)

(5)

In our case the variables x, y can be any element of the set {a, b, c}. Suppose that experimenter 1 chooses to measure and experimenter Let /^ti (resp. /j,2) denote the final state, i.e. the state at the time when the measurement occurs, of particle 1 (resp. 2). Condition (3.1) is then equivalent to ^iTaVi = T276Va

(6)

26

The empirical correlations of the measurements will then be

i

£ 5(1)(/x1)5f ( / i ^ C O i - T2>2)

(7)

where J^(-) is a <5-like factor keeping into account the fact that only the configurations satisfying condition (6) give a non zero contribution to the correlations. Now suppose that the state space S is the real line R . Thus the empirical correlations (7) are

na,b = Z J J 5 « ( m ) 5 f (M2)^(T1;aV1 -

T^^d^d^

(8)

where Z is a normalization constant. With the change of variables T ^ V i =: Ai ;

T~^2 =: A2

(9)

(8) becomes z j J 5W(T 1 , a A 1 )^ 2) (T 2 , b A 2 )<5(A 1 - X2)dTha(X1)dT2,b(X2)

(10)

Now introduce the notations S^\TiiX\j)=:S^(\j);

j = l,2;

x = a,b

with these notations, supposing as always possible, that T[i0(Ai),T2 (10) becomes Z j j S^{X1)S{b2\x2)8{Xl

- X2)T{
(11) 6 (A 2 )

> 0,

=

Z JSi1\X)si2)(X)Tla(X)Tib(X)dX Now let us make the following choices: A 6 [0,2vr] «• supp S<j) C [0, 2TT]

(12)

Z = (27T)"1

(13)

27

SW(\)

T'b = V^

(14)

na(A) = ^ | c o s ( A - a ) |

(15)

= sgn (cos(A - x)) ;

(16)

S™ = -S™

With these choices, the correlations (8) become I-2TT

( S ^ f i f }> = -

I

sgn (cos(A - a)) sgn(cos(A - 6 ) ) - | cos(A - a)\d\ Jo

= —

(17)

4

/ sgn (cos(A — b)) cos(A — a)d\ = — cos(b — a) = —a • b

which are the EPR correlations. References 1. L. Accardi, Phys. Rep. 77, 169-192 (1981). 2. L. Accardi, Urne e camaleonti. Dialogo sulla realta, le leggi del caso e la teoria quantistica. (II Saggiatore, 1997). Japanese translation, Maruzen (2000), russian translation, ed. by Igor Volovich (PHASIS Publishing House, 2000), english translation by Daniele Tartaglia, to appear 3. L. Accardi: On the EPR paradox and the Bell inequality Volterra Preprint N. 350 (1998). 4. L. Accardi, M. Regoli, Quantum probability and the interpretation of quantum mechanics: a crucial experiment,Invited talk at the workshop: "The applications of mathematics to the sciences of nature: critical moments and aspetcs", Arcidosso June 28-July 1 (1999). To appear in the proceedings of the workshop, Preprint Volterra N. 399 (1999) 5. L. Accardi, M. Regoli, Local realistic violation of Bell's inequality: an experiment, Conference given by the first-named author at the Dipartimento di Fisica, Universita di Pavia on 24-02-2000, Preprint Volterra N. 402 6. L. Accardi, M. Regoli, Non-locality and quantum theory: new experimental evidence, Invited talk given by the first-named author at the Conference: "Quantum paradoxes", University of Nottingham, on 4-05-2000, Preprint Volterra N. 421 7. J. S. Bell, Physics 1, 3, 195-200 (1964). 8. J. S. Bell, Rev. Mod. Phys. 38, 447-452 (1966).

28

9. J. F. Clauser , M.A. Home, A. Shimony, R. A. Holt, Phys. Rev. Letters 49, 1804-1806 (1969); J. S. Bell, Speakable and unspeakable in quantum mechanics. (Cambridge Univ. Press, 1987). 10. J. F. Clauser, M. A. Home, Phys. Rev. D 10, 2 (1974) 11. A. Einstein, B. Podolsky, N. Rosen, Phys. Rev. 47, 777-780 (1935). 12. A. Einstein in: Albert Einstein: Philosopher Scientist. Edited by P.A. Schilpp, Library of Living Philosophers (Evanston, Illinois, 1949).

29 R e f u t a t i o n of B e l l ' s T h e o r e m

Guillaume A D E N I E R Louis Pasteur University, Strasbourg, France. E-mail: [email protected] Bell's Theorem was developed on the basis of considerations involving a linear combination of spin correlation functions, each of which has a distinct pair of arguments. The simultaneous presence of these different pairs of arguments in the same equation can be understood in two radically different ways: either as 'strongly objective,' that is, all correlation functions pertain to the same set of particle pairs, or as 'weakly objective,' that is, each correlation function pertains to a different set of particle pairs. It is demonstrated that once this meaning is determined, no discrepancy appears between local realistic theories and quantum mechanics: the discrepancy in Bell's Theorem is due only to a meaningless comparison between a local realistic inequality written within the strongly objective interpretation (thus relevant to a single set of particle pairs) and a quantum mechanical prediction derived from a weakly objective interpretation (thus relevant to several different sets of particle pairs).

1

Introduction

Bell's Theorem 1 exhibits a peculiar discrepancy between any local realistic theory and Quantum Mechanics, which leads to empirically distinguishable alternatives. The quandary is that neither local realistic conceptions nor Quantum Mechanics are easy to abandon. Indeed, classical physics and common sense are usually based upon the former, while the latter is rightly presented as the most successful theory of all times. Several experiments have been done, all but a few2 show violations of Bell inequalities. 3 Yet, the ideas brought forth by Bell's Theorem are so disconcerting that there is still incredulity, not to mention antipathy, evoked by the verdict. The purpose of this article is to provide a refutation of this theorem, within a strictly quantum theoretical framework, without the use of outside assumptions. 2 2.1

The E P R B gedanken experiment Spin observables and singlet state

Bell's theorem is usually based on a didactic reformulation of the EPR (Einstein, Podolsky and Rosen 4 ) gedanken experiment, due to D. Bohm. 5 In this EPRB gedanken experiment, a pair of spin-| particles with total spin zero is produced such that each particle moves away from the source in opposite directions along the y-axis. Two Stern-Gerlach devices are placed at opposite

30

points (left and right) on the y-axis, and are oriented respectively along the directions u and v. The Hilbert space associated with the entire EPRB system is H = 7ih <8>HR, where T^L and HR are the Hilbert spaces associated with each Stern-Gerlach device respectively. The spin observable has two counterparts in this new product space H as CTL-U
•v

=

I R ,

=

I L ® a • v,

(1)

(2)

where I I and IR are the identity operators of ~Hh and % R . Contrary to the observables a • u and a • v which are mutually non commuting when u ^ v, these new observables ox • u and OR • v do commute, reflecting the fact that the Stern-Gerlach devices are arbitrarily far from each other, and are thus measuring distinct subsystems. The product of these two observables is therefore also an observable and can be understood as a spin correlation observable corresponding to the joint spin measurement of both Stern-Gerlach devices. Its eigenvectors are | £ L , U ) | £ R , V ) , with corresponding eigenvalues £L-£R> where each e is either + 1 or —1. In an EPRB gedanken experiment, the source produces particle pairs with zero total spin, represented by the singlet state

M = ^ [l+' n > ® !->n> - !->n> ® l+'n>]>

(3)

where n is an arbitrary unitary vector which can usually be ommited since the singlet state is invariant under rotation. 6 2.2

Statistical properties and hidden-variables

The expectation value of a spin observable for the singlet state \ip) is zero: (#r-u(8>lR|V>) MIL®

= =

0, 0,

(4)

whatever u and v, as follows from the rotational invariance of the singlet state. Likewise, the expectation value of the spin correlation observable 6'7 is E*(u,v)

=

M(ofu)(o*-v)M

(5)

=

-u-v,

(6)

which depends only on the relative angle between u and v.

31

In a local realistic hidden-variables model, a single particle pair is supposed to be entirely characterised by means of a set of hidden-variables, which are symbolically represented by a parameter A, so that the measurement result on the left along u can be written as A(u,A), and the result on the right along v as B(v,\). Although the hidden-variables model is supposed to be fully deterministic, it must also be capable of reproducing the stochastic nature of the EPRB gedanken experiment expressed in Eqs. (4) and (6). For that purpose, the complete state specification Aj of any particle pair with label i must be a random variable: 1,s its complete state Aj is supposed to be drawn randomly according to a probability distribution p. Consider a set of N particle pairs {i = 1,... ,N}, the mean value of joint spin measurements for this set is : 1

N

M"(u,v) = - ^ A ( u , A i ) B ( v ! A i ) . 3

(7)

The 'CHSH' function

In order to establish Bell's Theorem, a linear combination of correlation functions c(a, b) with different arguments 9 is considered, once when these correlation functions are expectation values E^{a,v) given by Quantum Mechanics; i.e., Eq.(6), and once when they are mean values M p ( u , v ) given by local hidden-variables theories, Eq.(7); then the results are to be compared. A well known choice of such a linear combination is the CHSH (Clauser, Home, Shimony and Holt 1 0 ) function, written with four pairs of arguments: S = |c(a,b) - c ( a , b ' ) + c ( a ' , b ) + c(a',b')|.

(8)

The exact meaning of the simultaneous presence of these different arguments in a CHSH function must be clarified. Basically, there are two possible interpretations, the strongly objective interpretation and the weakly objective interpretation: 11,12 Strongly Objective Interpretation implies that all correlation functions are relevant to the same set of N particle pairs. As such they cannot be relevant to actual experiments but rather with what result would have been obtained if measured on the same set of N particle pairs along different directions. Weakly Objective Interpretation implies that each correlation function is actually to be measured on distinct sets of N particle pairs, that is, for each pair only one joint spin measurement is to be executed.

32

The CHSH function was actually developed specifically for experimental convenience, 10 and many experiments have been done (the most famous being Aspect's 1 3 ), obviously invoking the natural interpretation, namely the weakly objective one. Nevertheless, the strongly objective interpretation must also be considered, since it remains a possible interpretation a priori, and since the choice between strong and weak objectivity is not made at all explicit in many papers, including Bell's. It must be stressed that these interpretations are radically different, not only epistemologically, but also physically. Indeed, the strongly objective interpretation pertains to a single set of N particle pairs characterised by the corresponding set of parameters {A; ; i = 1 , . . . , TV}; whereas the weakly objective interpretation pertains to no less than 4 sets of N particle pairs. The fact is that a finite set of N particle pairs characterised by {A;} can't be identically reproduced, either theoretically (for each complete state A; of any particle pair i is a random variable, as defined in Section 2.2), or empirically (for the experimenter has no control over the complete state of a particle pair in a singlet state). Hence, in the weakly objective interpretation, these four sets are necessarily four different sets of particle pairs 7 ' 14 respectively characterised by four different sets of hidden-variables parameters {Ai,j}, {^2,i}, {^3,i} a n d

{A4,J. The difference between each interpretation can therefore be embodied in the number of degrees of freedom of the whole system. Let / be the degrees of freedom of a single particle pair. In the strongly objective interpretation the degrees of freedom of the whole CHSH system is then Nf, whereas in the weakly objective interpretation it is 4 times as large, that is, 47V/. Thus, before initiating Bell's analysis, one has to choose explicitly one interpretation and stick to it. 4

Strongly objective interpretation

4-1

Local realistic inequality within strongly objective interpretation

The local realistic formulation of the CHSH function within strong objectivity is written OP

^strong

M " ( a , b ) - M " ( a , b ' ) + M' > (a',b) + M' , (a',b') ,

(9)

which (using Eq. 7) becomes after factorisation a summation where each term can have two values 2 ' 7 A(a, Xi) \B(b, Xi) - B(b', Xi)] + A(a', Xt) [l?(b, A<) + B(b', A*)] = ±2,

(10)

33

so that the most restrictive local realistic inequality within the strongly objective interpretation is : Strong < 2-

(11)

This is the well known generalised formulation of Bell's inequality due to CHSH. 10 It must be stressed once more, however, that this inequality has been established only within the strongly objective interpretation, which means that each expectation value is relevant to the same set of N particle pairs. Hence, this result cannot be compared directly with results from real experimental tests, where in fact mean values from four distinct sets of N particle pairs are measured. 4-2

Quantum mechanical prediction within strongly objective interpretation

The quantum prediction for the CHSH function within the strongly objective interpretation is written strong = l ^ ( a , b ) - E * ( a , b ' ) + E+(a!,b) + E*(a',h')\.

(12)

This equation is usually directly evaluated by replacing each expectation value by the scalar product result of Eq. (6). This, unfortunately, is all too hasty. Indeed, in order to understand better the quantum mechanical meaning of equation (12), it is advantageous to take a step backward using equation (5): ^strong

(V>|(a L .a)(|(<7L.a)(<7R.b')|t/>) + (y>|(<7L.a')(|(<7L.a')(<7R

.b')|V) •

(13)

The four spin correlation observables in this equation are non commuting observables (this can be shown by calculating the commutator of ((7L.U)(
(14)

even if R, S,... are non commuting observables. However, as was stressed by d'Espagnat, 11,16 quantum mechanics is only a weakly objective theory, and expectation values given by quantum mechanics are also weakly objective statements, that is to say, statements relevant to observations, so that when

34

R, 5 , . . . are non commuting observables, the expectation values cannot be simultaneously relevant to the same set of N systems: each expectation value is necessarily relevant to a distinct set of JV systems. Therefore, the only possible meaning of equation (13) is weakly objective, not strongly objective as desired. Of course, this does not imply that Quantum Mechanics cannot provide any meaning at all for the CHSH function; it implies only that this meaning cannot be strongly objective. Since the local realistic inequality SgtT0 cannot be compared with any strongly objective prediction given by Quantum Mechanics, Bell's Theorem cannot be verified with a strongly objective interpretation given to the CHSH function. Hence, there is no choice but to rely on the weakly objective interpretation in order to compare hidden-variables theories and Quantum Mechanics. 5 5.1

Weakly objective interpretation Quantum mechanical prediction within weakly objective interpretation

It was shown in Section 3 that strong objectivity and weak objectivity pertain to different physical systems. This difference should therefore appear in the relevant equations. Indeed, the correlation expressed in Eq. (6) is relevant to spin measurements performed on particles that once constituted a single parent particle. Yet, two particles issued from two distinct parents never have interacted with each other, so that spin measurements performed on such particle pairs can not be correlated. Hence, if left and right spin measurements are performed on two distinct sets of N particle pairs, instead of the same set, there should be no correlation, and this property should appear in a generalised spin correlation function (i.e. generalised to the case of spin measurements performed on different sets of particle pairs). This can be easily done within a quantum theoretical framework by means of a distinct EPRB space for each set of N particle pairs. Let Hj be the EPRB Hilbert space associated with the jth set of particle pairs. In this Hilbert space, the EPRB gedanken experiment is represented by the singlet state \ipj) (see Section 2),

|V;) = ^[l+>;®|->;-|->;®|+>,-].

(15)

The whole CHSH experiment with the four sets of particle pairs can be expressed then in terms of a new tensor product space W1234 = %i ® %2 ® %3 ® "HA in which the state vector is 1^1234) = |Vl) ® 1^2) ® |^s) ® |^4>-

(16)

35

The counterparts of observables in 7^1234 are obtained as in Section 2.1. For instance, the observable pertaining to the right Stern-Gerlach device for the 2nd set of particle pairs is a2,R - u = Ii ®

(CTR

• u) <8> I3 ® I4,

(17)

where Ij is the identity operator of the EPRB space Hj. Hence, the expectation value of the product of two spin observables, the first belonging to the fcth set and the second to the Zth set, is Eft{u,

V) = (V>1234|(1234),

(18)

and this is the generalised expectation value of spin correlation observables that was sought. The expectation value for measurements performed on the same set (k = I) of particle pairs is already known, Eq. (6), and E^k(u, v) should provide the same result. Indeed, using Eqs. (16) and (17) leads to <(u,v) =

-u) • K - v)\rpk) = - u v ,

(19)

but when k ^ I, the result is quite different: J3*(u,v) = (V-fcKot - u ^ X V - z I K -v)hM = 0,

(20)

in accord with Eq. (4). There are indeed no correlations between two sets of particle pairs, as stipulated in the beginning of this section. Now, contrary to what was done in Section 4.2, it is possible to proceed here in full accord with the quantum mechanical postulates, because the spin correlation observables as the one given in Eq. (17), are mutually commuting, so that a linear combination of these commuting observables is an observable as well. The CHSH experiment can therefore be described by a new observable Sweak = (<7l,L • a)(ai,R • b ) - (
+(o-3,L-a')(
(21)

and the quantum prediction for the CHSH function within a weakly objective interpretation is therefore obtained by calculating the expectation value of the observable 5 wea k when the system is in the quantum state 1^1234) : Sweak = (^1234|5weak|V'1234) ,

(22)

which using Eqs. (17), (18), and (19) is S L k = S f 1 ( a , b ) - ^ 2 ( a , b ' ) + ^ 3 ( a ' , b ) + E/ 4 (a',b') .

(23)

36 This equation is not ambiguous (as was Eq. 12): it is a linear combination of expectation values, each relevant to a distinct set of N particle pairs. This equation is therefore weakly objective, as requested. Finally, using Eq. (19), yields weak

(24)

a • b - a • b ' + a' • b + a' • b '

with a well known maximum equal to (25)

max(5*Bak)=2>^.

This numerical result is indeed the one given in the literature, the only difference here being the fact that the meaning of this result is unambiguously weakly objective. Quantum Mechanics, which is a weakly objective theory, n provides a clear answer to the CHSH function understood as a weakly objective question. 5.2

Local realistic inequality within weakly objective interpretation

The last step consists in comparing the quantum prediction S^eak with its local realistic counterpart S^eak. As was stressed in Section 3, the j t h set of particle pairs must be characterised by a distinct set of hidden-variables parameters [Xji; j = 1 , . . . ,N}. Hence, to the generalised expectation value of the spin correlation observable Eq. (18) corresponds the generalised mean value of joint spin measurements: 1 N M£(u,v) = - J > ( u , A M ) B ( v , A M ) ,

(26)

which is a priori capable of reproducing not only the k — I prediction, Eq. (19), but also the k ^ / prediction, Eq. (20). The local realistic CHSH function with a weakly objective interpretation is therefore 9P

= M f t f o b ) - M 2 " 2 (a,b') + M 3 ' 3 (a',b) + M 4 ' 4 (a',b') ,

(27)

weak

and that is explicitly 5

i 1

weak = b

N

E

[ ^ ( a , A M ) £ ( b , A M ) - >l(a,A 2li )B(b',A 2ii ) +A(a!, \3,i)B(h,

A3,i) + A{B!, A 4 ,i)B(b' l A4]i)

]

(28)

37

This expression is to be compared with the one pertaining to the strongly objective interpretation (Section 4.1), which contained terms that could be factored. Here, since each term is different from the others, no factorisation is possible; i.e., there is no way to derive a Bell inequality7—this is not the first time this fact has been noticed, unfortunately, no conclusion was drawn then. Yet, this fact cannot be ignored, for it has been shown in Section 4 that Bell's Theorem cannot be demonstrated within a strongly objective interpretation. Here, the only local realistic inequality that can be derived is obtained by considering—as was done with Eq. (10)—the possible numerical values of each term of the summation in Eq. (28), for which the extrema are +4 and -4, so that the narrowest local realistic inequality that can be derived from Eq. (28) is nothing but ^eak<4-

(29)

This most restrictive local realistic inequality (which can also be found in Accardi 17 ) is not incompatible with the quantum mechanical prediction, as the maximum of S„ e a k is 2-\/2. This shows that experiments intended to test Bell's Theorem were unfortunately not testing the strongly objective inequality, Eq. (11)—which is a Bell inequality—, but this weakly objective one, Eq. (29), since all experimental tests necessarily are executed in a weakly objective way, due to the irreducible incompatibility between spin measurements. As was stressed by Sica 18 and Accardi, 17 a local realistic inequality is nothing but an arithmetic identity, and inequality (29) is definitely too lax to be violated by experimental tests. 6

Conclusion

It was shown that Bell's Theorem cannot be derived, either within a strongly objective interpretation of the CHSH function, because Quantum Mechanics gives no strongly objective results for the CHSH function (see Section 4.2), or within a weakly objective interpretation, because the only derivable local realistic inequality is never violated, either by Quantum Mechanics or by experiments (see Section 5.2). It was demonstrated that the discrepancy in Bell's Theorem is due only to a meaningless comparison between S^trons < 2 and 5^ e a k = 2\/2, where the former is relevant to a system with Nf degrees of freedom, whereas the latter to one with 4Nf (see Section 3). The only meaningful comparison is between the weakly objective local realistic inequality 5^ e a k < 4 and the weakly objective quantum prediction S„ e a k = 2^/2, but these results are not incompatible. Bell's Theorem, therefore, is refuted.

38

References 1. J. S. Bell, Physics 1, 195 (1964). 2. F. Selleri, Le grand debat de la mcanique quantique (Champs Flammarion, Paris, 1986). 3. A. Aspect, Nature 398, 189 (1999). 4. A. Einstein, B. Podolsky, and N. Rosen, Phys. Rev. 47, 777 (1935). 5. D. Bohm, Phys. Rev. 85, 166 (1952). 6. D. Greenberger, M. Home, A. Shimony and A. Zeilinger, Am. J. Phys. 58, 1131 (1990). 7. A. Bohm, Quantum Mechanics, Foundations and applications (SpringerVerlag, New York, 1979). 8. J. S. Bell, in Proceedings of the international School of physics 'Enrico Fermi', course IL: Foundations of quantum mechanics (Academic, New York, 1971), p. 171. 9. J. S. Bell, Epistemological Letters, p. 2 (July, 1975). 10. J. F. Clauser, M. A. Home, A. Shimony and R. A. Holt, Phys. Rev. Lett. 23, 880 (1969). 11. B. d'Espagnat, Veiled Reality: An Analysis of Present Day Quantum Mechanical Concepts, (Addison-Wesley, 1995). 12. B. d'Espagnat, http://arXiv/abs/quant-ph/9802046. 13. A. Aspect, J. Dalibard, and G. Roger, Phys. Rev. Lett. 49, 1804 (1982). 14. A. Khrennikov, http://arXiv/abs/quant-ph/0006017. 15. J. von Neumann, Mathematical Foundations of Quantum Mechanics (Princeton University Press, 1955). 16. B. d'Espagnat, Conceptual foundations of Quantum Mechanics, (W.A. Benjamin, Massachusetts, 1976). 17. L. Accardi, http://arXiv/abs/quant-ph/0007005. 18. L. Sica, Opt. Commun., 170, 55 (1999).

39 P R O B A B I L I T Y CONSERVATION A N D T H E STATE DETERMINATION PROBLEM S. AERTS Free University of Brussels Triomflaan 2, Brussels, Belgium E-mail: [email protected] The problem of finding an operational definition for the wave vector is briefly examined from a historical point of view. Led by an old idea of Feenberg, we integrate the one dimensional probability conservation equation to obtain a closed formula that determines the state vector in the spinless case. The formula that determines the state does not depend on the (real) potential, external fields having their influence on the state only through the time derivative of the probability density function in position space. We apply the method to the simple case of a free Gaussian wave packet. Some problems regarding the operational status of the quantities involved are discussed.

1

Introduction

It is well known that Heisenberg constructed the matrix formulation of quantum mechanics by keeping in close accordance with what might be labelled the 'principle of operationality'. Roughly, one can describe this principle as a determination to introduce only measurable quantities. Schrodinger, more concerned with 'anschaulichkeit' than operationality, introduced rather unscrupulously the concept of a wave function. He initially interpreted the wave function as a charge density in space, but this interpretation is difficult to extend to several particle problems a . The interpretation that would stand the test of time, as testimonied by it being awarded the Nobel prize in 1954, was due to Born. In analogy with the theory of electro-magnetic radiation, in which the intensity is the square of the amplitude, Born took the step to interpret the intensity of an electro-magnetic wave in a given region of space as proportional to the relative frequency of a photon detection in that region and the probabilistic interpretation was born. However, this correspondence still doesn't make it an operational quantity, as for every density p(x, t) there are infinitely many 4>(x,t) such that, with ip(x,t) = ^p{x,t).el^x't\ we get ip*(x,t)ip(x,t) = p(x,t). The problem is then to find suitable functions that we can approximate experimentally in a statistical way, that in some well chosen combination yield the same information as the complete wave function. In order to make the question mathematically more precise, Prugovecki 2 introa

For a rescue attempt of the original Schrodinger interpretation, see Dorling1.

40

duced the notion of "informational completeness". A family T = {Oi\i € 1} of bounded operators on a Hilbert space ~H is called informationally complete iff for every two density operators p and p' the equality Tr(pOi) = Tr(p'Oi) implies p = p''. This definition implies that the set of expectation values of an informationally complete set of operators, allows only one state operator from which the expectation values could have been derived. What characterizes such a set? In a classical statistical framework, we can calculate all macroscopic quantities from a single density function p(p, q) in phase space. Hence, by analogy, one is naturally led to the following interesting question, originally due to Pauli 3 : Is it sufficient to know the probability density functions of position and momentum to determine unambiguously the quantum mechanical state of the physical system? In the quantum mechanical case, it is sufficient to know the wave function in coordinate space ip(x,t), since the corresponding wave function for the same system in momentum space ip(p,t) is given by its Fourier transform. Hence we can phrase the problem in a more mathematical way: is it possible to determine a square integrable function uniquely from both its modulus and the modulus of its Fourier transform? Possibly the first non-trivial counterexamples came from Bargmann b who constructed explicit examples of wave functions V'l and ip2, that give rise to the same probability distributions for position and momentum, but give a different probability distribution for a third operator that does not commute with the position or momentum operator. This leads to the remarkable conclusion that the wave function in its coordinate representation contains more information than the corresponding probability densities in position and momentum together. Due to Bargmann we know the answer to be negative in a physically relevant way c and what is now commonly referred to as the Pauli problem is either the problem of determining the set of states that share the same modulus and the modulus of their Fourier transform, or the problem of finding a set of observables that are informationally complete. The problems are related but not identical, and we prefer to refer to the first version of the problem as the Pauli problem, and to the second as simply the state determination problem. It seems much more work has been done on the state determination problem, which isn't surprising given the fact that the Pauli problem is a special case of it. With the exception of the production of counterexamples such as Bargmann's, the first instructive results regarding the Pauli problem were obtained only in

Bargmann never seems to have published these results himself, and as a result, little reference is given to his work in the literature. However, the examples can be found in Reichenbach4. c T h e problem re-appeared unaltered in the 1958 edition of Pauli's book, more than a decade after the first counterexamples.

41

1978 by Corbett and Hurst 5 . In their paper they construct physically important classes of functions that are uniquely determined by their position and momentum distributions. However, they also show there exist dense subsets of states that are not uniquely determined by their position and momentum distributions and, as a consequence, any state can be approximated, in norm, by a non-unique state. Extensions, comments and counterexamples to their work can be found in Friedman 6 and Pavicic 7 . Nevertheless, the complete characterization of the set of states that share modulus and the modulus of their Fourier transform is still open. As for the state determination problem, we can split the work into those who were primarily concerned with establishing a set of observables that is informationally complete (or disproving a certain set to have this property), and those that set out to characterize such sets. The first group includes Feenberg 8 (1933), Moyal 9 (1949 ), Gale, Guth, and Trammell (1968) 10 , Band and P a r k 1 1 1 2 13 (1970-1971), and many more 1 4 15 16 . We will not go into the reconstruction of the state by placing the entity in different potentials, a method pioneered by Lamb 1 7 and one that inspired many similar approaches such as Wiesbrock 18 and Weigert 19 nor will we mention the vast literature pertaining to the measurement of the Wigner distribution, known as phase-space tomography. However, concerning the characterization of informationally complete sets we cannot help but make the following elementary remarks. Suppose we have a non-trivial (i.e., not a multiple of the identity) self-adjoint operator A that commutes with every member of a set of operators S in a Hilbert space 7i. It is well known that the one parameter family of unitary operators exp(itA) also commutes with every element of <S. Now take any xj) that is not an eigenvector of A. For any observable in S, the state ipt — exp(itA)tp gives the same expectation value for this operator, whatever numerical value t has. But if t ^ s it follows that ipt ^ Vs (for the relation of this with superselection rules, see Wick, Wightman and Wigner (1952) 2 0 , Emch and Piron (1963) 21 and Piron 2 2 ). Hence S is not an informationally complete set of observables. So a necessary condition for a set of observables to be informationally complete is maximality in the sense of Dirac, in other words, that there be no other non-trivial operator that commutes with every member of the set. However, this is far from sufficiency. As Bush and Lahti 2 3 have shown, it is easy to derive d from the considerations above that no commuting set of observables is informationally complete! Maximal commuting sets of observables serve as a means of state preparation, not state identification. This means that, at least for for continuous variables, the Pauli set {P, Q} is in a certain sense the minimal set that one could possibly hope to be informationally complete (although Bargmann has shown this in general not One arrives at this result by allowing A to be a member of S.

42

to be the case). 2

Conservation of Probability

What we will present in this article is an elaboration on the reasoning followed by Feenberg. Consider the time-dependent Schrodinger equation in tp with a r e a l e potential V and using the shorthand tp for ip(r, t): ~ = -h/2imV2tp +^rVip at in Multiply by tp* and add this to the complex conjugate of the above equation multiplied by ip. After some elementary vector operator manipulation, we find what is commonly known as the conservation law of probability:

Substitution of the 'polar representation' of the wave vector

iP(r,t) = y/fafie*'™ (ip assumed real) into the former equation yields a second order partial differential equation, which is in fact a Fokker-Planck equation with zero diffusion coefficient and the phase serving as a a potential:

Feenberg's argument is a uniqueness result based on this last equation. It amounts to showing that any two phase functions that satisfy this equation and some gentle boundary conditions differ by at most a constant. His 1933 thesis is hard to get hold of, but the argument was (erroneously 1015 ) extended by Kemble 24 to three spatial dimensions in his much easier to find handbook on quantum mechanics . What we will do here, is go back to the original one dimensional idea, but rather than trying to establish a uniqueness result, we will show that in this simple case a solution can be obtained by direct integration. 3

Determination of the phase function

So p and ip satisfy the conservation law as given by the last equation. Rewriting this equation in one dimension, evaluated at a specific time instant t = to gives us: e The imaginary part of a complex potential can be used to mimic creation and annihilation effects. Although this is sometimes a useful approximation, such results violate the continuity equation, and for a more reliable analysis, one should really use a second quantized theory.

43

, ,<9V , dp(x,t0)dip p{x to) +

mtdp{x,t)

—dx—Tx + -n{—m-]t^

' w

=°

Assume for the time being that p(x, t0) ^ 0, and divide the equation by p(x, t0): d2(p ~dtf

+

dinp(x,t0) dx

dip m dlnp(x,t). ~5x~+ J{ dt

h=t0

_ ~

Assuming po{x) and its time derivative to be known functions, we can solve for the unknown phase
As all quantities are evaluated at the same time instant t = to, we will not bother to give further notational reference to this fact. In what follows, we will also abbreviate (with abuse of language) ( a i n P( x ' f )) f = t o a s dtlnp(x). Applying these transformations the equation becomes: ^

+ f(X)(f> = g(X)

So we have transformed the second order partial differential equation into an ordinary, first order, linear differential equation with a source g(x) at a fixed time instant. The solution of the homogeneous equation is: (x) = 4>h{x)(c + $x g(s)p(s)ds). We have to integrate this result once more to get
/ =

rr

4>h{r)(c+ I J

p~(7)[c+J

= J (c+-J 4

g(s)p(s)ds)dr J

P(s)dtlnp(s)ds]

dtP(s)ds)W)

Validity and range of applicability

The solution is seen to be a two parameter family of curves, one for every value of the constant c, and one for every lower limit, say x$, of the r integration ' . The result of changing the lower integration limit is only the addition •'The lower limit of the s integration is absorbed in the constant c.

44

of an overall constant to tp(x,t). Because we know the quantum mechanical expectation values and probabilities to be invariant under such an addition, we set this constant equal to zero. The value of the constant c can potentially affect the phase in a more profound way. Depending on the particular p(r, t) used, / pfriy m i g n t diverge when p(r, t) is zero for some value(s) of r or, even worse, for some Ar. First of all, we assumed in our derivation that p(r, t) ^ 0, but this restriction can easily be removed. Indeed, suppose we have n places xn where the density does equal zero. A solution ipi is then obtained for each interval ]x{, Xi+\ [ by means of our equation. The total solution ip is obtained by pasting all the ipi together by requiring continuity of if; and V^- 9 • Now continuity of ip and VV> implies continuity of their respective complex conjugates and hence of p and Vp. If we are to infer the phase from actual data, it seems reasonable to require (p also to be continuous. In fact, the conservation equation requires it to be twice differentiable. If any cutting and pasting is necessary to obtain the solution, we can easily see that the constant c should be the same for any two pasted pieces. Hence, if the cut is applied at a pole, c has to be zero h for

V(x,t0)

= y/p(x,t0)exp(i—

rx /

rr

fo . /

dtp(s,t0)ds)

Note that the state does not contain reference to the potential. External fields will show up in the state indirectly as a consequence of the time dependence of p. The assumptions that underlie the derivation of the equation are: a spinless, one dimensional particle that acts under a real potential V, being prepared in a pure state. In short, all that is required for a particle to obey the one dimensional dynamical Schrodinger equation. However restricted this class is, it does include many examples that can be found in standard textbooks on quantum mechanics. Comparing the result we have found to those in the literature, we find the closest match with a result obtained by Gale, Guth and Trammel 1 0 . They apply the definitions of p(r) and j(r) to show that knowing these is sufficient for the determination of the phase. They then discuss a gedanken experiment 9

This continuity demand is in fact a necessity because the validity of the equation of probability conservation (and a fortiori of the Schrodinger equation) requires xjj and Vi/> to be continuous. A notable but unproblematic exception is that of an infinite potential step. h t h e value of c might be non-zero in applications where the continuity equation only expresses conservation of the probability flux in some intermediate region, the boundaries (possibly at infinity) containing 'sinks' or 'sources' of probability.

45

for establishing the probability current by measuring the expectation of the velocity and argue, by means of this experiment and an intuitive argument, that the current j(r) equals p(r) < v(r) > for some r inside a small space region that is supposed to contain the particle. Our result was obtained by a direct integration and, as a consequence, is exact. It is however difficult to extend to higher dimensions because of two reasons. The first is the fact that the expression for the probability current in the presence of a vector potential becomes J(x,£) = Re{ip*(x,t)[p/m— (q/mc)A]ip(x, t)} and, depending on the form of the vector potential, it is not obvious to what function of the phase this corresponds. If the vector potential corresponds to a uniform magnetic field, or in absence of a vector potential (in which case one can transform the equation into a Poisson equation) one can solve the continuity equation by employing standard techniques. However, one then encounters a second problem. Providing an initial value for the phase (which is unproblematic as the phase is only determined within an additive constant) is no longer sufficient, instead we need an initial boundary function. Hence we have to resort to other principles to determine the phase on such a boundary in order to solve the problem. Of course, the principle of conservation may still serve the purpose of reducing the family of admissible functions for the phase of the amplitude. We will now illustrate the principle by applying it to a Gaussian wave packet. Later we will expound a few operational issues regarding the quantities involved in the solution given above. 5

Evolution of a Gaussian Wave Packet

The full, time dependent wave function for a free Gaussian wave packet is:

*c.o = <MA*)Sr''<
-x2/4(Ax)l + ik0x - ik2,Ht/2m J 1 + iht/2m(Ax)20

From this we easily calculate p(x,t): p(x,t)

=

tp{x,t).ip*(x,t) iv,,

/A

N2W-,

h2t2

N-,-1/2

r

-(x + k0ht/m)2

.

Now assume we did not know the wave function, only the probability density and its time derivative at some time instant t — 0. In an abbreviated

46

form (with easy identification of the coefficients) we can write the probability as:

*,,) =*

tf)-/»«p[-JE±|£]

+

At time t = 0 this gives us: p(x,0) — aexp(—^-) The derivative of p with respect to the time parameter:

**'•«» -

4i< 1 + 6 , 2 >~ 1 / 2 e x p <-|r^)>]'= CX

, 2a

=

X2

exp

~ ~d (~~j)

So the phase becomes:

~2d-hJ m fx

=

=

f

p(r,0)

fr

S

J

„2

2

V

• v

sexP(--)dsexP(-)d, v^

r2

kohm T~x m n kox

which is precisely the desired phase of the wave function at t = 0 6 Operational Issues Expounding Feenberg's uniqueness result, Reichenbach points out that we can recover the phase by numerical computation if we know p(x, to) and dtp(x, t) \t=t0 • In order to establish these quantities, Reichenbach outlines the following procedure 4 . We take an ensemble A of identically prepared systems such that the ensemble can be properly described by a pure state if>. Now select at random two sub-ensembles from A, say B and C. For each system in B we measure at the time to the value of a;. As the results will vary, we obtain in this way a distribution p(x,to)- Likewise, for each system in C, we we measure at the time ti the value of x, obtaining a distribution p(x,ti). The quotient p(x,t0) - p(x,h) h — to

47

is then supposed to approximate dtp(x,t) for t € [to,h] if the interval [to,h] is chosen sufficiently small. The wave function can then be obtained through numerical approximation and represents the state of the systems that are left untouched in the original ensemble A. There is a problem with Reichenbach's procedure for determining these quantities that is of equal concern to our method. Despite the fact that it is entirely possible to position the detector wherever one wants it to be, hence effectively controlling x in p(x,t), it is an annoying peculiarity of quanta that one cannot determine when a detection will take place. One places a detector and simply waits for a detection count to happen. The problem seems related to what Mielnik has called "the screen problem" in a provocative and enlightening paper by the same name 2 5 . As Mielnik points out "experimentalists perform a lot of experiments, but none resembling an instantaneous check of particle position". Indeed, a measurement setup typically consists of a source, that what is emitted undergoes a series of transformations (i.e., an optical bench or a potential) and is subsequently detected by a fixed detector, or a set of fixed detectors. If we are to describe operational means of measuring densities at some time instant, we will have to do so by such a typical setup. To produce anything remotely satisfactory, we will need a few assumptions. A first assumption is that if a particle is detected at some time instant to in position x, the intricate mechanism between the measurement apparatus and the particle that is responsible for its detection does not depend on to and in this sense, has no effect on the value of p(x,t). However unnatural the assumption might be from a physical point of view, it seems to underlie the statistical interpretation of fn \^{x, t)\2dV as an instantaneous localization probability of the system in a state ip in a space region fi and at a time instant t. In so far as our analysis depends on this assumption, so does the standard interpretation of quantum mechanics. The next assumption is that we are able to control the release of the particle in a certain state within a sufficient small time interval At such that, within this small time interval, the density can reasonably be approximated by a linear function. This can be achieved by placing a shutter mechanism behind the source. Naturally, the shutter opening time has to be substantially less than the coherence time of the particle. A sufficiently short opening time can only be established by experiment, and one can never be quite sure if there would still be more oscillations on a much shorter time scale. A density function with a larger variation will be harder to approximate as it requires a shorter shutter opening time and hence will result in a lower detection rate. The wave packet then participates in the transformations we may have set up (optical bench, Stern-Gerlach,...) and is detected. The time interval between the shutter release and the detection time is noted together with the position of the detector.

48

After many of such recordings, we gather all the data to reconstruct p(x,t). How many samples do we need? Well, if the samples were taken at equidistant At and Ax, we could do a Fourier synthesis and apply the Shannon-Whittaker sampling theorem. However, due to the non-equidistant spreading of the tn, (at best following some statistical pattern), we need Frame Theory (Duffin and Schaeffer26) to reconstruct band limited signals / from irregularly spaced samples {f(tn)}. The derivative with respect to time can then be derived from the reconstructed signal and the phase derived by means of the proposed equation. Acknowledgments The author wishes to acknowledge a helpful discussion with John Corbett regarding the subject of this paper. References 1. J. Dorling, Schrodinger, Centenary celebration of a polymath , eds. C.W. Kilmister (Cambridge, 1987). 2. E. Prugovecki, Int. J. Theor. Phys., 16, pp 321-331, (1977). 3. W. Pauli, Encyclopedia of Physics, Vol V, p.17 (Springer-Verlag, Berlin, 1958). 4. H. Reichenbach, Philosophic Foundations of Quantum Mechanics, (University of California Press, 1948). 5. J.V. Corbett, C.A. Hurst, J. Austral. Math . Soc, B20, 182-201, (1978). 6. C.N. Friedman, J. Austral. Math . Soc, B30, 298, (1987). 7. M. Pavicic, Phys. Lett. A, 122, 280, (1987). 8. E. Feenberg, The Scattering of Slow Electrons in Neutral Atoms, Thesis, Harvard University, (1933). 9. J.E. Moyal, Proc. Cambridge Phil. Soc, 45, 99, (1949). 10. W. Gale, E. Guth, and G.T. Trammell, Phys. Rev. A, 165, 1434-1436, (1968). 11. W. Band, J. Park, Found. Phys., 1, No 2, pp 133-144, (1970). 12. J. Park, W. Band, Found. Phys., 1, No 4, pp 339-357, (1971). 13. W. Band, J. Park, Am. J. Phy. , 47, pp 188-191, (1979). 14. A. Royer, Phys. Rev. Lett, 55, pp 2745, (1985). 15. A. Royer, Found. Phys., 19, 3, (1989). 16. W. Stulpe, M. Singer, Found. Phys. Lett, 3, 153, (1990). 17. W. E. Lamb, Phys. Today, 22(4), 23, (1969). 18. H.-W. Wiesbrock, Int. J. Theor. Phys., 26, pp 1175, (1987). 19. S. Weigert, Phys. Rev. A., 45, pp 7688-7696, (1992).

49

20. G.C. Wick, A.S. Wightman, E.P. Wigner, Phys. Rev., 88, pp 101-105, (1952). 21. E.C. Emch, C. Piron, J. Math . Phys., 4,pp 496-473, (1963). 22. C. Piron, Helv. Phys. Acta, 42, pp 330-338 ,(1969). 23. P. Bush, P.J. Lahti, Found. Phys., 19, pp 633, (1971). 24. E.C. Kemble, New York, MacGraw-Hill, (1937). 25. B. Mielnik, Found. Phys., 24, 8, pp 1113-1129, (1994). 26. R.J. Duffin, A.C. Schaeffer, Trans. Amer. Math. Soc., 72, 341-366 (1952).

50 EXTRINSIC A N D INTRINSIC IRREVERSIBILITY I N P R O B A B I L I S T I C D Y N A M I C A L LAWS

H. ATMANSPACHER Institut fur Grenzgebiete der Psychologie und Psychohygiene, Wilhelmstr. 3a, D-79098 Freiburg, Germany, E-mail: [email protected] and Max-Planck-Institut fur extraterrestrische Physik, D-85740 Garching, Germany R. C. BISHOP Institut fur Grenzgebiete der Psychologie und Psychohygiene, Wilhelmstr. 3a, D-79098 Freiburg, Germany, E-mail: [email protected] A. AMANN Universitatsklinik fur Anasthesie, Leopold-Franzens- Universitat, Anichstr. 35, A-6020 Innsbruck, Austria E-mail: [email protected] and Institut fur Allgemeine, Anorganische und Theoretische Chemie, Abteilung fur theoretische Chemie, Leopold-Franzens- Universitat, Innrain 52a, A-6020 Innsbruck, Austria Two distinct conceptions for the relation between reversible, time-reversal invariant laws of nature and the irreversible behavior of physical systems are outlined. The standard, extrinsic concept of irreversibility is based on the notion of an open system interacting with its environment. An alternative, intrinsic concept of irreversibility does not explicitly refer to any environment at all. Basic aspects of the two concepts are presented and compared with each other. The significance of the terms extrinsic and intrinsic is discussed.

1

Introduction

The relation between reversible, time-reversal invariant laws of nature and the irreversible behavior of empirical systems has been a long-standing problem in physics. In most standard approaches, fundamental dynamical laws such as in Newton's, Maxwell's, Einstein's or Schrodinger's equations describe the temporal evolution of isolated systems. Irreversible dynamical laws are typically regarded as emerging from the interaction between systems and their environment, i.e., from considering open systems. In contrast to this "extrinsic" conception of irreversibility, there is a group

51

of scientists who insist that some kinds of irreversibility are "intrinsic", i.e., some kinds of irreversible laws are fundamental. On this view, mainly advocated by Prigogine and colleagues in Brussels and Austin, the switch from extrinsic to intrinsic irreversibility goes along with a switch from particular kinds of deterministic descriptions to particular kinds of probabilistic descriptions. In general, the two viewpoints are considered to be distinct, sometimes even entirely incompatible. It is the main goal of this contribution to show that there are both differences and similarities between them. As a consequence it does not make too much sense to prefer one of them at the expense of the other. It is much more interesting to explore whether particular aspects of each of the two views can be constructively related to each other in order to increase our insight into the issue of irreversibility. In the following, both conceptions will be presented to some detail and compared. It is suggested that the distinction of ontic and epistemic categorial frameworks for some problems associated with irreversibility is particularly useful when focusing on a conceptual discussion. Such a distinction serves to clarify both common and distinct aspects of extrinsic and intrinsic irreversibility, and it helps to frame a number of open questions concerning them. In Section 2, ontic and epistemic descriptions are briefly introduced. We use an algebraic framework for this introduction since this has proven fruitful in related problem areas. Section 3 outlines some basic issues with respect to the ontic states of closed quantum systems and their time-reversal invariant dynamical evolution. Subsequently, two ways to conceive of extrinsic irreversibility are described. In one of them epistemic states are represented by (reduced) density operators, in the other they are represented by probability distributions of pure states. Section 4 presents the intrinsic conception of irreversibility. One major line of research in this regard deals with transformations from invertible K-systems to non-invertible exact systems, the other uses the concept of rigged Hilbert spaces to extend the state of a system beyond Hilbert space. Section 5 summarizes the main points and indicates some open questions. 2 2.1

Ontic and epistemic descriptions General issues

Can nature be observed and described as it is in itself, independent of those who observe and describe - that is to say, nature as it is "when nobody looks"? This question has been debated throughout the history of philosophy with no clear answer either way. Each perspective has strengths and weaknesses, and in each

52

epoch has had its critics and proponents. In contemporary terminology, the two perspectives can be distinguished as the topics of ontology and epistemology. Ontological questions refer to the structure and behavior of a system as such, whereas epistemological questions refer to knowledge (or information) about systems. In philosophical discourse it is considered a serious fallacy to confuse these two types of questions. For instance, Fetzer and Almeder emphasize that "an ontic answer to an epistemic question (or vice versa) normally commits a category mistake" 1 . Nevertheless, such mistakes are frequently committed in many fields of research when addressing subjects where the distinction between ontological and epistemological arguments is important. The ontic/epistemic distinction refers to states and properties of a system as such or in its relation to observers, hence it is an ontological distinction. 0 In physics, the rise of quantum theory with its interpretational problems was one of the first major challenges to the ontic/epistemic distinction. The BohrEinstein discussions in the 1920s and 1930s serve as a famous historical example. Einstein's arguments were generally ontically motivated; that is to say, he emphasized a viewpoint independent of observers or measurements. By contrast, Bohr's emphasis was generally epistemically motivated, focusing on what we could know and infer from observed quantum phenomena. Since Bohr and Einstein never made their basic viewpoints explicit, it is not surprising that they talked past each other in a number of respects 2 . Examples of approaches trying to avoid the confusions of the Bohr-Einstein discussions are Heisenberg's distinction of actuality and potentiality 3 , Bohm's ideas on explicate and implicate orders 5 , or d'Espagnat's scheme of an empirical, weakly objective reality and an objective (veiled) reality independent of observers and their minds 5 . Further terms fitting into the ontic side of these distinctions are latency 6 , propensity 7 , or disposition 8 . See also Jammer's discussion of these notions, including their criticism and additional references 9

A first attempt to draw an explicit distinction between ontic and epistemic descriptions for quantum systems was introduced by Scheibe 10 who himself, however, strongly emphasized the epistemic realm. Later, Primas developed this distinction in the formal framework of algebraic quantum theory 1 1 . The basic structure of the ontic/epistemic distinction, which will be made more precise below, can be roughly characterized as follows (for more details, the reader is referred to 1 1 ' 1 2 ): "On the other hand, the distinction between ontological and epistemological problems can be considered as epistemological insofar as both areas represent fields of (philosophical) knowledge.

53

Ontic states describe all properties of a physical system exhaustively. ("Exhaustive" in this context means that an ontic state is "precisely the way it is", without any reference to epistemic knowledge or ignorance.) Ontic states are the referents of individual descriptions, the properties of the system are treated as intrinsic •properties}' As an important example, ontic states refer to closed systems; they are empirically inaccessible. Typically, their temporal evolution (dynamics) is reversible and follows fundamental, deterministic laws. Epistemic states describe our (usually non-exhaustive) knowledge of the properties of a physical system, i.e. based on a finite partition of the relevant phase space. The referents of statistical descriptions are epistemic states, the properties of the system are treated as contextual properties. Epistemic states refer to open systems; they are, at least in principle, empirically accessible. Typically, their temporal evolution (dynamics) follows irreversible laws. The combination of the ontic/epistemic distinction with the formalism of algebraic quantum theory provides a framework that is both formally and conceptually satisfying. Although the formalism of algebraic quantum theory is often hard to handle for specific physical applications, it offers significant clarifications concerning the basic structure and the philosophical implications of quantum theory. For instance, the modern achievements of algebraic quantum theory make clear in what sense pioneer quantum mechanics (which von Neumann implicitly formulated epistemically 13 ) as well as classical and statistical mechanics can be considered as special cases of a more general theory. Compared to the framework of von Neumann's monograph 1 3 , important extensions are obtained by giving up the irreducibility of the algebra of observables (not admitting observables which commute with every observable in the same algebra) and the restriction to locally compact phase spaces (admitting only finitely many degrees of freedom). As a consequence, modern quantum physics is able to deal with open systems in addition to isolated ones; it can involve infinitely many degrees of freedom such as the infinitely many modes of a radiation field; it can properly consider interactions with the environment of a system; superselection rules, classical observables, and phase transitions can be formulated, which would be impossible in an irreducible algebra of observables; there exist infinitely many representations inequivalent to the Fock ''In a more technical terminology, one speaks of "observables" (mathematically represented by "operators") rather than properties of a system. Prima facie, the term "observable" has nothing to do with the actual observability of a corresponding property.

54

representation; and non-automorphic, irreversible dynamical evolutions can be successfully incorporated and even derived. In addition to this remarkable progress, the mathematical rigor of algebraic quantum theory in combination with the ontic/epistemic distinction allows us to address a number of unresolved conceptual and interpretational problems of pioneer quantum mechanics from a new perspective. First, the distinction between different concepts of states as well as observables provides a much better understanding of many confusing issues in earlier conceptions, including alleged paradoxes such as those of Einstein, Podolsky, and Rosen (EPR) 1 4 . Second, a clear-cut characterization of different concepts of states and observables is a necessary precondition to explore new approaches, beyond von Neumann's projection postulate, toward the central problem that pervades all quantum theory: the measurement problem. Third, a number of much-discussed interpretations of quantum theory and their variants can be appreciated more properly if they are considered from the perspective of an algebraic formulation. One of the most striking differences between the concepts of ontic and epistemic states is their difference concerning operational access, i.e. observability and measurability. At first sight it might appear pointless to keep a level of description which is not related to what can be operationalized empirically. However, a most appealing feature at this ontic level is the existence of first principles and fundamental laws that cannot be obtained at the epistemic level. Furthermore, it is possible to rigorously deduce (e.g., to "GNSconstruct"; cf. 12>15) a proper epistemic description from an ontic description if enough details about the empirically given situation are known. These aspects show that the crucial point is not to decide whether ontic or epistemic levels of discussions are right or wrong in a mutually exclusive sense. There are always ontic and epistemic elements to be taken into account for a proper description of a system. This requires the definition of ontic and epistemic terms to be relativized with respect to some selected framework within a set of (hierarchical) descriptions (see 16 for details and examples). The problem is then to use the proper level of description for a given context, and to develop and explore well-defined relations between different levels. These relations are not universally prescribed; they depend on contexts of various kinds. The concepts of reduction and emergence are of crucial significance here. In contrast to the majority of publications dealing with these topics, it is possible to precisely specify their meaning in mathematical terms. Contexts, or contingent conditions, can be formally incorporated as topologies in which particular asymptotic limits give rise to novel, emergent properties unavailable without those contexts (see 15 for more details). It should also

55

be mentioned that the distinction between ontic and epistemic descriptions is neither identical with that of parts and wholes nor with that of micro- and macrostates as used in statistical mechanics or thermodynamics. The thermodynamic limit of an infinite number of degrees of freedom provides only one example of a contextual topology, others are the Born-Oppenheimer limit in molecular physics or the short-wavelength limit for geometrical optics. These examples indicate that the usefulness or even inevitability of the ontic/epistemic distinction is not restricted to quantum systems. It plays a significant role in the description of classical systems as well. More specifically, it has been shown in detail that for systems exhibiting deterministic chaos the distinction of ontic and epistemic descriptions is necessary if category mistakes and corresponding interpretational fallacies are to be avoided 17 . 3 3.1

Breaking Time-Reversal Symmetry: Extrinsic Irreversibility Time-Reversal Symmetry in Closed Systems

Let us start with a closed quantum system which can be considered without any reference to an environment. The pure state of such a system is an extremal positive linear functional on a C*-algebra A. The state € A*, where A* is the dual of A, is then called an ontic state of the closed system. If a Hilbert space representation of A is possible, <j> can be represented as a state vector ip G %, characterized by the expectation values < ip\Aip > of all observables A € A. Under particular conditions, the dynamics of is given by the time-reversal invariant Schrodinger equation. In the traditional Hilbert space representation, the algebra A of observables is irreducible; there are no commuting observables. Due to the Stone-von Neumann theorem, every representation of the canonical commutation relations is then equivalent to the Schrodinger representation. In the more general setting of a Fock space (sum of tensor products of one-particle Hilbert spaces), the same holds for Fock representations. A restriction of characterizes an individual, undivided whole not consisting of subsystems with their own ontic states. This is the level of description to which the notions of quantum nonlocality or quantum holism apply. Since the concept of an environment does not make sense for ontic states of closed systems, it is illegitimate to speak about their entanglement or interaction with another state. If one introduces a distinction (Heisenberg cut) to create subsystems in

56

a closed system, then these subsystems in general are open. For example, one can then consider an object entangled and/or interacting with its environment. The epistemic state r] of those subsystems can be represented in two conceptually different ways. 3.2

Density Operators as Non-Pure States

The first, more or less familiar representation of an epistemic state n is given by a (reduced) density operator D 6 M*, where M* is the predual of a W*algebra M of contextual observables. The expectation value of D is given by Tr.DM for observables M E M. The epistemic state n represented by D is a non-pure state. EPR-correlations between subsystem and environment are generic if the contextual algebra of observables is non-commutative. The term "contextual observables" derives from the fact that their construction requires the selection of a context defined by a subset of "relevant" observables B E B C A and a reference state (e.g., vacuum state, KMS state) distinguished by some appropriate stability condition. This context induces the weak closure of B and gives rise to a contextual topology in M.. If the context is known well enough, then the GNS representation is a powerful constructive tool to implement a proper contextual topology (see, e.g., 15 ). The dynamics of D is of Schrodinger type plus dissipative terms (e.g., a master equation), so that the time-reversal invariance of the Schrodinger equation can be broken 18 ' 19 . 3.3

Probability Distributions of Pure States

If the epistemic state r\ of an open system is approximately pure by a clever dressing of object and environment (b indicates bare objects and environments and d indicates dressed objects and environments), ri0i,j <8> Henv

= Hgbj <8>

nenv,

7] can be represented (estimated) by a probability distribution fj, of pure states. (A dressing procedure is clever if it minimizes EPR-correlations between object and environment, or if it maximizes the "integrity" of both object and environment 20 .) Hgbj is the proper Hilbert space for an approximately pure epistemic state 77. Although 77 can be uniquely extended to a normal state on M (represented by a density operator), the pure states and their distribution fi themselves do not make sense on M. The "relevant" observables are elements of a C*-subalgebra B C A.

57

The dynamics of p is of Schrodinger type plus stochastic terms (e.g., an Ito/Stratonovic equation), so that the time-reversal invariance of the Schrodinger equation can be broken. The stochastic aspect of the time evolution (of approximately pure states of the object) originates from the fact that the (initial) state of the environment cannot be determined and therefore must be treated as a stochastic variable. Starting from an initial pure state pa, one gets time-evolved states pt,u, where co is the stochastic variable. First steps of such an approach toward single open quantum systems, not based exclusively on decompositions of density-operator dynamics, were proposed in 2 1 ' 2 2 . For a large class of stochastic dynamics of approximately pure states of objects, one ends up with one particular distribution p^ of pure states in the limit t —> oo independently of the initial conditions (such dynamical objects are called ergodic). Splitting the underlying C*-algebra B into two subsystems with two C*-subalgebras B\ and B2, B = B\ ® B2, is then admitted under particular conditions. In an ideal situation all those pure states onto which the probability measures pt extend are product states with respect to the tensor product B = B\ ® $2- This situation never arises in practice, but "most" relevant pure states can be product states or almost product states, if the dressing tensorization is chosen appropriately 23 . 3-4

Dynamics of Measurement: a Simple Example

Any dynamical description of measurement has to start from a proper decomposition of a system into a dressed object and its dressed environment. It is crucial to keep in mind that such a decomposition is a logical precondition for the dynamics of measurement insofar as the Hamiltonian of the composed system needs to be written as a sum H = Hobi®l

+ l®Hmy+Hint.

(1)

An illustrative heuristic example has been extensively discussed by Primas 2 4 . Consider the simple case of a two-level quantum object (spin 1/2 system) with the Hamiltonian h

3

^ o b j ~ Tj/^yGu,

(2)

a sufficiently nontrivial boson field environment 3

-Henv = ^2^2ujka*kl/akv,

(3)

58

and an interaction 3

Hint = ^

<7„ (g> A„ ,

(4)

^kuOtkv + C.C.

(5)

where Av = ^ k

If such a decomposition has been properly carried out (cf. Sec. 3.3), then it is possible to derive the expectation values M(t) a(t)

= =

<Xt\A\Xt>

(6) (7)

with respect to the (approximate) product state

* t = v- t obj ®xr-

(8)

Corresponding to the product state \Pt, the C*-algebra of intrinsic observables in the composed system of dressed object and dressed environment is A = A0hi ®-4env

(9)

Aohi is the C*-algebra of 2 x 2 matrices and ^4 env is the C*-algebra of intrinsic observables of an environment with infinitely many degrees of freedom. The equations of motion for the expectation values M(t) and a(t) are given by: M(t)

=

M(t) x ft + M(t) x a(t),

(10)

"*!/(*)

=

-UkOLkv + -^>~kvMv{t) .

(11)

They describe the feedback between object and environment. More precisely, they describe the polarization M of the object under the influence of the environment and the motion of the environment observable a (boson operator) under the polarizing influence of the object. The solution of the second equation, referring to the observables of the environment (or the measuring system,

59 respectively) has a retarded and an advanced part: aTke; = fj

exp(-iLjkt)akl/{0) i r 2Xk" exp(-iuk(t

=

(12)

exp(-wt(t-s))M„(s)ds (t < 0),

(13)

s))Mv(s)ds

exp(-iujkt)akv(t) i

+

(t > 0),

-

f°

9 ^ /

A bidirectionally deterministic system can be described in terms of a superposition of a backward deterministic (forward non-deterministic) and a forward deterministic (backward non-deterministic) process which are equally relevant a priori. Selecting one of these solutions and disregarding the other requires the time inversion symmetry of the compound system to be broken. For this purpose, one can apply the principle of causality (past-determinacy, error-free retrodiction, no anticipation) as a "heuristic" argument for the selection of the retarded solution. It has been argued that the retarded, i.e., the backward deterministic, forward non-deterministic, solution is a K-flowc on a state space with infinitely many degrees of freedom 24 . In the simplest case, the relaxation time for this K-flow is the time constant r„ of an exponentially decaying correlation function (for details, see 24 ) Kv = ivexp(-\t\/Tv).

(14)

At this point we are still at the level of description of intrinsic observables needed for the specification of initial conditions of the K-flow. Conceptually, this K-flow represents a stochastic process which corresponds to chaos in the sense of Wiener 25 rather than chaos in the sense of Kolmogorov and Sinai (i.e., a dissipative dynamics). By introducing a context via a reference state, with respect to which stability in a particular sense (hopefully more general than thermal equilibrium) can be checked, one can proceed to (GNS-constructed) contextual observables. 3.5

General Features of Extrinsic Irreversibility

The breaking of time-reversal symmetry in the framework of extrinsic irreversibility corresponds to the conceptual transition from closed systems with c

Note that K-flows or K-systems play an important role in one of the approaches of intrinsic irreversibility (see Sec. 4.1). It would be interesting, but exceeds the scope of this paper, to explore the question of whether the process of measurement as described here can be conceived as intrinsically irreversible. In this respect, see, e.g., 2 6 .

60

ontic states to open systems with epistemic states. Such a transition can be understood by dividing a closed system into open, more or less EPR-correlated subsystems (e.g., object and environment), and by selecting a subset of "relevant" observables. The proper state concepts are epistemic. There are then two different statistical representations for different epistemic state concepts. A /^-statistical representation expresses a probability distribution of pure states, whereas the usual /^-statistical representation focuses on reduced density operators. The interaction of the open subsystems is described by dynamical laws different from the time-reversal invariant dynamics of a closed system. Breaking the time-reversal invariance of a unitary group evolution generates two semigroups, which can be endowed with two arrows of time opposite to each other. It should be pointed out that the forward arrow cannot be selected by physical reasons alone. Extra-physical arguments such as consistency with experience, causality, etc. must be invoked. 4

Breaking Time-Reversal Symmetry: Intrinsic Irreversibility

In contrast to the extrinsic concept of irreversibility, there is an alternative concept of intrinsic irreversibility, mainly advocated by Prigogine and collaborators (more recently also by Bohm). They propose describing states of any system generically with distributions p (i.e., probability distributions or density operators). The claim is that the state p of systems beyond a particular degree of complexity evolves irreversibly by itself, i.e., without any relationship to an environment. There are essentially two lines of research pursuing this proposal. 4-1

A-Transformation

from K-Systems

to Exact Systems

The notion of the A-transformation has been developed by Misra, Courbage and Prigogine in the 1970s. It is essentially based on the theory of ergodic systems. In particular, the concept of Kolmogorov systems, briefly K-systems, is of central significance in this context. Definition 1 27 : Let (X, A, n) be a normalized measure space and let S : X —> X be an invertible transformation such that S and 5 _ 1 are measurable and measure preserving. The transformation S is called a K-automorphism if there exists a cr-algebra A0 such that the following three conditions are satisfied: (i)S-1(A0)cA0; (ii) the cr-algebra f l ^ L o ' - ' " " ^ 0 ) i s trivial (i.e., contains only sets of measure

61

1 or 0); (hi) the smallest cr-algebra containing \J™=0Sn(Ao) is identical to A. Another way to characterize (classical) K-systems is by way of the existence of positive Ljapounov exponents, equivalent to a strictly positive KolmogorovSinai entropy. The properties of K-systems imply mixing and ergodicity. Ksystems are invertible transformations, hence their deterministic dynamics, given by p(t) = Ut p(0), is reversible (Ut is a unitary evolution operator acting on p). A standard example is the (2-dimensional) baker transformation. Another important class of mixing systems refers to so-called exact systems. Definition 2 27 : Let (X\A,p) be a normalized measure space and let S : X —t X a, measure preserving transformation such that S(A) £ A for each A £ A. If l i m ^ o o = p(Sn(A)) = 1 for every A € A, p(A) = 1, then S is called exact. Exact systems are represented by non-invertible transformations, hence their stochastic dynamics, given by p(t) = Wt p(0), is irreversible. Wt is a semigroup evolution operator acting on a distribution p rather than p. For instance, an exact system obtained from the baker transformation is the dyadic transformation S(x) = 2x (mod 1). A theorem by Rokhlin 28 says that every exact system is the factor of a K-system. This means that K-systems can be transformed into exact systems by their projections (or "factors", see 2 7 ). More generally, a factor of a Ksystem can be obtained by restriction to dilating fibers or unstable manifolds. Hence it is intuitively clear that the invertibility of a K-system gets lost by its transformation into an exact system. According to Misra et al. 2 9 ' 3 0 ; the relations between the two kinds of dynamics Ut and Wt and the two state concepts p and p are provided by a similarity transformation A according to Wt =

AUtA-1 p = Ap.

Wightman's question 31 as to the meaning of p in his review of30 gets an immediate answer if one applies Rokhlin's theorem to construct A (cf. 3 2 ) . The transformed distribution p is the projection of p onto a dilating subspace. This can easily be seen for the examples of the baker transformation and the dyadic transformation. In the more complicated case of continuous-time nonlinear (hyperbolic) systems, the corresponding procedure would be a projection onto the unstable manifolds, i.e., those directions along which the Lyapunov expo-

62

nents are positive and add up to the Kolmogorov-Sinai entropy (cf. 33>34). As an important conceptual feature, such projections select a time direction. A crucial formal feature associated with the irreversibility due to Wt is that a properly constructed A (and hence A[/ ( A _ 1 ) preserves the positivity of the state distributions only for positive times. A conceptual discussion of this point can be found in 3 5 . For a more detailed, formal account of the role which positivity preservation plays in the transformation between irreversible semigroups and chaotic dynamics see 36 and references given there. 4-2

Rigged Hilbert Space Representation

Intrinsic irreversibility has also been implemented in an approach based on an extension of the usual Hilbert space representation of the state of a system. This approach makes use of the so-called rigged Hilbert space (RHS) construction first introduced by the Russian mathematician Gel'fand and his collaborators 37 . Roberts 3 8 and Bohm 3 9 independently showed how Dirac's formalism could be justified with complete mathematical rigor in a RHS. By the end of the 1970s, it turned out that some basic physical problems of Hilbert space quantum mechanics, notably in the context of decaying states or resonances, could be clarified in terms of RHS (40 and references therein). Very briefly, a RHS (Gel'fand triplet) can be understood as follows. Let * be an abstract linear scalar product space and complete * with respect to two topologies. The first topology is the standard norm topology yielding a separable Hilbert space. The second topology r$ is defined by a countable set of norms IMU = \ A & 0 ) n ^ € #, n = 0,1,2,...

(15)

where (f> e $ and the scalar product is given by (<(>, , (A + 1) V ) , n = 0 , 1 , 2 , . . .

(16)

where A is the Nelson operator A =J2iXi41- The Xi are operators representing the observables for the system in question and are the generators for the Nelson operator. Furthermore the operator A + 1 is a nuclear operator and ensures that $ is a nuclear space (cf. 42>39). An operator is nuclear if it is linear, essentially self-adjoint and its inverse is Hilbert-Schmidt. An operator A-1 is Hilbert Schmidt if A'1 = XiPi where the Pt are mutually orthogonal projection operators on a finite dimensional vector space and J2iPi < °° > Pi denoting the eigenvalues of Pi39. We then have the Gel'fand triplet of spaces $C^C$

X

(17)

63

where $ x is the dual to the space $. The Nelson operator fully determines the choice of function space when it comes to choosing a realization of the space $. However, there are many different inequivalent irreducible representations of an enveloping algebra of a Lie group used to generate a Nelson operator describing physical systems. Therefore further restrictions on the choice of function space for a realization of $ are required. The particular characteristics of the physical context of the system being modeled provide some of these restrictions, analogous to the situation for GNS constructions in the transition from C*- to W*-algebras in algebraic quantum mechanics 23 . Additional restrictions may be required due to the convergence properties desired for test functions in $ and <J>X. Bohm and colleagues applied the RHS approach to intrinsic irreversibility in the context of scattering and decay phenomena 40 ' 43 . Antoniou and Prigogine 44 extended the approach to broader contexts. The core idea in both versions is that a unitary group operator Ut = exp(-iHt), —oo < t < oo, generated by a Hamiltonian H, under very general circumstances, may be extended from W to $ x (restricted to $ ) . For scattering processes, $ is the intersection of the Hardy class functions with the Schwarz class functions. Because of continuity and completeness requirements, Ut : $ x —> $ x (Ut : $ — > $ ) can be extended to the upper half plane $+ (restricted to $+) for positive times and to the lower half plane $ x ($_) for negative times 4 3 . The extension of Ut to $ x (restriction to $) forms two semigroups because the extension (restriction) cannot be defined for replacement of t with —t. Thus, semigroup evolution falls out of the analysis quite naturally in the RHS framework. 4-3

General Features of Intrinsic

Irreversibility

In the intrinsic conception of irreversibility, states of a system are generically represented by distributions in a suitable state space, where pure states are S functions. The trajectories of individual points are either (1) considered irrelevant because empirically inaccessible (as in the A-transformation approach) or (2) make minimal contributions to the collective behavior of the system when a sufficient number of Poincare resonances are present (as in the RHS approach). For systems beyond a particular degree of complexity (K-systems, Poincare resonances, etc.), the dynamics of the system is governed by irreversible evolution laws regardless of interactions with an environment. While the A-transformation approach has only been applied to the baker map, the RHS approach has been applied to nonlinear maps, Friedrich models, d

The dual space * x is the space of linear functionals acting on elements of <£> and its topology is induced by the choice of T* and includes distributions among its elements.

64

scattering experiments and other decay phenomena. In the latter approach, exact Golden Rules for decay and survival probabilities and their rates can be derived in agreement with experimental observations 43 . In both approaches the transition from reversible to irreversible dynamical evolution laws is achieved by breaking the time-reversal symmetry in specific ways leading to two semigroups. The time direction of the semigroups, however, is not given by either the A-transformation or RHS approaches. Physical considerations alone are insufficient to select the forward arrow and one must appeal to consistency with experience, causality or other criteria. 5

Summary and Open Questions

There are two basic points at which extrinsic and intrinsic notions of irreversibility coincide. The first is that both notions explicitly break the timereversal symmetry of reversible dynamical laws. This is clearly the case for the standard, external view, in which the transition from fundamental, reversible laws to contextual, irreversible laws corresponds to the transition from ontic states of closed systems to epistemic states of open systems. But even for the alternative, intrinsic view "irreversibility is an emergent feature" 4 5 . In the framework of the A-transformation, the time-reversal symmetry of K-systems is broken, leading to irreversible, exact systems. In the RHS representation, a similar symmetry breaking is achieved by the transition from Hilbert space to the rigging spaces $ and $ x . The breaking of time-reversal symmetry always produces two semigroups which can be endowed with opposite temporal directions. Selection criteria must be used to select one of these two directions for a preferred mode of description. In both extrinsic and intrinsic approaches, there is no such criterion available based on physical reasoning alone. The selection is based on extra-physical arguments such as causality, experience, and others. This second point of agreement between extrinsic and intrinsic irreversibility raises the interesting question of what conditions the "proper" direction of time has to satisfy. It could be argued that up to the condition that it is the same for all physical systems, the selection is arbitrary . There are two basic points at which extrinsic and intrinsic notions of irreversibility apparently differ. One of them concerns the role of the environment, the other has to do with the state concepts used in the two approaches. Briefly speaking, the role of the environment and the distinction of different state concepts is crucial in the standard framework of extrinsic irreversibility. The conceptual framework of the formalisms refering to intrinsic irreversibility neither (1) explicitly contains the concept of an environment nor (2) distinguishes

65

between different state concepts. These observations do not necessarily imply that intrinsic irreversibility really can dispense with points (1) and (2). It is likely that the two points play crucial roles even though they do not explicitly appear in the formalism and its usual interpretation. The projection (factorization) which is the crucial part of a A transformation can be considered as the selection of an exact subsystem of the original K-system. Obviously, the A-transformation is not universal but contextdependent. Conceptually, the irreversible evolution of p — Kp due to Wt could then be attributed to the restriction of the K-system to an exact subsystem. This might lead to interesting analogies with aspects of extrinsic irreversibility, if the subsystem cannot be described as a closed subsystem. Concrete empirical applications of the A-transformation are not yet available. They would be necessary to check the significance of a physical environment which is not explicit in the formalism. Concerning the distinction between ontic and epistemic state concepts, it is clear that the approach of intrinsic irreversibility starts at the level of distributions rather than points. In the space of distributions, 5 functions are special cases that could be related to points in a state space underlying the distribution space considered. In this way, a connection between distributions as epistemic states and points as ontic states is possible. The general claim in the A-transformation framework of intrinsic irreversibility, though, is that ontic states in the sense of phase points are meaningless or irrelevant since they are empirically inaccessible. But is it justified to consider ontic states as generally irrelevant because they are empirically inaccessible? Reversible fundamental laws refer to ontic states, and it is not easy to formulate physics without them. The monographs by Ludwig 46 , which consistently avoid any ontic elements, are an illustrative example. Moreover, special techniques to break symmetries often enable a unique derivation of irreversible contextual laws if the fundamental laws plus contexts are known. This also holds for the symmetry breaking used to derive intrinsic irreversibility from time-reversal invariant evolution in the A-transformation approach. The empirical inaccessibility of ontic states notwithstanding, one should therefore not dismiss their overall relevance too quickly. In the RHS approach, there is no contradiction with the formal arguments in the case of extrinsic irreversibility insofar as the extension of Ut from V. into $ x leads from reversibility to irreversibility. In this case, irreversibility is a feature arising during the transition from states in % to states whose state space is defined with respect to contexts. In the algebraic framework of Sec. 3,

66 such contexts are reflected by a contextual topology on M.. As mentioned in Sec. 4.2, physical contexts may not be known sufficiently well to determine $ x uniquely. The physical examples used to demonstrate the significance of the RHS formulation (e.g., decay) suggest that a physical environment is inevitable, although this is not explicit in the formalism. The relationship between ontic and epistemic states in the RHS approach is more subtle than in the A-transformation approach. As Petrosky and Prigogine argue 4 7 , 4 8 , the presence of a sufficient number of Poincare resonances in so-called large Poincare systems (LPS) rapidly convert the smooth, infinitely differentiable trajectories of the phase space points into random walks. Though the trajectories are not considered to be empirically inaccessible, their effects are limited to the formation of higher and higher orders of correlations as the dynamics evolves. The phase space points can represent ontic states, but the correlations also have an ontic status. Correlations very rapidly come to dominate the dynamics of all collective modes of behavior of LPS (e.g. the approach to equilibrium) as the correlations diffuse throughout the system. In this way the effects of individual points and trajectories become irrelevant to the dynamics of the whole and, thus, one can argue that the distribution description is an ontic description of the system's behavior. In this way, the distinction between ontic and epistemic states might be a powerful conceptual tool even at the level of distributions alone. There is a conceptual difference between a probability distribution conceived as a distribution over an ensemble of individual pure states (as in the /^-statistical representation) and a probability distribution conceived as an individual whole. The latter concept is sometimes indicated in the context of intrinsic irreversibility and can be considered as an ontic version of the former (cf. the notion of relative onticity 1 6 ). For instance, continuum mechanics requires a formulation which needs ontically interpreted, "holistic" distributions from the very beginning, since its description in terms of an ensemble of points would violate basic physical laws. Among the adherents of intrinsic irreversibility it is claimed that the "holistic" concept of a distribution as a whole entails predictions, e.g., related to the dynamics of correlations in large systems, which cannot be obtained with the concept of a probability distribution of individual pure states. This claim particularly refers to situations far from thermal equilibrium. Based on Gallavotti's approach, which describes systems far from equilibrium in terms of SRB-measures 49 , i.e., in an ensemble description, this claim may become testable (see also 50 for a brief discussion). After all, it is possible to view the intrinsic approach to irreversibility as emphasizing the relative importance of the advanced level of complexity

67

of systems with nontrivial correlations over environmental effects. While extrinsic irreversibility addresses the importance of an environment, intrinsic irreversibility should not primarily be understood as focusing on the neglect of such an environment (e.g. the environment may be a necessary condition for the existence of the dynamics). Instead, it is perhaps more appropriate to understand intrinsic irreversibility as irreversibility intrinsic to the dynamics of a system given a particular degree of its complexity. Acknowledgments Helpful comments by L. Accardi, L. Ballentine, H. Narnhofer, and I. Volovich during the discussion of this contribution at the conference are much appreciated. We are grateful to H. Primas for remarks on an earlier version of this paper. References 1. J.H. Fetzer and R.F. Almeder, Glossary of Epistemology/Philosophy of Science (Paragon House, New York, 1993), p. lOOf. 2. D. Howard: Space-time and separability: problems of identity and individuation in fundamental physics. In Potentiality, Entanglement, and Passion-at-a-Distance, ed. by R.S. Cohen, M. Home, and J. Stachel (Kluwer, Dordrecht, 1997), pp. 113-141. 3. W. Heisenberg: Physics and Philosophy (Harper and Row, New York, 1958). 4. D. Bohm: Wholeness and the Implicate Order (Routledge and Kegan Paul, London, 1980). 5. B. d'Espagnat: Veiled Reality (Addison-Wesley, Reading, 1995). 6. H. Margenau: Reality in quantum mechanics. Phil. Science 16, 287-302 (1949), here: p. 297. 7. K.R. Popper: The propensity interpretation of probability, and quantum mechanics. In Observation and Interpretation in the Philosophy of Physics - With special reference to Quantum Mechanics, ed. by S. Korner in collaboration with M.H.L. Pryce (Constable, London, 1957), pp. 6570. [Reprinted by Dover, New York, 1962.] 8. R. Harre: Is there a basic ontology for the physical sciences? Dialectica 51, 17-34 (1997). 9. M. Jammer: The Philosophy of Quantum Mechanics (Wiley, New York, 1974), pp. 448-453, 504-507. 10. E. Scheibe: The Logical Analysis of Quantum Mechanics (Pergamon, Oxford, 1973), pp. 82-88.

68

11. H. Primas: Mathematical and philosophical questions in the theory of open and macroscopic quantum systems. In Sixty-Two Years of Uncertainty, ed. by A.I. Miller (Plenum, New York, 1990), pp. 233-257. 12. H. Primas: Endo- and exotheories of matter. In Inside Versus Outside, ed. by H. Atmanspacher and G.J. Dalenoort (Springer Berlin, 1994), pp. 163-193. 13. J. von Neumann: Mathematische Grundlagen der Quantenmechanik (Springer, Berlin, 1932). English translation: Mathematical Foundations of Quantum Mechanics (Princeton University Press, Princeton, 1955). 14. A. Einstein, B. Podolsky, and N. Rosen: Can quantum-mechanical description of physical reality be considered complete? Phys. Rev. 47, 777-780 (1935). 15. H. Primas: Emergence in exact natural sciences. Acta Polytechnica Scandinavica M a 9 1 , 83-98 (1998). See also Primas, Chemistry, Quantum Mechanics, and Reductionism (Springer, Berlin, 1983), Chap. 6. 16. H. Atmanspacher and F. Kronz: Relative onticity. In On Quanta, Mind, and Matter. Hans Primas in Context. Edited by H. Atmanspacher, A. Amann and U. Miiller-Herold (Kluwer, Dordrecht, 1999), pp. 273294. 17. H. Atmanspacher: Ontic and epistemic descriptions of chaotic systems. In Computing Anticipatory Systems: CASYS 99. Edited by D. Dubois (Springer, Berlin, 2000), pp. 465-478. 18. E. Fick and G. Sauermann: Quantenstatistik dynamischer Prozesse Ha: Antwort- und Relaxationstheorie (Harri Deutsch, Thun, 1986). 19. R. Kubo, M. Toda, and N. Hashitsume: Statistical Physics II (Springer, Berlin, 1985). 20. H. Primas: The Cartesian cut, the Heisenberg cut, and disentangled observers. In Symposia on the Foundations of Modern Physics. Wolfgang Pauli as a Philosopher, ed. by K.V. Laurikainen and C. Montonen (World Scientific, Singapore, 1993), pp. 245-269. 21. A. Amann: Structure, dynamics and spectroscopy of single molecules: a challenge to quantum mechanics. J. Math. Chem. 18, 247-308 (1995). 22. A. Amann and H. Atmanspacher: Fluctuations in the dynamics of single quantum systems. Stud. Hist. Phil. Mod. Phys. 29, 151-182 (1998). 23. A. Amann and H. Atmanspacher: C*- and W*-algebras of observables, their interpretation, and the problem of measurement. In On Quanta, Mind, and Matter. Hans Primas in Context. Edited by H. Atmanspacher, A. Amann and U. Miiller-Herold (Kluwer, Dordrecht, 1999), pp. 57-79. 24. H. Primas: Induced nonlinear time evolution of open quantum systems.

69

25. 26.

27. 28.

29.

30. 31. 32. 33. 34.

35.

36.

37. 38. 39.

In Sixty-Two Years of Uncertainty, ed. by A.I. Miller (Plenum, New York, 1990), pp. 259-280. N. Wiener (1938): The homogeneous chaos. Am. J. Math. 60, 897-936 (1938). C M . Lockhart and B. Misra: Irreversibility and measurement in quantum mechanics. Physica A 136, 47-76 (1986). Cf. H. Primas, Math. Rev. 87k, 81006 (1987). A. Lasota and M.C. Mackey: Chaos, Fractals, and Noise (Springer, Berlin, 1995). V.A. Rokhlin: Exact endomorphisms of Lebesgue spaces. Izv. Akad. Nauk SSSR Ser. Mat. 25, 499-530 (1964); transl. in Am. Math. Soc. Transl. 39, 1-36 (1964). B. Misra: NonequiUbrium entropy, Lyapounov variables, and ergodic properties of classical systems. Proc. Ntl. Acad. Sci. USA 75, 1627-1631 (1978). B. Misra, I. Prigogine, and M. Courbage: From deterministic dynamics to probabilistic descriptions. Physica A 98, 1-26 (1979). A. Wightman: Review of Misra, Prigogine, and Courbage 30 . Math. Rev. 82e, 58066 (1982). Z. Suchanecki: On lambda and internal time operators. Physica A 187, 249-266 (1992). H. Atmanspacher and H. Scheingraber: A fundamental link between system theory and statistical mechanics. Found. Phys. 17, 939-963 (1987). H. Atmanspacher: Dynamical entropy in dynamical systems. In Time, Temporality, Now, ed. by H. Atmanspacher and E. Ruhnau (Springer, Berlin, 1997), pp. 325-344. R.W. Batterman: Randomness and probability in dynamical theories: on the proposals of the Prigogine school. Philosophy of Science 58, 241-263 (1991). I. Antoniou, K. Gustafson, and Z. Suchanecki (1998): On the inverse problem of statistical physics: from irreversible semigroups to chaotic dynamics. Physica A 252, 345-361 (1998). I.M. Gel'fand and N.Ya. Vilenkin: Generalized Functions, Vol. 4 (Academic, New York, 1964). Russian original published 1961 in Moscow. J.E.Roberts: The Dirac bra and ket formalism. Journal of Mathematical Physics 7, 1097-1104 (1966). A. Bohm: Rigged Hilbert space and mathematical descriptions of physical systems. In Lectures in Theoretical Physics IX A: Mathematical methods of theoretical physics. Edited by W.E. Brittin, A.O. Barut and M. Guenin (Gordon and Breach, New York, 1967), pp. 255-317.

70

40. A. Bohm and M. Gadella: Dirac Kets, Gamow Vectors, and Gelfand Triplets. Lecture Notes in Physics, Vol. 348, ed. by A. Bohm and J.D. Dollard (Springer, Berlin, 1989). 41. E. Nelson: Analytic Vectors. Annals of Mathematics 70, 572-615 (1959). 42. F. Treves: Topological Vector Spaces, Distributions and Kernels (Academic Press, New York, 1967). 43. A. Bohm, S. Maxson, M. Loewe, and M. Gadella: Quantum mechanical irreversibility. Physica A 236, 485-549 (1997). 44. I. Antoniou and I. Prigogine: Intrinsic irreversibility and integrability of dynamics. Physica A 192, 443-464 (1993). 45. T. Petrosky and I. Prigogine: The Liouville space extension of quantum mechanics. Adv. Chem. Phys. XCIX, 1-120 (1997), here p. 71. 46. G. Ludwig: Foundations of Quantum Mechanics Vols. 1/2 (Springer, Berlin, 1983/1985). 47. T. Petrosky and I. Prigogine: Poincare resonances and the extension of classical dynamics. Chaos, Solitons & Fractals 7, 441-497 (1996). 48. T. Petrosky and I. Prigogine: The Extension of Classical Dynamics for Unstable Hamiltonian Systems. Computers & Mathematics with Applications 34, 1-44 (1997). 49. G. Gallavotti: Chaotic dynamics, fluctuations, nonequilibrium ensembles. CHAOS 8, 384-392(1998). 50. D. Ruelle: Gaps and new ideas in our understanding of nonequilibrium. Physica A 263, 540-544 (1999).

71

I N T E R P R E T A T I O N S OF P R O B A B I L I T Y A N D Q U A N T U M THEORY

Department

L. E. B A L L E N T I N E of Physics, Simon Fraser University, BC V5A 1S6, Canada e-mail: [email protected]

Burnaby,

There is a peculiar similarity between Probability Theory and Quantum Mechanics: both subjects are mature and successful, yet both remain subject to controversy about their foundations and interpretation. I first present a classification of the various interpretations of probability, arguing that they should not be thought of as rivals, but rather as applications of a general theory to different kinds of subject matter. An axiom system that makes conditional probability the fundamental concept is put forward as being superior to Kolmogorov's axioms. I then discuss the relevance to quantum theory of the various interpretations of probability, the applicability of classical probability theory within quantum mechanics, and the relations between the interpretation of probability and the interpretation of quantum mechanics.

1

Introduction

There are many connections between Probability Theory and Quantum Mechanics, the most notable being that Quantum Mechanics uses Probability Theory in its fundamental interpretation, not merely as a technique. But I wish to concentrate on a more peculiar similarity. Although both subjects are mature and successful, both remain subject to controversy about their foundations and interpretation. There may be even more interpretations of probability than there are of quantum theory. Can one bring some degree of order to this subject? Probability Theory, being a branch of mathematics, is defined by a set of axioms. So it can legitimately be applied to any entity that satisfies those axioms. Most of the interpretations of probability can be viewed as applications of the formal theory to different subject matters. It is therefore misguided to argue over which is the correct interpretation. Most of them are correct within their appropriate domain of application. But it is still reasonable to ask whether there is a general, overarching form of Probability Theory, of which all the various interpretations can be seen as special cases, applied to special subject matters. I shall propose such a classification of the various interpretations of probability. To do so, it is necessary to overlook small differences and to lump closely related interpretations into a few broad categories. I expect this classi-

72

fication to be controversial, but I believe that it is a step in the right direction. I shall consider only theories that are based on the same, or equivalent, sets of axioms. Hence generalizations such as negative probabilities are not included in this scheme, although I shall briefly refer to them later. After describing the major categories of interpretation of probability, I will discuss the relevance of each to quantum mechanics.

2

Interpretations of Probability

Many different interpretations of probability are examined in detail by T. L. Fine.1 I propose to overlook many of the fine differences, and hence classify them into a few major groups, shown in Figure 1. References to most of the authors named in Fig. 1, and critical analyses of their ideas, are given by Fine.1

2.1

The Theory of Inductive Inference

I propose that the Theory of Inductive Inference be taken as the master theory, and that all other interpretations be regarded as special cases, applicable in more restricted contexts. This point of view was expressed most completely by E. T. Jaynes in his book Probability Theory: The Logic of Science? which unfortunately was not completed during his lifetime. Within this interpretation, probability is assigned to propositions. The notation P(A\C) is to be read as the probability of A under the condition C. Probability is regarded as a logical relation among propositions that is weaker than entailment. Inductive logic reduces to deductive logic in the limit of probability values 0 and 1. Probability is an objective relation, and should not be confused with degrees of belief. The propositions to which probability is assigned may have any particular content. If we specialize to propositions about repeated experiments we obtain the Ensemble-Frequency theory. If we specialize to propositions about personal belief we obtain Subjective probability. If we specialize to propositions about indeterministic or unpredictable events we obtain the Propensity theory. Although P(A\C) is a logical relation between proposition A and the conditioning information C, it is not merely a formal, syntactic relation. The content (meaning) of A and C must be invoked to evaluate P(A\C). There is no magic formula to translate arbitrary information into probabilities. Jaynes has given solutions to this problem in some important special cases (symmetry groups, marginalization), but there is, as yet, no general solution.

73

The Logic of Inductive Inference (E. T. Jaynes, R. T. Cox, H. Jefferys) P(A\C) is the probability that proposition A is true, given the information C.

Ensemble and Frequency (Kolmogorov, Bernoulli, von Mises) Measure on a set; Limit frequency in an ordered sequence.

Propensity (K. R. Popper) P{A\C) is the propensity for event A to occur under the condition C.

Subjective and Personal (de Finnetti, L. J. Savage, I. J. Good) Incomplete knowledge; Degrees of reasonable belief.

Figure 1: Classification of the interpretations of Probability.

2.2

Ensemble and Frequency Theories

One of the most common interpretations of probability is as a limit frequency in an ordered sequence. The ratio of the number n of occurrences of a particular type in a sequence of N events, n/N, is identified with the probability. This interpretation is useful in analyzing repeated experiments, but it has the

74

difficulty that in a random sequence the ratio n/N need not have a limit. The ensemble interpretation is a generalization of the frequency interpretation, in which probability is identified with a measure on a set that need not be ordered. It is closely associated with Kolmogorov's axiom system, which will be discussed later. 2.3

Subjective Probability

Subjectivism has its place, and subjective probability provides an excellent way to describe degrees of reasonable belief. But in science, subjectivism can be like a virus, and we must guard against its infection. In general, the probability P(A\C) expresses an objective relation between A and C, determined by the totality of the information C, and not by anyone's personal opinions. Jaynes tried to ensure objectivity through the pedagogical device of introducing a robot that is programmed to reason consistently using only the information that is given to it. But even Jaynes sometimes slipped from objective to personal probabilities in his examples, without apparently being aware of doing so. Indeed, the contamination of Inductive Logic Probability by subjectivism may have been a major barrier to its acceptance. 2.4

Propensity

Propensity is a form of causality that is weaker than determinism.3'4 Generally speaking, probability expresses logical relations, rather that causal relations. (Recall the old saying: Correlation does not imply causality.) However, causality is a special kind of logical relation, and propensity theory deals with just that special case. The propensity interpretation of probability is natural in situations, such as those described by quantum mechanics, in which events can not be predicted with certainty from their antecedents. 3

The Axioms of Probability

The axioms of probability theory can be given in several different forms, however those given by R.T. Cox 5,6 are particularly convenient. Axiom 1. 0 < P{A\B) < 1 Axiom 2. P{A\A) = 1 Axiom 3. PhA\B) = 1 - P(A\B) Axiom 4. P(AkB\C) = P(A\C) P{B\AkC) Here the notation is as follows: ->A means "not A"; Ak,B means "A and J5"; A\/ B means "either A or B".

75

Axiom 2 states that the probability of a certainty (A, given A) is one. Axiom 1 states that no probabilities are greater than the probability of a certainty. Axiom 3 expresses the notion that the probability of non-occurrence of an event increases as the probability of its occurrence decreases. It also implies P{->A\A) = 0; an impossibility (not A, given A) has zero probability. Axiom 4 is the least intuitive. The probability of both A and B (under some condition C) is equal to the probability of A multiplied by the probability of B given A. The probabilities of negation (->A) and conjunction (A&B) each require an axiom. However, no further axioms are required to treat disjunction because AV B = -i(-iA&-ii?); in words, "A or B" is equivalent to the negation of "neither A nor B". This allows us to deduce a theorem: P(A V B\C) = P(A\C) + P(B\C) - P{AkB\C).

(1)

If A and B are mutually exclusive then we obtain P{AV B\C) = P(A\C) + P(B\C),

(2)

which is often taken to be an axiom, and may be used in place of Axiom 3. Several remarks about these axioms are in order. First, the notion of randomness plays no fundamental role in the theory. Hence we need not enquire whether our variables and events are random as a prerequisite to applying probability theory. Second, these axioms are not arbitrary. They are uniquely determined (apart from formal changes that do not affect the content) by conditions of plausibility and consistency (see Cox 5 and Jaynes 2 ): (i) The probability of A on some given evidence determines also the probability of "not A" on the same evidence. (ii) The probability on given evidence that both A and B are true is determined by their separate probabilities, one on the given evidence, and the other on that evidence plus the assumption that the first is true. (iii) If a complex proposition can be composed in more than one way [ex.: (A&B)&C, or A&c{Bb,C)\ then all ways of computing its probability must lead to the same answer. Notice that in (i) and (ii) only the existence of certain connections are assumed, but not their mathematical form. The consistency condition (iii) then leads to the mathematical forms of the axioms. Therefore, anyone who proposes an inequivalent alternative to Cox's axioms (such as allowing negative probabilities) has an obligation to explain how and why he departs from these conditions of plausibility and consistency.

76

Finally, a very important remark: All probabilities are conditional. The use of the single-variable notation P{A), instead of P(A\C), is permissible only if the conditional information C is obvious from the context, and is unchanging throughout the problem. Many fallacies and paradoxes follow from ignoring this principle. 3.1

Kolmogorov's axioms

If the fundamental axioms that define Probability Theory are those given above, then what is the status of Kolmogorov's well-known axioms? According to Kolmogorov's axioms, probability is assigned to subsets of a universal set fi, with the following rules: (i) p(n) = I (2) P(f) > 0 for any / in il. (3) If / i , - - - / « a r e disjoint then P(f) = S j / j , where / is the union of fir" fn(4) If/* —> 0 (the empty set) then P(fi) -> 0. The answer, I believe, is that Kolmogorov's axioms provide a mathematical model of probability theory (defined by Cox's axioms) on the theory of measurable sets. A mathematical model is useful because it reduces the consistency of one theory to that of another. (A familiar example is the algebra of complex numbers, which can be modeled by the algebra of ordered pairs of reals.) Thus any doubts about the consistency of Probability Theory may be laid to rest because of the existence of Kolmogorov's model. There are several objections to taking Kolmogorov's axioms as a foundation for Probability Theory, rather than merely as a model: • The universal set Cl is often fictitious. The propositions to which probabilities are assigned are not subsets of a set. • Conditional probability is relegated to secondary status, while the mathematical fiction of "absolute probability" is made primary. • Probability theory and Measure theory are distinct subjects. The interesting problems of one are not closely related to the interesting problems of the other. For example, measure theory deals mostly with infinite sets, culminating with the construction of non-measureable sets, which have no probabilistic interpretation. But in probability theory one seldom needs to consider an infinite number of conjunctions and disjunctions. On the other hand, the important problem of translating qualitative information into probabilities has no measure-theoretic analog.

77

4 4-1

Probability in Quantum Mechanics Relevant and Irrelevant Interpretations of Probability

Which of the interpretations of probability are relevant to quantum mechanics? The ensemble-frequency interpretation is obviously relevant, and widely used, in discussing the statistics of repeated experiments on similarly prepared states. Indeed, the standard description of an idealized experiment is: (1) prepare a state; (2) measure an observable of the system; (3) repeat the previous two steps until sufficient statistical data has been accumulated; (4) compare the relative frequencies of this data with the probabilities predicted by quantum theory. The propensity interpretation is in accord with the ensemble-frequency interpretation whenever it is applied to repeated experiments, but it also allows one to make meaningful statements about individual events. The propensity interpretation is more natural when one considers time-dependent states, and hence time-dependent probabilities. Consider the following examples. (i): A source produces s = 1/2 particles polarized at an angle 4> relative to some coordinate axis. A Stern-Gerlach magnet has its field gradient axis oriented at an angle 8. What is the probability that such a particle, incident on the apparatus, will emerge with spin "up"? The formal answer is, of course, p = {cos[(9 — <j))/2}}2 , but what does this mean? According to the propensity interpretation it means: The propensity (chance) of the particle emerging with spin "up" is p. According to the ensemble-frequency interpretation it means: In a long run of similar experiments the fraction of particles emerging with spin "up" will be (approximately) p. (ii) Now let the magnet be re-oriented in some arbitrary manner before each particle is released, so that 6 is different in each case. According to the propensity interpretation we say nearly the same thing: The propensity (chance) has a different value, p = p$, in each case. But in the ensemble-frequency interpretation one must conceptually embed each event in an imaginary long run of experiments having the same value of 6, in order to make a frequency statement.

78

(iii) Suppose next that the polarization direction <j> of the particles is unknown. Can it be inferred from the data of (ii)? In the ensemble-frequency interpretation the answer would appear to be: No. A long run of events for each value of 0 would be necessary to estimate p$ as a frequency, and hence to determine its dependence on 6. In the propensity interpretation the answer is: Yes. Bayesian inference (equivalent to maximum likelihood if the prior probability distribution for <> / is uniform) can determine the most probable value of <j>, even if there is only one event for each value of 9. I have never seen a coherent exposition of QM based on a subjective interpretation of quantum probabilities as representing knowledge*. This point (which has also been argued at length by Popper 8 ) is worth emphasizing because the interpretation of probabilities as knowledge seems to be a tenet of the Copenhagen interpretation. Two persons (with limited knowledge of QM) might have different "reasonable" beliefs about the position of the electron in the hydrogen atom, and those beliefs could be represented by subjective probabilities. But such "ignorance" probabilities have nothing to do with |*/>(a0|2 from the Schroedinger equation. |V'(a;)|2 is an objective propensity, not a subjective degree of belief. The so-called Uncertainty principle, AxAp > h/2, has nothing to do with subjective knowledge or ignorance. Its meaning is that in any physical preparation of a state, the values of x and p will not be reproducible, the widths of their distributions being related by the inequality. The widths Aa; and Ap are objective, predictable, and measurable parameters, which should not be called "uncertainties". Indeed, the name "Indeterminacy principle" is preferable to "Uncertainty principle". 0 Subjective probabilities can occur in the information games that are played in quantum communication theory. Consider a typical example. Bob prepares some quantum state, but keeps it secret. He tells Alice only that it is one of four (usually nonorthogonal) possible states, and she must try to infer what the hidden state is from a measurement. Alice's incomplete knowledge of that hidden state can be expressed as a subjective probability. Suppose also that Bob tells Carol that the unknown state is one of three possibilities. Carol's knowledge is different from Alice's, and hence her subjective probability will be different. But both of these subjective knowledge probabilities are quite distinct from the objective quantum probabilities (propensities) " W h e n I once heard Heisenberg speak (about 1964), he used the term Indeterminacy principle. In his early writings he used the words Ungenauigheit (inexactness), Unbestimmtheit (indeterminacy), and Unsicherheit (uncertainty) with various shades of meaning.

79 that would be calculated by solving Schroedinger's equation for Bob's state preparation apparatus. I suspect that the subjective "knowledge" interpretation of QM probabilities came about by accident; the founders of QM may have believed (erroneously) that probability can only be a measure of knowledge/ignorance. Max Born has written that Heisenberg did not know what a matrix was when he was inventing what later became known as matrix mechanics. It is therefore not very radical to suppose that the founders of quantum mechanics had an inadequate understanding of probability. 4-2

Fallacies in the use of Probability

Unsound arguments to the effect that "classical" probability theory does not apply to QM are woefully common. Before examining an actual argument to that effect, let us first consider a simple classical paradox. The Bookie's Paradox A bookie needs to fix the odds on a star track runner, who has a 60% chance of winning any race that he enters. There is a race in Paris and a race in Tokyo scheduled on the same day, so he cannot enter both, and we do not know which he will enter. What is the probability that he will win at least one of these races? Let A = (winning in Paris), and let B = (winning in Tokyo). Clearly A and B are mutually exclusive events, so P{A\JB) = P{A) + P(B). The probability of his winning at least one race is 0.6 + 0.6 = 1.2. But this is absurd, since 1.2 > 1. The paradox is resolved by taking account of a principle that was noted in Sec. 3: All probabilities are conditional. The notation P{A), instead of P(A\C), is permissible only if the conditional information C is obvious from the context, and unchanging throughout the problem. Let us, therefore, be more precise about the conditions involved. Let Ep = (entering in Paris), and let ET — (entering in Tokyo). Then clearly we have P(A\EP) = 0.6 ; P(B\EP)=0 P(A\ET)

= 0 ; P(B\Er)

= 0.6

80

Additivity, P(A V B\C) = P(A\C) + P{B\C), holds for the same condition C in all terms. But P{A\Ep) and P(B\ET) are not additive by any valid rule, so the absurd conclusion, reached above, followed only from an erroneous application of probability theory. Double-slit Fallacy A common fallacy about 2-slit experiment is of exactly the same form. The experiment consists of three parts: (a)

Open slit # 1 , close slit # 2 . The arriving at the point X on the screen (b) Open slit # 2 , close slit # 1 . The arriving at X is now P2(X). (c) Open both slits # 1 and # 2 . The arriving at X is Pi2(X).

probability of a particle is Pi(X). probability of a particle probability of a particle

Now passage through slit # 1 and through slit # 2 are mutually exclusive, so we deduce Pu{X) = Pi(X) + P2(X), which is empirically false. It is then concluded (fallaciously) that "classical" probability theory does not apply in quantum mechanics. The above reasoning embodies essentially the same fallacy is does the Bookie's paradox, and it is resolved similarly by paying proper attention to the conditional nature of the probabilities. Let condition C\ = (slit # 1 open, slit # 2 closed). Let C2 = (slit # 2 open, slit # 1 closed). Let C3 = (both slits open). We observe empirically that P(X\Ci) + P(X\C2) ^ P(X\C3), (due, of course, to interference). But this fact is is fully compatible with classical probability theory. 4-3

Quantum Probabilities

Quantum probabilities are not essentially different from classical probabilities, but like quantum theory itself, they do require some care in their interpretation. H. Jefferys 7 remarked that the probability statements of quantum mechanics are incomplete because, "a probability is always relative to a set of data, and the data are not specified." In our terminology, Jefferys is saying that all probabilities are conditional, and the conditions need to be specified to

81

make the probability statement meaningful. This can be accomplished through a propensity interpretation of quantum probabilities, with proper attention being given to the basic concepts of measurement and state preparation. When that is done, it can be demonstrated 9 ' 10 that quantum probabilities obey all of the axioms of "classical" probability theory. The demonstration is straight forward, but too lengthy to review here, so I shall only remark on some conceptual points. (a) The standard formula: P(A=an\^) = |(a„|*)| 2 , where A\an) = an\an), should be read as: The probability (propensity) for a measurement of the dynamical variable A to yield the value an, conditional on the preparation of the state * , is |(a„|*)| 2 . Note that the propensity is conditioned by the physical process of state preparation, and not by anyone's beliefs or opinions. (b) One can also calculate the probability of a measurement result, conditioned by state preparation and the results of other measurements^ P(B=bm\(A=an)kV). However, it is necessary that the measurement processes be described dynamically as an interaction between the object and the apparatus. Simplistic application of the Projection Postulate is liable to give an incorrect answer.11 (c) No difficulties of principle arise if the probabilities are conditioned on actual events of state preparation and measurement. But assigning probabilities to hypothetical unmeasured values is not always possible. This problem is encountered if we try to introduce joint probability distributions for (unmeasured values of) non-commuting observables, and require the marginal distributions to agree with the quantum probabilities of the individual observables. In the case of position and momentum, we would like to have a joint distribution P(x,p) that satisfies: P(x,p) > 0, Jp(x,p)dp=\(x\*)\2, Jp(x,p)dx

= \(p\V)\2.

(3) (4) (5)

There are infinitely many solutions to this problem,12 but there is no apparent physical reason for any one of them to be preferred. However, in the case of angular momentum, where we might seek a joint distribution P(Jx,Jy,Jz) for the three angular momentum components, it is

82

not difficult to show that no such a function can yield the quantum probabilities of the three components as marginals. However, this has more to do with Kochen-Specker 13 difficulties (the impossibility of assigning values to all quantum observables, consistent with all the relevant constraints) than to probability theory. There is no case in which a quantum probability is well defined but violates an axiom of classical probability theory. 5

Conclusions

In this paper I have suggested a scheme whereby all the major interpretations of probability are unified, with the separate interpretations now seen as applications of the general theory to particular subject matters. That such different ideas as ensemble-frequency theories, propensity theory, and subjective degrees of reasonable belief can all be encompassed within a single framework is both useful and surprizing. Because they can all be described by the same mathematical axioms, it is easy to switch from one kind of probability to another, as may be appropriate in a particular problem. But on the other hand, one can ask why such different things as frequencies, propensities, and degrees of belief should necessarily obey the same axiom system. This question should stimulate further foundational research. For the case of degrees of reasonable belief this work has already been completed by Cox,5'6 who showed that certain conditions of plausibility and consistency determine the axioms essentially uniquely. "Essentially unique" means subject only to formal transformations that do not alter the content of the theory. Therefore, any alternative inequivalent system of plausible reasoning could be shown to suffer from some degree of inconsistency. Khrennikov 14 has studied limit frequencies outside of any theory of probability, imposing only a condition of stabilization: that in a long sequence the frequencies should approach a limit. He has found many different cases to be possible, some of which lie outside of probability theory. It will be interesting to see whether these new logical possibilities are realized in nature. If not, then his stabilization condition will have to be supplemented by other conditions. The greatest need for more foundational research is in the case of propensity. Although it clearly can be described by the axioms of probability theory, it is not yet clear why it must be so described. Although I have dealt only with versions of probability theory that are derivable from the same axioms, I expect that the classification of interpretations (Fig. 1) may also be useful for generalized theories, such as those that admit negative probabilities.15 For such generalizations, we should ask which of the interpretations do they support. Can such generalized probabilities be

83

interpreted as frequencies? As propensities? As degrees of belief? Or must they be given some entirely new interpretation? There are connections between the interpretations of probability and of quantum mechanics. This must be so because quantum mechanics does not predict events, but only the probabilities of events. If one adheres exclusively to a frequency interpretation of probability, then one is bound to assert that a quantum state describes only an ensemble of similarly prepared systems. If, on the other hand, one adopts a propensity interpretation of probability, then it becomes possible to make meaningful probability statements about an individual system. However the empirically testable content of those statements can be realized only by measurements on an ensemble of similarly prepared systems. Thus the frequency interpretation is not made obsolete by the propensity interpretation, but merely broadened. The subjective interpretation of probability can be used in some situations, such as when the observer is not fully informed about the state preparation procedure. But it is never correct to interpret \ip\2 as representing knowledge (except, perhaps, in the trivial case in which the observer's knowledge is complete and in perfect accord with reality).

References 1. T.L. Fine, Theories of Probability, an Examination of Foundations (Academic Press, New York, 1973). 2. E.T. Jaynes, Probability Theory: The Logic of Science (Cambridge University Press, forthcoming); an incomplete version of this work is available electronically at http://bayes.wustl.edu/ 3. K.R. Popper in Observation and Interpretation ed. S. Korner (Butterworths, London, 1957). 4. K.R. Popper, Realism and the Aim of Science (Hutchinson, London, 1983). 5. R.T. Cox, The Algebra of Probable Inference (Johns Hopkins University Press, Baltimore MD, 1961). 6. R.T. Cox, Am. J. Phys. 14, 1 (1946). 7. H. Jefferys, Scientific Inference (Cambridge University Press, Cambridge, 1973), sec. 10.31 8. K.R. Popper, Quantum Theory and the Schism in Physics (Hutchinson, London, 1982). 9. L.E. Ballentine, Quantum Mechanics - A Modern Development (World Scientific, Singapore, 1998), Ch. 1.5, 2.4, 9.6 10. L.E. Ballentine, Am. J. Phys. 54, 883 (1986). 11. L.E. Ballentine, Found. Phys. 20, 1329 (1990).

84

12. L. Cohen, in Frontiers of Nonequilibrium Statistical Physics, ed. G.T. Moore and M.O. Scully (Plenum, New York, 1986), pp. 97-117. 13. S. Kochen and E.P. Specker, J. Math. Mech. 17, 59 (1967). 14. A. Khrennikov, Nonconventional approach to 'elements of physical reality' based on nonreal asymptotics of relative frequencies. Proc. Conf. Foundations of Probability and Physics, Vaxjo-2000 (WSP, Singapore, 2001). 15. A. Khrennikov, Interpretations of Probability, (VSP, Utrecht, 1999).

85

FORCING DISCRETIZATION A N D D E T E R M I N A T I O N IN Q U A N T U M HISTORY THEORIES BOB COECKE Imperial College of Science, Technology & Medicine, Theoretical Physics Group, The Blackett Laboratory, South Kensington, LondonSW7 2BZ; and Free University of Brussels, Department of Mathematics, Pleinlaan 2, B-1050 Brussels; E-mail: [email protected] We present a formally deterministic representation for quantum history theories where we obtain the probabilistic structure via a discrete contextual variable: no continuous probabilities are as such involved at the primal level.

1

Introduction

In this paper we propose and study a model for history theories in which the probability structure emerges from a finite number of contextual happenings, any next happening having a fixed chance to occur under the condition that the previous one happened. Although this model cannot have a canonical mathematical status since it has been proved that this type of representation in general admits no essentially unique "smallest one" 8 u , it provides insight in the emergence of logicality in the "History Projection Operator" setting 1 4 , and it illustrates how deterministic behavior can be encoded beyond those interpretations of quantum history theories that are interpretationally restricted by so-called consistency or quasi-consistency (e.g., approximate decoherence). The particular motivation for this "paradigm case study" finds its origin in structural considerations towards a theory of quantum gravity 4 ' 1 5 ' 1 9 . As argued in 1 6 , although the relative frequency interpretation of probability justifies the continuous interval as the codomain for value assignment, "... in the quantum gravity regime standard ideas of space and time might break down in such a way that the idea of spatial or temporal 'ensembles' is inappropriate. For the other main interpretations of probability — subjective, logical, or propensity — there seems to be no compelling a priori reason why probabilities should be real numbers." Our model should be envisioned as a deconstructive step unraveling the probabilistic continuum as it appears in standard quantum theory, reducing it explicitly to a discrete temporal sequence of (contextual) events. The as such emerging temporal sequence is then easier to manipulate towards alternative encoding of contextual events, e.g., in propositional terms. It also enables a separate treatment of internal (the system's) and external (the con-

86

text's) time-encoding variable. Although quantum history theories are currently most frequently envisioned in a context of so-called decoherence we prefer to take the minimal perspective that a history theory is a theory that deals with sequential quantum measurements but remains essentially a dichotomic propositional theory. This is formally encoded in a rigid way in the History Projection Operatorapproach 14 . We also mention recently studied sequential structures in the context of quantum logic, of which references can be found in 1 0 , resulting in a dynamic disjunctive quantum logic, which provides an appropriate formal context to discuss the logicality of history theories. A general theory on deterministic contextual models can be found in 8 . Note here that what we consider as contextuality is that in a measurement there is an interaction between the system and its context and that precisely this interaction to some extend may influence the outcome of a measurement. A lack of knowledge on the precise interaction then yields quantum-type uncertainties *. Besides this interpretational issue, classical representations are important since we think classical, so even without giving any conceptual significance to the representation, it provides a mode to think deterministically in terms of determined trajectories of the system's state, without having to reconcile with concrete non-canonical constructs like pilot-wave mechanics. 2

O u t c o m e determination via c o n t e x t u a l m o d e l s

We will present the required results in full abstraction such that the reader clearly sees which structural ingredient of quantum theory determines existence of contextual models. For details and proofs we refer t o 8 . Let B(M?) denote the Borel subsets of M? . Definition 1. A 'probabilistic measurement system' is given by: (i) A set of states £ and a set of measurements £; (ii) For each e e £ an outcome set Oe € B(W), a a-field B(Oe) of Oe-subsets and (Kolmogorovian) probability measures Pp<e : B(Oe) -> [0,1] for eachp 6 £ . The canonical example is that of quantum theory with every Hilbert space ray ij) representing a state, every self-adjoint operator H representing a measurement with its spectrum OH C K as outcome set where the a-structure B(OH) is inherited from that of B(R) and with probability measures P^,tH{E) •= (tp\PEtp) where PE denotes the spectral projector for E G B{OH) • In benefit of insight and also for notational convenience we will from now on assume that the measurements e £ £ are represented in a one to one way by their outcome sets Oe — note that whenever £ can be represented by points of W it then suffices to consider W x w' = W+v' in stead of W to fulfill this assumption,

87

taking Oe x {e} as the corresponding outcome set. We stress however that the results listed below also hold in absence of this assumption 81 '". Definition 2. A 'pre-probabilistic hidden measurement system' is given by: (i) A set of states £ and a set of measurements £ ; (ii) Sets O C B(W) and A that parameterize £, i.e., £ = {eA,o|A £ A,0 £ O}, and each e £ £ goes equipped with a map O . We can represent {<£A,O|A £ A} as ipo : £ x A -> O : (p, A) H-> P ( £ ) : (p,E) >-> {A|y 0 (p, A) £ E} where 7>(A) denotes the set of subsets of A. The core of this definition is that given a state p £ £ and a value A € A we have a completely determined outcome tpo [p, A). These pre-probabilistic hidden measurement systems encode as such fully deterministic settings. Definition 3. Whenever for a given pre-probabilistic hidden measurement system (Y,,£(0, A), {<po}oeo) there exists a a-field B(A) of A-subsets that satisfies \J0e0{AAo(p,E)\(p,E) £ £ x B(0)} C B(A), it defines a 'probabilistic hidden measurement system' if a probability measure p : B(A) —> [0,1] is also specified. The condition on A A requires that all AAo(p, E) are 23(A)-measurable, such that to all triples (p, O, E) we can assign a value PPto(E) := p(AAo(p, E)) € [0,1]. As such, any probabilistic hidden measurement system defines a measurement system. The question then rises whether every probabilistic measurement system (MS) can be encoded as a probabilistic hidden measurement system (HMS). The answer to this question is yes8% 4.2, Theorem 1,2 3: There always exists a canonical HMS-representation for A = [0,1], B(A) = B([0,1]) (i.e., the Borel sets in [0,1]) and pu([0,a]) := a, i.e., uniformly distributed — the proof goes via a construction using the Loomis-Sikorski Theorem 1 7 ' 2 0 and Marczewski's Lemma 13 . It makes as such sense to investigate how the different possible HMS-representations for different non-isomorphic pairs (B(A),p,) are structured — below it will become clear what we mean here by non-isomorphic. First we will discuss an example that illustrates the above; it traces back to 1 and details and illustrations can be found in 2 ' 8 " . Consider the states of a spin-1 entity encoded as a point on the Poincare sphere £ 0 ( = C^/C) C E 3 . Then any pair of antipodically located points of £ 0 encodes mutual orthogonal states, as such encodes mutual orthogonal one-dimensional projectors and thus a (dichotomic) measurement. Let p £ £ 0 , let (a, ->a) be a pair of mutual orthogonal points of £ 0 and let A be the diagonal connecting a and -a:], i.e., xp £ [a,A], we set a]) —> [0,1] : [a, (1 — x)a + x--> x , i.e., uniformly distributed,

88

we obtain exactly the probability structure for spin-| in quantum theory .a An interpretational proposal of this model could be the following:1'2'3 Rather than decomposing states as in so-called hidden variable theories, here we decompose the measurements in deterministic ones — the probability measure fi should then be envisioned as encoding the lack of knowledge on the interaction of the measured system with its environment, including measurement device. We now introduce a notion of "relative size" of HMS-representations, justifying the use of "smaller". Given a er-algebra6 and probability measure H : B —> [0,1] denote by B/n the [0,1] : [E] H-> H(E) again by fi. Given two pairs (B, /x) and (B1, //) consisting of separable cr-algebras and probability measures on them set: • (B u) < (B' u') & /

3

f

:B

^

~* B'^''

a n in

J e c t i v e c-ni°rphism

We call {B,n) and (B',fi') equivalent, denoted (B,fi) ~ (B',fi'), whenever in the above / is a c-isomorphism. Given two MS (£,£) and (E',£') we set:

{

3s : S -> E ' , 3t :£-+£', both bijections Ve 6 £, 3 / e : B(Oe) -> B(Ot(e)), a cr-isomorphism Vp E E , V e E £ : Ps(p),t(e)

° fe =

P

P,e

Via this equivalence relation we can define a relation < M S between classes of measurement systems M and M1 as M <MSM' if for all (E,£) € M there exists (E',£') 6 M' such that (E,£) ~ M S (S',£"'), i.e., if M is included in M' up to MS-equivalence. We can then prove the following: (i) (B,/i) ~ (B',ii') if and only if (B>Ai) < (B',n') and {B',ft') < {B,ft) — " , 3, Lemma 1; thus, the equivalence classes with respect to ~ constitute a partially ordered set (poset) for the ordering induced by < ; we will denote 8

"As shown i n 6 , 9 this deterministic model for spin-^ in R 3 can be generalized to R 3 -models for arbitrary spin-N/2 . The states are then represented in the so called Majorana representation 18 5 ' , i.e., as N copies of So . Correct probabilistic behavior is then obtained by introducing entanglement between the N different "spin-^ systems". fc I.e., a "pointless" cr-fleld. In particular, it follows from the Loomis-Sikorski theorem 1 7 ' 2 0 that all separable
89 the set of these equivalence classes by M , a class in it will be denoted via a member of it as [B, n]. (ii) When setting M H M S := {M[B{K),ii\ \ [B(A),n] £ M } where M[B(A),fi] stands for all HMS with B(A') and \i' such that (S'(A'), fi') £ [B(A),/j] , we have that (B(A),/i) < (B'(A'),M') B,ndM[B(A),n] <MS M[B'(A'),n'] are equivalent 8i \ 3, Theorem 2. This then results in: Theorem 1. (M, <) and (M H M S ,< M S ) are isomorphic posets. One of the crucial ingredients in (ii) above and also in the proof for general existence with A = [0,1] is the following: when setting A M ( E , £ ) := {(B(Oe), Pp,e)\p € £, e G £} , we obtain that £, £ admits a HMS-representation with B(A) and \i if and only if AM(E, £) < (B(A),n), where the order applies pointwisely to the elements of A M ( E , £ ) 8 t , 4.2, Theorem 1. Using this and Theorem 1 above we can now translate properties of M to propositions on the existence of certain HMS-representations. We obtain the following: (i) (M, <) is not a join-semilattice, thus: In general there exists no smallest HMS-representation. As such we will have to refine our study to particular settings where we are able to make statements whether there exists a smallest one, and if not, whether we can say at least something on the cardinality of A. (ii) One can prove a number of criteria on A M ( E , £ ) that force (B(A),fi) ~ (S([0,1]), /i„) as such assuring existence of a smallest representation. Among these the following. Let Mfinite := {(B(X),^) € M J X is finite}. / / ^•finite Q A M ( £ , £ ) than A cannot be discrete. It then follows for example that quantum theory restricted to measurements with a finite number of outcomes still requires A = [0,1]. (iii) Let MJV := {(B(X),(i) 6 M | X has at most N elements} . J / A M ( £ , £ ) C M^r then there exists a HMS-representation with A — N . Thus, quantum theory restricted to those measurements with at most a fixed number N of outcomes has discrete HMS-representation. (iv) / / A M ( E , £ ) = MAT then there exists no smallest HMS-representation. Neither does it exist when fixing the number of outcomes. So there is no essentially unique smallest HMS-representation for ./V-outcome quantum theory. Although there exists no smallest and as such no canonical discrete HMSrepresentation we will give the construction of one solution for dichotomic (or propositional) quantum theory, i.e., N = 2, since this will constitute the core of the model presented in this paper. We will follow8"2, to which we also refer for a construction for arbitrary N. Let us denote the quantum mechanical probability to obtain a positive outcome in a measurement of a proposition or question a on a system in state p as Pp(a) — the outcome set consists here of "we obtain a positive answer for the question a", slightly abusively denoted

90 as a itself, and "we obtain a negative answer for the question a", denoted as -ia. Set inductively for A € N : c . <pa(p, X):=\

{

aa

iff P (n\ > A- 4- V * - 1 i(.Vc.(p W Z ^ + U=i 2>

^ -ia otherwise One verifies that for p,(X) := ^x we obtain the correct probabilities in the resulting HMS-model. This provides a discrete alternative for the above discussed E 3 -model for spin-i . The model, including the projection xp remains the same although we don't consider [a, ->a] as A anymore. Let A e A ' : = N . Set x„ := ( 1 - £)a+ (£)--i • For xp <E [a,x$[U[x$,x$[U[x%,x£[U... U = a [a;2A-i'~lQ;] w e se ^ f'a&ty > anc ^ PaiP'ty = ~}<x otherwise. Then, for p'0 := B(N) —»• [0,1] : {A} >-> ^ we obtain again quantum probability. Geometrically, this means that the values of A £ A, as compared to the first model where they represents points on the diagonal, i.e., a continuous interval, or, again equivalently, decompositions of an interval in two intervals, we now consider decompositions of an interval in 2A equally long parts, of which there are only a discrete number of possibilities. We refer t o 8 " for details and illustrations concerning. 3

Unitary, ortho- and projective structure

In the above discussed E 3 models, rotational symmetries where implicit in their spatial geometry. However, in general the decompositions of measurements over p: B(A) —> [0,1] go measurement by measurement so additional structure, if there is any, has to be put in by hand. It is probably fair to say that these contextual models only become non-trivial and useful when encoding physical symmetries within the maps tpa in an appropriate manner. For sake of the argument we will distinguish between three types of symmetries that can be encoded, namely unitary, ortho- and projective ones. i. Unitary symmetries: When considering quantum measurements with discrete non-degenerated spectrum we can represent the outcomes {OJ}J by the corresponding "eigenstates" {pi}i via spectral decomposition, i.e., there exists an injective map B(Oe) -t P(E) for each e € £. Then, specification of

< a i | + \a2 >< a2\

196 and \bi >< b\\ + |&2 > < b2\, where {|a; >}j=i,2 and {\bi >}i=i,2 are two orthonormal bases in E. Let (p be a state (normalized vector belonging to E). We can perform the following operation (which is well defined from the mathematical point of view). We expend the vector

}i=i,2 •

+p2\b2>,

(34)

where the coefficients (coordinates) Pi belong to G. As the basis {\bi >}i=i,2 is orthonormal, we get (as in the complex case) that: \p1\2 + \p2\2 = l.

(35)

However, we could not automatically use Born's probabilistic interpretation for normalized vectors in the hyperbolic Hilbert space: it may be that Pi $. G + (in fact, in the complex case we have C = C + ) . We say that a state ip is decomposable with respect to the system of states {|6j >}i=i,2 (S-decomposable) if Pi G G+ .

(36)

In such a case we can use Born's probabilistic interpretation of vectors in a hyperbolic Hilbert space: Numbers q; = \Pi\2,i = 1,2, are interpreted as probabilities for values B = bi for the G-quantum state tp. We now repeat these considerations for each state \bi > by using the basis {\o>k >}*=i,2- We suppose that each \bi > is ^-decomposable. We have: |&i > = / ? n k > +Pi2\a2 >, |&2 > = & i | a i > +p22\a2 > ,

(37)

where the coefficients Pik belong to G+. We have automatically: |/?n| 2 + |/?i 2 | 2 = l, |/?2i|2 + |/? 22 | 2 = l .

(38)

We can use the probabilistic interpretation of numbers p n = |/?n| 2 ,pi2 = |/3i2|2 and p 2 i = |/32i|2,P22 = \P22? • Pik is the probability for a - ak in the state \bi > . Let us consider matrices B = (Pik) and P = (pik)- As in the complex case, the matrix B is unitary: vectors u\ = (Pn,Pi2) and u2 = (p2i,P22) are orthonormal. The matrix P is double stochastic. By using the G-linear space calculation (the change of the basis) we get

Foundations of Probability and Physics

Read more

Foundations of probability and physics. Proceedings Vaxjo, 2000

Read more

Quantum Probability and Related Topics: Proceedings of the 30th Conference

Read more

Foundations Of Modern Probability

Read more

Foundations of Modern Probability

Read more

Logical Foundations of Probability

Read more

Foundations of Modern Probability

Read more

Logical Foundations of Probability

Read more

Foundations of Modern Probability

Read more

Foundations of Modern Probability

Read more

Logical foundations of probability

Read more

Nomic Probability and the Foundations of Induction

Read more

Nomic Probability and the Foundations of Induction

Read more

Nomic Probability and the Foundations of Induction

Read more

Foundations of the Theory of Probability

Read more

Foundations of the theory of probability;

Read more

Foundations of quantum physics

Read more

Foundations Of Quantum Physics

Read more

Philosophical Foundations of Physics

Read more

Foundations of Nuclear Physics

Read more

Philosophical Foundations of Physics

Read more

Foundations of Quantum Physics

Read more

Foundations of Physics

Read more

Foundations of Quantum Physics

Read more

Foundations of Physics

Read more

Foundations of quantum physics

Read more

Probability Foundations of Economic Theory

Read more

Mathematical Physics - Proceedings Of The XI Regional Conference

Read more

Mathematical Physics: Proceedings of the 12th Regional Conference

Read more

Mathematical Results In Quantum Physics: Proceedings of the QMath11 Conference

Read more

Recommend Documents

Foundations of Probability and Physics

Foundations of probability and physics. Proceedings Vaxjo, 2000

Quantum Probability and Related Topics: Proceedings of the 30th Conference

Foundations Of Modern Probability

Preface Some thirty years ago it was still possible, as Lo`eve so ably demonstrated, to write a single book in probabil...

Foundations of Modern Probability

Logical Foundations of Probability

Seite 1 von 12 [reposted March 24, 1997] Boardman's Notes: The Criteria of Adequacy of An Explication, illustrated by ...

Foundations of Modern Probability

Preface Some thirty years ago it was still possible, as Lo`eve so ably demonstrated, to write a single book in probabil...

Logical Foundations of Probability

Foundations of Modern Probability

Foundations of Modern Probability Olav Kallenberg Springer Preface Some thirty years ago it was still possible, as L...

Foundations of Modern Probability

...