The foundations of quantum mechanics: historical analysis and open questions - Cesena 2004; Cesena, Italy, 4 - 9 October 2004

The Foundations of Quantum Mechanics Historical Analysis and Open Questions - Cesena 2004 editors Claudio Garola • Arca...

Author: Claudio Garola | Arcangelo Rossi | Sandro Sozzo

6 downloads 248 Views 17MB Size Report

This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!

Report copyright / DMCA form

DOWNLOAD PDF

The Foundations of Quantum Mechanics Historical Analysis and Open Questions - Cesena 2004

editors Claudio Garola • Arcangelo Rossi • Sandro Sozzo

The Foundations of Quantum Mechanics Historical Analysis and Open Questions - CBsBna 2DD4

This page is intentionally left blank

The Foundations of Quantum Mechanics Historical Analysis and Open Daestions - Cesena 2D04 Cesena, Italy

4 - 9 October 2004

editors

Claudio Garola Arcangelo Rossi Sandro Sozzo University of Lecce, Italy

\fc World Scientific NEW JERSEY • LONDON • SINGAPORE • BEIJING

• SHANGHAI

• HONG KONG • TAIPEI • CHENNAI

Published by World Scientific Publishing Co. Pte. Ltd. 5 Toh Tuck Link, Singapore 596224 USA office: 27 Warren Street, Suite 401-402, Hackensack, NJ 07601 UK office: 57 Shelton Street, Covent Garden, London WC2H 9HE

British Library Cataloguing-in-Publication Data A catalogue record for this book is available from the British Library.

THE FOUNDATIONS OF QUANTUM MECHANICS Historical Analysis and Open Questions — Cesena 2004 Copyright © 2006 by World Scientific Publishing Co. Pte. Ltd. All rights reserved. This book, or parts thereof, may not be reproduced in any form or by any means, electronic or mechanical, including photocopying, recording or any information storage and retrieval system now known or to be invented, without written permission from the Publisher.

For photocopying of material in this volume, please pay a copying fee through the Copyright Clearance Center, Inc., 222 Rosewood Drive, Danvers, MA 01923, USA. In this case permission to photocopy is not required from the publisher.

ISBN 981-256-852-2

Printed in Singapore by World Scientific Printers (S) Pte Ltd

SCIENTIFIC COMMITTEE Silvio Bergia and Giorgio Dragoni (University of Bononia) Claudio Garola and Arcangelo Rossi (University of Lecce) Vincenzo Fano and Gino Tarozzi (University of Urbino) Franco Pollini (Commune of Cesena)

ORGANIZING COMMITTEE Lucio Rizzo and Sandra Sozzo (University of Lecce) Isabella Tassani (University of Urbino)

SECRETARY Maria Concetta Gerardi (University of Lecce)

SPONSORING INSTITUTIONS

M M * WAITS "rrvrvoRVM

University of Bononia UNWERSITA

University of Lecce

UNIVCmiA' DB5U5TUDI Dl URBINO C ^ I C BO

University of Urbino

Commune of Cesena

Physics Department of the University of Bononia

i lUI/'SBJ ffa'l H.I-rf.JI'ji J/fll.i

•Ptpitftwicuiy. Ji 7 ' u y j

Physics Department of the University of Lecce Dtpartimento di Fiska

wepim

Cesena Interuniversitary Research Center on Philosophy and Foundations of Physics

This page is intentionally left blank

CONTENTS Introduction C. Garola, A. Rossi and S. Sozzo

1

If Bertlmann had Three Feet A. Afriat

18

Macroscopic Interpretability of Quantum Component Systems R. Ascoli

23

Premeasurement versus Measurement: A Basic Form of Complementarity G. Auletta and G. Tarozzi

40

Remarks on Conditioning E. G. Beltrametti

48

Entangled State Preparation in Experiments on Quantum Non-Locality V. Berardi and A. Garuccio

61

The First Steps of Quantum Electrodynamics: What Is It That's Being Quantized? S. Bergia

68

On the Meaning of Element in the Science of Italic Tradition, the Question of Physical Objectivity (and/or Physical Meaning) and Quantum Mechanics G. Boscarino

80

Mathematics and Epistemology in Planck's Theoretical Work (1898-1915) P. Campogalliani

92

On the Free Motion with Noise B. Carazza and R. Tedeschi

103

Field Quantization and Wave/Particle Duality M. Cini

110

vm Parastatistics in Econophysics? D. Costantini and U. Garibaldi

118

Theory-Laden Instruments and Quantum Mechanics S. D 'Agostino

130

Quantum Non-Locality and the Mathematical Representation of Experience 142 V. Fano On the Notion of Proposition in Classical and Quantum Mechanics C. Garola and S. Sozzo The Electromagnetic Conception of Nature and the Origins of Quantum Physics E. A. Giannetto

156

178

What We Talk About When We Talk About Universe Computability S. Guccione

186

Bohm and Bohmian Mechanics G. Introzzi and M. Rossetti

197

An Objective Background for Quantum Theory Relying on Thermodynamic Concepts L.LanzandB. Vacchini

210

The Entrance of Quantum Mechanics in Italy: From Garbasso to Fermi M. Leone andN. Robotti

225

The Measure of Momentum in Quantum Mechanics F. Logiurato and C. Tarsitani

238

On the Two-Slit Interference Experiment: A Statistical Discussion M. Minozzo

248

Why the Reactivity of the Elements is a Relational Property, and Why it Matters V. Mosini

260

IX

Detecting Non Compatible Properties in Double-Slit Experiment Without Erasure G. Nistico

274

If You Can Manipulate Them, Must They Be Real? The Epistemological Role of Instruments in Nanotechnological Research A. Rebaglia

281

Mathematical Models and Physical Reality from Classical to Quantum Physics A. Rossi

293

Complex Entanglement and Quaternionic Separability G. Scolarici and L. Solombrino

301

Mach-Zehnder Interferometer and Quantitative Complementarity C. Tarsitani and F. Logiurato

311

Antonio Gramsci's Reflection on Quantum Mechanics /. Tassani

320

The Role of Logic and Mathematics in the Heisenberg Formulation of Quantum Mechanics A. Venezia

335

Space-Time at the Planck Scale: The Quantum Computer View P. A. Zizzi

345

Three-Dimensional Wave Behaviour of Light F. Logiurato, B. Danese, L. M. Gratton and S. Oss

359

INTRODUCTION

The Conference entitled The Foundations of Quantum Mechanics. Historical Analysis and Open Questions - Cesena 2004 was held in Cesena (Italy) from October 4 to October 9, 2004, and was the fourth of a series that began with a Conference in Camerino (October 31-November 3, 1988) and continued with two Conferences in Lecce (October 5-8,1993, and October 13-16, 1998). All Conferences had the same title, in order to underline their ideal continuity. Indeed, they all were conceived as interdisciplinary meetings among Italian researchers (physicists, logicians, mathematicians, historians and epistemologists) concerned with the history, the structures and the foundational problems of quantum mechanics (QM). As we have already stressed in the Introductions to the previous Conferences, the idea of grouping together scholars coming from different disciplines was suggested by the huge number of open mathematical, philosophical, historical and epistemological questions in QM, which have raised the interest of many researchers having unlike cultural backgrounds. The main aim of the Conference was then comparing the various perspectives, favouring reciprocal understanding and setting up a common language that could help crossed fertilization of ideas. Of course, this aim has been only partially achieved, but every issue of the Conference has registered some significant progress. New proposals for the solution of old and more recent problems have been forwarded, and the propensity to compare different disciplinary perspectives has become more widespread. This is witnessed also by the papers collected in the present volume, some of which clearly have an interdisciplinary character. All papers are published under the responsability of the authors, but the editors have made the effort to revise and discuss many of them with the authors in order to offer a better and more understandable final product. To this end, we also comment briefly on all articles in this Introduction, so that the reader may have an overall perspective on the content of the book. In order to facilitate readability, the papers will be grouped here, rather subjectively, into three main classes. In the first class we collect "technical" papers that propound new solutions or viewpoints within the framework

1

2

of QM, or some generalization of it. In the second class we group those papers that have an "interdisciplinary" character, i.e., discuss QM problems introducing perspectives or notions pertaining to different disciplines. In the third class we collect the papers having a more historical, philosophical or epistemological character. Let us begin with the first class, the class of "technical" papers. The paper by A. Afriat argues against the common belief that conservation accounts for quantum correlation. The author notes that people who uphold this thesis actually have in mind conservation plus an additivity condition, and that quantum correlation has nothing to do with time, so that it should follow from the additivity condition only. Afriat recognizes that this actually occurs whenever a compound system made up by two component subsystems is considered. Yet he shows - by analyzing an entangled state of a compound system made up by three component subsystems that additivity alone is not sufficient to account for quantum correlation in the general case (we add here that the author's reasonings can be modified in order to apply to classical mixtures, attaining similar conclusions, so that they seem to be independent of the specific, non classical features of quantum correlation). The paper by R. Ascoli consists of three interconnected contributions. In the first of them the author argues that three main physical paradigms are introduced in QM whenever the interactions between quantum systems and their macroscopic environments (in particular, measurements and measuring processes) are discussed. The three paradigms have increasing complexity and the author's treatment shows that according to the third paradigm the boundary between a quantum system and its environment can be shifted by constructing a quantum model of a part of the environment. This shifting expresses the general idea of the Universality of Physics, which also guides the second and third contributions in the paper. In the former, Ascoli propounds possible definitions of semimacroscopic and macroscopic interpretability of quantum subsystems. This proposal introduces the latter contribution, which reports two theorems stated by the author some years ago and still unpublished. The theorems consider a quantum system undergoing an external process T and supply conditions for semimacroscopicity and macroscopicity of the environment of T. These conditions provide a satisfactory support to the aim at Universality of Physics quoted above, since they show that quantum models of the measurement satisfying final component system semimacroscopicity or macroscopicity can always be obtained.

3

The paper by E. Beltrametti deals with the notion of conditional probabilities in statistical physical theories. To this end, the author refers to the very general notion of convexity model, which encompasses the standard classical and quantum frames, and also includes operational (or unsharp) QM. Beltrametti considers in particular a special kind of convexity model called operational probability theory (OPT). The OPT frame indeed generalizes the standard classical case and also the standard and operational quantum schemes, introducing an enlargement of the usual class of random variables in order to encompass indeterministic features. In this general frame a plurality of conditional probabilities can be introduced, which mirrors the non uniqueness of the way in which a joint measurement of two observables can be performed. The author shows that the standard choice of the conditional probability in QM can be ascribed to the so-called Luders-von Neumann recipe, which rests on a strong, not always realistic, idealization of the measurement process. The paper by V. Berardi and A. Garuccio considers the experiments that have been conceived in the last decade in order to test Einstein's locality via Bell-type inequalities by using pairs of correlated photons emitted by spontaneous parametric down conversion processes. These experiments can be divided in two classes: those in which the emerging photons have the same linear polarization (type I) and those in which the polarizations are orthogonal (type II). The authors focus their attention on type I experiments and note that photon pairs may exist in the quantum state that is introduced which travel along the same channel and reach only one of the final polarizers. This implies that the conditions that allow one to deduce the inequality obtained by Clauser and other authors, which is commonly used to test Einstein's locality, are not fulfilled. The aforesaid inequality must then be replaced by another inequality, which is explicitly written down by Berardi and Garuccio but cannot be violated by the joint transmission probabilities predicted by QM. Hence, the authors argue that type I experiments actually cannot discriminate between QM and local realism. They also show that this conclusion can be supported by further theoretical arguments and criticize some attempts at circumventing the above difficulties by introducing ad hoc assumptions. The paper by B. Carazza and R. Tedeschi analyzes the effect of a thermal background on the free motion of a classical mesoscopic particle, modeling the external noise through a longitudinal force field. The authors show in particular that, asymptotically, the position values are dispersed with a mean square deviation which increases with the time elapsed after

4

the detection of the particle, while the mean square deviation of the velocity values vanishes after an initial growth. More generally, the authors' results suggest that some quantum features may be simulated by resorting to an external noise, which at first sight seems to confirm the old idea that nonrelativistic QM can be interpreted as describing a kind of Brownian motion. However, Carazza and Tedeschi point out in their conclusions some technical features that disprove this hypothesis. The paper by M. Cini provides a syntesis of some previous articles in which the author carried on a research program whose origin can be traced back to Wigner and Feynman. Cini generalizes the formalism of classical statistical mechanics in phase space, introducing an uncertainty and a discreteness postulate which imply mathematical constraints on the set of variables in terms of which any physical quantity can be expressed. These constraints entail that all variables must be represented by Dirac q-numbers and allow the author to recover the Wigner function obtained from the standard wave function of the state. By considering such a function as a pseudoprobability which can assume also negative values, one can then eliminate the problematical Schrodinger waves from QM. The author generalizes this approach to quantum field theory by applying the same procedures with simple changes and imposing Einstein's quantization to the states of a classical field. Within this generalized perspective the quantization of quantum variables is a consequence of the existence of field quanta, so that the formal rules of nonrelativistic QM follow from firstly quantizing quantum field theory. The wave-particle duality can thus be interpreted as reflecting the dual nature of the quantum field as a unique physical entity, objectively existing in ordinary three-dimensional (or fourdimensional, relativistic) space. The paper by G. Introzzi and M. Rossetti provides an overview on the de Broglie-Bohm model. The authors firstly resume the essentials of the model, in which a quantum potential plays a basic role, and briefly explain why the model can be considered empirically equivalent to standard QM. Secondly, they summarize the modern approach to the de BroglieBohm model, usually known as Bohmian mechanics. This approach avoids the introduction of a quantum potential and provides an instructive picture of physical reality that highlights the similarities and the differences between Bohmian and Newtonian mechanics. The treatment of the double slit experiment according to Bohmian mechanics is then summed up by the authors, who remind that it yields predictions about the wave function that do not differ from the predictions of standard QM. Moreover, Bohmian

5 mechanics suggests an interesting generalization of the Bohr complementarity principle, firstly propounded by Greenberger and Yasin in 1988 and successively confirmed by photon and neutron interference experiments. Finally, Introzzi and Rossetti briefly discuss some relevant features of the de Broglie-Bohm model, as causality, determinism, realism, nonlocality and holism. They point out in particular that a Lorentz-invariant formulation of the model is still lacking and that the model is realistic as long as the only measured quantities are positional ones, while all other variables are contextual and therefore not realistic. The paper by L. Lanz and B . Vacchini deals both with the foundations of QM and the quantum theory of non-equilibrium systems. Regarding the foundations of QM, Ludwig's point of view is endorsed, according to which an axiomatic approach to quantum theory must be based on the prerequisite that an objective description of statistical experiments can be given in terms of a phenomenologically established "pretheory" (not necessarily classical mechanics). The still debated problem of measurement arises if one forces, may be too naively, this objective pretheory inside quantum theory itself, taking the latter as the ultimate theory. The authors conjecture that termodynamic concepts should have a basic role in the formulation of the pretheory and consider a formalism accounting for non-equilibrium termodynamics, introducing the required objectivity elements by classical fields representing "state parameters" for local equilibrium systems. Recalling that Zubarev's approach to non-equilibrium quantum termodynamics provides a deterministic dynamics for these parameters inside quantum field theory, Lanz and Vacchini argue however that one cannot expect a general validity of such a method, just because of the evidence of the physical role of microsystems. Thus, they discuss how a breakdown of a deterministic regime for state parameters can be linked with the emergence of microsystems as the seeds of a stochastic situation: QM then appears, already in a framework which completes it, as the basic tool for the description of the stochastic regime. F. Logiurato and C. Tarsitani present two papers in this book. In the first paper (which is relevant, in particular, for a critical approach to the didactics of QM) they note that the complementarity and the uncertainty principles, though fundamental in QM, are still rather vaguely and imprecisely stated in some textbooks. The authors focus on the uncertainty principle, point out some common ambiguities in its enunciations in the literature, and observe that an operational definition of momentum is often neglected. Therefore they propose a definition based on the measurement

6

of the "flight time" of the particle, and show that it allows one to deduce the de Broglie relation from the wave function instead of postulating it and to obtain the momentum distribution amplitude as the Fourier transform of the wave function at the initial time. Logiurato and Tarsitani then apply their definition to the single slit diffraction experiment and find again the above results in this special case, establishing a connection between the deduction of the uncertainty principle by means of experiments and by the method of Fourier transforms. They note however that both Heisenberg and Bohr adopted in their thought experiments definitions of Ax and Apx which do not agree with the standard definitions of these quantities as variances. Therefore they propose to take this difference into account by distinguishing two kinds of uncertainty relations in the literature. In their second paper (in which the names of the authors appear in reverse order) C. Tarsitani and F. Logiurato consider a problem that has been recently debated in the literature, i.e., the existence of doubleslit experiments that provide "which-way" information without momentum transfer to the physical objects under examination, which makes it difficult to explain the loss of interference effects. The authors consider a MachZehnder interferometer and show that also in this case one can introduce observable magnitudes that allow one to define the notions of visibility (the capability of recognizing the interference effects), predictability (the capability of predicting which path the object will choose) and distinguishability (the capability of inferring which path the object has chosen after having gone through the interferometer). By using these magnitudes one can provide a quantitative formulation of the wave-particle dualism and the Bohr complementarity principle. Moreover, the operators corresponding to them do not commute and are linked by inequalities that resemble Heisenberg's uncertainty relations. Hence, one can give a mathematical expression of the relationship between two fundamental principles of QM, which thus appear as two sides of the same coin. One can also generalize, at least in twodimensional Hilbert spaces, the complementarity relation, and explain the interference loss by means of uncertainty relations involving incompatible observables that depend on the actual experimental conditions and may not coincide with position and momentum. The paper by G. Nistico briefly reviews the ideal double-slit experiment proposed by Englert, Scully and Walther (ESW), that aims to detect which slit each particle passes through (briefly, to detect the WS property), then measuring the point of impact on the final screen. Of course, whenever repeated experiments are performed, no interference pattern ap-

7

pears on the screen. Nistico then focuses his attention on the erasure phenomenon, exemplified by ESW by constructing another property of the system, incompatible with the ESW property, that can be measured without destroying interference but losing knowledge about the WS property. The author wonders indeed whether there are physical properties that are incompatible with the WS property but can be detected, for a given state of the physical system, without erasing WS knowledge. His answer to this question is positive, as proved in some previous articles. In addition, Nistico shows in his paper that, if the dimension of the Hilbert space of the physical system is suitably chosen, an ideal experiment can be contrived which makes it possible to detect a property incompatible with the WS property not only without erasure but also without correlation with this last property. The paper by G. Scolarici and L. Solombrino considers the evolution of a compound quantum system from the viewpoint of quaternionic quantum mechanics (QQM). It is well known that in complex quantum mechanics (CQM) a state of the whole system undergoes unitary evolution, while the evolution of the reduced density matrices that can be associated (via partial trace) to the component subsystems is generally not unitary. The authors show in a special case (evolution from a separable to an entangled state of a physical system made up by two s p i n - | subsystems) that the evolution of the states of the component subsystems can be described in QQM by quaternionic unitary maps, and that the non-unitary maps describing evolution in CQM can be obtained as complex projections of the corresponding quaternionic maps. Furthermore, Scolarici and Solombrino provide a description of the final state of the whole system which strongly suggests that this state should be considered separable in QQM. This establishes a relevant difference between CQM and QQM, and deserves further research (we add that such a difference could also constitute an argument in favour of QQM). Let us come now to the second class, the class of "interdisciplinary" papers. The paper by D . Costantini and U. Garibaldi propounds a general approach to equilibrium probability distributions (generalized Polya distributions) in the framework of which various kinds of uniform probability distributions on the occupation vectors can be recovered, as MaxwellBoltzmann's statistics or Gentile's parastatistics. It is well known that only limiting cases of parastatistics are needed in QM for describing equilibrium probability distributions of elementary particles, i.e., the Bose-Einstein

8

and the Fermi-Dirac statistics, and these statistics can be easily obtained as particular cases within the model propounded by the authors. However, there are other fields of research in which non-uniform distributions apply (that generalize, in some sense, Gentile's parastatistics) which are also described within the Costantini and Garibaldi approach. In particular, there are economic agents in econophysics whose correlated behaviours are described by typically non-uniform probability distributions. The authors provide an example ("the ants of Kirman") and an application to stock price dynamics in order to illustrate this point, and conclude that, more generally, the behaviour of economic agents may by characterized by high correlation that can possibly be handled with an adequate probabilistic description. The paper by C. Garola and S. Sozzo inquires the notion of proposition in classical mechanics (CM) and QM. The authors note that the term proposition usually denotes an element of standard quantum logic (QL) in QM, and that the physical interpretation of such propositions is problematical since they cannot be associated with sentences of a predicate calculus, following known logical procedures. Indeed, the elementary sentences of this calculus should attribute physical properties to individual samples of physical systems, which may be meaningless because of nonobjectivity of physical properties within ortodox QM. Garola and Sozzo show that this difficulty can be removed by adopting the SR (semantic realism) interpretation of QM propounded by one of them, since physical properties are objective according to this interpretation and a unified perspective can be adopted for introducing propositions both in QM and in CM (more generally, in any physical theory T ) . One can thus construct a simple first order predicate calculus C(x) with classical (Tarskian) semantics, associating a set of physical states (called physical proposition) with every sentence of C(x). The set Vf of all physical propositions is partially ordered, and contains a subset Vy of testable physical propositions whose order structure is determined by the criteria of testability established by the theory T. In particular, V? is a Boolean lattice within CM, while it is a standard QL within QM. Hence, QL can be interpreted as a structure formalizing the properties of the notion of testability within QM, and this interpretation proves to be equivalent to a previous pragmatic interpretation of QL provided by one of the authors. Moreover, it sheds some light on the concept of quantum truth underlying standard QM, showing that it does not conflict with the classical notion of truth. Finally, the authors point out that the above results can be embodied within a more general perspective

9

which considers states as first order predicates of a broader language with a Kripkean semantics. The paper by M. Minozzo propounds a new statistical discussion of the two-slit interference experiment for the ideal situation in which particles are sent sequentially (i.e., one after the other) through the interfering barrier toward the screen. The author's treatment is carried out adopting the standard axioms of Kolmogorov's probability theory, and fully exploits the sequential nature of the experimental observations. Altough a "classical" purely particle toy model is presented to explain the interference pattern and the non-additivity paradox which arises when comparing the interference pattern with the patterns obtained by closing one slit at a time, the main point of the contribution resides in the analysis of the actual experimental observations from a rigorous statistical point of view. Minozzo aims to show that, when the analysis is carried out correctly, at variance with some statistical investigations in the literature, interference experiments can be explained using standard statistical tools, without introducing waves and, in particular, without using QM. His contribution neither tries to reproduce the standard theoretical results of QM nor to provide any definite physical theory: rather, it proves, according to him, that QM is not the unique theory that may explain experimental observations and that, moreover, it has some well defined limits. Of course, it remains to establish whether the toy model mentioned above provides a convincing physical explanation of the experimental data. The paper by V. Mosini upholds that that existence of relational properties in many branches of science strongly suggests to revise the standard notion of realism, introducing a dynamic picture of reality which may change with the progress of scientific knowledge. This perspective was firstly worked out by Margenau, mainly bearing in mind QM, in his 1950 book entitled The nature of physical reality, but it was ignored or criticized by his contemporaries. Yet, some relevant features of Margenau's philosophy have been reproposed independently by several authors in the last twenty years. Mosini argues that the deep reason underlying this r e appearance is the widening of the domain of existence of relational properties in different areas of science, for new properties of this kind emerge whenever the increase of scientific knowledge leads scholars to consider more and more complex systems. As an instance, Mosini discusses in some details the case of chemical valence, showing that it is a relational property, since the same element may display different valences in its different compounds, and explaining how this may occur.

10 The paper by P. Zizzi deals with space-time at the Planck scale. As other authors, Zizzi assumes that space-time has a discrete structure at this scale, but adds the issue of quantum information propounding a Quantum Computer View (QCV) according to which each pixel of the Planck area encodes a qubit. The set of qubits forms a quantum memory register, and the information stored in the memory is processed by a network of quantum logic gates (unitary operators) that is part of quantum spacetime itself, since it describes its dynamical evolution. Zizzi claims that this model implies some interesting features of quantum space-time: reversibility of dynamical evolution, nonlocality of space-time itself at the Planck space, etc. She, however, points out some problems in her model and suggests how they can be solved. In particular, she maintains that the nonlinearity of the macroscopic level can be obtained from the linearity at the Planck scale level by considering self-organizing models. She also thinks that macroscopic irreversibility can be reconciled with the reversible dynamical evolution at the Planck scale by assuming Wheeler's picture of "space-time foam". At the end, Zizzi conjectures that micro-causality is missing at the Planck scale, but stresses that, notwithstanding its weird features, space—time seems to be able to compute its own dynamical evolution by quantum evaluating recursive functions. Finally, let us comment on the third class, the class of historical, philosophical or epistemological papers. The paper by G. Auletta and G. Tarozzi considers the three forms of duality that gave rise, historically, to the main conceptual problems in QM: (i) waves versus particles; (ii) deterministic versus stochastic dynamics; (iii) local measurements versus nonlocal correlations. The authors note that some connections between (i) and both (ii) and (iii) have already been established in the literature. Then, they provide some simple examples that lead them to argue that (ii) can be reduced to a more fundamental complementarity between "premeasurement" and "measurement". Since a further connection can be established between this complementarity and (iii), the authors conclude that all forms of duality in QM can be considered as different expressions of a unique fundamental complementarity. The paper by S. Bergia aims to recall scholars' attention on some interpretational problems in quantum electrodynamics (QED) and quantum field theory that were already discussed by Jordan and Dirac, and are yet still open. Bergia deals in particular with QED, and considers the positions of the above authors with reference to three basic issues: the difference between the quantization of the electromagnetic and the matter fields, the

11 dichotomy between waves and light quanta, the question about what quantizing a field actually means. Bergia observes that Dirac explicitly stressed the difference between "light waves" and "de Broglie or Schrodinger waves", and intended to quantize the electromagnetic field only. He was aware that no quantum mechanical wave equation exists for the photon, while QED instead introduces "complete armony between the wave and the light quantum description of the interaction". Moreover, Dirac built up the theory from the light-quantum point of view, proceeding in a direction opposite to Jordan's, who instead intended to apply QM to the Maxwell field itself. Bergia accepts the conclusion, propounded by other authors, that Jordan merely proved that the energy states of the field were quantized, which has nothing to do with the quantization of substantial Maxwell's fields. On the contrary, Dirac early succedeed in introducing a variable Nr that can be identified with the number of energy quanta in the state r arising from the quantization of a wave field. Finally, Bergia observes that Dirac also prefigured the commutation rules holding between number of photons and phase operators, and tries to reconciliate Feynman's insisting on the particle behaviour of light with Dirac's perspective. The paper by G. Boscarino considers the Einstein-Podolsky-Rosen (EPR) well known definition of "elements of physical reality" as a contradictory attempt to use an empiricist and operationalist conception of reality in order to criticize QM's claim to be a complete theory. The author maintains that Bohr's position, which affirms the completeness of QM by philosophically expliciting just the same empiricist and operationalistic conception as the most adequate to view quantum facts, is more consistent. According to Boscarino, however, an appropriate discussion of QM must be made, contrary to both Einstein and Bohr, by making explicit a different philosophical interpretation, going back to the old classical philosophicalepistemological tradition called by him "Italic" (which was diffused mainly in Magna Grecia, in ancient Italy, by Pythagorians). This interpretation was at the basis of classical physics, which maintains that one must distinguish between empirical measurements of physical quantities and physical properties in se: these have reality, even though they are not completely known. Thus, empirical measurements are only partial clues and incomplete proofs of their reality (cp. the distinction between relative and absolute space and time according to Newton). Only in this way it is possible to avoid contradictions and ambiguities of philosophical nature, not only as Bohr's complementarity, but also as EPR's "empirical" definition of elements of physical reality.

12 The paper by P. Campogalliani stresses a methodological point in science studies, opposing the neopositivistic approach: the presence of a third dimension beyond empirical and analytical knowledge, the dimension of metaphysical knowledge. This is a priori, interpretative and analogical, and also influences mathematics, which is usually identified with pure analytical knowledge, through imaginative efforts. This point of view, which enhances the role of informal thought also inside the formal one, is applied by the author to Planck's elaboration of radiation theory (1898-1915), where also the mathematical developments were influenced by conceptual models and analogies. In particular, the introduction of the statistical treatment was not justified, as in Boltzmann, by a mechanical interpretation of the second law of termodynamics, but by a microscopic hypothesis on natural radiation which only formally converged with the former, since it was conceptually very different. Planck in fact, according to his realistic metaphysics, aimed at a larger unification of physics and causal explanation, both before and after the introduction of quantum of action and discontinuity. Hence he considered the statistical approach, though elaborated in terms of phase space and sophisticated as Boltzmann's, as only temporary and provisional. The paper by S. D'Agostino starts from the known thesis that observation and experiments are theory laden in physics and studies the relationships between a primary (physical) theory and the instrumental theories that must be necessarily worked out whenever one wants to test the primary theory. Since every new physical theory includes a number of older physical theories (that can be obtained as special cases, or bringing to a limit appropriate parameters) in its so-called correspondence area, the author argues that instrumental theories associated with a given primary theory actually belong to its correspondence area. When this general viewpoint is applied to quantum measurements, the "objectification process" can be interpreted in physical terms according to the author. Indeed, D'Agostino conceives this process as a consequence of the physical process of recoursing to an objective instrumental theory as a procedure for testing the primary theory, i.e., QM. This should avoid, in particular, any need of interpreting objectification as following from the disturbance that is inherent in a measurement process. The problem remains, however, as acknowledged by the author, of reducing hard facts to instrumental theories apt to test primary theories, since the author rejects inductivist, ontological, or simply justificationist (as Popper's falsificationism) views, in favor of a more sophisticated semantic modelistic approach.

13 The paper by V. Fano proposes to compare the old question of the conflict between the qualitative character of experience and the numerical character of mathematical physics (notwithstanding the fact that the predictions of the latter refer to the former) with this violation of Bell's inequality. Indeed, also this violation breeds a conflict between the common sense locality (or factorizability principle) and the nonlocality following from the mathematical formalism of QM. Since the violation is empirically confirmed, the author tends to consider it as favouring an empiricist conception of the relation between experience and mathematical physics. In fact, Bell's inequality violation can neither be reduced to a mere statistical correlation without a common cause, nor to a mere consequence of the mathematical representation: rather, it seems a hybrid between the empirical reality and the scientific representation derived from it, in agreement with the empiricist outlook. This suggests that empiricism can give a satisfactory reply to the problem of the relation between experience and mathematics in general. Indeed it interprets the latter as a result of a process of idealization of experience, thus opposing platonism and differing from critical materialism (which makes reality beyond experience interact with supervenient physiological human structures, so creating mathematical representations) and operationalism (which reduces experience and mathematics to mere operations without any cognitive value). The paper by E. Giannetto traces back the origins of QM to the electromagnetic conception of nature, contrary to the mechanistic current view which finds out them in the extension of atomism and corpuscolar dynamics from matter to radiation (cp. Einstein). According to the author, only the electromagnetic conception, with its field view, could instead explain the peculiar statistical character of quantum phenomena: these were indeed introduced by Planck in his generalized thermodynamical approach as discontinuities of the universal electromagnetic field (cp. Larmor), expression of a "new mechanics" founded on electromagnetism both in relativity and in quantum physics (cp. Poincare). This conception agreed with Heisenberg's final formulation of QM which, because of its operational character, concentrated on electromagnetic more empirically detectable quantities, conceiving mechanical quantities, that are generally more indeterminate, as derived from the former. Giannetto's final argument is a frankly philosophical and also ethical one: the electromagnetic view favours an attitude towards nature, conceived as a dynamical and living reality, that is more respectful than the mechanistic one, which instead conceives nature as inert, dead and mechanical.

14

The paper by S. Guccione makes an effort to find out a minimal condition for the definition of Computable Universe, given that a general definition of Computable Universe is not available. The author firstly notes that it is necessary to consider a physical theory as a continuum expressed by real numbers, not limiting it to natural numbers as the computable "Turing machine" which constitutes the first model of computability. Then, Kreisel's definition of computability is adopted by Guccione, who adds to it a uniform law condition which should allow one to describe all physical processes. Notwithstanding this, it seems that theories still exist (in particular quantum gravity), which are incomputable even though they may be considered complete according to a suitable semantics. On the other side, a "theory of everything" should be computable as such, but it is not yet available (apart superstring theory, which candidates to this role). Then we must limit our discussion to universal constants in physical theories, inquiring whether they are computable or only measurable (that is, more and more approximable through different calculation recipes, not through a single rigorous one). Moreover, the physical constants seem to be not really constant, or at least, as the velocity of light, only conventionally constant. A more informative (though incomputable) theory on the universe could be quantum gravity, which, according to Hawking, may even recover, by quantum fluctuations, the information lost in black holes. Anyway, as constants may actually be slowly variable quantities and their measurability is not computability, we must content ourselves of a non perfectly uniform nature (according to Kreisel). So, the effective procedure may vary with the constant, and there may be also many procedures for a single constant. The paper by M. Leone and N. Robotti contributes to the historical reconstruction of the entry of QM in Italy. It underlines the mainly instrumentalist approach of the first Italian researchers on QM, Garbasso and Brunetti, who worked on spectroscopy (an area of research which was then very advanced in Italy), after the explicit rejection of the new theory by the authoritative physicist Corbino. The experimentalist-instrumentalist approach by Garbasso is confirmed by his subsequent attempt to interpret the results he had obtained by applying the new theory (in particular, the Stark-Lo Surdo effect) in terms of Thomson's classical theory of atom. On the contrary, Brunetti's contribution was not followed by any attempt at reinterpreting her results classically, even if her attitude was also basically instrumentalist. Anyway, the full entry of QM in Italy was due, some years later, in 1919, to Fermi. As confirmed by some documents and manuscripts in Pisa and Chicago archives, Fermi quite independently approached the

15 new physics, both from a theoretical and from an experimental viewpoint. He contributed to it in his undergraduate years in Pisa with theoretical accuracy, freedom of mind and experimental competence, well before his 1926 epoch-making contribution to quantum statistics. The paper by A. Rebaglia faces the crucial problem of the role of experiments at nanoscale (even at a single particle level), where all phenomena are, without exceptions, to be explained in quantum mechanical terms. Following Hacking's thesis, according to which we can have an evidence of reality of entities in science through their efficacy in actively manipulating with success other known realities, the author attributes a reality not only to empirical entities but also to theoretical entities, thus opposing a mere registration view of scientific truth. As an example, the wave function and the superposition of individual particle positions are real entities, since they contribute to the working, in particular, of a nanotechnological device as the scanning-tunnelling microscope, for which they are in fact theoretical starting points. Notwithstanding the appeal of this active transformative view of science, according to which we can also reproduce, by creating new devices, the creative processes of nature, the author recognizes that the problem remains of still distinguishing two different activities: first, experimenting on entities to get evidences; second, creating new technological devices by reproducing natural processes for practical use, so also giving reality to purely theoretical entities. The paper by A. Rossi underlines how the transition of the physical object from substance to function, which is to be traced back to the rise of modern physics (relativity and quanta), was made by maintaining the role of properties as constituents of any physical system, irreducible, as such, to mere measurements. In particular, the formalization of QM by von Neumann in terms of operator calculus in Hilbert space was not derived from empirical information acquired through measurements, but preceded its physical interpretation, and its application could be extended well beyond the empirical interpretation of projectors adopted by von Neumann himself. Even more, Dirac distinguished formal quantum properties from classical ones in algebraic terms, irreducible to mere measurements, though he left ambiguous the physical interpretation of his abstract formalism (particularly of his "delta-function"). The way out from the ambiguity consists in opposing the frank acknowledgement of the existence of physical properties (as Dirac's himself magnetic monopoles or non-simultaneously measurable properties in QM), which are not accessible (empirically decidable) and reducible to instrumental operations and theories, either to pragmatist re-

16

ductions or to falsificationist conceptions. The latter in particular, though avoiding to reduce physical theories and properties to mere instrumental ones, yet still consider them as quite empirically decidable (falsifiable). The paper by I. Tassani aims to analyse Gramsci's thought about microphysics as a part of a wider plan of reconstructing the cultural and philosophical European climate between the two world wars, when QM, as relativity theory (RT), was conceived and spread. Tassani stresses that QM was, contrary to RT, a difficult topic for philosophical discussion, as it was non-visualizable, complex and contrasting with common sense. In particular, these difficulties were marked in Italy, where a dominating idealistic philosophy either was not interested in new physical theories or tended to interpret them in purely immaterialistic and subjectivistic terms. Gramsci instead, though he shared the idealistic criticism of being a kind of superstiction to the positivistic exaltation of science, duly evaluated the scientific knowledge as a part of human labour and culture, thus overcoming immaterialism and subjectivism in terms of critical realism and intersubjectivity. He thus met the most advanced reflection on QM, even if he had little access to the literature since he was prisoner in fascist jails. He also underlined the relevance of linguistic considerations for avoiding paradoxical conceptions (so agreeing with some typical neopositivistic attitudes) and the impossibility of eliminating from science (inter)subjective conditions. It is then relevant to remind that Bukharin, the Sovietic exponent of historical materialism, in his contribution to 1931 London International Congress on History of Science claimed that science is only conditioned by economical-social factors, criticizing the subjectivistic intromissions of supposed religious origin. On the contrary Gramsci, stressing the role of subjective hypotheses together with experiments, anticipated, at least in part, Kuhn's historical-critical view of science. The paper by A. Venezia opposes Heinsenberg's matrix mechanics to Schrodinger's wave mechanics by assuming that the mathematical equivalence of the two formulations of QM, asserted by many authors, is contradicted by some differences existing also between the subsequent reformulations of the two perspectives (apart from the derivation of Schrodinger equation from Heisenberg's commutation rules made by Weyl using mathematical tools, as group theory, alternative to differential equations). The differences are both mathematical and logical, but the latter are more important. Indeed, the quantum logic (QL) which is more linked to Schrodinger's wave mechanics, because of its axiomatic organization and continuity, is the Birkhoff and von Neumann projectors algebra. Instead the QL which is

17

more linked to Heinseberg's matrix mechanics is an intuitionistic QL which is irreducible to the previous one, according to the author, since it rejects the law of the excluded third. In order to stress his point, Venezia proposes to derive mathematics from logic, and analyzes some of Heisenberg's reasonings on indetermination principle and commutation rules as a QL founded on the refusal of the law of excluded third. This logic is, in his opinion, more appropriate to QM than Birkhoff and von Neumann's QL. Finally, we insert at the end, out of alphabetical order, an article by F. Logiurato and other authors, which cannot be included in the above classification since it has mainly didactical purposes. Indeed, it describes a simple experimental apparatus which allows one to observe light diffraction and interference in the space and not only on a two-dimensional screen. Its aim is to illustrate the ondulatory aspects of light in the double-slit experiment used in many textbooks in order to introduce the wave-particle dualism and the Bohr complementarity principle. The pictures that have been obtained are suggestive and we have chosen them to close this book. We have thus completed the presentation of the papers collected in this volume. We would like to conclude by thanking all the Institutions that have sponsored the Conference and granted financial support to the publication of these proceedings. In particular, the Universities of Bononia, Lecce and Urbino, the Commune of Cesena, the Physics Departments of the Universities of Bononia and Lecce, the Cesena Interuniversitary Research Center on Philosophy and Foundations of Physics. Claudio Garola Arcangelo Rossi Sandro Sozzo

IF BERTLMANN HAD THREE FEET ALEXANDER AFRIAT Dipartimento di Filosofia, Universita di Urbino, via Saffl 9 1-61029 Urbino It is argued that perfect quantum correlations cannot be due to additive conservation. Dr. Bertlmann likes to wear two socks of different colours. Which colour he will have on a given foot on a given day is quite unpredictable. But when you see that the first sock is pink you can already be sure that the second sock will not be pink. Observation of the first, and experience of Bertlmann, gives immediate information about the second. [1] Most interesting features of quantum mechanics have to do with coherence (in other words with interference, with phase), which will not, however, be at issue here at all. Coherence is brought out with respect to different bases, but here the same (product) basis is adhered to throughout. It is often claimed that conservation accounts for quantum correlations (by which perfect quantum correlations will be meant). The underlying intuition is well expressed by Bertlmann's socks, or by the fact that the distribution of wine over two glasses can be worked out—provided one knows the total amount in both—by a measurement on one of them. Or consider a conservative classical Hamiltonian H =T + V(q), where T is kinetic energy and the potential V depends only on position. Conservation means that exchanges of kinetic and potential energy along a trajectory satisfy H0 = T + V, where H0 is the total energy of the motion. Kinetic energy will then be a function only of position, so that at any stage of the motion T(q) = H0 — V(q) can be deduced from the potential; the two energies are perfectly correlated. Or take two free classical particles, each one subject only to the influence of the other, with initial momenta p0 and p'0. Even if they collide their total momentum will remain TT = p0 +PQ', the momentum p' = -n — p of the primed particle can always be derived from the momentum p of the other. Such instances of additive conservation are paradigmatic. Quantum correlations are similar, especially at a given instant, and with only two subsystems; but they have nothing to do with conservation. When the contrary is claimed it seems that additive conservation is meant; but that can be broken up into two logically independent parts: 1. conservation; and 2. an

18

19 'additivity' condition, presently to be defined and denoted (A). Quantum correlations can have nothing to do with time, which has everything to do with conservation; so what is fundamentally at issue is additivity. I will argue that an additivity condition can be constructed to account for quantum correlations with two subsystems, but only with two; where there are more, quantum correlations are too strong to be explained by additivity. An explanation that only works in a restricted special case should be viewed as no explanation at all; so quantum correlations have nothing to do with additivity. Take three socks (on an equal number of feet) rather than two: once the pink sock is found on one foot, we know the remaining socks are on the other feet, but we cannot infer where the blue one is. With three glasses a measurement on one glass only tells us how much wine is in the other two together, not how much is in the third. Triorthogonal decompositions appear to go beyond the knowledge available in the above cases, and indeed to tell us where the blue sock is, or how much wine is in the third glass. Consider the triorthogonal decomposition*

W = £Ue« = w,®w2®w3,

(i)

m

where the Hilbert spaces Tir =span{|a, r ),|aij),...} ("span" denotes the closed span) have the same dimensionality, and {a\ | a;r) = 8tj (r = 1,2,3). The statef | &)

determines

k U H O H ^ )

a

trijective

or

one-to-one-to-one

correlating the bases

flo^)},

2

{\a J},

correspondence and

{|o£)},

m = l,2,.... To make the correspondence observable and give rise to correlations, we can construct the self-adjoint operator A = A'+A2+A3:W->W, where

Al=Al®I®I:H^H A2=I®A2®I:H->H Ai=I®I®A1:H^7i, and the three (maximal) operators Ar have the form

* According to Schmidt's theorem (Schmidt 1907), every vector in the tensor product of two Hilbert spaces can be given a biorthogonal decomposition (see also von Neumann 1932 pp.228-32, and SchrOdinger 1935). But almost no (see Clifton 1994) vectors in the tensor product of three Hilbert spaces admit a triorthogonal decomposition (see also Peres 1995). 1 Which is considered primitive or 'given'; its physical origins—who knows what conditions may have produced it—are not at issue.

20

ZXK>tehwr->w, m

r = 1,2,3. The operator Ar establishes a one-to-one correspondence A,r <-+1 aj"), A[ *->1 aj). A3r <-> | aj),... between eigenvalues and basis vectors, thus extending to the three spectra Ar ={A,r,A2,...} the aforementioned trijective correspondence between the bases (again r = 1,2,3). The discovery of an eigenvalue therefore selects one in both of the other two factor spaces. This will be particularly surprising if we require that Al+A^ + A^=A

(A)

for all m (so that A\&} = A|!^)); for then the entire system possesses an amount A of the physical quantity 21 represented by A, whose exact distribution over all three subsystems would be determined by a measurement on any one of them. We expect this with two subsystems, maybe not with three. Consider the Cartesian product A = A1 xA2 xA3 ={(A\,A 2 2 ,A\)} of the spectra, and the subset A ( A ) ={(A 1 ,,A 2 2 ,A 3 J ):A',+A 2 2 +A 3 3 =A}cA Lv

\A)

m

m

m '

m

m

J

m

satisfying condition (A). The discovery of an eigenvalue A^ (the value s = l,2 or 3 of the superscript is chosen by the experimenter, that of the subscript by nature) will determine a subset A

,

(A;m =n)

={(A1,,A\,A\):Al,+A22+A3]=A;m' = n}cAa), Lx

m

m

m '

m

7

m m

J

{*)

which would be a singleton if there were only two subsystems. The triorthogonal decomposition (1) determines another subset of A(A), namely A(1) = {(A^, A^, A3,)} c A(A). Here the discovery of the same eigenvalue A* would select the triple

We would have A m = A m and A„ , \l)

*•*>

(l;m =n)

= A,, , , with two subsystems, and (A;m =n)

J

only then. This means that the correlations due to the triorthogonal decomposition, being stronger than those due to (A), cannot be attributed to such an additivity condition.

21 The matter can also be seen as follows. A vector |<£) belonging to eigenspaces corresponding to eigenvalues Arr that add up to A will be an eigenvalue of A corresponding to A; in other words conditions ( | a i , 1 > « 1 | ( 8 ) | a ^ ) K . | ® l < 1 > « 3 | ) | * ) = |*) and A^ + A^ + A ^ A together imply that A|
> 1 for the subspace

(A; m =n )

Q

,

(A;m =n )

*

=span{|a',)|a22)|a33):A1,+A2J+A3, = A ; m s = n } A

"

m

f l

m ' '

m '

m

m

m

-*

determined by A*. But with a triorthogonal expansion, the two measurements A and As determine a single product |a£,)|

[email protected]

2') in particular (external) (sharp) Measuring Process ( / Measurement) 3 n = {Ili)

Vi pf = tr (IliW)

Figure 2. PARADIGMS 2 and 2'.

Paradigm 2 concerns "External Processes", in which one collection c splits into a family (d) of collections (here the index set A does not play an important role, because there is a physical separation of the channels and not only a cybernetic one as in Paradigm 1): the process does not only lead to a family (pY) °f numbers, but it even leads to a family (Ti(W) = p^Wi) of states, so that it is described by a countable family T = (Ti) of operations (see Ref. 12). The process is described from a "phenomenological" point of view, that is without reference to any explanatory account. The concept may refer to classical as well as to quantum physics. An explicit reference to this concept within the treatments of foundations seems to appear only in the last decades and not by many authors: this absence has certainly not contributed to the clarification of the basic concepts

28

of theoretical physics. Ludwig has attached importance to this concept in a paper going back to 1961, 9 but he has not emphasized it in his later books. The concept has been specially developed by Polish authors, and it is particularly dealt with in the 1976 book by Edward Brian Davies, 10 under the name of (discrete) instrument. Paradigm 2' concerns the important particularization to "(External) (Sharp) Measuring Processes", in which the outcome probabilities {pf) in the channels are those of the measurement of some decomposition of the identity 77 = (77j). Thus we carefully distinguish a Measurement (Paradigm 1), in which nothing is said about the final states of the measured objects, which may well be destroyed, and an (External) Measuring Process, in which the final states (p^Wi) of the objects in the different channels are specified. We think that the confusion between these two concepts is one of the most important sources of many disasters in the discussion of measurement in quantum theory, but even the confusion with the concept of Theory of Measurement (see the next Paradigm 3) cannot at all be neglected. Paradigm 3 concerns "External Quantum Models", as carefully described by Hellwig and Kraus (1969) n within the Ludwig school. Such models may provide a description of (i) a Measurement as described by Paradigm 1 (ii) an External Process, in particular an (External) Measuring Process, as (phenomenologically) described by Paradigms 2 and 2'. For a given quantum system S the (macroscopic) environment A (Apparatus) is described by a quantum model of that environment: as emphasized at the beginning, the quantum description of a system supposes a macroscopic environment measuring some attributes (77j) of the system, hence here a quantum model of the macroscopic environment A supposes a further macroscopic environment "reading" some attributes (77^) of the model of the environment A . We divide the description in three steps: (1) composition and unitary evolution: after unitary evolution of the system composed by S and A , the two systems are left in some final (partial) states Ws and WA. (2) reading: a first cybernetic involvement, transition from physical to cybernetic reality as in Paradigm 1 applied to A, occurs when a family 77 = (77^) of attributes of A is "read". In fact, when

29

applying Paradigm 1 to A, the index set A which is supposed to label the composite samples, hence the samples of S as well as the samples of A, is partitioned in a family (At) of subsets according to the result of the measurement of (IIA) on A. Thus, as in Paradigm 1, observed frequencies arise, to be identified with the outcome probabilities [pf ) of the measurement, which here may be referred to S as well as to A, because the two system have the same set A of labels.

(Finally measured) External Quantum Model (HA,\J, IIA, WA) of the environment of some system described in (H , Y^JJS) - in t h e M e a s u r e m e n t of some 7 7 5 = (Ilf) w i t h result (p^) - in an E x t e r n a l Process described by T = (Tf)

= (tr (Ilf

Ws))

step (1) composition and free evolution of the System S and the Environment A step (2) reading on S step (3) conditioning on A Further Environment —> WA \

<— Environment: -* ___

ws®wA=-w —• w=uwu*

/ Ws

(1) <- System: -> Vi

(nA)

i -

N

1

W /trs

|(2) |A-(A;)|-

tr^ s~s Ws= £ P

Tf (Ws) =PYSWf

*(3)

=tiA((Is®

nf)W)

— A

- Actually the model also induces a final state W (before reading) through W =tr'sW. „,

_ . Ts Tb

for the environment

describes an External Process allows an External Quantum Model with pure initial state

- Corollary: As a model of Measurement induces

W.

(Theory of Measurement) the model o

A

- the prescribed probabilities through (p^ ) = (tr (Z7V* W )) - a possibly prescribed (External) Measuring Process through

Figure 3. PARADIGM 3. The pointed downarrows express an interaction between physical and cybernetic reality.

30

(3) conditioning: a second (reverse) cybernetic involvement, transition from cybernetic to physical reality, is, as a consequence of reading, partitioning the collection of the samples of S, which was left in the state Ws, in subcollections, whose states (Ti(W) = pY Wf) are the final states in the different channel of a suitable external process as described by Paradigm 2 (this partitioning is called by Ludwig a demixture). In this way the action on S of "reading" on A is by no means physical on the samples of S, but cybernetic on their collection, which is split in subcollections: thus one state evolves in a family of states. As proved by Hellwig and Kraus, any external process, which has to be suitably and carefully defined, allows this kind of quantum model (even under request of a pure initial state WA of the external system). Here a very important fact is that any such model also provides a final ("before reading") state WA of A. Of course it is also important to emphasize that, according to Paradigm 1 applied to the "reading" of (IJA) on A, no statement about the final states of A in the channels "after reading" is provided. (A statement on these states of A would require a quantum model of the "further environment" — hence a further-further environment also, measuring the further environment — and, in agreement with the above statement of Hellwig and Kraus, any prescribed family of final states in the channels of A after reading would allow such a quantum model of the further environment.) The Theory which provide:

of Measurement

concerns important particularizations

(i) Quantum Models of Measurement (Paradigm 1): actually, according to the above statement of Hellwig and Kraus, for any given Measurement, as considered in Paradigm. 1 there always exist plenty of quantum models of the environment providing the same prescribed outcome probabilities (j>Y ) . (ii) Quantum Models of Measuring Processes (Paradigm 2'): actually any Model of Measurement, as considered within item (i), necessarily also describes a Measuring Process (Paradigm 2'), that is provides final states (pf Wf) of the given System S in the channels, which may even be arbitrarily prescribed, due again to the above statement of Hellwig and Kraus.

31 2. Semimacroscopic and Macroscopic Interpretability of Quantum Component Systems We display here a quite preliminary account of the subject. In particular the choice of some concepts and terms may certainly be improved.

2.1. Quantum

Component

Systems

With reference to Paradigm 3, the question of macroscopic interpretability of quantum component systems already appears after step (1) above, which (for any state Ws of the given system S ) provides (independently of the final measurement of (11^)), the final "before reading" state WA of the external system A. Thus here the relevant structure is the pair {HA,WA)

with

VV"4 = {WA\WS

In general we propose as appropriate ponent system a pair (H, W)

structure

with H a Hilbert space and

G

W{HS)}.

for a quantum

com-

W C W(H).

Thus we give the next definition. Definition 1. Let H be a Hilbert space and W C W(H) a subset of the set of the density operators in H (that is V W G W W > 0, t r W = 1). We say that the pair (H,W) is an Hilbert space with states. The purpose of this Section is to propose an answer to the question: what may it mean macroscopic interpretability of a quantum component system described by (H, W) ? (Actually a weaker concept of semimacroscopic interpretability is also introduced). The answer has to consist in definitions, not in theorems: it is a choice. The propositions that are produced here relate the choice to different concepts and may only emphasize the opportunity of the choice. The interest of the choice is further emphasized by the application in the unpublished Theorems reported in Section 3. As the very question involves both Boolean (macroscopic) and quantum structures, the appropriate physico-mathematical framework appears to be the Jauch-Piron axiomatization.

32

2.2. Hilbert Lattice and States of Physical Systems

Structures

Definition 2. For a given finite or countable set T we call Boolean system with phase space V a system whose attributes are described by the Boolean lattice Lr generated by T and whose set of states W r consists of the probabilities p = (pi)ier on r . We call Si the probability concentrated on i £ T, so that Vp e W r P = YliPrfiThe next lemma collects some useful statements which may easily be derived within the Jauch-Piron framework. Lemma 1. Let a physical system be described within a Hilbert space H. Let II = (i7,)jGr he an orthofamily of projectors (unnecessarily a decomposition of the identity, it may well be ] [ \ /7,- ^ 1H) Then (with V{-) and W(-) introduced at the beginning of Section 1 the following conditions are equivalent: (Al)

the system is described by the family IIH = (IIiH)ier °f "superselected" subspaces; (A2) the lattice of the attributes is Ln = \Ji&vV{niH) (Def. of LJJ); (A3) the set of the states is Wn = 5 o l J i e r W ( i I i f f ) = {W e W(H) | W = '^li LTiWIIi}, that is any state is a mixture of states described in the IliH's (with co convex hull, co its closure) (Def. of Wn),' (A4) the set of the vectors representing the pure states is [Ji LTiH. Let these conditions be satisfied. Then the next conditions are equivalent: (Bl) the Ili's are one-dimensional, (B2) with reference to Def. 2, the an identification Ly = LJJ; (B3) with reference to Def. 2, the an identification W r = W77

so that WiGT WiJiH = {Ili}; mapping i £ V —> Ili £ II enables mapping i £ F —> Ili £ II enables (through YliPi^i — > YliPi^i)-

2.3. Semimacroscopic Interpretability of Quantum Component Systems The concept is introduced by the next easy proposition. Proposition 1. Let (H,W) be a Hilbert space with states (Def 1) and let n = (77i)igr be an orthofamily of projectors. Then the following conditions are equivalent, and when satisfied we say that n expresses a semiboolean (semimacroscopic) structure for (H, W ) :

33

(1) W C Wn = c o U i e r % H = {W e W(H) | W = £ , 7 7 ^ 7 7 ; } , i/iot is any state is a mixture of states described in the ntH 's (or: W "is objectified" by II) (informally: no Schrodinger cat occurs); (2) W is reduced by II; (3) [W, 77] = 0 (i.e. WWG W Vi G r [W, 77<] = 0). 2.4. Macroscopic of Quantum

Interpretability Component Systems

We have now to introduce the concept of Boolean (macroscopic) structure of a quantum component system: it is a particularization of the concept of semiboolean (semimacroscopic) structure, which is not so immediate. The next technical definition enables simplifying the notation. Definition 3. Let H be a Hilbert space, K a Hilbert subspace. Let CK, CJJ be the spaces of the linear operators in the respective Hilbert spaces K C H. Let W G CK- Then we consider as an identification the injective mapping L : CK —> £H '• W —• iW = W © 0\(H-K) (whose inverse is t'1 : W —• W\K). We need the following definition. Definition 4. Let (H,W) be a Hilbert space with states (Def. 1) and let n = (/7j)ier be an orthofamily of projectors. At least when H expresses a semiboolean structure for (H, W) (Prop. 1), we define

re = {i G r | p™ ? {o}} = {i G r | tr (77, w) jt {o}} = {i G r | ntw ± {o}} as the index subset Fe C F effective for (H, W) and we define 77e < /7"|r« (i.e. Vi G T e nze < Ui) as the "part" of 77 effective for (77, W), through neH=

Y, I m ^ 7 7 | r » = (7777- f] K e r 7 7 | r e ^ ) , wew wew

so that (with the identifications of Definition 3) the following conditions are equivalent: 77 expresses a semiboolean (semimacroscopic) structure for (77, W); 77e expresses a semiboolean (semimacroscopic) structure for (77, W). In analogy to the above Proposition 1, we may now introduce the concept of Boolean (macroscopic) structure for a quantum component system by means of the next Proposition 2.

34

Proposition 2. Let (H,W) be a Hilbert space with states (Def. 1) and let II = (/Zj)jer express a semiboolean structure (Prop. 1). Then, with reference to Def. 4 and with the identifications in the last part of Lemma 1 and in Def. 3, the following conditions are equivalent, and when satisfied we say that actually II expresses a Boolean (macroscopic) structure for (H,W): (1) the "part" LTe of II effective for (H,W) consists of one-dimensional projectors, that is the non-zero part of the reduction of (H, W) may be identified to a reduction to one-dimensional components (by IIe); (2) with reference to Def. 2 the mapping i e T e —> IIei € LTe enables an identification Lpe = L[je; (3) with reference to Def 2 the mapping i £ P —> 77ej £ IIe enables an identification Wr e = VV/7e/ (4) with reference to Def. 2 the mapping i S Te —• LTei € IIe induces the inclusion W C Wr= • 2.5. Semimacroscopicity and Macroscopicity of External Quantum Models of the Environment We may now answer the question posed at the beginning of this Section 2, by giving the next Definition 5. Definition 5. For a given quantum system described in Hs let (HA, U, nA, WA) (with nA = (nAi)) be an External Quantum Model (the definition is implicit in Fig. 3 of Sec. 1 and in its explanation). Then, with WA= {WA= trs(V(Ws® WA)V*) | Ws G WHS} and with reference to Propositions 1 and 2 respectively, we say that: • (final) (quantum) component system semibooleanity (semimacroscopicity, objectivity) is satisfied (by the External Quantum Model) whenever LTA expresses a semiboolean structure for (HA,W ); • (final) (quantum) component system Booleanity (macroscopicity) is satisfied (by the External Quantum Model) whenever LTA expresses a Boolean structure for (HA,\V ) , that is, equivalently, the mapping i —> IIAei — an identification — an identification — an inclusion

(troughp^

enables

e

Lr = LJJAH, Wr« = WIJAC, W C Wr»

= ZiPFSi

~^WA

=

^pf*UA%).

35 We may give a more concrete form to the concept of macroscopic interpretability of a quantum model of the environment by introducing, through the next definition, the concept of effective (final minimal) Boolean Model of the environment, as the appropriate model of the place where the single events which are registered by experimental physicists as well as the events of ordinary life occur. Definition 6. For a given quantum system undergoing a (discrete) Measurement or External Process (see Sec. 1, Paradigms 1 and 2) with index set T, we call effective (final minimal) Boolean Model of the environment the Boolean system (Def. 2) that has phase space

re = {ier|#^o}cr. Then we may reexpress the second part of Def. 5 by means of the next statement. Corollary 1. For a given quantum system described in Hs undergoing a (discrete) Measurement or External Process with index set T, an External Quantum Model {HA,U,IIA,WA) (with UA = (IIAt)) of the environment satisfies (quantum) component system Booleanity (macroscopicity) (Def. 5) if and only if the mapping i —> IIAei enables an identification (of Boolean lattices, Lpe = LnAC, equivalently Wr« = WnAe) for the lattices of the attributes • Lp« of its effective (final minimal) Boolean Model and • LnAe of its "effective final" External Quantum Model

(HA,u,nAe,wA).

2.6. Comments

on this Section

2

Coming back to the questions posed at the beginning of this paper, at this point we have given an answer to the question: what does it mean macroscopic interpretability of a quantum component system? Yet, we have not examined to what extent this requirement may be satisfied, neither have we approached the question: does universality of quantum theory require that macroscopic interpretability of a quantum component system be an environmental (model independent) property? A contribution to the treatment of these problems is provided by the Theorems of the last Section 3.

36

3. Two Environmental (model independent) theorems (Camerino 1988) 3.1. Content

of this

Section

The main content of the theorems of this Section 3 had been produced at the first of these Meetings, Camerino 1988,* and also reported in some later Meetings (see Refs. 2 and 3). Section 2 here, whose necessary introduction is Section 1, in turn provides an appropriate introduction to these theorems: the previous lack of such an introduction may be one reason why they have not yet been published. We refer here to an "External Process", which may be in particular an "(External) Measuring Process", as described by a family T = (Ti) of operations in agreement with Paradigms 2 and 2' of Section 1. Each of the next Definitions 7 and 8 provides equivalent conditions under which we say that the environment of T satisfies semibooleanity (semimacroscopicity), respectively Booleanity (macroscopicity); Theorems 1 and 2 provide necessary and sufficient conditions on the operations performed by T on the given system, in order that the environment of T should satisfy semimacroscopicity, respectively macroscopicity. Concerning the proofs, which are not reported here, we only remark that the proofs of the implications (b) => (a) concerning the conditions in each of the definitions are more complex than the proofs of the theorems, especially for Definition 8 (as it uses some non-immediate statements which may be found in a previous paper by the author, together with another author 13 ). 3.2. A Condition

for

Semimacroscopicity

Definition 7. Let a given quantum system described in H undergo an external process described by T = (Tj)i e r. We say that the environment of T satisfies semibooleanity (semimacroscopicity, ojectivity) whenever, equivalent^, (a) all external quantum models (H^,\3, LT^,WA) ofT (with any initial state W^) satisfy component system semibooleanity, (b) some external quantum model (HA,U,II'A,WA) of T with pure initial state WA satisfies component system semibooleanity. Theorem 1. Let a given quantum system described in H undergo an external process described by T — (Ti)j e r- Then the following conditions are equivalent:

37

(1) T has orthogonal images, that is

(

y Im Ti(W) I is an orthogonal family of sub spaces of H; wew /i€r

(2) the environment ofT satisfies semibooleanity. (In particular the conditions are satisfied if T is unitarily equivalent to a measuring process of the first kind, that is if T — VT°V*, with V unitary operator and T° = (T?)ier such that Vi Im T?(W) C n{H with (i7») maximal family of orthogonal projectors). 3.3. A Condition

for

Macroscopicity

Definition 8. Let a given quantum system described in H undergo an external process described by T = (Tj)i £ r- We say that the environment of T satisfies Booleanity (macroscopicity) whenever, equivalently, (a) all external quantum models (H^,\J, IJ^, W^) ofT (with any initial state WA) satisfy component system Booleanity, (b) some external quantum model (HA,XJ, nA, W^) ofT with pure initial state WA satisfies component system Booleanity. Theorem 2. Let a given quantum system described in H undergo an external process described byT= (Tj)j £ r- Then in the case of finite-dimensional H the following conditions are equivalent, the first one implying the second one in any case: (1) T is unitarily equivalent to a perfect measuring process, that is T = — Tn = (T/I)i^r

VTnV*,

perfect measuring process

(i.e. Vier WeW Tf{W) = niwni II = (/7j)jgr decomposition of the identity in H) — V unitary operator in H; (2) the environment of T satisfies Booleanity. 3.4. Comments

on the

Theorems

In these Theorems, specially in Theorem 2, necessity (which is easier to be proved and does not depend on the dimensionality of H) appears to provide a satisfactory support to the aim at Universality of Physics outlined at the beginning of this contribution.

38

Indeed the claim to Universality specially refers to Measurement, as described by Paradigm 1 of Section 1, where no final states of the given system are prescribed, and requires that the boundary of the given system with the environment, even if not suppressed, may yet in any case be shifted. Now, as a consequence of the statement reported at the end of Paradigm 3, it is always possible to construct an external quantum model with pure initial state WA for the (External) Measuring Process with any prescription of final states for the given system after measurement (i.e. of

T = (r4)). Therefore, according to the above Theorems 1 and 2, by choosing a Measuring Process whose final states for the given system after measurement satisfy the conditions (1) of these Theorems, quantum models of Measurement satisfying final component system semibooleanity (semimacroscopicity, objectivity), or even Booleanity (macroscopicity) are always obtained. This means that models of Measurement which exclude superpositions of alive and dead cats always exist, provided one accepts the basic premise that has been stated at the beginning of this contribution, that Physics is not World-encompassing, that in particular a Theory of Measurement, like the theory of any External Process, requires a "further measuring external system" which measures the (external) measuring system. Thus the aim at Universality of Physics in a more general, but weaker form turns out to be met by the statement that for any Measurement there exist External Quantum Models satisfying the requirement of macroscopic interpretability. A stronger form of the aim at Universality of Physics arises by considering that a truly physical requirement concerning semimacroscopicity (analogously for macroscopicity) of the environment of some external process demands that any possible experiment on the environment should exclude superpositions. This requirement in its turn implies a quantum structure, that is a quantum model of the environment; as there exists plenty of such models, everyone providing the possibility of a lot of different experiments, a truly physical requirement needs to be satisfied in any model of the environment: then we may say that the environment of T satisfies semimacroscopicity (likewise for macroscopicity). Such requirements may appear very strict: still, according to the deeper statements expressed within Defs. 7 and 8, they are satisfied by the External Processes that fulfil the conditions (1) of the above theorems: thus the aim at Universality of Physics in a more particular, but stronger form turns

39 out to be met by the statement t h a t for the External (in particular Measuring) Processes that satisfy the conditions (1) of the above theorems, semimacroscopicity (respectively macroscopicity) is an environmental (model independent) property. T h e latter statement provides an answer to the second question posed a t the beginning of this contribution. Let us still briefly consider sufficiency in the above theorems. T h e object of the theorems being an External Process described by a family T — (Ti) of operations, it t u r n s out t h a t the requirements expressing semimacroscopic or macroscopic interpretability of the environment, which is the "instrument" in Davies's terminology, considerably restrict the family T of operations the instrument may produce. In particular semimacroscopicity of the "instrument" implies that the final states of the given system have to lay in orthogonal subspaces; true macroscopicity of the "instrument" even implies, under the assumption of finite-dimensional Hilbert space, that the process has to be unitarily equivalent to a perfect non-destructive measurement performed on the given system. References 1. R. Ascoli, in I Fondamenti della Meccanica Quantistica, Analisi storica e Problemi aperti, Camerino 1988, G. Cattaneo and A. Rossi eds. (Editel, Commenda di Rende, 1991). 2. R. Ascoli, poster at the Biannual Meeting of the I.Q.S.A., Castiglioncello, 1992. 3. R. Ascoli, in Symposium on the Foundations of Modern Physics 1993, P. Bush et al. eds. (World Scientific, Singapore, 1993). 4. R. Haag, in The Physicist's Conception of Nature, J. Mehra ed. (Reidel, Dordrecht, 1973). 5. C. Piron, Foundations of Quantum Physics (Benjamin, Reading, MA, 1976). 6. G. Ludwig, An Axiomatic Basis for Quantum Mechanics, Vol 1 (Springer, Berlin, 1987). 7. J. Von Neumann, Mathematical Foundations of Quantum Mechanics (Princeton University Press, Princeton, New Jersey, 1955). 8. N. Bohr, Phys. Rev. 48, 696 (1935). 9. G. Ludwig, in W.Heisenberg und die Physik unserer Zeit, F. Bopp ed. (Braunschweig, 1961). 10. E. B. Davies, Quantum Theory of Open Systems (Academic Press, London, 1976). 11. K. E. Hellwig and K. Kraus, Comm. Math. Phys. 11, 214 (1969). 12. K. Kraus, Lecture Notes in Physics, Vol 190 (Springer, Berlin, 1983). 13. R. Ascoli and R. Urigu, Int. J. Theor. Phys. 36, 1691 (1997).

PREMEASUREMENT VERSUS MEASUREMENT: A BASIC F O R M OF C O M P L E M E N T A R I T Y

G.AULETTA* Gregorian University, Rome, Institute of Philosophy, University

and of Urbino

G. T A R O Z Z I t Institute

of Philosophy,

University

of

Urbino

Traditionally, three forms of duality were found in quantum mechanics: between the wave—like and the corpuscular behaviour, between unitary dynamics and measurement, and between locality and non-locality. We show that connections between these three forms exist. Moreover, we point out that the fundamental duality of quantum mechanics, to which all the other ones can be led, is that between measurement and premeasurement, and develop a formalism in terms either of projection operators or unitary transformations that can be used for both complementary features.

1. Introduction Three forms of duality have given rise historically to the main conceptual problems of quantum theory. The first one, is the duality between two classically incompatible descriptions, the wave—like and the corpuscular behaviour, which represents the empirical basis of the complementarity principle. The second one is the coexistence between two mutually exclusive forms of dynamics, the one, deterministic and reversible, ruled by the Schrodinger equation and the other, probabilistic and irreversible, consisting in a random jump to a given measurement result. This second form of duality was seen as an inexplicable problem, especially because one could not find a common form and a conceptual framework for these two types of dynamics, generating the most controversial problem of quantum mechanics. The third duality is a consequence of the quantum-mechanical *E-mail: [email protected]. tE-mail: [email protected].

40

41

Figure 1.

Relationships between the different forms of quantum dualities.

treatment of correlated systems. We have on the one hand the empirical result of a measurement, which, due to its intrinsically local nature, seems only to depend on the local experimental context that we have chosen here and now, and, on the other hand, the theoretical description of quantum correlations strongly suggests that this result could have been determined by a previous measurement on another distant system that was entangled with the system we are dealing with. It appeared very early an evident connection between the wave-particle duality and the measurement problem. Indeed, measurement was seen as the procedure through which the corpuscular behaviour could manifest itself, and, on the other hand, one considered the unitary evolution ruled by the Schrodinger equation as expressing the changes in time of the superposed and wave-like behaving quantum system. Moreover, some authors 1 saw in the possible attribution of a form of reality to the wave function an opportunity to overcome both the duality wave-like/corpuscular behaviour and the reduction postulate of von Neumann.

42

In a recent paper 2 we have shown that a deep interconnection exists also between entanglement and wave-like behaviour in a complementarity experiment, so that the wave/particle duality and the duality between non-local correlations and local interactions can be reduced to a common conceptual root. Here, we propose the following three results. (1) We show that the the dynamical duality can be reduced to the more fundamental complementarity between premeasurement and measurement, and (2) that there is also a further form of connection between this duality and the one in the EPR problem. This paper will indeed point out the fact, which, in our opinion, has not been sufficiently underlined in the literature, that preparation and entanglement are two strictly related concepts, as well as, on the other hand, measurement and local interactions. Finally, (3) we stress that any other form of the above dualities can be led to the latter, in particular taking into account that in this way we can use the same basic formalism for describing all these different forms of complementarity. 2. T h e Use of P r o j e c t i o n O p e r a t o r s The interconnection between measurement and entanglement can be shown by the fact that both operations can be formalized by using projection operators. Suppose for instance that two 2-level systems, say systems 1 and 2, are initially in the factorizable state I * ) = ^ ( | 0 ) 1 | 0 ) 2 + |1}1|1)2 + |0)1|1)2 + |1)1|0)2),

(1)

or in short |*>=i(|00>+|ll>+|01)+|10».

(2)

Furthermore, let us consider the entangled state |M>E)=i=(|00>+|ll)).

(3)

Then, the projection operator PE =

\*E)(*E\

= \ (|00> <00| + 111) (111 + |00 > (111 + |11 > (00|)

(4)

applied to 1\Er) gives the (unnormalized) entangled state

£a|*>=i(|00)+|ll» OC|#E>.

(5)

43

In the case of a projection measurement, suppose that the initial state of the system is | ^E) • Then it suffices to apply the projection operator P0 = 100) (001 on this state in order to obtain the bit 100): PO|*B)

= - ) = 100) (001 (100)+111)) = ^|00).

(6)

However, 100) can be written as |00>=JV(|tt B > + ! * £ ) ) ,

(7)

|^)=^(|00)-|11»,

(8)

where

and N is a, normalization constant. This is interesting to the extent to which it clearly shows that a measurement result can be considered, from a certain point of view, as a superposition of entangled states. This points out that, from a global point of view, measurement results have the same form of superpositions of entangled states, whereas locally a measurement result, being the information that we have actually received, represents a random selection among the eigenstates potentially contained in the initial entanglement. 3 This already shows the deep link between measurement/premeasurement and local events/non-local correlations. Now, it suffices to apply again the projector PE to 100) in order to recover the initial state

p B |oo) oc|* B >(*a|(1*1*)+1*£» = \*E).

(9)

One of us has shown in a recent paper that entanglement and measurement are the only two allowed operations on a qubit, 4 what seems to be confirmed by Zeilinger and coworkers, who have opened a new path for understanding and implementing quantum computation by combining these two methods: 5 both entanglement and measurement concur to one—way quantum computation.

44

3. The Use of Unitary Operators We can also use unitary operators for describing both the entanglement process and the measurement. Let us introduce the following basis:

100) =

H0 0

w

, |oi) =

H0 1

, 110) =

(°\1

, 111) =

0

(10)

0

\o)

\o)

Ho W

It is well known that a factorized input state

I*)

1

V2

(|0>1 + |1> 1 )|0> :

(11)

which can describe the initial situation in which a system in superposition and an apparatus in some default state are still uncoupled, can be transformed in the entangled state \^E) by using a simple transformation that is called controlled not (CNOT), which is such that V„

'V5

|00)+|10»

V2

(12)

(|00)+|11».

Eq. (12) can be rewritten in matrix form as follows

0 100 1000 0010 V2 0001

0

w

0 0

Obviously, the inverse operation ^ N O T , which is equal to Uc to the state | ^E) gives our initial factorized state, that is 0 100 1000 00 10 0001

V2

0 0

1 0

(13)

r,

applied

(14)

W

Suppose, on the other hand, that we have the initial entangled state | ^E) , and wish to recover the state

100) =

(15)

45

which is a typical measurement result. In this case, we apply a "measurement" unitary operator such that ^MI*J3>=l

(16)

00) )

;ten as 1

71

"10 0 - 1 " 0 1-10 01 1 0 .10 0 1 .

1

71

0 0

0 0

(17)

w

It is easy to show that the inverse operation, given by U^, applied to the state 100) gives | ^ B ) , too. In fact,

71

1 0 01" 0 110 0-110 - 1 0 0 1.

0 0

\lj

1

~7i

0 0

(18)

w

4. Premeasurement, Measurement, and Information Given two systems in a factorized state, it is always possible to entangle them. Any premeasurement must in general provide the necessary coupling between the object system and the apparatus, and this coupling, in quantum-mechanical terms, is an entangled state. However, this situation is deeply different from a measurement. In fact, by measuring we expect an answer from the system to a given question. Premeasurements can be conceived to a certain extent as injunctions, measurements are questions. For this reason, no unitary evolution will ever be able to account for an abrupt change of the state vector from an arbitrary superposition (or entanglement) to one of its components. Of course, given a certain superposition state, it is always possible to build an unitary operator that brings it to one of its components — and we have provided an example. However, this assumes that one already knows a priori which superposition the system is in, i.e. that one already knows the initial state. In other words, there is no way to find a unitary transformation which provides the observer with the information that represents the final outcome of the measurement process. In order to clearly understand this conclusion, we can try to apply the CNOT transformation to the other possible factorized state which can

46

describe the initial situation in which a system in superposition and an apparatus in some default state are still uncoupled, that is to the factorizable state |*')=-^=(|01>+|11».

(19)

In this case, we have:

l "0 1 0 0' 1000 1 0 0 0 1 0 V2\ 1 . 0 0 0 1.

f)

w

1

1 1

~V2

(20)

w

that is, we obtain the entangled state

1 ,. *'£)=-^(|01)+|10)).

(21)

However, if we apply the "measurement" transformation to this entangled state, we obtain 1

7i

"10 0 - 1 " 0 1-10 011 0 .10 0 1 .

1 f°\ 1

I

/o\ 0 1

w w

(22)

that is, we obtain the state 101), which is a "lying" measurement, because the state of the object system and the state of the apparatus would be anticorrelated. On the other hand, if have the same initial superposition (3) and we want to obtain the result |11), instead of 100), we need to introduce another unitary operator, as follows 1

71

'10 0 01 .10

0 1" 1-10 1 0 0 -1.

1

7!

f1} 0 0 \l)

0 0

(23)

w

The previous analysis stresses that we must every time build a specific unitary operator in order to formally express what happens in a measurement, whereas this is not the case for entanglement implementing. 5. Conclusions This suggests an interesting way to consider quantum mechanics. The duality premeasurement/measurement can be seen as a duality between what

47

follows from the general laws of quantum mechanics and what does not. 3 Entanglement does follow, measurement events do not. This is the reason why any measurement result can be considered also, from a certain point of view, a superposition or entangled state (see Eq. (7)). What follows from the general laws of quantum mechanics, is the global unitary evolution of the entangled object system plus apparatus (plus environment). Instead, what does not follow is a random local jump by means of an information selection. This brings to our main conclusion: the duality between non-local correlations and local interactions can be led to complementarity between premeasurement and measurement. Since, as we have pointed out in the Introduction, (1) a connection between the dual dynamics of quantum systems and the wave/particle duality has already been established, as well as (2) a connection between the latter duality and EPR correlation/destruction of EPR correlation, and (3) we have shown here the existence of a link between the duality premeasurement/measurement and entanglement/local reduction, we can conclude that all forms of duality in quantum mechanics can be considered as different expressions of a unique fundamental complementarity. Moreover, since the complementarity between premeasurement and measurement allows one to use a single and relative simple formalism, and also conceptually is very enlighting about the relationships between entangled states and local interaction, we may take this form of complementarity as the basic one. References 1. G. Tarozzi, in Open Questions in Quantum Physics, G. Tarozzi and A. van der Merwe eds. (Dordrecht, Reidel, 1985). 2. G. Auletta and G. Tarozzi, Foundations of Physics Letters 17, 889 (2004). 3. G. Auletta, in Proceedings of the I Workshop on the Relationships Between Science and Philosophy, G. Auletta and M. Leclerc eds. (Rome, Pontifical Gregorian University Press). 4. G. Auletta, Foundations of Physics 35, 787 (2005). 5. P. Walther, K. J. Resch, T. Rudolph, E. Schenck, H. Weinfurter, V. Vedral, M. Aspelmeyer, and A. Zeilinger, Nature 434, 169 (2005).

R E M A R K S O N CONDITIONING*

E. G. B E L T R A M E T T I Department of Physics, University of Genoa and Istituto Nazionale di Fisica Nucleare, Sezione di Genova E-mail: [email protected]

The paper deals with the notion of conditional probabilities in statistical physical theories. Particular attention will be paid to the notion of conditional probability in a generalization of the standard probability theory, to be called operational probability theory. Such a generalization is based on an enlargement of the usual class of random variables, so encompassing indeteterministic features. The connection to the quantum frame rests on the fact that the operational probability theory hosts an extension of quantum mechanics.

1. Introduction The notion of conditional probabilities, and the one of correlations, play a crucial role in any physical statistical theory: in this paper we compare classical and nonclassical frameworks focusing attention on the related notion of joint measurement of two or more observables in some state of the physical system. In the next section we recall the main features of states and observables in the so-called convexity models which are general enough to encompass both the classical and the quantum frames: the former adopts a convex set of states which is a simplex and a family of observables having a deterministic nature, while the latter adopts a convex set of states which is not a simplex and the observables are not deterministic. Within the convexity models it appears interesting to focus attention on a frame which preserves the classical simplex structure of the set of states but allows observables that need not have a deterministic nature. This frame, to be called opera*The origin of this paper is in a draft in collaboration with S. Bugajski, conceived as a first step toward a joint paper. After the death of S. Bugajski (March 2003) the present author came back to that draft: he hopes that his dear friend should have agreed on this version of the paper.

48

49

tional probability theory (OPT for short), can be traced back to [1] and has been studied in a number of papers [2-5]: it will be summarized in Section 2. It is known that such a frame hosts a canonical extension of quantum mechanics [2], The classical standard case will be reviewed in Section 3 where a number of properties characterizing the conditional probability will be reviewed. In Section 4 we examine to what extent these properties hold true in the OPT frame. Some typical quantum features will emerge, and we will be faced with a potential plurality of conditional probabilities: a fact that mirrors the non uniqueness of the way in which a joint measurement of two observables can be performed. The standard quantum frame is considered in Section 5: the popular choice of the conditional probability goes back to the so-called Luders-von Neumann recipe which rests on a strong, not always realistic, idealization of the measurement process.

2. Convexity models By convexity model we understand a quite general frame which rests mainly on the convex structure of set of states: this idea is reminiscent of the old approach proposed by Ludwig and his school [6]. Let S be the set formed by the states of the physical system under discussion: a minimal requirement for S is the convexity, which translates the basic physical operation of forming mixtures of states. A state is called pure when it cannot be expressed as a (nontrivial) mixture: the pure states are thus extreme elements of the convex set S and we write fi for the set they form. It seems to be a general property of physical theories the fact that the nonpure states can be expressed as convex combinations of pure states: we mirror this fact by assuming that S is the convex hull of f2. The intuitive idea of an observable corresponds to specifying, for every state of the physical system, the probability distribution of its outcomes on some measurable space, say (E,B(E)), where B(E) denotes the Boolean algebra of the measurable subsets of E (in the sequel we shall just write S for this measurable space). This can be formalized by denning an observable A as an affine map A : S -t M1+(H) where Mf(E) denotes the family of the probability measures on S. Such a map is uniquely specified by its restriction to f2; notice, however, that in this general context, where determinism is not assumed, an observable need not map a pure state into a Dirac measure on the outcome space.

50

Given A : S -> Mf(E) and a measurable subset X of 5, the quantity (Aa)(X), thought of as a function of a £ S, determines an affine function S -»• [0,1] which is called an effect, and denoted by the pair (A,X). Also this function is uniquely specified by its restriction to fi, hence by a function of fi into the segment [0,1], namely by a fuzzy subset of fi. We write £(S) for the set of all effects, namely all affine functions S -> [0,1]; when explicit reference to an observable is not needed we write a, b,... for its elements. The set £(S) carries the natural ordering a < & O- a(a) < b(a) for every a £ S, as well as a notion of partial addition: if 0 < a(a) + b(a) < 1 for every a £ S then the function S 3 a -»• a(a) + b(a) is again an effect, denoted a + b, and called the (partial) sum of a and b. Two effects are said to be orthogonal when their sum exists. The mentioned algebraic structure of £ (S), that fits the notion of effect algebra [7,8], makes it possible to define effect-valued measures (EV measures). A map E : 13(E) -> £(S) is an EV measure on B(E) if EE is the unit function (on S) and for every family X{ of disjoint elements of B(E) we have E(\Ji Xi) — *£li EXt. The notion of EV measure provides an alternative way of characterizing the observables. Indeed every observable A : S -> M ^ S ) defines the unique EV measure EA • B(E) ->• £(S) by (EAX)(a) := (Aa){X) for every a € 5, X G B(E), and, conversely, every EV measure E : B(E) ->• £(S) defines the unique affine map AE : S -> M*(E) by (AEOL)(X) :— (EX)(a). The general frame summarized above characterizes what we call a convexity model, a name justified by the basic role played by the convexity of the set of states. As better specified in the next Section, it encompasses the standard classical frame which adds for S the requirement of being a simplex, and for the observables the deterministic requirement of having no dispersion on pure states. Indeed, the first requirement makes S representable as the set M{l"(fi) of the probability measures on fl, and the second requirement forces an observable A : M^(Q,) -» M{I"(H) to map Dirac measures into Dirac measures, thus corresponding to a measurable function fi -4 S. The effects become represented by functions fi -> {0,1}, hence by (sharp) subsets of fi: the standard classical events are thus reproduced. Also the standard quantum frame is encompassed by the notion of convexity model. The set S of states is now the convex set Su of the density operators on a separable, complex Hilbert space Ti, with the set of pure states corresponding to the set of the one-dimensional projectors, and the nonsimplex nature of S-H mirrors the nonunique decomposition of mixtures into pure states. The observables are restricted to the projection-valued measures on the reals so that they correspond to the self-adjoint operators

51 on %. Denoting by A the self-adjoint operator associated to the observable A : S-u ->• M 1 + (R), the pair (A,X), X € B(R), determines, by the spectral decomposition of A, a projection operator P | x , and the probability that the observable A takes value in X, when the state of the system is D £ Su, is given by Tr(DP^ x) . The effects, represented by the projection operators, form a particular effect algebra: the projection lattice of H. The notion of convexity model includes also the so-called operational, or unsharp, quantum mechanics [9]: the set of states is S-H as before, while the effects now correspond to the positive operators and the observables are positive-operator-valued measures on the appropriate outcome spaces. We come now to the OPT frame, the one to be mainly considered in the sequel. It is a convexity model in which the set S of states preserves the simplex structure Mf(Q.) of the standard classical case, but the family of observables is now enlarged by dropping out the deterministic requirement that an observable A : Mf (fi) -y Mf (S) should map Dirac measures into Dirac measures. An effect (A, X), X € B(S), now becomes represented by a function of M+ (fi) into the segment [0,1]: explicitely, (A, X)fj, := (A/j,)(X), H € M{*"(fi). This function is uniquely determined by its restriction to the pure states so that the effects can be viewed as fuzzy subsets of H. It is known [2] that the quantum scheme (both the standard and the operational one) can be extended in the OPT frame based on the set of states M*(rin). There is an affine surjection R : M+(fi w ) -> Su which is one-to-one on the pure states and many-to-one on the mixtures, the counterimage of the density operator D £ Su being the family of all the convex decompositions into one dimensional projectors admitted by D. Any quantum observable A : Sy, -> M{I"(S) (if we refer to the standard quantum frame then E should be the real line) has the classical representative A o R : Mx+(fi-H) ->• M 1 + (S) which reproduces all the statistical properties of A. Clearly, not every observable on M^(VLy) need be the classical representative of a quantum observable.

3. The classical frame As already sketched in the previous section, the classical frame adopts for the set fi of pure states the structure of a measurable space, and all singletons {u>}, u> € fi, are assumed to be measurable. The set of all states, pure states and mixtures, is then identified with the convex set M+(fi) of all the cr-additive probability measures on fl. The pure states correspond

52

to the Dirac measures, namely the probability measures concentrated at a point of fi, to be denoted Su, w € ft. The set M+(ft) thus embodies the structure of a simplex whose extreme points correspond to the elements of fi: this meets the classical feature that the nonpure states have a unique convex decomposition into pure states. A random variable taking values in some measurable space 5 is usually denned as a measurable function F : fi -» S. This notion mirrors the deterministic requirement that a random variable must take definite values on pure states, namely that it has no dispersion on pure states. Notice that a measurable function F : Q, -* S extends in a natural way to the afnne map Ap : M+(fi) -> Mf(E), denned by (AFfi)(X)

:= niF-1 (X)),

X € B(E),

(1)

which will be called the distribution functional of F and corresponds to the notion of observable introduced in the previous Section. The pair (F, X) can be viewed as a two-valued experiment, which simply states whether the outcome of F does or does not fall in X. It will be called an event: the event that does (does not) occur when the outcome of F does (does not) fall in X. Clearly it is uniquely determined by the subset F~l{X) of fi: in other words the events are represented by the elements of the Boolean algebra B(U) of all the measurable subsets of fi. With some abuse of notation the subset F~X(X) will be sometimes denoted by the pair (F, X). When we refer to elements of B{Q) without special attention to the random variable they come from we will write a, b,... and say that fi{a), (J, £ A/1+(fi), is the probability of occurrence of the event o when the state of the physical system is / j . The deterministic nature of the frame here adopted means that when the state is pure, say 6U, then Su(a) = 0,1. Writing C(b\a; /x) for the conditional probability of b given a in the state (i, the standard classical recipe reads C(b\a;,) = ^

.

(2)

Let us recall a number of relevant properties of this definition. (i) If C(b\a; /x) = fi(b) then also C(a\b; fx) = n(a): in such a case the probability of occurrence of one of the two events is not affected by the occurrence of the other and we can speak of two mutually independent events. (ii) C(-|a;/i) belongs to M+ (fi) since we have C(0|a;/x) = 0, C(fi|a;/z) = 1, C(b U c\a; fj,) = C(b\a; /i) + C(c|a; /x) if b n c = 0. Hence C(-|a; fj) is a new

53

state of the physical system, to be called the conditioned state and denoted ^a\ so that we can write C(b | a;/x) = l^a\b). (iii) If we write /x = X^w» <^,> 0 < w, < 1, 53,-Wj = 1, for the convex decomposition of fj, into pure states (the countability of the decomposition is, however, unessential) the conditioned state takes the explicit form Ma) i as easily checked by noticing that ^2 Wi 6Wi (a) 6Ui (b) = Y,wi i

6

"i (o n 6) = /*(o n 6).

(4)

i

If the state \i is pure, say <5W, then Eq. (3) gives $£' = 6U, a property that qualifies the map fi i-> / / " ' as nondisturbing. We have in particular C(6|a;<$w) = 5W(&) for every pure state 5U, u £

fi,

(5)

which expresses the independence of any two events in a pure state, namely the absence of correlations in pure states. (iv) The Bayes law is met: C(6|o;Ai)/i(o) = C(o|6;/i)Ai(6).

(6)

(v) In presence of ordered events we have C(b\a; n) = 4 ^ r

if & C a, (and C(b\a; //) = 1 if a C 6),

(7)

M<0 hence, in particular, C(a\a;n) = l,

(8)

which can be read as a repeatability property: the occurrence of a is certain in the conditioned state / / " ) . (vi) The conditional probability of Eq. (2) is naturally related to the notion of joint observable. Let i*i : ft -> Hi and F2 : fi ~> H2 be two random variables: the measurable function i*!^ : fi —• Si x S2 defined by .Fi,2(k>) := (Fi(ui),F2(u>)) is their joint random variable. For every Xi £ 5 ( 5 0 and X 2 e B(S 2 ) we get F ^ 1 ^ x X2) = Ffl(X1)nFf1(X2),

54

and the distribution functional AFl 2, when acting on pure states, takes the product form (AnJuHXi

x X2) = (AFJUXXJ.)

• (AFJU)(X2).

(9)

Leaving aside the notation a, 6 for the events and denoting them by (Fi ,Xi), (F2,X2) we can rewrite Eq. (2) as C ( (

F 2

, X

s )

| (

f i

, X

1

) ; r i = ^ ^ M

:

(10)

the conditional probability, in the state /i, of the event (F2,X2) given the event (F\,Xi) can be read as the ratio between the joint probability of the two events and the probability of occurrence of the conditioning event. The existence of the conditioned state, expressed by the property (ii), is a hint for a popular picturing of the conditioning: what we might call the sequential picture. One imagines a temporal sequence in which the physical system, in the prior state fi, first enters some apparatus where the occurrence of the conditioning event o is checked, and then it emerges from that apparatus in some posterior state n^: the conditional probability of the event b given a is then viewed as the probability of occurrence of the event b in this posterior state /j,^. The sequential picture of conditional probabilities alludes to a strong idealization of the measurement process; the various instrumental devices that can actually be used in a laboratory to check the occurrence of a given event need not fit with such an idealization, and they may affect in different ways the state of the physical system. Thus, as long as one refers to actually used measurement devices, the notion of conditioned state becomes blurred, and the sequential interpretation of conditional probabilities becomes ambiguous, if not untenable. Despite of this, the sequential interpretation became a tenet, especially in quantum mechanics. 4. Conditioning in operational probability theory In the framework of the operational probability theory any two observables Ai : M1l"(fi) -»• M ^ (Si) and A2 : Mf(Q.) - • M+(H 2 ) admit a joint observable. Recall that an afnne map Ait2 • M+(f2) - • M+(Si x E2) represents a joint observable if, for every fj, G Mj + (fi), Xi € B(Ei), X2 G B(E2), we have (Aifi)(Xi) = (Ai, 2 /i)(Xi x S 2 ) and (A2fi)(X2) = (Ali2fi)(SixX2), namely if (Aifi)(Xi) and (A2fi)(X2) are marginal distributions of (Alt2fj,)(Xi xX2). Contrary to the standard classical case, however, a joint observable of Ax

55

and A2 need not be unique when none of them is deterministic. In any case, among the joint observables there is always the product observable A\ El A2 : Mf(Q) ->• M+(Ei x E2) denned on pure states by (At H A2 6u)(Xi x X2) = (AiSuXXi)

• (A2SU)(X2),

(11)

and extended to nonpure states by linearity. The conditional probability that the observable A2 takes values in X2 given that the observable A\ takes values in Xi, in the state fi £ M^~ (ft), will now be defined as C((A,,X,)|M„Jf,),rt-(^-f^').

(12)

This definition captures, we believe, the genuine statistical meaning of conditional probability. It mirrors the classical property expressed by Eq. (10) but now the nonuniqueness of the joint observable Ai,2 gives rise to a plurality of conditionals, a fact that appears not surprising in view of the nonuniqueness of the way of performing a joint measurement of two observables. Thus we have to speak of the conditional probability with respect to a specified joint observable: to outline this fact we will sometimes denote the conditional probability under discussion by C((A2,X2) \ (Ai,Xi);fi;Ai,2). Let us now examine some features of the above conditioning. As in the classical case the condition C((A2,X2) \ (Ai,Xi);fi) = (A2/J,)(X2) implies C((Ai,Xi) \ (A2,X2);(i) = (Ai(j)(Xi): this condition thus fits with the natural notion of mutual independence of the two effects. Clearly the Bayes property (see Eq. (6) of Sec. 3) C((A2,X2)

| (AuX1);fi;Al,2)

• (A1^)(X1)

=

= C((A 1 ,X 1 ) | (A2,X2);fx;Ah2)-(A2ti)(X2).

(13)

is met for every choice of the joint observable A%i2. The possibility of viewing the conditional probability of Eq. (12) as the probability of the conditioned effect (A2,X2) in some conditioned state occurs if the product A\ $3A2 is chosen as the joint observable. Indeed, writing P — Yli wi ^w; (0 < ttTj < 1, £ \ wt — 1) for the convex decomposition of fj, into pure states, we have C((A2,X2) | (AuX^nAiBAi) where

= (A2^'X^)(X2),

(14)

56

To prove this statement just notice that

(A2^X^)(X2)

E

= (AJ){Xi)Ylwi , A K7I A i

• W.*)(*i) • (A2SUI)(X2) =

\fv

v

(yli/iXXO Y

,

(Al ®A2 fl)(Xl

X X2)

(^M(*i)

(16) The map /J, i-+ ^(^i' x i) specified above is nondisturbing since it leaves fixed the pure states: if /x is pure, say Su, then 6^' 1 = Su. Thus we see that the properties of the standard classical case quoted in items (ii) and (iii) of Sec. 3 are preserved in the OPT frame when we refer to the product Ai G3 A2 as the joint observable. Notice that the equality dl *' l } = 8W implies (see Eq. (14)) C((A2,X2) | (AuX^S^A^At)

= (A26U)(X2).

(17)

which expresses the fact that any two effects (Ai,Xi), (A2,X2) become mutually independent in a pure state with respect to the choice Ai K A2 for the joint observable. The standard classical property (see Eq. (10)) of no correlations in pure states is thus recovered in the OPT frame when we refer to the joint observable Ai E3 A2. However, two effects need not be independent in a pure state when we refer to different choices of the joint observable: in this way the typical quantum phenomenon of correlations in a pure state manifests itself in the OPT frame through the existence of joint observables that do not have the product form, hence thanks to the nonuniqueness of the joint observable. The so-called quantum entanglement here appears connected to the way in which two observables are paired together to form a joint observable: in this sense the entanglement cannot be viewed as a property pertaining only to a state [10,11]. Let us stress that the occurrence of a conditioned state having the nondisturbing nature expressed by Eq. (15) rests crucially on the simplex nature of the convex set of states Mj + (Q). We have indeed the following theorem. Theorem 4.1. If S is the convex hull of fi := {uii,u2, ••••} then a map of the form ^2wi ciui

S 3 a = ^2wiC0i i-> — i

Wi

^i

Ci

>

0 < Wi ,ct < 1, y^Wj = 1

i

(18) exists only if S is a simplex.

57

Proof. Suppose S is not a simplex, so that the minimal cardinality of the set Cl of its extreme elements is 4. If there are just 4 extreme elements then they have to be " coplanar", and by a proper ordering of them, the segment (wi, 0J2) will intersect the segment (W3,CJ 4 ) in one point a which will admit the two convex combinations a = Wi U1+W2 W2 and a = W3 U3+W4 W4. A nondisturbing state transformation would move the first convex combination along the segment ( ^ i , ^ ) and the second convex combination along the segment (^3,604) thus producing two different elements of S. If the cardinality of fi is greater than 4 then it is always possible to pick up 5 extreme elements, say wi,a;2,W3,a;4,W5 in such a way that the segment (wi,W2) intersects the triangle {u)3,U4,u)§) in one point a which will admit the two convex combinations a = W\ a>i + u>2 ^2 and a = W3 0J3 +W4UJ4+W5 UJ5. A nondisturbing state transformation would move the first convex combination along the the segment (u>i,w2) and the second convex combination along the triangle (LJ3,W4,W5) thus producing again two distinct elements of S. • We finally remark that in the OPT frame the conditional probability defined by Eq. (12) does not fulfill a property analogous to the one expressed by Eq. (7). Indeed, the condition (A2,X2) < (Ai,Xi), which explicitly reads (A2IJ,)(X2) < (Aifjb)(Xi) for every // £ M 1 f (fi), does not imply C((A2,X2) I (A1,X1)^;Ah2) = { ^ j j f ^

(19)

nor (A2,X2) > {A1,X1) implies C((A2,X2) | {A^X^n;Ah2) = 1- In fact, the property of Eq. (7) rests on the deterministic nature of the standard classical case. Notice that also the repeatability condition expressed by Eq. (8) is not preserved in the OPT frame.

5. Quantum conditioning As in Sec. 2, we write A for the self-adjoint operator of 7i associated to the quantum observable A : Su ->• M^"(R), and P j x for the projection operator defined by the pair (A,X); when explicit reference to this pair is not needed we simply write P (or P\,P2,.. if different projectors are called into play). These projection operators are representatives of two-valued observables, hence of quantum events. The traditional recipe expressing the quantum conditional probability

58 C(P2 | Pi; D) of P 2 given P x in the state D £ Su reads C(P2|Pi,£)-

^(£>A)

,

(20)

which can be rewritten as

C(ft|J\;2>)='&(fl W ft),

D(A) =

J^L

(21)

hence as the probability of "occurrence" of P 2 in the conditioned state D(Pl\ This is usually referred to as the Liiders-von Neumann rule for the quantum conditioning, a rule based on the sequential picture. Notice that the condition C(P2 \ P\]D) = Ti(DP2), which states that the probability of occurrence of (the event associated to) P 2 is not affected by the occurrence of Pi, does not imply C(Pi | P2;D) — Tr(DPi); only in case Pi and P 2 commute does the above condition recover the symmetry property. Thus the conditional probability expressed by Eq. (20) fails to give rise to a natural notion of mutual independence of two events. The map of Su into itself defined by D \-> D^p^ does not leave the pure states (namely the one-dimensional projectors) fixed: such a map does not belong to the family of nondisturbing state transformations previously discussed. Actually, a nondisturbing map on S-u would be prevented by Theorem 4.1 of Sec. 4. This implies that the conditional probability C(P2 \ Pi; D) need not collapse into the probability of occurrence of P 2 in the state D when the latter is a pure state: in other words, correlations can appear in pure states. The Bayes property, that now would read Tr(£) Pi P 2 Pi) = Tr(Z)P 2 PiP2)), does not hold true, except the case in which Pi and P 2 commute. We have however the analogue of the classical property expressed by Eq. (7): C(P2 | P i ; D ) = ^ ~ \ if P 2 < Pi (andC(P 2 | PX;D) = 1 if P x < P 2 ). ir(.L'Pi) (22) Actually, this property is sufficient to imply Eq. (20), as shown in [12]. In particular we have the repeatability property C(Pi | Pi; D) = 1 which can be read by saying that the occurrence of Pi is certain in the conditioned state £)( P l ). The conditional probability of Eq. (20) can be connected to a notion of joint probability distribution only in the case of commuting observables.

59 Coming back to the more detailed notation PA. X- f° r t n e projection operator associated to the observable Ai and to Xi €
I PAUXI;D) = ^{D^pf^\

(23)

where the numerator T r ( D P j x^A^ x2) *s *n ^ ac * a J o m * probability distribution since PA x PA x is a projection operator. Clearly, this joint probability distribution has not the product form discussed in Sections 3, 4. A typical example occurs when H has the tensor product form "Hi (gi?^ and A\, A2 are, respectively, observables on V.1, % : in this case A\ ® h, h ® Ai (where I\, 72 stand for the identity operators of %\,'H.2) axe commuting operators of %! ® H2 and their joint probability distribution takes the form )=

^{DPA i®A2,Xly.X2

)•

(24)

This joint probability corresponds, for instance, to the polarization correlations in the experiments related to Bell inequalities and the so-called Einstein-Podolsky-Rosen issue. The fact that the quantity in Eq. (24) is not the product of probability distributions is responsible for the possible occurrence of correlations between the two observables, even when D is a pure state. In the jargon of quantm mechanics this is usuall referred to as the entanglement between the subsystems of a compound system. As well known, the problem of defining some physically meaningful notion of joint probability distribution in the case of noncommuting observables encounters unsolved difficulties. Thus, inside the usual quantum formalism, the conditional probability is not related to a notion of joint probability, hence to a notion of joint observable, despite the fact that the physical notion of conditioning and of correlation between two observables calls into play some idea of joint measurement. The fact that the Liiders-von Neumann rule of quantum conditioning cannot be associated to any notion of joint observable has its root in the fact that the numerator Tr(DPAi XiPA2 X2PAi Xi) in the r.h.s. of Eq. (20) defines a function, say / , on R 2 which is not a measure on R 2 for arbitrary D € 5 ^ , so that it cannot represent a joint probability. Indeed, one can easily verify that / need not be additive on disjoint subsets of R. The extension of standard quantum mechanics into the OPT frame, sketched at the end of Sec. 2, might offer a way to conjecture a quantum conditional probability which avoids some shortcomings of the Liiders-von Neumann one. Given two quantum observables Ai, A2 and a quantum state

60

D, consider the classical representatives A\ o R, A?. ° R and an element /i in the counterimage of D under R. Then, given a joint observable £?ij2 of Ai o R and A2 o R, we are led to define the conditional probability

When D is a pure state the choice of /x is unambiguous since the map R is one-to-one on pure states. When D is not a pure state the counterimage of D under R is not a singleton and the conditional probability defined above inherits a dependence on the statistical content coded in fj. through its convex combination into pure states, a statistical content which is not coded in the quantum state D. A similar emergence of the role of the statistical content of a state in the characterization of classical and nonclassical correlations has been pointed out in [10,11,13]. References 1. 2. 3. 4. 5. 6.

E.B. Davies and J. T. Lewis, Commun. Math. Phys. 17, 239 (1970). E. G. Beltrametti and S. Bugajski, Journal of Physics A 28, 3329 (1995). S. Bugajski, Int. Journal of Theor. Phys. 35, 2229 (1996). S. Gudder, Demonstratio Mathematica 31, 235 (1998). S. Bugajski, Mathematica Slovaca 51, 321 and 343 (2001). G. Ludwig, Die Grundlagen der Quantenmechanik: Springer, Berlin (1954); English edition: Springer, New York (1983). 7. R. J. Greechie and D. J. Foulis, Int. Journal of Theor. Phys. 34, 1369 (1995). 8. E. G. Beltrametti and S. Bugajski, Journal of Physics A 38, 3020 (1997). 9. P. Busch, M. Grabowski and P. J. Lahti, Operational Quantum Physics, Springer, Berlin (1995). 10. E. G. Beltrametti and S. Bugajski, Int. Journal of Theor. Phys. 42, 969 (2003). 11. E. G. Beltrametti and S. Bugajski, Int. Journal of Theor.Phys. 44, 827 (2005). 12. G. Cassinelli and N. Zanghi, II Nuovo Cimento 73 B, 237 (1983) and 79 B, 141 (1984). 13. E. G. Beltrametti and S. Bugajski, Int. Journal of Theor.Phys. 43, 1793 (2004).

E N T A N G L E D STATE P R E P A R A T I O N I N E X P E R I M E N T S O N Q U A N T U M NON-LOCALITY*

V. BERARDI, A. GARUCCIO Dipartimento Universita

E-mail:

Interateneo di Fisica, e Politecnico di Bari, and INFN - Sezione di Bari Via Orabona 4, 1-70126 Bari (ITALY) [email protected], [email protected]

In the last decade a growing number of experiments for testing local realism via Bell-type inequalities using spontaneous parametric down-conversion (SPDC) photon sources have been performed. In this short paper we demonstrate that experiments based on the use of TYPE-I SPDC sources cannot discriminate between quantum mechanics and local realism, since TYPE-I SPDC sources do not allow to define correctly dichotomic observables.

The possibility of producing correlated photon sources using spontaneous parametric down-conversion (SPDC) has opened a new realm for testing Einstein's locality via Bell's inequality. 1-4 By this technique a laser beam impinges on a birifrangent crystal and, when some suitable phase matching condition is satisfied,5 two correlated, down-converted photons are emitted, either collinearly or along different paths. Two different types of SPDC processes are possible: i) TYPE-I, in which the emerging photons have the same linear polarization; ii) TYPEII, in which the emerging photons polarizations are orthogonal one to each other. Now, let us focus our attention on experiments carried on by using TYPE-I sources. This approach was proposed by Ou, Hong and Mandel (see Ref. 3) and although it has been already demonstrated that the quantum state of the correlated pair cannot be treated as if it were merely •This work is partially supported by EUROPEAN UNION under project INTAS-01-2122.

61

62

obtained by a combination of the quantum states pertaining to the single particles, 6 only for the sake of simplicity and with reference to Fig. 1, let us consider two TYPE-I down converted photons emitted, in the same polarization state (say \H)), along different paths by an SPDC crystal pumped by a suitable laser beam. Both photons are reflected by a mirror and then one of the two photons goes through a TT/2 polarization rotator (namely a half-wave plate set at ±45 degrees) and emerges in state \V), whilst the other one crosses a compensation plate (usually a quartz plate). The two photons reach then a beam-splitter (BS) from opposite sides. After the BS and before the detectors there are two polarizers oriented at angles Oi and #2, respectively.

Minor

Figure 1. Schematic of the experimental apparatus: SPDC is the Spontaneous Parametric Photon source, BS is the BEAM SPLITTER, P the glass PLATE, R the Polarization ROTATOR, \H) and \V) are the photons states and the - sign indicates photon absorbtion at the polarizer.

Thus, after the BS, the state of the emerging pair is given by:

IVO = VTHTV |if t ) \V2) + VRHRV iyjRvTu |ffi) \VY) + WTVRH

\VI)

\H2) +

\V2) \H2),

(1)

where: RH, R V and TH, Ty are the beam-splitter reflectivity and transmissivity, respectively, with RH + TH — Rv + Ty = 1 ; \Ht) [\Vi)] is the polarization state along the H-direction [V-direction] for the photon in the i-th output channel of the beam-splitter. The quantum state \ip), expressed by Eq. (1), has been used for testing Einstein's locality by means of the well-known Bell-type inequality,

63

obtained by local realism together with some ad-hoc assumptions, like fairsampling and/or no-enhancement hypotheses: 7 ' 8 B(91,9[;92,9'2)

= P(91;92)-P(91;92) l

+

P(9'1;92)

l

+P(6[;6 2)-P(9 1;oo)-P(cx>;92),

(2)

and -1
(3)

In Eq. (2), P{9\;92) is the joint detection probability of a photon pair, when the beams emerging from the beam splitter pass through linear polarizers, set at angles 9\ and 92, respectively. P(9i;9'2), P{9'1;92), P{8'\]92) have a similar meaning. P(#i;oo) and P(oo;#2) are the corresponding probabilities when either one of the linear polarizer is removed. However - and this is the key point of this short communication - the local realism expressed by inequality (3) cannot be tested by using the quantum state (1). This is due to the fact that the inequality obtained by Clauser et al.7,8 can be deduced only if a binary choice between the transmission and absorption in a polarizer is assumed. However, when a down-conversion photon source is used so as to obtain a correlated pair, as the one described by the quantum state expressed in Eq. (1), the choice at each polarizer is not dichotomic. Besides the probabilities analysed by Clauser et al.7'8 it must be also considered that the two photons can travel along the same channel and reach only one of the two polarizers. For every choice of the polarizer orientations 9\ and 02 Clauser et al.7 introduced four probabilities P(9i±;92±). For instance, P ( # i + ; 0 2 - ) represents the probability that the photon which travels along channel 1 is transmitted through polarizer 1 and the photon which travels along channel 2 is absorbed by polarizer 2. As a consequence, the correlation function can be written as: E(91;92) = P{9l+;92+)

- P(0 1 + ;0 2 _) + P ( 0 i _ ; 0 2 + ) + P ( ^ _ ; 0 2 _ ) .

(4)

If we assume a dichotomic choice between transmission and absorption in a polarizer, the following relations hold: P(61+;62+)

+ P(0 1 + ;0 2 _) + P(91-;92+)+P(e1--,e2_)

P(91+;92+)

+ P(0 1 + ;0 2 _) = P ( 0 1 + ; o o + )

P(91+;92+)+P(91_;92+)

= P(oo+;0 2 + )

P(oo + ;oo+) = 1,

= 1

(5) (6) (7) (8)

64

where P(9i+; oo+), P(co+;#2+) and P(oo + ;oo+) are the detection probabilities with one, the other or both linear polarizers removed, respectively. Using Eqs. (5)-(8), Eq. (4) becomes: E(9i;92)

= 4P(0 1 + ;9 2 + ) - 2P(61+;oo+)

- 2P(oo + ;9 2 + ) + 1.

(9)

It is worth noting that only the terms corresponding to cases of double transmission appear in the above equation, which, besides the quantum efficiency of the detector, can produce a detectable signal outcome. This allows us to transform Bell's inequality \E(9V,92) - £(
+ E(9[;9'2)\ < 2

(10)

into inequality (3). But it follows from Eq. (1) that some photon pairs can travel along the same channel, reaching only one of the two polarizers. Thus, this quantum state does not satisfy all Eqs. (5)-(8). As a matter of fact, 12 to account for such occurrence, Eq. (5) and Eq. (8) must be replaced by: P(91+;92+)

+ P ( 0 i + ; 0 2 - ) + P(0i-;02+) + .P(0i-;02-) = ^

P(co+;oo+) = i ,

(11) (12)

which, in turn, implies that Eq. (9) must be replaced by: E(91;92)=4P(O1+;02+)-2P(O1+;oo+)2P(oo_;0 2 + ) + i .

(13)

Consequently, inequality (3) must be replaced, in the case of TYPE-I SPDC sources, by: -\
(14)

It is easy to prove that the inequality in Eq. (14) cannot be violated by the quantum-mechanical joint transmission probabilities for the correlated photon pairs described by Eq. (1). For ideal polarizers and detectors these probabilities are given by: 1

P(9i;92) = - [cos9isen92 + sen9icos92] =

Isen2(01+
P(0i; oo) = Jsen26»i + ^cos29x = - ,

2

= (15) (16)

65

if we assume, as one does in the performed experiments, j?- = -^- = 1. In fact, for example, the maximum value of the observable B according to the quantum-mechanical predictions expressed by Eqs. (15) and (16) is BQM

=\ ( V 2 - I ) < \

(17)

and, therefore, any SPDC TYPE-I photon source cannot give a quantum state suitable to test quantum mechanics vs. local realism, not even in the case of ideal behavior of polarizers and detectors. This result is in contradiction with the claimed locality violation in this class of experiments, but it is in complete agreement with the theoretical results on the subject. In fact, the state \ip) in Eq. (1) can be written in the factorized form ^^(s/T^^+ijR^^))

(y/TV\V2)-iy/R^\Vi))

and it has been proved that factorizable states always satisfy Einstein locality or, equivalently, Bell-type inequalities. 9 This theoretical consideration seems to be decisive in proving that the results discussed here are reliable. Also Santos pointed out in 1991 (although in the case of atomic cascade sources) that a great care must be taken with the selection of a sub-ensemble of emerging pairs. 10 More specifically, the joint detection probabilities have been computed not only by us, but also by all the authors of the papers in Refs. 1-4, by using the state vector (1), i.e. referring to the total set of produced photon pairs, without selection of any sub-ensemble. Our approach differs from the others since we deduce a new Bell-type inequality that can be applied explicitly in this case, whereas the authors quoted above use an inequality which does not correctly represent the locality in their experiments. The reasons why the Clauser and Home 8 approach fails in describing the limits of locality in these kind of sources has been discussed elsewhere. 11 Furthermore, it has been observed that in a Bell-type inequality (3) only joint probabilities appear. Hence, to overcome the problems related to the use of the quantum state in Eq. (1), it may seem "reasonable" to cut away the last two terms in it. If this procedure is applied in this case, the result will be the entangled (but not normalized) state M = \ (l-ffi) \Vi) + \Vt) \H2)),

(19)

and even using such a state no violation of locality can occur. Only if the normalization of the quantum state to the subset of photon pairs travelling

(18)

66

in both channels is imposed, the wave function becomes \^) = ^=(\H1)\V2)

+ \V1)\H2)),

(20)

which may lead to the violation of a Bell-type inequality. It is worth noting that, in this case, this operation is not at all correct in principle. In fact, the selection of the sub-ensemble of the coincidence pairs is neither a measuring process nor a state preparation. It is not a measuring process because the measurements are made behind the polarizers, hence they give a factorized state of two photons in a well defined polarization state. It is not a state preparation since other pairs of photons are travelling along the two channels actively contributing to the single photon detection rates. Moreover, it was argued in 199212 that the state in Eq. (19) can be reproduced in a hidden-variable local realistic model for physical correlated systems. If the quantum operation of normalization of the probability amplitudes is imposed on the state in Eq. (19), the state in Eq. (20) can be obtained, but any chance of physical and local interpretation of the model is lost and this in turn can lead to errors when quantum mechanics is compared to local realism. This result shows the critical importance of the probability interpretation of the wave function. In fact quantum mechanics can be obtained starting from local realism, but only after the normalization step, which allows the raising of correlation functions values and, consequently, the violation of Bell's inequality. It can be concluded that even discarding from the theoretical description the photon pairs which travel along the same path, experiments using parametric TYPE-I SPDC are not be reliable in testing Einstein's locality. After the criticism whose historical development we summarized in this contribution, AG proposed to use a new kind of SPDC sources, named TYPE-II. 13 In fact TYPE-II sources are somewhat immune from the arguments developed throughout this contribution and have been subsequentlyused for testing Bell's inequality.14 However, as a final remark, we note that even the use of TYPE-II SPDC sources does not prevent the experiment from being seriously flawed by the well know detector efficiency loophole. References 1. 2. 3. 4.

Z. Y. Ou and L. Mandel, Phys. Rev. Lett. 61, 50 (1988). Y. Shih and C. O. Alley, Phys. Rev. Lett. 61, 2921 (1988). Z. Y. Ou, C. K. Hong and L. Mandel, Optics Commun. 67, 159 (1988). S. M. Tan and D. F. Walls, Optics Commun. 71, 235 (1989).

67 5. P. G. Kwiat, K. Mattle, H. Weinfurter, A. Zeilinger, A. V. Sergienko, and Y. Shih, Phys. Rev. Lett. 75, 4337 (1995), 6. T. B. Pittman, D. V. Strekalov, A. Migdall, M. H. Rubin, A. V. Sergienko and Y. H. Shih, Phys. Rev. Lett. 77, 1917 (1996). 7. J. F. Clauser, M. A. Home, A. Shimony and R. A. Holt, Phys. Rev. Lett. 23, 880 (1969). 8. J. F. Clauser and M. A. Home, Phys. Rev. D 2 3 , 526 (1974). 9. V. Capasso, D. Fortunato and F. Selleri, Int. J. Theor. Phys. 7, 319 (1973). 10. E. Santos, Phy. Rev. Lett. 66, 1388 (1991). 11. A. Garuccio and V. Berardi, Found, of Phys. 33, 657 (2003). 12. L. De Caro and A. Garuccio, Found. Phys. Lett. 5, 393 (1992). 13. A. Garuccio, Annals of the New York Academy of Science 755, 632 (1995). 14. P. G. Kwiat, K. Mattle, H. Weinfurter, A. Zeilinger, A. V. Sergienko and Y. Shih, Phys. Rev. Lett. 75, 4337 (1998).

THE FIRST STEPS OF QUANTUM ELECTRODYNAMICS: WHAT IS IT THAT'S BEING QUANTIZED? SILVIO BERGIA Department of Physics, University of Bologna, via Irnerio 46 40126 Bologna, Italy Quantum field theory, and in particular quantum electrodynamics (QED), are milestones in the history of twentieth century physics. Not all interpretative problems presented by them, however, can be said to have been settled once and for all. In this paper the first papers on QED by P. Jordan and P. A. M. Dirac are briefly analysed, with the intent to show that their authors confronted from the beginning the essential of those problems, that their answers were not unambiguous, and that there are still some open questions concerning the subject matter. These aspects have already been discussed by various authors. I will re-propose them here as they seem to be still waiting for a conclusive word, hoping that this presentation will stimulate further studies along these lines.

1. Introduction Traditionally, in the meetings of these series, devoted to an analysis of the foundations of quantum mechanics, the attention concentrated on the interpretative problems presented by the theory as codified in the so-called Gottingen-Copenhagen interpretation. I hardly need to stress that it becomes possible to deal in critical terms with the contents of specific chapters of physics only referring to their history; on the other hand, the historical analysis of a given chapter often allows one to catch and single out aspects worthy of critical reflection. This of course means dealing with historical developments, for instance those which allow one to find a thread leading from the famous 1935 paper by Einstein, Podolsky e Rosen to the experiments of the Aspect group in the Eighties. Given the above, even though the historical analysis extended in time up to fairly recently, it did generally not extend in scope much beyond that sphere. Yet, everyone will agree that the history of twentieth century physics does not end up with that important chapter, and would probably subscribe that other chapters - that of elementary particle physics, to give but an obvious example should deserve a comparable attention, although they may not seem, at least at first sight, to offer an immediate occasion of critical reflection.

68

69 Quantum field theory - quantum electrodynamics in particular - has on the other hand not only been the basic theoretical instrument for the development of this and other chapters of physics, but also a field of research that seems to have already stimulated critical studies which deserve attention. And in fact the development of quantum electrodynamics and of field theory in general has received a great deal of attention by several authors. I will here recall in the first place the studies carried on by Arthur Miller,1 Sylvan Schweber2 and Tian Yu Cao.3 In Italy there has been no comparable body of studies (a notable exception is a contribution by Rossi4 on Pascual Jordan). There is however more than a signal that the process has started, even if we are not yet in the presence of a systematic approach, but rather of isolated attempts determined by the need to outline the background of the work of authors such as Fermi. I am here referring, in particular, to a fine essay by Marcello Cini.5 It is by no means occasional that Cini's paper refers to the work of authors such as those mentioned above, although, at the same time, it begins to single out themes worthy of further critical investigation. A lively association, A.I.F., is operating in Italy in the field of the history of physics in connection with its institutional goal, the teaching of physics at the secondary school level. In the last four years, in particular, the winter schools of the association have paid particular attention to the development of twentieth century physics. The intent was to pave the way for an appropriation of the subject by the secondary school teachers following the courses, at least as far as items such as nuclear and cosmic ray physics were concerned. Quantum electrodynamics and quantum field theory in general were also dealt with to some extent. My intention here is to concentrate on the first steps of quantum electrodynamics. It is what I did presenting the subject in the occasion of those schools, but the stress will be here on a quite different aspect: while the attention there fell on topics that would be possible to transmit, at least in perspective, to secondary school students, I will concentrate here on subjects that might, and perhaps ought to, be of interest to people concerned with the foundations of physical theories, with particular attention to some basic aspects that were pinpointed by the authors who first tackled the subject, such as Pascual Jordan and Paul A. M. Dirac. Referring to each of them there is something worth stressing in any attempt at reconstructing the trend of reflections in the field. The first one is the difference that was drawn between the quantization of the electromagnetic and of the matter fields, to the extent that quantization of the latter was not considered

70

at all by authors such as Dirac. The second has to do with the question as to what, in the electromagnetic case, had to be considered as the basic quantities: waves or light quanta? The third has to do with the very basic further question: what does quantizing a field really mean? These aspects have already been discussed by Schweber, Cao and Cini. I will re-propose them here in an extremely synthetic way, devoting a short section to each of them, in the hope that this presentation will stimulate further studies along these lines, given, as I will argue, that there are still some open questions concerning some of them. 2. Particles as described by wave equations and the e.m. field described in terms of light quanta: one and the same problem? I will give here for granted the processes which, within a couple of years, i.e., from 1925 to 1927, led to the formulation of quantum mechanics in its two versions, Heisenberg matrix mechanics and Schrodinger wave mechanics, to the conclusion that they were just two versions of one and the same theory, and to the standard interpretation of the latter, the one we call the GottingenCopenhagen interpretation. I may have to fill in some details when necessary. Not every interpretative problem was however to be considered settled, in particular as far as what we call the wave-particle duality is concerned. The general viewpoint was that the question could not be considered on the same footing when dealing with particles described by wave equations or with the electromagnetic field described in terms of light quanta. In the latter case there existed a field, accessible to direct measurements, the electromagnetic field of classical electrodynamics; but this was not the case for the material particles described by quantum-mechanical wave equations. Wave fields were in fact associated with them in a definite way, and gave rise to observable effects", but it soon turned out that they propagated, in general, in configuration rather than in real space, and, most of all, that they just carried information on the probability distribution of the particles they described, and, as such, were not accessible to a direct determination. The difference between the two cases was stressed by Dirac in his own way in his first basic paper on the subject:6 "It should be observed that there is a difference between a light wave and the de Broglie or Schrodinger wave associated with the light-quanta. Firstly, the light-wave is always real, while the de Broglie wave associated with a light-quantum moving in a definite direction must be taken to involve an imaginary exponential. A more important difference is that their intensities are to be interpreted in a different way. The number of a

Experiments by C. J. Davisson and L. H. Germer, and by G. P. Thomson.

71 light-quanta per unit volume associated with a monochromatic light-wave equals the energy per unit volume of the wave divided by the energy (271 h )v of a single light-quantum. On the other hand a monochromatic de Broglie wave of amplitude a (multiplied into the imaginary exponential factor) must be interpreted as representing a2 light-quanta per unit volume for all frequencies"b. And, from the very beginning, as to the system to be quantized Dirac refers strictly to the electromagnetic field. It is therefore all the more surprising that his paper be considered as the one that introduced what has become known as "second quantization": namely Dirac would have quantized Schrodinger's wave function - itself an object of a quantum nature - showing that in this way to the wave become associated corpuscles, the field quanta. This is one of the issues that deserves further attention by historians of physics. Nor the meaning of a wave function quantization seems to have as yet received sufficient attention. The above arguments do not provide an answer to a question that rises spontaneously: what, if any, is the wave equation describing quantummechanically the light-quanta? The answer that would sound obvious d'Alembert's equation - unfortunately does not seem to be the right one. Nor there seem to exist other viable wave equations. I have in fact found in the literature no answer to the objection raised against the conclusion drawn by Akhieser and Berestetski in their treatise:7 after introducing a "photon wave equation" in momentum space and its Fourier transform f(r), and shown that the latter can be normalized in the usual way, they conclude that its squared modulus "cannot be interpreted as the probability density for finding the photon at a given space point". The reason? In the authors' words, the circumstance that "the presence of a photon can be established only by its interaction with charges. This interaction is determined by the electromagnetic field vectors E and H at a given point. The latter, however, are not determined by the wave function f(r) at that point, but by its values in all of space"c. Still, it is customary to deal with the paradoxical situation arising when trying to describe double slit experiments in terms of photons having recourse to probability amplitudes. To give but an example, I will quote from an altogether fine book,8 a text for a physics course for science and engineering students: "The mathematical formalism that resolves the paradox is to represent each particle as a probability amplitude y/(x,y,z,t) [•••]• When an event can occur in several alternative ways (such as one path through slit A and another through slit B) the probability amplitude for the event is the sum of the separate probability b c

Dirac, op. cit., p. 247. Akhiezer and Berestetski, op. cit., p. 17, where one can find further details.

72

amplitudes .. ,"d. And this even if it is generally acknowledged that the squared modulus of the four-potential A, which through the term j • A determines the coupling of photons to charged matter, does not represent a probability density. A brief - though very preliminary - conclusion. The question was addressed by Dirac, who made a definite choice: quantization of wave fields means in fact quantization of (only) the electromagnetic field. This is not, of course, today's conclusion: matter fields are subject to quantization as well. Dirac had however spotted one central point: the two cases are not on the same footing, since there is no physical field associated to massive particles. On the other hand, the naive attempt to identify a quantum-mechanical wave equation for the photon does not seem to work, in other words photons cannot be dealt with at the level of first quantization; in particular, at that level, there is no legitimate treatment of interference effects. The correct treatment should arise from QED. I will add a short final comment on this point. Dirac, who did not try to write down a photon wave equation, was then on the right track. 3. What comes first: The wave or the particle? About this point, the second author considered among the founders of quantum electrodynamics, Pascual Jordan, seemed to have a very clear idea from the beginning. To his eyes, the problem was that presented by "the vexing problem of Einstein's light quanta"6. And he was quite clear about what he meant with that and the way to follow in order to solve it: one should be able to explain the corpuscular properties of the electromagnetic field (the existence of the light quanta) and of its interactions with charged particles, "by applying quantum mechanics to the Maxwell field itselff. Already in his 1925 work with Born it was stated that the electric and the magnetic fields should be regarded as dynamical variables, represented by matrices and subject to quantization rules. A little later there appeared the so-called "Dreimannerarbeit", by Born, Heisenberg e Jordan,9 in which matrix mechanics obtained its final formulation. The last chapter of it is due to Jordang. I will come back on Jordan's argument given there in the next section. d

Orear, op. cit., p. Quotation in Cao, op. cit., p. 159. f ibidem. g "... in these two papers [...] the parts concerning quantum electrodynamics, that is the quantum theory of the electromagnetic field, which in turn provides a model for the quantum theory of fields in general - according to direct testimonies by Jordan in his correspondence with Born and to subsequent papers of his, and as later historiography has fully confirmed - is mainly Jordan's work": Rossi, op. cit., pp. 103-104 (translation mine).

e

73

It must, I believe, come as a surprise that this was not quite Dirac's attitude. In the Introduction of the paper, the reader is invited to consider "an atom interacting with a field of radiation"11; and it is anticipated that she/he will be shown how the assumption that certain variables become q-numbers "gives lightquantum properties to the radiation"1. Dirac seems therefore to hold on radiation a viewpoint similar to Jordan's, or, in Cao's words, to share a wave ontology. He will indeed come back to this set-up in the last section, but, as he tells from the Introduction, in the body of the article he will "build up the theory from the light-quantum point of view"J, that is, as correctly commented by Cini, starting from the opposite point of view. His general attitude seems to be well represented by the sentence: "There is thus a complete harmony between the wave and the light-quantum description of the interaction"11. An alternative way to pose the same question would perhaps be: what is light made of? Not everyone seems to agree that nature does not make a definite choice, i.e., that there is an essential duality. To consider an influential example, just consider what Richard Feynman writes in his semi-popular account of QED:10 "Newton thought that light was made of particles - he called them corpuscles - and he was right (but the reasoning that he used to come to that decision was erroneous). We know that light is made of particles because we can take a very sensitive instrument that makes clicks when light shines on it, and if the light gets dimmer, the clicks remain just as loud - there are just fewer of them [...] each little lump of light is called a photon [...] You might wonder how it is possible to detect a single photon. One instrument that can do this is called a photomultiplier ..."' "I want to emphasize that light comes in this form particles. It is very important to know that light behaves like particles, especially for those of you who have gone to school, where you were probably told something about light behaving like waves. I'm telling you the way it does behave - like particles. You might say that it's just the photomultiplier that detects light as particles, but no, every instrument that has been designed to be sensitive enough to detect weak light has always ended up discovering the same thing: light is made of particles""1. I shall gladly leave to the reader the task of describing - or should I say designing (if possible) - a device capable of making a click when a signal from a h

Dirac, op. cit., p. 244. ' ibidem, pp. 244-245. J ibidem, p. 245. k ibidem. 1 Feynman, op. cit., p. 14. m ibidem, p. 15.

74

radio source emitting in the realm of long waves - say a hundred meters - is picked up by a receiving antenna. I would rather like to stress that the duality wave-particle has simply disappeared, so that even the question "what comes first?" seems to have lost meaning: there are no waves whatsoever, neither "probability waves" nor - so it would seem - electromagnetic waves. This conclusion will probably sound outrageous, and - I believe - needs some comment. I will argue, in my final remarks, that what seems to me an appropriate reading of Feynman's argument will, at the same time, provide support to the claim alluded at above that a correct quantum treatment of interference effects can only be obtained in QED. But, no doubt, Feynman's statements deserve further attention by experts in the field. 4. What does it really mean quantizing a field? In the final chapter of the "Dreimannerarbeit" Jordan took up Einstein formula for the mean value of the energetic fluctuations of electromagnetic radiation described by Planck's law, =hv<E>+

-f^f (8xv2/c3)vdv and showed that it could be derived starting from the description of cavity radiation as a set of independent harmonic oscillators subject to the quantization condition qp - pq = ih • I • Cao, however, appropriately stresses that "the concept of the quantization of a field has two meanings: (i) the quantization of field energy (or, more generally, of the mechanical motion of the field), a quantization that was similar to the quantization of the mechanical motion of particles dealt with in quantum mechanics; (ii) the quantization of the field as a "substantial entity", for "the field cannot be regarded as a synonym for the energy of the field, but the former is related to the latter as an owner is to his possessions". Cao's conclusion, to be shared, is that "what Jordan could 'prove' was merely that the energy states of the field were quantized, a result that was exactly what he had presupposed but had nothing to do with the quantization of substantial Maxwell fields themselves."" Let us now turn to Dirac. In the Introduction of his paper he considers an electromagnetic field confined in a Planckian cavity, in order to be able to deal "Cao, op.cit., p. 160.

75

with a discrete set of degrees of freedom, and writes that energy and phase of each component can be considered as dynamical variables describing a radiation field, which can be assumed to form a couple of canonically conjugated variables. To deal with the system in quantum-mechanical terms, it is necessary to assume that the energy E and the phase Qt of each component obey the quantization rules [er,Es]=ihSIS

(1)

Energy and phase become in this way "q-numbers"; this is the assumption that according to Dirac, as already recalled, "gives light-quantum properties to the radiation". I will skip the entire central part of the chapter0 except for one point that bears on the matters under discussion, namely the fact that, in connection with the circumstance that one is dealing with a set of particles, their number Nt in a component is introduced, and it is claimed that a certain procedure, which I shall not try to reproduce, N and 6 become, in turn, canonical variables. Both Schweber and Cao subscribe that this implies the validity of the quantization rules [et,N,]=ihSa,

(2)

which thus would replace the previously formulated ones, even though Dirac does not write them explicitly. It is of course important to recall that the iV can have only integer eigenvalues. A general comment concerning both sets is that the status of 0r as a hermitian operator is doubtful;1213 I will come back on this point. A more specific comment concerns the question addressed by this section: is thus quantization of the field properly achieved? The answer seems to be that Eqs. (1), once more, means quantizing the field energy, the essential novelty being introduced by Eqs. (2). In the body of the article, as previously recalled, Dirac's purpose was to "build up the theory from the light-quantum point of view". In the final section, he comes back to his initial viewpoint, privileging the field over the particles, and states, without further comment, that "We can, as explained in sec. 1, suppose the field to be described by the canonical variables N , 6X. • ."pHowever, in the two contexts one is not dealing with the same physical system: Nr was born as a number of particles perturbed by an atom (!), now it 0 p

I have dealt with it in some details in my A.I.F.-school lecture (Bergia"). Dirac, op. cit., p. 252

76

should be identified with the number of energy quanta in the state r arising from the quantization of a wave field. Doubts about the legitimacy of this identification have been raised by Cao. One can nevertheless say that Dirac had the result at hand, since he might have passed from the variables ET and #r to Nt and &r within the ambit of his initial set-up; and it is what he would do in a paper published in the same year.14 5. Potentiality and up-to-dateness of Dirac's work I would like to come back to a question left open in the last section: that concerning the (impossibility to associate a hermitian operator to the phase of a wave. It has been taken up anew in more recent times, with results that it is worthwhile recalling in this context, as it is in Dirac's work under exam that the premises for these developments were set. In a section titled "The mode phase operator" of his treatise on the quantum theory of light,15 R. Loudon writes: "In the classical theory of light waves, it is convenient to write the complex electric field as a product of a real amplitude and a phase factor [...] It is similarly convenient in quantum mechanics to make a separation into amplitude and phase factors. To do this, it is necessary to introduce the concept of phase into the quantum-mechanical description of the field. [...] There is in fact no unique prescription for the way in which the separation should be accomplished in quantum mechanics and there is a corresponding degree of arbitrariness in the definition of the quantum-mechanical phase operator. The main considerations are that the quantum-mechanical phase should have the same significance as the classical phase in the appropriate limit, and that the phase should be associated with hermitian operators so that it is (at any rate in principle) an observable quantity" (emphasis added)q. Loudon proves next that the operators cos0 -—(exp(/0)+exp(- i(/>)\ sen^ =—{exp(i'0)- exp(- ifi)} fulfil these requirements, and that the following commutation rules with the particle number operator are satisfied: |7V,c6s^J=-/sen^ |7v\sen0]=j'c6s0. q

Loudon, op.cit., p. 141.

77

The comment that follows is particularly relevant: these rules, writes the author, "show that the number and phase operators do not commute and it is therefore not possible, in principle, to set up states of the radiation field that are simultaneous eigenstates of two of the operators"' [...] The results of measurements of the amplitude and phase are governed by the uncertainty relations: ArcAcos^ > — (sen^) AnAsen^> — (cos^)| • Loudon provides specific examples: in the case of a definite number n of photons, the uncertainties in the sine and cosine are such that the phase can have any value between 0 and 2n , i.e., it is completely undetermined. Miller asks himself if "the complete harmony between the wave and the light-quantum description of the interaction", looked for by Dirac, might have "influenced Bohr's thoughts toward complementarity"8. It seems to me that the influence could have rather been exerted by Dirac's prefiguration, although in an incomplete and not quite correct form, of commutation rules holding between number and phase operators. Left aside specific questions about complementarity, there does not seem to exist doubts as to the fact that number (corpuscles) and phase (waves) appear both necessary for a quantum-mechanical description of a radiation field. How are we then to take Feynman's statement that light is made of particles? A recent new edition of his semi-popular account of QED I referred to was reviewed by A. Zee', who is quite explicit on several points, but does not directly comment on this crucial issue. He writes, however, that many of the readers of the book "may be legitimately puzzled [...] for the absence of the wave function that figures so prominently in other popular discussions of quantum theory". The point is - and Zee duly stresses it - that Feynman, who does not even hint at the thing, was in fact trying to transmit to near laymen the essential content of his path integral formulation of quantum mechanics,16 and that this formulation is equivalent to Schrodinger's but does not need waves - which does not necessarily mean that they, in the form of electromagnetic waves, do not exist.

' ibidem, p. 144. s Miller, op.cit., p. 23. ' In can be found in http://theory.itp.ucsb.edu/~zee/feynman.html

78

Hopefully, this will settle things about Feynman's puzzling statement. Which I would not have quoted inasmuch as not strictly pertinent to this attempt at a historical reconstruction, were it not for the fact that it contributes to outline the background for the claim alluded at above that a correct quantum treatment of interference effects in optics can only be obtained in QED. The path integral formulation of quantum mechanics, applied to the propagation of photons, appears in fact to offer a way out from the difficulty connected with concerning the very concept of a photon wave function. But the other formulations do not. The question arises if QED can provide a description of interference phenomena in optics without having recourse to any kind of first quantization description of photons. The answer is of course yes. The essential point is that the (quantized) four-potential A is expanded into operators which create and annihilate photons corresponding to particular modes of the field, but that these modes correspond to solutions of the classical Maxwell equations written in terms of the A field. Thus, even if its squared modulus, as recalled above, does not represent a probability density, there is a relation between the wave field A, which, apart from subtleties relating to gauge freedom - it is perhaps worth repeating it continues to exist, and the detection of photons, the latter taking place where there is a current density j whose coupling to the wave field is determined by

JA. As a final comment concerning these matters, let me just state that Dirac, who did not try to write down a photon wave equation, was then on the right track. References 1. A. I. Miller, Early Quantum Electrodynamics: a Source Book (Cambridge, University Press, 1994). 2. S. S. Schweber, QED and the Men Who Made It: Dyson, Feynman, Schwinger and Tomonaga (Princeton University Press, 1994). 3. T. Y. Cao, Conceptual Developments of 20th Century Field Theories (Cambridge University Press, 1997). 4. A. Rossi, in Quanti Copenhagen? - Bohr, Heisenberg e le Interpretazioni della Meccanica Quantistica, I. Tassani ed. (II Ponte Vecchio, Cesena, 2004). 5. M. Cini, in Conoscere Fermi nel Centenario della Nascita - 29 Settembre 1901-2001, C. Bernardini and L. Bonolis eds. (Edizioni Scientifiche SIF, 2002). 6. P. A. M. Dirac, Proceedings of the Royal Society of London A114, 243 (1927).

79

7. A. I. Akhiezer and V. B. Berestetsky, Elements of Quantum Electrodynamics (Oldbourne Press, London, 1962); A translation of A. I. A. and V. B. B., Kvantovaya, Electrodinamika (Moskva, Gosudarstvennoe Izdatelstvo Technico-Teoreticheskoi Literaturi, 1953). 8. J. Orear, Physics (Macmillan Publishing Co., Inc., New York, 1979). 9. M. Born, W. Heisenberg and P. Jordan, Zeischrift fur Physik 35, 557 (1925). 10. R. P. Feynman, QED - The Strange Theory of Light and Matter, (Princeton University Press, 1985). 11. S. Bergia, La Fisica nella Scuola, Quaderno 17, Anno XXXVIII n. 4 Supplemento, 70 (2005). 12. T. Akioglu and E. Tepedelenlioglu, J. Phys. A: Mat. Gen. 33, 6357 (2000). 13. P. Carruthers and M. M. Nieto, Reviews of Modern Physics 40, 411 (1969). 14. P. A. M. Dirac, Proceedings of the Royal Society of London A114, 710 (1927). 15. R. Loudon, The Quantum Theory of Light (Clarendon Press, Oxford, second edition, 1983). 16. R. P. Feynman and A. R. Hibbs, Quantum Mechanics and Path Integrals (McGraw-Hill, 1965).

ON THE MEANING OF ELEMENT IN THE SCIENCE OF ITALIC TRADITION, THE QUESTION OF PHYSICAL OBJECTIVITY (AND/OR PHYSICAL MEANING) AND QUANTUM MECHANICS GIUSEPPE BOSCARINO Sortino

(Siracusa)

It is questioned: Is quantum mechanics a new science or a new (or rather old) philosophy of physical science? It is shown that Einstein's attempt in his article of 1935 to bring the concept of "element" from the classical (we call it Italic) philosophical-epistemological tradition, which goes under the names of Pythagoras Parmenides, Democritus, and Newton, into quantum mechanical theory is unclear, inadequate and contradictory.

1.

Introduction

"The best clue of the reality of phenomena, the one that alone is enough, is the success of the prediction offuture phenomena on the ground of past or present phenomena, whether the prediction is based on a reason or hypothesis so far confirmed or it is based on a custom constantly observed". To a community of specialists in foundations of quantum mechanics (hereafter QM), this quotation might appear as a paraphrase of the criterion of physical reality enunciated by Einstein in his famous article of 1935 written together with B. Podolsky and N. Rosen: "Can Quantum-Mechanical Description of Physical Reality Be Considered Completel". It recalls Einstein's criterion of physical reality, which he holds in agreement with the notion of physical reality of both classical mechanics and quantum mechanics. It seems as if one is quoting, perhaps in a somewhat inappropriate way, Einstein himself. Yet, it is not so! The quotation is drawn from a short writing by the great philosopher Leibniz entitled: "On how to distinguish real phenomena from imaginary phenomena", where, however, it is inserted in a completely different setting and, in our opinion, within a tradition of thought on the concept of physical reality that has completely different nature and greater epistemological consistence. In fact, what in Einstein's article is said on the meaning of element of physical

80

81 reality seems to us not very clear, far from the classical physical tradition, recalled in the article in an indistinct way (which one?), which we think goes under the names of Phythagoras, Parmenides, Democritus, Euclid, Archimedes and Newton, who certainly are its greatest interpreters. For this reason, we think, in our modest opinion, that we can altogether accept Boniolo's conclusion, although the issue deserves further investigation both from an epistemological and philosophical, and even from a sociological, viewpoint. He rightly writes: "This time of the dispute on the foundations of quantum mechanics is characterized by its being merely philosophical: it is the typical instance of a fight between two different and incompatible Weltanschauungen. What we want thus to emphasize is the mere fact that many discussions on the foundations of physics do not regard either the models or the images of the world, but the intuitions of the world. And being aware of that leads to deal with and discuss the problems according to correct guiding principles. For example, if this were taken into proper account there would be no such quaint mathematical and physical excesses as those by which many researchers try to solve the so-called EPR's paradox".2 But let us proceed in order, citing first Einstein's criterion of physical reality and the setting where it is found in order to better understand its epistemological and philosophical significance , then citing the setting where Leibniz' criterion of physical reality mentioned above is found. This will allow us to grasp the meaning of 'element' of classical physics as rigorously expressed by the great philosopher Leibniz (by the term 'philosopher' we mean the physicist, mathematician, logician, linguist, metaphysician, and so on conceived as a whole according to the best Italic tradition on the meaning of "philosophy"), by comparing it with the meaning of Einstein's 'element' who, in spite of his effort to refer to the classical tradition and its language, remains entangled in the confusion of the concept of physical reality of QM as usually interpreted first by Bohr in his response to Einstein's article, then by Heisenberg and later by the copious literature (for example d'Espagnat, to mention one). 2.

On the meaning of 'element'. The principle of physical reality and the criterion of physical objectivity

(1) In the article cited Einstein writes:

82 "Any serious consideration of a physical theory must take into account the distinction between the objective reality, which is independent from any theory, and the physical concepts with which the theory operates. These concepts are intended to correspond with objective reality, and by means of these concepts we picture this reality to ourselves"? Thus for Einstein on one hand there is an "objective reality" considered independent of the subject that builds the theory, on the other there are the "physical concepts", which are our representations of this "objective reality". Now, in our opinion, there is a confusion here, which is then repeated in the literature dealing with this matter and in the literature dealing with the so-called "realism of theories". In fact, one thing is the assertion of principle, which has metaphysical nature (in the sense "of that which is neither formally decidable nor empirically verifiable"), of the existence of a physical reality independent of the subject that builds the theory, which belongs to the field of individual and collective beliefs, and as such subject to external influences of various nature (cultural, religious, social, economic, etc.), another thing is the so-called physical objectivity that the theory works out or reconstructs from the empirical or sensible reality, of an entirely different kind, which is thought to arise either from common structures belonging to the knowing subjects or from the agreement of different subjects on the common methods of elaboration and control of the theory, which therefore are not object of demonstration but either of discovery or argumentation, in any case of metatheory, in the latter case subject, like the first, to the external influences of various nature mentioned above. Kant believes that these common structures are found within the knowing subjects, in structures that he calls transcendental; others in social structures or practices;4 others in particular epistemic practices. Since Einstein's discourse develops from these last, let us try to look into it more carefully in order to find out if the meaning of EPR's 'element' is that of classical physics according to the names we have indicated: "The elements of the physical reality cannot be determined by a priori philosophical considerations, but they must be founded by an appeal to results of experiments and measurements. A comprehensive definition of reality is, however, unnecessary for our purpose. We shall be satisfied with the following criterion which we regard as reasonable. If without in any way disturbing a system, we can predict with certainty (i.e., with probability equal to unit) the value of a physical quantity, then there exists an element of physical reality corresponding to this physical quantity. It seems to us that this criterion, while far from exhausting all possible ways of recognizing a physical reality at least

83

provides us with one such way, whenever the conditions set down in it occur. Regarded not as a necessary, but merely as a sufficient, condition of reality, this criterion is in agreement with classical as well as quantum-mechanical ideas of reality"? Here Einstein makes an assertion of principle about the meaning of 'physical objectivity', or 'physical meaning' or 'physical being', as it is otherwise termed, when he regards it as founded on results of experiments and measurements. However, as in much literature on this subject, 'physical objectivity', which has empirical nature and regards experiments, measurements and observations, is mistaken for 'physical meaning', which regards the physical being, i.e. the rational conditions that single it out. Boniolo rightly increases the number of assumptions specified by M. Jammer in EPR's article.6 Anyway, these assertions have nothing to do with physics, they belong to meta-physics, that is to a different meaning of metaphysics, to a different semantic level, to the field of assertions on "physical assertions", to the field that medieval logicians called 'second intension'. Then again, the statement "there exists an element of physical reality corresponding to this physical quantity" shows another semantic shift as Einstein jumps from the probability value equal to unit, that is from a number, which is assigned to a value or measure of a physical quantity, hence to another number, to the existence of an element of physical reality without having first defined: (1) what is meant by 'element', (2) whether the physical reality is the one independent of the subject or the objective one built by the theory, (3) what is meant by "existence of an element of physical reality", where there appear only numbers, which in the theory correspond to physical quantities, without having first either assumed or defined them, thus jumping from those that Newton would have called "relative, apparent and sensible" , that is measures or numbers, to the "absolute, true, mathematical", that is the 'physical objective', as the element corresponding in the theory to physical quantities can be. D'Espagnat attempts to clear up the meaning of 'element' by calling it a 'property' of the physical system7 but he fails to specify that not always a 'physical property' is a 'physical quantity', as Newton well knew when he wrote in his Principles that "positions properly have no quantity"? For him, they are instead properties of places, hence of the bodies occupying them. In EPR's article a position is assumed as a physical quantity and so is in Bohr's replying article.

84

In short, EPR use a language that claims to be classical when they refer to such terms as 'element' and 'physical quantity', but the philosophy that leads them is actually a mixture of nominalism, phenomenism or operationalism, since numbers, which are names of we do not know what, found the objectivity of 'physical quantities' or 'physical properties'. The latter, which for Newton can only be "absolute, true, mathematical" should not be confused with relative quantities, on which he incisively writes: "Wherefore relative quantities are not the quantities themselves, whose names they bear, but those sensible measures of them (either accurate or inaccurate), which are commonly used instead of the measured quantities themselves. And if the meaning of words is to be determinedfdefiniendae] by their use, then by the names time, space, place, and motion, their measures [mensurae sensibilies] are properly to be understood; and the expression will be unusual, and purely mathematical, if the measured quantities themselves are meant. On this account, those violate the accuracy of language, which ought to be kept precise, who interpret these words for the measured quantities: nor do those less defile the purity of mathematical and philosophical truths, who confound real quantities with their relations and sensible measures [vulgaribus mensuris]".9 Here "mathematics" and "philosophy" are for Newton synonyms of "rational", so it is true that instead of the "absolute, true and mathematical" in human things we use relative measures, but then he writes: "in philosophical disquisitions we ought to abstract from our senses and consider things themselves, distinct from what are only sensible measures of them". Clearly enough, Newton is aware that:(l) at a certain epistemological level, the battle on the meaning of "physical objectivity" is "philosophical", determined by a priori philosophical reasons, which Einstein tries to elude; (2) assuming sensible measures, i.e. numbers, without having first assumed and defined the rational conditions that single out physical quantities entails falling into a coarse philosophical empirism disguised under the name of "science". Experiments and measures can only test the existence of physical magnitudes or properties but they cannot found or mean11 them in their objectivity or physical meaning, which can only have rational nature. In fact, either measures, i.e. names, are only names of we do not know what, and therefore one can only state the existence of names without the physical thing, or one unjustifiably jumps from the existence of the particular thing, i.e. measures, to the existence of the universal thing, i.e. the physical magnitude, of which we can only have the name, not the measure.

85

This is precisely what empirists (there only exist particular things), nominalists (there only exist names), idealists (there only exist ideas) reproached to the coarse realism in general, which pretends to assert the existence of universal things starting from the universal thing, to whose name there corresponds no particular thing or measure, thus contravening its own philosophy founded on the criterion of the existence of things only, object of the senses and measurement operations. In this respect Bunge, epistemologist of assured competence, sharply writes: "Every empirically testable theory must be interpreted before we can hope to test it: we must know what it is about — and this is all that is meant by 'meaning'. In short, meaning is necessary though insufficient for testability, and the latter is sufficient though unnecessary for meaning". "Far from being experiment which determines the meaning of theoretical symbols, it is theories, and only theories, that enable us to interpret empirical operations" } 3 "The most popular criterion of reality is the one of measurability: "to be is to be measurable." Like most popular belief about science, this one is inadequate: measurability is very often indirect and is always dependent on theory — remember that our predecessors made accurate measurements of properties of nothings, such as the caloric, and that hundreds of our contemporaries are probably measuring properties offictive particles. In any case, every physical theory presupposes the philosophical hypotheses that there are physical objects (mind-independent things), that most of them are imperceptible (Hertz, 1894), and that some of them are knowable if only in part (Thomson, 1963). Should these hypotheses be dropped we would turn to introspection and mysticism"}A Bunge himself, however, does not distinguish between the principle of physical reality, in the sense we have already indicated, and, let us say, the criterion of physical meaning and physical objectivity, which are partly of metasemantic or meta-physic nature, partly of meta-methodological nature, consequently the former defines the meanings of physical meaning, the latter the methods of control and test of physical objectivity, which concern social practices and measurement theories. But now let us confine ourselves to delving, by a brief analysis, into Leibniz' text, where, in our opinion, a tradition of thought flows together with its own way of conceiving physical objectivity and the so-called principle of physical reality, whose finest expression was, however, given by Newton, the utmost interpreter of classical physics.

86

(2) In giving the conditions of real existence of an entity Leibniz not only gives the sufficient condition but also the necessary one. "What exists is either a being or a possible ",15 he writes. Then he adds: "In order for a being or a possible to be able to exist it must be non-contradictory". Moreover, in order for an existing being to be real it must satisfy some clues, the best being the one indicated at the beginning of this writing. In any case, he then concludes: "With no argument at all is it possible to demonstrate in an absolute way that bodies exist and nothing can prevent us from thinking that the objects of our mind are well-ordered dreams that are deemed by us as true and practically equivalent to the true ones thanks to their correspondence"'.17 Leibniz thus recognizes to the success of predictions the meaning of "clue" of physical objectivity rather than that of incontrovertible proof, while to rationality, aimed at avoiding contradiction, the absurd, the impossible, he recognizes the function of foundation of physical objectivity in the sense of physical meaning, leaving to the field of personal and collective beliefs, with no possibility of proof or demonstration, the assertion of the existence of bodies outside our mind or rational reconstruction. In Leibniz we do find expressed that epistemological philosophy which is at work in Newton, where comes to maturity a rationalistic tradition of thought which is the one that I call the Italic tradition, since it was started in the Italic School by Pythagoras in the 6th century B.C. and then in the course of centuries worked out, revived and developed by Parmenides, Democritus, Euclid, Archimedes and Galilei. 3.

On the meaning of 'element' in the science of Italic tradition

Now, it is on the meaning of 'element' that according to this tradition of thought the so-called "physical objectivity" and "philosophical meaning" can be built, and during a long and dense philosophical debate the founding "elements" of 'element' were laid, on which, however, we will only dwell shortly, further details on this matter being available in our more thorough historical, philosophical and epistemological reconstruction.18 It was the Italic Phythagoras of Samos who first enunciated the rational nature of the element, denying the name of 'element' to objects of empirical nature such as water, air, fire, etc., against the Ionic tradition of thought started, according to Diogenes Laertius, by Thales of Miletus.

87

The priority given by the Pythagoreans to the rational element, rather than to observations and measures, emerges from the following testimony by Aristotle: "They (the Pythagoreans) seek the reason and the cause not by referring to what is object of observation but by bringing back by force the phenomena to certain reasons and opinions of theirs and attempting in this way to harmonize and bring them to an orderly whole"}9 Thanks to the rational or theoretical element (Pythagoras first coined the term "theory") they are able to find out that the morning star and the evening star refer to the same object, the planet Venus, beyond any observations or appearances, otherwise they would have had to invent Bohr's principle of complementarity. The rational element must obey the logical principles of non-contradiction (Parmenides) and of the contraries (Alcmaeon and the Pythagoreans). Of a thing it cannot be said that it is and is not, for example it is a wave and not a wave, i.e. a particle, because that would be contradictory; it can only be said that once it appears as a wave and another time it appears as a particle; giving the name of 'being' to observations or measures, which now are in a way and then in another way, would mean hypostatising appearances, that is contradiction or, in Parmenides' words, the nothing, or the relative, with no possibility of discussion, criticism and growth of knowledge. According to Pythagoras, every property dichotomizes a being into two classes contrary and complementary, so that if an individuum belongs to one class it cannot belong "at the same time" to its complementary. With respect to this, the physicist Notarrigo writes: "According to these elementary rules of logics, or if you wish of "linguistic conventions", and logics is nothing but this, it is not possible that something is at the same time "wave and particle ", since according to the formal definitions of wave and particle, which are given in "classical physics" but are "not" given in "quantum physics" (referring only to the sensorial impressions we received when watching the sea waves and a falling stone), the intersection of the two classes is empty, i.e. it is the "nothing"?® Of Alcmaeon the Pythagorean, it can be said that he gave the epistemological manifesto of the Italic tradition of science when he wrote: "Contraries are the principles of beings"21 and then: "Of invisible things, of mortal things gods have immediate certainty, men have to proceed through clues (tekmairesthai)" } 2 Science is, therefore, a not-definite reconstruction of ours of empiric reality (the definite one belongs to the gods); observations and measures are only tests (tekmarion) of what we reconstruct starting from clues or signs (semeion).

88

Beyond names, appearances, observations and measures, which are changeable and relative, true physical objectivity can only be given by the idea, the concept, which founds its physical meaning, according to the testimonies. This is what Democritus, another great interpreter of this tradition, said: "Democritus calls elements ideas"P "Democritus says that ideas are the principles of things" P "Democritus says that the criterion ofjudgements for the scientific research is the concept"}5 To build true scientific objectivity, science must then build first its elements starting from 'principles', which are idealizations of properties abstracted from the surrounding empirical world, that are assumed to be independent of the knowing subject, thanks to the clues or signs given by the elementary physical operations we can accomplish through them; they then must be tested in their daring consequences by precise measures and rigorous observations worked out from the theory itself, whose elements are the founding structure. The intersubjectivity and apparent objectivity of measures and tests, although apparently verifying assertions of the theory, cannot undermine the physical meaning, which in any case remains of rational nature. And it is not the metaphysical assertion of the existence of the world external to the knowing subject that gives physical meaning to elements (Bunge, for example, writes: "No external reference, no physical meaning" in op. cit. note 12, p. 26), neither does the empirical objectivity of measures or the intersubjectivity of scientific practices, but only the observance in our physical assertions of the Parmenidean denial: "You shall never constrain to be that which is not", Fr. 7. Physical properties, which we measure or observe, cannot be created from nothing. That of which is said "is and is not" can only be apparent and relative. According to this rationalistic tradition, as Vailati well writes, it is not the facts that challenge the theory, but the theory that challenges the facts, and, we add, it speaks the language of being beyond that of appearances, often deceptive, mutable, relative and sensible.26 Thus "physical objectivity and physical meaning" are not given by the brute, empirical fact, surreptitiously raised in the Ionic or empiristic tradition to ontological thing, but by our logico-linguistic reconstruction, whose elements are built through precise "principles" from clues then tested in their consequences by rigorous methodological rules and social practices. Newton, who finds inspiration in the Pythagorean and Democritean philosophy, in his "Definitions" builds the 'elements' of physical science, or physical objectivity, starting from his principles, which are idealizations of physical properties, such as density, volume, velocity, time, space, etc.

89

In his definition of "quantity of matter" there is no vicious circle, since density and volume are its principles, its primitive ideas or primitive idealizations, abstracted from physical operations we perform and observe. "Air of double density, in a double space, is quadruple in quantity; in a triple space, sextuple in quantity. The same thing is to be understood of snow and fine dust or powders ,that are condensed by compression or liquefaction; and of all bodies that are by any caused whatever differently condensed".21 In his idealization of density and volume, Newton ignores whether there is a medium between the parts; their logical intersection is the physical body, "the element mass". "I have no regard in this place to a medium, if any such there is, that freely pervades the interstices between the parts of bodies. It is this quantity that I mean hereafter everywhere under the name of body or mass"?% This magnitude, or idealized description of element of physical reality, shows its value of physical truth, its physical existence, through clues which are the measures of its weight, and is tested through experiments by the fact that it is proportional to its weight. 'The same is known by the weight of each body; for it is proportional to the weight, as I have found by experiments on pendulums, very accurately made, which shall be shewn hereafter".29 4.

Quantum mechanics: science or philosophy? The new meaning of 'element'. Conclusions

While in Einstein the name of 'element' of the tradition of classical physics is maintained, but its meaning is changed, surreptitiously assuming in principle an operationalistic philosophy, denying the meta-physical meaning of "physical meaning" or "physical objectivity", thus hoping to strike philosophy off physics, mistaking the meaning of "physical objectivity", meta-semantic by nature, for the meaning of "physical objectivity", meta-methodological by nature, in Bohr the name of 'element', and its meta-semantic meaning, is suppressed along with the rationalistic philosophy it implies and changed with what he calls "a feature of individuality completely foreign to classical physics.",30 which implies "a new image of natural philosophy", evidenced by the new quantum facts, where the "physical object" can no longer be separated from the physical apparatus of measure. This new 'element', now called a new "feature of individuality", constitutes the basis of the new science, where the semantic meaning is confused with the empirico-methodological meaning, the concept with the measure, the rational

90 objectivity with the methodological intersubjectivity, the rational essence with the appearance, the thing with the name. Bohr, however, is more consistent than Einstein. In fact, if measures determine physical meaning and the formalism of QM is in agreement with them and vice versa, then QM is complete for it mirrors what is physically meaningful. Einstein's contradiction is, then, only apparent as it arises from his applying to new facts, the quantum ones, a philosophy inadequate to them. In short, Bohr appears more consistent because he considers the question having philosophical nature, which instead Einstein wants to avoid or hide, and ensuing from "new quantum facts". We are at this point faced with the following question: Is QM a new science or a new philosophy of physical science? Our impression, discussed in a writing published together with Notarrigo (31) , is that we are faced with a new philosophy, or perhaps an old philosophy of science, the empiristic or phenomenistic or nominalistic or operationalistic one, you name it, with all the confusions, frailties and contradictions that the centuryold critical rationalism of Italic tradition has detected and denounced. We are not facing a new science with a new philosophy, but traditions of thought whose lot is decided each time by events external to them. But this would take us into the field of sociology of science and its historical dynamics, which requires a completely different interpretation. References 1. G. W. Leibniz, De modo distinguendi phaenomena realia ab imaginariis, 1707 (G. VII, p. 319). 2. G. Boniolo, "Modelli, immagini e intuizioni del mondo", in Dove va la scienza. La questione del realismo, F. Selleri and V. Tonini eds., Ed. Dedalo, Bari, 1990, pp. 257-258 (The translation from Italian is ours). 3. A. Einstein, B. Podolsky and N. Rosen, Phys. Rev. 47, 1935, p. 777. 4. In The Open Society and its Enemies Popper writes that objectivity is closely related to the social aspect of the scientific method, i.e. to the fact that science and objectivity do not result (and cannot result) from the efforts made by a single scientist, but by the cooperation of many scientists, and therefore objectivity can be defined as the intersubjectivity of the scientific method. 5. A. Einstein, B. Podolsky and N. Rosen, op. cit., pp. 777-778. 6. G. Boniolo, op. cit., p. 256. See also M. Jammer, The Philosophy of Quantum mechanics, Wiley, New York, 1974, p. 185. 7. B. d'Espagnat, Conceptual Foundations of Quantum Mechanics, AddisonWesley, Reading, MA, 1976.

91 8. I. Newton, Philosophiae Naturalis Principia Mathematica, trans. A. Motte (1729), rev. F. Cajori, University of California Press, Berkeley, 1934. 9. Ibid. 10. Ibid. 11. On the confusion between to test and to interpret or mean which is usually made in the reading of quantum correlations, see A. Rossi, "Information and State Correlations from Classical to Quantum Physics: the Foundations Issue", in The Foundations of Quantum Mechanics. Historical Analysis and Open Questions, Lecce 1998, C. Garola and A. Rossi eds., World Scientific, Singapore, 2000, p. 369. 12. M. Bunge, Foundations of Physics, Springer Verlag, Berlin, New York, 1967, p. 57. 13. Ibid, p. 28. 14. /tof.,p. 58-59. 15. G. W. Leibniz, op. cit. 16. Ibid. 17. Ibid. 18. G. Boscarino, "Le forme e i mutamenti della scienza. Oggettivita scientifica e tradizioni di pensiero", Mondotre-La scuola italica 5, 2004. 19. Aristotle, De caelo, II, 13. 20. G. Boscarino and S. Notarrigo, La meccanica quantistica: scienza o filosofia?, Ed. Laboratorio, Sortino, 1997, p. 38. 21. Aristotle, Metafisica A 5, 986a, 40. 22. D.K. B 1. 23. D.K. 68, Democritus, B 57. 24. Ibid.,B 57; Ibid, B 111. 25. See in this connection the brilliant and profound essay in epistemology by G. Vailati, "II metodo deduttivo come strumento di ricerca", Scritti, Seber, Leipzig-Firenze, 1911. 26. J. Newton, op. cit. 27. J. Newton, op. cit. 28. J. Newton, op. cit. 29. N. Bohr, Phys. Rev. 48, 1935, p. 697. 30. G. Boscarino and S. Notarrigo, op. cit.

MATHEMATICS AND EPISTEMOLOGY IN PLANCK'S THEORETICAL WORK (1898-1915) PAOLO CAMPOGALLIANI University of Padova, Department of Physics "G.Galilei" 35131 Padova (I)

Via F.Marzolo,

8

It is quite easy to verify that, in the continuous theoretical evolution of Planck's quantum construction from 1898 till 1915, mathematics plays an evident basic role. There are, firstly the formal analogy with Boltzmann's mechanical theory of gas, and secondly the formal analogy with Boltzmann's probabilistic theory of gas, giving to mathematical algorithms a particular investigative and ideative power. But, in this context of discovering some aspects of the quantum world, Planck's epistemology and natural philosophy also plays an equally creative powerful role. So one finds the real key of this theoretical progress, only in a strong dialectic relation between these elements inextricably bound by reciprocal influence.

1. Mathematics and the third dimension of science Following a usual neopositivistic point of view, a suitable model, for representing the scientific knowledge, consists in a bidimensional surface, a plane, where one dimension means the analytical knowledge and the second dimension the empirical knowledge. Consequently the role of mathematics in science, particularly in physics, would be essentially considered of logico-analytical nature. As is well known, the research in philosophy and history of science carried out in the last century, inequivocally demonstrated the absolute insufficiency of this model and the necessity of introducing in this metaphore of scientific knowledge, another dimension. This third dimension consists of presuppositions, paradigms, images of the world and, generally, of any a priori assumption [1]. So it is possible to understand the complexity of the role of mathematics in physics, which obviously is not purely analytical but also investigative, heuristic, constructive, moulding the shapes of the world, briefly a "metaphysical" role. After this brief epistemological introduction, one can put the question discussed in this paper, in a concise way: what is the role of mathematics in Planck's theoretical work regarding the birth of quantum physics?

92

93 More precisely and explicitly: what kind of relation links mathematics with the thought inherent in his philosophy of nature, his epistemology and generally his image of the world, in the years between 1898, the date of formulation of the natural radiation hypothesis, and 1915-16 when a general theory of quantum physics appears? One will conclude that, while the engagement of mathematics in the description of phenomenological models seems very poor and narrow, on the contrary its role in the field of investigating the physical world beyond the observable phenomena and corrispectively in constructing a new image of the world, is very remarkable. Furthermore, in this historical case, one does not deal, as for example happened in the birth of classical mechanics in Newton's theoretical work, with a symbiotic link between formal and informal thought; but rather one deals with a never ending face to face dialogue between epistemology and mathematics. In this dialogue, there are some elements of epistemology, some aspects of the a priori image of the world, guiding the function of mathematics intrinsic in the theoretical construction, never exercising a complete control on it. On the other hand, there are some formal aspects of mathematics suggesting a continuous change in the general interpretative frame, drawing finally some important new shape of the image of the physical world, never achieving a complete inclusion of it [2]. Conclusively, in this transition from classical to quantum physics, the role played by the mathematical algorithm is crucial, nevertheless it is impossible to actually understand Planck's theoretical work, if we limit our study to the pure analytical apparatus of the theory. 2. Mathematics and epistemology In this never ending dialogue between informal and formal thought, a particular importance is taken on by the principle of disorder, formulated in 1898 as the hypothesis of natural radiation, and later repeatedly modified and enlarged in its original meaning. The role of this principle could be compared to a bridge, to a privileged channel of communication between formal and informal thought. It carried this particular function for more than a decade, supported in this role by its vague and also partially obscure nature. So the more suitable way for investigating the non analytical valence present in the mathematical algorithm of this theoretical work, will be achieved if we follow the evolutive steps of this principle [3].

94

At this point, it turns out to be profitable to briefly remember some basic statements of Planck's epistemology. These statements, which obviously are placed in the third dimension and mantain an evident exchange with the mathematical apparatus in a dynamics of reciprocal influence, are schematically the following: a) realism; b) the trend of unification of physics; c) the causality. a) Realism During his whole scientific life, Planck always maintained a strong realistic position about the goal of scientific knowledge. This statement, was sometimes expressed in explicit disagreement with prevailing positivistic conception of those decades. Actually Planck thinks that "beyond the sensible world there exists a real world independent of human thought" and furthermore that scientific theoretical representations build a third world, "the world represented in physical science: the image of the physical world" [A]. According to the German scientist this third world aims not only to "describe the phenomenal world in the simplest possible manner " [5], but also to grasp and understand the physical reality beyond fragmentary phenomena. The supremacy of the first aim, leads to a cautiously descriptive research, instead the supremacy of the second aim leads to a daringly "metaphysic" research. The history of physics, thinks Planck, proves the fertility of a realistic conception. In fact the image of the physical world, developed by theoretical research, moves throughout history, more and more away from the sensible world. This process is a consequence of growing abstraction and, at the same time, of growing proximity to the real world. Here it is profitable to consider that mathematics carries out a basic inventive role in this process of abstraction and of discovering the real world. b) The trend of unification of physics According to Planck, the history of physics inequivocally manifests some other evidence. - There is a clear transition from separate different fields toward an interconnected unitary picture. Therefore the increasing distance from the sensible world, the increasing proximity to the real world and the trend toward unification of physics are substantially different aspects of the same historical evolution.

95

- One finds the deepest split in classical physics in the unresolved dichotomy between reversible and irreversible processes: "[...] the difference between reversible and irreversible processes is much deeper than the difference, for example, between mechanical and electrical processes [...]. This difference, in the future [...] will acquire a fundamental role in the next image of the physical world" [6]. - The quantum novelty appears in the core of a programme of research regarding irreversibility. Then the feature of the quantum reality, at least in the decades here examined, is interpreted not in contrast but going further than the classical physics, in agreement with the trend to overcome this persistent split. Also in this case, in the laborious research of connecting reversibility and irreversibility, the mathematical algorithm plays a basic role of a local and general heuristic kind. c) Causality Planck asserted a causal conception of nature during his whole scientific life, and consequently sustained the deterministic character of the laws of physics. Nevertheless, even some years before the birth of quantum physics and the appearance of quantum of action, his theory of radiation and irreversibility, shows a statistical treatment inherent in some physical quantities. This statistical approach, for Planck, does not reveal the intrinsic nature of the second law of thermodynamics, but rather indicates the existence of a new and unknown microscopic reality. More explicitly, the statistical approach is a temporary necessity in the presence of the microscopic world which should follow an unknown but certainly deterministic lawfulness. So the introduction of the hypothesis of natural radiation, in the year 1898, immediately shows a conception deeply different from Boltzmann's thought. In fact, for Boltzmann statistics make up for the lack of knowledge in the measure of microscopical quantities, for Planck statistics make up for the lack of knowledge of microscopic lawfulness required by irreversibility. Later, when the grounds of quantum mechanics appear, Planck does not believe it possible to leave his general deterministic point of view, even if he already left the deterministic nature of the second law of thermodynamics. So in the quantum physics of the first two decades Planck considers the detachment from a classical rigorous deterministic description leading to an approximate statistical treatment, probably as a temporary necessity.

96

In this critical evolution, the mathematical thought really plays a crucial role because through it one carries out the appearance of the statistical principle of disorder, which will achieve an open, flexible and continuous dialogue between formalism and epistemology. 3.

Mathematics and the principle of disorder

For attacking our problem it is convenient, as already said, to follow the evolutive steps of this principle. Schematically these steps are the following: - the pre-quantum period: the appearance of the hypothesis of natural radiation and the consequent "demonstration" of the second law of thermodynamics; - the quantum novelty: the appearance of h, the quantum of action; - the principle of subdivision of phase space in elementary cells. 3.1.

The pre-quantum period

First of all, it is very important to take two basic aspects into account; that is absolutely necessary to be able to understand how Planck's theoretical work is open to receiving the quantum novelty. The first aspect, of epistemological nature, belongs, as one said, to the deterministic character of the physical laws: consequently, till the year 1915, Planck maintains an absolute conception about the second law of thermodynamics, ruling out any possible statistical antientropic fluctuations. The second aspect, is mostly of mathematical nature, and concerns the remarkable subdivision between rapidly variable quantities and slowly variable quantities. For example, the dipole momentum of resonators, belongs to the first kind and is not suitable for measurement, the energy of resonators belongs to the second kind and is suitable for measurement [7]. So this mathematical subdivision of physical quantities in two groups, gives rise to a way of establishing a proficuous formal analogy with a gas of molecules, entities which are not singularly observable nor measurable. Therefore, here one finds the creative power of mathematical language: mechanical theory of gases about irreversibility for Boltzmann, and electromagnetic theory of radiation about irreversibility for Planck, are put in a relation of similarity, where, nevertheless, the epistemology of Planck suggests some important transformations and different interpretations. Now, it is better to stop a little, to punctually remember some well known agreements of this analogy:

97

•

there exists a primary correspondence between the rapidly variable magnitudes, e.g. a Fourier component of the radiation field, and the kinematic magnitudes of every gas molecule; • consequently, the hypothesis of natural radiation [8] for the field arises, which corresponds to the hypothesis of molecular chaos for the gas: "the energy of radiation is spread in a completely irregular way on every partial oscillation from which the ray is constituted" [9]; • but in this correspondence between these two hypotheses, one finds a deeply different conception. While, the hypothesis of molecular chaos and the Htheorem do not have any absolute validity and are compatible with statistical fluctuations, the hypothesis of natural radiation maintained an absolute validity, for holding the absolute character of the second law of thermodynamics. • It follows that some initial conditions in the interaction between radiation and resonator are excluded, and therefore some solutions of the relative equation are prohibited. That implies that the resonator is partially unrecognized in its microscopic legality. • So the temporal evolution of resonator energy cannot admit an instantaneous description because it is not an elementary process [10]. But, while the rough description of temporal evolution of gas distribution function indicates an ignorance of molecular trajectories, the rough description of temporal evolution of resonator energy indicates an ignorance of microdynamics. • Therefore a single resonator has a temperature and also a definite entropy, and both are expressions of this microscopic disorder [11]. Briefly, in this way, the creative mathematic thought suggested by the formal analogy together with the guide of deterministic epistemology, produced a flexible theory, open to receiving some microscopical novelty. 3.2. Quantum novelty Now, Planck's specific interpretation of entropy of a single resonator as a precise sign of an unknown microstructure, gives rise to a conceptual milieu suitable to grasp the quantum novelty contained in the h constant appearance. Indeed mathematical formalism, also for cavity resonators, suggests adopting Boltzmann's principle and then applying complexions calculus. At that purpose, it is convenient to expound sequentially some reflections.

98 As is well known, in autumn 1900 [12], in the presence of experimental data for infrared wavelengths, Planck abandoned the previous assumption d2S a from which he could deduce Wien's spectral law. — dU7 U d2S a and deduced the famous dU2 U(p + U) correct black body spectral function going on the suspicion of being below probability considerations. Indeed, from that formula, one obtains for resonator entropy: Then he assumed the new formula

S = or

U.

U

1 +u_log

fi)

(1)

- %

Following Rosenfeld's reconstruction, while Wien's displacement law implies j3 = bv , one can write:

S = flrlog

H'u'

^ (2) V bv

bv. where it is possible to guess a combinatorial computation, with Stirling's approximation. •

Now the mathematics, suggested by the formal analogy with mechanical theory of gases, along with the new experimental data, induces Planck to adopt Boltzmann's principle [13].

In this new way of discovery, corresponding to a new formal analogy, the principle of disorder guided the German scientist to conceive the probability according to a different interpretation. Temporal disorder of a single resonator, that one found in its time average secular equation, now becomes the same fact as a spatial disorder in the instantaneous distribution of energy elements among many resonators of the same frequency. So not only entropy, but also probability and complexions now express a measure of disorder. In fact, from formula (2), one deduces the entropy SN of TV resonators:

99 NU'

SN = orlog

N + bv

N+^' bv

(3) NU

NN

NU

{ bv

If we write NU- P £, subdividing the energy of N resonators in P elements £ = b V = h V , one obtains:

SN = alog

(N + P)N+P NNNPT}P

(4)

In the logarithmic argument, one can evidently see the count of numbers of distribution of P elements £ among N resonators. So the mathematical computations, suggested by the formal analogy with the probabilistic theory of gases, give rise to the appearance of quantum of energy £ and more specifically of quantum of action h. •

So, in the birth of quantum physics, the appearance of the universal constant h in the physical quantities regarding the resonator, for Planck will show two different faces of meaning.

The first, as repeatedly sustained, is suggested by his deterministic epistemology and consequently not completely unexpected: the presence of £ hv in the formal expression of entropy and energy, inequivocally proves the unknown elementary nature of the resonator in interaction with radiation. The second face of meaning needs to be considered more extensively. 3.3. The principle of subdivision in elementary cells Planck's evolutive discovery of this new face of meaning of the constant h, in the years following the birth date 1900, results strongly influenced by some aspects of his whole natural philosophy. The h constant shows some particular features: it is an absolute, universal quantity and, moreover, as an action, is a relativistic invariant. So Planck's realistic epistemology and the conviction in the trend of unification of physics, probably insinuated in his flexible image of the physical world, a suspicion about the possibility of finding, in this constant, something much more general that the simple fact strictly regarding cavity resonators and radiation.

100 From this point of view, the principle of disorder, becomes, with the discovery of h, the principle of elementary disorder, aiming to generate a general principle of the new quantum world. •

The first step of this conceptual evolution, regards the resonator: the new meaning of h consists in subdividing the phase plane of the resonator in elementary regions of definite size.

With regard to this, in the fourth Part of his Lectures of 1906, one can read "the quantum of action reveals a new meaning, i.e. the size of an elementary region of phase plane of a resonator [...] the elementary regions could not be considered arbitrarily small, because its size is finite and definite by the value of the quantum of action h" [14]. That is obviously connected to the awareness that the existence of the constant h, in black body theory, implies the non validity of the mechanical equipartition principle (the last principle would be equivalent to h=0). •

Planck's conceptual attention is now particularly focused on absolute universal character of h: here one should find the key for grasping a general basic principle of the new physics.

One should remember that the principle of disorder already induced a microstate to be conceived in a roughly way: that was a consequence of the impossibility of exactly distinguishing microstates inside a little volume. So the phase space already has a granular structure, of arbitrary size for gas, of definite elementary size h for resonators. The arbitrary size of the cells, in the space of a few years, becomes a definite elementary size dictated by h constant for any physical systems. There is the appearance of Nernst's theorem which implies the absolute value of entropy. Behind the phenomenological meaning of this thermodynamic theorem, one has to go deep into the microscopical meaning of entropy signified by Boltzmann's principle: "The hypothesis of quanta assumes that the size of an elementary region of probability is not infinitely small, but finite:

j"Jdqdp = h . [...] The hypothesis of quanta does not regard energy but action. The essential core of this hypothesis is the size of the elementary region of probability, the quantum of action. [...] the quantum of action is basic also for non-periodic and nonstationary processes" [15].

101 Here one finds the complete general meaning of A: indeed the absolute value of entropy implies the absolute value of probability and therefore the definite elementary size of the cell dictated by h: "For the hypothesis of quanta as well as the heat theorem ofNernst may be reduced to the simple proposition that the thermodynamic probability of a physical state is a definite integral number, or, what amounts to the same thing, that the entropy of a state has a quite definite, positive value, which, as a minimum becomes zero [...]. For the present, I would consider this proposition as the very quintessence of the hypothesis of quanta " [16]. This subdivision of phase space, in elementary cells with size defined by h constant, could be reasonably called Planck's principle. •

So Planck's principle, integrated in a statistical approach to the unknown quantum world, constitutes a universal basis for quantizing a quite general physical system [17].

This theory, making the phase space structure explicit in many different important and meaningful physical cases, as, for example, the hydrogen atom and ideal monoatomic gas, characterizes Planck's theoretical work, in those years, as a construction aiming not so much at a phenomenological description as rather at an abstract mathematic search for the absolute unifying aspect deeply included in different physical phenomena. Closing remarks In conclusion the following points seem reasonable: -Planck's theoretical work is really understandable if we do not limit our study to the final product, but also extend our investigation to its evolution, particularly to the discovery process; - in this evolution, mathematics plays a very important heuristic, ideational role; - i n this evolution, epistemology and natural philosophy also play an important part; - finally, to understand the rationality intrinsic in this theoretical work, it is not enough to simply place epistemology and mathematics together, rather one must put these elements into a dialectic relation, inextricably bound by reciprocal influence. References 1. With regard to this metaphore, see, for example, G. Holton, Thematic origins of scientific thought: Kepler to Einstein (Harvard University Press, Cambridge, 1988); L'immaginazione scientifica (Einaudi, Torino, 1983),

102

2.

3. 4.

5. 6. 7.

8. 9. 10.

11. 12.

13.

14.

15.

16. 17.

particularly the paper L'analisi tematica delpensiero scientifico; La lezione di Einstein chap. 7 (Feltrinelli, Milano, 1997). P. Campogalliani, "Planck's quantum theory of the first two decades: historico-critical reflections", in The foundations of Quantum Mechanics, C. Garola and A. Rossi eds., pp. 83-97 (World Scientific Publishing, 2000). Ibidem, pp. 88-89. M. Planck, "L'immagine del mondo nella fisica moderna", in M. Planck, La conoscenza del mondo fisico (a cura di E. Bellone), p. 206 (Bollati Boringhieri, Torino, 1993). Ibidem, pp. 206-207. M. Planck, L'unita dell'immagine fisica del mondo, in op. cit., pp. 47-48. M. Planck, "Uber irreversibile Strahlungsvorgange", S.B. Preuss. Akad. Wiss., Mitteilungen 1-5, in Physikalische Abhandlungen und Vortrdge Band I, pp. 493-600, see also the summarizing paper in M. Planck, La teoria della radiazione termica (a cura di P. Campogalliani), pp. 65-119 (F.Angeli, Milano, 1999). M. Planck, Mitteilung 4, in op. cit. Ibidem, It. transl. in C. Tarsitani, // dilemma onda-corpuscolo, pp. 211-217 (Ed. Loescher, 1983). M. Planck, "Entropie und Temperature strahlender Warme", Annalen der Physik 4, pp. 719-737 (1900_, It. transl. in M.Planck, La teoria della radiazione termica cit., pp. 121-136. Ibidem, p. 126. M.Planck, "Uber eine Verbesserung der Wienschen Spektralgleichung", Verhandlungen der Deutschen Physilkalischen Gesellschaft 2, pp. 202-204 (1900), It. transl. in La teoria della radiazione termica cit., pp. 145-147. M.Planck, "Zur Theorie des Gesetzes der Energie-verteilung in Normalspectrum", Verhandlungen der Deutschen Physikalischen Gesellschaft 2, pp. 237-245 (1900), It. transl. in La teoria della radiazione termica cit., pp. 149-157. M. Planck, Vorlesungen uber die Theorie der Warmestrahlung (Verlag J. A. Barth, Leipzig, 1906), It. trasl. in La teoria della radiazione termica cit., § 150 e §166, pp. 210-212, p. 219. M.Planck, "Die Gesetze der Warmestrahlung und die Hypothese der elementaren Wirkungsquanten in Die Theorie der Strahlung und der Quanten", Verhandlungen auf einer von F. Solvay einberufenen Zusammenkunft (30 October - 3 November 1911) pp. 77-94 (A. Eucken, Halle 1914). M. Planck, The Theory of Heat Radiation, Engl, transl. Morton Masius, Preface to second edition pp. VII, VIII (Philadelphia, 1915). M. Planck, "Die Quantenhypothese fur Molekeln mit mehreren Freiheitsgraden", Physikalische Abhandlungen und Vortrdge, Band II, 1 Mitt. pp. 349-360, 2 Mitt. pp. 362-375 (1915).

ON THE FREE MOTION W I T H NOISE

B. C A R A Z Z A * Department of Physics, University of Parma, Parco Area delle Scienze, 7/A 143100 Parma, Italy E-mail: [email protected]. it R. T E D E S C f f l t Department of Physics, University of Parma, Parco Area delle Scienze, 7/A 143100 Parma, Italy E-mail: [email protected]. it

We analyze the effect of a thermal background on the free motion of a classical mesoscopic particle. The external noise is modelled through a longitudinal force field. It turns out that, asymptotically, the position values are dispersed with a mean square deviation growing with the time elapsed after the detection of the corpuscle. On the contrary the mean square deviation of the velocity values vanishes after an initial growth.

1. Description of a free m o t i o n w i t h noise We investigate the otherwise free motion of a charged small body, say a metallic colloid or a heavy ion, subject to a thermal background modelled by a longitudinal force field. The corpuscle is considered in classical non relativistic terms. We have in mind practical applications, but our results may also concern a conceptual scene by comparing the fate of our corpuscle with that of a twin companion (without external noise) living in the quantum realm. The field we consider obeys the equation of motion: - - A F = -47reVp-47r^J||,

* I.N.F.N. Sezione di Cagliari, Italy. tl.N.F.N. Gruppo Collegato di Parma, Italy

103

(1)

104

where e is the coupling constant, ep(r) the assumed spherically symmetric charge distribution, eJ\\ the longitudinal part of the current density. After decomposition on discrete plane waves basis the Hamiltonian reads:

n

\ £

(akak 1 2m

+ <5kak) + ^

V r,3 t-j

£

(ake-ikR + akeikR) +

^

k2c

-ik-R

K

-ake

ik-R\

(2)

where R is the centre of mass coordinate and the normal mode coordinates a k obey the Poisson brackets relation {ak,aki} = —ifcc<5kk< if the field is described classically, whereas if the field is quantized the a k become operators obeying the commutation relation [a k ,ct k ,] = Hkc6w Our operators are defined as the usual ones except for a multiplying factor, since we do not wish to suggest their values in advance. The field variables are statistically defined, and may be considered in the classical as well as in the quantum scheme. In the first case the underlying mechanism becomes more transparent, but we should specify the average values and the correlation function of the various mode coordinates. 1 ^ 4 The chief assumption will be that the phase for each mode is randomly distributed. The matter can be handled more easily within a quantum description which we hence adopt. It suffices now to say that the related density operator is diagonal in the a k a k eigenstate basis. Further we state that the quantity: 2 ( " k " k + «kQ;k) =/(fc)

(3)

is an isotropic function of the wave vector. The bar in the previous equation indicates the statistical average, i.e., the trace of the density matrix times the operators oversigned. Ignoring the "drift" on account of the condition R
g£kf>(cte—

t Akct 1

ale

(4)

5>3?C
ikct

k

ikct

ale )

105

Solving the first of them:

and taking the statistical average, we have: (P - mV) = 0.

(6)

But

fdri fdT2jrjJrp(k)p{k')(ailie-ikcT'-

- aleikcTA

f a k e - i f c c ^ - a^e**"*)

is in general a non null quantity. To compute it we adopt the variables £ = n + T2, TJ — TI — r 2 . Then, evaluating the sums in the continous limit the right member becomes: e2

[2t

r+t

— / d£ /

roo

dr}

;-

k2p2(k)dk[ail.al

+ alaitiJcos(kcr]),

(7)

which in the point-like approximation, i.e., p(k) = 1, implies: (P - m V ) 2 =

1 / fe/(fc) sin{kct)dk.

(8)

If the integrand is an analytical function we know, by virtue of the Riemann Lebesgue lemma, that the integral above is asymptotically zero. But to draw a better conclusion about the dispersion of velocity values we need a well denned expression of f(k). The obvious candidate is the Planck distribution, but for the sake of simplicity we assume an easier form with the same analitycal properties, namely: / ( * ) = 6e~Xk,

(9)

where the constant 9 is dimensionally an energy. The function f(k) represents a white noise with a cut off which can be intended as peculiar to the power spectrum 5,6 or as a remedy to our crude point-like approximation, assuming in this case the meaning of a form factor. From the expression above we get: 4 e26

(ct/A) 2

(p-"V) a = ;*-^AcM2 [l +?(ct/A) ; ' 2]2 ,

do)

which shows that for reasonable values of the cut off length A the mean square deviation of the velocity, after a initial increase, rapidly vanishes.

106 Taking for granted that the velocity is well defined, apart from the first instants, we solve the second equation of motion obtaining:

R = R 0 + V t mc\ - i ^ ^L6W z *dr£(c —' J0 k \

—ikcr

- 4e ifccr )

(11)

Averaging then the previous result we get: (12)

R - (Ro + Vt) = 0, and calculating the second central moment we have: 2 e29 [R - (Ro + Vt)Y = -ttari

'(!)•

(13)

It may be noticed that the expression above is an even function of time. We could go on computing further moments both for velocity and position distributions, which will give us an idea of how these functions look, but the calculations become more and more cumbersome. Hence, being content with this first glance at the matter and with some confidence in the applicability of the central limit theorem, we suppose that the gaussian:

Pr(R,t)

(x-x0-

=Mr(t)exp

Vt)2 + (y- y0)2 + (z - z0)2 4 e26 2 rttan'

•K

m (?

(14)

"(?)

where the initial velocity is assumed along the x axis, together with an analogous gaussian for the velocity, does represent the state of affairs. The quantity Mr{t) in the previous expression is the time dependent normalization factor. Both distributions refer to the ensemble of all the events in which a corpuscle of the same kind endowed with velocity x = V is found at the position R = Ro. The interval t is the time elapsed since the detection. In what follows we shall especially discuss the position distribution. Since we are particularly interested in the long time development, we will consider the asymptotic form of (14) which is: P = [2vr7|i|]~ 3/2 exp

(R-Ro-V*)2' 2 7 |t|

with: 6 7 =

mcmc' The P distribution obeys the diffusion equation:

at

+ V • V P - 7 A P = 0.

(15)

107 2. The probability distribution of position and velocity These results should be joined with a prescription of how to perform a direct measurement of the velocity. To this end we imagine a conceptual experiment with a one dimensional potential barrier of height U and infinitesimal thickness, infinitely extended in the two other dimensions. Counting the percentage of events in which the particle passed through the barrier and varying the height U the partition function for the normal component of the velocity can be determined. With the barrier immediately near the source, that is in an istantaneous measurement at the initial time, the equation (10) tells us that we will determine the exact value since there are no fluctuations. In a delayed measurement, with the barrier placed at some distance from the source, the same formula indicates that at first we may find a few particles outside the barrier even if U > (m/2)V2. Moreover, since the times of arrival are dispersed, the measurements should take a finite duration. Let us now concentrate on the position distribution (15). If £ seconds after the detection at Ro an experimentalist finds the body at some position R, he concludes that the velocity is (R — Ro)/£. The average values for the quantity so defined are:

\(R-Ro)2A2

= ^

+ 7/t'

lio;

and so on. The velocity distribution, operationally defined in the manner described above, is simply obtained by a change of variable and reads : 3/2

(

U

~

V

)

2

*

It is gratifying to note that (Xi - XiY • (ut - Ui)2 =

2 7

(18)

is a constant of motion. For practical purposes we may adopt the distributions (15) and (17) together with an operational definition of velocity through the time of flight measurement technique. In that case it is pleasing to deduce the information we need from a unique representative function, instead of from the expressions considered above. This can be done by setting: W = P1'^™,

(19)

108

together with the definition of the differential operators:

(20)

"> = lkIt is easy to verify that (W\&iW)

= Vi,

(21)

and that {(Wuiy\(uiW))=V?+>y/\t\. (22) The ( ) brackets indicate, as in the standard formalism, the integration over all space. What we have done for a free particle can be repeated mutatis mutandis for the unbound motion of a particle in a central potential well behaved at infinite distance. Supposing we found the particle when it was at the periastre, we imagine that, since asymptotically the orbits are straight lines, the incoming and outcoming velocities will still be well defined. The external noise may possibly modify the deflection angle. 3. Speculations on the spatial probability distribution in presence of a potential It is interesting to see what happens to the spatial distribution of a particle with null velocity V = 0 when its path is constrained by an absorbing or reflecting barrier. The effect of the barriers may also be simulated by an attractive potential. Writing:

where now A is a real quantity, the diffusion equation is:

A or:

l£

= V ilAVA)

'

'

8A A~ = 7 ( V A • T7A + AAA).

(23)

Considering it as a forward Fokker-Planck equation and taking the average with the backward one (the right member of which is reversed in sign) we obtain that the probability distribution is constant in time. But the square of the diffusion velocity may have a meaning, and, as it can be easily seen, the kinetic energy density gained from the background is: y72VA • VA

(24)

109 It is reasonable to suppose in the aforesaid situation that after a long time A2 (R) will reach a stationary form. We guess this seeking the distribution for which the average energy is an extremum, together with the condition:

j A2dv = l. To this end we write the functional: ^2VA--VA + A2U(r)-SA2^dT, (25) / ( where £ is a Lagrange multiplier to ensure the normalization condition and/or the potential scale. The Euler-Lagrange equation that follows has the form of an old acquaintance. 4. Conclusions The idea that non relativistic quantum mechanics could be understood in terms of a kind of Brownian motion was pursued in the past, supposing the diffusion coefficient equal to H/m.7'8 Our results suggest that some quantum features may be simulated by resorting to an external noise, thus confirming, at least at first glance, that such a hypothesis might be plausible. But a deeper thinking leads to deny this plausibility, since, even if we admit a universal noise spectrum, the diffusion coefficient times the mass is still dependent on the kind of particle, even for a point-like particle of the same charge. References 1. 2. 3. 4. 5. 6. 7. 8.

A. Einstein and L. Hopf, Ann. d. Physik 33, 1095 (1910). A. Einstein, Ann. d. Physik 47, 879 (1915). M.V. Laue, Ann. d. Physik 47, 853 (1915). M.V. Laue, Ann. d. Physik 48, 668 (1915). J.R. Carson, B. S. T. J. 10, 379 (1931). N. Wiener, Acta Math. 55, 17 (1930). E. Nelson, Phys. Rev. 150, 1079, (1966). D. Kershaw, Phys. Rev. 136, 1850, (1969).

FIELD QUANTIZATION AND WAVE/PARTICLE DUALITY MARCELLO CINI Universita La Sapienza, Roma, Italy INFN, Sezione di Roma, Italy In spite of the recent extraordinary progresses of experimental techniques it does not seem that, after more than seventy years from the birth of quantum mechanics, a unanimous consensus has been reached in the physicist's community on how to understand the "strange" properties of quantons, the wavelike/particlelike objects of the quantum world. In this paper I will present a derivation from first principles of the Wigner representation of quantum mechanics in phase space which eliminates altogether from the theory the Schrodinger waves and their questionable properties. This approach leads to the conclusion that the wave/particle duality has nothing to do with "probability waves", but is simply the manifestation of two complementary aspects (continuity vs. discontinuity) of an intrinsically non local physical entity (the quantum field) which objectively exists in ordinary three dimensional space.

1. Introduction In spite of the fact that the recent extraordinary progresses of experimental techniques make us able to manipulate at will systems made of any small and well defined number of atoms, electrons and photons - making therefore possible the actual performance of the gedankenexperimente that Einstein and Bohr had imagined in order to support their opposite views on the meaning of quantum mechanics - it does not seem that, after more than seventy years from the birth of the theory, a unanimous consensus has been reached in the physicist's community on how to understand the "strange" properties of quantons, the wavelike/particlelike objects of the quantum world. Unfortunately, we cannot know whether Feynman would still insist in maintaining that "It is fair to say that nobody understands quantum mechanics". We can only discuss if, almost twenty years after his death, some progress towards this goal has been made. I believe that this is the case. I will show in fact that, by following the suggestions of Feynman himself, some clarification of the old puzzles can be achieved. This paper discusses a derivation from first principles of the Wigner representation of quantum mechanics in phase space which eliminates altogether from the theory the Schrodinger waves and their questionable properties. This approach leads to

110

Ill

the conclusion that the wave/particle duality is no longer a puzzling phenomenon. The waves of quantum mechanics are not probability waves. The wave/particle duality is instead only the manifestation of two complementary aspects (continuity vs. discontinuity) of an intrinsically non local physical entity (the quantum field) which objectively exists in ordinary three dimensional space.

2. The Representation of the Irreducible Randomness of Quantum Reality in Phase Space If randomness has an irreducible origin in the quantum world its fundamental laws should allow for the occurrence of different events under equal conditions. The language of probability, suitably adapted to take into account all the relevant constraints, seems therefore to be the only language capable of expressing this fundamental role of chance. The proper framework in which a solution of the conceptual problems discussed above should be looked for is, after all, the birthplace of the quantum of action, namely phase space. It is of course clear that joint probabilities for both position and momentum having sharp given values cannot exist in phase space, because they would contradict the uncertainty principle. Wigner however, in order to represent Quantum Mechanics in phase space, introduced the functions called after his name1 as pseudoprobabilities which may assume also negative values, and showed that by means of them one can compute any physically meaningful statistical property of quantum states. It seems reasonable therefore to consider these functions not only as useful tools for computations, but as a framework for looking at Quantum Mechanics from a different point of view. A further step along this direction was made by Feynman,2 who has shown that, by dropping the assumption that the predictions of Quantum Mechanics can only be formulated by means of nonnegative probabilities, one can avoid the use of probability amplitudes, namely waves, in quantum mechanics. This program has been recently carried on3 by generalizing the formalism of classical statistical mechanics in phase space with the introduction of two postulates (uncertainty and discreteness), which introduce mathematical constraints on the set of variables in terms of which any physical quantity can be expressed. I briefly sketch here the two main steps of my argument, referring for details to the original paper.

112

My starting point is the joint probability distribution in the phase space Pa(l'P) °f a classical statistical mechanics dispersion-free ensemble characterized by a definite value a of a given variable A(q,p). The ensemble is defined by the requirement that all its systems have the value a of the variable A. Therefore the ensemble average < . > a of A 2 must satisfy a =a 2

(1)

A suitable formalism for achieving our goal is to express A(q,p) in terms of the "characteristic variables" C(k,y) = e^^Xkq+yp) (where k,y are the variables of the dual phase space)4

A(q,p) = j]dydka(k,y)C(k,y)

(2)

We have Pa(q,p) = < 5(q-q) 5(p-p)> a = (27th)"2 jjdy dk ei-'^X^+YP)

C a (k,y)

(3)

where the "characteristic function" Ca(k,y) represents the ensemble average <e (i/h)(kq+yp) >a

Condition (1) implies that Ca(k,y) must satisfy the equation Jjdy dh a(h-k, y-x) C a (h, y) = a C a (k,x)

(4)

where a(k,x) is the double Fourier transform of the function A(q,p). The first step of our approach consists now in selecting, among all the possible dispersion-free ensembles a the one in which the variable B conjugate to A is completely undetermined. With this constraint we obtain immediately that the characteristic function must satisfiy, in addition to (4), also the equation dy dh (ky-hx) a(h-k, y-x) C a (h,y) = 0

for all k,x.

(5)

Eq. (5) yields therefore the formal expression of a "classical uncertainty principle", representing the condition to be fulfilled by classical ensembles having the property that when a given variable A has the value a its conjugate variable B is undetermined. Only the distribution functions of these ensembles

113

are invariant under canonical transformations. Up to now we are still in the domain of classical statistical mechanics. The second, essential, step is to introduce the quantum into this scheme. This is done by imposing the fulfilment of a second postulate, based on the assumption that the real founding stone of quantum theory is the experimental fact that physical quantities exist (the action of periodic motions, the angular momentum, the energy of bound systems..) whose possible values form a discrete set, invariant under canonical transformations, characteristic of each variable in question. This means that we should request that a belongs to a discrete spectrum independent of the phase space variables. This feature can only be ensured if eq. (4), which yields a continuous spectrum a for the eigenvalues of the classical variable A, is modified to become a true Fredholm homogeneous integral equation with a nonseparable kernel, allowing for the existence of a discrete set of eigenvalues OCJ. I have shown3 that the simplest way to do this is to replace the separable kernel of eq. (4) with a nonseparable kernel: Jjdy dh a(h-k, y-x) g(ky-hx) Q(h, y) = oci Q(k,x)

(6)

where Q(k,x) is now the quantum characteristic function of the ensemble with A=a.[. Similarly eq. (5), expressing the uncertainty principle between A and B should be changed into dy dh (h-k, y-x) f(ky-hx) Q (h,y) = 0

for all k,x.

(7)

However, the only way to obtain (6) (7) from (3) and (5) is to replace the classical characteristic variables C(k,x) obeying the standard rule of multiplication of exponentials with quantum variables C(k,x) having the property (l/2)[C(k,x) C(h,y) + C(h,y)C(k,x)] = g(ky-hx)C(k+h,x+y)

(8)

and to replace their classical Poisson bracket with the Quantum Poisson Bracket {C(k,x), C(h,y)}QPB

= f(ky-hx) C[(k+h), (y+x)]

(9)

The determination of the functions f(X) and g(^.) is easily done3 by imposing the condition that both relations (8) and (9) should be invariant under the canonical transformations generated by the QPB's.

114 The two constraints (8) (9), however, cannot be satisfied by ordinary commuting numbers. This means that, if we want to allow for the existence of discrete values of at least one variable L we are forced to represent all the variables A by means of noncommuting Dirac q-numbers.This means that the mathematical nature of the entities needed to represent the quantum variables is a consequence of the physical assumption of the discreteness of quantum variables and not viceversa, as the conventional view of reality underlying the conventional axiomatic formulation of Quantum Mecchanics assumes. It is remarkable that the quantum variables C(k,x) with the properties (8) and (9) turn out to have the same exponential form of classical statistical mechanics where the classical variables q and p are replaced by quantum variables q and p satisfying the commutation relations [q,p]=ih

(10)

of the standard variables of Quantum Mechanics. From the solution of equations (6) (7) one immediately obtains (by simple Fourier transform) the pseudoprobability Wj(q,p) corresponding to the joint probability Pi(q,p) of the classical ensemble. It is easy to show that this pseudoprobability coincides with the Wigner function obtained from the standard wave function of the state. It is important to mention that all pseudoprobabilities satisfy the condition j]dq dp Wi(q, p) Wi(q, p) = (27*)"1

(11)

which expresses the uncertainty principle in the reformulation of quantum theory in phase space. It is remarkable that this principle is given by an equality , thus eliminating the ambiguity of the Heisenberg inequality. 3. Field Quantization in Phase Space and Wave/Particle Duality These results however leaved some conceptual problems still open. First of all, once the Schrodinger waves have been eliminated from Quantum Mechanics, how does one generalize its principles to Quantum Field Theory? One should not forget that, historically, QED was invented by Dirac5 by submitting "first quantized" Schrodinger amplitudes to the procedure of "second quantization". If no "first quantized" probability amplitudes exist any more how does one proceed? And, secondly, isn't one throwing away the baby with the dirty water

115 by forgetting that after all a quantum field must still show some of the wavelike properties of its classical limit? A second paper6 has been therefore devoted to answer to these questions, leading to the conclusion that: (a) one should not start from nonrelativistic quantum mechanics in order to formulate quantum field theory, but viceversa; (b) the wavelike behaviour of the quanta of a quantum field is, as already Pascual Jordan had understood in 1925,7 a straightforward consequence of imposing the Einstein property of discreteness to the intensity of a classical field - clearly a nonlocal physical entity - which exists objectively in ordinary three dimensional space. It is appropriate to recall that for Jordan, in fact, it is quantization which brings into existence particles, both photons and electrons. According to him, therefore, rather than trying to explain phenomena like diffraction and interference of single particles as properties of "probability waves" one should simply view them as primary properties of the field of which they represent the quanta. "These considerations show - we read in his paper "On waves and corpuscles in quantum mechanics"8 - that the quantized field is equivalent, in all its physical properties and especially with respect to its intensity fluctuations, to a corpuscular system (with a symmetric eigenfunction)" The derivation of Wigner functions from the principles of uncertainty and discreteness illustrated in the previous paragraph provides the formalism for deducing the kind of wave/particle duality suggested by Jordan (and forgotten by the physicist's community since then) by simply imposing Einstein's quantization to the states of a classical field represented by means of statistical ensembles in the phase spaces of its normal modes. Following the procedure sketched in the previous paragraph, we introduce a classical statistical ensemble for each radiation oscillator r of the field's normal modes defined by the constraint that the intensity Nr(q,p) has with certainty a given value v r . All the equations (3) (4) (5) (6) remain valid, provided the variable A with its value a is replaced by the intensity N with its value v and the conjugated variable B is replaced by the corresponding phase 0 of each normal mode (we omit from now onwards the index r). Our procedure of field quantization will be based on the Einstein assumption of the existence of discrete field quanta. More precisely we assume that the spectrum of the quantum variable N of each field oscillator should be discrete. Eqs. (7) (8) (9) (10) (11) remain unchanged and express now the result that, the quantum variables should be represented by means of non commuting quantities (Dirac's q-numbers). Quantization is therefore now a consequence of the physical property of the existence of field quanta, and not viceversa.

116

The field's states with a given number of quanta can now be represented by going from the quantum variables q, p to the Dirac complex variables a, a* expressed in terms of each wave's intensity N and phase 8 by means of their standard expressions a*=Nmexp(-iO/b)

a = exp(i0/h)Nm

<12>

The eigenvalue equations (6) (7) can be rewritten for the characteristic functions Cn((3 ,(3*) expressed in terms of the new variables P ,P* related to k,x and h,y by means of the same relations (12). These equations can be solved to give the eigenvalues v n of the quantum variable N and their characteristic functions C n (p ,p*) yielding vn=n+(l/2).

(13)

This result is expected, but remarkable, because it has been obtained by solving our new integral equations without any reference to Schrodinger wavefunctions. It is also easy with this formalism to treat the field's coherent states, as well as the processes of emission and absorption of photons from a source to reproduce the results obtained by Dirac in his seminal paper on the foundations of quantum electrodynamics. It turns out of course that the absorption rate is proportional to n r and the emission rate to n r +l (Einstein's laws). 4. Conclusions The reversal of the order of quantization from non relativistic quantum mechanics to quantum field theory gives a clear physical foundation to the mathematical nature of all quantum variables. The basic formal rules of quantum mechanics follow in this way from the Einstein postulate of the existence of field's quanta. The main conceptual result of this approach is therefore the clarification of the basic notion of wave/particle duality, which follows from this postulate, and simply reflects the dual nature of the quantum field as a unique physical entity objectively existing in ordinary three dimensional space (or ordinary four dimensional relativistic space, when is the case). The advantages of this approach are numerous. First of all, the paradoxes typical of the wave-particle duality disappear. Secondly, by establishing a (pseudo)probabilistic formalism from the beginning, this approach eliminates the conventional hybrid procedure, which consists of a first stage in which the theory

117 provides a deterministic evolution of the wave function followed by a hand made construction of the physically meaningful probability distributions, of describing the dynamical evolution of a system. Finally, one might view the elimination of Schrodinger waves from quantum theory, in analogy with the elimination of aether in the theory of electromagnetism, as a straightforward application of Occam's razor. References 1. E. Wigner, Phys. Rev. 40, 479 (1932). 2. R. P. Feynman, in Quantum Implications, B. J. Hiley and F. D. Peats eds. (Routledge & Kegan, London, 1987). 3. M. Cini, Ann. of Phys. 273, 199 (1999). 4. Moyal, Math. Proc. Cambridge Phil. Soc. 45, 99 (1949). 5. P. A. M. Dirac, Proc. Roy.Soc. A 114, 243 (1927). 6. M. Cini, Ann. of Phys. 305, 83 (2003). 7. M. Born, W. Heisenberg, P. Jordan, Zeit.f.Phys. 35, 557 (1926). 8. P. Jordan, Z. Phys. 45, 765, (1927).

PARASTATISTICS IN E C O N O P H Y S I C S ?

D. COSTANTINI* Laboratorio

di fisica e statistica medica, Dipartimento University of Genoa, via Dodecaneso 33, 16146 Genova, Italy

di

Fisica,

of

Genoa,

U. G A R I B A L D I 1 ' IMEM-CNR,

c/o Departement of Physics, University via Dodecaneso 33, I6I4.6 Genova, Italy

Gentile jr. introduced the notion of "intermediate statistics", that is, statistics different from the quantum ones. The lack of particles for which parastatistics hold does non imply that probabilistic behaviors that in some sense generalize the quantum cases cannot find any application in other fields of research. In econophysics there are economic agents whose correlated behaviors are different from those governed by quantum statistics. In some sense they can represent the examples Gentile was looking for. Yet, at odds with Gentile's suggestion, they appear "superbosonic".

1. Introduction In 1940 Gentile 1 introduced the notion of "intermediate statistics", that is statistics different from the quantum ones. Till now no particles obeying the so called "parastatistics" have been found. The only equilibrium probability distributions governing the behaviour of elementary particles are the uniform distribution on all the occupation vectors, i.e., the Bose-Einstein statistics, and the uniform distribution on the occupation vectors satisfying the principle of Pauli, i.e., the Fermi-Dirac statistics. 2 Gentile's idea was to study uniform distributions on the occupation vectors constrained by a generalized maximum occupation number d, where d = 1 would be the FD case and d = 00 the BE case. The lack of particles for which parastatistics hold does non imply that * E-mail: [email protected] t E-mail: [email protected].

118

119

probabilistic behaviors which in some sense generalize the quantum cases cannot find any application in other fields of research. In econophysics, a discipline half way between Physics and Economics, there are economic agents whose correlated behaviors are different from those governed by quantum statistics. In some sense they can represent the examples Gentile was looking for. At odds with Gentile's suggestion, they are not uniform distributions, but generalized Polya distributions. Furthermore, they are not "intermediate statistics" between the FD and BE cases, but they appear "superbosonic" in character. In the present paper we give a short survey on some economical applications of these notions.

2. Equilibrium Probability Distribution To begin we recall a result to be used in our approach to equilibrium probability distributions. Let Y\, Y a n d nj — # { ^ i = 3,i = 1, • - , " } . Thus rij is the occupation number of j , i.e. the frequency of the j-value in the first n member of the sequence of random variables. Then, n is the occupation vector, i.e. the vector of the occupation numbers, of the first n members of the considered sequence. If the probability function P{.|.} satisfies the condition of exchangeability and invariance (see3) and i > n, then P{Yi=j\n}

=*

^ = 2l±*lj = l,2,...,d, (1) a +n a+n where pj = P{Yi = j} is the initial probability of j , <x,- = pja, and a = J2j=iPja = S j = i aj- Here a and {Pj}j=i,..,d are d parameters to be fixed in order to obtain the value of the probability. We define a = (a\,..., ay) and call ctj the initial weight of j . Now we consider a system S consisting of N elements and d cells. The system-state is the occupation vector N ={Ni,..., Nj,..., Nd), with Ej=i Nj = N, that may be seen as the set of all the individual descriptions E = (Xi = j\,...Xn = jn,...,XN = j N ) , with, for all n, j n e {1,2,...,d}, whose occupation numbers are N\,..., Nj,..., Nd- We denote by £(d and j\f(d,N) t n e s e t Qf a u individual descriptions and the set of all occupation vectors of <S, respectively. Whenever the system-state is N , we look for the probability of destroying an element in the k-th cell, for short in k, and of creating another element in j . To this end, we introduce three conditions. The first of them is the following.

120

C (general condition). If N is the initial state of the system, then destruction and creation probabilities are exchangeable and invariant. As a consequence of condition C, in order to destroy or create an element we have to fix the free parameters in Eq. (1). The two following conditions make this clear. D C (destruction condition). For the destruction sequence D\, D2, .. •, Dm, applied to the initial occupation vector N . the parameter a takes the value aD = —N, while pk = -rr-, or, briefly, a^ = —NkOf course m < N. If we limit ourselves to the destruction of one element only, it follows from D C that the probability that the element is destroyed in k is Nk

P{Dl=t;N}

=

^i

=

^i

=

»

(2)

where the symbol N placed after the semicolon in Eq. (2) reminds that the free parameters have been determined after having taken into account the state to which the destruction applies. Due to D C , the destruction process is fully determined by the initial occupation vector. We agree to consider only closed systems, that is systems whose size cannot change. If this is the case, only a certain number of destructions followed by the same number of creations can occur, hence creation probabilities may be regarded as symmetrical to destruction probabilities. The last condition is C C (creation condition). For the creation sequence C\, C2, •••, Cm, applied to the initial occupation vector N, the parameter a is left free (and NNwe put ac = a + N), while pj = -~, or, briefly, ctj = Q-rrIt follows that the probability that an element is created in j is P{C1=j;N,a,,a} = ^

= ^ ± f .

(3)

According to C C the creation process is not fully determined by N but it depends also on ctj and a. While P{D\ = /c;N} is universal, P{Ci = k;N,aj,a} depends on the correlation between individuals. This correlation is positive if, for all j , otj is positive, it is negative if, for all j , ctj is negative. In the latter case | aj | must be integer, and it is the maximum occupation number for j .

121

2 . 1 . Creations

in

physics

We now consider what happens by assuming a uniform initial distribution, ct that is by putting ay = — for all j in Eq. (3). In this case, the initial weights of all cells are equal, and Eq. (3) becomes -i + Nj P{Ci=j;N,a,d}=-d a + N T,

d

By putting c = — we have a

c _ 1 + Nj _ 1 + cNj dc~l+N d + cN'

P{Ci=j;N|C,d}=

Eq. (3) holds for negative values of a too. 3 Hence in general, the range of c is the non negative real line plus the set of negative numbers such that — is an integer, that is {—1, — ^, — | , . . . } U [0,oo]. Again — can be lcl lcl regarded as the maximum occupation number of a cell. In physics only creation probabilities with c equal to 1,0 and —1 are taken into account, that is l + Nj fore d+ N 1 d

P{C1=j;N}

^

forc = 0

d-N

.

(4)

for c = — 1

These are respectively the creation probabilities associated with BoseEinstein particles, with Maxwell-Boltzmann particles, and with FermiDirac particles, respectively. Obviously, when Fermi-Dirac creations are considered, Nj can only be 0 or 1 for all j . The intermediate cases suggested by Gentile correspond to values in the set — 1 < c < 1. 2.2. Unary transitions

and related

probabilities

We focus now on the probabilities of the most elementary dynamical events that may occur in <S . The events we are interested in are unary transitions. Any such transition can be regarded as the juxtaposition of two different steps: the first desroys one element in some cell, say k, while the second

122 creates one element in some other cell, say j . In general k need not be different from j . The procedure considering a destruction followed by a creation can be extended to any number of destructions and creations. Let 0,1, ...,t,... be a discrete sequence of time points and t and t + 1 two subsequent points in the sequence. When the occupation vector of S at the time point t is N , we suppose that during the time interval [t, t+1) the system loses an element belonging to k and immediately after gains an element belonging to j . When such a change happens, the system firstly undergoes a transition from N to Nfc = N — e(k) = (JVi,..., Nk — 1,..., Nd), and then from Nk to N j = N - e(k) + e(j) = (Wi, ...,Nk - 1,...,Nj + 1,..., Nd). Taken together, these steps only change the occupation numbers of k and j , that become Nk — 1 and Nj + 1, respectively, but not the size of the system that remains N. However it is worth noting that after the first virtual step, since the state of the system is Nfc, its size is N — 1, while after the second virtual step the state is N j and its size is again N. In other words, starting from N , we can reach Nj. passing through the intermediate state Nfc. It must be stressed that the two steps, though logically independent, are stochastically dependent. For C D and C C , the probability of these steps are

^{N.IN} = f and P{Ni|Nfe} = "'^'J?

,

(5)

respectively. The Kronecker function 6k,j, defined as 5k,j = 1 if k = j and 5k,j = 0 if k ^ j , accounts for the case in which the creation occurs in the same cell in which the destruction has occurred. Hence the transition probability we are interested in is exactly determined by the two probabilities in Eq. (5). When the state of the system at the time point t is N , the probability of a transition leading to Nj. can be determined by means of Eq. (5). One gets P{X(t + 1) = Ni|X(t) = N } = f

• as+ff_SfJ

(6)

which determines the probabilities of all transitions that the process might undergone. 2.3. The detailed

balance

conditions

In order to study the stochastic dynamics of S, we consider a unary transition supposing that N is the occupation vector of the system at t and N '

123 is the occupation vector at t + 1, after the unary transition has occurred. Let P{X(t + 1) = N'|X(«) = N }

(7)

be the probability that at t the system undergoes a transition from N to N ' . We shall work on this probability in order to locate the equilibrium probability distribution of the process. A probability distribution 7r(N) defined on ff(d>N) is an equilibrium probability distribution for S when, whatever may be the initial state N(0), lim Pr{X(i) = N(*)|X(0) = N(0)} = TT(N),

(8)

t—>oo

that is, 7r(N) does not depend neither upon time nor upon the initial state of the system. A probability distribution on the states of S is stationary if it does not change with time. If a set of states is ergodic, for this set there exists a stationary distribution. Furthermore if this set is aperiodic, then the stationary distribution and the equilibrium probability distribution coincide. We aim at locating the stationary distribution 7r(N) of the homogeneous Markov chain whose transition probability is given by Eq. (7). The probabilistic time evolution of the system is given by the ChapmanKolmogorov equation, that is P{X(t + 1) = N } = ] T P{X(t + 1) = N|X(i) = N'}P{X(t) = N ' } , N'

with t = 0,1,2,... This equation may be written as P{X(t + 1) = N } - P{X{t) = N } =

J2P{X(t + 1) = N|X(t) = N'}P{X(t) = N'} N'

- J ^ P { X ( i + 1) = N'|X(t) = N}P{X(i) = N } N'

This is the discrete "Master Equation". When for any pair N ' ^ N , the equality P{X(t + 1) = N|X(t) = N'}P{X(i) = N ' } = = P{X(t + 1) = N'|X(t) = N}P{X(t) = N } holds, then P{X(t

+ 1) = N } = P{X(t)

= N } = TT(N).

(9)

124 This equality asserts that the distribution 7r(N) does not change with time. Hence a distribution satisfying Eq. (9) is stationary. The set of Eqs. (9) expresses the detailed balance between pairs of occupation vectors belonging to the same ergodic set. Roughly speaking, the meaning of Eqs. (9) is that the probability flux from N to N ' equals that from N ' to N. 2.4. The equilibrium

probability

distribution

The results in Sec. 2.4 ensure that a distribution is stationary if it satisfies the detailed balance equations. We suppose that exchangeability and invariance hold for the transition probability of <S. As a consequence, we have P{X(t + 1) = Ni|X(«) = N } = ^

•^ ~ ^ j .

(10)

The term Sk j does not appear in Eq. (10) because the detailed balance conditions are trivially satisfied if 5k,j = 1. Because of Eq. (6), the probability of the transition in the opposite direction from N^. to N is Nj + 1 ak + Nk-l P{X(t + 1) = N|X(t) = N{} rTTT-^r< f c /= _ - ^ N ' a + N-1

(11)

Putting N and Nj. in Eqs. (9) we have P{X(t + 1) = NJ|X(t) = N}7r Q , p (N) = = P{X(t + 1) = N|X(«) = N£}7r Q , p (N£).

(12)

Instead of P{X(t) = N } and P{X(t) = Njk} we have written directly 7r a,p(N) and 7r a)P (N^.), since we know that this is the equilibrium probability distribution. The subscripts a and p = (pi, ...,Pd) recall the dependence of this distribution upon the individual correlation and the initial probabilities. By using Eqs. (10), (11), and (12), we get 7ra,p(NJfc) _ Nk 7Ta,p(N) Nj + 1

+ N, ak + Nk-l' aj

{

'

If there is a probability distribution on M^d'N*> satisfying Eq. (13), this is the equilibrium probability distribution we are searching for. It is immediate to check that this is the generalized Polya distribution, i.e. AN

d

a[Ni]

125

Hence if the Markov chain that we are considering is governed by the transition probability in Eq. (10), its equilibrium probability distribution is given by Eq. (14). When the equilibrium probability distribution is reached, keeping constant the destruction probability but considering as creation probabilities those in Eqs. (4), the equilibrium distributions for <S are:

N + d-1 N

Bose-Einstein:

c = 1,

Fermi-Dirac:

c=-l, ("J

Maxwell-Boltzmann: c = 0,

—-j

,

(15)

d~

The Bose-Einstein statistics is an uniform probability distribution on j\f(d,N)_ rp^g F e r m i _ j ) j r a c statistics distribution is an uniform distribution on N(d'N*> when the vectors of this set has occupation numbers equal to 0 or 1. The Maxwell-Boltzmann statistics is the symmetric multinomial distribution on all occupation vectors. As a consequence this statistics is an uniform distribution on £(d
126

consequence, the statistical description of the system (the occupation vector) spends the biggest part of time in percentages near 1 or 0, while only a small fraction of time is spent in percentage close to 1/2. Considering a long period of time and plotting the percentage of ants arriving at one sources (the other percentage is obtained by difference), one reaches an Ushaped histogram, that is a series of frequencies the highest of which are located either near to 1 or to 0, while only a few frequencies are located on percentages near to 1/2. This is what entomologists have observed: in an apparently symmetrical situation ants behave in an unsymmetrical mode if the observation time is short. Kirman noted that this behaviour is very similar to the herding behaviour observed in assets markets as well as in other economical situations. The analysis of the behaviours just described made by Kirman is based on a Markov chain that is similar to the chain in Eq. (6) (the slight differences are commented on in Ref. 7). The equilibrium probability distribution of the Markov chain on the continuum limit N —> oo turns out to be the Beta Distribution (1/2,1/2) -x-ll2(l-x)-l/2,Q<x
(16)

that is [/-shaped, and fits well the observed frequencies. We can obtain a better result considering that the motion of the ants is ruled by Eq. (6), when the number of cells is 2, and the initial weights of the creation probability are a\ = a2 = - . It follows straightforward from Eq. (14) that the exact equilibrium distribution is the Polya distribution

with Ni + N2= N, and xW = x(x + l)...(x + k - 1). The Polya distribution is discrete and finite on the domain N\ = 0,1,..., N. Now, we recover Kirman's result by observing that in the continuum limit N —> oo the Polya distribution P{Ni,N2\ 1/2,1/2} tends (in distribution) to the Beta Distribution (1/2,1/2). What is essential for us is that ants are described by cti = - , or Cj = 2, that is they are twice more correlated than bosons. If ants were "bosonic", that is a* = 1 = c, the resulting distribution would be uniform. If they were independent (that is a, = oo", c = 0) the resulting distribution would be the simmetric binomial. Indeed the tendency to collapse in the same cluster is so high that the equilibrium distribution is [/-shaped, and the mean value is the less probable.

127

We have given elsewhere7 an interpretation of the motion of ants in terms of "herd vs. rational" behavior. The two attitudes are proportional to the weight of the herd (in this case N — 1) vs. the sum of the initial weights (in this case a = 1). 4. A n Application to Stock Price Dynamics: Gibbs' Limit The second economic application deals with price increments or returns. Following Ref. 5, we consider a stock market with N agents, labeled by 1< i < N, trading in a single asset, whose logaritm of the price at time t is x(t). During each time period, an agent may choose to buy, to sell or not to trade. The demand for stock of agent i is represented by a random variable $ j , which can take values {+l(bull),0(neutral), — l(bear)}. The aggregate excess demand for the asset at time t is then D (t) = J2i=i ^* (*)• Let us assume that the price return is proportional to D(t), i.e.: Ax (t)=x(t-l)-x

(t) =

-D(t)

where rj is the excess demand needed to move the percentage return of one unit. Here we pose rj = 1 for the sake of simplicity. In order to evaluate the distribution of returns we need the joint distribution of {<$>i (t)}i=1 N. The possible strategies of agents are three, d = 3, that is bullish, bearish, neutral. The state of the system is N(t) = (N+(t),N-(t),N0(t)). The excess demand is D(t) = N+(t) — N-(t). The parameters a+, a_, ao, a = a+ + a _ + a o , associated with the three strategies, determine the transition probability of the chain. In this scheme the economic interpretation is apparent. At each step one agent has the possibility to change strategy. For positive values of the parameters each chosen agent tends to join the majority; for negative values the behavior of the chosen agent tends to be at odds with the actual majority's behavior; when the absolute values of the parameters tend to oo, agents are not influenced by the environment. The equilibrium distribution is the Polya distribution P(N)

= P(N+,N-,N0)

An = -[Fr]

J]

T-T

a[Ni] -fT

i=+,-,0

*'

with the constraint N++N_+N0 = N. The effective demand J2iLi $* (*) N+(t) — N-(t) is a function of N. Now we introduce the "thermodynamic limit" ao —> oo, JV —> oo, ^ = x = cost. In this case the Polya distribution factorizes, i.e. P(N+=a,

N_=b,

N0 =

N-a-b)->P(a)P(6)

128

where

p

v=^{TW\^)b~Ne9Bin{f3>x)

a+ = a, a_ = 8. The limit corresponds to the increase in the number of agents and the initial propensity to be "neutral", conserving the mean number of "bulls" and "bears"

a + p + a0 that are surrounded by a "reservoir" of neutral agents providing new active agents, or absorbing them. The moments of the equilibrium distribution of the excess demand are functions of a and x- In i&ct, E (a) = ax, Var (a) - aX (1 + x ) ,

Kurt(a) = -U+

(17)

\

).

We note that the kurtosis of the negative binomial is large for small a. Hence, E (Ax) =E(a-b)

=

{a~j3)X-

Whenever a and b are independent, we obtain: Var (Ax) = Var (a - b) = (a + (3) x (1 + x) /» N Var(afKurt(a) + Var(b)2Kurt(b) ! Kurt (Ax) = — — ^ ^ 2 (Var (a) + Var (b)) 1 6+ T,

<* + / H

x(x + i)

In general we have three equations, that connect the moments of the excess demand distribution to the three parameters a, /3, x that specify the model. Hence we can then estimate a, f3, x from the mean, the standard deviation and the kurtosis of data. If the market is stable, E (Ax) = 0, a = [3. A comparison is presented in Ref. 4, and typical values of a and /? are a = (3 = .25, that is c = 4. If this rough estimate is correct, agents in stock markets are twice more correlated than Kirman's ants.

129 5.

Conclusion

Our approach deals with elementary particles in a simple unified manner. T h e values of the parameter c allow us to get rid of the exclusion principle a n d / o r t h e notion of (in)distinguishability. A particle statistics is a n equilibrium probability distribution. Moreover the only probability equilibrium distribution t h a t q u a n t u m physics takes into account are the uniform distribution resulting from Eq. (14) when c = ± 1 , or equivalently a* = ± 1 . As is well known, when c = 0, hence on —» oo, one obtains the statistics of classical particles, t h a t holds only in the case of low density. T h e r e is no room in the realm of elementary particles for parastatistics. As we have shown, parastatistics may be used profitably in economics or, better, in econophysics. However, the correlations appearing in these cases are not intermediate between c = ± 1 , rather they are "superbosonic". Moreover the correlation is not universal as t h a t describing the behaviour of bosons and fermions. T h e behaviour of economic agents is not fixed once and for all, b u t may vary with the context in which the agents act. This gives rise to a lot of possibilities. In conclusion, we can say t h a t a correlated behavior of the elements of a system does not exclusively characterize q u a n t u m particles. T h e behaviour of economic agents too may be characterized by a high correlation t h a t can be handled with an adequate probabilistic description.

References 1. G. Gentile jr., "Osservazioni sopra le statistiche intermedie", II Nuovo Cimento XVII, 10 (1940). 2. M. A. Penco, "Le statistiche intermedie di G. Gentile j . " , Atti del XVI Con. Naz. di Storia della Fisica e dell'Astronomia (Como, 1999) (Univ. Milano, Milano, 2000). 3. D. Costantini and U. Garibaldi, "The Ehrenfest model: from model to theory", Synthese 139 (1), 107 (2004). 4. U. Garibaldi and M. A. Penco, "Ehrenfest's urn model generalized: an exact approach for market participation models", Statistica Applicata 12, 249 (2000). 5. R. Cont and J. P. Bouchaud, "Herd behavior and aggregate fluctuations in financial markets", Macroeconomic Dynamics 4, 170 (2000). 6. A. Kirman, "Ants, rationality and recruitment", The Quarterly Journal of Economics 108, 137 (1993). 7. U. Garibaldi, M. A. Penco and P. Viarengo, "An exact physical approach for market participation models", in Heterogeneous agents, interactions and economic performance, R. Cowan and N. Jonard eds. (Springer, Berlin, 2002).

THEORY-LADEN INSTRUMENTS AND QUANTUM MECHANICS SALVO D'AGOSTINO Universita "La Sapienza", piazza Aldo Moro 5 00158 Roma Starting from the theses that in physics observations and experiments are theory-laden and that physical theories include theories in a so-called Correspondence Area, I argue that an experiment is a process aimed to establishing or non-establishing the relationship between the theory under test and a theory in its Correspondence Area. In conclusion, I argue that the well known QM wave-function collapse can be mathematically associated with a mapping from the Hilbert space to the vectorial or tensorial space which represents the physical process above, i.e., that of confronting the theory under test with a theory in its Correspondence Area.

1. Theory-neutral Experiments as a Paradigma of Classical Physics In his book Between Experience and Metaphysics, the Polish epistemologist Stefan Amsterdamsky presents an original view on the revolution which was at the origin of physics as an empirical science in the 17th and 18th Centuries. According to him, this revolution was supported by a philosophy whose fundamental tenet was the conception of an ideal observer, i.e., an observer who reacts by his feelers (the senses) to the states of the outside world without interfering in any way with the observed object and/or the measuring instrument. According to Amsterdamsky, the widespread introduction into scientific practice of measuring instruments in the 17th and 18th Centuries was supported by an analogous philosophy, the theory-neutral instrument conception, in substantial accord with the ideal observer philosophy. The epistemologies of "theoryneutral instrument" and "ideal observer" ruled almost all the experimental approaches to scientific research from the 17th to the 19th Centuries. [Amsterdamsky 1975, p.l passim, and p. 49 passim]. Clearly, Amsterdamsky's thesis reverses the traditional understanding of the processes of the origin and growth of physics as an empirical science, because, in his view, these processes were not originated by the mere transition from speculative to empirical thinking, as usually intended, but this transition was

130

131 conditioned by the assumptions of an ideal observer and of a theory neutral instrument. Let me add that these assumptions were validated and reinforced in the Nineteenth Century by Gauss and Laplace's statistical theories of errors, because they thought that the theoretical control of measurement errors eliminated the instrumental impact on measurements, and allowed them to grasp the "true" and "real" measure of quantities [D'Agostino 1996]. Due to problems with Quantum Mechanics (QM) measurement, it would be difficult to overestimate the role that the concept of direct experience played in the intellectual revolution of the 17th Century and in the science of the following two Centuries [Amsterdamsky 1975, p. 70]. It seems consequent to think that the difficulties that the overall philosophy of empiricism faces in our times spring from changes in the concept of experience, especially after the past Century's extended studies in psychology and cognition [Amsterdamsky 1975, p. 69]. Let me also underline Amsterdamsky's significant passage, "if by observational sentences we mean such sentences which can be accepted (either ultimately or provisionally) without referring to any assumption or theory, then there are no such sentences". And he adds that "scientific instruments are always constructed on the basis of some theory, therefore the statements of that theory determine the meaning of observational statements reporting the results of experiments performed by means of the instrument" [Amsterdamsky 1975, p. 110]a. In support of a more general view of the empirical process in physics, epistemologists, physicists, and historians of science have significantly illustrated the point that no observation can be theory-neutral. It is well known that Max Planck and Albert Einstein, among physicists, N. R. Hanson [Hanson 1958], W. V. O. Quine [Quine 1966] and K. R. Popper [Popper 1969, p. 128], among philosophers, opposed the net distinction between observational and theoretical terms. Following the actual difficulties of QM measurement, physicists have sometime adopted the thesis of a disturbance in the form of an interaction between "observer and instrument" and/or "instrument and system", thus discrediting the concept of an "ideal observer". Because the disturbance thesis was frequently introduced without success as a tool for justifying the difficulties

"When Galileo claimed that by means of the telescope he had constructed, he was able to discover mountains on the moon and spots on the sun, and that these observations could not be reconciled with Aristotelian cosmology, the controversy centred not only on the cosmological theories, but also on the optical theory of the instrument, which was seriously questioned by Galileo's opponents" [Amsterdamsky 1975, p. 79].

132 in the QM measurement, I am here interested to explore alternative possibilities as offered by a recent study concerning some measurement features of microphysics [Dalla Chiara and Toraldo di Francia, 1973; 1999]. I summarize here the various arguments presented in this study with special attention on the relationship between measurement accuracy and definition of physical quantities. According to the authors, the attribution of a given measure to a physical quantity Q depends on a theory of the accuracy of the instrument used in the experiment [Dalla Chiara and Toraldo di Francia, 1973, p. 1-20]. For example, the result of an experiment, aimed to measure an electron radius, can be accepted if the apparatus accuracy is limited, say, to the order of 10"8 cm. The quantity "electron radius" is therefore defined within the 10" cm accuracy. This definition is meaningless if one shifts to an instrument with 10" ' cm accuracy, because it is impossible to define the electron borders within this order of accuracy (borders are cloudy and foggy). As a result of these considerations, let us state in general that instrumental accuracy is to be taken into account when defining physical quantities, and that two quantities can be considered equal if their measured accuracy is kept within a definite interval. Let us consider this as a necessary condition for the equality of quantities. It can be observed, however, that not all equal and equally accurate measurements do belong to the same quantity (e.g., to an electron radius), and, therefore, that the condition above is only a necessary but not a sufficient conditionb. In fact, it is evident that only a definite group of instruments are eligible for measuring a given quantity Q in a given theory. It follows that this theory and the choice of an appropriate instrumental category are to be taken into account when defining equality of quantities. Therefore, both accuracy and definition of an appropriate instrumental category are to be considered as necessary and sufficient conditions for measuring a physical quantity0. 2. The Correspondence Principle; Role and Coverage of Instrumental Theories Let us now refer to the largely discussed topic of the so-called Correspondence Principle (CRP) and the related Correspondence (CR) [Hanson 1963; Jammer 1966; Amsterdamsky 1975, p. 145 passim; Fadner 1985; D'Agostino and b

c

An historical example is given by the Nineteenth Century measurement of the fundamental constant c. Both W.Weber and J.C. Maxwell measured c with the same accuracy, but they gave a completely different attribution to it [D'Agostino, 1996]. This point is often neglected, because the role of an instrument in a measuring process is usually limited to the final sequence of the process, e.g., by taking into account only its counting or output registrations [D'Agostino 2005].

133 Orlando 1990]. In discussing CR, I will summarily refer in this paper to ideas presented by Amsterdamsky [Amsterdamsky 1975, p. 93 passim]. I argue that an old theory represents the limiting case of the new theory TT , in the sense of belonging to its CR area, to which TT* "reduces" by bringing appropriate parameters to a limit [Fadner 1985, p. 831-832]. In other words, according to an usually accepted definition of CRP, TT* admits the old theory as a "sub-theory" whose statements T can be mapped onto a part of TT . As usual, let us call this sub-theory TCR, since it is a theory in the CR area of TT . Notice that the mapping presents only a syntactical correspondence between T , the statements of the new theory, and T , whereas the semantics of T might differ widely from that of the terms of TT*. Many instances of this semantic difference are illustrated in the historical development of physics [Fadner 1985, p. 830; Petersen 1968]. For instance, Newton's second law, that can be included in the CR area of the laws of motion of General Relativity (GR), and successfully operationally applied in launching spaceships and artificial satellites, is semantically different from the GR law [Amsterdamsky 1975, p. 12; D'Agostino and Orlando, 1990]. As another example, it is usually accepted that the semantics of Newtonian mass does not coincide with the GR meaning of mass [Amsterdamsky 1975, p. 145 passim; Fadner 1985, p. 836]. 3. The Problem of the Empirical Test of one Theory with the Help of Another Taking the start from Bunge's statements that "... theory and experience never meet head-on" [Bunge 1973, p. 236], and that instrumental theories represent a theory-experiment interface [Bunge 1973, p. 202], I wish to illustrate the thesis that an experiment entails a comparison between theories. The problem then arises of the relationship between the theory to be tested (let us call it the primary theory) and the instrumental theory, in the sense of analysing their respective coverages, i.e., their syntactic and semantic extensions. It is evident that their coverages cannot be of equal extension (equally-extensive), since the fault must be avoided of introducing syntactic and semantic loops that nullify the concept itself of theory testing. To solve this problem, the above thesis can be specified as follows. AA thesis. Instrumental theories are included in the so called "Correspondence Area" of the primary theory [D'Agostino 1983, p. 181-82]. In fact, the following arguments hold.

134

(a) An instrumental theory TA cannot have equal extension as the primary theory TT , because a logical flaw (petitio principii) would otherwise occur. Hence, TA and TT are different theories. (b) Because of (a), one meets again the above problem of testing either TT* orT A . (c) However, if my AA thesis is accepted, the requirement (a) is satisfied and the objection (b) is circumvented, since theory TA is semantically different from TT , although syntactically mapped on it. Among many historical cases confirming my AA thesis, an interesting one is presented by Heinrich Hertz's celebrated 1888 experiment on electromagnetic waves. In order to use his circular antenna (Kreiss) as an instrument apt to detect the waves, Hertz was confronted with the necessity of understanding the behaviour of his Kreiss, i.e. to find the theory TA that supported its usage. Remarkably enough, this theory is the circuital theory of electric current, a well known case of a CR theory (Tcr), in the context of Maxwell's electromagnetism [D'Agostino 1975, pp. 306-307; Morando 1998, p. 327 passim]. By detecting electromagnetic waves, Hertz confirmed Maxwell's general electromagnetic field theory, by including his TA in its CR area: in symbols, TA <->Tcr. 4. Pluri-correspondence in Relativity and Quantum Mechanics. Niels Bohr's Complementarity The above AA thesis on instrumental theories needs to be better specified when applied to General Relativity (GR) and QM. This is due to the interesting feature of Pluri-Correspondence (Pluri-CR), i.e., the theories' property of giving origin to various CR sub-theories through the selection of appropriate parameters. Let us follow Bunge's analysis. General Relativity goes over into Special Relativity for vanishing gravitation (equivalently for flat space), but it goes over the classical theory of gravitation CG (of Newton and Poisson) for weak static fields and slow motion (Actually there is a third limit, namely for a vanishing matter tensor)... [ Bunge 1973, p. ISA passim].

As to Dirac's quantum theory of the electron, Bunge explores two entirely different "limits" for CR [Bunge 1973, p. 184 passim]. I mentioned Bunge's points on Pluri-CR because I consider it at the origin of some of the difficulties of the quantum theory of measurement. In fact it can be connected to the problem of selecting classical CR theories as appropriate instrumental theories in

135 the measuring process. I think that my TA<->Tcr thesis can contribute to a better understanding of the nature of these difficulties. I shall illustrate the problem by presenting a short description of the wellknown "gedanken Experiment" that Bohr opposed to Einstein's claim of falsifying QM. It is well known that Bohr contended to Einstein that, if the position of the electron on the slide can be localised by fixing the screen and the time of the event is contemporarily determined by measuring with a clock the passage-time of the electron (or photon) through the same slide, then the space-time coordinates of the event are precisely determined, but at the cost of loosing any information on the momentum-energy exchanges of the particle. Reciprocally, if an instrument in the shape of a spring-balance is introduced, then the momentum-energy exchange can be exactly determined at the cost of losing any knowledge of the coordinates time-position of the event. Notice that this interpretation of Bohr's Complementarity Principle, known as the "Pauli version" [Tarozzi 1992, p. 35] connects Complementarity with Bohr's Correspondence principle11. In fact, a theory of the space-time localization of elementary particles, and, reciprocally, a wave- theory of energy-momentum exchanges, are theories in the Pluri-CR area of Quantum Theory (QT) [Bunge 1973]. Therefore, as a result of coupling a micro-system S with two macroscopic instruments A and B (a meter-clock and a balance, respectively) one obtains a de-coupling of two QT entangled states of the whole systems into two product states, described by corpuscular and wave-theories, respectively, both in the Pluri-CR areas of QT. The two states were entangled at a theoretical level, but a physical meaning (semantics!) can be attributed to the quantities only in their respective CR areas, and their values found only by measurement. The coupling of S with classical instruments produced the de-coupling of the QM state known as a decoherence-effect. The soundness of the above qualitative interpretation of Bohr's one-slide "gedanken Experiment" can be further confirmed by resorting to the appropriate mathematical symbolism of "ket" and "bra" state-vectors in Hilbert space.

Actually, Bohr's introduction to his Complementarity Principle in 1927 presents strong connections with his Correspondence Principle [Petersen 1968, p. 246; D'Agostino 1985]. He derived his view of Complementarity in 1927, following years of reflections on the possibility of deriving QM laws in analogy with classical laws. Eventually, he excluded that analogical similarity implied no more than a formal correspondence between laws belonging, so to speak, to different sides of an abyss [Jammer 1996]. Whereas it was this conclusion on Correspondence which opened Bohr's route to Complementarity, this part of his philosophy was almost totally ignored by Bohr's scholars.

136 According to Mittelstaedt, in order to obtain the transformation one must bring "the interference terms to vanish in the joint system S+A ", and by this step, called "the objectivation process", the observables of the two systems which are supposed to possess "eigenvalues", become objective properties, measured in the experiment [Mittelstaedt 1976, p.103-104]. As known, Mittelstaedt's "objectivation" can be formally described as the "reduction of the wave function", an axiom of the Copenhagen philosophy "whose essentially ad hoc nature is at the origin of the serious difficulties of the quantum theory of measurement" [Mittelstaedt 1976, p. 103-104]. However, whereas for Mittelstaedt the "objectivation process" is an ad-hoc abstract process with no physical meaning (Mittelstaedt 1976, p. 103), quite differently, in my view this process presents a physical interpretation6. In fact, I argue that it represents the formal expression of the confrontation TA—»Tcr, i.e., the recourse to an "objective" instrumental theory as a procedure for testing theory TT. The "reduction of the wave function" loses thus its "ad hoc" character, being a consequence of the physical process of confronting an instrumental theory with a CR "reduced" theory Tcr. Thus, macroscopic (non necessarily classical) instrumentation [D'Agostino 2004] produces de-coherence effects, and the various psychological interpretations "a la von Neumann" of the wave-collapse in QT measurement are shown as obsolete1. I find here convenient to follow Mittelstaedt's presentation of the measuring process [Mittelstaedt 1976, pp. 100 passim] as a transformation of the projection operator Py associated with the Hilbert state vector ly> into a statistical operator Wy, as follows. Let us indicate by ly> the Hilbert state vector of the joint system S+A at the end of the interaction between S and A in QM. Then, iy> = SiCi ly4 > with ly;> = IAi>li>,

e f

I am indebted to Arcangelo Rossi for discussions on this topic. Clearly, the view of physics resulting from my theses above is completely foreign to the so called psychical or psychological interpretation of the QM measurement process, whose ascendant is usually traced back to the well known Johann von Neumann's book Mathematical foundations of Quantum Mechanics. Although a critical examination of von Neumann's work is not the aim of the present paper, I argue that some of von Neumann's foudational assumptions - among them, his idea of perception as an extra-physical process, from which his epistemological theses are derived - are presented in the form of unconsistent arguments.

137 where IA,> is a state vector of the apparatus A with eigenvalue Aj and li> a state vector of the system S. Hence, ly> = ZiCilAi>li>. Let us indicate with Py the projection operator Py = iy>
y = Zi Ci Ci* lys x y j l .

Then, the "objectivation" consists in the transformation : Py —>Wy. By this step those observables of the joint system S+A, whose eigenstates are characterized by IAi>li>, become objective properties (Mittelstadt 1976, p. 103). In fact, through "objectivation" one obtains a mixture represented by the statistical operator Wy. This mixture expresses the statistical dispersion of measures, the result of a measurement process. 5. Conclusive Observations Starting from the now frequently accepted view of instrumentally-laden observations, let me resume shortly the main arguments in favour of my AA thesis, i.e., the assumption that instrumental theories belong to the CR area of the primary theory. In my specification of instrumental theories, following Amsterdamsky and, partially, Bunge, I argued that the relationship between CR theories and primary theories is only syntactic, in the sense that CR theories are only formally mapped on a limited area of the primary theory. Let us call it & feeble conception of CR. Let me observe that the strong conception of CR, i.e., the syntactic and semantic correlation between primary and CR theory, has been criticized by many epistemologists and historians [Hanson 1963; Jammer 1966; Fadner 1984, p. 836; D'Agostino 1998, pp. 151-166]. This view is also contradicted by the historical development of science, because it conflicts with the logic of research8. Notice that the rejection of the strong view of CR does not imply that CR theories are semantically empty theories, but only that they have a semantics of their own. In fact, CR theories are very often operational theories [Fadner 1984, p. 836], and instrumental operations have a semantics of their own. Only in the justification logic, CR is accepted in a conventional role consistent with the view of the accumulation of scientific knowledge.

138 As regards the often discussed point of a disturbance inherent in the measurement process, either caused by the observer on the observed system, or by the instrument on the system, I argue that this thesis is discredited by the assumptions of instrumental theories, and of theory-laden experiments, because a reciprocal disturbance implies that its agents are theory-neutral objects. Let me also note that if one abandons the disturbance thesis and accepts that experiment is a theory-to-theory comparison, Popper's view of an asymmetry between empirical verification and confutation of theories is somehow justified. In fact, an experimental confutation represents the impossibility of including the instrumental theory within the CR area of the primary theory. In other words, the instrumental behaviour is reluctant to its inclusion within the theory, behaving as if it represented theory-neutral hard facts. This implies that (pace Popper) in a theory-to-theory confrontation the confutation role of an experiment proves stronger than confirmation . My AA thesis thus concerns in a first instance measurements, but I deem it can be extended in general to the instrumental impact on experiments and observations, [Rossi 2000, p. 370-371] i.e., to the physical (empirical) interpretations of theories, by considering that in general any observation through instruments of a so-called observable implies the intermission of instrumental theories [Amsterdamsky, quoted above]. Another remarkable point of the AA thesis is that it reverses the common understanding of the role of the factual in the inductive conception of theory construction. For in the latter conception hard facts represent the basis for theory generalization, whereas in AA hard facts are theory-resistant experiments, and, as such, they play momentarily a role against theory, hopefully waiting for becoming instrumental theories. As a short note on the frequently debated theme of scientific realism, I argue that a realistic view of physics is not contradicted by the assumptions that instrumental theories are to be accounted for in the idea itself of a realistic scientific knowledge. The Semantic Realism Approch [Garola 2000, pp. 215216], which also distinguishes the semantic definition of truth by means of a model from its ontological conception, presents many aspects in favour of my argument above, The idea that knowledge implies the conditions themselves that allow its possibility has a respectable neo-Kantian tradition. It is recently

The Polish epistemologist Amsterdamsky recognizes that the statement that every observation is theoretical in character "constitutes an achievement of Popper' falsificationalism" (Amsterdamsky 1975, p. 95). However, let me point out that Popper's philosophy implies that the historical study of science is to be limited to the context of justification.

139 vindicated by Petitot [Petitot 2003]. What is important is that theoretically-laden experiments do not forbid a judgement on the adequacy or non-adequacy of theories with regard to experimental results.' References 1. S. Amsterdamsky, 1973, Between Experience an Metaphysics, Boston Studies, vol. XXXV. 2. G. Auletta, 2001, Foundations and Interpretation of Quantum Mechanics, World Scientific. 3. C. Chevalley, 1989, "De Bohr et Von Neumann a Kant; L'Ecole allemand de logique quantique", in: L'Age de la Science, n. 2, Epistemologie, Ed. O. Jacobs; 1993, "Niels Bohr's Words and the Atlantis of Kantianism", in: J. Faye and H. Folse (eds.), Niels Bohr and Contemporary Philosophy, Reidel. 4. S. D'Agostino, 1975, "Hertz's Researches on Electromagnetic Waves", Historical Studies in the Physical Sciences 6, pp. 267-269. 5. S. D'Agostino, 1983, "Strumenti scientifici e teorie fisiche: considerazioni storico-critiche sulla storia della strumentazione scientifica", in: G. Tarozzi (ed.), Gli Strumenti nella Storia e nella Filosofia della Scienza, Istituto per i Beni Artistici, Culturali, Ambientali della Regione Emilia-Romagna, Bologna 1983, pp. 173-182. 6. S. D'Agostino and L. Orlando, 1990, "II criterio di corrispondenza e la genesi della teoria gravitazionale einsteiniana", in: F. Bevilacqua (ed.), Atti del XI Congresso Nazionale di Storia della Fisica. Gruppo Nazionale di Coordinamento per la Storia della Fisica del CNR, pp. 111-122. 7. S. D'Agostino, 1995, "Note per una storiografia della visione microscopica", in: S. Marconi (ed.), Scritti in Onore di Corrado Maltese, Edizioni Quasar. 8. S. D'Agostino, 1985, "Strumenti Elettromeccanici e Concezioni Meccaniciste nella Storia deU'Elettrodinamica", Epistemologia VIII, pp. 119-137. 9. S. D'Agostino, 1985, "The Problem of the Link between Correspondence and Complementarity in Niels Bohr's Papers 1925-1927", Rivista di Storia della Scienza, n. 2 (3), pp. 369-390. 10. S. D'Agostino, 1996, "Absolute Systems of Units and Dimensions of Physical Quantities...", in: Physis, Vol. XXXIII, Fasc. 1-3, NS, pp. 5-51. 11. S. D'Agostino, 1998, "A Controversial Role for 'Correspondence' in Theoretical Physics and Quantum Mechanics", in: C. Garola and A. Rossi (eds.), 2000, pp. 151-165.

The concept of theoretical adeguacy in the context of a novel view of the logical-empiricism tradition has been analysed by van Fraassen (van Fraassen 1985).

140 12. S. D'Agostino, 2004 , "Un problema delle teorie classiche e quantistiche: esiste una logica della strumentazione?", in: I. Tassani (ed.), 2004, pp. 135148. 13. M. L. Dalla Chiara Scabia and G. Toraldo Di Francia, 1973, "A Logical Analysis of Physical Theories", Rivista del Nuovo Cimento, s.2, vol.3, pp.l20. 14. M. L. Dalla Chiara Scabia and G. Toraldo Di Francia, 1999, Introduzione allafilosofia della scienza, Laterza. 15. B. D' Espagnat, 1976, Conceptual Foundations of Quantum Mechanics, Benjamin, 1976. 16. H. Dingier, 1928, Das Experiment, sein wesen und seine Geschichte, Reinhardt, Munchen, 1928. 17. W. L. Fadner, 1985, "Theoretical support for the generalized correspondence principle", American Journal of Physics 53 (9), pp. 829837. 18. V. Fano, 1996, " Definizioni operative ed esperienza possibile", in: V. Fano (ed.), Fondamenti e Filosofia della Fisica, Societa Editrice "II Ponte Vecchio", pp. 277-296. 19. L. S. Feynman, 1965, The Feynman Lectures on Modern Physics, AddisonWesley. 20. A. Fine, 1986, The Shaky Gaim. Einstein's Realism and the Quantum Theory, The University of Chicago Press. 21. P. Galison, 1987, How Experiments End, The University of Chicago Press. 22. C. Garola and A. Rossi (eds.), 2000, The Foundations of Quantum Mechanics. Historical Analysis and Open Questions (Lecce 1998), World Scientific. 23. C. Garola, "Is Quantum Mechanics Contextual?", in: C. Garola and A. Rossi (eds.), 2000, pp.207-218. 24. M. Jammer, 1966, The Conceptual Development of Quantum Mechanics, McGraw-Hill Book Co. 25. N. R. Hanson, 1963, The Concept of the Positron, Cambridge University Press. 26. S. Haroche, 1995, "Mesoscopic Coherence in Cavity QED", II Nuovo Cimento 110 B, N. 5-6, pp. 545-556. 27. P. Mittelstadt, "The concept of substance in Quantum Theory", in: P. Mittelstadt, 1976, pp. 119-121. 28. P. Mittelstadt, Philosophical Problems of Modern Physics Vol.XVIII, Boston Studies, Reidel P. C. 29. A. Morando, 1998, "Galileo Ferraris e la nascita deH'ingegneria elettrica moderna", Physis Vol. XXXV, n.s. Fasc. 2, pp. 291-399. 30. J. von Neumann, 1955, Mathematical Foundations of Quantum Mechanics, translated from the German 1932 edition by Bayer T.R., Princeton University Press.

141 31. J. von Neumann, 1998, Giovanni Boniolo (ed.), I fondamenti matematici delta meccanica quantistica, Italian translation of von Neumann 1955, II Poligrafico. 32. A. Petersen, 1968 "On the philosophical significance of the correspondence argument", Boston Studies in the Philosophy of Science Vol. V, pp. 242252. 33. J. Petitot, 2003, "II razionalismo critico italiano", Nuova Civilta delle Macchine, a. XXI, n. 4; Poincare' Henri, 1950 La Science et I'Hipothese, Flammarion, p. 111. 34. K. R. Popper, 1969, "Problemi, scopi e responsabilita della scienza", in: D. Antiseri (ed.) Scienza e Filosofia, Einaudi. 35. W. V. O. Quine, "Due dogmi deU'empirismo", in: Autori Vari, Ilproblema del significato, Roma. 36. H. Reichenbach, 1954,1 fondamenti Filosofici della QM, Einaudi. 37. A. Rossi, "Information and State Correlation from Classical to Quantum Physics: the Foundations Issue", in: C. Garola and A. Rossi (eds.), 2000, pp. 369-380. 38. M. O. Scully et al, 1991, "Quantum Optical Test of Complementarity", Nature 341, pp. 111-116. 39. G. Tarozzi, 1992, Filosofia della Microfisica Vol. 1, Ace. Naz. di Scienze Lettere e Arti, Modena, Mucchi. 40. I. Tassani (ed.), 2004, Quanti Copenaghen? Bohr, Heisenberg e le Interpretazioni della Meccanica Quantistica, Societa Editrice "II Ponte Vecchio", Firenze. 41. Y. Ahshanov, "Definability and Measurability in Quantum Theory", in: T. Bastin (ed.), Quantum Theory and Beyond, Cambridge University Press, 1971.

QUANTUM NON-LOCALITY AND THE MATHEMATICAL REPRESENTATION OF EXPERIENCE VINCENZO FANO Istituto di Filosofia, Universita di Urbino Four possible solutions of the Kantian problem "how the mathematisation of experience is possible?" are presented: Platonism, critical materialism, operationism and empiricism. Then the experimental violation of Bell's inequality is discussed. To avoid the proof of Bell's inequality, it is possible to deny different conditions, but experiments support only the refutation of factorizability as a whole. It is argued that this implies a confirmation of the empiricist's point of view.

1. The Problem" I presuppose that qualia exist (Nagel, 1974), i.e. that they are not a mere illusion, as maintained by eliminativists (Dennett, 1991). Moreover it is possible to speak about qualia in the language of folk psychology, identifying them as what appears in first person experience. I assume as well that qualia are an irreplaceable source of knowledge, i.e. that we learn from first person experience something that cannot be known otherwise (Nagel, 1974, Jackson, 1982). This knowledge can be expressed only in third person language, and is therefore neither private (Wittgenstein, 1953), nor infallible (Ryle, 1949). In spite of this one cannot deny that some true statements like "red is not green" can be justified neither analytically, nor by natural science (Quine, 1953). In the second place, I assume that natural science refers its empirical predictions to experience15 and not to an object independent of experience. This agrees with the statement that holds that experience is not a mere illusion. The problem of whether the abstract objects of science - for instance magnetic fields - are either connected with the realm of experience, or independent realities, or useful fictions remains open.

a

b

I must thank Alexander Afriat, Mario Alai, Gennaro Auletta, Mauro Dorato, Walter Pisent, Federica Russo and Federico Laudisa for their sharp suggestions. By "experience" I mean the system of qualia.

142

143 From the preceding presuppositions, one can formulate the following problem: experience appears to be without any numerical character0, whereas many scientific predictions are numerical values deduced from mathematically structured theories. Then one cannot understand how it is possible for the predictions of science to refer to experience. To sum up, between the two following statements there is a conceptual strain: 1. experience is not an illusion, and is an irreplaceable source of knowledge; 2. the predictions of natural science refer to experience. From the apparent non-numerical character of experience and statement 1. it is in fact possible to deduce that: 3. experience does not have a numerical character. For if either experience were an illusion, or it were not an irreplaceable source of knowledge, one could affirm nothing about its nature. Therefore, afortiori one could not say that it is not numerical. Since it is obvious that: 4. many predictions of science are numerical values deduced from mathematically formulated theories, one cannot understand how it is possible that 2. holds. 2. Possible solutions It is possible to imagine many solutions to the problem posed in the preceding paragraph. A. Platonism. One can deny premise 1., by stating that experience is essentially an illusion; therefore it cannot be a source of knowledge. One can also maintain that experience is not an illusion, and yet that it is not a reliable source of knowledge (a strange position). In either case one cannot also maintain 2. Indeed it is possible to replace it with: 2'. the predictions of science refer to a theoretical object, whose structure derives from an idealization independent of experience. But with 2'. any direct comparison between theory and experience becomes impossible, since predictions of science refer to a theoretical object and not to experience. Indeed from the Platonic viewpoint, only a mathematical representation of the latter is a source of knowledge. In general the Platonic

c

Although experience appears endowed with order relations, like "bigger than", and with an intuitive partial metrics - this is twice bigger than that -, it is not clear how one can obtain numbers from these intuitions.

144 approach does not give an answer to the problem of the passage from the deceptive experience to its objective mathematical representation11. B. Operationalism. It is possible to deny the cognitive value of experience, as in the preceding case, by replacing 2. with a different concept: 2". the predictions of science do not refer to experience, but to the results of a set of operations. According to this point of view a set of operations causes a pointer to indicate a number, which should be the same as that predicted by theory. In this perspective a comparison between the sensorial appearance of a number on an experimental apparatus and in the books of the theoretical physicist is established (Bridgman, 1927). Note that this correspondence is not a knowledge about experience, but an element of a pragmatic process. C. Critical materialism. It is possible to keep both 1. and 2., by replacing 3. with the following statement: 3'. experience does not have a numerical character, but one can represent numerically, at least partially, the knowledge that derives from it. For instance, the statement "the table is taller than the chair" - which comes from experience - can be mathematized by saying that "the table is x centimetres taller than the chair". According to what I have called the materialist point of view, this is possible because experience is produced by the contact between an unknowable object and the human form of perception. This form makes mathematization possible. In other words, the quale is the result of the contact between a material object - which we cannot know completely - and the neurophysiological structure of our sensorial apparatuses. The latter gives the sensorial object the form which makes its mathematical representation possible. (This point of view needs a good psychophysical explanation for the determination of the quale from physical processes, based on the idea that mental states supervene strongly over the physical ones). This thesis is reasonable also because the neurophysiological structures that constitute experience are, at least partially, those that produce mathematical formalism6. D. Empiricism. As in the preceding case, we replace 3. with 3'., but we do not presuppose any object beyond the realm of perception. That is, we perceive real objects, and not a sensorial representation of them. (This position needs a good explanation of the occurrence of sensorial illusions.) Furthermore, d e

For instance, Galileo says that the book of nature is written in geometrical characters. Critical materialism is similar to Quine's perspective. But probably Quine would not accept the strong supervenience of the mental over the physical. See also the physiological criticism of Helmholtz.

145 according to empiricism, mathematical structures are not generated by the human neurophysiological apparatus independently of experience, but they are the result of a process of abstraction and idealization that stems from experience. (From this point of view, strong supervenience of the mental over the physical is not assumed. Therefore the best explanation of the genesis of mathematical idealities is the psychological and not the psychophysical). Then a partial mathematical representation of knowledge, which comes from experience, is possible, because both mathematics and that knowledge are rooted in the same realm of experience'. In this point of view 2. is substituted by: 2 ' " . The predictions of science refer to the result of a set of operations relative to a theoretical object constituted by abstraction and idealization of experience. If 2 " ' . holds one retains in the empiricist approach a few aspects of Platonism and operationalism. 3. Bell's inequality The experimental violation of Bell's inequality implies a conflict between our common sense image of the empirical world and its mathematical representation in quantum mechanics. A detailed analysis of the epistemological meaning of the violation could help to evaluate the different proposed solutions to our problem. Let us consider a composite system of two spin-l/2 particles, which we shall name 1. and 2., prepared in the singlet state whose state is described in the following way:

¥=-L(l + ®2_)--l-(l_®2 + ).

(1)

where 1+ is the state of particle 1. with spin +Vi, 1_ is the state of particle 1. with spin -W, 2+ is the state of particle 2. with spin +V2; 2_ is the state of particle 2. with spin -¥1. The state *F represents a situation where the outcomes l + ,2_ and 1_,2+ both have probability Vi. We consider the possibility of measuring four observables, namely the spin component of particle 1. projected onto two possible directions a, a', and the spin component of particle 2. projected onto two possible directions /?,/?'. With obvious notation, we indicate the values of the observables in the following way:

In a sense this is a phenomenological point of view.

146

5^,5^.,5o,5i. According to quantum mechanics the four observables can only have values ±1 in the unity given by Y2h . Now we assume that there are stochastic hidden variables A. such that: p{S\,S2y

I A,x,y)

= p{S\lX,x)p(S2y

I X,y).

(2)

Where p is a probability measure, x varies on a, a and y on /?, f¥. (2) is the socalled "factorizability condition", which expresses that the result on particle 1. depends neither on the orientation of the measurement apparatus on the particle 2., nor on its result, and vice versa. Then, let us define the following correlation coefficients on an ensemble of N couples of particles prepared in the same state (1): l " c(S',S2.) = lim N -> o o _ V s{"Sl"

' "

Nti

'

1 "

c(Sl,S},) = \imN -> °°—J^Sia"Sp

c(Sl.,S}) = \imN

^-^SlS2;

where the superscript n indicates the progressive number of the N couples of particles prepared in the singlet state. If the factorizability condition (2) holds, it is easy to show that the following inequality must hold as well: \c(Sla,S;) + c(SlS}.)+c(Sla„S>) - c(Sl„S2fi, \<2

(3)

This is the well known Bell inequality in the formulation by Clauser and Home. For certain orientations of the measurement apparatuses, quantum mechanics predicts a violation of (3); and till now experiments have supported quantum mechanics. This means that we have to abandon the factorizability condition (2). 4.

Factorizability analysed

It has been emphasized (Jarrett, 1984) that condition (2) is equivalent to the following:

147 p(S\ly,X) p{S\lX,y,S2y)

= p(SlxlX)

(4a)

= p(S[ly,X),

(4b)

together with the same equations with the superscripts 1 and 2 inverted. (4a) affirms that the result on particle 1. does not depend on the orientation of the measurement apparatus on particle 2., whereas (4b) affirms that the result on particle 1. - given the orientation of the measurement apparatus on 2. - does not depend on the result on particle 2. Shimony called (4a) "parameter independence" and (4b) "outcome independence". To prove that (4a) and (4b) together with the same ones with the superscripts 1 and 2 inverted - are equivalent to (2), note that the first term of (4a) is the same as the second term of (4b). If the factorizability condition could not hold, we have to eliminate either parameter independence or outcome independence. We assume that particle 2. is measured as first. Then the violation of Bell's inequality can be explained in two different ways (Maudlin, 1994, pp. 82-84): I. (Standard) before reaching the measurement apparatus, particle 2. flips a coin to decide the value of its spin. Then immediately after the measurement it communicates the orientation of its apparatus and the result of the measurement to the other particle. Thus the other particle can conform to the prediction of quantum mechanics, taking into account the orientation of its measurement apparatus. (This is a grossly metaphoric representation of what happens according to standard quantum mechanics with the collapse of wave function). II. (Bohmiari) At the moment of the constitution of the singlet state, when the two particles are still in contact, the values of spin for all possible orientations of measurement apparatus of both particles are determined. When particle 2. undergoes measurement it communicates only the orientation of its measurement apparatus to particle 1., so that the latter can conform to the prediction of quantum mechanics, taking into account the orientation of its measurement apparatus. (This is a grossly metaphorical representation of what happens according to Bohmian mechanics.) It is clear that the standard interpretation violates outcome independence and not parameter independence, whereas Bohmian interpretation violates parameter independence and not outcome independence. Indeed in model I. if one changes the setting of the measurement apparatus 2. this has no effect on the result on 1., since 2. communicates with 1. only after the first measurement. Therefore outcome independence but not parameter independence is violated. On the contrary, in model II. the results are already determined at the beginning,

148 whereas the orientation of the setting is relevant. In spite of this, interpretations I. and II. both recover the experimental evidence and the prediction of quantum mechanics. It seems that it is impossible to discriminate experimentally between parameter and outcome independence. Indeed, if one measures the spin of particle 2. as first, then it is impossible to determine experimentally the influence of the sole orientation of the measurement apparatus 2. on the result of particle 1., since the result of particle 2. is already determined. To obtain this, one would have to measure particle 1. as first. But, in this case, one could not test parameter independence (4a) with superscripts 1 and 2 inverted. One could also prepare a great number of equal couples in state (1); and on some of them one could measure particle 1., on the others particle 2., as first. But the doubt would always remain as to whether the orientation of the first measurement would influence the outcome of the second. An analogous argument holds for outcome independence. Therefore only the interpretations and not the experiments discriminate between outcome and parameter independence. It is also clear that the violation of parameter independence makes it possible to send superluminal signals from particle 1. to particle 2., whereas the violation of outcome independence does not. Therefore, in order to avoid superluminal signals, it seems1 sensible to prefer an explanation of kind I., similar to the one of standard quantum mechanics. Then, we arrive at the conclusion that, if outcome independence is refused, it is no longer possible to prove Bell's inequality, so that the experimental violation of the latter is explained. On the other hand, it has been shown (Maudlin, 1994, p. 95) that the factorizability condition is also equivalent to the following: p(S\lS2y,Z) = p(SlxIX) p{S\lS)Ay) = p(S\lS]A),

(5a) (5b)

together with the same equations with the superscripts 1 and 2 inverted. (5a) affirms that the result on particle 1. is independent of the result on particle 2., without taking into account the orientation of the measurement apparatuses, whereas (5b) affirms that the result on particle 1. is independent of the orientation of the measurement apparatus on particle 2., given the result on g

It is noteworthy that the Bohmian approach in general explains why it is impossible to send superluminal signals.

149 particle 2. To prove the equivalence of (5a) and (5b) - together with the same ones with the superscripts 1 and 2 inverted - with (2), note that the first term of (5a) is the same as the second term of (5b). We shall call (5a) "result independence" and (5b) "orientation independence". With respect to the interpretation of the standard and Bohmian kind the situation now is that in standard quantum mechanics, orientation independence but not result independence is violated. Indeed if the initial probability to find 1 on particle 1. is Vi, and we consider, for instance, all cases in which the result on particle 2. is 1, the initial probability on 1. does not change, because it is influenced solely by the result and the orientation of 2. On the contrary, if one chooses a set of results with the setting on particle 2. determined, result 1 on particle 1. will no longer have probability Vi. For model II. too result independence is not violated, whereas orientation independence is violated11. As for outcome and parameter independence, it seems that it is impossible to discriminate experimentally between result and orientation independence. Moreover the violation of orientation independence allows superluminal signals to be sent, whereas the violation of result independence does not. We have no reasons to prefer the decomposition of the factorizability condition in parameter and outcome independence or in result and orientation independence. Therefore we are compelled to accept that the factorizability condition as a whole is violated. To sum up, we have good reasons to believe that factorizability condition (2) is violated experimentally and result independence (5a) is not violated, whereas we cannot know if parameter independence (4a) is violated. We report here these three conditions for the convenience of the reader. p(Sx,S2y/A,x,y) p{S\ly,X)

= p(Sx/A,x)p(S2y/A,y)

(2)

= p{S\lX)

(4a)

P(S\IS],X) = P(S\IX).

(5a)

5. Quantum non-locality and the proposed solutions of the problem In the preceding paragraphs we discussed two problems: that of the correlation between the numerical predictions of mathematical physics and the qualitative If the GRW approach is developed only for measurement, it violates outcome independence but not parameter independence, Butterfield et Al., 1993. To my knowledge, there is no discussion of the relation between GRW and result/orientation independence.

150 nature of experience and that of the conflict between the apparent local nature of phenomena and quantum non-locality. In our common sense world we tend to think that whenever there is a statistical correlation, there must be a common cause that explains it. For instance, if every time we go to the swimming-pool we meet Mr. Smith, and for the most part we meet Mrs. Jones as well, then we formulate the hypothesis that they always have a phone-call agreement before. That is, if: p(A)p(B)>

p(A&B),

and A occurs a long way from B, the apparent structure of our experience suggests that there is a common cause C such that: p(A/C)p(B/C)

= p(A&B),

and C occurs in the past with respect to A and B in a place where A and B were spatially contiguous. If experience is an irreplaceable source of knowledge (premise 1.), then the local character of phenomena - expressed by the abovementioned common cause principle - receives part of its justification from the qualitative nature of experience, so that we may utilise the problem of quantum non-locality to test the four interpretations proposed of the relation between mathematics and experience. Indeed the experimental violation of Bell's inequality derives part of its philosophical interest from the conflict between common experience and mathematical physics that it entails. We moved from another example of incompatibility between experience and mathematical physics, that is, the one between the numerical character of the predictions of the latter and the qualitative character of the former. Therefore it is possible that a discussion of Bell's inequality would be relevant for our problem. We are thus going to discuss how the different solutions to our first problem would interpret the experimental violation of the factorizability condition. A. Platonism. The violation of the equation (2) confirms again that experience is not a reliable source of knowledge. Only the mathematical representation of experience is truthful, since therein, it is possible that factorizability would be violated'. B. Operationalism. As with the Platonist, neither is the operationalist worried about the violation of factorizability, since he does not believe that 1

For instance, according to Howard, 1989 as a consequence of the violation of Bell's inequality we have to modify our ontology.

151 experience has a cognitive value. It is in fact our experience that provides us with the intuition that a statistical correlation needs an explanation through a common cause. The Platonist believes that mathematical representation gives a truthful image of reality, whereas perception is deceptive; on the contrary, the operationist ascribes no representative value to mathematics, since experimental results are the product of a set of operations. In conclusion, the violation of factorizability needs no further explanation1. C. Critical materialism. According to materialism, there are many different mathematical representations, as a consequence of different possible constitutions of experience due to different kinds of neurophysiological apparatuses. The choice to measure spin and to ascribe certain mathematical properties to this observable is a product of a particular neurophysiological configuration of our sensorial apparatus and neocortex structure. Therefore the impossibility to find a common cause that explains the violation of factorizability could be the effect of the mathematical representation of the world. Then, it would be possible that future different neural constitutions of experience would lead to different theories - with different observables - which neither predict, nor allow the violation of factorizabilityk. D. Empiricism. In this perspective a neurophysiological explanation of the constitution of experience does not exist, since, even if it is possible that the mental supervenes over the physical, there are no strict psychophysical laws. Then, though uncertain and theoretically compromised, experience is a genuine source of knowledge. Moreover mathematical structures derive from processes of abstraction and idealization, which come from experience. Finally the real object is not something beyond experience, but a reasonable construction based on experience. Therefore it is impossible to state definitively that something is objective. Hence it is possible to interpret quantum non-locality as partially real and partially due to the mathematical representation of phenomena1. 6. Critical evaluation In order to put forward the discussion of the four preceding perspectives an analysis is necessary of the distinction between the orientation of the measurement apparatus and the outcome of the measurement. First of all we note that the setting of an apparatus for the measurement of the spin is neither a part j

This is the position of Fine, 1989 and van Fraassen, 1989. This point of view, although not maintained explicitly by anyone - to my knowldge - is in accordance with a general subjectivistic interpretation of quantum mechanics. In a certain sense this is Bohr's point of view.

152 of the process of preparation, nor an actual measurement. Orientation is something that follows preparation, but precedes measurement. Moreover through orientation it would be possible to send a signal from one particle to the other. Since a signal is not a mere causal connection, but a causal connection produced by an external agent, and the agent can modify the orientation, one concludes that the setting of the measurement apparatus implicitly involves an unavoidable element concerning the presence of an external structure capable of intervening in the measurement. One could observe that also the preparation of a system implicitly involves an external agent, since there is someone who chooses how to prepare the system. But there is an important difference with respect to the preceding case, since systems are all prepared in the same state, whereas orientation can be modified each time. Therefore the orientation is essentially connected with the agent during the measurement; this is the only truly external element in the actual measurement process. Given that the various settings of the measurement apparatus are reciprocally incompatible, in the Bell-type experiments orientation plays a role analogous to that played by the choice of measuring position or moment in Bohr's discussion about complementarity and indeterminacy. The presence in standard quantum mechanics of incompatible observables makes clear the role of choice in the measurement process. On the other hand in the outcome as such the agent is absent, since after the choice of orientation, external structures do not play any further role. On the basis of the preceding analysis, one could maintain the four following theses: Platonism confirmation. If, beside the factorizability condition (2), the result independence (5a) were also violated, then Platonism would have a confirmation. Orientation, in fact, introduces an external element, while Platonism states the objectivity of the mathematical representation with respect to the common sense intuitions. Hence a statistical correlation between results - independently of the setting of the measurement apparatus - would confirm that non-locality is autonomous with respect to external choices. Critical materialism confirmation. If, beside the factorizability condition (2), parameter independence (4a) were also violated, then critical materialism would receive a confirmation. Materialism emphasizes, in fact, the human element in the constitution of experience and in the mathematical representation of it. The setting of the measurement apparatus is the place where an external structure intervenes in the

153 process of measurement. If the latter were so relevant as to modify the outcome on the other particle, one would have a confirmation of materialism. Empiricism confirmation. The violation of the factorizability condition (2) only, without the violation of parameter independence (4a), and that of result independence (5a), confirms empiricism. Empiricism emphasizes, in fact, the impossibility of distinguishing definitively the contribution of the neurological apparatus of the observer from that of the physical system. If there are neither any arguments favouring a mere statistical correlation between results - i.e. violation of (5a) - nor arguments favouring the influence of the orientation as such on the outcome of the other particle - i.e. violation of (4a) - then one has good reasons to consider mathematics connected directly with experience and not with the neurophysiological structure of the observer. Irrelevance for operationalism. For operationalism, the violation of the factorizability condition (2), the violation of parameter independence (4a) and the violation of result independence (5a) are all epistemologically irrelevant. Operationalism, in fact, confines itself to considering a correspondence between elements of experience, without committing itself on the cognitive value of the latter relative to the physical system or to the observer. Operationalism neither affirms that sensorial intuition is erroneous with respect to mathematical representation of physical reality (Platonism), nor that mathematics is something added to experience in the constitution of physical reality (critical materialism), nor that mathematical representation and physical reality have a common origin in experience (empiricism). Therefore whether it is the observer - violation of parameter independence (4a) - or results as such - violation of result independence (5a) - or both that are in conflict with experience is not relevant. 7. Concluding remarks First of all it should be noted that the four theses do not imply the definitive validity of empiricism, but only prove that empiricism receives a confirmation from the analysis of quantum non-locality. Moreover the discussion of quantum non-locality shows that the latter is neither a mere statistical correlation between results on distant particles without a common cause, nor a causal connection between the setting of a measurement

154 apparatus on one particle and the outcomes on the other, but a statistical correlation between the outcomes and the orientations of two distant particles. Furthermore it is necessary to underline that quantum non-locality does not impose the existence of action-at-a-distance. The latter would surely be present if it held that making or not making the measurement on a particle would modify the results on the other (Redhead, 1987, pp. 113ff.). This is not the case in quantum mechanics. On the other hand, since non-locality is not a mere relation between the results on the two particles, one concludes that it is not an altogether objective phenomenon independent of the mathematical representation of the physical reality. Moreover, since quantum non-locality is not a causal connection between the setting of the measurement apparatus on one particle and the outcome on the other, one cannot deduce that non-locality is due above all to the kind of mathematical representation chosen by scientists. Being certain that only the whole factorizability condition is violated, one argues that quantum non-locality is a hybrid phenomenon, partly due to physical reality, partly to the scientific representation of the latter. This amphibious character of non-locality confirms the empiricist's perspective, which holds that it is not possible to distinguish definitively between what is part of the physical system and what is part of the representation of the latter. References 1. P. W. Bridgman, The Logic of Modern Physics (The McMillan Company, New York, 1927). 2. J. Butterfield, G. N. Fleming, G. C. Ghirardi and R. Grassi, "Parameter Dependence in Dynamical Models for State-Vector Reduction", International Journal of Theoretical Physics 32, (1993). 3. D. Dennett, Consciousness Explained (Little Brown, Boston, 1991). 4. A. Fine, "Correlations Need to Be Explained?", in Philosophical Consequences of Quantum Theory, J. T. Cushing and E. McMullin eds. (University of Notre Dame Press, 1989). 5. B. van Fraassen, "The Charybdis of Realism: Epistemological Implications of Bell's inequality", in Philosophical Consequences of Quantum Theory, J. T. Cushing and E. McMullin eds. (University of Notre Dame Press, 1989). 6. D. Howard, "Holism, Separability, and the Metaphysical Implications of the Bell Experiments", in Philosophical Consequences of Quantum Theory, J. T. Cushing and E. McMullin eds. (University of Notre Dame Press, 1989). 7. F. Jackson,"Epiphenomenal Qualia", Philosophical Quarterly 32, (1982). 8. J. Jarrett, "On the Physical Significance of the Locality Conditions in the Bell Argument", Nous 18, (1984).

155 9. T. Maudlin, Quantum Non-locality and Relativity (Blackwell, Oxford, 1994). 10. T. Nagel, "What is it Like to Be a Bat?", Philosophical Review 83, (1974). 11. W. V. O. Quine, "Two Dogmas of Empiricism", in From the Logical Point of View (Harvard University Press, Harvard, 1953). 12. G. Ryle, The Concept of Mind (Hutchinson, London, 1949). 13. L. Wittgenstein, Philosophische Untersuchungen (Basil Blackwell, Oxford, 1953).

ON THE NOTION OF PROPOSITION IN CLASSICAL AND QUANTUM MECHANICS

C. GAROLA Dipartimento

di Fisica dell'Universita and Sezione Via per Arnesano, 73100 Lecce, Italy E-mail: [email protected]

Dipartimento

di Fisica dell'Universita and Sezione Via per Arnesano, 73100 Lecce, Italy E-mail: [email protected]

INFN,

s. sozzo INFN,

The term proposition usually denotes in quantum mechanics (QM) an element of (standard) quantum logic (QL). Within the orthodox interpretation of QM the propositions of QL cannot be associated with sentences of a language stating properties of individual samples of a physical system, since properties are nonobjective in QM. This makes the interpretation of propositions problematical. The difficulty can be removed by adopting the objective interpretation of QM proposed by one of the authors (semantic realism, or SR, interpretation). In this case, a unified perspective can be adopted for QM and classical mechanics (CM), and a simple first order predicate calculus C(x) with Tarskian semantics can be constructed such that one can associate a physical proposition (i.e., a set of physical states) with every sentence of C(x). The set V? of all physical propositions is partially ordered and contains a subset Vj. of testable physical propositions whose order structure depends on the criteria of testability established by the physical theory. In particular, Tip turns out to be a Boolean lattice in CM, while it can be identified with QL in QM. Hence the propositions of QL can be associated with sentences of C(x), or also with the sentences of a suitable quantum language CTQ (X) , and the structure of QL characterizes the notion of testability in QM. One can then show that the notion of quantum truth does not conflict with the classical notion of t r u t h within this perspective. Furthermore, the interpretation of QL propounded here proves to be equivalent to a previous pragmatic interpretation worked out by one of the authors, and can be embodied within a more general perspective which considers states as first order predicates of a broader language with a Kripkean semantics.

156

157 1. Introduction It is often maintained in the literature on the foundations of quantum mechanics (QM) that the lattice of propositions of quantum logic (QL) a is a logical calculus which is different from the classical logical calculus and specific of QM (see Ref. 1 for a review on this subject till the early seventies; for a more recent perspective, together with an updated bibliography, see, e.g., Refs. 2 and 3). Yet many scholars do not accept this view and argue that QL is a mathematical structure with a physical interpretation, not a new logic (for an explicit statement of this position see, e.g., Ref. 4). In our opinion, the unsettled quarrel between the positions above finds its roots in a specific feature of the standard interpretation of QM, that is, nonobjectivity of physical properties. Because of this feature, there are sentences attributing physical properties to samples of a given physical system that are meaningful or meaningless (i.e., have or have not a truth value, respectively) depending on the state of the object, and also sentences that are meaningless in any case, even if they belong to the natural language of physics (a known example of these is the statement "the particle x has position r and momentum p at time t"). Hence the propositions of QL cannot be connected in a direct way with sentences of this kind, following standard procedures in classical logic (CL), which makes their logical interpretation problematical (in particular, QL seems to introduce a new mysterious concept of quantum truth5)b. The above difficulties cannot be removed as long as nonobjectivity is maintained to be an unavoidable feature of QM. Nevertheless most physicists accept nonobjectivity, basing this acceptance on well known no-go theorems (the most famous of which are probably Bell's 6 ' 7 and Bell-KochenSpecker's 7 - 9 ). It has been proven in a number of papers by one of the authors, however, that these theorems, which are mathematically well established, rest on assumptions which follow from implicitly adopting an a

For the sake of brevity, we simply call quantum logic here the formal structure that is called in literature concrete, or standard, (sharp) quantum logic,3 together with its standard physical interpretation. b A rather recent investigation on the concept of proposition has been done by Redei 2 . Within Ridei's analysis physical properties, or sentences about probabilities of properties, are directly taken as elementary sentences of a logical language, and propositions are identified with equivalence classes of (elementary or complex) sentences, each class containing all sentences which are equivalent with respect to a quantum concept of truth. Our analysis here considers a different kind of elementary sentences and introduces various kinds of propositions. The lattice of R&iei's propositions is then isomorphic, in QM, to the lattice of all testable physical propositions introduced here (Sec. 6).

158

epistemological position which is suitable for classical physics but contrasts with the operational philosophy of Q M . 1 0 - 1 6 To be precise, they assume the simultaneous validity of a set of empirical physical laws in which the observables that appear in some laws are incompatible with the observables that appear in other laws, so that it is impossible, according to QM, to check whether all the laws of the set hold simultaneously. This suggests that the simultaneous validity assumption should be dropped in QM: but, then, the no-go theorems cannot be proved. It follows that the nonobjectivity of physical properties can no more be classified as a logical necessity, but only as a (legitimate) interpretational choice, and alternative interpretations of QM in which objectivity of properties is restored become possible. An interpretation of this kind has then be constructed by one of us, together with other authors (semantic realism, or SR, interpretation 1 1 - 1 3 , 1 5 ' 1 7 , 1 8 ). The SR interpretation preserves the mathematical apparatus and the statistical interpretation of QM, and yet considers every elementary sentence attributing a physical property to a given individual physical object as meaningful (though its truth value may be empirically accessible or not, depending on the state of the object). Because of objectivity, the SR interpretation avoids the difficulties of the standard interpretation pointed out above, so that physical propositions can be introduced in QM associating them to sentences of a suitable classical predicate calculus. This allows us to propound in this paper a general scheme based on classical logic for the introduction of physical propositions in physical theories, which can then be particularized to classical mechanics (CM) and to QM. Our scheme explains, in particular, how QL can be obtained by using a testability criterion for selecting a suitable subset in the set of all physical propositions, and shows that a notion of quantum truth can be derived from the classical notion of truth as correspondence (as explicated rigorously by Tarski's semantic theory 19,20 ). In order to favour a better understanding of the above results, let us describe the content of the present paper in more details. In Sec. 2 we construct a classical first order predicate calculus C(x), with monadic predicates and one individual variable only, in which a classical (Tarskian) notion of truth is adopted, and associate a family of individual propositions, parametrized by the interpretations of the variable, with every (open) sentence of C(x). In Sec. 3 we define physical propositions, introduce the truth value certainly true on C(x) (which adds without contradiction to the standard values true Jfalse), and study some properties of the poset (V*,C.) of all

159

physical propositions. In Sec. 4 we conclude the general part of the paper by introducing the subset VT C V* of all testable physical propositions, which is basic for the analysis of measurement processes in the framework of specific physical theories (as CM and QM). In Sec. 5 we specialize the notions introduced in the previous sections to CM. We show that, if suitable axioms (which are justified by the intended interpretation) are introduced, the concepts of individual proposition, physical proposition and testable physical proposition can be identified, which provides a very simple scheme that explains why people usually say that "classical mechanics follows classical logic" (which is however a misleading statement in our opinion). In Sec. 6 we show that the different kinds of propositions introduced in the general part cannot be identified in QM, and introduce some specific axioms which are supported by the broad existing literature on QL. These allow us to construct a quantum language CTQ{X), based on C(x), which is such that the set of all physical propositions associated with its sentences coincides with Vj, and can be identified with the set of all propositions of QL. It follows that every proposition of QL can be associated with a sentence of a suitable first order predicate calculus, as in classical logic, and that the set of all propositions of QL is selected on the basis of a criterion of testability, which is tipically physical and shows the empirical character of the lattice structure of QL. In Sec. 7 we use the interpretation provided in Sec. 6 in order to look deeper into the concept of 'quantum truth'. We show that this concept directly follows in our approach from the concept of certainly true introduced in the general part, hence it does not conflict with the classical concept of truth. This provides a satisfactory unification of notions that are usually regarded as incompatible. In Sec. 8 we discuss the relations between the semantical interpretation of QL provided in Sec. 6 with the pragmatic interpretation propounded by one of us in a recent paper. 21 We show that the two interpretations can be easily translated one into the other, and that they are intuitively equivalent. In Sec. 9 we briefly comment on our approach from a general logical perspective. We note that individual and physical propositions can be considered as propositions in a standard sense in CL if states are considered as possible worlds (modal interpretation of QL). This interpretation is however problematical, and we briefly sketch a possible alternative which refers to

160

the broader language introduced by one of us, together with other authors, in some previous papers. 17 ' 18 2. T h e language

C(x)

The formal language that we want to construct in this section is a simplified and modified version of the more general language introduced in some previous papers 17,18 with the aim of formalizing a sublanguage of the observative language of QM. The alphabet of C(x) consists of an individual variable {E, F,...} of monadic predicates called properties, a set {->, A, V} of logical connectives and a set {(, )} of auxiliary signs. The formation rules for sentences, or well-formed formulas (wffs), of C(x) are the standard (recursive) formation rules for wffs of a classical first order predicate calculus, in which -i, A, V denote negation, conjunction and disjunction, respectively. We denote by cj>{x) the set of all wffs of C(x), and by £(x) the set of all elementary sentences (or atomic wffs) of C(x). The semantics of C{x) consists of a family of Tarskian semantics parametrized by a set S of states. Every S € S is associated with a universe Us of physical objects. An interpretation of the variable a; is a mapping p : (x, S) € {x} x S —>• ps(x) € Us- For every S € S and E € £, an extension extsE c Us is defined. The atomic wff E(x) is true in the state S for the interpretation p iff ps(x) € extsE, false otherwise. The truth value of molecular wffs of C(x) is then defined following standard (recursive) truth rules in Tarskian semantics. For every interpretation p and state S, we call assignment function the mapping a^ : (#) —• {T, F} (where T stands for true and F for false) which associates a truth value with every wff of C(x) following the truth rules mentioned above. The intended interpretation of C{x) is anticipated by the terminology that we have adopted. States are defined operationally as classes of physically equivalent preparation procedures (briefly, preparations) and properties as classes of physically equivalent (ideal) registration procedures (briefly, registrations)0. The universe Us consists of samples of a prefixed physical system fi prepared according to any preparation in S. Whenever an interpretation p and a state S are given, an elementary sentence, say E{x), of C{x) states a (physical) property E of the physical object c

The notion of physical equivalence is not trivial and requires a careful analysis of the notions of preparation and (ideal) registration procedure.18 We do not insist on this issue here for the sake of brevity.

161

ps(x) € Us (by abuse of language, we often avoid mentioning the interpretation p in the following, and briefly say that E(x) attributes the property E to the physical object x in the state S). It must be stressed that the intended interpretation of C(x) provided here implies that the semantics of C(x) is incompatible with QM whenever the standard interpretation of QM is adopted. Indeed, within this interpretation QM is maintained to be a semantically nonobjective (or contextual) theory, which implies that the extension extsE is not denned for every property E. Hence the general scheme for propositions in physical theories propounded in this paper is based on an explicit acceptance of the SR interpretation of QM mentioned in the Introduction, which is semantically objective (we have already noted in the Introduction that the possibility of such an interpretation follows from a criticism of the implicit assumptions underlying the no-go theorems that should prove that QM is necessarily a nonobjective theory). It must also be stressed that the operational definition of properties as classes of registrations makes every elementary wff E(x) E 4>(x) testable, in the sense that a physical procedure exists that, under specified physical conditions, allows one to check empirically the truth value of E{x). Yet, it is important to observe that this check does not reduce in all theories to registering a physical object x in the state S by means of a registration in E. There are indeed physical theories, as QM, in which the registration usually modifies the state 5 in an unpredictable way, so that the obtained result refers to the state after the registration, not to S. In these theories the empirical accessibility of the truth values of E(x) is then restricted to a proper subset of states which depends on E (see Sec. 7). Let us introduce now some further definitions. Firstly, two binary relations of logical preorder < and logical equivalence = can be defined on 4>{x) by following standard procedures in classical logic, i.e., by setting, for every a{x),fi{x) e (j>{x),

a(x)<0(x) for every pe1l,Se

S,a%(a(x))

iff = T implies aps(/3(x)) = T,

and a(x) = P(x)

iff

a(x) < /3(x) and 0(x) < a(x).

It is then easy to see that the partially ordered set (briefly, poset) (4>(x)/s ,<) (where < denotes, by abuse of language, the order canonically induced

162

on <j>(x)/= by the preorder < defined on (j>{x)) is a Boolean lattice (the Lindenbaum-Tarski algebra of £(#)). Secondly, let TZ be the set of all possible interpretations of x. Then, we associate an individual proposition P^(;r% with every pair (p,a(x)) E n x 4>{x), defined as follows. | aps(a(x))=T}.

4 ) = {5£5

(2.1)

The definition of p£(x) implies that aps{a{x))=T

Seppa{x).

iff

Furthermore, one easily gets that, for every elementary wff E(x) € <j)(x), P"E(X)={S£S

while for every a(x),/3(x)

I Ps(x)eextsE],

(2.2)

E <j>{x) one gets *C(*)= s\Paixy

Pa(x)A0(x) Pa{x)V0(x)

=

(2-3)

n

(2-4)

=

Pa(x) ^ / 3 ( i ) ' Pa(x)

U

(2-5)

^3(x)

(where \ , n, U denote set-theoretical subtraction, intersection and union, respectively). Let C denote set-theoretical inclusion and let V be the set of all individual propositions associated with sentences of cj>{x) whenever p is fixed. Then, Eqs. (2.3), (2.4) and (2.5) imply that also the poset (V, C) is a Boolean lattice. Thirdly, by using the definitions of logical order, logical equivalence and proposition introduced above, we get a(x) < P{x)

iff for every p £ 11, ppa(x)

C

pp0{x),

a{x) = P(x)

iff for every pen, ppa(x)

=

ppp{x),

which show that the logical relations on (x) imply set-theoretical relations on every set V of individual propositions. 3. T h e poset of physical propositions The intended interpretation of C{x) introduced in Sec. 2 suggests to associate a set of states with every sentence of C{x), to be precise the set of states which make this sentence true whatever the interpretation of the variable may be. Thus, for every a(x) E <j)(x) we define a physical proposition Pfa(x)' as follows. Pfa{*) = {SES\VPEn,

aps(a(x)) = T}.

(3.1)

163 By using the definitions introduced in Sec. 2, we then get

p'aw = {S€S\vP€ii,S€

p£(x)} = npifa{x).

(3.2)

We denote by V* the set of all physical propositions associated with wffs of £(x), that is, we put Vf = {Pfa(x) I «(*) e (x)}-

(3-3)

For every a(x) € <j){x) we can now introduce the notion of "true with certainty" by setting: a{x) is certainly true in S

iff

S 6 Paixy

The new notion thus follows from the standard notion of truth introduced in Sec. 2 and applies to open wffs of {x) also allows us to introduce the new binary relations of physical preorder and physical equivalence on (j){x). For every a{x),(i{x) E (j>(x), we put a(x)~i0(x)

iff

pfa(x)

C

pf0(x),

a(ar)«0(aO

iff

pfa{x)

=

pf0{x).

By comparing the definitions of -< and ss with the definitions of < and =, respectively, one gets a(x) < j3(x)

implies

a(x) -< 0(x),

a(x) = (3(x)

implies

a(x) « fi(x),

that is, logical preorder implies physical preorder and logical equivalence implies physical equivalence. The converse implications do not hold in general, in the sense that one cannot prove that they hold without introducing further assumptions. We come back on this issue in Sees. 5 and 6. Let us come now to the set V* of all physical propositions. This set is obviously partially ordered by set-theoretical inclusion, but the properties of the poset (V*,C) depend on the specific physical theory that is considered. In particular, one cannot generally assert that (V?, C) is a Boolean lattice, as (Pp, C). However, some weaker features of it can be established. Indeed, let a(x),f3(x) € 4>(x). Then, the following statements hold.

164

(0 *£„(,) CS\p£ ( x ) , (") Pa(x)A0(x) —Pa(x) nP0(x)' ( Ui )pi( i «)V«*)2Pi(x) U Pj(x) (note that, generally, neither S \ pfa{x)

nor pfa{x)

belong to Vs;

U p^x)

statement (ii) shows instead that paix\ ^Pgix\ belongs to V?). Let us prove statements (i), (ii) and (in). By using Eqs. (3.2) and (2.3), we get

PL(X) = n p
=

n

pPa(x)A0(x)

= n

p(Pa(x)

R

P$(x))

=

= OV^s)) n (ry£ ( x ) ) = Pfa{x)r\pf0{xy Finally, by using Eqs. (3.2) and (2.5), we get Pa(,x)V0(x) ~ npPa(x)V0(x)

=

n

p(Pa(x)

U

P/3(x)) -

3 (n P ^ ( x ) ) u (n„pg(x)) =Pfa{x) u ^ ( i ) . To close up, we note that the definitions of -< and « on (x)/~ by the preorder -< defined on (x)) is orderisomorphic to (V?, C). 4. The general notion of testability The intended physical interpretation of C{x) suggests that a sentence of C{x) can be classified as empirically decidable, or testable, iff it can be associated with a registration procedure that allows one (under physical conditions to be carefully specified, see Sec. 2) to determine its truth value whenever an interpretation p of the variable x is given. Since all elementary sentences are testable, one is thus led to define the subset T(X) Q <j>(x) of all testable wffs of (f>{x) as follows. {x) | 3Ea G £ : a(x) = Ea(x)}.

(4.1)

The subset Vj- C V* of all physical propositions associated with wffs of (J>T(X) will then be called the set of all testable physical propositions. More formally,

H = {P{{X) e ?f I «(*) e M*)Y

(4-2)

165

Of course, CJ>T(X) is preordered by the restrictions of the preorders < and -< denned on {x) to it. For the sake of simplicity, we will denote preorders and equivalence relations on <J>T(X) by the same symbols used to denote them on (j>(x). Hence, the logical preorder < implies the physical preorder -<, and the logical equivalence = implies the physical equivalence sa also on 4>T(X). We thus get two preorder structures, (<J>T{X), <) and ((/>T(X), -<), and two posets (T(x)/=,<) and (T(£)/R;,~0- The latter, in particular, is isomorphic to (Vj>, Q)We shall see in the next sections some further characterizations of the foregoing posets within the framework of specific theories. 5. Classical mechanics (CM) It is well known that in classical mechanics (CM) all physical objects in a given state S possess the same properties. This feature of CM can be formalized here by introducing the following assumption. CMS. For every S 6 <S and E e £, either extsE — Us or extsE = 0. It follows from assumption CMS that, for every interpretation p € 1Z, Ps{x) 6 extsE iff extsE = Us, and ps(x) £ extsE iff extsE = 0. Therefore, the assignment function o-ps does not depend on the specific interpretation p. More explicitly, for every interpretation p and state S, ( <rps{E(x))=T { aps(E(x))=F

iff iff

extsE = Us extsE =
o-"s(E(x) A F(x)) =T crps(E{x) A F(x)) =F

iff iff

extsE = Us = extsF extsE ^ extsF

(where E,F e £), etc. Since aps does not depend on p, neither the individual proposition ppaix\ depends on p, and we can omit writing the index p in both symbols. Thus, for every p £1Z, the individual proposition associated with a(x) E (f>(x) is given by pa{x)

= {SeS:

as(a(x))

= T}.

(5.1)

More explicitly, we have PE(X) = {SeS: PE(X)AF(X) = {S eS:

extsE

extsE

= Us},

= Us = extsF}

= pE{x)
(5.2) (5.3)

166

etc. The set V of all individual propositions associated with wffs of C(x) obviously does not depend on p, and will be simply denoted by V. Because of the above specific features, the general notions introduced in Sees. 2, 3, 4 particularize in CM as follows. For every a(x),(}(x) € 4>{x), and S E S, as{a{x))=T a(x)<0{x)

iff iff

S€paix), Pa(x)QP0(x),

a(x)=P{x)

iff

pa(x)=p0(x).

It also follows from the general case that the Lindenbaum-Tarski algebra (<j)(x)/=, <) of C(x) is isomorphic to the Boolean lattice of individual propositions (V, C), so that the two lattices can be identified. Coming to physical propositions, we get, for every a(x) € <j){x), Pfa{x)=Pa(x),

(5-4)

and, therefore, V* = V. Thus, the set of all physical propositions coincides in CM with the set of all individual propositions, and the notions of true and certainly true also coincide. Furthermore the intended physical interpretation suggests that every sentence of the language C{x) is testable in CM. This inspires the following assumption. CMT. The set of all testable sentences of the language C(x) coincides in CM with the set of all sentences of C(x), that is, T(X) — (j>(x) in CM. Assumption CMT implies that V^ = Vf = V, whence CP£,C) = (P,C). More explicitly, the poset of all testable physical propositions of a physical system Q, coincides with the poset of all individual propositions of its language £(x), and has the structure of a Boolean lattice. This result explains, in particular, the common statement in the literature that "the logic of a classical mechanical system is a classical propositional logic" ? This statement is however misleading in our opinion, since it ignores the conceptual difference between individual, physical and testable physical propositions, that coincide in CM only because of assumptions CMS and CMT.

167 6. Quantum mechanics (QM) We have stressed in Sec. 2 that our semantics (hence the general scheme in Sees. 2, 3 and 4) is unsuitable for QM whenever the standard interpretation of this theory is accepted. As anticipated in the Introduction and in Sec. 2, we therefore adopt in the present paper the SR interpretation of QM worked out by one of the authors and by other authors in a series of articles, 11 ~ 1 3 ' 1 5 , 1 7 , 1 8 according to which extsE can be defined in every physical situation (we show in Sec. 7 that the new perspective also allows us to elucidate the concept of quantum truth underlying the standard interpretation of QM). At variance with CM, it may then occur in QM that 0 7^ extsE ^ lis, so that the assignment function aps generally depends on the interpretation p. The formulas written down for the general case cannot be simplified as in Sec. 5. In particular, Vf ^ V, assumptions CMS and CMT do not hold, and v£cVf. In order to discuss how the general case particularizes when QM is considered, let us briefly remind the mathematical representations of physical systems, states and properties within this theory. Let fi be a physical system. Then, fi is associated with a separable Hilbert space V. over the field of complex numbers. Let us denote by (£(H), C) the poset of all closed subspaces of 7i, partially ordered by settheoretical inclusion, and let A C £(H) be the set of all one-dimensional subspaces of H. Then (in absence of superselection rules) a mapping

(6.1)

exists which maps bijectively the set <S of all pure states of fi onto A (for the sake of simplicity, we will not consider mixed states in this paper, so that we understand the word pure in the following)d. In addition, a mapping X:EE£^

xiE) € C(U)

(6.2)

exists which maps bijectively the set £ of all properties of fi onto C(Jl). The poset (£(H), C) is characterized by a set of mathematical properties. In particular, it is a complete, orthocomplemented, weakly modular, atomic lattice which satisfies the covering law. 2 2 - 2 4 We denote by -1, fn\ and l!U orthocomplementation, meet and join, respectively, in (£(H), C) (it "It follows easily that every pure state S can also be represented by any vector \r/>) £ f(S) G -4, which is the standard representation adopted in elementary QM. Moreover, a pure state 5 is usually represented by an (orthogonal) projection operator on tp(S) in more advanced QM. However, the representation ip introduced here is more suitable for our purposes in the present paper.

168

is important to observe that T fH coincides with the set-theoretical intersection fl of subspaces of C(H), while ^ does not generally coincide with the set-theoretical complementation ', nor iyj coincides with the set-theoretical union U). Furthermore, we note that A obviously coincides with the set of all atoms of (£(%), C). Let us denote by -< the order induced on £, via the bijective representation x, by the order C defined on C(V). Then, the poset {£,-<) is orderisomorphic to (£(%), C), hence it is characterized by the same mathematical properties characterizing (C(H), C). In particular, the unary operation induced on it, via x, by the orthocomplementation defined on (£(%), C), is an orthocomplementation, and {£, -<) is an orthomodular {i.e., orthocomplemented and weakly modular) lattice, usually called the lattice of properties of fi. By abuse of language, we denote the lattice operations on (£, -<) by the same symbols used above in order to denote the corresponding lattice operations on (£(H), C). Orthomodular lattices are said to characterize semantically orthomodular QLs in the literature. 3 The lattice of properties (£, -<) is a less general structure in QM, since it inherits a number of further properties from (£(%), C), and can be identified with the concrete, or standard, sharp QL mentioned in Sec. 1 (simply called QL here for the sake of brevity). A further lattice, isomorphic to (£, -<), will be used in the following. In order to introduce it, let us consider the mapping

9:Ee£—>SE

= {SeS\

tp(S) C * ( £ ) } e C(S),

(6.3)

where £(<S) = {SE \ E e £} is the range of 8, and generally is a proper subset of the power set V(S) of S. The poset (£(<S), C) is order-isomorphic to (£(%), C), hence to (£, -<), since

169

because of the analogous result holding in (£(H), C) e . Basing on the above definitions, we now introduce the following assumption. Q M T . The poset (V^C) of all testable physical propositions associated with statements of <J>T{X) (equivalently, with atomic statements of C{x)) coincides in QM with the lattice (£(S), C) of all closed subsets of S. Assumption QMT is intuitively natural, and can be justified by using the standard statistical interpretation of QM. We do not insist on this topic here for the sake of brevity. We note instead that assumption QMT implies that the posets ( 0 T ( ^ ) / « , - < ) and (Vj,, C), on one side, and the lattices (£(S),C), {£{%), C), (£,-<) on the other side, are order-isomorphic. Therefore also the operations of meet, join and orthocomplementation on (0T(:E)/RS, <) and (Pj., C) will be denoted by the symbols r{x), (PUX))^S\PI(XV f

f

(6-4)

P a(x)n4(x)=P a(x)®4(xy

(6-5)

Pfa(x)UPf0(x)^Pl{x)wP0{xy

(6-6)

The isomorphisms above allow one to recover QL as a quotient algebra of sentences of C{x). They, however, make intuitively clear that associating the properties (or 'propositions') of QL with sentences of C{x) is not trivial. The association requires indeed selecting testable wffs of {x) and the lattice operations of QL. To this end, let us note that statements (i), (ii) and (iii) in Sec. 3, e Whenever the dimension of V. is finite, the lattice {C{H), C) and/or the lattice (C(S), C) can be identified with Birkhoff and von Neumann's modular lattice of experimental propositions, which was introduced in the 1936 paper that started the research on QL. 25 This identification is impossible if the dimension of H is not finite, since {C{H), C) and (£(S), C) are weakly modular but not modular in this case. Birkhoff and von Neumann's requirement of modularity has deep roots in von Neumann's concept of probability in QM according to some authors.2

170

if compared with Eqs. (6.4), (6.5) and (6.6), respectively, yield, for every a(x)J(x) e<j>T{x), pL(x)^S\pfa{x)D{pi(x))\

P«(x)v/3(*) 5P f aix) Upf0{x) C pfa{x)

(6.7)

&pf0(x).

(6.9)

Eq. (6.8) shows that, if a(x) and /3(x) belong to <j>r{x), then a(x) A0(x) belongs to CJ>T(X), and establishes a strong connection between the connective A of £{x) and the lattice operation |fj) of QL. Eqs. (6.7) and (6.9) establish instead only weak connections between the connectives -• and V, from one side, and the lattice operations -1 and iyj, from the other side. Hence, no simple structural correspondence can be established between C(x) and QL. One can, however, obtain a more satisfactory correspondence between the sentences of a suitable language and the 'propositions' of QL by using a fragment of C{x) in order to construct a new quantum language JCTQ(X), as follows. First of all, we consider two properties E,F € £ and observe that, since the mapping \ introduced in Eq. (6.2) is bijective, E and F coincide whenever they are represented by the same subspace of C(H). This implies that the following sequence of equivalences holds. PEM=PF(X)

%

E = F

^

E{x)nF{x)

iff

E(x) = F(x).

It follows in particular that every equivalence class of <J>T(.X)/~ contains one and only one atomic wff of C{x). Since the set £{x) of all atomic wffs of C(x) (Sec. 2) belongs to 4>T(X), we conclude that the correspondence that maps every a(x) € <J>T(X) onto the atomic wff Ea(x), the existence of which is guaranteed by Eq. (4.1), is a surjective mapping. Moreover, this mapping maps all physically equivalent wffs of (J>T{X) onto the same atomic wff o f f (a;). Secondly, let us consider the set (j>^(x) of all wffs of C(x) which either are atomic or contain the connective A only. Because of Eq. (6.8), the proposition associated with a wff a(x) A ft(x) of this kind belongs to VT, hence a(x) A f3(x) belongs to <J)T{X), SO that <j>^{x) C 4>T(X). Then, let us introduce a new connective ->Q (quantum negation) which can be applied (repeatedly) to wffs of cj>/\(x) following standard formation rules for negation connectives. We thus obtain a new formal language CTQ{X), whose set of wffs will be denoted by <J>TQ(X). We adopt the semantic rules introduced in Sec. 2 for all wffs of <j)/\(x) C (f>Tq(x), and complete the semantics of £TQ {%) by means of the following rule.

171

Q N . Let a(x) e <J>TQ{X) and let a wff Ea{x) e £{x) exist such that a{x) is true iff Ea{x) is true. Then, -TQ{X), an elementary wff Ea(x) exists such that a(x) is true iff Ea(x) is true. This conclusion has the following immediate consequences. (i) One can define, for every interpretation p of the variable x and state S, an assignment function T | : <J>TQ{X) —> {T,F}. Hence, a logical preorder and a logical equivalence relation (that we still denote by the symbols < and =, respectively, by abuse of language) can be defined on <J>TQ(.%) by using the definitions in Sec. 2 with <J>TQ{X) in place of <j){x) and Tg in place of o-ps. (ii) One can associate a physical proposition with every a(x) e (J>TQ (Z) by using Eq. (3.1) with r | in place of aps. Hence a physical preorder and a physical equivalence relation (that we still denote by the symbols -< and « , respectively, by abuse of language) can be defined on (PTQ(%) by using the definitions in Sec. 3 with 4>TQ{X) in place of (j>(x) (one can also show that « coincides with = on (J>TQ(X)). (iii) The notion of testability introduced in Sec. 4 can be extended to CTQ{X) by using Eq. (4.1) with (J>TQ(X) in place of (j>{x), obtaining that all wffs of TQ{X) coincides with V^. It follows from (ii) and (iii) that {$>TQ{X)I~,-£) is isomorphic to the lattice {Vj., C), so that these two order structures can be identified. The set of connectives defined on CTQ {X) can now be enriched by introducing derived connectives. In particular, a quantum join can be defined by setting, for every a(x),p(x) e <J>TQ{X), a(x) VQ /3(x) = ^Q(->Qa(x)

A

-Q/3(X)).

(6.10)

It is then easy to show that the following equalities hold. < , « ( , ) = (^(x))" 1 .

(6- 11 )

Pfa(X)^(X)=Pfa{X)^P0(Xy

( 6 - 12 )

p{(x)vQ(3(x)=Pfa(x)®P0{xy

(6-13)

The equations above establish a strong connection between the logical operations defined on <J>TQ(X) and the lattice operations of QL. Hence, a structural correspondence exists between £TQ(X) and QL, and the latter can be recovered within our general scheme also by firstly considering the

172

set of all elementary wffs of C{x), and then constructing CTQ{X) and the quotient algebra {4>TQ{X)I~,-<). It is now apparent that the semantic rules for quantum connectives have an empirical character (they depend on the mathematical representation of states and properties in QM and on assumption QMT) and that they coexist with the semantic rules for classical connectives in our approach (the deep reason of this is, of course, our adoption of the SR interpretation of QM). In our opinion, these conclusions are relevant, since they deepen and formalize a new perspective on QL that has been propounded in some previous papers 1 0 - 1 8 and is completely different from the standard viewpoint about this kind of logic. To conclude, let us observe that a further derived connective —> can be introduced in

<J>TQ(X)

by setting, for every a(x),/3(x)

G

Q <J>TQ{X),

a{x) - • 0{x) = {~>Qa{x)) VQ (a(x) A /?(*)).

(6.14)

w One can thus recover within £TQ(%) the Sasaki hook, the role of which is largely discussed in the literature on QL. 2 ' 3,24 7. Quantum truth The general notion of certainly true introduced in Sec. 3 is denned for all wffs of C(x). Yet, according to our approach, only wffs of <j)r{x) can be associated with empirical procedures which allow one to check whether they are certainly true or not. Whenever a(x) € ^T(X), the notion of certainly true can be worked out in order to define a verificationist notion of quantum truth (Q-truth) in QM, as follows. Q T . Let a(x) 6 <J>T{X). Then, we put:

a(x) is Q-true in S € S iff 5 € pLx\', a{x) is Q-false in S 6 S iff S e (pfa{x))L; a(x) has no Q-truth value in S e <S (equivalently, ct(x) is Q indeterminate in S) iff S e S \ (pLx\ u (Pari))"1)It obviously follows from definition QT that a(x) is Q-true in S iff it is certainly true in S. Definition QT can be physically justified by using the analysis of the notion of truth in QM recently provided by ourselves26 and successively deepened by one of us. 21 We only note here that it is equivalent to defining a wff a(x) e <j>{x) as Q-true (Q-false) in 5 iff: (i) a(x) is testable;

173

(ii) a(x) can be tested and found to be true (false) on the physical object x without altering the state S of x. The proof of the equivalence of the two definitions is rather simple but requires some use of the laws of QM (see again Refs. 21 and 26). It is apparent that the notions of truth and Q-truth coexist in our approach. Indeed, a wff a(x) e (f>{x) is Q-true (Q-false) for a given state S of the physical system iff it belongs to <J>T{X) and it is true (false) independently of the interpretation of the variable x (equivalently, iff it belongs to T{X) and can be empirically proved to be true or false without altering the state S of x). This realizes an integrated perspective, according to which the classical and the quantum conception of truth are not mutually incompatible. 21,26 ' 27 However, definition QT introduces the notion of Q truth on a fragment only (the set <J>T{X) C {x)) of the language C{x). If one wants to introduce this notion on the set of all wffs of a suitable quantum language, one can refer to the language CTQ{X) constructed at the end of Sec. 6. Then, all wffs of TQ{X) are testable, and definition QT can be applied in order to define Q-truth on CTQ{X) by simply substituting 4>TQ(X) to (f>r(x) in it. Again, classical truth and Q-truth may coexist on £TQ{X)

in our approach.

Let us close this section by commenting briefly on the notion of truth within standard interpretation of QM. Whenever this interpretation is adopted, the languages C{x) and CTQ{X) can still be formally introduced, but no classical semantics can be defined on them because of the impossibility of defining, for every S 6 S and E 6 S, extsE (see Sec. 2). One can still define, however, a notion of Q-truth for CTQ{X). Indeed, one can firstly introduce a mapping x : a{x) G TQ(X) —• Ea G £ by means of recursive rules, as follows. For every a(x) € TQ(X), For every a(x),fi(x) € 4>TQ(X),

xhQ<*(x)) = E£, x(a(x) A Pi.x) = EafibEp.

Then, one can associate a physical proposition va,x\ € C(S) with every a(x) € <J>TQ{X) by settingp f a , x \ = 0(Ea). Finally, one can define Q-truth on 4>TQ{X) by means of definition QT, independently of any classical definition of truth. It is apparent that the above notion of Q-truth can be identified with the (verificationist26) quantum notion of truth whose peculiar features have been widely explored by the literature on QL (in particular, a tertium non datur principle does not hold in CTQ{X)). Hence, the interpretation of QL as a new way of reasoning which is typical of QM seems legitimate. But

174

this widespread opinion is highly problematical. Indeed, whenever S is given, some wffs of <J>TQ(X) have a truth value, some have not, quantum connectives are not truth-functional and the notion of truth appears rather elusive and mysterious. 5 Accepting our general perspective provides instead a reinterpretation of the notion of truth underlying the standard interpretation of QM, reconciling it with classical truth, and allows one to avoid the paradoxes following from the simultaneous (usually implicit) adoption of two incompatible notions of truth (classical and quantum). 8. The pragmatic interpretation of QL The definition of Q-true in S as certainly true in S for wffs of <pr(x) in Sec. 7 suggests, intuitively, that the assertion of a sentence a(x) of 4>T(X) should be considered justified in S whenever a[x) is Q-true in S, unjustified otherwise. This informal definition can be formalized by introducing the assertion sign h and setting h a(x) is justified (unjustified) in S iff a(x) is Q-true (not Q-true) in S. The set of all elementary wffs of T(X), each preceded by the assertion sign r-, can be identified with the set of all elementary assertive formulas of the quantum pragmatic language CQ introduced by one of the authors in a recent paper 21 in order to provide a pragmatic interpretation of QL f . The set ip® of all assertive formulas (afs) of CQ is made up by all aforesaid elementary afs plus all formulas obtained by applying recursively the pragmatic connectives N, K, A to elementary afs. For every S E <S a pragmatic evaluation function ITS is defined which assigns a justification value (justified/unjustified) to every af of ip][ and allows one to introduce on tp^ a preorder -< and an equivalence relation as following standard procedures. More important, a p-decidable sublanguage CQD of CQ can be constructed whose set <$\D of afs consists of a suitable subset of all afs of tp^ which have a justification value that can be determined by means of empirical procedures of proof (in particular, all elementary afs of ip® belong to ^D)CQD can then be compared with the quantum language CTQ(X) introduced f

It must be noted that the pragmatic interpretation of QL has some advantages with respect to the interpretation propounded in Sec. 6. In particular, it is independent of the interpretation of QM that is accepted (standard or SR), while our interpretation in this paper follows from adopting a classical notion of truth, hence from accepting the SR interpretation of QM.

175 at the end of Sec. 6 by constructing a one-to-one mapping r of onto 9ADI as follows. For For For For

every every every every

<J>TQ(X)

E(x) E 4>TQ{X), T(E(X)) =\- E(x), ot(x) e (J>TQ{X), T(-IQOL(X)) = JV h a{x), a(x),fi(x) e (J>TQ(X), r(a(x) A/3(a;)) = h a(x)K V- /?(x), a(x),/3(x) € <J>TQ{X), T(OL(X) VQ /3(X)) = h a{x)A h fl(x).

Indeed, it is rather easy to show (we do not provide an explicit proof here for the sake of brevity) that the mapping r preserves the preorder -< and the equivalence relation RS (in the sense that a{x) -< /3(x) iff r(a(x)) -< r(/3(x)), and a{x) « /?(a:) iff T(O:(:E)) « r(/3(a;))). Moreover, the wff a(x) 6 <J>TQ(X) is Q-true iff the af T(a(a;)) 6 ^D is justified, which translates a semantic concept (Q-true) defined on the language £TQ(X) into a pragmatic concept (justified) defined on the pragmatic language CQD. Bearing in mind our comments at the end of Sec. 6, we can summarize these results by saying that QL can be interpreted as a theory of the notion of testability in QM from a semantic viewpoint, a theory of the notion of empirical justification in QM from a pragmatic viewpoint. The two interpretations can be connected, via the mapping T, in such a way that Q-true transforms into justified, which is intuitively satisfactory. 9. Physical propositions and possible worlds The formal language C(x) introduced in Sec. 2 is exceedingly simple from a syntactical viewpoint, even if it is very useful in order to illustrate what physicists actually do when dealing with QL. Its syntactical simplicity has forced us, however, to set up a somewhat complicate semantics, in which, in particular, states are formally treated as possible worlds of a Kripkelike semantics. A less intuitive but logically more satisfactory approach should provide an extended syntactical apparatus, simplifying semantics. This could be done by enriching the alphabet of C(x) in two ways: (i) adding a universal quantifier (with standard semantics); (ii) adding the set of states as a new class of monadic predicates of C(x). Let us comment briefly on these possible extensions of C(x). Firstly, let (i) only be introduced. Then, a family of individual propositions can be associated with the quantified wff (Vx)a(x), and a proposition P(vx)a(x) — Dpp^/j can be associated with it. Hence, we get Pfa{x) = P(V*)a(z)

176

which provides a satisfactory interpretation of the physical propositions introduced in Sec. 3 and of the related notion of certainly true. Second, let us note that considering states as possible worlds is a common practice in QL, 3 but it doesn't fit well with the standard logical interpretation of possible worlds. In order to avoid this problem, one could introduce (ii), as one of us has done, together with other authors, in several papers. 17,18 In this case, states are not considered possible worlds, propositions as denned in the present paper are not propositions in the standard logical sense (rather, an 'individual proposition' associated with a wff a(x) is the set of all states which make a sentence of the form S(x) ->• a(x) true in a given interpretation of x, while a 'physical proposition' is a set of 'certainly yes' states which make a sentence of the form (\/x)(S(x) -> a(x)) true). We do not insist here on this more general scheme, and limit ourselves to observe that it is compatible with a standard Kripkean semantics, which can be enriched by introducing physical laboratories in order to characterize the truth mode of empirical physical laws in more details and connect the notions of probability and frequency.17,18 Yet, of course, an approach of this kind would make much less direct and straightforward the interpretation of QL that we have discussed in this paper. References 1. M. Jammer, The Philosophy of Quantum Mechanics (Wiley, New York, 1974). 2. M. Redei, Quantum Logic in Algebraic Approach (Kluwer, Dordrecht, 1998). 3. M. Dalla Chiara, R. Giuntini and R. Greechie, Reasoning in Quantum Theory (Kluwer, Dordrecht, 2004). 4. D. Aerts, in Quantum Physics and the Nature of Reality, D. Aerts and J. Pykacz eds. (Kluwer, Dordrecht, 1999). 5. B. C. van Praassen, in The Logico-Algebraic Approach to Quantum Mechanics, Vol. I, C. A. Hooker ed. (Reidel, Dordrecht, 1975). 6. J. S. Bell, Physics 1, 195 (1964). 7. N. D. Mermin, Rev. Mod. Phys. 65, 803 (1993). 8. J. S. Bell, Rev. Mod. Phys. 38, 447 (1966). 9. S. Kochen and E. P. Specker, J. Math. Mech. 17, 59 (1967). 10. C. Gaxola and L. Solombrino, Found. Phys. 26, 26, 1329 (1996b). 11. C. Garola, in Quantum Physics and the Nature of Reality, D. Aerts and J. Pykacz eds. (Kluwer, Dordrecht, 1999). 12. C. Garola, Found. Phys. 30, 1539 (2000). 13. C. Garola, Found. Phys. 32, 1597 (2002). 14. C. Garola, Found. Phys. Lett. 16, 599 (2003). 15. C. Garola and J. Pykacz, Found. Phys. 34, 449 (2004). 16. C. Garola, Int. J. Theor. Phys. 44, 807 (2005). 17. C. Garola, Int. J. Theor. Phys. 30, 1 (1991).

177 18. C. Garola and L. Solombrino, Found. Phys. 26, 1121 (1996a). 19. A. Tarski, in Semantics and the Philosophy of Language, L. Linski ed. (Urbana, University of Illinois Press, 1944). 20. A. Tarski, in Logic, Semantics, Metamathematics, A. Tarski ed. (Oxford, Blackwell, 1956). 21. C. Garola, quant-ph/0507122 (2005). 22. G. W. Mackey, The Mathematical Foundations of Quantum Mechanics (Benjamin, New York, 1963). 23. C. Piron, Foundations of Quantum Physics (Benjamin, Reading, MA, 1976). 24. E. Beltrametti and G. Cassinelli, The Logic of Quantum Mechanics (Addison-Wesley, Reading, MA, 1981). 25. G. Birkhoff and J. von Neumann, Ann. Math. 37, 823 (1936). 26. C. Garola and S. Sozzo, Found. Phys. 34, 1249 (2004). 27. C. Garola, quant-ph/0510199 (2005).

THE ELECTROMAGNETIC CONCEPTION OF NATURE AND THE ORIGINS OF QUANTUM PHYSICS ENRICO A. GIANNETTO Department of 'Scienze delta Persona', University of Bergamo, Piazzale S. Agostino2, Bergamo 24129, Italy The rise of quantum physics is analyzed by outlining the historical context in which different conceptions of Nature (mechanistic, thermodynamic and electromagnetic ones) were in competition to give a foundation to physics. In particular, electromagnetic conception roots of quantum physics are shown: since Larmor's first trials to Poincar6's and to Heisenberg's new mechanics.

1. Introduction 1.1. Conceptions of Nature As well known, in the late XlXth century physics was no more mechanics only, but also thermodynamics and electrodynamics. This new situation implied the problem of the very foundations of physics, and the correlated issue of the hierarchical relations among these different physical disciplines [1]. There were at least four different «fighting» conceptions of Nature. The socalled Energetic conception of Nature, which was looking at energy as the fundamental unifying concept of physics and had its most important proponents in Georg Helm (1851-1923) and Wilhelm Ostwald (1853-1932). The Thermodynamic conception of Nature, which had energy, entropy and system as fundamental concepts and was looking at thermodynamics as the real foundation block of physics. Its major exponents were Pierre Duhem (18611916) and Max Planck (1858-1947). The Mechanical conception of Nature, which was the most conservative one as searching for a mechanical reduction of the other physical disciplines and of all the physical concepts in terms of mass, space and time by means of the models of material point and action at-a-distance forces. Hermann von Helmholtz (1821-1894), Heinrich Hertz (1857-1894) and Ludwig Boltzmann (1844-1906) were the most representative scientists of this perspective. The Electromagnetic conception of Nature, based on the concepts of field, energy and charge was looking at electromagnetism theory as the foundation

178

179 level of the other physical disciplines. Among the physicists who gave the most relevant contributions to this perspective there are: Hendrik Antoon Lorentz (1853-1928), Joseph Larmor (1857-1942), Wilhelm Wien (1864-1928), Max Abraham (1875-1922) and Henry Poincare (1854-1912). The electromagnetic conception of Nature has deep roots in the history of mankind and certainly has been developed by the elaboration of the Brunian-Leibnizian physics and tradition. On one side, it has been developed within the German physics or Naturphilosophie, on the other side mainly within English physics. Electromagnetism had shown that physical reality was not only inertial and passive matter, but also dynamical, active electromagnetic field, irreducible to a mechanical matter model. Furthermore, Maxwell equations present vacuum solutions, that is, in absence of charged matter: electromagnetic field exists even when there is no matter. Thus, the possibility of a new non-dualistic view of physical reality was considered: if matter cannot exist without electromagnetic field and electromagnetic field can exist without matter, electromagnetic field could be the only physical reality and matter could be derived from the field. 1.2. Electromagnetic Conception of Nature and Relativity Usually, the electromagnetic conception of Nature has been considered as superseded by the developments of XXth century physics. However, a deep historical inquiry shows that the electromagnetic conception of Nature is at the roots of both the relativistic and quantum transformations of physics. Concerning relativity, the 1900, 1902, 1904 and (5 June) 1905 papers written by Poincare [2] show as special relativity dynamics derived from, and was a first realization of, the electromagnetic conception of nature. Einstein's (30 June) 1905 paper was only an incomplete mechanistic version of this new dynamics. This historical recognition is also fundamental to understand the first reception of special relativistic dynamics in all countries, and in particular in Italy. A first complete presentation of this new dynamics appeared in the July 1905 paper written by Poincare and published in 1906 [3]. In this paper the new dynamics was presented as an invariant one by the Lorentz-Poincare transformation group, and it was derived by Maxwell's theory of electromagnetism and contained also a theory of gravitation (absent in Einstein's 1905 paper). The starting point was electromagnetic self-induction phenomenon related to the so-called radiation reaction. When a charged particle is submitted to the action of an electromagnetic field, it is accelerated and it irradiates. This

180 radiation modifies the field and the new field modifies the acceleration of the particle, which again irradiates and so on. In this way, the electromagnetic field depends on all the time derivatives of position up to the infinite one. This means that there is also a contribution to the field force proportional to the acceleration, the coefficient of which involves an electromagnetic mass, that is an electromagnetic contribution to the particle inertia. At this point, the question was: is it possible that mechanical (inertial and gravitational) mass was not a primitive concept and indeed is wholly due to this electromagnetic effect? Poincare, among other scientists, realized that this was the case also for non-charged matter as long as is constituted by charged particles: that is mechanical mass was nothing else than electromagnetic mass, and electromagnetic mass is not a static fixed quantity but depends on velocity. Mass is so related to the electromagnetic field energy by the today well-known (now considered from a mechanistic and not electromagnetic perspective) equation: m = Ee.m.fieId/ c 2 . If mass is nothing else than electromagnetic field energy and charge can be defined, via Gauss' theorem, by the electric field flux through a certain space surface, matter can be completely understood in terms of the electromagnetic field, and it has also active and dynamical features beyond the passive and inertial ones. If mass must be understood in terms of the electromagnetic field, mechanics must be derived by electromagnetism theory which becomes the fundamental theory of physics. If mass changes with velocity, Newtonian mechanics is no more valid and must be modified. The new mechanics must have the same invariance group of electromagnetic theory, that is the LorentzPoincare transformation group, to which a new relativity principle and a new gravitation theory (even gravitational mass changes with velocity) must also be conformed. 2. Electromagnetic Conception of Nature and Quantum Physics The rising of quantum physics is conventionally related to the works of Planck during the years 1899-1900 [4]. However, Joseph Larmor, within an electromagnetic conception of Nature, was working to understand the atomic structure of matter in terms of the electromagnetic field at least since 1893 [5]. After leaving the idea of a "vortex atom", he considered the electrons as vortices into the sea of the electromagnetic field: this idea lead him to what, many years later, was called a "quantum atom". Electrons as rotations into the electromagnetic field constitute stable, stationary non-radiant configurations of atoms: these configurations correspond to given discrete values of the conserved

181 angular momentum. Radiation is emitted or absorbed by atoms by impulses only when these configurations change in respect to the minimal total energy. Thus, emission of radiation and loss of energy were not related to the absolute translations of the electron as an accelerated, charged material particle, but to the relative changes (within the atoms) of the inertial rotational motions constituting electrons (in any stable state the change of velocity in a period is zero). This idea furnished an explanation of atomic spectra and even a prediction of the Zeeman effect. This electromagnetic conception of the atomic matter structure, that is the recognition of these atomic matter structures within the electromagnetic field, Larmor understood, would be also the key to the calculus of specific heats in terms of internal energy and equal partition of energy within the kinetic theory of gases. Planck wanted to show the universality of thermodynamics and its second principle showing that it holds also for electromagnetic phenomena. Planck was forced to use Boltzmann's statistical thermodynamics concept of entropy, but showed that thermodynamics cannot be reduced to mechanics because heat is not only disordered matter motion but also electromagnetic radiation and that thermodynamics could be deduced from electromagnetism theory too. In 1900 Planck introduced discrete values of energy as heuristic tool within statistical thermodynamics of radiation to fit black-body radiation distribution experimental data. That is, energy was treated by Planck not as a continuous mathematical variable, but discrete: E = n h v , where n is an integral number and so energy is given by an integral multiple of the product of a universal constant h = 6.55 10"27 erg . sec with the physical dimension of an action and the radiation frequency. Planck's words made reference to "energy elements" (Energie-eletnenten), but Planck did not want to introduce an essential discontinuity within Nature but only to solve by the mathematical artifact of discreteness the problem to fit experimental data: he did not want to modify classical physics or to make a revolution. In 1899 Planck had already introduced this constant naming it "b" and not "h", it did not denote an action and it was a constant in the different theoretical context of finding an absolute system of natural units of measure. The first actual physical meaning to this constant was given not by Einstein, but by Larmor in 1902 within his electromagnetic conception of Nature [6]. Following Larmor, Planck's constant was not related to a mathematical artifact but had to be interpreted in terms of the relationship between matter and (ether) electromagnetic field, that is as the ratio between matter energy (given by electromagnetic field energy) and radiation frequency. Planck's constant, for

182 Larmor, was a quantum of the conserved angular momentum to be related to atomic electrons considered as vortices within electromagnetic field. Larmor proposed also to leave the abstract oscillator model of matter used by Planck and to take count of the actual electromagnetic nature and origin of matter. This implied to use the simple idea of 'elementary receptacles of energy', that is of cells in the phase space of physical systems. This idea was deduced from the consideration of the nature of radiation, constituted by discrete elements given by short trains of simple undulations. The phase space reformulation of Planck's problem lead to the discreteness of the atomic conserved angular momentum from which was deduced the discreteness of energy. J. W. Nicholson in 1912 [7] explored this explanation of the atomic structure and his work was the starting point of Niels Bohr's model. From Larmor's perspective, from the electromagnetic conception of Nature, the discrete, discontinuous, quantum nature of matter and radiation is easily understood because matter is derived from the fundamental physical reality given by the electromagnetic field. Thus, electromagnetic field must present wave but also corpuscular aspects to explain the origin of matter, and matter particles must present corpuscular but also wave aspects as long as they derive from the electromagnetic field. Bohr [8] reconsidered Nicholson's model but completely changing its meaning: atom was no more understood in terms of the electromagnetic conception of Nature but in terms of an axiomatic approach in which the meaning of Planck's constant is no more given by the electromagnetic nature of the atomic matter structure but by an abstract quantum of mechanical action. Bohr followed Arnold Sommerfeld's perspective [9] which presumed to understand all the things in terms of an a priori assumed and unexplained constant, that is Planck's constant: electromagnetic as well as thermodynamic and mechanical models were considered to be no more suitable because electromagnetic field theory as well as thermodynamics and mechanics must be reformulated in order to fit experiments and to overcome the problem of their incompatibility. However, Sommerfeld and Bohr seem not to understand that their interpretation of Planck's constant was mechanical and this put mechanics at the fundamental level of physics, restating a new mechanistic perspective. It happened something like to the procedure of axiomatization which led to the loss of electromagnetic meaning of the light velocity constant c in the mechanistic version of relativity dynamics given by Einstein. The meaning variance of a revolutionary item (c as well as h), together with the change in its "title" ("Universal Constant"), is a well known process which leads to a restoration, to a

183 dogma to be understood "mechanically" and to a myth of the foundations of a new religion as well as a new scientific theory. From Larmor's perspective, Planck's statistical thermodynamics of electromagnetism implied that classical electromagnetism continuous variables lose meaning and cannot be precisely determined, but only probabilistically just in order to derive matter corpuscles from the electromagnetic field. In 1905-1906 Einstein [10], as well as he had done with Poincare's new electromagnetic relativistic dynamics, by criticizing Planck noted the discontinuous and probabilistic character of radiation but inverted Larmor's perspective and introduced the quanta of light to reduce electromagnetism (as a statistical theory) to corpuscular mechanics. In 1911-1912, from an electromagnetic conception of Nature, Poincare [11] showed that these new characters of light and electromagnetic field cannot be understood in terms of the old corpuscular mechanics, and, on the contrary, these changes within electromagnetic theory imply a new mechanics. Indeed, if mechanics has to be built on electromagnetism and electromagnetism must be changed, then also mechanics must be modified: there must be a new "electromagnetic dynamics". From this perspective, electromagnetism cannot be reduced to mechanics, but, on the contrary, mechanics must be modified again and in more radical way by the relativistic electromagnetic dynamics: mechanics must be intrinsically probabilistic even for only one material particle, because the origin of matter is electromagnetic and electromagnetic radiation is discontinuous. Poincare's new electromagnetic discontinuous mechanics based on a discontinuous electromagnetic action was mathematically very difficult for the other physicists and was not understood at all: it was the first form of a new revolutionary "electromagnetic quantum mechanics". Only after many years, in 1925, Heisenberg [12] stated the necessity of, and posed the basis for, a new quantum mechanics: his starting point was not the electromagnetic conception of Nature, but an operational perspective. Heisenberg showed that at the atomic or microphysical level the only measurable variables were the electromagnetic variables of frequency and intensity of electromagnetic radiation absorbed or emitted by electrons within atoms. From this point of view, mechanical variables, as long as they are not directly measurable and cannot be objects of absolute experimentation, intuition or visualization at the atomic microphysical level, must be redefined in terms of such measurable electromagnetic variables. This implied, as then stated in 1927 by Heisenberg himself [13], a fundamental indeterminacy of mechanical variables. If physical reality is only what can be experimentally measured, from

184 Heisenberg's perspective the electromagnetic conception of Nature can be deduced without any aprioristic assumption. Its deduction follows merely from the request of an operational definition of physical variables at the microscopic level. Unfortunately, this original derivation and foundation of quantum mechanics has been completely forgotten and removed. It was for ideological reasons that mechanics must be maintained independent from electromagnetism and at the foundation level of the physical sciences. This priority of mechanics is related to the mechanistic conception of Nature. Considering Nature and the other nonhuman living beings as machines, that is as inert and passive matter, is the precondition to avoid any ethical problem in respect of Nature and the other nonhuman living beings and to the complete violent dominion over, and exploitation of, Nature and the other living beings. References 1.

E. Giannetto, Saggi di stone del pensiero scientifico (Sestante, Bergamo 2005). 2. H. Poincare, Revue de Metaphysique et Morale 6, 1 (1898); H. Poincare, Arch. Need. 5, 252 (1900); H. Poincare, La Science et I'Hypothese (Flammarion, Paris, 1902); H. Poincare, Bulletin des Sciences Mathematiques 28, 302 (1904); H. Poincare, Comptes Rendus de I'Academie des Sciences 140, 1504 (1905). 3. H. Poincare, Rendiconti del Circolo Matematico di Palermo 21, 129 (1906). 4. M. Jammer, The Conceptual Development of Quantum Mechanics (McGraw-Hill, New York, 1966); M. Planck, Berliner Berichte 18 (May), 440 (1899); M. Planck, Verhandlungen der Deutschen Pysikalischen Gesellschaft 2 (14 December), 237 (1900), Engl, transl. in The Old Quantum Theory, D. ter Haar ed., (Pergamon Press, Oxford, 1967). 5. J. Larmor, part I abstract, in Proc. Roy. Soc. 54, 438 (1893); part I, in Phil. Trans. Roy. Soc. 185, 719 (1894); part II abstract, in Proc. Roy. Soc. 58, 222 (1895); part II, in Phil. Trans. Roy. Soc. 186, 695 (1895); part III abstract, in Proc. Roy. Soc. 61, 272 (1897); part III, in Phil. Trans. Roy. Soc. A190, 205 (1897); J. Larmor, Phil. Mag. (5) 44, 503 (1897); J. Larmor, Aether and Matter (Cambridge University Press, Cambridge, 1900); B. Giusti Doran, "Origins and Consolidation of Field Theory in NineteenthCentury Britain: From the Mechanical to the Electromagnetic View of Nature", Historical Studies in the Physical Sciences 6 (Princeton University Press, Princeton, 1975). 6. J. Larmor, "Theory of Radiation", Encyclopedia Britannica 8 (vol. XXXII of the complete work), 120 (1902), Black, London. J. Larmor, Reports Brit.

185

7. 8. 9. 10. 11.

12. 13.

Assoc. Adv. Sci. 1902, 546 (1903) (abstract of a paper presented at the Belfast meeting); J. Larmor, Proc. Roy. Soc. London A83, 82 (1909); J. Larmor, Preface (1911) to The Scientific Papers of S. B. McLaren (Cambridge University Press, Cambridge, 1925). J. W. Nicholson, Monthly Notices of the Royal Astronomical Society 72, 49, 139, 677, 693, 729 (1912). N. Bohr, Phil. Mag. 26, 1, 476, 857 (1913). A. Sommerfeld, Physikalische Zeitschrift 12, 1057 (1911). A. Einstein, Annalen der Physik 17, 132 (1905); A. Einstein, Annalen der Physik 20, 199(1906). H. Poincare, Comptes Rendus de I'Academie des Sciences 153, 1163 (1911); H. Poincare, Journal de Physique theorique et appliquee' s. 5, t. 2, 5 (1912); H. Poincare, Revue scientifique s. 4,1.17, 225 (1912); R. Dugas, Histoire de la mecanique (Griffon, Neuchatel 1955), Engl, transl. by J. R. Maddox, A History of Mechanics (Dover, New York 1988). W. Heisenberg, Zeitschrift fur Physik 33, 879 (1925); M. Born, W. Heisenberg and P. Jordan, Zeitschrift fur Physik 35, 557 (1926). W. Heisenberg, Zeitschrift fUr Physik 43,172(1927).

W H A T W E T A L K ABOUT W H E N W E TALK A B O U T UNIVERSE C O M P U T A B I L I T Y SALVATORE GUCCIONE Istituto di Fisica Teorica dell'Universita di Napoli, Mostra d'Oltremare, pad. 20, Napoli 80125 Email: [email protected]

lost in time, lost in space, and in meaning. (The Rocky Horror Picture Show)

In the present work we will not follow the road of searching for a general definition of Computable Universe, but rather we will limit ourselves to advance a modest proposal regarding some adequate minimal conditions for the definition of Computable Universe. (In the present work we will have to do with only one Universe. In other words we will not treat Computability in parallel Universes).

1. Section 1 A. It is possible to propose a thesis according to which physical universe is viewed as a (the?) computer: ".... no time, no space, and no law. The building element is the elementary 'yes, no' quantum phenomenon. It is an abstract entity. It is not localized in space and time." ([1], p. 570; but see also [2], [3], [4]). B. It is possible to propose a thesis, less strong, according to which physical processes can be viewed as computations (see, e. g., [5]). Thesis B is obviously less strong than thesis A, as, if we consider thesis A valid, then thesis B is also valid; whereas, if we consider thesis B valid, validity of thesis A does not follow. C. It is possible to propose a third thesis, according to which all physical theories, as such, are computable. D. It is possible, finally, to propose a thesis according to which all today physical theories are computable. It should be observed, en passant, that theses B, C, D do not seem to contain the strange assertion: "no laws". (Not even, so to say, computational laws?), "no time" and "no space" (anyway, what about meaning?).

186

187 It is clear that here "computability" means "effective computability", or - to those who accept the Turing-Church thesis - "calcolability using a Turing machine". More generally, having to do with physical processes, and usually with physical theories (with the exception of the "no law" in [1]), we should ask: what do we mean when we talk about computable physical theory! In my opinion, the definition proposed in 1974 by Kreisel [6], remains the most adequate definition of computable physical theory notwithstanding the fact that it is a feeble thesis , as observed in [7], where it was proposed to reinforce it by adding a requirement - epistemological in character - christened with the expression : "uniformity condition"([7], p. 161). Computability/non computability of today physical theories (or also of physical theories as such) has been discussed by various authors: see, e. g., [8], [9], [10], [11], [12], and overall [13], [14], [15], and [16]; in fact, these last four works report results of great interest in favour of the thesis that it is possible to find in Quantum Mechanics elements of non computability. And now an observation: when referring to physical theories, we usually speak of real numbers. For example, following Einstein in the second appendix of its "The Meaning of Relativity", we see that a continuum with a finite number of dimensions is necessary both in Newton mechanics and in Relativity. It is possible to assert that the continuum is at the basis of almost all theories of today physics. This is the reason why we talk of real numbers. It should be observed that this contrasts with the immaterial vision of Wheeler [1], which, however, will not be analysed here. (It is worthwhile to note, however, that partisans of the so called strong hypothesis of artificial intelligence never looked for, to my knowledge, [17], to merge their strong hypothesis in the extra strong vision of Wheeler). At this point it is useful to remember that Turing-computability has to do with natural numbers. It appears therefore necessary to proceed from Turing machines to computational effective procedures on real numbers: see, e. g., the definition given by Gzegorczyk in terms of recursive functionals [18], [19] or the - equivalent- definition by Pour-El and Richards [20], or the notion of computability on real numbers by L. Blum et al. [21].

188 2. Section 2 In the previous section we proceeded from the question: "is the Universe computable?" to questions - perhaps more precise from an epistemological point of view - regarding computability of single physical theories. We should now first underline that, in the light of our present knowledge, nothing seems to forbid that of two theories treating the same range of phenomena one is computable (according to the definition previously given), whereas the other is not computable. Geroch and Hartle, for example, in the work previously quoted, distinguish the formulation of a theory from its possible implementation through an algorithm (computability); in addition they identify, within a specific approach to Quantum Gravity, a counterexample to the hypothesis (generally implicitly assumed by the community of physicists) of computability of every physical theory. Geroch and Hartle, however, warn us that "one mathematical formulation of the theory may provide no algorithm for implementing the theory and yet another formulation does" ([16], p. 348). Note that in the previous assertion Geroch and Hartle clearly appear to accept Quine's doctrine of under-determination of theories to data: "the doctrine that natural science is empirically under-determined; under-determined not just by past observations but by all observable events " ([22], p. 313). Note that, as underlined by Quine in the same work, the doctrine of underdetermination of theories should not be confused with , e. g., the so called Duhem-Quine thesis, regarding which see also, e. g., [23]. At this point is appears necessary to introduce the notion of completeness of a physical theory for at least two reasons. The first reason is quite obvious: it can be expressed saying that physical science, by its own nature tends to elaborate complete theories. Even if this assertion is contrasted by some epistemological currents, irrealistic in character, more interested in the so called sociology of Science than in the logical structure of Science. The second reason is that Quinian doctrine of under-determination of scientific theories to data shows all its force when referred to complete theories. In fact, it could be argued that non complete theories could be completed in such a way that they would not be empirically under-determined. The relationship between completability and under-determination thus appears a key point of great epistemological interest. In addition, in the case treated here, a further element of interest is the addition to this key point of the element of computability.

189 The number of questions is therefore increased: one could, for example, ask if theories demonstrated to be uncomplete regarding a same range of phenomena, even if all computable and completely covering the designated phenomenological field, are reciprocally computable. In other words, the problem of the computability of the, so to say, reciprocal junctions of various theories would arise (see, e. g., [24]). But what do we mean here by completeness of a scientific theory? The notion of completeness in science has generated many dicussions (see, e. g., [25]). We remind, for example, that Einstein considered uncomplete theories both the quantum mechanics and the relativistic theory of gravitation. The first results uncomplete as it does not fulfill the condition that "every element of the physical reality must have a counterpart in the physical theory" ([26], p. 777), a condition which, in a way, is close to condition of logical completeness a la Tarski; whereas the second is uncomplete as it does not fulfill the condition that: "every field theory must be constructed only by means of the primitive notion of field" ([27], p. 76). (For a detailed analysis of Einstein's position on this question, see, e. g., [28], [29], [30]). In any case it is important to underline that no matter which reasonable definition of completeness for a scientific theory we consider, its possible non computability (i. e. the non computability of its mathematical apparatus) leads in any case to a situation of undecidability of the theory, via Goedel's first theorem of uncompleteness [31]. (Note that Geroch and Hartle, for example, have demonstrated the non computable case previously mentioned via a theorem of undecidability due to Hanken [32]). This, given any reasonable (from a physical point of view) semantic could lead to the incompleteness of the theory with regards to those semantics. We say "could lead" and not "leads" as Goedel's theorem, a rigore, deals with axiomatics and coherent theories. 3. Section 3 Considering what has been said in previous sections, a definition of computability of the Universe in terms of computability of physical theories which are complete and non reciprocally contradictory should face at least the two following orders of problems: a) problems related to theories which have been demonstrated to be non computable; b) the problems here previously characterized with the expression: "junction's computability". Essentially, we are concerned with the availability of "a unified theory of everything", i.e. a theory unifying Gravitation and all the Nuclear Forces,

190 Electromagnetism and the Quantum Mechanics. In addition such theory of everything should result computable. In any case, it is well known that we do not possess such a theory (computable or non computable) even if today promising candidates to such a role (independently from the problem of computability) are the so called theories of super strings. We will not follow here the way of searching for a general definition of Computable Universe, but will limit ourselves to follow a more modest road with the aim to propose a possible definition of the minimal computability of the Universe. It will be useful, to this aim, a short digression_on the notion of physical constants and on some recent works which question the same constancy of the so called physical constants. First let us recall the words by Levy-Leblond: "In most formulae of physics or, more generally, in most theoretical analyses of any physical phenomenon, there appears one or more physical constants. Some of these play an essential and pervasive role in physics. They are variously called general or fundamental or universal physical constants" ([33], p. 87). Among these constants we recall, as an example, the velocity of light, designated as c, the gravitational constant designated as G, Planck's constant designated as h, the constant of fine structure, designated as a and the charge of electron designated as e. The cosmological constant, generally designated as A, has some peculiar aspects. It was introduced by Einstein in 1917 [34], after his attempt to apply his formulation of general relativity to the Universe as a whole. His philosophical guideline was that the Universe is static and the introduction, in his equations, of the cosmological constant (which is not contradictory with but not generated within the mathematical structure) allowed a static Universe. (As regards the cosmological constant, we will limit ourselves to recall chapter V of the already quoted book by Barrow [9] with its rich bibliography relative not only to the cosmological constant but to physical constants in general, as well as the fundamental article of Weinberg [35] and the more recent work by Krauss and Turner [36]). In the present note we will consider only the more traditional physical constants and face two questions. First question: are the values of these constants measurable numbers according to the definition of Geroch and Hartle (16)? We recall that the notion of measurable number according to Geroch and Hartle is the following: "Regard number w as measurable if exists a finite set of

191 instructions for performing an experiment such that a technician given an abundance of unprepared raw material and an allowed error e is able by following those instructions to perform the experiment yielding ultimately a rational number within e of w." ([16], p. 542). Obviously, the instructions depend from e and prediction within a progressively decreasing error would require materials always new, new instructions, new ideas (see [16], p. 549 and [12]). Second question: are the values of such constants computable real numbers? Where, to indicate a number as computable real we will follow the definition: "Roughly speaking a computable real is one which can be effectively approximated to any desired degree of precision by a computer program given in advance. Thus a number n is computable since there exist finite recipes for computing it. When more precision is desired the computation may take longer, but the recipe itself does not change" ([20], p. 13). We note first that, whereas concerning computable numbers an increasing rational approximation to a real number can be obtained using the same recipe and only at the expenses of longer computation, for measurable numbers increasing approximation may require new recipes. Geroch and Hartle present examples of physical constants which are, so to speak, traditional and are measurable: for example the constant of fine structure a is measurable (see, e.g. [37]). In any case, a rigore, it is not given for granted that Geroch and Hartle measurability (G-H measurability) is a property of all physical constants. Let us consider the example of the velocity of light, which is particularly meaningful. In general, c is used to indicate the one way velocity of light, whereas we can only measure the velocity of light along a closed path. The constancy of one way velocity is directly deduced from two axioms of Einstein Special Relativity [38]. Then we attribute to the one way velocity of light the same numerical value (this one measurable!) as to the velocity of light along a closed path (this is the reason why conventionalism is mentioned in Special Relativity: Reichenbach and Gruembaum Thesis [39]. Also, for other points of view see, e.g., [40], [41], [42]). Geroch and Hartle ([16], p. 544) assert that any computable number is measurable and this seems acceptable (even though the example of velocity of light invites to caution). But certainly the opposite is not true. As they have previously demonstrated that the set of computable numbers is not coincident with the set of measurable numbers. The information that we have on computability / non computability of physical constants is almost nihil (even considering only those constants which, following Barrow, we have called

192 traditional). From the point of view of these notes this is an unpleasant hole. In fact we suggest here to take seriously into consideration the following definition. Definition 1. We will call computable physical theories in feeble sense a physical theory for which the values of all its constants are computable real numbers. Then a theory of everything computable in feeble sense could, with some precision, tell us something about computability of the Universe. Obviously it is not contradictory to envisage a theory completely devoid of constants. Such a theory, perhaps would have pleased Einstein as it, in addition to the disturbing dualism field / particle, would radically reduce everything to the shape of the field equations. However, presently, no theorical approach seems to lead to such theory. We will therefore for the moment avoid this point to face another problem: what if physical constant were not really constant? 4. Section 4 The so called constant of fine structure a represents, roughly speaking, a measure of the electromagnetic attraction between photons and electrons: its expression, in function of other physical constants is or = e2/ h c / 2 7t, where e represents the electron charge, h the Plank's constant and c the velocity light. Well, there is experimental evidence that ^increases, although slowly, in the cosmic time t ([43], [44], [45]). We said that a would grow slowly: in fact according to experimental results in the last 6-10 billion years there would have been a variation Aa/a - - 0,72 +/ - 0.18 x 105. A behaviour, however, that would be decidely of bad taste for a constant. It has been discussed if this variation depends exclusively from one of the constants e, h, c (see, e. g., [46], [47]) and it has been suggested (on the basis of thermodynamics of black holes [48]) a test which would lead to the control of the variability of the velocity of light c (but the problem of conventionality or not of the one way value of velocity of light would remain unresolved). Now a short digression regarding the argument based on the thermodynamics of black holes [48]. At the 17nth annual International Conference on General Relativity and Gravitation (GR 17 Conference) held in Dublin, July 18-24 2004 Hawking gave a lecture on his new calculations regarding Black Holes Information Loss.

193 In his controversial lecture - controversial mainly for the use of the mathematical technique known as the Euclidean path integral method in the place of the "more straightforward Lorentian approach to gravity" [49] (and on this subject I would advise to read the stimulating paper by R. L. Oldershaw: "The new Physics - Physical or Mathematical Science?" [50]) Hawking asserts that, in contrast with his own statements during the last 30 years, black holes do not destroy information. This is the text of the press release at GR17 for physicists and reporters: "One of the most intriguing problems in theoretical physics has been solved by Professor Stephen Hawking of the University of Cambridge. He presented his findings at GR17, an International Conference in Dublin on Wednesday 21 July. Black holes are often though of as being region of space into which matter and energy can fall and disappear forever. In 1974 Stephen Hawking discovered that when one fused the ideas of quantum mechanics with those of general relativity it was no longer true that black holes were completely black. They emitted radiation now known as Hawking radiation. This radiation carried energy away from the black hole which meant that the black hole would gradually shrink and then disappear in a final explosive outburst. These ideas led to a fundamental difficulty, the information paradox, the resolution of which is to be revealed in Dublin. The basic problem is that black holes, as well as eating matter, also appear to eat quantum mechanical information. Yet the most fundamental laws of physics demand that this information be preserved as the universe evolves. The information paradox was explored and formalised by Hawking in 1975. Since then, many have tried to find a solution. Whilst most physicists think that there must be a resolution of the paradox, nobody has really produced a believable explanation. In fact, seven years ago the issue prompted Hawking, together with Kip Thome of Caltech, to make a wager against John Preskill also of Caltech, that the information swallowed by black holes could never be recovered. On Wednesday, Hawking conceded that he has lost the bet. The way his new calculations work is to show that the event horizon, which is the surface of the black hole, has quantum fluctuations in it. These are the same uncertainties in position that were made famous by Heisenberg's uncertainty principle and are central to quantum mechanics. The fluctuations gradually allow all the information inside the black hole to leak out, thus allowing us to form a consistent picture. The information paradox is now unravelled. A complete description of this work will be published in professional journals and on the web in due course."

194 At the time of the present meeting (Cesena, 4-9 October 2004) comments to the Hawking talk are already available ([49], [51]-[53]; but see also the interesting paper by A. N. St. J. Farley and P. D. D'Eath [54]). Going back to our topic, we now face functions which are not constant but acquire different values depending on variation of t. Now a first question: could the values in cosmic time t of those pseudoconstants which would be slowly variable constants (CLV) (such as fine structure constant or) be measurable numbers according to Geroch and Hartle? (G-H measurable)? In any case we give the following definitions: Definition 2. A CLV t-funcion will be said G-H measurable if any of its values is G-H measurable. Definition 3. A CLV t-function will be said G-H computable if any of its G-H measurable values results computable. Of course in Definition 3 we use the term computable in the sense of PourEl and Richards [17] . In addition, following Geroch and Hartle, ([16], p. 544) we will hold that any computable number is a G-H measurable number. We could then say that: Definition 4. A theory will be said to be G-H computable in feeble sense if all its CLV t-function are G-H computable. We thus find a conclusion that under many aspects sounds familiar with Kreisel definition [6] but is more feeble in that it refers only to fundamental constants present in a theory and not to "any real number which is well defined (observable) according to the theory ([6] , p. 11). On the other side, the minimal definition (in the sense that it is hardly possible to ask less to a definition of computability for a scientific theory) of computable physical theory results sufficiently ample to inglobe physical constants which in reality result slowly variable (in this paper we will not discuss the notion of slowliness in variability). Finally, our Definition 4, is, as Kreisel definition in [6], of non uniform nature (following the ideas of Kalmar [55]. In fact we have said that: "Given a theory T for each fundamental constant of the theory there exists an effective procedure (in the sense of [10]) such that ".

195 To reinforce the definition we could ask (according with [7] that: " given a theory T there is an effective procedure (in the sense of [20]) such that for each fundamental constant of the theory we have ".As you can see it is once again question of permutation of logical quantifiers. References 1. 2. 3. 4. 5. 6. 7. 8. 9. 10. 11. 12.

13. 14. 15. 16. 17. 18. 19. 20. 21. 22. 23. 24. 25. 26. 27. 28. 29. 30.

J. A. Wheeler, International Journal of Theoretical Physics 81, 557 (1982). S. Lloyd, Nature 406, 1047 (2000). Y. J. Ng, Physical Review Letters 86, 2946 (2001). S. Lloyd, Physical Review Letters 88, 2946 (2002). S. Wolfran, Physical Review Letters 54, (1985), 735 (1985). G. Kreisel, Synthese 29, 11 (1974). S. Guccione, G. Tamburrini and S. Termini, AGORA' 17, 159 (1998). R. Penrose, The Emperor's New Mind (Oxford U. P., 1989). J. D. Barrow, Theories of Everything (Oxford U.P., 1989). J. Earman, A Primer on Determinism (D. Reidel Publ., 1986). S. Guccione, 10th International Congress of Logic Methodology and Philosophy of Science, Abstracts, 532 (1995). S. Guccione, in The Foundation of Quantum Mechanics: Historical Analysis and Open Questions, C. Garola and A. Rossi Eds. (World Scientific, 1999). P. Benioff, J. Math. Phys. 11, 2253 (1970). P. Benioff, /. Math. Phys. 12, 360 (1971). A. B. Komar, Phys. Rev. B133, 542 (1964). R. Geroch and J. B. Hartle, Foundations of Physics 20, 533 (1986). S. Guccione, EPISTEMOLOGIA, (2004, in press). A. Gzegorczyk, Fundamenta Mathematicae 42, 168 (1955). A. Gzegorczyk, Fundamenta Mathematicae 44, 61 (1957). M. Pour-El and I. Richards, Computability in Analysis and Physics (Springer Verlag, 1989). L. Blum, in Lectures in Complex Systems, E. Jen. Ed. (Addison Wesley, 1990). W. V. O. Quine, Erkenntniss 9, 313 (1975). A. Derecin, and S. Guccione, EPISTEMOLOGIA VIII, 77 (1985). S. Guccione, Boston Studies in the Philosophy of Science 47, 237 (1981). R. Schlege, Completeness in Science (Appleton Century-Croft, 1967). A. Einstein, B. Podolski and N. Rosen, Phys. Rev. 47, 111 (1935). A. Einstein and N. Rosen, Phys. Rev. 48, 73 (1935). R. de Ritis and S. Guccione, Fundamenta Scientiae 5, 103 (1984). R. de Ritis and S. Guccione, Fundamenta Scientiae 8, 383 (1987). R. de Ritis and S. Guccione, EPISTEMOLOGIA XVI, 97 (1993).

196 31. K. Goedel, Monatsh. Mat. Rev. 38, 173 (1931). 32. W. Hanken, in World Problems, W. W. Boone, F. B. Cannonito and R. Lindon Eds. (North Holland, 1973). 33. J. M. Levy-Leblond, II Nuovo Cimento 7, 187 (1977). 34. A. Einstein, Sitzungsber Preuss Acad. Win. Phys. Math. Kl, 142 (1917). 35. S. Weinberg, Rev. Mod. Phys. 61, 1 (1989). 36. L. M. Krauss and M. S. Turner, GRG 27, 1137 (1995). 37. B. N. Taylor, W. H. Parker and D. N. Langerberg, Rev. Mod. Phys. 8, 375 (1969). 38. A. Einstein, Ann. Phys. Leipzig 17, 891 (1905). 39. H. Reichenbach, The Philosophy of Space and Time (Dover, 1956). 40. D. Malament, Nous 11, 293 (1977). 41. R. de Ritis and S. Guccione, GRG 17, 596 (1985). 42. R. de Ritis and S. Guccione, Fundamenta Scientiae 8, 57 (1987). 43. J. K. Webb et al, Phys. Rev. Letters 82, 884 (1999). 44. A. Songaila and L. L. Cowic, Nature 398, 667 (1999). 45. J. K. Webb et al, Phys. Rev. Utters 87, 1 (2001). 46. J. Barrow and J. Magueijo, Phys. Letters B443, 104 (1998). 47. S. K. Lamoreaux, Nature 416, 803 (2002). 48. P. C. W. Davies et al., Nature 418, 602 (2002). 49. C. Seife, Science 305, 586 (2004). 50. R. L. Oldershaw, Am. J. Phys. 56, 1075 (1988). 51. P. Rodger, PHYSICS WEB, (22 July 2004). 52. J. Baez, UTTP://MATH:UCR.EDU./HOME/BAEZ/WEEK207.HTLM, (25 July 2004). 53. C. Seife, Science 305,934 (2004). 54. A. N. St. J. Farley and P. D. D' Eath, arXiv:gr-qc/0407086Vl, Univ. of Cambridge, U.K., (23 July 2004). 55. L. Kalmar, in Constructivity in Mathematics, Heiting Ed., 72 (1979).

BOHM AND BOHMIAN MECHANICS GIANLUCAINTROZZI Dipartimento di Fisica Nucleare e Teorica, Universita di Pavia, Via Bassi, 6 27100, Pavia, Italy MARCO ROSSETTI Dipartimento di Fisica Nucleare e Teorica, Universita di Pavia, Via Bassi, 6 27100, Pavia, Italy

The standard, or Copenhagen, formulation of quantum mechanics postulates that the complete specification of a quantum state is given by the corresponding state vector (completeness). A different approach is possible, assuming instead the incompleteness of the theory. Additional parameters, called "hidden variables" since they are not empirically known, would be needed to completely characterize the quantum state. The knowledge of these hidden variables would allow the precise determination of the values for the observables of the quantum system. In 1952 David Bohm, starting from such an assumption, has proposed a hidden variables formulation of quantum mechanics that is empirically equivalent to standard quantum mechanics, but offers a more rational and coherent picture of reality. Bohm's model integrates the ordinary quantum theory by introducing particle coordinates as hidden variables. Therefore, particles are distinguishable and describe trajectories in space or in configuration space that are causally determinate. In this context it is possible to explain double slit experiments and interference phenomena in terms of particle trajectories. Quantum probabilities become epistemic: the probabilistic nature of physical predictions is not an intrinsic characteristic of nature, but depends on our ignorance of the exact value of the hidden variables. Since Bohm's interpretation is, both epistemologically and ontologically, a natural extension of classical mechanics to the quantum domain, the visualization of physical processes is still possible, and the corresponding picture of reality is more intuitive. Bohm's interpretation clearly also presents limits and weaknesses: a Lorentz invariant formulation of the model is still lacking, and all the observables result to be dependent on the global context (contextuality), with the only exception of position observables.

1.

The de Broglie-Bohm interpretation of quantum mechanics

The objective coexistence of a quantum wave and the associated particle is the fundamental physical assumption in the causal interpretation of quantum mechanics suggested by Bohm. In the de Broglie-Bohm (d.B.B.) model (or

197

198 interpretation) each particle is always associated to a pilot wave guiding it. The opposite in not true: there could be waves without an accompanying particle, called 'empty waves'. There are physical situations in which the wave associated to a particle splits in different waves with negligible spatial superposition. One of these waves, carrying the particle, is the pilot wave; the remaining waves are, by definition, empty waves. These empty waves do carry energy and momentum indeed, and are physical waves by all means: if an empty wave meets a particle at a later time, it will influence the particle trajectory, becoming a pilot wave again. The d.B.B. model requires, for the complete characterization of a system of N quantum particles, the specification of the corresponding wave function and, in addition, of the hidden variables, represented by the positions of the particles belonging to the system. Particles are supposed to be really existing in space, distinguishable and traveling along trajectories. These trajectories can not be exactly known for a specific particle, because the spatial positions are the hidden variables of the model. The causal interpretation of quantum mechanics is the evolution of the pilot wave concept, originally proposed by de Broglie, and is due to D. Bohm,1 J. P. Vigier and J. B. Hiley.3 It is based on the same dynamical law of quantum mechanics (the Schrodinger equation) and is empirically equivalent to standard quantum mechanics since both interpretations assume the probability density to be equal to p = \y/\2. For a single particle, Bohm writes the wave function y/ in a polar form as -S(x,t)

yr=R{x,t) e" , (1) with action s(x,t) and amplitude R(x,t) both real, and with R positive. Assuming a density probability given by p = lyJ,2=R2 and inserting the polar wave function into the Schrodinger equation, it is straightforward to get the following equations: following equations:

M(w+v{r)j*LriU at dt

2m {

\2m m)

dt

,

(2)

R J \

mJ

The classical limit (h—>0) for Eq. (2) gives

^ + Yll dt

2m

+ V(r)^0

,

(4)

199

which is the classical Hamilton-Jacobi equation for a single particle with momentum p = V5(r), influenced by a potential v(r)- The corresponding Newton equation is: m ^ = -VV(5) dt Eq. (3) is the continuity relation for an ensemble of identical particles, distributed with density p = R2, where

(6)

, = YM

m is interpreted as the velocity of a massive particle (m being the mass), moving along a trajectory normal to the surface with constant action S, in a potential field V\r). Therefore, the continuity Eq. (3) can be written: ^ + V(/w) = 0(7) at The term that was neglected when considering the classical limit of Eq. (2) defines the quantum potential:

fi(r)=-*L™.

(8)

W

2m R Therefore, the quantum Hamilton-Jacobi equation for a single particle is:

?S+W)L

+ V(r)+Q(r) =

0-

(9)

at 2m and the equation describing the causal evolution of the hidden positional variables, corresponding to the Newton equation for a classical particle, is: m—

= -V(V + Q) •

(10)

dt The force due to the classical potential is not the only one present in this case: there is also a quantum force due to the quantum potential Q and determined by the amplitude R of the wave function. The wave function is therefore responsible for both the density probability and the quantum force acting on the particle. The conditions granting identical predictions and thus empirical equivalence of the d.B.B. and the standard interpretation of quantum mechanics are the following. • Wave function satisfying the Schrodinger equation. • Density probability given by p = U 2 . • •

Particle momentum equal to p = VS(r) • Particle position determined within the precision limit imposed by Heisenberg's uncertainty relations.

200

As a final remark, it should be noted that a Lorentz invariant formulation of the d.B.B. causal interpretation has not been proposed yet. The particle positions are expressed in term of an absolute time, common to the entire quantum system. 2. Bohmian mechanics In spite of the importance of the quantum potential Q for the d.B.B. interpretation, it turns out to be possible to obtain a causal formulation of quantum mechanics without using such a concept (that Bohm himself defined "rather strange and arbitrary"4). Modern approaches to the d.B.B. interpretation, known as Bohmian mechanics,5 do not use the quantum potential. A system of N particles is described by the wave function y/(X,t), where X =(Xt,...,XN)e R3N and XK denotes the position of the &-th particle. The wave function y/(X,t) satisfies the time-dependent Schrodinger equation ih^-

= H¥,

(11)

of

where H is the non relativistic Hamiltonian operator. In the case of spinless particles, H is given by 2mk where

Vt=4--

(13)

k

Furthermore, y/(X,t) equation

8Xk determines the particle motion, controlled by the motion

dxk _ n im(/v^)

—

dt

mk

;

y/ y/

(,A,,..., AN)

(14)

*• >

According to Bohmian mechanics, a system of N non relativistic particles is completely determined by Eqs. (11) and (14). Even if classical and Bohmian mechanics share common characteristics (see for instance the time evolution given by Eq. (10)), there are also substantial differences as well. Bohmian differential equations are first order equations, while Newtonian mechanics is characterized by second order equations. As a consequence, particles positions and velocities are not independent in Bohmian mechanics. The specification of initial particle positions is therefore sufficient to completely define a Bohmian system, while both initial positions and velocities are needed for the complete specification of a Newtonian system.

201 3. Double slit interference As emphasized by R. Feynman in his lectures, a double slit experiment shows all the mysteries and paradoxes of quantum mechanics. Interference patterns could be observed by using microscopic particles in experimental setups equivalent to the double slit devices used in optics to reveal the wave properties of light, as originally suggested by Young. A point like source emits waves/particles travelling to a screen with a double slit. After the two slits, the waves/particles continue their motion until they reach a second screen, where they are detected as point like spots on the second screen. Interference patterns have been obtained by using electrons (Jonnson, 1961; Merli, Missiroli e Pozzi, 1974; Lichte, 1986; Tonomura, 1989), neutrons (Zeilinger, 1988), helium atoms (Carnal e Mlynek, 1991) and fullerenes (Zeilinger, 2002). The experiments described by P. G. Merli, G. F. Missiroli and G. Pozzi,6 and subsequently by A. Tonomura et al.,1 are realized by using electrons emitted one by one, instead of a beam. By using a very low intensity current (equivalent to one electron reaching the second screen every 0.04 seconds) it is indeed possible to obtain the quantum interference pattern, even if there is no possible interference among the electrons. Clearly, the wave-like interference effect is not a collective property, but has to be attributed to each and every electron sent through the double slit, and impinging on the second screen.

Figure 1. Double slit interference using single electrons (from A. Tomonura): (a) 8 electrons; (b) 270 electrons; (c) 2000 electrons; (d) 6000 electrons.

The standard interpretation of quantum mechanics is unable to explain the interference pattern emerging as single electrons accumulate on the second screen, since there is no possible interaction among these electrons (sent one by

202

one through the double slit, so that only one electron is crossing the apparatus, at any time). There is no explanation for the fact that a single electron does behave as a part of an ensemble of many electrons, even if there is no possible interaction among the electrons emitted by the source, each at a different time. On the contrary, the single particles double slit experiment is easily explained in the context of the d.B.B. interpretation: the single electron trajectory is causally defined by the position of the electron within the slit and the quantum potential. During the experiment, the boundary conditions for the device are unchanged and therefore the quantum potential will remain the same. Each electron will feel a quantum force depending on its initial position within the slit and the shape of the quantum potential Q (both of which are time independent), and will contribute to the overall interference patter predicted by the theory. The electron motion is totally independent of the trajectories followed by former or later electrons, but it is determined by the quantum potential which is the same for all the electrons belonging to the ensemble. The same interference pattern would result from the overlapping of many single electron images, each collected in one of many different double slit devices with the same geometry (and thus with the same quantum potential), eventually located far apart from each other. Adding up the results from each device, an interference pattern equal to the one produced by the same number of electrons sent one by one in a single double slit will emerge. In order to analyze the diffraction of a single particle through a double slit, let us consider a source Sl, a screen P with two slits A and B centered at (0,±y) and a second screen S2. The particle beam is described by a plane wave emitted by the source Sl and reaching the two slits. Each slit generates a Gaussian wave that propagates after the first screen, and the two identical waves overlap on the second screen S2 • There is no classical potential in the region between Sl and S2. A time independent quantum potential is present, according to Eq. (8). At a given time, the wave function at position (x,y) is, neglecting a normalization factor, represented by: iy{x,y,t) = [\//A{x,y,t)+y/B(x,y,t)] •

(15)

The wave function can be factorized in two orthogonal components. The particle motion along the x and the y axis are independent, and the quantum potential is just a function of the v variable: Q = -, 2mR where f(y,t)

d2R

d2R

dx

V

is a plane wave

2 +

9V(y.Q 2m f(y,t)

d2y

(16)

203

The shape of the quantum potential Q for a double slit is shown in Fig. 2.

Figure 2. Double slit quantum potential (as seen from the second screen).

The single particle trajectory can be obtained by integrating mx = VS from a specified position x. While the x component of the velocity V is uniform, the y component y is determined by the force F = -^(17) dy resulting from the quantum potential. The trajectories are initially divergent from the slits, due to the repulsive effect of the central peak of the quantum potential Q. Subsequently the particles propagate in a spatial region where Q is flat and there is no force acting upon the particles. Therefore they move with almost uniform speed, and a small transversal component of velocity V • Due to the force F, particles can cross potential gaps, ending up in an adjacent potential trench (see again Fig. 2). The pattern of trajectories reaching the second screen S2 clearly shows an interferential structure, with a high central peak (high trajectory density), followed by two minima (absence of trajectories), then two lower maxima and so on. Assuming an initial density probability distribution |^ 0 | for the particles, the final density probability distribution corresponds to the quantum probability density U 2 , as shown in Fig. 3.

204

Figure 3. Possible particle trajectories through a double slit device.

The double slit experiment shows an additional property of the de Broglie-Bohm model: nonlocality. If one slit is closed at a specific time, the wave function changes instantaneously, and therefore a different pattern, corresponding to one slit diffraction instead of a double slit interference, appears on the second screen 5 2 . A particle, even if localized far away from the slit that has just been closed, is instantaneously affected by the change of the wave function guiding the particle. This is a clear indication of the nonlocal character of the d.B.B. model. The nonlocal behaviour, originally seen in the d.B.B. interpretation and related to the nonlocal structure of the quantum potential Q, is also a feature of the modern formulation of the theory, namely the Bohmian mechanics. In fact, according to Bohmian mechanics, the velocity of each particle is dependent on the positions of all the other particles of the ensemble, as shown by Eq. (14) or, equivalently,9 by dXk

„ -

h

V.y/

-

(18)

The connections between a particle and all the others belonging to the same ensemble imply the existence of nonlocal properties. This characteristic was initially considered as a defect of the d.B.B. model. On the contrary, it has been realized at a later time that nonlocality is a property of any possible formulation of quantum mechanics, and that the d.B.B. interpretation has the great advantage of showing it clearly: "That the guiding wave, in general case, propagates not in ordinary threespace but in a multidimensional-configuration space is the origin of the notorious 'nonlocality' of quantum mechanics. It is a merit of the de BroglieBohm version to bring this out so explicitly that it cannot be ignored".1

205

The double slit experiment has also suggested a generalization of the Bohr complementarity principle, proposed by D. M. Greenberger and A. Yasin in 1988. n As a particular case of a double slit device, let us consider the possibility of reducing arbitrarily the size of slit A, while the other slit B remains unchanged. As we decrease the size of A, the probability of having a particle crossing the first screen through B increases. This fact could be described by saying that, by reducing the size of one slit, we increase our knowledge of the trajectory, or corpuscular behaviour, of the quantum system. In the limit where A has a null dimension, we know for sure that the particle crossing the first screen went through the slit B. The complementary information about the wavelike behaviour of the system is therefore completely lost. Let us consider a wave function y/ = y/ + ys B , where y/ and y/ are arbitrary waves with amplitude R A and R B and the intensity is defined as / = |^y | . The wave-like behaviour of the quantum system, related to the relative intensity of the interference peak, is characterized by 7 D

max

= /

max

~ +

l

™n /

min

2R

ARB

= R

A

+

R

.

(19)

B

while the particle-like character ("which path") of the quantum system is characterized by R p =

l ~ Rl . *1 + Rl

(20)

The Greenberger-Yasin relation connects these two quantities: P2 + D2 = 1 . (21) A single experiment could display both the wave-like and the particle-like behavior of a quantum system simultaneously. This result is more general than Bohr's complementarity since the only two possible outcomes of an experiment are, according to Bohr, exclusively wave-like (z? = 1; f = 0 ) or exclusively particle-like (D = 0 ; P = l ) . The Greeenbeger-Yasin relation has been confirmed by neutron interference12 and in optical experiments.13 4. Features of the d.B.B. model 4.1. Causality Bohm's formulation clearly has been proposed in order to re-establish the causality principle in quantum mechanics. In fact, the d.B.B. interpretation describes particles moving along classical trajectories, defined by Eq. (10),

206

m— = -V(V + Q) • dt This equation is known as "quantum Newtonian equation", since the potential (V + 2) c o n t a ins a quantum potential term. The force acting on the particle (cause) is correlated to the particle motion (effect). Therefore, the causality principle is clearly effective. The probabilistic aspects of quantum mechanics do emerge, as related to a causally defined ensemble of possible trajectories (see Fig. 3). 4.2. Determinism A theory is considered to be deterministic if the specification of the initial value of all the relevant variables of the system is sufficient to calculate the past values and to predict the future values of such variables for any arbitrary value of time. This formulation of determinism also implies that it is possible, for an arbitrary time, to assign a value to all the variables characterizing the system. The time evolution of the wave function, controlled by the Schrodinger equation, is deterministic. Quantum mechanics, however, is a non deterministic theory because of the probabilistic nature of the predictions for the values of the observables of a quantum system. It is not possible to formulate exact predictions (such as "this particle will decay in 15 seconds"), but only probabilistic ones ("the half-life of this particle is 20 seconds"), in the context of quantum mechanics. The Copenhagen interpretation considers probabilities as intrinsic features of quantum mechanics: all the predictions about the outcomes of a quantum measurement can only be expressed in terms of probabilities. As a consequence, standard quantum mechanics turns out to be a non deterministic theory at an ontological level. The d.B.B. model would require, for the complete definition of a quantum system, not only the specification of the probability density, but also the definition of the initial positions of all the particles belonging to the quantum system ("hidden variables" of the model). Precise trajectories are therefore defined at any time, but they are not empirically known, for the initial positions are to be considered hidden variables. Since particle positions can not be experimentally determined with an accuracy better than that given by the probability density M 2 , the d.B.B. interpretation is unable to produce deterministic predictions, exactly as standard quantum mechanics. The d.B.B. model is indeed deterministic at an ontological level (since the equation of motion is the quantum equivalent of Newton's classical equation), but it remains a probabilistic model with respect to the possible predictions about the results of

207

a quantum measurement. In this case, probabilities are epistemic (i.e., linked to the limits of the observer's empirical knowledge of the quantum system) rather than ontological (i.e., related to the intrinsic features of the model). 4.3. Realism Realism is usually defined by an ontological and an epistemic hypothesis: there is an external reality, existing independently of any observer ("ontological"); it is possible to have direct access to this external reality ("epistemic"). The d.B.B. interpretation describes a quantum system in terms of really existing particles, with a precise (even if not known) location in space at any time, and really existing waves, either guiding the particles or "empty". Furthermore, the result of a measurement is considered to be independent of the observer, who simply registers a physical result, as in classical physics. Hence, the d.B.B. interpretation has to be considered a realistic model, as long as the only measured quantities are positional ones. All other variables are contextual (see Sec. 4.6) and therefore not realistic. These variables are sometime defined as "quasi-real".14 4.4. Nonlocality and holism The quantum potential Q connects the dynamical variables of all the particles belonging to a quantum system, independently of their spatial distance, and it is responsible for all the nonlocal and holistic features of the Bohmian model. In 1966 J. Bell15 demonstrated the impossibility of formulating a local hidden variables theory equivalent to standard quantum mechanics. Nonlocality was thus recognized as a fundamental property of any possible hidden variable theory, instead of a peculiar feature of the d.B.B. model. To be precise, nonlocality was at first considered a rather strange feature of this model, but later on it was understood that also the Copenhagen interpretation of quantum mechanics has a nonlocal character (nonlocality is not inconsistent with special relativity theory because the lack of knowledge about hidden variables values prevents using such information to send super-luminal signals). If a system has nonlocal properties, it has to be considered as a whole, which cannot be divided into smaller parts and is irreducible to the sum of his constituents. This property (holism) is evident in the d.B.B. model and is also present, but less recognizable, in standard quantum mechanics.

208 4.5. Lorentz-invariance A Lorentz-invariant formulation of the d.B.B. model has not been provided yet: in the equations of the model, the particle coordinates are expressed as functions of an absolute time, common to the entire system. The lack of a formulation consistent with special relativity is, along with contextuality (see Sec. 4.6), the major limit of Bohm's proposal. 4.6. Contextuality Hidden variables are added to standard quantum theory in order to achieve a complete knowledge of the system. Such variables are required to have a precise (even if unknown) value, regardless to the fact that it has been determined by a measurement or not. A further requirement is the possibility to perform reliable measurements: if a precise value is ascribed to a variable, a measurement performed on the system for such a variable should have the expected outcome, regardless of the the details of the measurement. This has been stated by J. S. Bell10 as "non contextuality principle": all possible measurements of an observable should give the same result, independently of all other measurements performed on different observables at the same time. J. S. Bell15 in 1966 and S. Kochen and E. P. Specker16 in 1967 demonstrated that the two requirements above are irreconcilable: hidden variable theories do violate the non contextuality principle enunciated by Bell. Contextuality implies that the value experimentally attributed to a variable in a quantum system does not depend just on that variable, but on the entire experimental context. Therefore, the attempt to complete standard quantum mechanics in a deterministic way could only be done at the cost of accepting the contextual character of such models. Not all variables are, however, contextual: for each quantum system there is at least one complete set of noncontextual compatible variables. They are the only variables that could be measured in an objective way, independently of all other measurements performed on the system at the same time. In the d.B.B. interpretation, positional variables (not to be confused with the hidden variables represented by particle positions) are the only noncontextual observables of the model. References 1. 2. 3. 4.

D. Bohm, Phys. Rev. 85, 166 (1952). D. Bohm and J. P. Vigier, Phys. Rev. 96, 208 (1954). D. Bohm and J. B. Hiley, Phys. Rep. Ill, 93 (1989). D. Bohm, Wholeness and Implicate Order (Routledge, New York, 1980).

209 5. D. Diirr, S. Goldstein and N. Zanghi, in Bohmian Mechanics and Quantum Theory: An Appraisal, J. T. Cushing, A. Fine and S. Goldstein eds. (Kluwer, Dordrecht, 1996). 6. P. G. Merli, G. F. Missiroli and G. Pozzi G., Am. J. Phys. 44 (3), 307 (1974). 7. A. Tonomura, J. Endo, T. Matsuda, T. Kavasaki and H. Ezawa, Am. J. Phys 57,117(1989). 8. P. R. Holland, The Quantum Theory of Motion (University Press, Cambridge, 1993). 9. D. Diirr, S. Goldstein and N. Zanghi, in Experimental MetaphysicsQuantum Mechanical Studies in Honor of Ahner Shimony, R. S. Cohen, M. Home and J. Stachel eds., Boston Studies in the Philosophy of Science (Kluwer, Dordrecht, 1996). 10. J. S. Bell, Speakable and unspeakable in quantum mechanics (Cambridge University Press, Cambridge, 1987). 11. D. M. Greenberger and A. Yasin, Phys. Lett. A128, 391 (1988). 12. H. Rauch, in Proceedings of the 3rd. International Symposium on the Foundations of Quantum Mechanics, M. Kobayashi Ed. (Physical Society of Japan, Tokyo, 1990). 13. P. Mittelstaedt, A. Prieur and R. Schieder, Found. Phys. 17, 891 (1987). 14. A. Fine, in Bohmian Mechanics and Quantum Theory: An Appraisal, J. T. Cushing, A. Fine and S. Goldstein eds. (Kluwer, Dordrecht, 1996). 15. J. S. Bell, Rev. Mod. Phys. 38, 447 (1966). 16. S. Kochen and E. P. Specker, Jour. Math, and Mech. 17, 59 (1967).

A N O B J E C T I V E B A C K G R O U N D FOR Q U A N T U M THEORY RELYING O N T H E R M O D Y N A M I C C O N C E P T S

L. L A N Z A N D B . V A C C H I N I Dipartimento

di Fisica dell'Universita di Milano and INFN, Via Celoria 16, 1-20133, Milano, Italy

Sezione

di

Milano,

We come back to the rooting of quantum theory in an objectively given phenomenological context, as it was first sustained by Bohr and later taken by Ludwig as basic motivation of his axiomatic approach. It is shown that the question of compatibility of an objective phenomenological context with present day quantum theory can be answered in a positive way if also thermodynamic concepts are taken into account for a quantum description of macroscopic systems. A formalism is recalled accounting for non equilibrium thermodynamics, by introducing classical fields describing local equilibrium in the quantum field context. Also a non deterministic dynamical evolution of these classical fields appears as an important possibility, linked to the breakdown of the method appropriate for the deterministic case. In this connection a refinement of this method is indicated, which with some additional condition leads to the concept of microsystem and to quantum theory for particles interacting with a macrosystem.

1. The role of a phenomenological pretheory In discussions about foundations of quantum mechanics some macroscopic level is generally invoked, often referred to as a classical level, to be described by some classical physics as opposite to a microscopic level, related to more accurate physical investigations touching the particle level which nature displays at a sufficiently small space scale and to which quantum mechanics applies: then an embarrassing question immediately arises, i.e. where the border lies between macroscopic and microscopic level. The rooting of quantum description inside a classical macroworld goes back to Bohr; a rather subtle answer and settlement of this question was given by Ludwig,1 whose approach actually leads to a richer mathematical structure than usual textbook quantum mechanics, based on states as statistical operators and transformations of states as affine maps on a linear space generated by the statistical operators. Let us mention that this same mathematical framework imposes itself when quantum mechanics is challenged with

210

211 problems in the realm of communication and evaluation of information.2 Often two utterly different ways of thinking about physics appear intertwined: one way as a theory aiming at the description of the nature of things, the other way as a theory associated with Galilean experiments. Ludwig's approach is a systematic development of the second standpoint. The most profound motivation to research leads to a mixture of these two attitudes, escaping whenever possible from the second to the first. The progress of science just comes from both ways taken together, but when difficulties appear one has to follow the second one. Any experiment about a given theory is a setting identified and controlled in a phenomenological way or known in the context of a pretheory, separated form the theory the experiment is about. The formalism of quantum mechanics in its generalized form arises in Ludwig's approach as a mathematical representation of items given inside a pretheory and used as parts in an experiment which gives evidence of a physical object, relevant for the experimental setting but not of relevance for the things composing such setting. Quantum theory fits very well and seems to be a very simple realization of Ludwig's mathematical scheme; the latter is however much more general. Also classical mechanics for classical point-like atoms is compatible with it. This must be expected since classical atomism, though disproved by experiment is a conceptually consistent scheme. To the phenomenological pretheory objectively given state parameters are generally associated, showing in many situations, but not necessarily always, a deterministic time evolution inside suitable time intervals. Generally indeed this phenomenological level does correspond with macroscopic properties of systems, while the result of an experiment performed using them is often linked with microphysical components. The key point for a proper understanding of Ludwig's approach however implies being aware of the fundamental role also of phenomenological thermodynamics with its may be inelegant phenomenological artefacts like containers, walls, external fields and its obvious local equilibrium indexes like temperature, velocity, chemical potentials, not to be linked a priori with mechanical properties of classical or quantum components. While it was obvious that thermodynamics was necessary in order to supplement the physics of a continuum in classical physics, it is believed that one can dispense with it when the physics of interacting quantum fields is considered. This is based on a persistent mechanicistic prejudice, by which quantum fields are merely a replacement of collection of fundamental particles, thus promoting any model of interacting quantum fields to a fundamental mechanical model which has to involve only mechanical quan-

212

tities. Nonrelativistic quantum mechanics is to a certain extent compatible with the existence of a system composed by a macroscopic part, to be described at a microphysical level, interacting with a microphysical system consisting of one or few particles, thus substantiating this fictitious representation of macro and micro systems; in the relativistic situation due to non conservation of mass this becomes actually inconsistent. The contribution is organized as follows: in §2 objective state parameters are introduced with their deterministic time evolution; in §3 it is suggested how a breakdown of determinism can be linked to the emerging role of a microsystem; §4 is devoted to conclusions. 2. Deterministic dynamics of objective states To represent the objective phenomenology of a system let us associate to it a set of quantum fields, here for simplicity a scalar field ip(x) confined inside a region fi C R 3 , together with a Hamilton operator H and a countable set M of linearly independent local or quasi-local observables Aj(x), built with the fields V>(x), V>t(x) an< ^ quasi-local in the sense that A,(x) only depends on ^ ( y ) , V^(y') for |x — y| C r o , |x — y'| « r o and also H is obtained by a quasi-local density. The length ro can be taken to characterize the short distances limit of the physical description, its typical space scale being A >• r 0 . A simple but already meaningful model is fluidodynamics, with a Hamiltonian density \ / n d?y ffl(x)^t(y) U(\x — y|)V'(y)^(x), with V(r) a short-range potential with range ro, the relevant observables being mass, momentum and energy densities. A set of state parameters consists of functions Q (x) such that the operator

IIOEJ/ACM-W

(!)

is essentially self-adjoint with a point spectrum and exp[—<$(C)] is trace class. Then starting with the reference generalized Gibbs state e-*(0 ™<1 = — *TT (2) C Tre-*(0 entropy is introduced by 5(C) = —kTrw^logw^ and a partition function can be defined by Z(Q = T r e - * ^ . A deterministic time evolution of these parameters can be defined if one succeeds in constructing a solution pt of the Liouville von Neumann equation carrying only one family (# of state parameters t' < t, where such a family is defined assuming that the

213

state (2) is equivalent to pt relatively to the relevant observables Tr Aj(x){pt - u/Ct) = 0

\/Aj(x) e M,Vt.

(3)

Due to the fact that the selection of relevant variables in M is generally not invariant under time evolution, pt must be represented as pt = z^e-MV+^W

Zt = Tr c -[*«.)+*«)],

(4)

S(t) being a correction not involving relevant variables. Indicating by U\a the unitary time evolution determined by the Hamiltonian one has

pt = Uyt0

= z^e-K^oWtotM],

(5)

In a formal way one can trivially rewrite Z^*o$(£t0) in terms of $(Ct) by

<*(Ct 0 ) = *(Ct) - / * * ' ^7P4*(0]

(6)

CivWAjto +

^wMic)

then one expects that also S(to) should have a similar form, so that

dt

' f dZ*ui' ^ ( ^ i ( x ) + Kf-W>lj(x)

s(t) = j2 f •

J—oo

(7)

JQ

where Qt', t' < t, describe the preparation of the system. One expects that their choice should be significant only for t' € [to — r,to], T being a typical time interval related to the duration of the preparation procedure. Further analysis of models is however necessary to establish whether the most obvious choice Qti = 0 for t' for to
Pt =

+ ie-*tt«) + e-*W Jt + Je-*(C*)Jt

:

—

:

1

—

—

(8)

Tfc[e-*«.) + 5eT*(C) + e -*(CO$t + 5 e -*(C.)5t] where: S=

rk Jo

due-uW^+sWs(t)e+u*^\

(9)

214

and thus depends on times t' < t due to expression (7) of S(t). Furthermore one can set in evidence at r.h.s. of equation (8) the reference generalized Gibbs state e -*(Ct)

=

<10)

** S7*i« writing pt = * C . + & B c . + M t + & M ; t = * 1 + Tr (Sw^ + w
+

f

(4)>

( n )

where T(t) is a traceless operator given by: f (t) = -y(t)[SwCt + tDCtJt

+

J ^ J t _ $(t Tr(Ju) C( + wit J t + Ji&Jt)] ( 12 )

with 7(f) = 1/[1 + Tr(JiDC( + wQt J* +
^ W = EOt'WA'W + ^ f Wii(x)

(is)

i where x indicates localization. Such localization is lost, and the irrelevant contribution increases with increasing time in the tail part T[t) of

S = S(t,t-T)

+ T(t)

(14)

where S(t,t-T)=

f d% f 5 d u e - » l * K ' > + ^ f dfUtt,
(15)

215

and f(t)

=[<&[' due-u&^+SW Jn Jo

f 'df^ffCtl(x)e+0*^> J-oo

(16)

due to the time evolution map, for a time interval t — t' > r . For r large enough, only the general representation in terms of the local fields V>(x), V^(x) can be given, and taking into account that relevant variables leave n particles subspaces invariant, one has the general representation:

t(t) = Y,fd\...[ a

X 4>]l(m)ftM

Jn

•••

Jn

dV I d\a...f d%

ftiVaWita)

Jn

Jn

• ••IpfaWiZlWatiVl,-

• • > Vocta,- • • , 6 )

(17) o' a t(jji,...,»;o)^a>---)€i) being for large t increasingly complicated complex valued functions describing the n body dynamics of the system. To calculate expectations of local observables O(x), such as A,(x) and A,(x), one considers expressions of the form: Tr(d(x)f(i)wCtft(i)),

Tr6(x)(f(*)^t+^"f^)),

(18)

the following typical approximation makes the job: in (17) one can replace the integration region D, by Q,\UJS(X) with w^(x) the sphere with centre in x and radius S C fi- If one calculates (18) with this modified T(t), due to locality of relevant variables Aj(x) in term of which ui£t is built and also locality of Aj(x) and Aj(x), the contribution in which only one factor T(t) or T* (t) arises are negligible due to the off-diagonality in x representation induced by the modified T(t), while for terms with both T(t), T*(t) the factorization •&(6(x)f(t)«; C t ft(*)) « Tv(d(x)wCt)Tr(f(t)wctft(t))

(19)

occurs. In this way if (8) is used to calculate the expectations of local observables, S can typically be replaced by the head part S(t, t — r ) of (14); only the factor 7(f) at r.h.s. of (12), where it affects the irreversible contribution to the dynamics, still depends on T(t). By the very definition of the state parameters these irreversible corrections strictly vanish for all relevant

216

variables; then one can reasonably expect that Tr A,(x)f (t) is a small correction to Tr Aj(pc)w£t. This is just the way in which phenomenological local equilibrium thermodynamics of a fluid, ruled by Navier Stokes equation can be derived in a quantum field approach, as was initiated by Zubarev 4 and further developed by Morozov and Roepke. 5 Even if no systematic effort has been done in this direction, one can expect that a deterministic evolution of the state parameters can be constructed by the following procedure. One starts with a zero order dynamical evolution C?t(x) solving the evolution equation of an ideal fluid, then taking the memory term with (j t (x) = Cjt(x) finds the first order dissipative and irreversible corrections to the evolution equations of the relevant expectations, this step also implies a first principle evaluation of phenomenological coefficients such as viscosity and thermal conductivity. Then one has to calculate a correction to the state parameters Cjt(x) = C°t(x) + Cjt(x)i since state parameters are linked to expectations or relevant variables. Looking in this way to Tr Aj (x)p4 one goes beyond the usual linear approximation of thermodynamics of irreversible process and a more sophisticated investigation of the subject can be envisaged, which should yield an increasingly precise deterministic evolution of the state parameters, also involving more precise evaluations of phenomenological coefficients and an increasing number of them. All this means a matching between a certain deterministic phenomenology of systems and some more or less fundamental quantum field model. Such a matching has been obtained relying on thermodynamic concepts. It provides a way to face the macro-objectification problem, whose relevance and difficulty inside the usual axiomatics of quantum mechanics has been pointed out by Ghirardi and coworkers.6 Let us observe that in this way, by some well understood phenomenology, by maybe empirical knowledge of phenomenological coefficients and by the solution of classical evolution equations (e.g. Navier Stokes equation), one can bypass, if we limit the interest to the dynamics of the expectations of the relevant variables, the generally extraordinarily complex problem of full quantum field dynamics. This just happens since we put in the foreground the basic quantum field balance equation, e.g. in interaction picture, while we leave buried in the underground of the theoretical description the dynamics of the Fock-space states of the system. No explicit use of microsystems is strictly necessary nor useful. In a sense this situation is strongly reminiscent of an ancient and bright piece of physics: the parametrization of the solution of the Boltzmann equation by local equilibrium state parameters

217 by which Chapman and Enskog succeeded in deriving phenomenological fluidodynamics from the Boltzmann equation, taken as the fundamental equation for the dynamics of the molecules composing the system, without solving explicitly such equation: though collisions between molecules was the leading underlying idea, only the link to thermodynamics allowed the extraction form the theory of the relevant physics. Now interacting fields are the dynamical setting, the context is much more general and amenable to relativity and, as we shall see, something of the underlying microstructure acquires an autonomous phenomenological evidence, much stronger than molecules did inside Boltzmann's atomistic effort.

3. A way t o m i c r o s y s t e m s By the previous consideration one can understand that a simple phenomenological description, e.g. fluidodynamics, can hold for long times, with absence of memory effects, as it is in fact observed, despite the long time complex structure which is displayed by (7). However this is not the whole story: one can expect that the typical uniformity of the factors aat(r]i,... ,T)a,(,a,. ..,£i) in (17) that allowed the simplification fi —> fi\cj,5(x), can be influenced by some, possibly not at all trivial, driving process, correlating the localization points JJ, £ in (17) with the localization points of some relevant variables: then a new feature is disclosed inside the previously described phenomenology. Let us investigate what happens if the simple behaviour we have considered before with respect to the uniform distribution of the integrand in (17) inside the space fl x fi x . . . x Cl 2a times, is analyzed in a more accurate way. We assume this simple behaviour for all variables r), £, but one, e.g. TJI for times t > i. Then (1) can be represented as:

f(t)=

[ dVft(V)Dt(r,)

(20)

the operator

Dt(n) = £ / d% ... / &na f d%...[

d%

Jn Jn Jn Jn x ft(7)2) . . . ft{r)a)ftt,a) • • • ft(,2)ft(,i)aat(r), • • • ) Wan sen * • • -si ) (21) a

being a generalized annihilation operator in the sense that Dt(ri)N = (N + l)Dt(r}) \/rj. If (21) is evaluated replacing the integration region f2 -»•

218

fi\wa(x) but not in (20) one obtains TtiAjWfWwbPit))

- Tr(Aj(X)w
[ d3r) f dS [TrAjtoftWw^ir]') Jn Jn

(t)w
=

- Tr(i ; (x)iD c t ) " L ^ f a K ^ f a ' ) ) ] x TiiDMw^btir,'))

(22)

and similarly for Aj(x). The expression gt{f],T)') = Tr(JDt(r/)i()ft£)](r/')) defines a positive operator g\ ' on L2(£l) by: ( 5 t ( 1 ) / ) W = fdPn'gt^vViv')-

(23)

Let us represent it in terms of orthonormal eigenstates gjt{vi) and positive eigenvalues Xj(t): f dV9t(v,v')fW) Jn

= T,X^9Jt(v) j

[ d3V'g*jt(v')f(v')Jn

(24)

Setting Jn equation (21) becomes T r ( i ( ( x ) f ( ^ C t f t ( f ) ) - T r ( i ; ( x ) ^ C J T r ( f (t)«) C l ft(i)) = £

A,- (*) Tr i , ( x ) ^ ^ - Tr ( i , (x)u>c,) Tr ($ t u> c , ^ t )

(25)

j

where the typical mechanism by which T(t)w^tT^(t) contributes in a negligible way to expectations of local observables can also be stated as: Tr AtWtyMit

« Tr(i,(x)ii; Ct ) T r ( ^ t t « ) C | ^ )

(26)

and similarly for A,(x). Let us now assume that an atypical behaviour can be associated with a selected subset of creation operators for which a coherent dynamics prevents the decay of correlations described by (26). As a consequence, together with the macrostate u)Qt also the new states:

will have their role, for the dynamics at times t > i; as macrostates perturbed by a particle in the state gatLet us observe that representations of T(t) different from (20) are conceivable: instead of

219

^{r}) a destruction operator ip{r]), can be put in evidence, so that a hole will have the role of microsystem; more generally products tpHviWiw) •••V't(7?r)1/'(£s) •••V'feOVKCi) could be considered, so that composed structures (more particles and holes) arise as microsystem. We have concentrated on the simplest situation. To summarize, the last term in (8), which is a kind of shadow subcollection as long as dynamics is ruled by (10) (deterministic case) is decomposed at time i as in (10) and from a time i on we shall consider pt = U\pi , where pi according to (8) is represented as k =

%

+

fft> Ct -ft(^ + f(t-)

=

^

^

+

^ = * c , + f(i)flc,ftffl

m = rv-^m

(2g)

(29)

t

1 + Tr f (t) + Tr T(i)w(lTKi)

(30)

where we skip the explicit expression of T(t), essentially depending on S(i,i — T). By (28) the statistical operator pj is represented in terms of a new reference statistical operator wj, given by (29), which shows that a part of the previous shadow collection has been associated to the previous reference state WQ corrected by t(t), which is the traceless part of T(i). By our previous analysis wj can be represented as a stochastic generalization of u>Cr:

w-t = \fwCol + Y^ Kl ^ 7 ^ * " 7t

A

f-^f>0,

Af + ^ ^ = 1

(31) Thinking about the time evolution for t > t, the first subcollection at r.h.s. of (31) is associated with pursue for t > t of deterministic evolution of state parameters; the other subcollections might evolve in different ways, due to the perturbing particle in its different states gai. Then one can expect that for t > i a more adequate parametrization can be given by tuning Q to the different components of the mixture at r.h.s. of (31) so that to describe this stochastic time evolution of the system, the following stochastic generalized

220

Gibbs state will be introduced for t > t:

a

1±

TrV'at^t>4V'at

a

(32) the state parameters C,t and C,at will have at t = i the common value £j and (32) will coincide with (31). For t > i we can write a solution of the Liouville von Neumann carrying the new parameters by an obvious generalization of the treatment given in §2: pt = U\p-t =

(ftcG-M^x) + Hfnd3x

f(t)exp[- "£jn +E x exP [ - E ^

[Jnd3vUmHv) d3x

- Jn A jf*df

0«t(x)ii(x) + E ^

ACtWtkri)

f_dt'

d 3 x

|7(Z4<>(x)i;(x))] -^(UkaArtWHr)))

£ dt'

- J d3r, J* dt> ^(Kt-Cf

^.(UiCjafWAjW)}

(r,)i>{r}))

+ Ujr® (33)

for £ = F the time integrals disappear, and one recover the coefficients Cj<*f = Cif, and the functions # ( x ) = f | f ,
221

shadow collection appears and if it were inequivalent to wt for t > i a new stochastic reference state must be introduced showing a more complicated particle structure and thus leading to a stronger stochasticity. In this way interaction of a microsystem with a system can produce more complicated microsystems: not only decoherence happens but also increasing or varying complexity can occur, whenever behind the deterministic or less stochastic wt the shadow collection SwtS^ becomes significant. Due to the larger set of parameters entering wt a larger set of fixing conditions is necessary: now we shall indicate how by these conditions quantum mechanics is recovered. We assume that in order to determine wt in the simplest stochastic case one has to choose a suitable set M ^ of one particle observables and extend to these observables the former assumption (3): IV Aj (x) (^ - % ) = 0 Tv A(pt - wCt) = 0

VAj (x) e M, W VieAf

{1)

(34)

,Vt.

The one particle observables A € M^ are more appropriately given in the momentum description: A = Jd3pdV&Hp)A^(P,P')a(pl),

(35)

correspondingly also i/S0 in wa are replaced by ag = J d3pa(p)g*(p), 1, [6 s ,aJ] T = 1. Indicating by: wg <- = (1 ±

9 i 9 lTagWQa'g)

\\g\\ =

3 6

one has: Tr(AwgX) = (
(37)

with

Agl) ( P , p') = g(P) J d3V~g*(V)AMfo,p')+j A A& (p, r,)g(r,)g* (p') (38) Let us assume for simplicity that Tr agw^ag
222

choose observables (35) which are non diagonal in p representation to a degree high enough so that \Tr(Agwi;)\
= T r ^ , A^w[x)

(39)

a

w\ is a positive trace class operator with trace less or equal than one. One has \{t) = 1 — Tr w\ ' with \{i) the weight of the deterministic situation. The operator w\ in one particle Hilbert space expresses a universal feature of all physical systems whose phenomenology is covered by our objective description: by wt -> w\ ' all objective elements associated with wt have been erased. Inversely by the spectral decomposition of w\ ' one obtains the one particle states gatip) and the weights \at\ then by (34) one gets Cat and Q and finally the whole wt can be constructed. Open questions are: i) the precise characterization of M^; ii) to study the time evolution of all the parameters CtXat, gt,gat(p),h,Kt starting with TtAl{-K)pt = jtTv{Al{^)wt)

Vi,(x)eM

(40)

Tr Apt = 4- Tc(Awt) V i € M ^ at where pt has the from (33). By our discussion the dynamics of a system is finally reduced to the construction of a solution of (34) and (40): microsystems become the natural ingredient to pursue the description of a microsystem when the parametrization in terms of the sole state parameters £t would become insufficient. To catch something quantum mechanical inside this dynamical context that could appear completely extraneous to usual quantum mechanics let us think about the most simple contribution to the l.h.s. of (40) Tr Apt = - \ Tr A[H, pt] coming from the part of pt given by (33) which is related to ft (n) or better to the corresponding agt: here 2

the leading term in the expression of gat(p) is given by e _ ^ 2 ^ ^ _ t o ^ a t 0 ( p ) ! m being the mass of the Schrodinger field VKx) introduced in §2. Beside this free dynamics of a particle with wave function gat{p), also its collisions with the remaining local equilibrium system will have a major role, relating

223

this problem to irreversible dynamics of a microsystem interacting with a local equilibrium macrosystem: then the structure (33) could be useful to treat such problem without assuming a Markovian structure of dynamics. The choice of M^ is in a sense complementary to M: one has to look for observables which are possibly insensitive to the macrostates WQ , but sensitive to the microphysical additional structure of wt given by the operators ag ag. In typical treatments of a system composed by a macrosystem <S interacting with a my, one represents the states of the compound system in a Hilbert space % = 'Hi® Tis and taking g 6 Hi and w^ statistical operator on Us, all operators ,4 given by (34) can be taken and one removes in an artificial way the problem we are now meeting. Eq.(37) indicates an eventually more refined treatment: the term Tr(AgW{) should be small enough with respect to ( g ^ 1 ) ^ ) , by some orthogonality of Ag with respect to W(: then starting with w\' as a zero order statistical operator of the microsystem, a recursive construction of the full wt can be started.

4. Conclusions and outlook By the presented construction microsystems emerge as stochasticity seeds inside an objective framework primarily based on local quantum field observables Ai(x) 6 M and on classical fields Cjt(x) having the role of local equilibrium thermodynamic indexes. They are related to the expectations of the relevant variables as long as their evolution is deterministic. The pretheory we have mentioned in the introduction is no longer a purely phenomenological framework but becomes the deterministic sector of our description: it could be treated by replacing, at least to a certain extent, the phenomenological attitude with a more first principle one. In 7 this was initiated in the case of fluidodynamics. A breakdown of the deterministic regime can occur and contextually, just in order to describe a stochastic behaviour, an additional structure is pointed out, that can be associated with a microsystem: its quantum mechanical evolution becomes a necessary ingredient in order to face the more complicated situation. In this way the quantum mechanical nonlocality looses its apparently paradoxical aspects, just as was claimed by Ludwig. The purpose of this paper was to show why this seeds can be expected and how formally one can become aware of them inside a context that is usually treated as strictly deterministic. That basic difficulties of quantum mechanics can be overcome with thermodynamic concepts was indicated among the different open alternatives in this context also in a recent general essay.8

224 Acknowledgments This work was supported by INFN a n d by MIUR under F I R B . References 1. G. Ludwig, Foundations of Quantum Mechanics (Springer, Berlin,1983). 2. A. S. Holevo, Statistical Structure of Quantum Theory, Lecture Notes in Physics, Vol. m67 (Springer, Berlin, 2001). 3. L. Lanz, B. Vacchini and O. Melsheimer, in Quantum information, statistics, probability -celebration of Holevo's 60th birthday, O. Hirotaed. (Rinton Press, New York, 2004). 4. D. N. Zubarev, Non-equilibrium statistical thermodynamics (Consultant Bureau, New York, 1974). 5. D. N. Zubarev, V. Morozov and G. Roepke, Statistical mechanics of nonequilibrium processes (Akademie-Verlag, Berlin, 1996). 6. A. Bassi and G. C. Ghirardi, Phys. Rep. 379, 257 (2003). 7. E. Vitali, Degree thesis (University of Milan, Milan, 2005). 8. V. AUori, M. Dorato, F. Laudisa and N. Zanghi, La Natura delle Cose (Carocci, Roma, 2005).

THE ENTRANCE OF QUANTUM MECHANICS IN ITALY: FROM GARBASSO TO FERMI MATTEO LEONE, NADIA ROBOTTI Department of Physics, University of Genoa Via Dodecaneso 33,1-16126 Genoa

The first steps of quantum mechanics in Italy will be here discussed, through the use of the available archives and printed sources. As it will be shown, this development was closely linked with a spectroscopy tradition of research, whose major protagonists were three physicists working in Tuscany during the first two decades of the century, namely Antonio Garbasso, who worked in Arcetri (Florence) on the theoretical basis of the recently discovered Stark Effect (1913-14); Rita Brunetti, in Arcetri as well, who made use of the quantum theory in order to explain the X-rays emission (1918-20); and, finally, the young Enrico Fermi, who paid attention to the quantum theory since his days at the Scuola Normale Superiore in Pisa (1918-22).

Introduction As is relatively well known, before 1920s Enrico Fermi's intervention, quantum ideas had been largely neglected by the Italian physics. Many indicators corroborate this contention, notably the glaring fact that very few contributions on modern physics topics were published by the Italian physics literature during the first two decades of the century. As of 1912, the Italian Physical Society journal {Nuovo Cimento) had published only two review papers on quantum theory [1] [2], one of whom being quite critical in its stance toward the subject [1]. Its author, Orso Mario Corbino (who eventually became director of Rome Institute of Physics), emphasized indeed that "the hypothesis of finite variations of the contents of molecular energy according to whole multiples of the quantum £ — hv deeply repels all our mechanical conceptions". Moreover, as of 1912, no research paper based on quantum hypotheses had been yet published on Nuovo Cimento. This backward state of art somewhat changed the following year when spectroscopy became an important topic of quantum mechanics through the discovery of an important quantum effect, the so-called Stark effect. More importantly, and unexpectedly, the Italian physics gave meaningful contributions to both the experimental set-up and the theoretical interpretation of

225

226

this effect. The role played by these - and other - researches in allowing the quantum mechanics to enter Italian physics will be here addressed. 1.

Garbasso, the Stark effect, and the problematic birth of quantum mechanics in Italy

In 1913 the Italian physicist Antonio Garbasso (1871-1933) left his position at the University of Genoa for the University of Florence to become director of its Institute of Experimental Physics. During the preceding years he had been carrying out works on electromagnetic waves, mirages, X-rays and, most importantly, on spectroscopy topics. In 1906, he had indeed published an important theoretical spectroscopy textbook: Vorlesungen iiber theoretische Spektroskopie (Leipzig: J.A. Barth). At the University of Florence, Garbasso met the young assistant Antonino Lo Surdo (1880-1949). Lo Surdo had been pursuing research on terrestrial physics at Florence, but now Garbasso set him to investigating the Doppler effect in spectra produced by the retrograde rays, i.e. by the positive ions which were always present in discharge tubes in the vicinity of the cathode. Lo Surdo thus turned his attention to the radiation being emitted in the dark space close to the cathode. At first he used standard discharge tubes, but he soon found it advantageous to use long, thin ones between 1.5 and 4 millimeters in diameter.

Figure 1. Antonio Garbasso (1871-1933). Source: [29],

227

Lo Surdo's intent was to look for a Doppler effect in the positive retrograde rays, but in the summer of 1913 he observed a far more significant phenomenon. As he recalled at the end of that year: Since last summer, while studying the Doppler effect due to the positive retrograde rays close to the cathode by means of a discharge tube placed obliquely to the slit of a spectroscope, I observed that the hydrogen [spectral] lines appear to be resolved into several components.... I [subsequently] discovered that this phenomenon also makes its appearance when the tube was placed perpendicularly to the slit. This was, therefore, a new phenomenon [3]. The nature of this unexpected phenomenon and its explanation under a quantum-mechanical framework were discovered a few months later. 1.1. Stark effect Since 1906 the German physicist Johannes Stark (1874-1957) had studied the effect of an electric field on spectral lines experimentally. His decision to do so was influenced by his interest in quantum theory, in particular, by the possibility of interpreting an electric analogue of the Zeeman effect (where a split of spectral lines was obtained out of intense magnetic fields) within its framework. Former analyses had been based exclusively on classical electrodynamics, which did not predict such an electric analogue effect. By contrast, Stark believed that the new quantum theory might permit it. In the fall of 1913, i.e. a few months after the unexpected (and up to that time unpublished) Lo Surdo's observation, Stark was able to carry out a systematic study of the influence of an electric field on spectral lines, through a specially assembled canal-ray discharge tube. By this work, he discovered that a transverse electric field caused hydrogen spectral lines to split into several components. He presented his results at a meeting of the Prussian Academy on November 20, 1913 [4]. The following month, Stark discovered the longitudinal effect as well, that is, the splitting of spectral lines when observed parallel to the direction of the electric field [5]. The discovery of this Stark effect was instrumental in gaining for Stark the Nobel Prize in Physics in 1919. On December 4, 1913, Stark's letter, where the discovery was announced, was published in Nature [6]. Lo Surdo read the letter and understood immediately that he too had observed the same effect. He realized that with his discharge tube, with its particular geometry, he had produced a strong electric field - of the same order of magnitude as Stark had - in the region where he had observed the splitting of the hydrogen spectral lines [7].

228

1.2. Theoretical explanation Niels Bohr, a few months after his landmark 1913 paper on the atomic spectra [8], proposed the first successful theoretical explanation of the Stark effect. In a paper that he published in the March 1914 issue of the Philosophical Magazine, he stated that "it seems possible on ... [my atomic] theory to account for some of the characteristic features of the recent discovery by Stark of the effect of an electric field on spectral lines...." [9]. According to Bohr's theory, an electric field causes the elliptical orbit of the electron to precess and to change its eccentricity. Bohr concluded from his analysis that only two stationary electronic orbits were allowed: "the orbits simply consist of a straight line through the nucleus parallel to the axis of the field, on each side of it." Thus, Bohr found that the change in frequency Av is proportional to the amplitude of the electric field, as Stark had found, and is given by:

Av = —-.— E(nl-nf)

An em where E is the electric field, h Planck's Constant, e and m the charge and mass of the electron, and n, and n2 the quantum numbers of the stationary states between which the electron undergoes a transition when emitting the spectral line." 1.3. Antonio Garbasso's theory Independently of Bohr, Garbasso proposed a theoretical interpretation of the Stark effect (re-named by Garbasso as "Stark - Lo Surdo phenomenon") at a session of the Accademia dei Lincei on December 21, 1913, whose content was published in Physikalische Zeitschrift, Rendiconti of the Accademia and Nuovo Cimento [10]. This was the first research paper based on quantum hypotheses to appear on the journal of the Italian Physical Society. Garbasso's interpretation too was based on Bohr's atomic theory and was similar to Bohr's. Garbasso learned about Bohr's theory of the Stark effect from a letter that Bohr published in Nature on January 15, 1914, where Bohr announced the publication of "a paper on the influence of electric and magnetic a

A completely satisfactory theoretical interpretation of the Stark effect was found only after Arnold Sommerfeld (1868-1951) generalized Bohr's theory in 1916 by introducing the "Sommerfeld conditions" to describe systems of more than one degree of freedom. His student, the Russian-born Paul Epstein (1883-1966), and independently the German astrophysicist Karl Schwarzschild (18731916), then used the Sommerfeld conditions to explain the Stark effect in hydrogen. They showed that in a transition of the electron from an initial state (quantum numbers ki,k2, k3) to a final state (jii, n2, n3), the line splitting Av in the first-order Stark effect is given by Av = —-—[(/i, +n2 +n3X«2 - » i ) - ( * i +k2+k3Xk2 v>n em

-*,)]-

229

fields on spectral lines, which will appear shortly in the Philosophical Magazine" [11]. Garbasso then wrote to Bohr on January 19, stating that: From the last issue of Nature I see that you have applied your marvelous theory of spectral analysis to the consideration of the Zeeman and Stark phenomena, respectively. I am looking forward to your paper with great interest; I have myself tried to extend your theory in this direction, however, as you will see, with but little success. I contented myself with a preliminary calculation, since, from experiments done by my assistant, Dr. Lo Surdo, the matter seems to be very complicated experimentally [12]. FUTUS

(Italian) i«n Wt«o 1. 19I4

Phya. Inat. Itr it.Hochicul* <:•»

.1

J

.«

[ ?

c =L

'

->»hr j « « h r t « r M»zx i a l l « g » I kUH d«r l » t t t a u ;iujnmer dor U»tujg ers«h« i c h , iaaa dl« Ihre «miuieraofa#ne

rttap. ~C*ri'.':K'!,.dH Vf'ftliomtjIiB

ent^d^eDi :'U

f h e o r i * d«r dpak^

.1:; dwuti.it

h.iboa.

L'[: !.iil'<- ;' H a t v.irSLlcht Lt::t) .'(:ciJ_

.110 U i t i t p

-.lotaotv-; uudzudnhlltfu,

--'-I,.

I w h wlp J l o ijoln*:i werkiuu, =uit XeiiK-i' .;ri>di>«il

loh (tab© Ullch b o ^ n V t "«tinolitoiit;,

wtt ••liior v v l i a u f i 0 '»u

* « i l »u« v«rvuoti»u vun »iu»a A»_

»iat»ut«tt von u u r , i u r r u L)x. t « Surde, <JLl« ;i«ott» »otmlnt • x y a r l i i w u l a l l

umhx ••JlfAit.li't

IU t l e l j l .

Figure 2. Antonio Garbasso to Niels Bohr, January 19, 1914 [12],

In analogy with Bohr, Garbasso had discovered that when an electric field E was present, the original hydrogen line was accompanied by a couple of spectral lines - symmetrical of the original line - whose width infrequencyAv was proportional to the electric field. Garbasso deduced an expression for this change in frequency that differed from Bohr's only by a factor of 2: l 2x2em X2 ' Bohr responded on February, 7, 1914, pointing out that Garbasso had made an error in his calculation that had led him to underestimate the difference in frequency between the components of the split lines [13] (figure 3). When

230

Bohr's aforementioned paper was in press, the Physikalische Zeitschrift published Garbasso's one where his calculations were briefly reported [10]. In a footnote added shortly before the press, Bohr wrote that "the arguments of Garbasso are stated very briefly, but seem of a type similar to those of the present paper" [9].

X> >, /f/f

^L — > • * , lit ~.-JL4~^v4.

Figure 3. Niels Bohr to Antonio Garbasso, February 7, 1914 [13].

Notwithstanding Garbasso's minor error, his intervention here was unprecedented: all Italian physicists until then had worked on subjects in classical physics; Garbasso was the first Italian physicist to use the quantum hypothesis in his theoretical research. As emphasized by Giovanni Polvani (then director of the Institute of Physics in Milan) in his historiographical paper on one hundred years of Italian scientists (1839-1939), "Garbasso, better than anybody else [in Italy], was ready to take advantage of the new Bohr's conceptions" [14]. Thus, in order to explain the Stark effect, Garbasso adopted Bohr's model of hydrogen atom before Bohr himself did. However, Garbasso's favorable attitude toward Bohr's model failed to open wide the door of Italian physics to quantum mechanics. The intellectual environment among Italian physicists was not yet ready to such a revolutionary change and, ironically, among the physicists that were not yet ready to accept quantum mechanics figures Garbasso himself! A few months after his theoretical paper, he wrote indeed a paper where he supported a revised version of the classical J.J. Thomson's atomic model, formerly suggested by German theoretical physicist Woldemar Voigt (1850-1919). As stressed by Garbasso:

231 It is of the utmost importance, from the logical viewpoint, that J.J. Thomson's model - from whom Lorentz developed his theory of Zeeman effect, forbids any influence of the electric field upon the emission process. [...] Since 1901 Voigt attempted to generalize Thomson's atom. At that time he supposed that the cubic density of charge was a function of the radius vector rather than constant. From this hypothesis follows that, when an electric field is present, each line gives rise to a pair of lines whose elements are displaced toward the same side [...] with respect to the original line. However, Stark and Lo Surdo discovered symmetrical configuration as regards Balmer series lines. Voigt has recently [1914] took up again his calculations, by supposing a potential shaped as

+b2)c + —k2c3

where a, b, and c are the components of electron displacement [15]. By this ad hoc choice of potential, Garbasso showed in his paper that a revised Thomson's model might account for the experimental observations of a simultaneous action of both an electric field and a magnetic one upon the H a hydrogen spectral line. No mention is made of Planck's theory and the "new Bohr's conceptions". When the first world war broke, the full entrance of the quantum mechanics in the Italian physics was not yet an accomplished fact. 2.

Brunetti and the study of process of X-ray emission

The "Stark - Lo Surdo phenomenon" became the focus of interest at Arcetri in the prewar years and during the conflict. Around 1915, the Arcetri Institute was "a prolific center for [physical] studies, and the spectroscopy - i.e. the most immediate tool for studying the major contemporary issues - was kept there in great esteem" [16]. Soon after Lo Surdo's discovery and Garbasso's theoretical papers, further experimental works on this subject were indeed carried out by other Garbasso's assistants. Among the topics covered were electric field effect on more lines of Balmer series, polarization conditions, possible regularities with the number of order of the series lines, and Stark effect in helium. Much more importantly, one among the young Arcetri experimental physicists namely, Rita Brunetti (1890-1942) - had a key role in making Garbasso's occasional use of quantum mechanics not an isolated effort. Rita Brunetti had obtained her degree in experimental physics from the University of Pisa in 1913 with a thesis in spectroscopy. Two years later she was appointed as assistant in Arcetri under Garbasso supervision. Under the war, Garbasso went to the front as a voluntary lieutenant of the Corps of Engineers, where he set up an efficient phono-telemetric service. During his - and most of

232

Arcetri assistants' - leave, Brunetti took over the management of the institute. In 1928 she eventually became the first Italian woman in acquiring the direction of an Institute of Physics (in University of Cagliari). Her relevance as regards the full entrance of quantum mechanics topics in Italian physics lies in the fact that after her original works on the Stark effect, she worked - between 1919 and 1920 - on the process of X-rays emission through the help of quantum hypotheses. As a consequence of these researches, she discovered a post-cathode emission caused by the electronic ejection (1920) and she verified the emission law of the radiations characteristic of various metals. She eventually arrived at a theoretical interpretation of dependence of continuous background polarization of the applied voltage (1926). Since 1918, she had - very cautiously - hinted at the 1913 Bohr's quantization of energy in her paper concerning the behavior of high frequency spectra in a magnetic field [17]. While she readily set limits to the use of quanta in physics - it was the first time for the last five years since an Italian physicist looked at the quantum mechanics. Quantum theory obtained the deserved consideration only one year later, when Brunetti wrote her first paper on X-rays emission (1919) [18]. In this paper, Brunetti adopted a definite position in favor of the quantum theory and the collection of experimental data tended toward this goal. As an example, in her introduction she wrote: The theory concerning the mechanism of emission of radiant energy out of an oscillator has several interesting applications for the field of high frequency radiations. According to this theory, the oscillator might emits etheric waves only in such a way that the energy expended for their formation is a whole multiple of an elementary quantity whose value is hv, if v is the frequency of the emitted waves and h = 6.55 x 10~27 is a universal constant. In Brunetti's approach the quantum mechanics is seen as a very efficient theoretical tool for explaining the spectroscopic data. No attention is paid to the conceptual problems posed by this tool. By this approach, Brunetti shared both Garbasso's instrumentalist methodology and the experimentalist tradition of the Italian physics. Most importantly, Brunetti shared Garbasso's interest for spectroscopy issues. However, differently of Garbasso, she did not turn back to the old models, as shown by her 1921 review paper on the atomic nucleus, where Bohr's theory is held in high esteem [19]. At the beginning of the new decade, skepticism on quantum mechanics was still dominant among the Italian institutes of physics in Italy, however Garbasso's and Brunetti's efforts were soon to be vindicated by a young student that was going to graduate in physics a few kilometers away from Arcetri, at the prestigious Scuola Normale Superiore of Pisa. His name was Enrico Fermi.

233

3.

Fermi, the "quantum mechanics propagandist"

In 1918, Enrico Fermi (1901-1954) won a fellowship of the Scuola Normale. He spent four years at the University of Pisa, gaining his doctor's degree in physics in 1922, with Luigi Pucciand, then director of the Institute of Physics and former Garbasso's assistant in Arcetri. Although Puccianti had brought important contributions to the field of experimental spectroscopy when he worked in Arcetri, as of his Pisa directorship he was no longer involved in modern physics research. As there was a substantia] lack of courses on quantum mechanics and other modern physics subjects in Italian universities, one may wonders where did Fermi find instruction and guidance. The answer is that, as it is well known, Fermi was largely self-taught since his high school years. The early - and lonely - attention paid by Fermi to modern physics topics, and in particular to spectroscopy and quantum mechanics literature, is supported by a set of recently surfaced juvenile notebooks, currently preserved at the Domus Galilaeana in Pisa (together with other Fermi's manuscripts and notebooks relative to the scientific activity carried out by Fermi during his life in Italy) [20]. One of these notebooks, entitled "Riassunto di memorie di fisica" {Summary of papers on physics), contains a collection of summarized papers written by other researchers and original papers by Fermi himself in his younger years (figure 4) [21]. Among the others, figure papers by Einstein, Richardson, Sommerfeld, Laue, Debye and Levi-Civita. Most importantly, figures a summary of the famous Bohr's 1913 Philosophical Magazine paper on atomic spectra [8].

Figure 4. Enrico Fermi's Italian language synopsis of 1913 Bohr's paper on atomic spectra [21].

234

Another juvenile notebook, preserved at the Enrico Fermi Collection, University of Chicago, throws further light on the extent of Fermi's knowledge on the old quantum mechanics subject. This notebook, titled "Alcune teorie fisiche" (Some physical theories), was written between July 12, 1919 and September 29, 1919, i.e. shortly before his second year as physics student [22]. Its 102 pages are packed with notes and bibliographies on several branches of modern physics; e.g., chapter 2 (pp. 29-57) is devoted to the electronic theory of matter, whose bibliography has an extended section on spectroscopy (papers by Voigt, Stark, Bohr, Garbasso, etc.); chapter 3 (pp. 58-66) is devoted to Planck equation and blackbody radiation. On January 30, 1920 he wrote to his close friend Enrico Persico (letter preserved by the Niels Bohr Library in Washington, D.C.) that he had "took up again the study of the progresses happened in physics during the war". Furthermore, Puccianti had charged him to deliver lectures on quantum mechanics. Among the subjects to be touched by Fermi in his lecture was the "Stark - Lo Surdo phenomenon". "Little by little", Fermi wrote, I am becoming the most influential authority at the Institute of Physics. And more than that, one of these days / am going to deliver a conference on the quantum theory - about which I am always a propagandist - before a gathering of tycoons [emphasis added] [23]. As of May 1920 he had a complete mastery of the Bohr-Sommerfeld model, as shown by another letter to Persico where he hinted at his solution of a theoretical spectroscopy problem [24]. During the same year he likely carried out an in depth study of Sommerfeld's Atombau und Spektrallinien, that eventually became for several years the reference textbook on quantum mechanics. During the fall of 1920, Fermi's fellows at Pisa (Franco Rasetti and Nello Carrara) came to recognize his immense superiority in the knowledge of mathematics and physics and "henceforth regarded him as their natural leader, looking to him rather than to the professors for instruction and guidance" [25]. During his third and fourth Scuola Normale years, Fermi published his first theoretical papers. These ones were mostly devoted to electromagnetism and relativity problems, such as the first one (January 1921) dealing with the inert mass of a rigid system of electric charges [26]. The first lasting contributions of Fermi to quantum mechanics derive from a group of papers on analytical mechanics which Fermi completed during his stay in Gottingen, one year after the graduation. In particular, P. Ehrenfest, who had delved deeply into the foundation of statistical mechanics, was impressed by 1923 Fermi's proof of the ergodic theorem [27]. At last, in 1926, while teaching theoretical physics at the

235 University of Florence, he published his celebrated paper on the statistical mechanics of particles obeying the Pauli exclusion principle (fermions) [28]. This work immediately won him international fame, since Sommerfeld recognized its revolutionary significance for understanding the properties of conduction electrons in metals and many other phenomena. Quantum mechanics was no longer a forbidden topic in Italy.

Figure 5. Some of the protagonists of the entrance of quantum mechanics in Italy from a 1925 photograph: E. Fermi, N. Carrara, F. Rasetti (left to right) and R. Brunetti (2nd row) [30].

4.

Conclusion

As it is well known, the quantum theory originated in an empirical attempt to bring the then current theories of black-body radiation into line with experiment. Of vastly greater significance were the successes which the postulate e = hv had - during the first decade of 1900s - to explain other phenomena, such as the photoelectric effect and the specific heats of solids. As we have shown, these major developments were largely extraneous to Italian physics, as the Planck's formula apparently "deeply repelled" to Italian physicists conceptions. Thus, Italian Physics delayed in tackling quantum mechanics for several years, and when things changed, the abroad debates on black-body radiation, photoelectric effect, etc. played no role. As a matter of fact, quantum mechanics entered Italian physics only in 1913, i.e. when this field was seen as a fruitful tool to deal with experimental spectroscopy subjects. This spectroscopy tradition of research had indeed an old and glorious heritage, dating back to Padre Angelo Secchi stellar spectroscopy and to what was likely the first professional society devoted to astrophysics, namely the Societa degli Spettroscopisti Italiani (1871).

236

Among the major protagonists of this tradition during the first two decades of 1900s - as regards the laboratory spectroscopy - was Garbasso and his Arcetri school. He was indeed first in applying the 1913 Bohr's theory of atomic spectra with the goal of explaining the recently discovered Stark effect. This spectroscopy school kept flourishing during the second decade of the century through Garbasso's assistants, such as Lo Surdo, who improved the experimental techniques for detecting the Stark effect, and Brunetti, who was successful in using quantum mechanics to explain X-rays spectra. Finally, signs of this tradition are also present in the early works carried out by Fermi, whose landmark papers allowed quantum mechanics to fully enter Italian Physics during the first years of 1920s. This contention is indeed supported by some of his juvenile notebooks, preserved both at the Domus Galilaeana in Pisa and at the University of Chicago, where spectroscopy issues and Bohr's theory receive a wide coverage. Acknowledgments We are grateful to Enrico Fermi Collection, University of Chicago, for access to notebook "Alcune memorie fisiche", to AIP Center for History of Physics Niels Bohr Library for access to Fermi-Persico correspondence and to the Accademia Nazionale delle Scienze "detta dei XL" for access to the Archive for History of Quantum Physics. References 1. 2. 3. 4. 5. 6. 7. 8. 9. 10. 11. 12. 13. 14.

O.M. Corbino, N. Cimento 17, 256 (1909). O.M. Corbino, N. Cimento 3, 368 (1912). A. Lo Surdo, Rend. R. Ace. Lincei 22, 624 (1913). J. Stark, Ann. d. Phys. 43, 965 (1914). J. Stark, Ann. d. Phys. 43,983 (1914). J. Stark, Nature 92, 401 (1913). M. Leone, A. Paoletti, N. Robotti, Phys. perspect. 6, 271 (2004). N. Bohr, Phil. Mag. 26, 1 (1913). N. Bohr, Phil. Mag. 27, 506 (1914). A. Garbasso, Rend. R. Ace. Lincei 22, 635 (1913); N. Cimento 6, 338 (1914); Phys. Zeit.. 15, 122 (1914). N. Bohr, Nature 92,554 (1914). Garbasso to Bohr, January 19, 1914, Archive for History of Quantum Physics (hereafter AHQP). Bohr to Garbasso, February 17, 1914, Bohr Scientific Correspondence (2,5), AHQP. G. Polvani, Atti SIPS, 669 (1939).

237

15. 16. 17. 18. 19. 20. 21. 22. 23. 24. 25. 26. 27. 28. 29. 30.

A. Garbasso, N. Cimento 9, 376 (1915). Z. Ollano, N. Cimento 19, 221 (1942). R. Brunetti, N. Cimento 16, 5 (1918). R. Brunetti, N. Cimento 18, 266 (1919). R. Brunetti, N. Cimento 22, 215 (1921). M. Leone, N. Robotti, C.A. Segnini, Physis 37, 501 (2000). E. Fermi, Notebook N2, Fermi Archives, Domus Galilaeana, Pisa, Italy. E. Fermi, Notebook "Alcune memorie di fisica", Enrico Fermi Collection, University of Chicago. Fermi to Persico, January 30, 1920, AIP Center for History of Physics Niels Bohr Library. Fermi to Persico, May 30, 1920, AIP Center for History of Physics - Niels Bohr Library. F. Rasetti, in E. Fermi, Note e Memorie (Collected Papers), Vol. 1, Ace. Lincei - Univ. Chicago Press, Rome - Chicago 1961, p. 55. E. Fermi, N. Cimento 22, 199 (1921). E. Fermi, N. Cimento 25, 267 (1923). E. Fermi, Rend. R. Ace. Lincei 3, 145 (1926); Z. Physik 36, 902 (1926). R. Brunetti, N. Cimento 10, 129 (1933). C. Bernardini, L. Bonolis, Conoscere Fermi, Societa Italiana di Fisica, Bologna 2001, p. 338.

THE MEASURE OF MOMENTUM IN QUANTUM MECHANICS FABRIZIO LOGIURATO Department of Physics, University ofTrento, Trento, Italy

38050 Povo

CARLO TARSITANI Department

of Physics, University of Roma 'La Roma, Italy

Sapienza'

The de Broglie relation p - h/X is often used in the heuristic deduction of the Schrodinger equation. Yet, this relation does not appear among the postulates of quantum theory. Actually, in most textbooks the physical definition of the quantum concept of momentum is often neglected. In this paper we show that the definition of momentum as derived quantity, operationally founded on the typical measurement of the so called "flight time", not only fits very well with the physical principles of the quantum theory, but can also help to avoid common ambiguities in the enunciation of Heisenberg's uncertainty principle.

1

Introduction

Complementarity and uncertainty principles are considered as the two chief "pillars" by which the stately quantum edifice is sustained. This edifice appears to be at the same time grand and solid, after so many decades of theoretical, experimental, technological successes. However, it is worth noticing that the two great principles which form the foundations of the theoretical framework, if thoroughly analyzed, appear to be still today somewhat vague and uncertain [1]. We will focus our attention in this paper on the uncertainty principle (UP) and on the "meaning variance" of its different enunciations". In particular, we will only refer to the well-known relation between position and momentum uncertainties. Usually, physics textbooks introduce UP either by describing the classical thought-experiments by Heisenberg and Bohr [3, 4], or by carrying out a formal demonstration (for the one-dimensional case) based on the Fourier theorems or, a

Indeed, under the label "uncertainty principle" one may find not only different physical statements, but also different formal expressions for the functions of the observables, for the relations with the physical states and for the various kinds of measurements [2], Let's take, for instance, the common statement: "it's impossible a simultaneous and indefinitely accurate measurement of a pair of noncommuting observables". Actually, two incompatible observables may share a subset of eigenstates. If the system is in a state belonging to that subset, the values of both the observables can be simultaneously measured as accurately as we want!

238

239 more generally, on the operators' algebra [5, 6]. Ax and Apx are obviously linked to the dispersion of the measured values of the two quantities. It is thus presumed that the measurements are made on an ensemble of identically prepared systems, so that UP is obtained by measuring each time just one of the two observables. Now, the question we want to answer is the following: which is the connection between the experimental procedure we adopt to verify UP, Fourier transforms, and the Heisenberg-Bohr thought experiments? In particular, how the measurements of momentum are performed? Let's consider for instance the well-known experiment of diffraction by a single slit (Fig. 1). A beam of particles is directed towards a wall with a hole in it whose diameter we denote by d. Each particle is prepared in the same state: its momentum has an absolute value p, and its direction is orthogonal respect to the wall. Everybody knows that on a screen, parallel to the wall and at distance L from it, we observe the typical diffraction pattern. Therefore, if we consider the particles as classical corpuscles, we are forced to admit that, while passing through the hole, the momentum of the particles acquires a transverse component.

Fig. 1. The diffraction experiment: the transverse component of the momentum of each particle is measured by means of the position detected on me screen.

The direction of the momentum of the great majority of particles is between the angles 9 and - 9, where 9 indicates the position of the first minimum of the b

In other terms, we can imagine to split the ensemble of identically prepared systems in two equal parts: for the first part we make an "ideal" (infinitely precise) measurement of the position, and, for the second, we make an ideal measurement of the momentum.

240

diffraction pattern. For small 0, we can assume px=p-WL),

(1)

where x is the position of a particle on the screen and L the distance between the wall and the screen. For each particle, the x-coordinate immediately after the hole will be known with an uncertainty Ax -d, and we can assume that the xcomponent of the momentum will acquire an uncertainty Apx » psin 6. So far, our description of the process is based on a particle-model of the quantum system. However, in order to obtain UP we need to introduce the wave model. From wave theory we know that, for the first minimum of the diffraction pattern, we have X = dsinO. So, by means of the de Broglie relation, it is easy to deduce the relation: AxApx « h.

(2)

Let's focus our attention on the quantity "momentum" in the relation (2). First, we have defined it in a purely classical way. Second, its measurement is indirect: actually, we can only measure positions and/or times. In the case of the single slit experiment, we don't need time measurements: we already know the absolute value of/?, so must measure just positions. According to Popper and Ballentine [7, 8], the demonstration of UP presupposes the assumption that, for each particle, both position and momentum are measurable so accurately that the product of the respective uncertainties can be less than h. Indeed, for each particle j which has gone through the hole and it is arrived to the screen, we have A^ = d, and Apx ~ 0. Indeed, this is a consequence of the fact that we have performed a classical measurement of the momentum, based on the measurement of two positions. In order to avoid this apparent paradox, we must make a sharp distinction between the concepts of state preparation and data prediction. In fact, UP states that it is impossible to find an ensemble of systems in a dispersion-free state both for momentum and for position (that is in a state in which all particles prepared with the same momentum can be detected at the same position). By taking into account the single particle j , if we know that AjX ~ d, we can predict that Ajpx ~ h/d. In other terms, after having measured the position of a single system with an uncertainty Ajx, our knowledge of the value of momentum is affected by an uncertainty not less than Apx ~ h/d. Often UP is stated as such: "it is impossible to know simultaneously and with exactness both the position and the momentum of a particle" [9]. However we have shown the ambiguity of this expression: if we don't know position we cannot know momentum. We could state UP in a more correct fashion by saying: "Once the position of a particle is measured, the value of its momentum

241 can be predicted with an uncertainty not less than that established by UP (and vice versa)"; or by saying: "It is impossible to prepare a system or an ensemble of systems which have simultaneously well defined values of position and momentum". Also Feynman, in his Lectures [10], uses an ambiguous statement of UP. However, it's worth noticing that he also writes: Sometimes people say quantum mechanics is all wrong. When the particle arrived from the left, its vertical momentum was zero. And now that it has gone through the slit, its position is known. Both position and momentum seem to be known with arbitrary accuracy. It is quite true that we can receive a particle, and on reception determine what its position is and what its momentum would have had to be gotten there. That is true, but that is not what the uncertainty relations refers to [...] the fact that it went through the slit no longer permits us to predict the vertical momentum. We are talking about a predictive theory, not just measurements after the fact. So we must talk about what we can predict0.

2

The definition of momentum in classical and quantum physics

Let us adopt an "operational" definition of momentum [11]. In classical physics, the momentum of a particle is deduced by the measure of its velocity. The average velocity v^ of the particle is obtained by observing its positions x and XQ in two different instants t and £0dWe cannot adopt the same method in quantum theory because of the wellknown fact that the first observation changes the system's state. However, this shortcoming can be easily avoided by the following procedure. Let Ax be the uncertainty with which we know the initial position of the particle (or the dispersion of the positions of an ensemble of particles, each prepared in the same identical state). Let's also assume that the wave function ^ W . for t = 0, is centered on the origin of the spatial coordinates. If, at the time t, we observe the position of the particle at the point x, its velocity will be vx = x/t, so that its momentum will be px - mx/t. If we suppose to be able to measure both x and t with arbitrary precision, the uncertainty on the measured value of the momentum is Spx - mAx/t. Therefore, for large /, we can reduce the uncertainty of the momentum as much as we want, and the effect of the uncertainty on the initial position is negligible. Let's now calculate the probability distribution P(px) of the momentum after a set of measurements on an ensemble of identically prepared systems. Being px = mx/t, the probability P(px)dpx that the value of the particle's momentum lays between px and px + dpx is equal, according the "flight time"

c d

Also Heisenberg specifies [4]: "the uncertainty relation does not refer to the past". If the particle is free, the average velocity is equal to the instantaneous velocity.

242

technique, to the probability p{x)dx that the particle will be revealed, at the time t, between x and x + dx. The "propagator" for the free particle is given by the following expression: K(x,t;x',0)--

2niht

exp

im{ x — x') 2ht

(3)

The probability amplitude that a particle will be observed in x at the time t, is Hx,t)=

U(x,t;x',0)po(x')dx'.

(4)

From (3) it follows that y/(x,t)

Iniht

exp

2%t

Mi^**-2"" \lf (x')dx<.

(5)

0

According our operational definition of momentum, we can write, for large t, 2

mdx p(x)dx -2Kht

U-^(x«-2xx')

¥o(x')dx'

= P(px)dPx.

(6)

Putting px = mx/t, we obtain: P(px)dpx

_ dpx inh

f

.imx'2 ip x x\

,

(7)

If the initial amplitude y/o(x) is different from zero only in a small region Ac centred on the origin, for times t -» °°, the ratio mAx2/2ht tends to zero: the contribution to the integral of the first exponential will be negligible. Thus, the distribution of momentum is equal to the square modulus of the amplitude:

V2wft J

\

h )

Voix')dx',

(8)

Now,
This proof is essentially due to Feynman [12]. It is worth noticing that, when we passed from the distribution (7) to the amplitude (8) we omitted a phase factor. For a complete proof, see the old book by Kemble [13]. Its first version is due to Kennard [14].

243

1 ^1K%

ilni VA

.

(9)

the corresponding probability amplitude 0(px) for the momentum will be the Dirac's function S(px - h/A), so that the momentum assumes the single value px = h/L All the measurements of momentum, performed on an ensemble of particles identically prepared in the state (9), would always give the same value. The values of momentum for the state (9) don't show any dispersion. Then the amplitude (9) represents a momentum eigenstate whose corresponding eigenvalue is given by the de Broglie relation. In contrast with the great majority of textbooks, there is no need to postulate the de Broglie relation: it can be deduced from the wave functionf. However, we must observe that the above measurement of the momentum eigenstate is an "ideal" measurement. Indeed, the eigenstate (9) is represented by a function which is defined in the entire space and whose square modulus is constant and different from zero. Therefore, the measurement process that we have described so far cannot be performed. Actually, we deal with wave trains. In this case, we can easily show that by increasing the length of a finite wave train, we obtain a narrower and narrower distribution, which tends to a Dirac's "delta function". Let us choose, for instance, as initial probability amplitude, the following finite and normalized wave train of length I: 1 (2m ^ —;=exp —— x \if I x l< +/ if \x\> -I J 0 We obtain, as its Fourier transform, ,jXp '

)=M±[sml[{2ff/Z)-(px/h)y[(2x/X)-(px/h)]).

(11)

\ hi 7t

For / -* co the function (10) becomes gradually narrower around the value px = h/X, where it has a maximum which goes to infinity as the square root of L. Differently from the state (9), in the new state both the function of position (10) and its Fourier transform (11) are normalized and correspond to concretely realizable physical states8. f

For instance, Sakurai introduces the relation in a rather formal way by means of the infinitesimal translation operator and the analogy with the function that generates the corresponding canonical transformation of analytical mechanics [6]. 8 Obviously also the time requested to perform the measurement tends to become infinite, as we can see from equation (5).

244

3

The diffraction by a single slit and the Fourier transform

Let's come back to the single slit diffraction experiment. For the particle at the slit the position probability amplitude is constant: \d~y^if

\x\
0

ix\>d/2

y

The Fourier transform of (12) is: (Px) =(dn.7Ch)m{^\Px{dl2K)}l\px{dl2h)]} .

(13)

Let's analyze the diffraction pattern revealed by the screen. Let's assume that the incident beam is made of plain waves. From the classical wave model, on Fraunhofer's condition (L »d) and for small angles (sin#= 0), the intensity of the waves is: l(d) = I(O){sm{{Kda)6\l[{xdlX)0\}2,

(14)

where 7(0) is a factor corresponding to the central maximum intensity [15]. Hence, the position probability density will be (0= x/L): p(x) = |^0)| 2 {sin[(;r d/A)(x/L)]/ [(nd/A)(x/L)] } 2 .

(15)

We know that, in the classical particle model, the transverse momentum px, which is acquired by the particle while it is going through the hole, is px = p-x/L. By observing on the screen the particle's point of arrival, we will obtain and indirect measurement of its transverse momentum. Therefore the momentum distribution for an ensemble of particles is the following: \(0)|2 {sm[px(d/2h)]/\px(d/2h)]}2.

(16)

The connection between equation (16) and the Fourier transform is now evident: the momentum probability amplitude corresponds to the Fourier transform (13). So, the connection is established between the deductions of UP from Heisenberg's experiment and from the method of Fourier transforms. The diffraction pattern on the screen is deeply connected with the Fourier transform of the wave amplitude on the slit: it is nothing else than its square modulus. This is a widely known result in classical optics [16], but it is often missing in quantum mechanics textbooks. The "time of flight" and the "point of arrival" measurement techniques are equivalent. Both, in fact, adopt the classical particle model for quantum systems. Let t be the time spent by the particle to go from the slit to the screen. Given that

245

px = mx/t = m(L/t)(x/L)=pix/L),

(17)

for the time of flight we get: t = mXLIh.

(18)

Since Ax ~ d, and since the wavelength X has the same order of magnitude, for L» X the first term in the exponential in the expression (7) is negligible. Again, we obtain the momentum distribution amplitude as the Fourier transform of the position wave function. The condition L » X is certainly satisfied if we adopt the Fraunhofer's approximation11. We have just seen that the dispersions Ax and Apx are connected by the Fourier's transforms. Yet, why the "classical" formulation of UP as Ax Apx > h/2 and the relation (2) of the first section appear so different? The answer is simple: as Uffink and Hilgevoord [18] have noticed, the reason is that both Heisenberg and Bohr, in their though-experiments, adopt for Ax and Apx definitions which cannot be interpreted as variances. According to Uffink and Hilgevoord, a good quantitative measurement of the uncertainty cannot be given by variance. In fact, in the single slit experiment, by applying such a definition to the distributions \y/(x)? and \<j> (px)\2, we get Ax = d/^/Vl and Apx = °°, so that the uncertainty on momentum does not depend on the width of the slit. The reason is that the function \i//(x)\2 is characterized by sharp boundaries and goes discontinuously to zero. Therefore Uffink and Hilgevoord are forced to introduce a different definition for uncertainties. For Braginsky and Khalili [2], the difficulty does not depend on the above definition: it is sufficient to consider more realistic measurements by eliminating the discontinuities of the distributions. According to their point of view, if we adopt the definition of uncertainty as "variance" and, at the same time, if we take a slit with "smooth" boundaries (we describe the distribution I i/Kx)\2 as a Gaussian function), we get again the usual expression for UP. However, we must underline a conceptual difference. In the usual expression of UP, the uncertainties do not refer to the measured values of the observables but are regarded as intrinsic features of the prepared state. On the contrary, the relations (2) can be interpreted as uncertainties produced by the detector in the measurement process of the single particle state. By measuring position with an uncertainty AxD, we create an uncertainty on the previous value of momentum which is at least ApxD = h/2AxD. So, we can symbolically distinguish the two kinds of uncertainty relations by introducing two different

h

For further details and for some applications of the classical definition of momentum in quantum mechanics see Ref. 17.

246

notations': Sxvbspxv>hl2 4

AxDApxD>h/2.

(19)

Conclusions

A clear presentation of uncertainty relation position-momentum is deeply influenced by the use of the wave and the particle model. In quantum mechanics, if we want to define physical concepts by means of an operational approach, momentum can only be defined in terms of position and time measurements. Therefore the operational definition must be essentially "classical". For this very reason, we are forced to adopt a definite model of the system we deal with. We think that this example show how, by paying more attention to the way the concepts are used in quantum physics, we could avoid misconceptions and prevent the confusion between verbs like to predict, to prepare and to measure. Moreover, by using the classical definition of momentum we can get both the de Broglie relation and what is already known from classical optics: the diffraction pattern, interpreted as the particle distribution amplitude, is just the Fourier transform of the particle amplitude at the slit. The deep connection between the heuristic deduction of UP by means of the usual thought experiments and its formal deduction by means of Fourier transforms is often missing in textbooks. Acknowledgements It is a pleasure to thank Prof. Stefano Oss for his kind support, and Dr. B. Danese, Dr. S. Defrancesco and Dr. L. M. Gratton for valuable discussions. References 1. M. Jammer. The Philosophy of Quantum Mechanics (Wiley, New York, 1974). 2. M. G. Raymer, Am. J. Phys. 62, 986 (1994). 3. W. Heisenberg, Z. Phys. 43,172 (1927).

' A new class of measurements, which combines the uncertainties (19), can be defined [2]. We perform simultaneous measurements, which are of course inaccurate, both of the position and of the conjugate momentum for each member of the identically prepared set of systems. It is possible to show that the squares of the statistical dispersions sum up with the squares of the uncertainties due to the measurements. Then it is easy to show the validity of the relation Ax]»DApi»*) - ^> s o m at the inferior limit of the product is two times the usual one.

247

4. W. Heisenberg, The Physical Principles of the Quantum Theory (University of Chicago Press, Chicago, 1930). 5. S. Gasiorowicz, Quantum Physics (Wiley, New York, 1974). 6. J. J. Sakurai, Modern Quantum Mechanics (Addison-Wesley, Reading, MA, 1985). 7. K. R. Popper, Quantum Theory and the Schism in Physics, from the Postscript to the Logic of Scientific Discovery (Routledge, London, 1992). 8. L. E. Ballentine, Rev. Mod. Phys. 42, 358 (1970). 9. M. Alonso and E. J. Finn, Fundamental University Physics, Vol. 3 (Addison-Wesley, Reading, MA, 1972). 10. R. P. Feynman, R. B. Leighton and M. Sands, The Feynman Lectures on Physics, Vol.3 (Addison-Wesley, Reading, MA, 1989). 11. P. W. Bridgman, The Logic of Modern Physics (Macmillan C , New York, 1927). 12. R. P. Feynman and A. R. Hibbs, Quantum Mechanics and Path Integrals (Mc Graw-Hill, New York, 1964). 13. E. C. Kemble, The Fundamentals Principles of Quantum Mechanics (Dover, New York, 1958). 14. E. H. Kennard, Phys. Rev. 31, 876 (1928). 15. F. A. Jenkins and H. E. White, Fundamentals of Optics (Mc Graw-Hill, New York, 1957). 16. P. M. Duffiuex, The Fourier Transform and Its Applications to Optics (Wiley, New York, 1983). 17. F. Logiurato, The Concept of Momentum in Classical Physics and Quantum Mechanics, unpublished (2005). 18. J. B. Uffink and J. Hilgevoord, Found. Phys. 15, 925 (1985).

ON THE TWO-SLIT INTERFERENCE EXPERIMENT: A STATISTICAL DISCUSSION

M. M I N O Z Z O Department

of Economics, Finance and Statistics, University Via A. Pascoli, 06100 Perugia, Italy E-mail: [email protected]

of

Perugia,

In the last decades, many two-slit interference experiments have actually been performed as sequential experiments by sending as few as possible particles at a time through some interfering apparatus. In this work, under the usual axioms for probability theory of Kolmogorov, a novel purely particle statistical approach to the analysis of these interference experiments is proposed by explicating the sequential nature of the experimental observations.

1. Introduction The origins of the two-slit experiment go back to Young's experiment in 1803, in which a beam of visible light separated by two pinholes combines to form an interference pattern. At the beginning of the present century interference experiments were also performed for X-rays, in 1912, and for beams of electrons, in 1927. More recently, the groups of Tsuchiya et al. x and of Tonomura et al. 2 , e. g., carried out, for photons and electrons respectively, two-slit experiments in which not beams, but particles, one after the other, were sent through the interference device. Since their discovery, all these and other interference phenomena have always been regarded as the strongest empirical evidence in support of wave-like theories in quantum physics. Indeed, these phenomena, together with diffraction phenomena, are the only direct evidence of the wave-like behaviour of light and matter. Despite purely particle realizations of the two-slit experiment have been performed, up to now only a few purely particle-based explanations have been attempted, and in the main-stream theoretical developments interference experiments are always explained by appealing to some sort of wavelike entities, might these be called waves, fields or strings. In this work, which is a development of ideas already presented by the author elsewhere, 3 after formulating in Sec. 2 our ideal purely particle two-slit experiment in a

248

249 Barrier

Source

II U fj

x

Screen

y

IT

(b)

Figure 1.

(c)

Two-slit ideal experimental set-up.

sequential statistical setting under the usual axioms for probability theory of Kolmogorov, we consider a particle-based toy model in Sees. 3 and 4 which explains the interference pattern in a purely statistical fashion, without using any wave-like mathematics or concept. In Sec. 4 we also show that this model accounts for the non-additivity paradox which usually arises by comparing the interference pattern with the patterns obtained by closing one slit at a time. 2. The Two-Slit Interference Experiment Let us introduce the classical idealized version for particles of the two-slit interference experiment. For our purposes we will not make any distinction between different types of particles, such as photons, electrons or neutrons. A particle could be whatever entity with the property of being well localized in a small region of space and of moving from one place to another in a continuous way without splitting anywhere. As usual, despite real experiments are three-dimensional phenomena, our discussion will focus on the usual two dimensional geometrical section. The two-slit ideal experimental set-up (see Fig. 1) consists basically of three different elements: a source of particles, a barrier with two slits and a screen. The whole two-slit interference experiment is then made up of three distinct experiments: a first experiment with both slits open, a second experiment with only the first slit open, and a third experiment with only the second slit open. Consider first the experiment with both slits open. When the experiment starts, particles are sent out of the source sequentially, one after the other, towards the barrier. Whereas many stop against the barrier, some particles find their way through the slits and go to finish on the screen where their position is recorded. When enough particles have arrived on the screen, the histogram of their positions will start to resemble the clas-

250

sical interference pattern described by the contour (a). On the other hand, considering the experiment with only the first slit open, sending a particle after the other towards the barrier and onto the screen we obtain an empirical histogram which approximates something alike to the contour (b), whereas with only the second slit open we obtain something alike to the contour (c). We come here to the essence of the problem. Whereas the histograms relative to the experiments with only one slit open are in accordance with what we would have expected, the histogram obtained with both slits open, against our common sense experience, is not the uniform mixture of the other two, and shows instead a pattern which is similar to the distribution of intensity of a wave. According to orthodox views, it is in this sense that light, as well as matter, behaves "sometimes like a particle and sometimes like a wave", and it is in this sense that we observe the so-called phenomenon of the "self-interference" of a single particle. From a purely particle viewpoint, the "non-additivity", in the sense of non uniform mixture, of the three histograms seems to give rise to a paradoxical situation. In fact, this is in contrast with what we would have expected assuming that the barrier acts only as a static selection rule. However, all three experiments are actually performed sequentially, in a long single run, and this assumption is equivalent to assume that before each particle is sent out of the source the whole experimental set-up can be considered as if it had been completely resetted in the same identical initial conditions, that is, it is equivalent to rule out all the interactions that could have occurred (in particular those between the particles and the barrier) up to then. On the other hand, the non-additivity paradox disappears as soon as we abandon the hypothesis of a barrier acting as a static selection rule and we recognize that the three histograms are obtained in three distinct experiments in which different, dynamic or not, effects can take place. Thus, let us consider the following modeling framework. Each of the three experiments can be represented on its own probability space ($7, J-, P) where the set Q represents the set of all possible realizations w G f l o f the experiment. For each of these probability spaces, to account for the position of the particles at the barrier we consider a stochastic process (Xn). The position at the barrier of the nth particle in a given experimental realization u> is represented by xn = Xn(u>). To represent the position of the particles at the screen we consider a stochastic process (Yn) assuming values in the extended real line R U o o . For any given experimental realization OJ, we assume yn = Yn(u>) equal to oo if the nth particle does not pass the barrier, and equal to some real value, representing the position at the screen of the

251 particle, if the nth particle does indeed pass the barrier. For this framework we use the following ordinary interpretation of the concept of probability. Let us imagine to have a very large, ideally infinite, number of ideal laboratories in the same identical conditions, where, in each of them, we have an experimental set-up identical to the one in Fig. 1. Also imagine that in each laboratory it is performed a single run of length N of one of the three experiments, that is, N particles are sequentially sent out of the source one after the other. Then the probability P(Xn < x), x 6 R, for some given n = 1 , . . . , N, will represent the proportion of the laboratories in which the nth particle in the run had a position xn < x. For the experiment with both slits open, the distribution of the position of the particles at the screen in a given experimental realization u £ Q. of length N is given by the empirical distribution function, for y < oo, F(y;N) = # { n : n < N,Yn{u) < y}/#{n : n < N,Yn{w) < oo}, where # denotes cardinality. Considering a very long, ideally infinite, run, the form of the interference pattern is represented by the limiting empirical distribution function F(y) = limjv^oo F(y; N), for y < oo. For the other two experiments with only the first or the second slit open respectively, we similarly define empirical distribution functions and limiting empirical distribution functions, for which we have, for y < oo, F'(y) = lim^v^oo F'(y; N) and F"(y) = limjv-^oo F"(y; N). Of course, these, finite or limiting, empirical distribution functions, representing the proportion of particles that in a given run, among all particles that passed the barrier, have a position Vn < y, are random quantities and not probabilities.

3. A Dynamic Model with Memory Effects at the Barrier To show the possibilities of our approach we shall now discuss, with no attempt to provide any physical theory, a toy model in which the barrier is allowed to dynamically interact with the particles. For the experiment with both slits open, let us suppose that the emission of particles by the source is governed by a stochastic process such that particles arrive at the barrier independently and identically uniformly distributed over a finite interval covering the two slits. Denoting with m' and m" the position of the middle of the two slits and with 2v'0 and 2V'Q their widths, we assume that Xn, n = 1,2,..., are independent and identically uniformly distributed over a finite interval [—6, b] well covering [m' — v'0, m" + V'Q]. To specify the interaction between the particles and the barrier, we assume that at each instant of time n = 1,2,..., each slit can be in one

252 of three states: 0, 1 and 2, say. Denoting with S'n and S% the states of the two slits at time n, the barrier as a whole can be described, at any given instant of time n, by the pair (S^S^), which can be in one of the 9 combinations (0,0), (0,1), (0,2), . . . , (2,2). Then particles close to the barrier are assumed to be subjected to two effects taking place before (attraction) and after (redirection) the barrier respectively. To describe the attraction effect, we imagine that for each slit a funnel, with a width depending on the state of the slit, "captures" the particles coming from the source. Since the net result of the funnels would be that to capture some particles that were originally meant to hit the barrier and to funnel them through the slits, the total effect of the funnels can just be described by a fictitious widening of the slits. Denoting with Wn and W£, n = 1,2,..., the fictitious half widths of the two slits, we assume the simple linear relationships W'n — v'0 + cS'n and W£ = V'Q + cS%, where c is some real constant. Since the two slits can be in three different states, the half widths of the two slits can take three different values which we denote with v'0, V[, V'2, and V'Q, V", V'^. (TO avoid to take account of what happens when the two widenings overlap, we will deal with widenings small compared to the distance between the two slits.) Particles passing the barrier are also subjected to a redirection effect. Looking again at the net result, this effect is responsible for the position of the particles at the screen. For the first slit (and similarly for the second slit) we assume that if the nth particle is meant to pass through it and its position at the barrier is Xn, then its position at the screen is given by f Xn + d((Xn - m')/W^, In

S4),

\Xn-d{{m'-Xn)IW^S'n),

where the "drift" function d(Xn,S'n), (*J3,

if

(Xn - m') > 0,

if (Xn-m')<0,

{

>

for 0 < Xn < 1, is defined by _

3 *(y»,^)=u«2 sB; ,v. ( i + (, n 2^ < - i )^ ),

if S'n = 0, «« o if s'n := ,1,2.

(2)

From Formula (1), it is easy to see that the regions of the real line where the first derivative of d(Xn,S'n) with respect to Xn is close to 0 correspond to the locations on the screen where the particles accumulate more. To describe how particles interact with the barrier, indicating with S'n = [rri - W'n; m' + W'n\ and S£ = [m" - W%;m" + W^}, n = 1 , 2 , . . , the (fictitious) openings of the two slits, and assuming to start the experiment (with both slits open) at time n = 1 with the barrier in state S[ = 0 and <S" = 0, we make the following assumptions:

253

i) i£XnGS'n, then 3 , + 1 = S'n - 1, and S^+1=SH + 1; ii) if Xn e S£, then 5 ; + 1 = S'n + 1, and S^'+i = S% - 1; iii) if X n 0 5 ; U 5;', then S ; + 1 = 54, and S^'+i = <^'We also assume that if the states of the two slits cannot decrease or increase, because already equal to 0 or 2, then they remain the same. Considering, e. g., assumption i) (assumptions ii) and iii) having a similar interpretation), if the nth particle passes through the first slit, then the (n + l)th particle will find the state of the first slit decreased by a unit and the state of the second slit increased by a unit. 4. Interference Pattern and Non-Additivity Paradox Now, for the experiment with both slits open, let us define, for the position of the nth, n = 1,2,..., particle at the barrier, the conditional distributions U!(x) = P(Xn < x\Xn e S'n,S'n = i) and U?(x) = P(Xn < x\Xn e S^S^ = j), for i = 0,1,2 and j = 0,1,2. Then, considering the transformation determined by formula (1), we define the conditional distributions of the position of the nth particle at the screen as G'^y) = P(Yn are symmetric, centered around the middles m' and m" of the respective slits, and, apart from g'0(y) and g'o(y), also bimodal. Then considering the first slit we have P(Yn
=

i,S';=j)

P{Yn
for i,j = 0,1,2, and, similarly, for the second slit we have, for i, j = 0,1,2, P{Yn
=

i,S^=j)

S':, S'n = i, S% =j) P(Xn e S^\S'n = i, S£=j) = G'J(y)2v'>/2b.

With these conditional distributions we can then write, for i, j = 0,1,2,

__ P(Yn
= i,S'Jj =3)

Gi(y)^+^'(«)^]=^[GI(yM+^(y)«}q, 1

J

254

Figure 2.

States of the two slits for the experiment with both slits open,

and so, for i, j = 0,1,2,

P(Yn
= i,s': = j)

= P(Yn < y\Xn eS'nU S';, S'n = i, S': = j)

p(xn es'nusz\s'n = i,sz = j)P(s'n = i,s;' = j) = K + v'!)-1 [G'SlK + Gi'{y)v'>} (2„< + 2x,;)(26)- 1 P(<S; = », S^ = j). Moreover, for i, j = 0,1,2, we can also write

p(xn e s; us;, s'n= i, s';=j)=p{xn

€

s; us;'|<s;= i, s':=j)p{s'n= i, s;=j)

= l(2v> + 2v'>)/(2b)]P(S>n =

i,S';=j).

Thus, considering that the evolution of the state of the barrier (S'n, S'^) is as described in Fig. 2, and so that the actually admissible states for the barrier are only the states (0,0), (0,1), (0,2), (1,0), (1,1), (2,0), the distribution of a particle at the screen, for a given n = 1,2,..., is given by

p(Yn
es;us;)

(3)

= i, S<; = j)

E L o E ' I O P{Xn &S'nU S", S'n = i, S» = j)

E t o E 2 :^ [G'dvX + G'!(y)v'!] PQS; =», s ; = j) E L E?=OK + <)P(S; = i, ^ = 3) Let us now consider to evaluate the distribution of a particle at the screen for n large. The evolution of the state of the barrier forms a discrete time, discrete state, homogeneous Markov Chain. Starting with the barrier in state (S[ = 0,<S" = 0), after some particles have passed the barrier, the Markov Chain will have left forever, with probability one, every transient state for the set of non-transient states at the bottom of Fig. 2. For this set

255

of non-transient states, for every n = 1, 2 , . . . , the (homogeneous) transition probability matrix is given by

P=

P02,02 P02,ll

0

Pll,02 P l l . l l

Pll,20

.

0

P20,ll

P20.20.

where, for i',i",j',j" = 0,1,2, piV/Jlj,^P(S^+1 = j',S^+1 = j"\S'n=i',Si:= i"), are the probabilities of transition from state (i',i") to state (j',j"), which, depending on the fictitious widths of the slits, are given by P02,oa = P ( - x ' » g K -v'Q;m' +v'Q}{J[m" -v a ';m"+v' 2 \) = l-(v'0+v'2)/b, , , ,, p02Ai = P(Xn€[m'~v o]m'+v^) + P(Xn€[m '-v^m +v^)==(v'0+v^)/b, Pu,02 =

P(Xn€[Tn'-v'1;m'+v'1])=v'1/b,

Pn,n^P{Xn^[m'-v[;m'+v'1]U{w,"-v'{;m"+v'{})=l-(v[+v'{)/b, Pn.ao = P(Xn e [ m " - < ; m"+v'(}) =,v[/b, P20,u = P(Xn G [m'-v'2; m'+v'2})+P(Xn P20,20 =

6 [ m " - i # ; ro"+i#]) = (v2+v%)/b,

P(Xn$lm'-v2;m'W2}\J[m"-vZ-m"Wo})='L-{v2+vZ)/b.

Moreover, for these non-transient states, the limiting probabilities of recurrence n02 =\imn-f00P(S'n = 0 , 5 ^ = 2), 7r u =lim„_ >00 P(54 = 1,5^' = 1) and 7T2o = limn^ooP(<54 = 2, S^ = 0) can be obtained by solving the usual system of equations 7T02 = 7T02P02,02 + 7TllPll,02,

7Tn = 7T02P02,11 + ^ " l l P l l . l l + ^20^20,11,

7I"20 = 7TllPll,20 + 7T20P20,20,

7T02 + 7Tll + ^20 = 1-

Thus, finally, considering n —> oo, for the distribution of a particle at the screen for the experiment with both slits open we have lim P(Yn < y\Xn e S'n U S£)

(4)

n—»oo

^Ng^(^)+i1+^G^(j/)+
= i}/#{n:n
= i},

and, for y < oo, for j = 0,1, 2, Gj'(y;iV)=#{n:n
256

Then, the empirical distribution function of the positions of the particles at the screen for a run of the experiment of length JV is given by, for y < oo, p(v. l2/

'

m - ( W # { " ••n
S^} l

S£}

'

_ E L o E J t o ( l / ^ ) # { " = n < JV, Yn < y, Xn £ S'n U S£, S'n = i, S£ = j} E l o E - = O ( W ) # { « :n
S», 5 ; = i, S» = j }

'

where, for i, j = 0,1,2, for y < oo, and for n < N, (l/JV)#{n : y„ < y, X„ S 5 ; U 5;', 5 ; = i, S'^ = j } _ # f n : X , < y , ^ e ^ , ^ = i , ^ / = j } # { n : ^ < y , J k €%',.%=», $'=.7}

^ " \

#{n:sk =

+G'J(y;N)

itS»=j}

JV # { n : X n G 5 ; ' , 5 ; = i,SZ=j}\ / # { n : S ' n = i,S^=j} #{n:S^ = i,S»=j} J\ JV

and, for i,j = 0,1,2, for y < 00, and for n < JV, (l/iV)#{n : Xn e S'n U S^'.s; = i,S£ = j} ' # { n :XneS'nU S£, S'n = i, S£ = j } \ / # { n : S ; = i, S^ = j} #{n:S^=i,SZ=j] J\ JV Now, considering an infinite run of the experiment, the distribution of the positions of the particles at the screen is given, for y < 00, by the limiting empirical distribution function F{y) = limN-^oo F(y; N). In general, different infinite realizations would lead to different limiting empirical distribution functions, however, for the present model it is easy to see, by a repeated application of the strong law of large numbers, 4 that F(y) = linin-^oo P(Yn < y\Xn € S'n U S'^), almost surely, that is, for a set, of infinite realizations of the experiment, having probability one. So, for a given single run of the experiment of length JV, with JV large enough, we would expect the contour of the histogram of the positions of the particles at the screen to assume a pattern close to the probability density (v'Q + V 2 > 0 2 + K + <)""ll + («2 + V0 )71'20 An evaluation of the empirical distribution function F(y; JV) has been obtained by simulating a single run of the experiment. For ml = —2.1, m" = 2.1, v'0 = 0.1, V'Q = 0.1, c = 0.2, and Xn, n = 1,2,..., independently and identically uniformly distributed over [—3,3], Fig. 3 (left) shows the

257

Figure 3. (Left) Histogram of the positions y„k, k = 1 , . . . , 10000, of the particles a t the screen for the simulated experiment with both slits open. (Right) Histograms (shown together) of the positions ynk, k = 1 , . . . , 5000, of the particles at the screen for the two simulated experiments with only the first or the second slit open respectively.

histogram of the simulated positions of a sequential record of 10,000 particles arrived at the screen. Let us note that this histogram, which follows the pattern of the density (6), has been obtained by grouping together all the positions of the particles, from the 1st to the 10,000th, arrived sequentially at the screen in a single run, and not, in particular, by grouping together 10,000 values y^ (for the particles actually arrived at the screen) obtained by simulating a large number of runs of the same given length N. Let us now consider the experiment with only the first slit open. Starting the experiment with S[ = 0, and considering that, if a particle passes through the open slit, that is, if Xn e S'n, then S^+1 = S'n — 1, and that, if Xn 0 S'n, then <S^+1 = S'n, for this experiment we have that S'n = 0, n = 1,2,.... Thus, for n = 1, 2 , . . . , for the distribution of a particle at the screen we have P(Y„ < y\Xn € S'n) = P(Y„ < y\Xn £ S'n,S^ = 0) = G'0(y). On the other hand, the empirical distribution function of the positions of the particles at the screen for a run of the experiment of length N, for y < oo, is given by F'(y, N) = #{n: n < N, Yn < y, Xn e S'n}/#{n :n
(7)

Whereas, considering a run of infinite length, for the limiting empirical distribution function F'(y) = limjv—oo F'(y; N), y < oo, of the positions of the particles at the screen we have F'(y) = G'0(y), almost surely. Finally, consider the experiment with only the second slit open. Starting the experiment with S" = 0, and considering that, if Xn € S£, then S^'+i =

258

S% - 1, and that, if Xn <£ S%, then S%+1 = S%, we have that S£ = 0, n = 1,2,.... So, for n = 1,2,..., P(Yn < j/|X„ e 5;') = P(Yn < y\Xn e Sn,S„ = 0) = Go(2/)- Then the empirical distribution function of the positions of the particles at the screen for a run of the experiment of length TV, for y < oo, is given by F" (y;N)=#{n:n<

N, Yn
S;'}/#{n: n < N, Xn € S^}

_ #{n:n
(8)

= 0} _ ~„ = 0} -^(V^)-

And, considering a run of infinite length, for the limiting empirical distribution function F"(y) = limjv->oo F"(y; N), y < oo, of the positions of the particles at the screen we have F"(y) — G'^y), almost surely. Thus, performing an infinite run with each of the three experiments we would obtain, for every p e [0,1], F(y) ^ pF'(y) + (1 — p)F"(y), (—oo < y < oo), almost surely. In other words, considering a very long typical realization from each of the three experiments, we do not have to expect the empirical distribution function of the positions of the particles at the screen obtained with both slits open to be a mixture, and in particular a uniform mixture with p = 1/2, of the empirical distribution functions obtained with only the first or the second slit open respectively. Keeping the same parameter values used for the generation of the data in Fig. 3 (left), but considering that now only one slit is open, we numerically evaluated by simulation the empirical distribution functions F'(y; N) and F"(y; N). Fig. 3 (right) shows together the histograms of the positions of two sequential records of 5,000 particles arrived at the screen obtained by simulating a single run from each of the two experiments with only one slit open. 5. Conclusions In our investigation, even if we have not been concerned with any specific real set of experimental observations, emphasis has been given to the modeling of actual experimental data and not to the reproduction of the standard theoretical results of quantum mechanics. Indeed, the present investigation, although not in disagreement with classical published data, would suggest for the gathering and analysis of new experimental data from freshly made purely particle sequential experiments. For these experiments it would then be possible to investigate, among other things, also sequential properties (and so to discriminate between different models) that cannot be accounted for by the classical calculus of wave functions of quantum mechanics. For instance, under our model, even if the positions of the particles

259

Figure 4. Plot of the positions y„k versus ynk_1 of successive particles at the screen for the simulated experiment with both slits open.

at the barrier are not sequentially dependent, the dynamic interaction of the barrier with the particles, when the two slits are b o t h open, implies t h a t they are nevertheless dependent at the screen. Analysing the simulated sequential d a t a of Fig. 3 (left), in Fig. 4 it is shown the plot of ynk versus ynk_1, t h e positions of two successive particles at t h e screen, and the p a t t e r n of empty and filled crossings indicates t h a t there is a dependence at the first time lag. Let us observe t h a t even if actual experimental observations would not show any temporal dependence, this confutation could only be discussed in our framework and not with the q u a n t u m mechanical calculus. Whereas in this work we showed how to tackle interference experiments (similarly we would proceed for diffraction experiments), in Ref. 5 it is shown how to account in a purely particle sequential statistical fashion for t h e correlation experiments associated to the Bell's inequalities and t o the Einstein-Podolsky-Rosen-Bohm gedankenexperiment. References 1. Y. Tsuchiya, E. Inuzuka, T. Kurono and M. Hosoda, Advances in Electronics and Electron Physics 64A, 21 (1985). 2. A. Tonomura, J. Endo, T. Matsuda, T. Kawasaki and H. Ezawa, Am. J. Phys. 57, 117 (1989). 3. M. Minozzo, Proceedings of the 4th World Congress of the Bernoulli Society, Vienna, 1996, p. 332. 4. A. N. Shiryayev, Probability (Springer, Berlin, 1996). 5. M. Minozzo, in The Foundations of Quantum Mechanics: Historical Analysis and Open Questions, C. Garola and A. Rossi eds. (World Scientific, Singapore, 2000).

WHY THE REACTIVITY OF THE ELEMENTS IS A RELATIONAL PROPERTY, AND WHY IT MATTERS VALERIA MOSINI Dipartimento di Chimica, 'La Sapienza', and Centre for the Philosophy of Natural and Social Sciences,

LSE

In this paper I discuss the role of relational properties, first detected in quantum mechanics, in other branches of science, most specifically, in chemistry. The widening of the domain of existence of relational properties calls for a revision of the notion of realism, which can, and should, be modelled on that formulated by Henry Margenau back in the 1950s.

1. Introduction In 1950 Henry Margenau, professor of physics at Yale and one of the founders of Philosophy of Science, published a book entitled The nature of physical reality. The aim of the book was to offer a way out of the long-standing controversy on the interpretation of science that opposed realists to empiricists, the former seeing science as an approximately true account of reality, the latter as a convenient, albeit possibly fictional, means to describe and predict the phenomena. The disagreement between the two camps on the interpretation of science was grounded on an even deeper disagreement about the subject matter of science: an objective, mind-independent, physical reality for the realist, a subjective collection of sense data for the empiricist. Accordingly, the realist saw the aim of science as consisting in the one-to-one mapping between scientific concepts and observational statements, while the empiricists refused to make claims that went beyond observations. Realists and empiricists, however, agreed on one thing: the existence of a sharp dividing line between the observer and what is observed. Margenau claimed that the assumption of separability between the observer ('spectator') and what is observed ('spectacle'), that he dubbed 'spectatorial doctrine', is a misleading oversimplification because the experience of reality is an inextricable mixture of external stimuli that act on our senses while being acted upon by our intellect. Hence the elements of reality are not wholly external to, but partly constructed by, us, according to metaphysical principles that are not fixed, but revisable, as shown by the several conceptual re-shuffling recorded by the history of science, which acted as turning points in

260

261

our world-view. Margenau claimed that the spectatorial doctrine is one such principle, and that evidence gathered mainly, but not exclusively, in the microscopic domain, suggested to abandon it, showing that quantum mechanics, far from being an anomaly within physics, represented the "culmination of methods long present in natural science" (Margenau: 1950, v). Notwithstanding Margenau's prestige in the Academic world, his (1950) book failed to impress his contemporaries and, in fact, met with a rather cold reaction. In the British Journal for the Philosophy of Science the book was described as framed within "an antiquated and obscure theory of knowledge", and Margenau was accused of being over-preoccupied "with the pseudo-problem of Reality" (Hutton: 1951, p. 81); in the Philosophical Review the book was called "an ill-conceived project", expressed "in confused language" by an author who displayed "a rather immature philosophical technique", and, therefore, achieved "scant results" (Smart: 1951, p. 411-13); even Philosophy of Science (of which Margenau was one of the editors) was luke-warm (Werkmeister: 1951). Accordingly, Margenau's position went largely unnoticed, and the controversy that opposed realists to empiricists has been re-iterated in exactly the same terms as it had been stated before the publication of the book (see, for instance, Smart: 1968, and Boyd: 1973 for the realists, and Hesse: 1967, and Laudan: 1984 for the empiricists). In this paper I defend Margenau's position by showing that the argument he used against the spectatorial doctrine-the existence of relational properties, whether between things, or between things and the spectator, applies also outside the boundaries of quantum mechanics, for instance, in chemistry. I also claim that the initial reaction to Margenau's book was utterly inadequate, as shown by the fact that subsequent developments in quantum mechanics-most notably the discovery of entanglement-brought the question of relational properties to prominence (see, for instance, Teller: 1986 and Priest: 1989). Hence the negative reaction to the publication of The nature of physical reality was, in all likelihood, to be attributed to the philosophical community of the mid-nineteenth Century not being prepared to comprehend and appreciate an epistemology that was much ahead of its time. 2. Margenau on the nature of physical reality Margenau's discussion of what physical reality amounts to began with considerations of what-to his mind-it does not: reality is not identical with truth, given that reality is a material notion, truth a formal notion that can only be established within a conventional framework. Margenau also denied that reality is a set of facts given by sensory experience, and, denouncing the "lack of

262

analytical depth peculiar to perceptions" (Margenau: 1950, p. 103), warned against relying solely on the deliverances of the senses, irrespective of how solid the evidence they provide appears to be. Following Schlipp (1935), he criticised the empiricists' account of reality as the no-further-analysable set of data given by perception, claiming that the passage from sensation to thought is gradual, continuous, and, above all, complex, and would be best described as an "imaginative supplementation". Against Kant's dualism of noumena and phenomena, Margenau claimed that reality and the experience of it are one and the same thing, which is brought about in a single event, and represents a "unique adventure". To sum up, for Margenau, cognition is neither imposed by perception, as the empiricists claimed, nor pre-determined by the categories of the mind, and, therefore, inescapable, as Kant claimed. However, cognition is not free, but constrained by metaphysical principles or "correspondence rules" (see also Margenau: 1935). These rules have no content of their own, as shown by the fact that they cannot be properly stated except by reference to the terms they relate to; moreover, their range of applicability cannot be determined a priori, and each rule becomes clear, albeit in a circular way, in its habitual application. Margenau noted that the practical invariability with which the cognitive processes is performed confers on constructs objectivity and the semblance of uniqueness. However, when even the simplest act of cognition, reification, is considered in the scientific domain, it looses the deceptively immediate character it has in the domain of everyday experience: the abstraction embedded in the rules of correspondence becomes evident, and so does the fact that different degrees of abstraction are required, for instance, to interpret colour in terms of wavelength, to posit electric and magnetic fields, or to construct the wavefunction of microscopic entities. The difficulty in finding "common elements between the various rules of correspondence, for instance, between the "tree rule", the "wavelength rule", and the "electron rule", led many positivists to deny their presence altogether" (Margenau: 1950, p. 81). The lack of common elements between the various rules of correspondence, however, is counterbalanced by an important similarity: the fact that all scientific constructs satisfy the same requirements of stability, fertility, consistency, and empirical verifiability, whatever the rule implied to arrive at them. Constructs that have been corroborated by testing may be labelled "verifacts"; however, their ontological status is not due to "their referring to something real: on the contrary, they denote something real because they have been found valid" (Margenau: 1950, p. 292). Accordingly, the state function "is a property of an electron just as truly as the blue colour is a property of the sky" (Margenau: 1950, p. 68). Margenau pointed out that the history of science showed that even metaphysical

263

principles that appeared definitive have, at times, been abandoned, and that this happened because the principle in question led to shaping scientific constructs that did not satisfy one or more of the requirements cited above. "In view of such evidence" nobody can maintain "that the method now in vogue has any likelihood of being ultimate" (Margenau: 1950, p. 76), and it should be accepted that the metaphysical principles currently used might have to be amended or abandoned in the future, and the spectatorial doctrine is one such principle. This is because, however glorified by the success of Newtonian mechanics it was, the spectatorial doctrine started to face difficulties in the nineteenth century, when classical electrodynamics described the interaction between two charged particles as the effect of the field generated by one particle on the other. In so doing, electrodynamics impressed "upon interacting charges an artificial subjectobject distinction... forbidding all questions on the fate of the field-producing charge, it surrounded the subject charge with an impenetrable barrier to understanding" (Margenau: 1950, p. 37). The case of classical electrodynamics showed that the terms subject and object and, by extension, spectator and spectacle, are used in a conventional, not in an absolute, sense. Quantum mechanics proved even more detrimental for the spectatorial doctrine, because of the uncertainty in the value of the observables of quantum systems: "the space-time-matter complex degenerated into probabilities for perception. The spectacle has begun to involve the spectator" (Margenau: 1950, p. 46). To sum up, on Margenau's account, the physics of the nineteenth and twentieth centuries showed that assuming sharp separability between things, and explaining phenomena as resulting from the action of a 'subject' on an 'object', whether both things of external experience-as in electrodynamics-or mind and matter-as in quantum mechanics, is misleading. On his account, the distinction between subject and object is always arbitrary: his criticism of the spectatorial doctrine refers to the question of the nature of the properties of things, not to the question of a possible role of consciousness in determining the outcome of events. 3. Things, and their properties The spectatorial doctrine is a generalisation of the assumption of separability between things, which is, in turn, predicated on the assumption that things are fully characterised by intrinsic properties. These are properties that are always displayed independently of anything else, on the basis of which things can be defined, whether as particulars, or as universals. Within philosophical discourse, the assumption that things are fully characterised by intrinsic properties has been

264

present from the start: it underpinned the Presocratics' search for the substance(s) from which the Cosmos originated, made Plato's Forms eternal and changeless (Ross: 1953), and provided the minimal conditions for Aristotle's essentialism (Owens: 1951). Given Plato's and Aristotle's overwhelming influences throughout the Middle Ages, it is no surprise that the assumption in question became a pillar of scholastic philosophy, to be later subsumed in the great epistemologies of the Modern Times. It was not until the 1930s that Victor Lenzen, a philosophically minded physicist, challenged the view that the properties of things are intrinsic, suggesting that, if some properties appear to be always displayed, this is because the conditions for them to be manifested obtain as part of the natural order. So it is, for instance, of weight, colour, size, and shape, because of the existence of gravity and light on earth, and of human beings being sighted (Lenzen: 1931). Lenzen's own account of properties was that some are "given", some just "possible" (in the sense of not being directly accessible), and this is due to the fact that properties result from "unions of particular qualities, complexities and relationships" (Lenzen: 1931, p. 16), which may or may not obtain. Lenzen's point had important epistemological consequences. This is because, if things are defined only on the basis of their given properties, the related concepts are fixed; this option provides "an essential feature of the classical concept of substance....the assumption of the self-determination and independence of substance" (Lenzen: 1931, p. 285). If, in fact, things are defined on the basis of possible, as well as given, properties, the related concepts "admit of transformation" and reflect the "observed fact that the physical characters (of things) are inter-dependent" (Lenzen: 1931, p. 15). Margenau begun his discussion of the nature of properties by quoting, and endorsing, Lenzen's position. He ackowledged that the first phase of the cognitive process, reification, assumes that things are "carriers of observable properties" (Margenau: 1950, p. 172), but stressed that this assumption actually goes against the reality of cognition, where "properties are postulated first and somehow settle upon a construct" (Margenau: 1950, p. 173). Nonetheless, as far as everyday practice is concerned, no harm is done by assigning objects intrinsic properties whose numerical value can be uniquely assessed. The same cannot be said, however, of the properties of microscopic objects, which "take on different values on different occasions and are yet in another sense unique" (Margenau: 1950, p. 175). These properties should be seen "not so much as attributes of the system but as entities determined by the physical operation performed on it" (Margenau: 1950, p. 343). Hence Margenau suggested replacing the notion of property with that of observable, and to distinguish "possessed observables", which have unique value, from "latent observables", which scatter when

265

repeatedly observed while having a determinate probability distribution. Notably, not all observables of microscopic things are latent: charge and mass, for instance, are not; they are often regarded as parameters rather than as proper observables. Moreover, there exist states of quantum systems-called eigenstatesin which the latent observables of microscopic objects assume sharp values as possessed observables: hence, the classical and the quantum domains cannot be sharply separated from one another on the basis of things displaying, respectively, possessed or latent observales. This fact, together with the realisation that quantum properties come into being as a consequence of the operations perfomed on the system, suggests that "the modern physicist can no longer countenance simple realism" (Margenau: 1950, p. 343), and that a more elaborate epistemology should be elaborated. Lenzen's and Margenau's discussions of the nature of properties did not receive attention in philosophical quarters, and the idea that things are endowed with intrinsic properties survived unchallenged and unaltered even the introduction of the notion of dispositional properties (see Carnap: 1936, and Goodman: 1955), which have been construed as intrinsic properties with a special relationship to subjunctive conditionals (Mackie: 1962), irrespective of whether they are attributed a non-dispositional basis (Armstrong: 1968) or not (Mellor: 1974). In scientific quarters, however, the introduction of relativity theory showed that space and time, previously thought of as intrinsic properties, independent of one another, are, in fact, interrelated, and must be jointly specified (see, for instance, Sklar: 1974). Furthermore, developments in quantum mechanics, particularly the discovery of entanglement,1 showed that the observables of quantum systems do not represent intrinsic properties and should, in fact, be viewed as relations. As I spell out in the next section, also the most important chemical property, the reactivity of the elements, is a relational propert; this realisation is not without epistemological consequences. 4. Chemical reactivity: intrinsic or relational property? The most important chemical property is the reactivity of the elements, which, even since the mid-1860s, came to be denoted by the term valency (Russell: 1971). In discussing what kind of property valency is, I refer to the theoretical framework provided by a textbook of general chemistry. 1

The fact that correlations between the observables of two separate quantum systems previously joined together exist, such that latent observables of one system become possessed observables because of measurements performed on the other, spatio-temporally separated, system (Aspect etal.: 1982).

266 a) Chemical valency is neither an intrinsic nor a relative property. I start by considering the elements belonging to Group 1 of the Periodic Table, the alkali metals: lithium, sodium, potassium, rubidium, and cesium. The behaviour of the elements of the same Group is expected to vary slightly and progressively along the Group. Upon combination with oxygen, the elements of the first Group are expected to form compounds by the generic formula Me20 (as in the case of Lithium), and Me202 (in the case of sodium, potassium, and rubidium). Notably, the metals Me are monovalent both in the compounds of generic formula Me20 and Me202. However, potassium, rubidium, and cesium also form the so-called superoxides of generic formula Me02 where the metals Me are tetravalent. Hence, with the exception of lithium and sodium, the elements of Group 1 display variables valencies. The same can be said of all other elements, and is particularly evident in the case of the halogens, which, with the exception of fluorine (always monovalent), form compounds of as many generic formulae as HX, X 2 , HXO, HX0 2 , HX0 3 , HXO4; in the first three compounds the element X is monovalent, and, respectively, tetra-, penta-, and epta-valent in the following ones. The fact that most elements display variables valencies shows that valency is not an intrinsic property. Before discussing if it is a relational property, I need to consider the possibility that valency be a relative property, namely a property defined with respect to a reference point.2 To explain why a given element displays variable valencies upon reaction with the same element it is necessary to introduce the notions of oxidation number and that of redox reaction. The notion of oxidation number rests on the assumption that, when two elements of different electronegativity are bound together, the bonding electrons are fully transferred, as it was, to the more electronegative element. (Hence, the oxidation number of homopolar compounds is zero.) The acquisition/loss of electrons upon reaction on the part of an element is expressed by the oxidation number bearing the sign minus when the elements acquire, the sign plus when they loose, electrons. When the elements that take part in a reaction change their oxidation number (as in all cases listed above), they are said to undergo a redox reaction. The factor that determines whether a given redox reaction occurs or not is the difference in the redox potential of the reactants. Assuming two reactants only, as in the case of the metals of Group 1 and oxygen, the element that displays the higher redox potential is reduced while the other one is oxidised. Let us assume that the reactions take place at 273° K and 1 atm, these conditions being identified as 'standard' in chemistry textbooks As in the case of younger or brighter, for instance, which reduce to intrinsic properties, such as age and refractive index.

267

(see, for instance, Lide: 2000). The fact that some elements assume variable oxidation numbers when reacting with the very same element is explained by taking into account that, although the standard redox potential of each element Eo is fixed, the redox potential in non standard conditions E is not. It varies with the temperature T and with the relative concentrations of the reactants [r] and of the products [p] according to Nerst's equation: nF

[r]

(where n is the number of moles and F is Faraday's constant). As Nerst equation shows, in non-standard conditions, which represent almost the totality of chemical reactions, the redox potential of an element may well differ from its value in 'standard' conditions. This fact, in turn, may alter the energetic balance towards the formation of products that, on the basis of the values of the standard potentials only, should not be formed. In other words, in non-standard conditions, the redox potential depends upon parameters, temperature, and concentration of the reactants and products, which do not belong to the reacting elements themselves. This consideration explains why the reactivity of the elements is not a relative property. To address the problem of why not all the elements belonging to the same Group display the same valencies, it is necessary to abandon the approach followed so far, and bring in some simple considerations from quantum chemistry. b) Chemical valency is a relational property. The valence-bond theory represented the first application of quantum mechanics to the question of the chemical bond (Heitler and London: 1927). Like Lewis' theory, it considered the chemical bond as a localised interaction consisting in the sharing of two electrons between two atoms and, when applied to heteroatomic and unsaturated compounds, it faced problems. In the first case, it created ambiguity as to the extent in which the bond in question was to be regarded as covalent or as ionic, in the second case, it gave incorrect predictions of the number of isomeric substitution products to be expected. By contrast, another theory borne out of the application of quantum mechanics to the question of the chemical bond devised at the time, the molecular-orbital theory (Hund: 1927), gave unproblematic results when applied to heteroatomic and unsaturated compounds and to those with unpaired electrons (see, for instance, Mulliken: 1932). This state of affairs brought about the gradual replacement of the valencebond theory with the molecular-orbital theory, a transition that was almost completed by the 1950s. Hence, in this paper, I refer to the question of the chemical bond in the terms set by the molecular-orbital theory.

268 Recall that molecular orbitals are represented as linear combinations of atomic orbitals: V= CA
(1) an

where and CA and eg are the coefficients of the atomic orbitals (pA d
269

principal quantum number n are grouped together in the same shell. The electrons that take part in chemical reactions are those that occupy the outer shell, and they are called valence electrons; it is to these electrons that reactivity considerations apply. The molecular-orbital theory may be seen as an extension of the Aufbau process and differs from it in that it feeds electrons to molecular, rather than to atomic, orbitals. Restricting the discussion, for the sake of simplicity, to homonuclear diatomic molecules of generic formula A-B, with A being indistinguishable from B, the combination of two atomic orbitals results in two molecular orbitals, which may be written as follows: \|/=N±[(p A (2s)±(pB(2s)];

(2)

the energy of the orbitals is given by the secular equation: [y/Yiy/dT E(ifr)=-L

(3)

W2dT

and this has two solutions, E + and E", the former having higher, the latter lower, energy than the original pair of degenerate atomic orbitals. In the ground state of the molecule, the lower energy orbital, occupied by the bonding electrons, provides the bonding molecular-orbital; the higher energy orbital remains unoccupied and provides the antibonding molecular orbital. The energy difference between the initial pair of degenerate atomic orbitals and the bonding molecular orbital explains the stability of the molecule; its value is taken to correspond to the energy associated with the chemical bond(s) that have been formed. Consider now a heteroatomic diatomic molecule, such as, for instance, lithium hydride; this compound results from the combination of elements of respective electron configuration ls 2 2s (lithium) and Is (hydrogen), and is expected to display electron configuration la 2 2a 2 . 3 In fact, theoretical calculations show that the compound's inner shell is virtually identical with that in the free lithium atom and that the bonding molecular orbital 2a is given by the following expression: 2a = 0.323(p2sLi + 0.231(p2pLi+0-685(pH

(Ransil: 1960).

(4)

In other words, the 2a molecular orbital, which is expected to be simply a linear combination of the singly occupied valency orbital 92sLi a n d 9H> actually contains an appreciable amount of 2p character. Molecular orbitals that arise The symbols c? and n, are used in analogy with the atomic orbitals s and p, to designate, respectively, molecular orbitals that are unchanged by rotation around the molecular axis, and molecular orbitals that, having positive and negative lobes separated by a single nodal plane, change sign on rotation by half a turn.

270

from combining atomic orbitals of different symmetry are called hybrid orbitals; the evidence in their favour is overwhelming, and comes from x-ray diffraction and other spectroscopic methods, which provide information on molecular shapes, and from thermochemical data which provide information on bond energies. Notably, hybrid orbitals result from the combination of atomic orbitals of different symmetry; the functions that describe them are stationary solutions for the Schroedinger equation of the isolated atoms with non-fixed angular momentum, and, as such, they do not represent possible orbitals for the atoms in question. In other words, hybrid orbitals require that atoms interact to come about: this fact settles the question of the reactivity of the elements being a relational property. 5. Relational properties and scientific realism Let us take stock, and recall that, when the theory of valency was first formulated, the realisation that most elements display different valencies in their different compounds puzzled the majority of the chemists. A famous case was that of Kekule, who maintained that, as a "fundamental property of the atom", valency should "be as constant and invariable as the atomic weight itself (Kekule: 1864, p. 510). In the last decades of the nineteenth century, however, the number of elements that were found to display different valencies in their different compounds grew so much that the idea of variable valencies became broadly accepted. It is important to stress that such acceptance came on the basis of evidence alone, and with no theoretical justification. This had to wait until the 1930s, when the formulation of the molecular-orbital theory (Hund: 1927) turned the previously bewildering idea of variable valencies into the distinctive trait of chemistry: "it is the existence of mutual effects between pairs of atoms that gives chemistry its intrinsic interest" (Coulson: 1952, p. 11). The chemistry case discussed here shows that the range of relational properties extends outside the domain of quantum mechanics, and in a way that involves no observer dependency, as some (see, for instance, Everett: 1957) claimed to be the case in the collapse of the wavefunction upon measurement. In this respect, the chemistry case may be said to resemble the electrodynamics case, in which the subject/object distinction did not involve a role for consiousness, and was introduced to have, on the one side, the 'subject'-particle creating the field, and, on the other side, the 'object'-particles affected by the field. The case of entanglement is similar to the chemistry and electrodynamics cases in its revealing properties whose relational character is strictly between things and involves no role for consciousness. Given the relational character of the

271 properties of chemical elements, charged particles, and entangled quantum systems, the definition of those entities is context dependent. This consideration calls for a revision of the definition of scientific realism away from the naive picture of science providing the faithful mirror image of an immutable reality consisting in things endowed with intrinsic properties, towards a picture of science accounting for different interactions, which emerge in different contexts, and are bound to change as science, and technology, advance. Notably, a view of this kind has nothing to do with social constructivism, which holds that experimental practices and natural phenomena are "bound together, so that assessment of the one is ultimately an assessment of the other" (Pickering: 1984, p. 113), and that the choice of experimental, as well as theoretical, practices, far from being 'dictated' by the phenomena, is mainly driven by opportunities for further experimental, or theoretical, investigations. On the contrary, as the chemistry case discussed here, the electrodynamics case, and entanglement in quantum mechanics show, the existence of relational properties is dictated by the phenomena and, therefore, is perfectly compatible with realism, albeit not with naive realism. The existence of relational properties points to a dynamic picture of reality, one that the progress of scientific knowledge modifies. Interestingly, versions of realism that emphasise the role of relational properties in natural phenomena, and the provisional character of the scientific description of reality have been advanced, over the last twenty years or so, from within quantum mechanics (Teller: 1986, and Priest: 1989), chemistry (Ramsey: 2000), and the whole of science (Giere: 1999). The re-appearence of the salient features of Margenau's philosophy of science-the relational character of properties, the revisable status of the metaphysical principles that inform science-almost half a century after its formulation, and in positions that make no reference to The nature of physical reality, suggests that those positions were reached independently of Margeanau. This fact should be taken as providing independent confirmation for Margenau's version of realism, which was characterised by its regarding quantum mechanics as the catalyst for conceptual and epistemological revisions, and for a great deal of re-thinking, in the whole of science. Ackowledgments I am grateful to Ron Giere for helpful discussion. References 1. A. Aspect and G. Roger, Physical Review Letters 48, 91 (1982).

272

2. D. M. Armstrong, A materialist theory of the mind (Routledge, London, 1968). 3. R. Boyd, (1973), Nous 7, 1 (1973). 4. R. Carnap, Philosophy of science 3, 420 (1936). 5. C. A. Coulson, Valency (Oxford University Press, Oxford, 1952). 6. H. Everett, Reviews of Modern Physics 29, 454 (1957). 7. R. Giere, Science without laws (University of Chicago Press, Chicago, 1999). 8. N. Goodman, Fact, Fiction, and Forecast (Bobbs Merrill C , Indianapolis, 1955). 9. W. Heitler and F. London, Zeitschriftfur physik 44, 475 (1927). 10. M. Hesse, The Encyclopedia of Philosophy 4, 404, P. Edwards ed. (McMillan, New York, 1967). 11. Hund, Zeitschrift fur physik 40, 724 (1927). 12. E. H. Hutton, in British Journal for the Philosophy of Science 2, 81 (195152). 13. F. A. Kekule, Comptes Rendus 58, 510 (1864). 14. J. Kim, Philosophical Studies 41, 51 (1982). 15. L. Laudan, in Science and Reality, J. Cushing et al. eds., (Notre Dame University Press, Notre Dame, 1984). 16. V. F. Lenzen, The nature ofphysical theory (J. Wiley, New York, 1931). 17. D. R. Lide, Handbook of chemistry and physics (Boca Raton, London, 2000). 18. J. L. Mackie, Truth, probability, and paradox (Clarendon Press, Oxford, 1972). 19. H. Margenau, Philosophy of Science 2, 48 and 164 (1935). 20. H. Margenau, The nature of physical reality (Mac Graw Hill, New York, 1950). 21. H. Mellor, Philosophical Review 83, 157 (1974). 22. R. S. Mulliken, Physical Review 40, 45 (1932). 23. J. Owens, The doctrine of being in Aristotle's metaphysics (Pontificial Institute of Mediaeval Studies, Toronto, 1951). 24. A. Pickering, Studies in History and Philosophy of Science 15, 85 (1984). 25. G. Priest, British Journal for the Philosophy of Science 40, 29 (1989). 26. R. J. Puddephatt, The Periodic Table of the elements (Clarendon Press, Oxford, 1973). 27. G. L. Ramsey, in Of minds and molecules, N. Bushan and S. Rosenfeld eds. (Oxford University press, Oxford, 2000). 28. B. J. Ransil, Review of modern physics 32, 239 (1960). 29. W. D. Ross, Plato's theory of ideas (Clarendon Press, Oxford, 1951). 30. C. A. Russell, The history of valency (Leicester University Press, Leicester, 1971).

273

31. P. A. Schilpp, Philosophy of Science 2, 128, (1935). 32. L. Sklar, Space, Time, and Spacetime (University of California Press, Berkley, 1974). 33. J. J. C. Smart, The Philosophical Review 60, 411 (1951). 34. J. J. C. Smart, Between science and philosophy (Random House, New York, 1968). 35. P. Teller, British Journal for the Philosophy of Science 37, 71 (1986). 36. W. H. Werkmeister, Philosophy of Science 18, 183 (1951).

D E T E C T I N G N O N COMPATIBLE P R O P E R T I E S I N DOUBLE-SLIT E X P E R I M E N T W I T H O U T E R A S U R E

G. NISTICO Dipartimento di Matematica, Universita della Calabria Via P. Bucci 30b - 87036 Rende (CS) Italy Istituto Nazionale di Fisica Nucleare, Italy E-mail: [email protected] In this work we show that in double-slit experiment properties incompatible with the Which Slit property can be detected without erasing the knowledge of which slit each particle passes through.

PACS numbers: 03.65.Ca, 03.65.Db, 03.65.Ta 1. Introduction In the ideal double-slit experiment proposed by Englert, Scully and Walther (ESW), 1 - 3 the detection of which slit each particle passes through is performed together with the measurement of the point of impact on the final screen. As expected, no interference appears on the final screen. A different set-up of ESW experiment makes it possible the detection of another property of the system, incompatible with the Which Slit property, again with the final impact point; but in so doing interference is restored, so that the knowledge of which slit the particle passes through is definitively lost: this phenomenon has been called erasure. In this work we face the problem of finding properties incompatible with the Which Slit property, whose detection does not erase which slit knowledge and it is performed together with the measurement of the final impact point. We begin in section 2 by establishing the necessary theoretical apparatus. The problem at issue is expressed in theoretical terms as problem (V). In section 3 we present an ideal experiment which makes it possible, in a particular state vector \I>, the detection of a property incompatible with the Which Slit property without erasure and without correlation with this last property.

274

275

2. Formalization of the problem We consider a physical system which consists of a localizable particle which we describe according to Heisenberg's picture. Let the observable position of the centre-of-mass be represented, at time t, by an operator Q'*' of a suitable Hilbert space Hi. Let our particle be endowed with further degrees of freedom, related to spin or similar, described in a second Hilbert space HH , in such a way that the complete Hilbert space is H = Hi <S> Hn • Let us suppose that the Hamiltonian operator H is essentially independent of the degrees of freedom described in Hn, so that we may assume the ideal case that H = Hi® In, where Hi is a self-adjoint operator oiHi- In general, if M {An) denotes a linear operator of Hi (Hn), by the same symbol without index / (H) we denote the linear operator A — Ai ® In (A = 1/ ® An) acting on the whole space H = Hi® HnThe projection operator identifying the Which Slit (WS) property "the particle passes through slit 1" has the form E = Ei ® In, where Ei is the localization projection operator of Hi which localizes the particle in slit 1 at the time t\ of the crossing of the slits' support. We may assume, without losing generality, that the property "the particle passes through slit 2" is represented by E\ ® In, where E'j = 1/ — Ei. Given any interval A on the final screen, the event "the particle hits A" is represented, like E, by a localization projection operator F(A), but relative to a different time t2 > h, so that in general [F(A),E] ^ 0. Therefore, it is not generally possible to measure the WS property and the final impact point directly.

I I I slit)

slit 2

Figure 1.

FINAL SCREEN -

Which slit detector

276

However, if for a given state vector \& a projection operator of the kind T = 1/ T j exists such that equation T\I> = E^ holds, then it is possible to detect which slit each particle hitting the final screen passed through. Indeed, since [T, E] = 0, the joint event T A E can be measured, being represented by the projection operator TE = ET. This allows us to compute the conditional probabilities p(T \ E) and p(E | T) by means of the formulas n(T P{

I Fs; '

=

P(TAE) p(E)

=

(* 1 TE*) <* | £ * )

'

I ™ _ P(EAT) ^ I ' p(T)

pn(F

_ (* I TEV) ~ (tf | T*> '

Since T * = £ * implies T E ' * = T * = i ? * , we have to conclude that p(T | E) = p(E | T) = 1, so that the occurrence of outcome 1 (resp., 0) for T detects the passage of the particle through slit 1 (resp. 2). 4 Furthermore, such a WS detection by means of T can be performed together with the measure of the final impact point because [T, F(A)] = 0. Indeed, though [F(A),E] ^ 0, the condition H = Hi <S>ln ensures that F(A) must have the form F(A) = F/(A) ln, so that [T, F(A)] = 0. In this sense we qualify projection T as WS detector, without entering the debate about the causes of the loss of interference. 5-10 E x a m p l e 1. In the thought experiment of ESW the physical system consists of an atom in a long lived excited state, e.g. rubidium in state 63^3/2, whose centre-of-mass position is described in Hilbert space Hi. The further degrees of freedom, described in Hu, concern with a pair of cavities 1 and 2, placed as shown in fig. 1. The cavities are resonators for the electromagnetic field, tuned at a microwave frequency in such a way that whenever the excited atom enters cavity 1 or 2, it decays emitting a photon. The event "a photon is revealed in cavity 1 (resp., 2)" is represented by a projection operator Tn = |1)(1| (resp., T'n = |2)(2|) of Hn- In this experimental situation the complete state vector of the particle is \& = Tm[i>-\. ® |1) + ^2 ® |2)], where ipi,ip2 € Hi are state vectors respectively localized in slit 1 and 2 when the particle crosses the two-slit support, i.e. Eii>\ = ipi, Eifo = 0. A WS detector is represented by the projection operatorT = l j ® | l ) ( l | ; indeed (l/|l)(l|)# = (E&ln)*, i.e. T * = EX, trivially holds. The possibility of this kind of detection is not exclusive of the WS property. We can introduce the following more general definition. Definition 1.

A projection operator Y of H is called a detector of a

277

property G = Gj ljj with respect to the state vector \& if (i) [Y, F(A)] = 0, VA (ii) [Y, G) = 0 and F * = Gtf. A measurement of Y detects G in exactly the same way a measurement of the WS detector T detects the WS property E. E x a m p l e 2. In their thought experiment, ESW found (with respect to the state vector \I> of example 1) a detector YESW = 1/ | + ) ( + | of another property GESW ~ IV'+XV'+I ® Iff, where ip+ = l/v^C^i + ^2) and |+) = (l/\/2)(|l) + |2)). Conditions (i) and (ii) in definition 1 are trivially satisfied. Such a property GESW is incompatible with the WS property E because [E, GESW] 7^ 0. As shown by ESW, if GESW is detected on each particle, then the distribution of particles for which YBSW = 1 exhibits interference (fig. 2) on the final screen, and this forbids to assign each particle with the slit it passed through: we detect GESW, but the WS property is erased. 1 ' 2 Distribution of particles for which Y= I

I II I Figure 2.

Toinl disti iliDTic m ofp.lllklfs

FINAL SCREEN -

Erasure

In the present work we seek for the possibility of detecting a property G = Gj 0 l j incompatible with the WS property E, but without erasing WS knowledge provided by a WS detector T. This possibility can be realized if with respect to the same state vector \& there exist both a WS detector T of E and a detector Y of G; moreover, it must be required that [Y, T] = 0, so that Y and T can be measured together, yielding simultaneous detection of E and G. Condition [Y, -F(A)] = 0 will be automatically satisfied if Y has the form Y = 1/
278

(V) Given the WS property E = Ej ® In, we have to find: a projection operator Gi ofHj, a projection operatorTn of Tin, a projection operatorYn of Tin, and a state vector \1/ g Hi ®Hn, such that the following conditions hold: (C.l) [E,G}^0,i.e[Ei,GI}^0I, (C.2) [T,Y] = 0,i.e[Tn,Yn] = 0,r, (C.3) T * = EV, (C.4) Y$ = GV, (C.5) 0 ^ £ # ^ # , 0 ^ Gtf ^ * . Condition (C.5) excludes solutions of (C.1)-(C4) corresponding to the noninteresting case that \I> is eigenvector of E or G. 3. A 'gedanken experiment' solution In Ref. 12 it is proved that if dim(Hi) = 2, then no solution of (V) exists. In Ref. 13 a solution of (V) is given with dim(Hn) = 2 and dim(Hi) = 4. Such a solution is characterized by a direct correlation between the nondisturbing detections of E and G. This trivial character turns out to be shared by every solution of (V), if dim(Hi) < 4, independently of the dimension of Tin-12 Therefore, for non-correlated solutions we have to look at double-slit experiments in which dim(Hj) > 6 (odd dimensions are excluded to deal with symmetrical slits only). The following ideal experiment shows that it is sufficient to take just dim(Hi) = 6 to find non-correlated solutions. Our ideal apparatus exploits the same physical principles used by ESW to design their thought experiment. Therefore, the system is the excited atom of example 1, which can travel towards the slits. For each slit we choose 3 non-overlapping regions, up (u), centre (c) and down (d) which decompose that slit (see fig. 3). The further degrees of freedom, described by means of Hu, concern with four (rather than 2 as in example 1) micromaser cavities A, B, C, D, located as in fig. 3. By An we denote the projection operator of HE representing the event "a photon is revealed in cavity A". Actually, we can define four projections An, Bn, Cn, Dn so associated to cavities A, B, C, D, and we shall denote their respective eigenvectors relative to eigenvalue 1 by \a), \(3), \j) and |<5). In this experimental situation there is a correlation between the presence of the emitted photon in one of the cavities and the passage through the corresponding region (cavity A with region u U c of slit 1, and so on). To describe these

279 correlations the state vector of the entire system must be * = - L {(Vtf + W)\a) + tf\{3) + W + r2)\S) + V2d|7)} ,

(1)

where ij)f, i/jf and ipf are normalized state vectors of Hi respectively localized in region u, c and d of slit i. These six vectors form an orthonormal set. Then we take the Hilbert space Hi for describing the centre of mass of the atom as the space generated by them. The correlations can be exploited to define a WS detector as T = 1/ ® (An + Bn), which satisfies T$> = E^. Our problem is now to find a property G = Gi ® 1 n incompatible with E,

1

Figure 3.

Distribution of particles for which Y—l

I I =-

Total tistribution of pariicles

c

Ideal apparatus for detecting both E and G

which can be detected by means of a detector Y — 1/ ® (An + Ca), without renouncing to WS knowledge provided by T. We take G/ as the projection operator whose matrix representation with respect to the orthonormal basis

W.Vf.Vtf.V'M,^} of Hi is

Gi

1 4

1 4

0

1 4

1 4

0

1 4

1 4

0

1 4

1 4

0

0

0

1

0

0

0

1 4

1 4

0

1 4

1 4

1 4

1 4

0

1 4

1 4

0

0

280 We notice t h a t a Hilbert space Hi of dimension 4 would be sufficient to describe the state vector ^ in (1), but without the six dimensions obtained by splitting the regions in front cavities A and C we could not describe the operator G / . By straightforward calculations 1 2 it can be verified t h a t all conditions ( C . 1 ) - ( C 5 ) are satisfied. Therefore, from the knowledge of which cavity the photon is revealed in, we can infer b o t h which slit the a t o m comes from and whether it possesses either property G or G' = 1 — G, according t o the following scheme. Cavity A B C D

=>• slit 1 and G =>• slit 1 and G' => slit 2 and G => slit 2 and G'

T h u s , the detection of property G is attained without erasing W S knowledge. No correlation between the non-disturbing detections of E and G occur in this thought experiment. Indeed, none of the equations as Y ^ = T\&, y * = YT$, 7 ' * = T * , T * = TY$>, ..., describing correlation holds.

References 1. M. O. Scully, B.-G. Englert and H. Walther, Nature 351, 111 (1991). 2. M. O. Scully and H. Walther, Phys. Rev. A39, 5229 (1989). 3. B.-G. Englert, J. Schwinger and M. O. Scully, in New frontiers in quantum electrodynamics and quantum optics, A. O. Barut ed. (Plenum, New York, 1990). 4. G. Nistico and M. C. Romania, J. Math. Phys. 35, 4534 (1994). 5. E. P. Storey, S. M. Tan, M. J. Collett and D. F. Walls, Nature 367, 626 (1994). 6. B.-G. Englert, M.O. Scully and H. Walther, Nature 375, 367 (1995). 7. E. P. Storey, S. M. Tan, M. J. Collett and D. F. Walls, Nature 375, 368 (1995). 8. H. M. Wiseman and F. E. Harrison, Nature 377, 584 (1995). 9. U. Mohrhoff, Am.J.Phys. 64, 1468 (1996). 10. B.-G. Englert, M. O. Scully and H. Walther, Am. J. Phys. 67, 325 (1999). 11. N. Bohr, in Albert Einstein: Philosopher-Scientist, P. A. Schilpp ed. (Library of Living Philosophers, Evanston, 1949). 12. G. Nistico, ArXiv, quant-ph/0409092 (2004). 13. G. Nistico and A. Sestito, J. Mod. Opt. 51, 1063 (2004).

IF YOU CAN MANIPULATE THEM, MUST THEY BE REAL? The epistemological role of instruments in nanotechnological research ALBERTA REBAGLIA Ingegneria dell'Informazione,

Politecnico di Torino, Corso Duca degli Abruzzi 24, Torino, Italia

«So far as I'm concerned, if you can spray them then they are real» (Hacking, 1983, p.23). This statement embodies a well-known key point in Ian Hacking's contemporary reading of scientific realism: scientific instruments assume a fundamental role in characterizing the ontological scenario to believe in. This paper focuses on the challenges of nanotechnology to this standpoint. Scanning tunnelling microscopy, as opposed to traditional microscopy (from optical to electron microscope), is not an imaging but a "touching and rearranging" technique. It requires a deep appraisal of epistemological ideas such as "representing" and "intervening", "knowing" "natural" entities and "creating" "artificial" ones.

1. Introduction Richard Feynman -in his now famous lecture, There is Plenty of Room at the Bottom. An Invitation to Enter a New Field of Physics, delivered on December 29, 1959 at a meeting of the American Physical Society at the California Institute of Technology [1] - offered the perspective of exciting new discoveries if one would be able to fabricate materials and devices at the atomic scale, where all phenomena are believed to be explained in quantum mechanical terms. The invention of the scanning tunnelling microscope (STM) in 1981 by Gerd Binnig and Heinrich Rohrer at IBM's Research Laboratory in Zurich [2] a has been an epoch-making event enabling researchers to directly observe atoms and to build structures to the nanoscale (one-billionth of a metre). As a tool for atomic image observation, the STM addresses the basic philosophical problem concerning the status of scientific theories and of experimental evidence. Discussing this issue constitutes the first aim of this paper. To achieve this purpose, we refer to the philosophy concerned with the

Binnig and Rohrer were awarded the Nobel Prize for Physics in 1986 for their invention of the scanning tunnelling microscope.

281

282 practices involved in the technical applications of science. Principally, the central features Ian Hacking proposed for entity realism will be appraised. As a tool for manipulating individual atoms, the STM also deals with a further momentous problem: what characterises its ability to operate in the nanoregion is the unavoidable requirement of a bridge joining the classical (macro) and quantum (sub-micro) scenarios. Shrinkage of device working volume till to the nanometre scale is not only a process of dimensional ultra-miniaturisation, but requires as well a deep change in our understanding of its operational principles due to the rise of significant quantum effects. In this sense the STM device opens a remarkable new world where distinguishing human-made objects from natural entities could become a problematic task. This topic will be taken into account in the final part of the paper. 2. In search of a demarcation between given experience and mental creation In his influential book Representing and Intervening [3] Ian Hacking claims «Experimenting on an entity does not commit you to believing that it exists. Only manipulating an entity, in order to experiment on something else, need do that» (p.429, emphasis added). Undoubtedly, this thesis epitomises the Canadian philosopher's most peculiar and intriguing point of view. Experimental activities, according to Hacking, indeed provide evidence for scientific realism; but not in the usual expected way, which traditionally rests on their task of testing hypotheses about physical entities. The so-called "new philosophy of science" has advanced a long list of epistemological arguments in order to disprove this mostly accepted opinion: a point of view largely debated in the XXth centuryb. According to Ludwig Wittgenstein's Tractatus [5], scientific theories, such as any other set of linguistic statements, can only describe some "state of affairs" but they cannot represent their very correspondence to reality. The notorious distinction between what language could say and what it might show is the basis of Wittgenstein's claim that there is no causal link delineating an outer reality: «The law of causality is not a law but the form of a law» (6.32). In physics we have laws of the causality form and we could construct the network of scientific statements out of figures of this particular (causal) kind -as well as of any other b

The "new philosophy of science" is the conceptual horizon to which belongs also Hacking's academic life, being him the editor of Scientific Revolutions, a well known collection in which Feyerabend, Kuhn, Popper, together with Putnam, Laudan, Shapere and the same Hacking, debate such critical items as incommensurability between scientific paradigms or scientific realism [4].

283

kind. We are justified in saying that a «causal» network can describe nature, but this statement asserts nothing about the world. Hacking takes the most epistemically strong conception that adequately moves within the boundaries of Wittgenstein's view: there is no reason to believe that any theoretical term depicts "the essence of the world". After all, phlogiston, caloric and ether took part in successful research programs. But if we can reliably manipulate such theoretical entities as convenient tools, this "shows" their reality. As stated by Hacking, what convinces us of the existence of physical entities does not rely on the explanatory merits of the theory that describes their dynamics. Neither does a coherent response we could obtain from our scientific instruments, since observational sentences are seriously infected by theory: as Norwood Hanson underlines, observation is theory laden [6]. According to Wittgenstein and the "new" philosophers of science, as Hacking emphasises, experimenting on entities could not be used to learn their real, objective properties; nevertheless experimenting with entities provides grounds for belief in their existence. In his intervention-based realism the key point is that entities, which in principle cannot be observed, nonetheless can be manipulated. 2.1. Hacking's scientific entity realism. PEGGYII Hacking's chief example involves electrons. Their manipulability, he emphasises, is a criterion for believing in their existence, in spite of the fact that experimental practices don't give us a theory-independent access to "real" features of those entities. Scientists are persuaded that electrons must be real since well-defined, stable and repeatable causal properties, distinguishing those particles in a very direct and tangible way, can be successfully employed to achieve other results. «We are completely convinced of the reality of electrons when we regularly set to build -and often enough succeed in building- new kinds of devices that use various well understood causal properties of electrons to interfere in other more hypothetical parts of nature» [3], p.433. PEGGY II is the peculiar apparatus that Hacking takes as an illustration of his point of view. It is a laser gun, built at the Stanford Linear Accelerator Centre in the late 1970s, used to produce a beam of circularly polarised light, which is directed on a gallium arsenide crystal. When light impinges on it, the crystal emits a large number of linearly polarised electrons, which can be employed as tools to check parity violation in weak neutral currents. Parity expresses the relation between the directions of the spin-vector of an electron and its travelling path in the beam. Parity violation was found in weak charged current and the proposed investigation hoped to find similar violation in

284

neutral ones. Scientists may disagree with the outcomes of the experiment and the distinctive traits of some element involved, such as the boson particle that is assumed to carry the weak forces, may be considered controversial; however, the existence of electrons, which are employed as tools in the experiment, in Hacking's opinion, is hardly contestable. Using directly unobservable entities such electrons as instruments implies that we confer the highest possible degree of belief in their existence: if they were not real, it would be quite amazing to pretend to use them reliably. 2.2. The Scanning Tunnelling Microscope According to Hacking's argument, we are precluded from being realists about entities that cannot yet be manipulated and exploited to investigate other entities. With regard to this conceptual approach, it is of scarce relevance that the STM -the revolutionary device developed by IBM researchers that allows for the first time to image and rearrange structures at the nanoscale- gives scientists the way to observe (and not only to detach) small particles at the atomic, and sub-atomic, level0. The really significant fact is that the tunnelling of electrons through the vacuum (as predicted by quantum mechanics) is centrally involved in the STM, as a tool to be employed in producing an image on the computer screen giving a rendition of the topology of a given surface with sub-micrometer resolution. We may ask ourselves if, in spite of the peculiar challenges that "seeing" on a nanoscale may present, we can legitimately consider the STM a new exemplary case that Hacking could assume for substantiating his "entity realism". First proposed by George Gamow in 1928, the tunnel effect is one of the most spectacular, paradoxical results due to quantum mechanics. According to quantum laws, low energy subatomic particles can tunnel through a potential barrier even if its actual gap, in classical mechanics, prevents them from jumping over: they simply disappear and then reappear on the other side. The tunnel effect explains phenomena like alpha decay: the alpha particle of the element radium can move itself away from the atomic nucleus and through the outside of the nucleus, so appearing in a place where energy cannot move it. Philosophical thinking from Wittgenstein to Hacking states that applying a theory in order to explain the behaviour of an entity does not provide us with knowledge about its

c

In 1993, Donald M. Eigler, at IBM, using the STM tried to move several dozen cobalt atoms on a copper surface into an ellipse-shaped ring (major axis of 20 nanometers, minor axis of 10 nanometers) and found that the electrons in the nanoscale range waved with an intensity just like that on the surface of water, directly demonstrating that they produce a wave pattern as predicted by quantum mechanics.

285

reality. The eigenvalues obtained by solving the relevant Schrodinger equation give us an estimate of the probability that particles might pass through a potential wall and base their strange conduct on the double nature (particlewave) of any quantum entity. But we can infer their reality neither by this elusive property nor by experimenting on the tunnel effect using laboratory practices detached from our quantum theoretical knowledge. Nevertheless, inducing and controlling quantum tunnelling phenomena provides technical applications such as the STM. A probe tip, which ideally is atomically sharp, is moved over a surface, a small voltage differential having been placed between the two parts. The potential barrier -isolating probe from surface- should prevent any electron flow, unless a tunnel effect occurs. The tunnelling electrons give rise to a small electrical current, whose intensity can be measured with great precision. Quantum theory predicts that the tunnelling current is very sensitive to the tip/surface distance, being proportional to the inverse of the distance squared. If one scans the tip across the surface, the distance will vary and so will the current: a feedback control of the tip's location allows measuring this distance. «Typically, a topographic image is produced by running the tip back and forth over the sample surface such that, by means of an electronic feedback loop, the tip is moved up or down to keep the tunnelling current -and consequently the tip's distance above the surface- at a constant value», as David Baird and Ashley Shew explain (Probing the History of Scanning Tunnelling Microscopy, in [7], p.5). The correlation between tip position and current intensity can thus be used to produce the image of the sample surface. «One can compare STM to Braille reading or the way the tumblers in a lock 'read' the key shape. STM relies on the phenomenon of electron tunnelling to image surfaces* (ibidem). It becomes possible to manipulate electrons in a way rather uncommon with respect to basic quantum mechanical laws, and to use them as tools to achieve topographic information about the surface we are "seeing" by means of the STM. Following Hacking's claim that some entities are rightly believed to exist when they have attained the status of experimental devices, we can say that exploiting the motion of a wave packet, which can pass through a potential barrier, implies an acceptance of its reality. This application of Hacking's view to the STM allows us to assign the same ontological status to macro-objects and particle-wave entities. This is an embarrassing way to characterize "reality", at least for those scientific realists who possess a notion of reality like that criticized by Paul Feyerabend: «Those of them who pay attention to the results of anthropologists and classical scholars may admit that immaterial entities did

286 appear and that Gods did make themselves felt; they may admit that there are divine phenomena. But, they add, such phenomena are not what they seem to be. They are 'illusions' and, therefore, do not count as indicators of reality» [8], p.246. Using the change in the intensity of the tunnelling current as a tool to obtain images with a resolving power of one atom prevents realists from asserting that there is a significant difference between theoretical entities (such as the wave function of the electron) and observable ones (like subatomic particles). And this conclusion is not reached from the realistic point of view, according to which some theoretical entities become visible, but from the instrumental line of reasoning, which stresses that a reliable exploitation of something implies supposing it is real. When we use, as a tool for "quasi-visual" inspection, the finite probability that the wave function of an electron tunnels through the potential barrier it encounters, according to Hacking we must be completely convinced of the reality of the wave function of an electron truly spread out (not just hidden or unmeasured). The wave function can no longer be regarded as describing «divine» phenomena that «do not count as an indicator of reality», according to Feyerabend's quotation. The "real world" is incredibly rich; it consists of countless kinds of things, depending on the different ways in which reality constructs its qualities responding to the plurality of our inquiries. 3. In search of a demarcation between given experience and technological manipulation The STM gives a still deeper challenge to the idea of human beings answering to an independent authority called Nature (a point of view strictly related even to the instrumentalist position). The key problem is to distinguish "real entities" from artefacts of the observational procedures. Hacking's argument depends on a thesis about experimental practices that endorse the notion of laboratory investigation as attempts to create physical effects, instead of passively observing natural phenomena. According to Hacking, we see with a microscope. Our confidence in the reality of what we see does not settle on the image itself but is a consequence of the various ways that we interact and interfere (via some causal links well defined by theories) with the specimen so observed. «Don't just peer, interfere*, he urges in [9], p.308. However, thinking of scientific explanations as dependent upon "intervening" instead of relying on "representing" does not prevent us from separating attempts to learn how things really are and efforts to design and assemble artificial entities. Underlining, in Hacking's words, that we do not see

287

through a microscope but we see with it should in any way maintain the difference between two quite distinct kinds of "intervention" on nature. The first is intervening as arranging a suitable laboratory experiment to isolate some factual effect, taking advantage of our ever increasing technical ability to use some physical entity as a tool. The second is intervening as the practical and tangible result of any theoretical investigation enabling to conceive and design artificial devices whose performances seem only limited by the need of avoiding what physical laws forbid. Hacking devotes all his attention to experimentation as the use of something understood in nature in order to prove something that is not. Consequently, for him experimentation considered as the use of something understood in nature in order to design a new artificial device does not seem so philosophically crucial. Even if experimenting on micro-entities cannot directly establish their existence and nature, we are now gaining the means to construct them directly. And if we can construct them, they are real! Clearly the carpenter is sure that the table he has made is "real", and the same holds for the nanotechnologist manipulating atoms into new arrangements. To directly detect something is equivalent to getting information about its characteristics by observations and instruments, and both are theory laden. On the contrary, we know so completely the properties of what we ourselves have made that we can be sure even of the causal links that relate the final products to the procedures and tools we used to obtain it. This is the conceptual background from which Hacking's considerations start: experimental apparatus is nothing else than a peculiar kind of man-made equipmentd. 3.1. The Scanning Tunnelling Microscope as a tool for manipulating individual atoms To push resolution down to the microscale of the individual atom could seem a chimera due to the uncertainty principle, a fundamental tenet of quantum mechanics. In accord with Heisenberg's equations, no optical reflection microscope -however accurate- can be useful to observe an atom, since we have to illuminate it with a very short wavelength beam which induces unpredictable d

Hacking's position about the thesis advanced by those authors that regard natural sciences and their experimental apparatus as social constructs is chiefly depicted in [10]: «I try to make sense of the claim that something can be both real and a social construction*, p.68 For the purposes of this paper, it seems worthy of note to underline that «Kant was the great pioneer of construction*, [10], p.41, and that a deep historical background to the movement of constructivism relies on the Gianbattista Vico's thesis (we better understand what we ourselves have made).

288

shifts of the particle, so that no decision about its position can be assumed with certainty. This is true for free atoms, but does not apply for the case of atoms embedded in a solid. The photon sent to determine the position of an individual atom might nudge it, according to the uncertainty principle, but if it belongs to an organised structure the neighbouring atoms will push it back into place. Thus to observe and even manipulate individual atoms in a piece of material seems possible in principle. Feynman was the first to underline, in his famous talk, that -in spite of its counterintuitiveness- manipulating atoms (that are typically a few tenths of a nanometre in size) does not need new physics. «I am not inventing anti-gravity, which is possible someday only if the laws are not what we think. I am telling you what could be done if the laws are what we think; we are not doing it simply because we haven't yet gotten around to it» [1], p.4. Nevertheless, we cannot use traditional microscopes to "see" atoms, as a consequence of several defects in their structure and optics, which further degrade the theoretical limit of their resolving power (set by the wavelength of the illuminating light). In the visible region of the electromagnetic spectrum performances are well beyond what would be required. Scientists and technicians interested in increasing resolution in observing and studying material surfaces have introduced and developed several devices which adopt, for target illumination, beams of energised particles (like electrons or ions), aiming to shorter and shorter wavelengths. Basically, all these instruments derive from Braggs' fundamental work [11]. We can characterize these instruments as "model based", since what they detect (a beam deflection angle, averaged on a relatively large area) is used to identify some unknown average parameter (e.g. lattice constant) of a theoretical model (crystal arrangement) whose general structures are supposed in line with our previous knowledge. The desired information about the surface structure of the crystalline material observed is indirectly deduced from the identified model. The STM is a quite different kind of device in many ways. Using the vacuum tunnelling of electrons to study the surfaces of materials is quite different from using a particle beam and look for its behaviour after deflection: it involves moving a tip over a surface to obtain local, direct topographic information about it. To produce an image of the topography of a surface using a tactile practice can be considered as a perceptual ability untill we contemplate macro-dimensions. But the image produced on a computer screen by running the scanning tip back and forth over the surface in order to keep the tunnelling current is the result of active manipulation, in a manner consistent with quantum mechanics. The quantum explanation of the tunnelling process, developed in 1983, shows that it is not just a question of "feeling" the topography of the

289 underlying surface, but rather a result of the overlap -with the greatest proximity- of electron orbitals of the upper atom in the sharp tip and of some other atoms on the sample surface. In the STM case, we cannot touch without interacting. We cannot observe without interacting. 3.2. Shaping the world atom by atom What we could simply define as "seeing with a scanning tunnelling microscope" is not appropriately described as "gathering any kind of experimental evidence". The STM allows us to manipulate unobservable entities, that opens significant new perspectives, not included in the mere interference with unobservable entities to get observable results. In his talk, Feynman already envisaged a time when atoms could be rearranged to order6. The STM is an essential research tool in nanotechnology to pursue this goal. The tunnelling current used has enough energy to manipulate atomsf. D. M. Eigler and E. K. Schweizer, IBM researchers, carried out an experiment in which the STM was used to position 35 individual xenon atoms on the surface of a low-temperature single crystal of nickel to spell out the letters I B M s . Besides creating in this odd way their company's logo, they -as other research groups- have created "artificial" molecules, an atom at a time. Using nanotechnologies, scientists are abandoning the traditional assumption that they understand and explain a nature that is simply given. Instead, they embrace the project of remodelling or transforming it. In the wake of Feynman's lecture -a defining moment for nanoscience- they are taking a "bottom-up" approach to experimental research rather than the traditional "topdown" approach, which involves successive miniaturisation of macrooperations'1.

e

f

g h

According to Feynman, «it would be, in principle, possible (I think) for a physicist to synthesize any chemical substance that the chemist writes down. Give the order and die physicist syndiesizes it. How? Put the atoms down where the chemist says, and so you make die substance* [1], p.]2 By expanding the principle of the STM, Binnig and his group developed also the Atomic Force Microscope in which the atomic force between the probe and die sample surface is used in place of the tunnelling current. The now well known image was first published in 1990 in die journal Nature «Nanotechnology should be recognized as a basic technology common to all atoms, bits and genomes (materials, data and genetic engineering) as it may result in die convergence of traditional top-down technology (miniaturization) with the newly developed bottom-up technology. The development of new properties incorporated in nanoscale structures -as well as functional materials and devices through the bottom-up approach- is now swiftly beginning to take shape and is no longer just a dream of the future. In short, a paradigm shift in advanced technologies through nanotechnology is steadly developing* [12], p.2.

290 Controlling matter at atomic or molecular levels means tailoring the fundamental properties belonging to phenomena. To build electronic devices using atom-by-atom engineering signifies manipulating and keeping stable the interaction between atoms and molecules; it means to explore the same scale dimension at which all natural material and systems establish their foundation. This new methodological paradigm makes it clear that the usual divisions between "basic science" and "engineering" are no longer applicable. Also, in principle, distinguishing physical effects from artefacts becomes quite ambiguous1. For example, a great challenge for nano-manufacturing technologies that will support tailor-made products having functionally critical nanometre scale dimensions is to be shaped using self-assembly. Manipulation of nano-structures using the STM requires a very long time, so the ultimate solution seems to be self-assembly, the most fundamental process for forming a functional and living structure'. Conceptually, moving from small to larger size and creating new matter by combining atom with atom, one atom at a time, molecule to molecule is an imitation of natural processes. To mime nature is the ability that has marked several technical developments, so we might think that pursuing this goal is really nothing new. Nevertheless, science-based technology is a result of scientific abstraction and symbolism; it is far from being largely perceptionbased and it makes it possible to control nature on a hitherto unimaginable scale and even to take its place in governing phenomena. So, in exploring bottom-up, self-organising and self-assembling routes, nanoscience implies radical change in the philosophical analysis of what we consider technological and what we define ' To keep a valid criterion of demarcation between natural and artificial for what concerns nanotechnologies turns out to be a quite problematic undertaking. As Gregor Schiemann admits: «Building upon the narrow sense, I proposed an epistemic criterion according to which an object is natural if it is impossible -using all available scientific methods at a given time- to ascertain that it was produced by human action. This criterion makes it possible to distinguish analogously to the Turing-test of artificial intelligence- between natural and artificial components of most nanotechnological processes and products. Given the multifariousness of the relationship between nanotechnology and nature, there are cases where it becomes problematic to distinguish between the two. I assume, however, that these cases are exceptions. Nanotechnological objects are mostly hybrids of nature and art; only in a few cases would they be said to be wholly natural because their artificial origin could no longer be confirmed.», Nanotechnology and Nature. On Two Criteria for Understanding Their Relationship, in [ 14] pp.77-96. Many interesting suggestions about the provocative challenges nanotechnology addresses to the philosophy of science and philosophy of technology can be found in the Special Issues on Nanotech Challenges jointly published by Techne and Hyle (ref. [13] and [14]). ' Recent progresses in this field is reviewed in [15]. It has been calculated that, if a device has a feature size of 5 nanometers and a scanning tip can move 10 atoms per second, it will take about 6 months to build 1012 devices on an 8-inch wafer.

291 natural. Artefacts produced by nanotechnologies and natural objects articulate analogous dynamics; they can be regarded as parts of a structurally identical whole. 4. Conclusion The breakthrough that technology provided with the development of the STM is closely connected with the dissolution of two conceptual guidelines that seem to be essential in our everyday thinking. The use of the tunnel effect as a tool for inspection purposes introduces quantum wave function and superposition of possible position states of a particle as elements of physical reality (undoubtedly, a quite disturbing intrusion). Moreover, to improve technological applications at the nanoscale with the aid of the STM, in order to realise very special processes of "material forming and machining", means to mobilise for this aim uncommon but effective self-assembly phenomena. And since this emergence of order at the atomic level applies both to natural dynamics (e.g. in molecular physics) and to small systems created employing nanotechnologies as well, the difference between natural and man-made products fades away. Due to both these points, troublesome problems arise in the philosophical foundations of nanoscale research and applications. It becomes difficult to provide bases for maintaining a difference between experimenting on entities (properly interpreting the way in which STM interacts with the sample) and constructing some artefact (manipulating the same elementary building blocks that nature employs and leaving them to self-organise and reproduce). Upsetting questions, which venture to open further puzzling queries in the never-ending dilemma about what "reality" is. Acknowledgments I am extremely grateful to Thomas Nickles for his improving comments and suggestions, as well as to Riccardo Zich for his useful remarks. References 1. 2. 3. 4.

R. Feynman, "There is Plenty of Room at the Bottom: An Invitation to Enter a New Field of Physics", Engineering and Science, February (1960). G. Binnig, H. Rohrer, "Scanning tunnelling microscopy", IBM Journal of Research and Development, 30, 355-369 (1986). I. Hacking, Representing and Intervening (Cambridge University Press, 1983). I. Hacking ed., Scientific Revolutions (Oxford University Press, 1981).

292

5. 6. 7.

8. 9. 10. 11. 12. 13. 14. 15.

L. Wittgenstein, Tractatus Logico-Philosophicus (Routledge & Kegan Paul, London 1922,1955). N. R. Hanson, Patterns of Discovery: An Inquiry into the Conceptual Foundations of Science (Cambridge University Press, 1958). D. Baird, A. Nordmann, J. Schummer and A. E. Schwarz, eds., Discovering the Nanoscale, International Conference (Darmstadt Technical University 2003). P.K. Feyerabend, Conquest of Abundance (University of Chicago Press, 2000). I. Hacking, "Do we see through a microscope?", Pacific Philosophical Quarterly 62, 305-322 (1981). I. Hacking, The Social Construction of What? (Harvard University Press, 2000). W. H. Bragg, Concerning the nature of things; six lectures delivered at the Royal Institution (G. Bell, London, 1925). N. Ikezawa, "Nanotechnology: Encounters of Atoms, Bits and Genomes", Nomura Research Institute Papers 37, December 1 (2001). "Nanotech Challenges", Techne Special Issue Part I, 8 (2), Winter, 2004; Part II, 8 (3), forthcoming. "Nanotech Challenges", Hyle Special Issue Part I, 10 (2) (2004); Part II, 11 (1) (2005). Z. L. Wang, "Self-assembled nanoarchitectures of polar nanobelts/ nanowires", Journal of Materials Chemistry 15,1021-1024 (2005).

MATHEMATICAL MODELS AND PHYSICAL REALITY FROM CLASSICAL TO QUANTUM PHYSICS ARCANGELO ROSSI* Dipartimento di Fisica dell'Universita di Lecce, Via per Arnesano, I - 73100, Lecce, Italy The concept of physical object in twentieth century's physical theories (relativity and quanta) turns from intuitive-substantial to formal-functional, that is to a relatively invariable properties system well beyond the intuitive properties "carrier" typical of classical physics (CP). Anyway, the autonomous heuristic fecundity of this formal mathematical model neither was fully shown before the twentieth century nor, moreover, is independent of the properties it correlates. In quantum mechanics (QM) the turn to such purely functional correlation, which is nevertheless irreducible to mere information related to measurements, is historically evidenced by the fact that the formalization of the theory as an operator calculus in Hilbert space came before its physical interpretation. Dirac, moreover, asserted the irreducibility of quantum properties to mere measurements, just because this simply pragmatic reduction is poor and ambiguous in front of a mathematical formulation of linguistically expressible (even though not always experimentally decidable) properties. Neither a falsificationist interpretation of scientific theories, which makes them empirically decidable even though irreducible to mere instrumental theories, fits the purpose. It is then safer, and in better agreement with historical testimony, to admit not only the existence of properties which are measurable and reducible to instrumental operations, or at least falsifiable, but also the existence of properties which are physically inaccessible or empirically untestable, since properties of this kind are described and fruitfully implied by the mathematical model.

1. Introduction The concept of physical object was radically modified through the shift from classical to modern physics (relativity and quantum theories).1 The transformation of the physical object from substance to function which was theorized by Cassirer2 as typical of modern science in general, even since Galileo, was instead historically determined and clearly expressed by the twentieth century's physics scientific revolutions. The fact that this transformation has been identified with the essence of modern science by Cassirer, and even before him, implied its misunderstanding

E-mail: [email protected].

293

294 as an affirmation of vague empiricism and anti-ontological phenomenalism, supposed characteristic of modern science in general in front of ancient and medieval natural philosophy.3 On the other side, as history of modern science cannot be identified with empiricism and phenomenalism in general, even less the transformation from substance to function can be identified with a positivistic stereotype: for, it must be more correctly traced back to the rise of relativity and quantum theories, which were, in any case, neither merely phenomenological nor anti-ontological theories. A more precise definition of that transformation is then required in the light of the effective historical process of scientific change then taking place. Only after the birth of relativity and quantum mechanics (QM) it had been possible to completely reduce the physical system to a non-empirical, mathematical invariant, no longer built up on the base of physical analogies with pre-established models derived from concrete physical objects (as, for example, rigid bodies or oscillating systems), but rather as a functional structure. This structure is obtained by extending the properties of those physical objects themselves, in a generalized way, to new formal connections of functional type, even unconceived before. These are irreducible to intuitive representations (in particular, of geometrical type) of those physical objects themselves whence they have nevertheless been derived in the last instance. The objective reference nuclei of new relativity and quantum laws are in fact no longer traditional intuitive "carriers" of properties, but become functional connections, more or less stable, of those properties. In short, the physical system is no more an object independent of its properties in the new physics: instead it becomes a functional correlation of properties fully expressible in mathematical terms, such as the relativistic invariants of Lorentz transformations or the invariants of Hilbert continuous transformations.1 Of course, the above remarks do not mean that there was no tendency in classical physics (CP) to reduce physical systems to mere mathematical functional connections of properties. Rather, it means that such tendency, though present, was overcome by the opposite prevailing tendency to interpret the physical system by tracing it back to its analogical representations of substantive character, through true processes of identification with concrete reference physical objects. Lagrangian and Hamiltonian formalizations of CP (as later Dirac made explicit by showing that Hamiltonian mechanics was the basis for his formalization of QM4) were no doubt strongly oriented towards a purely functional representation of physical bodies, so meant as pure mathematical correlations of properties (typically, space-time position and quantity of motion): but, in fact, they anyway identified those correlations with intuitive realities

295

irreducible to mere formal correlations, such as material dots, rigid bodies, oscillating systems. The point is that the physical object was then not yet liable to be reduced, notwithstanding existing tendencies in that sense, to a purely formal cluster of properties, no longer appealing, for the development of knowledge, to concrete physical analogies beyond the mathematical representation. The reason of this was the enormous heuristic efficacy of the physical analogy, which appeared, at that moment, irreplaceable. Only afterwards the mathematical model acquired a new efficiency in suggesting new developments of knowledge and succeeded in replacing the heuristic function of the old physical analogy, so appearing as a formalism and language which was nevertheless heuristically fertile.1 2. States and properties The fading away of the heuristic role of substantive physical analogies that previously appeared as a necessary presence in physics does not imply, in modern physics, the end of the distinction between states and properties, which is instead still necessary. In fact, it is not fit, though the tendency is well represented,1 to reduce those terms to one while calling in question substantive analogies and physical models in general, and then the correlated conception of physical object. The mathematical functional connection in which the new concept of physical object or system (the former simply meant as individual sample of the latter) consists is just a function which can change, but owes its relative invariance, stability and nomological legality and objectivity to its constituting elements. These constituting elements are the properties rather than the states of the system itself. Indeed, if properties (such as being red, having a space-time position or a mass, a quantity of motion, a charge) distinguish one from another by their distinct, though correlated definition domains (quite different anyway from their effective measurements), states, according to the standard interpretation of QM, only express the quantity of information which is effectively accessible to us relatively to the properties themselves. Thus, a "pure state" is just meant in QM as a maximum accessible information, which cannot be simply identified with the objective reality of properties as such, as already stressed by Einstein et al. to declare the "incompleteness" of QM.5 Identifying the two terms leads one to renounce that minimal realism of properties which is semantic rather than ontological and does not accept to identify reality (meant as an ensemble of properties each of which can be known even if not all properties, although they are real, can be conjointly known) with its effective knowledge. The latter is indeed limited to the information content, even maximal, but not necessarily exhaustive, that we can actually attain. A property is then what characterizes the system, just meant as a functional, more or less variable

296 nucleus of properties, a state on the contrary expresses the quantity of information we have about the system, that is the set of values that the properties which constitute the system in fact reveal step by step (even if a notion of state in terms of preparations can be introduced which allows one to define physical objects also as acts of preparation6). To renounce the distinction between states and properties means to accept the point of view of standard or "orthodox" interpretation of QM, according to which the object of knowledge is substantially identified with its effective knowledge (following the "philosophy of observables" that Bohr objected to Einstein7). A more realistic and modest view of knowledge counteracts this one in terms of a semantic realism of properties. This, however, also counteracts the ontological view that was dominating in CP, which conceives physical systems as in se or absolute objects, the so called properties "carriers" independent of the properties themselves. These instead, whenever they are not identified with the states of the system, can give much more consistency than the states to the system itself, which is then simply meant as a functional, more or less stable, correlation of properties. 3. Von Neumann's formalization of Quantum Mechanics It follows from Sec. 2 that the turn in the conception of physical system took place in terms, on one side, of irreducibility to elementary atomic objects, according to an idea of system as functional correlation, and, on the other side, of irreducibility of the system's properties to the mere information obtained by measuring them. This fact is evidenced, in particular in QM, by the most important non-relativistic mathematical formulation of the theory. This was formulated by von Neumann as an operator calculus in Hilbert space which completely preceded its physical interpretation.8 According to Jammer,9 it represented, from this point of view, a unique case in history of physics, also in view of the unquestionable success it achieved. The formal structure of the mathematical correlations of the theory came before their physical interpretation and application rather than being drawn out from particular measurements, limited to the information relative to accessible empirical values of the correlated properties. So, to be accepted, the formalization of QM realized by von Neumann had to respect some peculiar formal characters of quantum properties even before waiting for concrete empirical confirmations, as it preceded them. These characters can be identified in particular with the non-commutativity of quantum properties, or with their intrinsically probabilistic character. In any case, they are introduced independently of any intuitive representation of objects constituting their imaginary "carriers" as completely separated individual objects, avoiding,

297 at the same time, to identify them with the effective information (in many cases, as we have seen, irremediably limited) which can be obtained on them by means of measurements.8 True, von Neumann, in agreement with the widely shared "orthodox" interpretation to which he himself contributed, supposed, in the last instance, that the concepts of state and property collapsed, introducing an operational reduction of observable properties to the knowledge of them resumed in the state.8 Notwithstanding this, a consequence was inevitably intrinsic to the mathematical model adopted by him, which contradicted that reduction: the existence of "superselection rules" which put in question the so called "projection postulate", according to which the measurement of an observable property defining the state of the system, by representing the maximum accessible information on it in certain conditions (its eigenvalue), verificationistically also exhausts the meaning of the measured property itself. In any case, those rules were derivable in Hilbert theory well before and independently of von Neumann's controversial physical interpretation.9 Moreover, von Neumann's interpretation left crucial concepts as probability and measurement open to further different interpretations. In particular, the concept of measurement was largely ambiguous and complex, since the tendency to collapse properties with states, thus totally reducing concrete physical properties to mere experimental measurement procedures, appeared in it most evident, extremely controversial and to be still demonstrated in agreement with all the relations expressed by the axiomatically adopted mathematical Hilbert formalism.9 4. Dirac's formalization beyond pragmatism and falsificationism Which was then the effect of the new trend of mathematical modeling in QM in absence of substantive reference models? It was a proliferation, a true panoply of formal models in order to account for quantum properties in conceptually and formally different (though quantitatively coincident or equivalent) ways. Then, different mathematical formal correlations were built which were empirically indistinguishable in the last instance. The above remark is confirmed by the fact that, soon after and independently of von Neumann, Dirac advanced a different, very compact and smart formalization.10 This was anyway criticized by von Neumann for its lack of rigor, in particular because it introduced functions which were considered "improper" and liable to conflicting physical interpretations, as the famous

298 "delta-function" (though afterwards Schwartz reformulated its definition in a rigorous and univocal way with his theory of distributions9). Analogously, after the first more physical and epistemological interpretations of the theory by Heisenberg, Bohr and Schrbdinger, further formulations were given, some of which more operational and formal in style, as Jordan's, Weyl's, Wigner's and Stapp's so called "S-matrix". Other formulations, in terms of "second quantization", or Feynman's "path integrals",9 exalted the role of probability well beyond the previous statistic and probabilistic interpretations, considering any quantum phenomenon as a discrete probabilistic one, even if apparently purely ondulatory and continuous.11 Dirac, however, contrary to von Neumann, strongly affirmed the irreducibility of quantum properties, functionally correlated, to their measurements. More specifically, he upheld the distinction between classical and quantum properties, that he called "c-numbers" and "q-numbers", respectively. The properties of both kinds were unidentifiable with mere measurements: rather, these were characterized by algebraic formal properties, such as the non-commutativity of the latter. Moreover, in Dirac's formalization even more sharply than in von Neumann's, the power of mathematics (just as in the case of the famous "delta-function" mentioned above) made ambiguous, in its strong abstractness, the physical interpretation, as the problem of interpretation could even less be solved through concrete physical analogies and substantive models.12 Moreover, the appeal to instrumentation as a "deus ex machina" of the reduction of properties to states of the system in order to overcome the difficulty (notwithstanding the irreducibility of properties to simple measurements, as seen above), though entrusted not only to simple empirical procedures but also to true instrumental theories and theories of instrumentation, does not look sufficient. We must in fact add other theories beyond instrumental theories and theories on instrumentation to give a physical meaning to phenomena, theories which don't define the system's properties as mere measurements or instrumental interventions. Only by acknowledging this new degree of freedom one can indeed avoid two reductive converging views: pragmatism and falsificationism. Pragmatism is, even in its most rigorous operationalist expressions, contradicted by the fact that instrumental intervention and measurement alone are often poor and ambiguous, at variance with the mathematical formulation, provided that this is linked to specific properties expressed by the formalism through their correlations (anyway not always experimentally decidable, though formally well expressible).

299 Popper's falsificationism, in its turn, appears contradictory. Indeed it admits that there are facts untranslatable into immediate experimental data, consisting in properties expressed by theories which are irreducible to instrumental theories. On the other side, though acknowledging the speculative and creative character of these theories, it also admits that they are in fact in science always empirically decidable (falsifiable) through their experimental consequences.15 As a counterexample, one could quote the magnetic monopoles mathematically introduced by Dirac,16 whose instrumental tests did not succeed in yielding non ambiguous evidence or refutation,17 notwithstanding Popper's belief in the possibility of deciding any question in science on the base of falsificationism. There is in fact within falsificationism a kind of subtle dogmatism, since it attributes the capacity of selecting (deciding on) even the most abstract and speculative scientific theories to a unique decision method based on mere empirical consequences. Thus, we conclude that it would be better, in agreement with historical evidence though contrary to standard or "orthodox" QM, to admit that there are further properties besides those that are measurable, or reducible to experimental operations, or at least falsifiable. To be precise, properties which are described or fruitfully implied by a mathematical model taken as starting point, although not always physically accessible and empirically or instrumentally decidable (such as properties which are not precisely measurable simultaneously to others in QM, or Dirac's magnetic monopoles). In fact, as underlined by Galison,18 it is impossible to say in general when experiments end, because there is no absolutely certain general decision rule in testing physical theories: these are not always strictly decidable or reducible to mere instruments or operations, though they may be empirically and experimentally quite fertile. References 1. S. D'Agostino, Physis 40, 219 (2003). 2. E. Cassirer, Substance and Function and Einstein's Theory of Relativity (Dover, New York, 1953). 3. E. Mach, The Science of Mechanics (Chicago University Press, Chicago, 1893). 4. P. A. M. Dirac, Proc. Royal Soc. A109, 642 (1925). 5. A. Einstein, B. Podolsky and N. Rosen, Physical Review 47, 777 (1935). 6. C. Garola and S. Sozzo, here. 7. N. Bohr, Physical Review 48, 696 (1935). 8. J. von Neumann, Mathematical Foundations of Quantum Mechanics (Princeton University Press, Princeton, N. J., 1955).

300

9. M. Jammer, The Philosophy of Quantum Mechanics (John Wiley & Sons, New York, 1974). 10. P. A. M. Dirac, The Principles of Quantum Mechanics (Clarendon Press, Oxford, 1930). 11. R. P. Feynman, QED. The Strange Theory of Light and Matter (University of California Press, Berkeley, Los Angeles, 1985). 12. O. Darrigol. From c-Numbers to q-Numbers. The Classical Analogy in the History of Quantum Physics (University of California Press, Berkeley, Los Angeles, 1992). 13. S. D' Agostino, here. 14. P. W. Bridgman, The Logic of Modern Physics (The Macmillan Company, New York, 1927). 15. K. R. Popper, The Logic of Scientific Discovery (Basic Books, New York, 1959). 16. P. A. M. Dirac, Proc. Royal Soc. A133, 60 (1931). 17. H. Kragh, Studies Hist. Phil. Sci. 12 (1981). 18. P. Galison, How Experiments End (The University of Chicago Press, Chicago, 1987).

COMPLEX E N T A N G L E M E N T A N D Q U A T E R N I O N I C SEPARABILITY*

G. SCOLARIClt Dipartimento di Fisica dell'Universita di Lecce and INFN, Sezione di Lecce, 1-73100 Lecce, Italy L. S O L O M B R I N O * Dipartimento di Fisica dell'Universita di Lecce and INFN, Sezione di Lecce, 1-73100 Lecce, Italy

We consider the evolution of an entangled state of a simple compound system made up by two spin \ systems both in complex and in quaternionic quantum mechanics. We show, by using a recent remarkable result on quaternionic maps by Kossakowski, that the initial and final matrices associated with a component subsystem can be seen as the complex projections of quaternionic pure states connected by a unitary evolution operator. Furthermore, the state of the compound system looks like a separable state in quaternionic quantum mechanics.

1. Introduction The essential difference in the concept of state in classical and quantum mechanics is clearly pointed out by the phenomenon of entanglement, which may occur whenever the product states of a compound quantum system are superposed. Entangled states play a key role in all controversial features of QM; moreover, the recent developments in quantum information theory have shown that entanglement can be considered a concrete physical resource that it is important to identify, quantify and classify. The usual techniques devised to this end rely on the concept of density matrix. 1,2 The state of a quantum system £ whose state space has finite dimension n can be represented by a n x n quantum density matrix p, or equivalently, by an Hermitian, positive (i.e., all its diagonal matrix 'Partially supported by PRIN "Sintesi". tE-mail: [email protected]. tE-mail: [email protected]

301

302

elements, in any basis, must be nonnegative) operator of trace class (in particular, of unit trace; for the sake of brevity we do not distinguish between operators and their representing matrices in the following). Any external action which changes the state of E can be represented as a mapping As of the state space into itself,3 hence As must be positive (i.e., it must preserve positivity of operators). Yet, positivity is a necessary but not sufficient condition for a given map As to describe a physical process. In the case of open system dynamics, one usually asks for a more stringent condition than positivity, namely complete positivity. Essentially, the requirement that As is completely positive (CP) guarantees that (for any n) the map As I n , where I n is the identity map acting on the states of another n-level system E n , preserves the positivity of all states of the compound system E + E n . The physical argument supporting complete positivity is that one cannot exclude that the system E might have interacted in the past with another n-level system E„. In this case one should consider the two systems together, even though only one of them has a non-trivial evolution described by the map A E , while the other is dynamically inert (note that, if As is not CP, the only states of E + E n that may develop negative eigenvalues under AE ® I n are the entangled states). More generally, the completely positive maps are positive maps satisfying the condition that their tensor multiplication is again positive; they can be characterized as the convex set generated by the maps of the form:2 P^SpS^

(1)

where S is a linear operator and * denotes the Hermitian conjugation. However, unitary evolution of compound systems may lead in standard QM to an evolution of the density matrices associated (via partial trace) with the component subsystems that neither is unitary not described by CP maps. The role and the physical interpretation of the maps which are positive but not CP are still under investigation.4 Some hints in this direction can be provided by a recent result on quaternionic maps. 5 Following the suggestions in Ref. 5, we study in this paper an entangled state of a system made up by two 2-levels subsystems and discuss the evolution of this particular state in standard (complex) quantum mechanics (CQM) as well as in quaternionic quantum mechanics (QQM), where a Hilbert space over the skew-field Q represents the set of states of the physical system. The plan of the paper is the following. Firstly, we collect some basic notations and results on QQM (Sec. 2). Secondly, we recall and quickly

303

illustrate a general result by Kossakowski 5 on quaternionic maps and their complex projections (Sec. 3). Thirdly, we apply these concepts to study a simple situation, considering an entangled state of a physical system made up by two different spin | systems: we show by means of a simple exercise that the evolution of the subsystems in QQM can be suitably described by quaternionic unitary (hence, completely positive) maps, and that the complex (not completely positive) maps acting on the reduced density matrices in CQM are just the complex projections of the former (Sec. 4). Finally, we provide a quaternionic description of the state of the compound system as a separable state (Sec. 5) . 2. Density matrix formalism in quaternionic Hilbert spaces We recall here some basic notations. A (real) quaternion is usually expressed as q = qo + qii + qii + q$k where qt G 1 (I = 0,1,2,3), i2 = j 2 = k2 = - 1 , ij = -ji = k. The quaternion skew-field Q is an algebra of rank 4 over R, non commutative and endowed with an involutory anti-automorphism (conjugation) such that q-+q = qo-qii-

q2j - q$k

In a (right) n-dimensional vector space Q" over Q, every linear operator is associated in a standard way with a n x n matrix acting on the left. Moreover, in analogy with the case of vector spaces over C, one can introduce the concepts of unitarity, Hermiticity and so on. The density matrix p* associated with a pure state | / ) belonging to a right quaternionic Hilbert space ~hfi is defined by

Pf = l/X/l

(2)

and is the same for all (normalized) ray representatives. The definition of the density matrix associated with mixed states is given in a standard way. Denoting by Re TrA the real part of the trace of the linear operator A (notice that the real trace enjoys the cyclic property: 6 KeTrAB = ReTrBA), the expectation value of a quaternion self-adjoint operator A can be expressed in terms of pf as follows7 (A)f = (f\A\f)

= ReTr(A\f)(f\)

= ReTr(APf).

(3)

304

The time evolution equation for p. reads 7 ^pf

= -[H,pf],

(4)

where H is the quaternionic anti-Hermitian Hamiltonian operator. Hence, by using Eq. (4) and the cyclic property, one obtains the time evolution equation for {A)f.

!<">'= R e T r {(w+[iUl)*}=(£+lS-A]),

•

(5)

Moreover, for every linear operator A and density matrix p, let us put A = Ac + jA and p = pc + j'p where Ac, A, pc,7> are complex matrices (hence, Ac and pc are the complex projections of the quaternionic operators A and p, respectively); from Eq. (3) it follows that the expectation value {A)f may depend on i or p only if both A and p~ are different from zero. Indeed, one easily obtains (A)f = Re Tr(Ap) = Re Tr (Acpc ~ A*p)

(6)

where * denotes complex conjugation. As a consequence of Eq. (6) it follows that quaternionic physical effects can be revealed on quantum systems described in a quaternionic Hilbert space only if both the corresponding state and the observable are represented by genuinely quaternionic matrices. On the contrary, if an observable 0 is represented by a complex Hermitian matrix, its expectation values cannot depend on the quaternionic part fp of the state p = pc + j'p ; moreover, the expectation value predicted in the standard (complex) Quantum Mechanics for the state pc coincides in such case with the one predicted in Quaternionic Quantum Mechanics for the state p, since Tr(Opc)

= ReTr(Opc)

=

ReTr(Op).

3. Complex and quaternionic maps In the complex as well as in quaternionic quantum mechanics, the unitary evolution of a pure state p is described by the CP map ACp:p-+p,=UpUi, 2

(7) 2

where U is unitary, so that p = p implies p' = p'. However, as we already observed in the Introduction, if p is a state of a compound system, the evolution of the density matrices associated

305

(via partial trace) with its subsystems is described in CQM by positive but, generally, not CP (hence, not unitary) maps, which only preserve the Hermiticity of the p's. In particular, in the two-dimensional case, it has been proven8 that any positive map must have the form A = A1CP + A2CPoT,

(8)

where T denotes the transposition operation T : P - • PT

and A?CP (I = 1,2) is a CP map. Maps of the form (8) are called decomposable. Furthermore, a remarkable result, due to Kossakowski,5 states that any complex decomposable map (of a complex density matrix) can be seen as the projection of a corresponding quaternionic completely positive map. We can illustrate these results in a special case which generalizes (7), as follows. Let U = R + jS be a quaternionic operator, let p be any complex density matrix and let us associate the quaternionic CP map p —> UpU* to U. By a direct computation one immediately obtains UpU* = RpR) + S*pTST + j(Sptf

- R*PTST),

(9)

Since, trivially, ST = (5*)^, the mapping p -¥ RpRi + S*pTST has just the form (8). Conversely, given the map

p^Rptf

+ SpTSl,

that has the form (8), it can be seen as the complex projection of the quaternionic CP map p -> UpW associated with U = R + jS*. Our example shows that a new physical meaning can be attributed to decomposable maps if an evidence in favour of QQM is provided. In the next section we apply the above results to a simple evolution pattern, obtaining some interesting consequences, even though in a very particular case. 4. Two spin | s y s t e m s in ?£ Q Let us consider a compound system made up by two (different) spin | subsystems, denoted by 1 and 2, in standard QM, let ~HC be the complex Hilbert space of the whole system, and let us suppose that the compound system evolves unitarily as follows,

|+,-)->a|+,-) + 0|-,+>,

306

where a and /3 are complex numbers such that |a| 2 + |/3| 2 = l. The density matrices corresponding to the pure separable state |+, —) and to the pure entangled state a\+,—)+fi\—,+) are given by /0000\ 0100 Pc(0) = 0 0 0 0

Voooo/ and fO 0 Pc® = 0 \0

0 0 0\ \a\2 a{3* 0 0a* \0\2O 0 0 0)

respectively. The density matrices of the subsystems can be obtained by taking the partial traces of pc (0) and pc (t) with respect to the spin variables 2 and 1 respectively:

pS'd) =

$•*•<«>-(?

0

l/5|2

(10)

and PP(O)

=

:i)-^-r:

a •

(11)

Thus, since p[? (t) ^ pc (t) (I — 1,2), we see that the dynamics inherited by each subsystem makes it evolve from a pure to a non pure state. The dynamical evolution of the subsystems cannot be reduced to the dynamics described by a unitary evolution operator on the spaces of the two subsystems. Prom a purely algebraic point of view, unitary maps are indeed congruence transformations (see (7)), and no transformation of this kind (because of the Sylvester theorem 9 ) can connect a semidefinite operator PQ (0) with Pc (£), which is positive definite. Now, let us come to QQM. In order to discuss the same physical system in a quaternionic Hilbert space, we denote by p^> (0) and p® (t) the quaternionic density matrices of the subsystem Z, and we put pW (0) = p^' (0), p(» (f) = pg> (t)+jp{l) (t), where p£}(0) and p^ (t) are given by Eqs. (10) and (11), while pr> (t) is a still unknown complex matrix. We also observe that in l-ft as well as in 'Hc the spin observables are represented by the

307

Hermitian matrices 7

Then, as we already noted in Sec. 2, it follows from Eq. (6) that the expectation values of the spin observables do not depend on the purely quaternionic part of the state. Now, let us suppose that in QQM the evolution is described by the quaternionic unitary operator U:

whose adjoint operator is given by

It is then easy to verify by a straightforward calculation that pU(t)

= tfp(«(0)tft = Uf$Ho)U* = £){t)

+j

( ^ "M)

(14)

and pW(t) = C7p(2)(0)C/t

=

U^(0)W = £){t)+j

(_°^ M )

.

(15)

Recalling Eq. (9), we can conclude that the complex decomposable map Pc (0) —• Pc (*) acting on the density matrix p^.' (0) associated with subsystem I at t = 0 actually is the complex projection of a unitary (hence, CP) quaternionic map. Notice that the (right) eigenvalues of both p^\t) and p^2\t) are 0 and l, 10 ' 11 so that they are semidefinite operators, as one expects because of Eqs. (14) and (15), and of the quaternionic generalization of Sylvester's law. 12 We stress once again that the quaternionic pure states p^ (t) are physically indistinguishable from the complex ones p^(t), as long as we limit ourselves to consider spin variables only. 5. Quaternionic description of the compound system It is well known that in the description of compound systems in quaternionic quantum mechanics the usual definition of Kronecker product of matrices does not hold, and also the standard definition of tensor product of Hilbert

308

spaces cannot be used, owing to the non-commutativity of the skew-field Q (in order to overcome this difficulty, a concept of tensor product of quaternionic Hilbert modules has been proposed, 13 which allows one to describe compound systems on a mathematically well-founded basis; unfortunately, the results obtained in this way do not agree in the complex limit with those of standard quantum mechanics 14 ). Anyway, in the particular case described in Sec. 4 all matrix elements of the quaternionic matrices p^ (t) and U commute, since they belong to C(l, j). This suggests to resort all the same to the usual Kronecker product to describe the compound system. Then, let us calculate the state p(t) = pW (£) <8> p^> (t) and the evolution operator U = U ®U . We obtain:

p(t) =

\aP\2 0 0 M'\ 0 \a 4 -|a/3| 2 0 0 - \a0\2 \Pf 0

M2 0

j m

0

Ml /

(0 |a|2-|/3|20 \ 2 -|a| 0 0 -|a|2 |/3| 2 0 0 |/3| 2

\0

+

2

\a\2-\p\20

(16)

J

and

u=

\P\3 -\<*\jJ " K\P\j 0\3-\<*\j)

/M2 Mi

\a/3\ -N2j \<*P\jl/3|2J V-l/31 2 |a/3|

l/?|2 \ -\a/3\j 1 |2 • — \ot\ J -\<*0\j \a0\ -H2 J \a/3\ \0\2j

(17)

Of course, p(t) = pW(t) ® p(2\t) = (U ® ^)(p^ 1} (0) ® ^ 2 ) (0)) , i.e., the quaternionic density matrix in Eq. (16) has (by construction) the formal structure of a separable state. We are well aware that the above result does not hold in general, since it strongly depends on the form provided by Eq. (13) of the evolution operator U. Actually, it is easy to verify by a direct computation that

309 the most general quaternionic unitary evolution operator U' : p ^ ( 0 ) p(')(t)

and

4 ° ( ° ) ->Pc°(*)

reads

C/' =

j/31113 |a|n 4 y '

where ui, u 2 , U3, U4 are unimodular quaternions satisfying U1U3 + U2W4 = 0. Then, if a different choice is made (for instance u\ = i = —v.2 and 1*3 = j = u 4 ) one obtains that the matrix elements of such operator do not commute any more. Anyway, although we obtained it by an heuristic calculation, Eq. (16) provides a quaternionic density matrix that can be associated with the compound system (in the sense that the complex projections of the partial traces give the correct density matrices of the subsystems) and that it enjoys an interesting property. Indeed, if we consider the partial transpose of p, that is the density matrix pT* (t) obtained from (16) by interchanging the indices referring, say, to the second subsystem:

/M2 0 0

PT2 (*) =

z

\-\ap\ 0

M2\

0 |a/3| 2

a \aP\' l/?|4

0

+ \aPY

l«|2-|/3|20 \ Mo 0 -\a\ l/5|20 0 -|/?|2

/0

3 M

0

)

2

l/?l 2

0

(18)

J

and we solve the corresponding quaternionic eigenvalue problem, 10,11 we see that its eigenvalues are 0 and 1, hence pTs (t) is a positive state. In the realm of complex QM, whenever the subsystems are 2-dimensional, this property is actually a necessary and sufficient condition for a density matrix to describe a separable state. 2 Hence, we get a further argument which supports (together with the formal structure of p{t)) the conjecture that the density matrix (16) actually describes a separable state in QQM. In conclusion, our research has pointed out a puzzling situation, in which the same state of a physical system is entangled in CQM, while it seems to be separable in QQM. We hope that further investigations (on quaternionic

310 maps, and more generally, on QQM) will contribute to throw new light on this problem.

References 1. V. Gorini, A. Kossakowski and E. C. G. Sudarshan, J. Math. Phys. 17, 821 (1976). 2. M. Horodecki, P. Horodecki and R. Horodecki, "Mixed-state entanglement and quantum communication", arXiv: quant-ph/0109124 (2001). Also reprinted in Quantum Information: An Introduction to Basic Theoretical Concepts and Experiments by G. Alber, T. Beth, M. Horodecki, P. Horodecki, R. Horodecki, M. Rotteler, H. Weinfurter, R. Werner and A. Zeilinger (Springer Tracts in Modern Physics, 2001). 3. K. Kraus, Ann. Phys. 64, 311 (1971). 4. V. I. Man'ko, G. Marino, E. C. G. Sudarshan and F. Zaccaria, Phys. Lett. A 327, 353 (2004). 5. A. Kossakowski, Rep.-Math. Phys. 46, 393 (2000). 6. D. Finkelstein, J. M. Jauch, S. Sciminovich and D. Speiser, J. Math. Phys. 4, 136 (1963). 7. S. L. Adler, Quatemionic Quantum Mechanics and Quantum Fields (Oxford University, New York, 1995). 8. S. L. Woronowicz, Comm. Math. Phys. 51, 243 (1976). 9. R. A. Horn and C. R. Johnson, Matrix Analysis (Cambridge Univ. Press, 1985). 10. S. De Leo and G. Scolarici, J. Phys. A 33, 2971 (2000). 11. S. De Leo, G. Scolarici and L. Solombrino, J. Math. Phys. 4 3 , 5815 (2002). 12. F. Zhang, Lin. Alg. Appl. 251, 21 (1997). 13. A. Razon and L. P. Horwitz, Acta Appl. Math. 24, 141 (1991). 14. S. P. Brumby, G. C. Joshi and R. Anderson, Phys. Rev. A 51, 976 (1995).

MACH-ZEHNDER INTERFEROMETER AND QUANTITATIVE COMPLEMENTARITY CARLO TARSITANI Department

of Physics, University of Roma Roma, Italy

'LaSapienza'

FABRIZIO LOGIURATO Department of Physics, University of Trento, 38050 Trento, Italy

Povo

The complementarity principle is often stated as the impossibility to perform an experiment by which both the effects of the wave-like behaviour and the effects of the particle-like behaviour of quantum objects can be simultaneously tested. For instance, in the Mach-Zehnder two-way interferometer, the effect of interference is destroyed whenever we insert a device that allows us to know which of the two paths has been chosen by each photon. Interference effects and "which-way" experiments are mutually exclusive. In the present paper we focus our attention on an ideal situation in order to show the validity of a quantitative representation of complementarity principle, which has been recently introduced. On this basis, we develop a simple conceptual analysis of the intimate connection between complementarity and uncertainty principles.

1

Introduction

According to the most common interpretation of the Bohr's complementarity principle, it is impossible to check out, at the same time and with the same experimental apparatus, both the effects of the wave-like behaviour and the effects of the particle-like behaviour of quantum objects. The experimental conditions that allow us to check the wave properties of an object are incompatible with the experimental conditions that allow us to check its particle properties, and vice versa [1]. For instance, in the well-known double-slit experiment we can set up the experimental apparatus either to observe the interference pattern or to ascertain which of the two slits each object passed through, but we cannot obtain both results with the same experimental arrangement. If we insert a device that lets us know the path of the object, we destroy the interference pattern, that is the most evident effect of the object's wave-like properties. Now, the fact that the interference pattern disappears is often attributed to the unavoidable perturbation of the object's momentum that is introduced by the device by which we obtain the which-path information. From the quantitative point of view, it is commonly stated that the amount of the perturbation cannot 311

312 be smaller of what is predicted by Heisenberg's relations [2-4]. Indeed, also Heisenberg's uncertainty principle is often interpreted in terms of the "uncontrollable" perturbation involved in any measurement process. This point of view seems to be shared also by Feynman in his well-known description of the double-slit experiment with electrons [5]. If we want to ascertain the hole by which each electron passed through, we can put a light source just behind the slits and observe the photons scattered by the electrons. It's easy to verify that, as a consequence of the uncertainty relations, any localization of the electron disturbs their momentum enough to destroy the interference effects. In few words, as Feynman says, "trying to watch the electrons we have changed their motions" [5]. However, some years ago, Scully, Englert, and Walther envisaged a thought experiment whose importance from the conceptual point of view cannot be underestimated. In fact, the disappearance of the wave-like effects in this experiment cannot be explained in terms of momentum's perturbation [6,7]. The authors even say that the uncertainty relations have in general nothing to do with such experiments: interference disappears because the measurement process creates an entangled state for the compound system "measuring apparatus plus measured object". From this point of view, the complementarity principle would be more fundamental than the uncertainty principle. Obviously, even a collimated beam of atoms incident upon a two-slit arrangement will show an interference pattern. Now, for atomic beams, we can use two maser cavities in order to ascertain by which hole each atom is going to pass through. In fact, each atom, if it is prepared in an excited state, on traversing either one of the cavities spontaneously emits a microwave photon by which we can get the "which-way" information. Now, it is possible to show that no net momentum is transferred to the atom during its interaction with the cavity fields. Therefore, it is impossible to call into play the disturbing action of detectors in order to explain the loss of the interference effects. Some experiments that are conceptually equivalent to the two-slit experiment we have just described have been actually performed. For instance, it has been performed an interefometric experiment with photons, where the role of the two slits is has been played by two entrapped atomic ions. In this case, the which-path information is stored in the atoms' orbital state [8]. Rauch's interferometric experiments with neutrons have the same interest. Here, the which-path information is inscribed in the spin state of the neutrons [9]. An optical analogue of such experiments is the following. One can insert along one of the arms of a Mach-Zehnder interferometer a system that rotates the photons' polarization state. If the polarization states of the photons that take different paths are orthogonal to each other, it is possible, by a polarization measurement

313 effected at the end of the interferometer, to go back to the path of each photon. Obviously, the interference effect is lost. The paper by Scully, Englert and Walter - and their claim that complementarity is a more general concept than uncertainty - has given rise to a long-standing debate in the literature. Several authors have attempted to reestablish an equal status for the two principles, by introducing hidden effects that Scully and his collaborators would have neglected and that would cause the requested disturb for the momentum. However, the results of these attempts are uncertain and their interpretation has never been clearly established with a complete agreement [10-13]. The debate has made an actually relevant step forward thanks to a contribution by Englert in which he formulates a quantitative representation of complementarity [14]. The final clarification of the question is due to Bjork and his collaborators [15]. In their paper a proof can be found of the close relationship between the complementarity principle and the uncertainty relations. According to these authors, "which-way" experiments involve observables that are in general neither position nor momentum. By introducing new observables, it is still possible to attribute the loss of interference to a peculiar uncertainty relation. 2

Predictability and visibility in a Mach-Zehnder interferometer

We now describe in detail a typical Mach-Zehnder interferometer experiment (Fig. 1). Let us assume that a beam of "particles", that are initially in the momentum eigenstate |a) is directed towards the beam splitter BS1.

Fig. 1. The Mach-Zehnder interferometer.

The state of each particle is transformed in a superposition of the reflected state and the transmitted state. The mirrors Ml e M2 reflect the particles towards

314

the second beam splitter BS2. The arrangement of the beam splitters and of the mirrors is such that, if the reflection and transmission coefficients of the two beam splitters are equal, all the particles are detected by the upper WWD. Let us describe the process from a quantitative point of view. The unitary evolution of the states | a) and | b), due to the 50-50 beam splitters, is the following:

l«}-*^>.

\b)^^b)

+ i\a)).

(1)

The evolution due to the mirrors is the following: |fc)-Ha>-

|fl)->«|*).

(2)

More in general, let us assume that the transmission and reflection coefficients of the first beam splitter are different from each other. Then, the initial state evolves as follows: \a)^>cosa\a) + is'mc^b).

(3)

Let's put c + = c o s « and c_=sina (obviously c + + c f = l ) . The system's total evolution, using the simplified notations v+=-{c+ + c_)/y2 and v_ = (c+ - c _ ) / V 2 , will be: \a)—> c+\a) + ic_\b) -* ic+\b)- c_\a) —> -^—r=(c++c_)\a)

+ -j=(c+-cj\b)=v+\a)

+ iv_\b).

(4)

Now, following Greenberger and Yasin [16], we introduce the visibility Vas a function of the intensities of the maxima and the minima of the interference pattern: T/ _

max

min

max

min

/C\

It is evident that V can be considered as a measure of the system's wave properties and that the above intensities will be proportional to the probabilities of finding the system either along the path a or the path b. Therefore, V will be: \r __ r m a x

f^min

max

+ Pn rniin

(£L\

From (6) and (4) we can deduce that, for our experimental arrangement, the visibility is: V = v2+ - vt = sin 2a = 2c+c_. (7)

315 Let's now introduce the function predictability P [16]. Indeed, we could perform a measurement in order to discover the path taken by each system, but we have also some chance to guess the result of such a measurement. If c+>c_, the relation that quantifies our capability to predict the right result is the following: P = ^l=cos2a 1/2

= cl-c2. +

(8)

"

Then, it's easy to show that: P2+V2 = \. (9) It's worth noticing that, whenever we can predict with certainty the chosen path (P = 1), the visibility goes to zero (V= 0). Vice versa, if the predictability is zero (P = 0), we get the maximum value for the visibility of the interference pattern (V=l). We can thus interpret the relation (9) as a quantitative expression of the complementarity principle. It's important to underline the fact that for mixed initial states (in our case, for mixtures of the states |a) and |fo)), the more general inequality P2 + V2 < 1 holds [14]. Let's now investigate on the link between the expression (9) and the uncertainty relations. We introduce the Hermitian operator A defined in a two dimensional Hilbert space with orthonormal eigenstates |A+) (eigenvalue A+ = l) and | A - ) (eigenvalue A _ = - l ) . Let's assume that the observable A refers to the path that the system actually chooses immediately after the first beam splitter. The eigenvalue +1 corresponds to the superior path and the eigenvalue - 1 to the inferior one. Then we get: (A) = c2+-c2_. (10) So, the predictability P is equal to the mean value (A) . This equality can also be deduced in the following way: we put |a) = |A+), |£) = | A - ) and A=|A+)(A+|-|A-)(A-|; if we define the state \y/x) as | Y\) - cos o\ a) + i sin «j b), we can calculate ( ^ |A\ %) = P. For the variance /(AA)2V we get: ((AA)2) = ( A 2 ) - ( A ) 2 =\-(c2+-cl)2

= \-P2.

(11)

Now we introduce the operator B with orthonormal eigenstates |.B+) (eigenvalue B+=\) e|fi-) (eigenvalue B_ =-1). In order to connect B with interference and visibility, let's assume that it refers to the path chosen by the system at the end of the interferometer, just beyond the second beam splitter. We

316 assign to the observable the value 1 if the system is detected along a, and the value - 1 if it is detected along b. The mean value of B will be: (B) = vl-vl

(12)

Its variance will be: ((AB)2) = (B2)-(B)2

= 1-(V+2-V2)2 = 1-V2.

(13)

Let's discuss the form of the operator B. In terms of projection operators, it can be written as B = |B +)(fi +\-\B-)(B-1. The observable B refers to a whichpath measurement at the end of the interferometer, that is to the measurement of A when the system is in the state |v 2 ) = v + |a) + iv_|fc). Since the variances that appear in the uncertainty relations must refer to the same state, we can look for an operator B such that (^ 2 |A|^ 2 ) = (^|.e|^;}, (y/2\{AA)2\y/1) = (y/\\(AB)2\y/x). If UR and UBS indicate the evolution operators for the mirrors and for the beam splitters, respectively, we can notice that («\U+RUlsAUBSUR\¥l) = {¥l \B\Wl),

\B±) = U+RU+BS\A±).

Therefore we can represent B in the basis of the eigenvectors of A. Leaving aside constant phase factors, a possible choice is the following: |B+> = (|A+> + i|A->)/V2,

|B-} = (|A+)-i|A-»/V2.

(14)

Notice that A and B are incompatible: if the system is in an eigenstate of A, we have no information about the value of B, and vice versa. So, for the observables A and B, an uncertainty relation must hold. In fact, by multiplying the variances of A and B, we obtain: ((AA) 2 )((AB) 2 ) = (1 - P2)(l - V2) = V2P2.

(15)

We can conclude that there is a close relationship between complementarity and uncertainty principles: both can be expressed in a quantitative form by means of the functions V and P. Moreover it's always possible to define noncommuting operators that refer to the path or to interference, for which the uncertainty relations hold. In general the uncertainty relations do not involve the observable momentum. This is the main difference from what is suggested by Bohr's classical thought experiments3.

a

In (15) the variances are calculated for a state that minimizes their product. For a more detailed analysis of the connection between the Robertson-Heisenberg relations and the visibility and predictability operators, see the paper by Bjork and collaborators [15].

317

3

Distinguishability and visibility in a Mach-Zehnder interferometer

In the above discussion, the measurements of the observables A and B were effected on the same set of identically prepared systems. The variances were calculated starting from separate sharp measurements or of A or of B. Let's now analyze the effect on visibility (on the interference pattern), when the path of each system is known with some uncertainty. In other terms, we imagine to effect simultaneous unsharp measurements of the noncommuting observables for each member of the set [15, 18]. For the sake of simplicity, we assume that also the first beam splitter is 5050. Let's suppose that just behind the first beam splitter there is a device R whose initial state is | z - ) . If the system goes downwards, R keeps its initial state, if the system goes upwards R takes the state | R +), which can be not orthogonal to | z - ) . The entangled evolution of the system and of the device is described as follows: | a ) U - > - ^ | a ) + i|*>j|z->- \y1) = ^(\a)\R+) ->

+ i\b)\z-))^

-^(i\b)\R+)-\a)\z-))->

- W2) = -\\a)([z-) + \R+))-{\b)([z-)-\R+))-

(16)

Let R be represented in a two-dimensional Hilbert space whose basis is

fl*+>.|*-»Let's assume that \R+) can be represented in a simplified form with real and positive coefficients: | R +) = cos S\z+) + sin 6\z-)By measuring the device's state we will get some information about the path of the system. For instance, the system could be a spin 1/2 "particle" whose initial state is | z - ) , and we can imagine that, if it goes upwards, its spin would be rotated by the device. We can assign to the observable A the value +1, if the systems takes the state | z +) , and the value -1, if it takes the state | z - ) . However, if \R+) and | z - ) are not orthogonal, the path will be known with some uncertainty. The probabilities Pa and Pb to find the system, at the end of the interferometer, respectively along the superior path or along the inferior one, will be:

318

P.=(v2\a)(a\y,I)=±(l Pb = {v2\b){b\v2)

+

{z-\R+))=\
+ *™W-

= \(\-{z-\R+))=\(l-™e).

0?) (18)

If the chosen path is perfectly distinguishable ((z-|/?+) = 0), Pa and Pb are equal: the interference is not visible. On the contrary, if ( z - | # + ) = l, the device's state (the system's spin) does not change: we have no information about the path and all the systems are detected along a. The visibility, defined by the relation (6), now is: Vs=\Pa-Pb\ = (z-\R+) = sm6.

(19)

Let's also define a D, in order to quantify the distinguishability of the paths [15]: D = (l-|(z-|fl+>|2//2=n-sin20)1/2.

(20)

So, one can easily demonstrate the following relation: Vs2+D2=l.

(21)

In this case, the detailed analysis of the relationship between simultaneous measurements and uncertainty relations is rather complicated: the reader can find it in the literature [15]. If the first beam splitter is not 50-50, optimal simultaneous measurements can be effected; the product of the uncertainties will be: ((AA) 2 )((AB) 2 ) = ( 1 - W ) 2 . (22) One can see that the product only depends on the state | yfx), on the visibility (7), and on the predictability (8). 4

Conclusions

We have shown how, in the case of a Mach-Zehnder interferometer, it is possible to introduce observable magnitudes, that allow us to define quantitatively the notions of "predictability", that is the capability of predicting which path the object will choose, of "distinguishability", that is the capability of inferring which path the object has chosen after having gone through the interferometer, and the "visibility", that is the capability of recognizing the interference effects. By means of such magnitudes we can obtain a quantitative formulation of the wave-particle dualism and of Bohr's complementarity principle. The operators corresponding to predictability, distinguishability and visibility are incompatible and are linked by inequalities that resemble Heisenberg's uncertainty relations. It

319 is thus possible to give a mathematical expression to the close relationship between the two fundamental principles of quantum mechanics and it is also possible to find a way to verify their equivalence. As shown by Bjork and his collaborators, complementarity is not more fundamental than uncertainty: the two principles can be considered as the two sides of the same coin. A generalized complementarity relation can be formulated for any system defined in a two-dimensional Hilbert space. However, there is no need to express this relation in term of position and momentum observables. The thought experiment by Englert, Scully and Walter - together with the carrying out of its various experimental versions - shows that the effect of uncertainty relations on the interference loss does not always involve position and momentum, but, in general, it involves other noncommuting observables, depending on the actual experimental conditions. References 1. M. Jammer, The Philosophy of Quantum Mechanics (Wiley, New York, 1974). 2. W. Heisenberg, Z. Phys. 43, 172 (1927). 3. W. Heisenberg, The physical principles of the quantum theory, University of Chicago Press (1930). 4. S. Gasiorowicz, Quantum Physics (Wiley, New York, 1974). 5. R. P. Feynman, R. B. Leighton and M. Sands, The Feynman Lectures on Physics, Vol. 3 (Addison-Wesley, Reading, MA, 1989). 6. M. O. Scully, B. G. Englert and H. Walther, Nature 351, 111 (1991). 7. M. O. Scully, H. Walther, Phys. Rev. A39, 5229 (1989). 8. U. Eichmann et al, Phys. Rev. Lett. 70, 2359 (1993). 9. H. Rauch, Cont. Phys., 27, 345 (1986). 10. P. Storey et al, Nature 367, 626 (1994). 11. B. G. Englert, M. O. Scully and H. Walther, Nature 375, 367 (1995). 12. P. Storey et al, Nature 375, 368 (1995). 13. H. Wiseman and F. Harrison, Nature 377, 584 (1995). 14. B. G. Englert, Phys. Rev. Lett. 77, 2154 (1996). 15. G. Bjork et al, Phys. Rev. A60, 1874 (1999). 16. D. M. Greenberger and A. Yasin, Phys. Lett. A128, 391 (1988). 17. J. J. Sakurai, Modern Quantum Mechanics (Addison-Wesley, Reading, MA, 1985). 18. M. G. Raymer, Am. J. Phys. 62, 986 (1994).

ANTONIO GRAMSCFS REFLECTION ON QUANTUM MECHANICS

ISABELLA TASSANI Istituto di Filosofia, University of Urbino Via Saffi 9, Urbino, Italy As the first step of a wider historical reconstruction of the reception of quantum mechanics in the nineteenth-century philosophy, we are going to consider Antonio Gramsci's philosophy. He asks himself about the nature of quantum objects, if their existence depends on the act of measuring by the experimenter and if this kind of relationship can be interpreted as an argument in favour of an immaterialistic philosophy. We will remark how an idealistic interpretation of quantum mechanics found a fertile field in the Italian culture, characterized by an antiscientific attitude and at the same time needing to find in science a term of comparison.

1. Introduction The appearance of quantum mechanics aroused different reactions in the circle of the European culture at the beginning of the twentieth century and did not fail to go towards manifestations of real hostility from physicists who were linked to the picture of the world offered by classical physics and the nineteenth-century experimental methodology; they had no inclination to a kind of physics which seemed to them abstract, too mathematized, not viewable. Inside the so-called "Copenhagen-school", the founders of the theory did not hide difficulties to elaborate a new way of considering nature and sometimes they even suggested that philosophy would have to stimulate a part of this conceptual renewal. Surely, neopositivists were the most inclined philosophers to accept this challenge; many of them were able to fully understand the new theory and sometimes, as in the case of Hans Reichenbach, to make their innovative and decisive contribution to the elaboration of its interpretation [31]. In the neokantian movement, Ernst Cassirer suggested an interpretation in Kantian terms of quantum mechanics, which will be received by his followers or authors inclined to evaluate the new theory from the same perspective [11], [35]. On the other hand, except for Kantian and neopositivist traditions, the reactions were sporadic and less incisive than those which accompanied the

320

321 appearance of the theory of relativity; in fact its fundamental concepts seemed immediately to have a philosophical pertinence [13]. Quantum mechanics, instead, showed itself to be a complex theory, not viewable and often in full contrast with notions of common sense.3 In Italy such difficulties seemed particularly marked for philosophical reflection, because Italian scientists had neither shared in the elaboration of the formal apparatus of quantum mechanics nor realized any work of mediation between their scientific task and cultural popularization outside the confined sphere of scientific research [2], [17], [18], [34]. The aim of analysing Antonio Gramsci's thought about microphysics is part of a wider plan of reconstructing the cultural and philosophical European climate in the age of the appearance of quantum mechanics; we are going to pay particular attention to the philosophers' capability of appropriating conceptual novelty of this physical theory. 2. Gramsci's criticism of positivism and his conception of science In the sphere of philosophical reflection on science realized by Italian intellectuals in the first decades of the twentieth century, Gramsci was an element bearer of novelty.b In fact, as is well known, the intellectual climate of that age was dominated by the idealistic approach which shared the philosophy of Benedetto Croce and Giovanni Gentile, and by a widespread distrust towards science, which often became a real antiscientific culture0. Such hostility had its a

b

c

In [2], p. 30, Agazzi underlines "the relative poorness of epistemological discussions about quantum physics", compared to those about relativity; in fact, Italian mathematicians had prepared adequate mamematical instruments only to understand the latter. The fact that Gramsci had no part in the prevailing Italian culture in the three decades of the nineteenth-century is emphasized by Rossi, who singles out the characteristics of a similar culture in irrationalism, in the criticism of science and its union with technology, in the nostalgia towards the past ([33], p. 56). See [2] and [21], where Garin points out that the question of the role of the intellectuals in contemporary society was a subject that "was troubling" the European culture; so, Gramsci's reflection is surely not limited only to Italy's point of view (pp. 291 ff.). For the well-known critical attitudes of Croce towards science, see [12]. The stereotype that ascribes to the philosophical idealism, at the same time as the advent of fascism, the responsibility of the diffusion of an antiscientific culture in Italy, is analysed in [2]; Agazzi proves how idealism played in effect only a secondary role in causing the extinction of some phases of the previous Italian philosophy of science; after all, he points out that "such a kind of antipositivistic and, in the specified sense, also "antiscientific" reaction was diffused in those years in the whole world at philosophical level, such were the instrumentalistic, pragmatistic and conventionalistic interpretations of scientific knowledge and even the depreciations of formal logic" (p. 25); finally, Agazzi notices that inside Italian idealism we can find also more favourable attitudes towards science, for example that of Ugo Spirito, who appreciated it as a humanistic knowledge. The Italian panorama between the first and the second war was then more diversified than is usually known, and, according to the author, the debates occurred inside actualism, or in controversy with

322

origin in a misleading identification of general scientific empirical results with the representation of science offered by the nineteenth-century positivism. The idealistic criticism indeed remarked a narrowness of an activity, such as the scientific one is, which eventually limits itself merely to considering empirical data, even though translated into general laws, but however not able to embrace the large movements of thought in which subjectivity expresses itself. Positivism, after all, had offered an extremely limited image of scientific concern, either reducing it to a collection or a cataloguing of empirical datad, or linking it strictly to the idea of progress as evolution, which was badly suited to the historical and cultural context of the first decades of the new century. After all, all over Europe a criticism of positivism had been expressed not only by the strictly neoidealistic circle (Croce, Gentile) or by the "spiritualistic" one (Bergson), but also by authors that had taken an active part in elaboration of scientific theories, such as Poincare6. Gramsci's criticism of positivistic science is connected to this wider European perspective, which is developed by him on the basis of the double point of view of idealism and Marxism [21]. In fact, positivism is considered by Gramsci a kind of superstition, as it transmits a false image of science and even assumes the form of a reactionary ideology. The main mistake made by positivists lies also in a hypostatization of scientific prediction, based on mechanical causality, and in consideration of the latter as a methodological criterion, which we can apply also to different sciences from the natural ones,

d

c

it, ended in a gnosiological background which was very useful to epistemology (p. 31). For Leonetti the controversy between idealists and positivists was exhausted in the twenties, leaving its place to that part of Gramscian reflection which was more oriented in political sense [30]. That is the Gramscian criticism of positivism, which is oriented to an "abstract classifying, to methodologism and to formal logic"; see [24], p. 1467 (Q 11, § 45, 57 bis). Rossi points out that the controversy with the positivistic representation of progress had formed, in the three decades of the century, from very different ideas: "Ideas taken from Gentile and Croce, Bergson and Mach, James and Poincare, Nietzsche and Sorel acted in conjunction — even if used with different aims — as well as themes from intuitionism, conventionalism, pragmatism, historicist idealism, from the actualistic one and the magical one" ([30], p. 55). A similar point of view is supported by Garin: "We were wrong if we isolated the movement of ideas which was prevailing in Italy between the end of the nineteenth century and the first decades of the twentieth one, considering it as a "provincial" episode, and bringing it closer to some aspects of the French culture (Sorel, Bergson) or, perhaps, to the North American one (James), but separating or, at the worst, opposing it to the contemporary developments of the philosophy of life, of German historicism and even of Husserl. A criticism of science, the distinction and the antithesis between science of nature and science of life, between life and forms and so on, are themes circulating everywhere symmetrically" ([21], p. 354). As for the criticism of the formalism of logic made by Poincare\ see [2], pp. 28 f.

323

beginning, for example, from the historical ones. But, Gramsci concludes, "it is the concept itself of "science", [...] which requires to be critically destroyed. It is taken root and branch from the natural sciences, as if it were the only science or the science par excellence, as decreed by positivism" . In other words, even though Gramsci admits that positivistic exalting of science is the outcome of a bourgeois ideology, he does not extend his refusal to science as such; on the contrary, he realizes the importance of scientific method in the distinction between facts and ideological elements and desires a more authentic knowledge of scientific data and methods. Such a knowledge can be pursued by means of a wider appropriation of essential scientific notions, popularized by means of "scientists and serious students, and no longer by allknowing journalists and the self-opinionated self-taught of this world" ([24], p. 1459 (Q 11, § 39, 53 bis); [26], p. 295). Therefore, despite the fact that Gramsci is never completely free from his idealistic development, he still fully understands and exalts the deep value of science, not only for the general human knowledge, but also as a mean of emancipation of Italian culture8. Science continues to be a superstructure, for Gramsci, but it distinguishes itself from the others because it contains in itself the method for a critical distinction between ideological and factual terms. 3. Gramsci and microphysics To fully understand Gramsci's reflection on microphysics, we have first of all to consider the means by which he learned the novelty introduced by new physics — taking into account the years from 1925 to 1927 as those in which the main contributions on the formal structure of quantum mechanics were published11 — and to evaluate which books he had read during his imprisonment. Gramsci was arrested on 8 November 1926, as a result of "exceptional measures" adopted by the fascist dictatorship, and submitted to a regime of isolation; during February of 1927 he obtained a permit to receive books and

' [24], p. 1404 ( g 11, § 15, 26); [23], p. 438. The controversy with the deterministic view of economic-social structures is a topic which has been present in Gramsci's philosophy since his early works ([21], pp. 303 ft). 8 In Gramsci's opinion science not only is the bearer of theoretical values, but also of historicpolitical ones: for an analysis of the revolutionary function of culture in Gramsci, see [21], pp. 297 f.; for an examination of the crisis of the culture of the Italian middle classes, [32], pp. 235-244. We take into account only the contributions on quantum formalism and not all of Bohr's previous works on the structure of the atom, or that of Planck, going back to the beginning of the century. In spite of the publication of the results in international reviews, in Italy Enrico Fermi was the first who originally contributed to the study of the atomic structure, and only since 1923 [18].

324

reviews, and only later he was allowed to write inside his cell'. He began a series of irregular readings, from what he was receiving from friends and relatives; he also fulfilled some translations from German to Italian as a relaxing exercise. However reading, which at the beginning had seemed to him an antidote and a valid self-defence against the brutishness determined by prison and isolation, ended up very soon by appearing to him an empty intellectual exercise, because it had no definite aimJ; so Gramsci began the draft of Quaderni del carcere, to which he devoted himself from February 1929 to 1935, when he was forced to suspend the work due to the bad state of his health. The plan of work was at first divided into four parts, dedicated to the following subjects: a research on the history of Italian intellectuals, a study of comparative linguistics, an analysis of Pirandello's theatre and an essay on serial stories [22]. This project was successively modified many times and was not always kept to by Gramsci, because his health worsened, or because of the modification of his requirements and interests, and finally because of the impossibility to find the books he needed for his research. So Gramsci's reflection on science was placed inside a wider critical reconstruction of the Italian culture of his time. Gramsci's clear awareness to be at the beginning of a new age did not escape his attention, in spite of his imprisonment. Paradoxically, the restrictions seemed to have intensified his sensibility towards wider themes and his acuteness of mind in catching suggestions from the scientific European debate\ although they were filtered from second-hand sources; in fact they were not examined closely through a systematic enquiry. For a more detailed analysis, we can ask what books by means of which Gramsci heard of new scientific theories were. In numbers 8, 9 and 11 of his Quaderni del carcere, he cites many times The nature of the physical world by Eddington [15] (in the French edition of 1929 [16]), and then the "booklet" by Giuseppe Antonio Borgese [6], defined as a "small book in which G. A. Borgese ' Gerratana specifies that Gramsci obtains a permit to read newspapers, starts a double subscription to the prison library and has the right to eight volumes a week; besides he receives books and reviews from the outside and may write two letters a week ([22], p. XV e LXI). ' In a letter of December, 11, 1926, sent to Piero Sraffa, Gramsci asks his friend to send him some books, to face the boredome generated by the reclusion ([25], 1, p. 44). So he decides to extend his research to an analysis of Italian intellectuals, from an impartial point of view (fiir ewig). k In contrast with the presumed Gramscian provincialism, Garin objects that Gramsci is really "a man who shares the drama of the post-war period and of the Russian revolution, curious of any kind of books and teachings, who lived between Moscow and Vienna during the crucial years, and not in peripheral backgrounds, but in contact with the main characters of the world-history" ([21], p. 344).

325 speaks about the new trends of scientific opinion (Eddington) and announces that they dealt a blow to historical materialism" ([24], p. 985 (Q 8, § 77, 25 bis, March 1932)1. Borgese was a figure of prominence in the intellectual Italian panorama at the beginning of the nineteenth-century"1; his small volume, Escursione in terre nuove [6] (published in 1500 copies in 1931), was a report on a journey to Oxford, made on the occasion of the 7th International Congress of Philosophy, during the month of September 1931; therefore, it seems a medley of scientific and philosophical argumentations, on the one hand, and of descriptions of landscapes and personal suggestions on the other. Borgese makes many relevant comments on European and Italian philosophical culture, with a disenchanted view, but able to target some elements of novelty, above all in science. On this matter, it is worth noting that Gramsci's scientific knowledge is the outcome of the report of a literary man, with whom he shares the Crocian development, however completely independent of the provinciality of a large part of contemporary Italian culture. Borgese recognizes "the most peculiar sign of the modern era" in the "aspiration after discontinuity", and cannot fail to observe how contemporary philosophical culture seems a sort of Scholasticism, which perpetuates the past, completely unaware of the novelties occurring in other spheres of knowledge: For three or four centuries people have spoken without respite of revolutions: there is neither form of thought which does not aspire to subvert the past, or almost any man who does not hold in him the myth of a conversion; indeed, this aspiration after discontinuity is the most peculiar sign of the modern era. However, a lot of talk about revolutions does not exclude that there have been and there are revolutions.

1

The volume [16] is part of the books deposited in "Gramsci's Fund" with prison-marks of the gaol of Turi (dating back to the period November 1930-March 1933). In a letter of August, 31, 1931, in [25], 2, p. 62, Gramsci asks his sister-in-law, Tatiana Schucht, to send him "a book on physics by a well-known English writer, [...] I think it was Eddington", and another by Sir James Jeans, The universe around us [27], published in Italian translation in 1931 [28]. So Gramsci comments: "Jeans is a pure physicist; Eddington, however, accepts idealism in science". For an exact dating of the Notebook 8 — in which the physicists are quoted — we refer to [24], pp. 2365 f, as well as to [20], in which Francioni suggests an interesting reconstruction of the Gramscian working method and a more precise dating from that realized by Gerratana; in particular, the Notebook 8 was written between November 1931 and May 1932; inside, we recommend the paragraphs: 170 (Scientific ideologies) and 176 (The new science), written in November 1931, and finally the paragraph 177 (The "objective" reality), dateable between November and December 1931 (pp. 140-146). Both Eddington and Borgese are mentioned again in Notebook 11, § 36, written between August and the end of 1932.

m

G. A. Borgese (1882-1952) was literary critic, writer, contributor to newspapers, university teacher of German literature. Because he refused to take the oath required from the fascist regime, he was forced to move to the U.S.A., where he lived from 1931 to 1949. He wrote many travel books, in which naturalistic descriptions are interlaced with the historical-political dimension.

326 What happened in physics and in natural sciences in the last few years is certainly one of the most important events of the century; and it will not pass without profound consequences on any moral science, philosophy and religion. At Oxford I thought it was not sufficiently spoken about, or, as usual, without sufficient discernment; on the contrary so many reports or debates, in different philosophical fields, seemed to me faint, tired and scholastic things, without any interest for the soul. In the news I gave, I tried to point out only what has real value, what makes us imagine a tomorrow. This revolution or turning point is very recent and its fame did not become common even in the nearby fields. One of the most named philosopher still used to say at Oxford, in September 1930, that "tout dans la nature est determine: il n'y a pas d'effets sans cause". And shortly before, a philosopher of ours, one of the most recent and subtlest, had written that everything in the physical world happens according to an inexorable causal chain. Many people still talk about earth or sky as if nothing important had happened after Galileo and Newton, after the Angel had spread such a large wing ([6], pp. 10 ff.).

Borgese has the impression of failing the philosophers' pride, of the diffusion of a possibilism by which the construction of systems is improbable, of the disappearance of great philosophical personalities who embody the spirit of the ages: "In this world of problematic and descriptive philosophy, in which nature is physics and soul is history, it becomes more and more improbable to face a violent formula which opens the mind, in a heroic thinker who embodies it" ([6], p. 26). Nevertheless, Borgese remarks how at the philosophical congress of Oxford nothing seemed to him so interesting as the question posed in the first plenary session: "Is the recent progress in physics metaphysically important?" ([6],pp.35ff). Borgese expressly quotes Eddington, understanding the aspects which have revolutionized physics, that is the questioning of the spatio-temporal concepts by Einstein and Minkowski and the representation of matter offered by Rutherford: "Between 1905 and 1908 Einstein and Minkowski introduced fundamental changes in our ideas of time and space. In 1911 Rutherford brought about the greatest change in the idea of matter from Democrito's time". However, Eddington continues, while Einstein's ideas seemed immediately revolutionary, those of Rutherford did not make a stir [...]. They were instead ahead of the time of the great subversion. Rutherford, discovering the vacuum inside the atom, pulverizes — if there were a sufficient word for such radical demolition — the solidity of nature and shakes up the traditional assumption that things are more or less as they appear to the senses. The atom is as discontinuous as the solar system is. "If we eliminated all the unfilled space in a man's body and collected all his protons and electrons into one mass, the man would be reduced to speck just visible with a magnifying glass" ([15], pp. 1 f.). [...] Here, men in the street, theosophers, spiritualists, emanators of ectoplasm, yogis, fakirs and even conjurers [...], all a multicoloured group surrounds the new physics and dares to ask: but are we really sure that, all things considered, that little fragment exists?

327 Should one not imagine that a further analysis, a more penetrating inquiry, will dissolve even this last leftover of definite existence into empty space? "Matter leaves universe" ([6], pp. 40 f.)

About consequences of the new physics, Borgese concludes: Men will have to find new words, that is new sentiments, for the new things that they have now in mind. [...] As for microphysical phenomena, on the infinitely little one to which nowadays many people so passionately play attention, someone could say that "they can not be considered to exist independently of the subject that observes them". So all the reality passes to thought. Jorgen Jorgensen, a participant at the Congress of Oxford, could say that "once all the ideas existing until now on the physical world have been dashed, we have only a magnificently well working symbolism left, but the exact meaning of which, if we suppose that it has one, nobody has yet revealed" ([6], pp. 47 f.).

Borgese then recalls expressly Eddington's discussion on Heisenberg's principle of indeterminism and the refusal of causality introduced by it, emphasizing the year 1927 as the beginning of the new era of physics and of the downfall of positive and mechanical sciences ([6], pp. 55 ff.). Gramsci takes up again Borgese's references to Eddington, commenting on them in the light of his philosophy; as an example, Gramsci points out how in the new image of matter, in which the space between protons and electrons is much wider than we would expect — such a thing made a deep impression on Borgese — "there is no meaning", because "there would be no change in ratios and relationships, things would stay just as they are" ([24], p. 1451 (Q 11, § 36, 49), p. 1043 (Q 8, § 170, 52 bis); [26], p. 286)n. For Gramsci, in these elucubrations "we are dealing with mere word-play, with science fiction, not with a scientific or philosophical thought. It is a way of posing the question that is fit only for creating fantasies in empty heads" ([24], p. 1451 (Q 11, § 36, 49 bis); [26], p. 286). In other words, while Borgese had interpreted Eddington and modern physics in a mere immaterialistic and subjectivistic way0, instead Gramsci estimates the problems of modern physics as questions regarding language; so, on one side he plays down the difficulties in the representation of microscopical reality, but on the other he grasps entirely the linguistic, between others, dimension of the matter. In fact, he does not hesitate to conclude (in 1932):

" Eddington's sentence, to which Gramsci refers, is drawn from [16], p. 20: "L'atome est aussi poreux que le system solaire. Si dans le corps d'un homme nous eliminons tuot l'espace depourvu de matiere et que nous reunissions ses protons et electrons en une seule masse, 1'homme serait r6duit a un corpuscule a peine visible a la loupe". The Italian translation in Notebooks is made by Gramsci; Borgese quotes the original edition, widi a little different translation ([6], p. 41). 0 An idealistic interpretation of physic, conceived under Eddington's guidance, can be also found in [14], p. 55, in which De Giuli states: "Idealism finds in science the best confirmation of itself.

328 In Eddington's physics and in many other manifestations of modern science the surprise of the ingenuous reader depends on the fact that the words used to indicate certain facts are modified to denote arbitrarily quite different facts. A body remains "massive" in the traditional sense even if the "new" physics demonstrates that it is comprised of one million parts of matter and 999,999 parts vacuum. A body is "porous" in the traditional sense and does not become so in the sense of the "new" physics even after Eddington's claim. [...] The glosses of the various Borgeses in the long run will serve only to reduce the subjectivistic conceptions that allow trivial playing around with words in this way to a state of ridicule ([24], pp. 1451 f. ( g 11, § 36, 49 bis); [26], pp. 286 f.).

The universe around us, by James Jeans [27] (appeared in Italian translation in 1931 [28]) contributes to suggest Gramsci's correct interpretation of Eddington's words; in it the author explains the experiments and the model of atom proposed by Rutherford, emphasizing that all the universe is filled by vacuump. However, the immaterialistic conclusions are an outcome only of Borgese's personal interpretation, which is exposed by Gramsci. In other words, with an approach like a neopositivistic one, Gramsci fully realizes the need not to allow that philosophical speculation follows suggestions of play-words, maintaining precise terms, logical and conceptual rigour. But he notices above all that common language, moulded according to the macroscopic reality, very soon shows itself to be inadequate to describe the microscopic one ([24], p. 1454 (Q 11, §36, 50)). A similar linguistic mistake is made — in Gramsci's opinion — by Borgese, when he quotes the expression of the participant to the Congress of Oxford, Jorgensen; according to him the infinitely small phenomena "can not be considered independent of the subject that observes them"; on this matter also Mario Camis writes in turn that "these are words which give rise to quite a number of reflections and, from completely new standpoints, bring back into play the great problems of the subjective existence of the universe and the meaning of sensorial information in scientific thought" ([8], p. 131; quoted in [24], p. 1452, p. 1048; [26], p. 287). If we had to interpret these observations in a mere literal way and not metaphorically — Gramsci explains — physical phenomena would in fact not even have been observed, but "created"; they would be an issue of the experimenter's subjective experience, like works of art

p

As Gerratana specifies, in [24], p. 2901 (note 6, § 36 of Q 11), Gramsci wants to read the work by Jeans after the recommendation made by Mirkij; the volume [28] is part of the "Gramsci's Fund", with prison-marks (Turi III). Experiments made by Rutherford are described in [28], pp. 116-119; [29], pp. 112-119, where Jeans states: "As we pass the whole structure of the universe under review, from the giant nebulae and the vast interstellar and internebular spaces down to the tiny structure of the atom, little but vacant space passes before our mental gaze. We live in a gossamer universe; pattern, plan and design are there in abundance, but solid substance is rare" (p. 114).

329 ([24], p. 1454 (Q 11, § 36, 51)). Instead Camis himself — even though he was aligned to "a way of thinking about the 'new' physics" prevalent among British scientists in particular — implicitly ends up by explaining that the expression quoted by Borgese should be understood in a merely metaphorical sense (Ibid., p. 1452, p. 1456; [26], pp. 287 f)q. Again Gramsci fully realizes the problem of subjectivism and of solipsism implied by a possible interpretation of quantum theory, but he tends to reduce their importance assigning these ill-fated results to a rough misunderstanding by Borgese and his sources: If it were true that the infinitely small phenomena in question cannot be considered as existing independently of the subject who observes them, they would in fact not even be "observed", but "created" and would fall into the same domain as the pure imaginative intuition of the individual. The question of whether the same individual can create (observe) the same fact "twice" would also have to be posed. One would not even be dealing with "solipsism" but with witchcraft, with demiurgic powers. It would not be these (non-existent) phenomena but rather these imaginative intuitions that would, like works of art, be the subject of science. [...] But if, on the other hand, despite all the practical difficulties inherent in different individual sensitivities, the phenomenon did repeat itself and could be objectively observed by various scientists independently of one another, what would the assertion quoted by Borgese mean except that a metaphor was being used to indicate the difficulties inherent in giving a description and an objective representation of observed phenomena? It does not seem difficult to explain mis difficulty: 1) with the lack of literary ability of scientists who, up to now, have been didactically trained to describe and represent only macroscopic phenomena; 2) with the insufficiency of common language, which has also been fashioned for macroscopic phenomena; 3) with the relatively slight development of these sub-microscopic sciences, which are awaiting a further development of their methods and criteria in order to be understood by many people through the channels of literary communication [...]; 4) one must always bear in mind that many sub-microscopic experiments are indirect, chain ones whose result "is seen" in the results and not in the act itself (as in the experiments of Rutherford). One is, in any case, dealing with the initial and transitory phase of a new scientific era, which — together with a great intellectual and moral crisis — has produced a new form of "sophistry" ([24], p. 1454 (Q 11, § 36, 51 e 51 bis); [26], pp. 289 f.).

In other terms, then, Gramsci realizes that solipsim and subjectivism are implicit in quantum theory, at least as it has been conceived and interpreted, even if he thinks that these results are due to the literal use and meaning of expressions which are instead to be understood in a mere metaphorical sense; equally suitably he underlines the need to develop concepts and forms of

q

Camis — taken note of the difficulties imposed on the experimentation from the "minuteness" of the methods of inquiry required from the physical and medical sciences — comments only: "We feel like asking what are the objective conclusions which in fact can be drawn from such subjective intuitions as are those here mentioned" ([8], pp. 131 f.).

330

language able to describe microscopical reality, in the same way as Bohr used to invite people to devise a new theory of knowledge. So Gramsci, far from yielding to the temptation of accepting immaterialism, finds other reasons able to guarantee objectivity to a scientific knowledge not founded on an undoubted observation of phenomena: intersubjectivity or communication between scientists on the results of their independent experiments. This is the only criterion that can guarantee that observed phenomena do not emerge from individual speculation. Finally, he emphasizes another interesting aspect, that will be developed by later historiography. After his assertion that quantum theory is in an initial phase and is fated to be improved, the author draws attention to the subjectivistic and "irrationalist" results to which it seems to lead; he thinks that they are issues of "the intellectual and moral crisis" of the society in which this scientific theory has been conceivedr. Such a kind of sociological model in the approach to the history of science — Kuhn is a forerunner of it — has been used by Paul Forman to reconstruct the cultural climate that gave rise to quantum mechanics, with similar remarks to those already made in short by Gramscis. Another very meaningful remark developed by Gramsci concerns the ageold controversy about the existence of an external world; on this matter neopositivists, that in philosophy were following a similar approach to that of empirical sciences, maintained that it was impossible to make asserts endowed with an empirical meaning on the question of realism. On this topic all of them were true to Carnap who, since 1928, in Der logiche Aufbau der Welt, had considered either realism or idealism as pseudoquestions [9], [10]. After few years, Gramsci independently draws similar conclusions, asking himself if science can give any "certainty" on the objective existence of an external world. His clearness of thought is revealed when he states:

' The attention to the social-historical dimension in the history of science and the idea that the origin of the crisis of modern physics, such as other single fields of science, found in the historical development of the capitalistic society was a theme that Gramsci and Bukharin shared; his paper [7] is quoted and discussed in many passages of the Notebooks. s Forman considers the birth of quantum mechanics and of its subjectivistic and non causal interpretation as an outcome of the decadence of a precise historical period, corresponding to the one that was in Germany between the two wars; during the Republic of Weimar, in fact, the triumph of different forms of irrationalism happens; a kind of neoromantic philosophy (Lebensphilosophie) is diffused, such as — in the name of life and of its individuality and hostility to mechanistic theory, to exact sciences and their technical applications — denies in the first place the validity of the causal law [19]. Gramsci anticipates many topics which will be successively developed by historians of science not influenced directly by his thought [4] [5].

331 One may maintain it is an error to ask of science as such the proof of the objectivity of reality, since this objectivity is a conception of the world, a philosophy and thus cannot be a scientific datum. [...] "Objective" means this and only this: that one asserts to be objective, to be objective reality, that reality which is ascertained by all, which is independent of any merely particular or group standpoint. But, basically, this too is a particular conception of the world, an ideology. [...] But if scientific truths themselves are not conclusive and unchangeable, then science too is a historical category, a movement in continual development. Only that science does not lay down any form of metaphysical "unknowable", but reduces what humanity does not know to an empirical "not knowledge" which does not exclude the possibility of its being known, but makes it conditional on the development of physical instrumental elements and on the development of the historical understanding of single scientists. If it is so, what is of interest to science is then not so much the objectivity of the real, but humanity forging its methods of research, continually correcting those of its material instruments [...], in other words culture, the conception of the world, the relationship between humanity and reality as mediated by technology. In science, too, to seek reality outside of humanity, understood in a religious or metaphorical sense, seems nothing other than paradoxical. Without humanity what would the reality of the universe mean? ([24], pp. 1455 ff. {Q 11, § 37, 51 bis, 52, 52 bis); [26], pp. 291 f.).

Gramsci's considerations conclude, then, not only with the assertion of the impossibility for science to state the reality of an external world, but also with the reduction of scientific knowledge to an argument about man: "Science too is a superstructure, an ideology"; this is demonstrated by the fact that in some periods it had not any priority-role which we assign nowadays to it, as a result of a real "infatuation"'. On this matter Gramsci also criticizes the approach followed by Bukharin, delegate of the Sovietic Union at the second international Congress of History of Science and Technology (held in London in 1931), who represents a source for him but also a critical target". In fact, the soviet delegate maintains that the subjectivistic conception prevailing in modern philosophy and science has a religious origin, that can be ascribed to the Bishop Berkeley and to his idea of the esse est percipi. On the contrary, Gramsci criticizes Bukharin's conception, asserting that the whole question of external reality is misleadingly formulatedv. 1

For the criticism of the "abstract and impersonal fetishism of science" and the "deification of the corresponding categories", see [7], p. 28, 20; [24], p. 1458 (Q 11, § 39); [26], p. 295. " According to what is specified by Gerratana, [24], p. 2765, Gramsci had received in prison, at the end of August 1931, the proceedings of the Congress of History of Science and Technology (London, 29"' June-3"1 July 1931) [7] ([25], 2, p. 62). v In [7], pp. 11 f., Bukharin writes: "Nearly all the schools of philosophy, from theologising metaphysics to the Avenarian-Machist philosophy of "pure description" and renovated "pragmatism", with the exception of dialectical materialism (Marxism), start from the thesis, considered irrefutable, that "I" have been "given" only "my" own "sensations". This statement, the most brilliant exponent of which was Bishop Berkeley, is quite unnecessarily exalted into a new gospel of epistemology". Gramsci points out mat religion cannot wander from the idea of an independent reality; in fact, in Italy positivism was absorbed by the religious culture to refute subjective idealism [24], p. 894 (Q 7, § 47, 73 bis); [3].

332

It is true that believing in the objective existence of an external world is by now a kind of metaphysical credo that positivistic science shares with common sense; but it remains right to address the issue, not as an object of scorn, but as an opportunity for a correct historicist interpretation: Man knows objectively in so far as knowledge is real for the human race historically unified in a single unitary cultural system. [...] There exists therefore a struggle for objectivity (to free oneself from partial and fallacious ideologies) and this struggle is the same as the struggle for the cultural unification of the human race. [...] Up to now experimental science has provided a basis on which a cultural unity of this kind has reached its furthest extension. This has been the element of knowledge that has contributed most to unifying the "spirit" and making it more universal. It is the most objectified and concretely universalized subjectivity ([24], p. 1416 sg. (Q 11, § 17, 31 bis); ibid., pp. 1075 ff., pp. 1455 ff., [23], pp. 445 f.).

However, positivism gave a misleading image of science, not only exalting excessively its value, but above all not realizing that it cannot be identified with a mere collection of empirical data; more appropriately, instead, it has to be defined as a historical category, which arises from the union of empirical data and hypotheses: Science never appears as a bare objective notion — it always appears in die trappings of an ideology; in concrete terms, science is the union of the objective fact with a hypothesis or system of hypotheses which go beyond the mere objective fact. It is true however that in this field it is relatively easy to distinguish the objective notion from the system of hypotheses by means of a process of abstraction that is inherent in scientific methodology itself ([24], p. 1458 (Q 11, § 38, 53); [26], p. 293).

If then it is true that science is a historical category and that scientific objectivity is historically "dependent on the theoretical activity" of man ([3], p. 401), however this does not mean that scientific knowledge is completely determined by the subject, because scientific method and the possibility of rectifying our knowledge, submitting them to a continuous empirical check, safeguard it from an absolute levelling on subjectivity. The positivistic debate on the role of hypotheses, on the relationship between theoretical and observative terms and on the so-called theory-ladeness will be successively examined more closely, but in agreement with so many clear Gramscian remarks". In conclusion, Gramsci distinguishes himself from the intellectual climate of his time and, even in isolation and by reading few books on the new physics, he can emancipate himself from a mere idealistic and subjectivistic interpretation of quantum mechanics, that would have found a fertile field in

Boothman blames Gramsci because he was not entirely aware of how observations are so theoryladen [4]; but, in our opinion, Boothman's sentence is compromised by a kind of anachronism.

333

which to grow. He can do this, on the other hand, knowing by intuition and great clearness of thought many meaningful aspects that will characterize the epistemological debate of the following decades. References 1. 2. 3. 4.

5. 6. 7.

8.

9. 10. 11. 12. 13.

14. 15. 16. 17.

18.

E. Agazzi (ed.), Lafilosofia delta scienza in Italia nel '900, Franco Angeli, Milano (1987). E. Agazzi, "Fasi e forme della filosofia della scienza italiana nel '900", in [1], 15-41 (1987). M. Aloisi, "Gramsci, la scienza e la natura come storia", Societa, VI, 3, 385-410(1950). D. Boothman, "Gramsci, Croce e la scienza", in R. Giacomini, D. Losurdo and M. Martelli, Gramsci e VItalia. Atti del convegno internazionale di Urbino, 24-25 gennaio 1992, La citta del sole, Napoli, 165-186 (1994). D. Boothman, "General Introduction", in [26], XIII-LXXXVII (1995). G. A. Borgese, Escursione in terre nuove, Meschina, Milano (1931). N. Bukharin, 'Theory and practice from the standpoint of dialectical materialism", in Science at the Cross Roads, Kniga, London, 1-23 (1931); now in J. Needham (ed.), Science at the Cross Roads, Frank Cass, London, 9-33 (1971). M. Camis, "Scienze biologiche e mediche: Gosta Ekehorn, On the principles of renal function, Stockolm, 1931", Nuova Antologia, 1 novembre, 128-133 (1931). R. Carnap, Der logiche Aufbau der Welt, Weltkreis, Berlin (1928). R. Carnap, Scheinprobleme in der Philosophic Weltkreis, Berlin (1928). E. Cassirer, Determinismus und Indeterminismus in der modernen Physik, Goteborgs Hogskolas Arsskrift 42 (1937). B. Croce, Logica come scienza del concetto puro, Laterza, Bari (1909). S. D'Agostino, "La relativita generale nel dibattito degli anni venti fra neokantiani ed empiristi logici. Annotazioni su recenti studi einsteiniani", Physis. Rivista internazionale di storia della scienza, XXXIV, 3, 643-658 (1998). G. De Giuli, "Scienza e idealismo", Rivista di Filosofia, XXII, 1, 53-56 (1931). A. S. Eddington, The Nature of the Physical World, Cambridge University Press, Cambridge (1928). A. S. Eddington, La nature du monde physique, Payot, Paris (1929). V. Fano, "How Italian Philosophy Reacted to the Advent of Quantum Mechanics in the Thirties", in G. Tarozzi and A. van der Merwe (eds.), The Nature of Quantum Paradoxes, Kluwer, Dordrecht, 385^101 (1988). V. Fano, "La riflessione degli scienziati sulla meccanica quantistica in Italia fra le due guerre", in G. Cattaneo and A. Rossi, / fondamenti della

334

19.

20. 21. 22. 23. 24. 25. 26. 27. 28. 29. 30. 31. 32. 33. 34.

35.

meccanica quantistica. Analisi storica e problemi aperti, EditEl, Commenda di Rende, 105-118 (1991). P. Forman, "Weimar Culture, Causality and Quantum Theory, 1918-1927: Adaptation by German Physicists and Mathematicians to a Hostile Intellectual Environment", Historical Studies in the Physical Sciences, 3, 1115(1971). G. Francioni, L'ojficina gramsciana. Ipotesi sulla struttura dei Quaderni del carcere, Bibliopolis, Napoli (1984). E. Garin, Intellettuali italiani del XX secolo, Editori Riuniti, Roma (1974). V. Gerratana, "Prefazione", in [24], XI-XLII (1975). A. Gramsci, Selections from The Prison Notebooks, Lawrence and Wishart, London (1971). A. Gramsci, Quaderni del carcere, Einaudi, Torino (1975). A. Gramsci, Letters from Prison, Columbia University Press, New York, vol. 1 and vol. 2 (1994). A. Gramsci, Further Selections from The Prison Notebooks, Lawrence and Wishart, London (1995). J. Jeans, The universe around us, Cambridge University Press, LondonNew York (1929). J. Jeans, L'universo intorno a noi, Laterza, Bari (1931). J. Jeans, The universe around us, Cambridge University Press, LondonNew York (1960). A. Leonetti, Note su Gramsci, Argalia, Urbino (1970). H. Reichenbach, Philosophical Foundations of Quantum Mechanics, University of California Press, Berkeley (1944). C. Riechers, Antonio Gramsci. Marxismus in Italien, Europaische Verlagsanstalt, Frankfurt (1970). P. Rossi, "Antonio Gramsci sulla scienza moderna", Critica marxista, 2, 14, 41-60 (1976). G. Tarozzi, "Introduction: The Italian Debate on Quantum Paradoxes", in G. Tarozzi and A. van der Merwe (eds.), The Nature of Quantum Paradoxes, Kluwer, Dordrecht, 1-50 (1988). C. F. Weizsacker, Zum Weltbild der Physik, Hirzel Verlag, Stuttgart (1960).

THE ROLE OF LOGIC AND MATHEMATICS IN THE HEISENBERG FORMULATION OF QUANTUM MECHANICS ANTONIO VENEZIA Gruppo di Storia della fisica, Dipartimento di Scienze Fisiche, Universita Federico II, Napoli e-mail: [email protected]

In this paper, by means of a logical and linguistic analysis of Heisenberg's work, the properties of a logical model suitable for quantum mechanics are obtained. This model is an alternative to traditional quantum logic, because it uses an intuitionist negation. It is able to justify the passage from the problem of conjugate variables measurement to the mathematical formalization (commutation rules) of matrix mechanics.

1. Two Quantum Mechanics Formulations In 1925 Heisenberg [1] developed a first coherent "method to treat quantum theoretical data". Starting from the methodological premise that only the observable variables (for example atomic spectra) are to be considered, he resolved some special problems, as the one-dimensional harmonic oscillator (energy calculation and comparison with Kramers-Born method) and rotator (electron rotating around nucleus and comparison with Goudsmit-Kronig-Honl formulas). In the same year, Max Born and Pascual Jordan [2] proposed the first mathematical formulation of the Heisenberg method, showing how the observable variables could be represented by matrices of a non-commutative algebra. For this reason, the new mechanics was called matrix mechanics. In 1927 Heisenberg [3], in publishing his uncertainty relations, explained physically why it is impossible to make simultaneous measurements of conjugate variables, i.e. variables represented by non-commuting matrices (for example momentum and velocity of the electron). In 1926 Erwin Schrodinger [4] published four papers proposing an alternative theory to Heisenberg's mechanics, known as wave mechanics. He derived the whole theory starting from a differential equation considered as an axiomatic principle. His formulation was closer to the traditional mathematics of continuum used in classical mechanics, and was the expression of the same ideal,

335

336

according to which it is sufficient to know the initial conditions to predict the evolution of a physical system without any limitation (the ideal opposite to that expressed by the uncertainty principle). In order to reach this purpose, Schrodinger introduced a complex wave function ys, that started a long debate" about its physical interpretation. At the end of the 1920s, there were several attempts to unify the two formulations made by Schrodinger, Weyl, Dirac and von Neumann. From an historical standpoint, some authors, for example F. A, Muller [5], consider the supposedly demonstrated equivalence between matrix mechanics and wave mechanics a "myth". They pointed out that the first Schrodinger's demonstration [6] was at least incomplete. Moreover, in 1975, Heisenberg [7], even if admitting the mathematical equivalence, considered different the physical interpretations of the two formulations. Actually, both from a mathematical and logical standpoint, the next reformulations will maintain some differences already seen in the early two. From a mathematical point of view, for example, Dirac [8] unified the two theories by means of a more abstract formalism (Hilbert space), having in common with Schrodinger's theory the axiomatic organization and the choice of mathematical continuum. Heisenberg, Born and Jordan's matrix mechanics was considered a particular application (the so-called Heisenberg representation) of a more general theory. On the contrary, Weyl [9], by reformulating the theory, started from the uncertainty principle and showed how the Schrodinger equation was a consequence of the Heisenberg commutation rules when a mathematics (i.e. group theory) alternative to differential equations was used. From a logical point of view too, it is possible to recognize two distinct paths linked to the early two formulations. First, in 1936, Birkhoff and von Neumann [10] proposed a non-classical logic for quantum mechanics, characterizing it as a non-distributive logic. They described the quantum property of a physical system in terms of lattices of projectors (that admit only the eigenvalues 1 and 0) and the logical operations of conjunction (A), disjunction (v) and negation (-.) in terms of, respectively, intersection, direct sum and orthocomplement among these lattices. In this way, they could conclude their paper showing that the distributive law of conjunction vs. disjunction didn't hold true and proposed a weaker form called modular law. In the 1960s this approach was the object of several studies and until now it was the predominant one. But it was not the only one. a

From Born probabilistic interpretation of ^(1926) to von Neumann projection postulate (1932) essential to solve the problem of (f collapse.

337

In the 1970s a real alternative to this approach was built by authors such as A. Fine [11], J. Bell and M. Hallett [12], Y. Gauthier [13], J. V. Corbett and M. Adelman [14], according to which quantum logic is a non-classical logic not because the distributive law fails, but because the law of the excluded third doesn't hold true: an intuitionist characterization of quantum logic started. The main criticism against Birkhoff and von Neumann projectors algebra concerns the role of negation and the problem of defining the orthocomplement for infinite-dimensional Hilbert subspaces or for finite but open subspaces. In this last case, a Heyting algebra is required to generalize the concept of the projectors to operators (called effects) with eigenvalues included between 0 and 1. In the detailed analysis of these proposals, we showed that the formal problems of intuitionist quantum logic remain unsolved and we pointed out the lack of a clear reference to the theory's basis. In the next section, we'll show how it is possible, by means of a logicallinguistic analysis of Heisenberg's papers, to link the intuitionist approaches to matrix mechanics, as the Birkhoff and von Neumann approach is linked to Schrodinger's mechanics. 2. Premises to a New Approach to Quantum Logic Starting from the study [16] of the several approaches to quantum logic and from the A. Drago papers [17] on the bases of classical physics, it is possible to review the intuitionist program for quantum logic by means of the following assumptions. In a physical theory we can distinguish three parts: (a) experimental laws, (b) mathematical formalism (differential equations, symmetries etc.) (c) principles. Logic regards only point (c), i.e. the organization of theory's principles. Therefore, in contrast to other authors that try to characterize quantum connectives only by means of experimental examples, we'll consider the linguistic analysis of these principles sufficient to characterize primitive logical connectives and to define the syntax. Glyvenko [18] showed that this is enough to distinguish the classical logic, in which the double negation affirms, from the intuitionist logic without the law of the excluded third. In contrast to the traditional approaches to quantum logic that started from the mathematics of the Hilbert space to derive a logical calculus, here we'll show that the inverse path, in which logic precedes mathematics, is possible.

338 Finally, by agreeing with A. Drago [17], we'll consider the revision of some classical physical theories in terms of a non-classical logic, in which the double negation doesn't affirmb. 3. The Logic of the Heisenberg Formulation In order to reconstruct the birth of matrix mechanics, as it was later called, we are going to consider some Heisenberg papers or letters written between 1925 and 1928. In the 1925 paper, Heisenberg built his quantum theoretical kinematics and mechanics starting from a methodological principle about the operational meaning of physical quantities. He identified measurement as the main problem of the new theory and he looked for a method to solve this problem. His idea was to explain atomic emission spectra not in terms of orbits (as in the old quantum theory), but only by means of the experimental data, such as frequencies and intensity of light emitted or adsorbed by matter. Heisenberg writes ([1], p. 879) that the rules of the old quantum theory are not supported by physical evidence, unless we accept to found the theory on the hope that non-observable quantities could become observable in the future. I have underlined a double negation included in the author's reasoning. Heisenberg doesn't affirm, by collapsing this double negation in an affirmative statement, that "the rules of old quantum theory are supported by physical evidence on the basis of the hope ...". This affirmative statement links the evaluation of a physical theory to a future event, which Heisenberg cannot definitively prove or disprove. Neither can he say that the rules of the old quantum theory have physical evidence, because, for example, position and revolution time of the electron are not observable. Nor can he say the contrary, because the energy is observable and other quantities could be observable in the future. Thus, the double negation is essential to express this semantic ambiguity and it is not reducible to the corresponding affirmative statement as in classical logic or in traditional quantum logic. At the end of his reasoning, Heisenberg can only suggest a method of research: to reformulate a quantum mechanics with only observable quantities. After defining the field application of the theory, Heisenberg gives it a formal representation. He clearly chooses discrete variables.

b

By considering non-classical logic also in classical physics, it is possible to avoid the problem of stating if Quantum Logic is empirical or not, an intrinsic issue of Birkhoff and von Neumann approach.

339 On November 23, 1926, in a letter to Pauli ([19], p. 357) Heisenberg writes that "It is not possible that the world is not discrete ... if space-time is discrete, the velocity in a point has no significance because in order to define the velocity in that point, we need a latter point infinitively near the former: this is impossible in a discrete world". Here is evident the negation of continuous space-time. Moreover we have again a reasoning by double negation. Heisenberg starts from the statement "it is not possible that the world is not discrete". At this point of the argument he doesn't consider this statement equivalent to "the world is discrete", because he has no physical evidence of it. Then, he admits by hypothesis the possibility of space-time discreteness and obtains an operative and verifiable consequence (the impossibility to define the velocity in a point). Only with this conclusion does the initial choice of discrete variables have meaning. Then it is clear that the formal representation adopted by Heisenberg is alternative to classical analysis (used by Schrodinger), because Heisenberg declares explicitly that it is impossible to perform the limit of the incremental ratio that defines velocity. Already in the 1925 paper, by reformulating the classical theory of radiating electron, Heisenberg argued that the concept of electron orbit has not physical meaning. In order to represent the emitted radiation in terms of Fourier series expansion, he writes ([20], p. 125) that "it is always possible to find the quantum theoretical equivalent for a quantity x(tf ...while an essential difficulty arises when we consider two quantities x(t) and y(t) and we try to represent the product x(t) y(t); in the classical theory x(t)y(t) is always equal to y(t)x(t), but in quantum theory this is not true". The failure of the commutative rule was already known to mathematicians for some types of matrix algebras. So it was natural to use this mathematical formalism to develop the new quantum theory. So we are left to consider what Heisenberg called the quasi-equality (the inequalities were introduced by Weyl following Pauli's suggestion): XP-PX=ih/2n (1) where the matrices X and P represent respectively position x and momentum p of a particle. Let us note that from a formal point of view the role of the imaginary number is essential to obtain this quasi-equality. However, even if now Heisenberg has a general rule for the complex matrices, he has not still solved his starting problem of the measurement of physical observables. In order to do that, he needs to have real numbers to compare with the measurement results. This last fundamental achievement is

340

obtained by introducing the concept of measure uncertainty (in the statement of the uncertainty principle)0. Historically there are at least two distinct Heisenberg's statements for this principle. The first is in paper [3]: "The more precisely the position is determined, the less precisely the momentum is known in the same time and vice-versa" (A) The second statement is in paper [26]: "It seems a general law that it is not possible to determine position and velocity simultaneously by absolute precision" (B) These statements were considered by Heisenberg ([27], pp. 15-19) a "semiquantitative" argument with the same mathematical content of Kennard's relations'*. Thus, the statements (A) and (B) introduce the mathematical formulation of quantum mechanics and are suitable for a logical analysis. A detailed analysis of the proposition (A) in terms of intuitionist logic was discussed in a previous paper [28]. Here we want to discuss the content of the second statement (B). The proposition (B) expresses an impossibility by means of a double negation6. The former is clearly "not possible". The latter is included in the word "absolute"="not relative". In fact, according to Heisenberg's insistence of referring the physical theory only to operational quantities, we have to consider the relative precision instead of the ideal "absolute precision". We want to show that this double negation doesn't semantically coincide with the corresponding affirmative proposition and thus the law of the excluded third fails in its underlying logic. Let R be the predicate "measurable on the physical system S by relative precision". Then R(x) means that "position x is measurable on the physical system S by relative precision". The same formalism holds for momentum p. By mean of these definitions, (B) becomes: c

d

e

Heisenberg and Bohr spoke of uncertainty relations. First A. E. Ruark [22] introduced the expression "uncertainty principle". However, Heisenberg [3] used the word principle several times in his 1927 paper, with the meaning of "methodological principle" and not "axiomatic principle" as Schroedinger intended his equation. On the other hand, Heisenberg didn't have an incontrovertible experimental proof to substain the evidence of his principle. For this experimental proof we had to wait for the work of Kaiser, Werner and George 1983 [23], Uffink 1985 [24], Nairz, Andt, and Zeilinger, 2001 [25]. Kennard relations (1927) translated (A) and (B) in the mathematical formula for the product (Ax/lp) of position and momentum uncertainties. Even if Heisenberg didn't formally recognize this double negation, he expressed fully its meaning in a letter to Pauli (Feb 23, 1927; [21], pp. 376), by comparing quantum limitations to thermodynamical ones. In fact, thermodinamical limitations are formulated by the disequality of the machines thermal efficiency and are expressed by the double negation of the impossibility of perpetual motion ( see A. Drago [17]).

341 -,-,(R(x)*R(p)) (2) where the symbols "->" and " A " denote respectively the negation and the conjunction'. The corresponding affirmative proposition of (2) is: (R(x)AR(p)) (3) Statement (3) says that "it is true that we can measure x and p by relative precision". This statement, without further specifications, is neither true, nor false. It is not true because when Ax Aph/n, the measurement is possible. The same consideration holds for the negation of (3); in fact, the statement ^(R(x)AR(p)) (4) means that it is not possible to measure x and p by relative precision; as for statement (3), without further specifications, it is neither true nor false. In conclusion, statements (3) and (4) are semantically indeterminate, while (2) is true. Thus, in order to express the uncertainty principle, as Heisenberg formulized it, a logic without the law of the excluded third is needed. 4. The "Synthetic" Aspect of Heisenberg's Argument Up to now the attempts made to axiomatically organize quantum theory starting from the uncertainty principle have failed. One of the last proposals made by Bub [29] in 2000 showed the necessity to start from more general principles to derive the whole theory. In an indirect way this conclusion highlights that Heisenberg, who instead started from the uncertainty principle, used an alternative to the axiomatic organization. In order to clarify this alternative, let us sum up the logical structure of Heisenberg's work as illustrated in the previous section: 1. the measurement problem of physical observables is formulated by means of a double negation in order to criticize the old quantum theory; 2. the formal representation of quantum observables as discrete variables and the complex matrices formalism follow from the spectra observation; 3. the quasi-equality (expressing the non-commuting property of conjugate variables) are derived by means of the matrices rules. 4. by means of a double negation a semi-quantitative statement (the uncertainty principle) solves the starting problem and introduces the mathematical formulas of the uncertainty relations. f

Let us note the crucial role of negation and conjunction in the formalizing the principles of the two important scientific revolutions of 1900. The reformulation of simultaneousness (both in relativity and in quantum mechanics) has required a rigorous redefinition of the logical conjunction.

342

It's possible to show [28] that this logical organization is a general feature that other authors have adopted both in classical physics (see Drago [17]) and in quantum physics (see T. F. Jordan [30]). In particular, this kind of organization was first proposed and formalized by L. Carnot [31] as an alternative to Newton's analytical and deductive method and for this reason was called "synthetic method". In his mechanics and analysis, L. Carnot started from an operationally founded problem expressed by a double negation. He defined a formal system by introducing an auxiliary variable in order to solve the starting problem. Once he obtained a general rule, he suppressed the auxiliary variable. Thus, he reconsidered the starting system and applied the new rule to the main problem. In Heisenberg and T. F. Jordan case, the auxiliary variable is the imaginary number, necessary to formalize the complex matrices calculus. This variable disappears by means of the square modulus in the uncertainty formula (defined as standard deviation) in order to express the result of a measurement. Thus, L. Carnot mechanics is the real classical counterpart to Heisenberg's theory rather than Newton mechanics (linked to Schrodinger formulation by axiomatic organization and the mathematical continuum). L. Carnot's theory lay between geometry and mechanics, and introduced a mathematical technique less powerful but more operational than differential equations. This technique was founded upon the concept of geometrical motions8 ("virtual invertible displacements"), which can be considered the first group of symmetry in classical physics. Following a similar path, Heisenberg formulated in 1926 [33] the first non-geometrical symmetry of modern physics, by applying the methods of group theory (permutations). Wigner [34] extended this technique to geometrical symmetries (rotations). This link between the theories of Heisenberg and L. Carnot suggests the possibility of reformulating the whole matrix mechanics using the mathematical technique of symmetries; this is a path until now not well explored, except for Wigner, Weyl and T. F. Jordan's attempts of the last century. Conclusions The historical analysis of Heisenberg's formulation of the uncertainty principle can be considered the starting point of a new approach to quantum logic. The logical-linguistic analysis of his papers can justify the modem intuitionist approach to quantum logic in terms of theory's foundations. In order g

C. Gillispie [32] pointed out that the concept of geometrical motion is included in the idea of thermical cycle developed by S. Carnot.

343

to achieve these results, the present analysis has made some assumptions about the basis of quantum mechanics (relationship among principles, mathematics and experiments) and about two kinds of physical theory's organization. Suggestions were made for a complete reformulation of the theory. References 1. W. Heisenberg, Zeit. fur Phys. 33, pp. 879-893, (1925); It. trans, in [20], 2. M. Born, P. Jordan, Zeit. fur Phys., 34, p. 858, (1925). 3. W. Heisenberg, Zeit. fur Phys. 43, pp. 172-198 (1927). 4. E. Schrodinger, Ann. Phys. 79, 361-376; 79, 489-527; 80,437-490; 81, 109139, (1926). 5. F. A. Muller, Stud. Hist. Phil. Mod. Phys., 28, 35-61; 28, 219-247 (1997). 6. E. Schrodinger, Ann. Phys. 79, 734-756 (1926). 7. W. Heisenberg, Bemerkungen iiber die Entstehung der Unbestimmtheitsrelation (1975); It. trans, in [20], p. 105. 8. P. A. M. Dirac, The Principles of Quantum Mechanics, Oxford (1930). 9. H. Weyl, The theory of groups and quantum mechanics, Dover Pub. (1931). 10. G. Birkhoff, J. von Neumann, Ann. of Math. 37, 823-843 (1936). 11. A. Fine, "Some conceptual problems of Quantum Theory", in R. S. Colodny Paradigms and Paradoxes, Pittsbourgh, pp. 3-31 (1972). 12. J. Bell, M. Hallett, Philosophy of Science 49, 355-379 (1882). 13. Y. Gauthier, Int. Jour. ofTheor. Phys., 22, n. 12,1141-1152 (1983). 14. M. Adelman, J. V. Corbett, Appl. Categ. Struct., 3, n. 1, 79-104 (1995). 15. A. Venezia, La logica della Meccanica Quantistica: analisi storico-critica, Tesi di Laurea in Fisica, Universita Federico II, A. A. 1999-2000, Napoli. 16. A. Venezia, "I diversi approcci alia Logica Quantistica", in E. Schettino (ed.): Atti XX Congr. Naz. St. Fis. e Astr., CUEN, pp. 423-450 (2001). 17. A. Drago, Le due opzioni, La Meridiana, Molfetta (1991). 18. V. I. Glyvenko, Acad. Roy. Belg. Bull. Sci. (5) 15, 183-188 (1929). 19. W. Pauli, Wissenschaftlicher Briefwechsel mit Bohr, Einstein, Heisenberg u.a.I-II-III, K. von Meyenn, ed., Springer-Verlag, Berlin (1979). 20. W. Heisenberg, Lo sfondo filosofico della fisica moderna, a cura di G. Gembillo e E. Giannetto, Sellerio, Palermo (1998). 21. W. Pauli, Quantentheorie, in Handbuch der Physik, H. Geiger, K. Scheel (eds.) 23, Springer-Verlag, Berlin (1926). 22. A. E. Ruark, Bulletin of the American Physical Society 2, p. 16 (1927). 23. H. Kaiser, S. A. Werner and E. A. George, Phys. Rev. Lett. 50, 560 (1983). 24. J. Uffink, Physics Letters 108 A, 59-62 (1985). 25. O. Nairz, M. Andt, A. Zeilinger, Quantum Phys., quant-ph/0105061 (2001). 26. W. Heisenberg, Forschungen und Fortschritte 3, 83 (1927). 27. W. Heisenberg, Physical Principles of Quantum Theory, Chicago (1930).

344

28. A. Drago, A. Venezia in C. Mataix and A. Rivadulla (eds.): Quantum Physics and reality, Ed. Complutense, pp. 249-266, Madrid (2002). 29. J. Bub, Stud, in Hist, and Phil, of Mod. Phys. 31B, 75-94 (2000). 30. T. F. Jordan, Quantum Mechanics in simple matrix form, Wiley (1985). 31. L. Carnot, Reflexion sur the metaphysique du calcul infinitesimal (1813). 32. C. Gillispie, Lazare Canot Savant, Princeton U. P. (1971). 33. W. Heisenberg, Zeit. fur Phys. 38,411-426 (1926); 41, 239-267 (1927). 34. E. Wigner, Zeit. fur Phys. 40, 883-892 (1927).

SPACE-TIME AT THE PLANCK SCALE: THE QUANTUM COMPUTER VIEW PAOLA A. ZIZZI* Dipartimento

di Matematica Pura ed Applicata, Universita di Padova, via Belloni 7, 35131 Padova, Italy

We assume that space-time at the Planck scale is discrete, quantised in Planck units and "qubitised" (each pixel of Planck area encodes one qubit), that is, quantum space-time can be viewed as a quantum computer. Within this model, one finds that quantum spacetime itself is entangled, and can quantum-evaluate Boolean functions which are the laws of Physics in their discrete and fundamental form.

1. Introduction What is "space-time" at the Planck scale? Once we understand that, we will be able to formulate the theory of Quantum Gravity, the theory which should reconcile General Relativity and Quantum Mechanics. In fact, it is widely believed that at the Planck scale the quantum aspects of gravity become relevant. Moreover, it is generally assumed that at the Planck scale, space-time is not any longer a smooth manifold, but has a discrete structure. There are two main approaches to quantum gravity that assume quantum space-time to be discrete: Loop Quantum Gravity [1,2] (and spin foams [3]), and String (and M) Theory [4,5]. Other interesting approaches are non-commutative geometry [6], Causal Set Theory [7] and kinds of discrete models of space-time at the Planck scale, like lattice versions of loop quantum gravity [8,9,10], and Cellular Networks [11,12]. In our particular approach to quantum gravity, we assume discreteness of space-time at the Planck scale, and we also include the issue of information, (more precisely quantum information [13,14,15,16]). In fact, as it was suggested by Wheeler (the "It from bit" proposal) [17], information theory must play a relevant role in understanding the foundations of Quantum Mechanics. Wheeler's view is shared, in particular, by Zeilinger (who associates bits with elementary systems, i.e. two-level systems, and claims that the world appears quantised because information is quantised) [18]. E-mail: [email protected].

345

346

As it was first realized by Feynmann, a quantum computer can be exponentially more powerful than a classical one in simulating a quantum system. This line of thought is what we call here the "Quantum Computer View" (QCV). We believe that the QCV is universal, and thus can be extended to the "description" of quantum space-time itself. Approaches similar to ours, still encompassing the QCV, are those of Lloyd [19], and Jaroszkiewicz [20]. Our approach is closely related to Loop Quantum gravity and spin networks. Spin networks are relevant for quantum geometry. They were invented by Penrose [21] in order to approach a drastic change in the concept of space-time, going from that of a smooth manifold to that of a discrete, purely combinatorial structure. Then, spin networks were re-discovered by Rovelli and Smolin [22] in the context of Loop Quantum Gravity. Basically, spin networks are graphs embedded in 3-space, with edges labeled by spins and vertices labeled by intertwining operators. In loop quantum gravity, spin networks are eigenstates of the area and volume-operators [23]. We interpret spin networks as qubits when their edges are labelled by the spin-1/2 representation of SU (2). In this context, we use the quantum version [24,25] of the Holographic Principle [26,27,28]. In our model, quantum space-time is discrete, quantised in Planck units, and each pixel of Planck area encodes a qubit. This is a quantum memory register. To process the quantum information stored in the memory, it is necessary to dispose of a network of quantum logic gates (which are unitary operators). The network must be part of quantum space-time itself, as it describes its dynamical evolution. The quantum memory plus the quantum network form a quantum computer. In the QCV, some new features of quantum space-time emerge: i) The dynamical evolution of quantum space-time is a reversible process, as it is described by a network of unitary operators. ii) During a quantum computational process, quantum space-time can be in an entangled state, which leads to non-locality of space-time itself at the Planck scale (all pixels are in a non separable state, and each pixel loses its own identity). iii) As entanglement is a particular case of superposition, quantum spacetime is in a superposed state, which is reminiscent of the Many-Worlds interpretation of Quantum Mechanics [29]. iv) Due to superposition and entanglement, quantum space-time can compute a Boolean function for all inputs simultaneously (massive quantum

347

parallelism). We argue that the functions which are quantum-evaluated by quantum space-time are the laws of Physics in their most fundamental, discrete and abstract form. v) By scratch space management, we find that at the Planck scale it is possible to compute composed recursive functions of maximal depth. The paper is organized as follows. In Sec. 2, we discuss the new concepts of event in quantum space-time, and its quantum information nature. In Sec. 3, we introduce the Quantum Computer View of quantum space-time at the fundamental level. In Sec. 4, we analyze the possibility of quantum space-time being in a superposed/entangled state. In Sec. 5, we investigate about a possible quantum network chosen by Nature. In Sec. 6, we illustrate how space-time can quantum-evaluate Boolean functions at the Planck scale. In Sec. 7, we investigate about a unitary evolution of quantum space-time Sec. 8 is devoted to the conclusions. 2. Qubitisation of quantum space-time The very concept of event should be revised in the context of quantum spacetime. In fact, the definition of event as a point in a four-dimensional smooth manifold becomes meaningless once space-time is assumed to be discrete, and quantized in Planck units. If the minimal length is assumed to be the Planck length: lp = 10~33 cm and the minimal time interval is assumed to be the Planck time: tp —10 sec, it follows that an event in quantum space-time is an extended object without structure. In the QCV, the quantum event encodes quantum information. The (classical) holographic principle [26,27,28] claims that it must be possible to describe all phenomena within the bulk of a region of space of volume V by a set of degrees of freedom which reside on the boundary, and that this number should not be larger than one binary degree of freedom per Planck area. All this can be interpreted as follows: each unit of Planck area (a pixel) is associated with a classical bit of information.

348 At the Planck scale, however, where quantum gravity takes place, we argue that the encoded information should be quantum, and the holographic principle should be replaced by its quantum version [24,25]. In the quantum version of the holographic principle, a pixel encodes one quantum bit (qubit) of information. (A qubit is a linear superposition of the logical states 0 and 1, namely: \Q\ = a\o) + b\l), where a and b are complex numbers called probability amplitudes, such that U + \b\ =1). The necessity of the quantum version of the holographic principle follows directly from loop quantum gravity. In loop quantum gravity, non-perturbative techniques have led to a quantum theory of geometry in which operators corresponding to lengths, area and volume have discrete spectra. Of particular interest are the spin network states associated with graphs embedded in 3-space with edges labelled by spins

;=o,l,i,|,... and vertices labelled by intertwining operators. If a single edge punctures a 2-surface transversely, it contributes an area proportional to [23]: Let us consider the edges of spin networks in the spin -1/2 representation of SU(2): they are 2-level systems, and can be thought as qubits. In mathematical terms, the group manifold of SU(2) can be parameterized by a 3-sphere with unit radius. In fact, the most general form of 2 x 2 unitary matrices of unit determinant is: f a b^ a\2+\b\2=l U= -b where a and b are complex numbers. For example, the action of the unitary SU(2) matrix

j_

U0t=-j=a where o 2 is the Pauli matrix:

+ i
-r 0,

349

on the edge states

I \ and + I \ respectively, gives the equally 2/ 1^

superposed states

4i

2'

2/ When a surface is punctured by such a superposed state, a pixel of area is created, which encodes one qubit. The elementary pixel can then be viewed as the surface of a unit (in Planck units) sphere in three dimensions. The pixel is punctured (simultaneously) in the poles by an edge in the superposed state of spin down and spin up. Equivalently, a qubit corresponds to the surface of the 3dimensional unit sphere, where the logic states 0 and 1 correspond to the poles. This is the so-called Bloch sphere. There is clearly an analogy between the spin networks approach to quantum gravity and our Quantum Computer View of quantum space-time. 3. Quantum space-time: is it a quantum computer? Having assumed that space-time at the Planck scale encodes quantum information, the latter must be processed to give rise, as an output, to the universe as we know it. If so, quantum space-time is not just a quantum memory register of n qubits: it is the whole thing, a quantum memory register plus a network of quantum logic gates. In other words, space-time at the Planck scale must be in such a quantum state to be able to evaluate those discrete functions which are the laws of Physics in their discrete and most fundamental form. We may interpret that quantum state as the state of a quantum computer which is computing Boolean functions. But doing so, we should assume that at the Planck scale space-time is in a superposed/entangled state. In fact, any efficient quantum algorithm relies on superposition and entanglement of qubits. In quantum computation, superposition and entanglement are very important, because they allow quantum parallelism: the possibility to compute exponentially many values of a function in polynomial time. 4. Is quantum space-time in a superposed/entangled state? If the qubits encoded by pixels were superposed, the surface embedding a region of space would "exist" in many different states simultaneously. This would be a quite weird wave-like aspect of quantum space-time itself. Superposition is one characteristic feature of quantum mechanics, but we should be aware of the fact

350

that once applied to quantum space-time, it spoils the latter of its usual attributes. We think that the idea of a superposed state of qubits associated to pixels fits quite well in the Many-Worlds interpretation of Quantum Mechanics, obviously restricted to the micro-domain of space-time itself, more precisely at the fundamental level. Do the pixels of Planck area encode qubits which are entangled to each other or not? In the affirmative, space-time itself would be spoiled of locality, at the Planck scale. In other words, two quantum events might be described by a single quantum state, each event losing its own identity. This would be a quite weird feature of quantum space-time, but it cannot be discarded a priori, because entanglement is a very peculiar feature of the quantum world. Let us consider a finite number N of pixels pt (i =1, 2...N) each one encoding one qubit I Q\. (notice that the number of pixels of area of a certain surface S is equal to the number of punctures made by spin network' edges in the 1/2-representation of SU(2) onto S). The N qubits span a Hilbert space of dimension 2 . The standard basis for one qubit is: |o), ll) • The dual basis for one qubit is:

The most general one qubit state is: \Q\ = a|o) + b\l) where a and b are complex numbers such that: U + \b\ = 1. A 2-qubits state can be either non entangled (product state of two qubits) or entangled (a non-separable state). The non entangled basis for two qubits is: |00),|01),|10),|11). An example of non entangled two qubits state is the product of one dual basis vector and the qubit 10):

^(|o) + l 1 ))|o)-^d°o)+|io))The entangled basis for 2-qubits (Bell states, maximally entangled) is:

|VP±) = ^ ( | 1 0 ) ± I 0 1 > ) i*±>=^dii>±|oo>).

351 5. The quantum network of Nature Let us suppose that all the N qubits encoded by N pixels are initially in state|000 0) • They form a quantum register of size N, but that is just storage of quantum information. To be able to perform quantum computation, the qubits of the memory must be manipulated by some unitary transformations performed by quantum logic gates (the number of the gates is called the size of the network). Now, to make a superposition of two qubits, it is necessary to dispose of the Hadamard gate:

J_ ft n i -i 4i and to entangle two qubits, it is necessary the controlled-NOT (or XOR) gate: H=

(\ XOR =

0 0 0^1

0 1 0 0 0 0 0 1

0 0 1 0 V In the case of n qubits, we need the Walsh-Hadamard transformation: Hn =H®nLet us see how it works in the case of two qubits. Let us write the standard basis in vector notation: |0) =

|i) =

v°y

vly The action of the Hadamard gate on the ket I o) is:

ft r fi\ ; -I v°/

(0

1 vv°y

= ^(|o)+|i)>

and on the ket |l\ is:

i r (0

j_

i

4i W°y

-i

=^do>-|i»

vl/ Consider a quantum register of size two in state |00) • The action of the Hadamard gate H on the first qubit gives the superposed state:

If we take the superposed state as the control qubit (c), and the second qubit of the memory as the target qubit (t), the action of the XOR gate is:

352 XO/?:_^(|0) +

|1)) (C) |0) (() ^-^(|00) + |11))

which is an entangled state of two qubits. A quantum memory register of size n is a collection of n qubits. Information is stored in the quantum register in binary form. The state of n qubits is the unit vector in the 2"-dimensional complex Hilbert space: C 2 ® C 2 ®...®C 2 , n times. As a natural basis, we take the computational basis, consisting of 2" vectors, which correspond to 2 " classical strings of length n: |0)®|0)®...®|0)s|00...0) |0)®|0)®...®|l)s|00...l)

|l)®|l)®...®|l)s|ll...l). In general, we will denote one basis vector of the state of n qubits as: \X1)®\X2)®...®\XH)

where xl,x2,...,x

= \X1X2..JCH) = \X),

is the binary representation of the integer x, a number

between 0 and 2" . The general state is a complex unit vector in the Hilbert space, which is a linear superposition of the basis states:

E c il*)' where c, are the complex amplitudes of the basis states |A, with the condition: V | c I2 =\. i

To perform computation with n qubits, we have to use quantum logic gates. A quantum logic gate on n qubits is a 2" x 2" unitary matrix U. Initially, all the qubits of a quantum register are set tolo) • By the action of the Walsh-Hadamard transform, the n input qubits are set into an equal superposition: 1

2"-l

-prZI*>V2 *=o At this point the very computation can start.

353

6. Quantum function evaluation at the Planck scale The quantum computation of Boolean functions f is implemented by unitary operators U . In the case of bijective functions / : {0,l}" —> {0,l}", which are reversible, it always exists a unitary operator \J such that:

Uf:\x)->\f(xj), where I x) stands (for brevity) for the input register, namely

V2 *=o The quantum computation of non bijective functions f: {0,l}" —> {0,l}m, (which are non reversible) requires (at least) two registers, in order to guarantee the unitary of U (reversibility of the computation): a register of size n to keep a copy of the arguments of f, and a second register of size m, to store the values of f:

Uf\x)\y) =

\x)\y®f(x)),

where © stands for addition mod 2 m . Notice that in general, (for non trivial functions) the states M U © / ( x ) ) a r e entangled. Moreover, the quantum computation of f on a superposition of different inputs, produces f(x) for all x in a single run (quantum parallelism):

2»l°HI»|/«>' X

X

But we cannot get all values of f(x) from the entangled state V I x\\ f(x)) a s a n v measurement on the first register will yield one particular*value x', and the second register will then be found with the value f(x'). It is possible, however, to compute some global properties of f(x) in a single run. As we already said, both superposition and entanglement are necessary for quantum computation. But it is not obvious that quantum information stored in quantum space-time is exploited to perform quantum computation. It depends on which kind of quantum network (if any) Nature has chosen. The question is: what should be computed by quantum space-time? The answer is: the global properties of Boolean functions, as in a quantum computer. In our case, we argue that the output of quantum computation would be the global structure of the Laws of Physics. Some extra registers (called scratch space) are also needed to store intermediate results. In longer calculations (for example in computing composite functions) this leads to a large amount of "garbage" (or "junk")

354 qubits, which are not relevant to the final result. In order not to waste space, these "junk" qubits must be re-set to |o) and the scratch space can then be "recycled" for further computations. Scratch space management was proposed by Bennett [30,31]. Let us suppose we have to calculate a composite function of depth d. Without scratch space management, the computation would need d operations, and would consume d-1 junk registers. With scratch space management, the computation will need 2d-l operations, and d-1 scratch registers. For example, the computation of a composite function of depth d=2, f(x)=h(g((x))), would need 3 operations, and one scratch register, which can be reused in further computation:

\x,0,0)-^^\x,g(x),0)-^-^\x,g(x),h(g(x)))-^^\x,0,f(x)), Where U ,Uh are the unitary operators implementing the quantum computations of functions g and h respectively, and the suffix numbers refer to the registers operated on. The last step of the computation is just the inversion of the first step and un-computes the intermediate result. The second register can then be reused for further computations. As we have seen, the number of required scratch registers, increases linearly with the depth of the composite function which has to be quantum computed. This fact will be very useful to our purpose. We can imagine the boundary surface S enclosing a volume V of space, as a collection of N pixels of Planck area, each encoding a qubit. Thus S is a quantum memory register of N qubits. If all N qubits are initially set tojo), as always before any computation, the original register can be thought as the product of several registers: |o) lo) lo) ...Jo) where registers x, y, z...w have respectively size n, m, k,..., r such that n + m + k...+ r=N. The initial quantum state I*?)e C 2 of S is then:

l*Ho)J°),l<Mo) w Suppose that register Ifj) has the smallest size, for example n=2. This size is very close to the Planck scale, as for n=2, it is: ~ p. The register |o) can be set to an equal superposition of basis states by the action of the Walsh-Hadamard transform H® which acts locally on it:

Now the quantum state of S is:

l*H*M,|o) t |o>w.

355 where I x) stands for

3

}ZI*>In our case, the quantum computation of a function f: {0,l}" —» {0,1}""" can be implemented by a unitary operator such that:

tf/:EI*)l0),->£!*./<*)) only if the second register y has the right size to accommodate f, i.e., m=N-n, and there are no other registers available. However, if the computation of f produces n' junks bits which fill a scratch register of size n', a second register of size n', has to be provided. The best way to solve this problem, is to take a smaller first register x to enable scratch space management. Moreover, if f is a composite function f(h((g(l(....(x))))) of depth d, the original register of size N must be partitioned in such a way that there are d-1 scratch registers available. So, in order to compute highly composite functions, the first register (storing the argument) must have the smallest possible size, to leave room for the needed number of scratch registers. In particular, if n=l (the Planck scale), the available scratch space has size N-l, and the highest level of composition for f is d=N when d-1 scratch registers, of one qubit each, sum up to the original register of size N. Thus, the quantum computation of highly composite functions must be performed close to the Planck scale, and the output (some global property of f) is obtained at macroscopic scales. According to inflationary cosmological theories, the cosmological horizon has at present a radius R = 1060/,, , thus its surface area is A ~ 10120/p, that is an area of 10120 pixels, each one encoding one qubit. In the QCV, the cosmological horizon's surface can be interpreted as a quantum memory register of N = 10120qubits. Thus, space-time at the Planck scale can compute a composite function of maximal depth
356 It follows that, in the quantum computer view, the dynamical evolution of quantum space-time itself is a reversible process. This sounds like a paradox, as far as we think of quantum space-time as a pre-space-time with almost all the same characteristics of classical space-time, which is the seat of irreversibility. Irreversibility might be just an emergent feature at larger scales. One should be able, however, to figure out what it means reversibility of quantum space-time itself. The simplest answer leads us back to Wheeler's "space-time foam" [35], made up of virtual black holes (and wormholes). Like all virtual processes, also this one takes place by virtue of the time-energy uncertainty relation, (which at the Planck scale is saturated). A quantum black hole of Planck mass, comes into existence out of the vacuum, and then evaporates in Planck time, releasing a quantum of Planck energy back to the vacuum. As this "virtual" process is due to quantum fluctuations of the vacuum, which are non-dissipative [36], it can be considered a reversible process, unless a measurement takes place. But virtual particles cannot be probed. 8. Conclusions The QCV of space-time at the Planck scale relies on linear concepts like superposition and entanglement. Thus, this view cannot be extended to the macroscopic domain, where space-time is described by the non linear equations of General Relativity. To understand how, from the linearity of the Planck scale level we obtain the non linearity of the classical macroscopic level, it might be useful to consider self-organizing models and related technicalities. This is what we call emergence of classicality and complexity (our classical world emerges as one which is complex). As we have seen, in the QCV, quantum space-time looks like having a reversible dynamical evolution. But what does it mean that space-(time) evolves in time, and moreover in a reversible manner? As we have seen, this paradox can be solved by assuming Wheeler's picture of "space-time foam" which however excludes time flow at the Planck scale. Thus, both non linearity and irreversibility, which have no home in the QCV, should be emergent features of space-time. In the QCV, also locality is lost: "space-time" itself is non local at the Planck scale, due to the entanglement of pixels/qubits. This is very much on line with Penrose's argument, stating that the theory emergent from spin networks should have a fundamentally non-local character [37].

357

As far as causality is concerned, it is a more subtle point. However we believe that, because of non-locality due to entanglement of pixels, microcausality is missing at the Planck scale, at least in its usual form. Finally, despite all these weird features, space-time at the Planck scale seems to be able to compute its own dynamical evolution, by quantum evaluating recursive functions. Acknowledgments I wish to thank G. Peruzzi e G. Sambin for useful discussions. References 1. C. Rovelli, "Loop Quantum Gravity", gr-qc/9710008 (1997). 2. C. Rovelli and L. Smolin, "Loop representation of quantum general relativity", Nucl. Phys. B133, 80 (1990). 3. J. C. Baez, "Spin Foam Models", Class. Quant. Grav. 15, 1827 (1998). 4. J. C. Schwarz, "Introduction to Superstring Theory", hep-th/0008017 (2000). 5. M. J. Duff, "M-Theory (The Theory Formerly Known as Strings), hepth/9608117(1996). 6. A. Connes, Non Commutative Geometry (Academic Press, S. Diego, 1994). 7. L. Bombelli, J. Lee, D. Meyer and R. Sorkin, "Space-time as a causal set", Phys. Rev. Lett. 59, 521 (1987). 8. R. Gambin and J. Pullin, "A rigorous solution of the quantum Einstein equations", Phys. Rev. D54, 5935 (1996). 9. R. Loll, "Nonperturbative solutions for lattice quantum gravity", Nucl. Phys. B444, 619 (1995). 10. M. Reisenberg, "A Left-Handed Simplicial Action for Euclidean General Relativity", Class. Quant. Grav.U, 1730 (1997). 11. M. Requardt, "Cellular Networks as Models for Planck-Scale Physics", J. Phys. A31, 7997 (1998). 12. M. Requardt and S. Roy, "(Quantum) Space-Time as a Statistical Geometry of Fuzzy Lumps and the Connection with Random Metric Spaces", Class. Quant. Grav. 18, 3039 (2001). 13. J. Preskill, "Quantum Information and Computation", Lecture Notes for Physics 229 (California Institute of Technology, 1998). 14. M. A. Nielsen and I. L. Chuang, Quantum Computation and Quantum Information (Cambridge University Press, UK, 2000). 15. A. Ekert, P. Hayden, and H. Inamori, "Basic Concepts in Quantum Computation", quant-ph/0011013 (2000). 16. A. Steane, "Quantum Computing", Rept. Prog. Phys. 61, 117 (1998).

358 17. J. A. Wheeler, "It from Bit", in Sakharov Memorial Lectures on Physics, Vol. 2, L. Keldysh and V. Feinberg eds. (Nova Science, New York, 1992). 18. A. Zeilinger, "A Foundational Principle for Quantum Mechanics", Found. Phys. 29, 631 (1999). 19. S. Lloyd, "Universe as quantum computer", Complexity 3(1), 32 (1997). 20. G. Jaroszkiewicz, "The running of the universe and the quantum structure of time", quant-ph/0203020 (2002). 21. R. Penrose, "Theory of quantised directions", in Quantum Theory and Beyond, T. Bastin ed. (Cambridge University Press, 1971). 22. C. Rovelli and L. Smolin, "Spin networks and quantum gravity", Phys. Rev. D52, 5743 (1995). 23. C. Rovelli and L. Smolin, "Discreteness of area and volume in quantum gravity", Nucl. Phys. B442, 593 (1995). 24. P. A. Zizzi, "Holography, Quantum Geometry, and Quantum Information Theory", Entropy 2, 39 (2000). 25. P. A. Zizzi, "Quantum Computation toward Quantum Gravity", Gen. Rel. Grav. 33, 1305 (2001). 26. G. 't Hooft, "Dimensional reduction in quantum gravity", gr-qc/9310026 (1993). 27. G. 't Hooft, "The Holographic Principle", hep-th/0003004 (2000). 28. L. Susskind, "The world as a hologram", hep-th/9409089 (1994). 29. H. Everett III, "Relative State Formulation of Quantum Mechanics", Rev. Mod. Phys. 29, 454 (1957). 30. C. H. Bennett, IBM J. Res. Develop. 17, 525 (1973). 31. C. H. Bennett, SI AM J. Comput. 18, 766 (1989). 32. P. A. Zizzi, "The Early Universe as a Quantum Growing Network", grqc/0103002 (2001). 33. P. A. Zizzi, "Ultimate Internets", gr-qc/0110122 (2001). 34. S. Lloyd, "Computational capacity of the universe", quant-ph/0110141 (2001). 35. J. A. Wheeler, Geometrodynamics (Academic Press, New York, 1962). 36. E. Nelson, "Quantum Fluctuations", Princeton Series in Physics (Princeton University Press, 1985). 37. R. Penrose, "Afterword", in The Geometric Universe, Science, Geometry, and the Work of Roger Penrose, S. A. Huggett et al. eds. (Oxford University Press, 1998).

THREE-DIMENSIONAL WAVE BEHAVIOUR OF LIGHT FABRIZIO LOGIURATO BENIAMINO DANESE LUIGI M. GRATTON STEFANO OSS Department of Physics, University ofTrento, 38050 Povo Trento, Italy We describe a simple experimental apparatus which allows one to observe the wave properties of light in a new way. This apparatus makes it possible to introduce and illustrate, in a very suggestive way, some fundamental principles of quantum theory.

1

Introduction

Quantum theory is introduced in many books by means of an example widely recognized as paradigmatic: the double-slit experiment (see, e.g., Refs. 1-4). Light, after travelling and behaving as a wave, manifests itself on the detection screen as a stream of corpuscles. According to Feynman, it is absolutely impossible to explain this phenomenon in any classical way. In his opinion this is the "heart" of quantum mechanics, "in reality, it contains the only mystery" [I]. However, in experiments emphasizing the wave nature of light, diffraction and interference patterns are shown only in the last part of the light path. What is going on in the space between the slits and the detection screen is only sketched in the figures. We developed a simple apparatus where diffraction and interference patterns are not only observed at the final screen position as in traditional experiments, but also in a three-dimensional environment [5, 6]. In this paper we give a few examples of how our apparatus may be used to illustrate the wave properties of light.

2

Experimental setup and results

Many simple techniques have been adopted in the past to visualize light rays. For instance, light diffusion from chalk powder or from smoke. The technique we adopt here is based on light diffusion from water droplets produced by an ultrasonic mist-makera immersed in water. Vibrations at ultrasonic frequencies a

Further informations about mist-makers may be found, for instance, at: http://www.phvslink.com/estore/cart/UltrasonicMistMaker.cfm 359

360

of a ceramic electrode inside the mist-maker generate ultrasounds that break the surface of the liquid and nebulize the water. This technique produces a continuous and homogeneous fog which allows the formation through its whole volume of very stable luminous patterns. To minimize turbulences and to assure high homogeneity of the fog along the light path, the mist-maker is placed in a box with transparent walls, such as an aquarium. A black piece of fabric covers the walls of the box through which no vision takes place, to avoid disturbing reflections. As coherent light source we use a 10 mW HeNe laser. The laser wave length is X = 0.6828 p n . The slits belong to Pasco optical kit OS-9165. The photographs are taken with a digital D70 reflex camera. The equivalent sensitivity is set to 200 ISO. Exposure times ranges from 1/30* to Vi sec, with various f/values. In Figure 1 we see various images. In each of them HeNe laser light impinges on a single slit, and the slits in different images have different widths. The dependence of the extent of the diffracted beam on the width of the slit is clear: the narrower the slit the broader the intense (0th order) central beam.

Figure 1. The experiments of diffraction from a single slit. The images correspond to slits of decreasing width (from left toright):80 flm, 40 \im, 20 um. We think that these images are a really beautiful illustration of the famous experiment of the single slit by which Heisenberg introduces the uncertainty relations [7, 8]. In fact, if we regard light as a stream of corpuscles (the photons), it follows that the narrower the slit, the higher the space localization of each photon in the light beam, and the larger the uncertainty of the momentum acquired by it.

361 In Figure 2 two further interesting images are compared. In the left one, the light beam impinges on a screen with two slits. The beam that passes through the screen forms the well-known interference fringes of the classic Young experiment in the space. This experiment provided the definitive demonstration of the existence of wave properties of light in 1802. In the resulting series of maxima and minima we can distinguish two patterns: the enveloping pattern due to the light diffraction through each slit and, inside the envelope, the interference pattern of the light coming from the two slits [9]. We may compare the Young experiment of the double slit with the diffraction from the single slit (right image). In the latter, the slit has the same width as each slit in the Young experiment. It can be noted immediately that the interference pattern from two slits is not the sum of two diffraction patterns from single slits. This phenomenon cannot be explained if one adopts only the classic corpuscular model of light. Hence, the photos in Figure 2 support effectively the undulatory counterpart of Feynman's two-slits experiment, by which this author introduces the wave-corpuscle dualism and Bohr's complementarity principle [1].

Figure 2. Left: Young experiment of two slits. The width of the slits is 40 urn, and their separation is 125 urn. Right: diffraction from a single slit of the same width.

362

References 1. 2. 3. 4. 5. 6. 7. 8. 9.

R. P. Feynman, R. B. Leighton and M. Sands, The Feynman Lectures on Physics, Vol. 3 (Addison-Wesley, Reading, MA, 1989). T. Hey and P. Walters, The New Quantum Universe (Cambridge University Press, Cambridge, 2003). E. H. Wichmann, Berkeley Physics Course, Vol. 4, Quantum Physics (Wiley, New York, 1971). B. D'Espagnat, Conceptual Foundations of Quantum Mechanics (W. A. Benjamin, Reading, MA, 1976). F. Logiurato, L. M. Gratton and S. Oss, The Physics Teacher, in print (2005). F. Logiurato, L. M. Gratton and S. Oss, submitted to Physics Education (2005). W. Heisenberg, The Physical Principles of the Quantum Theory (University of Chicago Press, Chicago, 1930). M. Alonso and E. J. Finn, Fundamental University Physics, Vol. 3 (Addison-Wesley, Reading, MA, 1972). F. A. Jenkins and H. E. White, Fundamentals of Optics (Mc Graw-Hill, New York, 1957).

This volume provides a unique overview of recent Italian studies on the foundations of quantum mechanics and related historical, philosophical and epistemological topics. A gathering of scholars from diverse cultural backgrounds, the conference provided a forum for a fascinating exchange of ideas and perspectives on a range of open questions in quantum mechanics. The varied nature of the papers in this volume attests to the achievement of that aim with many contributions providing original solutions to established problems by taking into account recommendations from different disciplines.

The Foundations of Quantum Mechanics

ISBN 981-256-852-2

www.worldscientific.com

The Foundations of Quantum Mechanics: Historical Analysis And Open Questions, Cesena 2004

Read more

The foundations of quantum mechanics. Historical analysis and open questions - Cesena 2004

Read more

Scientific american (October 2004)

Read more

php|architect (October 2004)

Read more

Foundations Of Quantum Mechanics

Read more

Foundations of Quantum Mechanics

Read more

Foundations of quantum mechanics

Read more

«Если», 2004 № 9

Read more

«Если», 2004 № 9

Read more

«Если», 2004 № 9

Read more

Новый Мир ( № 9 2004)

Read more

Sound on Sound (October 2004)

Read more

Greg Egan - Foundations 4 - Quantum Mechanics

Read more

Egan, Greg - Foundations 4 - Quantum Mechanics

Read more

Новый Мир ( № 4 2004)

Read more

Mathematical Foundations of Quantum Mechanics

Read more

Conceptual foundations of quantum mechanics

Read more

Philosophic Foundations of Quantum Mechanics

Read more

Discovery Science: 7th International Conference, DS 2004, Padova, Italy, October 2-5, 2004. Proceedings

Read more

Mathematical Foundations of Quantum Mechanics

Read more

Philosophic foundations of quantum mechanics

Read more

Mathematical Foundations of Quantum Mechanics

Read more

Mathematical Foundations of Quantum Mechanics

Read more

Philosophic Foundations of Quantum Mechanics

Read more

Mathematical Foundations of Quantum Mechanics

Read more

Philosophic Foundations of Quantum Mechanics

Read more

2004)

Read more

Comparative Genomics: RECOMB 2004 International Workshop, RCG 2004, Bertinoro, Italy, October 16-19, 2004, Revised Selected Papers

Read more

Algorithmic Learning Theory: 15th International Conference, ALT 2004, Padova, Italy, October 2-5, 2004. Proceedings

Read more

Algorithmic Learning Theory: 15th International Conference, ALT 2004, Padova, Italy, October 2-5, 2004. Proceedings

Read more

Recommend Documents

The Foundations of Quantum Mechanics: Historical Analysis And Open Questions, Cesena 2004

The Foundations of Quantum Mechanics Historical Analysis and Open Questions - Cesena 2004 editors Claudio Garola • Arca...

The foundations of quantum mechanics. Historical analysis and open questions - Cesena 2004

Scientific american (October 2004)

SPACE DISK DYNAMICS • LET THE INTERNET RUN EVERYTHING Genetic Junk and the Secrets of Complexity OCTOBER 2004 WWW.SCIAM...

php|architect (October 2004)

OCTOBER 2004 VOLUME III - ISSUE 10 TM www.phparch.com The Magazine For PHP Professionals This copy is registered to...

Foundations Of Quantum Mechanics

Foundations of Quantum Mechanics Dr. H. Osborn1 Michælmas 1997 1 A LT EXed by Paul Metcalfe – comments and corrections...

Foundations of Quantum Mechanics

FOUNDATIONS OF QUANTUM MECHANICS JOSEF M. JAUCH University of Geneva, Switzerland A 'V ADDISON-WESLEY PUBLISHING COMP...

Foundations of quantum mechanics

...

«Если», 2004 № 9

«Если», 2004 № 9

«Если», 2004 № 9