ADVANCES IN IMAGING AND ELECTRON PHYSICS VOLUME 109
EDITOR-IN-CHIEF
PETER W. HAWKES CEMES/L,uboratoire d’ Optique Electronique du Centre Natiord de la Recherche Scientifique Toulouse, France
ASSOCIATE EDITORS
BENJAMIN KAZAN Xerox Corporation Palo Alto Research Center Palo Alto, California
TOM MULVEY Department of Electronic Engineering and Applied Physics Aston University Birmingham, United Kingdonz
Advances in
Imaging and Electron Physics EDITED B Y
PETER W. HAWKES CEMES/Lahorutoirc d’Optique Electronique du Centre Nutiond cle la Recherche Scientifique Toirlouse, France
VOLUME 109
@
ACADEMIC PRESS A Harcourt Science and Technology Company
San Diego
San Francisco New York London Sydney Tokyo
Boston
This book is printed on acid-free paper. @ Copyright 0 1999 by Academic Press All rights reserved. No part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. The appearance of the code at the bottom of the first page of a chapter in this book indicates the Publisher’s consent that copies of the chapter may be made for personal or internal use of specific clients. This consent is given on the condition, however, that the copier pay the stated per-copy fee through the Copyright Clearance Center, Inc. (222 Rosewood Drive, Danvers, Massachusetts 01923), for copying beyond that permitted by Sections 107 or 108 of the U.S. Copyright Law. This consent does not extend to other kinds of copying, such as copying for general distribution, for advertising or promotional purposes, for creating new collective works, or for resale. Copy fees for pre-1998 chapters are as shown on the title pages: if no fee code appears on the title page, the copy fee is the same as for current chapters. 1076-5670/99 $30.00 ACADEMIC PRESS A Harcourt Science and Technology Company 525 B Street, 1900, San Diego, California 92101-4495, USA http://wuw.apnet.com Academic Press 24-28 Oval Road, London NW1 7DX, UK http://www.hbuk.co.uWap/ International Standard Serial Number: 1076-5670 International Standard Book Number: 0-12-014751-3 Typeset by Laser Words, Madras, India Printed in the United States of America 9 9 0 0 0 1 0 2 0 3 B B 9 8 7 6 5 4 3 2 1
CONTENTS
CONTRIBUTORS . . . . . . . . . . . . . . . . . . . . . . . PKEFACE. . . . . . . . . . . . . . . . . . . . . . . . . FORTHCOMING CONIRIHUTOKS . . . . . . . . . . . . . . . . . . .
ix xi ...
Xlll
Development and Applications of a New Deep Level Transient Spectroscopy Method and New Averaging Techniques P L A M ~V.~ N KoI.~.vA N I I M. JAMAL. DEEN I. Introduction . . . . . . . . . . . .
.
.
.
.
.
.
.
.
.
.
2
11. Review of the Deep-Level Transient Spectroscopy Method . . . . .
6
I l l . Averaging and Recording of Digital DLTS Transient Signals . . . . IV. Feedback Circuits and Experimental Setup for CC-DLTS and CR-DLTS V. Constant-Resistance DLTS in Enhancement Mode MOSFETs . . . , nce DLTS i n Depletion Mode MOSFETs . . . . . VII. Constant-Resistance DLTS in Junction Field-Effect Transistors . . . VIII. Conclusions and Areas for Future Research . . . . . . . . . . References . . . . . . . . . . . . . . . . . . . . . . Appendix A: Magnitude Errors . . . . . . . . . . . . . . . Appendix B: Time Constant Errors . . . . . . . . . . . . . Appendix C: Noise Sources and Signal-to-Noise Ratio (SNR) in the DLTS Transients . . . . . . . . . . . . . . . . . . . . Appendix D: Electrical Circuit of the Pseudo-Logarithmic Generator . Appendix E: Electrical Circuits of the Feedback Circuit . . . . . . Appendix F: Listing of a Template for a DLTS Measurement Program Appendix G: Listing of a Template for a DLTS Analysis Program . . Appendix H: Radiation-Induced Defecls in Silicon . . . . . . . . List of Acronyms , . . . . . . . . . . . . . . . . . . . List of Symbols . . . . . . . . . . . . . . . . . . . .
39 61
72 83 96 1 I9 122 135 140 141 145 146 148 Is2 155 156 158
Complex Dyadic Multiresolution Analyses J:M. LINA, P. T~JRCOTTL A N 0 B. GO[ll A K D I. Introduction . . . . . . . . . . . . . . . . . . . . . . 11. The Spline Example . . . . . . . . . . . . . . . . . . . 111. Multiresolution and Wavelet . . . . . . . . . . . . . . . . V
163 164 I67
vi
CONTENTS
IV . Daubechies' Wavelets . . . . . . . . . . . . . . . . . .
i70
V . Symmetric Daubechies Wavelets . . . . . . . . . . . . . .
175
VI . The Phase of SDW Scaling Function . . . . . . . . . . . . . VII . The Mallat Algorithm with Complex Filters . . . . . . . . . .
176 178
VIII . Restoration from the Phase . . . . . . . . . . . . . . . .
180
1X. Image Enhancement . . . . . . . . . . . . . . . . . . . X . Complex Shrinkage . . . . . . . . . . . . . . . . . . .
183 187
XI . Conclusion . . . . . . . . . . . . . . . . . . . . . .
193
References . . . . . . . . . . . . . . . . . . . . . .
196
. . . . . . . . . . . . . . . . . . . . .
197
Reading List
Lattice Vector Quantization for Wavelet-Based Image Coding MlKHAlL SHNAIDLR A N D A N D R ~P W . PAPIJNSKI I. I1 . 111. IV .
Introduction . . . . . . . . . . . . . . . . . . . . . . Quantiation of Wavelet Coefficients . . . . . . . . . . . . . Lattice Quantization Fundamentals . . . . . . . . . . . . . Lattices . . . . . . . . . . . . . . . . . . . . . . .
V . Quantization Algorithms for Selected Lattices . . . . . . . . . VI . Counting the Lattice Points . . . . . . . . . . . . . . . . V11. Scaling Algorithm
. . . . . . . . . . . . . . . . . . .
VIII . Selecting a Lattice for Quantization . . . . . . . . . . . . .
IX . Entropy Coding of Lattice Vectors . . . . . . . . . . . . . . X . Experimental Results
. . . . . . . . . . . . . XI . Conclusions . . . . . . . . . . . . . . . . . . Appendix A: Cartan Matrices of Some Root Systems . . References . . . . . . . . . . . . . . . . . .
. . . . . . . .
. . . . . . . . . .
200 201 213
218 229 233 242 244 250 254 258 260 261
I~uzzyCellular Neural Networks and Their Applications to Image Processing TAUYANG I . Introduction . . . . . . . . . . . . . . . . . . . . . . 11. Fuzzy Cellular Neural Networks
. . . . . . . . . . . . . . 111. Theory of Fuzzy Cellular Neural Networks . . . . . . . . . . IV . FCNN as Computational Arrays . . . . . . . . . . . . . .
266 275 290 327
CONTENTS
Vii
V . Embed Linguistic Statements into FCNN . . . . . . . . . . . VI . Learning Algorithms of FCNN . . . . . . VII . Generic Algorithm for FCNN . . . . . . VIII . Applications of Discrete-Time FCNN . . . 1X . Conclusions and Future Work . . . . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
360 37.5 401 407 420
References . . . . . . . . . . . . . . . . . . . . . .
427
.
447
Index . . . . . . . . .
This Page Intentionally Left Blank
CONTRIBUTORS
Numbers in parentheses indicate the pages on which the author’s contribution begins
M. JAMAL DEEN( l ) , School of Engineering Science, Simon Fraser University, Vancouver, British Columbia, Canada V5A 1 S6; Department of Electrical and Computer Engineering, McMaster University, Hamilton, Ontario, Canada L8S 4K1. BEKNARD GOULARD (163), Network for Computing and Mathematical Modeling, Centre de Recherches Mathematiques, Univ. de Montreal C.P. 6 128 Succ. Centre-Ville, Montreal, Quebec H3C 357, Canada PLAMEN Ko1.w ( I ) , School of Engineering Science, Simon Fraser University, Vancouver, British Columbia, Canada V5A I S6 JEAN-MARC LINA(l63), Network for Computing and Mathematical Modeling, Centre de Recherches Mathematiques, Univ. de Montreal C.P. 6128 Succ. Centre-Ville, Montreal, Quebec H3C 357, Canada ANDREW P. PAPLINSKI (199), School of Computer Science and Software Engineering, Monash University, Australia
MIKHAIL SHNAIDER ( I99), Motorola Australian Research Centre PAu TURCOTTE ( 163), Network for Computing and Mathematical Modeling, Centre de Recherches Mathematiques, Univ. de Montreal C.P. 6128 Succ. Centre-Ville, Montreal, Quebec H3C 357, Canada
TAOYANG(265), Electronics Research Laboratory and Department of Electrical Engineering and Computer Sciences, University of California at Berkeley, Berkeley, CA 94720, U.S.A.
ix
This Page Intentionally Left Blank
PREFACE The themes of the four substantial contributions to this volume are all new to the series, though some aspects of image coding, which is the subject of the third chapter, have been examined here in the past. We begin with an account of a new method of deep-level transient spectroscopy and a related averaging technique developed by P.V. Kolev and M.J.Deen. These are extremely important for the design of semiconductor devices, the properties of which are influenced by deep-seated imperfections in the crystal structure-impurities, lattice defects and interactions between these in particular. Deep-level transient spectrosopy is a sensitive method for investigating these deep-lying defects. In the past, it has not been universally accepted but the authors have made significant progress and they argue convincingly that results obtained by this technique are reliable. Their chapter, which is long enough to be regarded as a monograph on the subject, covers the subject very fully, starting with a review of the technique and continuing with a detailed description of signal processing and the new approach. It is then applied to enhancement-mode and depletion-mode MOSFETs and to junction field-effect transistors. Specialized topics are examined in eight appendixes. The next two contributions are concerned with different aspects of wavelets. The first, by J.-M. Lina, P. Turcotte, and B. Goulard, considers certain questions of the highest interest, both practical and intellectual: What use are complex wavelet.. and what is the role of symmetry? Wavelets have been known and studied (though not under that name) for more than a century, but for many decades they were little more than a mathematical curiosity; the earlier books on image processing pointed out the attractive features of the Haar functions, for example, but they were rarely used in practice. With the work of Ingrid Daubechies in particular, the utility of a whole class of such functions was finally recognized and this chapter records and explains an extension to the theory, in which compactness of the support, orthogonality and symmetry are rendered compatible by the introduction of complex-valued scaling functions. The role of the phase thereby introduced is examined carefully. This is a very fascinating development and I am delighted that the authors agreed to explain it in the pages of AIEP. The second contribution on wavelets, by M. Shnaider and A.P. Paplihski, deals with the use of wavelets in vector coding. This type of coding, in which each codeword corresponds to a set of signal values (or image grey-levels) and not to a single value or grey-level), suffers from the rapid growth of the code-book as the size of the set increases. Such coding is, however, extremely
xi
xii
PREFACE
efficient and a considerable amount of research continues to be devoted to finding ways of circumventing this handicap. Lattice theory is a very promising approach, and in this chapter the authors present very fully the necessary background knowledge and show that the combination of lattice quantization and wavelets is indeed highly effective. The final contribution, which is again long enough to be regarded as a monograph on its subject, describes fuzzy cellular neural networks and their use in image processing. The controlled imprecision of fuzzy set theory, even if some probability theorists deny the need for a new terminology, has generated much new thinking, in control engineering for example, and also in signal and image processing. Here, the theory of fuzzy cellular neural networks is presented in great detail. The connection between such ideas and mathematical morphology is explored and applications in image processing are presented. This very full account of a fairly new aspect of image processing will, I hope, be widely used and in so fast-moving a research area, doubtless need to be complemeted by a new contribution on the subject in a few years time. As usual, I thank the authors very sincerely, on behalf of all readers of these volumes, for agreeing to present their ideas in a way that makes them accessible to a wide range of non-specialists and I conclude with a list of chapters promised for future volumes. Peter W. Hawkes
FORTHCOMING CONTRlIlUTlONS
L. Alvarez Leon and J. -M. Morel (vol. 1 1 I ) Mathematical models for natural images D. Antzoulatos Use of the hypermatrix W. Bacsa (vol. 110) Interference scanning optical probe microscopy
N. D. Black, R. Millar, M. Kunt, F. Ziliani and M. Reid Second generation image coding N. Bonnet Artificial intelligence and pattern recognition in microscope image processing G. Borgefors Distance transforms A. van den Bos and A. Dekker Resolution 0. Bostanjoglo (vol. 110) High-speed electron microscopy S. Boussakta and A. G. J. Holt (vol. 1 1 I ) Number-theoretic transforms and image processing P. G. Casazza Frames
J. A. Dayton Microwave tubes in space
E. R. Dougherty and D. Sinha Fuzzy morphology J. M. H. Du Buf Gabor filters and texture analysis
R. G. Forbes Liquid metal ion sources
E. Forster and F. N. Chukhovsky X-ray optics ...
Xlll
xiv
FORTHCOMING CONTRIBUTIONS
A. Fox The critical-voltage effect
M. J. Fransen (vol. 1 1 1) The Z r O N Schottky emitter M. Gabbouj Stack filtering A. Gasteratos and I. Andreadis (vol. 110) Soft morphology W. C. Henneberger (vol. 112) The Aharonov-Bohm effect
M. I. Herrera and L. Bru The development of electron microscopy in Spain K. Ishizuka Contrast transfer and crystal images C. Jeffries Conservation laws in electromagnetics M. Jourlin and J. -C. Pinoli Logarithmic image processing E. Kasper Numerical methods in particle optics A. Khursheed Scanning electron microscope design
G. Kogel Positron microscopy K. Koike Spin-polarized SEM
W. Krakow Sideband imaging A. van de Laak-Tijssen, E. Coets and T. Mulvey Memoir of J. B. Le Poole
L. J. Latecki Well-composed sets C. Mattiussi The finite volume, finite element and finite difference methods
FORTHCOMING CONTRIBUTIONS
S. Mikoshiba and F. L. Curzon Plasma displays
R. L. Morris Electronic tools in parapsychology J. G. Nagy Restoration of images with space-variant blur
P. D. Nellist and S. J. Pennycook Z-contrast in the STEM and its applications
M. A. O’Keefe Electron image simulation
G. Nemes Phase-space treatment of photon beams B. Olstad Representation of image operators M. Omote and S. Sakoda (vol. 110) Aharonov-Bohm scattering
C. Passow Geometric methods of treating energy transport phenomena E. Petajan HDTV
F. A. Ponce Nitride semiconductors for high-brightness blue and green light emission
J. W. Rabalais Scattering and recoil imaging and spectrometry
H. Rauch The wave-particle dualism D. Saldin Electron holography G. E. Sarty (vol. 1 11) Reconstruction from non-Cartesian grids G. Schmahl X-ray microscopy J. P. F. Sellschop Accelerator mass spectroscopy
xv
xvi
FORTHCOMING CONTRIBUTIONS
S. Shirai CRT gun design methods
T. Soma Focus-deflection systems and their applications
I. Talmon Study of complex fluids by transmission electron microscopy S. Tari (vol. 1 1 I ) Shape skeletons and greyscale images J. Toulouse New developments in ferroelectrics
T. Tsutsui and Z. Dechun Organic electroluminescence, materials and devices
Y. Uchikawa Electron gun optics
D. van Dyck Very high resolution electron microscopy J. S. Villarrubia Mathematical morphology and scanned probe microscopy L. Vincent Morphology on graphs
N. White Multi-photon microscopy J. B. Wilburn Generaked ranked-order filters
C. D. Wright and E. W. Hill Magnetic force microscopy
ADVANCES IN IMAGING AND ELECTRON PHYSICS VOLUME 109
This Page Intentionally Left Blank
.
ADVANCES IN IMAGING AND ELECTRON PHYSICS VOI . 109
Development and Applications of a New Deep Level Transient Spectroscopy Method and New Averaging Techniques PLAMEN V . KOLEV AND M . JAMAL DEEN
.
School of’ Etigineering Scic.nc.e Sitnon Frciser Univur.~itv.Witicouv~.r. British CCJ/Mtllhlri. ccinrrdei V5A 1% Dc.prirtrneni of’ Electriid cind Cornlnrter Engineering. McMcisfer Uniwrsiry. Haniilrori. Ontririo. Crinrrcki LKS 4 K I
I . Introduction . . . . . . . . . . . . . . . . . . . . . . . A . Importance of Impurity Characterization . . . . . . . . . . . . B . Deep-Level Transient Spectroscopy . . . . . . . . . . . . . . C . Goals of This Article . . . . . . . . . . . . . . . . . . . D . Organization of the Ailicle . . . . . . . . . . . . . . . . . I1. Review of the Deep-Level Transient Spectroscopy Method . . . . . . . A . Introduction . . . . . . . . . . . . . . . . . . . . . . €3. Defects, Impurities and Energy Levels . . . . . . . . . . . . . C . Generation-Recoinbination Statistics . . . . . . . . . . . . . . D . Detection of the Emission of the Trapped Charge . . . . . . . . . E . Determination of the Deep Level Parameters . . . . . . . . . . . F . Conventional Deep-Level Transient Spectroscopy (DLTS) . . . . . . . G . Main Stages in the DLTS Experiment . . . . . . . . . . . . . H . A New Classification Scheme for DLTS Methods . . . . . . . . . 1. Other Methods and Comparisons with DLTS . . . . . . . . . . . 111. Averaging and Recording of Digital DLTS Transient Signals . . . . . . . A . Introduction . . . . . . . . . . . . . . . . . . . . . . B . Technical Overview of the DLTS Experiment . . . . . . . . . . . C . Pseudo-Logarithmic Averaging . . . . . . . . . . . . . . . . D . Continuous Time Averaging . . . . . . . . . . . . . . . . E. Applications . . . . . . . . . . . . . . . . . . . . . . F . Conclusions . . . . . . . . . . . . . . . . . . . . . . IV . Feedback Circuits and Experimental Setup for CC-DLTS and CR-DLTS . . . A . Introduction . . . . . . . . . . . . . . . . . . . . . . B . Feedback Circuit and Details on the Setup for CC-DLTS . . . . . . . C . Experimental Setup for CR-DLTS . . . . . . . . . . . . . . D . Illustrations . . . . . . . . . . . . . . . . . . . . . . E . Conclusions . . . . . . . . . . . . . . . . . . . . . . nce DLTS in Enhancement Mode MOSFETs . . . . . . . A . Introduction . . . . . . . . . . . . . . . . . . . . . . B . Overview of the DLTS Techniques Applied to FETs . . . . . . . .
2 2 3 4 5 6 6
7 II IS
21 24 25 34 35 39 39 39 43 52 55 60 61 61 63 65 61
72 72 72 73
1 Viilurnc 109 ISBN 0-1? ~ 0 1 4 7 5 1 ~ 3
ADVANCES I N IMAGING AND IiLECTIION I’HYSICS Copyright 0 1999 h y Acadcinic Prc% All right* of rcproductioii ti1 m y t o w reaervcd. ISSN 1117h-Sh70/99 $ 3 0 00
2
PLAMEN V . KOLEV AND M . JAMAL DEEN
C . Theoretical Background . . . . . . . . . . . . . . . . . . D . Experimental Results . . . . . . . . . . . . . . . . . . . E . Conclusions . . . . . . . . . . . . . . . . . . . . . . VI . Constant-Resistance DLTS in Depletion Mode MOSFETs . . . . . . . . A . Introduction . . . . . . . . . . . . . . . . . . . . . . B . Theoretical Background . . . . . . . . . . . . . . . . . . C . Experimental Results and Discussions . . . . . . . . . . . . . D . Conclusions . . . . . . . . . . . . . . . . . . . . . . nce DLTS in Junction Field-Effect Transistors . . . . . . A . Introduction . . . . . . . . . . . . . . . . . . . . . . B. Theoretical Background . . . . . . . . . . . . . . . . . . C . Experimental Results and Discussion . . . . . . . . . . . . . D . Conclusions . . . . . . . . . . . . . . . . . . . . . . VIII . Conclusions and Areas for Future Research . . . . . . . . . . . . . A . Conclusions . . . . . . . . . . . . . . . . . . . . . . B . Areas lor Future Research . . . . . . . . . . . . . . . . . References . . . . . . . . . . . . . . . . . . . . . . . Appendix A: Magnitude Errors . . . . . . . . . . . . . . . . . Appendix B: Time Constant Errors . . . . . . . . . . . . . . . Appendix C: Noise Sources and Signal-to-Noise Ratio (SNR) in the DLTS Transients . . . . . . . . . . . . . . . . . . . . . . . . C . I . Noise Sources . . . . . . . . . . . . . . . . . . . C.2. Noise Sources in CT-. CC-. and CR-DLTS Transients . . . . . . C.3. The Role of the Averaging Techniques for SNR Improvement . . . Appendix D: Electrical Circuit of the Pseudo-Logarithmic Generator . . . . Appendix E: Electrical Circuits of the Feedback Circuit . . . . . . . . Appendix F: Listing of a Template for a DLTS Measurement Program . . . . Appendix G: Listing of a Template for a DLTS Analysis Program . . . . . Appendix H: Kadiation-Induced Defects i n Silicon . . . . . . . . . . List of Acronyms . . . . . . . . . . . . . . . . . . . . . List of Symbols . . . . . . . . . . . . . . . . . . . . . .
75 71 83 83 83 84 87 95 96 96 97 99 118
119 119 120
122 135 140 141
141 143 144 145 146 148 153155 156 158
I . INTRODUCTION A . Importance of Inipurity Cliuructerization
The rapid advances in semiconductor technology during the last few decades are closely related to the success in achieving significant increase in semiconductor material purity (Sze. 1983)'. The ability to detect and measure the properties of a very small amount of impurity atoms of structural defects in the semiconductor material and in the active regions of semiconductor devices I
References are listed at the end of this article i n alphabetical order .
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
3
is of fundamental importance for this progress. The electrical properties of these impurities or defects are of particular interest for both the performance (Kwan and Deen, 1998; Raychaudhuri er al., 1996a, b; Kwan et al., 1997) and reliability (Kwan et al., 1996; Raychaudhuri et d., I996b; Deen and Raychaudhuri, 1994; Deen and Quon, 1991) of semiconductor devices (Graff, 1995; Pantelides, 1992; Deen and Zhu, 1993e; Sze, 1983, Zhu et al, 1992). Shallow impurities in semiconductors generally contribute extra free carriers, electrons, or holes. By intentionally incorporating shallow impurities in the semiconductor material, the type and magnitude of the conductivity of the material is controlled. The properties of the shallow impurities related to the host semiconductor material are considered to be well understood. Still, even for the best-known semiconductor, silicon, some details of the interaction of the shallow impurities with the host atoms were only recently found (Karasyuk et a/., 1994). Other imperfections in the crystal structure, such as other impurity atoms, lattice point defects and impurity-defect complexes, are referred to as deep centers. Their role is primarily to control the generation, recombination, and lifetime of the current carriers. Despite the significant progress in the last two decades, deep centers have proven to be far more difficult to investigate than shallow impurities. In many cases, the physical nature of the center causing the appearance of a deep level is poorly understood or unknown (Pantelides, 1992).
B. DeepLevel Trmsient Spectroscopy
Deep-level transient spectroscopy (DLTS) (Lang, 1974a) is a well-established research technique for characterization of electrically active centers deep in the semiconductor bandgap (Blood and Orton, 1992; Schroder, 1990). It is known for its high sensitivity and direct relation to the measured properties of the defects. In the last two decades, many variations of the method were developed and adapted for studying the defect properties of a variety of materials and devices. Still, new modifications and further improvements of the already existing DLTS techniques continue to be reported (Anand et al., 1992; Bosetti et al., 1995; Chretien e r a / . , 1995; Doolittle and Rohatgi, 1992; Hacke and Okushi, 1997; Istratov and Vyvenko, 1995; Lossen et a/., 1996; Martin, 1995; Ozder et al., 1996; Rancour, 1995; Shaban, 1996). Despite the large variety of modifications, the method is not yet accepted as a standard characterization technique in the semiconductor industry. There are many reasons for this limited acceptance. First, in order to extract the properties of the traps, there is need for a wide variation in sample temperature. Second, a standard describing the settings and parameters of the measurement
4
PLAMEN V . KOLEV AND M. JAMAL DEEN
instrumentation has not been established. Third, the wide variety of test structures and the dependence of'the signal magnitude on the size of the test device prevents the establishment of a standardized approach that is convenient for industrial applications. With the continued scaling-down in device dimensions in semiconductor integrated circuits and the emergence of new technologies, such as SiGe heterojunction bipolar technology, silicon-on-insulator (Sol), porous silicon, thin-film transistors, or copper metallization for VLSl and ULSI, the importance of accurate measurement and control of the defects that introduce deep levels will progressively increase. Therefore, steps toward further improvement and refinement of the DLTS method and instrumentation have important practical applications.
C'. Goals of This Chapter This chapter first describes new and improved digital techniques for transient data processing that offer better sensitivity and more effective data storage. This approach is applicable for all variations of DLTS and can be easily adapted to other experiments involving recording and analysis of transient signals. Also, it opens up opportunities for further development of the group of isothermal DLTS techniques that rely on the analysis of the transient decay for extraction of the characteristic time constant and not on the thermal scan. Second, a new feedback circuit is described that allows for fast and sensitive operation in one very attractive and technically challenging variation of the method-constant capacitance (CC-)DLTS. It is important to note that this variation produces a signal with a magnitude that is independent of the area of the device and, therefore, the method is better suited for routine parametric control in the semiconductor industry. Third, a new technique, termed constant-resistance (CR-)DLTS is presented. This new technique is well-suited for measurements of field-effect transistors (FETs) regardless of their size and without compromise to the sensitivity of the measurement. Unlike CC-DLTS, because the sensitivity is dependent of the gain of the transistor (thus. of the channel width-to-length ratio and not of the active area), it allows for sensitive measurement of very small, deep-submicron devices, which are the basic transistors in the advanced microelectronics circuits and systems. For corroboration of this technique, the results are compared to those obtained from conventional DLTS and CC-DLTS measurements. While the method has been applied to three different types of field-effect transistors, it can be easily used for a wider range of FETs. Illustrations are made with measurement of proton- and neutron-induced damages in metal-oxidesemiconductor (MOS) FETs and silicon and germanium junction FETs. The possibilities for measurement of interface trap density in the active interface
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
5
of regular MOSFETs and for defect profiling using the new technique are also demonstrated. In this chapter the emphasis is on the development of semiconductor metrology and instrumentation using mixed analog, digital and software engineering. The experimental results are used mainly for illustration of the new techniques and are not self-contained and complete studies. D. Orgcrnizution of' the Chupter
In Sect. 11, the basics of the DLTS method are introduced and various techniques and instrumentation are described. The existing large variety of DLTS makes any attempt at classification a very complex task. Nevertheless, an attempt to define the criteria that can be used for classification of the DLTS techniques is made. Following these criteria, a classification scheme is demonstrated with examples using well-known and less popular DLTS techniques. One potential benefit of this classification is the identification of techniques and conditions that allows the reader to quickly tailor the setup to fit the specific properties of the device or material under investigation. In Sect. 111, two complementary digital signal processing techniques that are well suited for DLTS applications are presented. These techniques are new in the processing of digitized DLTS transients and can be used in virtually any DLTS experiment. Furthermore, their application can be easily extended to the data processing in other physical experiments that produce a transient signal. A mathematical model is presented and error analysis is made of pseudologarithmic averaging, a technique which is less well-known in digital signal processing. Because of the substantial increase of signal-to-noise ratio (SNR) and efficient data reduction, these techniques may increase interest in the isothermal DLTS techniques (Akita et al., 1993; Kim et al., 1993; Kiyota et al., 1992; Okushi and Tokumaru, 1980, 1981; Yoshida et al., 1993). The latest developments in the feedback circuit used for CC-DLTS are presented in Sect. IV. This improved feedback circuit is essential for the successful implementation of the new CR-DLTS technique. The speed of the feedback is demonstrated by comparison of recorded traces from standard capacitance-transient DLTS, CC- and CR-DLTS. Guidelines are given for design of the feedback circuit and its setup during CC- and CR-DLTS measurements. The sensitivity achieved using the feedback circuit is demonstrated with measurements of interface-trap density of submicron MOSFETs. A description of the new CR-DLTS technique is given in Sect. V. This new technique is similar to the conductance DLTS, but it is more sensitive and does not require simultaneous measurement of the transconductance g,,,, or surface mobility ps of the transistor for calculation of the trap concentrations. An important advantage is that the DLTS signal is independent of the transistor
6
PLAMEN V. KOLEV AND M. JAMAL DEEN
size, which allows for measurements of very small transistors. In this chapter, the technique is demonstrated with measurements of submicron enhancementmode MOSFETs. The body effect on CR-DLTS is demonstrated and CR-DLTS is compared with CC-DLTS by using a multi-transistor structure containing 400 transistors connected in parallel. In Sect. VI, the new CR-DLTS technique is demonstrated with measurements of radiation-induced traps in buried channel MOSFETs, which are used as CCD output amplifiers. These devices exhibit a unique structure that offers extended opportunities for studying the spatial distribution of the defects. In addition to the normal front-gate mode of operation, the back-gate mode of operation is demonstrated, and this mode is applicable for studying the channel-substrate p-n junction. The results are compared with CC-DLTS data. Complementary measurements using front-gate and back-gate operation of CR-DLTS can be useful to resolve the difficulties in the analysis of DLTS data from structures with symmetrical p-iz junctions. In Sect. VII of this article, the CR-DLTS technique is applied to study virgin and fast neutron irradiated junction field-effect transistors (JFETs). The technique is demonstrated with measurements of three groups of devices: commercially available discrete silicon JFETs; virgin and high-level neutron-irradiated silicon JFETs made by a specific monolithic technology (Citterio et ul., 1995); and commercially available discrete germanium ychannel JFETs. The CR-DLTS is demonstrated to be a very sensitive and area independent technique applicable for measurement of a wide range of deep level defect concentrations. Comparisons are made with the CC-DLTS and standard capacitance DLTS. Possibilities for defect profiling in the channel are also demonstrated. Section VIII summarizes the accomplishments described in the chapter and recommends directions for future work. It also gives references to follow in future research in this field and proposes some attractive applications of the developed system. At the end of the chapter, an extended list with references and appendixes, which include schematics of electronic blocks, program listings and mathematical transformations, is provided for readers.
11. REVIEW OF THE DEEP-LEVEL TRANSIENT SPECTROSCOPY METHOD A. Introduction
In this section, we review the basics of deep-level transient spectroscopy (DLTS), including methods for DLTS signal detection and data processing. First, the kinds of imperfections that can be studied with DLTS are described
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
7
and some important results of the generation-recombination statistics are reviewed. Then, several detection techniques, based on the effect of trapped charge on measurable electrical parameters, are presented. Next, the standard capacitance-transient DLTS method is introduced. This is followed by a general description of the main stages in DLTS experiments. This definition of the stages in DLTS helps the reader to understand better the options for design of an experiment and serves as the basis for a classification scheme. Section I1 concludes with a demonstration of how a given technique could be classified according to this new classification scheme; a few examples are given for illustration. B. Defects, Impurities and Energy Levels
1. Shallow Impurities The explosive growth of the electronic industry during the last few decades is based on a significant increase in accumulated experience in manipulating the properties of crystalline semiconductors. This manipulation is done by tight control of the purity and perfection of the crystal lattice and intentional incorporation of a t o m extrinsic to the host semiconductor material. Shallow impurities in semiconductors introduce minor perturbations in the lattice, creating bound states in the bandgap of the host material very close to the band edges. They generally contribute extra charge carriers -electron or holes. The primary role of intentionally incorporating shallow impurities in the semiconductor material is to control the type and magnitude of the conductivity in the material. The ionization energy E,!,is the amount of energy that the foreign atom needs to release a free current carrier in the host material. It is clear that for conductivity due primarily to the extrinsic atoms, this ionization energy must be less than kT ( k - Boltzmann constant; T - absolute temperature) or much smaller than the bandgap energy E , of the host material, which determines the amount of intrinsic free carriers at a given temperature. For example, if electron conduction is required, the extrinsic atom must “donate” an electron and therefore is called a donor. Similarly, if hole conduction is required, the extrinsic atom must “accept” an electron from the host material, thus creating a free hole, and the extrinsic atom is called an acceptor. In Table I are listed some experimentally determined (using thermal and optical methods) ionization energies of shallow impurities in silicon and germanium (Kohn, 1957). As Si and Ge are group IV on the periodic table, most donors are from group V, which substitute for the Si or Ge atoms and have one remaining electron in the upper valence shell from which an electron can be easily released to contribute to electron conduction. Similarly, the elements from group 111 are acceptors, as they require an extra electron to
8
PLAMEN V. KOLEV AND M. JAMAL DEEN TABLE 1 IONIZATION
ENERGIES OF SHALLOW
LMPURITIES IN
E,,, in Si l e v ] Impurity element
Donors Li P As Sb Bi Acceptors B Al Ga In
si AND Ge
E , / , in Ge [eVl
Thermal
Optical
0.033 0.044 0.049 0.039 0.069
0.045 0.053 0.043
0.0120 0.0127 0.0096
0.04s 0.057 0.065 0.16
0.046 0.067 0.071 0.154
0.0 104 0.0 I02 0.0108 0.01 12
-
Thermal
~
complete the upper valence shell and taking this electron from the lattice, they will produce a free hole. In general, the presence of shallow-level impurities is well understood: the potential in the crystal lattice is essentially Coulomb-like and the impurity states are very closely related to the states of the hydrogen atom (Milnes, 1973; Pantelides, 1992). The properties of the shallow impurities related to the host semiconductor material were essentially determined both theoretically (effective-mass theory) and experimentally, using mainly optical absorption techniques, by the end of the 1950s. However, even for the best-known semiconductor, silicon, details are still being revealed (Karasyuk, 1994). 2. Deep-Level Stotes: Electron, Hole Trups and Recombination Centers Other impurities and point lattice defects such as vacancies, self-interstitials, anti-site defects etc., induce more severe local perturbations of the potential in the crystal lattice, creating bond states more localized and deeper in the bandgap. Imperfections in the crystal structure, such as impurity atoms, lattice points defects and impurity-defect complexes, are referred to as deep centers (Jaros, 1982; Milnes, 1973; Pantelides, 1992; Sze, 1983). As opposed to shallow impurities, they act primarily as charge carrier traps or recombination centers and, thus, they control the lifetime of current carriers. When a center in the forbidden energy gap interacts mainly with electrons from the conduction band, it is defined as an electron trup; in the case in which the center interacts mainly with holes in the valence band, it is defined as a hole trup; in the case in which both types of carriers can interact with the center, it is called a recombination center (see Fig. 1 and Miller et ul., 1977; Sah et al., 1970; Schroder, 1990; Sze, 1983). Since the probability for electron or hole emission
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
9
FIGUREI . Illustration of relaxation processes; a) direct bmdgap recombination; b) relaxation pmcesses involving a deep level.
is exponentially dependent on the energy separation to the corresponding conduction or valence band, the most effective recombination centers are those with energy levels around the midgap that have nearly equal probabilities for electron or hole emission. For the same reason, at relatively low temperatures, deep levels above the midgap usually act as electron traps (or donor-like) and those below the midgap as hole traps (acceptor-like) (Jaros, 1982; Milnes, 1973; Schroder, 1990; Sze, 1983). However, there are many exceptions to this rule at higher temperatures. Deep centers have proven to be far more difficult to investigate than shallow impurities. In many cases the physical nature of the center causing the appearance of a deep level is poorly understood or unknown (Pantelides, 1992). As a common characteristic,bulk-induced defects exhibit one or more well-defined discrete energy levels and often are called deep levels2 (Jaros, 1982; Li and Sah, 1982b; Milnes, 1973; Pantelides, 1992; Pearton et al., 1992; Schrober, Here and elsewhere, we use the short form deep levels to stand for deep level miters.
10
PLAMEN V. KOLEV AND M. JAMAL DEEN
1990; Sze, 1983). In contrast, surface related defects usually generate a continuum of energy levels spread over the entire bandgap (Blood and Orton, 1992; Nicolian and Brews, 1982; Klausmann, 1981; Sze, 1983) and are commonly referred to as ititeqhce t r q u . In addition to the shallow impurities and deep centers, the semiconductor lattice may contain extended defects, such as dislocations, stacking faults, precipitates, and grain boundaries. In most cases, they behave electrically as large-concentration deep centers and, in general, crystals free of dislocations are desired for electronic and optoelectronic applications (Pantelides, 1992).
3. I$ects of the Deep Centers on Device Perjormance As previously mentioned, deep levels control primarily the lifetime of the excess charge carriers. In devices where long carrier lifetime and stability are desired, deep centers usually are unintentionally incorporated and have a negative effect. All of the following may be attributed to deep centers: leakage currents in p-n junctions (Chen et al. 1984); Schottky barriers and related devices (Milnes, 1973); minority carrier lifetime degradation (Hamilton er LII., 1979); charge losses in CCD cells (Murowinski et a/., 1995) or DRAM capacitors; reduced efficiency and degradation of solar cells (Rohatgi, 199 I ; Rohatgi et d., 1993; Schott et a/., 1980) and high electron mobility transistors (HEMTs) (Meneghesso et ail., 1996); contact quality (Pantelides, 1992); performance degradation in avalanche photodiodes (Zhao et al., 1996; Ma et nl., 1994); resonant tunneling diodes (Deen, 1993a, b; Ng et a/., 1991; Ma er a/., 1992); and bipolar transistors and low-frequency noise (Chen et a / , I998a, b; Deen et d.,199Sa, b; Ng er d., 1992; Murowinski e t a / . , 1993a, b; Doan et ul, 1997). In VLSI technology, as the dimensions shrink and die area increases, defect density must be decreased appropriately and, thus, very low deep-center concentrations must be detected and eliminated (Rancour, 1995). Furthermore, the new technology trends are for using deep trenches filled with a dielectric that has quite different mechanical properties from those of the host semiconductor. As a re:wlt, consecutive thermal treatment may generate stress-induced lattice defects (Pantelides, 1992). Annealing of these defects (Johnson and Herring, 1991) is not a simple task, because another important requirement, the need to keep the p-12 junctions shallow, calls for lowering the temperature and shortening the time of the thermal processing. In a number of models, native defects at the interface were recently attributed to the Fermi-level pinning observed at Schottky contacts (Pantelides, 1992). For optoelectronic devices, nonradiative transitions through deep levels may substitute for the radiative transitions, thus degrading photon yield by up to a few orders of magnitude (Pearton et d . , 1992). Therefore, deep-level control is of particular importance for III-V devices and especially for solid-state
DEVELOPMENT A N D APPLICATIONS OF A NEW DLTS METHOD
I1
lasers, such as GaAs/AICaAs lasers (Lang, 1989). Recent success in the development of SiGe hetero-junction devices is particularly exciting for the future of the silicon-based electronics- this may provide two properties missing in crystalline silicon microelectronics, but inherent in Ill-V devices: bandgap engineering and light emission. Because of the lattice mismatch between Si and Ge, thin Ge,Sil-, layers may generate a significant amount of deep levels, which can lead to a significant deterioration of some desired device properties, mainly carrier mobility (Lang, 1989). Studying deep levels is also important for optimization of Ill-V devices (Gotz et al., 1994; Gotz e t a / . , 199621, b, c). On the other hand, in devices where short lifetime is beneficial or recombination is light emitting, such as in fast switches or light-emitting diodes (LEDs), respectively, deep centers can be introduced intentionally. For example, nitrogen and oxygen introduced in GaP are responsible lor the radiative recombination in commercial LEDs, and gold-doping is used to reduce the switching time in fast Si-based bipolar devices (Pantelides, 1992). Another example of the possible benefits of deep levels is that of Cr-doping used to reduce the conductivity and obtain semi-insulating GaAs substrates. Nevertheless, in these technologies control of non-intentional contaminants is required, as they can impede the role ofthe desired impurity. In all cases, the ability to identify the deep centers and to measure their concentration, and sometimes their spatial distribution (impurity profile), is a necessary requirement. C. Generation-Recoinhiriation Stutisticx
I . PheriomerioloRical Overview In an ideal semiconductor material, there are no allowed energy states inside the bandgap. As already outlined, they are created by incorporation of shallow impurities and deep defect states. The latter are also referred to as traps, generation-recombination (G-R) centers, deep levels, deep centers. deep impurities, deep imperfections, etc. Although in many cases the use of a specific term is justified, this abundance of names might sometimes be confusing. Further complication arises from the fact that the same defect state can be a trap or a G-R center, depending on the temperature and the energy level in the bandgap. In Fig. 2 are defined electron and hole traps, and a recombination center (Miller et ul., 1977). The probability for a given transition is illustrated by the width of the arrow defining this process. From Fig. 2 is clear that the complete description of a particular defect does not define it only as a trap for electrons or holes but as a deep level with specific capture cross sections for electrons CT,~and for holes o ~ Then ,. the capture rate coefficients cIIand cll are defined as (Hall, 1952; Shockley and Read, 1952) =
(u,,)
and
('/1
= ol,(u,,)
(1)
13
PLAMEN V . KOLEV AND M. JAMAL DEEN
FI(;IIRE2. Definition of electron trap. hole trap, and recombination cciiIci-.
where ( u , ~and ) ( u / ) )dcnote the average thermal velocities for electron and holes, respectively. Considering that the capture rate itself equals the capture coefficient multiplied by the concentration of the free carriers, that is, c,!n for electrons. it is obvious that ;I given defect with specific capture cross sections nI, and mil can be a trap or recombination center depending on the free carrier concentrations. 2. Some Results ,from Shoc.lilc.\,-ReLid-Hull (SRH) Theor!, Let us consider the simple case in which only one kind of deep center exists in the material. A deep ccntcr may be occupied by an electron or hole. The
concentrations of deep centcrs occupied by electrons n I , and those occupied by holes 11,. must equal the total concentration of the deep centers A',, that is, N , = )it p i . When free electrons and holes are generated and trapped, the electron concentration in the conduction band 17, the hole concentration in the valence band p , n , , and p , are functions of time. Thc time rate of change of these concentrations is described by Shockley-Read-Hall (SRH) theory (Hall, 1952; Shockley and Read. 1052). The emission rute for electrons e,, represents the number of electrons, c n i i k d from charged deep cenkrs per second. Similar is the definition of the emission rate for holes el,. The ccrpture rutes c,, and cI, are defined in the preceding by using Eq. ( I ) . In gener;ll, the rate of change of n , or p i is described by nonlinear differential equations. Either of these equations can be linearized and solved in two cases: I ) in the reverse-biased space charge region, where the free carrier concentrations n and p are small and can be neglected; and 2) in the quasi-neutral region, where 17 and 17 are
+
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
13
essentially constant. For the last case, the solution for n ( t ) gives (Schroder, 1 990).
where n ( t ) is the concentration of the deep center occupied by electrons at t = 0, en and e , are the emission rates for electrons and holes, c,,n and c p p are their respective capture rates, and r = l/(en c,,n e , c,p). Equation (2) is difficult to solve without additional simplifications because e,,, el,, c,,, and c,’ are not known, and n and p vary with time and distance in the material. However, some important simplifications can be made for extrinsic semiconductors in the case of one-sided p-n junction or Schottky barrier (Miller et al., 1977; Schroder, 1990). In an n-type semiconductor, to a first approximation, p can be neglected. If we consider a Schottky diode on an n-type substrate with electron traps in the active region, then at zero bias the capture rate dominates the emission rate and the steady-state concentration is n , = N , . After applying a reverse bias pulse, within several picoseconds, the free electrons are swept out of the space-charge region (SCR) and emission dominates since ~ , 2 ~ 0. n Then, with r,,, = l/e,,, Eq. (2) reduces to
+
+ +
(3 1
However, at the edge of the space-charge region (the so-called Debye tail) the electron concentration is not negligible, c,,n is not zero, e,7 is not constant, and so the time dependence of n ( r ) can be non-exponential if the contribution of that effect to the total transient is not small enough. When the diode is pulsed back to zero bias, the free electrons flow back in the space-charge region, and if the previous emission pulse was long enough to empty most of the electron traps, then capture dominates, and the concentration n ( t ) is given by n,(t)= N t
-
[ N , - n,(O)]exp
(
2
--
where r,,, = I/c,,n and n,(O) is the initial steady-state concentration. Considering equilibrium conditions, the principle qf detuiled balutzce requires the rates of emission and capture processes lo be balanced. Thus (Schroder, 1990),
14
PLAMEN V. KOLEV AND M. JAMAL DEEN
where no (the index
denotes equilibrium values) is
and
nTo =
1
+
NT exp[(E, - Ep)/kTI
where E,. is the conduction band energy level, E , is the deep-center level energy, T is the absolute temperature and k is Boltzmann’s constant. Using Eq. (7) and Eq. (5) we obtain
The emission rate for holes e , , can be similarly obtained. An important assumption is then made. The deviation from equilibrium state is considered small, the new state is referred to as quasi-equilibrium, and the new notz-equilibrium emission und cupture rutes ure considered equul to their equiIibriutn values. This is expressed as en = c , , n ~ and
where
n = N,. exp and
[
e p = c,,p~
(9)
E,. - E l -
[
P I =N,,exp -~ E1,,J]
In the reverse-biased junction where a strong electric field exists, this is certainly a poor approximation and capture cross sections determined under these conditions generally do not give reliable results (Schroder, 1990). Nevertheless, the assumption is commonly made and the accuracy of all results is contingent on the limits of this uncertainty. Considering Eqs. (9) and ( 1 O), the expression for the emission rate e,, of electrons trapped in centers with energy level El below the conduction band is e,, = (T,v,,N,.exp
(
Z).
-~
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
15
where N , is the effective density of electron states in the conduction band and A E , = E , - E,. Here and later in the text we omit the brackets around u , , , ~ ] for brevity. Similarly, for hole traps in p-type semiconductors, we obtain
where N , is the effective density of hole states in the valence band and A E , = E, - EL,.The expressions for the emission rate-equations (12) and (13)-are the fundamental equations in the DLTS method. As the sample material is known, u,, and N , . (or uI, and N,,, respectively) can be easily calculated over a wide range of temperatures. Then it remains to determine experimentally the characteristic defect parameters CJ?, (or 0,)) and A E , . Each one of these equations links two unknown parameters of a deep center with two measurable variables -emission rate and temperature. Therefore, one has to measure the dependence of the emission rate from the temperature in order to obtain the deep-center parameters. Alternatively, the capture cross section can be determined from the capture process kinetics (Henry et al., 1973; Pals, 1974; Wang and Sah, 1985), the emission rate can be measured, and A E , determined at a constant temperature (isothermal measurement). However, for a wide range of deep levels, the emission rate at room temperature is too fast or too slow in order to be accurately measured conveniently with sensitive instrumentation. Considering the exponential dependence of the emission rate on the temperature, as indicated in Eqs. (12) and (13), the relative distance from the corresponding band edge, and the speed of most frequently used instrumentation, the useful temperature range can be estimated to be 50-450 K. This means that control of the sample temperature and, most likely, cooling well below 0°C might be required even for isothermal measurement of the deep-level parameters. D. Detection of the Emission of the Trcrpped Churge
I . Capacitance Trunsient For simplicity, the case of a one-sided abrupt p-n junction or Schottky barrier will be considered in the following sections. The specifics of MOS structure will also be discussed where applicable. Let us consider the space-charge region (SCR) of a reverse biased p+-n junction or a Schottky barrier on n-type semiconductor (Miller et al., 1977; Stolt and Bohlin, 1983). Figure 3 shows only the SCR in the lightly doped ti-type material, because the SCR spreads mainly on this side of the junction. In ideal n-type material, the SCR contains only the charge of the ionized donors. Let us assume that donor-like deep levels are present in the SCR and that they are charged with trapped electrons, for
16
PLAMEN \/. KOLEV A N D M. JAMAL DEEN
space
transsition quasi-
VI,
+++
+++
Vn
f
+ ++ + +
tran-
- _ _ -E,_ ++ + ++ Ed ~
- . +++ - . h + ~- ~ .-E~ . - . OOOOOE,
)---I -w-
I
-
b
-w-
I
-
example, after a pulse to zero or a slight forward bias. Applying reverse bias again will sweep out the free electrons in a few picoseconds and for 0 < x < y in the SCR, the charge trapped in the deep levels will be added to that of the ionized donors. Within that part of the SCR the electronic transition will be entirely an emission process as there are no free electrons and the capture rate will be zero (Sze, 1983). The situation is more complicated in the transition region at y < .r < w where both generation and recombination processes contribute to the equilibrium occupation of the deep-level traps (Miller et d., 1977). One should also remember that the transition to the quasi-neutral region in the bulk is not abrupt, but has some width determined from the diffusion of free electrons into the SCR. This transition is also known as the Debye tail region (Miller et al., 1977; Rockett and Peaker, 1981; Sze, 1983). When the width of the SCR is much larger than the width of the transition region, the emission process described by Eq. ( 1 2) will dominate and the junction capacitance can be used to monitor this process. With a reverse bias, the traps inside 0 < x < y cannot be charged with electrons, so it must be alternated with a bias at which electron capture will dominate. This is usually accomplished by applying for some short period of time a slight forward or zero bias, which is commonly referred to as a trap $fling pulse (Schroder, 1990). During this filling pulse, the capture rate dominates because of the large concentration of free electrons. In Fig. 4, this is the second event in the sequence. After this pulse (event 3 in Fig. 4), the bias is set back at V R but the capacitance is below its quiescent value (event 1 in Fig. 4) because of the compensation effect of the trapped majority carriers, electrons in this case. Next, at the moment = O+ these trapped electrons will be emitted at a rate defined by Eq. (12) and after a sufficiently long time the system will restore to its quiescent state
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
due to thcrmal emission I> 0
reverse hias I
majority carriei "filling" pulsc t=O FlGuRe
17
hcginning of transienl
4. Capacitance transient due to a majority carrier trap in
ii
I=
o+
pi
-11
diode.
with the corresponding steady-state capacitance value. This relaxation of the system can be observed through the capacitance transient A C shown in the central part of Fig. 4 and corresponding to event 4. When the trap in the ti-type material is acceptor-like. the situation can be more complicated (Miller P t a/., 1977). As it is shown in Fig. 5 , a forward bias pulse can be used to inject minority carriers (event 2 in Fig. 5) in order to populate the deep levels with holes, which can be subsequently emitted (event 4 in Fig. 5 ) . This emission of holes is defined by Eq. (13) and can be monitored as a capacitance transient A C shown in Fig. 5 . From Figs. 4 and 5, it is clear that the sign of A C indicates whether the emission is from either a majority or a minority carrier trap. In the case of a voltage bias
0 qLllcaccnl
reverse hias t <0
t
I ra
due 10thermal emission of holes t>o
AC
-
0 minority carrier in,jection pulsc
heginning ol transicnl t = O+
t=o
FIGURE5 . Capacitance transient due to
ii
minority carrier trap in a p + - t t diode.
18
PLAMEN 'V. KOLEV A N D M. JAMAL DEEN
or photoinjection pulse, a complication arises from the fact that the pulse introduces both majority and minority carriers in the depleted region, thus two capture processes govern the population of the deep levels. Also, deep levels near the midgap may emit carriers to both conductance and valence bands (Miller ct d., 1977). Eixperiments with variation of the injection level or other modifications (Kolev et d., 1998c) may help to reduce this ambiguity. Nevertheless, because of these complications, it is generally accepted that the DLTS method based on capacitance-transient measurements is mostly a majority-carrier method (Blood and Orton, 1992). Its strength is that each deep level can be distinguished through its unique set of activation energy and capture cross section, and can be studied independently (Benton. 1990). One important requirement in order to obtain a simple relation between the measured capacitance transient AC and the emitted charge is A C << C, where C is the total capacitance of the diode. It can be shown (Schroder, 1990) that this condition is satisfied when the deep-level concentration is N, << N,/ where N,/ is the donor concentration. Then, the expression for time dependence of the capacitance C ( t ) toward its steady-state value Co is simplified to
Assuming complete trap filling and using Eq. (14), it is simple to show that AC/C = N,/2N,/. Therefore, for sensitive measurements of trap concentrations orders of magnitude below the shallow impurity concentration, the ability to measure very small relative changes in the capacitance of the diode is required.
2. Current Trmsient Another means to detect the charge emission from the traps in the previous cases is to measure the current transient (Borsuk and Swanson, 1981; Sah, 1976; Tokuda and Usami, 1983). Indeed, the electrons emitted from the traps cause a current transient (Schroder, 1990), which is given by
where Wo is the width of SCR from which the current transient orig'm a t e s and I , is the leakage current that is eliminated in a differential measurement mode (Wilmshurst, 1990). In the foregoing expression, the validity of the approximation N , << N,! was assumed. Equation (15) is not a simple function of T,,, because the time constant appears both i n the numerator and in the denominator. When the second term in the denominator is small compared
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
19
to unity, it can be neglected and Eq. (15) regains the usual exponential time dependence. The main advantage of measuring the current transient is that the detection instrumentation can be faster than typical capacitance meters, and the current transient can be obtained from structures with relatively small area. However, using the current transient as a probe for detection of the emitted charge from the traps has two major disadvantages as compared to the capacitance transient technique. First, current response is related to the change in the trap occupancy with charge through rfQ/dt while the capacitance transient is directly proportional to AQ. Second, there is no distinction between emission of majority or minority carriers. For these reasons, current transient measurements are used for samples with large leakage currents (Day et al., 1979b) or with small areas (Hawkins and Peaker, 1986) because these samples cannot be measured using the capacitance transient technique.
3. Chiirge Trunsierzt The charge emitted from the traps can also be measured directly. A circuit for this type of measurement is proposed in (Farmer et ul., 1982). This circuit is essentially an analog integrator of the current transient previously described. Therefore, the integration time constant of the integrator t p = R1:Cl.. tnust be much larger than the largest time constants of measured transient T,,,.Then, neglecting the second term in the denominator of Eq. (15) and considering an integrator with a time constant t p >> T , ~ ~ the , , emitted charge can be detected as a voltage transient (Farmer et al., 1982; Schroder, 1990), which is given by
Coinparing Eq. (16) with Eq. (15), we see that t,,,is eliminated from the denominator in the exponential prefactor, thus simplifying the analysis (Thurzo and Lalinsky, 1982). Another advantage is the simpler instrumentation needed to measure fast decaying transients.
4. Voltage Trrrnsient When the volume of the space charge region can be held constant by applying a voltage to compensate for the emitted charge, this voltage is directly proportional to the emission of the trapped charge. The simplest method to keep the volume of the SCR constant is to maintain the capacitance C of the diode 1985; Goto et ~ d . 1973; , constant by adjusting the reverse bias (DeJule et d., Johnson, 1979; Johnson ef al., 1979; Miller, 1972; Shiau et d., 1987a; Kolev. 1992). The compensation voltage needed to maintain the capacitance constant
20
PLAMEN V. KOLEV AND M. JAMAL DEEN
is a voltage transient (DeJule et ul., 1985; Schroder, 1990), and is given by
where E,Y is the dielectric permittivity of the semiconductor, A is the diode's area, and Vf,; is the built-in junction potential. Unlike the expression for the capacitance transient, Eq. (17) is valid even when the condition N , << N,! is not satisfied. In other words, Eq. (17) can be used even for large trap concentrations comparable with those of the shallow level impurity. Another important advantage of using this method for detecting the emitted charge is that it does not depend on the area of the diode, as the signal depends on A2/C2 according to Eq. (17). A similar area independence also exists when the traps are located at the interface of a MOS structure. Because of their continuous distribution in the bandgap, interface traps are measured with their energetic density of states per unit area fl;,[eVp'cm-2]. When the decaying transient V c ( t ) is measured at two delay times t I and t 2 with t 2 > T I , and the capture cross section is considered independent of the energy and temperature, the interface trap density is given as
Disadvantages of this method are considered to be the reduced speed and sensitivity of the measurement because of the instrumentation needed to maintain the capacitance constant (Schroder, 1990). However, the recent developments of the feedback circuit (Shiau et ul., 1987a; Kolev, 1992), which is used in coiijunction with a fast capacitance meter, have effectively eliminated this disadvantage. One should note that even using this detection method often does not produce ideal exponentially decaying transients because the approximations used to obtain Eqs. (12) and ( 13) are not valid -the reason is a considerable capture rate in the Debye tail (Rockett and Peaker, 1981). Besides the constant-capacitance technique, we also note other methods used to keep the volume of the SCR constant. The constant-resistance method uses the dependence of the threshold voltage (or pinch-off voltage) of a field-effect transistor on the volume of the SCR and the balance of the total charge within it (Kolev and Deen, 1998). Another technique, convenient for bipolar transistors, could possibly use the Early voltage as a tool to probe charge emission from the traps localized in the collector-base SCR. In general, virtually any device where the SCR can affect some device parameter may offer an opportunity to compensate for the charge emitted from the traps localizd inside the SCR by adjusting the applied voltage. Then this compensation voltage should display a
DEVELOPMENT A N D APPLICATIONS OF A NEW DLTS METHOD
21
transient described by Eq. ( I 7) or another expression related to Eqs. ( I 2) and ( 13). Therefore, further developments in this technique might be expected.
5. Other Methods In the preceding methods the test structure was a one-sided abrupt p-n junction or Schottky barrier diode and the detection of the change in the trapped charge was based on the measurement of some electrical parameter of the diode. However, there are about 200 different effects i n semiconductors relating their electrical, optical, magnetic, acoustical, thermal and mechanical properties. Many of these may be related to the charges trapped in the deep-level centers in the bandgap and thus may be used to probe its changes. For example, the transverse component of the acousto-electric voltage generated by a surface acoustic wave (TAV) has been used as such a probe (Tabib-Azar, 1989; TabibAzar et id., 1987). The Hall effect has also been used (Kackwalla and Miller, 1987). In both cases, the new technique was used to study semi-insulating GaAs where the concentration of free carriers is too low for standard DLTS measurements. It can be speculated that deep levels may play a crucial role in the appearance and magnitude of some effects. Therefore, finding a convenient method to study deep levels may have a significant practical value. In principle, this is a largely unexplored area and the limited application of other detection methods can be attributed to both the limited knowledge of the influence that charge emission from deep levels has on any particular semiconductor effect, and the lack of suitably developed instrumentation. E. Determinution qf' the DecpLevel Pcirumeters
1. Uncertuititirs in Determinution ojthe IleepLevel Purameters
There are three major deep-level parameters 01' interest: energy level position, capture cross section, and concentration (Benton, 1990; Chen and Milnes, 1980; Grimnieiss, 1977; Rohatgi et a/., 1983). Sometimes, it is also important to know the spatial distribution of a trap in the bulk of the semiconductor (Li and Sah, 1982b; Rockett and Peaker, 1982; Shiau rt ul., 1987b; Stievenard and Vuillaume; 1986) or the energetic distribution of interface traps in the bandgap (Johnson et ul., 1978; Klausniann, I98 I ; Soncini et ul., 1990; Stannard, 1978). Finding the energy-level position and the capture cross section is most important as these are major steps in the process of trap identification (Benton and Kimerling, 1982; Chen and Milnes, 1980; Kimerling et ul., 1981a; Milnes, 1983; Sze, 1983; Walker and Sah, 1973). However. these parameters may not suffice and a close look at the production history of the test structure may be required (Benton, 1990).
22
PLAMEN ’4.KOLEV A N D M. JAMAL DEEN
Equations ( 1 2) and ( 1 3) for the emission rate of the trapped charge describe a thermally activated proce The standard approach is to measure the emission rate at different temperatures, to construct an Avrheizius plot (i.e., log (I/re,,,/,)vs l/T), and to determine the activation energy and the exponential prefactor. It seems straightforward to consider the energy-level difference to the corresponding band edge A E , as the activation energy and from the prefactor to determine the capture cross section a,,,,,. Unfortunately, this is not precisely true for several reasons. First, the exponential prefactor in Eqs. ( 1 2) and ( 1 3) contains thermally dependent entities. N,. is dependent on temperature as T3’’ and u,, as T”’, which is easily accounted by plotting log(1/T2r,,,,,,) vs 1/T. However, a,L,/, and its thermal dependence are not known. For determination of a,,,,) it is better to use the filling pulse technique (Brotherton and Bicknell, 1982; Henry et ul., 1973; Lang, 1974a; Pons, 1984; Pals, 1974; Schroder, 1990; Zylbersztejn, 1978) at several temperatures and to find its thermal dependence. Often this gives an additional term in the exponent (Miller et a/., 1977; Schroder, 1990) and, as a result, the Arrhenius plot does not accurately give the energy-level position or the capture cross section. Other complications arise when a,,,,, and A E , are electric-field dependent (Couturier et ul., 1989; Lang et al., 1980; Komarov et al., 1994; Martin et ul., 1981; Sah et ul., 1970; Vincent et d., 1979). When thermodynamic considerations are taken into account, it is shown (Engstrom and Ahm, 1983; Schroder, 1990) that the thermal activation energy is related to enthalpy, whereas the energy difference in Eqs. (12) and ( 1 3) is related to entropy. This difference can be accounted for by introducing a degeneracy factor g in these equations, but this degeneracy factor is not well known for the deep levels (Lang, 1974a; Lang et ul., 1980; Schroder, 1990). There is a large variety of techniques used to investigate deep levels by measuring their effect on some property of the host semiconductor material (Deen, 1993b; Groeseneken et ul., 1984; Klausmann et al., 1986; Kuhn, 1970; Milnes, 1973; Miller er ul., 1977; Sah et ul., 1970; Schroder, 1990). Here we concentrate on the technique that investigates the transient phenomena during emission of the charge trapped at deep levels or interface traps.
-
-
2. The “Rate Window” Concept
For measurement of the activation using the Arrhenius plot technique, one has to obtain the relation between the time constant of the measured transient and the temperature. Considering the fact that the transient signal itself is frequently very noisy and is often superimposed on some background constant level that is orders of magnitude larger than the magnitude of the transient, monitoring a fast signal decay (usually in the millisecond range) is not a trivial measurement task. A major breakthrough occurred in 1974, when Lang introduced the so-called “rate window” concept and applied it to the method that he
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
23
I
a
2
the system produces a peak when the tirnc constant o f the transient signal equals that of the sytem’s filter
II
1 1
FIGURE 6. The concept for a time-conslant filter known as “rate window”.
named deep-level transient spectroscopy or DLTS (Lang, 1974a). In essence, this is an electronic filter for a decaying transient signal with a particular time constant, as selected by the user (see Fig. 6). The filtering property is independent of the magnitude of the tested signal. When the sample’s temperature is varied, the emission rate of each trap with energy in the bandgap varies according to Eq. ( 1 2) or Eq. ( 1 3). When the emission rate is within the “rate window,” that is, the time constant of the decaying transient equals that of the system filter, then the event is indicated by appearance of a peak in the system output signal. When the range of variation of the temperature is large enough, each trap will be indicated by a characteristic peak depending on the specific properties of the trap -activation energy and capture cross section. Thus, the output signal will look like a spectrum produced by an optical measurement. For this reason the method was named trcinsietzt spectroscopy, and a plot of the output signal vs temperature is called a DLTS .spectrum. In principle, it is enough to change the system time-constant filter just once and to repeat the measurement, because two data points for the Arrhenius plot can define a straight line. Frequently, more measurements are needed in order to improve the accuracy of the measurement.
3. The Digital Alternative Although the method for detection of the time constant of the measured signal already described here was historically implemented first, even then, it was realized that the signal information is contained in the time varicitiorz of the decaying transient. This means that, if the decaying transient is recorded itself and without noise, then “no other experimental method can possibly provide superior raw data” (Miller et ul., 1977). Indeed, the ability “to see”
24
PLAMEN V. KOLEV A N D M. JAMAL DEEN
the recorded transient without noise allows the emission rate to be exactly determined at aJixed temperature by simply calculating the slope of' the signal on a semilogarithmic plot. This can be combined with filling pulse methods for measurement of the capture cross section (Schroder, 1990) and the energy level position can be determined at a fixed temperature (Okushi and Tokumaru, 1980), thus avoiding all the uncertainties associated with the thermal dependence of the capture cross section or energy level position (Katsube et d., 1981). We note this digital alternative in particular, as the new digital averaging techniques described in Section 111 are used to obtain and efficiently record the transient, and to reduce significantly the noise. F. Conventional Deep-Level Trutisietit Spectroscopy (DLTS)
As mentioned here, in real instrumentation the capacitance decay waveform is corrupted with noise; indeed. it is sometimes even below the noise level. The essence of DLTS is the extraction of the waveform parameters in some automated manner from the noisy signal (Blood and Orton, 1992; Lang, 1974a; Schroder, 1990). This is a correlation technique, which is a signal processing method where the input signal is multiplied by a reference signal, the weighting function w ( t ) , and the product is filtered (averaged) by a linear filter. In the conventional DLTS method (see Fig. 7), w ( t ) is a Ooxccrr functiori. The capacitance transient is monitored in two moments with delay times tl and 12 from the start of emission pulse at = 0, and the difference is averaged and plotted vs the temperature T . The setup of delay times tl and t2 determines the rate window I l s of' the system and r is given by (Lang, 1974a)
filling pulses
synchronization
r - -
I
I I
I
t y
I
I 1
crycooler
L - - - - J
1
I
T-
FIOIIKF. 7. Setup for conventional capacitance-transient DLTS
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
2s
1
Temperature [ K l (:I)
FIGURE8. a) DLTS spectra with different rate windows dcfned with I / / . b) Arrhenius plots of ihe traps in a).
As the temperature changes, the time constant for a given center r,l, changes according to eqs. ( 12) or ( I 3) and, at some temperature T when T , ~ , ,= ~ 5, the output signal of the box-car averager reaches a peak corresponding to that center. In Fig. 8a are shown examples of DLTS scans with different rate windows. By doing measurements with several rate windows, one obtains data points for an Arrhenius plot (Fig. 8b), from which the activation energies A E , and the effective capture cross sections are extracted. Then, with [ I , t? and A E , known, the impurity concentrations are calculated from the peak magnitudes of the spectra. The great convenience of the DLTS spectrum is that during the experiment the spectrum gives qualitative information on the relative concentrations and energy levels of several traps (Schroder, 1990).
G.
Mciiti Sfngrs
in the DLTS Experiment
Here, a more general view on the DLTS experiment will be presented. The logical sequence of events will be followed in order to distinguish and define the objectives at each measurement stage. The methods and instrumentation used to achieve these objectives will be discussed. 1 . Excitation (Filling Pulse)
The main objective o f t h e excitation is to fill all traps with carriers and to avoid re-emission of the trapped charge, thus ideally t z , ( O ) = N T . The first condition is satisfied by increasing substantially the capture rate as compared to the emission rate. As the capture rate is dependent on the concentration of free carriers (Hall, 1952; Shockley and Read, 1952). it is obvious that the excitation techniques are designed specifically to attract free charge in the vicinity of the deep levels. The simplest and probably the most effective technique is a bias pulse, which attracts free carriers from the quasi-neutral bulk of the
26
PLAMEN V. KOLEV AND M. JAMAI, DEEN
semiconductor into a region that can be subsequently depleted. Because the reaction of the deepest levels is relatively slow, at room temperature and below then the second condition is satisfied by applying relatively short tilling pulses (Miller e f a/., 1977). Therefore the name fifil/itzg pulse in most cases means simply a forward, accumulation, strong inversion, or zero bias pulse (Wang and Evwaraye, 1976; Lang, 1979; Chen etal., 1981; Tredwell and Viswanathan, 1980). When free carriers are not available, as in semiinsulating GaAs or other, heavily compensated compound semiconductors, they must be created in some way and attracted near the deep levels. Most frequently, the excitation pulse is a direct (Chantre et al., 1981; Dubecky, 1989; Hurtes et u/., 1978; Rhee and Bhattacharya, 1982) or indirect (Abele r t ul., 1987; Kremer et ul., 1987) “band-gap” light beam that generates electron-hole pairs near the traps. As both types of current carriers are created, there are complications related to electronhole annihilation (Yoshie anld Kamihara, I983a, b; Burd and Braunstein, 1988) and re-trapping of the emitted charge (Balland et a/., 1986~1,b; Kremer et ul., 1987; Tin et a / . , 1987). Th’e need to create significant concentrations of free carriers requires intensive light that can be supplied by a laser beam, which was used, for example, to excite free carriers in high-purity, detector grade Ge (Blondeel et a/., 1997) and Si (Li and Li, 1994). Other methods used to excite free carriers are electron beam excitation using a scanning electron microscope (SEM) (Breitenstein, 1982; Heydenreich and Breitenstein, 1986; Petroff and Lang, 1977) and acousto-electric RF pulses (Tabib-Azar, 1989; Tabib-Azar et d., 1987). Depending on the magnitude of the applied bias pulse, the intensity or wavelength of the light beam, or the focus bias of the electron beam, the excitation can be nonselective (large-signal) or selective. Selective excitation occurs when only a small fraction of the traps i n the volume observed during the emission pulse are populated. For example, when the filling pulse magnitude is small and very close to the reverse bias during the emission pulse, the Fermi level scans only a fraction of the interface traps. Thus, the acquired signal originates only from traps within a few kT above the steady-state Fermi level, which is a powerful tool for investigating the interface trap density (Johnson, 1979; Klausmann, 1981; Eades and Swanson, 1984a, b; Kumar and Iyer, 1983; Schulz and Johnson, 1977, 1978; Soncini et ul., 1990; Wang and Evwaraye, 1976). An alternative technique is to combine two consecutive filling pulses with slightly different magnitudes in pairs, and to consider the difference in the corresponding emission pulses. This is a differential rather than selective population technique and is known as double-correlation or DDLTS (Johnson et a/., 1979; Lefkvre and Schulz 1977a, b). This technique is useful for obtaining the deep-level depth profile in the depleted region. For horizontal profiling, an electron beam in a SEM can selectively populate only
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
27
a fraction of the traps in some particular area of the device (Petroff and Lang, 1977j.
2. Detection (Emission Pulse) Detection techniques are based directly or indirectly on the transient phenomena discussed here i n Subsect. 4. Their main objective is to convert the charge emission from the deep levels into a measurable electrical transient signal that is almost exclusively a voltage. For example, in the conventional DLTS technique, the capacitance transient (Stievenard et al, 1985) is converted into a voltage transient by a custom-built (Lang, 1974b; Wagner er d., 1980; Breitenstein, 1982; Miller and Potterson, 1975; Misrachi et al., 1980) or a commercial3 capacitance meter. Capacitance meters are most commonly used for DLTS measurements. In our opinion, the latest model from the Boonton Electronic Corp., the Boonton 7200, is the best available at this time.4 Sometimes, a commercial capacitance meter can be modified to fit the DLTS application better (Chappell and Ransom, 1984; Nicolian and Brews, 1982; Kolev, 1992). Commercial capacitance meters are also part of commercial DLTS systems.' A compromise is made in selecting the test frequency of the capacitance meter. The need both to integrate more pulses in a short time interval for better sensitivity and to obtain a larger signal limits the low threshold frequency at about I MHz. At the same time, the bulk resistance of the material and its associated series resistance, phase delays in the cables connecting the capacitance meter with the test structure inside the cryocooler, and skin effect all require the test frequency to be kept not too high, usually below 10 to 20 MHz. It seems that 1 MHz, which is a standard frequency for high-frequency capacitance-voltage (C-V) measurements in the MOS industry, has become widely accepted. The test level is most frequently 15mV,,_,,, but it can be selected at higher levels of up to 100mV,,-,,.6 A current transient is detected with a fast, large-gain amplifier, such as EG&G PAR 181 current sense amplifier (Borsuk and Swanson, 1980) or a custom-made circuit (Wang and Sah, 1985). Detection of the current transient has been calculated to be about 10 times more sensitive than detection of the capacitance transient (Rancour, 1995j. However, as mentioned earlier, current transient spectroscopy (CTSj (Borsuk and Swanson, 1980; Tokuda
'
Application Note IM-00 I , Boonton Electronics Corporation, 2.5 Eastmans Road. Parsippany, NJ 07054-0465. USA. Also, Irzstructiori Mriiriitrl / o r Moc/c,/ 7213 Cripar.itoric.c~Mc,rc.r, Boonton Electronics Corporation, Parsippany, NJ ( IW3). Applicate Note 1M-001, Boonton Electronics. Catalog SULA TECHNOLOGY Deep Level Spectrometer. P.O. Box 963, Ashland, OR 97520 Application Note IM-00 I . Boonton Electronics.
'
28
PLAMEN V. KOLEV AND M. JAMAL DEEN
and Usami, 1983; Wessels, 1976) has two major disadvantages. First, there is no distinction between emission of majority and minority carriers. Second, the signal magnitude is dependent on the time constant. In addition to the transient current, the displacement current caused by the expansion of the SCR and leakage currents is present, and these currents create a dc level that is difficult to eliminate. Suppressing the parasitic transients associated with the transition from filling to emission pulse further complicates the measurement. In general, current transients are more difficult to analyze than capacitance transients. To be precise, only capacitance and current transients are direct methods for observation of the charge emission from the deep levels. Charge measurements are in fact processing of the transient current with a simple integrator circuit (Farmer et al., 1982; Kirov and Radev, 1981). The integration performed by a circuit similar to that shown in Fig. 9 eliminates the appearance of time constant in the denominator of Eq. (IS), and then the more convenient expression can be used. In addition, the instrumentation is simpler and less expensive than a fast, sensitive capacitance meter or nonsaturating current sense amplifier. Potential problems for the switching from tilling to emission pulse can be eliminated, for example, by applying the methods described by Gardner (1987) and Mego (l986), and the circuit can be further improved using analog memory circuits (Kolev, 1992; S h i m et ul., 1987a; Vitanov and Kolev, 1986). Just as the charge transient is a result of electronic processing of the current transient, the voltage transient is obtained when the capacitance is maintained constant by an electronic feedback circuit (Dejule et ul., 1985; Goto et al., 1973; Johnson, 1979; Johnson et ul., 1979; Miller, 1972; Shiau et d., 1987a; Kolev, 1992), which is used in combination with a fast and sensitive capacitance meter. Unlike the charge transient circuit, this feedback circuit does not process the signal but it chariges the experimental conditions, that
w
DLTS signal
F I G ~ J R9.E Integrator circuit for charge transient rneasurements.
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
29
is, the applied bias. Because of the very high gain of the feedback circuit ( > 1 ,OOO,OOO), the actual capacitance change needed to produce the com-
pensation voltage is negligibly small and the term “constant-capacitance” is completely justified. In Fig. 10, a setup for constant-capacitance measurements is shown. The reference capacitor C , is set to be equal to the sample’s capacitance during the filling pulse, and C , is set to balance the sample capacitance during the emission pulse. Thus, when the switch S W alternates between two reference capacitors C , and C , connected to the differential terminals of the capacitance meter, the feedback amplifier OA produces a bias applied to the sample, which is precisely the one needed to balance the corresponding reference capacitor. For dc and low frequencies, this feedback loop is negative, but for high frequencies the delay introduced primarily by the capacitance meter makes the feedback positive, causing large oscillations in the feedback loop. In order to avoid these oscillations, a lowpass RC filter is integrated into the feedback amplifier. This filter is adjustable, because the setup depends on the specific effective gain of the sample dV/dC, whose gain may vary with the temperature during the measurement. During the filling or emission pulse, the capacitance is largely stable. If for some reason it changes, a compensation voltage is immediately produced by the feedback amplifier to compensate for the change. Thus, the charge emitted from the deep levels is precisely balanced by the voltage applied to the structure. This constitutes the voltage transient. Probably the most important consequence of maintaining a constant volume of the depleted region is the avoidance of the requirement that N r << N,/ (Eq. (17) is valid for arbitrary N , ) . This feature eliminates a severe restriction on sample purity and effectively solves the problems associated with a nonexponential transient analysis (Crowell and Alipanahi, I98 1 ; Istratov and Vyvenko, 1995; Shaban, 1996; Murray et ul., 1986; Phillips and Lowney, 1983; Okushi and Tokumaru,
c,
Cf
# j! -sw
.
+
bias -external input
sample
Hi capacitance ineter
test
LO diff
-
u c -3 .r r Ja
AV
ZEi
go feedhack circuit
FIGURE10. Setup for constant-capacitance voliage transient meastireinents CCVT.
30
PLAMEN V. KOLEV A N D M. JAMAL DEEN
1980; Wang and Sah, 1984; Hanak et ul., 1990; Kirchner et al., 1981 ). Furthermore, it allows deep-level profiling to be implemented easily and regardless of the relative trap concentration compared to the doping concentration N,,. In (Shiau et al., 1987b), it was demonstrated that the constant-capacitance technique can be used for profiling simultaneously both the dopant and several deep traps by scanning, the depleted region while varying the reference capacitor C,. Similar scanning of the emission pulse level was also used for energy-resolved interface trap measurements (Tredwell and Viswanathan, 1980). Although the constant-capacitance technique is an indirect probe method, it is directly related to the emitted charge from the traps. There are many other indirect detection techniques that have much more complicated relations to the emitted charge. Conductance DLTS (Chen et al., 1981; McLarty et al., 1988~1, b) detects the transient of the threshold voltage caused by charge emission from traps in the depleted region of a field-effect transistor by measuring the corresponding changes in the transconductance g,n. For the same purpose, rather than the transconductance, the variation in the drain current can be measured (Hawkins and Peaker, 1986). Although a less sensitive technique, it is possible to measure the small-signal conductance transient in high-resistivity samples and to relate this signal to the emitted charge from the traps (Fourches, 1991). Photovoltage (Lagovski et ul., 1992; Monemar and Grimmeiss, 1982; Mooney, 1983) and Hall voltage (Kachwalla and Miller, 1987) measurements have also been used in DLTS techniques to detect the charge emitted from the traps. 3. Aivruging The main objective of the averaging process is to improve S N R and possibly to reduce the number of data points (Kolev et al., 1998a). There are two main approaches at this stage of the experiment, analog and digital techniques. In analog techniques, the averaging is performed simultaneously with the next step -transient analysis. For this reason, as a representative of the analog methods, here only the averaging function of a single boxcar averager will be discussed. The equivalent circuit of a boxcar channel is simple-it is just a lowpass filter with a time constant T,. = RC and a switch S W (Fig. 1 I ) . With the switch permanently closed, the response (Wilmshurst, 1990) u , of the lowpass filter is
where u,,, is the input voltage. If u,, = 0 then the response of a voltage step is similar to that shown in Fig. 1 lb. If the switch S W is closed periodically
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
(h)
31
(c)
Ficurte I I . Averaging performed by a boxcar channel: a) equivalent circuit; b) transient with SW closed permanently; c ) transient with SW closed only during the intervals Ti,.
with a period 7 , only for a short time T,,, which is called aperture, then the response is similar to that in Fig. I Ic. A dashed line in Fig. 1 l c shows the effective time constant 7(.(,~.,which has been increased T / , / T , , times. This allows for a substantial increase in SNR. But more important is the fact that the sampling can be synchronized easily with the exciting pulse to give the averaged value in some particular moment after the beginning of the emission transient. In that way, by varying this delay time and waiting long enough for the averaging to become close to the actual value, the whole transient can be extracted from noise. When the signal at the output of the detection instrumentation is digitized, then most frequently, a preselected number N of transients are averaged arithmetically by the system controller. Here, it is useful to remember that the SNR improves as f i ,so in most cases this number is set below 300. A new combination of averaging techniques, which is a major contribution in this field. is described in Sec. 111.
-
4. Trunsient Analysis The ultimate goal of the DLTS experiment is extraction of the trap parameters from the components of the decaying transient. This is the objective of the transient analysis. Again, there are two main groups of analytical techniques, analog and digital. Hi$torically, analog methods were implemented first. The filtering function of the “rate window” is a linear operation (Miller et a/., 1977) over the
32
PLAMEN V. KOLEV AND M. JAMAL DEEN
emission rate e ( r ) described by
where w ( r ) is known as a weightjunction. The boxcar averager (Lang, 1974a), lock-in amplifier (Kimerling, 1976) and exponential correlator (Miller et al., 1975) functions are the most commonly used. They are shown in Fig. 12. A good comparison has been reported in (Dmowski and Pidro, 1987; Miller e t d . , 1975). The boxcar function has a superior selectivity for traps with closely spaced time constants, but it has the worst SNR. At the opposite end, the exponential correlator has a superior magnitude sensitivity but very poor selectivity for time constants. The lock-in amplifier’s performance is between that of the boxcar averager and exponential correlator. Sometimes (Day et 01.. 1979b; Miller er d., 1975), the lock-in amplifier is considered to be equivalent to a boxcar averager with special setup. The preference in using a lock-in amplifier is given because it is an easily available and inexpensive instrument. I
transient signal I
I I I
W(t)
hoxcar avcragcr
-I
I
I
I
I I
bL.( 1)
I
exponcntial correlator
I
FIGURE12. Three most popular weight functions used for SNR improvement and transient analysis of analog DLTS signals.
DEVELOPMENT AND APPLICATIONS 01’ A NEW DLTS METHOD
33
The main advantages of the analog methods are their high sensitivity and continuous control during the experiment. The main disadvantage is the necessity to conduct more than one thermal scan in order to collect data for the Arrhenius plots of the traps. Other analog techniques rely on the analysis of the shape of the spectra (Hasegawa, 1985; Langfeld, 1987; Okushi and Tokumaru, 1981 ; Stannard et nl, 198 1 ), analog signal processing (Mangelsdorf, 1959; Toinokage et al., 1982), or they use more sophisticated weight functions to obtain better time-constant selectivity (Atanassov, 1983; Crowell and Alipanahi, 198 1). With advancements in computer and digital electronics, popularity of digital methods has increased. For transient analysis, spectral analysis DLTS (SADLTS) (Morimoto et al., 1987, 1988), Fourier transform analysis (Ikeda and Takaoka, 1982; Okuyama et LII., 1983; Weiss and Kassing, 1988), nonlinear least square fitting (Hanak et nl., 1990), the modulation function method (Valeur, 1978), the method of moments (Kirchner et al., 1981; IkossiAnastasiou and Roenker, I987), the correlation method of linear predictive modeling (Shapiro et ul., 1984), the inverse Laplace transform (Nolte and Haller, 1987), etc. are used. Isothermal methods are also gaining popularity, as the measurement is conducted at a few fixed temperatures (Farmer et al., 1982; Henry et nl, 1985; Yoshida et d., 1993; Suno et ul., 1997; Okushi and Tokumaru, 1981). The main advantages of the digital methods are that they record the whole transient and that only one thermal scan is necessary for transient analysis. The main disadvantages are, in general, lower sensitivity and lower resolution of the magnitude. In transient analysis, the problem often is to separate the contribution to the transient response of two or more deep levels with comparable emission rates because they are signatures of different types of defects. Assuming that the transient signal is a sum of exponents with different magnitudes and time constants, the solution of this problem, in principle, is to perform inverse Laplace transform, which produces a sum of delta functions with parameters the time constants of the traps (Nolte and Haller, 1987). Unfortunately, a well-behaved numerical method for inverse Laplace transform that can be used for DLTS does not exist. The main reason for this is that the exponential functions that are basic in the Laplace transform are not orthogonal along the real axis. In other words, the contribution of each exponent cannot be calculated in a simple way, as is done, for example, in the case of Fourier transform spectroscopy methods where the basic functions are sinusoids with constant magnitudes. Still, an accurate algorithm for inverse Laplace transform was proposed to be adapted for DLTS analyses in (Nolte and Haller, 1987). There is an interesting discussion of the forenientioned problem, which is illustrated with simulation results, showing that the rate window techniques used for analog transient analysis have inherently limited resolution, while the inverse Laplace
34
PLAMEN V. KOLEV AND M. JAMAL DEEN
transform has theoretically unlimited resolution. Unfortunately, the proposed algorithm for inverse Laplace transform is extremely sensitive to noise, which in practice limits the resolution obtained from a DLTS signal with a typical SNR to only about two times better values compared to that obtained using the best rate window technique. As the averaging techniques proposed in Sec. III offer unprecedented improvement in the SNR of the raw transient, this can be translated into a very high energy resolution by applying inverse Laplace transform for the transient analysis.
H. A New Classijcution Scheme fiw DLTS Methods
Since its introduction in 1974, a large number of variations and modifications of the DLTS method have been reported (for example, see the references in (Schroder, 1990), Sec. VII)i. Few and limited attempts for classification have been made (Blood and Orten, 1992; Schroder, 1990). In the previous sections, the main stages in the DLTS experiment and a large variety of techniques for cach stage were presented. Obviously, when they are combined as appropriate. their diversity is very large. There are four main stages of the experiment that should be noted when classifying a technique: excitation, detection. averaging, and transient analysis. For a detailed classification, the magnitude or the selectivity of the excitation pulse should be noted. For example, the conventional DLTS technique introduced by Lang ( 1974a) is a: ( 1 ) large pulse, bias excitation, ( 2 ) capacitance detection, (3) boxcar averaging, and (4) Arrhenius plot transient analysis technique. The acoustoelectric DLTS (Tabib-Azar, 1989; Tabib-Azar et ol., 1987) is an: ( I ) acoustoelectric excitation, (2) transverse acousto-electric voltage (TAV) detection, (3) lock-in amplifier averaging, and (4) Arrhenius plot transient analysis technique. The transient spectroscopy based on the Hall effect (Kachwalla and Miller, 1987) is an: (1) over bandgap light excitation, (2) Hall voltage detection, (3) digital averaging, and (4) Arrhenius plot transient analysis technique. From thc foregoing, it is clear that for designing an appropriate technique, most important are the choices of ( 1 ) and ( 2 ) , while (3) and (4) depend more on the availability of instrumentation and, sometimes, personal preferences. It is also obvious that (3) and (4) can be fixed and only (1) and (2) can be varied to fit the specific properties of the investigated material. In this sense, the choice of ( 1 ) and (2) in most cases can be considered specific to the experiment, while the choice of (3) and (4) refers to the data processing methods used. The following sections in this chapter describe a new feedback circuit and a new detection technique, thus both are related to ( 2 ) , and new averaging techniques, which improve and enrich the choice for (3).
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
35
I. Other Methods rid Coinpcirisons with DLTS Besides DLTS, there are many other techniques for characterization of deeplevel traps located in the bulk of the semiconductor or at the interface between two materials. In the silicon industry, for example, charge pumping (Groeseneken et a/., 1984) is more popular than DLTS because this method can be applied as a wafer level reliability test. Originally, the charge pumping techniques were designed to give the total interface trap concentration N ; , of a MOSFET. With modified bias pulses, which allow the traps at the Si-SiO? interface to be selectively populated, D;, and (T,,,,, can be measured and D;, sensitivity up to IO'eV-'crn-' can be achieved. The method is applicable 0.2 eV from the midgap of Si and mainly for interface traps of more than is restricted only to a specific group of devices. The DLTS is applicable to a wider range of devices and offers alternative methods for energy resolution, allowing D;, to be measured. In addition, DLTS using MOS devices can measure traps closer to the midgap and is not restricted only to the interface region. Charge pumping can be used to obtain spatial profiles of N ; , along the MOSFET channel (Kwan et u/., 1996; Li and Deen, 1990, 1992). The DLTS can also be used for measurement of spatial profiles of a particular discrete trap or D,, along the channel (Johnson, 1979; Kolev and Deen, 1998; Petroff and Lang, 1977). Significant advantages of the charge-pumping techniques are its technical simplicity, high sensitivity for small gate area devices, and no need for temperature variation in large temperature interval. As compared to the current DLTS, it may have higher sensitivity and simplicity because the integration of the trap response occurs in situ, that is, the signal is an averaged response of the interface traps and only a dc current meter is required. Drain current DLTS requires a fast current meter or a feedback circuit plus a voltmeter and additional hardware/software averaging and filtering. In general, most of the selective trap filling, charge-pumping techniques used for energy resolved detection are applicable with some form of DLTS as well, which can supply additional information. For investigation of the interface traps at the Si-SiO2 interface, quasi-static capacitance-voltage (C-V ) method (Ku hn, 1970) and high-low frequency C- V method (Nicolian and Brews, 1982) are still very popular and are often used as routine characterization techniques in VLSl development and production. For these methods, there is an industry standard and high quality instrumentation is readily available. The D;, sensitivity for a good system is usually around 10"',v- I - 2 . Like the charge-pumping technique, change in temperature is not required and the integral response of the interface traps is measured. However, there is an inherent energy resolution because only the response of those traps in the vicinity of a few k T / q around the Fermi level are detected. It can be compared with a DLTS that uses the Fermi level as a probe to obtain
-
-
36
PLAMEN V. KOLEV AND M. JAMAL DEEN
energy resolution (Johnson, 1979; Klausmann, 198I , 1986), but it also needs surface potential calculations in order to relate the gate bias to the Ferini level at the silicon surface. Both C-V methods and small-signal DLTS are sensitive to the surface potential fluctuations. Large signal DLTS with selective trap tilling can be used to avoid the surface potential fluctuation or the fluctuation can be extracted from a curve fitting of AV,(V,) (Klausmann, 1981). The quasi-static method scans almost the entire bandgap without any interruptions in the midgap region and only a MOS capacitor is needed. The DLTS requires a minority carriers source for scanning both sides of the bandgap and fails around the midgap because of generation of current carriers opposite to those emitted from the interface traps. The C-V methods require a high quality electrometer, are sensitive to leakage currents, and apply to MOS structures. 10’eV-’cmP’), is less The DLTS method can be more sensitive (about sensitive to the leakage currents, and can be applied to various test structures. Combining quasi-static C-V and DLTS methods gives more complete results for D,, because DLTS covers ranges of the bandgap where uncertainty i n calculation of the surface potential affects strongly the accuracy of the quasistatic measurements. The Conductance-voltage (G-V)method (sometimes called impedance spectroscopy) (Nicolian and Brews, 1982) is based on the measurement of the parallel conductance component G,, of a MOS capacitor a s a function of bias and frequency. The conductance G,, results from the interface capture and emission of carriers, and is a measure for interface trap density. The conductance method is highly sensitive, IO’eV-’cm-* and gives the capture cross section for majority carriers, but it is tedious, time consuming, and gives Di, close to the bandgap edges, but not in the midgap region. It is also sensitive to the surface potential fluctuations. With elaborate procedures, the effect of these fluctuations can be estimated by fitting curves to the experimental data. A similar approach is used in the small-signal DLTS (Klausmann, 1981). Because of similar sensitivities, the conductance and conventional DLTS methods are considered to complement each other; for time constants faster than I ms, the conductance method is used and for slower time constants, DLTS is used (Klausmann, 1981). Similar to the conductance method, the transconductance method can be used for MOS transistor measurements (Collet, 1975). Except for the conductance method, the methods listed here are designed to measure the deep levels in the surface region of the device. For bulk deep levels, thermally stimulated capacitance (TSCAP) and current (TSC) measurements were used before the introduction of DLTS (Buehler and Phillips, 1976; Nicolian and Brews, 1982). They are compared as a dc analog of DLTS and because of their low sensitivity ( n , > 0 . I N d ) and limited range ( A E > 0.3 eV), they have been largely replaced with DLTS (Schroder, 1990). For the special case of measurement of charge-coupled devices (CCDs), charge transfer
-
-
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
37
efficiency can provide valuable information about the deep levels (Hardy et a/., 1998a, b; Murowinski et ul., 1995). Optical methods such as photoluminescence (PL) can be applied only to those impurities for which radiative recombination has been observed (Schroder, 1990). Because of the high sensitivity of detection instrumentation, PL can be used even when radiation is dominated by another recombination mechanism. Compared to DLTS, PL allows easier impurity identification, but is more difficult for concentration measurement. Nonradiative bulk and surface recombination complicate correlation of a given PL spectral line with the concentration of the impurity corresponding to that line. The sample’s volume detected by PL is determined by the absorption depth of the exciting laser light and the diffusion length of the minority carriers, and this does not allow for deep-level profiling. In some cases, PL can complement DLTS as a tool for investigation of the shallow levels but that is complicated because of the different test structures used for PL and DLTS. The PL has the advantage of being a nondestructive, contactless method, which allows deep-level mapping on large area samples. It also requires complex and very expensive instruments such as argon ion laser, helium refrigerator, spectrometer, and photon counting and detection system (Schroder, 1990). Recently, low-frequency noise measurements of current or voltage noise spectrum have also been used for characterization of deep levels (Citterio et al., 1996; Deen, 199%; Deen and Raychaudhuri, 1994; Deen and Zhu, 1993; Deen et a/., 1993; Deen et al., 1995a, b, c, d; Deen and Quon, 1991; Jones, 1994; Kolev et NI., 1997c; Murowinski et al., 1993a, b, 1995; van der Ziel, 1986). The total noise is regarded as a superposition of noises that originated from several noise sources (Citterio et id., 1996; Deen, 1993b). The thermal and shot noise are considered to be fundamental sources whose contribution can be minimized by appropriate device design and operating conditions such as bias and temperature. Added to those sources are excess noise components such as flicker ( I /f), generation-recombination (G-R), and random telegraph signal (RTS) noises. These three sources are common for inany devices, and their spectral densities and amplitude distributions are well defined (van der Ziel, 1986). The intensity of the G-R noise is proportional to the concentration of the electrically active deep levels and at a given temperature, it peaks at some characteristic frequency, depending on the activation energy of the deep level. The variation of the peak with the temperature then gives the Arrhenius plot. At some other temperature, the actual peak may be above the observation limit of the measuring instrument and then the results may be incorrectly interpreted. This is a Fermi level probe method and can be compared to the small-signal variations of the DLTS. In Table 11, some of the most popular techniques for interface and bulk deep-level measurements are summarized.
TABLE I1 COMPARISON OF DLTS WITHDIFFERENT METHODS Method
DLTS methods
w 03
C-v methods charge pumping G-V method Low-freq. noise method CCD's charge TSCAP TSC
Bulk sensitivity
-
109
NIA
Surface sensitivity
-
E,
109
10"'
NIA
- 109
NIA
NIA
NIA
Energy range
108
NIA NIA
Tempscan needed
- 0.1
-- +
eV midgap E, 0.1 eV E, -0.1 eV E, t 0 . 1 eV E, 0.1 eV midgap E,, f0.05 eV midgap E, - 0.1 eV midgap
-
- 0.05
eV midpap (minority cc.) AE > 0.3 eV AE > 0.3 eV
-
Test device
Deep levels
Interface states
Optical properties
space-charge r e g i x ; resistoi; MOS structure; MOS capacitor MOS transistor MOS capacitor
+
-
E, NIA
data for u
+ +
+ +
a few points
same as DLTS
a few points
CCD
+ +
same as DLTS same as DLTS
+
NIA
+
+ +
+
+
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
39
111. AVERAGING AND RECORDING o t DIGITAL DLTS TRANSIENT SIGNALS
I . Introduction Since the introduction of deep-level transient spectroscopy (DLTS) in 1974 by Lang, the method has been refined many times with new techniques for improving sensitivity and accuracy, adapting DLTS to the specific properties of new types of samples, and simplifying procedures for measurement and data analysis. At present, DLTS is composed of a large group of different measurement techniques applied to a variety of test devices and materials, and is considered to be among the most accurate and reliable tools for investigating the properties of electrically active point defects in semiconductors (Benton, 1990; Schroder, 1990). Because of the great variety of DLTS techniques and numerous possibilities for their combination, any attempt at a detailed classification easily becomes a formidable task. Nevertheless, in all these techniques, there are several common features identifying the technique as of DLTS type. First, it is a time domain measurement of a relaxation process, most often the emission of charges trapped in electrically active centers. Second, the process is considered to be thermally activated and the validity of the Shockley-Read-Hall theory (Shockley and Read, 1952; Hall, 1952) is assumed. Third, there is periodic alternation between filling of the traps with charge (filling pulse) and emission of the trapped charges (emission pulse). This last feature is used for synchronized measurement and integration -a basic method used for signal recovery from noise in electronic instrumentation (Wilmshurst, 1990). The increased sensitivity, allowing detection of a signal immersed in noise, is the major advantage of DLTS over the previously used thermally stimulated current (TSC) (Driver and Wright, 1963), or thermally stimulated capacitance (TSCAP) (Carballes and Lebailly, 1968) methods for the study of deep levels in semiconductors (Sah et ul., 1969; 1970; Sah, 1976).
2. Technicul Ovendew of the DLTS Experiment The DLTS experiment can be regarded as a sequence of several steps. The first step is to populate the traps in the investigated volume with charges. The second step is to detect the change of the trap occupancy with electrons as a measurable change in some electrical parameter of the test structure. In classical DLTS, this is the small-signal high-frequency capacitance. The next steps are synchronous detection and averaging of this signal in order to improve the SNR. There are two main approaches at this stage of the experiment-use of either analog or digital signal processing methods (Fig. 13).
40
PLAMEN V. KOLEV AND M. JAMAL DEEN
I
+";: integratoi
amplifier
digital inethods
correlator
FIGURE13. Classification scheme of the signal processing of DLTS transients. The shadowcd areas indicate whcre the averaging techniques presented in this chapter belong.
1. Analog Methods The noisy transient signal can be measured and integrated over many pulses using analog instruments such as a boxcar averager (Kosai, 1982; Lang, I974a), a lock-in amplifier (Kimerling, 1976), or an exponential correlator (Miller et ~ l . 1975). , In the classical setup, two boxcar channels are used to measure the signal at two different times in the transient, and these times define a time constant window of the instrumentation (Lang, 1974a; Miller er ul., 1977). The averaging is performed by two boxcar channels and the difference of the signals from these channels vs the sample temperature is recorded, giving a DLTS spectrum. When the emission time constant of a deep level coincides with that of the instrumentation while scanning the temperature of the sample, the output signal indicates this coincidence with a peak in the spectrum. To apply the Arrhenius relationship to determine the energy level and the capture cross section of a trap, several temperature scans are needed using different settings of the time constant window of the instrumentation. Similarly, it is necessary to change the pulse frequency when using a lock-in amplifier, or the reference time constant when using an exponential correlator, and to repeat the temperature scan in order to obtain enough data for the Arrhenius plot technique. In all these cases, the averaging technique simultaneously achieves two different goals, increasing the SNR and analyzing the transient parameters. One serious disadvantage of the analog techniques is the need to perform more than one temperature scan. This practical difficulty can be eliminated, for example, by using more complex (and more expensive) instrumentation, or by analyzing the shape of the DLTS spectrum (Hjalmarson and Samara, 1988; Steele, 1986; SLIarid Farmer, 1990), instead of just finding the peak
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
41
temperature. Other disadvantages of the analog methods are their inherent limitations i n resolving closely spaced energy levels of defects, and difficulties associated with the analysis of nonexponential transients. Nevertheless, the analog methods are simple to implement, they produce an analog signal in real time during the experiment, and plotting this signal vs the temperature gives spectra that are available for immediate interpretation. 2. Digital Metliocls While the analog methods produce an integrated response to the transient signal, digital methods record the whole transient as a set of data points. The noisy signal is first digitized and then processed using various digital techniques (Asada and Sugano, 1982; Chang et d . , 1989; Doolittle and Rohatgi, 1992; Hanak et ul., 1990; Henini rt ul., 1985; Holzlein et NI., 1986; IkossiAnastasiou and Roenker, 1987; Ikeda and Takaoka, 1982; Jack et cil., 1980; Jervis et i l l . , 1982; Kirchner et al.. 1981; Losson et al., 1996; Morimoto et ul., 1987, 1988; Okuyama et al., 1983; Shapiro et nf., 1984; Valeur, 1978; Weiss and Kassing, 1988; Zitti et d., 1989). Analysis of the transient parameters can take place during the experiment, or data can be stored for analysis later. In common with the analog methods described here, the transient can be digitally correlated with a boxcar, lock-in or exponentially decaying function, and used to build a set of DLTS scans (Doolittle and Rohatgi, 1992). Alternatively, the transient can be analyzed using spectral analysis DLTS (SADLTS) (Morimoto et al., 1987, 1988), Fourier transform analysis (Ikeda and Takaoka, 1982; Okuyama et af., 1983; Weiss and Kassing, 1988), nonlinear least square fitting (Hanak et d., 1990), the modulation function method (Valeur, 1978), the method of moments (Kirchner et d., 1981; Ikossi-Anastasiou and Roenker, 1987), the correlation method of linear predictive modeling (Shapiro et d., 1984), or other digital methods described, for example, in Bevington and Robinson (1992). As the whole transient is recorded, it is necessary to perform only one experimental temperature scan, and this greatly reduces the time needed to perform the experiment. This is a major advantage over the analog methods, in addition to the extensive possibilities for analysis. However, digital methods have disadvantages as well. Because the whole set of data points representing the DLTS transient is recorded for each temperature, the total number of stored data points can be very large compared to that recorded using analog methods. Consider for example a digitizer fast enough to study transients with time constants of below 0.5 ms. Commonly available 12bit analog-to-digital converters (ADC) with conversion time of below 10 ps, which interface directly to a PC, are suitable for that purpose and their cost is below ten dollars. If operated at rate of 33.3 kHz, the time spacing between the data points would be 30 ps and, For simultaneous observation of slow transients with time constants in the tens of milliseconds range, there will be several
42
PLAMEN V. KOLEV A N D M . JAMAL D E N
thousand samples for one transient recording. For example, if the emission pulse is 90 ms, there will be 3000 samples for each transient recording. If the transient is recorded in I " intervals over a temperature range of 50-350 K , there will be a total of 900,000 samples, which occupy 1.8 Mbytes of disk space (each 12-bit sample occupies two bytes). The problem, however, is not so much the required disk space, but rather the difficulties in processing this large amount of data, as many of the digital methods involve iterative numerical calculations. The need to transfer 6 kbytes of data in just 90 ins is one more obstacle when the system is based on IBM PC and the processor operates in real mode, known also as MS-DOS mode. Another disadvantage of the digital methods, as compared to analog methods, is the decreased SNR for long delay times. Assuming constant noise, as the transient progresses, the signal decay leads to decreased SNR at the input of the signal processing apparatus. The analog methods have various ways of dealing with this problem. The boxcar technique enables one to compensate for the decreased SNR by increasing the sampling aperture of thc channel that is recording the second time delay point (Kirchner et nl., 1981; Day et id., 1979b). In the lock-in technique, an integral of the transient is processed and the high SNR at the beginning of the transient is averaged with the low SNR at the end. The correlator reference function, which multiplies the noisy signal transient before integration, also decays along with the signal, thus minimizing the influence of the low SNR at the end of the transient. However, in a digital system, the sampling aperture of the ADC remains constant, and so the SNR decrease in the tail of the transient cannot be compensated for. In general, both analog and digital DLTS signal processing techniques are mainly focused on the transient parameter extraction and on the ability of the method to resolve closely spaced energy levels of defects. Averaging to improve the SNR is considered important, but is a secondary task. While in the analog techniques these two tasks are merged, in the digital techniques, the transient analysis is separate and usually performed ufter some simple digital averaging is done. Most often, this is the multiple time averaging of a selected number of successive transient recordings (Doolittle and Rohatgi, 1992; Morinioto et a/., 1987, 1988; Okuyama et al., 1983; Hanak et al., 1990; Kirchner et ul., 1981) and this number is usually in the range of 100-300. This technique has three disadvantages. First, there is a need to allocate a memory buffer large enough to accornmodate the selected number of transients. In the preceding example it takes more than 300 kbytes, or approximately half of the conventional memory of an IBM PC. Second, after averaging, the buffer is cleared for processing of the new transient, and the old information is lost, thus making it more difficult to maintain high SNR. Third, the progress of the summed transient is not monitored as it accumulates in the allocated buffer and, if anything goes wrong and some adjustment and a restart are needed,
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
43
one is not aware of the problem until the full sequence of the selected number of transients has been accumulated. In our approach (Kolev et d., 1997a), we consider averaging of the noisy DLTS signal as a sepparatc task from the extraction of the transient parameters. We present two different, but complementary averaging techniques offering an increased time resolution for short delay times, improved SNR for long delay times, and more efiicient data storage. In addition, the transient and the DLTS scan are available i n real time for continuous observation and data analysis after the acquisition of each single transient.
C. Pseudn-LoRciritlimic Awrcigirig 1 . Theory
For detailed observation of an exponentially decaying signal, short sampling intervals are needed at the beginning of the transient, while much longer intervals are sufficient at the end. At the same time, high SNR of the processing instrumentation is much more important at the end of the transient, where the signal is weak, as compared to the beginning of the transient, where the signal is strong. One way partly to fulfill these requirements is to use a high sampling rate and short emission pulse at high temperatures, and to lower the rate and increase the emission pulse width at low temperatures (Okuyama et d., 1983; Hanak et al., 1990). This is easily done by using a programmable digital scope as a digitizer. Although this approach does not allow the simultaneous monitoring of both fast and slow transients, it is still possible to optimize the time resolution setting of the measurement system for a given time constant of interest. A much better way to satisfy the time spacing requirements is to use sampling with a logarithmic time base. This approach is not new in DLTS experiments. Morimoto et a / . (1987, 1988; Ikeda and Takaoka, 1982) described selecting from the measured 512 data points a set of 195 data points having almost logarithmic time distribution. The data points in (Wang et a/., 1993) also appear to have a logarithmic time spacing. To our knowledge, the best published work on this subject is described in (Doolittle and Rohatgi, 1992) where a high, but constant ADC sampling rate is used, all sampling results are transferred to the computer memory, and pseudo-logarithmically spaced data points are selected by specially written software. The pseudo-logarithmic storage scheme in (Doolittle and Rohatgi, 1992) uses a logarithm with base 2 because of the ease of unsigned integer division by 2" in binary arithmetic (Fernandez and Ashley, 1990). This simplicity is equally advantageous in hardware logic circuits and is used in our scheme as well. Another. probably less obvious advantage is the fact that the function 2" is closer to ~ x p ( n )
44
PLAMEN V . KOLEV A N D M. JAMAL DEEN
than the commonly used 10”. In our system, the sampling rate is 100 kHz and, therefore, the sampling intervals At are spaced 10 ps apart. If the data is averaged over time intervals t,,, with each one twice as long as the preceding one, the second averaging interval is 2At, and the n‘” averaging interval starts at t,, = (2” - 1)At. (22) If, for convenience, we number the intervals starting from zero, which is a commonly used standard i n programming and digital electronics, for 2” >> 1 the index n can be expressed as
n
%
log, t , ) / ( A t ) .
This represents a logarithmic dependence of the interval index on the interval time length, as the sampling interval At is fixed by the constant sampling rate of the ADC. Unfortunately, the pseudo-logarithmic storage scheme proposed in (Doolittle and Rohatgi, 1992) has Feveral disadvantages. First, it is still necessary to allocate very large memory buffers, as the data points are selected cfter the full set of transient sampling points is recorded into the computer memory. Sampling each 10 ps, there are 131,072 sampling points that are stored in a 256-kbyte buffer. From this large set of data, only 768 data points are selected and used, while the remaining 130,304 points are discarded. Second, more buffers are needed for the multiple time averaging of successive transients as described in the preceding. iind the result is available for observation after the selected number of transients is averaged. Third and most important, there is a substantial decrease in the SNR, especially in the tail of long transients, where the data point intervals are in the millisecond range, because the data points are selected from just 10 ps long sampling intervals. In addition, recording of 768 data points for each degree in the 50-350 K range still requires almost 0.5 Mbytes disk space. Also, direct memory access (DMA) transfer into memory blocks larger than 64 kbytes is complicated as it requires continuous initialization of the DMA controller for crossing the memory page boundary (Royer, 1987). The main difference between our pseudo-logarithmic averaging scheme and that described in Doolittle and Rohatgi (1992) is the method of obtaining the data points. In Doolittle and Rohatgi; ( I 992), the data points are srlected from the large set of sampling points, but in our system, the data points are Liverciges o j the satnpling points inside the averaging intervals, which increase in a pseudo-logarithmic manner (Austin, et d., 1976). In this way, ~ 1 1 1measured samnples are used. Although our electronic circuit implementation is different, the averaging principle (Fig. 14) is the same as in (Austin, et al., 1976). The
DEVELOPMENT A N D APPLICATIONS OF A NEW DLTS METHOD
45
M = 5 for all groups ti = groups nunibcr 0 ADC sampling A date point
sanigJletl trilnsient n
Ill
11=0;
,I=
I
j
It
I
0 FIGURE
10
=5
I
I
I
I
30 Time [sampling intcrvalsl
20
I
I
I
I
I
/
40
.
=3 .
.
.
_
_
, 50
14. Pseudo-logarithmic time averaging scheme. After each 5 data points the number 2” doubles as II increases by 1 .
of averaged samples
logarithmic time spacing algorithm already described gives only about three data points per decade. To increase this number, the length of the averaging interval in the pseudo-logarithmic averaging scheme is doubled after a preselected number M of averaging operations, thus producing a linearly spaced train of M averaging intervals. In this way, the pure logarithmic time spacing is mixed with linear time spacing, and hence the name p.seud[~-loRarithrnic. The sampling interval remains the same, 10 ps in our system, and the result is a sequence containing ( N I ) groups of averaging intervals m,,. Inside the group n , each interval m,, has the same length, but it is twice as long as an interval from the preceding group rn,,-~.or half of the time of an interval from the succeeding group m , , + ~Because . the maximum number M of averaging intervals m,,inside each group is the same for all the groups, the time lengths of the groups change in the same way. Therefore, we have ( N I ) groups, each one containing M equal averaging intervals m,,, but with the time length of the groups increasing as 2”.
+
+
2. Error Anulysis At first glance, this scheme for averaging of the sampling points may be understood as an approximation of the exponentially decaying function with a set of straight lines connecting the obtained data points (piecewise linear approximation). Indeed, the obtained set of data points does not contain explicitly any information about the time intervals between the data points -this information remains hidden in the summing results from which the data points were obtained. However, the data points themselves were obtained by summing of
46
PLAMEN V. KOLEV AND M. JAMAL DEEN
the true experimental signal, sampling each A t , and dividing the result by the number of the samples in the averaged interval. For a given group with an index n , the largest error is always expected to occur in the first of the averaging intervals m,, with m = 1, where the signal varies more rapidly as compared to its variation during the subsequent intervals from the same group with nz > 1. Therefore we need to consider only the worst case when the averaging is done from ts,,,M = At.M.(2" - 1) to te,,,M = At.M.(2" - I ) At.2". The result of the averaging process is assigned to the middle of the averaging interval defined as tnz,,,M = At.M.(2" - 1 ) At.2"-'. Thus, the error function Err(t) can be obtained by subtracting the magnitude obtained by averaging of the exponentially decaying function in this interval from the exact magnitude in the midpoint rm,,,M as
+
+
Err(t)= C .exp
.exp
I?-[
2"
.xexp
At
[-T . i ]
(24)
1=I
where C is the transient magnitude at the beginning of the transient. When Err(r) is plotted vs r for B given set of A t and M , this function exhibits one sharp maximum at some tin+, which dominates short time constants for small t i . At long time constants, this maximum is compensated for by another sharp minimum that begins to dominate for large n at some tnr-< tnl+. Away from these peaks, the error function is close to zero. We find these particular t,,,+ and r,,,- by setting the first derivative vs r equal to zero and solving numerically (Appendix A) with convenient choice of the initial guess value for r
-Cat
where y , , , ~is M ( 2 P - 1). Next, we replace the values obtained for tm+ and tnl-in Eq. (24), which gives the maximal errors Err(r,,,+) and Err(r,l,-) when using the averaged values in the mrr interval with m = 1. This function was normalized relative to the initial magnitude C and plotted against t,,,+and r,,,- in Fig. 15 with parameters n and M for two values of A t . There are two distinct reasons for the errors in this pseudo-logarithmic scheme.
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
47
10
I
L
b 0.0 I
0.0 1
0.1
I
10
100
L,,, lmsl FIGURE15. Normalized absolute magnitude and time coiistant errors plotted vs the time constant t,,,,, at which these errors occur. The straight lines indicate independence from the gi-oup number n or number of equal averaging intervds inside the group M and dependence only on the sampling rate. Inverted triangles indicate sampling with I MHz rate and M = 10. The right line with hollow symbols gives the errors in determining the time constant error sampling with 100 kHz.
First, the limited speed of the ADC gives an error that is strictly dependent on the sampling interval Af, in our case 10 ps, and it does not depend on the number of the averaged samples 2” or the number of the equal intervals inside the group M . This gives the straight line, delnonstrdtes increased error for short time constants. Obviously, this error is not inherent to the pseudo-logarithmic averaging, and can be decreased only by using faster ADCs. The second type of error is more pronounced for longer time constants and it depends much more on M than on 2”. Figure 15 demonstrates that for time constants longer than 1 ms the errors are below 0.1% if the groups are divided into five or more averaging intervals. Calculations with a 10 times faster ADC sampling rate show a 10 times reduction of the errors for fast time constants. For transient analysis, it is attractive to use a boxcar function obtained simply by subtracting two data points with time delays tl and f 2 . Because each data point represents an average with time width W, it is interesting to evaluate the error in defining the rate window. We follow the procedure outlined in (Day et d., 1979b) with a slight modification. Instead of integral averages, we use summing averages in order to model the actual function of our circuit accurately, and to account for the limited rate of the ADC. Furthermore, we subtract only the first subintervals in two adjacent groups where the maximal error is expected to occur. The normalized output S(t) is
48
PLAMEN V. KOLEV AND M . JAMAL DEEN
then given by
We find the rate window by differentiating S ( r ) with respect to r and setting the result equal to zero. The solution can be found numerically (Appendix B) to give r,,,,. We calculated r,,, for variations of 17 and M and compared the values with those obtained using the Lang's expression using the midpoints of the averaged intervals, which is
The normalized error in determining rcn= (rkax- rmax)/rmax is also shown in Fig. 15. One should mention that because of the logarithmic dependence from r on the Arrhenius plot errors for r in the range of 1 % can be ignored (Day et a / . , 1979b).
3. Ir~i~~lernrritcrtion There are two possible ways to implement the averaging scheme described here. It might be possible to use the same hardware as described i n (Doolittle and Rohatgi, 1992). Instead of selecting only 768 data points from the whole set of sampled points, fast software could be used to change the length of the averaging interval, perform the actual averaging of the sampled points inside the interval, and store the corresponding data points. Equation (24) could give the starting point for such an implementation. There are two potential problems with this approach. First, there may be some d processing during the first three groups, when there is no averaging or only 2 or 4 samples are averaged per data point. In processor real mode, fast data transfer can only be done with direct memory access (DMA). As the DMA process actually "steals" cycles from the PC processor, an intensive DMA transfer can considerably slow down program execution (Royer, 1987). Thus, the DMA transfer can cause timing conflict with the real-time averaging routines. Therefore, we give preference to the hardware implementation shown in Fig. 16. In our circuit, one can distinguish four functional blocks: an ADC, a pseudologarithmic pulse generator, an averager, and an interfacing block. The ADC
DEVELOPMENT A N D APPLICATIONS OF A NEW DLTS METHOD
49
FIOIIKF.16. Block diagram reprcscntation of pseudo-logal.ithmic time hase averaging circuit.
operates continuously at a fixed rate of 100 kHz. The pseudo-logarithmic pulse generator (Appendix C) is constructed from programmable counters and shift registers. I t incorporates a programmable divider of the clock frequency, which is set at the beginning of the transient t o divide by 2", because for the first group 11 = 0. Of course, i n practice this means no division ;it all, because 2" = I . When the counter of linear pulses m reaches a predetermined number M , it is cleared to start counting again, and this produces a logarithmic pulse 1 1 , thus changing the setup of the programmable divider to 2 ' . This doubles the time between each one of the next M pulses. When the linear pulse counter reaches M again, ti increments by one, and this sets up the divider t o divide the clock pulses by 2'. This sequence is repeated until the logarithmic pulse counter reaches the software-programmed number of groups N . The averager is made by combining an accumulator, shift register, and a prograininable down counter. The end-of-conversion (EOC) signal from the ADC triggers an adding operation that adds the result from the current conversion to the stored sum of the previous conversions. This accumulation of conversion results is repeated over the whole length of the averaging interval. When the new linear pulse arrives, it stores the accumulated result into the shift register, which must be large enough to accommodate the whole sum. Then the accumulator is cleared for storing the new sum while the result in the register is divided by 2" by being shifted right IZ bits by the down counter. As the down counter is programmed by the same logarithmic pulse counter H , which is programming the clock frequency divider to divide by 2", the accumulated results are always correctly divided.
50
PLAMEN V. KOLEV AND M. JAMAL DEEN
After the averaging operation is over, an interfacing circuit signal requests the DMA controller of the PC to transfer the averaged result to the specified memory location. After the last result from the last octave is transferred to the PC memory, the logarithmic pulse counter is cleared and ready for the next transient. Simultaneously, the interfacing circuit produces a signal that triggers a hardware interrupt routine for performing the second averaging technique that will be described in this section. The operation of the whole circuit is synchronized by a 16 MHz system clock, and the ADC sampling frequency is obtained by dividing the system clock frequency by 160. The maximum allowed numbers for N and M depend on the hardware circuit. In our system, N,,,,, = 12 and MI,,i,X= 16, which gives a total of 208 pulses (13 x 16, because N starts from zero). This is enough to store transients of up to I .3 s long, with 10 ps and 20 ys resolution, and with 32 data points in the tirst 0.5 ms. In addition, we have a programmable delay of up to 160 ps i n order to compensate for possible slow response of the capacitance meter. The intervals in the last group are 40.96 ins long, each one averaging 4096 J4096 or 64 times samples. Therefore, the expected SNR improvement is (Wilmshurst, 1990). However. i t refers to points inside the averaging interval and not for the whole transient, and this implies that the SNR improvement will be less for low-frequency noise. If we use the averaging scheme proposed in Doolittle and Rohatgi (1992), then each data point from the last group (the second half of the transient) would be just one selected ADC sampling result out of 4096 ADC samples. In our averaging scheme that same data point represents an average of 4096 ADC samples. This gives a major advantage of our averaging technique for long delay times over the previously used technique. Another advantage is storage efficiency. Because of significant SNR improvement, even long transients can be recorded with just 208 points and the disk storage space for a DLTS scan in the range 50-350 K occupies less than 125 kbytes. Other advantages are the reduced size of the required buffers (in our system we allocate just two 420-byte buffers) and the large amount of available time for the microprocessor to process and display the results while the measurement is in progress.
-
4. Demonstrcitiori In order to demonstrate the benefits of using the proposed pseudo-logarithmic averaging scheme, we recorded a real DLTS signal when our system was operated in constant-capacitance mode. We used an analog memory feedback (Kolev, 1992), which eliminates the distortion of the voltage transient introduced by the integration of the voltage step from the filling to the emission pulse, and provides a stable zero baseline (Shiau er al., 1987a). This feedback was used in combination with a Boonton 7200 capacitance meter. The measured sample is a p-Si MOS capacitor with insufficient annealing after
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
0
100
200
300
400
so0
51
600
Delay time [ins]
FIGLJR~, 17. a) Original signal sampled each 0.2 nis with 12-bit resolution: h) same signal aftcr multiple time averaging ol’ 32 convxutivc transients.
the ion implantation for correction of the threshold voltage. More details can be found elsewhere (Kolev, 1992). The traces i n Fig. I7 were recorded using DMA transfer and a commercially available data acquisition board. Trace a) appears in the same way as the signal seen on the oscilloscope screen, and trace b) was obtained with multiple time averaging of 32 consecutive transients. This trace gives an idea of the expected improvement in SNR using a simple averaging technique. It should be noted that real-time observation of these traces on the computer screen was impossible. Also, we demonstrate our system mainly with a relatively weak DLTS signal and with 60 Hz electromagnetic interference present, in order to demonstrate better the complementary action of both averaging techniques presented in this section. For the same reason, we intentionally recorded transients longer than those normally used in DLTS. In Fig. 18 are shown four traces. The trace denoted with k = 0 ( k will be defined later) was obtained using only pseudo-logarithmic averaging. At first glance, the advantage of using this technique alone is not obvious. Although the noise magnitude inside the averaging intervals is significantly reduced, the trace shape is substantially different from the expected exponentially decaying transient. The reason for this unsatisfactory result is because the pseudo-logarithmic averaging technique is used inside the averaging intervals and, therefore, it cannot suppress low-frequency noise components. However, note the transition to longer averaging intervals around 320 ins and its effect on the curve shape. In order for the forementioned digital averaging circuit to operate correctly, it must be assured that essentially all the averaged
52
PLAMEN V. KOLEV AND M. JAMAL DEEN
transients are inside the limits of the ADC. We use a digital-to-analog converter to display the averaged data points on the oscilloscope screen, which helps us to detect ADC input overloading.
D. Contiiiuous Time Avercigiizg 1. Theot:v
The pseudo-logarithmic averaging proposed here is a very efficient technique for reducing the number of data points and for SNR improvement at relatively high frequencies and large delay times. However, it needs to be complemented with another averaging technique that can suppress low frequency noise components and improve the SNR at the beginning of the transient. We found that continuous time averaging (Wilmshurst, 1990) is a very convenient technique for these purposes. In addition, it has the advantage of allowing continuous transient display after each pulse and the size of the allocated memory buffer is independent of the number of averaged transients. The continuous time averaging mode is similar to the running average formed by a low-pass filter. In this mode, the result of the ltist n , transients is available at any time for display and for other data processing. Unfortunately, it is not convenient to apply the low-pass filter directly to the: multiple time averaging scheme because the data points it averages are consecutive in time. In contrast, multiple time averaging processes the whole set of transient data points in parallel, that is, each data
0
100
200
300 400 500 Dclay time [ins]
600
FIOIJRI~, 18. The signal of trace a) in Fig. I7 after only pseudo-logarithmic time averaging for k = 0. Fork > 0 , pseudo-logarithmic time averaging is combined with continuous time averaging with tinic constants 2'. The inset shows good time resolution during first 2 nis with k = 3.
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
53
point is averaged with corresponding data points from the other transients that are at the same delay point from the start of the transient. In this way, the data points to be averaged are separated by one or more pulse periods and are not consecutive in time. Therefore, we need a method to adapt the low-pass filter function to be applied separately for each data point of the transient. For a simple asymmetrical first order low-pass filter consisting only of a resistor R and capacitor C, the voltage increment of u[) is (Wilnishurst, 1990)
where T , = RC is the filter time constant. Let us consider u,,, digitized at short sample intervals 6 t . Then in discrete form, Eq. (28) becomes
where n,& is the digital time constant cotresponding to the analog time constant RC. Therefore, in order to implement a lowpass filter function on each data point of the transient. we have to replace the simple summing and dividing algorithm with W) + 141 (uirr - V O ) / ? ~ / (30)
+
7
which means, that the value u()of the data point stored in the computer memory buffer is updated after each new transient with a fraction of its difference from the corresponding data point v i I r from the new transient. If the starting value stored in the buffer is zero, then in the beginning of the averaging process the difference is large, and the value in the buffer quickly grows as a new value is added to it after each new transient. As the value stored in the buffer approaches that of the new transient, the growth rate decreases and eventually when the stored value is nearly equal to that o f t h e new transient, the growth terminates. After this, the value i n the buffer becomes largely stable and it reflects only changes in the incoming transient that are sustained long enough to be comparable to the digital time constant. This evolution of the transient stored in the computer memory buffer is continuously monitored on the computer screen where each data point looks like it was produced by a separate “virtual” boxcar channel. This is not a surprise, as a real boxcar channel performs exactly the same averaging operation. Of course, our “virtual” boxcar channel differs from the real one. Each “virtual channel” has a fixed time delay and aperture. Fortunately, the pseudo-logarithmically averaged interval length, or the “virtual aperture,” is conveniently self-adjusted depending on the time delay, new “channels” can be easily created, and selecting the averaging time constant t i , & can be fully automated and varied throughout the experiment. Because 6f remains constant, only t i I is varied. For convenience, i n our system rzl is represented as 2“ and the actual change
54
PLAMEN V. KOLEV AND M. JAMAL DEEN
of the digital time constant is made by selecting k . The SNR increases with the square root of the number of averaged transients so it is expected that the SNR will depend linearly on k . 2. Demonstrution The results of the combined action of both techniques (pseudo-logarithmic and continuous time averaging with variation of k ) are shown in Figs. 18 and 19. A high time resolution in the beginning of the transient is seen in the inset in Fig. 18, and a very good SNR is demonstrated in the inset in Fig. 19. The trace with k =: 2 seems almost identical to that with k = 3 but in fact it contains more noise at low frequencies comparable to the inverse of the pulse period. Note the difference in the second half of the transient. Also, the trace with k = 4 in Fig. 19, compared to the trace with k = 3 in Fig. 18, appears more noisy because the scale of the vertical axes is different. The noise fluctuations in the inset in Fig. 19 can be estimated to be in the range of 10-30 pVppp.Compared with the noise magnitude as seen in Fig. 17a, which is about 10 mV,-,, SNR improvement is substantial. The 12-bit resolution of the ADC (or that of the pseudo-logarithmic circuit) is always much better than the vertical resolution of the computer screen; therefore, quantization errors are not visible. In addition, the 12-bit resolution can be effectively boosted hy the averaging process to 16-bit resolution, because the computer operates with 16-bit “words” (Wilmshurst, 1990; Doolittle and Rohatgi, 1992).
k = 4
0
100
200
300 400 so0 Dclay time [ins1
600
F i w w 19. Comhined pseudo-logarithmic and continuous time averaging with different time constants 2k. The inset shows very good SNR at the end of the transient with k = 10. Input signal is seen in Fig. 17.
5s
DEVELOPMENT A N D APPLICATIONS OF A NEW DLTS METHOD
3. ltiipleriientmtion The continuous time averaging routine' is activated by a hardware interrupt immediately after the transfer of the last pseudo-logarithmically averaged data point of each transient. For each data point of the received transient, the subprogram shifts left 4 bits, which is equivalent to multiplying by 16. Next, it subtracts the corresponding 16-bit value, stored in the reference memory buffer, then shifts right k bits and finally adds the result to the old buffer value to be stored as a new value. This is exactly the low-pass filter algorithm implied in Eq. (30). Because the averaging is performed over 16-bit digits, the digital time constant can be as large as 2"-1 or 32,767, which means the maximum increment or decrement lo the 16-bit value stored in the buffer is just one bit. However, in practice this is a very inconvenient choice. Consider, for example, a relatively fast transient recorded during a 20-ms-long emission pulse period. There will be just 50 averaging steps per second, and to change the stored value e times will require 32,767s/S0 or roughly 1 1 min. To reach the new value within 1 o/o error, one has to wait almost I hr. Besides, a 20-inslong emission pulse period does not allow the pseudo-logarithmic averaging technique to be used efficiently. E. Applications
I . Loizg Trmsierits The averaging techniques suggested here are not limited only to DLTS measurements. The main goals achieved by these techniques are substantial SNR improvement and the efficient reduction of the number of data points. In DLTS, these techniques should be considered as preprocessing steps followed by the actual transient parameter analysis, which can use all of the digital processing methods listed earlier (Doolittle and Rohatgi, 1992; Morimoto et ul., 1987, 1988; Ikeda and Takaoka, 1982; Okuyama et ul., 1983; Weiss and Kassing, 1988; Hanak et ul., 1990; Valeur, 1978; Kirchner et ml., 1981; IkossiAnastasiou and Roenker, 1987; Shapiro rt nl., 1984). Furthermore, because of the reduced number of data points and increased SNR, it can be expected that these techniques will give better results. Alternatively, as demonstrated in Doolittle and Rohatgi ( 1992), weight functions well known in analog methods, such as boxcar, lock-in amplifier. and exponential correlator, can also be used for transient analysis. When SNR is not of concern, the boxcar function is particularly attractive because it gives Free for download at ht~p://www.ensc.sfu.ca/C;~.adStudcnts/kolcv/DLTS.htmlor http://ww~.GeoCities.~~n~SiliconValley/Piiie~/~X~9.
at
56
PLAMEN V. KOLEV AND M. JAMAL DEEN
higher resolution of the peaks. In our system, we rely mostly on the use of the boxcar function with time delays t? = 2tl by simply subtracting two selected data points with the same index m but from different groups IZ and n 1. The corresponding DLTS scan is displayed in an inset on the computer screen during the thermal scan, overlapping the transient display. The time delays can be varied and the resulting DLTS scans can be seen in real time. Thus both the transient signal and the DLTS scan are updated on the computer monitor after each pulse, similar to what it would be if displayed on an oscilloscope screen. Whereas the data points for the Arrhenius plots can be obtained during the experiment, the analysis of a given trap can start as soon as the scan of the temperature interval corresponding to the measurable trap time constants is completed. Figure 20 shows several transients recorded for each degree of temperature change. The magnitudes of the traces were adjusted to fit on the same chart and are not to scale. This adjustment was made in order to demonstrate the benefits of the very high SNR, which allows for making obvious the observed time constant changes. These results suggest the option of using isothermal analysis DLTS (Okushi and Tokumaru, 1980; Yoshida et LII. 1993), which can minimize the effects of thermal dependencies of the capture cross-section and transient magnitude. For this purpose, we need to determine the emission rate without relying on its temperature dependence; for example, by displaying the transient on a logarithmic scale. Figure 21 shows several transients at the same temperature (47 K), but with different settings of the reference capacitor and the corresponding different reverse steady-state bias voltage across the sample. For comparison, we
+
ion - implanted
\
-
o
,
I
*
.
ioo
zoo
I
I
400 Delay tiinc 1ni\1 300
I
8
5oo
.
I
6on
FIGURE 20. Temperature variation of the transient time constant as ohserved during m a surements. The curves demonstrale the need for better temperature resolution at low temperatures. Magnitude is not to scale. The temperature was maintained with accuracy better than +/-0. I K.
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
57
-=--;y
-.
ion-i m plm ted p-MOS capacitor at 47 K
4v RC
.sv OV
0
200 400 Delay time [ins]
600
Ficitmf; 2 I . Time constant variation at fixed temperature of 47 K and variable reverse bias 0 to 8 V. The time constants calculatcd from the slope 0 1 the trace vary lrom 200 i n s at zero bias to 158 nis at 4 V. Note the departure froiii the exponential hchavior at 8 V bias during the first 5 0 nis. For comparison is shown a transient obtaincd by difierenliating a squarc wave with an RC high-pass tiller is a150 shown.
also recorded a true exponential transient with the magnitude and the time constant adjusted to be close to the real DLTS signal. This true exponential transient was obtained by differentiating a square wave with an appropriate RC filter. The slope of the traces gives time constants decreasing from 200 ms to IS8 ms. The decreased transient time constants, corresponding to low values of the reference capacitor or higher reverse bias demonstrate the effect of field-enhanced emission (Couturier et d., 1989). It is important to note here that the change in the slope, and the corresponding change in the transient time constant, can be easily monitored at a fixed temperature. This allows for appropriate adjustment of the measurement setup during the experiment. For example, by adjusting the voltage and monitoring the slope of the trace, we can avoid the conditions of field-enhanced emission, and at the same time, obtain a large signal that improves measurement sensitivity.
2. DLTS Spectro In the measurements in Figs. 18-21, the pseudo-logarithmic settings were N = I 1 and M = 16, the temperature variation was limited to between 40 and SO K, and all transients were associated with just one trap. Indeed, with less than two transient recordings per second and the digital time constant set to 29, about 600 s were needed to obtain good averaging results. Clearly, these settings are not very convenient for a full-range thermal scan, and this setup is
58
PLAMEN V. KOLEV AND M . JAMAL DEEN
better suited for isothermal measurements when a significant noise reduction is essential. When the signal level is high compared to the noise, we set the digital time constant to 24 through 2', and then we can store transients up to 1.3 s long for the full range of the thermal scan. For scanning temperature measurements of noisy transients, we use a setup of N = 8 and M = 10, which is enough to record a 50 ms long transient with just 9 0 points. With about 20 continuous time averages per second, this setup allows us to use digital time constants of up to 2"', depending on the temperature scanning rate. For example, if the setup is 2", then to change the values of the stored transient e times takes less than a minute. Recording the transient at each degree temperature, and with a temperaturc scanning rate of 0.8 K/min or less, there is still enough tirne lor averaging, because the time constant of the transient varies far less than e times for each degree of temperature change. This is especially true at temperatures above 100 K where the DLTS signal varies slowly with the temperature. Using this setup and the simple boxcar technique already described, a series of six DLTS spectra were obtained (Fig. 22). There are five well-defined peaks present. The shape va.riation of the large peak at the highest temperature suggests the existence of a sixth peak, which is dominated by the large peak. This peak appears more on the curves corresponding to a short time constant setup, and it is a rather strange result because it is well known that the DLTS peak resolution improves for setups of long time constant windows (Doolittle and Rohatgi, 1992). However, further analysis shows that this false peak can
p-MOS capacitor
f =
50
I ; 2 : s; 10; 20; 37
I00
Ills
150 200 Ternpcrature [ K ]
250
300
FIC~UKE 22. Series of six boxcar DLTS spectra with channel delay ratio 12 = 211 obtained from onc thermal scan. The change in the background level seen at 100 K is lor r = I ms and r = 2 ins. The change in the magnitude for longer time constants is probably due to temperature variation of the capture cross section.
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
59
be associated with signal overloading, which affects the first half of the stored transient more than the second half. Figure 23 shows the Arrhenius plots obtained using the same data file. As outlined in Benton ( 1990),obtaining the energy level positions and capture cross sections is only the first step toward impurity or defect identification. We limit the scope of this section to demonstration of the averaging techniques. The trace in Fig. 21 with lowest influence of the field-enhanced emission is fitted well by an exponentially decaying signal with a time constant of about 200 ms. In the rightmost Arrhenius plot in Fig. 23 for the trap around 50 K, this time constant corresponds to 47.6 K. This again supports the need to record more than one transient for each degree temperature at low temperatures. Using a setup of tl = 10.3 ms and t? = 20.4 ms for the rate window, we have found the peak temperatures of the traps appearing in Fig. 22. The recorded transients at these temperatures are shown in Fig. 24. Signal overloading of the transient recorded at 251 K is seen at the beginning of the trace for delay times of less than 5 ms. This is the reason for the false peak appearance on the low-temperature side of the large peak in Fig. 22. Therefore, the observation of the whole transient can easily prevent the incorrect interpretation of the shape variation mentioned here. In the inset of Fig. 24 are shown details of the same transients during the first 2 ms. This can further improve the rejection of incorrect data for transient analysis. One simple way to check the transient distortions at the beginning of the transient that are difficult to see is to display the transient on a logarithmic time scale. X
4
2
SO
I00
I so
200
250
I/kT l e v - ' ]
23. Arrhenius plots obtained from the scan data file used in Fig. 22. The energy level positions of the hole traps are in eV ahove the valence hand a s indicated. Note the good linearity of the data points. FlGLIRF
60
PLAMEN V. KOLEV AND M. JAMAL DEEN 2,s 251 K 2.0 53 K 1x9 K
5 .o I
0.S
Delay liinc Inis]
61 K
I42 K
0.0 0
I0
20
30
40
so
Delay lirnc [ins1
FIGLIRE 24. Series of transients at the peak temperatures for sctup f 1 = 10.3 ins and 12 = 20.4 nis of the traps in Fig. 21 (same data file). Details of the first 2 ms are shown in the inset. Note the overloading at 25 I K .
In Figure 25 is shown the same series of transients as in Fig. 24, but on a logarithmic time scale. Here one considers the data above 0.4 ms because below this limit the recorded data points reflect the capacitance meter recovery process. The switching time of the feedback circuit is below 1 ys and it does not introduce any significant delay. Another possible source of relatively slow recovery time is high series resistance of the sample. From the point of view of demonstrating the averaging techniques, it is interesting to note the smooth shape of the curves during instrument recovery. This is possible using only very high time resolution with a good SNR in the beginning of the transient capture. One of the curves in Fig. 25 (at 53 K) is presented as a set of data points in order to demonstrate their even spacing on a logarithmic time scale. For comparison, another CC-DLTS trace recorded with the same setup is also shown. The measured sample was a junction field-effect transistor subjected to neutron irradiation. This trace proves that the feedback and the biasing circuit are not the source for the distortion in the remaining traces. F. Conclusions
In this section, a new approach to the digital signal processing of DLTS signals by separating the noise and data reduction from the transient analysis was introduced. The combined action of two complementary digital averaging techniques to improve the DLTS digital signal processing was demonstrated. Pseudo-logarithmic time averaging is efficient in reducing the number of
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
61
2.5
2.0
-> 1.5 -E
5 1.0 0.5
0.0
I
0 .I
" I
I0
SO
Delay time I iris] f k i l l R E 25. The data of Fig. 24 plotted on a seinilogarithmic scale to display details ;it the beginning of the transient. The signal before 0.4 nis is distorted because of the instrument recovery after the tilling pulse. The dots in the trace at S3 K show the data points evenly spaced on a logarithmic scale. For coinparison is shown ii CC-DLTS trace obtained irom neutron irradiated junction field effect transistors and reduced 25 times i n mngnitude.
processed data points and improving the S N R for high-frequency noise components and for the transient tail. Continuous time averaging is well suited for improving overall SNR, for continuous data display and processing, and it is more efficient in using the computer resources. The described combination of hardware and software tools for implementation of these techniques supplies continuously fresh data and does not require any synchronization with the main computer program. The proposed techniques allow one to combine the powerful transient analysis of the digital DLTS methods with the sensitivity and convenience of the analog methods. Furthermore, the developed averaging techniques can be easily adapted to data processing in many other experiments, where the sampling is performed at a fixed rate, but the results can be adequately presented on a logarithmic scale.
Iv. FEEDBACK CIRCUITS AND EXPERIMENTAL SETUP FOR CC-DLTS AND CR-DLTS
A. Iiztrodiution Since the introduction of DLTS by Lang (1974a), a large variety of modifications and improvements have been reported. When a capacitance meter is
62
PLAMEN V. KOLEV AND M . JAMAL DEEN
used, there are two possible modes of operation. In constant-voltage mode, one can measure the capacitance transient (CVCT). The second way is to keep the capacitance constant by using a feedback circuit (Pals, 1974; Goto et a/., 1973; Johnson, 1982; Miller, 19'72; Li and Sah, 1982a; DeJule et al., 1985; Klausmann, 1986; Shiau e t a / . , 1987a; Kolev, 1992; Kolev and Deen, 1998) and to measure the voltage transient (CCVT). Combining DLTS and CCVT modes has many advantages. As Johnson (1979) has shown, because the capacitance is constant, the depletion layer width also remains constant and, therefore, the change of the net charge trapped in interface states is directly proportional to the measured voltage transient for any interface trap density. Furthermore, the proportionality factor does not depend on the temperature, doping concentration, or doping profile. With double correlation technique, CCVT can be used for both deep imperfection profiles measurements Lefkvre and Schulz (1977a) and energy-resolved interface trap measurements (Johnson, 1979). In the classical setup, shown in Fig. 26, the capacitance of the DUT is forced to be equal to that of the reference capacitor connected at the time. This is done by using a large-gain compensation amplifier OA in the feedback loop to adjust the bias voltage A V across the sample. In practice, for this purpose an integrator with a sufficiently long time constant r = HlCl is used to prevent any oscillation in the system, by reducing the total gain in the feedback loop for high frequencies. The cut-off frequency of the integrator is set depending on the properties of the sample at the quiescent point. However, the reduced speed of the feedback amplifier causes the voltage transitions between the filling and emission pulses to be integrated, thus distorting the correct signal. The small
F K ~ O K26. E Classical setup for constant-capacitance measurements. Thc feedback loop contains: DUT, capacitancc meter, and a high-gain compensation amplifier OA with cut-off gain . P is alternating the connection between the refercnce for high frequencies set by R L C I Signal capacitor for the filling pulse Cf and that for the emission pulse C,. Shown is also the voltage applied to the DUT containing fast (before moment t) and slow transients.
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
63
voltage transient, which compensates for the change in the trap occupancy with charge, is outweighed by a large transient caused by the integration of the voltage step. In Fig. 26 this occurs mostly before the moment t . The distortion is significant for low-density impurities and for measurements with short delay times (before t in Fig. 26). Taking into account that most often the voltage step is in the range of hundreds of millivolts or even several volts, while the voltage transient caused by charge emission from the traps is usually below I mV, the distortion is a serious problem that significantly reduces the useful part of the recorded transient. The large transient is difficult to account for because the effective gain d C / d V of the DUT is generally unknown and temperature dependent. A very good solution to this problem was proposed by Shiau et ul. ( 1 987a). In addition to the integrator in DeJule et (11. (1985), an additional local feedback loop, which the authors call a “memory circuit,” was introduced. Its purpose is to store the bias applied on the sample at the end of each pulse and to apply this bias as a baseline at the beginning of the next cycle. Thus the slow integrator needs to follow only the transient caused by the traps and the summing circuit is used to combine the total bias voltage. Unfortunately, the sensitivity reported in Shiau et d.(1987a) (about 1/500 of the dopant density) is too low for measurement of interface state densities at the Si/SiO;! interface. Following the basic idea proposed in Shiau et a/. (1987a), a new “memory circuit” has been created with some improvements. The sensitivity has been increased up to 1 OWs of the dopant density, and measurements with delay times as short as SO/IOO ps are still possible. Furthermore, the concept of alternating the feedback circuit configuration synchronously with the sequence of filling and emission pulses is the technical basis for the new variation of the DLTS technique that we called constant-resistance DLTS. R. Feedbuck Circuit urid Detuils on the Setup , f . r CC-DLTS
The feedback circuit is shown in Fig. 27 and it is similar to the circuit in Shiau et al. ( 1 987a). It consists of an integrator OA, a summing amplifier C, and four sample and hold amplifiers S&H-I through S&H-4. Using these sample and hold amplifiers, two parallel analog memories are built, each one providing the corresponding baseline voltage for filling or emitting pulses at the input of the summing amplifier. We call them “emission pulses memory” (S&H- 1 and S&H-3 in Fig. 27) and “filling pulses memory” (S&H-2 and S&H-4 in Fig. 27). Only the noise introduced by the “emission pulses memory” is important for DLTS applications. I n order to reduce this noise, two RC lowpass filters R2C2 and R3C3 are applied in the “emission pulse memory.” The time constants of the integrator R I C I and the lowpass filters are adjusted with respect to the boxcar averager delay times. Further noise reduction is achieved
64
PLAMEN V. KOLEV AND M. JAMAL DEEN
Y
1
CJ
bb
0
I
i
0 I
1
R
.3 a .-0
CJ
E * X Y r
-
SWITCHING CONTROL FIGURF. 27. Improved feedback circuit-details. Shadowed area denotes the “filling pulse memory.” which can he omitted when the overloading during the filling pulse can he ignored. w
by using commercially available sample and hold amplifiers. They are used in unity gain mode and are connected with large storage capacitors CI and C5 (-2 pF high quality type, e.g., polystyrene). These capacitors are connected via -I kR resistors (not shown in Fig. 27) to further increase acquisition time.’ As a result, the acquisition time rises to several hundred milliseconds or more, and it can be adjusted by varying the resistance in series to the storage capacitors Cq and Cg. In the hold mode, because the input current of the reading amplifier in the sample and hold circuits is very low, the voltage drop across these resistors can be neglected. The rise of the acquisition time does not affect the system performance, but the noise reduction for low-frequencies is significant. An additional benefit is the reduced distortion when large time constant windows are used. I n Shiau ef a / . (1987a) the distortion is reported to be 1 mV per second, while in our circuit it is less than 7 pV/s and can be neglected in most cases. With this control of acquisition time, an ordinary S&H circuit behaves much like a boxcar channel. Alternatively, a boxcar channel could be used to provide the baseline for the summing amplifier (Vitanov and Kolev, 1986). As it was explained in the previous section, the purpose of the feedback circuit is to bias the DUT in s,uch a way that its capacitance is equal to the corresponding emission reference capacitance C,. A miniature electromechanical relay or a fast, low-capacitance diode is used for alternating C, and C f at the reference terminals. Another solution is to replace these reference capacitors with a varactor diode, biased separately from the DUT. The varactor diode Atitdog
Device.c Data-Acqui.sitiori Dntcihook, Vol. 1, pp. 14-31 (1985).
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
65
replaces both capacitors, and its capacitance is alternated simply by applying voltage pulses to the external bias input of the capacitance meter or to an external biasing circuit.’ The varactor diode is easy to switch and adjust, but it is neither an ideal switch nor a high-quality capacitor. As an alternative, the reference capacitors can be selected to be high-quality variable air-capacitors, but the switching device is slow (Reed-relay or mercury relay), or non-ideal, and operates over a limited voltage range (a diode or an analog semiconductor switch). Additional complications arise when the variation of the reference capacitor used for defect profiling is automated, because then a bank of reference capacitors combined frequently by a slow relay-matrix is used. The problem is that the relays in this matrix have their own parasitic capacitances. Inevitably, some compromises are needed and the choice is made on a case-by-case basis.
C. Experimental Setup for CR-DLTS In the setup for CR-DLTS shown in Fig. 28, the sample transistor is connected as a voltage-controlled resistor between the test terminals of a Boonton 7200 capacitance meter. On the differential terminals a reference resistor R,, is connected that has a low conductance at the I MHz test signal frequency. In our case, this resistor is about I M R . The capacitance meter is used as a highsensitivity amplifier of the difference in the channel conductance of the FET sample transihtor
Sam ple transistor
-
%-
F K i U R E 28. a) Block diagram of the setup for CR-DLTS; h) connection for hack-gate driving available for four-terminal devices; c) connection of two devices for simultaneous measurements (expandable to more).
’
Application Note IM-00 I , Boonton Electronics Corporation, 25 Eastmanh Road, Parsippany, NJ 07054-0465 USA; Insrruction Munutrlfiw Model 72B C a p a d o w e Meter, Boonton Electronics Corporation, Parsippany. NJ 97520.
66
PLAMEN V . KOLEV AND M. JAMAL DEEN
and the conductance of the reference resistor. This difference is detected by a phase-sensitive detector (PSD) and, through the conductance analog output of the capacitance meter provided on the rear panel, then applied to a feedback circuit. An important property of the circuit is its dynamic configuration in response to the driving pulses. Essentially, the feedback output is connected to the gate of the sample FET by the switch SW only during the emission pulse. During the filling pulse, the feedback loop is open and at the FET gate a constant bias from an external source is applied. During the emission pulse with the feedback loop closed, a high-gain compensation amplifier OA provides for almost exact matching of the channel conductance of the tested transistor to that of the reference resistor R,,f by continuously adjusting the gate voltage V , . As the value of R,.
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
67
making the memory circuit operate much like a boxcar channel. From this description, it is obvious that the function of the analog memory can be replaced with a boxcar channel sampling just before the end of the emission pulse. As seen in the circuit diagram on Fig. 27, the transient part of the gate voltage is separated from the total gate bias, forming a DLTS signal with a zero base line. This allows the signal to be easily amplified and digitized. The baseline voltage, which is close to the pinch-off voltage, is also available at the output of the analog memory circuit and can be recorded as a function of the temperature during the DLTS thermal scan. This feedback circuit allows for extraction of transients in the microvolts range from the total gate bias, which can be several volts. This property of the feedback is crucial for successful implementation of the new CR-DLTS technique. We further increase the sensitivity of the new system by using advanced digital signal processing techniques for noise reduction and for storage of the transient at each degree of temperature with a limited number of data points (Kolev et ul., 1997a). This technique is described in the next section.
D. Illustrritions
In Fig. 29 are shown three traces from three different DLTS modes but for the same device (neutron-radiation damaged JFET) and at the same temperature.
.
CT-DLTS CC-DLTS CR - DLTS
\
\
\
\
i I 0.1
~
~
-
- ~-
80
60
-k V
40
20
0
I0 Dclay time Inis] 1
FIGURE 29. Transients obtained liom capacitsnce-transient (CT-DLTS). constant-capacitance (CC-DLTS), and constant-resistance (CR-DLTS) methods using the same device at the same temperature. The CR-DLTS trace below 800 ps shows the recovery of the capacitance meter following its overloading during the filling pulse.
68
PLAMEN V. KOLEV AND M. JAMAL DEEN
These traces are very similar for delay times above 800 ps up to 1.3 s. The CRDLTS trace shows the recovery of the capacitance meter from its overloading during the filling pulse. Also, CT- and CC-DLTS traces below 100 ps show a slight distortion, which is caused by the low-pass filter of the conditioning amplifier. All traces deviate from purely exponentially decaying transients that is readily seen when they are plotted on semilogarithmic axes. This deviation is most likely caused by carrier trapping i n the Debye tail of the space-charge region (Noras, 1982). One problem often encountered both in CC-and CR-modes of operation is the variation of the effective gain of the measured device with temperature; i n general, it increases at low temperatures. While this dependence does not need to be considered for quantitative DLTS measurements, it may still adversely influence the results. The problem is that this gain change modifies the total gain of the feedback loop in both methods, and thus may affect the condition for appearance of oscillations i n the feedback loop (see Fig. 30). Setting the integrator’s time constant ( K I C I in Fig. 27) to a large value ensures that oscillations are avoided, but it simultaneously distorts the data at the beginning of the recorded transient. Therefore, a compromise is often made to adjust this time constant to a value that is just large enough to prevent the oscillations. Unfortunately, because of t h e variation of the effective gain of the device under test, the feedback circuit, which seems stable at some temperature, may start to oscillate at lower temperatures, causing complete data loss below this critical temperature. Note that in Fig. 30 the fading oscillations at the beginning of the traces increase at low temperatures. Obviously, the oscillations were nearly
microwavc n-MOSFET W/I. 60 pm/0.8 PI11 filling pulse -2 V substrate hias 0 V
0 .I
I
I0
100
I000
Dclay time Inis]
FiC;uw 30. CR-DLTS transients of a microwave MOSFET that was stressed with hok carriers. Note the fading oscillations at the beginning of the traces at low temperatures. For illustration of the data point density, each point is represented with a dot in the trace taken at 100 K.
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
69
avoided leaving only negligible distortion, but the setup of the integrator time constant was too close to the critical value. Figure 3 1 demonstrates the sensitivity obtained using the new feedback circuit. I t allows for recording of voltage transients with a magnitude below 0.1 niV. Because the usual transconductance of the transistors in CR-DLTS mode contributes more to the total gain of the feedback loop than the effective gain of most capacitance structures in CC-DLTS mode, constant-resistance measurements are in general more sensitive. This allows us to obtain specific “features” of the spectrum that help in determining the capture cross section using the Arrhenius plot technique. Then, as described i n Johnson (1982), it is possible to obtain the interface trap density shown in Fig. 32. For calculation of the trap density, the capture cross section was assumed independent of temperature and energy in the range of a few kT (where kT = thermal energy) (Johnson, 1982). This is an approximation, but as the variation of the capture cross section gives only a minor shift in the obtained interface-trap density spectra on the energy scale, in many cases this result is acceptable. For example, in Fig. 33 an increased interface trap density of a transistor located on the same die is demonstrated following hot-electron stressing for 100 h and at maximum substrate current. As can be seen, the trap concentration increases toward the band edge i n agreement with commonly accepted U-shape distribution found using other methods. When the traps for minority charge carriers are of interest, in this case the traps for electrons, the filling pulse must drive the surface beneath the
I
n-MOSPET W/L 2 pm/0.2 pni substrate bias 0 V
I
-40 Q I I
I 50
~~~
~~~
~~
t=
s 1116
____ t = 1 0 m s
1
1
I00
1 so
200
250
Tcniperature [ K ] FI(jIIKk 3 I. CR-DLTS spectra obtained from a virgin microwave transistor using the new feedback circuit. This is a demonstration of the good sensitivity of the measurement: the magnitude of the signal is just a few tens of microvolts.
70
PLAMEN V. KOLEV AND M. JAMAL DEEN
10"
n - M O S F E T W / L 2 p m / 0 . 2 pin substrate bias 0 V ------ -. r
-
= 5 Ins
--
r, I
E ' 2
-.
-i10"'
- -. -.
z = 10ms . - z = 3 5 Ills
-
>,
Y
-
.-
n
i 10'0.1
0.4
0.3
0.2
E-E,
[eV]
FIGURE 32. Interface trap density obtained from three different spectra on the same device. As the energy-resolution increases at low temperature, we see a better coincidence of the curves. Repeating feature in the curves result from variations of the interface trap density.
I
0. I
I
0.2 E-E,
I
0.3 [eV]
I
0.4
F I G ~ I R33. E Interface-trap density lor holes of a microwave transistor on the same die as the transistor ineasured in Fig. 32 after hot-electron stressing. The capture cross section was obtained for three temperatures using the Arrhenius plot technique and then was interpolated as shown in the tigure.
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
0. I
0.2 E,
-
0.3 E lev]
71
0.4
FIGURE 34. Interfxe-trap density for electrons of the microwave transistor in Fig. 3 3 , after hot-electron stressing. Note the difference with Fig. 33.
gate into strong inversion in order for these traps to be populated. In Fig. 34 the interface trap density for electrons of the same microwave transistor as i n Fig. 33 is shown after hot-electron stressing. The filling pulse was set at 2V, which is enough to obtain strong inversion at all temperatures of the DLTS scan. One important remark has to be made about the difference between the measurement of traps for majority and for minority carriers. When measuring majority carrier traps, the space-charge region collapses during the tilling pulse and the majority carriers flow into it and accumulate at the interface. The capture rate is high and both interface and bulk traps are tilled. During the subsequent emission pulse, the traps will emit their charge to join the majority carriers in the substrate and both interface and bulk traps will be detected. When minority carriers are injected from the source and drain during the tilling pulse, the SCR remains essentially intact, because the inversion layer at the interface screens the electric field from penetration into the substrate. Therefore, traps for minority carriers will be charged only at the interface and during the emission pulse only these traps will respond by emitting their charge, while bulk traps for minority carriers located in the SCR will not be detected. If these traps are of interest, the strong inversion pulse applied at the gate during the filling pulse has to be combined with injection of minority carriers from the source and drain into the substrate by forward biasing the corresponding n+-p junctions. Then the bulk traps for minority carriers in the
72
PLAMEN V. KOLEV AND M. JAMAL DEEN
substrate can be populated and measured. While charges emitted from the traps for minority carriers nn the SCR, which can contribute to the threshold voltage will be compensated, there will be no such compensation for the traps in the remaining areas of the source and drain p-n junctions. Then the variation of the volume of SCR will cause the traps for majority carriers also to be charged and detected, which may complicate the interpretation of the measurement results. This is the problem that was encountered before in the measurement of traps for minority carriers using diode structure and conventional capacitance-transient DLTS. E. Conclusions In this section, new feedback circuits for CC-DLTS and CR-DLTS have been presented. I t was demonstrated that with the aid of these circuits, it is possible to record voltage transients without compromising either the speed or the sensitivity of the measurement. Comments on technical difficulties when voltage transients are measured were presented. The feedback circuit operation was illustrated with measurements of the interface-trap density in MOSFETs and transient traces of a neutron-radiation damaged JFET.
V. CONSTANT-RESISTANCE DLTS
IN
ENHANCEMENT MODEMOSFETS
A. ltitroduction
During the last two decades, deep-level transient spectroscopy (DLTS) (Lang, 1974a) has proven to be among the most sensitive and accurate techniques for analyzing deep levels in semiconductors. The standard capacitance DLTS setup (Lang, 1974b, Miller et ul., 1977; Kimerling, 1976) uses p-n or Schottky diodes as test structures and the emission 0 1 trapped charge from the deep levels in the forbidden gap is monitored by measuring the corresponding change in the depletion capacitance of the diode. A very high SNR can be achieved by averaging the measurement over many pulses, as this represents an application of the well-known phase-sensitive detection (PSD) technique used for signal recovery from noise (Wilmshurst, 1990).The significant improvement in SNR offered by the PSD technique made DLTS more successful than the previously used thermally stimulated capacitance method. In standard capacitance DLTS instrumentation, there is already one level of PSD employed inside the capacitance meter. However, it has a substantially lower time constant that is designed to filter only capacitance changes which are much slower than the frequency of the test signal, usually set at 1 MHz.
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
73
For practical reasons, sensitive measurements are done by using relatively large area structures. Considering the best commercially available capacitance meters, which are fast enough to respond to capacitance changes below the 1 ms range, it is possible to resolve capacitance variations in the range of -0.1 fF with the aid of sensitive DLTS processing instrumentation. Therefore, for detection of trap concentrations in the range of 10"' cmP3 in test structures with background doping of 10'' cmp3, the total depletion capacitance of the structure should be -10 pF. In practice, this can usually be obtained using relatively large area test structures. The various technologies for fabricating large area diodes are rarely coniplicated, and this has often been considered an advantage of the DLTS method. Nevertheless, the method was adapted for a number of other test devices that are available during electronic device fabrication or devices whose properties are of practical interest. Among these, the field-effect transistor (FET) has attracted the most attention, because MOSFETs are the workhorse of integrated circuits.
B. Overview of' the DLTS Trchtziyues Applied to FETi When relatively large FETs are available, the standard capacitance DLTS technique can be easily modified by reverse biasing of source and drain regions, and measuring the gate-substrate MOS structure (Wang, 1980). Furthermore, unlike the simple MOS structure, the existence of the source and drain in a MOSFET, as large reservoirs of minority current carriers, allows measurements of minority carrier traps (Wang, 1977, 1979) even though data interpretation can be complicated by simultaneous trapping and emission o f majority carriers (Wang and Evwaraye, 1976). Further modifications of the DLTS technique for measurement of FETs are needed because of the small size of the devices, resulting in a small total gate-substrate capacitance. Indeed, with the gate-substrate capacitance in depletion only a few or even less than one fF,measurements of capacitance transient due to charge emission from the traps located beneath the gate become practically impossible. More complications arise when the FET is fabricated on insulating or semi-insulating substrate, such as silicon-on-sapphire, or FET on GaAs or other compound semiconductor material substrate. Then, the high series resistance of the depleted semiconductor material in the channel requires correction factors to be added to the measurement (Broniatowski, r t ~ d . ,1983) before analyses are made. Moreover, for thin film transistors, the thin silicon epilayer limits the change of the depletion region width and the corresponding capacitance change. Combined with the large series resistance in the depleted channel, this results in a large free-carrier transient,
74
PLAMEN V. KOLEV AND M. JAMAL DEEN
which complicates the analysis of a much smaller transient due to the trapped carriers. To avoid these difficulties, the conductance DLTS (Chen et al., 1981) was proposed for measurements of MOSFETs. In this method, the small-signal ac channel conductance is used to monitor the emission transient of the trapped carriers, while the back-gate bias pulses are used to control trap tilling and emission. For trap concentration profiling, the back-gate bias controls the depletion width and bias pulses on the gate are used for controlling the population of trapped carriers. In Chen rt ( I / . (198 I), the ac channel conductance was measured at zero dc bias of drain to source using an HP 4271B LCR meter, which employs the PSD principle as previously mentioned and can detect 0.01% conductance changes. However, for calculation of trap concentration, the surface free carrier mobility was assumed constant over the entire measured temperature range. In addition, knowledge of the flat-band voltage, doping concentration, and oxide capacitance was required. Later, conductance DLTS was cornbined with C-V measurements to calibrate the theoretical characterization model for investigation of ion-implanted GaAs FETs (Golio rt al., 1984). The model was needed to eliminate common assumptions in other simpler characterization techniques, and with this more rigorous treatment, free-carrier, shallow-level donor. trap and mobility protiles were obtained. Another modification of the DLTS technique that is convenient for FET investigation is based on observation of the drain current transient caused by the effective change in the threshold voltage due to trap carrier emission (Hawkins and Peaker, 1986). Instead of measuring the small-signal conductance as in Chen et a/. (198 1 ) and Golio et a / . (1984) a small drain-source bias was applied and the resulting current transient Al,, was measured, while keeping the gate bias V,q constant. This method benefits from using the gain of the measured transistor, requires simpler instrumentation, and can potentially reach lower time constants, as compared to the small-signal ac conductance DLTS. However, the sensitivity can be lower because of the elimination of PSD noise reduction, and for trap concentration measurements, the FET transconductance under the same DLTS conditions must be monitored. In Hawkins and Peaker (1986), this method was also called a conductance DLTS, which may create confusion with the small-signal method described here. Essentially the same drain current transient technique combined with surface mobility (and therefore, transconductance) measurements was used later (Haddara et d., 1993; McLarty et a/., 1988a,b), and the method was correctly referred to as a current DLTS. In this section, we suggest a new DLTS variation, the constant-resistance (CR-)DLTS. The preference over the name constant-conductance, which is also correct, was given in order to distinguish it from another popular abbreviation, namely CC-DLTS, which usually stands for the constant-capacitance
DEVELOPMENT A N D APPLICATIONS OF A NEW DLTS METHOD
75
technique (Johnson et ill., 1978). This new technique is similar to the conductance DLTS (Chen et d., 1981), but instead of measuring the small-signal ac conductance directly, the result is applied as an input signal to an electronic feedback circuit. Some idea for this setup was found in Collet (1975), but it was not used there for DLTS measurements. The feedback circuit serves to provide the voltage needed to compensate for the charge losses from the traps during the emission pulse. Thus, additional measurement of the surface mobility or transconductance is avoided, and the benefit of using the PSD noise reduction technique is combined with simple and straightforward analysis. The new technique is also similar to the constant capacitance DLTS in the way it compensates for the trapped carrier emission by adj listing the applied bias. C. Theowtical Biickgrounrl Here we consider the case of a donor-like level in an n-channel MOSFET. A similar fortnalistn can be derived for acceptor-like levels in a p-channel transistor with appropriate changes to the notation. In general, with a nonzero source-substrate bias V H S ,the threshold voltage V l in the linear region is given as (Deen and Yan, 1990; Sze, 1983; Tsividis, 1987; Yan and Deen, 1991, 1989) (31) V I . = V F H 2qfi - Qo/CO.,,
+
where V p B is the flat-band voltage, 9~ is the Fermi potential in the substrate, and QH/Cl,,, is the gate voltage required to compensate for the charge of the ionized acceptor atoms in the space charge region QR given by
In the foregoing equations, E,\ is semiconductor permittivity, y is the electronic charge, Nn is the effective substrate doping concentration, and C,,, is the oxide capacitance per unit area defined as
c,,.,= f,,, ~
T,,,
(33)
with T,,, being the oxide thickness and E,,, the gate oxide permittivity. When there are charges trapped in impurities or defects centers in addition to the shallow dopant N n , their total charge has to be added to the ionized acceptor . all the traps are atoms. If the total donor-like trap concentration is N T ~and charged, which is the case after accumulation or filling pulse at the moment t = 0, the initial threshold voltage ( V I , ) ;can be written using Eq. (31) as
76
PLAMEN V. KOLEV AND M . JAMAL DEEN
where A and B are defined as
and
In Eqs. (35) and (36), the Fermi potential is assumed to remain approximately constant, independent of the temperature in the temperature interval around the DLTS peak. The trapped charge N T ( f ) varies with time during the emission pulse as (37) N T( r ) = N T Te x p ( - t / d , where 5 is the time constant of the traps according to Shockley-Read-Hall statistics (Shockley and Read, 1952; Hall, 1952). After a sufficiently long time, all trapped charges (holes in this case) are emitted to the valence band, N , = 0, and the threshold voltage ( V T ) f with all traps discharged and neutral becomes (VT)j = A BJNA. (38)
+
The total change of the threshold voltage A V T can be obtained by subtracting Eq. (34) from Eq. (38) and is given by
As N T T
<< N A , then
Eq. (39) can be reduced to
In our system, this is exactly the voltage change that is compensated by the feedback circuit when the emission pulse is long enough, and this is the niagnitude of the recorded voltage transient. Because of system delay, in practice. only the part of the voltage transient after this delay is measured and recorded. Therefore, it is useful to express the time variation of A V T ( t ) using Eqs. (37) and (40) as
Our feedback system provides a separate output where the total voltage needed to maintain the drain-source small-signal AC conductance equal to
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
77
some selected reference value C,/ can be measured and recorded. If this reference conductance is small enough, as in our case it is G,?/ > 1 pS, then thic voltage is approximately equal to the threshold voltage. The time variation of the threshold voltage can be found by combining Eqs. (34). (38) and (41) to Five
or
We should note that, since the feedback circuit provides the threshold voltage and its changes directly without ever measuring explicitly the drain-source current, there is then no need for the transistor size or surface mobility to be measured, which is a necessary part of the conductance or current DLTS methods. Furthermore, if the trap concentration and interface trap densities are the same, the voltage transient should be independent of the MOSFET dimensions. I n other words, even deep submicron transistors should provide a DLTS signal that is large enough to obtain good SNR. This is no surprise, given the fact that the small total amount of trapped charge beneath the gate of the minimum size transistor has to be balanced by a correction voltage applied across a relatively small gate-substrate capacitance.
D. Experimental Results We illustrate the new CR-DLTS technique with some results from measurements of n-channel MOSFETs with 0.8 pm channel length, the minimum dimension available for our test chip design. Here, it should be noted that the sample transistor gain is an effective part of the common feedback loop, and therefore it may be expected to affect circuit sensitivity. Howcver, to our observation, the large gain of the capacitance meter effectively compensates for the sample transistor's gain, and in practice, the transistor's size affects only the setting of the time constant of the feedback circuit integrator. At low temperatures, where surface mobility increases, for samples with large W/L ratio it was often better to reduce the capacitance meter gain rather than to increase the integrator time constant, because the latter distorts the recorded transient for short delay times. The CR-DLTS scan of the sample MOSFET presented in Fig. 35 shows a peak at -70 K, which can be associated with an energy level at about 0.1 1 eV 1 x 1OP1' cm2. This above the valence band and capture cross section m peak arises from a signal with continuous energy distribution due to interface states. At this stage, the associated trap cannot be identified with confidence.
-
78
PLAMEN V. KOLEV AND M. JAMAL DEEN - I00
n-channel MOSFET 0 . 1 1 eV
L = 0.8 pni W = 22 ~ r n t , = s 111s t? = I 0 111s
>
- -50 I
> a
0 SO
I so Teniperatiire [ K ]
2.50
-
FIGLIRE 35. CR-DLTS scan of an n-channel MOSFET. The peak at 70 K appears for all possible combinations of 11 and t? settings and from the Arrhenius plot, the corresponding energy level is 0. I I eV above the valence band and the capture cross section is (T I .2 x lo-” cm’.
-
-
but it may be related to the (Fe-B) complex (Benton and Kimerling, 1982). At higher temperatures, it appears that there are two other “peaks,” but they cannot be related with confidence to discrete energy levels because they are wider than what is normally obtained. There were no intentional contaminations or damage of our samples, and the measured trap concentrations are very low. This in turn produces, a very weak signal below the instrumentation noise level, and this weak signal can be detected only after advanced digital averaging (Kolev et al., 1998a). However, the noise features can be identified easily and are distinguished from the peaks by varying the delay times and observing the changes in the scan. The increased magnitude of the signal at lower temperatures corresponds to the increased interface trap density toward the valence band edge. This result is in agreement with the well-known Ushape energy distribution of the interface traps, but contradicts other DLTS measurements on p-type MOS capacitors (Johnson et al., 1978). At present, there is no satisfactory explanation of this result. In Fig. 36 is shown a CR-DLTS temperature scan of a minimum size MOSFET available from our test design. The device in Fig. 36 was selected in order to demonstrate a discrete trap energy level, and it was not on the same test chip as the device described in Fig. 35. The parameters of the hole trap causing the peak at -100 K are: energy level at about 0.12 eV above the valence band and capture cross section of about 6 x lo-’’ cm2. It is possibly related to Mn impurity atoms (Benton and Kimerling, 1982). We emphasize here more the demonstration of this CR-DLTS method, rather than the results.
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
79
-0 3 0. I2 eV
ii-channcl MOSFET
-> -0.2 E
I
> 4
-0. I
4.0
I so
so
250
Ten1peraru rc [ K 1 FIGURE36. CR-DLTS scan of a miniinum size 11-channel MOSFET. The peak below 100 K is related 10 a discrete deep level at 0.12 eV above thc valence band with a capture cross scction (T 6 x lo-” cin2.
-
-
- I .11,
11 -
MOSFET W = 2 pi11 L = 0 . 3 pin 3 ms/6 111s
I so Tempcraturc [ K I
SO
150
FIGURE 37. Two CR-DLTS scans of an n-channel deep submicron MOSFET. The magnitudc 35. The peak below 100 K 0 I x 10 - 15 cm2.
is comparable with that of the 0.8 pm x 22 v i n device shown in Fig. corresponds to a deep level 0.1 3 eV above the valence band with
-
-
A similar deep level appears in the CR-DLTS scans shown in Fig. 37 at 0.13 eV above the valence band and capture cross section (T 1 x lo-’’ cm2. The measured device is a 2 pm x 0.3 pm n-MOSFET, which has an entirely different fabrication history from the devices in our test chips. In this figure, the interface trap density decreases toward the valence band edge, which result
-
80
PLAMEN V. KOLEV AND M. JAMAL DEEN
-150
r n
S0
-
MOSFET W
=
2 pin L = 0 3 pin ~
-IV
1 so
Temperature [ K ]
Ft~unk. 3R. Body effect in CK-DLTS. The substrate bias affects the surface and the bulk t r a p in different ways. The only par! of the DLTS scan with -1 V substrate bias that is larger than thc scan measured with zero bias is the peak corresponding to a bulk trap. At the same tiine. the signal related to the surface traps is decreased due to the rcduced area of the channel. which is still tinder gatc control. This effect is more pronounced for short-channel devices.
is in agreement with the previous DLTS measurements of p-MOS capacitors. Here it is interesting to note that both scans exhibit very similar features when the shift toward lower temperatures for slower rate window setups is taken into account. This supports the idea that we are measuring the actual variation in the interface trap density with different energy levels inside the bandgap. In Fig. 38 are presented results from measurements of the same device as in Fig. 37, but with variation of the source-substrate bias. The thermal scan with -1 V substrate bias has a lower magnitude compared to that obtained without substrate bias. The only exception is the peak associated with the hole trap at 0.13 eV above the valence band. At the same time the surrounding peaks have significantly reduced magnitudes. Increasing the reverse substrate bias leads to a decreased gate control over the space-charge region near to the interface and a reduced number of interface traps that can participate in the charge capture and emission processes. At the same time, the width of the space charge region beneath the gate increases, and more bulk deep levels can respond to the filling pulse applied to the gate. This can explain the increased magnitude of the peak corresponding to the deep level in the scan with - I V substrate bias. The differences in Fig. 38 between both DLTS scans demonstrate the possibility to distinguish the effect of the surface traps from the bulk traps by performing measurements with variations of the substrate bias.
DEVELOPMENT A N D APPLICATIONS OF A NEW DLTS METHOD
81
It is useful to compare the new CR-DLTS method with some other DLTS technique. In our case, the constant capacitance DLTS method is very convenient. It uses almost the same instrumentation, except that the reference structure is a variable air capacitor, the input of the feedback circuit is connected to the capacitance output, and the total bias is applied to the sample through an external bias circuit.’” In a comparison (Hawkins and Peaker, 1986) of the proposed new technique to the standard capacitance, DLTS was made by using a “fat” transistor, that is, a transistor with relatively large gate area that allows for capacitance measurements. As the constant capacitance DLTS signal is also area independent, one might expect that this is an unnecessary requirement in our case. However, the “gain” of the MOS capacitor structure i n the feedback loop rlC/dV is very low for small area devices, and the SNR is very poor, thus preventing sensitive measurements. For this reason, we compared the two DLTS modes by using 400 MOSFETs connected in parallel, each of W / L = 22 pn/O.8 pin. The CR-DLTS measurements were done in the manner already described here. For the CC-DLTS measurements, we used a 1 MHz decoupling filter to connect the source and drain to the substrate at all frequencies except I MHz. This filter has reduced the electromagnetic interference that affects the CC-DLTS more than CR-DLTS. In Fig. 39 are presented the results from the comparative measurements with variation o f the substrate bias. Comparing the two CR-DLTS scans. we see a substantial increase in the peak value at higher substrate bias, which indicates a bulk trap. A good agreement was found between the results obtained with CR-DLTS and CC-DLTS, despite some discrepancy above 200 K. We attribute the positive values in the CC-DLTS scan to possible problems with the biasing circuit and different averaging parameter settings of the DLTS processing and recording circuit. As both measurements are performed using the same device, similar results should be expected. The hole trap is at 0.28 eV above the valence band and its capture cross section is -8 x IO-’” cm2. Differences between the parameters determined using CR-DLTS (2) ,ind CC-DLTS are 7 MeV and -2 x cm2, and this difference can be Gtttributed to the possible effect of field-enhanced emission. because the two types of measurements are performed with different reverse biases applied to the substrate. In Fig. 40 are shown the Arrhenius plots of the traps from the previous ligures. Recording the whole transient lor each degree change in temperature allows us to obtain large sets of data points for each trap. Note the good linearity and low dispersion of the data points. The traps in Figs. 35 and 36 have similar energy levels, but their very different capture cross sections prevent I” Applicnlioii Note IM-001. Boonton Electronicb Corporation. 25 Eastinans Road, Patsippany, NJ 07054-0465. USA.
82
PLAMEN V. KOLEV AND M. JAMAL DEEN
1
400 n - M O S F E T s i n parallel
CR DLTS ( I ) C C DLTS
I
70
I20
I
170 220 Temperature [ K ]
I
270
FIGURE 39. Comparison between CR-DLTS with different substrate bias and CC-DLTS by ineasuring 400 MOSFETs in parallel. Substrate bias was zero for CR-DLTS ( I ) and -2 V for CR-DLTS (2). Note the increased noise magnitude of CC-DLTS due to lower gain of the samplc i n the feedback loop.
7
m-
t-
s
-c
3
1
FIGIJRE 40. Arrhenius plots of the peaks in the previous figures. Bold circles refer to the trap in Fig. 35. bold triangles refer to the trap in Fig. 36, and bold squares are for the trap in Figs. 37 and 38. The two overlapped leftniost plots correspond to the trap in Fig. 39. Data obtained from CR-DLTS ( 2 ) is denoted with crosses, and from CC-DLTS is denoted with open circles. The difference in the determined energy level position is about 7 MeV and the differencc i n the capture cross section is 8 x 10-16cni2.
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
83
us from attributing them to just one type of defect. A similar situation exists for the trap in Fig. 36 compared to the trap in Figs. 37 and 38. They have approximately the same capture cross section, but the measured difference of 20 MeV i n the energy level position is well above the margin of error for o u r system, which we estimate to be just a few milli-electron volts (meV). The small difference in the parameters of the trap in Fig. 39 can be attributed to the effect of field-enhanced emission and is not necessarily due to inaccurate measurements. E. Conclusions In this section, we presented a new DLTS technique -constant resistancc DLTS -suitable for submicron MOSFET measurements. It does not require additional measurements of the sample transconductance g,,, or the surface mobility ps, is very sensitive, and can operate at very low channel current levels, thus i t is unlikely to degrade the sample FET. Because the DLTS signal is independent of the device area, CR-DLTS allows measurements of minimum size transistors without loss of sensitivity.
VI. CONSTANT-RESISTANCE DLTS
IN
DEPLETION MODEMOSFETS
A . Introduction 1 . Chcrrge-Coupled Devices rind Reading MOSFETs
Charge-coupled device (CCD) image sensors (Murowinski and Deen, 1993; Jaggi and Deen, 1995) are used widely in optical astronomical telescopes because of their high sensitivity, linearity, and low noise. They are particularly attractive for space-based astronomy because of their low power consumption and high reliability. During operation in space, these CCD-based imagers are subjected to various sources of radiation damage. Among these, protoninduced damage is of particular interest to us because in the prqjected 5-year mission of the Lyman Far-UV Spectrographic Explorer (FUSE), the imagers are expected to encounter a large number of energetic protons (Hardy et NI., 199th; Kolev et al., 1997c, 1 9 9 8 ~Murowinski ; and Deen, 1994; Murowinski; e t ul., 1995, 1993a,b). The charge signal induced in the CCD imaging array is converted to a voltage by a low-noise amplifier. The key device in these amplifiers is a buried-channel MOSFET, which has a significant effect on the overall performance and reliability of the CCD imager. Because of this, the radiation-induced damage in these transistors is of particular importance to the reliability of the CCD imagers that are to be used in space.
84
PLAMEN V. KOLEV AND M. JAMAL DEEN
Output MOSFETs are specially designed depletion-mode lightly doped drain (LDD) devices that operate as source followers with floating gates. The charge from the CCD array is dumped onto the gate of the transistor, and it produces a voltage inversely proportional to the capacitance of the floating node. Therefore, for greatest sensitivity, it is desirable to reduce the capacitance of the gate a s much as possible. This can be achieved by minimizing the gate area, and by using the LDD structure, which reduces the overlap capacitance between the gate and the source and drain diffusions. Typical CCD charge conversion transistors have channel dimensions of 50 pm x 10 pm and the depths of the implanted channels are in the range of several microns. For our devices, using a C-V technique, the implanted channel effective doping concentration is measured to be 1 . 1 x 10” cmp2, and this is in a good agreement with the value calculated from technology process modeling programs (Kim et LII., 1990). The deep-channel design reduces the low frequency noise associated with surface and near-interfacial traps, but it makes it practically impossible to apply traditional capacitance transient DLTS (Wang and Evwaraye, 1976).
2 . CR-DLTS Method Applied fiir MeLisuretnmt of’ Depletion Mode FETs
In this section we demonstrate the benefits of using the new CR-DLTS method for studying depletion-mode MOSFETs. Although the DLTS setup used is technically the same as the setup described in Section IV, there is a ma-jor difference when it is applied to measurements of depletion-mode transistors. If we consider enhancement and depletion-mode n-channel transistors, the gatecontrolled space-charge regions have different types of conductivity -p-type substrate in the enhancement mode transistor and n-type implanted channel in the depletion-mode device. As DLTS is a majority carrier method, the enhancement device is well suited for investigation of the hole traps, while the depletion-mode transistor is better used for measurement of the electron traps. In the case when the doping concentrations of the implanted channel and the substrate are comparable, it is possible to measure the trap distributions on both sides of the implanted channel-substrate p-n junction. B. Theomticul Background The buried channel MOSFETs are more complicated for analysis than the corresponding enhancement mode surface channel devices (Tsividis, 1987). Here we limit our discussion to an n-type buried channel MOSFET. For simplicity, we neglect the LDD regiorts and the depletion regions around the n t doped source and drain. Figure 41 shows part of a buried n-channel MOSFET with the channel fully depleted. The vertical structure can be regarded as a combination of a MOS
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
85
FIC~LIM 41. Part of a CCD 11-MOSFET wilh the implanted channel fully depleted. For simplicity. the space-charge region around the ni region and the LDD extenaions are not shown.
capacitor and an adjacent p-n junction diode. For front gate operation, the source-substrate (p-n diode) reverse bias voltage v.7~is held constant, and therefore the width of the two space-charge regions of the channel-substrate p-n junction also remains constant. At the same time, the gate-source (MOS capacitor) bias is pulsed between a channel enhancement voltage (trap tilling pulse) and a channel depletion voltage (emission pulse). During the filling pulse, when the gate-source voltage is constant, the gate-controlled part of the implanted channel is saturated with electrons flowing from the source and drain regions, and the electron traps are filled. During the emission pulse, a feedback circuit continuously adjusts the gate-source voltage in order to maintain a very weak conducting channel inside the buried channel. This conducting channel is maintained at the boundary between the gate-controlled and substrate-controlled parts of the channel, that is between the p-n diode and the MOS capacitor in Fig. 41, while the rest of the channel is fully depleted. This condition, when the two depleted regions inside the implanted channel meet, is defined as pinch-off, and it is shown in Fig. 41. The voltage to achieve channel pinch-off is denoted in Fig. 41 with Vc-8= V c ; ~ pwhere , the gate-substrate voltage V(;B includes the constant voltage V.slj. We use in our DLTS setup a meinory feedback circuit (see Section IV), originally designed and used for a constant-capacitance DLTS mode of operation (Johnson rt L L ~ . , 1978, 1979).
86
PLAMEN V . KOLEV A N D M. JAMAL DEEN
In general, V C ~ pcan be found by solving Poisson’s equation. Tsividis p the form (1987) suggests an expression for V c ~ of
where V p is the “pinch-off voltage.” For the case when the implanted channel has a substantially higher doping concentration NI compared to that of the substrate N A B ,the pinch-off voltage is given by
where
In Eq. (46), the flat-band voltage V F B= @MS
-
Qil/Cll.yand
where y is the body effect coefficient of an unimplanted device on the same substrate and y~ is the depth of the implanted channel. The remaining symbols have their usual meanings (Tsividis, 1987). When there are charges trapped in deep impurities or defect centers, in addition to the shallow dopants (NI - N A B ) in the implanted channel, their total charge has to be added to the ionized shallow impurity atoms in Eq. (46). If the total acceptor-like trap concentration is N T T and all the traps are charged, then (NI - N A B ) in Eq. (46) has to be replaced with (NI - N A B - N7.7.). The time variation of the trapped charge is N T ( t ) = N T T exp(-r/s)
(48)
where t is the emission time constant of the traps according to the ShockleyRead-Hall statistics. After a sufficiently long time, all the trapped charges will be emitted and then the space charge in the depleted implanted channel will consist only of donors and acceptors. With constant V S ~if ,the interface trapped charge is neglected and the deep impurity concentration is much lower than the effective doping concentration in the implanted channel, then most of the terms in Eq. (46) are constant during the emission pulse, and only the third term is changing with time. The total change of V p o will be
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
87
This reflects the fact that the volumes of all space charge regions and body coefficients remain constant during the trap emission, which is a good approximation for low trap concentrations. The time variation of V p , using Eq. (48)
In our DLTS system, the feedback circuit compensates for the trapped charge loss and this compensation voltage balances the change in the pinch-off voltage AVp,(t). We measure this change directly and without reference to the transistor width-to-length ratio W / L or free carrier bulk mobility p. The preceding expressions were derived for the case of N / >> N A B ,but it is possible to obtain corresponding expressions for the more common case with comparable concentrations N / and N A B , again without prior knowledge of W I L or p. If the trap concentration is low enough, the volume of the space-charge regions will remain essentially constant. This can greatly simplify the calculations and one can obtain expressions similar to Eqs. (49) and (50),except that the effective width of the gate controlled part of the implanted channel y,’ should replace the total implanted channel thickness y/. By adjusting the source-substrate bias we can vary this effective width y/’ and thus we can achieve depth profiling of the traps. Another commonly used DLTS profiling technique, selective trap filling, is also available (Lang, 1974a; Svensson er ill., 1987). C. Experimentill Results mid Discussions We have investigated three test chips, each containing a set of IS buried nchannel MOSFETs. One set was subjected to 5.0 x 1 0’ protonskm’ at 1 .O MeV, and another to 9.7 x 10” protonskm’ at 1 .O MeV. The third set of devices were undamaged. These are the same devices that were studied in (Murowinski of d., 1995, 19931). Here we present first the CR-DLTS results from the measurement of the 20 pm x SO pm device, since these results are typical. Fig. 42 shows CRDLTS scans of these devices. As expected, the transistor with the largest dose of radiation exhibits the largest DLTS signal. I n this spectrum, five pe‘‘I k:5 are observed. In the spectrum of the device with a moderate dose of radiation, one can distinguish only four peaks and the magnitude of the signal relates to that of the other damaged device in approximately the same ratio as the proton dose. The CR-DLTS scan of the undamaged device has a much lower magnitude and no peaks can be seen. The slight rise in A V,y at low temperatures that is visible in this scan is most likely due to the interface traps at the Si-SiO2 interface, which can also be detected with this technique (Johnson et al., 1978). Our DLTS system records the transient at each degree of temperature as a set of data points (Kolev et al., 199th) Figure 43 shows six DLTS spectra obtained from one temperature
88
PLAMEN V. KOLEV A N D M. JAMAL DEEN
0.7 0.6
-2
0.5 0.4
&
0.3
0.2 0. I
0.0 50
200
I 50
I00
250
Temperiiture [K]
Ficii~w42. CR-DLTS spectra of two proton irradiated devices compared to the spectrum of an undainaged device. The source-substrate bias was - 16 V and the litling pulse was 0.6 V. 0.4 T = I ; 2 : 5: 10; 20: 37 I l l \
0.3
0.2
0. I
0.0
-0.11
I
so
'
'
"
' I00
"
"
I
'
"
I 50
"
200
"
"
' 250
Tcmperature [ K ]
FIGLIRI: 43. Six CR-DLTS spectra obtained from one tentperature scan of the heavily daim aged devicc at different rate window settings of I , 2. 5 , 10, 20, and 37 ms.
scan for six selected rate windows. The scans obtained for time constants I , 2, and 5 ms become negative in the range 100-200 K . Reasons for this behavior could be the nonexponential transients or a high series resistance (Weiss and Kassing, 1988; Schroder, 1990). From the CR-DLTS data file, which was used to generate the spectra in Fng. 43, the data points for the Arrhenius plots shown
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
89
in Fig. 44 were obtained. Good linearity was obtained except for the data of the partly overlapped peak at 65 K. The electron trap energy levels were found to be 0.098, 0.138, 0.194, 0.256, and 0.392 eV below the conduction band. The corresponding capture cross sections were 4.6 x IO-", 6.3 x lop1', 1.2 x lop1", cm2. From these parameters, it is d and 3.4 x 8.5 x tify with confidence any of the traps with those listed in Benton and Kimerling ( 1 982). Our later analyses, described in this work, have demonstrated that the values obtained for these traps are significantly affected by field-enhanced emission; to indicate that, we present these values in parentheses. Figure 45 shows two DLTS spectra obtained with different settings of the source-substrate bias VSS. Reducing V.78 causes a larger part of the implanted channel to be under gate control, and therefore, an increased number of traps that participate in the filling and emission processes. This is reflected in Fig. 45 with an increased DLTS signal magnitude AV,?. Reducing V,SRalso shifts the boundary between the substrate- and gate-controlled part of the channel deeper into the channel. In the common case where the traps have a nonuniform spatial distribution, then this can give a different change in the magnitude o f AV,c for each deep level. In Fig. 45, the level denoted with E3 changes very slowly as compared to E2, E4, and E5. This could indicate that traps at E3 are concentrated near the surface beneath the gate, so that increasing the depth of the gate controlled part of the channel does not include a substantial new m o u n t of the defects identified by E3. I n Fig. 46 we present again front-gate CR-DLTS spectra with variation of the source-substrate bias, but measure another device on the same test chip. The increased relative magnitude, substantial variation of the temperature of the peak, and the shape of E4 can be an indication that this peak might result from a combination of some other peaks and not nece rily be attributed to a separate deep level. The profiling technique introduced with the DLTS method (Lang, 1974a) is commonly available i n most of the DLTS experiments. The idea is to populate selectively the traps in the investigated space charge region by varying the magnitude of the filling pulse (Lang, 1974a; Svensson and Willander, 1987). I n Fig. 47 we present several CR-DLTS scans with variation of the filling pulse magnitude. As all the voltages are relative to the source, and the pinchoff voltage is about -8 V, increasing the filling pulse magnitude toward -7 V actually decreases the active volume in the channel and the amount of traps that can be populated, resulting in a proportional decrease of the DLTS signal. Furthermore, one can see a slight shift of all the peaks toward higher temperatures (for thc same rate window setup), and this is equivalent t o increasing the apparent time constant of the traps. In fact, at a more negative filling pulse magnitude, only those traps that are close to the p-n diode/MOS capacitor boundary inside the implanted channel (see Fig. 41) can be populated.
90
PLAMEN V. KOLEV A N D M . JAMAL DEEN
8
6
h
$
4
G
2
0 50
0
I00
1 so
200
I/kT Lev-’] FIGURE 44. Arrhenius plots using the CR-DLTS data file of the heavily damaged device, which received the dose of 2.7 x lo9 protonskm’ 1 .o
MOSFET SO pi11 x 20 pin T = 7.44 ms
E5 substrate bias
0.8
E2 _______
0.6
E
Y
Y Q
0.4
0.2
0.0
50
I00
1 so
200
2.50
Teinperaturc [K]
FIGLIRE 45. Transient magnitude dependence on the gate controlled volunie of the buried channel. A decreased substrate bias allows a larger part of the channel to be controlled by the gate.
91
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD 7
0.5
0.4
1 -
MOSFET 3 I pin x 1 3 pin
su hstrate hi as
0.5 -
MOSFET 3 I pin x I0 p i n . 7 = I5 m s filling pulse
0v
~
-I
v
_ _ _ _ _ ... . .....
~~~~~
-3V -7 v
i
I
92
PLAMEN V. KOLEV AND M . JAMAL DEEN
During the emission pulse, these traps experience a weaker electric field compared to those close to the Si-SiOz interface and the emission time constant of these traps will be less affected by the electric field. Therefore, for reduced field-enhanced emission and correct trap identification, one should use the data obtained with a filling pulse of magnitude close to the threshold voltage. As the DLTS signal magnitude, and therefore sensitivity, is also reduced, then one has to determine some optimal filling pulse voltage for the transient analysis. This can be done by comparing the data from several scans with different filling pulse magnitudes. Alternatively, one can vary the filling pulse and monitor the time constant of the recorded transient at some fixed peak temperature. We found that for our samples, setting the filling pulse at -3 V is a good compromise between reasonable sensitivity and lower electric field requirement during the emission pulse. In Fig. 48 are shown the Arrhenius plots and the activation energies of the traps. The results for E2, E3, and E5 are very close to those reported in Svensson and Willander (1987); Benton and Kimerling ( 1982); Tokuda et ul. ( 1979) and more recent works (Privitera et ~ 7 / . , 1996; CofSa et ul., 1997; Svensson et d., 1993, 1995; Awadelkaritn et ul., 1986). There is an electron trap at 0.16 eV below the conduction band associated with the oxygen-vacancy complex (Tokuda et ( I / . , 1979; Privitera et a/., 1996; Coffa er a/., 1997) and our data for E2 is very close to this value. Others (Svensson and Willander, 1987; Benton and Kimerling, 1982; Svensson et d., 8
6
-
t 4 c
2
0 0
50
100
I so
200
I/kT [eV-l] FICUKE48. Arrhenius plots obtained from the data recorded with -3 V filling ptilac magnitude. The source-substrate bias was -8 V. The capture cross sections obtained froni the intercepts lor the traps EI-E5 are IO-'"cm', 3 x 10-1scm2. 4 x 10-"cm2, 3 x I O ~ ' " c m ' , and 10 - I s ciii', respectively.
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
93
1993. 1995; Awadelkarim et a]., 1986) have reported for this defect a level of 0. I8 eV. More consensus exists for the defects giving levels at 0.23 eV and 0.43 eV (Svensson and Willander, 1987; Benton and Kimerling, 1982; Privitera et al., 1996; Coffa et d., 1997), which corresponds to our results for E3 and E5. A double charged divacancy is considered to give the level at 0.23 eV and a singly charged divacancy at 0.43 eV. I t was recently suggested that the level at 0.43 eV could also be a combination of a singly charged divacancy at 0.41 eV and a phosphorus-vacancy complex at 0.44 eV (Privitera er al., 1996; Coffa et al., 1997). Unfortunately, most of the experiments in the literature are limited to liquid nitrogen temperatures and defect E 1 giving a level at 0.1 1 eV could not be identified with confidence. The parameters are close to those listed for an Mn impurity (Benton and Kimerling, 1982), but the low relative concentration of this defect prevents reliable identification. The origin of E4 is also not well established. Similar results for a trap at 0.30 eV are reported as a minor level defect after 0.6 MeV ion implantation with Si (Svensson et ul., 1995) or 2 MeV electron irradiation (Svensson and Willander. 1987; Awadelkarim et d . ,1986). However, the peculiar behavior of the peak associated with this trap demonstrated in Fig. 45 requires extra precaution in the trap identification. Similar peak broadening was observed in Svensson and Willander (1987) and it was suggested that the peak at 0.30 eV may result from overlapping of two closely spaced peaks. In this work, we emphasize the various options for DLTS measurements that the new CR-DLTS technique offers rather than the trap identification. It should also be noted that, unlike the quoted references where the studies are carried out on specially prepared pf-n and nf-p samples, we use depletion-mode MOSFETs with dimensions and channel doping designed for other purposes. By comparing the peak heights with the noise, the magnitude resolution in the spectrum at -7 V in Fig. 47 and i n the spectrum of a less exposed device in Fig. 42 can be estimated to be in the range of a few tens of microvolts. With the oxide thickness about 400 A and y,’ estimated to be in the range of several microns, by using Eq. (46) we estimate the concentration resolution to be in the range of a few times lo9 cmP3. This sensitivity is about three orders of magnitude below the effective doping concentration in the channel, and it is about two orders of magnitude worse than that obtained in our more recent results (Kolev er ul., 1998b) which is typical for advanced DLTS systems (Schroder, 1990). However, when we take into account the size of the measured device (50 pm x 20 pm),and consider the absolute number of traps into the active volume of the channel, to the depth of several microns, our estimate for the sensitivity corresponds only to a few tens of traps. This high absolute sensitivity is not new in the DLTS (Breitenstein, 1982), and we benefit froin the special design of our devices, which is aimed at the highest possible sensitivity of the gate voltage vs gate charge changes (Hardy et ul., 1998b).
94
PLAMEN V. KOLEV AND M. JAMAL DEEN
While addressing the issue about the maximal absolute sensitivity of the DLTS and in particular of the variation presented here, one should remember that single traps were observed by analyzing the so-called random telegraph signal (RTS). This high sensitivity was demonstrated for both MOSFETs (Kirton and Uren, 1986) and JFETs (Kandiah, 1994). Our best results are in the range of 20-30 detected traps. It may be possible for the sensitivity to be further enhanced by using specially designed devices and longer integration times and thus, only a few traps to be detected as demonstrated before for a scanning DLTS system (Rreitenstein, 1982). but still the RTS technique will be more sensitive and simpler. However, one should remember that this technique works well only when a few traps well separated in space and in energy are present and it becomes very complex (if not impossible) to perform RTS analysis in all other cases. Because of the effective mechanism needed to maintain constant volume of the space charge regions employed by the new technique, the upper limits of trap concentrations that can still be measured with confidence can be very high, comparable to the doping concentration in the channel. This suggestion is based on estimates made for CC-DLTS (Shiau et al., 1987a), which setup has a similar effect on the space charge volume of the test capacitor. Similarly, the same limits should be valid for the new technique as well, if the variation of gate-source and gate-drain capacitors is small compared to the gate-channel capacitance. An attempt was made to operate the CR-DLTS setup in back-gate mode. The gate bias was fixed a1 large negative values (-17 V and -20 V), and the filling pulse was varied. In Fig. 49, we see that a more negative filling pulse (-4 V) reduces the effect of hole trapping and emission and correspondingly, the magnitude of the large negative peak H I . The filling pulse magnitude also has a clear effect on the background level of E3, which we suggest is effect of magnitude variation of an overlapping negative peak denoted as H2. A constant-capacitance (CIC-)DLTS scan was obtained from the measurement of a large implanted channel-substrate p-n junction formed by connecting in parallel all available transistors on the chip, and the results are also presented in Fig. 49. After scaling, this CC-DLTS scan coincides for the most part with the CR-DLTS (2) scan obtained with a less negative filling pulse. The only large deviation is the large peak H 1. These results demonstrate the simultaneous participation of electron and hole traps in capture and emission processes. For CC-DLTS, this situation is identical to measurement of a dual-sided p-n junction, which is difficult to analyze. Although CR-DLTS appears to be similar to CC-DLTS, it has a major advantage. By varying the gate bias, one can control the active volume and total number of electron traps contributing to the DLTS signal. Also, one can vary the magnitude of the filling pulse applied to the substrate and thus control the trap filling on both sides of the
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
0.3 -
MOSFET SO pin x 20 pin T = 7.44 ins
\
-0.3 Sub. CK-DLTS ( I ) Sub. CR-DLTS (2) -0.5 -
9s
~~~
S-B CC-DLTS
~~
I
,
,
I
t
I
\
\
I
I I
‘,v//
x I .I
H2 0.362 I
,
,
,
,
,
,
,
,
,
junction. When these possibilities are combined with the front-gate operation, there are better opportunities for separation of the contribution of the hole and electron traps in the total DLTS signal. As an illustration, we found the activation energies for both the large and the small negative peaks suggested i n Fig. 49. Our results for H1 and H2 are presented in Fig. SO and they agree with those obtained for irradiation generated hole traps in p-Si at +0.22 eV and +O.36 eV above the valence band (Benton and Kimerling, 1982; Tokuda rf ol., 1979; Coffa ef ul., 1997) According to these references, H2 is associated with single positively charged divacancies, and HI is considered to be due to carbon-oxygen complexes.
D. Conclitsions
In conclusion, we have demonstrated how the new and improved DLTS technique can be applied to study defects in the low gate capacitance MOSFETs that were used as charge conversion output transistors in CCD imagers. The technique is sensitive, independent of the transistor size, and does not require measurement of the transconductance or free carrier mobility. This DLTS technique offers extended opportunities to study the trap distribution in the depth of the implanted channel of output transistors in CCD imagers. When the front-gate mode of CR-DLTS operation is combined with the back-gate mode,
96
PLAMEN V. KOLEV AND M. JAMAI, DEEN
50
60
I0
80
90
I00
I/kT [ e V ' I
FIG~IR 50. ~ ! Arrhenius plots for the large negative peak and for suggested smallcr peak using the same data as used for the CR-DLTS (2) spectra in Fig. 49.
there are additional opportunities for separating the contribution of electron and hole traps to the total DLTS signal.
VII. CONSTANT-RESIST,~NCE DLTS IN JUNCTION FIELD-EFFECT TRANSISTORS A . I n t roduction 1. Junction Field-Efect Transistors (JFET)
Despite the early invention of the JFET by Shockley in 1952 and its demonstration one year later (Dacey and Ross, 1953), this type of FET has found limited applications when compared to the MOSFETs or bipolar transistors. The JFETs are used for amplifiers, switches, voltage-controlled resistors, constant current sources, mixers, and oscillators as discrete devices or in the input stage of operational amplifiers (Evans, 1980). The attractive properties of JFETs are high-input impedance, low noise, and negative temperature coefficient at high current levels. These transistors are considered to be very prospective for application as front-end amplifiers in liquid krypton and liquid argon calorimeters for high energy physics experiments (Citterio et d., 1996). Large arrays containing many thousands of JFETs are expected to operate in a high radiation environment in some experiments. Therefore, radiation-induced
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
97
damage is of particular interest in determining the reliability of these transistors. Previously, the effect of neutron and gamma radiation on the dc and noise characteristics of JFETs produced by buried layer monolithic technology was reported (Citterio et d., 199.5, 1996). I n this section, we continue the investigation employing the new CR-DLTS technique. We also demonstrate the new technique with measurements of coininercially available silicon and germanium JFETs that were tested “as received.” B. Tlieor-etic(iIBrickground Figure 5 I shows a cross-section of an n-channel JFET build using buried layer monolithic technology. Details on this technology can be found elsewhere 1996). As indicated, the top and the bottom sides of the gate (Citterio el d., are connected internally. At zero or low gate bias, the source and drain regions are connected together with the conductive channel and form a p-ti junction with the top and bottom gate. When this p-n junction is biased in the reverse direction, the depleted regions extend mostly in the channel area, since the doping concentrations in the gates are much higher than in the channel. The voltage at which the depleted regions on both sides of the channel merge is defined as a pinch-off voltage.
FlCimfi 5 I . Cross-section of a silicon n-channel JFET built by huried layer monolithic technology. The channel thickness 20 was esliniated irom nie;isurenient o f thc pinch-of1 voltage to he about 1 pin. The epitaxial layer thickness was 5 pin. The clashed line indicates where ii conductive channel appears, which connects source and drain and provides access to the gale-channel capacitors shown on both sides of the channel. Gate-source and gate-drain capacitors are also \hewn.
98
PLAMEN V. KOLEV AND M. JAMAL DEEN
Let us consider an n-channel transistor. For a one-sided p-n junction between the gate and the conductive channel, and for uniform channel doping concentration N D , the pinch-off voltage is given by Sze (1983)
where q is the electronic charge, ES is the semiconductor dielectric permittivity, and a is the channel depth (Fig. 51). When electron traps with concentration N T T are present in the channel, then after sufficiently long positive filling pulse is applied to the gate all traps will be filled with electrons. The preceding equation then becomes
where V,(O) denotes the pinch-off voltage at the beginning of a negative emission pulse, for example at moment t = 0. During the emission pulse, the trapped electrons will be emitted, and the concentration of the occupied traps will vary as N T ( t ) = N T T exp(-tlr), (53) where t is the emission time constant for electrons according to ShockleyRead-Hall statistics (Shockley and Read, 1952; Hall, 1952). After a sufficiently long time, all electrons will be emitted from the traps and the pinch-off voltage will be described by Eq. (51). By combining Eqs. (51), (52), and (53), one can find the time variation of the pinch-off voltage
which shows a direct relation between the charge emitted from the traps and the change of the pinch-off voltage. As usual, it is assumed (Chen et al., 198 1; Hawkins and Peaker, 1986; McLarty et al., 1988a, b) that in the vicinity of a DLTS peak the concentration of the ionized donors N D will remain constant. We have designed a system that maintains the forementioned condition and at the same time measures AV,(t) directly. It is important to note that AV,(t) is independent of the transistor dimensions, and only the channel depth a is important for measurements of the trap concentration. In addition, the carrier mobility, which may vary with the temperature, does not have to be taken into account as there is no explicit current measurement in this system.
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
99
C. Experimental Results and Discussion
1. Silicon JFETs Two types of Si JFETs were measured. The first type are devices fabricated using buried layer monolithic technology (Citterio et al., 1995, 1996). Several devices are incorporated on the same die with 5- and 7-pm channel lengths and widths ranging from 350- 11400 pm. One set of devices was irradiated with 1 MeV neutrons at a fluence of 4 x lOI4 neutrons/cm2. A second set of devices was left intact for comparison. Only the irradiated devices produced measurable DLTS transients. Figure 5 2 shows selected CR-DLTS spectra with different rate window settings. As explained in Section 111, the rate window is set by subtracting two data points with different time delays (Kolev et al., 1998a). Three peaks indicating electron traps denoted with E l , E2, and E3 are observed. The variation of the peaks is discussed later in this section when we compare the new CRDLTS technique with constant-capacitance and standard capacitance-transient DLTS. Because we record the entire transient, there are many points available to obtain Arrhenius plot data. With a large trap concentration, the signal is strong and no additional time averaging is needed. This allows the measurement of long transients, limited only by our DLTS system hardware to a maximum of 1.3 s. (Kolev et al., 1998a). Arrhenius plots obtained from a single thermal 12 10
8
7 €
I
6
M
2
Q
4
2
0
40
120 200 Temperature [ K ]
280
FIGURE5 2 . CR-DLTS spectra of JFET 54-480 x 7 pm2 with various time constant windows. The magnitude of the defect denoted with E3 is approximately twice that of defect E2.
100
PLAMEN V. KOLEV AND M. JAMAL DEEN
40
70
I00 I/kT
130
I60
reV-11
Ftcirlitr: 53. Arrhenius plots obtained l’rom ii data lile containing I .3 s long transicnts obtained using the new digital signal processing techniques. Fifty percent of the data points ohtainetl lor E3 were removed for clarity. In weak electric tield when field-enhanced emission is reduced, then the corresponding values are El -0.164 eV, E2 -0.225 eV, and E3 -445 eV.
scan are shown in Fig. 53. It is clear that the magnitude variation is relatively weak and does not affect the accuracy of measurement of the activation energy-the data points are fitted well with straight lines in the Arrhenius plots with a maximal standard error of 1.3 MeV. However, this accuracy in finding the activation energy does not indicate the same accuracy of determination of the energy level because of the large error contributed by the field-enhanced emission (Lang, 1974a; Kolev et a/., 1998a; c; Yamasaki et a/., 1979: Vincent et a/., 1979; Couturier et a/., 1989) or by the inherent limitation of the rate window technique to separate defects with similar emission rates (Nolte and Haller, 1987). The traps observed in these Si JFETs are consistent with previous observations (Asom P t al., 1987; Barnes, 1979; Benton, 1990; Benton and Kimerling, 1982; Bosetti e f a/., 1995; Chantre and Kimerling, 1986; Chen and Milnes, 1980; Coffa et d., 1997; Doolittle and Rohatgi, 1992; Evwaraye and Baliga, 1977; Kimerling, 1977; Kimerling et a/., 1981b; Kolev et al., 1 9 9 8 ~ Li ; and Li. 1994; Privitera el a/., 1996; Song et ul., 1986; Su e t a / . , 1990a,b; Tokuda and Usami, 1976; Walker and Sah, 1973; Wang et ( I / . , 1975; Yatnasaki et ( I / . , 1979). Trap El in our notation has the characteristics of an interstitial (i) oxygen-vacancy complex Oi-V often referred to as an A-center (Chen and Milnes, 1980; Li and Li, 1994; Privitera et d., 1996; Su et al.. I990a,b; Tokuda and Usami, 1976; Wang et al., 1975). It is an electron trap situated
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
101
at approximately 0.17 eV (Asom et d . , 1987; Chantre and Kimerling, 1986; Evwaraye and Baliga, 1977; Li and Li, 1994; Song et al., 1986; Su et al., 1990a; Walker and Sah, 1973; Wang et al., 1975) below the conduction band cdge E'., but also has been reported at 0.18 eV (Barnes, 1979; Benton, 1990; Kimerling, 1977; Kimerling et d . , 198 1 b; Yamasaki, 1979) and at 0.16 eV (Chen and Milnes, 1980; Tokuda and Usami, 1976). I n our devices, El appears at 0.161 f0.003 eV, which agrees with some recent results (Bosetti et al., 1995; Coffa et d., 1997; Privitera et NI., 1996). Another defect at the same energy level as the A-center is attributed to a substitutional (s) carbon related to interstitial carbon C,-Ci (Asom et al., 1987). No lowering of the activation energy caused by field-enhanced emission was observed. Several devices were tested from the same set under various bias conditions and employing different DLTS techniques. Indeed, when only the traps close to the edge of the depletion region are populated during the filling pulse, we observed a slight increase of the activation energy to about 0.164 eV. However. the value of 0.17 eV frequently reported in the literature remained outside the 2 MeV margin of the experimental error. Our data processing techniques have been carefully tested (Kolev et NI., 1998b) and do not produce any significant distortion of the recorded transient or error in defining the rate window, which can explain this deviation. We performed detailed analysis of data obtained for both directions of the thermal scan, which also gives less than a 2 MeV difference. This value is also consistent with the previously reported results from noise measurements (Citterio et 01. 1996) of the same samples. Earlier (Kolev et al., 1998a), we measured the A-center level at 0.166 eV in lightly doped (- 1 0 1 2 c t n ~ 'n-Si ) of buried-channel MOSFETs after 1 MeV proton 10"'protons/cm2. bombardment to a fluence of Defect E2 at 0.221 eV is consistent with the double-charged divacancy (VV ) (Asom et ~ l . 1987; , Barnes, 1979; Benton, 1990; Benton and Kimerling, 1982; Bosetti et al., 1995; Chantre and Kimerling, 1986; Chen and Milnes, 1980; Coffa et ol., 1997; Doolittle and Rohatgi, 1992; Evwaraye and Baliga, 1977; Kimerling et al., 1981b; Kolev et d., 1998~ Song ; et al., 1986; Su ct ul., 1990a,b; Tokuda and Usami, 1976; Yamasaki et al., 1979). Doolittle and Rohatgi (1992), suggested that this defect results from the combination of two defects at 0.205 eV and 0.249 eV. Our system allowed us to record 1.3 s long transients, similar to the system described in Doolittle and Rohatgi (1992) but, unlike the results there, our data shows no coexistence of two separate levels around 0.23 eV, even when the rate window was set to 930 ms. This value of 0.222 eV is also close to our earlier results (Citterio et al., 1996; Kolev et nl. 1 9 9 8 ~ )Defect . E3, consistent with the characteristics of a phosphorous-vacancy pair Ps-V, is also known as an E-center (Chantre et d . , I98 I ; Evwaraye and Baliga, 1977; Song et ul.. 1986; Tokuda and Usami, 1976). The measured energy position at 0.433 eV below the
-
102
PLAMEN V . KOLEV A N D M. JAMAL DEEN
conduction band is close to the value of 0.44 eV referenced in the literature (Asom et al., 1987; Barnes, 1979; Benton and Kimerling, 1982; Chantre and Kimerling, 1986; Chen and Milnes, 1980; Li and Li, 1994; Privitera et al., 1996; Song et al., 1986; Su et al., 1990a; Tokuda and Usami, 1976; Yamasaki et al., 1979). Although the accuracy in determination of the activation energy is better at high temperatures, we observed a much larger data spread in the range 0.415-0.45 1 eV, depending on the sample or biasing. This variation probably results from the different relative contributions of two defects with close energy levels-Ps-V pair at 0.44 eV (Asom et al., 1987; Benton and Kimerling, 1982; Chantre and Kimerling, 1986; Li and Li, 1994; Privitera et (11.. 1996; Song et al., 1986; Su et ul., 1990a,b; Tokuda and Usami, 1976), and single charged divacancy (+V-V) at 0.41 eV (Benton, 1990; Bosetti et ul., 1995; Evwaraye and Baliga, 1977; Kimerling, 1977; Kimerling et al., I981 b). Figure 53 shows the capture cross sections determined from the intercept with the vertical axis (Pals, 1974; Schroder, 1990). We include these values in order to complement the signature data of the traps. Appendix F gives the activation energies and capture cross sections (where available) of the defects in radiation damaged n-Si obtained by other researchers. The trap concentration in Fig. 53 was calculated using Eq. (54). Because the accuracy of the concentration measurement strongly depends on the channel depth 2a (Fig. 5 I), we verified the technology information with additional I-V and C-V measurements. Taking into account the total fluence of 4. I0l4 neutrons/cm2 (Citterio ef al., 1995) and possible defect annealing, it is obvious that each accelerated neutron creates more than one trap. The trap concentrations obtained demonstrate a high efficiency of the neutron radiation in creating electrically active centers, which is attributed to the low-temperature at which the samples were irradiated (Citterio et ul., 1995). In Figs. 54 and 55 are shown the CR-DLTS spectra and the Arrhenius plot of another, larger device from the same set -J 1 with 1 1,400 pm channel width and 5 pm channel length. Comparing Figs. 52 and 54, the independence of the signal from the device size is obvious. The second type of silicon JFETs that we use to demonstrate the new CRDLTS technique are commercially available devices 2N5459 and 2N5457 produced by Siliconix Inc.". These are discrete transistors encapsulated in TO-92 plastic packages that we attached with an epoxy compound to a ceramic 24-pin carrier in order to improve the thermal contact with the sample holder in our closed-cycle helium refrigerator. This was the only manipulation prior to the DLTS measurement -the devices were not subjected to any kind of defectgenerating procedure and they have not been used in an electronic circuit. We detected deep levels in only two of a set of several devices, both 2N5459. In " Silicwrix F E 7
Design Cutcilog 1982, Siliconix Inc.. Santa Clara, C A (1981).
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
12
103
-
10 -
8 -
Y
5
6 -
5 4 -
2 -
50
I so 200 Tcrnperature [ K ]
1 00
2.50
300
FIGURE54. CR-DLTS spectra of JFET J I - I 1400 x .5 pm with various time constant windows. Note the similarity with the spectra in Fig. 52
8
-
6
v
c:
#
4
-
2
40
70
100
130
I60
IlkT l e v - ' ]
FIGURE55. Arrhenius plots obtained from the same data file that was used to extract the spectra in Fig. 54.
104
PLAMEN V. KOLEV AND M. JAMAL DEEN
200
1 so
2
I
>“
I00
d
S0
0
SO
100
I so
200
250
300
Temperature [ K ]
F1cmw 56. CR-DLTS spectra of general purpose silicon JFETs 2N5459. These devices were tested “as received” without any intentional damages. Note the y-axis scale and the sensitivity of the measurement.
Fig. 56 the CR-DLTS spectra of these devices are shown displaying two deep levels with low concentrations. As transconductance of the samples is in the range of several milliamperes per volt (which contributes to total feedback gain), we were able to perform very sensitive measurements. The Arrhenius plots of the deep levels are shown in Fig. 57. Notice the overlapping of the data for defect El in both samples. The difference in the slope of the plots for E l is less than 1 MeV. As these devices are from the same distribution set and most likely have the same production history, the coincidence of the fitting lines is not surprising. However, the overlapping of the Arrhenius plot demonstrates the high precision of our DLTS system (Kolev et id., 199Xa) because the data is for two different devices measured at identical conditions. When the source for defect generation is not known. obtaining accurate activation energy and capture cross section is important, but this is only part of the process of trap identification, as outlined in Benton (1990). Without technological data, we limit our comments only to the demonstration of the new CR-DLTS technique as a potential tool for routine defect analysis and possible control in industrial environments. With a combination of current-voltage (I-V) and capacitance-voltage (C-V) ineasurements, and with the aid of the design catalog of Siliconix Inc., the doping level was estimated to be around 5 . 1015cm-3. Then, the trap concentrations were calculated using Eq. (54). One possible application of CR-DLTS is to study the defect distribution in the bulk of the channel, for example, by varying the filling pulse level in order
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
IkT
I05
leV-11
FIGL~RP, 57. Arrlieiiiiis plots o f the traps i n the CC-DLTS spectra on Fig. 56. Note the difference i i i the energy level and capture cross section of El in both samples.
iiiiiior
to selectively populate only fractions of the traps (Lang, 1974a). In Fig. 58, we denionstrate several CR-DLTS scans of sample 2N5459 # I with variation of the magnitude of the filling pulse relative to the pinch-off voltage V,, (about -5.2 V in this case). There is an obvious change in the peak magnitudes vs tilling pulse level, particularly around -3 V. Again, we emphasize more the aspects of possible applications of the new technique rather than the final result of the defect profiling.
2.
Grmiuiiiiini
JFETs
We demonstrate the new technique also with p-channel germanium JFETs TlXM I2 produced by Texas Instrument. As in the previous case, these samples were neither used in an electronic circuit nor damaged with radiation. In Fig. 59 are shown CR-DLTS spectra in one device. Unlike the spectra of the previous devices, the spectra in Fig. 59 show bipolar signals resulting from the measurement of two-sided p-n junctions. We obtained fairly good Arrhenius plots shown in Fig. 60 and from these, the energy levels were found to be 0.12 eV, 0.173 eV, and 0.283 eV above the valence band edge for H I , H2, and H3, respectively, and -0.268 eV for E l below the conduction band. The capture cross sections determined from the intercept with the y-axis were 7 . 4 . 10p15cm', 4 . IO-"cm?, 8 . 10-'3cm', and 7 . IOp'"cm', respectively. However, this data should be treated with caution because lhe results for the hole traps H1, H2 and H3 could be affected by the electron
106
PLAMEN L’. KOLEV AND M. JAMAL DEEN
- I00
c
Si J E T 2 N 5 4 5 9 # I at T = I6
111s
I
-80
-
2.
-60
u
00
5 -40 -20
0 I80 220 Temperature [ K ]
I40
260
FIGIIRE58. CR-DLTS scans with variable filling pulse level. The signal magnitude change with the filling pulse level can be used for defect profling
G e JFET T l X M I2 #3
-200
-1
YI
z [ins]
so
1.2 14.4 29 12
- I00
I
OIJ
5
-50
0
50 50
100
I50
200
250
T e in perat ure [ K ]
FIC~URE 59. CR-DLTS spectra of gerinanium p-channel JFET TIXM12 #3. The sign reversal is probably caused by emission from electron trap inside the gate material.
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
107
Ficiu~ri 60. Arrhenius plots o f the gerninnium transistor in Fig. 59. Note the fairly good linearity and small data spread despite the low signal levcl shown in Fig. 59.
trap E l , or by field enhanced emission. This may explain the difference in our results with those obtained in high purity germanium by other researchers (see Blondeel et a/., ( 1 997); Evwardye rt 01, 1979; Haller et ul. ( 1979) and the References therein). Therefore, we limit our discussion only to illustration of the CR-DLTS in general purpose commercially available transistors. From the magnitude of the signal, it can be estimated that the sensitivity of the concentration measurement should be comparable to that of the silicon devices in the 1 0 ' l c m ~ range.
3. Cotiipurison of CR with CC ~ i n dCT-DLTS To explain the similarities and differences between constant-capacitance (CC-) and CR-DLTS, we need to review some specifics of the C-V curves of a JFET. From, we note that there are three capacitors connected in parallel: gate-source, gate-drain and gate-channel capacitors. This model is further complicated from the internal connection of the top and bottom gates. I n Fig. 61 are shown typical C-V curves of a monolithic silicon JFET. Let us consider the possible configurations for C-V measurements. If the p-n junction capacitance of the gate versus source or drain is measured, then we obtain curve similar to curve a). One can repeat this measurement with source and drain connected externally together, in order to eliminate the isolation of the two areas around and below the pinch-off voltage. Curve b) was obtained from this configuration. The source-drain symmetry of the
I08
PLAMEN V. KOLEV AND M. JAMAL DEEN
channel
X0
60
40 (b)
20
0
(a)
I
I
I
0.0
-0.5
-1.0
:
1
-1.5
Bias [ V l
FIGLJRE 61. Capacit~unce-voltage(C-V) CLII-vcs of JFET J I : a) gate to source 01- gate Lo drain capacitance: b) gate to sotirce and drain. which are connected in parallel. The aspect ratio is 11.400 vni 10 5 ~ r n .
device can be verified by multiplying the part of curve a) around and below the pinch-off voltage by a factor of two, and then comparing the result with the corresponding part of curve b). As seen in Fig. 6 I , the measured device is fairly symmetrical. In Sze (1983), the expression for the pinch-off voltage (5 1 ) was derived using the assumption of abrupt edges of the depleted regions extending into the channel from the top and bottom gate (see Fig. 51). When these regions merge, the contact between source and drain no longer exists, and this should lead to a sudden drop in the gate capacitance. However, Fig. 61 shows that this sharp drop in the capacitance extends from -0.8 to - 1.2 V, and this reflects a gradual merging of the ends of the depleted regions (Debye tails). In this region of the gate voltage, the channel is depleted from free current carriers. In Fig. 61, this region can be defined froin the bias point where curve a) and curve b) split to the bias point where curve a) multiplied by 2 deviates sharply from curve b). Above this region, the voltage variation corresponds to variation of the depletion width (and thus, capacitance) of the p-n junctions extending from source to drain along top and bottom gates. Below this “channel depletion” region, in the pinch-off region, there are just two separate capacitors o f the source and drain vs (mainly) the bottom gate. The relative contribution of the gate-channel capacitance can be estimated from the smooth step in the curve b) during the transition from the linear to the pinch-off region. From Fig. 61 it is clear that this capacitance is dominated from the capacitance of the gate-source and gate-drain p-n junctions and the associated edge capacitances.
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
109
In constant-capacitance mode of operation, the volume of the depleted region is fixed by selecting the value of the reference capacitor. In order to compare this mode with CR-DLTS, one has to select an appropriate value of the reference capacitor, so the baseline in CC-DLTS (see Fig. 28) during the emission pulse equals that in the CR mode. This will ensure complete depletion of the channel and emission from all charged traps in the channel. However, in CR-DLTS, the baseline drives the transistor in the subthreshold region. Therefore, matching this requirement in CC-DLTS mode effectively eliminates the reaction of the traps inside the channel region-the channel will be completely depleted. As a result, the gate-channel capacitance will be independent of the gate voltage. As the gate-channel capacitance is connected in parallel with the gate-source and gate-drain capacitors, the observed voltage transient in CC-DLTS mode will compensate for the variations in these two capacitors. This means that the CC-DLTS signal will originate from the traps located inside the source and drain depleted regions. In contrast, i n CR-DLTS mode, the voltage transient compensates only the charge emitted from the traps localized inside the channel, because the charge trapped in the source and drain depletion regions cannot affect the pinch-off voltage and the channel conductance. Of course, one can select a value of the reference capacitor convenient to bias the transistor in the linear region (Fig. 61); in this way the traps inside the channel can be measured. However, the existence of a conductive layer in the middle of the channel during the emission pulse will reduce the total number of responding traps. Even if this biasing is matched in CR mode by reducing the value of the reference resistor, and Eq. (54) is appropriately changed, there will still be a difference in the area and volume tested by the two techniques. Thus, an accurate comparison of CR- with CC-DLTS is not possible in this case. Although the two techniques use very similar instrumentation, they rely on entirely different physical mechanisms for compensation of the charges emitted from the traps. Fortunately, both techniques are area independent and, for uniform defect distribution, should ideally produce identical voltage transients. In practice, there will be some difference in the magnitude because of edge capacitance effects- the effective defect concentration will appear different even for a completely uniform defect distribution. More significant differences will appear for small area devices. In CCDLTS, the differential capacitance vs voltage (dC/dV) plays a critical role i n the total gain of the feedback loop as an “internal gain” of the test structure, and it strongly affects SNR. Obviously, small u r w devices will have low dC/dV and, if possible at all, the CC-DLTS measurements will have a severely limited sensitivity. In CR-DLTS however, the parameter corresponding to the “internal gain” of the test structure is the transconductance of the transistor, which depends on the m p c t rotio W/L, and not on the active device
110
PLAMEN V. KOLEV AND M. JAMAL DEEN
area. Thus, unlike the constant-capacitance technique, the constant-resistance technique is completely area independent and allows for very sensitive DLTS measurement of deep-submicron devices. This is demonstrated by sensitive DLTS measurements made on 2 pm x 0.2 pin MOSFETs (Kolev and Deen, 1998). In Fig. 62 are shown several curves demonstrating the magnitude dependence of the DLTS magnitude on the filling pulse level. This is a classical method for obtaining the trap distribution in the depleted region by partially filling and monitoring the emission from only a fraction of the defects (Lang, 1974a; Akita et al., 1993; Deen, 1993b). As seen in Fig. 62, to the first order, the trap E3 is uniformly distributed (Lang, 1974a). For our illustration of the potential for defect profiling using CR-DLTS, more important is the precise overlapping and smooth nature of the curves. For nonuniform trap distribution, the analysis would be complicated by the internal, nonremovable interconnection between the top and bottom gates. Thus, for low-level filling pulses (in our case around -1 V), the traps in the middle of the channel will be first populated (Fig. 51). As the filling pulse levels go to zero, more traps will be charged and the signal magnitude will increase. We see this behavior in Fig. 61 down to zero bias, and slightly in the forward bias direction where the signal saturates. The major difference between CR and CC-DLTS is in the forward direction. In addition, the difference between the two modes of DLTS is more
e-
I
~
~
AAA
AVg CR J5
000
AVg CK J1
AVg CC JS
AAA
AVg CC J1
+++
ACg J1
v
i
C
u" U
->
A0 0
*M I
40 -
:
0.0
*A
0
4,
O
-03
channel depletion
-1
.o
-1.5
Bias [ V ]
FIGURE 62. Profiling curves of J1 ( I 1400 pin x 5 pin) and JS (400 pm x 7 pin). Circles denote CR-DLTS data, triangles denote CC-DLTS, and crosses show the data obtained from CT-DLTS after appropriate magnitude adjustment.
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
111
pronounced for J 1 ( 1 1,400 pm x 5 ym’), a device with very large periphery as compared to that of JS (400 pm x 7 pm’). At about - 1.3 V, when the f i l l ing pulse magnitude and the DLTS signal are close to zero, the extrapolated curves cross the x-axis at different points. This corresponds to a difference in the pinch-off voltages of the devices. After appropriate scaling, the magnitude dependence of the standard capacitance-transient DLTS signal completely coincides with the curves obtained with CR- or CC-DLTS. In Fig. 63 are shown CC-DLTS spectra of device 3 1 . The main difference between the results in Figs. 63 and 52 is in the appearance of a partly resolved peak below 200 K. We attribute this peak to a local fluctuation of P,-Ci defect E4 located at 0.29 eV (Asom et d., 1987). In view of the difference in the two modes of operation, we believe that this peak is related to defects concentrated mainly in the source and drain regions. As the CR-DLTS signal originates entirely from the channel region, it may have reduced influence from this type of defect. The activation energy and capture cross sections of the traps are shown in Fig. 64 and are similar to the those obtained by CR-DLTS. Also, it was possible to perform sensitive CC-DLTS measurements using three other transistors in parallel, -53, 34, and 37. The spectra are shown in Fig. 65 and the Arrhenius plots in Fig. 66. Comparing with the results from measurement of a single device J I , the area independence of CC-DLTS mode of operation is demonstrated. The minor differences can be attributed to the different areato-periphery
t
-c
I
40
I20
200 Temperature [ K ]
280
FIGURE 63. CC-DLTS scans of JFET J I . Below 200 K a weak peak E4 appears, but it is dominated by E3. which i s tentatively assigned to a P,-C, complex at -0.29 eV. Note the wide range of rate windows used.
112
PLAMEN V. KOLEV AND M. JAMAL DEEN
40
100 I/kT [ e V - ' ]
70
130
I60
FI(;URE64. Arrhenius plots from CC-DLTS data. The device-measured is J I
40
I20
200 Temperature [ K ]
2x0
Firmiti; 65. CC-DLTS spectra obtained from measurement of three J E T S connected in parallel. Note the similarity with the spectra in Fig. 63.
The thermal dependence of the baseline voltage for one CR-DLTS and two CC-DLTS spectra is shown in Fig. 67. The darker line corresponds to constant-resistance mode. Around 200 K, this darker line displays a step that is related to E3-for a given pulse period and below certain temperature, the time constant of E3 is slow and the traps remain charged all the time.
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
40
X0 I00 I k T IcV-l]
60
I20
113
I40
FIGURE 66. Arrhenius plots of the defects i n Fig. 65.
t
Bahelines of JFET J 1 W = I I000 ~ . l m1. = 7 uin
SO
I00
I so 200 Tcmpcraturc [ K ]
2.50
3011
FIGURE 67. Thermal dependence of the baseline voltage is very similar lo the thermal dependence of the pinch-off voltage. The darker line was recorded from a CR-DLTS run, CC-DLTS # I was recorded after adjustment at the room temperature, and CC-DLTS #2 was recorded alter adjustment at 50 K. the low-temperature end of the therinal scan.
This explains the similarity of the curve with thermally stimulated capacitance curves (Walker and Sah, 1973). The remaining two lines are recordings of the baseline voltage during operation in constant-capacitance mode. Line #1 was obtained after the reference capacitor was adjusted at room temperature so the baseline was the same as in CR mode. Note the different effect that carrier
I I4
PLAMEN V . KOLEV AND M. JAMAI, DEEN
freezing has on the CR and CC baseline curves at low temperatures. This effect is more pronounced in the C C curve #2, which was recorded after the adjustment was performed at S O K. One should note that C C curve # I actually follows the temperature variation of the Fermi level in the source and drain depletion regions, while the CR curve is related to the Fermi level variation in the channel region. Both C C curves have slope changes around 80 K that correspond to defect E l , but curve # I has the change associated with E3 at significantly lower temperatures. This may be related to the specific balance between carrier freezing and the shift of the Fermi level inside the source and drain depleted regions with the temperature. Figure 68 compares several CR-DLTS spectra of devices with different aspect ratios. Also included is a CC-DLTS spectrum of three of the devices connected in parallel. The fluctuations in the magnitude are insignificant and can be attributed to variation of the defect concentration. This suggestion is supported by the appearance of a defect E4, which we have assigned to P,-Ci complex (Asom rt a/., 1987). Notice the complete independence of the magnitude of the DLTS signal from the size of the device. This is also true for the constant-capacitance spectrum, which is influenced by the trap E4 in the same way as it appears in the spectrum of 57, one of the devices which was connected for the CC-DLTS measurement. The largest available device in the set J I displays the strongest signal for defect E3, and in the same spectrum, the magnitude of E2 appears almost at 60% as compared to the other CR
SO
I00
I so Ternperature [K]
200
Flcuw 68. Comparison of CR-DLTS spectra of transistors with different siLe and with a CC-DLTS spectrum. The small difference in the magnitude can be attributed to variations in the trap concentrations and to edge capacitance effects. CC-DLTS spectra were obtained from JFETs 53. 54, and 57 connected in parallel.
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
1 15
spectra. This can be regarded as evidence for the independence of the DLTS signal magnitude from the device size or the mode chosen for the experiment, CR or CC. The advanced digital signal processing implemented in our system (Kolev rt d.,1998a) allows us to record 1.3 s. long transients with just 208 data points and with 10 ps-resolution at the beginning of the transient. In Fig. 69 are shown several recorded transients at temperatures where the time constant is roughly equal to 1/.5 of the duration of the emission pulse. In this way, we could record most of the decaying curve. Next, we display the traces on a semilogarithmic scale and this allows for immediate checking for nonexponential behavior. As we see in Fig. 69 all the traces deviate from pure exponents during the first 2.5% of the time. This is the reason for magnitude variation in Figs. 52, 54, 63, and 6.5. Less deviation is present in the transient of E l , and this corresponds to reduced variation of the peak magnitude. This nonexponential behavior is frequently attributed to field enhanced emission, distortion caused by large trap concentration, or presence of multiple traps. However, more plausible is the cause suggested in Rockett and Peaker (1981)-variation of the Debye tail both with temperature and bias. In our case this effect is enhanced by the overlapping of the Debye tails from both top and bottom gates. Also, the bias is fixed to relatively low values near the pinch-off voltage. The difference in the magnitude between CR and CC
Dclay time Lms]
FIGLJRE 69. CR and CC traces of JFETs J I and J5 recorded at appropriate peak temperatures. The deviation from pure exponential decay is the rcason for magnitude variation in Figs. 52, 54, 63, and 65. CT-DLTS denotes a capacitance transient trace after appropriate scaling and shifting i n order to be distinguished from thc CR-DLTS 1r;ice. CR-DLTS JS - low was recorded with sinall magnitude of thc tilling pulse referred to the basclinc and x a l e d until the tail coincides with the capacitance transient trace of the same transistor.
1 I6
PLAMEN V. KOLEV AND M . JAMAL DEEN
mode of operation is attributed to the edge capacitance. We find evidence to support this suggestion in the differences between the two modes for JI and J5 -device J5 with smaller periphery has the CC trace closer to CR trace as compared to the much larger device J I . For sensitive capacitance-transient DLTS measurements, the capacitance meter should be operated in its most sensitive range. This requires the quiescent sample capacitance to be balanced by a reference capacitor during the thermal scan (Doolittle and Rohatgi, 1992). Because we did not have a computer-controlled reference capacitor (Doolittle and Rohatgi, 1992), we were unable to perform sensitive capacitance-transient DLTS measurements. However, our reference capacitor for CC-DLTS measurements could be temporarily used for balancing the sample capacitance at a fixed temperature. This allowed us to record the capacitance transient CT DLTS J5 in Fig. 69. After appropriate scaling, it coincided completely with the other transients and this proves that the nonexponential behavior is not related to a particular DLTS mode of operation -they produce identical results. In order to verify the reason for nonexponential behavior, we also recorded a transient denoted as CR-DLTS JS-low (a. u.). It was recorded after a low-level filling pulse, just I80 mV above the base line maintained by the feedback circuit. With this low filling pulse magnitude only those traps localized near the edge of the depletion regions of the top and bottom gates (in the Debye tails) were populated. According to the bias dependence demonstrated in Rockett and Peaker ( 1 98 I), this should increase the nonexponential behavior. Next. we adjusted the tail of the recorded transient by scaling it to coincide with the tail of the capacitance transient, the uppermost trace in Fig. 69. The difference between the two curves during the first 200 ms gives strong support to the idea that the cause for nonexponential behavior (as suggested in Rockett and Peaker (1981) and Zhao et al (1987)) is charge capture in the Debye tail region. In Fig. 70 are shown CR- and CC-DLTS spectra of the germanium JFET. The area of transistor 2N54.59 was too small to perform the constant-capacitance measurement with the required sensitivity. Again, there is a remarkable overlapping of both spectra in Fig. 70, particularly below 120 K . The CR curve also demonstrates much better SNR when compared to the CC curve. Obviously, for small devices, CR-DLTS is superior to CC-DLTS. We have observed an interesting difference between CR- and CC-DLTS spectra at high temperatures when the gates of several monolithic JFETs were connected in parallel. In Figs. 7 1 and 72, CR- and CC-DLTS spectra are shown at high temperatures. Not only is the magnitude substantially different, but the signal sign is also reversed. As the devices were isolated on common, floating substrate, the minority carriers generated at high temperatures could accumulate on the external gate, and this affects the measurement in a similar way
DEVELOPMENT AND APPLICATIONS O F A NEW DLTS METHOD
117
I00
Ge JFET #3 z = 14.4 ins \
_ _ _ ~ ~~
" 30
I
60
,
90 I20 Tcm pcrat ure 1K 1
CK DLTS CC DLTS
,
I
I 50
I80
FIGURE 70. Comparison of CR-DLTS and CC-DLTS spectra of a germanium JFET. Note rhe increased SNR of the CR vs CC spcctrum.
I6
JFETs 33.34 and 57 bottom - CR-DLTS 54
-> 12 -Ew > 4
8
4
0
-0.3 -0.0
280
300 320 Temperature [K]
340
FIGURE7 1 . Effect of minority carrier generation on CR- and CC-DLTS spectra of several devices with gates connected in parallel. CC-DLTS magnitude difference depends on the s i x of thc external device connected to the meusured.
I18
PLAMEN V. KOLEV AND M. JAMAL DEEN 4 4
3
F
E 2
Y
til
> Q
I
-e----
---=:==
0 270
310 Temperature [K]
290
330
FIGLIRE 72. Same experiment as in Fig. 71 hut measuring only the largest available device I'rom the set. Compare the magnitudes with thesc in Fig. 71.
to the situation in CC-DLTS of a MOS-capacitor (Johnson et al., 1978). In this case, the ratio of the external gate area to that of the measured device was important. While this is obviously a system effect, it still clearly demonstrates the different physical nature of CR- and CC-DLTS methods despite the apparent similarity of their principle of operation. When the external gate was disconnected, the strong CC-DLTS signal disappeared. Arrhenius plots of these peaks produced almost the same activation energy of the E-center (in our notation E3), about 0.66 eV above the valence band.
D. Conclusions We have applied the new constant-resistance DLTS technique to both custommade and commercially available silicon and germanium JFETs. We have demonstrated that CR-DLTS is a reliable and very sensitive tool for investigation of electrically active point defects located in the channel of the JFET. The trap concentration was calculated without knowledge of the transconductance g,,, or the mobility p of the test device. The new technique was compared in detail to constant-capacitance DLTS. The CR-DLTS signal magnitude was demonstrated to be independent of the device size and it was shown that, for small size devices, CR-DLTS is more sensitive than CC-DLTS. Comparisons have also been made with the standard, capacitance-transient DLTS. The observed nonexponential behavior was attributed to the complex generationrecombination processes at the edge of the depletion regions in the Debye tail. In addition, the possibility for defect profiling using CR-DLTS was illustrated.
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
I19
VIII. CONCISJSIONS A N D AREAS ITJK FUTURE RESEARCH A. Conclusiorzs
We introduced in this chapter a new approach to signal processing in digital DLTS systems. The problems of signal recovery from noise and efficient data storage are addressed separately from the transient signal analysis. As a result of this approach, an improved digital averaging scheme for DLTS signal recovery from noise and transient data storage has been proposed. We have shown that the combined action of two complementary digital averaging techniques can improve DLTS digital signal processing. Pseudo-logarithmic time averaging is efficient in reducing the number of processed data points and improving the SNR for the high frequency noise components and long delay times. We demonstrated that the normalized errors in the magnitude measurements introduced by this type of averaging remain below 1% for pure logarithmic, and below 0.1% for the case when the logarithmic averaging intervals are further divided into five or more equal parts. Continuous time averaging is well suited for improving the overall SNR, and for continuous display and processing of data and more efficient use of the computer resources. The described combination of hardware and software tools for the implementation of these techniques supplies continuously fresh data and does not require any synchronization with the main computer program. Compared to other digital DLTS systems, this new approach offers an improved short delay time resolution, improved SNR, and more efficient data storage. At the same time it offers real-time observation of essentially noisefree transients and, like analog systems, a real-time display of the DLTS scan. The real-time display means that the displayed transient and DLTS scan can be updated on the computer screen after each pulse even for a very large number of averaged pulses and the result can be predicted well before the acquisition process reaches this number. The proposed techniques allow one to combine the powerful transient analysis of the digital DLTS methods with the sensitivity and convenience of analog methods. The described averaging and data reduction techniques are intended primarily for DLTS data processing but the same principles can be useful for many other physical experiments involving transient data analysis. We also described setup configurations for CC- and CR-DLTS. Details of the novel feed-back circuit, which solves the most critical technical problems for implementation of CC-DLTS were also given. After slight modification, the same circuit makes CR-DLTS possible. We also discussed the most important technical parameters this feedback circuit should have in order to allow for fast and sensitive DLTS measurements. When these requirements are met, the speed and sensitivity of CC-DLTS are almost the same as in the
120
PLAMEN V. KOLEV AND M. JAMAL DEEN
conventional constant-voltage capacitance-transient DLTS. Also, guidelines for using the feedback circuit were presented in order to provide a practical guidance for running CC- and CR-DLTS experiments. The application of the feedback circuit was demonstrated with recorded traces and experimental results of interface trap density measurements. We presented a new variation of a DLTS technique convenient for measurement of submicron field-effect transistors, where standard capacitance DLTS cannot be used. Constant-resistance DLTS is similar to the conductance DLTS, but it is more sensitive and does not require simultaneous measurement of the transconductance or surface mobility for calculation of the trap concentrations. In addition, the DLTS signal is largely independent of transistor size, thus allowing measurements of very small-size transistors. The proposed technique is not restricted to metal-oxide-semiconductor field-effect transistors (MOSFETs), but can also be used to study other field-effect transistors. In another application, CR-DLTS was demonstrated with measurements of radiation-induced traps in buried channel MOSFETs, which are used as CCD output amplifiers. The unique structure of these devices offers extended opportunities for studying the space distribution of the radiation-induced defects. Still, most of the results are also valid for ordinary depletion-mode MOSFETs. In addition, we show a variation of the CR-DLTS technique using back-gate driving, a technique which is applicable for studying the channel-substrate p-n junction and the results are then compared with those obtained from constantcapacitance (CC-)DLTS measurements. Complementary measurement using front-gate and back-gate operation of CR-DLTS can help to resolve the ambiguities usually associated with DLTS measurements of symmetrical p-n diodes. The CR-DLTS have been successfully applied to study virgin and radiationdamaged junction field-effect transistors (JFETs). We have described results from three groups of devices: commercially available discrete silicon JFETs, virgin and exposed to high-level neutron radiation silicon JFETs custom-made by using a monolithic technology, and commercially available discrete germanium p-channel JFETs. The CR-DLTS was found to be a simple, very sensitive, and area independent technique that is well suited for measurement of a wide range of deep-level concentrations. Comparisons have been made with the CC-DLTS and standard capacitance DLTS. In addition, possibilities for defect profiling in the channel have been demonstrated.
B. Areas ,f.r Future Research I . Development of the System Hurdware Development. There are several areas for improvement. First, it is useful to replace the low-noise amplifier with a programmable-gain amplifier.
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
121
This would allow for the program to select convenient gain according to the signal and to avoid overload and loss of data. Another, simpler improvement is to connect a second reference resistor and to alternate between the two resistors. Thus, when measuring transistors with high conductance during the tilling pulse, it would be possible to avoid overloading of the capacitance meter. As a result, the recovery time would be decreased several times and the fast portion of the transient would not be distorted. Other important improvements would be the possibility to control the values of the reference resistor and capacitor from the computer. This would offer opportunities for automated spatial profiling of the defects. I t would also be convenient to build a small, automated relay matrix i n order to change automatically the configuration from CC-DLTS to CR-DLTS or to change the measured device. A lixture to apply short tilling pulses is useful for implementation of alternative techniques for determination of the capture cross section as explained i n Sect. 11.
Sojtware Develoimimt. Hardware development can be efficient only when i t is complemented by corresponding software development. This is especially true when the goal is to make available gain adjustments during thermal scans, or setup, bias and test device changes between the scans. This includes routines for estimating the prevailing magnitude and for gain adjustments based o n these estimates. These changes would require a new format of recorded data files, which would allow for automated recognition of gain changes. Major improvements can be done in the transient data analysis. With superior raw data, all known digital techniques for transient analysis should work better. Many of them could be implemented to operate in real time and to display the trap parameters during the measurement. The references in Sect. 111 provide a good starting point.
2. Ap~diccrtioris The new CR-DLTS method can be tested with new types of FETs such as silicon-on-insulator (Sol), silicon-on-sapphire (SOS), 111-V MESFETs, or high-electron mobility transistors (HEMT). Hot-carrier degradation in subniicron MOSFETs can also be studied. By independent variation of the sourcesubstrate and drain-substrate biases, the spatial distribution of the interface and bulk traps can be investigated. Unlike the capacitance measurements, CRDLTS uses the channel conductance as a probe. It is also very tolerant to the changes i n the gain of the test structure and allows measurement of very small devices. Therefore, it makes it possible to reroute the CR-DLTS test signal to n single device in a large set of devices connected together, provided that the channel conductance of each device can be tested independently. Combined with other functional tests, this may allow us to identify the type of defect causing malfunction of a particular device out of many similar devices.
I22
PLAMEN V. KOLEV AND M. JAMAL DEEN
ACKNOWLEDGMENTS
P. V. Kolev expresses his deep gratitude to the following persons -B. Z. Antov, A. Y. Mladenov, and P. K. Vitanov (Institute of Microelectronics, Sofia, Bulgaria) and R. Attanasov (Faculty of Physics, University of Sofia, Bulgaria) who inspired, guided and supported his interest in the DLTS techniques during 1983-1990. The support received in 1992-1993 from P. Clauws and F. Cardon (Dept. of Solid-state Physics and Crystallography, Royal University of Gent, Belgium) is also greatly appreciated. M. J. Deen is deeply indebted to the students and researchers in his research group for their valuable contributions, comments, and suggestions in his research work on characterization and parameter extraction of semiconductor materials and devices. Both authors are pleased to acknowledge the fruitful collaboration of M. Citterio and J. Kierstead (Brookhaven National Laboratory, USA), T. Hardy and R. Murowinski (National Research Council, Victoria B.C.), and N. Alberding (Dept. of Physics, Simon Fraser University). We also express our gratitude for the comments and suggestions of E. Haller (University of California at Berkeley, LISA) which improved the final version of the manuscript. We also thank the members of our Integrated Devices and Circuits Research Group and the staff members of the School of Engineering Science at SFU-B. Woods and C. Cheng are thanked for their support, comments and assistance during the course of this research. This work was supported by: Crystar Inc.; the Science Council of British Columbia; Micronet; the Federal Center of Excellence in Microelectronics; the Natural Sciences and Engineering Research Council (NSERC) of Canada; Canadian Microelectronics Corporation; and Simon Fraser University. REFERENCES Abele, J . C.. Kremer, R. E., and Blakemore, J. S. (1987). Transient photoconductivity measurenients i n semi-insulating GaAs. 11. A digital approach, Jout-. A/)/?/,P/7y.s.,62: 2432. Akita, C., Fujirnoto, M., and Ito, K. ( I 993). Isothermal capacitance transient spectroscopy of grain-boundary interfacial states in Bi-doped SrTiOi ceramics, Jour. A””/. Phys., 74: 2669. Anand, S.. Subramanian. S., and Arora, B. M. (1992). Use of low-frequency capacitance in deep. level transient spectroscopy ineasurernents to reduce series resistance effects, Joirr-. A / ) / J ~Phys., 72: 3535. A.rada, K. and Sugano, T . ( 1982). Simple niicroconiputer-based apparatus for combined DLTSC-V measurement, Rev. P i . bi.rrrum., 53: 1001. Asom, M. T., Benton, J. L., Sauer, R.. and Kimerling, L. C. (1987). Interstitial defect reactions i n silicon, A p / . Phys. Lett.. 51: 256. Atanassov, R. D. ( 1983). Spectrum analyzer of exponentially decaying transients, A/Jp/. Phys. Lrrt., 43: 1361.
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
I23
Austin, R. H., Beeson, K. W., Chan, S. S., Debrunner, P. G.. Downing. R., Eisenstein, L., Frauenfelder. H.. and Norlund. T. M. (1976). Transient iinalyxr with logarithmic time base. Rei*. sci. /ustru/f~.,58: 44s. Awadelkarim, 0. 0..Wenian, H., Svcnsson, B. G.. and Lindstroni. J . L. (1986). Deep-level transient spectroscopy and photoluminescence studies of clectron-irradiated Cmchralski silicon, Jorrr. Appl. Phys., 60: 1974. Balland. J . C., Zielinger, J. P.. Nogucl. C., and Tapiero, M. (1986a). Investigation of deep levels in high-resistivity hulk materials by photo-induced current transient spectroscopy: I . Review and analysis of sonie basic problems. Jorrr. Phys. / I . ; Appl. Plrw.. 19: 57. Balland, J . C., Zielinger, J. P., Tapiero, M., Gross, J . G., and Noguet, C. (1986b). Investigation of deep levels in high-resistivity hulk materials by photo-induced current transient spectroscopy: II. Evaluation of various signal processing methods, Jour. Phys. D.: Appl. Plzy.~., 19: 71. Barnes, C. E., ( 1979). Cama-induced trapping levels in Si with and without gold doping. Jour. Electron. Mur.. 8: 437. Henton. J . L., ( 1990). Characterization of defects in semiconductors by deep-level transient spectroscopy, Joio-. C y s r . Growth. 106: I 16. Benton, J . L. and Kinierling, L. C. ( 1982). Capacitance transienl spectroscopy of trace contaniSOC.,129: 2098. ination in silicon, Jour. Elrc~troc~/reru. l Anrr/y,si.s ,fiw rhr Ikvington, P. K. and Robinson, D. K . ( 1992). In IMtr Rt,dirc.tion ~ i t Error P/iy.siccil Scirnces, second edition, New York: McGraw-Hill. See alao Reid, C. E. and Passin, T. B. (1992). In Si,qnrrl Procv.v.\irrg in C. part II. New York: Wiley. Blondeel. A,, Clauws, P., and Vyncke, D. ( 1997). Optical deep level transient spectroscopy of minority carrier traps i n n-type high-purity germanium, ./our. A/>/>[,Phys., 81: 6767. Blood, P. and Orton, J . W. ( 1992). In Tlir Elrcrrictrl Chrrrcrc,teri~trriorr( I f Srniit~ondrrc~tor.stMrr,jority Curriers n r i d Electron Sttrtrs, N. H. March, Ed., New York: Academic Press. Borsuk. J . A. and Swanson. R. M. ( 1980). Current transient spectroscopy: A high-sensitivity DLTS system, IEEE Trms. Elec./nm Derl., ED-27: 22 17. Borsuk, J . A. and Swanson, R. M. ( I98 I ). Capture-cross-section determination by transicnt-current trap-tilling experiments, Jnirr. AppI. Phys., 52: 6704. I3osetti. M.. Croitoru. N., Furetta, C., Leroy. C., Pensotti, S.. Rancoita, P. G., Rattaggi. M., Redaelli. M.. Rimatti. M., and Scidman, A. ( 1995). DLTS measurements of energetic levels. generated in silicon detectors, Nu(././n.s/t-. U U L ~Meth. A , 361: 461. Breitcnstein. 0. ( 1982). A capacitance meter of high absolute sensitivity suitable for scanning DLTS application. P l q Brotherton, S. D. and Bicknell. J . (1982). Measurement o f minority carrier capture cross sections and application to gold and platinum i n silicon. , / c J u ~ . Appl. Pkys., 53: 1543. Hroniatowski. A., Blosse, A.. Srivastava, P. C., and Bourgoin, J . C. (1983). Transient capacitance nieasurcinents on resistive samples. Jour. Appl. Phys., 54: 2907. Brotherton, S. D. (1983). The width of the non-steady state transition region i n deep level i n purity measurements, Solid-Sttrte Elec~roir.,26: 987. Burd. M. R. and Braunstein. R. (1988). Deep levels in senii-insulating liquid encapsulated Czochralhki-grown GaAs, Jorrr. P h y Chtwi. Sol., 49: 73. Buehler, M. G. and Phillips, W. E. (1976). A study of the gold acceptor in a silicon p+-njunction and an n-type MOS capacitor by thermally slimulated current and capacitance measurements, Solid-Stutc Elrcrron., 19: 777. Carhallcs, J. C. and Lebailly, J . ( 1968). Trapping analysis in gallium arsenide, .Solid-Stcrrc C o w miin., 6: 167. Chappell, T. 1. and Ransom. C. M. (1984). Modifications to the Boonton 72BD capacitance meter for deep-level transient spectroscopy applications. Rev. Sci. Instrum., 55: 200.
124
PLAMEN V. KOLEV AND M. JAMAL DEEN
Chang. C. Y.. HSLI,W. C., Uang, C. M., Fang, Y. K.. and Liu, W. C. (1984). A simple and low-cost personal computer-based automatic deep-level transient spectroscopy system lor semiconductor devices analysis, f E E E Trtiiis. Insrriiiii. Mc~is.,IM-33: 259. Chantre. A. and Kimerling, L. C. (1986). Configurationally multistable defect i n silicon. App/. P//!..\. Lctt., 48: 1000. Chantre, A,. Vincent, G., and Bois, D. (1981). Deep-level optical spectroscopy in GaAs. Phys. Re\,., B23: 5335. Chen. X . Y., Deen, M. J., Yan, Z. X . , and Schroter. M. (1998a). Effects of emitter dimensions on low frequency noise in double polisilicon BJTs, Elwtroii. Lert., 34: 2 19. Chen. C. H..Deen, M. J., Yan, Z. X., Schroter, M., and Enz, C. (1998b). High frequency noise in MOSFETs I1 - experiments, Sdit/-St(irtJ Elecrroii., Vol 42; no. 1 I , pp. 2083- 2092 (Nov 1998). Chen. J. W. KO. R. J., Brzezinski, D. W., Forbes, I,.. and Dell’Oca, C. J. (1981). Bulk t r a p i n silicon-on-sapphire by conductance DLTS, IEEE Trms. Elrctrorr Deij., ED-28: 299. Chen, J. W. and Milnes, A. G. (1980). Energy levels in silicon, In Aniiiiti/ Revirvv of Mrrrcrit i / Scirwce. R. A. Huggins, R. H. Bube, and D. A. Vermilyea. Eds., Palo Alto. CA: Aiiiiutrl R(,iievt~.s,10: 157- 228. Chen, M. C.. Lang, D. V., Dautremont-Smith, W. C.. Sergent. A. M . , and Harbison, J . P. (1984). , 790. Eflccts of leakage current oil dcep level transient spectroscopy, Appl. Phys. L ~ t t .44: Chretien, O., Apetz. R., Vescan, I-., Souifi, A., and Liith, H. (1995). Thermal hole emission from Si/Si I -.,Ge,/Si quantum wells b,y deep-level transient spectroscopy, Jour. A/)/)/. Pliys., 78: 5439. Citterio, M., Kierstead, J., Rescia, S., Manfrecti, P. F., and Speziali. V. (1995). Low noise monolothic Si-JFETs ior operation in the 90-300 K range and in high radiation environments. Proc. . s y r i ~ ) o . s i r r r i i o i i Low Tern/). Elcctroirics m i d High Temp. Sii/Jer[-otidLrcrii,iry,C. L. Clayes, s. I . Raider. R . K. Kirshman, and W. D. Brown, Eds.. The Electrochemical Society Proceedings Sei-ies, PV 95-9: 4lX, Pennington. NJ. Citterio, M., Kierstead. J., Rescia, S.. and Radeka, V. (1996). Radiation effects on Si-JFET devices for front-end electronics, IEEE rims. Niicl. S i . , 43: 1576. Coli‘a, S., Privitera. V., Priolo. F., Libertino. S., and Mannino, G. (1997). Depth profile\ of vacancy-and interstitial-type defects in MeV implanted Si, Jour. AppL Phys., 81: 1639. Collet. M. G. (1975). An experimental method to analy/,e trapping centers in silicon at very low concentrations. Solirl-Store E/ectroti., 18: 1077. Couturier, G., Thabti, A,, and Barriere. A. S. (1989). The baseline problem in DLTS technique, Rei3. Phyv. App/iqfi6, 24: 243. Crowell, C. R. and Alipanahi, S. (198 I ) . Transient distortion and 17th order filtering in deep level transienl spectroscopy (D’ILTS), Solitl-Store Elocrroii., 24: 25. Dacey. G. C. and Ross, 1. M. (1953). Unipolar “Field-Efiect” transistor. Proc. IRE, 41: 970. Day. D. S . . Tsai. M. Y., Streetman, B. G., and Lang, D,V. (1979b). Deep-level transient specrroscopy: System effects and data analysis, Joirr. Appl. Phys., 50: 5093. Day. D. S.. Helix. M. J.. Hess, K . , and Streetman, B. G. (107%). Deep level transient spectroscopy for diodes with large leakage currents, Rela. Sci. Itisrrurn.. 50: 1571. Deen, M . J., (1993a). Low-frequency noise and excess currents due to trap-assisted tunneling in double barrier resonant tunneling diodes, 23-rd Eiiropem Solicl-Stcirr Device Re.serrrch Cord: (ESSDRC’93). Grenoble, France, 355. Decn, M. J. ( 1993b). Low-frequency noise as a characterization tool for InP- and GaAs-based double-barrier resonant tunnelling diodes, Mar. Sci. ciird Eng.. B20: 207. Deen, M. J . ( 1 9 9 3 ~ ) .Low frequency noise and excess currents due to trap-assisted tunneling i n double barrier resoilant tunneling diodes 23rd Euro/wt/n Solid-Sttitr Drvicv Re.seorch Coiifiretrce (ESSDERC ’93). Grenoble, France, 355.
DEVELOPMENT AND APPLICATIONS O F A NEW DLTS METHOD
125
Deen, M. J . ( 1993d). D. C. and low frequency noise characteristics of resonant tunneling diodes, Proc. Sytnp. on LOW Teniperritiirr Electronics titid High 7iwiperutirre Siiprn~orrc/uctivity P v 93-22: 191, S. Raider, C. Claeys. D. Foty. and T. Kawai, Eds., The Electrochemical Society Proceedings Series, Pennington, NJ. Deen, M. J.. Ilowski, J. I., and Yang, P. (199Sa). Low frequency noise in polysilicon-emitter bipolar junction transistors, JoLrr. App1. Phys., 77: 6278. Deen, M. J . . Ilowski, J. I., and Yang, P. (199%). The effect of emitter geometry and device processing on the low frequency noise of polysilicon emitter npn bipolar transistors, Proc. 13th /tit. Cot$ 011 Noise iti Phg.sicd Sgstrms t i r i d //f”iic~turrtions, p. 4.54, Bareikis and R. Katilius. Eds., Singapore: World Scientitic. Deen, M. J . and Quon, C. ( 1991). Charncterization of hot-carrier effects in short channel NMOS devices using low frequency noise measurenients. 7/11C‘otIf /n.sir/trtingFi1tn.s 0 1 1 Sf~trii-c~ot?r~ucror.s (INFOS 91), Liverpool, United Kingclorn, 2-5 April 1991, W. Eccleston and M. Uren, Eds., p. 295. United Kingdom, 1OP Publishing Ltd.. Deen. M. J . and Raychaudhuri, A. (1994). Charge pumping, low frequency noise and floating gate characteriLation techniques of Si02 gate insuliitors in MOSFETs, Proc. S y i p . oti Silicwti Nitride citid Silicoti Dioiide Thiri / t l . s d t i r i t l g Filtns, Thc Electrochemical Society Procecdings Series, PV 94-16: 375, V. J . Kapoor and W. D. Brown. Eds., Pennington, NJ. Dcen. M. J., Kouniyantsev, S. and Orchard-Web, J. (1998). Low (iequency noise in heavily doped polysilicon thin film resistors. Joirr. V t i c , . Sci. r i d Techti. B. vol B 16, no. 4. pp. 1881 - 1884 (July/August 1998) Dccn, M. J. and Yan, Z. X. (1990). A new method lor measuring the threshold voltage lor small geometry MOSFETs from subthreshold contluction. Solitl-Srtirr Elertr., 33: 503. Deen, M. J . and Zhu, Y. (1993). I/f noise in n-channel MOSFETs at high temperatures, AIP Cotif: Proc. 2x5 - Q~itititutii I / f Noist~ntirl Other Low Freqirrticy F1irctutrriott.s i t i Electronic Ilei~icr.~, p. 165, P. H. Handel and A. L. Chung, Eds., New York: AIP Press. DcJule, R. Y.. Hause, M. A,, Ruby. D. S., and Stillman, G . E. (1985). Constant capacitance DLTS circuit for measuring high purity semiconductors, Solicl-Stare Electr., 28: 639. Dinowski. K. and Pi6ro. Z. (1987). Noise propcrties of analog correlators with exponentially 58: 2 185. weighted average, Rev. Sci. /rr.~tr.ut~i., Doan, M.. Z. Buffet, and Deen, M. J . (1997). The effect5 of a.c. and d.c. reverse bias stress on the d.c. forward bias and low frequency noise characteristics of polysilicon emitter bipolar f1evice.s Dire t o Device junction transistors, Proc. of’the Sgriip. on /lie Drgrticltitioti of’E1et~tmtiic~ ~ ~ / J C ‘ ~ ( L ~ ;( O I St l w e / / U S CrJ~.St(i/itle ( I d ~ ~ O l ’ r . S . S - / t l ~ l / ~ t . e d The Electrochemical Society Proceedings Series. PV 94-1: 235. H. J . Qucssnier. J. g. K. E. Bean, T. J . Shaffner, and H. Tsuya, Etls., Pennington, NJ. Doolittle, W. A. and Rohatgi, A. ( 1992). A novcl computer based pseudo-logarithmic capacitance/conductance DLTS system designed for transicnt analysis. Rev. Sci. /ti.striuti., 63: 5733. Driver. M. C. and Wright. G. T. ( 1963). Thermal release of trapped space charge in solids. /’roc. Phg,\. .SOC.( f A J t l t k J t f ) . 81: 141. Dubecky, F. ( lYX9). Characterization of deep delccts in semi-insulating GaAs by capacitance and conductance DLTS with electrical and optical excitations, Mat. Sci. For~rt71,38-41: 1301. Eades, W. D. and Swanson, R. M . (1984a). Improvements in the determination of interface state density using deep level transient spectroscopy, ./our. App/. P1iy.s.. 56: 1744. Eades, W. D. and Swanson, R. M. (I984b). Determination of the capture cross section and degeneracy factor of Si-SiO? interface states, Appl. Ph Engstroin, 0. and Alm, A. (1983). Energy concepts o nductor interface traps, Joirr. App1. Phgs., 54: 5240. Evans. A. D. ( 1980). In De.yigtiing with Fie/tl-I?ffecrTrmsistors, New York: McGraw-Hill. Evwaraye. A. 0. and Baliga, B. J . ( 1977). The dominant recornbination cenkrs i n elcctronirradiated semiconductors devices, Joiir. /i/ectroc/ietfi.soc., 124: 9 13.
126
PLAMEN V. KOLEV AND M. JAMAL DEEN
Evwaraye, A. 0..Hall, R. N., and Soltys. T. J . (1979). DLTS measurements o f trapping deifects in high purity germanium, lEEE Trims. Nu(,/. Sci.. NS-26: 27 I . Farmer, J. W., Lamp, C. D., and Meese, J . M. (1982). Charge transient spectroscopy. AppI. Phys. Lett.. 41: 1063. Prugrcirnniiri,q for the XO.786, Fernandez, J. N. and Ashley, R. ( 1990). In A.s.senihly L.~rrijiii~i~qr p. 182. New York: McGraw-Hill. Fourches, N. ( 1991). Deep level transient spectroscopy base on conductance transients, A/J/J/. P h y . Lett., 58: 364. Gardner, R. G. (1987). Improved fast integrator for small ion currents (IfA-IinA), Rev. Sci. Iristrrrrri.. 63: 1540. Golio. J. M.. Trew, R. J., Maracas, G. N., and Lefkvre, H. (1984). A modeling technique for characterizing ioii-implanted material tising C-V and DLTS data, Solid-State Electron., 27: 367. Goto C., Yanagisawa. S.. Wada, 0.. and Takanashi, H. (1973). Determination ot deep-level energy and density profiles in inhoinogeneous semiconductors. Appl. Phys. Lett., 23: I SO. Giitz, W., Johnson, N. M., and Akasaki, 1. (1994). Deep level defects in n-type GaN. App/, Phys. Lett.. 65: 463. Gotr, W., Johnson, N. M., and Bour, D. P. (1996a). Deep levcl defects in Mg-doped. p-type GaN grown by metalorganic chemical vapor deposition, A p p / . Pkys. Lrtt.. 68: 3470. Gdtz. W . , Johnson, N. M., and Imler, W. (1996b). Activation energies of Si donors in GaN. Appl. P/iy.s. Loti., 68: 3 144. Gijtz, W., Johnson, N. M., and Street. R. A. ( 1 9 9 6 ~ ) .Activation of acceptors in Mg-doped GaN . Lett., 68: 667. grown by metalorganic chemical vapor deposition, A / J / J /Phyx Graff, K. ( 1995). In Mrral ltuprri,firsit7 Si/icoti-Drr%x I;rrhriccition, Hans-Joachim Queisser, Ed., Berlin: Springer Series i n Material Science. Grimmeins, H. G. (1977). Deep level impurities in semiconductors, In Aiiriutrl RPi>iewof’Mrrrr~ritrl Sciencr, R. A . Huggins, R. H. Eiuhe, and R. W. Roberts. Eds., Palo Alto, CA: Atzriucil Ker~iews. 7: 341. Groeaeneken, G., Maes, H. A , , Beltrin. N.. and De Keersmaecker, R. F. (1984). A reliable approach to charge-pumping measurements in MOS transistor, lEEE Trwis. E/rt.rroti. net,.. ED-31: 42. Hacke, P. and Okushi, H. (1997). Characterization of the dominant midgap levels in Si-doped GaN by optical-isothermal capacitance transient spectroscopy, Appl. Phys. Lett., 71: 524. Haddara, H., Elewa, M. T., and Cristoloveanu, S. (1993). Measurements and modeling of drain current DLTS in enhancement SO1 MOSFETs. Microelectrot7. Jour., 24: 647. Hall, R. N. ( 1952). Electron-hole recombination in germanium, Phys. RCIJ..87: 387. Haller, E. E., Li, P. P., Huhbard, G. S., and Hansen. W. L. (1979). Deep level transient spectroscopy of high purity germanium diodesldetectors, lEEE Trrrris. N u d . Sci.. NS-26: 265. Hamilton B.. Peakcr, A. R., and Wight, D. R. ( 1979). Deep-state-controlled minority-carrier lifetime i n n-type gallium phosphitle, ./our. A p p / . Phys., 50: 6373. Hanak. T. R., Ahrenkiel, R. K., Dunlavy, D. J., Bakry, A. M., and Timmons, M. L. (1990). A new method to analyze niultiexponential transients for deep-level transient apectroscopy, Jour. Appl. PIiXs., 67: 4126. Hardy, T.. Murowinski, R.. and Deen, M. J. (1998a). Charge transfcr efficiency in proton damaged CCD’s, lEEE ~ o ~ I .Ns/ K. / . .%;., NS-45(2): Hardy, T., Murowinski, R., and Dren, M. J. (1998h). The effect of proton radiation on the charge transfer efficiency in CCD’s, P roc. ESO Workshop - Clpricril Lktrctors jor Astroriotriy, J . W. Beltic and P. Amico, Eds., Kluwer ASSL Series, pp. 223-230. New York: Kluwer. Hascgawil, F. (1985). A new method (the three-point method) of determining transient time constants and its application to DLTS. J f i p n . Joirr. Appl. Pkvs., 24: 1356.
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
127
Hawkins, I. D. and Peaker, A. R. (1986). Capacitance and conductance deep-level transient spectroscopy in field-effect transistors, A/J/J/.Phys. Lett., 48: 227. Henini, M., Tuck, B., and Paull, C. J. (1985). A microcomputer-based deep level transient spectroscopy (DLTS) system, Jo/rr. Phys. Ec Sci. /ri.strrrrn., 18: 926. Henry, P. M., Meese. J. M., Farmer, J. W.. and Lamp, C. D. (1985). Frequency-scanned deep/. 57: 628. level transient spectroscopy, Jour. A ~ / J Phy.~., Hcnry. C. H., Kukimoto, H., Miller. G. la., and Merritt, F. R. (1973). Photocapicitance studies of the oxygen donor in Gap. 11. Capture cross sections, Phy.s. Rev., 137: 2499. Heydenreich. J. and Breitenstein, 0. ( 1986). Characterization of defects in semiconductors by combined application of SEM (EBIC) and SDLTS. Jour. Mic.rosc.. 141: 129. H.jalmarson. H. P and Samara, G. A. ( 1 988). An improved deep level transient spectroscopy method, Joirr. App/. Phys., 63: 180 I. HolAein K., Pensl, G., Schulz, M., and Stolz, P. (1986). Fast computer-controlled deep level transient spectroscopy system for versatile applications in semiconductors, Rev, Sci. /ri.s/rurii., 57: 1373. Hurtes, C., Boulou. M., Mitonneau, A,, and Bois, D. (1978). Deep-level spectroscopy i n highresistivity materials, A p / ~ l Pkyx . Lm., 32: 82 I . Iheda, K. and Takaoka, H. ( 1982). Deep-level Fourier spectroscopy for determination of deeplevel parameters. Jcip. Joirr. Appl. Phys., 21: 462. Ikossi-Anastasiou, K. and Roenker. K. P. (19x7). Retinements in the method of moments for analysis of multiexponential capacitance transients in deep-level transient spectroscopy. Jorrr. A / J ~ / Phyx., . 61: 182. Istratov, A. A. and Vyvenko, 0. F. (199s). Deep centers in cadmium sulfide crystals: New method for comparing DLTS data found by different investigators, Sernicondirctor.s, 29: 340. lack. M. D.. Pack, R. C., and Henriksen, J. (1980). A computer-controlled deep-level transient spectroscopy system for semiconductor process control, IEEE Trciris. Electron Dev., ED-27: 2226. Jnggi, B. and Deen. M. J. (1995). Low temperature operations of silicon charge coupled devices for imaging applications, Pmc. of the Syr7ip. on Low Ewrperotrrre Elrctroriics r i r i d High E w prrcrture Sii/)ercorirluc.r;i~ir,v, S . I . Raider, R. Kirshinan, H. Hayakawa, and H. Ohta, Eds., The Electrochemical Society Proceedings Series. PV 88-9: 579, Pennington, NJ. Jaros, M. ( 1982). Deep Leivls in S~,r)iicoridicc,tor.s, A. Hilger, Ed.. Bristol, UK: The Institute of Physics. Jcrvis, T. R., Teter. W. M., Cole, T., and Dunlavy, D. (1982). Deep level transient spectroscopy using CAMAC components, R n , . Sci. /ri,srriwi., 53: I 160. Johnson. N. M. (1979). Energy-resolved DLTS nieasureiiient of interface states in MIS structures, A/)/)/. Phys. Lett., 34: 802. Johnson, N. M. ( 1982). Measurement of semiconductor-insulator interface states by constantcapacitance. deep-level transient spectroscopy. Jorrr. b2ic.. ScY. Techno/.. 21: 303. Johnson, N. M., Bartelink, D. I., Gold, R. B.. and Gibbons, J. F. (1979). Constant-capacitance DLTS measurement of defect density protiles i n semiconductors. Jnrrr. App/. Phys.. SO: 4824. Johnson, N. M., Bartelink, D. J., and Schulz, M. (1978). Transient capacitance measurements of electronic states at the Si-SiOl interface, P roc. hi t . Corzj: PIzys. o f ' ,502rrricl Its /rl/
[email protected]. p. 42 I , New York: Perganion Press. Johnson, N. M. and Herring, C. (1991). Migration of the H? complex and its rclation to H- i n n-type silicon. Phys. Rev. B, 43: 14297. Jones, B. K. (1994). Low-frequency noise spectroscopy, / Trms. Electrori L h , . , ED-41: 2 188. Kachwalla, Z . and Miller, D. J. (1987). Transient spectroscopy using the Hall effect, A /) /) /. /'/iys. Lett.. SO: 1438.
128
PLAMEN V. KOLEV AND M. JAMAL DEEN
Kandiah. K. ( 1994). Random telegraph signal currents and low-frequency noise in junction tield effect transistors, IEEE Trtrns. Elccrron Dev., ED-41: 2006. Karasyuk, V. A., Thewalt, M. L. W., An. S., and Lithtowlers, E. C. (1994). Intrinsic splitting of the acceptor ground state in silicon, Phys. Rev. Lett., 73: 2340. Katsube, T., Kakiinoto, K., and Ikoma, T. (1981). Temperature and energy dependences of capture cross sections at surface states in Si metal-oxide-semiconductor diodes measured by deep level transient spectroscopy, Jour. Appl. Phys., 52: 3504 3508. Kim. M . C., Song, K. H., and Park, S. J. (1993). Isothermal capacitance transient spectroscopy study on trap levels in polycrystalline SnOz ceramics, Jour. Mat. Kes., 8: 1368. Kim, H.. Blouke, M. M., and Heidtmann, D. L. (1990). Effects of transistor geometry on CCD output sensitivity, In Chcrrge-Coupled Deisict>.sarid Solid-State Opticcrl Sen.tors, M. Blouke, Ed., Proc. SPIE, 1242: 195. Kimerling, L. C. (1976). New developments in defect studies in semiconductors, IEEE Trcrm Nucl. Sci., NS-23: 1497. Kimerling. L. C. ( 1977). Defect states in electron bombarded silicon: Capacitance transient analyses, In Rrrdiation Effrcts in Semit~oiiiiuc.tor.v(Bristol, UK: The Institute of Physics, 1976). Institute of Physics Conf. Series, 31. p. 22 I . Kimerling, L. C., Benton, J. L., and Rubin. J. J . (1981a). Transition metal impurities i n silicon, I n Dcficts crnd Rtrdiariori Efects in Sernicorze/ui.tor.s (Bristol, UK: The Institute of Physics, 1980). Institute of Physics Conf. Series, 59: p. 217. Kimerling, L. C., Blood, P., and Gibson, W. M. (1981b). Defect states in proton-bombarded silicon at T<300 K, I n Defect.s and Rnditrrion Effei,ts in Seniiconductors (Bristol, UK: The Institute of Physics, Bristol, 1980), Institute of Physics Conf. Series, 59: p. 273. Kirchner, P. D., Schaff, W. J.. Maracas, G. N., Eastman, L. F., Chappell, T. I., and Ransom, S. M . ( 198 I ) . The analysis of exponential and nonexponential transients in deep-level transient spectroscopy, Jour. Appl. Phys., 52: 6462. Kirov. K. 1. and Radev, K. B. (1981). A simple charge-based DLTS technique, Pllgs. Stut. SO/., 63a, 71 I . Kirton, M. J . and Uren, M. J. (1986). Capture and emission kinetics of individual Si:SiOz interface states, Appl. Phys. Lett., 48: 1270. Kiyota, H., Okushi, H., and Okano, K. (1992). Isothermal capacitance transient spectroacopy measurements on polycrystalline diamond/liydrogenated amorphous silicon hcteroj unctions, Appl. PIiys. Lett., 61: 1808. Klausmann. E. (198 I ) . Si/SiOz properties investigated by the CC-DLTS method, Proc. Set. I n / . Con/: INFOS'8f Erltrngerz, p. 169, Springer Ser., 7: Heidelberg: Springer. Klausmann, E., Fahrner W. R., and Brauning. D. ( 1986). In In.s/cihilitie.s i n Silimn Deijiet,,s, Chap. 7, Amsterdam: North-Holland. Kohn, W. (1957). Shallow impurity states in silicon and germanium, .So/id-Stcrre Phgs., 5: 257. Kolev, P. ( 1992). An improved feedback for constant-capacitance voltage-transient measurements, Solici-Stcite Electron., 35. 387. Kolcv. P. and Deen, M. J . ( 1998). Constant-resistance deep-level transient spectroscopy i n aubmicron metal-oxide-seinicoiiductor field-effect transistors, Jour. Appl. P h y x , 83: 820. Kolev. P.. Deen, M. J., and Alberding, N. ( 1997a). Digital averaging and recording o f DLTS transient signals. Proc. of the S;>W/J. on Ditrgnostic Technic/ue.r,fix Srrnic,nnductor Matt,ricrI.s e n d I h i c e s . P. Rai-Choudhury, J . Benton, D. Schroder, and T. J. Shaffner, Eds., The Electrochemical Society Proceedings Series, PV 97-12: 377. Pennington, NJ. Kolev, P.. Dcen. M. J., and Alberding, N. (1998a). Averaging and recording of digital DLTS transient signals, Rev. %i. Insrrum., Vol 69, no. 6, pp. 2464-2474 (June 1998) Kolev, P., Deen, M. J., Kierstead, J., and Citterio. M. (l998b). Constant-resistance deep-level transient spectroscopy in Si and Ge junction field-effect transistors, IEEE Trcrns. Elecfro,~Drv. Vol. 46, no. I , pp. 204-213 (January 1999)
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
129
Kolev, P., Deen, M. J., Liu, H. C.. Li, J. M.. Buchanan, M., and Wasilevski. Z. R. ( 1997b). Asymmetry in the dark current low frequency noise behavior of quantum well infrared photodetectors (QWIPs), Pro(,. 14th hit. Con/: on Noi.so in Pl7y.ricd S w r m m t / / / f Fllrcmrriorls, p. 5 1 1 . C. Claeys and E. Simoen. Eds.. Singapore: World Scientific. . LISC of constant-resistaiice Kolev. P., Hardy, T., Deen, M. J., and Murowinski. R. ( 1 9 9 7 ~ ) The DLTS to study proton radiation damages in CCD output MOSFETs, Pro(,. of the S W I ~ J o. / i Diujiriostic Techriiqice.s,fi~rSeniicoriductor Mn/rriu/.vui7d Dcwices, P. Rai-Choudhury, J . Benlon. 1). Schroder, and T. J . Shaffner, Eds.. The Electrochemical Society Proceedings Series, PV 97-12: 389. Pennington, NJ. . use of constaiit-resistaiice Kolev, P., Hardy, T., Deen. M. J., nnd Murowinski, R. ( 1 9 9 8 ~ )The DLTS to study proton radiation damages in CCD output MOSFETs. Jour. Electrocl/etii. Soc. Vol. 145, no. 9, pp. 3258-3264 (September I99X) Komarov, B. A,, Korshunov, F. P., and Murin, L. I . (1994). Role of field eflects i n a determination of the concentration of thermal donors i n silicon by the DLTS method. Seniic.orrt/ucto,-, 28: 305. Kosai, K. ( 1982). External generation of gate delays in boxcar integrator -application to deep level transient spectroscopy, Rev. Sci. Instrrrm., 53: 210. Krciner, R. F., Arikan, M. C., Ahele, J. C., and Blakernore. J. S. (1987). Transient photoconductivity nieiisureinents in scini-insulating GaAs. I. An analog approach, Jour. App/. fhy.v., 62: 2424. Kuhn. M. (1970). A quasi-static technique for MOS C-V and surfxe state mmsurenients. SoliclSttrtr Elrctr.. 13: 873. Kumar. V. and lyer. S. B. ( 1983). Characterization o t surface states in MOS capacitors, Phys. Srtii. S o l . , 76a: 637. Kwan, W. S. and Deen, M. J. (1998). Hot-carrier effects on the scattering parameters of LDD MOSFETs. Joirr. k.Sci. rrritl Tech., 16: X55. Kwan, W. S.. Raychaudhuri, A. and Deen, M. J. (1996). Location and quantification of the interface detects in stressed MOSFETs using a coinbination of Hoating gate and transcOiidtictaiice techniques, and charge pumping methods, Can. Jour. Phys., 74: SI 67. Kwan, W. S., Raychaudhuri. A. Deen. M. J., and King, M. I . H. (1997). Hot-carriei- defect length propagation in LDD NMOSFETs, Proc. oj'Forrrth SW/J. un Silica,? Nitride urrd Silicon Dioxide Thin /mitlotirig Filn7.7, M. J . Deen. W. D. Brown, S. I . Raider, and K. B. Sundaram, Eds.. The Electrochemical Society Proceedings Series, PV98-10. 389. Pennington, NJ. Lagovski, J.. Ecielman, P.. and Moravski, A. ( 1992). Non-contact deep level transient spectroscopy (DLTS) baaed oil surface photovoltage, S m i c ~ o t i d .Sci. Techno/., 7: A 2 I I , Lang, D. V.. (1989). Defects and future semiconductor devices, Mur. Sci. F ' o r i ~ ~38: i , 13. Lang, D. V. (1974a). Deep level transient spectroscopy: A new method to charactcriLe traps in semiconductors, Jour. App/. P/7ys.. 45: 3023. Lang, D. V. (1974h). Fast capacitance transient apparatus: Application to ZnO and 0 centers in gap p-n junctions. Jour. Appl. P/iy.s., 45: 3014. Lang, D. V. ( 1979). Space-charge spectroscopy in semiconductors, I n Tupics it, Applied Physics: Tlrcrmo//y Stirnuluted Krlu-ution itr Solids, 37: p. 93, P. Briunlich, Ed., Berlin: Springer. Lang, D. V., Grimmeiss. H. G.. Meyer. E.. and Jaros.. M. (1980). Complex nature of gold-related deep levels in silicon, Pltys. Rev., B22: 3917. 1.angfeId. R. (1987). A new method of analysis of DLTS-spectra. A p p / . Phys.. A44: 107. Lefivre, H. and Schulz. M. (1977a). Douhle correlation technique (DDLTS) for the analysis , ~ . Truns. Elecwon Devices. ED-24: of deep level protiles in GaAs and G a A s o , ~ , P , ~lEEE 913.
LefPvre, H . and Schulz, M. (1977b). Double correlation technique (DDLTS) for the analysis of deep level profiles in semiconductors, Appl. Plys., 12: 45.
130
PLAMEN V. KOLEV AND M. JAMAL DEEN
Li, C. and Li. Z . (1994). Characterization of high fiuence neutron induced defect levels in high resistivity silicon detectors using a laser deep level transient spectroscopy (L-DLTS), Nircl. Insrnrtri. arid Meth. in P/I?~S. Rcs.. A 342: 137. Li. M. F. and Sah. C. T. (1982a). New techniques of capacitance-voltage measurements of semiconductor junctions, Solid-State Elecrrorl., 25: 95. Li, M. F. and Sah, C. T. (lY82bb). A new method for the determination of dopant and trap I.s. Dev.. ED-29: 306. concentration profiles in semiconductors, lEEE T ~ N ~Electron Li. X. M. and Deen, M. J. ( 1992). Determination of the interface state density in MOSFETs using the spatial profiling charge pumping tcchnique, Solid-State Electron., 35: 1059. Li. X . M. and Deen, M. J. (1990). A new charge pumping method for the determination of spatial interface state density distribution in MOSFETs, IEEE hit. Electrori DPV.Meetiri,g (IEDM’ 90) Techriical Digest. San Francisco, CA, 85. Losson, E., Dmowski, K., Lepley, B. (1996). Practical methods to improve DLTS data smoothing, Phys. Stcit. Sol. A , 156: 413. Ma, C. L. F., Deen, M. J., and Hardy, R. H. S. (1992). Excess currents due to trap-assisted t u n neling in double barrier resonant tunneling diodes, Curl. Jour. Phys., 70: 993. Ma, C. L. F., Deen. M. J., and Taroff, L. (1994). Dark current noise characteristics of separate absorption, grading, charge, and multiplication in lnP/InGaAs avalanche photodiodcs. 2 4 t h Eirropeun Solid-Strite Device Resecirch Cor2.f: (ESSDRC’ 94). Edinburg, Scotland, 727. Mangelsdorf, P. C., Jr. ( 1959). Ccinvenient plot for exponential functions with unknown asymptotes, Jour. Apj~l.Phyx, 30: 442. Martin, J. M. (1 995). Deep-level transient spectroscopy and electrical characterization o f ioniniplanted p-n junctions into undoped InP. Jour. Appl. Pkys., 78: 5325. Martin, P. A., Streetman, B. G. and Hess, K. (1981). Electric tield-enhanced emission from nonCoulombic traps i n semiconductors, Jour. Appl. Phys., 52: 7409. McLarty, P. K., Ioannou, D. E., and Colinge, J. P. (1988a). Bulk traps in ultrathin SlMOX MOSFETs by current DLTS, IEEE Electroti Dev. Left., EDL-10: 545. McLarty, P. K., loannou, D. E., and Hughes, H. L. (198Xb). Deep states in silicon-on-insulator substrates prepared by oxygen implantation using current deep level transient spectroscopy, Appl. Phys. Lett., 53: 87 1. Mego, T. J . (1986). Improved feedback charge method for quasistatic CV measurements i n semi-conductors. Rev. Sci. Instrirm.. 57: 2798. Menegheso, G., Haddab, Y., and Zanoni, E. (1996). Drain current DLTS analysis of recoverable and permanent degradation effects in AIGaAs/GaAs and AIGaAs/lnGaAs HEMT’s. Microelrc.rrotiic..sand Relinhility, 36: 1895. Milnes, A. G. ( 1973). In Deep Irnpurities iri Semiconductors. New York: Wiley-Interscience. Milnes, A. G. (1983). Impurity and defect levels (experimental) in gallium arsenide. In Ad1mrice.t irz Elec~trorricsu t i d Electron Plzvsic.~,61: 63, P. W. Hawkes, Ed.. Orlando. FL: Academic Press. Miller, G. L., ( 1972). A feedback method for invcstigating carrier distributions in semiconductors, IEEE Triins. Electroil Dev., ED-19: I 103. Miller. G. L.. Lang. D. V., and Kimerling, L. C. (1977). Capacitance transient spectroscopy. I n A m . Rev. Mat. Sci., Annual Reviews, Palo Alto, CA, 7: 377. Miller, G. L., Raniirez, J. V., and Robinson, D. A . H. (1975). A correlation method for seiniconductor transient signal measurements, Jour. Appl. P/I!Js.,46: 2638. Miller, M. D. and Patterson, D. R. (1977). Transient capacitance deep level spectrometry instrumentation, Rev. Sci. Iristrurn., 48: 237. Misrachi, S., Pcaker, A. R., and Hamilton. B. (1980). A high sensitivity bridge for thc measurement of deep states in semiconductors, ./our. Phys. E; Sc.i. Insrrunr., 13: 1055. Monernar. B. and Grimmeiss. H. G. (1982). Optical characterization of deep energy levels in semiconductors, Progr. CTsr. Growrh. Churuct., 5 : 47.
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
131
Mooney, P. M. (1983). Photo-deep level transient spectroscopy: A technique to study deep levels in heavily compensated semiconductors. Jorrr. A/)/>/. Phys., 54:208. Morimoto, J.. Fudamoto. M.. Tahira, K., Kida, T., Kato. S..and Miyakawa. T. (1987). Spectral analysis of dccp-level transient spectroscopy (SADLTS), Jq). Joirr. Appl. Phys., 26: 1634. Morimoto, J., Fundamoto. M., Tashiro, S., Arai, M., Miyakawa, T., and Bube, R. H. (1988). Spectral analysis of deep-level transient spectroscopy (SADLTS) of deep centers in CdTe single crystals, J t r p Jour. A/>/>/.Phv.s., 27: 2256. Murowinski, R. G. and Deen, M. J . ( 1993). Low temperature characteristics of large array charge coupled devices, Proc,. of the S y i p . on /,ON fiwyirrciture E/rctrorric..v triid High Empertiturc. .Sul)(,r"(~"c/icc.ti~,j/~, C. Clayes, S. Raider, T. Foty. and T. Kawai, Eds.. The Electrochemical Society Proceedings Series, PV 93-22: 209. Pennington. NJ. Murowinski. R. G. and Deen, M. J. (1904). CCDs f o r the Lyman FUSE mission, /EEE/lEE Itit. nsfer efficiency in low-temMui-owinski, R. G., Deen, M. J . , and Hardy, perature charge coupled devices, Proc. of the Syrnp. on Low Trmprntrurr Elrctrouics tirid High 7i,inprnirrrrr S~rp~~rc~)iiduc/ii.irv. C. Clayes, S. Raider, R. Kirshman, and W. Brown, Eds., The 8lectrochemical Society Proceedings Series, PV 95-9: 299, Pennington, NJ. (1995). Murowinski, R. G., Linzhuang, G., and Dcen. M. J. (1993a). Effects of space radiation damage and temperature on CCD noise for the Lyman IWSE mission, / L E E f i r i n s . Nu(./. Sci., NS 40(3): 288. Murowinski, R. G., Linzhuang, G.. and Deen, M. J . (1993b). Temperature dependence of radiWorksliop o r 1 Clitrrge Couplet/ Ilrvicc~s, ation-induced trapping phenomena in CCDs. / Waterloo, Ontario. Murray. F., Carin. R.. and Bogdanski, P. ( 1986). Dctcrminalion of high-density interface state parameters in metal-insula(,~r-scniiconductorstructures by deep-level transient spectroscopy, ./our. Appl. Phy.r., 60: 3592. Ng, A,, Deen, M. J., and Ilovski. J . (1992). Determination of the trap energy levels and emitter area dependence of noise in poly-emitter bipolar ,junction transistors froin generation, 949. recombination noise spectra. C'ciri. Joui-. P k ~ s . 70: Np, A., Deen, M. J., Li, X. M., and Berolo, 0. (19911. Cryogenic d.c. and low frequency noise characteristics of AIAs/GaAs/AIAs resonant tunneling diodes, Proc. of the Synip. or1 Low Ewipertrtrtre Elecrroriic Dri O p r r d o u , D. Foty, N . Saks, S. Raider. and G. Oleszek, Eds., -14: 131. Penningrton, NJ. The Electrochemical Society Proceedings Series, Nicolian. E. H. and Brews, J. R. (1982). In MOS P niid Tklinology, New York; Wiley. Nolte, D. D. and Haller, E. E. (1987). Optimimtion of the energy resolution of deep level transient spectroscopy, Jour. Appl. Phy.s.. 62: 900. Noras. J. M. ( 1982). Thermal filling effects of constant capacitance transient spectroscopy, Phgs. Srcit. Sol., 69a: K209. Okushi, H. and Tokumaru, Y. ( 19x0). Isothermal capacitance transient spectroscopy. J t i p . ./our. Appl. Phys., 19: L335. Okushi, H. and Tokumaru, Y. (1981). A modulated DLTS method for large signal analysis (C'-DLTS), ./up. Joui-. AppL P/i.v.v., 20: LJS, Okuyama, M., Takakura, H., and Hamakawa, Y. ( 1983). Fourier-transformation analysis of deeplevel transient signals in semiconductors, Solirl-Sttrte E/rc/rotz.,26: 689. (jzder, S., Atilgan, I., and Katircioglu, B. (1996). New approach to bias scan DLTS for rapid evaluation of interface states i n MOS structures, Solirf-Strite Electron., 39: 1507. Pals, J. A. (1974). Properties of Au, Pt, Pd and Rh levels in silicon measured with a constant capacitance lechnique. So/irf-SfureE ~ f ~ T r o i f17: . , I 1 39. Pantelides, S. T. ( 1992). In / k p Ceriters iii Srrriiei)iidirc/or.s,Sokartes T. Pantelidcs, Ed., Second edition, Switzerland: Gordon and Breach Scicncc Publishers S. A.
132
PLAMEN V. KOLEV AND M. JAMAL DEEN
e Pearton, S. J., Corbett, J. W., and Stavola, M. (1992). In H y d r o p i in C ~ . m / / i nSernii.orrcliri~tor.\, Hans-Joachim Queisser, Ed., Springer Series in Material Science, Berlin: Springer. Petroff, P. M. and Lang. D. V. (1977). A new spectroscopic technique for imaging the spatial distribution of non-radiative defects in a scanning transmission electron microscope. AIJIJI. Phvs. LPII.,31: 60. Phillips. W. E. and Lowney, J. R. (1983). Analysis of nonexponential transient capacitance in . 54: 2786. (1983). silicon diodes heavily doped with platinum. Jorrr. A ~ / J / Phys., Pons. D. (1984). Accurate determination of the free carrier capture kinetics of deep traps by space-charge methods, Jour. AIJ/J/. Phys., 55: 3644. Privitern. V., Coffa, S., Priolo, F., Larsen, K. K.. and Mannino. G. ( I 996). Rooin-temperature migration and interaction of ion beam generated defects, Al~pl.Phys. Lett., 68: 3422. Rancour. D. P. ( 1995). A sensitive technique for detecting low concentrations of deep level t r a p : Current source deep-level transient spectroscopy, Jour. Appl. P h y . . 78: 543 I . Raychaudhuri, A,, Deen, M. J.. King, M. I. H., and Kwan, W. (1996a). Features and mechanisms of the saturating hot-carrier degradation in LDD MOSFETs, IEEE T,-nns. Elrc.tron D w . , ED-43: I 114. Raychaudhuri, A,, Deen, M. J., King, M. I. H., and Kwan. W. (l996b). A simple method to qualily a LDD structure against the early mode hot-carrier degradation. IEEE Trms. Electron D ~ w .ED-43: , 110. Raychaudhuri, A,, Kwan, W., Deen, M. J., and King. M. I . H. (1995). Propagation of defcct? i n hot-carrier degradation of LDD MOSFETs-from the early mode to the late mode. Pro(,. of 1995 lnt. S ~ ~ i i ~ ~Dei.. ~ i i Rex d . S y i p . (ESSDEHS’95), Charlottesville. VA, 78 I . Rhee, .I.K. and Bhattacharya. P. I(.(1982). Photoinduced current transient spectroscopy of senii. 53: 4247. insulating InP:Fe and InP:Cr, Jour. A / J ~ / Phys., Rockett, P. I . and Peaker, A. R. (1981). Temperature dependence of peak heights in deep-level transient spectroscopy, Electrori. Lett., 17: 838. Rockett. P. I . and Peaker. A. R. ( 1982). Calculation of deep state profilcs from transient capacitance data, Appl. Phys. Lert., 40: 957. Rohatgi, A,, Davis, J . R., Hopkius, R. H.. and McMullin, P. G. (1983). A study of grown-in impurities in silicon by dccp-level transient spectroscopy, So/id-.%re E/ectron., 26: 1039 Rohatgi. A. (199 I ). A review of selected techniques for characterizing radiation-induced defects in solar cells, Solnr Cells, 31: 379. Rohatgi, A., Weber. E. R . . and Kimerling, L. C. (1993). Opportunities in silicon photovoltaics and defect control in photovoltaic materials, J(Jlrt‘. Electron. Mot., 22: 65. Royer, J. P. ( 1987). In Hcrntfbook of’ Sofrwirre & Hrrrdri~irc./ntt~rfirc.ing.f;w IRM PC. p. 206. Englewood Cliffs, NJ: Prentice-Hall. Sah, C. T., Rosier, L. L., and Forbes, L. (1969). Direct observation ofthe multiplicity of impurity charge states in semiconductors from low-tempemture high-frequency-capacitance, Appl. Phyx Lert., 15: 316. Sah, C. T., Forbes. L., Rosier, L. L,., and Tasch. Jr., A. F. (1970). Thermal and optical emission and capture rates and cross sections of electrons and holes at imperfection centers in semiconductors from photo and dark junction current and capacitance experiments, Solid-Sttrre E/ec,rron., 13: 759. Sah, C. T . ( 1976). Bulk and interface imperfections in semiconductors. Solid-Stiite EIectrm.. 19: 975. Schott, J . T., DeAngelis. H. M., and Drevinsky, P. J. (1980). Capacitance transient spectra of processing- and radiation-induced defects in silicon solar cells, Jour. Electmrr. Mtrt., 9: 419. Schroder, D. K. ( 1990). In Sun?icoiic/irctor Mtrterirrl ernel Device CI7~rcrcterization.Chap. 7, New York: Wiley.
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
133
Schulz, M. and Johnson, N. M. ( 1977). Transient capacitance measurements of hole emission from interface states in MOS structures, Appl. Plrys. Lett., 31: 622. Scliulz, M. and Johnson, N. M. (1978). Evidence for rnultiphonon emission from interface states in MOS structures. Solid Sttitr Co/nr~., 25: 48 I ; errata: Solid Strife Comm., 26: 126i. Shaban, E. H. ( 1996). Non-exponential capacitance transient in DLTS measurements, SolicI-Srtrrc, E/Pcrro/7., 39: 32 I . Shapiro, F. R., Senttiria, S. D., and Adler. D. (1984). The use of linear predictive modeling for the analysis of transients from experiments on semiconductor defects, Jour. AppL Phy.s.. 55: 3453. Shiau. J. J., Fahrenbruch, A. L.. and Bube, R. H. ( 1987a). A method to improve the speed and wisitivity of constant capacitance voltage transient measurements Solid-Srtite Elem-on., 30: 513. Shiau, J. J . , Fahrenbruch, A. L., and Bube, R. H. (l987b). Determination of dopant and deep imperfection protiles in p-type CdTe by improved constant capacitance voltage transient measurements, Jorrr. Appl. Pl7y.s.. 61: 1340. Shockley, W. and Read, T. W. ( 1952). Statistics of recombinations of holes and electrons, Phy.s. I < ~ 87: ~ ~ 83s. ~.. Shockley, W. ( 1952). An unipolar “Field-Elfect” transistor. Proc.. IRE, 40: 1365. Soncini, G., Tripodi, G. B.. and Zen, M. ( 1960). Surface state churacterization at the oxide-silicon and nitroxide-silicon interfaces, Microelec,trorl. ./our. 21: 23. Song, L. W.. Benson, B. W.. and Watkins. G. D. (1986). New vacancy-related defects in n-type \ilicon, Phy.s. Rev., B 33: 1452. Stannard, J. F. (1978). Transient capacitancc in GaAs and InP MOS capacitors. ./our. Vtrc. Sci. Tech~zol.,15: I SO8. Stannard. J. F.. Day, H. M., Bark, M. L.. and Lee, S . H. (1981). Spectroscopic line fitting to IXTS data, Solid-Stcitr Elc~c.troir.,24: 1009, Stecle. J . M. ( 1986). Hasegawa’s three point method for determining transient time constant, . I ( ~ I I IJorir. . Appl. Phy.~. , 25: I 136- I 137. Stievenard, D. and Vuillauiiie, D. ( 1986). Profiling of defects using deep level transient speciroscopy, Jour. Appl. Pliys., 60: 973. Srievenard. D., Lmnoo, M., and Bourgoin, J. C. ( 1985). Transient capacitance spectroscopy i n heavily compensated semiconductors. Solitl-Sttire Elrc~lroir.,28: 485 -492. Stolt, L. and Bohlin, K. ( 1985). Deep-level transient spectroscopy measurements tising high Schottky barriers, SolitI-Srtrtr Elrc~rorr..28: I2 IS. Su. Z. and Farmer, J. W. (1990). Single-scan deep-level transient spectroscopy, ./our. Appl. Phys., 68: 4068. Su, Z., Husain, A,, and Fanner, J. W. ( 1990aj. Determination of oxygen in silicon by ratio of A center to E center, Jour. Appl. Phys., 67: 1903. Su. Z., Wald, P. G.. and Farmer, J. W. (1990b). Role of boron in multistable carbon-related defect in silicon, .loiir. Appl. I’hys., 67: 4249. Suno, K., Yoshino, J., and Miyakawa, T. ( 1997). Differential isothernial capacitance transient spectroscopy for the studies of clecp levels in semiconductors, Rev. Sri. frt,\tr/mi.. 68: 21 16. Svensson, B. G., Jagadish, C.. and Williams, J. S. (l993j. Generation o f point defects i n crystalline silicon by MeV heavy Ions: Dose rate and temperature dependence. P h y . ~Re\). Lptt., 71: 1860. Svensson. B. G., Jagadish, C., Hallen, A,, and Lalita, J. (19%). Point defects in MeV ion-imMcdr. Phy.v. R t x , planted silicon studied by deep-level transient apeciroscopy, N u d . /n.s/ru~~r. B 106: 183. Svensson, B. G . and Willander, M. ( 1987). Generation of divacancies in silicon irradiated by 2-MeV electrons: Depth and dose dependence, Joirr. Appl. Phys., 62: 2758.
134
PLAMEN V. KOLEV AND M. JAMAL DEEN
Sze, S. M. (1983). In Physics of Semiconductor device.^, Second Edition, New York: WileyInterscience. Tabib-Azar. M. ( 1989). Deep-lebel spectroscopy of Cr-doped GaAs using nondestructive acoustoelectric voltage measurements. Solid-State Electron.. 32: 149. Tabib-Azar, M., Park, N., and Das, P. ( 1987). Transient type-inversion of semi-insulating GaAs under strong illumination at 140 K detected by the nondestructive surface-acoustic-wave technique, Solirl-Statc, Electron., 30: 705. Thurzo, T. and Dubecky, F. (1985). On the role of the back contact in DLTS experiments with Schottky diodes, Phys. Stut. SO/., 8Ya: 693. Thurzo. T. and Lalinsky, T. (1982). On the relation between charge- and current-based DLTS, P/ry.s. Stcrt. Sol. ((1). 71: K83. Tin, C. C.. Teh, C. K., and Weichman, F. L. (1987). Photoinduced transient spectroscopy and photoluminescence studies of copper contaminated liquid-encapsulated Czochralski-grown semi-insulating GaAs. Jour. Appl. Phys., 62: 2329. Tokuda, Y., Shimizu, N., and Usami, A. (1979). Studies of neutron-produced defects in silicon by deep-level transient spectroscopy, Japm. Jour. Appl. Phy.~.,18: 309. Tokuda. Y. and Usami, A. (1976). Investigation of neutron-produced defects in silicon by transconductance measurements of junction field-effect transistors, Jour. Appl. Phys. 47: 4952. Tokuda, Y. and Usami, A. (1983 ). Current DLTS with a bipolar rectangular weighting function for neutron-irradiated p-type Si, Jcipoii. Jour. Appl. Phys.. 22: 37 I . Tomokage, H., Nikashima, H.. and Hashimoto. K. (1982). Note on the analysis of DLTS and , 67. C2-DLTS, Jup. Joirr. Appl. P h y . ~ .21: Tredwell, T. J. and Viswanathan, C. R. (1980). Determination of interface-state parameters in a MOS capacitor by DLTS, Solirl-Strite Electron., 23: I 171. Tsividis. Y. P. (1987). In O p e r ~ ~ i ocrnd n Modeling of /he MOS Tramistor, p. 223, New York: McCraw-Hill. Valcur, B. ( 1978). Analysis of tirne-dependent fluorescence experiments by the method of modulating functions with special attention to pulse fluoroinetry, Chem. PIrys., 30: XS. van der Ziel, A. (1986). I n Noi.w in Solid-Sture Devices mid Circxit.v, New York: Wiley. Vincent, G., Chantre, A,, and Boas, D. (1979). Electric field effect on the thermal emission of traps in semiconductor junctions, Joirr. Appl, Phys., 50: 5484. Vitanov, P. and Kolev. P. (1986) Method for studying the interface states of MOS transistors using transient capacitance technique, P roceetling.~oJ /lie Internntional School of Phy.siccrl P ro/il~wr,\in Microelectronics - ISP P ME'XS, p. 572. Y , K bov. Ed.. Singapore: World Scientific. Wagner, E. E.. Hiller. D.. and Mars, D. E. (1980). Fast digital apparatus for capacitance transient analysis, Rev. Sci. liistruni., 51: 120.5. Walker, J . W. and Sah, C. T. (1973). Properties of 1.0-MeV-electron-irradiated defect centers in silicon, Phys. Rev., B 7: 4587. Wang. A. C. and Sah, C. T. (1984). Determination of trapped charge emission rates fr-om nonexponential capacitance transients due to high trap densities in semiconductors. Jorcr. Appl. Phys.. 55: S65. Wang, A. C. and Sah, C. T. (1985). New method for complete electrical characterization of recombination properties of traps in semiconductors. Jour. Appl. Phys., 57: 4645. Wang. C. W., Wu. C. H., and Bone, J . L. (1993). A new technique to decompose closely spaced interface and bulk trap states using temperature dependant pulse-width deep-level transient spectroscopy method: An application to Pt/CdS photodetector, Jour. Appl. Plrys.. 73: 760. Wang, K . K., van der Ziel. A,, and Chenette, E. R. (1975). Neutron-induced noise in junction field-effect transistors. lEEE Trcins. Electron. Drv.. ED-22: 59 1.
DEVELOPMENT AND APPLICATIONS O F A NEW DLTS METHOD
135
Wang, K. L. ( 1977). Determination of processing-related interpace states and their correlation with device properties. In Semicnnd~rcforSilicou 1977, p. 404. H. R. Huff and F. Sirtl, Eds., Princeton, NJ: Electrochem. Soc: Wang, K. L. (1979). A Determination of interface state energy during the capture of electrons and holes using DLTS, IEEE Trwu. Elecrron Uev., ED-26: 819. Wang, K. L. ( 1980). MOS interface-state density nleasureinents using transient capacitance spectroscopy, IEEE T r m s . Electrori Dev., ED-27: 223 I . Wang, K. L. and Evwaraye, A. 0. ( 1976). Determination of interface and bulk-trap states of IGFFT!, using deep-level transient spectroscopy. Jour. Appl, Phys., 47: 4574. Weiss, S. and Kassing, R. (1988). Deep-level Fourier spectroscopy (DLTFS) - a new technique for the analysis of deep-level properties, So/id-St~!teElectron., 31: 1733. Wessels, B. W. (1976). Determination of deep levels i n Cu-doped GaP using transient-current spectroscopy, Jour. AppL Phvs., 47: 1131. Wilmshurst. T. H. ( 1990). In Signrrl Recovery ,from Noise iri Elec/ronic. In.~rrumenfr~tiorr, p. 29. Second Edition, Bristol, UK: Institute of Physics. Yamasaki, K., Yoshida, M., and Sugano, T. (1979). Deep-level transient spectroscopy of bulk traps and interface states in Si MOS diodes, Jmp. Jour. Appl. Phys., 18: 113. Yan, Z. X. and Deen, M. J. (199 I ) . A physically-based method for measuring the threshold . s , 35 I . voltage of MOSFET, IEE Proc. 011 G Circuit.\, De\&.s m d S ~ ~ s t e n ~138: Yan, Z. X. and Deen, M. J. (1989). A new method for measuring the threshold voltage for small Conf 011 Elec~trictrl & Computer Engirieering, geometry MOSFETs, Proc. of’ the C~zr~c~rlicin Montreal. 1034. Yoshida, H., Niu, H., and Kishino, S. ( I 993). Isothermal capacitance transient spectroscopy of electron and hole emissions from interface states i n metal-oxide-semiconductor transistors, .lour. Appl. Phys., 73: 4451. Yoshie, 0. and Kamihara, M. ( 1983~1).Photo-induced current transient spectroscopy in higliresistivity bulk material. I. Computer controlled multi-channel PICTS system with high resolution, Jnpriri. Jour. Appl. PI7 Yoshie, 0. and Kamihara, M. (I98 nduced current transient spectroscopy in highresistivity bulk Material. II. Influence of non-exponential transient on determination of deep /. 22: 629. trap parameters, Jtiprm. J m r . A ~ [ J Phgs.. Zhao, J. H., Lee. J . C., Fang, 2. 0.. Schlesinger, T . F., and Milnes, A. G . (1987). The effects of the nonabiupt depletion edge 011 deep-trap profiles determined by deep-level transient spectroscopy, Jour. App/. Phys., 61: 5303; see errata, p. 5489. Zhao, X., Deen, M. J., and Taroff, L. (1996). Low frequency noise i n separate absorption, grading charge and multiplication (SAGCM) avalanche photodiodes. Electron. Lett.. 32: 250. Zhu. Y . , Deen, M. J . , and Kleinpenning, T. M. (1992). A new I/f noise model for nietal-oxidesemiconductor tield-effect-transistorsin saturation and deep saturation, Jou,: Appl. Plrys., 72: s990. Zitti, E. Di, Bisio. G . M.. Fuochi. P. G., Passerini. B. V., and Zambelli, M. (1989). An analysis of complex spectra from deep level transient capacitance measurements, Jorrr. Appl. Ph 1199. Zylhersztejn, A. (1978). Trap depth and electron capture cross section determination by trap rcfilling experiments in Schottky diodes, AppI. Phys. Luff.,33: 200.
APPENDIX A:
MAGNITUDE
ERRORS
The following pages contain printout of a MathCAD 5.0 file which was used to transform the expressions for the magnitude errors. Below, first we set the
136
PLAMEN V. KOLEV AND M. JAMAL DEEN
values for example calculation. n is the group number (logarithmic index), m is the number of averaged intervals inside the group (linear index), A t is the sampling interval, D is the programmed delay, i is the sampling index, C is the magnitude at t = 0, ts,,,,,, tm,,,,,, and re,,,, are beginning, middle and end of the averaging interval correspondingly, Tc,,,, and t are initial guess values of the time constant. Then, we define the error function E r ( r ) in percents as a difference between the exact magnitude E x ( t ) and the averaged magnitude E a ( s ) at the midpoint of the averaging interval, and t,,, is the time constant at which this error occurs. IZ := 3
111
:= 10
A t := 0.01
i := 1..2" j := 0 C := 100 tnz,,,,, = 0.74
D := 0.0
r.y,,,,l,
rin,,,,, := At
:= A t . m . (2" - 1 )
. m . (2" - 1 ) + At . 2"-'
+D
+D
r := Tc,,,,
tc;,l,,r,l:= A t . m . (2" - 1 )
+ At . i + D
tefl.nr - tS,,,,,,
Tc,,,,,,= 0.739279
At
Ex(t):= C . exp
t.s,,,,,
= 0.7
ten,nz= 0.78
=8
EN(t)
:=
-At
. m . (2" - 1
)l.$
exp (-At
.:)
Ex(t) = 36.752069 E a ( r ) = 36.521876 E r ( t ) = 0.230193 d given - E r ( r ) = 0
:= find(t) rmax= 0.79835 E r ( t n l n x= ) 0.230895
t,,,
dt
2"
dt 2" [nz . (2"
dt
c . At . ,[ -
t2
. (2,
-
1
-
I)
1 ) +2("-1) . e x p
+ 2t1-11 [-"'. t
1 [ m .(21, - 1 ) + 2'"-"
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
C ~.~
At
(2”)
. m . (2” - 1 )
. m . (2” - 1 ) . exp
5’
1
137
.
...
X
1”
[(-l+exp($)).exp[Af.-
($)
exp
-
5
. At
[(-l+exp($))’.exp[Af.(?.]]
Af . 1.5-
1
-
($)) 2
(-1
+exp
exp
. (tn
+exp
[ -Pf ~
”
7
+I).
(2” - I )
m . (2” - 1 )]
1
138
PLAMEN V. KOLEV A N D M. JAMAL DEEN
-exp
. [rn . (2''
- 2-" . exp
[
~
-rA'
-
1)
+ 2"]
1
. t n . (2" - 1) . . . . [ m . (2" - 1 ) . (rn + 1 ) . (,'I
tn
I
- 11
-
At
r rn
r
(rn
+exp
+ 1 ) . (2" - 1) . ( m + 1 ) . (2" - 1 )
1 ...
m . (2" - 1 ) - 2-11. exp
[
~
-rAt
1
1
. rn . (2" - 1 )
1)
1
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
1
. [ t n . (2’’ - 1 ) - 11
... I]
1
r - C ’ At
- C . At
-
[nz . (,-‘I
-
I)
-
(2-”
+ I)]
139
140
PLAMEN V. KOLEV AND M. JAMAL DEEN
1
r,,,
:= find(t)
r,,,, = 0.79835 E r (r,,,,) = 0.230895
The error was calculated directly by setting the derivative obtained by MathCAD to zero (above), and by first finding the expression for the derivative, and then setting the obtained expression to zero (below). This was repeated for each transformation in order to verify its correctness. For the chosen parameters. obtaining the same numerical result (0.230895%) means that the simplified expression for the derivative is correct. Here we demonstrate the correctness of the last transformation, which gives the expression for the derivative shown in Section 111, Eq. 25.
B: TIMECONSTANT ERRORS APPENDIX This listing is similar to Appendix A, but here the time constant error is directly calculated without finding an exact expression for the derivative. The exact time constant is calculated numerically by setting the derivative to zero and then it is compared to the time constant found by midpoint approximation. This procedure is repeated for all possible values of the group index IZ and linear index m. group index is n := 5 with m := 10 averaging intervals inside containing
i := 1..2"
samples each
At := 0.01 ms
long with magnitude C := 100
The midpoint of the first delay time is f l,,,,,, := At
. m . (2" - I )
+ At . 2"-'
or
t I,,,,,, = 3.26 ms
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
I41
The midpoint of the second delay time is t2n,n7:= A t . m . (2""
- 1)
+ At
+
2"
or t2,,,m= 6.62 ms
The time constant calculated using midpoint approximation is
we define rumax:= Tc,,,, we set the initial guess value to r := Tc,,,,,. Using current time delay for each sample t ~ , . ,:= ~ ,A~t . m . (2" - I ) At . i we calculate the exact time constant:
+
d given --Tx(r) = 0 rx,,, := find(r) rxmax= 4.74829 ms dr Next, we find the error in percents
E r := 100.
rxrnax - rainax
or E r = 0.105173%
rx111ax
APPENDIX C: NOISESOURCES AND SIGNAL-TO-NOISE RATIOIN THE DLTS TRANSIENTS As mentioned in Sec. 11, the noise component in the DLTS transient is directly related to the ability of the system to separate deep levels with similar emission rates. Therefore, a short discussion on the noise sources and their contribution to the noise components of the recorded DLTS transients is presented here.
C.l. Noise Sources Thermal (or Johnson) noise is inherent to all electrically conductive devices with limited active conductivity. It can be shown that the reactance component does not introduce thermal noise. The thermal noise current average in frequency interval A f (van der Ziel, 1986) is defined as -
i;h = 4kTgA f
(C. 1)
142
PLAMEN V. KOLEV AND M. JAMAL DEEN
where g is the small signal active conductance component. Usually this noise is orders of magnitude below the noise associated with other sources and in most cases it can be neglected. Another noise source usually associated with the discrete nature of the current flowing through a potential barrier is the shot noise. It is considered to appear due to carriers crossing potential barriers independently and at random. For a p-n junction and Schottky barrier, this noise (van der Ziel, 1986) is defined as i:,? = 2ql(Af). (C.2) The deep levels themselves are another source of noise -generation-recombination (G-R) noise. The physical source of that noise is the fluctuation in the total number of charge carriers exited from or trapped back into the traps. In equilibrium, at any given temperature, the deep levels at a few kT below the Fermi level are mostly filled, and those at a few kT above the Fermi level are mostly empty. Therefore, the associated G-R noise is significant only when the Fermi level is close to the trap energy level. The noise current in the case of m centers with m time constants is given by van der Ziel (1986)
where A; and r; are the amplitude and the time constant associated with the i-th center. At low frequencies there is another source of noise, flicker noise, with a spectral density frequency dependence of about Ilf" and, with a! 1, it is called 1l.f noise. One possible model for this noise source involves G-R sources with continuous distribution of time constants. Two mechanisms are considered for generation of I l f noise: number-fluctuation, when the number of the free charge carriers vary, and mobility-fluctuation, when mobility of the charge carriers vary. Often the mobility-fluctuation is related to the surface mobility, which is i n general less than the bulk mobility and may fluctuate due to interaction of the free carriers with the surface, regarded as a large defect. This can explain why the I l f noise in enhancement mode MOSFETs is much larger as compared to this type of noise in depletion mode MOSFETs or in JFETs. As postulated by Hooge, the 1l.f noise current is (van der Ziel, 1986)
-
where ( r is ~ a constant (Hooge constant) of about 2 x which vary slowly with the temperature, N is the carriers number and I is the bias current.
DEVELOPMENT A N D APPLICATIONS OF A NEW DLTS METHOD
I43
Currently, it is accepted that a~ can be several orders of magnitude smaller for small devices. For nonuniform devices, instead of N , the effective number of carriers Net, is used. For practical purposes, the expression ((2.8) is modified by combining ( Y H and N,,,f to give a new constant K , and the powers of I and ,f may be set different from two and one, respectively, so that by substitution in Eq. (C.4) we get I f i and f a .
(2.2. Noise Sources in CT-, CC-, and CR-DLTS Transients As noted by Lang in his fundamental work ( 1 974a) a major advantage of the DLTS over previously used TSCAP and TSC methods is that it allows us to avoid or to reduce significantly the I l f noise component in the measured signal. However, for long transients the repetition rate can be as low as 0.6 Hz and, in some cases, the l / f noise component can be significant in the raw DLTS transient. The PSD technique employed in the capacitance meter strongly reduces the drift, offset and the 1l.f noise component of the high-gain I MHz amplifier that is part of the instrument (Wilmshurst, 1990), but there remain other, frequency independent noise sources. A white noise is present at both capacitance and conductance outputs of the instrument, even without a device connected to the test terminals, particularly at high sensitive ranges. Good noise analyses of CT-DLTS systems can be found in Borsuk and Swanson (1980); Breitenstein (1982); Misrachi et al. (1980); and Rancour (1995) and they will not be repeated here. In the case of a closed loop system such as CC-DLTS and CR-DLTS, quantitative noise analysis can be much more complicated, and it depends strongly on the properties of the measured device and less on the used instrumentation. The test device can be considered as a first stage in an amplifier consisting of the test device, the 1 MHz amplifier of the capacitance meter, and the feedback integrator. The output signal of this complex amplifier is a stimulus for the measured device, which is at the input of the complex amplifier. Therefore, as mentioned, the effective gain of the test device can significantly affect the total noise i n the loop. The baseline provided by the memory circuit is added to the output signal, and this can further complicate the analysis. Fortunately, a well-designed memory circuit produces a baseline with noise components well below 1 Hz and it may be considered noiseless. The bandwidth of the loop is limited at high frequencies from the integration time constant of the feedback integrator. As the recovery time of the capacitance meter is at best about 40 ~ s there , is no reason for the time constant of the integrator to be less than 8 ps. At low frequencies, the bandwidth is limited by the duration of the emission pulses. In the case of CC-DLTS, the effective gain is the differential capacitance of the test device dC/dV. Obviously, large area devices with large total capacitance will have larger effective gain and this will reduce the noise in the
1 44
PLAMEN V. KOLEV AND M. JAMAI. DEEN
system, because most of the needed total loop gain will be accomplished by the 1 MHz amplifier inside the capacitance meter, while the feedback integrator will need to contribute less gain. In contrast, small test devices with poor dC/dV will require much more gain from the feedback integrator to he added to the total gain. As there is no PSD noise rejection in this part of the loop, the total noise may increase dramatically, particularly the llf’ noise component. A similar situation exists for the case of CR-DLTS. Because the transistor is near to cut-off, there is essentially no channel current to contribute to the thermal, shot or I l f noise components. The G-R noise component is also not important, as a large G-R component will also generate a very large DLTS signal. However, the preceding discussion about the role of the effective gain of the test device for the redistribution of the gain in the loop requires a small modification. Here, the effective gain is actually the transconductance of the FET, therefore, the ratio of the channel width to the channel length will be important and the area of the device is not. Because the MOSFETs and the JFETs used for measurements reported in this chapter have substantially different channel width-to-length ratios, direct comparisons of the noise were not possible. To our observation, the differences in the noise could be attributed more to the gain of the FET than to the type of the transistor. In practice, the role of redistribution of the gain in the loop between the 1 MHz amplifier inside the capacitance meter and the feedback amplifier could be easily demonstrated by selecting a less sensitive range of the capacitance meter, or by varying the 1 MHz test signal level.
C.3. The Role of the Averaging Techniques for SNR Improvement Quantitative analyses of the SNR improvement resulting from the averaging technique are also complicated. The main reason is that the SNR improvement due to pseudo-logarithmic time averaging varies along the decaying transient and depends on the specific setup. As a rule of thumb, a noisy signal with a significant IIf noise component is better processed with higher digital time constants selected for continuous time averaging, and this require higher repetition rates of the filling pulses to keep acquisition time reasonably long. Still, it is possible to obtain 30-40 times SNR improvement by continuous time averaging and about 20-25 times SNR improvement (in the transient tail) by pseudo-logarithmic time averaging. Then the total SNR improvement can reach several hundred, but in this case there is also a need to account for the limited resolution of the digitized signal, which in turn can also be effectively enhanced by the averaging process (Wilmshurst, 1990), again with variation along the time axis.
I
I
.
I45
I
APPENDIX E: Coiitiriuerl
output ES - S&H sampling emitting pulses 3”q output FS - S&H sampling filling pulns
memc,ry
and
the diode
the v
re ferena capacit o m C11 ir
-.,
G I 2 In
rrL pulses fro1 I signal generat
IZate signal u x d to disconnect the
capacitance meter output during pulse transitions and t o reset the integrator
_Ihis . is the diagram of thc timing circuit for driving the feedback circuit for CC- an CR-DLTS. L2r 470
I48
PLAMEN V. KOLEV AND M. JAMAL DEEN
APPENDIX F: LISTING OF A TEMPLATE FOR A DLTS MEASUREMENT PROCRAM
#include cconio.h> #include ciostream.h> #include<dos.h>
....include some libraries. #include cgraphics.h> #include ”svgal6.h” #include < ieeeio.h> #define out61 7(chars) #define in6 170 #define ON 1 #define OFF 0 #define uch unsigned char #define P300 0x300 #define P304 0x304 #define P307 0x307 #define P308 0x308
ieeewt(“output 09; “chars”\n”) ieeewt(“enter 09\n”) /* Define some handy constants */ /* Define some handy constants */ //hardware communication ports/
. . .graphics definitions
void void
interrupt (‘koldisr)(.. .); interrupt far int-resp(. . .);
//save old interrupt //declare the new interrupt routine void setGEN(uch&dly, uch&linnmbr, uch&octv): //setup pseudo-logarithmic generator void setGAIN(uch&gn); //setup gain, if available void writedata(F1LE *f); //record time intervals void writetrace(F1LE *fl ,float Temp); //record trace magnitude void cleanup(int cod); int changesettings(int); void settemp(doub1e stemp I , double stemp2, float r a m p ) ; //two routines for temperature control float meastemp(void); extern “C” { //declaration of Assembler functions) unsigned Chtest(void); //extended character check //DMA initialization and control routines int runDMA(void); int stop(DMA)(void); void startDMA(void); void statusDMA(void); unsigned initDMA(void);
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
149
int huge ReadBuffer(unsigned Segmnt,unsigned Offst,uch Weight); //read and clean memory buffer void ClearBuffer( unsigned Segnm,unsigned Offs );
I extern unsigned BUFSEG,BUFLEN, BUFBEG,BUFEND, MODE; int huge cnt[400] [2 101; //magnitude array long tml2 1 01; //delay array
. . .more global declarations void main(int-argc, char "Yargv) { . . .local definitions
. . .check the name and open a data file
...
out6 17("FOROD 1 B4T4COZ 1 X"); setGEN(dely.lnmbr,octvs); setGAIN(gain); open graphics windows oldisr=getvect(O x 72); //save old HW interrupt vector setvect(Ox72,(void interrupt far (*) (. . .))int-resp); //set new interrupt vector bt=inportb(OxAl)&OxFB; //check if it is safe to initialize the DMA controller outportb(OxA1,bt); do {
. . .do measurements or observations until ESC is pressed
1 )while (chtst !=27); do ( bt=inportb(P308) & 1 ;]while (bt==O); init-err=stopDMA(); setvect(Ox72,(void interrupt far (*) (. . .))oldisr); bt=inportb(OxAl) I 0x04; outportb(0xA I , bt); closegraph(); //Return the system to text mode ieeewt("output 17;RAMP O\n"); temp=meastemp(); sprintf(ieeestr,"output 17;SETP %5.1f\n",temp); ieeewt(ieeestr); ieeewt("loca1 17\n"); ieeewt("loca1 18\11"); ieeewt("loca1 09\n");
I50
PLAMEN V. KOLEV AND M. JAMAL DEEN
if (ieeeclose()==- 1) cleanup(5); if (fclose(fp)!=O) printf(“Error closing text data file.\n”); */
I //h*******+**********************Y*****Y*****************~:**~:***
void interrupt int-resp(. . .) {
unsigned i; outportb(OxAO,Ox20); BUFO=O; for (i=O; i c m ; i++) {
cnt[k] [ i]=ReadBuffer( BUFSEG,BUFO,Wgt); BUFO+=2;
I count=initDMA(); startDMA(); alarm= 1 ; oldisr(. . .);
I //*~:************Y**********~:Y***Y***Y***Y*YY**Y***YY**Y**YY*****
void
setGEN(uch&dly, uch&linnmbr, uch&octv) {
int i j ; uch corn- byte,nmbr; com-byte=((dly- 1)<<4)+linnmbr-l; nmbr=(linnmbr* octv)- 1 ; outportb(P300, com- byte); outportb(P307, Ox 3A); outportb(P304, nmbr); outportb(P304, 0); m=(int)linnmbr* (int)octv; //m-I is the number of time data points BUFLEN=2* linnmbr* octv; for (i=O; ic(int)octv; i++) { for (j=O; j c(int)linnmbr; j++){ if ((i* linnmbr+j)tm) if (i>O) tm[i*linnmbr+j]=dly+linnmbr*(((long) 1 <
I I
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
1.51
t 1 =(int)(3*(m> >2)); 1 I); if (lg== I)scaleX=MaxX/(float)(tm[melse scaleX=MaxX/(float)loglO(tm[m- I]); scaleY=( MaxY-height-2.5)/4096.0;
I
//* * * * * * * * * * * 1: * * * * * * * * * * :ic * * * * * * :I: * * * * * * * * * * * * * *:ic
void
* * * * * * * * 1: * * * * * * * *
setGAIN(uch&gn)
{
outportb(P308, gn);
I . . . graphics routines . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
float
.........................
meastemp(void)
1 float tmp; ieee wt( "ou tpu t 17 ; s DAT '?\n"); ieeewt("enter 17\"); ieeescnf("%f',&tmp); if (tmp< 10.0) ieeewt("output 17;rang 1 \"): else
1 if (tmpt25.0) ieeewt("output 17;rang 2\n"); else ieeewt("output 17;rang 3\n");
I return tmp;
I
......................................
void settemp(doub1e stemp 1, double stemp2,float rrate) {
char ieeestr[40]; ieeewt("output 17;RAMP O\n"); sprintf(ieeestr,"output 17;SETP %S. 1 1 f]n",stemp I ); ieeewt(ieeestr); delay(300); ieeewt("output 17;RAMP I \n"); sprintf(ieeestr,"output 17;RAMPR %S. I f\n",rrate); ieeewt(ieeestr); sprintf(ieeestr."output 17;SETP %S. 1 1 f\n",stemp2); ieeewt(ieeestr);
I
//* * * * * * * * * * * * * * * * * * * * * * 1: * * * * * * * * * * * * * * * * * * * * * * * * * * 1: * * 1: * * 1: * * * * */
152
PLAMEN V . KOLEV AND M. JAMAL DEEN
APPENDIX C: LISTING OF A TEMPLATE FOR A DLTS ANALYSIS PROGRAM
. . .include standard libraries # include cgraphics.h>
. . .declare graphics routines
void Write Trace(char “filename); void WriteScan(char *filename); void WriteArrenius(char “filename); unsigned Arrhenius(unsigned ch,int N); void readdata(F1LE *t]; readtrace(F1LE “f 1,double&Temp,int ko); void extern “C” unsigned Chtest(void); unsigned chtst; int huge cnt[400][2 101; //magnitude data array long tm[210]; //time data array float fg ai n, scinag [ 1 01[400] ; double T[400],temp,TI ,T2,Tdif,deltaT,scaleX,scaleY ,zoomk= 1 .O; //temperature array and other declarations double intgr1[400],dcnt[2 101; double I OOOKT[ 10][200],T2xtau[ 10][2001; //Arrhenius data points array ..other global declarations void main(int-argc, char** -argv) {
. . ..
local declarations
selecting and handling data file name strcat( filename.“.dat”); if ((fp=fopen(filename,“rt”))==NULL)[ //open and read data file printf(“Error opening %s file.\n”,filename); exit( 1);) readdata( fp); while (!feof(fp)) {
readtrace( fp,T[k], k); k+= I ;
I if (fclose(fp)!=O) printf(“Error closing text data file.\n”); . . .. graphics routines sush as.. ..
1 wrt=o;
DrawScan(t1);
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
I53
while (kbhi t() !=0) ( wrt= 1 ; chtst=Chtest(); if ((chtst>O x40)&&(chtst
case Ox4D00: case Ox4B00:
/I-> if ( t l t m - 1 0 ) I/
10)
case 0 x 9 : case 0 x FOO: case 0x5000:
break;
tl+=l; tl-=I;
break;
t 1 += 10;
break;
t 1 -= 10;
break;
//tab if (tl <m-10) //shift+tab if (t I > 1 0) //down arrow
. . . different actions on the chart depending on the key pressed
)while (chtst !=27); closegraph();
I //:ic********
/* Return the system to text inode“/
***** ***
* * * * * * * * * * 4: * * * * * * * * * * * * * 2:
. . .graphics routines
void
readdata(F1LE *f) {
int ij; char delys[4],octvss[43],lnmbrs[4],gainstr[S],Wgts[4],tms[8]; fscanf(f,”%s %s%s%s%s\ii”,&delys,&octvss,&lnmbrs,\ &gainstr,&Wgts); dely=atoi(delys); octvs=atoi(octvss); Inmbr=atoi(lnmbrs); gain=atoi(gainstr); fgain=( 10000.0/gain)/4096.0; Wgt=atoi(Wgts); m=lnmbr* octvs; l1in-l is the number of time data points for (i=O;itoctvs;i++) ( for (j=O;j tlnmbr;j++) if ((i *lnmbr+j)<m)( fscanf(f,”~s,”,tms); tm[i*lnmbr+j]=atol(tins); }
154
PLAMEN V. KOLEV A N D M . JAMAL DEEN
fscanf(f,”\”);
1 for (i=O;itoctvs;i++) [ for fj=O;j O) tmli *lnmbr+j]=dely+lnmbr*( ((long) 1 < < i)- 1)+((long) 1<
) cout<
1 exit( I ) ;
//
I .................................................................. readtrace(F1LE *f 1, double&Temp,int ko)
void
1 int char
ij;
cnts[6],Temps[6j,*endptr;
fscanf(f1, “%s\n”,&Temps); Temp=strtod(Temps,&endptr); for (i=O;i toctvs;i++) { for fj=O;j
I 1
.................................................................. void {....
DrawScan(int No)
//display spectrum for time(24’No)-time(No)
)
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
unsigned Arrhenius(unsigned ch,int N ) { char msg[80]; int ij; double tau,Xscale,Yscale; struct PTS pt,z;
tau=(double)(tm[N+lnmbr]-tm[N])/ ( 100000.01 *log((double)tm[N+lnmbr]/(double)tm[N]));
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
155
switch( ch) { case Ox3B00: /IF1 //select an Arrhenius data point IOOO-T[O][f[O]] = l/T[iipeak]; if (k2<1) k2=I; T2 x tau[ 01[f[O]]=(T[i- peak] *tau);f[O]+= 1 ;break; . . .inore data points } //display and store in the global aray the Arrhenius data points return ch;
I ................................................................ routines to writethe Arrhenius data points or the spectrumpoints to a file //~l:***wI****Y****+**********************~~**~:********************
void
boxcar(int N,double Xsc,double Ysc,struct PTS zr)
{
. . .calculate spectrum points, (called from the graphics routine)
I I . . .routines to write data to files APPENDIX H:
RADIKrION-INDUCED
E2 lev] [crn'l
0
0. I84 f 0.004 ( I .4 f 2.3) x 1 0 - ' 5 0.1742f0.0014
0.17 4.4 x 10-15 0. I8 10-14
0.16 3.9 x I 0 ~ ' J 0. I7 0.18 0 . I8
0. IS + 0. I8
*
0. I96 0.003 (1.7 f 2.9) x 10-I' 0.2355 f 0.0006 0.23 4.8 x lo-" 0.23 2 x lo-'" 0. I9 1.6 x I0-l' 0.23 0.6 x 1 0 - i h 0.28 0.20 0.20 + 0.2s
DEFECTS IN
E3 lev] rn [cm']
0.44s f 0.004 ( 1 . 8+ 2.4) 10-14 0.4706 f 0.0015 0.42 7.9 x 10-1' 0.39 4 x 10- 1.5 0.44 2.3 x 10-1.' 0.4 I 1.62 x 10-I' 0.41 f 0.43 0.46 0.39 + 0.46
SIIJCON
Reference
Yainasaki er al. (1979) Walker and Sah (1973) Wang el a / . (1975) Kimerling (1977) Tokuda rt
( I / . ( 1976)
Evwaraye and Baliga (1977) Kiinei-ling P / t i / . (198 I b) Barnes ( 1979) Chen and Milnes ( I 980)
156
PLAMEN V. KOLEV AND M. JAMAL DEEN
APPENDIX H: Continued
E2 [eV]
E3 [eV]
o ~cni']
0
--10-'h
-10-15
Reference
[tin?]
~
-10-1'
0.23
0. I8 10-14
0 . I7 0 17
0.17 0.17
0. I8
0. I64 + 0. I72 ( 1 . 8t 7 . 5 ) 10-15 0.17 0.16 0.01 ( I .4 0.2) x lo-'' 0.16
*
0.16
0. I665 & 0.0025 1.7 x 0. I608 0.0016 1.7 x lo-''
*
2 x 10-16 0.23 0.21 f 0.23 (2 f 4)4 x 0.23 0.23 0.24 + 0.26 1.5 x lo-'' f 2.7 x 0.23 0.202 f 0.258 7 x I0-"+2.2 x I 0 - l ' 0.25 z t 0.02 10-l~
0.23 0.23 0.225 f 0.005 2 x 10-16 0.22 I5 f 0.00I6 2 x 10-16
0.44 > 10- 16
0.44 0.44 1.2
10-l~
0.44 0.44
0.39 0.409 f 0.4 19 ( 1 . 1 f 1.6) x IO-" 0.44 0.40 f 0.02 4 x lo-" 0.44 0.41 0.425 0.005 4 10-l~ 0.4335 f 0.0016 4 x 10-15
+
LISTOF ACRONYMS AC ADC CCD
cc
CCVT CR CT c-v CVCT CTS DC
alternate current analog-to-digital converter charge-coupled devices constant capacitance constant-capacitance voltage transient constant resistance capacitance transient capacitance-voltage constant-voltage capacitance transient current transient spectroscopy direct current
Benton and Kinierliiig (1982) Song et t i / ( 1986) Chantre and Kimerling ( 1 986) A w n et crl (1987) su et trl ( 1990a) Su et trl. (1990b) Benton ( 1990) Doolittle and Rohatgi (1992) Li and LI ( 1959) Bosetti et a/ (1995) Privitera et ti/. ( 1996) Coffa et 01. (1997) Kolev et d.( I99Xc) Kolev et d.(1998b)
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
DDLTS DLTS DMA DRAM DUT EG&G EOC FET( s) FUSE G-R G-V HEMTs IBM I-v JFET(s) LDD LEDs MOS MS-DOS OA PAR PC PL PSD QTS RTS SADLTS SAW SCR S&H SNR
so1
sos SRH
sw
TAV TSCAP TSC
uv
ULSI VLSl
double deep level transient spectroscopy deep level transient spectroscopy direct memory access dynamic random access memory device under test company founded in 1946 by Harold Edgerton, Kenneth Gernieshausen, and Herbert Crier end-of-conversion field-effect transistor(s) Far-UV Spectrographic Explorer generation-recombination conductance-voltage high electron mobility transistors International Business Machines company current-voltage junction field-effect transistor(s) lightly-doped drain light emitting diodes metal-oxide-semiconductor Microsoft disk operational system operational amplifier Princeton Applied Research personal computer photoluminescence phase-sensitive detector charge transient spectroscopy random telegraph signal spectral analysis DLTS surface acoustic wave space charge region sample and hold signal to noise ratio silicon-on-insulator silicon-on-sapphire S hockley-Read-Hall switch transverse component of the surface acoustic wave thermally stimulated capacitance thermally stimulated current ultraviolet ultra large scale integration very large scale integration
I57
158
PLAMEN V. KOLEV AND M. JAMAL DEEN
LISTOF SYMBOLS implanted channel depth diode area total capacitance capacitance as a function of time integrator capacitor reference capacitor during the emission pulse in CC DLTS reference capacitor during the filling pulse in CC DLTS feedback capacitor capture rate coefficient for electrons capture rate coefficient for holes equilibrium capture rate coefficient for electrons equilibrium capture rate coefficient for holes steady-state capacitance per unit area oxide capacitance per unit area substitutional (s) carbon - interstitial (i) carbon defect capacitance transient differential capacitance versus voltage or the effective gain of a capacitive sample in a constant capacitance feedback loop interface trap density emission rate as a function of time emission rate coefficient for electrons emission rate coefficient for holes equilibrium emission rate coefficient for electrons equilibrium emission rate coefficient for holes conduction band energy level Fermi level energy band gap energy E , shallow level ionization energy activation energy deep-center ionization energy valence band energy level error function representing the difference between the exact magnitude of the exponentially decaying transient and the magnitude obtained by averaging as a function of the time constant r transconductance parallel conductance component reference conductance current transient in the moment t leakage current
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
159
Boltzman’s constant total number of averaging intervals inside any group index of m-th averaging interval in the rz-th group maximal group index acceptor concentration effective concentration of the donors and the acceptors prior to implantation effective density of electron states i n the conduction band donor concentration interface trap concentration concentration of the implanted impurities concentration of occupied deep centers as a function of time t total concentration of deep centers effective density of hole states in the valence band index of a group of averaging intervals concentration of electron traps interstitial oxygen - vacancy defect substitutional phosphorous - interstitial carbon defect substitutional phosphorous vacancy defect concentration of hole traps concentration of the free holes electronic charge charge of the ionized impurities oxide trapped charge charge transient integrator resistor feedback resistor normalized output of a boxcar function as a function of the time constant r delay time of the first boxcar channel delay time of the first boxcar channel n-th averaging time interval length of the ADC sampling interval sample interval in continuous time averaging end of the first of the averaging intervals m,, in a group containing M intervals middle of the first of the averaging intervals m,, begin of the first of the averaging intervals m,, absolute temperature aperture of a boxcar channel effective time constant of a boxcar channel oxide thickness
160
PLAMEN V. KOLEV A N D M. JAMAL DEEN
pulse period vacancy - vacancy defect built-in junction potential gate voltage gate voltage transient gate-substrate voltage of a depletion mode MOSFET voltage to achieve channel pinch-off in a depletion mode MOSFET flat band voltage pinch-off voltage pinch-off voltage at the beginning of the emission pulse pinch-off voltage in the moment f pinch-off voltage without substrate bias reverse voltage bias substrate bias threshold voltage threshold voltage transient in the moment t threshold voltage with all traps discharged and neutral threshold voltage with all traps charged depth of the implanted channel effective width of the gate controlled part of the implanted channel width-to-length ratio initial width of the space-charge region weighting function in the moment t body effect coefficient body effect coefficient of the implanted channel semiconductor dielectric permittivity oxide dielectric permittivity free carrier mobility in the bulk of semiconductor surface free carrier mobility Fermi potential in the substrate work function from the Fermi level in semiconductor to vacuum contact work function summing amplifier capture cross section for electrons capture cross section for holes time constant error in defining the time constant window time constant at which the error function Err(r) reaches a positive peak
DEVELOPMENT AND APPLICATIONS OF A NEW DLTS METHOD
time constant at which the error function Err(5) reaches a negative peak accurately calculated time constant window time constant window calculated using the Lang’s expression applied to the midpoints of the averaged intervals feedback time constant capture time constant for electrons capture time constant for holes input voltage average thermal velocities for electrons average thermal velocities for holes output voltage
161
This Page Intentionally Left Blank
ADVANCES IN IMAGING AND E I . I T I K O N PIIYSICS. VOL. I09
Complex Dyadic Multiresolution Analyses J.-M. LINA, P. TURCOTTE and B. GOULARD Network ,f.r Cotnpirting cinrl Mutherncitic,cil Modeling C i 1 7 d Centre rle Rec1ierche.c.Mrithdnzcitiqirc~.~, Ut7iv. rle Montrdrrl C.P. 6128 Succ. C m t r e - Ville. MotitrcW Qullwc H3C 3J7, Ccititrclcr
I . Introduction . . . . . . . . . . . . . . . . . . . . . . . 11. The Spline Example
163
. . . . . . . . . . . . . . . . . . . .
111. Multiresolution and Wavelet .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
164
.
167
.
IV. Daubechies’ Wavelets . . . . . . . . . . . . . . . . . . . . V. Symmetric Daubechies Wavelets
170
. . . . . . . . . . . . . . . .
VI. The Phase of SDW Scaling Function .
.
VII. The Mallat Algorithm with Complex Filters
.
. .
. .
. .
. .
. .
. .
. .
. .
. .
. .
. .
175 .
.
VIII. Restoration from the Phase . . . . . . . . . . . . . . . . . .
176 178 180
IX. linage Enhancement
. . . . . . . . . . . . . . . . . . . .
183
X. Complex Shrinkage
. . . . . . . . . . . . . . . . . . . .
187
XI. Conclusion . . . . . . . . . . . . . . . . . . . . . . .
193
. . . . . . . . . . . . . . . . . . . . . . .
196
Reading List . . . . . . . . . . . . . . . . . . . . . . .
197
References
ABSTRACT This chapter presents the construction and some properties of the symmetric Daubechies wavelets. This class of solutions is part of the set of complexvalued solutions of the Daubechies’ program: orthonormal bases of L 2 built from compactly supported wavelets with maximum regularity. We review some algorithms that take explicit advantage of the presence of phase in the complex representation of signals by the symmetric Daubechies wavelets. Applications for two-dimensional (2-D) signals are discussed.
I. INTRODUCTION Complex-valued fields are very common in physics not only because of the extensive use of Fourier representation of physical quantities that helps to solve problems such as wave propagation but also because phase can
164
J.-M. LINA, P. TURCOTTE and B. GOULARD
contain nontrivial information related to interactions and symmetries. Quantum mechanics, nonlinear optics, and solid state physics illustrate nicely the “role of the phase” in the modeling of physical processes. More recently, the Berry-Hannay phase exhibited nontrivial behavior in the cyclic evolution of quanta1 or semiclassical systems. However, i n comparison, signal processing makes little use of the phase: because the “role of the phase” is less prominent in the Fourier representation of real signals, most of the attention is focused on amplitude information. Nevertheless, many papers have stressed the “importance of the role of the phase” in various problems in signal processing: furthermore, phase is indeed present in many technologies and sometimes plays a fundamental role, as i n holography or quantum cryptography. The objective of this work is to discuss the “role of the phase” in the area of discrete wavelet representations of signals. It is worth noticing that the notion of phase was involved in the early development of the continuous wavelet framework, especially for tracking the local frequency of signals (the so-called r i d p m d .skeleton crlgorifhrn). Here we consider the complex extension of the famous Daubechies solutions. As mentioned in Mallat (1998) (used here as the main reference on wavelets), some of these complex solutions exhibit symmetries. This property first motivated our study of the complex Daubechies wavelet but from a practical point of view the question remains: Why should complex Daubechies wavelets be of interest? The next two sections discuss the role of the phase in the context of spline wavelet bases and provide us the opportunity to recall basic definitions of a multiresolution analysis used here. Sections IV and V present the Daubechies solutions and the symmetric Daubechies wavelets respectively. Section VI describes some properties of those solutions that are directly related to the phase of the wavelet. The algorithmic point of view of the complex dyadic wavelet analysis is presented in Sect. VII. The remaining sections describe some applications of the complex wavelet representation, dedicated mainly to image processing. Finally, the conclusion presents future avenues of research in the field of complex data modeling.
11. THESPLINE EXAMPLE
The “role of the phase” can be nicely illustrated with the well-known wavelet bases associated with the lineur-spline junction (Fig. 1). We take the opportunity of this introductory example to present some basic definitions used in this work. More details can be found in Daubechies (1992) and Mallat (1998). We start with the definition of an approximation space, say Vo, endowed with a Riesz basis ( q k , k E 271:
COMPLEX DYADIC MULTIRESOLUTION ANALYSES
165
I .4
I.2
I 0.8 0.6 0.4 0.2 0 -0.2 -0.4 -3
-2
-1
0
I
2
3
Ficiiim I . The linear-spline function
where cpk(x) = cp(x - k ) . The cp is the scaling function normalized with unit integral. The set of functions c p ~is linearly independent and provides a complete and stable representation of Vo : A > 0 and finite B exist such that
This property (Riesz) will be more characterized in a sequel. A simple example of such a basis is given by the linear-spline function (Fig. l ) , s ( x ) = s u p ( 1 - 1x1, 0).
(3)
The integer translates of s ( x ) span an approximation space:
where CI, = f ( k ) because splines are interpolatory functions. In other words, any continuous function f can be linearly approximated through an expansion in this approximation space VO.Equation ( 2 ) can be characterized in the Fourier domain. Defining
where @ is the Fourier transform of the scaling function, Eq. (2) amounts to consider the inequalities 0 < u,(w) < 00 (see Mallat, p. 222, 1998). For instance. we have I 2 cosw - < n,(w) = (1 3 3 -
+
166
J.-M. LINA, P. TURCOTTE and B . GOULARD
for the linear spline equation (3) whose Fourier representation is
Looking for an orthonormal basis for Vo, the orthonormalizatiorz of' the Riesz basis is easily done in the Fourier domain because it is equivalent to find a basis constructed from a scaling function for which a,(w) = 1 (see Mallat, p. 225, 1998). Such a condition is not fulfilled with the spline equation (4), however, writing __
a,(w) = u ( w ) u ( w )
(8)
the new scaling function defined by
obviously satisfies a,(w) = 1 and defines an orthonormal basis of Vo. Various factorizations of 0,( w ) define different scaling functions cp through the phase of u ( w ) . In the spline example discussed here, a first factorization has been proposed by Stromberg ( 1982):
The associated scaling function is displayed in Fig. 2. Later, Battle ( 1 987) and Lemarik ( 1988) have considered another factorization
The phase ( ( w ) introduces an extra property into the new scaling function: cp(x) = cp(-x), that is, symmetry, as illustrated in Fig. 2. Both factorizations yield orthonormal sets (pk) but the solution of both Battle and LemariC leads to a symmetric scaling function. We notice that orthonormalization of the spline basis has ruined the original compactness of the support of the spline scaling function. In the next section, it will be shown that compactness of the support (i.e., strong locality in space), orthogonality of the basis, and symmetry are all met in the seminal Daubechies framework (j' we relax the implicit condition on the reality of the scaling functions. Indeed, all found solutions are complex-valued.
167
COMPLEX DYADIC MULTIRESOLUTION ANALYSES I .4
I
I .2
0.X
1
0.8 0.6 0.4 0.2 0 -0.2 -0.4
0.6 0.4
0.2 0 - 4 - 2 0
2
6
4
-0.2
-4 -3
-2 -I
0
1
2
3
4
FIGURE2 . The Stroniberg scaling function (left) and the Battle-LemariC scaling function (right) for the linear spline.
111. MULTIRESOLUTION AND WAVELET
A dyadic multiresolution is based on a scaling function finement equation
q~
that obeys a re-
where the ak's are complex-valued coefficients such that C N = ~ I . This functional equation relates the spanning function at different scales. For instance, the spline defined by Eq. ( 3 ) does satisfy such a relation: s(x) = &S(2X
-
I)
+ s(x) + 4S(2X + I ) .
(13)
It is worth rewriting Eq. (12) in the Fourier domain and defining the trigonometric polynomial
usually called filter associated with the scaling function. Equation ( 1 2) can be written as
Most of the properties of the scaling function can be computed from mo(w) (Mallat, 1998). For example, the filter associated with spline equation (3) is
and the solutions of Stromberg in Donoho and Johnson's work ( I 995) (see Eq. (10)) and Battle-LemariC in Downie and Silverman's report (1996) b'riven
168
J.-M. LINA, P. TURCOTTE and B. GOULARD
by Eq. ( I 1) are associated with the filters
respectively. The extra factors at the right of mo(w) ensure the orthogonality property n,(w) = 1 of the new bases. We notice that the two filters in Eq. (17) differ only by a phase: this is the origin of the symmetry exhibited by the Battle-LemariC solution. Indeed, the orthogonality condition can be recast in terms of the amplitude of mo(w):
Thus, it is clear that combining a property like symmetry with orthonormality is a matter of defining a specific phase of the filter. The scaling relationship expressed by Eq. ( 12) is related to the definition of the multiresolution analysis of L2(IR) based on cp proposed by Mallat (Mallat 1989) and Meyer (Meyer, 1992). A multiresolution amounts to considering a sequence of closed subspaces Vj c L2(IR),
such that f ( x )E
v, * .f’(2x-) E Vj,,.
The scaling equation (12) expresses the nested structure of the v’s. At any scale a, = 2 - J , elements of V, are of the form
where cp,(x) is the scaled function cp,(x) = l / f i c p ( x / a , ) . Consequently, the set of functions cp, k ( x ) = 2’”cp(2’~ - k ) (22) constitutes a basis of the approximation space “at scale ~ i = j 2-./” when k runs over the integer. Let us remark that if Vo is endowed with an orthonormal basis, then ( c p j , k , k E 77) is also an orthonormal basis for the approximation space 4. The last step towards wavelets is the following theorem (see Mallat, p. 236, 1998): a wavelet I/J E VI with vanishing integral exists such that (cp(.u k ) , +(x - k ) , k E Z) is an orthonormal basis of V I .
COMPLEX DYADIC MULTIKESOLUTION ANALYSES
169
Wavelet multiresolution therefore aims to decompose VI = Vo @ W Owhere W o is generated through the integer translates of a function $. Since $ E W 0 c V I , a sequence of complex-valued coefficients b k exists such that:
11 can be shown, using nzI(w)=e-””nzo(w+n), that the set ($(x - k ) , k E Z) is an orthonormal basis of W O .This result amounts to taking bk = (- 1 -k i n Eq. (23). The orthonormal decomposition of each dyadic approximation space, that is, V,+I = V, @ W i , defines the details spaces W.i generated by the set of orthonormal wavelets
when k runs over Z.Altogether, those functions generated by the translates of all dyadic scaling transforms of $ produce an orthonormal basis of L?(IR):
Example The Splirir W h e l r t s . The construction of the wavelet associated with the two scaling functions displayed in Fig. 2 is straightforward: using the previous expression for in1 ( w ) , we obtain
As illustrated in Fig. 3, the factorization O , ~ ( W = ) 1u2(w)Iz gives a symmetric wavelet: $(x) = $( 1 - x). Those two examples illustrate nicely the scope of the forthcoming sections: Provided we can select an appropriate phase in $, that is, in rno(w), a symmetric scaling function can be constructed on the miltiresolution spaces. The same phase analysis will be carried out in the context of Daubechies wavelets. Let us make a final comment about the regularity of the multiresolution bases. Let us recall that for orthonorrnal bases of the form of Eq. ( 3 ) , it is irnpo.s.vible that $ has exponential decay in both space and frequency. It will be more convenient to control the regularity through the polynomial content
170
J.-M. LINA, P. TURCOI‘TE and B. GOULARD 2 I
I .5
0.X
I
0.6 0.4 0.2 0 -0.2 -0.4 -0.6
0.5
0
-0.5 -I -1.5
-4-3
-2
-I
0
I
2
3
1
FIGURE3 . The Stroinberg (left) and Battle-LemariC (right) wavelets.
of the approximation spaces V, or, equivalently, the vanishing moments of the corresponding wavelet (see Mallat, p. 154, 1998):
I‘
x +(x)dx = 0, for 1 = 0, I , . . . , J
(28)
where J is some integer or, in an equivalent way, the vanishing of the successive derivatives of at w = 0. Using Eq. (24) and the definition of un~(o), the criteria of regularity can be written as
4
d‘
--mo(w),,=,
do‘
= 0, for I = 0, 1,
,J .
This is the Strang-Fix condition (see Mallat, p. 242, 1998).As seen in Eqs. (16) and (17), the previous examples illustrate this condition with J = 2.
IV. DAUBECHIES’ WAVELETS Daubechies scaling functions are subject to the following constraints:
1 ) Compactness of the support of cp: We require that cp (and consequently has a compact support inside the interval [ - J , J I ] for some integer J . This requirement amounts to considering a finite number of terms in the expansion equation (7): ak # 0 for k = - J , -J 1, . . . ,J , J I. 2) Orthogonality of the cp(x - k ) : Instead of working with rno(w) and solving Eq. (1 8), we define the polynomial
+
I))
+
+
COMPLEX DYADIC MULTIRESOLUTION ANALYSES
171
where z is on the unit circle, IzI = 1 . The orthonormality condition equation ( 1 8) can be stated through the following equation
where the polynomial P ( z ) is defined as
3) Strang-Fix condition: To maximize the regularity of the functions generated by the scaling function p, we require the vanishing of the first J moments of the wavelet. In terms of the polynomial equation (30), the Strang-Fix condition reads as F(-I) = F ’ ( - I ) = F ” ( - 1 ) = . . . = P
I ( - ] )
= 0.
(33)
This last condition is satisfied with the polynomial defined by
where /I(:)
I +z 2
= -.
(3.5)
A solution of the full constraints defined by Daubechies is
Let us sketch the proof that P./(z)does satisfy Eq. (3 I ) : We first rewrite P J ( z ) in the form
172
J.-M. LINA. P. TURCOTTE and B. GOULARD
where H ( z ) = obtain
~
+
4
')'.
Using the equality
(2J ) ( 2 J25+ +l - j ) we 1
=
,
and finally Eq. (31) results trivially by taking the difference of the two last expressions:
The relevance of the polynomial PJ ( z ) in the construction of multiresolution bases relies on the particular relationship between the 2J roots of I > J ( z ) . Indeed, we can write the following factorization of p ~ ( z ) :
where X L = ~ , ~ . , . , , are J the J roots inside the unit circle, lxkl < I , and !N(xk) > !H(xk+,) (see Fig. 4). Writing F ( z ) as
2 I .5 I 0.5 0 -0.5 -1 -1.5
FIGURE4. The roots of p l z ( ; ) (left) and ploo(:) (right).
I73
COMPLEX DYADIC MULTIKESOLUTION ANALYSES
the polynomial p ( z ) is defined through some particular roots of ~ J ( z and ) it is easily found that for any subset R c ( I , 2, 3 , . . . , J ) all solutions of the form
factorize f J (z) = z F ( z ) F ( z ) . Therefore, for any integer J , the trigonometric polynomial given by
with any selection R of roots of ~ J ( z ) ,defines an admissible trigonometric polynomial and thus a compactly supported orthonormal dyadic wavelet basis (with all the material described in Sec. Ill). Let us notice that permuting K and R in Eq. (43) is equivalent to replacing cik by its complex conjugate (up to some trivial translation). So. except for the Haar case ( J = 0, at) = 0.5 and a l = 0.5), we obtain 2'-' solutions. In her seminal work (1992), Daubechies investigated a subset of this family: She selected R so that the coefficients in the trigonometric expansion rno = CkCikeiXoare real-valued. The usual Daubechies wavelets are thus real-valued functions.
) two roots Example J = 1 : The polynomial p ~ ( z has xr'. The unique solution is thus
XI= r = 2
-
and
and corresponds to the well-known DAUB4 wavelet (see Fig. 5). Let us note that taking Eq. (44) with r = 0 leads precisely to the filter associated with the linear spline, see Eq. ( 1 6). 1 .5
1
0.5 0
-0.5 -I -1.5 -7i
-2-1.5-1-0.50
0.5 I 1.5 2 2.5 3
FIGIIKE 5. Left: the scaling function cp lor J = I . Right: the Daubechies wavelet
lor J = I.
174
J.-M. LINA, P. TURCOTTE and B. GOULARD
Example J = 2: The polynomial p 2 ( z ) has four roots
and F;', X;'.
Two distinct cases are found: R = { 1 , 2 ) and R = { I ) .
The first case
corresponds to the DAUB6 solution displayed in Fig. 6. The second selection of roots gives
As anticipated by Lawton (see Daubechies, p. 253, 1992 and Lawton, 1993), this solution leads to symmetric but complex-valued scaling function and wavelet (see Fig. 7). Such complex solutions exist for any value of J but symmetry is only possible with J even. In the next section we will consider a particular family of Daubechies symmetric solutions. Let us however mention the existence of the solution displayed in Fig. 8 for which J = 6. Two things must be noted: The functions have a small imaginary part and a real part that bears a close resemblance to the Battle-LemariC orthogonal basis associated with the quadratic spline (see Daubechies, p. 150, 1992). Let us mention that, despite 1.4
2
I .2
I .s
I
1
0.8 0.6 0.4 0.2 0 -0.2
0.5 0
-0.5
-1 -1
-0\,.T A
-2 -1.5-1-0.5
0 0.5
I 1.5 2 2.5 3
.._,-2 2;
-1.5-1-0.5
0 0.5 I 1.5 2 2.5 3
FIGURE 6 . The real hcaling function and wavelet (DAUB6) for J = 2
COMPLEX DYADIC MULTIRESOLUTION ANALYSES
175
i .5 I
0.5 0 -0.5
-1 -1.5
FIGLJKE 7. The complex-valued scaling function (left) and wavelet (right) for J = 2 (imagi n x y part indicated by it dashed line).
F I G ~ J R8.E A complex scaling function (left) and wavelet (right) for J = 6 (imaginary part indicated by a dashed line).
its complex value, the solution displayed in Fig. 8 has demonstrated nice performance in image compression (see Belzer et al., 1995).
V. SYMMETRIC DAUBECHIES WAVELETS The symmetry q ( x ) = cp( 1 - x) defines a subset of solutions corresponding to the following rule of selection for the roots in Eq. (43):
(49)
k€R+J-k+l@R.
We first notice that symmetry rules out the “reality” of the scaling function: All the symmetric Daubechies wavelets (SDW) are complex-valued and for any even value of J , 2 ( J - ’ ) ’ 2 distinct solutions exist. In the present work, we consider a particular family of solutions, the so-called SDW J Daubechies wavelets, which correspond to the following selection of roots: R = {1,3,5, . . . , 2 k f 1 , . . . , J
-
1)
(50)
176
J.-M. LINA. P. TURCOTTE and B. COULARD 0.8
.
.
.
: I
,
I
\
.
.
-0.4 -4-3-2-1
.
.
I
8
.
0
I
#
;
.
.
0.8
.
-0.x.
.
.
.
.
.
.
.
.
-4 -3 -2 - I
0
I
2
3
4
' '
I
:
.- . ,
.
,
1
)
I t .
2
.
.
3
4
5
5
9. Complex scaling function (left) and wavelet (right) for J = 4 (imaginary part indicated in dashed line). FiGuRb
I
I
0.X
0.X
0.6 0.4 0.2 0 -0.2 -0.4 -0.6 -4 -3 -2 -I
0.6 0.4 0.2 0 -0.2 -0.4 -0.6 -0.8 0
I
2
3
4
5
-I
-4 -3 -2 -I
L . . . . 0
I
2
3
4
5
FIGLIRE10. SDW4: scaling function (left) and wavelet (right) (imaginary part indicated in dahhed linc.)
that clearly satisfy Eq. (49). For J = 2, the solution (SDW2) is shown on Fig. 7. Figures 9 and 10 display the two complex solutions for J = 4 (SDW4 corresponds to Fig. 10).
VI. THEPHASE or: SDW SCALING FUNCTION All syinnietric Daubechies wavelets (SDW) share the usual properties of the standard real Daubechies bases. This statement relies on the fact that most of those properties, which can be found either in Daubechies (1992) or Mallal (1998), are based on the modulus of the filters rno(w). Writing q ( x ) = h ( x ) i g ( x ) , we investigate the relationship between the real functions h and g. Using the Fourier representation, we consider the ratio
+
COMPLEX DYADIC MULTIRESOLIJTION ANALYSES
177
ALPHA ( f o r J=2,4.6 and 8)
1s 10 5
0
-10
-1s
-6
-5
-4
-3
-2
-I
0
I
2
log( w )
Left:
~ ( o J ) ;log-log
xxle. The slope of the stralght lines is 2
This quantity is singular a t w = 2n and real-valued, because of the symmetry. On Fig. 1 I , we plot the numerical computation of v ( w ) in log-log scale for J = 2.4, 6 and 8. As seen on this figure, the complex scaling function is roughly approximated by the expression
This approximation is accurately verified on the frequency domain defined by the sampling rate of the analyzed signal. In other words, this identity is verified in the interval 10, n](with a sampling step renormalized to unity) when they are written in the Fourier representation. For J > 10, higher derivative terms in g(x) become nonnegligible. Let US note that h ( x ) is further endowed with interesting vanishing moments:
J ' d x h ( r ) = 1 and J ' d x k ( x ) (x
- ;)"I
= 0 for
112
= I , 2 and 3.
(53)
In the early days of the Daubechies wavelets, the introduction of vanishing moments for the scaling function led to the construction of the well-known co(flefs. The parameter a in Eq. (52) can be directly computed from the filters coefficients an by using the first nonvanishing momentum of cp(.r): writing (54) we have
178
J.-M. LINA. P. TURCOTTE and B. GOULARD M U (for J=2.4,h and 8)
5 . -5
.
-25
.
-30
FIGL~KF. 12. Left:
.s((o);
/
I
/
.
-6
, ,
-5
,
.
-4
,
.
I
, .
. I
-3
.
’ .
.’
I
/
.’ .
-2 -1 log(w)
.
.
0
I
2
log-log scale. The slope of the straight lines is J
+ 1.
Straightforward integrations by part lead to
For J = 2 and 4, (Y is respectively equal to -0.164 and -0.089. We observe that Figs. 7 and 10 are consistent with Eq. (52). Another interesting relationship can be stated between $ and cp. Writing $(x) = w ( x ) i7r(x), we look at the relationship between the real functions w and h (the real part of the scaling function cp). As seen on Fig. 12, it is found numerically that, at least for J up to 8, the ratio
+
is real and behaves as d+I.Such a relationship certainly does not occur in the real Daubechies cases. However, it is worth recalling that continuous wavelets are usually generated by taking successive derivatives of some scaling function as the Gaussian. The famous “mexican hat” wavelet is the second derivative of the Gaussian function. Here, we obtain compactly supported orthogonal complex wavelets whose real part is close to being the derivative of a smooth function, the real part of the corresponding complex scaling function.
VII. THEMALLAT ALGORITHM WITH
COMPLEX
FILTERS
The discrete multiresolution analysis of f consists of the computation of the coefficients of the expansion
COMPLEX DYADIC MULTIRESOLUTION ANALYSES
179
where jo is a given scale (low resolution). In practice, the sum over j (the details at finest scales) is finite and ,f’ is projected onto some approximation space vjmSx : PV,,,,~“ f (XI =
C cjl,l,,x.
k~;,,,~,~,k
(59)
k
The coefficients in the expansion equations (46) and (47) are computed through the orthogonal projection of the field over the multiresolution basis:
Starting with Pv,,,,,, f , the wavelet coefficients are computed with the frst wavelet u‘econiposition algorithin W composed with the low-pass prqjection V, + V / - , and the high-pass projection V, + W / - l :
h
k
Conversely, any elements of 4-1and of W,-l can combine to give a unique vector in 4; this reconstruction (denoted by W - ’ ) is expressed by the inverse fast wavelet transform:
k
k
In most applications, the signal to be analyzed is real-valued: The complex wavelet representation provides a redundant description of the signal. Equation ( 5 2 ) helps in interpreting this redundancy because, using the Taylor expansion of a one-dimensional field, we can estimate the real and irnaginary parts of the coefficients c i as
Let us consider the estimate of the finest scale approximation of f , that is, f k . A crude approximation is simply
Pv,,,,,,,(f), given a sampled function
180
J.-M. LINA. P. TURCOTTE and B. GOULAKD
FiciLlat. 13. Projection onto V,,,,,,,: real part (left) and imaginary part (right) of P,,!,,],,,1. (The original image is the right iinagc displayed on Fig. 10.)
given by L ‘ ; , ~ , ~ , , x= 2pJ/2fk. Denoting by !)IJ c V, the set of all functions in V, with real-valued modes, this approximation is nothing but the orthogonal projection (denoted by P!,il,,,z,k) of ,f’ onto !Hj,,l,l,. This corresponds to “Mallat’s initial conditions” for fast wavelet transform. A more accurate estimate of PLJ,,,~,,~ ( f ) is obtained by using the operator
This projection gives a nontrivial imaginary part for the cl”’.“.As expected, it corresponds to the Laplacian of the estimated real part. This is illustrated nicely in the 2-D example displayed in Fig. 13.
VIII. RESTORATION FROM n j F : PHASE The issue discussed here is nieant to facilitate understanding of the redundancy of the complex wavelet representation of a real signal. In other words, we want to understand the “role of the phase of a complex wavelet coefficient d j , .”~ For simplicity of notation, the discussion is done for 1 -D signals but the simulation is presented in 2-D. Let us first define two projectors, P,Ii and P r . The projector P,,{ extracts the first-order approximation of the scaling coefficient of the expansion equation (59) at the finest resolution (!)1 denotes the real part):
COMPLEX DYADIC MULTIKESOLUTION ANALYSES
181
Let us now consider the wavelet expansion equation (58) of a given tield .f‘o and define the phase of the wavelet coefficients 0 j . k = Arg(d,,k). We observe that the new set of functions Qj,k(x) = e”’:$J,~(x) is also an orthonorinal basis of L2(R):This “local rotation” of the wavelet basis leads to a inultiwavelet basis adapted to the signal. Indeed, we define the istyhase spuce r by the set of all expansions
where the coefficients ri are now positive real numbers. The PI. is the orthogonal projector on this space that depends on the phase of the wavelet coefficients of the original field with which we start. Given an arbitrary wavelet expansion of the form (2) with d,,,k = w,,,k i u ; , ~ the , projection on the isophase space is defined by the closest point on r, that is,
+
with
We further observe that both P:li and f’l- project onto convex spaces (POCS). Considering an arbitrary point .70 in r, a well-known theorem states that the sequence of alternate projections shown in Fig. 14, that is,
converges and, in the present case, the limit point is the original real signal .f‘o from which we defined r. The 2-D generalization of this algorithm is straightforward using the usual cross-product of the I-D rnultiresolution basis. For the sake of illustration of the “phase reconstruction algorithm,” Fig. 15 displays the original picture f o , the initial point P,~{,fo(obtained by killing all the modulus of the ~ ,j,,,i,x - 4 wavelet coefficients of the four-level decomposition, that is, j , = with SDWJ = 2) and the POCS reconstructions f n = 1 o 0 and J ’ J J = ~ ( J ( ) ( ) . We first notice that the POCS gradually restores the details of the iinage from coarse to fine. As we notice in Eq. (68), the projector PI- “shrinks” the inodulus of the wavelet coefficients, even to 0. I t is worth recalling that shrinkage techniques
182
J.-M. LINA, P. TURCOTTE and B. GOULARD
F~GLJR 14. E Phase reconstruction by alternating projections on aftine spaces
rj,,,zlx and K,,,,,,
are nowadays an efficient tool for denoising. Phases thus encode the "coherent" structures of the signal and the POCS algorithm reconstructs the original image through the coherency of the encoded information. The restoration of the modulus of the wavelet coefficients is illustrated in Fig. 17: coefficients of the coarser level j = j o = j,,,,, - 4 and those of the finest scale j = j,,,, - 1, both for f n = l O O O . We can observe the resulting shrinkage of the wavelet coefficients that depends on the scale of the details. Let us further mention the significant speed-up of the POCS algorithm by using a relaxation parameter in the isophase projector. This is done by redefining PI- as follows
In place of Eq. (69), we now consider the new sequence of projections (see Fig. 16), .frl
= (P!iiPr(Ll) ) " ~ ! I t ( . j ' o )
(71)
where h,, is computed in order to minimize the quadratic error ll.fr, - f112. The iterative algorithm is obtained on the form
COMPLEX DYADIC MULTIRESOLUTION ANALYSES
Fq
183
FIGURE 16. Phase reconstruction by alternating projections with relaxation parameter A. 200000 200000, 150000 I00000
A
.
50000 0
0
”
f ..t
50000 100000 150000 200000 (a)
60001
’
’
5000. 4000 -
3000 .
2000 1000
0
1000 2000 3000 4000 5000 6000 (h)
FIGURE 17. (a) coarse scale wavelet modulus of .f’~cno vs original wavclet modulus: (h) finest scale wavelet modulus of .flow vs original wavelet modulus.
with 6,, = d J l / r J where l, d,, is the distance between the two convex (i.e., the energy of the imaginary part killed in the projection P!,,),
In Fig. 17 we have displayed the amplitude of the reconstructed wavelet coefficient versus the original true value. In this example, all the modulus of the wavelet coefficients have been set to zero at the beginning of the POCS algorithm.
ENHANCEMENT IX. IMAGE The bidimensional multiresolution analysis is built from the product of two niultiresolution spaces V. The scaling function @(x, y ) = cp(x)cp(y) generates
184
J.-M. LINA, P. TURCOTTE and B. GOULARD
Vo and, complemented with three wavelets @(x, y) = @(x)@(y),q ’ ( x , y ) = @(x)cp(y), @(x, y ) = cp(x)cp(y), it spans VI . The functions are complexvalued. In particular, we have
and Eq. [ 5 8 ] or Eq. [59] now generalizes in two dimensions as
span the spaces V, and W > ,respectively, so that
In the sequel, we denote by W N the N-levels wavelet transform
where d = (do,d ’ , d 2 ) . Figure 13 shows an example of the projection Pv,,,,,,,I and Fig. 18 displays the modulus of the complex wavelet coefficients (and the c 111 ~ ~ , ~ in , ’the s upper left corner). The real and imaginary parts of the scaling function are
where G(x, y ) denotes the real smoothing kernel h ( x ) h ( y ) .On one hand, the real part of the 2-D scaling function is close (because a2 < < 1) to the smoothing kernel G(x, y ) while, on the other hand, the imaginary part is proportional
COMPLEX DYADIC MULTIRESOLUTION ANALYSES
I85
I’ICiLJKE 18. Modulus of the coniplex wavclet coefficients (SDW4, N = 3 ) . The top leli sector is the matrix of coefficients ri:,,. The other sparse matrices are the wavelets coefticicnts when the scale j increases from .jo to ,j,,,L,x = j~ -1- 3 (3 directional wavelets).
to the Laplacian of G(x, y ) : W ( x , y ) is thus the “Marr wavelet” associated with @(x,y ) 2: G(x, y ) .
As the real and imaginary parts of the wavelet transform coefficients of some r e d image correspond to the convolution of the original field with the real part and the imaginary part of @ , , , , , , , l (x, y ) , respectively, we then have access to the (multiscaled) smoothed Laplacian of the image:
The simultaneous presence of a smoothing kernel and its Laplacian in the complex scaling function can be exploited to define some elementary operations on the wavelet coefficients. In other words, we use this information to synthesize a new image that corresponds to some prescribed operation. A typical example is de-noising; this application is among the most successful applications of wavelets. Here, we investigate the edge enhuncemenr that is
I86
J.-M. LINA, P. TURCOTTE and B. GOULARD
usually implemented through the sharpening operator
Starting with an expansion of the form given by Eq. (76),we synthetize a new image
defined by
t11.11
where the scale 1 runs from ,j,,,L,x to the coarse scale jo. hl are real-valued and 61 (:) are some functions of the complex wavelet coefficients. Our intention is to extract various kind of details at different dyadic scales of the image and to add them to the original with appropriate weights. Some particular choices of hl’s and 61’s are particularly worthy of mention. For example, let us consider:
We observe that hl = = 0 leads to the identity, that is, I = 1. Introduction of nonvanishing (but small) hl amounts to adding scale dependent details that are similar to the multiscale Laplacian. Figure 19 shows an example of such a processing with SDW4 and N = 3. In comparison with the original image, the local contrast has been improved significantly: Artefacts from the scanner acquisition are now apparent in the processed lady image. This “anomaly detection” is one of the most promising applications of multiscale representations. Let us mention that other efficient rnultiscale sharpening transformations have been proposed in the recent past. The main difference in the current work is the orthogonality property of the SDW transform and the use of the phase of a complex basis. We recall that the SDW bases are not derived from a representation that allows specific
COMPLEX DYADIC MULTIRESOLUTION ANALYSES
187
FI(iIJKE 19. Image eiihaiicciiient: the teat image ;I( left. The I-ight image has heen synthctircd u4ng nonvanishing A's and r in Eqs. (XO) and (82)
enhancements. On the contrary, the Laplacian has been shown to be inherent to this particular orthogonal basis.
x.COMPLEX S H R I N K A G E Let us now consider an image corrupted with additive Gaussian noise (denoted by N) projected into the approximation space of highest resolution
We want to estimate a real signal
-
from the observed image I =Io+N. The wavelet shrinkage technique amounts to computing estimates from the wavelet representation of the observed signal. Let
I88
J.-M. LINA, P. TUKCOTTE and H . GOULAKD
We have to solve the following variational problem: Given a positive Lagrange parameter h, find an image I* that minimizes the functional
L ( I ) E E ( ? ,I ) + h S ( I ) .
(91)
Here E(?, I ) is the root-mean square (rms) error between the observed and test images:
whereas S ( I ) represents some constraint on the regularity of the optimal solution; in fact it regularizes the ill-posed problem of minimizing E alone. The choice of this constraint involves some LI priori knowledge about the true signal we aim to restore. The parameter h controls the trade-off between goodness of fit and CI priori smoothness. This latter property, given by S ( I ) , can be quantified by using a norm in some Besov space (see De Vore and Lucier, 1992). An important result i n wavelet theory is the definition of such a norm in the wavelet representation. Considering the Besov space (i.e., the space of functions “with (Y derivatives in L , ” ) , it can be shown that
cc 2
S(I)=
(z
/,,,.,x-~
ld),kl,k2Ip)“”’, s’ = (Y
2”“’
[=o ]=n
+ 1 - 2P
-
(931
defines a norm in this space. For the sake of simplicity, we consider here p = q = 1 and(-Y= 1; then
and the functional defined in Eq.(91) can be read as 2
11,,.n,-~
Equation (94) illustrates the efficiency of the wavelet representation for solving the preceding optimization problem because it “diagonalizes” the functional over each coefficient of the decomposition. As illustrated in Fig. 20, the
COMPLEX DYADIC MULTIRESOLUTION ANALYSES
189
FIGURE 20. Solution of Argrnin R d , 2).
solution d* of our variational problem d* = ArgminF(d, d )
(97)
is clearly a point lying on the ray defined by the phase of 2.In other words, the phase is preserved and plays a role in this problem rather like a parameter. Let us rewrite F(d, d ) in terms of the amplitudes,
The solution of Argmin F ( r , 7) is obviously given by r* = ( r
-
i)+
(99)
where (x)+ is equal to x for x 2 0 and 0 elsewhere (see dashed line in Fig. 2 1 ). We thus obtain a soft-shrinkage of the wavelet coefficients’ amplitude with a threshold defined by k ( j ) . Denoting by 7this shrinkage operator of the wavelet
190
J.-M. LINA, P. TURCOTTE and B. GOULARD
FIGURE
21. Shrinkage function
s(x).
coefficients, the resulting estimate is given by
Let us emphasize that this approximation is obtained by modifying only the amplitude of the wavelet coefficients: The phases are preserved. This fact will be explicitly used in the Bayesian approach considered in the sequel. Let us consider the general case of a zero-mean complex Gaussian random variable ;rl with a normal distribution (see, for instance, Miller, 1974). Following the recent work of Picinbono (1996), we define the variance cr2 and the “relation” c by 2 def 0 = E(rpj), e f E ( ; r 1 2 ) . (101) The complex Gaussian distribution is then described by the density function ( 102)
with
Straightforward computation yields the following expression for Q:
191
COMPLEX DYADIC MULTIRESOLUTION ANALYSES
where n = EelLYand
Let us consider the following expression for the wavelet coefficients computed from the observed image (we omit to write the indicesj, k ) :
using the polar representation of the wavelet coefficients, = pelt. First we obtain the following likelihood function:
2 = reiN and
do
Second, as we are interested in an estimator that preserves the phase of the wavelet coefficient, that is, an estimator of the amplitude only, we define a likelihood on the amplitude rj,k by
<
n , , k ( r I p ) = JC ( r , o = ~ j , k l p , = ~ , , k CT, , p, y, a = ~ , , k )
- re
(r-p)? o’r(H~k).
(108) Let us comment on the various “hidden” parameters (assumed to be known) in r(H,;,k). The angle 1/2y defines the principal axis of the distribution (for the noise) in the complex plane; the p parameter takes real values between 0 and 1 and measures the “circularity” of the complex noise component (see Picinbono, 1996): If the data are circular (i.e., the phases are meaningless), then p 0; however, real and imaginary parts are strongly correlated for p 1 (the limiting case p = 1 is obtained with the Haar wavelet for which all the wavelet coefficients lie on a line). The variance cr intuitively measures the noise level. In a fully Bayesian analysis, it would be possible to assume prior distributions on each of these parameters in Eq. (107). Here, however, we perform a partial Bayesian analysis by first using the wavelet coefficients at the finest scale to estimate r, p, and y and also using the fact that a! = = 0 j . k . Thus, the following estimators of the parameters arise naturally:
- -
<
Following the previous discussion, the prior distribution on p, which can be considered as a measure of smoothness, should depend on the Lagrange
I92
J.-M. LINA, P. TURCOTTE and B. GOULARD
parameter h in Eq. [91]. Here, we choose to model this smoothness by a Gamma (2, l / h ) density; that is
The resulting posterior density for p is thus found to be
The Bayes estimate can now be specified as follows (see for instance B. Vidakovic, 1999). Let us consider the signal contaminated by additive noise and expanded in some orthogonal discrete wavelet basis. The Bayes estimator with respect to the quadratic loss of the wavelet coefficient amplitude is equal to the mean of the posterior distribution: That is, for any pair of indices (j, k ) , the Bayesian estimator 6 j . k ( r ) of the amplitude of wavelet coefficient d,,k, given the observed amplitude r = r;.k is
Using the following probability integral (Abramowitz and Stegun, 1964):
where i"erfc(x) denotes the repeated integrals of the error function:
we obtain the following expression for the estimator
COMPLEX DYADIC MUI~TIRESOLUTIONANALYSES
I93
and
As seen on the graph of Fig. 22, S(x) gives a “supersmooth” version of a “softshrinkage” estimator. This function and the phase-dependent estimator given by Eq. ( 1 16) are our main results. In Fig. 22, we show the phase dependency of the threshold for various values of the “circularity.” As expected, we observe that the limiting case p % 1 does correspond to the usual real case for which a single phase (e.g., 0 or n)is present in the decomposition. Conversely, the threshold does not depend on the phase when the data are circular, that is, when p % 0. Note that approximating Six) by (x), (the dashed line on Fig. 21) simplifies the result in Eq. ( 1 16) and leads to the estimator
As expected, this result extends the initial estimate equation (99) with an explicit dependence on the phase. Images displayed in Fig. 23 illustrate the technique presented here. In the upper and lower images both columns contain the noisy observed images and the corresponding estimates, respectively. It should be mentioned that we have used h = m a - ’ in Eq. ( 1 17) ( n being the number of pixels of the image) in order to recover the universal threshold of Donoho and Johnstone (1995) when p = 1.
XI. CONCLUSION Ten years after the celebrated work of I. Daubechies ( 1 988) (finding the orthonormal bases of compactly supported wavelets with a maximum number
FIGURE22. Left: plots of O.Sr(O), for various values 0 1 p. (The threshold i s flat for p = 0 and peaky for p = I.) Right: A,,k(r).
194
J.-M. LINA, P. TURCOTTE and B. GOULARD
FIGUKI':23. Noisy and estimated images
of vanishing moments), the present paper describes a generalization of this problem, pointing out the existence of symmetric solutions. Indeed, such solutions are complex-valued and exhibit extra properties that have been used in various applications, mainly in image processing. Currently, wavelet-based processing has reached a midterm maturity and only a few methods have taken advantage of this new framework for signal description. All of these applications make use of statistical analysis of signals: zero-tree coding and data compression, de-noising, multiscale modeling, . . . . One of the most interesting and significant applications of this technique has been in the field of medical image processing, where it has helped interpret signals. In Fig. 24, we illustrate this with a mammogram in which details as small as microcalcifications have been amplified using the approach described in Sect. IX, where we use the complex nature of our solutions. Indeed, in Sect. VI, we explain the presence of a wavelet in the imaginary part of the scaling function: the
COMPLEX DYADIC MULTIRESOLUTION ANALYSES
195
FIGURE 24. Image enhancemcnt: the test image (bottom) is a region on a digiiiied mammography image. The tipper image has been synthetizcd using S with SDW4 and N=3.
196
J.-M. LINA, P. TURCOTTE and B. GOULARD
sharpcning enhancement illustrated in Fig. 24 is based mainly on this result. The phase of the solutions has also been useful in the de-noising application presented in Sect. X. It is worth comparing our approach with the multiwavelet shrinkage method proposed by Downie and Silverman (1 996). The complex solutions cp(x) = h ( x ) ig(x), $(x) = w ( x ) iv(x), can be trivially recast as real-valued multiwavelet solutions where we consider vectors
+
+
and the matrix form of Eqs. (12) and (23). In this framework, a multivariate statistical analysis of the 2-D wavelet coefficients has been carried out (Downie and Silverman, 1996) and an “ellipsoidal” shrinkage has been proposed. This estimator amounts to preserving what we unambiguously identified as the phase of the complex wavelet coefficients. Moreover, we justified preserving the phases in Sect. X. Let us mention that such an “ellipsoidal” shrinkage technique has been used efficiently in the complex framework by Gagnon and Drissi Stnaili (1996). Their estimator corresponds to our result in Eq. ( 1 19). They have shown, with careful comparisons, that this de-noising technique based on complex wavelets out-performed the other standard methods for reducing speckle artefacts in SAR images.
ACKNOWLEDGMENTS This work is supported in part by A.N.S. Technologies Ltd. (Montreal, Canada) and by the Natural Sciences and Engineering Research Council (NSERC) of Canada.
REFERENCES Ahramovich, F., Sapatinas, T., and Silverman, B.W. (1996). Wavelet thresholding via a Bayesian approach. Tech. Report, Srhool of Matherncitics, Univ. of Bristol (UK). Ahramowitz, M., and Stegun, 1. (1 964). Hundbonk ($Muthmia/iccil Functions, NBS Applictl Math. Series. Battle, G . (1987). A block spin construction of ondelettes. Part I: Lemarit functions, Conun. M d l . Phys., 110: 601. BelLer, B., Lina, J-M., and Villasenor, J. (199.5). Efticient image coinpression using coniplex wavelets filters, IEEE Trrns. Signal Processing, 43: 10, 2425. Dauhechies, 1. (1 988). Orthonormal bases of compactly supported Wavelets, Cornn~un.oi? Pure and Appl. Mu/h., 41: 909. Dauhechies, I. ( 1992). Ten Lectures on Wtrve/e/s,S.I.A.M., Philadelphia.
COMPLEX DYADIC MULTIRESOLUTION ANALYSES
197
De Vore, R.. and Lucier. B.J. (1992). Fast wavelet techniques for near-optimal image processing, Proc. 19Y2 IEEE Militarv Commim. Co$, IEEE Communications Soc., NY. Donoho, D., and Johnstone, I. ( 1995). Wavelet shrinkage: Asyniptopia?, Jour. R. Stutist. SOC.B. 301 (and reference therein). Downie, T.R.. and Silverman, B.W. (1996). The discreste multiple wavelet transform and the thresholding methods, Tech. Report. School of Muthemutics, Univ. of Bristol (UK). Gagnon, L., and Drissi Smaili, F. (1996). Speckle noise reduction of airhone SAR images with symmetric Daubechies wavelets, SPIE-2759. Lawton, W. ( 1993). Applications of complex valued wavelet transforms to suhhand decomposition. lEEE Trans. Signul Proc., 41: 3566. LernariC, P.G. (1988). Une nouvelle base d'ondelettes de L2(R"). Jour. Lie Math. Piires etApp/., 67: 227. Mallat, S . (1989). Multiresolution approximation and wavelet orthonormal bases of L'iR), firms. Ainer. Muth. Soc., 315: p.69. Mallat, S. ( 1998). A Wuveld 7 h r qf'Sigrztrl Processing, Boston: Academic Press. Mcyer, Y. ( 1992). Wave/et.s cmd Operators, Advunced Muthemurics. Cambridge Univ. Press. Miller, K.S. ( 1974). Complc~xStochustic Proces.srs, Reading, MA: Addison-Wesley. Picinbono, B. ( 1996). Second-order complex random vectors and normal distributions, lEEE Trans. Sig. Proc., 44: 2637. Stromherg, J. (1982). A modified Franklin system and higher order spline system on 08" as in Honor of' A. Zvgmmtd, 11, Wadsworth unconditional bases for Hardy spaces, Cor~fi~rc.nc~e Math. Series, p. 475. Vidakovic, B. Stutistical Modeling by Wivelets. J. Wiley & Sons Ed., 1999.
READINGLIST Lina, J.M., and Mayrand, M. (1995). Complex Dauhechies wavelets. App. Comp. Hurmorric Anal., 2: 219. Lina, J.M. (1997). linage processing with complex Daubechies wavelets, / i f f . Jour. qfMuthernciricul Imuging uncl Vision, 7:2 1 I . Lina. J.M. (1996). Complex Dauhechies wavelets: filters design and applications, First ISSAC Cortf, Delaware. Lina, J.M., and Drouilly, P. (1996). The importance of the phase of' the symmetric Daubechies wavelets representation of signals, Proc. 1WlSP '96, Manchester (UK), p. 69.
This Page Intentionally Left Blank
.
ADVANCES IN IMAGING A N D ELECTRON PHYSICS V O L 10‘)
Lattice Vector Quantization for Wavelet-Based Image Coding MIKHAIL SHNAIDER Motorolo Austrcrlitrn Resetrrch Centre
ANDREW P. PAPLINSKI Schuo/ of’Cornpiirer Science wid S o j t ~ ~ u rEngineering. ? Monnsh Liniversity. Auslrdicr
I . Introduction . . . . . . . . . . . . . . . . . . . . . . .
200
I1 . Quantization of Wavelet Coefticients . . . . . . . . . . . . . . .
201
A . Fundamentals of the Quantization Process . . . . . . . . . . . .
203
B. Information Distribution across the Coefficient Matrix . . . . . . . .
204
C . Optimal Bit Allocation
.
.
.
111. Lattice Quantization Fundamentals
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
208
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
213
. . . . . . . . . . . . . . . . . B. Distortion Measure and Quantization Regions . . . . . . . . . . .
214
C . Optimal Quantizer for Wavelet Coefficients . . . . . . . . . . .
217
A . Codebook for Quantization
IV . Lattices
214
. . . . . . . . . . . . . . . . . . . . . . . .
218
A . Root Lattices . . . . . . . . . . . . . . . . . . . . .
210
B. Construction of the Root Lattices . . . . . . . . . . . . . . .
224
C . Laminated Lattices
226
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
V . Quantization Algorithms for Selected Lattices . . . . . . . . . . . .
220
A . The Closest Point of a Dual Lattice . . . . . . . . . . . . . .
231
B. Z,, Lattice
. . . . . . . . . . . . . . . . . . . . . .
C . D, Lattice and Its Dual . . . D . The Laminated A16 Lattice . . VI . Counting the Lattice Points . . . A . Estimation of the Number of the
232
. . . . . . . . . . . . . . .
232
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
233
Lattice Points within a Sphere . . . . .
233
234
B. The Number of Lattice Points on a Sphere . . . . . . . . . . . .
23.5
C . Relationship between Lattices and Codes . . . . . . . . . . . .
238
VII . Scaling Algorithm . . . . . . . . . . . . . . . . . . . . .
242
V111. Selecting a Lattice for Quantization . . . . . . . . . . . . . . .
244
IX . Entropy Coding of Lattice Vectors . . . . . . . . . . . . . . . .
250
X . Experimental Rcsults . . . . . . . . . . . . . . . . . . . .
2.54
199 Volume 101) ISBN 0-12-01.3751-1
ADVANCES IN IMAGING A N D fII.BCTRON PllYSlCS Copyright C) 1994 hy Acadcrnic I’res\ All righl\ OC ~epruduclimin any torni tc\crved . ISSN 1117h~Sh70/9L)$30 00
200
MIKHAIL SHNAIDER AND ANDREW P. PAPLLNSKI
.
XI. Conclusions Appendix A: Cartan Matrices of Some Root Systems References
. . . . . . . . .
.
.
.
.
.
. . .
.
.
.
258
.
.
.
260
.
.
.
261
I. INTRODUCTION A vast variety of approaches to deal with the task of image coding and compression have been developed. Each of the methods is characterized by parameters such as compression ratio, speed of coding, quality of decoded images, and simplicity of implementation. In practice it is extremely difficult to satisfy all of these criteria listed, so usually a trade-off between them is necessary. For instance, with an increase in the compression ratio the quality of a decoded image is in general degraded. Also, an increase in the speed of coding is typically achieved by simplifying the compression algorithm, which in turn results in a reduction in the quality of the decoded image. A practical image compression system is built from a number of components, the first being a transform unit followed by a quantization unit, and a bit allocation unit. We emphasize these three generic components of a compression system because our considerations will be primarily related to them. Although quantization is a very effective technique for data compression, direct application to raw image data has some drawbacks that are in general independent of the specific method of quantization employed. For example, a significant disadvantage of vector quantization (VQ) is the well-known “blocking” effect, which is especially visible at low and very low bit rates. This can be overcome by applying VQ in the frequency domain instead of in the space domain. The domain change is achieved by an initial mapping of the raw image data onto a frequency plane using one of the transforms, such as Fourier, Discrete Cosine (DCT), etc., and then quantizing the resulting transform coefficients. Among the techniques for space-frequency transformation developed in recent years, the wavelet transform has attracted remarkable attention. It has already found applications in a wide range of areas from new methods of solving partial-differential equations to an analysis of geological data. By projecting image data onto a set of independent subspaces the wavelet transform captures essential space-frequency characteristics of an image. An appropriate quantization of the projections obtained as a result of this process may lead to significant compression of the original image data, while introducing only minimal distortion. The combination of the wavelet transform and scalar or vector quantization forms a coding method that has become extremely attractive for image compression. The seminal work in this area of research was published by Mallat in
LATTICE VECTOR QUANTIZATION FOR WAVELET-BASED IMAGE CODING
20 1
1989. The idea was further developed by Zettler rt d . ( 1990), Antonini et al. ( 1992), Lewis and Knowles ( I992), DeVore et d . ( I992), Shapiro ( 1993), and Wang rt d.(1996), etc. However, the main emphasis of this study, ltrttices
ond their ripplicntinn ,f;w yicrintizcrtion of wavelet coefficients, appears to have
been beyond the scope of most studies. Some results on the application of lattices have been published only recently. They include work by Barland er d . ( 1994), Barland rt 01. ( 1993), Sampson rt ~ i l (. I994), Chen et cil. ( I997), and Shnaider (1997). The focal point in this study is lattice vector quantizaticin based on the theory of niultidimensional lattices. In this work we present a methodical investigation of the main concepts in the lattice theory that emerge across a number of areas in mathematical and information sciences. In particular, we formally show that uniform lattices are highly suitable for quantization of the wavelet coefficients of real images. Moreover, based on existing studies, it is shown that these lattices constitute optimal quantizers for wavelet coefficients in vector spaces of corresponding dimensions. We discuss the basic properties of a variety of lattices, in particular, the root and laminated lattices. The problem of counting the number of available lattice points essential from the point of view of quantization is also addressed. A vast variety of lattices opens a possibility to select one lattice over another for a particular application. We present here a solution for lattice selection for quantization of wavelet coefficients of real images. A number of compression test results pertaining to the application of the theoretical aspects of image coding using multidimensional lattices are presented i n the experimental part of this chapter. I n the context of a practical image coding system we also present the details of the method for the transmission of a Huffinan codebook, which was originally proposed by the authors (Shnaider and Paplinski, 1997). The presentation is summarized with a conclusion.
We assume that the reader is familiar with the wavelet transform. From the body of literature devoted to this subject, works by Daubechies (1992) and Meyer (1993) can be recommended as a good starting point. A single step of the forward wavelet transform of a one-dimensional (I-D) signal consisting of 2 N samples generates two vectors of low- and high-frequency wavelet coefficients, each of size N . For two-dimensional (2-D) signals, such as images, the structure of the coefficient matrix can be complex even for a small number of expansion levels. In this section we briefly introduce some technicalities related to the 2-D wavelet transform. We observe first that in practice a 1-D wavelet transform is
202
M I K H A I L SHNAIDER AND ANDREW P. PAPLINSKI
performed by successive application of suitably selected high-pass and lowpass filters and an operation of subsampling by a factor 2. In order to obtain coefficients of the wavelet transform of an image, a I-D expansion is performed first on the image rows followed by an expansion by columns. The set of wavelet coefficients reflects the space-frequency content of an input image. Typical expansion steps for the discrete wavelet transform (DWT) of iniages are depicted in Fig. I , and resulting structure of the wavelet coefficients is shown in Fig. 2. The wavelet coefficient matrix is of the same size as the
Original Image FiciiiuE 1.
Typical expansions steps lor the wavelet transform of an image
Pitiuut: 2. A structure of coefficients of a wavelet transform of an image.
LATTICE VECTOR QUANTIZATION FOR WAVELET-BASED IMAGE CODING
203
- (HL,
-
I.
24
2.1
(HH)",
H FIGIJRE
2.1
3 . A single expansion step of a 2-D DWT.
original image. The upper-diagonal quarter of the coefficient matrix is expanded further in successive steps. In detail, operations performed during a single expansion step are illustrated in Fig. 3. In this figure, the 2-D input signal (LL),,,- I represents a matrix of low-frequency coefficients calculated i n the previous step, for example, the matrix (LL)., of Fig. I . Each row of the matrix (LL),,z+l is decomposed using the lowpass ( L ) and high-pass ( H ) filters, and subsampled by a factor of 2. As a result, we obtain two submatrices, each having half the number of columns of the matrix (LL),,,+I.Next, each column of these two submatrices is decomposed using again the lowpass and higli-pass filters and the subsampling operation. Each of the four resulting matrices. namely, (LL),fz,(LH)f,,, (HL),,l, and (HH),,, has a quarter of the matrix (LL),,,+, size, as shown in Figs. I and 2, due to the dilution factor being equal to 2. The low-frequency matrix (LL),,, is then further expanded as already described here until the required number of expansion steps n is reached. A. Futidumentuls of the Quentizcition Process
Quantization of the wavelet coefficients is the next step in image coding after the wavelet expansion is performed on an image. Because quantization allows a certain amount of flexibility in bit allocation to the matrix of coefficients, one needs to select optimal parameters for a quantizer, such as the block size and the length of the codebook, in order to achieve a satisfactory image quality for a given bit rate. The problem of optimal quantization of wavelet coefficients of different frequencies has been extensively studied in recent years (Vetterli, 1984; Woods
204
MIKHAIL SHNAIDER A N D ANDREW P. PAPLINSKI
and O’Neil. 1986; Charavi and Tabatabai, 1988; Senoo and Girod, 1992; and Li and Zhang, 1994). The latest overview of some of the existing quantization algorithms was conducted by Cosman et al. (1996). In this survey, the authors point out that the “equal-slope” algorithm is one of the most popular. There exist a number of variations of the basic “equal-slope” algorithm. The goal of this algorithm is to determine the optimal quantizer for a given source. The idea is to initially collect the distortion information by encoding every subband with all candidate quantizers. Then, for each subband graphs are plotted of the quantizer vs the level of distortion that it introduces. The optimal quantizers are found on the parts of the curves with equal slopes corresponding to the required overall bit rate. However, in order to generate an optimal solution in most cases, this method requires an extensive amount of calculations due to the necessity of forward and inverse quantization for all candidate quantizers across the solution space. The term of “candidate/available quantizers” does not necessarily imply that the referred quantizers rely on different methods or algorithms. A simple modification of the parameter of the quantizer such as the size of a quantization block and the length of the codebook, distinguishes that quantizer from the similar quantizers from the same class. In addition to the studies mentioned in Cosman et al. (1996) some research has been conducted on the sensitivity of the human eye to different spatial frequencies. This research shows that our eyes are not equally perceptive across the frequency range (Campbell and Robson, 1968). This feature of human eyes is particularly useful in terms of wavelet expansion of images because each level of wavelet coefficients represents a certain spatial frequency of the image. We can take advantage of this information and encode the wavelet coefficients with a bit rate that produces minimal subjective distortions. The preceding two methods, namely the “equal-slope” method and the one based on human visual perception, can be used for bit allocation across the matrix of wavelet coefficients. Although both methods give acceptable results, it seems possible that incorporating them into a single algorithm could exploit the advantages of both. Such an algorithm will be presented later in this section. However, before being able to do so we need to estimate the distribution of the information among the blocks within a matrix of wavelet coefficients. This task is carried out in the next section. B. Information Distribution across the Coeficient Mutrix As already mentioned, the result of the wavelet transform of an image is a
matrix that consists of blocks of wavelet coefficients corresponding to different frequency components of the image. From the point of view of subsequent quantization it is desirable to know the influence of each block of coefficients on the overall reconstruction quality of images. Here we address and illustrate
LATTICE VECTOR QUANTIZATION FOR WAVELET-BASED IMAGE CODING
205
FIGURE 4. Image “Lena”- the original image
FIGURE 5. Image “Coast”- the original image
this problem by encoding and decoding a number of test images (high resolution scans of two of them, “Lena” and “Coast,” are given in Figs. 4 and 5). For simulation purposes we used MATLAB ( 1 992) with the wavelet toolbox we described (Shnaider and Paplinski, 199%). The test images were expanded by means of the 5-level DWT using a set of biorthogonal low-pass and high-pass wavelet filters (Daubechies, 1992). Applying the steps as previously
206
MIKHAIL SHNAIDER AND ANDREW P. PAPLINSKI
( ~ < H I S(HH),
‘
*%.
\\.. (HL), ---___ - - - - _ _ _ - - - . ...
‘\%
~
(LAW,
(HH),
FIGURE 6 . Test expansion
LATTICE VECTOR QUANTIZATION FOR WAVELET-BASED IMAGE CODING
207
25 20 -
p?
%
15 10
24
-
22 -
‘
p?
20 18 -
16 -
FIGURE 8. Test expansions of the “Lena” image: blocks of wavelet coefficients vs relative SNR.
We have consecutively set each submatrix of the wavelet coefficients to zeros and measured the quality of reconstruction. The results of the experiments for the test images “Lena” and “Coast” are given in Figs. 8 and 9. In these figures the relative SNR, which is defined as maximum SNR for the image minus the SNR with a submatrix set to 0, is given on the vertical axis and plotted against the matrix structures as in Fig. 6.
208
MIKHAIL SHNAIDER AND ANDREW P. PAPLINSKI
FICXJKE 9. Test expansions of the “Coast” image: blocks of wavelet coefficients vs relative SNR.
Examination of Figs. 8 and 9 shows that in general the overall performance depends less on diagonal orientation than on vertical or horizontal orientations within the same level. Furthermore, the effect of cancellation of the vertical or horizontal edges of a level is close to the effect of cancellation of the diagonal edges of the next level up. We can observe that most of the energy of the real images is typically concentrated in the low and medium frequency components (higher block indexes) while the high frequency components formed by very sharp edges are not as significant. In terms of wavelet coefficients this means that, following the general convention, the coefficients containing most of the image data are concentrated in the top left corner of the coefficient matrix. Therefore, it is desirable that the parameters of vector quantization for encoding the wavelet coefficients reflect this property. An uneven distribution of energy across a range of frequencies in images is taken into account in the algorithm of an optimal bit allocation for quantization of wavelet coefficients presented in what follows. This algorithm takes advantage of human perception of images and keeps the amount of calculations low due to the fact that it relays on an appropriate model of the probability distribution function of wavelet coefficients instead of on the coefficients directly. C. Optimal Bit Allocation
The process of quantization unavoidably introduces some distortion to the image being compressed. The appropriate shaping of the quantization noise across the matrix of wavelet coefficients can reduce undesirable distortion in the reconstructed images. In this section we discuss an algorithm for an optimal bit allocation across the matrix of wavelet coefficients. This algorithm
LATTICE VECTOR QUANTIZATION FOR WAVELET-BASED IMAGE CODING
209
is based on modeling the probability distribution function (PDF) of the wavelet coefficients.
I . Distortion Function Following the test results presented in Section ILB, we can conjecture that the quality of image coding depends highly on the appropriate shaping of the bit rate across the coefficient matrix (Shnaider and Papliriski, 1994). Therefore, an optimal bit allocation routine that minimizes the overall distortion D for a given bit rate R is required. Let us specify the total distortion D for a given bit rate R as follows:
D ( R ) = DN(rN) +
N-l
aibi2F"'1 i= I
where D N ( t - N ) is the distortion produced by quantization of the lowest frequency block (texture), t-; is a bit rate for a block i, ( 1 5 i 5 N ) , c is a parameter which is set to 2 for the mean-squared error distortion measure. The remaining variables are specified in the following way:
with G ( n , ,2 ) being a precalculated value of the average mean-squared error of the optimal n, -dimensional quantizer (Gersho, 1979), p , being the probability distribution functions (PDF) of the ith block of the wavelet coefficient matrix, k, being the size of the ith block. And B,, a human perception factor (Campbell and Robson, 1968), is defined as
where q, is a relative position of the ith block in the frequency plane and D, is the variance of the wavelet coefficients. Finally, the parameter b, from Eq. ( I ) is specified as follows: h - '--(/:+I; (4 1 I where I f and 1;' are the levels of the transform in the horizontal and vertical directions, respectively. Equation (1) is a modified version of that given in Antonini et al. (1992). The modification takes into account wavelet packets as well as the traditional wavelets. One can see that the total distortion D(R) given by Eq. ( 1 ) depends on the probability distribution of the wavelet coefficients. The straightforward
210
MIKHAIL SHNAIDER AND ANDREW P. PAPLINSKI
approach is to compute the PDF for each block within the wavelet coefficient matrix. Alternatively, one can use a prior known function to model the required PDF. Although the model gives only an approximation of the real PDF, this approach is significantly faster which is important for a number of applications, such as real-time image compression. In the next section we will develop a PDF model of wavelet coefficients using their statistical properties.
2. A Stutisticul Model ?f WLzvelet Coeflcients Before we proceed to modeling the PDF of wavelet coefficients we need to reexamine the structure of the single stage 2-D wavelet expansion given in Fig. 3. As we have described it already, at each level of the 2-D wavelet expansion the input matrix is split in four matrices, each of which represents certain frequencies of the input matrix. For an input matrix ( L I ~ ) , , ~the - I output of the 2-D wavelet expansion is (LL),,,, (LH),,l, (ffL),,l,and ( H H ) , , , where L and H refer to application of the low-pass and high-pass filters, respectively. In such a way, while matrice5 (LH),,,, (HL),, and (HH),,, represent highfrequency content, such as edges, the matrix (LL),,, gives the low-frequency content of the input image, such as textures and shapes. Accordingly, we can classify the complete set of the wavelet coefficients into high-frequency and low-frequency coefficients.
High-Frequency Coeficients It has been shown in Antonini et (11. (1992) that for real images the high-frequency wavelet coefficients fluctuate around zero with a variance that depends on the density of discontinuities or edges in the input image. In order to approximate the probability distribution of the high-frequency wavelet coefficients with an appropriately selected model, we compare the graphs of distribution of wavelet coefficients presented in Fig. 10. The distribution of wavelet coefficients shown in Fig. 10 is based on a randomly selected block from the matrix of coefficients of the image “Lena.” A direct comparison shows that the Laplacian and generalized Gaussian (GGF with 6 = 0.7) functions give a good approximation of the PDF of the higherfrequency coefficients (Antonini et nl., 1992; Barland et al., 1994). The generalized Gaussian function (GGF) is given by
where
LATTICE VECTOR QUANTIZATION FOR WAVELET-BASED IMAGE CODING
211
0.2s
0.2
0.1s
0.1
.
0.05
0
-
-30
I
-20
-10
0
10
.
.
30
20
0
F I G U R E10. Probability distributions functions (PDF) of wavelet coefficients and generali d Gaussian functions (GGF) for: '-" [he high-frequency wavelet coefficients, ''. . .'' the Generalized Gaussian with 6 = 0.7, "- -" the Laplacian. "- . -" the Gaussian.
Here, r(.)denotes the Gamma function, o is the standard deviation and 6 is a parameter. If is set to 1, the preceding equation becomes the Laplacian distribution. For 6 set to 2 it leads to the standard Gaussian distribution.
<
Low-Frequency Coejicients The statistics of low-frequency coefficients are similar to the statistics of the image data. As we do not restrict ourselves to any particular type of images, it is not possible to give a general model of the PDF valid for any image. However, most of the real images exhibit significant correlation in the space domain, which can be exploited by the familiar predictive coding (Gersho and Gray, 1992). Typically, the probability distribution of the errors of prediction is a bell-shaped function that can be approximated by the generalized Gaussian function (GGF) with 6 = 2, that is the standard Gaussian distribution with zero mean. Therefore, by using predictive coding it is possible to utilize a single function (GGF) for modeling the PDF of all wavelet coefficients. For the low-frequency coefficients the predictive coding error vectors are approximated by GGF with 6 = 2, whereas for the high-frequencies wavelet coefficients the GGF approximation with 6 = 0.7 is used (Shnaider and Papliriski, 199%). Formally, we can write PI =
{
GGF(6 = 0.7), GGF(6 = 2),
if 1 5 i 5 N if i = N
-
I;
(7)
212
MlKHAlL SHNAIDER AND ANDREW P. PAPLINSKI
where i is the index of the block, that is, the level of the transform within the matrix of wavelet coefficients. Given the preceding probability distribution model, function ( 1) describing the total distortion after quantization of the wavelet coefficients can be expressed in a homogeneous form as follows:
Now, this function needs to be minimized for a given total bit rate.
3 . Minimizatiori of the Distortion Function As shown in the previous section the initial problem of optimal distribution of the quantization bits across the matrix of wavelet coefficients narrows down to minimization of the total distortion for a given overall bit rate or, in other words, compression ratio. That is, we need to minimize the distortion function (8) subject to
c N
R=
h,r,.
I=
(9)
I
The first approach to minimization is to use the Lagrange multiplier technique. In this case the following derivatives with respect to r, must attain zero:
From this we obtain the following equation for r;:
Solving for r; yields 1
r; = - - log, c
h
___
cln(2)a;
or, equivalently, log2(ln(2)ai) - log, C
LATTICE VECTOR QUANTIZATION FOR WAVELET-BASED IMAGE CODING
2 13
By substituting Eq. ( 1 2) into the constraint equation (9), we have
or
Now we can substitute Eq. (14) into Eq. (12) to obtain the following expression for the bit rate:
Finally, the bit rate for the ith block can be calculated as r, = BR
+ c1 log, -
ln(2)al N
n(ln(2)u1
)B’’l
i= 1
However, direct calculation of the derivative using Eq. ( 1 0) does not guarantee that the values of r; are within acceptable range. The alternative solution to the minimization problem specified in Eq. (8) is the utilization of one of the numerical optimization methods. We use the algorithm developed in Riskin’s work (1991), which gives the optimal bit allocation and allows control of the values of bit rates. Initially, r; = 0(1 5 i 5 N ) . Then, some additional bits A r are allocated at each iteration to the rl that attain the minimal overall distortion for all i ( I 5 i 5 N ) . The procedure continues until the desired bit rate (9) is reached. In our calculations A r has been set to 0.01 bpp, and the parameters in Eq. (3), and y , have been set to 1.05, and I , respectively. In conclusion, the algorithm for an optimal bit allocation for quantization of wavelet coefficients presented in this section enables the number of calculations to be kept low by taking advantage of the GGF model. The numerical minimization algorithm for the distortion function has been employed. 111. LATTICE QUANTIZATION FUNDAMENTALS
Examination of various vector quantization (VQ) techniques shows that a majority of them suffer from high computational cost that is associated with the
214
MIKHAIL SHNAIDER AND ANDREW P. PAPLINSKI
generation of an appropriate codebook. This implies that they require either a long time to encode images or complex hardware to facilitate the computation in an acceptable period of time. It is possible, however, based on the theory of n-dimensional lattices to develop a family of quantizers that are free from this drawback. The resulting lattice quuntizers offer computational simplicity for image coding and will be examined in detail in this section. We begin our discussion by considering the motivation for lattice quantization. Then, we give a formal definition of lattices followed by the description of two classes of lattices especially important in the context of quantization: root and laminated lattices. Successful application of a lattice is only possible if there exist enough lattice points available for quantization. Therefore, we also address the problem of counting the lattice points. First, an approximate solution to this problem is presented. Then, by using theta functions, we show how the exact number of lattice points available can be computed. A. Codehook for Quantization
Let us examine the process of quantization. The quantization scheme requires a knowledge of the codebook by both the encoder and the decoder prior to the process of quantization. In most cases, particularly in VQ, to satisfy this requirement the codebook is also sent through the communication channel, or included in the compressed file. As a result, the overall compression ratio may be significantly reduced, especially for large codebooks or high dimensionality of vectors in the case of VQ. Lattice quantization (LQ), to be discussed here, is free from this drawback because it does not require transmission of the complete codebook due to its regular structure and, as a result, it is possible to generate the codebook independently at both the encoder and decoder ends. In addition, LQ delivers a superior speed as compared with other types of quantization, such as learning vector quantization, due to the simplicity of generating the codebook. B. Distortion Measure and Quantizution Regions
Let us consider an input stream of continuous random vectors x = ( X I , x2, . . . , x N ) with the probability density function p ( x ) . Let us also consider an output stream y = ( y l , y2, . . . , y ~ related ) to an input stream x by the conditional probability density function qCy Ix). Assuming that the stream x is the input to a quantizer, then the corresponding values from the output stream y are obtained by a reference to the quantizer codebook.
LATTICE VECTOR QUANTIZATION FOR WAVELET-BASED IMAGE CODING
2 15
According to Shannon ( I948), the amount of uncertainty R ( x , y ) of a value of x when we receive its quantized counterpart y from a transmission channel is given by R ( x , y ) = h ( x ) - h(xly) (17) where h ( x ) = - J p ( x ) log p ( x )dx is the differential entropy of a variable x . The value of R ( x , y ) is effectively the actual transmission rate. With this in mind, we can pose the problem of optimal quantization in terms of minimization of the amount of uncertainty, that is, R ( D ) = min(R(x,y)) 4tnlJ
(18)
with Q ~ being I a set specified by the conditional probability 40,Ix) so that
where d(., .) is a distance function (to be defined). The substitution of Eq. (17) in Eq. ( 1 8) leads to
As the entropy h ( x ) is independent of the conditional probability q @ , x ) and the entropy h ( x l y ) = Iz(x - y ly) 5 h(x - y ) , we can further modify Eq. (20) in the following way:
In information theory the last equation is known as the Shannorz lower bound (Berger, 1971 ; Sakrison, 1979; Gibson and Sayood, 1988). Now let us return to the process of quantization. We collect the input vectors x j = ( X I , X Z , . . . , XN) in blocks X = ( x I , x ~. ,. . , x l , ) and, for each input ..., y ~ ) block, the quantizer finds a corresponding output block Y = (y~,yy?-, . average bit rate per vector for consisting of vectorsy, = ( y l , y2, . . . , y ~ ) The a vector quantizer can be calculated as
216
MIKHAIL SHNAIDER A N D ANDREW P. PAPLINSKI
Assuming that the distribution of x is uniform within each quantization region V and that all regions are simple translations of each other, we have the following approximation for the average bit rate per vector (Gibson and Sayood, 1988): 1
h Q ( Y )25 - h ( X ) L
1 - -
L
log v.
If the input vectors x are independent we have that
1 h(x) = -h(X) L
and
I h(x - y ) = -h(X L
-
Y).
Furthermore, it can be shown (Berger, 1971; Sakrison, 1979) that for sufficiently high dimensionality of vectors the following equality is satisfied: h ( X - Y ) = log v.
(24)
Thus, we can conclude that the performance of an uniform vector yucrritizer asymptotically achieves the Shannon lower bound for large dimensions. An average distortion of a quantizer can be defined in the following way:
where for every input x with the PDF p ( x ) the quantizer produces an output y. The mth-power distortion d"')introduced by this operation is defined by
By varying m, different distortion measures can be obtained. Closely related to the distortion measure is an optimal shape of the quantization region. It is easy to see that for a uniform source the optimal shape of quantization region is a pyramid, if m is selected to be I , as depicted in Fig. 11. For m equal to 2, Eq. (26) gives rise to the well-known mean squared error (MSE) distortion measure, the optimal quantization region for a uniform source being a sphere (Conway and Sloane, 1982). Furthermore, the radius of the sphere is equal to 1/LD where D and L are the target distortion and dimensionality of quantization space, respectively. From the foregoing considerations we can conjecture that, assuming the MSE distortion measure, an optimal quantizer for an L-dimensional uniform source constitutes a set of uniformly distributed spheres with code vectors as centroids of the spheres.
LATTICE VECTOR QUANTIZATION FOR WAVELET-BASED IMAGE CODING
\ \
\ / / / / /
\ \
\
/
FlGtJRr;. 1 1.
I\\\\\
2 17
/ / f l y
\
/
A pyramidal quantization region for nz = I
C. Optimal Quntitizer fiw Wwelet Coejjicients
In order to generalize the statistics of the wavelet coefficients it is necessary to approximate their probability distribution functions (PDF) by an appropriately selected model. As discussed in Section 11, for real images the PDF of wavelet coefficients can be well approximated by the generalized Gaussian functions (GGF) given by Eq. (5) with appropriately chosen exponents as follows:
Pi
=
GGF(( = 0.7),
if 1 I iI N - 1;
GGF(.$ = 2),
if i = N .
Here i is the index of the block of wavelet coefficients. If i is set to 0, this corresponds to the low-frequency block, located in the upper left corner of the matrix of coefficients. Assuming that the PDF of wavelet coefficients of an image can be well approximated by Gaussian-type functions and using the mean-squared error as a measure of the quality of this approximation, it can be shown that in order to achieve an optimal quantization of the wavelet coefficients the codevectors should be uniformly distributed on a hypersphere with the center at the origin (Sakrison, 1968). Consider a continuous random variable x with the Gaussian PDF p ( x ) . Without loss of generality, let us assume that p ( x ) can be well approximated by a staircase function p’(x). For some small EO and E I , we can write
218
MIKHAIL SHNAIDER AND ANDREW P. PAPLINSKI
I
’
: *
I
,’
: I
’
FIGURE
12. Regions of constant probabilities in a 2-D space.
Following the foregoing assumption, within each of the constant probability segments of our approximation function y ’ ( x ) we have uniformly distributed values of x. For example, i n a 2-D space the regions of constant probabilities form rings around the origin (Fig. 12). As shown in Section IILB, for optimal quantization of a uniformly distributed source we need a uniform vector quantizer with spherical quantization regions. This is true provided that the chosen distortion measure is the meansquared error. If the source has the Gaussian PDF, for example, wavelet coefficients, which can be approximated as in Eq. (28), then the spherical quantization regions must be placed so that their centroids located on a hypersphere centered at the origin of the source distribution. As will be shown, this result leads directly to the utilization of lattices for quantization of the wavelet coefficients.
IV. LATTICES A lattice L can be defined by an integral combination of linearly independent basis vectors of dimension n collected in a so-called “generator” matrix M (Gibson and Sayood 1988; Birkhoff, 1998; Conway and Sloane, 1993). Assuming that a vector [ E Z , , , the lattice L is generated by the product Me. The simplest lattice is the Z,, lattice, which consists of all the integral points in an II -dimensional space.
LATTICE VECTOR QUANTIZATION FOR WAVELET-BASED IMAGE CODING
2 19
Now, let us consider an n-dimensional lattice L specified by the generator matrix M consisting of n basis vectors of the form:
Then a vector x = ( X I , . . . , x , , ) belonging ~ to this lattice can be expressed as a linear combination of basis vectors b,
The squared norm of a vector x can be defined as a function of lj
i
i
= < M T M l j = ljTAlj = J ’ ~ ( l j ) .
(31)
The matrix A = M T M is called a Gram matrix of the lattice L. The (i, j ) entry of this matrix is the inner product of the corresponding basis vectors ( b , ,b. ;). In Eq. (31) the function f l . ( . ) is known as a quciclruticfimz associated with lattice L. For each lattice L there exists a dual lattice L* defined as follows. If a vectorj of dimension n belongs to the dual lattice L*, then for any vector i from the subspace spanned by the lattice L the inner product ( i j ) is an integer, that is, L* = (i E R” : (i,j)E Z V i E L ] . (32) A dual lattice L* also may be defined as
(33) where Y, are the representatives of L in L* known as glue vectors. If Y, = 0, then the lattice L is integral because it is contained in its dual lattice L*. A. Koot Lattices
In this section we examine some concepts from Lie algebras, in particular root systems, and their utilization for generation of lattices. The material presented in this section is due mainly to Gibson and Sayood (1988), Conway and
220
MIKHAIL SHNAIDER A N D ANDREW P. PAPLINSKI
Sloane ( 1993), Humphreys ( 1972), Grove and Benson (1 972), Gilmore ( 1974), Coxeter (1973), and Shnaider (1997). As already shown, with each lattice we can associate a quadratic form equation (31). The elements in the Gram matrix A are the inner products of the basis vectors of a lattice a,, = ( b , ,b , ) . The Gram matrix of a root lattice is closely related to the so-called Cartan matrix of the root system corresponding to this lattice. 1 . Root Systems
Let us begin with the definition of an orthogonal transformation S E T ( R " ) where T(R")is the set of all orthogonal transformations over an n-dimensional real Euclidean space 9. Consider a reflection transformation in a 2-D space E ( R 2 )shown in Fig. 13. The reflection transformation S, converts every vector x from the space R 2 to its mirror image S,.x with respect to the line I . Thus, if r I1 and i E 1, we can write x = u$ ulr and, consequently, S,x = aoi - a l r . In general, for an n-dimensional space the transformation S, is called the reflection through a hyperplane I, or the reflection along r. Mathematically, such a transformation S, can be formulated as
+
FIGURE13. The reflection transformation S,.
LATTICE VECTOR QUANTIZATION FOR WAVELET-BASED IMAGE CODING
22 1
Assuming that there exists a subgroup of transformations G 5 r ( R 2 )and S, E G, we can define the roots of G as f r . Formally, a subset R of the Euclidean space 3 ( R 1 *is) called a root systeni of this space if: 1. 2. 3. 4.
R is finite, spans R” and does not contain 0; for all r E R, -r E R; R is invariant over all reflections S, for all r E R; for all r l , r , ( i # j ) E R
In Lie algebras the number 2b.1, r ; ) (Ti ri ) 3
often has a separate notation, namely
There exist a number of root systems that satisfy the definition given in this section. They are denoted by upper case letters with a subscript referring to the dimensionality of the space they span, for example, A,l, B,,, C l l . The set of roots in the root system can be classified into two groups. For any vector t E X such that ( t .r ) # 0 a root is referred to as either negative or positive depending on the sign of the inner product of this vector t with the corresponding root: R: = [r E R : ( t ,r ) > 0) (positine)
(37)
Rt- = ( r E R : ( t ,r ) < 0) (negative).
(38)
and This classification can be geometrically interpreted as two subsets of the root system lying on opposite sides of the hyperplane perpendicular to the vector t . A subset A of R is called a base if:
I . A forms a basis for E; 2. each root (Y E R can be expressed as (II
kiri with ri E A
=
(39)
I
where k; are either all negative integers for negative roots or all positive integers for positive roots.
222
MlKHAlL SHNAIDER A N D ANDREW P. PAPLINSKI
The roots in the base A are called simple. Consider an example of a root system shown in Fig. 14. This system comprises eight roots given by
R = { & ( I , 01, f(0, I ) , (51,& I ) ) .
(40)
By choosing t = (cosp, s i n p ) with n/4 < p < n/2 the positive roots are
R t = {(1,0),(0, 11, ( 1 , 11, (-1,
1)).
(41)
Finally, the simple positive roots forming a base are A,f = { ( I , O ) , (-1,
1)).
(42)
Let 11s analyze the possible angles between the roots in a system. The condition (35) restricts these angles to a limited set of values. As we know that
we have the following expression for the Lie number (r;,r;):
I FIGURE14. Example of a root system
LATTICE VECTOR QUANTIZATION FOR WAVELET-RASED IMAGE CODING
223
and, consequently,
(451
( r , , r , ) ( r , , r ,= ) 4c0s28,/. The last equation also can be written as
Now with respect to condition ( 3 3 , which restricts the values of the Lie number (ri,r ; ) to integers, and considering also that 0 I cos2 8,; 5 1 , we can compile all possible angles @, between roots ri and r,;,as well as their relative lengths. The results are given in Table I. The two specific choices of roots { ( r i ,ri) = f4,( r j ,r i ) = & 1) and { (ri,r , ) =fI , (r.;,r ; )= f 4 ) when cos2 8,,/ = I , which also satisfy Eq. (45), are invalid due to a contradiction with the definition of roots. This contradiction can be seen as follows. Because cos2 Q i j = I the angles OL,are 0" or 180". As a result, we have two roots in the same or opposite directions and, by definition, these roots must be of the same length. This does not agree with the foregoing choices of roots. It can be shown (Humphreys, 1972) that for any two distinctive simple roots the following inequality must be satisfied:
( r i , r ; )5 0 Vri E A
and
i#j
(47)
which together with Eq. (43) limits the values of 8i; between the simple roots ri, r,(i # j ) to n / 2 I 8i.j < n.Furthermore, considering the results in Table I, the angles between simple roots are 8;; = (90", 120", 135", 150") V r ; , r ,E A
and
i# j.
(48)
This result, which is valid for any dimension n 2 2, will allow us to identify a subset of simple roots in a root system. TABLE I ANGLESBETWEEN THE ROUTSA N D THEIR RELATIVE LENGTHS
1
o", 180"
*2
It2
I
1 4
3W. 150"
13 1 1
f l
1
f3
3
45", 135"
12 i l
f l
I
1 2
2
i l
I undetermined
1
4 I
5 0
60", 120" 90"
i l 0
0
I
2
224
M I K H A I LSHNAIDER AND ANDREW P. PAPLINSKI
2. Curtun Mutrices Let us consider a set of simple roots (ri: ri E A ) in the root system R defined in the foregoing. The matrix of the Lie numbers [(ri,r j ) ]is called the Ccrrtun matrix of R. The Cartan matrix completely determines the corresponding root system R. In what follows we consider some important properties of Cartan matrices. From Eq. (36) we have (ri,ri) = 2, which is valid for all roots including simple positive ones. Thus the elements on the main diagonal of a Cartan matrix are always equal to 2. Table I shows that in the case of any roots the values of the Lie number, (r,,r , ) , are 0, 51, f2,f 3 . Considering conditions (47) and (48) we can narrow down the range of values for simple roots belonging to the base A to nonpositive values. Therefore, we have (r:, r 7 ) = (0, - 1, -2, -3) and, consequently, the entries of a Cartan matrix located off the main diagonal also take on only these values. It was mentioned in Section 1V.A.1 that there are a number of different root systems. As examples, the Cartan matrices for some of those root systems, namely, A,,, B,,, C,, D,,, Eg, E l , E X ,F4 and Gz. are given in the Appendix. In what follows we consider an example of construction of the root system D,,. For the construction of the remaining root systems the reader is referred to Gibson and Sayood (1988).
D,,(n
> 4) Root System.
Z = R"
and
The D,,root system is obtained by letting
R = (r E 2 : ( r , r )= 2 ) = ( f ( e ;fe,), i # j } .
Note that the dimensionality n of root system of type D is 4 or greater. This is due to the fact that in low-dimensional spaces some of the root systems are equivalent. The 0 2 root system, for example, can be represented by the product A , x A , and is shown in Fig. 15. The 3-D version of the D type of root system, namely, D3, is equivalent to A3.
B. Construction of the Root Luttices As already mentioned, the Gram matrix of a root lattice and the Cartan matrix of a root system are interrelated. In fact, for a root system the elements in the normalized Cartan matrix, divided by a factor of 2, equal the elements of the Gram matrix of the lattice corresponding to this system. Therefore, starting from the Cartan matrix of a root system we can generate a set of basis vectors of the root lattice. However, as shown in Section IV a lattice can be defined by a number of distinct sets of basis vectors. The necessary orientation of the lattice basis can be achieved by setting the direction of one of the basis vectors. As suggested by Gibson and Sayood (1988), this
LATTICE VECTOR QUANTIZATION FOR WAVELET-BASED IMAGE CODING
225
F I G U R15. ~ D2 root system.
can be done by reorganizing the generator matrix M of a lattice to its upper triangular form as follows: ~
I
0 0
I ~ Z I h31
h22
0
b.32
0
. . . br,~ . . . hii? ...
= [bl b2 b3
. . . b,,].
(49)
. . . b,,,,
In Eq. (31) the Gram matrix A was defined as
Let us now consider the normalized Cartan matrix @ = (q$j : 0 5 i, j 5 n I ] . From Eqs. (49) and (50) the first element of the Cartan matrix can be expressed as: 411 = (bl3bl) (51) where bi is the ith basis vector in the generator matrix M . For the triangular generator matrix M as defined in Eq. (49) we can write that
h:, = ( b i , b i ) .
(52)
Thus, we have hll
=
6.
For the next basis vector we can write a set of two equations
(53)
226
MIKHAIL SHNAIDER AND ANDREW P. PAPLINSKI
Similarly, using Eqs. (53) and (54) it is possible to deduce the following results for the next basis vector b2:
Assuming that C,h=,x~,= 0 for all xk if N > h, the procedure of calculating the basis vectors can be generalized in the form of the following algorithm (Sayood et al. 1984): 1 . Set i = 1 and calculate b11 = &. 2. Set i = i I and calculate the off-diagonal elements
+
3 . Calculate the diagonal element
4. If i < n , go to 2. C. Laminated Lattices
In this section we examine another class of lattices known as laminated lattices. One of the classical mathematical problems is that of packing an n-dimensional container with identical spheres as densely as possible. This problem is completely solved for the 1- and 2-D spaces. In such spaces, the centers of the spheres must coincide with points belonging to the following lattices:
where A denotes a laminated lattice. The I-D laminated lattice A , consists of all even integral points and is equivalent to the lattices Z I ,A1 and AT. In a 2-D space the lattices A2, A, and A; are equivalent. If we draw unit radius circles around each point, then by placing exact copies of the resulting row of circles next to each other as closely
LATTICE VECTOR QUANTIZATION FOR WAVELET-BASED IMAGE CODING
227
as possible we form the 2-D laminated lattice 122. This lattice, equivalent to the so-called hexagonal lattice A2, is shown in Fig. 16. For dimensions from 2 to 8 the densest sphere packings are known only among lattices. The lattices that form the basis for the densest sphere packings in the spaces of 2, . . . , 8 dimensions are the following (Conway and Sloane, 1993):
where A,,, D,, and E,, are the root lattices discussed in the previous sections. The laminated lattices are known to be the densest sphere packings in dimensions up to 8. This is also true up to the 29-dimensional space with the exception of 10- to 13-dimensional spaces where the so-called K-type lattices give better results. Half of the minimal distance p between two distinct lattice points (see Fig. 16) is called the packing radius. It can also be defined as the largest number p, such that spheres of radius p centered at the lattice points do not overlap. The packing radius of laminated lattices equals unity. The points on the plain farthest from the lattice points are called deep holes, and R is called the covering radius, which is equal to the distance from a lattice point to a deep hole. For the lattice in Fig. 16 the covering radius R = 2p/&. The covering radius is related to the covering problem, which is to find the least dense covering of the space R” by overlapping spheres. The covering problem is the dual of the packing problem defined at the
F I G L I R16. ~ The laminated latticc A ?
228
MIKHAIL SHNAIDER AND ANDREW P. PAPLINSKI
beginning of this section. The covering radius R is the smallest number p , such that spheres of radius p centered at the lattice points cover the whole R ” . The 2-D construction of sphere packing illustrated in Fig. 16 can be extended farther into the 3-D space by replacing the circles with 3-D spheres of the same radius as circles and stacking the obtained layers of the A2 laminated lattices as densely as possible in the third dimension. In this way we obtain the A3 lattice, which is equivalent to the A3 and 0 3 lattices. The A3 is known as the face-centered cubic lattice. Similarly, the A,, lattice can be recursively constructed from the A , , - , lattices. The A,, lattice is obtained by placing the A,?-I lattices as close as possible to each other. In such a way every A,, lattice includes a number of A,,-I lattices. This relationship is depicted in Fig. 17. It can be seen in this figure that for spaces of some dimensionalities the result of the foregoing construction is not unique. That is, more than one laminated lattice can be generated in such spaces. From the point of view of image compression the most interesting of the high-dimensional laminated lattices is the A 16 lattice with the generator matrix
- 4 2 2 2 2 0 2 0 0 0 0 2 0 0 0 2 0 0 2 0 1 MA16
=
-
1/2
0
2 0 0 0 0 2 0
2 0 0 0 0 0 2 0
2 0 0 0 0 0 0 2 0
2 0 0 0 0 0 0 0 2 0
2 0 0 0 0 0 0 0 0 2 0
2 0 0 0 0 0 0 0 0 0 2 0
1 1 1 1 0 1 0 1 1 0 0 1 0
0 1 1 1 1 0 1 0 1 1 0 0 1 0
0 0 1 1 1 1 0 1 0 1 1 0 0 1 0
-
0 0 0 1 1 1 1 0 1 0 1 1 0 0
I1 1 1 1 1 1 1 I ’ 1 1 1 1 1
1
1 1-
0
LATTICE VECTOR QUANTIZATION FOR WAVELET-BASED IMAGE CODING 229
... ...
... \
FIGURE17. Laminated lattices
V. QUANTIZATION AI.GORITHMS FOR SELECTED LATTICES Quantization can be viewed as a mapping of an input set onto a precalculated set known as a codebook. For each input sample, or batch of samples, the quantizer finds the corresponding samples, or batch of samples, in the codebook according to the minimal distance criterion and sends its index to the output stream. Let us assume that the codebook consists of points belonging to a certain lattice L,t. The dimensionality n of the lattice corresponds to the dimensionality of the input vectors of the quantizer. Assuming that the codebook of a quantizer is formed from the lattice points we need an algorithm to find the closest lattice point for every input vector. For each lattice such an algorithm is required to quantize the input with the codebook in which the codewords are the lattice points. In this section we examine some fast quantization algorithms (Conway and Sloane, 1982; Gibson and Sayood, 1988) for the lattices discussed in the previous sections. Before we proceed further with the presentation, let us introduce first a few “utility” functions used in quantization algorithms. 0
0
Let u(x) denotes the closest integer to x. In the case of a tie, when x is equidistant from both neighboring integers, ,u(x) is equal to the integer with the smaller absolute value. Let w ( x ) denotes the second closest integer to x distinct from ~ ( x ) .
230 0
0
M I K H A I LSHNAIDER AND ANDREW P. PAPLINSKI
If the input is an n-dimensional vector, that is, x = (XI, . . . , x,,), the functions v ( x ) and w ( x ) are applied to each vector component separately, that is,
Let us also define the round-off residue function, d ( . ) , as d ( x )= x - u(x).
0
Let us also define a coordinate index k in the following way:
and Jd(xk)I = Id(x;)I
0
implies
k I i.
In words, the coordinate xk has the largest absolute value d(xk) among all coordinates xi in a vector x. If it happens that a number of coordinates in x have the same maximum absolute value d(xk) then k is assigned to the index of the coordinate, among those with the maximum d(.), which is located on the leftmost position in the vector x. Finally, given that xk is known we define a function g(x) as
The function g(x) effectively equals u ( x ) with the coordinate v ( x k ) substituted with w(xk). The index k is defined as in the foregoing. The preceding functions are illustrated in Fig. 18.
v(x)
R(X)
LATTICE VECTOR QUANTIZATION FOR WAVELET-BASED IMAGE CODING
If, for example, x = (3.6, -2.5, - 1.8), then the functions and g(.) return the following values: 1I(X)
?I(.),
23 1
w(.), d(.)
= (4, -2, -2),
w ( x ) = (3, -3, - I ) , d ( x ) = (-0.4, -0.5, 0.2), g(x) = (4, -3, -2).
A. The Closest Point of a Dual Lattice As mentioned in Sect, IV, for each lattice L there exists a dual lattice L* that can be specified in the following way: I- I
L* = u ( r i I =o
+L)
(59)
where ri are the glue vectors. Let us assume we have a quantization algorithm for the lattice L that assigns a pointy from the lattice L to a point x being quantized, that is, y = @(x), y
E
L.
Then, the quantization procedure for the corresponding dual lattice, L*, can be defined by the following procedure Conway and Sloane (1982): 0
Calculate all prospective dual vectors
+
y: = @(x - r ; ) r ; ,V 0 5 i 5 k 0
-
I.
(60)
Determine the glue vector, r.i for which the distance dist(x, y;)
0
attains minimum. Assign y * = @(x - r , )
+ r,
where y * E L*. In other words, y * is the one closest to the point x being quantized among all other y:.
232
M I K H A I LSHNAIDER AND ANDREW P. PAPLINSKI
Apart from the dual lattices this procedure can be used to find the closest points for any lattice that can be represented as a union of cosets of the form given by Eq. (59).
B. Z,, Lattice We know that the Z,, lattice consists of all integer points in an n-dimensional space. Therefore, for x E R" the pointy of the lattice Z,, closest to x is given , is, by ~ ( x ) that y =v(x). C. D,, Luttice arid Its D u d
The quantization algorithm for the D,, lattice follows directly from the definition of the corresponding root system. We know that the D,, root system is obtained by letting Z = R" and R = (r E Z : ( r ,r ) = 2) = ( f ( e ; fe l ) , i j ) . Therefore, the lattice D,, is a set of integer points in the n-dimensional space with an even sum of coordinates. For an arbitrary point x the closest point belonging to the lattice D,, is given by either v(x), or g(x), depending on which result has an even sum of coordinates, that is,
+
@(x)=
v(x)
if x j u ( x , ) is even otherwise (C,g ( x j ) is even
)
For example, for a four-dimensional (4-D) point x = (2, -3.4. 0.7,6.1), we find that ~ ( x = ) (2, -3, 1,6) and that g(x) = (2, -4, I , 6). As the sum of components of v ( x )is even while the sum of components of g(x) is odd, then the 0 4 lattice point that is the closest to x is given by the function 7@). In general, it can be observed that the function g(x) needs to be calculated only in the case when C ; v ( x ; )is odd. Otherwise, it is sufficient to calculate only 4 x ) . The dual lattice D,T can be defined as
where r, are the glue vectors specified as follows:
LATTICE VECTOR QUANTIZATION FOR WAVELET-BASED IMAGE CODING
233
Alternatively, the lattice DZ can be defined in terms of the lattice Z,, in the following way:
with the glue vectors being
ro = (0”) and rl =
(1”)
As both definition equations (62) and (63) determine the same dual lattice D:, one can use any of those definitions in conjunction with Eq. (60) depending on a specific application. However, the latter definition of the lattice D: clearly results in a faster algorithm for obtaining the closest point because of the use of two cosets, instead of four as in Eq. (62). Apart from the smaller number of cosets used in Eq. (63) compared with Eq. (62), quantization of the lattice Z,, in an algorithm based on Eq. (63) is advantageous because it is, on average, faster than quantization of the lattice D,,. D. The Lainitiated
A16
Liittire
As already mentioned, this lattice can be constructed from the first-order ReedMuller code of length 16. The quantization procedure relies on this fact. Using the algorithm for the 0 1 6 lattice we calculate 32 lattice points taking the codcwords of the Reed-Muller code as the coset representatives. The goal point among the 32 lattice points obtained is the one for which the minimal distance from the quantized point is attained.
VI.
COUNTING THE
LATTICE POINTS
A problem that seems especially important in the context of lattice quantization is that of counting the lattice points located within a given distance from the origin. In terms of quantization, the number of lattice points corresponds to the size of the codebook used for quantization. Therefore, we need to know how many points of a lattice L,, are located on the surface of an n-dimensional sphere centered at the origin. We address first the problem of counting lattice points contained inside a sphere of a given radius.
234
MIKHAIL SHNAIDER A N D ANDREW P. PAPLINSKI
A . Estimation of the Number of the Lattice Points within a Sphere
In this section we estimate the number of lattice points contained inside a sphere of a given radius. We will consider first a problem of counting the points of a 2-D lattice 2 2 (Kriitzel, 1988). The lattice 2 2 is the set of all integral points in a 2-D space. We would like to estimate the number of lattice points C,, lying within the circle of the radius f i with m > 1 (Fig. 19):
With each lattice point P , = (B,, y , ) , we can associate a square S, which has the point P , located at its center. Formally, the square S , can be defined as
+
Now we can draw a circle of radius f i d / 2 centered at the origin. All squares S , that lie within this circle satisfy the following inequality:
B’+y2i
l
2
(A+$)
j o j o j o j o j
.
................ . .
................... .. .. ..
FIGURE
I
.
.
.
.
.
19. Counting the points of the lattice Z l
LATTICE VECTOR QUANTIZATION FOR WAVELET-BASED IMAGE CODING
235
Therefore, the number of lattice points C,,, that are contained inside the circle of the radius fi can be estimated in the following way:
which is true for any m 2 1 (as has been assumed here). In order to obtain the lower bound of the C,,,, we draw a smaller inner circle defined by
for which the corresponding estimate of the number of lattice points inside this circle is
Hence -
or
I c,,,- 751121
<2nfi
Thus, we can state that the number of points C,,, of the lattice Z2 lying within a circle of radius fi can be closely approximated by the area of this circle. Furthermore, the error of the approximation is of the same order as the length of the circumference of this circle equal to 2 7 r f i . These considerations can be applied to any close curve, rather that just to a circle. Also, they can be applied not only to the 2-D space, but also to higher dimensions using corresponding versions of Eq. (71). It can also be shown (Kratzel, 1988) that inequality of equation (71) holds for any lattice. In conclusion we can state that the number of lattice points within an 17dimensional sphere can be closely approximated by the volume of the sphere divided by the volume of a fundamental region of the lattice. B. The Nunzber
of
Lrrttice Points on
CI
Sphew
In terms of quantization, the points on the surface of a sphere of the radius fi,their number being given by A,,,, will be used as codewords in quantizing Gaussian sources in general, or wavelet coefficients in particular (see Secs. 1II.B and C). In order to obtain the number of lattice points lying on the
236
MIKHAILSHNAIDER AND ANDREW P. PAPLINSKI
surface of a sphere, let us associate with each lattice L a theta series (Ebeling, 1994; Grosswald, 1985; Igusa, 1972; Rankin, 1977; Lang, 1976) as follows:
X€L
+ + +
where x E L , and IIx [I2 = x: x i . . . x: is a squared norm of x and z is a complex variable with Im(z) > 0. Letting 4 = en': we can rewrite Eq. (72) as
X € L
After the introduction of a new summation variable m = IIx I(*, we can express the theta series as 00
C A,,4"
OL(Z) =
(74)
r1r=O
where A,,, is the number of lattice points with the squared norm of m . In other words, assuming that the vector x is of dimension n , the summation coefficient A,,f gives the number of vectors in the lattice L located on the surface of the n-dimensional sphere of radius f i centered at the origin. The theta functions examined in this study originate from the Jacobi theta function given by
<
where is a complex variable that is permitted to assume any value. From the preceding equation a number of simpler theta functions can be generated. For instance, the functions @ ( z ) , @ 3 ( z ) and 8 4 ( z ) are given by the following equations:
= 2q'l3( 1
+ y* + q b + qI2 + q"' + . . .),
(75)
LATTICE VECTOR QUANTIZATION FOR WAVELET-BASED IMAGE CODING
237
TABLE I 1 NUMBERSOF POINTS O N THE SlJRFACE OF A SPHERE OF A GIVENRADIUS FOR THE LATTICE 23 rn
0
I
2
3
4
5
6
7
8
9
10
II
12
13
14
IS
A,,
1
6
12
8
6
24
24
0
12
30
24
24
8
24
48
0
NLlMUEI<SOF POINTS ON
WE
TABLE I11 SURFACE OF A SPHERE OF A GIVENRAINS FOR THE LATTICEDq
tw
0
2
4
6
8
10
12
14
16
18
20
22
24
26
28
A,,,
I
24
24
96
24
144
96
192
24
312
144
288
96
336
192
and
The theta series of the lattice of integer points Z,, is
Now by substitution of Eq. (76) in Eq. (78), we can calculate the number of points of the lattice Z,, for a given norm m. This number is by construction equal to a coefficient A,,, of the variable 4 with the power m. For this particular lattice, A,,, corresponds to the number of ways of writing m as a sum of n squares. An example of the numbers of lattice points for the first several spheres of the lattice 2 3 calculated from the theta series O:(z) is given in Table 11. The next lattice of interest to us is Dl1.This lattice is formed by points having an even norm. Therefore, its theta series is as follows: 1
(%,, ( z ) = - (0; ( z ) 2
71
+ 0;- (-I.>)2
1 =2 (0;( z )
+ (-1; ( z ) ) .
(79)
Among the D-type lattices one of the most interesting is the 0 4 lattice. The theta series of the 0 4 lattice is specified in Table 111. The dual lattice DZ has the following theta series:
238
MIKHAIL SHNAIDER A N D ANDREW P. PAPLINSKI
The last lattice, which is also important in the context of wavelet-based image compression, is the laminated lattice 1216.The theta series of this lattice is
with the coefficients given in Table IV. C. Relationship between Lattices and Codes As already mentioned, the laminated lattice A 16 can be generated from the first-order Reed-Muller code. This is an example of the relationships between lattices and codes, which are examined in this section (MacWilliams and Sloane, 1977; Conway and Sloane, 1993; Leech and Sloane, 1971; Sloane, 1977). Let F2 denote a finite field with two elements: F2 = (0, l}. A binary linear code C over the field F 2 of length n is a subset of F ; . In other words, the code C is a set of binary vectors of length n. The Hamming distance between two vectors C I and c2 is equal to the number of positions where these two vectors differ, and is denoted by dist(c1, cz). For instance, if C I = 101 1 1 and c2 = 00101 then dist(c1, c 2 ) = 2. The Hamming weight of a vector is the number of nonzero components in it, and is denoted by wt(c). For the c l specified here we have wt(c1) = 4. It is easy to derive the following relation between the Hamming distance and Hamming weight: dist(c1, c2) = wt(c1 - c2).
A code C can also be characterized by its minimum distance between any two codewords C : d ( C ) = mindist(c1, cz), V
C I ,c2 E
C
and
CI # c?.
A code with a minimum distance d can correct r(d - 1)/21 errors, where [a1 denotes the usual ceiling function giving the greatest integer less than or equal to a. Typically, an [ n ,k , d ] binary linear code refers to a code with 2k codewords of length n that differ in at least d places. The first-order ReedMuller code related to the A16 laminated lattice is denoted by (Vetterli, 1984; DeVore et al., 1992; Barlaud et al., 1994). TABLE IV
NUMBERS OF
POINTS ON THE SURFACE OF A SPHERE OF A G I V E N RADIUS FOR 'THE
LATTICEI\ 16
In
0
2
4
6
8
10
12
14
16
A,,,
1
0
4320
61440
522720
221 1840
8960640
23224320
67154400
LATTICE VECTOR QUANTIZATION FOR WAVELET-BASED IMAGE CODING
239
For an [ n ,k , d ] linear code C let A, be the number of codewords with the Hamming weight of i. Thus, Cr=,,A, = 2'. The numbers A , are called weixht distribution of C. Now, we can associate with the code C a homogeneous (containing terms that are all of the same degree) polynomial in the fol1owing way :
This polynomial is called the weight enumerutor of the code C. In weight enumerator the variable x effectively indicates the number of zeros in a codeword, whereas y indicates ones. With the preceding fundamentals in mind it is now possible to draw a parallel between the weight enumerators of codes and the theta series of lattices. Consider the coefficients A,,, of the theta series given by Eq. (74) and the coefficients A; of the weight enumerators in Eq. (82). In both cases they specify the number of points, or codewords, at a certain distance from the origin. Therefore, these polynomials contain the essential information about the distribution of vectors in the subspace under consideration. Extending the connection established here between lattices and codes, we can observe that there exist a number of methods of constructing sphere packings from codes. It can be shown that a sphere packing constructed from a code is a lattice packing if and only if the code is linear (MacWilliams and Sloane, 1977). Therefore, by choosing a linear code as a basis for any of the available constructions we obtain a lattice corresponding to this code. Three of the most popular construction methods, referred to as Construction A, B and C, are outlined in what follows. 1. Construction A
This construction yields the simplest way to associate a lattice with a code. Let C be a binary code. The centers x of spheres from the sphere packing corresponding to the code C in R" are those that are congruent (modulo 2) to codewords of C: h x m o d c r E C. (83)
In this way the centers of spheres are obtained by adding even numbers to the codewords of C and then dividing the result by The inultiplication with
a.
240
MIKHAIL SHNAIDER AND ANDREW P. PAPLINSKI
is simply a scaling operation and, as we shall see later, it does not alter enumeration of lattice points and is often omitted. As we have already mentioned, the code must be linear in order to obtain a corresponding lattice packing L ( C ) . Let us consider a codeword c = (cl, c2, . . . , c,,) in a linear code C. Following Eq. (83) the corresponding lattice points 1 = ( I 1 , 1 2 . . . . , l,,) E L ( C ) can be expressed as
L ( c ) = ( 1 1 ~ 1 2 , .. . , l , , ) with
I,
E
(c,
+ 2 Z ) A = -c,1 + &Z Jz
where L ( c ) E L ( C ) denotes a lattice point constructed from a codeword c E C . As c, is defined on F2 = (0, 11, we need to consider two cases, namely, c, = 0 and c, = 1. From Eq. (78) we can recall that Oz,, ( z ) = O , " ( z ) . Thus, S z ( z ) = O,(z), and we can deduce the form of 0~~ as follows:
Similarly, for O , , J Z + J Zwe ~ have
Now we can write
where wt(c) is the number of 1s and n - wt(c) is the number of 0s in c. Consequently,
In summary, if we assume that C is a linear code with weight enumerator W c ( x , y) given by Eq. (82), then the theta function of the corresponding lattice L ( C ) is given by Eq. (84). For example, using Construction A we can generate lattice E8 from the 18, 4, 41 extended Hamming code H 8 (Sloane, 1977) with weight enumerator
LATTICE VECTOR QUANTIZATION FOR WAVELET-BASED IMAGE CODING
24 I
2. Construction B Consider an [n,k, 81 binary linear code C with codeword weights divisible by 4. The centers x of the corresponding sphere packing are points that satisfy the following properties:
I. 2.
421-(mod 2) E C;
4,JZk.x;. i= I
It can be shown (Conway and Sloane, 1993) that the lattice sphere packing L ( C ) obtained by the following Construction B has a theta series as follows: O L ( e ) ( Z )=
+
;Wc((-)->,(22). (4')2(22)) ;(31;(2,7).
For example, by applying Construction B the lattice E N can be generated from the repetition code [8, I , 81 consisting of codewords {{O)', { I ) ' ) . The procedures given by Constructions A and B can be simplified by introducing a coordinate array for each lattice point defined as follows (Conway and Sloane, 1993; Sloane, 1977). The number of columns in this array equals the dimensionality of the generated lattice. Each column is the binary rcpresentation of the corresponding coordinate of the point. For negative number the 2's complement notation is used. For example, the coordinate array of x = (3, 2, I , 0, -1, -2, -3) is 3 1 1 0 0
2 I 0 -1 0 1 0 1 0 0
0 0 0 0 0 0
-2 - 3 1 0 I I I 0 1 1 I I 1 1
Now, if the coefficient f i in Constructions A and B specified by Eqs. (83), (85) and (86) is omitted, we have the following simplified algorithms. Construction A is effectively reduced to finding the centers (points) that have the top rows of their coordinate arrays being codewords of code C . Construction B can be redefined as a method of finding the centers that have the top rows being codewords of code C and the second top rows with even weights. 3. Construction C This construction is based on a linear code C with codewords of length n . The corresponding lattice packing consists of points for which the 2"s rows ( i = 0, . . . , 1 2 ) of their coordinate arrays are i n C.
242
MlKHAlL SHNAIDER A N D ANDREW P. PAPLINSKI
In our discussion of lattice quantization we have covered definitions from the lattice theory (Sect. IV), types of lattices (Sects. 1V.A and C), construction of lattices (Sects. 1V.B and C). Fast quantization algorithms for lattices can be found in Conway and Sloane (1982, 1983). From the point of view of lattice quantization another problem remains open. We need a scaling algorithm that could be applied to wavelet coefficients as a preprocessing step before utilization of the fast quantization algorithms. Such a scaling algorithm is discussed in the next section. VII. SCALING ALGOKITHM Although the probability distribution function of the wavelet coefficients studied in Sect. 1I.B.2 has similar shapes for each block, for example, (LL), (LH), (HL), and (HH), some statistical parameters, such as minimum and maximum values, standard deviation, and so on, are expected to vary from block to block. Therefore, for the purpose of quantization each block of the wavelet coeffjcient matrix should be treated separately. According to the results presented in Sect. 1II.C an optimal quantization of wavelet coefficients can be achieved by uniform placement of the codevectors on the surfaces of concentric spheres with centers at the origin. It has been shown in the previous section that each lattice actually provides a set of points located on the surfaces of spheres centered at the origin. The number of such points for each sphere is defined by the theta series of Eq. (74). Assuming that each lattice point is a codeword, we need to know how many codewords are required for quantization of a given block, or alternatively, what the sire of'the codebook should be. One possible solution to the problem of determining the number of codewords was presented in Sect. I1.C in the form of an optimal bit allocation routine. The output of this routine is the number of bits per pixel or, in other words, the compression ratio for each block. Now, from the target compression ratio we derive the size of the goal codebook (the number of codewords required for quantization), N . By summing the coefficients of the theta series, the number of lattice points corresponding to that size can be obtained. Because the theta series gives only the numbers of lattice points (see Tables 11. Ill. and IV) lying on surfaces of the spheres around the origin, the size of' the codebook actually used is often an approximation of the one generated by the optimal bit allocation procedure. The available size of the codebook consisting of the lattice points is b'riven by N = EYLoA,.Equivalently, given the appropriately chosen radius f i of the largest sphere which accommodates vectors of the squared norm (or energy) 177, we obtain a set of codevectors belonging to the selected lattice. Now we require that the wavelet coefficients (collected into vectors, since we use vector quantization) with a certain norm E be scaled to the norm H I of the surface
LATTICE VECTOR QUANTIZATION FOR WAVELET-BASED IMAGE CODING
243
of the outer sphere. E is selected so that a prescribed number of vectors, say 70-80%, have their norm not greater than E (Fig. 20). The scaling factor between the m and E shells is given by s=
E.
After quantization this scaling factor is included in the compressed data stream (Barlaud rt al., 1994). There are a number of possible ways of handling vectors with squared norm greater than E . The simplest method is to truncate them to the surface of the outer sphere E . However, it results in the introduction of some additional distortion of the high frequency edges in an image. The approach we have adopted and tested in this work is to set a separate scaling factor s, for each
where El is the squared norm of the i-th vector. Thus, the high frequency information remains preserved at the expense of a slight increase in the bit ratc, which depends on the value of the threshold E l . The waling factors s ,
P
0
,,’ E shcll
. o -0.
Wavelet coefficicnts
Fic;~irct;20. Laltice quanriLaiion scheme.
244
MIKHAIL SHNAIDER A N D ANDREW P. PAPLlNSKl
are to be transmitted together with the corresponding vectors. Thus, at the receiving end, dequantization is followed by scaling each vector by the factor l/s. for the vectors with norm not exceeding E , and the factor l/.s;, for the vectors with norm greater than E . Now, let us summarize the considerations presented in this section in the form of the following scaling algorithm to be used as a preprocessing step in lattice quantization of wavelet coefficients: 1 . Compute the squared norm of each vector of wavelet coefficients and de-
termine the norm E . 2. Determine the size of the codebook for a chosen lattice as well a s the corresponding norm in (see Sect. 1I.C). 3. Determine the scaling factor as in Eq. (87) and include s in the bit stream. 4. For each vector (norm E ; ) : (a) if E j 5 E , scale the vector by a factor of s, otherwise determine s; as in Eq. (88), and scale the vector by a factor of s,; (b) encode the vector with a fast quantization algorithm (see Sect. V); (c) if E , > E , include s, in the bit stream.
In this section, we have examined the problem of lattice quantization. As it has been shown, there exist a variety of lattices that can be used for this purpose. The final choice depends on a particular application and relies on the evaluation of a number of criteria, such as the fidelity of the system. the time of encoding, software or hardware implementation, and so on. This issue, namely the choice of a suitable lattice, is addressed in the following section.
VIII.
SELECTING A
LATTICE FOR QUANTIZATION
From Section IV we know that every lattice is determined by its generator matrix M and the points that comprise a lattice are specified by Eq. (30). We can choose these points as codevectors for quantization of an n-dimensional input sequence. Let w , denote an i-th codevector and V ; be a quantization region around it. As a result of quantization. every entry in the input sequence located within the region V ; is represented by the codeword w;.The distortion introduced during quantization can be measured as the mean squared error (MSE) between the input and output sequences of the quantizer. Here we use a dimensionless quantity D known as MSE per dimension given by
LATTICE VECTOR QUANTIZATION FOR WAVELET-BASED IMAGE CODING
245
where x is the input sequence and p ( x ) is the probability distribution function of x . For many applications, an optimization of quantization means an appropriate selection of codevectors w , with the aim to minimize distortion. Thus, it is important to determine the minimum distortion D* subject to the i I N - 1) set of codevectors w , , (0 I D* = infD. W,
Solving this equation for each lattice will enable the choice of an appropriate lattice for quantization of the source x . It is evident that the distortion depends not only on the position of the codevectors in the quantiLation space but also on their quantity. Therefore, to allow a correct comparison of the performance of various lattices, the number of lattice points N used for quantization should be fixed for every lattice. We know that the quantization regions V, formed by a lattice are congruent Often, in order to fit N quantization regions in a to its fundamental region quantization space Q, quantization regions are scaled
v.
where q is the scaling factor. It was shown by Zador (1982) that for large N the following equality is satisfied:
where r is set to 2 for the MSE measure, and G p ( n , r ) is known as the dimensionless second moment of a polytope P. In the case of lattice quantization, a polytope P becomes a quantization region of a lattice. After setting r to 2, we have that the moment C,(rz, 2) depends only on n , which is the dimensionality of the input x. Assume that w , are selected so that the distortion D* is minimized. Thus, G v ( n ,2) can be calculated as
246
MIKHAIL SHNAIDER AND ANDREW P. PAPLINSKI
In the preceding equation SR,$p ( ~ ) " / ( " ~ ' )has d x been replaced by E L , S, p ( x ) r l / ( " + 2 ) dbecause ~, the quantization space is entirely covered with the quantization regions. The values of the second moment G y ( n ,2) can be tabulated for each lattice by letting x be uniformly distributed over the quantization space Q, that is p ( x ) = const. After setting p ( x ) to a constant, Eq. (93) becomes
Because all quantization regions V, are congruent to the fundamental region of a lattice we have
l,
dx = vol(V,) = vol(v)
(95)
v
where is the fundamental region of a lattice. Also, provided that wi are located in the centroids of the corresponding quantization regions, the distortions produced by quantization of the uniformly distributed source x within all quantization regions are equivalent. Finally, assuming that the centroid w of the fundamental region is located in the origin, Eq. (94) can be simplified as follows:
Table V, reproduced from Conway and Sloane (1993) shows the dimensionless second moment G v ( n , 2) for various popular lattices. Equation (92) can be now rewritten as
247
LATTICE VECTOR QUANTIZATION FOR WAVELET-BASED IMAGE CODING
TABLE V DIMENSIONLESS SECOND MOMENT G p ( / l ,2) FOR SELECTED LATTICES. (Source: Conway and Sloane. 1993. Reprinted with permission.) 1
2
A5
4
D5
DT
E(,
E:
E7
E;
077647
,076922
,075786
,075625
074347
,074244
,07323I
,073 I 16
/I
4
3
X Ax
AI;
DX
1);
.(I7739I
,075972
,075914
.(I74735
Ex.E,* .07 16x2
24
12
16
KI?
A I ~AT6 ,
Ax,
,070100
,068299
,06577 1
According to the scaling algorithm presented in Sect. VII, every input vector x is multiplied by the scaling factor l/s before it is fed into the quantizer.
Thus, for the minimum distortion, D*, we have
Suppose that the lattice points cover the quantization space densely enough to assume that the PDF of the input is uniform within each quantization region. With this assumption, Eq. (98) can be simplified further as follows:
where p ( w , )is the probability of the source at the centroid of the i-quantization region. In the preceding derivation we have assumed that all quantization regions are congruent and the PDF within each of them is constant, thus, Eq. (95) is satisfied. Using Eqs. (95), we can rewrite Eq. (99) in the following form:
It is easy to see that the value of .s”vol(v) in the foregoing equation is, effectively, the normalized volume of the fundamental region of a lattice, and the scaling factor .I: corresponds to the scaling factor q in Eq. (91). An
248
MIKHAIL SHNAIDER AND ANDREW P. PAPLINSKI
appropriate selection of the factors will ensure that the normalized volumes of the fundamental regions of all lattices are equal to each other, and a collection of N lattice points covers completely the quantization space R. A connection between s and N also can be seen from the fact that s is given by the scaling equation (87) and m in Eq. (87) is chosen so that N = C’,!’=,A,. Here N denotes the number of points that belong to a lattice. For quantization, lattice points become codevectors. In (100) the codevectors are denoted by wi with i between 1 and N . As mentioned in Sect. VI the number of lattice points is given by the theta series equation (74). Therefore, Eq. (100) can be modified in the following way:
(101) with wlwk = J , A, being the coefficients of the theta series. Equation (101) gives a good approximation of the distortion measure provided that the following assumptions are fulfilled: 0
0
quantization codevectors are densely distributed in the quantization space; and the probability distribution within each quantization region is almost uniform.
In order to select an optimal lattice for quantization of a source, one should fix the value of N and then solve Eq. (100) for every lattice with p ( . ) set to the PDF of the source. The final selection of a lattice for quantization does not depend on the actual values of the distortion measure D* across the range of lattices, but rather on which lattice attains the smallest D*. In this context, it appears valuable to note that after fixing N for some n-dimensional quantization space R,the term N p 2 / ”i s ” v ~ I ( V ) ) ( ” +becomes ~ ) / ~ ~ constant: C = N-””
(Il+2)/11
(s”vol(V))
(102)
Thus, we have
1
(l1+2 1/11
D * ( N , R) = C G v ( n , 2) )( : ; (
2:C
p(.~Wk)n’(r’+2)
(103)
As our primary concern here is quantization of wavelet coefficients, the PDF that must be used in Eq. (103) is the PDF of wavelet coefficients. Recall from Sect. I1 that the PDF of wavelet coefficients can be modeled by the Gaussian-type function. Naturally, we can use this model in Eq. (103).
LATTICE VECTOR QUANTIZATION FOR WAVELET-BASED IMAGE CODING
249
Consider the following example of selecting a 2-D lattice for quantization of the lowest frequency band of the wavelet coefficients. Assume that we have a bank of two lattices, namely lattices 2 2 and A ? . As shown in Eq. (7), for the lowest frequency band of the wavelet coefficient matrix (texture) the Generalized Gaussian function (GGF) is reduced to the Gaussian PDF. Assuming that the Differential Pulse Code Modulation (DPCM) predictor used in the lowest frequency band removes correlation between neighboring samples, we have ( 104)
given that the standard deviation (T = I . For the preceding Gaussian PDF, the distortion measure can be expressed in the following way:
where p ( . s f i ) is the probability of the source vector having the squared norm of .s2,j:
where j is the squared norm of x . For an estimation of the distortion measure of Eq. (105) it is required to select the scaling factor s . In Sect. VII, s was defined a s a scaling factor for mapping the outer sphere of the quantization space S2 onto the outer sphere composed of the lattice points to be used for quantization. After testing, it was found that a good choice of the outer sphere of the quantization space for wavelet coefficients is when it includes between 70% and 80% of wavelet coefficients. This corresponds to selecting the value of E in eq. (87) to be approximately (1.30)'. where (T is the standard deviation. Recall that (T was chosen to be 1 in Eq. (104). In order to calculate s specified in Eq. (87), it is required to select the number of lattice pointdcodevectors used for quantization; this number is denoted by N . As N = X Y ! o A , , , from N we can find m which is needed in Eq. (87). As already mentioned, A,, are coefficients of the theta series, which, for the lattices 2 2 and A2 used in this example, are given in Table VI. Let us set N = 45, for example. Thus, we have Z i L , A , = 45 for the lattice 22 and C&Aj = 43 RZ 45 for the lattice A?. Using this, the scaling factors for the lattices 2 2 and A2 become s = I . 3 / m and s = 1 . 3 / m , respectively.
250
MIKHAIL S H N A I D E R A N D ANDREW P. PAPLINSKI TABLE VI N U M H ~ ROF S POINTS OF I H E LAI TICES z? AND A? 0
I
Z
?
A? A ,
A
, I
2
I
I 6
4 0
3
4
5
4
0
4
6
6
0
7
6
8
0
0
8
0
12
9
4
4
0
6
I0
II
12
8
0
0
0
13
I4
15
16
0
8
0
0
4
6
12
0
0
6
Now, Eq. (105) can be tabulated for the test lattices. The results are
q, (45, Q) = 17.2136 C
and
D;? (= 45. Q j
C
= 15.7095.
In conclusion, by quantization of the wavelet coefficients with the A2 lattice one may expect to obtain approximately 10% improvement in the meansquared error comparing with quantization of the same source with the Z ? lattice. Unsurprisingly, this result coincides with the result obtained by Conway and Sloane (1993) for a uniformly distributed source. For a uniformly distributed source, the benefit of using A2 over Z ? is approximately 4%.
Ix.
ENTROPY CODING OF
LATTICE VECTORS
In an image compression system lattice quantization is typically followed by an entropy coder, which exploits an unevenness of the PDF of lattice vectors. Therefore, an entropy coder requires the knowledge of the probability of each codevector before encoding. In many systems, the codevectors together with their probabilities are included into the output bitstream. This results in undesirable increase in the bit rate. As the lattice quantizer has a regular structure, all codevectors can be generated by the decoder from the fundamental region of the lattice. What remains to be included into the bitstream is the probability for each vector. However, as we know that the probability distribution of the wavelet coefficients has a form of the generalized Gaussian function, the probability of a codevector w , can be approximated as
P ( w , ) = s”vol(v)p(w,j V I 5 i 5 N
(107)
where N is the number of lattice points used for quantization and p ( . ) is the generalized Gaussian function with the mean of 0 and the standard deviation equals the standard deviation of the wavelet coefficients. This estimation can be done independently at the encoder and decoder using the standard deviation of the coefficients, which can be included into the bitstream without noticable overhead.
LATTICE VECTOR QUANTIZATION FOR WAVELET-BASED IMAGE CODING
25 1
Although the preceding approach to tabulate the probability of each codevector is correct it appears rather demanding t o enumerate all lattice points with their probabilities. It was shown in the previous sections that the generalized Gaussian function with an appropriately selected parameter ( closely resembles the PDF of the wavelet coefficients. is set to 2 for the lowest frequency band and ( = 0.7 for the remaining bands. When ( = 2 the n becomes the well-known Gaussian and, consequently, the regions of equal probabilities are located on spheres centered around the origin, which is 0 in the case of wavelet coefficients. If ( equals 0.7 the equiprobable surfaces become somehow distorted spheres. Obviously, the farther the value of ( departs from being 2 the more distant from a spherical shape the equiprobable surfaces become. For instance, when ( is I , that is the Laplacian PDF, the equiprobable surfaces are pyramids (Fischer, 1986). The number of lattice points on each equiprobable surface is given by the theta series specified in Eq. (74). Provided that we have a theta series for each lattice, instead of enumeration of the lattice points as in the foregoing, we can enumerate spheres and calculate the probability of any vector, say a representative, located on each sphere. The remaining vectors have the same probability as representatives corresponding to them. In order to accomplish the preceding enumeration we need a theta series for each lattice. Let us define an (-norm of an M-dimensional vector x as
<
Employing this definition a general form of the theta series can be written as
By setting ( to 2, we obtain the theta series discussed in Sect. V1.B. For
< = 0.7 a new set of theta series must be tabulated. The theta series for the I-D lattice Z I with the norm 0.7 is
Equation ( 1 10) corresponds to the theta series ( 3 j ( z ) introduced in Sect. V1.B. We will denote the new theta series by 6 ) 3 ( ~0.7). ,
252
M I K H A I LSHNAIDER AND ANDREW P. PAPLINSKI
Another important theta series with the norm from Sect. V1.B:
= 0.7 corresponds to O ~ ( Z )
The n-dimensional lattice Z,, has the following theta series derived from Eq. ( 1 10): Oz,,( z , 0.7) = (z, 0.7). ( 1 12) Similarly to spheres, more complex lattices can be obtained for the norm of 0.7 by utilization of the Construction A discussed in Sect. V1.C. These lattices include the following:
Now, by analogy to Eq. (84) we can write
By using the preceding theta series the numbers of points on equiprobable surfaces can be tabulated and, then, used in Eq. (107) to tabulate probabilities of the codevectors. This approach greatly simplifies the compilation of probabilities of the codevectors for entropy coding of the generalized Gaussian sources with 6 equal to 2 or 0.7. An alternative method for estimation of the probabilities was suggested in Shnaider and Paplinski (1997). Consider a random 2-D sample x = [ X I , x?), from a generalized Gaussian distribution with the standard deviation I and the mean of 0. Let y be defined as L
i= 1
The probability distribution function of y is given by /
2
\
LATTICE VECTOR QUANTIZATION FOR WAVELET-BASED IMAGE CODING
253
Then, provided that X I and x2 are independent, the probability distribution G ( y ) can be written in the following form:
where S is the collection of random samples of size 2 located within the space bounded by y”6 and centered at (0, 0). The parameter h is assumed positive. Introducing a new variable c = h‘, we have for JJ 2 0:
The change to the spherical coordinates leads to
with p between 0 and y’‘€ and Cp between 0 and 277. Upon substitution of the boundry conditions, Eq. (1 17) becomes
Letting p = w ’ l t , then we have
From Eq. ( 1 19), the PDF of y is
254
M I K H A I LSHNAIDER A N D ANDREW P. PAPLINSKI
After substitution of a specified in Eq. (1 17), the last equation can be written in the following form:
Finally,
where B =
J ( ri - t2)1/S-‘dt.
The last part of Eq. (121), namely,
1
( c y ) 2 / c - ‘ exp(-cy), is the Gamr(2/0 ma distribution. In general, the PDF of y follows the Gamma distribution. ~
1
In other words, g ( y ) is proportional to - ( ~ y ) ~ ’ ~ - ’ exp(-cy) where y = T(r/O C:,,lxilc. When is set to 2, that is in the case of the Gaussian PDF, g(y) becomes the well-known chi-square distribution with Y degrees of freedom. In conclusion we can say that using the forementioned approach the probabilities of codevectors located on the equiprobable surfaces can be closely estimated. The close form solution for these probabilities depends on the dimensionality Y of the quantized vectors and the norm and is always in the form of a Gamma distribution.
e
e
X. EXPERIMENTAL RESULTS As discussed previously, the basic I-D wavelet transform can be performed as an expansion using a pair of filters, of which one is a lowpass and the other high-pass. The typical extension to the 2-D cases is based on the consecutive application of the algorithm to the columns and rows of the input matrix, as depicted in Figs. 1 , 2 and 3. The depth of the transfonns is only limited by the size of input images. In our experiments we have used the three-level wavelet expansion tree. Most of the information in real image data is concentrated in the low to medium frequency components. Therefore, we increase the dimension of the quantization veclors moving from low to high frequencies and, as a result, we obtain fine quantization at low frequencies and coarse quantization at higher frequencies. In the examples that follow, we use 1 x 1 blocks, or scalar quantization, for the wavelet coefficients of the level “3,” 2 x 2 blocks for level
LATTICE VECTOR QUANTIZATION FOR WAVELET-BASED IMAGE CODING
255
“2,” and 4 x 4 blocks for level “ I . ” Thus, we consider three possible dimensions of the quantization vectors, namely, 1-D, 4-D and sixteen-dimensional ( 16-D) vectors. For simulation purposes we use three test images: “Lena” (Fig. 4), “Mandrill’’ (Fig. 21) and “Coast” (Fig. 5 ) , each of size 512 x 512 pixels. The test images have been expanded by means of the three level wavelet transform. This tree structure has been examined in a number of recent studies (Antonini et ~ i l . ,1992; Shnaider and Paplidski, 1995 a, b) and found to be well suited for image coding applications. The biorthogonal filters “(4, 4)” (Daubechies, 1992) used in the experiments are specified in Table VII and Eq. (122).
The matrix of wavelet coefficients consists of 7 blocks, as listed in Table VIII, each block representing certain spatial frequencies. The texture block denoted BIORTHOGONAI. II
0
TABLE VII Fii:rm (4,4) U S E D I N
SIMIJLATION
fl
&2
13
54
h,,
0.6029
0.2669
-0.0782
-0.0169
0.0267
h,
0.5575
0.2956
-0.0288
-0,0456
0
FICURI:2 I . Image “Mandrill”: original.
256
MIKHAIL SHNAIDER AND ANDREW P. PAPLINSKI
TABLE VIlI FOR IMAGES “LENA,”‘‘MANDRII.I.,” “COAST” Blt ALLOCATIONS 0.6 bpp
WITH THE
AVERAGE Bit RAI E OF
Image
3
2H
2v
2D
IH
IV
ID
“Lena”
3.66
I .77
0.93
0.57
0.67
0
0
“Mandrill”
2.95
0.98
1.55
0.69
0.12
0.74
0
“Coast”
4.07
2.04
1.75
0.25
0
0.38
0
by “3” in Table VIII and located in the upper left corner of the matrix of wavelet coefficients has been encoded with scalar quantization using the Z I lattice. The coefficient blocks of level 2 (2H, 2V, 2D) have been quantized with 4-D vectors using the 0 4 lattice. To quantize the highest frequency blocks of level 1 ( 1 H, 1 V, 1 D) we have used 16-D vectors coded using the A 16 lattice. Before quantization, approximately 75% of the wavelet coefficients within each block were rescaled with the same scaling factor s following the scaling algorithm presented in Sect. VII. The rest of the coefficients were scaled individually. The optimal size of the codebook for each quantizer has been determined using the routine developed in Section I1 with the primary bit per pixel parameter set to 0.6. The bit allocation for each block of the test images is given in Table VIII. It can be seen from Table VIII that the number of bits per pixel required for optimal quantization decreases when moving from low to high frequency blocks. This is consistent with our preliminary assumption that the energy of real image data is mostly concentrated in low and medium frequencies. Due to the nature of lattice quantization the bit rates obtained were rounded up to the nearest codebook length available for the corresponding lattice. For example, the number of bits per coefficient for the block “2H” of the image “Coast” is 2.04 (Table VIII). As each block is 2 x 2 coefficients, the number of bits per block is 8.16 with a corresponding codebook size of approximately 286. Now, from Table 111 we have that the nearest number of lattice points that makes up the codebook is oC ,!, A, = 3 13. Finally, the quantized coefficients were encoded with the Huffman coder. The resulting compressed images were of the following sizes: 10908 bytes for the “Lena” image, 18566 bytes for “Mandrill” and 14674 bytes for “Coast.” These correspond to compression ratios of 24.0: I , 14.1:1, and 17.9:1, respectively. The specified compression ratios are based on the size of the file containing the encoded image and all additional information required to decode it. The reconstructed images are shown in Figs. 22, 23, and 24. The fidelity of compression was measured using the peak signal-to-noise ratio (PSNR). The results obtained are collected in Table 1X. As might have been expected the PSNR for the image “Mandrill” is lower than for the other test images due to the high complexity of this image.
LATTICE VECTOR QUANTIZATION FOR WAVELET-BASED IMAGE CODING
257
FIGURF, 22 Image “Lcna” encotled with thc A-type lattices.
FIGURE 23. linage “Mandrill” encoded with the A-type lattices
I n addition to the PSNR, in image compression another factor of general concern is the speed of coding. I t is possible to observe that lattice quantization is significantly faster than most of the existing quantization methods due to the use of the fast algorithms for calculating the closest lattice point (or codevector), and the elimination of the need to generate the codebook. For lattice quantization it is possible to further reduce the time for encoding
258
M I K H A ~ LSHNAIDER A N D ANDREW P. PAPLINSKI
FIGLIRF 24. image “Coast” encoded with the A-type lattice\. TABLE IX COMPRESSION RESUl-TS USING ‘THE “OPIIMAL” QUANIIZERS
LATTICE
Image
Compression Ratio
“Lena”
24.0: 1
32.06
“Mandrill”
14.1:l
23.82
“Coast”
17.9:I
34.23
PSNR(dB)
by choosing lattices with low complexity quantization algorithms. One of the most advantageous possibilities is to use the D-type lattices, which deliver high-speed encoding algorithms. The combination of the wavelet transform with the D-type lattices was examined in Shnaider and Paplinski (1997).
XI. CONCLUSIONS One of the most prominent directions in the area of coding of still images is based on the utilization of spatial correlation between neighboring pixels i n an image. Such correlation can be exploited by mapping the image data into a frequency plane using one of the transforms, the wavelet transform being the best option contender. The resulting set of transform coefficients represents the original image. The coefficients are subsequently quantized and coded to achieve the desired compression ratio. The original image is recovered by the
LATTICE VECTOR QUANTIZATION FOR WAVELET-BASED IMAGE CODING
259
inverse procedure. The quality of an image compression system is assessed by the fidelity of the output images, the compression ratio, and the time required for encoding and decoding, as well as by a number of secondary criteria. Typically, it is not easy to satisfy all of the criteria for a high-quality image coding system, and therefore a trade-off between various requirements must be found with emphasis on the prime objectives, which are the compression ratio, quality of encoding, and the time for encoding and decoding. As has been shown here, an image coding system based on the wavelet transform and lattice quantization supported by appropriate bit allocation procedure and entropy coding gives excellent fidelity for a given compression ratio, maintains high speed in both encoding and decoding operations, and is not very demanding in terms of implementation costs. Although the combination of wavelet transform and different methods of quantization has been investigated for image compression in a number of recent studies, the introduction of lattice quantization in such an image compression system seems a rather novel direction in the research. I t has been gaining popularity following a number of recent developments in lattice theory. The study of lattice theory presented in this text is a summary of current developments in the context of the utilization of lattices for quantization. We have shown that the lattices constitute optimal or near-optimal quantizer systems for encoding the wavelet coefficients. We have also examined two of the most interesting types of lattices, namely, the root and laminated lattices. Another problem addressed in this work has been the number of lattice points available for quantization. We have presented methods for calculation of an approximate solution, as well as the exact solution to this problem. In order to minimize the distortion introduced by the procedure of quantizing the wavelet coefficients, we have developed an optimal bit allocation routine. This routine allows distribution of the available bits so that the fidelity of the compression system is maximized. By using a model of the probability distribution function of coefficients, instead of the actual PDF, we have significantly reduced the amount of computation required to obtain the solution of the distortion function, and as a result lowered the overall time of encoding. In the experimental part of this work we have tested the performance of our coding system based on the discrete wavelet transform and lattice quantization. In conclusion we can state that in this study we have both investigated the theoretical foundation for an image coding system based on the wavelet transform and lattice quantization and developed and tested such a system. Through the number of simulations this system has demonstrated its capacity to encode and decode gray-scale images at low to medium bit rates while maintaining excellent fidelity and a high coding speed. Optimization of the
260
MIKHAIL SHNAIDER AND ANDREW P. PAPLlNSKl
algorithm that has been developed has resulted in both a reduction of its complexity and, consecutively, simplicity of its implementation. We would also like to note that the high speed of coding makes the system that has been developed particularly advantageous in the context of real-time image compression.
APPENDIX A: CARTAN MATKICES OF SOMEROOTSYSTEMS The Cartan matrices for the root systems A,,, B , , C,,,D,,, Eg, E,, E x , F 4 and G2 are:
2 - 1 -1
-1
cJI
=
.
.
2
-1
0
I.
. .
.
.
.
. . . 0 - 1 2
. . . .
.
.
.
.
.
.
.
.
.
- 2 0 -1 = 0 0 - 0
.
.
2 - 1 0 . 2 - 1 . 0 - 1 2 - 1 .
.
. . . . . . .
.
- 2 - 1 0 . -1 2 - 1 . . . 0 . . . D,, = 0 . . . 0 . . . - 0 . . .
Eh
.
o . . . .
o . . . . . 2 - 1 2 - 1 0 . . , . 0 0
-1
0
2 - 1
.
.
.
. .
(123)
.
. .
0 0
.
( 1 24)
.
- 1 2 - 2 . 0 - 1 2
. . .
.
. . . . . .
. .
.
.
.
.
0 0 0
. .
.
.
. . . - 1 . . . . . .
. . .
2 - 1 -1
2 0 - 1 0 - 1
0 - 1 0 0 0 2 0 - 1 0 0 0 2 - 1 0 0 -1 -1 2 -1 0 0 0 - 1 2 - 1 0 0 0 - 1 2
0 -1 2 0
(125)
0 0 0
-1 0
2
(127)
LATTICE VECTOR QUANTIZATION FOR WAVELET-BASED IMAGE CODING
. 2 0 -1 0 0 0
0 0 0 -1 0 2 - 1 0 0 - 1 2 - 1 0 0 - 1 2
0 - 1 0 2 0 - 1 0 2 -1 -1 -1 2 0 0 - 1
0 0
. o
0 0
0 0 0
0 0 0 0
- 2 0 - 1 0 0 0 0 2 0 - 1 0 0 -1 0 2 - 1 0 0 0 - 1 - 1 2 - 1 0 0 0 - 1 2 - 1 0 0 0 0 0 - 1 2 0 0 0 0 0 - 1 - 0 0 0 0 0 0 -
-;-: -: 0 0
-I
2
0 - 1
26 1
-I
0 0 0 0 0 1 2 1
0 0 0 0 0 0 1 2
':I 2
REFERENCES Antonini, M., Barlaud, M., Mathieu, P., and Dauhechics, 1. (1992). Image coding using wavelet transform, IEEE Truris. I n i ~ g eProcessing, I : 205 - 220. Barlaud, M., Sole, P., Gaidon, T., Antonini, M., and Mathieu, P. (1994). Pyramidal lattice vector quantization for rnultiscale image coding, IEEE Truns. Irritrgr Proc., 3: 367-381. Barlaud, M. Sole, P., Moureaux, J., Antonini. M., and Gauthier, P. (1993). Elliptical codehook for lattice vector quantization, In Proc. ICASSP, 5 : 590-593. Berger, T. ( I97 I ). Rute Distortion Tlieor:y; A Motheninticd Rusis for Dutu Coniprr.x.sion.Englewood Cliffs, NJ: Prentice-Hall, Inc. Birkhoff, G. (1948). Lrrrticr Throgi. Rev.Ed. Campbell, F. and Rohson, J . (1968). Application of Fourier analysis to the visibility of gratings, Jonr. Physiology, 197: 55 I - 566. Chafiivi, H . and Tabatahai, A. (1988). Suhhand coding of monochrome and colour images, IEEE Truns. Circuirs unrl Sy.stoin.s. 35: 207- 2 14. Chen. F., Gao. Z., and Villasenor, J. (1997). Lattice vector quantization of gcneralized gaussian sources, IEEE Truns. Infi~rniuti~v~ Throry, 43: 92- 103. Conway, J. and Sloane, N. (1982). Fast quantizing and decoding algorithms for lattice quantizers and codes, IEEE Trans. liforin. Tlieon~,28: 227-232. Conway, J. and Sloane, N. (1982). Voronoi regions of lattices, second moinents of polytopca, and quantization, IEEE Truns. Iizfiwm. Tlirop, 28: 2 I I -226.
262
MIKHAIL SHNAIDER AND ANDREW P. PAPLINSKI
Conway, J. and Sloane. N. (1983). A fast encoding method for lattice codes and quantirera. lEEE Trims. Inform. Theory, 29: 820- 824. Conway, J. and Sloane, N. (1993). Sphere Packings, Lattices c i n d Groups. 2nd Edition: New York: Springer-Verlag. Cosman, P., Gray. R., and Vetterli, M. (1996). Vector quantization of image subbands: A survey, Trcrns. linage Proc., 5: 202-225. , H. (1973). Kegitlrir PoIvtope.s. 3rd Edition, New York: Dover. Daubechies, I. ( 1992). Ten Lwtures on Wtrvelet.~.Society for Intlustrial and Applied Mathematics. h, B., and Lucier, B. (1992). Image compression through wavelet transform m s . Inform. T h e o y , 38: 719-746. . Lattiws trnd C(~tle.~. Berlin: Freidr. Vieweg & Sohn Verlagsgesellschaft inbH. Fischcr. T. R. (1986). A pyramid vector quantizer, IEEE T r m s . Injbrmotioii Theor:v. 32: 568-583. Gersho, A . ( 1979). Asymptotically optimal block quantization, IEEE Trtins. Ir!f?wni. T h e o y , 25: 373 -380. Gersho, A. and Gray, R. (1992). Vector Qucintizcifion and S i g i i ~ lCoinpre.s.sion. Boston: Kluwer Acadcniic Publishers. Gibson, J. and Sayood, K . ( 1988). Lattice quantization. Ach. Elec.nronic.smid E/ec,trori PhF.sics, 72: 259-330. Gilmore, R. ( 1974). Lie Groups. Lie Algrhrns, trrzrl Some of Their Applicntions. New York: John Wiley and Sons, Inc. .~ Berlin: Springier-Verlag. Grosswald, E. ( 1985). KepreseirtLitioiis of/ri/egrr.s u s S I O I ZofSyuure.s. ~ SEdition. . New York: SpringerGrove. L. and Benson, C. (1972). Finite, ReJI~,ctiortG ~ C J U2nd Verlag, Inc. Humphreys, J. ( 1972). lritrocluctiori to Lie A/,yehrci.s c i n d R~,/~rP.verrttitioiiT/7eon. New York: Springer-Verlag Inc. Igusa, J . (1972). Thetci F I U Z ~ ~ ~Berlin: ( J ~ ~Springler-Verlag. .S. KrBtiel, E. ( 1988). L ~ t t i wPoints. Berlin: VEB Dcutscher Verlag der Wissenschaften. Lang, S. ( 1976). Introdcdon to Modulcir Forms. Berlin: Springer-Verlag. Leech. J. and Sloane, N. (I97 I ). Sphere packings and error-correcting codes. Ctrruidiciri ./our. Mrith.. 23: 7 I 8 - 745. Lewis, A. and Knowles. G. ( 1992). Image compression using the 2-D wavelet transforin, I Trans. Imrige Proc~rssing,I: 244-250. . (1994). A study of vector transform coding of subband decomposed C'ircuit.s Sy.s. Vid(w TEchnology, 4: 383 -39 1 . Sloane, N. ( 1 977). The the or^ of Err-or-Corir,c.firf'~Codes. Austcrdinn: North-Holland. Mallat, S. ( 1989). Multifrequency channel decompositions of images and wavelet models, I Trtrns. ASSP, 37: 2091 -2110. MATLAB Ryfir-tvice Chritle. (1992). The Math Works, Inc. Meycr, Y. ( 1993). Wtivrlrfs: Algorithnis tint/ A p p I i u i t i o ) ainhridge University PI-caa. Rankin, R. ( 1977j. Motlrrkir Forms r r n d Functions. Cam Riskin, E. (1991). Optimal bit allocation via the generalized BFOS algorithm, IEEE Trims. / ) I fiwni. Theory. 37: 400-402. Sakrison. D. ( 1 968). A geometric treatincnt of the source encoding of a gaussian rmdoin variable. IEEE Trcins. Injiirm. T h r w y , 14: 48 I -486. Sakrison. D. ( I979 j. Image coding applications of vision models. In In7cigr Trm~.\rni.s.siorif i d nirprrs (K. Pratt, Ed.). pp. 2 1-71, Boston: Academic Press, Inc. Sampson, D., da Silva. E., and Ghanbari, M. (1994). Wavelet transform image coding using lattice vector quantization, Iilectroti. Lettencs, 30: 1477- 1478. Sayood, J. Gibson, K.. and Rost, M. (1984). An algorithm for uniform vector quantizer design, lEEE Trcrns. Inforin. Thcwry, 30: 80.5 - 8 14.
LATTICE VECTOR QUANTIZATION FOR WAVELET-BASED IMAGE CODING
263
Senoo, T. and Girod, B. (1992). Vector quantization for cntropy coding o f irnage subbands, IEEE 1ruii.s. Inicige Processing, 1: 526-532. Shannon. C. (1948). A mathematical theory of communication, Bell Svs. Tech., J,, 27, 379-423, 623-656. Shapiro, J . ( 1993). Embedded image coding using zerotrees of wavelet coefficients, lEEE Trcriis. .Sr,qna/ Processing, 41: 3445 -3462. Shnaider, M. ( 1997). A S t i i d j 011 (in Imrige Coding System Brrseti or1 the Wavelet Trunsform cmcl Ldtice Vector Quanriztrrion. PhD thesis, Monash University, Australia. Shnaider, M. and Papliriski, A. ( 1994). Wavelet transform for irnagc coding, Teclirzic~rdReport 94- I I , Monash University, Department of Robotics and Digital Technology. Shnaider, M. and Paplinski, A. ( 1 9 9 5 ) . Compression of fingerprint images using wavelet transform and vector quantization, In Proc.. of l n ~S. y p . Sig. Proc. Appl., ISSPA '96 (Gold Coast, Australia). Shnaider, M. and Paplinski, A. (l995h). A novel wavelet toolbox with optimal vector quantizer, In Pmc. Dig. Initrge Camp Tech. App/., DICIA '9.5 (Brisbane, Australia). Shnaider, M. and Paplinski, A. (1997). Image coding through D lattice quantization of wavelet coefficients, Cruphicul Models and Iniuge Processing, 59: 193-204. Sloane, N. ( 1977). Binary codes, lattices and sphere packings, In Conibirzatoricd S u n c~edingsof'The Sixth Rrirish Comhiriutoritil Conference. (P. Cameron, Ed.), pp. 1 17- 164, New York: Academic Press, Inc. Vetterli, M. ( 1 984). Multi-dimensional sub-band coding: some thcory and algorithms, Signrrl P racessing, 6: 97 - 1 12. Wang, X., Chan, E., Mandal, M., and Panchanathan, S . (1996). Wavelet-based image coding using nonlinear interpolative vector quantization, IEEE Trrm.v. Image Proc., 5: S 18-522. Woods, J. and O'Neil, S. (1986). Subband coding of images, IEEE Trans. Acoust. Speech Sigriul Processing. 34: 1278- 1288. Zador, P. ( 1982). Asymptotic quantization error of continuous signals and their quantization diinension, IEEE Tmii.s. Informarion Theoty, 28: 139- 149. Zettler, W., Huffman, J., and Linden, D. (1990). Application of corupactly supported wavelets , 150- 160. to image compression, In Proc,. sP/E Conf / m q e Pmc. A/gorithna, ~ d z n . pp.
This Page Intentionally Left Blank
ADVANCES IN IMAGING A N D H . I ~ C I K O NPHYSICS. VOI .. 109
Fuzzy Cellular Neural Networks and Their Applications to Image Processing TAO YANC
I . Introduction .
.
.
.
.
.
.
266
. . . . . . . . . . . . . . . . .
266
.
A . State o l the Art of CNN .
.
.
.
.
.
B. Structures of Conventional CNN . . .
.
.
.
.
.
.
.
.
.
.
.
269
.
.
215
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. . .
.
.
.
.
.
.
.
.
.
.
.
.
.
279
C . Clnssilication of FCNN . . . . .
.
.
.
.
.
.
.
.
.
.
.
.
.
282
B . Principles of General FCNN
.
.
.
275
.
.
.
C. Notes . . . . . . . . . . . . . . . . . . . . . . . . I I . Fuzzy Cellular Neural Networks . . . . . . . . . . . . . . . . .
.
.
.
A . A Unilied CNN Structure
.
.
276
D . Different Structures of FCNN . . .
2x2
E . Differences between FCNN and FNN
287
111. Theory of Fuzzy Ccllular Neural Networks .
A . Elementary Theory
. . . . . . . . . . . .
290
. . . . . . . . . . . . . . . . . . .
290
B. Global Stability . . . . . . . . . . . . . . . . . . . . .
299
C . Local Stability . . . . . . . . . . . . . . . . . . . . .
310
D . Type-Il Delay-Type FCNN . . . . . . . . . . . . . . . . .
313
E . Type-I DFCNN . . . . . . . . . . . . . . . . . . . . .
320
F. Stability of Discrete-Time FCNN . . . . . . . . . . . . . . .
324
1V . FCNN as Computational Arrays
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
327
.
A . Basic Knowledge of Mathematical Morphology . . . . . . . . . .
327
B. Implementation of Morphological Operations . . . . . . . . . . .
329
C . MIN / MAX CNN . . . . . . . . . . . . . . . . . . . .
333
D . Face Image Processing Using Type-I FCNN .
.
.
.
.
.
.
.
.
.
.
356
.
.
.
.
.
.
.
.
.
.
360
. . . . . . . .
361
B . Embedding Fuzzy Inference into FCNN . . . . . . . . . . . . .
364
C . Application to Image Processing .
. . . . . . . . . . . . . .
366
. . . . . . . . . . . . . . . .
375
V . Embcd Linguistic Statements into FCNN
.
.
.
A . FCNN: Intcrkiaccs between Human Experts and CNN
VI . Learning Algorithms of FCNN .
A . Learning Stiucturing Elements . . . . . . . . . . . . . . . .
377
B. Advanced Learning Algorithms of Additive Discrete-Time FCNN .
.
382
C . Learning from Linguistic Inputs . . . . . . . . . . . . . . .
387
.
.
265 Volu, n c 10‘) ISBN 11- I ?-(I1475 I - ?
ADV.\NCES IN IMAGING A N D ELECTRON PHYSICS Copyrighi ‘i) 1999 hy Ac, tdcnuc l’rc\h All righi\ of reprr~Iuclion111 m y tom ~csel-ved ISSN 107h-Sh70/0‘1 ~ 3 0 . 0 1 1
266
TAO YANG
VII. Generic Algorithm for FCNN . . . . . . . . . . . . . . . . .
40 I
A. Genetic Algorithm for Optimizing FCNN . . . . . . . . . . . .
40 I
B. Application to Image Processing . . . . . . . . . . . . . . .
404
V111. Applications of Discrete-Time FCNN . . . . . . . . . . . . . . .
407
A. lmplernenting Nonlinear Fuzzy Operators for Image Processing . . . . .
407
B. Embedding Local Fuzzy Relation Equations . . . . . . . . . . .
413
IX. Conclusions and Future Work . . . . . . . . . . . . . . . . .
420
. . . . . . . . . . . . . . . . . . .
42 1
A. The CNN Universe
B. Implementing Fuzzy Spatial Dilemmas Using Type-I1 DTFCNN References
.
.
.
. . . . . . . . . . . . . . . . . . . . . . .
.
42 I
427
I. INTRODUCTION In this section we give an overview of the state of the art of celhdrir nei4rul netwot-k(s) (CNN). This overview will give readers who are not familiar with CNN all elementary concepts, expressions, and symbols. For experts of CNN, we will show why from the systematic point of view the structure of discrete-time CNN(DTCNN) is more important than other CNN structures that appeared during the same period, for example, delay-type CNN(DCNN), CNN with nonlinear synaptic laws(NCNN), and chaotic CNN(CCNN). We also show that CNN universal machine(CNNUM) is a secondary level concept that can not be mixed with the CNN universe in the first level (the elementary level), which consists of both conventional and fuzzy CNN(FCNN). The purpose of this section is not to survey all existing CNN literature but to show the existing CNN map from a systematic point of view. A. State
of the Art oj'CNN
Cellular neural networks are locally connected nonlinear networks. They originally stemmed from cellulcrr ciutonzrita(CA) (Goles and Martinez, 1994; Toffoli, 1987; Farmer, 1984; Perdang and Lejeune, 1993; Gutowitz, 1991) and tirtifificicilneurcil networks (ANN) (Hassoun, 1995; Braspenning et a / . , 1995; Sethi and Jain, 1991; Diederich, 1990; Morgan, 1990; Zhou, 1992; Kulkarni, 1994; Karayiannis and Venetsanopoulos, 1993). Local connectedness is the most significant property of CNN. Continuous dynamics distinguish CNN from CA. The local connectedness restricts the ability of CNN to solve many global problems that can not be decomposed into local components. However, the local property has its advantages, such as easy implementation using VLSI technology and efficiency for solving local problems. Another important
FUZZY CEI.,I,ULAR NEURAL NETWORKS A N D THEIR APPLICATIONS
267
aspect of CNN is that they provide paradigms for studying emergent computation (Forrest, 1991) and relevant topics, for example, artificial life (AL) (Artificial Life Workshop, 1994; Langton, 1995; Thalmann and Thalmann, 19941, as well as some other models such as CA and the spin glass model (Mezard et d . , 1987). Cellular neural networks were first introduced i n two twin papers by Chua and Yang (1988). This decade viewed rapid growth in this field. So far, two books (Roska and Vandewalle, 1993; Chua, 1998), five conference proceedings (Proceedings CNNA, 1990, 1992, 1994, 1996, 1998) and many special issues in different international journals (Int. J . Circuit Theory and Appls. Sept.-Oct. 1992, May-Jun. 1995, Jan.-Feb. 1996; IEEE Trans. on Circuits and Systems-I, Fund. Theory and Appls. Mar. 1993; IEEE Trans. on Circuits and Systems-11, Ana. and Dig. Signal Processing, Mar. 1993) have been published. Since the invention of CNN there have been many branches under development. The main branches are motivated by engineering applications and biological science-related findings, especially those in the area of retinal research because the retina and CNN share many common properties, including layered structures and local connectedness (Hubel, 1988). Engineers and scientists keep trying to find different CNN structures that are easily implemented using different technologies, including VLSI, optical components, and quantum dot arrays (Toth et a/., 1996). So far, CNN has been used mainly in image processing because the two-dimensional (2D) array of cells is directly connected with digital images (usually 2D arrays of small units called pixels). As image processing is well-studied and is one of the most challenging areas of artificial intelligence and signal processing, the relation between it and CNN is 2-fold. First, many existing results from linear and nonlinear image processing can be directly mapped into CNN structures. Most of the early literature and much of the recent literature concentrate on this area (Osuna and Moschytz, 1996; Sziranyi, 1996; Destri and Marenzoni, 1996; Crounse and Chua, 1996; Cruz and Chua, 1995; Thiran et al., 1995; Slavora,1995; Park and Chae, 1995; Tanaka et d . , 1994; lkegami and Tanaka, 1994a; Shia et d., 1993; Crounse et ul., 1993; Paul rt ul., 1993b; Galias, 1993; Roska et nl., 1992; Zarandy et nl., 1996; Nemes et nl., 1996; Kozek and Roska, 1996; Roska et d . , 1995; Venetianer et nl., 1995; Chua and Roska, 1993; Perez-Munuzuri rt nl., 1993; Shi and Chua, 1992; Suzuki et d . , 1992; Rekeczky et al., 1995; Yang, 199Sa). Almost all of the results in this direction employ computational CNN structures that simply map existing image processing algorithms into weights of CNN. The new tendency in CNN research is to embed nonlinear synaptic laws into a CNN framework. This is motivated directly by nonlinear image processing techniques. Again, many of these results are focused on computational CNN (Receczky et nl., 1995b). On the other hand, CNN structures
268
TAO YANG
contribute to image processing with new computational arrays and learning algorithms. Although there exist some other techniques, such as optical computing array (Bey et al., 1993; Fruehauf and Lueder, 1990; Slot, 1992; Slot et ol., 1992; Fruehauf et ul., 1993; Hung et ctl., 1994) and quantum dot computing array (Toth et id., 1996), so far, the VLSI technique is the only available way to implement CNN (Varrientos et al., 1993; Halonen et al., 1991a; Varrientos et a/., 1990b; Faure and Mazare, 1990; Baktir and Tan, 1991; Huerdas and Rueda, 1990; Halonen and Vaananen, 1990; Lai and Leong, 1995, 1996; Pham et al., 1995b; Chua, 1992; Paasio et al. 1996a,b; Salerno et ul., 1995; Sargeni, 1994; Espejo et al., 1992, 1993, 1994b,d, 1996b; Huertas et al., 1992; Kacprzak and Slot, 1992; Ruedas and Huertas. 1992; Nossek et d . , 1992; Backtir and Tan, 1992; Rodriguez-Vazquez et a/., 1993; Kinget and Steyaert, 1994a,b,c,e; , Piovaccari and Stetti, 1994; Kowalski et a1.,1994; Sani Bang, et ~ l . 1994; et [/I., 1994; Sheu et al., 1994; Berger et al., 1994; Doan et al., 1994; Wen, 1994; Lai et al., 1995; Sheu et al, 1995 a,b; Perfetti, 1994; Anguita et a/., 1995; Cordarilli and Sargeni, 1995). Some VLSI implementations of CNN are designed for special tasks with fixed parameters (Varrientos et al., 1990a; Yang et id., 1990; Rodriquez-Vazquez et d . , 1990; Cruz and Chua, 1991; Kowalski et a/., 1994; Szolgay et d., 1994), while others use programmable chips in which parameters of CNN are adaptable (Betta et a/., 1993; Cardarilli, et ~ 1 . . 1992, 1993; Dalla Betta e f al., 1992; Anguita et al., 1991, 1996; Halonen et al. 1990, 1991b,c; Paasio et d., 1994a,b,c; Kinget and Steyaert, 1994d, 1995; Reljin et al., 1995; Salerno et al., 1996; Sargeni and Bonaiuto, 1994, 1995, 1996; Dominguez-Castro et al., 1994b; Chua et al., 1996a; Raffo rt ul., 1996; Espejo et al., 1996a; Cruz et al., 1994; Lim and Moschytz, 1994). Some softwares and hardware accelerating boards were also developed (Varrientos and Sanchez-Sinencio, 1992; Roska and Radvanyi, 1990; Roska and Bartfai, 1990; Roska etal., 1990, 1992; Krieg and Chua, 1990; Lee and de Gyvez, 1994 a,b; Kohn et al., 1992; Dominguez-Castro et id., 1994; de Gyvez, 1994). In VLSI-based implementation, both digital CNN and analog CNN were presented. Parallel input and output problems can be solved by using embedded optical sensors and optical interface in each cell (Espejo et ul., 1994a.c; Szolgay et al., 1994; Shev et al., 19953; Sullivan et d., 1996; Hung et nl.. 1996; Beccherelli, et al., 1994; Balsi et al., 1994). However, state-of-the-art VLSI CNN chips are no better than those programmable DSP chips that had been developed many years ago and are in wide use today. However, we can not overlook the potential power of future CNN chips. This is one of the main reasons why the study of CNN is becoming more and more attractive. In a word, the state-of-the-art CNN structures emphasize VLSI implementation and applications to image processing. There exists almost no work that considers the global CNN universe from a systematic point of view.
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
269
B. Structures of Conventional CNN 1. Elementary CNN
Definition 1. The elementary processor in a CNN array is called a cell. Remark: A cell is the most elementary unit in a CNN array; it builds up the structure of a CNN. We denote a cell, respectively, by C,, C,, and C,,L in one-, two- and three-dimensional (ID, 2D, 3D) CNN arrays.
Definition 2. Let r E N be a positive integer, the r-neighborhood system, Nt ( i ) , N , ( i j ) and N,(ijk) of a central cell C,, C,,, C,,k in a 1-, 2-, and 3D CNN, respectively, be defined by A
N,(i) = (C, : ma x(dl (j - i ) ) 5 r ]
(1)
where dl(.) is a distance defined in 91, A
N,(ij) = {Ckl : max(d2(k - i , 1 - j ) )5 r]
(2)
where d2(., .) is a distance defined in !Ti2, and A
N,(ijk) = (C,,,
: max(d3(p
-
i, q - j, s - k)) 5 r ]
(3)
where d3(., ., .) is a distance defined in 913. Remark: A n r-neighborhood system is the core definition for describing the local connectedness of a CNN. This concept defines only the possible longest synaptic weight that may affect a central cell. It does not give connections between cells. Although we can also define the neighborhood system in !)it', n > 3, in view of future technologies, we find n I 3 is enough. We then define the dynamics of a cell in CNN. Here, only the 2D case is given. The 1D and 3D cases are similar.
Definition 3. An elementary CNN is an M x N array that consists of M rows and N columns cells and every cell Ci, is given by the following equations: 1 . State equation
270
TAO YANG
where x i j , uij and y;j denote the state variable, input, and output of cell Ci,, respectively. And A ( i , j ; k , 1) and B ( i , j ; k , I ) denote the feedbuck and feedforward synaptic weights between cells Cij and Ckl, respectively. The bias is l i j ( t )(also called threshold) of cell Cij, which may be static, timevarying, space-invariant, or space-varying; C > 0 and R., > 0 are the values of the capacitor and the resistor, respectively'; and yij is given by 2. Output equation
u,, is given by
3 . lnput equation %, = ,f,A=5/)
where E,, is the detected signal, for example, the intensity of light detected by an embedded optical sensor in cell C , / .In the elementary CNN, f , , ( . ) is used to normalize the detected signal to a proper range. 4. Initial condition x,,(O).
5. Boundary conditions. Remark: In a VLSI implementation of CNN, u,,, x,,, and y,, are three voltages and / , / ( t ) is a bias current. The block diagram of a cell based on electronic circuits is shown in Fig. 1 .
FIGURE1 . Block diagram of a cell based on electronic circuit for elementary C N N ; E ; j is the output of the embeded signal sensor; V,,t,. V,v,, and V , , are input voltage, state voltage, and (i,j : k, I ) are currents output voltage, respectively; l ; j is the bias current; I,,, (i, j ; k, I ) and l A y from feedforward path and feedback path, respectively; f ( x , , ) is the output nonlinearity.
I Of course, a cell is not necessary to be a first-order dynamic system; it can also be represented by a high-order ordinary differential equation or a functional differential equation.
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
27 I
If a CNN is space-invariant, then Eq. (4) can be written as a 2D convolution form (Chua and Yang, 1988a)
where "*" denotes a 2D convolution. A is called feedback template and B is called feedforward template. 2. Diflerent CNN Structures Since the invention of CNN (Chua and Yang, 1988 a,b), different CNN structures have been proposed for different applications and from different biological models. In motion-related applications (Roska and Chua, 1993a), time delays are introduced into CNN structures and result in a kind of deluy-type CNN (DCNN) that is defined by (Roska and Chua, 1990b, 1993a):
where r E !)I+ is called time-deluy. The DCNN was proved to be essential in motion-related CNN-applications (Roska and Chua, 1993a). There are some theoretical results on the stability of DCNN in works by Finnocchiaro and Perfetti (1995), Gilli (1994), T. Yang et al., (1994), Huang et nl., (1994), Roska et a/., (1992f, 1993), Civalleri et d . , (1993), and Civalleri and Gilli (1992). As DCNN is governed by a set of functional differential equations (FDE), some complex phenomena, for example, chaos, were observed even when only a sinall number of cells were used (Civalleri and Gilli, 1994; Gilli, 1993); some of the results of predicting the chaotic sequence generated by chaotic DCNN are presented in Gilli ( 1 995). As only linear synaptic weights are not enough to deal with some image processing tasks where nonlinear properties are embedded, the CNN with nonlinear synuptic laws (NCNN) were introduced (Finocchiaro and Perfetti,
272
TAO YANG
1995). A general NCNN can be given by
wherex (N,-(ij ) ) and u ( N , ( i j ) ) denote state variables and inputs within N , . ( i j ) , respectively. In this case, synaptic laws are functions of time, state variables, and inputs within the neighborhood system. As CNN is a nonlinear dynamical array, some complex phenomena, for example chaos, were found even when a few cells were used (Tao, et ul., 1995; Zou and Nossek, 1991, 1992, 1993a,b,c; Zou et al., 1993; Arena et al., 1995a, 1996). The strange nonchaotic attractor is also observed in 2-cell quasiperiodically forced CNN. If an array of cells is used, hyperchaos also emerges (Arena et al., 1995b). Because the control of chaos is a very active field in view of its possible applications to spread spectrum communication (T. Yang and Chua, 199721, 1998a, b), secure communication (Yang, 1995b; Chua et al., 1996c; Yang and Chua, 1996a,b; Yang et al., 19978; Yang et ul., 1998d,e; Wu et al., 1996; Suykens et a/., 1998; Yang and Chua, 1997b,c; Yang et at., 39978; Yang et al., 1998c,h; Yang et al., 1997j, k) and measurement improvement, some control methods are also used to control the chaos generated by CNN (Galias and Nossek, 1994; Jankowski et al., 1996). There also exist some other kinds of CNN structures such as chuotic CNN (CCNN), where every cell is a chaotic dynamic system (Chua and Goras, 1995; Guzelis, 1993) that can be used to model some kinds of emergent behaviors and simulate some wave and pattern formation phenomena in an active medium (Perez-Munuzuri, et ul., 1995; Jankowski and Wanczuk, 1994; Chua and Goras, 1995; Chua et al., 1995). In CCNN arrays some nonlinear dynamic behaviors such as synchronization (Jankowski et al., 1994), cluttering, and cooperative phenomena were also found (Ogorzalek et al., 1994, 1996). The existing results of CCNN consist of two main branches. One branch studies how to use the elementary CNN to generate chaotic signals and relevant applications (Chua and Goras, 1995; Neines et al., 1995; Guzelis, 1993). The other branch studies how to use chaotic elements as elementary cells to model spatio-temporal chaotic processes (Perez-Munuzuri et al., 1995; Arena et ul., 1995b; Jankowski et al., 1994; Jankowski and Wanczuk, 1994; Ogorzalek et i l l . , 1996).. A multilnyer C N N (MCNN) uses more than one layer CNNs to perform a single task (Mizutani, 1994; Chua and Yang, 1988b; Harrer, 1993). Because the C N N universal machine (CNNUM) (Roska, 1994a,b; Roska and Chua, 1992d,e, 1993b; Crounse and Chua, 1996; Slot, 1994) is not an elementary CNN structure, but a platform for integrating the flow of CNN operations, we do not discuss it here; interested readers may read research
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
273
by the authors cited here. However, the CNNUM is an important tool for organizing different kinds of CNN structures to perform complicated tasks that a single CNN can not finish. The CNNUM sometimes can also be used to solve some global problems that are difficult to decompose. In fact, the CNNUM had been proved to be as universal as a Turing machine (Crounse and Chua, 1996). As it is only a platform for CNN operations, any kind of CNN should be included in the core of this platform, including DTCNN (Slot, 1994) and FCNN (T. Yang and L.B. Yang, 1997d,e). However, the state-ofthe-art CNNUM has only an elementary CNN core (Roska and Chua, I993b), so this platform needs further improvement.
3. Discrete-time CNN A discrete-time CNN (DTCNN) (Harrer and N o s e k , I992b) is defined by the following discrete dynamic equations: 1 . State equation A ( i , j ; k, I ) y u ( k )
x,,(k) = CAIEN,(I / )
+
B ( i , j ; k, I)uu
+I ,
CL/€N,(I./)
1 5 i S M , I 5 j l N , k = 1 . 2, . . . .
(10)
2. Output equation
3. Initial condition x,,(O), 1 5 i 5 M , I 5 j 5 N .
(12)
4. Input equation
5 . Boundary conditions
!N, y,,(k) E (0, I } and ii,, E (0, I ] are state, output, and input of cell C,,, respectively. Where I E !)I is the bias, k E Z is the discrete time, and F ( . ) is the nonlinear output characteristic of a cell, it should be implemented easily by using VLSI techniques. Similarly, we can rewrite Eq. (10) into the following form:
x,,(k) E
x;.j(k) = A
* ~ ; , ( k )+ B * u;;+ 1.
(13)
Observe that the structure of DTCNN is close to that of a CA. One advantage of DTCNN over the elementary CNN is that it has both binary input and output, which makes the connection between two different chips very easy. And the DTCNN is more robust than elementary CNN.
274
TAO YANG
Although DTCNN may be viewed as a discrete form of CNN, it is not necessary to think that CNN can d o everything DTCNN can do. For example, let us consider the following DTCNN defined by Yang et a / . ( 1 9 9 6 ~ ) :
I . State equation
2. Output equation
where @ is the exclusive OR(X0R).
3 . Cell nonlinearity
4. Initial condition
Observing that the nonlinearity of the forementioned cell is different from the original one in Harrer and Nossek (1992b), we use this nonlinearity to guarantee that the outputs of this DTCNN can be represented by 0- 1 logic. The output equation is also different from that in the original work by Harrer and Nossek (1992b) and needs a 2-bits local digital memory for storing previous outputs. As XOR is reversible, from Eq. (15) we have
+
+
which means that if we know the output y i j ( k 1) at k 1 and the output yi,(k) at k then we can find the output y i j ( k - 1) at k - 1. This kind of DTCNN is reversible, which has no corresponding CNN structure. Although it had been shown that a CNNUM program can model the behavior of any DTCNN, the invention of DTCNN played a very important role in CNN history. First, it introduced discrete-time dynamics into the CNN framework. It demonstrates the possibility that in the CNN universe, a cell may have different kinds of dynamics other than continuous dynamics. The invention of DTCNN reminds us that cell dynamics should evolve in the event space, which may include time uxis and any event sequence. Second, if we break the restriction of implementing consideration, DTCNN directly enlightens the concept of “structural dynamics” of the neighborhood system, which is a most important concept in a CNN universe. This concept will be discussed in Sect. 11. A.
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
275
There are stability results of DTCNN in Harrer and Nossek ( 1992b); He and Ushida (1994). Some other theoretical results can be found in Shimizu et a / . ( 1 994); Perfetti (1993a,b); Fruehauf et ul. ( 1992); Galias ( 1992); Magnussen and Nossek (1994a). The learning algorithm based on DTCNN is presented in Harrer et ul., (1991a). And DTCNN can also be used in associative memories (Brucoli et ul. 1995a, 1996), image coding and decoding (Ikegami and Tanaka, 1994b, 1995), image thinning (Ho et d . , 1993; Ho and Mori, 1994), and other applications (Harrer et u /., 1994; Park et ul., 1994). Some DTCNN were designed to have continuous output (Coli et a/., 1995; C.M. Yang et al., 1994); this kind of DTCNN emphasized the discrete nature in time instead of both in time and inputloutput. There are also many results of hardware imple1996; Pham et a/., 1995a; H. Yang, 1994; mentation of DTCNN (Park et d., Hung et u/., 1994; Harrer et u / . , 1992; Harrer and Nossek, 1992a; Anguita et uf., 1993). Because DTCNN are nonlinear discrete-time dynamic systems, complex phenomena, for example, chaos, can also be easily found (Perfetti, 1995; C.M. Yang et uf., 1994). A multilayer DTCNN structure can be found in Harrer (1993).
C. Notes As CNNs can also Function as local connected learning networks, much information on CNN learning algorithms has been published to date (Nossek, 1996; Brucoli et ul., 1995b; Tetzlaff and Wolf, 1996a,b; Gunsel and Guzelis, 1995; He and Ushida, 1994; Nossek, 1994; Aizenberg and Aizenberg, 1994; Utschick and Nossek, 1994; Magnussen and Nossek, 1992, 1994b; Magnussen et [ I / . , 1994; Guzelis and Karamahmut, 1994; Schuler et d., 1992, 1994; Mizutani, 1994; Balsi, 1994; Hansen, 1992; Guzelis, 1992; Szolgay et ul., 1992; Puffer et d., 1995; Van Dam et a/., 1994; Aizenberg et a/., 1996; Roska, 1992a; K o c k et ul., 1993; Tzionas, 1996; Sziranyi and Csapodi, 1994; Zou et u/., 1990a,b; Harrer et ul., 1991b; Vandenberghe et id., 1990; Pelayo P t id., 1990). Because almost all these learning algorithms are similar to those used in ANN, we do not discuss them here. Interested readers are referred to the preceding references.
11. FUZZY CEILULAR NEURAL. NETWORKS
I n this section, we present the principles and structures of FCNN in the framework of a unified CNN structure. We first present the unified CNN structure
276
TAO YANG
from a systematic point of view, then FCNN is developed from this unified CNN structure by applying fuzzy set theory (Zadeh, 1965) into it. Fuzzy set theory provides an inference methodology that approximates human reasoning capabilities and can be applied to knowledge-based systems (Zadeh, 1965). It provides mathematical support to the capture of uncertainties associated with human cognitive processes, for example, thinking and reasoning. Also, it provides a mathematical methodology to model linguistic statements and knowledge. While fuzzy theory provides an inference mechanism under cognitive uncertainty, CNN structures offers advantages such as learning, adaptation, faulttolerance, and parallelism. However, it seems that a CNN cell, which is typically of a 3 x 3- or 5 x 5-cell neighborhood system, is a poor model of a real neuron (which typically has thousands of synapses (Taketoshi et a/., 1993; Joseph, 1993; Kuffler et nl., 1984; Poliakov, 1972)). There exist some uncertainties in CNN synaptic weights. On the other hand, the input information (e.g., images) may also bear fuzziness that stems from sensing, transmitting, and processing. The information flow propagating in a CNN is mostly like a fuzzy process if inputs and/or synaptic weights are fuzzy. The conventional CNN has a poor interface to the knowledge of human experts, which is represented by fuzzy IF-THEN rules, and the experience of human experts, which is described by linguistic statements. From these we know that it is very necessary to integrate fuzzy set theory with the CNN paradigm and give birth to a new structure calledfuzzy CNN(FCNN). The concepts of FCNN are reasonable extensions of CNN from classical set to fuzzy set. The principles of FCNN are based on uncertainties in human cognitive processes and in modeling neural systems. To provide the reader with an overview of the position of FCNN in the CNN universe, we first give a unified CNN structure and then discuss FCNN as an important generalization of the unified CNN structure. A. A Unified CNN Structure
I t is difficult to predict potential structures of CNN for different applications. Chua et al. (1995) presented a kind of generalization of CNN structure (which is a circuit-based unification). Although the concept of CNN is developed based on circuit theory, it is indeed a much wider concept from a systematic point of view because it reflects some aspects of life systems where some biological cells also share the nature of local connectedness. In this section, we unify CNN in a systematic framework. We present a unified CNN structure that includes all existing CNN structures as its subclasses and makes FCNN an important generalization, which embeds fuzzy dynamics and fuzzy information flows (linguistic flows) into the unified CNN structure.
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
277
The most basic properties and principles of the unified CNN structure are local connectedness, dynamics, and the concept of cell. The local conrzectedness emphasizes not only easy implementation but also the inner nature of many biological, physical and social phenomena, which reflects a new tendency in artificial intelligence and artificial life where the new concept of decentrulization is proposed (Artificial Life Workshop, 1994). On the other hand, the local connectedness also provides us with a tool to imitate emergent behaviors of life systems (Forrest, 1991) besides CA. The dyiiarnics provide the ability of self-organization and learning and the basic condition for emergent computation. Without dynamics, CNN is only a computational array that does not diifer significantly from existing parallel image processing chips or parallel signal processing chips. The cell is the elementary unit of CNN, which is a trade-off between implementing complexity and function. A simple cell structure can support a bigger cell population and larger neighborhood systems but a relatively simple cell function. As the emergent computation is difficult to deal with, one may want sometimes to embed more controllable (programmable) functions into a cell such that the control of CNN behaviors becomes easier. But a more complicated cell decreases cell population and the size of neighborhood system or makes implementation very difficult. A cell can be any kind of individual. Local connectedness can be measured by a “distance.” The new concept we use to define the unified CNN structure is the dynamics, which consist of cell dynamics, neighborhood system dynamics, and synaptic weight dynamics. 1. Cell dynamics
where x ( N ,(u)) and u ( N ,(u))denote all states and inputs within the neighborhood system N , ( u ) .“u” is a generalized time, which may be an arbitrary event sequence, for example, discrete event sequences. The symbol ‘‘0” denotes a combining relation between injormarionJiow ( D x(., ., ., .)) and structurulJiow (S,(.)). Symbol
-”*0
denotes an arbitrary relationship from the right-hand side to the left-hand side. This relation may be defined by an ODE, a PDE, an FDE, discrete-time dynamics, linguistic dynamics, conceptual dynamics, functional dynamics, or any other dynamics.
’
“
This symbol comes from an ancient Chinese Philosopher Zhuang Zhou who believed that all things one could define came from one’s heart (brain). In this sense, this symbol means: “froin all things one can detinc.”
278
TAO YANG
2. Neighborhood system dynamics
-
3 . Synaptic weight dynamics S;(u)
0
Vs (cells within Ni(u))
(21)
where Dy(., ., ., .), VN(., ., ., .) and Vs(.)are three dynamic systems/or transformations. Remarks: 1 . Cell dynamics can be any dynamic system, function or functional, continuous or discrete-time, given a variable called sfute, a variable called input, and a variable which can be accessed by the other neighbor cells called ourput. 2. Neighborhood system dynamics consist of any fixed, time-varying, spacevarying, and movable organization rules. Even random connections can be used. The only elementary principle here is local cluster of a collection of cells with any possible manner and organization. 3. Synaptic weight dynamics only define the connection type within a neighborhood system. It may be a relation of matter transmission, chemical exchanges, or energy exchanges and any other physical or nonphysical relationships.
Definition 4. A unified CNN is defined by cell dynamics in Eq. ( 19), neighborhood dynamics in Eq. (20), and synaptic weight dynamics in Eq. (21). Given any central cell Ci and 0 < r < 00 the following neighborhood restriction should be satisfied: < dr(C;) (22) where d,(C,) denotes a kind of distance with respect to C,. Figure 2 shows the concept of this kind of unified CNN structure. In Fig. 2 the generalized time is labeled (1) and (2), however, it only denotes two “snapshots” of the dynamics. In a unified CNN model “u” may be continuous or discrete (or some cells are continuous while the other are discrete). In Fig. 2, every small circle denotes a cell and the black dot denotes cell C ; .The regions within the closed solid or dashed curves denote the neighborhood systems of C ; . The thin arrow curves denote the direction and form of information flow. Figure 2 shows that V N changes the shape of a neighborhood system and the number of cells within the neighborhood system at different moments. The Ds changes the directions of information flow and the connecting relationship between the central cell and its neighbor cells.
FUZZY CELLULAR NEURAL NETWORKS A N D THEIR APPLICATIONS
279
FIGURE 2. Unified CNN structure. In this figure only the neighborhood system dynamics (upper) and synaptic weight dynamics (lower) are shown.
B. Princi$es qf Generul FCNN
An FCNN structure should maintain two main features: 1) local connectedness between cells; and 2) simple cell structures and characteristics. A general FCNN is defined by the following definition.
Definition 5.
1. Cell dynamics
2. Neighborhood system dynamics
280
TAO YANG
Remark: This is a generalized case of the CNN structure presented in Sect. 11. A. In this definition, over a character means that the character may represent something related to fuzzy set, for example, fuzzy number, linguistic statement, and conceptual variable. However, this definition is too general to be implemented and applied to special applications. We use the following operational definition of FCNN to unify all existed FCNN structures. Here, only 2D cases are considered. Definitions are similar for ID and 3D cases.
"-"
Definition 6. A cell C;, in an M x N general FCNN is defined by: 1. State equation
-
conventional part
v
fuzzy part
2. Output equation
Remarks: 1. From Eqs. (26) and (27) we can see that this structure is a type-111 FCNN (for Definition see Sect. 11. C). It can be degenerated to type-I, -11 FCNNs and conventional CNN. The block diagram of an electronic circuit of a cell in the general FCNN is shown in Fig. 3. -
E,,
c)
1,,@CI:
j:R, '
(3
(3 f)1,~Wd) ...
Furry Operatioti F,
Fuzzy Operation FA
'1
Ix,,(i.j;k.l)
'4
... 1,&&ll
1'
(3 ::R,
f(X,,)
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
28 1
2 . The fuzzifier layer is embedded in the fuzzy part. The reason that we do not insert a separate fuzzifier layer is that we can choose simple membership functions p ~ , ( i , , ; k , ~ l (p. ) ,~( ; ,,j ; k , ~ ) ( . )p,,I, , (.), and P.~,, (.), which can be easily implemented with VLSI technologies. The fuzzy part consists of all nonlinear synaptic laws while the conventional part consists of all linear synaptic laws. However, we can not conclude that FCNN is only a kind of NCNN. Remember that the fuzzy number (i.e., a convex and normal fuzzy set on a real line) can propagate through an FCNN structure. For example, consider the following FDTCNN:
where the symbol ‘‘-” over a variable is used to denote that the variable is a fuzzy number. No conventional NCNN can implement this structure. 3. The membership function p i , J ( . i)n a conventional CNN is interpreted as an “output function.” Sometimes we would like to view it as a defuzzifier. For example, if we choose pLL,,,(.) as
PX,,( X I =
{
0, X I 0 1, x > o
then it defuzzifies all states into classical logic variables. 4. FA^^^^^,^ (., .) and F H ~ ~ , ~ ~ .) , ~can , , , be , ( simple ., operations on fuzzy sets, for example, union, intersection, algebraic product, algebraic sum, bounded suin and bounded difference. Also, they may be complicated operations, for .) example, similarity between two juzzv numbers. By the way, FAcIie,v,,,,Jl(., and F B ( . ~ , ~ ~ ,.)~may , , , ~also ( . , be any combination of the foregoing operations. They can use any number of entries in A , and B f and use any y k / and ukl in N r ( i , j ) . It is not necessary to keep the’tradition of the conventional CNN where every entry in A or B can multiply only a y k l or a uk/ because the relation between the (local) structure (synaptic weights) and the (local) information flow (inputs, state variables and outputs) is the only thing we are concerned about in an FCNN. ,,,]
5. The general frame given by Eqs. (26) and (27) can be easily generalized to delay-type FCNN (DFCNN) and discrete-time FCNN (DTFCNN).
282
TAO YANG
C. Classificationqf FCNN Fortunately, from 1974 on there has been much literature on fuzzy (artificial) neural networks (FNN) (Lee and Lee, 1974). According to the method presented by Buckley and Hayashi (1996), we can lump FNN into 3 types: 1 ) Type-1 FNN, which has real signal and fuzzy weight (Yamakawa, 1990; Yamakawa and Furukawa, 1992); 2) Type-I1 FNN, which has fuzzy signal and real weight (Jang and Sun, 1995; Shann and Fu, 1995); and 3 ) TypeI11 FNN, which has fuzzy signal and fuzzy weight (Ishibuchi, 1993; Hayashi et ol., 1993; Blanco et al., 1995a,b; Pedryez, 1991; Furukawa and Yamakawa, 1995). This classification can also be applied to FCNN. As FCNN is a generalization of CNN from classical set to fuzzy set, it is not strange to enable a generalization of almost all conclusions and applications of the conventional CNN. Corresponding to the classification presented in Sect. I. B. 2, the FCNN structures can also be classified in that way. With the ability to interpret linguistic statements, the fuzzy set theory embeds the ability to process linguistic inputs into FCNN. From this point of view, FCNN can model not only structures of neural systems, as the conventional CNN does, but also the behaviors and function of neural systems, namely, cognitive processes. From these statements one can see that the classification of FCNN can be done in both the CNN and the FNN direction. There exist some differences between FNN and FCNN. In an FCNN, the cell property is not necessary space-invariant, which means that FCNN has more possibilities than FNN. For example, we can define type-IV FCNN while there does not exist a corresponding type-lV FNN. A type-IV FCNN may contain all kinds of cells that belong to any type-I, -11, and -111 FCNN. This structure gives the type-IV FCNN the ability to process both signals (real numbers) within some neighborhood systems and more general information flow (e.g., conceptual or linguistic variables) within some other neighborhood systems. The type-I1 FCNN is the closest one to the conventional CNN because it has real weight. On the other hand, because type-I1 FCNN has the simplest structure for VLSI implementation, this is why most FCNN results are focused on this type. D. Different Structures of FCNN
Since it is impossible to study all kinds of FCNNs (as defined in Eqs. (26) and (27)) in a single section, we will focus on some simple cases in which only fuzzy logical OR ( q or MAX) and fuzzy logical AND (A or MIN) are integrated. On the other hand, MAX and MIN are the simplest fuzzy union and intersection operations that can be implemented by using VLSI technologies.
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
283
The structure of FCNN is a tradeoff between VLSI implementation and general function. For the purpose of VLSI implementation, the FCNN proposed here integrates the fuzzifier, the defuzzifier, and the fuzzy inference engine into a planar structure. The nonlinear dynamics of the conventional CNN are kept in FCNN structure. In this section, we give structures of type-I and type-I1 FCNNs. Also, only 2D cases are given. 1. Type-IFCNN A cell
Ci, in an M x N type-I FCNN is defined by:
State equation
Output equation
Input equation u fJ = E l , , 1
5iI M, 1 I .j 5 N .
Constraint conditions
Parameter assumptions
Boundary conditions and initial conditions.
(33)
284
TAO YANG
In a type-I FCNN, thereexist fuzzy synaptic weightsA,(i, j ; k , / )a n d j f ( i ,j ; k , f ) . (In this section, we use the symbol ‘‘-” over a character to denote a fuzzy number.) The relation between a fuzzy feedback synaptic weight and an output is defined by the membership function ( j , j ; k ~, (/ y k / ) . The relation between a fuzzy feedforward synaptic weight and an input is defined by the membership function p B , ( i , j : k , ~ ) ( ~ ~The / ) . inputs and outputs are crisp variables in a type-I FCNN. Remark: Fuzzy synaptic weights introduce a set of nonlinear synaptic laws into a type-I FCNN. In general case, the concept “template,” which is very useful in conventional CNN (Chua and Yang, 1988a) is not suitable for describing fuzzy synaptic laws.
2 . Type-I1 FCNN A cell C,, in an M x N type-I1 FCNN is defined by:
State equation
Output equation
Input equation
Constraint conditions
Parameter assumptions
Boundary conditions and initial conditions.
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
285
where pl,k/(.) and ~ . ~ i ; ( are . ) two membership functions. In a type-I1 FCNN, all synaptic weights are crisp. Inputs and outputs are supposed to be fuzzy. They are described by membership functions pLlkl(.)and ~ ~ i ; ( . Observe ) . that p , i , ; ( . ) corresponds to the output function in a conventional CNN. The preceding type-I1 FCNN is sometimes called multii>licutive type-I1 FCNN. Correspondingly, there exists an additive type-I1 FCNN whose state equation is given by:
A subclass of the additive type-I1 FCNN has been found to be a universal paradigm for implementing mathematical morphology operations (Yang and Yang, 1997d,e).
3. Simple Cuses of Type-I and Type-I1 FCNN In type-I and type-I1 FCNNs, F A ( . and ) F B ( . )denote two fuzzy local operators defined in N,.(i, j ) , which may be any fuzzy logical expression combined by fuzzy OR ‘‘G”and fuzzy AND “A”. For example, suppose F A ( . )denotes the following fuzzy logical expression in a 1 -neighborhood system:
where xi, denotes fuzzy variable (for example, the gray value of a pixel (i, j ) in an image). Then we have
286
TAO YANG
A simple and most commonly used type-I FCNN is given by
+ +
A A
CLA,,,,,,(r,j:k./)(Ykl) C U E N ,( i , j )
p H , ,,,,,,( i . j ; k , / ) ( u k / )
FB,,,,; , x ( ; , , ; ; k , / ) ( y k / )
C U E N , ( I .j )
\j
+
Cii eNrif.j)
1 (i(M,1
v
f
P E f ,,.i x ( i , j ; k , / ) ( u k / ) t
CLIE N ,( i .j )
(45)
5 , j ~ N .
If FCNN is space-invariant, then in view of the method used in conventional CNN (Chua and Yang, 1988a), Eq. (45) can be rewritten as the following 2D convolution form:
+ ;if r n i n O r n i n Y i j + 2.f m a x O r n a x Y i j + B f r n i n O r n i n u i j + B f m a x G n m x u i j > 15 i s M , 15 j i N (46) where * denotes a 2D convolution. The A, B are feedback and feed-forward templates, respectively, and Af ,,,in, A f B f !,,in, and B.j max are fuzzy feedback MIN template, fuzzy feedback MAX template, fuzzy feed forward MIN template, and fuzzy feedforward MAX template, respectively. O,,,,, denotes a 2D operation as shown in following example: maxOrnmllxYij
=
v (‘LiEN,
and
Omin
CLA,,,,,,(i,.j:k./)(Yx/)
(47 1
(Ji)
denotes a 2D operation as shown by: ifrnin6minYij
=
A
CLA,,,,,,,(i.j;k./)(Vk/).
(48)
(‘L/EN/u.;)
From the preceding one can see that fuzzy templates and 6 f are local fuzzy patterns that are fuzzified for the purpose of fitting a more loose relation between fuzzy templates and signal patterns. A simple and most commonly used type-I1 FCNN is given by
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
287
where A.f Af Inax. B,f ,,,in, and B f Illax are feedback MIN template, feedback MAX template, feedforward MIN 'template, and feedforward MAX template, respectively. The Olnaxdenotes a 2D operation as shown in the following example: ~ f r n a x0n1ax
Yij
=
\s/
~flljax(i,j ; k , f ) Y k /
(51)
C U E N ,(1. j )
and Omindenotes a 2D operation as shown in the following example: A f m i n ~ m l Y,,/ n =
/I
~j
min(i9 j ; k , [)YL/.
(52)
C u m , ci..;,
E. Diferences between FCNN und FNN The FCNN structure differs significantly from other FNN structures. In an FCNN, the fuzzifier layer and the defuzzifier layer, which always appear in a standard FNN structure, are embedded into a single layer. This planar structure is mostly suitable for 2D VLSI implementation because links between two chips are avoided. Thus an FCNN universal cell should be programmable for different membership functions and allow some basic programs of fuzzy operations (relational computation). The most significant characteristic of FCNN is the local connectedness of cells. Any FNN structure that is based on local connectedness will fall in the range of the concept of FCNN. To understand the differences between FCNN and FNN, we first show how we can get FCNN structures from the crossover of FNN and CNN. A typical FNN has three layers as shown in Fig. 4a. The first layer is used to give crisp
288
* TAO YANG
Glohal Connections Between Layers
B Input
- u u
88
memhership functions
I
000 0 v A
fuzLy operetlons
@ i-th defuziified function (a)
@ input function @ dynamics of cell
I
(h)
FIGURE4. The FCNN structure is a crossover of the conventional CNN and FCNN. (a) Typical FNN structure. (b) Typical conventional CNN structure. (c) The structure of FCNN. (d) The planar topological structure of a cell in FCNN for easy VLSI implementation.
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
I
289
1.ocal Caoncclmns Inwlc thc h y c r
/-- b
function
inputs some fuzzy measurements. The nonlinearities in the neurons of this layer are some membership functions. The second layer is used to calculate the relationship between different fuzzy variables from the first layer. Fuzzy computations are embedded into the nonlinearities of this layer. The third layer is used to give some crisp forms of outputs. If the inputs are already fuzzy
290
TAO YANG
variables (e.g., the gray value of a pixel in a digital image), then the first layer can be eliminated. Also, if we need some fuzzy outputs (e.g., outputs that are used by a high-level A1 system), then the third layer can be eliminated. Figure 4b shows the typical structure of a signal layer conventional CNN. Observe that this single layer CNN contains an input function sublayer, a cell dynamics sublayer and output function sublayer. In a conventional CNN, the input function is usually a linear function and the output function is a piecewise linear function. Of course, we can use more complicated nonlinearities as input and output functions. Then we are in the situation of showing how to combine the structures in Figs. 4a and b into FCNN, which is shown in Fig. 4c. By using different membership functions as input functions, we embed the fuzzifier layer of FNN into the input function sublayer of FCNN. We combine the cell dynamics of CNN and the fuzzy operations of FNN into a fuzzy-crisp mixed dynamical sublayer that contributes FCNN cell dynamics. The fuzzy-crisp mixed dynamics make our FCNN structure solve both crisp and fuzzy problems. Finally, we choose the nonlinearities of the defuzzifier layer of FNN as the output functions of FCNN. The output function sublayer is equivalent to the defuzzifier layer of FNN. Of course, FCNN structures carry out the properties of local connectedness from CNN. Figure 4d shows that the sublayers in Fig. 4c actually can be fabricated onto a planar silicon chip. In this structure a region in each cell is used to fabricate a photodiode as an embedded sensor. Then the fuzzifier, the fuzzy inference engine, and defuzzifier are fabricated in different subregions within the cell.
111. THEORY OF Fuzzy CELLULAR NEURAL NETWORKS
In this section, we study the elementary theory of different kinds of FCNN. We always present the results for type-I1 FCNN first and then present the corresponding results of type-I FCNN.
A. Elem entury Theary
In this section, we study the dynamical range of type-I and -11 FCNN. The dynamical range is important to a physically implemented FCNN because we know the dynamical range and then we can choose the power supply and physical structures of FCNN. Also, the existence of an equilibrium point for type-I and -11 FCNN is needed to guarantee correct operation of FCNN.
FUZZY CELLULAR NEURAL NETWORKS A N D THEIR APPLICATIONS
291
I . Dynamicul Rmge of Type-I1FCNN To guarantee that FCNNs can be implemented by physical systems, we should study the dynamic range. In this section, we study the dynamic range of type-I1 FCNN in Eq. (49). First, we need the following definition.
Definition 7. Dissipative FCNN: Let E be a compact set in !IIMN; if all solutions of an FCNN finally fall into E and stay in E , then this FCNN is called a dissipative FCNN. Remark: Let
The dynainical range of a dissipative FCNN is E .
i
when(k, I ) = (i, j ) and A(i, j ; k, 1 ) 5 0 when(k, I ) = (i, j ) and A(;, j; k, I ) > 0, or, (k, 1 ) # ( i , j ) (53) then we have the following theorem. ''I
=
0, I,
Theorem 1. The type-11 FCNN in Eq. (49)is u dissiputive FCNN and all its solutions with any initial conditions x, ( 0 )jfinally fall into the jbllowing conipuct set:
,
whcre
292
TAO YANG
where x = cof (XI 1 ,
. . . ,X M N ) .
Proof ( 1 ) We construct a radially unbounded positive definite function VI =
. 1
c M
N
Differentiate V I along the solution of Eq. (49), and as lylJl I and I p u ( u I J ) I I 1, we have
1.I
J
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
293
The last inequality is satisfied when x E S2,. (2) We then construct MN radially unbounded Lyapunov functions with respect to the state variable x;, as follows:
where sgn(.) is the signum function. Along solution of Eq. (49), we calculate the Dini upper-right differential as
294
TAO YANG
the last inequality is satisfied when x E !&. Thus, the solution of Eq. (49) will fall in !NMN/C2z,and fall in S and stay in S . If x(0) E E, then x ( f ) E 3 , V t > 0. So, S is an w-invariant set, in W " N / S there exists no stable equilibrium point of the FCNN in Eq. (49). 0
Remark: This theorem gives the dynamical range of an FCNN; in practical circuit design, we can choose the correct parameters to guarantee that the FCNN can work in the typical voltage range of power supply in IC circuits. The following theorem guarantees that the FCNN in Eq. (49) has at least one equilibrium point.
Theorem 2. The FCNN in Eq. (49) has
nt
Consider the following vector operator:
where
least one equilibrium point.
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
Q
z
295
I+ v max
+
R.,
i/
IBf m,tn(i.j ; k , 111 Q,
1)
IBj rn,x(i,j ; k , l)1
Ci/ E N , ( , . , 1
CIlEN,(I J )
Then the vector operator
v
+
maps the following set
S = {x(x,,( 5 Q , 1 5 i 5 M , 1 5 j 5 N }
. (65)
(66 1
into itself. As S is a convex compact set, from Brouwer’s fix-point theorem, we know that Q, : S H S has at least one fix-point x = x * . And x * is an 0 equilibrium point of FCNN in Eq. (49). 2. Dynamical Range of Type-I FCNN
Theorem 3. The type-I FCNN in Eq. (45) is a dissipative FCNN and any of its solution with any initial condition x,, (0)will full into the following compact set: !vN/QI !liMNp2 (67)
A
where
n
296
TAO YANG
where x = col(xl1,. . . , X M N ) .
Proof (1) We construct a radially unbounded positive definite Lyapunov function l
v ,= 2c
-
cc M
N
.
l=l
XI',
J=I
(70)
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
< 0.
297
(71)
The last inequality is satisfied when x E 2 ' I. (2) We then construct M N radially unbounded Lyapunov functions with respect to the state variable
where sgn(.) is the signum function. Along the solution of' Eq. (43, we calculate the Dini upper-right differential L1S
298
TAO YANG
the last inequality is satisfied when x E S22. So, the solution of Eq. (45) will and fall in E and stay in 8 . If x(0) E E,then x ( t ) E E,Vr > 0. fall in RMN/S22, So, 8 is an w-invariant set, in R M N / Ethere exists no stable equilibrium point of the FCNN in Eq. (45). The following theorem guarantees that the type-I FCNN has at least one equilibrium point.
Theorem 4. The FCNN in Eq. (45) has at least one equilibrium point. Proof
Letting the right-hand side of Eq. (45) be 0, then we have
Consider the following vector operator:
where
and x = col(xl I , . . . , . x M N ) ,
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
299
Let
Then the vector operator
4,
maps the following set;
into itself. Since S is a convex compact set, lrom Brouwer’s fix-point theorem, we know that 4, : S H S has at least one fix-point x = x*. And x* is an 0 equilibrium point of FCNN in Eq. (45). B. Clobul Stability
1 . Results ,for Type-ll FCNN As each pixel of input image can be viewed as a.fuz2.y singleton, we can choose pL,(.)as p.,,(x)= x, then the type-I1 FCNN in Eq. (49) can be rewritten as
Output equation of C ; ; is given by
Constraint conditions are given by
300
TAO YANG
Parameter assumptions are as follows:
In state equation (79), if no fuzzy logical relation exists between two cells C;,, and Ckl, then we say that the fuzzy connections between them are rinnexistent, or else we say that the fuzzy connections between them are existent. We only study the FCNN with flat fuzzy feedback MIN templates and flat fuzzy feedback MAX templates. A ,flat fuzzy feedback MIN template is defined as A , lnill(i,j ; k, I ) = a, VCkl E N r ( i , j ) and A f m l l , ( i j, ; k, 1 ) are existent
(83)
where a is a constant. A flat fuzzy feedback MAX template is defined as A , nlax(i, j ; k, I ) = j?, vch/ E N r ( i , , j ) and A f nlax(i,j ; k, 1 ) are existent
(84)
where j? is a constant. Then Eq. (79) can be rewritten as
where
a , .' J
-
=
{ {
:ndefined, {ndefined ,
if corresponding A , m,n(i.j ; k , I ) is existent. if corresponding A f m i l l ( i ,j ; k, I ) is nonexistent. (86) if corresponding A f m a x ( i j, ; k, I ) is existent. if corresponding A,,nax(i, j ; k , I ) is nonexistent.(87)
From parameter assumptions in Eq. (82), we have
We then have the following proposition:
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
Proposition 1. Suppose that x and x’ are two states of FCNN in Eq. then we have (1)
301
(as),
302
TAO YANC
then we have the following theorem.
Theorem 5. Suppose that the spectral radius of matrix R, [ A1, p(R, [ AI ) < 1 , then the type-lI FCNN in Eq. (85) has only one equilibrium point, and this equilibrium point is globally stable.
Proof The existence of equilibrium point of FCNN in Eq. (85) is guaranteed by Theorem 2. Now we only need to prove that the FCNN has less than two equilibrium points. Let the right-hand side of Eq. (85) be 0, we have
( 1 ) N ) and x ( ~ = ) c01(x;~),. . . , xzh) be two solutions Let x(') = col(xj", . . . , x M of Eq. (85), then we have
MN
The second inequality is in view of Proposition 1.
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
303
We then rewrite Eq. (99) into a vector form as follows:
As p(R.,IAI) < I , we have
We then have x(I)
- (2)
-x
3
which yields that the FCNN in Eq. (85) has only one equilibrium point, x*. As x* is the only equilibrium point, following Eq. (85) we have
As p(R,IAI) < 1, ( E - R,IAI) is an M-matrix, where E is the unity matrix. So, there exists a group of positive constants, p, > 0, i = 1 , 2 , . . . , M N , such that MN
j = 1 , 2,..., M N . We construct the following Lyapunov function: 1
V(x) = C
c MN
,IjlX,
/=I
I ; .-
> 0.
304
TAO YANC
When x = x*,we have V(x) = 0. When Ix, - xgl + +m, we have V(x) -+ +m.
Along the solution of Eq. (103), we calculate the Dini upper-right differential of V(x) as D+ v (x ) I&.(
103)
(106)
< 0.
The second inequality is in view of Proposition 1. The second equality is in view of the parameter assumption in Eq. (88). The last inequality is satisfied 0 when x # x*. Similarly, we can have the following theorem.
Theorem 6. Suppose that the ,following matrix
is a Hurwitz niutrix, then the equilibrium point x = x* is globally stable. Here s;j
=
{
1 ,i=j 0 ,i#j.
(107)
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
Proof
305
Since
MNxMN
+ ( ( l - 6 ; / ) ( l " i ; l + Ia;,/I+
IBi,jI))MNxMN
is a Hurwitz matrix, we have that
+ + Ia;;I + lBiiI + Ilai;l + ~i;,
- ((1
- 6ij)(lLli.;I
MNxMN
IB;;I))MN~MN
is an M-matrix. Then, from properties of M-matrix, there exists a group of positive constants. p i > 0, i = I , 2 , . . . , M N , such that I
PJ
(-g
+ la.;.iI + lb;./I
+"././
- 6f/)(lcl;jI
f Ia;,;I
+
IBi/I)
< 0,
i= I
j = 1, 2 , .. . , M N .
(108)
We construct the following Lyapunov function: , MN
(109)
where sgn(.) is the signum function. When x = x*, we have V(x) = 0. When Ix, - XTI + +m, we have V(x) -+ +m. Along the solution of Eq. (103), we calculate the Dini upper-right differential of V(x) as D+V(X)lE,.
(103)
306
TAO YANG
MN
+ C pi(l
-
1
~ ~ J ) ( I ~+i IaijI j I + I B ~ ~ ~I ). f ( x . j-) .f(x7)1
i=I
< 0.
( 1 10)
To get the first inequality, observe that u,jj(f(xj)-
.f(x,*))sgn(x,; -
= a,j;If(x;)- .f(xf)l.
(111)
The second inequality is in view of Proposition 1. To get the third inequality, observe that -Ixj -.TI F -I,f(x;) - .f(xy)I. ( 1 12) The second equality is from the parameter assumption in Eq. (88). The last inequality is satisfied when x # x * . I?
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
307
2. Results ,f.r Type-I FCNN We recast the type-I FCNN i n Eq. (45) into
( 1 13)
We need the following proposition
Proposition 2. Assuine that for thrrt
then we have
two points
x(') and
x ( I ) there
exists ci k such
308
Proof
TAO YANG
1. Assume that there exists an h such that
We have
2. Assume that there exists an h such that
We have
then we have the following theorem.
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
309
Theorem 7. Suppose that the spectral radius of matrix R, IAl, p ( R , [ AI) < 1, then the type-1 FCNN in Eq. (113) has only one equilibrium point, and this eyiiilibriim point is globally stable. Proof The existence of equilibrium point of FCNN in Eq. (1 13) is guaranteed by Theorem 4. Now we only need to prove that the FCNN has fewer than two equilibrium points. Let the right-hand side of Eq. ( I 13) be 0, we have
I j Let x(’) = col(xl( 1 ) , . . . , x (M N and ) x ( ~=) col(x,( 2 ), . . . , x (2) M N )be two solutions of Eq. ( I 13), then we have
MN
MN
j= I
310
TAO YANG
The second inequality is in view of Proposition 2. The rest of the proof is 0 similar to that of Theorem 5. Similarly to Theorem 6, we have the following theorem.
Theorem 8. Suppose that the following matrix
is a Hunvitz matrix, then the equilibrium point x = x* is globully stable. Here Jij
=
{I,
i=j 0, i # j
C. Local Stubility 1. Results for Tvpe-IIFCNN
In the FCNN as in Eq. ( 8 5 ) ,the state variable of each cell, x,(i = 1 , . . . , M N ) , can stay in three different intervals: (-m, --I], ( - I , 1) and 11, GO), which correspond to three different cell outputs: -1, x,, and 1 . So, the state space of the FCNN can be divided into 3 M N separated regions, D,(i = 1, . . . , 3MN). Each D, is an M N -dimensional hypercube. Suppose that x* is an equilibrium point of the FCNN in Eq. (85), and it is an inner point of Dk. Then there exists a neighborhood of x*, which is in the interior of Dk. Let
be the biggest hyperball in Dk. We can rewrite Eq. (85) in G as follows: d (Xi
-
x; )
dt MN
MN
FUZZY CELLULAR NEURAL NETWORKS A N D THEIR APPLICATIONS
31 1
then we have the following theorem.
Theorem 9. Suppose that the @lowing matrix
is u Hurwitz matri.x, then the equilibrium point x = x* is asymptotically stuble in the basin of cittraction G. Here
{ 1,
i=j
&; = 0, i # j
(131)
und
Proof
Since
is an Hurwitz matrix, we know that
is an M-matrix. Then, from properties of M-matrix, there exists a group of positive constants, pi > 0, i = 1, 2 , . . . , M N , such that
312
TAO YANG
We construct the following Lyapunov function: I
MN
where sgn(.) is the signum function. Along the solution of Eq. (1 30), we calculate the Dini upper-right differential of V ( x ) as D+ V(X) I EL,.( 130)
MN
r
MN
r
MN
.
MN
3 13
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS MN
MN
MN
+ CPi(1
- Ji,,)"(X,*)
+
(al.; IaijI
i= I
< 0.
I
+ I P ~ ~ IIxj) -.TI (135)
The second inequality is in view of Proposition 1. The last equality is from the parameter assumption in Eq. (88). The last inequality is satisfied when x # x* and x in hyperball G. So, G is the basin of attraction of the equilibrium point x*. 0
2. Resultsfor Type-I FCNN Similarly to Theorem 9, we can get the following theorem for type-I FCNN.
Theorem 10. Suppose thtrt the Jollowing matrix
+ ((I
- ~IJ~'(x~)(~aIJ~
+ a;":+ B I J ) ) M N x M N
(137)
is u Hurwitz matrix, then the equilibrium point x = x* is asymptotically stable in the btisiri o j attraction G. Here
arid
D. Type-I1 Delay-Type FCNN An M x N type-I1 DFCNN is described by the following state equation: C-
dxi,;( t ) dt
314
TAO YANG
We repack the state variables x,, into a vector x of size n = M N . Similarly, the input and output variables u,, and yf, are repacked into u and y using the same labeling order. The initial conditions for an DFCNN are given by
We assume that xo;;(t) is a continuous function. Then we recast the state equations (140) into the following functional differential equations (FDE):
Ck = C F ( t , x,)
x, E C, is defined as X,(O); = X ( t
+ el;,
0E
[-t,01.
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
3 15
1 . Existence and Uniqueness o j Solutions
Proposition 3. Given the initial condition xo(t) = @ ( t ) , @ ( t )E then the DFCNN in Eq. (142)has
N
c,
unique coritinuous solutionfiw t
( 1 44) E
[0,00).
Proof we need to show that Eq. (142) has a unique solution. First we show that F ( t , x,) is globally Lipschitzian, that is, IF(t, $) - F ( t , @)I 5 Lly? - @I
for all
y ? , @ E C , and all t
(145)
for some constant L. If we define
then L qualifies as our Lipschitz constant. As input is continuous, F ( t , y?) is continuous with respect to r for all y?. The conclusion then follows from work by Driver (1977, pp 308-309). 0
Proposition 4. v t h e initial conditions are bounded by K > 0, theti all states x,, .f the type-I1 DFCNN in Eq. (140) ure bounded ,f.r all time in ubsolute value by the suni M = K+R,III+R,max
x ( l A r ( i , j ; k , l)l+lA(i, j ; k , l)l+lB(i, j ; k , 1 ) l )
316
TAO YANG
Proof It is sufficient to follow the Proof of Theorem 1 by Chua and Yang (1988b) to see that in this case it is also possible to recast the equations of the network in the same form as that of their Eq. (4a), as follows here: (148) where f ; ; depends only on y k / ( f ) and j ' k / ( t - r ) and g i j depends only on the inputs and the bias, and for both it is possible to compute an upper bound in the same way as was done by Chua and Yang (1988b). 0
2. Stcihility Results Given two points x,x* E DV""'", and the following function, C : !)IMN A such that for C(x - x * ) = (a(x; which is defined by A
n(x, - xT) = f ( x , )
-
H
!)IMN,
f(x,F)
A
Suppose x* is an equilibrium point and let w = ( w ; } M N ~=~ x Eq. (140) can be rewritten into
-
x*, then
Also, we study the stability of the type-I1 DFCNN with flat fuzzy templates. Similar to that in Proposition 1 , we have the following proposition.
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
3 17
Proposition 5. Suppose x and x' are two solutions of type-11 DFCNN in Eq. (150), then we have (1)
- MN
- MN
Proof Let the right-hand side of Eq. (150) be zero, then we have the corresponding equilibrium equation as
318
TAO YANG
At the equilibrium point, we have
then we can recast Eq. ( 1 54) into
then we have
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
3 19
In view of la(x) - a(y)l 5 Ix - y l , and using a process similar to that in the proof of Theorem 5, we complete the proof. 0 Let H = (
~ ; ~ ) M N ~ satisfy M N
Then we have the following theorem.
Theorem 12. The origin of E4. (150) i s globally asymptotically stable i f H is a nonsingular M-matrix. Proof
We construct the following Lyapunov function:
where pi > 0, i = I , 2 , . . . , M N , are constants. Along solution of Eq. ( 1 50), we calculate the Dini upper-right differential of V(w(t)) as
3 20
TAO YANG
(160) In view of Io(wi)l 5 Iwi( and using a process similar to that in the proof of Theorem 5 and in view of parameter assumptions in Eq. (124), we have
The last inequality is satisfied when w # 0.
E. Type-I DFCNN An M x N type-I DFCNN is described by the following state equation:
32 1
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
Similarly we have the following two propositions:
Proposition 6. Given the initial condition
then the DFCNN in Eq. (162) has a unique continuous solution for t
E
[O,
00).
Proposition 7. If' the initial condition ure bounded by K > 0, then all strites x I Jofthe type-I DFCNN in Eq. (162)are boundedfor all time in absolute value by the sum M =K+R,III
+ 6R,
(164)
and the w-limit points o f x ,I ( t )are bounded in obsolute value by ( M - K). By repacking x = [ Eq. (162) as
X , ~ } M ~into N
a ID vector x = { X , } M N ~ Iwe , can rewrite
322
TAO YANG
Suppose x* is an equilibrium point and let w = ( Eq. (165) can be rewritten into
A w ; ] M N ~= ~
x - x*, then
Similar to that in Proposition 2, we have the following proposition.
Proposition 8. Assume thut f o r two points x i ' ) und x ( ~there ) exists u k such thut
then
FUZZY CELLULAR NEURAL NETWORKS A N D THEIR APPLICATIONS
323
Similarl.y, assume that there exists an 1 such that
thetz we have
Proof
It is the same as those in Theorems 7 and 1 I .
0
Letting H M = ( / Z , ~ ) M N ~ Msatisfy N
then we have the following theorem.
Theorem 14. The origin of Eq. (166) is globally asymptotically stable if H M is a rionsingulur M-matrix.
324
Proof
TAO YANG
Similar to that of Theorem 12.
0
F. Stability of Discrete-Time FCNN Discrete-time fuzzy cellular neural network (DTFCNN) is a very important branch of FCNN. The DTFCNN is governed by a set of difference equations. In this section, we present the structures of DTFCNN and provide some stability criteria for them. The dynamics of a cell C,, in an M x N DTFCNN is given by: 1. State equation
where t E N is the discrete-time; uij, xi,; and yi, are input, state, and output, respectively. 2. Output equation yij(f) = f(xij(t)). (179)
3. Parameter assumptions Al (i, j ; k, 1 ) = A1 (k, I ; i , j ) , AZ(i, j ; k, I ) = A2(k, I ; i , j ) .
,x:
(1 80)
Let x:, be an equilibrium point of the DTFCNN and letting e;,;(t)= xi,([) we can recast Eq. (178) into
325
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
Similar to Corollary 1 in Yang and Yang (1996), we need the following proposition.
Proposition 9. Let ( x , , ~ mid ] [x,,~)he two states o j DTFCNN in Ey. (178), then we have
(183)
Proof
Similar to that of Corollary 1 from Yang and Yang (1996).
Theorem 15. The equilibrium point qf the DTFCNN in Ey. (178), [x:,], asymptotically stable if
where L > 0 is a constant.
Proof
We define a Lyapunov function as M
N
0 is
326
TAO YANG
Taking the forward difference of V along the solution of Eq. (181) we get
In view of Proposition 9, we have
Observe that if
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
327
then A V is negative, which implies the asymptotic stability of the equilibrium point { x:~]. With the symmetric property of the neighborhood system and the parameter assumptions, we can recast Eq. (190) into
0 DTFCNN can be used in some typical applications of FCNN. Furthermore, DTFCNN can also be used to model some discrete-time and spatial-distributed phenomena such as highway traffic flow (Yang et al. 39983).
IV. FCNN
AS
COMPUTATIONAL ARRAYS
In this section, we present those FCNN structures that function as computational arrays. As the most commonly used interpretations of fuzzy AND and fuzzy OR are minimum and maximum calculations and as mathematical morphology (Serra, 1982, 1988; Heijmans, 1992; Haralick et al., 1987) is closely connected with fuzzy logic, FCNN is a paradigm for implementing morphological operators. The applications of FCNN as computational minmax networks are presented in this section. We show that FCNN can function as low-level computational structures just as conventional CNN does. The advantage of applying FCNN to image processing problems is that type-TI FCNN can implement max and min operations in a more natural and efficient way than the conventional CNN does. A. Basic Knowledge o f Muthematical Morphology
Mathematical morphology (Serra, 1982, 1988; Heijmans, 1992) is a theory that deals with processing and analysis of image, using operators and functionals based on topological and geometrical concepts. During the last decade, it has become a cornerstone of image processing problems. Morphological operations have been widely used for object recognition (Shih and Mitchell, 1988 a, b), edge detection (Lee et al., 1987), shape analysis (Pitas and Venetsanopovlos, 1990), thinning (Jang and Chin, 1990), image coding (Goutsias and Schonfeld, 1989; Maragos and Schafer, 1986), and smoothing (Jang and Chin, 1989). Four basic transformations in mathematical morphology include, dilation, erosion, opening, and closing. These basic transformations permit extraction
328
TAO YANG
of contours and skeletons, separation of close objects, and computation of geodesic distances, etc. (Serra, 1982, 1988; Heijmans, 1992). The basic idea of mathematical morphology is to probe an image with a structuring element and to quantify the manner in which the structuring element fits (or does not fit) within the image. In general, the structuring element has a simple shape and is very small compared to the image being investigated. We let f : X I-+ E and s : S H E be maps representing image and structuring element, respectively, where E is the range of gray values, X is a gray-scale image, and S is a weighted structuring element. Then the basic morphological operations of erosion and dilation for gray-scale images are given by Haralick er al. (1987) Grav-scale Erosion
+
for all z E S and x z Gray-scale Dilation
E
X.
for all z E S and x - z E X. With the definitions of gray-scale erosion and gray-scale dilation, gray-scale opening and gray-scale closing are given by Gray-scale Opening XOS = (X€3 S ) CB s. (194)
Gray-scale Closing XOS= (XCBS)~S.
(195)
For the purposes of implementing gray-scale morphological operations by FCNN, E is normalized within LO, 11. For example, let S be within a 3 x 3 square with its origin located at center as follows: hi h2 h3 S = h4 h5 hg (196) [ h , hg hg1
-
where hl hg denote gray values of corresponding entries of the structuring element. An entry in a structuring element is dejined if there exists an operation, is undefined if there exists no operation (or simply set an undefined entry as fco in erosion and as -m in dilation). A structuring element whose defined entries have the same gray value is called afZat structuring element.
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
329
B. Iinplenierztation of Morphological Opercitions As type-I1 FCNN is a combination of min and max operations with parallel dynamics, it is very convenient to implement morphological operations in type-I1 FCNN. Another reason for using FCNN in mathematical morphology is that given a relatively small structuring element the morphological operations have strong local properties. And a big structuring element can sometimes be decomposed into a set of smaller size structuring elements. This makes possible the applications of FCNNs with 3 x 3- or 5 x 5-neighborhood to image processing problems where large structuring elements are needed. The FCNN is found to be a universal parallel array to implement morphological operations for processing both binary and gray-scale images (Yang and Yang, 1997 d, e). I n this section, we use different FCNN structures to implement the basic morphological operations. Although the results presented in this section are based on FCNN, the corresponding DTFCNN structures can also be used. 1 . Using Multiplicative Type-I1 FCNN
The following multiplicative FCNN' (Yang and Yang, 1996) is used to implement a morphological operator with a flat structuring element:
The parameters for implementing erosion with a flat structuring element are given by K,,= I , I = -11, B f .,,,i{X = 0, B f ",in = S' (198) where h is the height of the flat structuring element S , S' is given by substituting the defined entries in S by Is. The parameters for implementing dilation with a flat structuring element are given by K, = I , I = 11, B f = Sb, B + =0 (199) where Sh is given by substituting the defined entries in S,) by Is and S,) = (-x : x E S ] . An FCNN is callcd niu//i/~/ictr/ivr if i t has multiplicative fuzzy synaptic laws. An FCNN is called d d i / i v r if it has additive f'tiz7y synaptic laws.
330
TAO YANG
For example, letting S be that in Eq.( 196), then SL, is given by
As an example, Fig. S shows output results of the forementioned two FCNNs with the following zero-height flat-structuring element:
s'=
I: 1 :I 1
1
1 ,h=0.
The parameters for the erosion FCNN are given by
The output of this erosion FCNN is shown in Fig. Sb. The parameters for the dilation FCNN are given by
The output of this dilation FCNN is shown in Fig. Sc. As defined, a grayscale opening can be implemented by an erosion FCNN followed by a dilation FCNN. The result of FCNN-based flat opening is shown in Fig. 5d. A grayscale closing can be implemented by a dilation FCNN followed by an erosion FCNN. The result of FCNN-based flat closing is shown in Fig. 5e.
2. Using Additive Type-I1 FCNN The following additive FCNN is used to implement erosion and dilation with any structuring element besides flat ones (Yang and Yang, 1997 d, e; Yang et ul., 1996 d):
The additive FCNN for implementing erosion has the following parameters: R , = 1 , BfI,,;,x= undefined, B ,
= -S.
(20s)
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
33 1
(K)
t:lGuw 5. FCNN-based mathematical morphological operations with zero hcight 3 x 3 full scale Hat structuring element. (a) Original image. (b) Output of erosion FCNN. (c) Output of dilation FCNN. (d) Output of FCNN-based opening. ( e ) Output of FCNN-based closing.
We call the preceding FCNN an erosion FCNN. Image X is its input and its initial state is arbitrary. When we say a template is “undefined,” it means that lhe template is not used by the FCNN. The additive FCNN for implementing dilation is given by
R, = 1, B ,
= undefined, B f
= SD.
(206)
332
TAO YANG
(d)
(c)
FIGURE6. Implementation of gray-scale morphological operations using additive type-I1 FCNN. (a) Output of erosion FCNN. (b) Output of dilation FCNN. (c) Output of pray-scale opening. (d) Output of gray-scale closing.
We call this FCNN a dilation FCNN. Image X is its input and its initial state is arbitrary. Figure 6 shows examples of implementing basic morphological operations using additive type-I1 FCNN. The structuring element is given by
s=
[
0.02 0.1 0.02 0.1 0.2 0.1 0.02 0.1 0.02
1
The original image is the same as that in Fig. 5a. Figure 6a shows the output of the erosion FCNN. Figure 6b shows the output of the dilation FCNN.
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
333
Figure 6c shows the output of the dilation FCNN. Thus, it is the opening of the original image. Figure 6d shows the output of erosion FCNN when the image in Fig. 6b is the input. Thus it is the closing of the original image. Shortly after its invention (Yang and Yang, 1996), many applications of FCNN-based matheinatical morphology were found in image processing. Some typical applications include gray-scale reconstruction (Yang and Yang, 1997e), Euclidean distance transformation (Yang and Yang, 1997d), fuzzy shrinking and expanding (Yang et al., 1998g), medial axis transformation (Yang et a/., 3998g), and edge detection under low SNR conditions (Yang and Yang, 39970. The FCNN-based mathematical morphological operators were also proved to be more robust and reliable than those based on conventional CNNs (Yang e t d . , 19980. A comprehensive survey on applications of FCNN to image processing can be found in a technical report by Yang et al., (1997i). C. MIN/MAX C N N
MINMAX CNN, which consists only of local MIN and MAX operations, is a special case of type-I1 FCNN by eliminating either the multiplication or addition operation between the weights and the inputs (respectively, outputs). The state equation of a discrete-time MINMIN CNN (MMCNN) is given by x ( i , j ) = kl min u ( i ~IENI
+ k , j + I ) + k2 max u(i + k , j + I ) , kl En/.
1 l i s M , l 5j < N .
(208)
Nj and N2 in Eq. (208) are two spheres of
injuence, henceforth called neighborhood patterns, which give the arguments of MIN and MAX operators, respectively. N I and N2 may be the same or different. Figure 7 shows three typical neighborhood patterns, which can also be respectively represented in the following forms (-1,
-1)
(1, -1)
(-1,O)
(1,O)
(-1,
1)
(1, 1 )
where (.,.) denotes the Cartesian coordinate (with respect to the center) of the artificial synapse, and the symbol “0” denotes that the weight (a PN junction) does not exist (for nonprogrammable chip) or is off (for programmable chip) at the indicated position. Because comparing operations are much easier to implement than arithmetic operations given the same accuracy level, and as local MIN and MAX
334
TAO YANG
(a)
(b)
(c)
FIGURE7. Some typical neighborhood patterns used in MMCNN.
operators are widely used in image processing based on gray-scale mathematical morphology operations, MIN/MAX CNN has a very high silicon area efficiency and yet performs many primary image preprocessing tasks. A schematic circuit implementation of the local MIN and MAX operations is shown in Fig. 8a and b, respectively. A possible CMOS current-mode MAX circuit is given in Fig. 8c (Baturone et al., 1997), which shows that much fewer transistors are needed. As the MMCNN chip is technically simple yet highly efficient, its functions should be specially designed for different image processing tasks. In this section we design some typical image processing tasks for MMCNN chips. There are two kinds of design methods. The first is an exact design method, which we apply for mathematical morphology operations, rank filters, and range operations. All of these operations use M I N M A X operators. The second
Max u(i+k,j+l) N?
Min u(i+k.j+l) Nl
(C)
FIGUIE8. Circuit implementation of the MIN and MAX operations. (a) The schematic circuit of the MAX operation. (b) The schematic circuit of the MIN operation. ( c ) A practical CMOS current-mode MAX circuit.
FUZZY CELLULAR NEURAL NETWORKS A N D THEIR APPLICATIONS
335
is an approximate design method, which we apply for Laplacian kernels, averaging operations, and orientation derivatives. The nature of these operations is totally different from the MIN/MAX operators. The additional optimization needed in the second class is implemented by a learning algorithm presented in Sect. IV. C. 6. By choosing kl = 1 and k2 = 0, an MMCNN becomes an erosion network, whose structuring element is given by Mi (the domain). Similarly, by choosing k l = 0 and k2 = I , an MMCNN becomes a dilation network, whose structuring element is decided by Nr (reflected with respect to origin). 1 . Des igriirig M M C N N ,for Approximciting Laplcician Operator
A 3 x 3 Laplacian operator is given by -1
-1
-I
The corresponding neighborhood patterns of the MIN and MAX operators in the corresponding MMCNN, henceforth called a ,fidl-.scale rzeighhorhod pcittrm, are given by
N , =N2 =
(-1. - 1 ) (0-1)
i
(],-I)
(-1,O) (-1,O) (I,0)
(-1, 1) (0,l) (I,])
}
.
(210)
We search around the kl - k2 plane for the minimum regions for the error between the output of the MMCNN and the Laplacian operator. Based on our knowledge, we know that we should search in regions where kl < 0 and k2 > 0. In Fig. 9 we show the searched results. The corresponding output for the Laplacian operator is shown in Fig. 9a. Figure 9c shows the error surface with kl E (-4, 0) and k2 E (0,4). This region is divided into 50 x SO small rectangular regions. The minimum value is found at the point ( k l , k 2 ) = (-0.72,0.72) with E = 505.946899. The corresponding output of the MMCNN is shown in Fig. 9b. Comparing the results in Figs. 9a and b we find that the latter is a "blurred" (lowpass filtered) version of the former. This can be confirmed from the rather big error E in this case. 2. Designing M M C N N for Approximating Averuging Operator A 3 x 3 averaging operator is given by 119 119 119
119 119 119
119 1/9 119
336
TAO YANG
(c)
FIGURE 9. The distribution of the erro between the output of the MMCNN and the Laplacian operator with different kl and kz values. All images in this figures are shown in negative exposure. (a) The output of the Laplacian operator. (b) The output of the MMCNN with minimum error. (c) The error distribution on the k~ - kZ parameter plane.
The corresponding neighborhood patterns of the MIN and MAX operators in the corresponding MMCNN are given by
Figure 10 shows the searched result. The corresponding output of the averaging operator is shown in Fig. 10a. Figure IOc shows the error surface with kl E (0, 1 ) and k2 E (0, 1 ). This region is divided into 30 x 30 small rectangular
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
337
(c)
F l ~ u n t i 10. Error distribution between the output of the MMCNN and the averaging operator with tliffcrcnt kl and kz values. (a) The output of the averaging operator. (b) The output o l the MMCNN with minimum error. (0) The error distribution on the kl -?, ! parameter plane.
regions. The minilnuin value is found at point ( k l , k z ) = (0.5, 0.5) with E = 13.7573. The corresponding output of the MMCNN is shown in Fig. lob. As we can expected from the very small E , the two images in Figs. 10a and b are almost the same. Of course, this result is image-dependent. For images having very few ‘‘smooth” segments, the error E will become larger. 3. Designing MMCNN,fiw Approxinzrrting Horizontul Derivative A horiLontal derivative with a 3 x 3 kernel is given by
{:i
i).
(2 13)
338
TAO YANG
The neighborhood patterns for the MIN and MAX operators in the corresponding MMCNN are given by
We then search the minimum point on the kl - k? plane. Figure 1 I shows the searched results. The corresponding output of horizontal derivative is shown in Fig. I la. Figure I Ic shows the error surface with kl E (-3,O)
(C)
FIGIJREI I . Error distribution between the output of the MMCNN and the horizontal derivative operator with different k l and k? values. All images in this ligure are shown i n negative exposure. (a ) The output of thc horizontal derivative operator. ( h ) The output of the MMCNN with minimum error. ( c ) The error distribution on the X I k~ parameter plane. ~
FUZZY CELLULAR NEURAL NETWORKS A N D THEIR APPLICATIONS
339
and k2 E (0, 3). This region is divided into 30 x 30 small rectangular regions. The minimum value is found at the point ( k l , k ? ) = (-1.5, 1.5) with E = 391.307098. The corresponding output of the MMCNN is shown in Fig. 1 Ib. Here we present only the derivative kernel for the horizontal derivative. The other MIN/MAX derivative kernels are given in Sect. IV. C. 4.
4. Designing MMCNN jii r ApproxiinLi ring Orietz ta tinrz De ri vci tives The orientation derivatives are collections of orientation-dependent high-pass filters. A combination of different kinds of orientation derivatives usually gives a much better edge detector than a single orientation-independent high-pass filter, such as the Laplacian operator. In this section we show the results of different orientation derivatives using different neighborhood patterns. The neighborhood patterns corresponding to one of the 8 compass directions are listed here: 1.
2.
4.
5.
340
TAO YANG
6.
Ni=
(-1, -1) (O,-l) ( I , -1)
i
(-1,O)
0 0
(-I,
1) 0 0 ],N*={ 0 0 (I. -I>
(1,O)
(-1.1) (0, I (I, I)
1
.
(220) There are many other neighborhood patterns, including combinations of these patterns. Some simulation results are shown in Fig. 12. Observe that the details of all of the results are different from each other even though the corresponding neighborhood patterns differ only slightly from each other. Although the orientation derivatives are standard image operations, to implement them in real time is not a trivial problem; MMCNN provides a possible real-time solution to this problem. 5 . Uiiiwrse of MMCNN Furzc~tionsin the kl
-
k2
Plane
In this section we show that the two tunable parameters kl and k2 can generate many different image processing functions by introducing competition and cooperation between the local MIN and the local MAX operations. The firnctional universe of MMCNN with full scale N , and N 2 is depicted in Fig. 13. We call the positive half of the horizontal axis the erosion axis and the positive half of the vertical axis the diliitioii axis. Observe that the functional universe of MMCNNs includes generalized erosion and dilation as two special cases. Moreover, the functional universe also includes many other tasks resulting from competition and cooperation between the MIN and MAX operations. To study the universe qualitatively, it suffices to consider only the upper half because the lower half represents only corresponding “negative” tasks. However, as the nonlinear output function of the neurons is not symmetric, the tasks irnplemented by MMCNNs in the lower half plane are very different from those corresponding to the negative of the upper half plane. These differences are illustrated in the examples provided in this section. We also labeled two lines indicating the central regions of competition and cooperation. Cooperation results in a lowpass filtering operation while competition results in a high-pass filtering operation. Perforinmice of H i g h - p a s s Filteririg via MMCNN. Figure I4 shows different results of high-pass filtering operations resulting from competitions between erosion and dilation. The parameters for Figs. 14b and e are the negative of those in Figs. 14c and d. As the nonlinearity of the output function is not symmetric, the output of the parameters in the lower half plane is not negatively symmetric to those from the upper plane. Observe that k l and k? can tune the high-pass performances in a very different manner. Figures 14a, b, and c show typical Laplacian performances. However, the result in Fig. 14d
(d) (e) ( f) FIGURE 12. The results of the orientation derivatives computed by the MMCNN; kl = -1.4 and kl = 1.4 for all cases. All images are shown in negative exposure. (a) The output as in Eq. (215). (b) The output as in Eq. (216). (c) The output as in Eq. (217). (d) The output as in Eq. (218). (e) The output as in Eq. (219). (f) The output as in Eq. (220).
342
TAO YANG
FIGLIRF 13. The universe of functions of the MMCNN in the k~
~
k? plane
looks more like image enhancement than high-pass filtering alone whereas the result in Fig. 14e is more similar to image segmentation. In the functional universe of MMCNNs the points kl = -1 and kl = I correspond to the so-called range operator, which computes the difference between the maximum value and the minimum value of the pixels within the neighborhood. The range operator responds to the boundaries between regions having different average brightness and is sometimes used as an edge-defining algorithm (Russ, 1992). Our MMCNN represents a generalization of the range operator by also scaling the effects of MIN and MAX using different weights. Perfbrtnatzce of Lowpnss Filtering via MMCNN. In this section we give the cooperating results of dilation and erosion. The simulation results are shown in Fig. IS. Although the effect of different kl and k 2 on the performance is somewhat similar to that corresponding to the radius of different Gaussian kernels, the difference is that the MMCNN has a very strong nonlinearity. For example, in Fig. 1Sa, although some region is very similar to the output of the Gaussian kernel with a big radius, however, by inspecting the position of the hair between the hat and the face, we can still find a black thin vertical line, which would not exist using a similar Gaussian kernel. Hence, we find that the MMCNN not only provides reasonable approximations of existing various image processing operations, but also presents other unique functions that can be exploited in some special applications.
6. Learning Algorithms Many image processing operations can not be implemented exactly using MMCNN because they are in nature very different from MIN/MAX operations.
w
P
w
(d) (el FIGLIRE 14. The results of high-pass performances resulting from the competition between erosion and dilation. All images are shown in negative exposure unless stated otherwise. (a) kl = -4 and k, = 4. (b) k1 = -1 and kz = 0.75. (c) kj = -1 and X.2 = 3 . (d) kl = 4 and k? = - 3 (shown in positive). (e) kl = 1 and k? = 0.75 (shown in positive).
344
TAO YANG
(d) ~ l ( i 1 l R E 15. The results of lowpass performances given by a11 MMCNN. (a) kj = 4 mid = 4. (b) kl = 2 and k2 = 2. ( c ) k1 = I and k2 = I . (d) kl = 0.2.5 antl kz = 0.75. ( e ) kl = 0.75 antl kz = 0.25.
They are convolutions whose implementation requires many multiplications. Unfortunately, current CMOS implementation of a multiplier requires at least 10 transistors, occupying 10 times more silicon area than an MMCNN implementation. This 10-fold reduction in silicon area is the motivation for developing MMCNNs for approximating the functions of classical image operators. Clearly, there is a tradeoff between functionality and implementability. We present a learning method to optimize an MMCNN for different image processing tasks. Some examples are also presented in this section. In this section we use the following input and output functions of MMCNN:
We then study how an MMCNN can learn the optimum parameters when only the example sets { ( u ( i , j ) , o ( i , j ) ) ) are available, where [u(i,j ) ) is the input image set, and { o ( i ,j ) } is the desired output image set. The learning process is driven by minimizing the square of the difference between the
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
345
desired output ( n ( i ,j ) ) and actual output ( y ( i , j ) }
E =
1 -
2
C(o(i, j ) - y ( i , j))’ (1.J)
for all the training samples to be learned. We call this function a cost,function. We then have
aE ilk1
-
aE d j l ( i , j ) &(i, j ) ay(i, j ) &(i, j ) akl
where
Then the &learning rule for updating kl is given by
where p is the learning rate. Similarly the 8-learning rule for updating k2 is given by
Hrmurks:
The learning algorithms provided in this section can only converge to local minima of the surface of error function E . However, if the training examples are smooth enough, we found in most cases that E only has a single global minimum. Thus, the choice of the initial condition is only to speed up the search process in the learning algorithm. 0
The size and statistical characteristics of the image samples should represent typical ones for the applications of the trained MMCNN. When the size of
346
TAO YANG
the image sample is too small, more than one image sample should be used because the training algorithm will be too sensitive if only a very small number of pixels are used in the training iterations. 0
0
p can not be chosen too big, or the training algorithm may become unstable. However, a very small p will slow down the convergence speed significantly.
More advanced learning algorithms should be developed for training neighborhood patterns. This is a kind of “structure” training problem, which is much more complicated than a parameter training problem. However, as image processing is a well-developed field, in most cases we can use our experience to choose neighborhood patterns efficiently.
Typicul Examples. I n this section we will train an MMCNN to perform three typical image processing operations: namely, averaging, calculating the Laplacian, and calculating the horizontal derivative. The averaging operator is a typical lowpass operation, whereas the Laplacian operator is a typical high-pass operation. While the preceding two operations are orientationindependent, the horizontal derivative is a typical orientation-dependent operation.
Averuging Operator. The output shown in Fig. 10a for the averaging operator in Eq. (21 1) is the desired output image. To train the MMCNN we choose p = lop5. The initial values for both k l and k2 are chosen equal to unity. The learning process is shown in Figs.16b and c with 1000 training iterations. Figure 16b shows the error E decreases monotonically and Fig. 16c shows the corresponding learning dynamics of kl and kz, which converges to kl = 0.48 1469 and k2 = 0.5 193549, respectively. The corresponding output of the MMCNN is shown in Fig. 16a. As we can expect from the very small error E , the two images in Figs.lOa and I6a are almost the same. This result is image-dependent. For images having very few “smooth” segments, the error E will become larger. Horizontul Derivutive. The output shown in Fig. I la of the horizontal derivative in Eq. (213) is the desired output image. To train the MMCNN we choose p = 4 x lops. The initial values are chosen equal to kl = - 1 and k2 = I . The learning process is shown in Figs.17b and c with 800 training iterations. Figure 17b shows the minimization of the error E and Fig. 17c shows the learning dynamics of kl and k 2 , which converges to kl = - 1.556571 and k2 = 1.533475, respectively. The corresponding output of the MMCNN is shown in Fig. 17a. Luplucian Operutor. The output shown in Fig. 9a of the Laplacian operator in Eq. (209) is the desired output image. To train the MMCNN we choose
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
347
(ill
l
i
I
.
.,
.
,
,
.
.
.
I I
,
,
,
,
,
,
,
,
,
,
I i I
I;~C;LJKI: 16. Thc learning process of an MMCNN for approximating the averaging operator. (a) The output of the MMCNN with learncd parametcrs. (13) The dynamics of the cost function E . (c) The convergence processes of kl (solid line) and k: (dashed line).
p = IO-’. The initial values are chosen to be kl = - 1 and k2 = I . The learning process is shown in Figs. 18b and c with 800 training iterations. Figure 18b shows the minimization of the error E . Figure I8c shows the learning dynamics o f kl and k l , which converges to kl = -0.6467125 and k l = 0.6626593, respectively. The corresponding output of the MMCNN is shown in Fig. 18a. Comparing the results in Figs.9a and 18a, we find that the latter is a “blurred” (lowpass filtered) version of the former. This can be predicted from the rather large error E at the equilibrium point found by the learning algorithm.
7. Esumples of’ Applications To demonstrate the capability of the MMCNN, we present here some real-life applications of the MMCNN to irnage processing. As the performance of any image processing structure is evaluated by humans and not by machines, we present here some simulation results for the readers to judge the performance of the MMCNN. Filteriiig out Smull Pcirtides. In Fig. 19a we show the image of a microscope view containing different sizes of particles. The task is to count only the big particles. Although humans can perform this task very easily, for a
w
P
00
iterations (b)
iterations
(c) FIGURE17. The learning process of an MMCNN for approximating the horizontal derivative. All images in this figure are shown in negative exposure. (a) The output of the MMCNN with learned parameters. (b) The dynamics of the cost function E . ( c ) The convergence processes of l i l (solid line) and kz (dashed line).
FUZZY CELLULAR NEURAL NETWORKS A N D THEIR APPLICATIONS
'A
349
,
FIGURE18. Training an MMCNN for approximating the Laplacian operator. All imagcs in this figure are shown in negative exposure. ( a ) The output of the MMCNN with learned parameters. (b) Tho dynamics of the cost function E . ( c ) The convergcncc processes of kl (solid line) ;ind k l (dashed line).
program to do it, it is first necessary to delete the small particles. This problem can be solved by applying four erosions on a 3 x 3 8-neighbor sphere of influence N , (the result is shown in Fig. 19b), followed by applying four dilations, as shown in Fig. 19c. Observe that the result is not very good because of the strong background at the upper-left corner in the original image. To overcome this problem we should eliminate first the big difference in contrast between different portions of background; this problem can be solved by applying an MMCNN with the following sphere of influence:
Nl
= N2 =
{
-1) (0, - 1 ) (1, -1)
(-1,
(-I,()) (O,O) (1,0)
(-1,
1) (0, I ) (1,1)
}
(227)
and with kl = 0.2, kz = 0.2, which is a scaled smoothing MMCNN. When we subtract the output of this MMCNN from the original image, we obtain the image shown in Fig. 19d. Observe that the background becomes much more homogeneous. The image in Fig. 19d is then processed by applying the erosion MMCNN twice (shown in Fig. 19e) and followed by applying the
I
(.
50 . a
100
I50
150.
200
200
B
250 50
100 150 200 250
n
'
. I 250 50
100 150 200 250
50
100 150 200 250
(a)
.m.
'
.. .
'... #
. . ' I
1
s .
*
.
250
250
FIGURE19. Sequence of operations to filter out small particles. All images in this figure are shown in negative exposures. (a) Original image. (b) After four erosions of (a). (c) After four dilations of (b). (d) Homogenizing the background of (a). (e) After two erosions of (d). (f) After two dilations of (e).
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
35 I
FIGLIIIE20. linage enhancement. (a) The output of NMCNN in Eq. (228). (b) Subtraction of (a) from the original image.
dilation MMCNN twice. The final result shown in Fig. 19f is significantly better than that of Fig. 19c. linage Enhuncement. One way to enhance an image is to locally increase the contrast at discontinuities by subtracting a high-pass version from the original image. For this example, the high-pass version is obtained by applying the following MMCNN:
Ni =N2 =
(-1, - 1 ) (0,- 1 )
i
(1,--1)
(-1,0)
(0,O)
I) (0, 1 )
(-1,
(1, 1 )
1
,kl
= -0.65,
k2
= 0.65.
(228) The high-pass output image is shown in Fig. 20a. This result is then subtracted from the original image as shown in Fig. 5a and the enhanced image is shown in Fig. 20b. Our next example shows improvement in the fine details from an almost homogeneous image that resulted from the sinall dynamic range of the imaging camera. The simulation results are shown in Fig. 21. Figure 21a shows the original image of a tissue sample. Since the camera mounted on the microscope has a very limited dynamic range, details related to the fiber orientations within the tissue samples are very difficult to distinguish. To make the fiber orientations visible, we use an MMCNN to enhance the original image. In the first step a “smoothing” MMCNN with full-scale neighborhood patterns N, and N2 and with synaptic weights kl = k2 = 0.5 is used to smooth the original image. The result is shown in Fig. 21b. The smoothed version is then subtracted from the original image to give the result shown i n Fig. 21c. Subtracting the result in Fig. 21c from the original image in Fig. 21a, we obtain the enhanced image in Fig. 21d where the fiber orientations are clearly visible.
352
TAO YANG
(C)
(d)
21. Image enhancement: enhance the details in the original imge. (a) The original image. (b) The smooth version of (a) using an MMCNN with k l = 0.5, k l = 0.5. (c) Substract (b) from (a) and amplify 20 times (shown in negative exposure). (d) Substract (c) from (a). FI(iLu<E
Texture Processing: Alloy Quality Control. Many images consist of regions with repeated variations in brightness called textures. Rank operations are often used to detect textures in images. One of the simplest (but least versatile) texture operators is to find the range of difference between the maximum and the minimum brightness values in a neighborhood. For a flat or uniform region, the range is small. Large values of the range correspond to a surface with a large roughness. The size of the neighborhood should be large enough to include both dark and light pixels, which generally means that it should be larger than the small uniform details in the images. However, current VLSI technology makes it impractical to choose a sphere of influence
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
353
larger than a 3 x 3 neighborhood. I n view of such practical considerations, an MMCNN can only process relatively fine textures. For images with big texture characteristics, we need to reduce the resolution before an MMCNN can process it. Consider a real-life application of an MMCNN for quality control of alloys. Thc results are shown in Fig. 22. Figure 22a shows an original image of the broken face of an alloy test sample. Observe that the upper left corner is much smoother than the lower right corner. The smooth portion is due to long-term fatigue. The rough portion is due to terminal tearing failures. If the alloy has no defect, then the boundary between these two kinds of textures should be regular. As fatigue failures are more likely to develop along defects in the alloy, any defect will make the boundaries irregular. However, with the naked eye it is very difficult to see the boundaries between the smooth portion and the rough portion in Fig. 22a. Let us use an MMCNN to segment the smooth texture and the rough texture. Figure 22b shows the range image provided by an MMCNN with full-scale 3 x 3 neighborhood patterns Nl and N? and with kl = - 1, kr = 1. In this image original feature brightness is gone and brightness represents only texture. However, as the 3 x 3 neighborhood is too small compared with the characteristic regions of the texture, the difference between textures is covered by noise. To overcome this problem we use the same MMCNN to process the output image in Fig. 22b for enhancing the difference as shown in Fig. 22c. We then smooth the image in Fig. 22c by an MMCNN with parameters kl = 0.25, k2 = 0.75. The result is shown in Fig. 22d. As this result favors the MAX effect, we process Fig. 22d by an MMCNN with parameters kl = 0.75, k2 = 0.25. The result is shown in Fig. 22e. The results in Figs.22d and e are then added to give the comprehensive result shown in Fig. 22f. The result in Fig. 22g is combined into a threshold version of Fig. 22f (with threshold equal to 0.85) and the original image. Figure 22h is the compensated image of Fig. 22g. Figure 22g shows the smooth portion of the broken end. This portion is formed by long-term fatigue. From Fig. 22h we can see that all small grains with a sinall bright center region are identified. This texture corresponds to the rougher portion of the original image. This kind of texture is formed by tearing failures. We can therefore conclude that a big defect exists in the lower left corner. This defect induced a fatigue failure along the horizontal direction from the lower left comer.
Intelligent Cumera. As an MMCNN chip has a very high pixel density, it can be fabricated in a very small size that fits into a camera. A smallscale MMCNN chip can be used to control the self-focusing system of an automatic camera for dynamic object tracing. The self-focusing system is an
354
TAO YANG
(b)
(11)
F I W R E 22. Texture detection. (a) The original image. (b) The range image of (a) provided by an MMCNN with kl = - 1 , k? = I , (c) The range image of (b) provided by an MMCNN with k l = - 1 , kz = I . (d) The smoothed image of (c) provided by an MMCNN with k~ = 0.25. k? = 0.75. (e) The smoothed image of (d) provided by an MMCNN with kl = 0.7.5. k? = 0.25. (f) The sum image of (d) and (e). (g) The combination of the threshold version of (f) and the original image in (a) reveals the locations of the smooth texture. (h) The compcnsation of (g) reveals the Icications of the rough texture.
intelligent feedback control system consisting of a sensor, an algorithm, and an actuator. The sensor is embedded into the MMCNN array. The actuator is driven by a step motor. The algorithm is a simple digital feedback controller. The whole system is depicted in Fig. 23. Figure 23a shows the block diagram of the entire self-focused camera. Whenever a captured image is off-focus,
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
355
(b)
FK;LIIE23. Applications of an MMCNN to a self-focused camera. (a) The block diagram of the entire optical and electronic system. (b) The varying neuron dcnsity on the MMCNN chip. Every box denotes it neuron.
the MMCNN gives a signal to the controller. The controller drives the lenses moving forward or backward. The feedback loop is set up when the MMCNN detects the adjusted image and gives a signal determined by the degree of offfocus. From linear optics we know that the more the camera is off-focus, the more blurred is the captured image. The off-focus effect can be approximately modeled as a lowpass filtering process with a different neighborhood size (X.P. Yang et a/., 1994). The goal of the feedback controller is to minimize the sum of the outputs of all neurons on the following MMCNN chip:
Nl =N2=
{
(-1, -1) ((3-1) (1, - 1 )
(-1,O) (-1,O) (1,O)
1) ((),I) (1,l)
(-1,
1
,kl = - l , k 2 =
I.
(229)
This MMCNN can calculate the sharpness of an image by finding the difference between the local minimum and the local maximum. As any lowpass filter will reduce the original local maximum and increase the original local minimum, the minimum of the output sum of the foregoing MMCNN corresponds to the “sharpest” image.
356
TAO YANG
The MMCNN chip used in the camera has a space-varying neuron density just as the human retina does. As shown in Fig. 23b, the center region of the MMCNN chip has the highest neuron density because when we shoot a picture, we usually put the objects we find interesting in the center region. On the other hand, it is more convenient to implement this structure using a VLSI chip because we can put the control and addition circuits in the margin regions. In this MMCNN chip, only 40 neurons are fabricated. The captured images of a controlling sequence are shown in Fig. 24. Figures 24a to f are captured images in steps 1 to 6. The output signal given by the MMCNN chip is shown in Fig. 24g. Let us follow the output of the MMCNN where the minimum is found at the sixth step. The seventh step gives an overshoot, which is suppressed at the eighth step. D. Fure Image Processing using Type-1 FCNN For the purpose of face recognition (Chellappa et al., 1995), facial expression animation (Yang et al., 19960 and synthesis (Sato et al., 1995), and compression of face image (Clarke, 1995; Torres and Kunt, 1996), we need to model face images. Two important steps in modeling of face images are face image segmentation and feature extraction. The structural features of a face image include eyebrows, eyes, nose, and mouth. For facial expression animation, the deformation of cheeks is of interest. Although Yang et al., ( 1 9960 used a 2-layer conventional CNN to animate facial expressions, they did not provide a method to locate the key-points, which specify the locations of eyes (eyebrows) and mouth. Of cause, we can locate eyebrows, eyes, nose, and mouth by using a series of templates that perform the corresponding digital image algorithms as surveyed in Chellappa et al., (1995). We show here how the following two simple type-I FCNNs can be used to solve this problem. One of them is given by (230)
2 . Output equation
In the preceding type-I FCNN, the template
i f,,,in
consists of fuzzy numbers.
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
357
Control Step
(g)
24. Self-focused intelligcnt camera. (a) The captured image at step 1. (b) The capturcd iinage at step 2. (c) The captured image at step 3. (d) The captured image at step 4. (e) The capturcd image at step S. (f) Thc captured image at step 6 . (g) The output of the MMCNN chip. FIGURE
The other is given by the following state equation:
The output equation is the same as that in Eq. (231). Similarly, the template 81 ,111,, consists of fuzzy numbers. 1 . Locating Low Boutidmies of E~iehrows,Eyes. Nose, and Mouth
In this example, the templates given by
81
and Bfrndxin Eqs.(230) and (232) are
358
where of A.
TAO YANG
“-”
is an algebraic negative and
-
Bfmax=
[
‘‘2’denotes the fuzzy complement
A
-C
-c
A
B
A
(234)
-C
where the fuzzy number A is given by
+ l),
x 5 0.5 0.5 < x 5 1
max(0, - n u l), p c ( x ) = max(0, m ( x - 1 ) l ) ,
x 5 0.5 0.5 < x 5 1
=
{
k ( 0 , k(x - 1 )
the fuzzy number B is given by
the fuzzy number C is given by
+
{
+
(237)
‘
Observe that the fuzzy numbers A, B, and C are piecewise linear for the purposes of easy VLSI implementation. To show how the operator “&,in” works, we rewrite Eq. (230) into an explicit form as i l j
= -XI
pA(ui-l.j-l), 1 + pC(ui-l.j), 1 - p A ( u i - l , ; + l ) ~ + / ~ ~ ( u i . j - ~ ) , 1 ~ ~ ( u i . , ) ) 1 + ~ ~ ( u i , j + ~. (238) 1 - pA(Ui+l,j-l), 1 + /-k(ui+l.j), 1 FA(ui+l,j+l)
1
A(
j+ -
1
-
Also, to show how the operator an explicit form as i . .- . x . I. J + -
.IJ
)
1 3
-
v
-PC(~~-I.,)*
PA(ui-I.j-1)3
-(
-PC(U~.~-I 1
p A ( U i + I , 1-1
‘‘6,,,ax” works,
9
),
we rewrite Eq. (232) into P A ( ~ ~ - I , ~),+ I
~ n ~ ( ~ i , j ) ,- P C ( ~ ~ , ; + I -pC(ui+l, j ) ,
1,
pd(uif1.j+l )
)
.
(239)
The simulation results are shown in Fig. 25. Figure 25a shows the facial image of a Chinese girl of size 63 x 63 and 256 gray levels (normalized within [O, I I). This image is fed into the input of FCNN in Eq. (232), the output image is shown in Fig. 25b. Then the image in Fig. 25b is fed into the input of FCNN in Eq. (230), the output image is shown in Fig. 25c. From Fig. 25c one can
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
359
(c)
FIGURE 2.5. Locating structural features of facial iniage using typc-I FCNN. (a) A gray-scale facial image of a Chinese girl. (b) Output image of the FCNN in Eq. (232). Input is the image i n (a). (c) Output image of the FCNN in Eq. (230). Input is the image in (b).
see that the low boundaries of eyebrows, eyes, nose, and mouth are marked by black lines. In this simulation, we choose k = 2 and m = 4. However, different k s and ms should be chosen for different facial images.
2. Segmentation of Fuce
I n this example, we show thc segmentation of "flat" parts of a facial image that contains cheeks, chin, and forehead. The two type-I FCNNs in Eqs.(230)
360
TAO YANG
(b)
(a)
FIGLIRE 26. Segmentation o f a facial image. (a) Output image of the FCNN in Eq. (230). Input is the image in Fig. 25(a). (b) Output image of the FCNN in Eq. (232). Input is the image in (a).
and (232) are used. The templates
if,in
and
ifmax are given by
The fuzzy numbers A and B are the same as those in Eqs.(235) and (236). The simulation results are shown in Fig. 26. Figure 26a shows the output results of the FCNN in Eq. (230). The input image is that shown in Fig. 25a. Then the image in Fig. 26a is fed into the input of FCNN in Eq. (232), the output image is shown in Fig. 26b. From Fig. 26b one can see that all the flat regions of the face (e.g., forehead, cheeks, and chin) are marked by light regions. We choose k = 2.
v.
EMBED LINGUISTIC STATEMENTS INTO FCNN
In this section we present some inner properties of FCNN that function as an intelligent component in the CNN universe as an interpreter between highlevel knowledge expressions and low-level hardware structures. Comparing the results provided in Sect. IV, this section emphasizes the high-level ability of
FUZZY CELLULAR NEURAL NETWORKS A N D THEIR APPLICATIONS
36 I
FCNN structures. And high-level ability is the significant difference between conventional CNN and FCNN. A. FCNN: 1tzterfiice.s Beth>eenHicmurz Experts Lrnd CNN
In an artificial intelligent system, the motivation is to make a brain model as the core of the system. A conventional CNNUM cannot function ;IS this core even though it has already been proved to be a Turing machine (Crounse and Chua, 1995) (Turing machines cannot answer a simple question: “How are you feeling?”) and even though it can be used to explain many visual phenomena (Yiing et NI, 1996f; Csapodi and Roska, 1996; Werblin et [ I / . , 1994) (Seeing is not thinking). On the other hand, the FCNN structure can be used as an interface between a human expert and a conventional CNN. In this sense, the input of an FCNN is the knowledge of a human expert, which is described by linguistic statements, and the outputs are sets of “templates.” In other words, FCNN is used to translate linguistic or higher-level statements, which are expressed as fuzzy rules into CNN structures. 1 . Fuzzy Set Theory c r n d Fuzzy Properties of 1in~ige.s
In each phase of image processing, there exist many uncertainties (Yager and Zadeh, 1992), for example, additive and nonadditive noise in the sensing and transmission processes; the loss of information while 3D shapes or scenes are projected into 2D images; lack of the quantitative measurement of image quality and imprecision in computations; and ambiguity and vagueness in representations, definitions and interpretation of complex scenes. Fuzzy set theory (Zadeh, 1965) provides the mathematical strength to capture these uncertainties (Kandel, 1982; Marks, 1994). I t has found wide applications in image processing (Yager and Zadeh, 1992; Zadeh er d . , 1975; Pol and King, I98 1 ; Kandel, 1982; Marks. 1994; Peleg and Rosenfeld, 1989; Nakagawa and Rosenfeld, 1978) such as image modeling, preprocessing, segmentation, objecthegion recognition, and reasoning aspects of image processing problems. While fuzzy set theory provides an inference mechanism under cognitive uncertainty, the CNN (Chua and Yang, 1988a,b) offers advantages such as learning, adaptation, fault-tolerance, parallelism, and generalization. Although fuzzy logic is a natural mechanism for modeling cognitive uncertainty, it may involve an increase in the amount of computation required (compared with a system using digital logic). This can be readily offset by using FCNN, which has the potential for parallel computation with high flexibility. A fuzzy set A with its finite number of supports x,,i = I , . . . , t i , is defined as an ordered pair A ={ PA (.Ti ))I (242) 1
362
TAO YANG
or, in a union form, A=
Up r / x , ,i = I , . . . , n
(243)
I
where the membership function ~ A ( x ;in) the interval [O,l] denotes the degree to which an event x; may be a member of A. Here P A = 0 represents no membership and p~ = 1 represents full membership. This characteristic function can be viewed as a weighting coefficient that reflects the ambiguity in A . A f i z z y singleton is a fuzzy set that has only one supporting point. I n digital image this concept is very useful because a pixel can be viewed as a fuzzy singleton. The operations on fuzzy sets are extensions of those used for traditional sets. Some of the common operations include comparison, containment, intersection, union, and complement. Assuming U to be the universe of discourse, A E U and B E U , these operations are defined as follows: Cornptrrison: is A = B?
U.
(244)
< ,utj(x), Vx E U .
(245 )
A = B iff p ~ ( . x= ) p ~ ( x ) Vx ,
Contuinnient: is A
E
c B? A
cB
iff
PA(X)
Union: The union of two fuzzy sets A and B , A v B , is given by combining the membership functions of A and B. Although there have been several different union operations defined (Yager, 1979), the most common, and so far the simplest, union is defined as
Intersection: Like the union, the intersection of two fuzzy sets A and R , B , is given by combining the membership functions of A and B and is defined as PAM = m i n { ~ ~ ( xP)B, ( x > ) , Vx E u. (247)
A
A
Conzplement: The complement of the fuzzy set A , 2, is defined as
In addition to these operations, De Morgan’s law, the distributive laws, algebraic operation such as addition and multiplications, and the notion of convexity have fuzzy set equivalents (Zadeh, 1965).
FUZZY CELLULAR NEURAL NETWORKS A N D THEIR APPLICATIONS
363
Kernnrk: The traditional CNN can implement the following fuzzy operations: Comparison can be implemented by using a minus operation template and then checking whether the output is zero or not. Containment can be implemented by using a minus operation template and then checking the sign of the output. Complement can also be implemented by using a minus operation template. However, intersection and union cannot be implemented by using the traditional CNN, and they can be implemented by using FCNN. A gray-scale image possesses ambiguity with each pixel because of the possible multivalued levels of brightness. If the gray levels are scaled to lie in the region [O,l], we can regard the gray levels of a pixel as its degree of membership in the set of high-valued “bright” pixels-thus a gray image can be viewed as a fuzzy set. Regions, features, primitives, properties, and relations among pixels that are not crisply defined can similarly be viewed as fuzzy subsets of images (Zadeh et l i l . , 1975; Pol and King, 1981). With the concept of fuzzy set, a gray-scale image X of M x N pixels with gray levels belonging to [0,I ] can be considered as an array of fuzzy singletons, each with a value of membership function denoting the degree of brightness relative to some brightness level in [0,1 I. On the other hand, the fuzzy property of an image also comes from the uncertainty of the relationship between rent pixels. This is the basis for application of FCNN to image processing.
2. FCNN cis mi Interpreter Human experts usually use linguistic statements to evaluate and describe images. When we take a picture we are most likely to say, “got a little blurred” instead of “filtered by a lowpass filter”. Or we may say “there are some black dots!” instead of “there were impulsive noises at pixels (3,4), (44,94) and (123,321).” Rased on the fuzzy description of images, many fuzzy methods have been developed to deal with image processing problems. It is very difficult to embed linguistic if-then rule-based fuzzy image processing techniques into a conventional CNN chip. To overcome this problem, we need an interpreter between the human experts and the conventional CNN. From many previous works (Yang and Yang, 1996, 1997d,e,f; Yang et nl., 1996d; Yang et nl., 1996e, 1997i, 39988; Yang et a/., 1998. 1998). we find that FCNN can efficiently embed fuzzy rules into its structure. This capability of FCNN is found very useful in interpreting the accumulated linguistic knowledge from the field of fuzzy image processing. On the other hand, the learning ability of FCNN also provides us with the possibility of “teaching” FCNN using linguistic statements and making the design of CNN structures for special image processing tasks much easier. If we think that the only way a human expert communicates with a conventional CNN is through a template design or by collecting a huge body of data to train the CNN structure, we find that teaching an FCNN using our language
364
TAO YANG
may be more direct and easier than the method used so far. In this section we present only the method for embedding linguistic statements into FCNN structure. We will present learning algorithms of FCNN in Sect. VI. B. Embedding Fuzzy Ii$erence into FCNN In research by Yang et 01. (1998g), the FCNN structure for implementing the following fuzzy IF-THEN rule was presented. The rule is given by:
where K (( .i )L I E Ni ,j o( jx,k l , I 5 i 5 M , is an algebraic or fuzzy local operation detined in N,.(i, j ) . Here A;, 1 5 i 5 M is a fuzzy variable and B is a fuzzy variable and the consequent. The corresponding FCNN structure for implementing the above IF-THEN rule is given by 1. State equation
2. Output equation Yi; = P R ( X i j )
(250)
where / L A , ( . ) , i = 1, 2, . . . , M , and p ~ ( ( are . ) membership functions of A ; and B. respectively. In Yang and Yang ( 1997e) and Yang er LZI(19988) we presented the FCNN structure for embedding the fuzzy inference ruled by else action (FIRE) operntors, which are fuzzy operators for image processing (Russo, 1992; Russo and Ramponi, 1994b). The FIRE operators are based on fuzzy IF-THEN-ELSE architectures to perform many important image processing tasks, for example, image enhancement (Russo and Ramponi, 1994a, 1995) and edge detection (Russo, 1992; Russo and Ramponi, 1994b). First, we introduce FIRE operators briefly. Consider an L-level gray-scale is a pixel in the neighborhood image U . Suppose u;; is a pixel in U and of u,;, then we define xk) = lukl - u;;l as “gray-value difference.” We also
365
FUZZY CELLULAR NEURAL NETWORKS A N D THEIR APPLICATIONS
need the membership function of linguistic variable ZERO(ZE), K L f - (x), the membership function of linguistic variable WHITE(WH), / L W H (x), and that of linguistic variable BLACK(BL), p ~ l / ~ ( . x ) . In general, a FIRE operator consists of a group of N IF-THEN-rules and one ELSE-rule as
Rule 2: IF XI is A I I and . . . and .XM is A I M THEN y is BT . . . IF .rl is A N I and . . . and XM is AN^ THEN y is Br ELSE y is BE where M is the number of input variables. Ai.,, i = I , . . . , N ; J = I , . . . , M , is the fuzzy set corresponding to the j-th input variable in the i-th THEN-rule. BT is the common consequent set of the group of THEN-rules and BE is the consequent set of the ELSE-rule. Observe that every THEN-rule in Rule 2 has the same structure as in Rule 1. If BT is different for every THEN-rule, we can use N layers of FCNN in Eq. (249) to implement Rule 2. As all THEN-rules have the same consequent, we can use a simpler FCNN structure to implement Rule 2. Letting A; be the strength of the i-th THEN-rule in Rule 2, we have
where P A , , ( . ) is the membership function of A;,. Letting AT be the strength of the THEN-sub-rule in Rule 2, we have
v
AT=
A,
(252)
j=l. ...N
Letting AE be the strength of the ELSE-rule in Rule 2, we have
Al:’= 1
-AT.
(253)
Finally, the output y is given by a tradeoff between AT and h . by ~ using a proper defuzzifier. There is no unique way to perform defuzzification. And there are some considerations for choosing defuzzifiers. Several existing methods for defuzzification take into consideration the shape of the clipped fuzzy numbers (Bojadziev and Bojadziev, 1995). The complexity of computations and the possibility of VLSI implementation are also taken into account. According to the principles proposed in Yang et al., (1998g), the defuzzifier is
366
TAO YANG
implemented by the so-called output function in the conventional CNN (in our FCNN, it is called an output membership fuizction or defuzz~erfur7ction). The simplest defuzzifier is a threshold function when only binary output is needed. C. Applicution to Irnrige Processing 1 . Fuzzy lnfererzce Edge Detector
We then give the FCNN structure that can be used to embed the FIRE edge extractor (Russo and Ramponi, 1994b). A FIRE edge extractor is illustrated in Fig. 27. In Fig. 27, the membership functions of ZE, BL, and WH are chosen as a trapezoidal shape. To simplify the structure of FCNN, we only use 17. to defuzzify output. We use only one single-layer FCNN to implement all THEN-rules shown in Fig. 27. The state equation of this FCNN is given by
(254)
It should be noticed that a don’t cure pixel in Fig. 27 introduces no template relation. The equilibrium point of the state variable gives the strength of the THEN-rules, 1 ~The . output equation of the foregoing FCNN functions as a defuzzifier and is given by
367
FUZZY CELLULAR NEURAL NETWORKS A N D THEIR APPLICATIONS
p€p% I I
I I I I I
I
I THEN-rule 1
THEN-rule 3
THEN-rule 2
I
THEN-rule 4
I
FlouRE 27. Rule\ for FIRE edge extractor
where y i i denotes the classical truth value of the pixel to be an edge pixel and > 0 is a threshold. The simulation results are shown in Fig. 28. Figure 28a shows the state variables of FCNN in Eq. (254), which is the result of fuzzy inference. Figure 28b is the corresponding output, which is a thresholded (defiizzified) version of Fig. 28a.
h
2. Iiizpulsive Noise Rerimul via Fuzzy Itlference In this section, we consider the problem of impulsive noise removal. Median filters are usually used to remove impulsive noise (Arakawa, 1996; Yin ~t ul., 1996). Unfortunately, a median filter blurs fine structures of an image and causes edge jitter and streaking. To overcome this problem, an efficient method is one in which the median filter filters only those pixels where impulsive noise exists and keeps the other pixels unchanged (Arakawa, 1996). To do this, the first step is to identify impulsive noise. From our experience, an impulsive noise always seems to introduce a significant gray-value difference to its neighbors. This experience can be expressed by the following fuzzy
368
TAO YANG
(3)
(b)
FIOUKE 2X. FCNN-based fuzzy inference for edge detection. (a) Output of FCNN-based f u u y inference. ( b ) Result of thresholding the image in (a)
IF-THEN rule:
IF p ~ ( u ; , ; - l u ; , ) is big and p , ~ - ( u ; , ~ +u;,;) ~ is big and p ~ ( u j - 1 , ;- u;,;)is big and p ~ ( u ; + l , ,; u;,;) is big and p ~ ( u j - 1 , j - l - u ; ; ) is big and p ~ ( u ; + l , , , +~ ~ 4 , ~is) big and p~1.F(u;-l,;+l- u;,) is hig and p,:(u;+l,j-1 - u ; j ) is big, THEN u;; is an impulsive noise. I n the preceding fuzzy rule the membership function pb-(.) functions as a fuzzilier. We choose p,;(.) as the following piecewise linear function:
where k > 0 is a constant. If the input of an FCNN is normalized within 10,I ] and k = 1, then Eq. (256) can be rewritten as
The membership function of linguistic variable big, p b l g ( . ) . can be obtained by using the training method proposed in (Arakawa, 1996). We use the membership function p,,,,,,(.) to denote the degree of truth of the sentence “there
369
FlJZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
is an impulsive noise.” So, p,,,iae(.) = 1 denotes “there is (exactly) an impulsive noise;” p,,,ise(.)= 0 denotes “there isn’t an impulsive noise.” To facilitate VLSI complementation of FCNN, we usually choose phig(.) and pnoise(.) as piecewise linear functions. Then the following FCNN can be used to identify impulsive noises: I . State equation x. ’11- -xi,; ,
+
/I
phig(luk/ - u i j l )
(258)
ChctNi ( l . l ) / c , ,
where N I ( i , j ) / C l , denotes all cells within neighborhood system N I(i, j ) except for C l / . 2. Output equation (259)
~ i ,= j P.noise(Xij).
Generally, finding the corresponding template of an FCNN structure as in Eq. (249) is very difficult and unnecessary. Fortunately, as the relation between synaptic weights and inputs in Eq. (258) is not very complicated, the FCNN in Eq. (258) has the following space-varying nonlinear B f min template: BJ iiiiii(ij)0 llij
=[
pbig(llli-1.j-I phig(1ui.j-I @hig(llli+I.j-I
- uijl) -
phig(1ui-1.j
-
u i , / I ) / ~ h t g ( l ~ i - l , j +l uijl) phig(lui.j+I
ui.jI)
- l(i,,I)
phig(lui+l.j - u i j l )
phig(lui+l,j+l
- ui.jI) - Lli,jl)
1
.
(260) However, even in this simple case, use of a template seems to complicate the expression. The preceding FCNN identifies impulsive noise in a parallel way over the whole image. Here, p h l g ( ’ ) and pUnoite(.) are usually difficult to choose and depends on the statistical properties of impulsive noise. To overcome this problem, we can train these membership functions by a 3 x 3 FCNN. Some efficient learning algorithms for this kind of FCNN had been developed for from examples; these results will be presented in learning p h i g ( . ) and pnoise(.) Sec. VI. Simulation results are shown in Fig. 29. Figure 29a shows a facial image of size 63 x 63 and 256 gray levels. In Fig.29b, impulsive noise of mean value is added. This image is denoted by {d’). Figure 29c 2.56 and deviation shows the output result of a median filter. This image is denoted by (u:!). Observe that the fine structures of the image in Fig. 29a are totally destroyed because Fig. 2% has low resolution and many details have a characteristic width of 1 pixel. Figure 29d shows the output of the FCNN in Eqs. (258)
370
TAO YANG
(el
(d)
FICXIKH 29. Using FCNN to remove impulsive noise. (a) The facial image of a Chinese girl. (b) Impulsive noises are added. ( c ) Output of a median filter. (d) Output of FCNN shows the degree of impulsivc noise. (e) Output of FCNN-based median filter.
and (259). In this image, the gray value of each pixel denotes the degree of being an impulsive noise. A black pixel means that it is an impulsive noise (or, pnoise= 1). A white pixel means that it is not an impulsive noise (or, pnoise = 0). A gray pixel, which occurs at boundaries or edges, means that it is suspected of being an impulsive noise. This image is denoted by (p:,ise). In this simulation, we normalize gray values within interval [O,l], then phi,,(’) is given by phig
and
= x, 0 5 x 5 1
is given by
40 20 where y > s > 0 are two constants, p = m, s = 256
(261 1
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
37 1
We are then ready to give the output result of our FCNN-based median We have filter
(~7,~).
(263) The output result is shown in Fig. 29e. Observe that this result is much better than that of Fig. 2%. The median filter can be implemented by some conventional CNNs. So far, there exist three kinds of CNN-based median filters. The first one (Paul et nl., 1992, 1993a) needs n cells to sort tz samples. The second one (Shi, 1994) reduces the cell number to one and needs a neighborhood of odd number of cells. The third one (Rekeczky et al., 199Sa; Roska and Kek, 1995), which is supposed to be an improved version of the second one, needs a neighborhood with an even number of cells. In this section, some analysis of the second and the third CNN-based median filters is proposed. For simplicity, we consider only those CNNs with neighborhood size of 3 x 3 (i.e., N I ( i , j ) ) ; the analysis of other cases is similar. The median filter given in Shi (1994) has the following state equation:
where the function sgn(.) should be defined by -1,
x
1,
x>o
If the input set { U ~ I I CELNI l ( i , j ) ) is sorted into a nondecreasing order as: (u1, w ,. . . , u5,. . . , U X , ug), then the median value should be ug. Let I I + denote the number of elements in the set (uk Iuk = us, k > 5 ) and t z - denote the number of elements in the set (ukluk = U S , k < S), if n+ = n - then U S is the only equilibrium point of the cell in Eq.(264). In this case, we study the global stability of this equilibrium point. We have the following theorem.
Theorem 16. Given n+ = n - , then of attraction (-00,00). Proof
145
is asymptotically strible in the basin
We construct the following Lyapunov function:
372 V,,(t) > 0 for any x,,(t) # of Eq. (264), we have:
TAO YANG u5.
Differentiating V l j ( f )following the solutions
where
Obqerve that when x,, < U S ,a! > 0 and when x,, > ( x l / ( t ) - u 5 ) a ! < 0 for any x,/ # us. We then have
U S , a!
< 0. Hence, we have
The equality is satisfied only when x,, = u5. So, the median value is asymptotically stable in the basin of attraction (-00, 00). 0 However, if it+ # n - then the cell in Eq. (264) has no equilibrium point. In this case, this median filter has no stable output in common sense. In simulations we found that the output of the cell fluctuated around a certain value with a small deviation as time became sufficiently large. To describe this fact, we need the following definition (Yaiig, 1994):
DeJnition 8 Equilibrium point for the mean (m-equilibrium point): said to be an in-equilibrium point of Eq. (264) if lim E ( x ; ; ( t ) J= x:,
1-00
and lim E ( i , , ( r ) ) = 0. 1-00
Then we have the following theorem:
XI*/
is
(270)
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
E [ s g n ( d * ( t ) )= ]
11- -
373
n+
ri++n-+ 1
then us is an tn-equilibriiirripoint of Eq. (264) when n+ #
11-.
Proof
0
The fourth equality is in view of d * ( t ) E (-6, 6).
Remark: We do not argue that us is the only m-equilibrium point. In fact, there are infinite m-equilibrium points, xs,:, given different E[sgn(x,,( t ) x;,)). This is possible if noise exists. Figure 30 shows the effect of noise on m-equilibrium points. In this simulation, we let (111, M?, . . . , u s , . . . , ug, ug) =
FIGURI: 30. (a) m-equilibrium points will1 different dc biases in point with different unil'ormly distributed noise in . ~ , , ( f ) .
.t;j(f). (b)
In-equilibrium
374
TAO YANG
(0.4,0.5,0.5,0.5,0.5, 0.6, 0.7, 0.8, 0.9}, that is, us = 0.5, n+ = 0, t i - = 3. t ) denotes additive noise in xi,;(t).Figure 30a shows cases when u,, ( t ) = K u is a dc bias. Observe that the m-equilibrium points are changed by different dc biases. For purpose of comparison, Fig. 30b shows cases when II,, ( t ) unifonnly distributes between -Kv Kii. The change of m-equilibrium point is similar to that in Fig. 30a. As these results are independent of initial state, we find the CNN-based median filter in Eq. (264) is robust enough. Given a low level of additive noise, the m-equilibrium point of Eq. (264) is very close to the median value when n f # n - . In most cases, this CNN-based median filter can output a satisfied result. When this median filter is used to process a 256 gray-scale image, it can always output the correct median value because the offset of an m-equilibrium point from a real median value is 3 times less than the value corresponding to 1 bit (We normalized 256 gray levels in [ - I , I]). Rekeczky et al. (199%) proposed a CNN-based median filter as follows: ‘ti,, (
-
1. State equation: i i j ( t ) = -xij
+ f(x;,) +
C
sgn(xjj(t) - i l k / ) .
(274)
Chi EN1 ( i . illc,,
2. Output equation:4 . f ( x i , j ( t ) )=
1/2(Ixij(t)
+ 1I
-
Ix,j(t)
-
11).
(275)
Then Rekeczky et a / . (1995a) argued that “General rank order filters can be implemented simply by changing the bias value of the template (e.g., MIN filter: I = 8; MAX filter: I = -8)”. However, Yin etal. (1996) gives the following description of a median filter: “To compute the output of a median filter, an odd number of sample values are sorted, and the middle or median value is used as the filter output.’’ Since the CNN-based median filter in Eq. (274) uses only eight sample values in N l ( i , j ) , it is not a median filter and its result is very sensitive to initial conditions. For example, suppose these eight values are sorted into a nondecreasing order as ( U I ,. . . , 114, U S , . . . , U X } and 114 # u5. If x f j ( 0 ) < 114, then xfJ(o0) = 114 6 , 6 + O+. If x;,;(O) > us, then x,,;(oo) = 115 6, S ---f 0-. If x i , ( 0 ) is a random number such that 114 < x i j ( 0 ) < U S , then x i j ( o 0 ) = .x;,,(O) is also a random result. When ( U S - 114) is big, (e.g., there exists an edge in N I(i, j ) ) , this CNN-based median filter seems to output a random result when noise exists in x,,j(O). So, this CNN-based median filter is much worse than that in Eq. (264). Also, it cannot be a rank-order filter with different biases.
+
+
Rekeczky et d.(199Sa) and Roska and Kek (1993 did not give the explicit expression of the output function hut described it as (Rckcczky et nl.. 1995a p. 684) “a sigmoid-type piecewise linear function.”
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
375
6
6
e
I
2
2
0
0
2
2
1
4
6 80
6
80 0
" "
(C)
FGUKE31. (a) The difference of gray value between a median filter and the CNN in Eq. (274) with initial condition .r,,(O) = -1. (b) The difference of gray value between a median filter and the CNN in Eq. (274) with initial condition x , , ( O ) = 1 . (c) The difference of gray value between a median filter and the CNN i n Eq. (274) with initial condition x l , ( 0 ) = ul,.
To show this kind of randomness in a real image processing problem, we use the CNN-based median filter in Eq. (274) to process the gray-scale image shown in Fig. 25(a). Figure 31a shows the difference between the median filtering result and the output of CNN in Eq. (274) with initial condition x,, (0) = - I . Figure 3 I b shows the difference between the median filtering result and the output of CNN in Eq. (274) with initial condition x,;(O) = 1. Figure 3 I c shows the difference between the median filtering result and the output of CNN in Eq. (274) with initial condition x;;(0) = u;;. VI. LEARNING ALGORITHMS OF FCNN In this section we present the results that distinguish an FCNN from a computational array. It means that an FCNN can learn its weights from examples or from existing knowledge and the experience of human experts.
376
TAO YANG
There are many references on learning algorithms of different CNN structures (Harrer et al., 199 la,b; Brucoli et al., 199Sa,b; Nossek, 1994, 1996; Tetzlaff and Wolf, 1996a,b; Gunsel and Guzelis, 1995; He and Ushida, 1996; Aizenberg and Aizenberg, 1994; Utschick and Nossek, 1994; Tzionas, 1996; Magnussen and Nossek, 1992, I994b; Magnussen et al., 1994; Guzelis and Karamahmut, 1994; Schuler et al., 1992; 1994; Mizutani, 1994; Balsi, 1994; Hansen, 1992; Guzelis, 1992; Szolgay et al., 1992; Puffer et al., 1995; Van Dam et al., 1994; Aizenberg et ul., 1996; Roska, 1992; Kozek et al., 1993; Sziranyi and Csapodi, 1994; Zou et al., 1990a,b; Vandenberghe et al., 1990; Pelayo et al., 1990). Learning is one of the promising properties of CNN that distinguishes a CNN structure from a parallel computational array. On the other hand, FNN literature also provides us with many special learning algorithms on the high nonlinearity of FNN (Yamakawa and Furukawa, 1992; Ishibuchi, 1993; Blanco et al., 199Sa). Nourished by these two fields, the learning algorithms of FCNN were developed. One difference between the learning algorithms for FCNN and those for conventional CNN is that the learning algorithms of FCNN may have linguistic variables as their examples (input-output pairs). In some cases, when the experience of a human expert is easily obtained and the measuring data is difficult to obtain, this learning ability is very useful. In Sect. IV, we have shown that additive FCNN is a universal framework for implementing different kinds of mathematical morphology operators. Although as shown in Sect. IV mathematical morphology is very useful in signal processing, one key problem is the choice of structuring elements for different tasks. Normally, structuring elements are chosen by trial-and-error methods. Recently, some morphological (neural) networks with learning ability were presented (Davidson, 1992; Davidson and Hummer, 1993; Davidson and Ritter, 1990; Araujo and Ritter, 1992). But the structure of a morphology (neural) network is too complicated to be implemented with the state-of-the-art VLS1 techniques. On the other hand, we find that when FCNN are used as computational arrays, some of them (see examples in Sect 1V.B) are in fact morphological networks. Thus we can train FCNN with examples and find structuring elements from the training results. Because FCNN is a combination of two mature fields: fuzzy set theory and CNN, many regions are waiting to be explored. At the very beginning of our attempts to set up the framework of this brand new field, there were two basic motivations. One involved mathematical morphology (Serra, 1982, 1988; Heijmans, 1992), which is a very elegant framework for signal processing from the geometrical point of view. We found that both FCNN and mathematical morphological operators share two elementary features, local connectedness and max/min operations. The other motivation involved the necessity of developing an interface between human experts (users) and low-level conventional CNN structures.
FUZZY CELLULAR NEURAL NETWORKS A N D THEIR APPLICATIONS
377
An FCNN structure can be used as either a computational array or a learning array. In Sects. IV and V, FCNNs were used as computational arrays. However, the learning ability of FCNN is also a very important aspect because only when an FCNN can learn its parameters from both real number examples and linguistic statements can it actually perform as an “intelligent” interface between human experts and low-level CNNs (e.g., the conventional CNN).
A. Learning Structuring Elernents
I n this section, we present some learning algorithms for additive FCNNs. The learning algorithms are based on the DTFCNN structure. Although DTFCNN can be viewed as a corresponding concept of DTCNN (Harrer and Nossek, 1992b), it is not necessary to obey the tradition of standard DTCNN in which the output should be binary. The learning algorithms presented in this section are used to learn structuring elements from examples. In this view, these DTFCNN structures are mathematical morphology networks with learning abi I i ty . A general framework of type-I1 DTFCNN is given by
where F A ( . )and F H ( . )are two local fuzzy operations defined in N,.(i, , j ) and A , ( i , j ; k , I ) and B + ( i ,j ; k , I ) are fuzzy feedback synaptic weight and fuzzy feedforward synaptic weight, respectively. In this section, we study the learning algorithm of the following type-I1 DTFCNN: xij(k
+ 1 ) = F;k,EN,(;.;)(B,f(i,j ; k I ) , u ~ ) .
(277)
This DTFCNN is a kind of uncoupled DTCNN. It maps its input to its output by a single iteration. This computational structure is very useful in iinplementation of mathematical morphology operators (Heijmans, 1992). 1. Leurning Algorithm cf Additive Type-I1 DTFCNN
In Yang and Yang (1997d, e), we have shown that the following DTFCNN is very useful to implement gray-scale mathematical morphology transformations:
378
TAO YANG
State equation:
+
v
(BfIllax(i, j ; k ,[)
+w),
C U E N , ci.;,
(278 1
1 l i l M , 1 5j ( N .
As the operations between fuzzy feedforward synaptic weights and inputs are additions, the foregoing DTFCNN is called additive DTFCNN. The output equation is given by I
Y i j ( k ) = f ( x i j ( k ) ) = ,(lx;j(k)
+ 1I
-
l s i ( M , l ~ j ~ N .
Ixi,(k)
-
111, (279)
The parameters of DTFCNN in Eq. (278) for implementing erosion are given by Bymax= undefined, By ,,in = -S (280) where S is the structuring element. The parameters of DTFCNN in Eq. (278) for implementing dilation are given by R~,,,,, = S n , B f m i n= undefined (28 I ) where S D is given by S D = (-x : x
E
S).
In this section, we study how an additive DTFCNN can learn the structuring element when only a set of examples ((u;,, O,,;))is available. The quantity ( u ; ; ) is input set and (O;,} is output set. As structuring elements are embedded in the feedforward templates of the DTFCNN, the objective of training the network is to adjust weights so that a set of input produces the desired set of output. This is driven by minimizing the square of the difference between the desired output [O;,) and actual output ( y ; ; } ,for all the samples to be learned (283) It is well known that
FUZZY CELLULAR NEURAL NETWORKS A N D THEIR APPLICATIONS
379
Let us expand the first terms in the right-hand side of Eqs. (284) and (285) as
and expand the second terms in the right-hand side of Eqs. (284) and (285) as
Then let us expand the third term in the right-hand side of Eq. (284) as
Then we consider the so-called "smooth derivative" (Blanco ef ul., 1995a) of min(y, x). In the classical sense, max(y, x) is derivable into the open intervals y ix and y > x but the derivative is not defined at y = x, that is,
From Eqs. (284) and (291) we know that the DTFCNN will stop learning whcn y > x.This makes the learning process very slow. In the worst case, this
380
TAO YANG
can even make the learning process impossible. To overcome this problem, we notice that Eq. (291) only gives the crisp truth value of statement: “ y is less than x.” In this view, we can fuzzify Eq. (291) using different schemes to make the DTFCNN learning in a fuzzified way. One method can be found in Blanco er al. (199.54. However, this method can not be used here, we fuzzify our “smooth derivative” as
where y E [ - I , 11, x E [ - 1 , 11. Similarly, the third term in the right-hand side of Eq. (28.5) can be expanded as
and
Similarly, we use the following “smooth derivative” to guarantee the learning process of DTFCNN in Eq. (28.5)
aE We denote -~ by 6 ; j , therefore ayi, -
aE ax;.; - 6 . .@. . aw(i, j ; k , I ) ” aw(i, j ; k , I )
(297)
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
38 1
where w(i, j ; k , I ) denotes B f ,,,,,,(i, j ; k , I ) or B f m a x ( ij, ; k , 1 ) . Finally, the changes for the weights will be obtained from a 8-rule with expression
where w ( i , j ; k , I ) as that in the Eq. (312); p is a positive constant. 1. Exuniples
Here the learning algorithms of DTFCNN are used to learning structuring elements from examples. We use two examples to show the usefulness of the learning algorithms to structuring element learning. Let the structuring element S be 0.11 0.15 0.13 0.16 0.19 0.18 . (299) 0.12 0.17 0.14
1
Then we have
0.14 0.17 0.12 0.18 0.19 0.16 0.13 0.15 0.11 and
Bfmin = -S =
-0.1 1 -0.16 -0.12
!
-0.15 -0.19 -0.17
-0.13 -0.18 -0.14
I
.
(301 j
Then we use the dilation operator to generate 2000 samples {(u;;,Oi;)) as the training data to train a dilation DTFCNN. The learning process of
the /Iflnax template is shown in Fig. 32. Observe that the elements of Bfmax approached to correct values (see Eq. (300)). The initial condition for B,,,,,, template is 0 and ,LL = 1. As the B f m a xtemplate is of size 3 x 3, we only need a 3 x 3 DTFCNN to learn the structuring element. Next we use the erosion operator to generate 2000 samples [ (u;,, O;,)} as the training data to train an erosion DTFCNN. The learning process of the B , fmin template is shown in Fig. 33. Observe that the elements of Bfn,in approached to correct values (see Eq. (301)). The initial conditions for B f m i ntemplate is 0 and p = 1. These examples show that our DTFCNN learning algorithms work well.
382
TAO YANG
0.22 I
I 1
1'
0.2
Bfmax(2.2)
rJ
g 0.18
a
-6 0.16
!2
0.14 J,
.-u
5
0.12
w
0.1 0.08 n nr. I!.,,"
200 400 600 800 1000 1200 1400 1600 1800 2000 Number of Training Examples
FIG~JW, 32. Learning process of dilation DTFCNN. -0.06
I
I
-0.08 C .+
-0. I 2 m -2 -0.12
-
2 -0.14
u-+.---
L
0
$ -0.16 .-
-
Bfmin( 1.3) Bfiiiin( 3,3) Bfmin( I .2)
c
C
w -0.18 -0.2
-0.22
I 200 400 600 800 1000 1200 1400 1600 1800 2000 Number of Training Examples FIGURE
33. Learning process of erosion DTFCNN.
B. Advanced LeLirning Algorithm of Additive Discrete-Time FCNN The breakpoints of min and max operators pose a big problem regarding finding their derivatives. In practice there exist two kinds of methods to overcome this problem. The first one uses bounded-addition and multiplication to replace the min and max operators. Although this method bypasses the problem of derivatives, the trained network may be functionally very different from the original one. The second one involves developing a rigorous and systematic
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
383
theory for the differentiation of min and max functions by means of step function (Marks er ~ d . 1992), , functional analysis (Zhang er al., 1994), and some special functions (Zhang ef Nl., 1996). For the purpose of deriving the &learning law for DTFCNN, we have to cope with the partial differentiation of E with respect to lzf mln(i,j ; k , 1 ) and B, ,,,,,(i, j ; k , I); such a differentiation can not be given in a conventional sense. By Theorem 8 of Zhang et (11. (1996) we know that the following two expressions are satisfied almost everywhere in real field 91:
where a denotes a partial derivative in a conventional sense and 3 denotes the partial derivative presented in Zhang et ul. (1996). Let us expand the first term in the right-hand sides of Eqs. (302) and (303) as (304) and expand the second term in the right-hand sides of Eqs. (302) and (303) as (305) Then let us expand the third term in the right-hand side of Eq. (302) as ax; ;
384
TAO YANG A
A
-
where Y = B f . m i n ( i , j ; P , 9) + u p q and x = A ~ ~ l ~ N ~ ( i , , j ) , ( e , / ) #r (n i~n (,i 7, ~j ); ( B f k , 1 ) ~ e)./ Becauses min(.. .) and max(., .) are not differentiable functions in the conventional sense, we need to show that under certain conditions all min-max functions are continuously differentiable almost everywhere in the real number field !N. Fortunately, a rigorous theory on this problem was presented in Zhang et al. (1996). To make this section self-contained, we need the following definition and lemma:
+
Definition 9 (Definition I , p. 1141, Zhang et al., 1996). A function !I1 H !N on the real number field !I1 is defined as
flor
:
(307)
Proposition 10. (Corollary I , p. 1143, Zhmtzg et al., 1996). Suppose ci i s N real number and f (x), 121 (x) = ci';.,f (x), cind h 2 ( . x ) = a r \ f (x) are real iwriable functions. If they lire all differentiable at point x, then (308)
It follows from Proposition 10 that
(310) Similarly, the third term in the right-hand side of Eq. (303) can be expanded as
(31 1)
FUZZY CELLULAR NEURAL NETWORKS A N D THEIR APPLICATIONS
--
3E
aw(i, j ; k , 1 )
= 6,/4,-
;Ix, /
3w(i, J ; k , I )
385
(312)
k , l ) . Finally, the chanwhere w(i,j ; k , l ) denotes B +,,,,(i, j ; k , l ) or B f lllax(i,j; ges of weights will be obtained from a 6-rule with expression
(313) where w(i, j ; k, 1 ) is the same as that in Eq. (312); p is a positive constant. The following two theorems guarantee that the learning algorithm in Eq. ( 3 13) makes sense almost everywhere in !li and the learning result will be a local minimum of the cost function E .
Theorem 18. For the erosion DTFCNN in Eq. (280)and the dilution DTFCNN in Eq. ( 2 8 / ) ,arid the cost function in Eq. (283), the partial diflereritiuls in Eys. (306) und ( 3 1 I ) exist almost everywhere in !I{. Proof Because x;,;s in both erosion DTFCNN and dilation DTFCNN are i)-functions, that is, functions containing c and/or , i operations, it follows from Corollary 4 of Zhang et ul. (1996) that the partial differentials in 0 Eqs. (306) and (31 I ) exist almost everywhere in !N.
(c.
Theorem 19. The &rule given in Eq. (313)guarutitees the erosion DTFCNN in Eq. (280) mid the dilution DTFCNN in Eq. (281) will converge to a locul tnitzitnum c$ E in Eq. (283) with Probability I with incrensing iteration index. Proof Similar to the proof of Theorem 10 of Zhang et al. (1996), let us prove the theorem in two steps. First, using the similar process in the proof of Theorem 10 of Zhang et al. (1996), we immediately know that E in Eq. (283) is differentiable with respect to discrete time with Probability 1 . Then, as the second part of the proof, we show that E always decreases whenever it is differentiable. Suppose E is differentiable at time t , then
386
TAO YANG
I . Exumples In this section, the advanced learning algorithms of DTFCNN are used to learn structuring elements from examples. We use two examples to show the usefulness of the learning algorithms to structuring element learning. Letting the structuring element S be the same in Eq. (299), we then use the dilation ) training data to train a dilaoperator to generate 2000 samples ( ( u i , , 0 ; ; )as tion DTFCNN. The learning process of Bflrlaxtemplate is shown in Fig. 3421. Observe that elements of B j m a xapproach correct values (see Eq. (300)) within 300 iterations. The initial conditions for BfinaXtemplate is 0 and I-( = I . As the BfInaxtemplate is of size 3 x 3, we only need a 3 x 3 DTFCNN to learn the structuring element. Next we use the erosion operator to generate 2000 samples ( ( u ; , ; ,O;,;)] a s training data to train an erosion DTFCNN. The learning process of B,+illin template is shown in Fig. 34b. Observe that elements of Bfl,,iil approach correct values (see Eq. (301)) within 400 iterations. The initial conditions for B , illlil template is 0. We choose p = 1. Comparing the results in Fig. 34 and those in Figs. 32 and 33 we find that the learning time of the learning algorithms presented i n this section is much shorter than that presented in the previous section. 4 IJh
,
,
,
,
,
,
,
nsmhcr of ~r.iinmg~.x,unple\
niiinher 01 1c11~nm8e w n p I c \
1 4
(hi
.
,
FicnKE 34. Learning process of dilation DTFCNN and erosion DTFCNN by using advanced learning algorithms. (a) Training dilation DTFCNN. (b) Training erosion DTFCNN.
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
387
To show that the learning algorithm can obtain correct learning results, we also show the learning results of different types of structuring elements. The next one is the so-called j k i t structuring element, which has all its elements on the same gray-scale value. We choose the flat structure element as
s=
i
0.2 0.2 0.2 0.2 0.2 0.2 0.2 0.2 0.2
I
The learning process of BJ,,,,, template is shown in Fig. 3%. Observe that approach correct values within 900 iterations. The initial entries of B f condition for B,,,,, template is 0. The learning process of Bfnlin template is shown in Fig. 3%. Observe that elements of B,,,,iIl approach correct values within 1400 iterations. The initial condition for B,min template is 0. Also for comparison, we present the results of the learning algorithms presented template and B f ,,,in in the previous section. The learning process of BJ template are shown in Figs. 35c, d, respectively. The results in Figs. 35b, d are somewhat misleading due to low printer resolution. After 1400 iterations, the learning emors in Fig. 35b are much smaller than those in Fig. 35d. We have performed extensive simulations using different templates and it cult to find cases in which the learning algorithms did not converge to the correct results (global minimum). As the surface of E is very complex, the choice of initial conditions and p is very important to get a global minimum. Zhang et al. (1996) proposed a method to get global minimum by randomly choosing many groups of initial conditions and then choosing the best one from these training results. However, the local minimum problem is still an open problem for almost all of the existing learning algorithms. C. Learning from L,inguistic Inputs
In this section, a learning algorithm for a type-lV DTFCNN is presented. Unlike the FCNNs we proposed before (Yang and Yang 1996, 1997d, e, f; Yang et id., 1996d, e, 1998g), this type-IV DTFCNN can process fuzzy number inputs and real number inputs. Its learning algorithm is based onfuzzy nurizher inputs. 1 . Structure of Type-IV IITFCNN
Generally, an FCNN can be used as a computational array or a learning array. As a computational array, with synaptic weights being predesigned and fixed, FCNN is a universal framework of mathematical morphology network (Yang and Yang, 3997d, e) and a paradigm of processing local linguistic statements (Yang ef al., 1996d3,19988). As a learning array, FCNN should organize its
0.5
II
I _I 1
0
w
02x,
I
I
.
,
,
,
,
,
,
,
,
200 41x1 600 K H I IlK)O I200 1400 I600 I800 2000 Nuniher ot Trainme Example, (hi
W W
FIGURE35. Learning process of dilation DTFCNN and erosion DTFCNN for flat structuring element. (a) Training dilation DTFCNN using advanced learning algorithm. (b) Training erosion DTFCNN using advanced learning algorithm. ( c ) Training dilation DTFCNN using the old learning algorithm of Sect. V1.A. (d) Training erosion DTFCNN using the old learning algorithm of Sect. V1.A.
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
389
own knowledge by learning from examples that may be related to crisp numbers or fuzzy numbers. As the conventional CNN can only process numerical information from sensors (e.g., a camera), it can not learn from linguistic information. However, in a hybrid system, the knowledge represented by fuzzy if-then rules usually plays an important role in the high level of image processing and understanding. The learning ability of FCNN can bridge the gap between the linguistic knowledge and the low-level image processing ability of the conventional CNN (or, the conventional CNNUM). When we train a CNN, we need a set of examples that consists of a set of phenomena and a set of corresponding results. Therefore, we have to collect enough data. In some cases we feel that CNN is so limited that it can only learn knowledge from data that may be very expensive to acquire or wastes much time in collection. However, human experts have accumulated a huge body of knowledge and experiences that can not be expressed by data but by linguistic statements. If we can make a CNN-based hybrid system that is smart enough to “understand” and “learn” knowledge from a human expert, we may save both money and time. In this view, FCNN functions as the interface between the human expert and the low-level conventional CNN. In this section, we proposed a DTFCNN structure that can be trained by fuzzy number (i.e., a convex and normal fuzzy set on a real line (Kaufmann and Gupta, 1995)). This DTFCNN has a crisp structure, which allows fuzzy number information to flow through it. Therefore, the synaptic weights in this DTFCNN are crisp set while the inputs, states, and outputs are fuzzy numbers. This DTFCNN structure can process the knowledge of a human expert. This DTFCNN is a type-1V FCNN. In particular, we teach this DTFCNN how to remove impulsive noise in an image using linguistic variables. To remove impulsive noises in images, median filters are usually used (Arakawa, 1996; Mancuso et al., 1996; Pitas and Vanctsano-Poulos, 199 1 ). Although median filters have some edge-preserving capabilities, they distort the fine structures of images (thin lines in the image may disappear and the image becomes slightly blurred). One can use weighted median filters (Yli-Harja et al., 1991) or conditional median filters (Arakawa, 1996; Mancuso rt al., 1996) to improve performance. However, setting weights of a weighted median filter is very difficult, so we do not discuss this kind of median filter. A conditional median filter outputs the median value if an impulsive noise is identified and keeps the input value unchanged if no impulsive noise is identified. Identification of impulsive noise thus plays the most important role in a conditional median filter. In Arakawa (1996) and Mancuso et LZI. ( 1 996), the fuzzy rule-based methods are used to identify impulsive noise and have high performances. However, it is difficult to design the fuzzy rules and choose the membership functions. To overcome this problem, the DTFCNN
390
TAO YANG
learning algorithm is used to learn these fuzzy rules from those linguistic examples that are based on our experience. In this section we use the symbol ‘‘-” over a character to denote a fuzzy number. To reduce the computational complex, the LR-type fuzzy number (Dubois and Prade, 1980) is used. A fuzzy number X is said to be LR-type if and only if
where pi is the membership function of X. g L ( . ) and g ~ ( . are ) the reference functions for left and right references. The c denotes the mean values of X and u and b denote left and right references, respectively. If LZ and h are both zero then X is degraded to a crisp number. We define addition of two fuzzy numbers X and j as
and define multiplication of a real number k and a fuzzy number
X as
m ( y ) = m a x M x ) l y = &I.
(3 18)
For a monotonically increasing function f ( . ) , we define f ( X ) as
The h-level set X” of
X is defined by
2’ = {XI/.Lh(X) 3 h. h E
(0, I]).
(320)
So, X” is a closed interval denoted by
where the subscripts “L” and “CJ” denote the lower limit and the upper limit, respectively. We define addition of two intervals [ X L , X U ] and [ y ~y , ~ as] (Alefeld and Herzberger, 1983) [XL,
xu1
+ [ Y L . vul = 1x1, + Y L , X U + Yul
(322)
and define multiplication between a real number k and an interval [ x ~xu] , as (Alefeld and Herzberger, 1983) (323)
391
FUZZY CELLULAR NEURAL NETWORKS A N D THEIR APPLICATIONS
inax(., .) and min(., .) operations are defined by
For a monotonically increasing function f ( . ) we define
~ ( [ x L xu]) ,
as
(327) (328) (329) (330) (331) A cell C;, in an M x N DTFCNN used in this section is defined by state equation:
+
+
wherei,,(t+ I)isthestateofC,, atdiscrete-timet I andi,,(r 1)isafuzzy number. The F ( . ) denotes a fuzzy local operator defined in r-neighborhood N,.(i, j ) and iikl is the input of CAIand a fuzzy number. Because the preceding DTFCNN does not have feedback synaptic weight, its output equation is given by
where f (.) is a monotonically increasing nonlinear function given by
(334) The conventional DTCNN has an f ( . ) as a sgn(.) function (Harrer and Nossek, 1992b). However, when an DTFCNN is subjected to a learning process, a continuous first-order derivative of f(.)should be used. , f ( . ) defined in Eq. (334) satisfies this condition.
392
TAO YANG
Remarks: The DTFCNN structure in Eq. (332) is completely different from the structures we proposed before. In our previous FCNN structures the membership values were mapped to either crisp values or other membership values, which are real numbers, that is, only real numbers are propagated through these FCNN structures. The DTFCNN structure in Eq. (332) can map fuzzy numbers to crisp values or fuzzy numbers and allows fuzzy numbers to propagate through it. Although we demonstrated that the general FCNN structure is not a kind of conventional NCNN, we did not present examples of FCNN structures that cannot be included in the classical CNN with nonlinear synaptic laws. However, the DTFCNN in Eq. (332) is totally different from any kind of conventional NCNN because the fuzzy number can flow through this structure. This DTFCNN structure is very useful to classification problems where input patterns are fuzzy numbers. As the structure in Eq. (332) is very general, we would like to study the learning algorithm, using one of its simple forms as follows:
This type-IV DTFCNN shares the same mathematical form of a simple min/max FCNN we proposed before. However, as its inputs and states are totally different from those we proposed before, it is a new FCNN structure. 2. Leartiing Algorithm of Type-IV DTFCNN In this section we propose the learning algorithm of the type-IV DTFCNN for two-class classification problems. Assume that we have the following example set:
{(h,L O,/)} where
(G;i)
(336)
is a set of fuzzy numbers given by
O;, is a classification result given by
“1
1 = 0:
{
if C,, belongs to class 1 if C , , belongs to class 2
’
(338)
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
393
If we use the output of C;; to denote the classification result of C i j , we have I , if C;; belongs to class I (339) j i j ( f ) = 0, if C,, belongs to class 2
c
'
From the foregoing one can see that j ; j ( t ) degenerates to a crisp number. It is because the nonlinear output function .f(.> functions as a defuzzifier. An explicit expression of this kind of defuzzifier can be given by (340) where h E (0, 11. Then, given an h-level set of following cost function:
IIJ, our training
objective is to minimize the
(341) where
From this one can see that we should train the DTFCNN using different 12-level sets. An increase of the number of h-level sets improves the training results but also increases the training time. Therefore, there exists a tradeoff between the number of k-level sets and the perforinance of training results. To train the DTFCNN, we use the following learning rules to update two kinds of feedforward synaptic weights B I ( i , j ; p , q ) and B2(i, j ; p , q ) , respectively:
where t is the learning iteration, C,), E N,.(i, j ) , and a and /3 are learning rate and momentum rate, respectively. In the right-hand side of Eq. (343) the aE/i3B1(i? j ; p , q ) is given by
aj!7.
aE - i)E a.il;,; IJ ( i , j ; p , q ) iI-j;,j i~t;;;~ B I(i, j ; p , q )
P6U
DNVA 0V.L
394
TAO YANG
pul? (LPE)
(347)
As X I / is a fuzzy number, then we can train the DTFCNN using the h-level interval numbers as
aq,
(348) where
Then we consider the so-called "smooth derivative" (Blanco et LII., I995a) of max(u1, E). In the classical sense, max(uI, E) is derivable into the open intervals u I < E and L I I > E but the derivative is not defined at 111 = E, that is,
From Eq. (350) we know that the DTFCNN will stop learning when u 1 < E. This makes the learning process of DTFCNN very slow. In the worst case, this
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
395
can even make the learning process impossible. To overcome this problem, we notice that Eq. (350) only gives the crisp truth value of the statement: “ u , is greater than E.”In this sense, we can fuzzify Eq. (350) using different methods. One example can be found in Blanco et d., (1995a). However, the method used in Blanco et id., (199%) can not be used here, we fuzzify our “smooth derivative” as arnax(u1, E) a11 I
=i 1,
if u I > E (351)
Then we have
Similarly, in the right-hand side of Eq. (344) the aE/aB?(i, j ; p, 4 ) is given by
(353)
(354)
396
TAO YANG
Similarly, the “smooth derivative” of min (u?,y ) is given by (356) Then we have
3 . Application to Impulsive Noise Ident~fificution Impulsive noise in an image can be removed by using a nonlinear filter, such as a medium filter, rank-order filter, or mathematical morphology operator. However, almost all of the foregoing filtering methods blur fine structures of the parts of the image where impulsive noises do not exist. Thus a kind of expert knowledge-based method exists that can remove impulsive noise while keep regions without impulsive noise unchanged (Mancuso et al., 1996; Arakawa, 1996). The first step in this kind of method is to identify the location of impulsive noise based on linguistic statements of knowledge of impulsive noise. If we assume that an image is smooth enough, then impulsive noise will introduce a significant difference of gray value from its neighbors. Our visual system has the experience that if a pixel has a gray value that differs significantly from all its neighbors it should be impulsive noise. To make this experience understandable to a DTFCNN, we first translate it into a set of fuzzy if-then rules (Mancuso et d., 1996). Considering a 3 x 3 neighborhood (N I ( i , , j ) ) and use u;,; to denote the gray value of pixel (i, j ) in the image, we have If Iu;,,;-l - ui;l is big and l u ; , ; + ~- “;,I is big and If lu;-~,;- u;;l is big and Iu;+l,, - u,jl is big and If I U - ~ ~ ; - I - u;,l is bigand Iu;+l,;+l - u,;I is big and If lu;-l,,j+l - u;;l is bigand lul+l,,j-1 - u;,;I is big THEN u;,; is an impulsive noise. where “big” is a fuzzy number. As the characteristics of impulsive noise are changed from one image to another, human experts will have different
FUZZY CELLULAR NEURAL NETWORKS A N D THEIR APPLICATIONS
t
FiGLJRE
397
M em hers ti i p
36. Membership ltinctions of three fuzzy numbers: srrrtrll (S). mic/d/c~( M ) ,and hig (13).
qualitative statements for “big. The DTFCNN can do a trddeoff between the judgments of human experts by learning from different linguistic examples (knowledge from different human experts). To train the DTFCNN, we detine the fuzzy numbers big, middle, and smdl as shown in Fig. 36. From Fig. 36 one can see that membership functions of fuzzy numbers smull, middle, and big can be expressed by ”
From the preceding we know that we can use the following DTFCNN to identify impulsive noise:
where B,,, = {Bma,(k,/ ) ] 3 x 3 and &,in = {B,ill(k, / ) ] 3 x 3 are two feedforward templates. The quantity UT+n,J+I denotes the fuzzy number that is used to describe the uncertainty of lu;+~,,+/- ui,l. Since I”;, - u;jl = 0 is always true, Bnlill(O,0) and B,,,,(O, 0) are don’t mre entries. In this section, we let B,,,,,(O, 0) = 0 and B,,,,,(O, 0) = 0. As the DTFCNN in Eq. (361) is space-invariant, we only need a 3 x 3 DTFCNN to learn a 3 x 3 template. And as the training process only needs
398
TAO YANG CLASS I input
output
CLASS 2 input
output
input
output
input
output
input
output
input
output
input
output
0 s
OM HB
input
input
output
output
m a n y o f S , M a n d B mdon'tcare
(i ~mpu~sive J noise @)
not an impulsive noise
FIGURE 37. Illustrations of patterns of training examples in two classes. Class I denotes that an impulsive noise exists. Class 2 denotes that an impulsive noise does not exist.
knowledge from a human expert, we can generate the training examples as shown in Fig. 37. In Fig. 37, each input pattern denotes a possible configuration of iiT++r,j+,sin N l ( i , j ) . Two classes of examples are illustrated. There is only one pattern in class 1 that has output 1 (impulsive noise) while all 8 patterns in class 2 have outputs 0s (not an impulsive noise). We train the DTFCNN using examples choosing from class 1 and class 2 randomly. During the first 2000 examples, we chose 80% of examples from class 1 and the rest from class 2. This makes the learning process faster. After that, we chose only 8% of examples from class 1. This makes the learning process slower and smoother. Figure 38 shows the learning curves of & i n ( 1 , I ) and &,,( I , I ) with parameters: (Y = 0.5, p ( t ) = 0.5 x (0.999)'. The initial values of the entries of templates &,in and B,,,, are chosen randomly in interval (0, 1). As the ranges of inputs and outputs are in [0, 11, in the learning process we restrict the dynamical ranges of B,in(k, I ) and B,,,,(k, I) i n interval [-1, 11. From Fig. 38 one can see that Bmin(1,I ) approaches 1 while B,,,(l, 1 ) approaches 0. In this simulation, we use 3 level sets ( h = and 1 ) of each fuzzy number input pattern to train DTFCNN. After being trained by 40,000 examples, the DTFCNN learns the following templates:
i, i,
399
FUZZY CELLULAR NEURAL NETWORKS A N D THEIR APPLICATIONS 1
0.9
- 0.8 Z 0.7
m
0.6
3
5 0.5
- 0.4 7
.z v
m
0.3 0.2 0. I
'
012
014
016 018 'I 112 114 Number of Training Examples
1.6
2
118 x
lo4
FIGURE. 38 Learning curve\ o f B,,,, (1, 1 ) and R,,,,, ( I . I )
B,,,,, =
0.996746 0.996926 0.994743 0.996385 0.000000 0.997409 0.997262 0.997369 0.996796
i
)
,
-0.000003 -0.000001 -0.000002 -0.000001 0.000000 -0.000002 -0.000002 -0.000002 -0.000003
(362)
Observe that every entry in B,,iIl is very close to 1 and every entry in B,,,,, is very close to 0 (one should notice that the central entries of both templates are don't care entries). We then use the templates in Eq. (362) to process a 63 x 63 gray-scale image of 256 gray levels, which contains impulsive noises of mean value 220 and deviation 35, shown in Fig. 39a. The image in Fig. 39a is used as u ; j , 1 5 i, j 5 63; u,, is normalized such that condition
(363) is satisfied. After training, synaptic weights of the DTFCNN can be fixed and the DTFCNN is also degenerated into a computational array whose inputs and outputs are crisp values. Thus the crisp form of the trained DTFCNN used in this simulation can be written as
(364)
400
TAO YANG
(c)
FIGURE 39. Computer siniulation results of impulsive noises identilied using the trained DTFCNN. (a) Image containing impulsive noises. (b) The output of the trained DTFCNN. ( c ) The threshold result of (b).
Figure 39b shows the output of the foregoing DTFCNN, from which one can see that every impulsive noise is identified except those in the first and the last rows and those in the first and last columns. It is because we used these cells as dumb cells (boundary cells) for 3 x 3 templates in our simulation. Figure 39c shows the threshold result of Fig. 39b, from which one can see that all impulsive noises are identified. We never use crisp examples to train the DTFCNN but it works well when it processes crisp Inputs.
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
401
Because fuzzy numbers can be propagated through this DTFCNN structure, this DTFCNN can learn its templates from linguistic knowledge. In some cases where training examples are difficult to collect and the knowledge of a human expert is available, this structure is very useful. On the other hand, this structure can also be used as an interface between the conventional CNN and the human experts, designers, and users of a CNN-based hybrid image processing system. Therefore, this structure will extend the CNN concept from low-level image processing to high-level image processing and from structurebased image processing to knowledge-based image processing.
FCNN structures can be used effectively to solve fuzzy IF-THEN-ELSE rulesbased image processing problems. Given a set of local fuzzy rules, a systematic method was presented for selecting the corresponding FCNN structures in Sect. V. The membership functions of the linguistic variables used in the fuzzy rules should be chosen according to different rules. In a fuzzy IF-THENELSE rule the membership functions, whose choice is usually a very difficult and time-consuming process, play very important roles. In this section, a real coded genetic ulgorirhrn (GA) is used to optimize the membership functions of the chosen FCNN structure. The corresponding crossover and mutation operations are presented. The crossover operation consists of three schemes that are a tradeoff between the evolution of the best individual and that of the other population. The mutation operation consists of a local one and a global one. The local one makes the evolution search the local basin of the best individual while the global one makes the evolution search the global problem space to overcome the trap of a local optimization. Then the GA is used to optimize the membership functions for solving the edge extraction problem with ill-conditioned examples. A. Genetic Algorithm ,for Optimizing FCNN
GAS are optimization approaches motivated by creature evolution. They cumbine robustness with the ability to explore huge search space quickly. The basic knowledge of GA can be found in Davis ( 1 99 1 ) and Goldberg ( 1 989). The GA exploits the collective learning process within a population of individuals, and each individual represents a search point in the space of potential solutions to a given problem. The applications of GA to fuzzy logic (Hanebeck and Schmidt, 1996; Back and Kursawe, 1995; Lozano et al., 1995; Tryba et al., 1995) can be lumped roughly into two categories: 1) optimization of the membership functions of fuzzy sets; and 2) automatic learning of fuzzy rules.
402
TAO YANG
We use GA to optimize the membership functions of FCNN. The correct choice of the membership functions plays an important role in the design of FCNN. There are some applications (Hanebeck and Schmidt, 1996; Back and Kursawe, 1995; Lozano et al., 1995; Tryba et ul., 1995) that show that GA are capable of optimizing membership functions. The basic idea is to represent the complete set of membership functions by an individual and to evolve shapes of the membership functions. We use GA only to optimize the normalized trapezoidal membership functions, which can be represented by a 4-tuple ( a ( ' )d , 2 )d, 3 )d4)) , as follows (Bojadziev and Bojadziev, 199.5):
(36.5) (0,
otherwise
where A is a trapezoidal fuzzy variable. A typical GA starts with a randomly chosen population of individuals. Then this population undergoes evolution in the form of natural selection. In each generation, relatively good individuals are reproduced, providing offspring that replace the relatively bad individuals which are eliminated. An evaluation or fitness function is used to distinguish good and bad individuals. A typical GA consists of three basic operations: 1) evaluation of individual fitness; 2) formation of a gene pool, and 3) recombination using two basic genetic operators-crossover and mutation. The GA used in this section is shown as fol lows :
/*initiuli;.e*1 Generation t = 0; Initialize the gene pool = G P (0); while (not termination-condition) do generation t = t I ; select individual C ( t 1 ) = (c,}E G P ( t - 1); crossover c,, c, E C ( t - 1) and get C ( t ) ; evaluation and selection C ( t ) and get GP ( t ) ; mutation G P ( t ) ; end
+
I
+
Since in an FCNN-based FIRE edge extractor, only two fuzzy variable ZERO and WHITE are used, the k"' individual in the gene pool ck can be represented by
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
403
where (a,( 1 ) , czk( 2 ), (ik(3) , cik(4)) IS a 4-tuple for determining the trapezoidal membership function of ZERO and (hi”, h;”, h f ) ,hi4’) is that for WHITE. The initialization of the gene pool is given by the following process. Suppose is a pseudo-random number distributed in (0, I ) . It is easy to see that the only choice of a:’) of p ~ & ) is a : ’ ) = 0. Then aL4)is chosen by ’
(4)-
Uk
and
c,:.~)
-
5
(367)
2
and a:,”), respectively, are chosen by
(369) Observe that hi” for p ~ ~ (has x )a best choice of 6;’ given by t 6,( 1 ) 0.5 3
= I . Then 0;” is
+
(370)
and then by’ and b13),respectively, are chosen by
Crossover is given by the max-min-arithmetical algorithm presented in Lozano et al. (1995). Assuming that C I = (cl(l), c1(2), . . . , cl(8)) and C? = {c?(l),c2(2), . . . , ~ ( 8 )are ) two individuals to be crossed, then the four offsprings are given by
CT
= ( ~ y ( i ) l c y (= i)~
C ( iI)
+ (I
c; = (c;(i)lc;(i)= ( 1 - (~)q (i)
I , 2, . . . , 8)
(373)
+ a c l ( i ) ,i = 1,2, . . . , 8 )
(374)
-
( ~ ) ~ 2 ( i i) ,=
+ c?(i)),i = I , 2, . . . , 8) ci = ( c z (i )l c z (i )= min(c1 (i) + c ? ( i ) ) ,i = 1 , 2, . . . , 81 C;
= {c;(i)lc;(i)= max(cl(i)
(375) (376)
where a E (0, 1 ) is a constant. And then the best ones are selected. There are three crossover schemes used in our GA. The first one, which occurs with a probability pel, is the crossover between the best individual and the
404
TAO YANG
worst one. And then the best offspring is substituted for the worst one. The second one, which occurs with a probability p C 2 , is the crossover between the best individual and any of the subworst ones. And then the best offspring is substituted for the subworst one. The third one, which occurs with a probability pc.3,is the crossover between any two of the subbest ones. And then the two best offsprings are substituted for the two subbest offsprings. Mutations consist of a local mutation scheme and a global mutation scheme. The local mutation, which occurs with a probability p , ? ! ~is, given by the following process. Assuming that an element of an individual ck = (ck ( 1 ), . . . , , 8, is chosen for local mutation and that , ck(8)),ck(i),i = I , the domain of ck(i) is [d', d ' ] , then the result is a new individual ck = (ck(l), . . . , c : ( i ) , . . . , c k ( 8 ) ] ,where c ; ( i ) is given by
c:(i) =
{
c;(i) - c*(cE(i)- d), for 6 > 0.5 c i ( i ) ["(cf' - c : ( i ) ) , else
+
(377)
where {* is a pseudo-random number uniformly distributed in (0,l). The [d', d'7 for each element of individual ck is given by: (([0,0],[0,d3?,b'2),a'4)], 111, (lo, ~ ' 1 [b"), , P ) ] , [ P , b(411, [ I , 11)) The global mutation, which occurs with a probability pm2, is the same as initialization. Thus, the local mutation can be used to improve the existing individuals while the global mutation continuously added new types of individuals into the gene pool during the evolving process. To evaluate the performance of an individual ck, the output of FCNN in Eqs. (254) and (255), yi;, is compared with the ideal output (o;;)by using an error function (378) where
41, ( t )=
cr(t), for o,, = 0
(379)
is the evaluation weight. In our simulations, we let a ( t ) = 1 and B ( t ) = 0.5 0.5c. And we define the global fitness of the tth generation gene pool G P ( t ) ,E,,,,,as n E,in(t) = inin E k ( t ) (380)
+
k=l
where n is the number of population in the gene pool. B. Application to Image Processing
In this section, computer simulation results are provided. We use GA to choose p z ~ ( . and ) ~ w H ( . )Figure . 40(a) shows the original gray-scale image of size
FUZZY CELLULAR NEURAL NETWORKS A N D THEIR APPLICATIONS
3200 3000 1 2800 2600 E 2400 2200 1000 I800 I600 .
,
,
,
,
,
,
,
405
,
WE
\
63 x 63 with 256 gray levels. The image is normalized to [O,l]. Figure 40b shows a bad version of an edge detecting result. Observe that much noise exists in this result and the edge is almost indistinguishable. Then we use GA to learn p z , ~ ( . and ) p w (.) ~ from this ill-conditioned example. The parameters of the GA used in this simulation are chosen as:
population size: tz = 10. prohabili9 of croossover: p(I = 0.1, p ( 2 = 0.1, p'y = 0.2. Max-Min-Arithmetical crossover parumeter: CY = 0.618. prohuhility ( f rnutution pnlI = 0.2, I),,!?= 0.5. stop conditions: If El,,, trapped into a local minimum more that 45 generations.
406
TAO YANG
After 100 generations, we get the following individuals:
( a ‘ ’ ) C, I ( ~ ) a, ( 3 ) d4)) , = (0.000000, 0.00461 I , 0.007106, 0.043000) (b‘”,O”’, h‘3’, h(4))= (0.543300, 0.675663, 0.689058, 1 .000000).(381 )
The corresponding output is shown in Fig. 40c. Observe that the edge characteristics are enhanced while noise is suppressed. In this simulation, h in Eq. (255) is chosen as h = 0. One should notice that the contour of the entire face, which is ahnost diffused by noises in Fig. 40b, is perfectly recovered by the FCNN and GA as shown in Fig. 4Oc. As well, the contours of both eyes and the mouth are significantly recovered and filled into closed curves while those in the example in Fig. 40b remain broken lines. From the crisp or classical image processing point of view, the learned result in Fig. 40c is “terrible” because there exists a big error from the original example. However, from a human expert’s point of view (i.e., from our cognitive point of view), the result in Fig. 40c is much better than its original example. In the first view, this kind of improvement is unbelievable because our common sense is that any learning algorithm of an artificial neural network (ANN) is an approximation to its supervisor examples. The usual example that an ANN learning simulation can give is to use a perfect crisp algorithm to generate some input-output examples and then use an ANN to learn the known crisp algorithm from the input-output pairs. In this view, the trained ANN should not be better than the crisp algorithm. Why does the FCNN in the simulation perform better than its supervisor examples? In this case, improvement comes from the structure of the FCNN. When we come back to Sect. V.C. 5.3 and Fig. 27, we can find that the human expert’s intuition (or experience, knowledge) about the concept of “edge” has been embedded into the FCNN structure as shown in Eq. (254). While the learning ability of our brain is overemphasized these days, we should also remember that our brain has a unique structure that took nature millions of years to evolve. The structure of our brain is also a kind of knowledge. How can this kind of structural knowledge be made useful? The answer resides in learning. It we return to the example shown in Fig. 40, one can see that in the evolution of learning, the knowledge of edge detection embedded in an FCNN structure will gradually correct errors from the supervisor examples. Our simulation demonstrated that any distortion found i n the knowledge structure as shown in Eq. (254) or Fig. 27 gave a much worse result. From this example, one can also find that FCNN is a high-level CNN structure capable of embedding human experts’ knowledge in a very efficient way and performing some intelligent behaviors.
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
407
Of course, if we can acquire the perfect examples, the conventional CNN may be a better choice to learn from these examples. In fact, an excellent example of DTCNN learning algorithm of edge detection had been proposed by Harrer et al. (1991a). The question that remains, however, is how the conventional CNN performs when only ill-conditioned examples are available. Without prior knowledge of the task embedded in its structure, the CNN will be puzzled and wander in the problem space and settle down to an arbitrary local minimum in the vicinity of its initial condition. Why not use both ill-conditioned examples and prior knowledge (usually represented by a set of rules or linguistic statements) to train our model‘? I f we can use this method, we can let our model use its “knowledge” to judge whether an example is good or bad. Then big weights are automatically assigned to good examples and small weights are automatically assigned to bad ones. Structural representation of knowledge plays a very important role i n human intelligence. We keep emphasizing that there are two motivations for inventing FCNN one is mathematical morphology and the other is to embed human knowledge into a CNN structure. This section provides another result motivated by the second of the two.
VIII. APPLICATIONS OF DISCRETE-TIME FCNN Although in most applications both continuous FCNN and DTFCNN can be used, DTFCNN has some unique applications that continuous FCNN can not perform. In this section we present some of these unique applications. A . Inipletnenting Nonlinear Fuz: y Operutors jor Image Processing
The fuzzy operators we discuss in this section are based on fuzzy IF-THENELSE rule bases. The parallel computation mechanics of DTFCNN are used to offset the computational complexity of fuzzy image processing problems. 1 . The Structure of DTFCNN
A DTFCNN is described by the following equations:
I . Input function sublayer (= fuzzifier layer)
408
TAO YANG
where Ekl(r) is the detected signal, for example, the output of a camera. The (.) is the membership funcquantity t denotes discrete-time iteration; C k / E'Ly~ N,(lJ) tion of the fuzzy variable embedded in cell Ci; and it is used by feedforward synaptic law. Finally, { ] denotes a set. 2. Cell dynamics sublayer (= fuzzy inference engine layer) (383)
where ck/:i;(i,)(.) denotes a fuzzy inference process in N r ( i j ) . Here fill(.) is the membership function of the fuzzy variable embedded in cell C i , and it is used by feedback synaptic law. The y k l ( t ) is output, which is given by: 3. Output function sublayer (= defuzzifier layer) (384)
where ck,:;(,Jl (.) denotes a defuzzifier function defined in N,.(ij).
2. Embedding Fuzzy IF-THEN-ELSE Rules into DTFCNN Fuzzy local image operations were developed as new image processing tools (Russo, 1992; Russo and Ramponi, 1994a, b, c, 1995) because there exist different kinds of uncertainties in image processing and image understanding. Some simple local fuzzy operators such as fuzzy shrinking and fuzzy expanding may be considered as types of mathematical morphological operations and can be readily implemented by type-I1 FCNNs (Yang and Yang, 1996; 1997d, e; Yang et nl., 1996d). Here, we use type-I1 DTFCNN to implement fuzzy IF-THEN rule-based image operators. One simple fuzzy operator is given by the following fuzzy rule:
Rule One IF ((Ekl)' is A ' ) AND ({Ek1l2is A 2 ) .. . AND ( ( E k l J Mis A''"), THEN (y,, is BI ( X ; C I , W ~11, I ELSE (yl, is B O ( X ; C OW .U ) ) for CJJE N , ( i j )
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
409
where Bo(x; CO, w g ) and B I ( x ;c1, W I ) are two triangularly shaped fuzzy sets, which are defined by (Bojadziev and Bojadziev, 1995) (385) A / ) , p = 1, 2, . . . , M are M fuzzy variables and y ~ is/ a crisp output of Rule One. Then we can use the following DTFCNN to implement Rule One.
I . Input sublayer for implementing
({EklJP
is A " ) , p = I , 2 , . . . M : (386)
2. Cell dynamics sublayer for implementing IF parts is
(387) By adopting correlation-product inference (Kosko, 1992), we get the third sublayer as follows. 3. Output sublayer for implementing THEN-ELSE part
For comparison, the preceding DTFCNN structure can be summarized into a form similar to that of a conventional CNN as follows: 1. State equation
Of course, one set of fuzzy IF-THEN rules may be too simple to solve a practical problem. We usually need more that one set of IF-THEN rules in fuzzy image operator. For example, Russo and Ramponi ( 1 995) use a fuzzy
410
TAO YANG
rule set that contains 32 IF-THEN rules and one ELSE rule. Usually we should consider the following fuzzy rule:
Rule Two IF ( { E u } " is A " ) AND ( { E k / ) 1 2is A ' * ) . . . AND ( { E ~ I } ' ~ ~ is A'"'' 1, THEN (y;,; is B I(x; C I , W I ) ) ,
IF ( { E L / } "is A " ) AND ( { E L / } 'is * A ' * ) . . . AND EL/}'"'^ i? THEN (y,, is B , ( x ; c , ,wf)),
AIM,),
, p = 1,2, . . . , N are N triangularly shaped fuzzy sets where B p ( x ; c pwIJ), as defined in Eq. (385). The { E k / } l J ' / ,p = 1 , 2 , .. . , N , q = 1 , 2 , . . . , M I , are fuzzy variables. This rule base consists of N IF-THEN rules and one ELSE rule. Similar to the implementation of Rule One, the IF part of the p-th IFTHEN rule can be implemented by the following DTFCNN:
Therefore, we need p layers of DTFCNN to implement p IF-parts. The ELSErule is implemented by the following DTFCNN:
Then the whole rule base is finished by a common output layer
(393)
41 1
FUZZY CELLULAR NEURAL NETWORKS A N D THEIR APPLICATIONS
In conclusion, to implement Rule Two, we need ( N a common output layer.
+ I)
layer DTFCNN and
3. Implementing Fuzzy Inference Shurpener We then show how a fuzzy IF-THEN-ELSE rule base for image processing can be embedded into a DTFCNN structure. Consider a basic fuzzy sharpener presented by Russo and Ramponi ( 1 994c). The rule shown in Fig. 4 I is applied to a 256-gray-level digital image. It should be noted that all inputs in the rules are gray-value differences between each pixel in the neighborhood system and the central pixel. This is the so-called “relative in the antecedents” approach (Russo and Ramponi, 1 9 9 4 ~ ) . The rulebase in Fig. 41 consists of two IF-THEN rules and one ELSE rule. We can express the rulebase by the following equivalent statement:
IF (E;-I,,;-~ - E;, is P ) AND (Ej-1.; - E;; is P ) AND (E;-I,,,.~-E;; is P ) AND (Ei,,-1 -E;,, is P ) AND ( E ; , K I- E ; , is P ) AND ( E + ~ ,-E;; ~ - ~ is P ) AND @ i + l . , i - ~ i , i is P ) AND (Ei+l,j+l - E,; is P ) THEN ( y i , ; is N ) , IF (E;-I,,-I -E;, is N ) AND (E,-l,,-E;; is N ) AND ( E ; - I , ~-E;j I is N ) AND (Ej,i-l - E i j is N ) AND ( E , , f i-Ei; ~ is N ) AND (E,+I.~-I - E i J is N ) AND ( E ; + I , ~ - E is ; ~N ) AND (E+l,,*~ -Eii is N ) THEN (yi,; is P ) , ELSE ( y i , is 2 )
To implement this fuzzy inference sharpener, letting E l , denote the grayvalue of pixel (i, j ) , then we have the following multilayer DTFCNN structure. The state equation of DTFCNN #I which is used to implement the 1st rule in Fig. 41 is given by
where p p ( . ) is the membership function of fuzzy set P as shown in Fig. 41. The state equation of DTFCNN #2, which is used to implement the 2nd rule in Fig. 41 is given by: Eij)
(395)
where P U N ( .is) the membership function of fuzzy set N as shown in Fig. 41.
412
TAO YANG
Definitions of fuzzy sets
CN 0 Wz cp gray-value difference
Rulebase for sharpener
The 2nd RULE
ELSE
ELSE
-El
RULE FIGURE 41. The rulebase lor fuzzy inference sharpener.
The state equation of DTFCNN #0, which is used to implement the ELSE rule in Fig. 41 is given by = rnin(1 - x x ,I , ( t - I ) , I -xf,(t x,,(t) 0
- 1)).
(396)
The foregoing three layers share a single output layer as
p=o
where c,, and w ~ , ,p = 0, 1,2, are centers and widths of triangularly shaped fuzzy variables Z , N , and P as shown in Fig. 41, respectively.
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
4 13
Similarly to that in Russo and Ramponi (1994c), if we change the output layer of the foregoing multilayer DTFCNN into the following form: ?,lo)=
Yf(A$-
l)J;/(f-
I>,x,?,(t- I ) ) + 1
(398)
where I is a fixed bias for each cell, then we get a kind of fuzzy high-pass filter. The output of a DTFCNN-based fuzzy inference sharpener is shown in Fig. 42. Figure 42a shows the original image. Figure 42b shows the output of the DTFCNN-based fuzzy inference sharpener. The parameters are chosen as: y = 1, cz = 0, M?Z = 50, ~p = 255, wp = 300, C N = -255, and WN = 300. When we choose I = 128 and keep all other parameters unchanged, the output result of the DTFCNN-based fuzzy high-pass filter is shown in Fig. 42c. B. Enzhedding Locd Fuzzy Rellition Equntioizs FCNN is the only existing high-level CNN structure in CNN universe. Here, the phrase “high-level” means the ability to process conceptual variables, for example, linguistic variables. We present here a DTFCNN structure for embedding local fuzzy relation equations. Fuzzy relation equations were first recognized and studied by Sanchez ( 1976). Fuzzy relation equations play an important role in areas such as fuzzy system analysis, design of fuzzy controller, decision-making processes, and fuzzy pattern recognition. Fuzzy relation equations are associated with the concept of composition of binary fuzzy relations, which includes both set-relation composition and relationrelation composition. We only use max-min composition because it has been studied extensively and utilized in numerous applications. Embedding fuzzy relation equations into artificial neural networks (ANN) is not new-there are inany references (Blanco et al., 1995a,b; Hirota and Pedrycz, 1996; Nola et al., 1995; Pedrycz, 1991) on this topic. Thanks to these references, we can combine DTFCNN and fuzzy relational neuroconiputations very easily. 1 . Locul Fuzzy Relation Equation and I t s Irriplenzentatiorz
Let A i j be a fuzzy set in N , - ( i j ) and R , j ( N , . ( i j ) ,(4ij))be a binary fuzzy re, set ( & / } = (@lj, . . . , #:;}, then the set-relation lation in N , ( i j ) x ( 4 i j ]where
414
TAO YANG
(c)
FIGURE42. The sirnulation result of DTFCNN-based fuzzy inference sharpener and I’u~zy high-pas filter. (a) Original image. (b) The output of the DTFCNN-based fuzzy inference sharpener. ( c ) The output of the DTFCNN-based fuzzy high-pass filter.
composition of A;j and Rij, A;,oR;,, results in a fuzzy set in { y i l ) . Let us denote the resulting fuzzy set as B i j , then we have A I.J. o RI J. . - B I J ’
(399)
The preceding equation is a fuzzy relation equation. The membership function of Bij is given by:
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
4 I5
If we view R,, as a local fuzzy system, A,, as a local fuzzy input, and B,, as a fuzzy output, then Eq.(399) describes the characteristics of a fuzzy system via fuzzy input-output relation. From Eq. (400) we can see that an m-layer DTFCNN can implement the fuzzy relation equation in Eq. (399). The p-th layer DTFCNN is given by 1. Input equation
2. Cell dynamics
is the synaptic weight. 3. Output equation = x:(t .$’.(t) ‘J
-
I ).
2. An Exumple To show how fuzzy relation equations can be embedded into a multilayer DTFCNN we define the following fuzzy relation equation:
where U = { u ~1.42, , . . . , ux, u 9 ) is numbered according to Fig. 43 (lower part labeled “numbering order”); U can also be expressed by the following pattern:
and C is given by C = { c , ,cz}; R is given by
416
TAO YANG
255
I
S
2
6
9
1
3
8
4
numbering order Fi(iuRE
43. The fuzzy set R and the numbering order of cells in N l ( i j ) . CI
'0.5 0.5 0.5 0.5 1 1 1
I ,0.2
c2
0.75' 0.75 0.75 0.75 0.5 0.5 0.5 0.5 1 ,
The first-layer DTFCNN is used to implement the first column of R in Eq. (408), which is given by:
I . Input equation Uk/
= PB(Ek/ 1, Ckl E NI ( i j ) .
(409)
2. Cell dynamics
where B " ) ( i , j ; k , I ) is given by the following fuzzy set defined in Z 2 ( N l (ij)) grid: j-1 j j + l 0.5 1 { ~ ( " (ji;,k , I)]= i 1 0.2 (41 1 ) 0.5 1 0.5 which corresponds to the first column of K in Eq. (408).
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
4 I7
3. Output equation 0) (11 .y1.1. ’ ( t ) = ,rij ( t - I ) .
The second layer is used to implement the second column in Eq. (408), which is given by: 1. Input equation
2. Cell dynamics x,(;’(t) =
max
CIIENI ( I
/I
min[ukl, B(*’(i,j ; k , 1 ) ]
(414)
where B ( 2 1 ( ij,; k , I ) is given by the following fuzzy set defined in Z2(N1 ( i j ) ) grid j - I .j j + l 0.75 0.5 0.75 (415)
3 . Output equation (2 I
(21
y,, ( t ) = x , , ( t - 1 )
Figure 44 shows the simulation results. I n this simulation, we choose the fuzzy set B in Fig. 43 as x < 50
0, 1.50
1.
’
50 5 .r < 200 else.
(4 17)
Figure 44(a) shows the output of the first layer. The gray value of each pixel corresponds to a membership value. Figure 44(b) shows the output of the second layer. Also, the gray value of each pixel corresponds to a membership value. Observe that the forementioned two results are two kinds of image segmentations. In both simulations, wrap-up boundary conditions are used. Because fuzzy relation equations can be viewed as a description of fuzzy systems that have fuzzy input and fuzzy output, we find that DTFCNN functions as a parallel implementation of this kind of fuzzy system. The immediate applications of this kind of fuzzy relational DTFCNN structure are image processing and pattern recognition.
418
TAO YANG
FIGIJKE 44. Simulation results of DTFCNN-based local fuzzy relation equation. (a) The output of the first DTFCNN layer. (b) The output of the second DTFCNN layer.
3. Detecting Impulsive Noise As we have shown, the first step for removing impulse noise is to identify
its position. In this section we will show how a DTFCNN-based local fuzzy relation equation can perform this task. We choose the input equation as
and BIG is given by .Y - 100- r
5 -200
,
-200 < x 5 -100 -100 5 x < 100 .r - I00 100’ 100 _ ( x < 200 x 2 200. ,()()
(4 19)
The cell dynamics are given by
where B(i,,j;k , l ) is given by the following fuzzy set defined in ZTN1 ( i j ) )grid:
{B(i,j ; k , / ) l = i~ i
+1
(
j-1
0.5 0.5 0.5
j
j + l
0.5
0.5
0.5
)
(42 1 1
FUZZY CELL.ULAR NEURAL NETWORKS AND THEIR APPLICATIONS
(a)
4 I9
(b)
FIGLIRE 4.5. Sirnulalion resulis of DTFCNN-based local f u u y relation equation for impulse noise detection. (a) Image with impulse noise. (h) The output of the DTFCNN-based local fuzzy relation equation.
The simulation result is shown in Fig. 45. Figure 4% shows an image with impulse noise. Figure 4Sb shows the output of the DTFCNN. Observe that all impulse noise is found. 4. Fuzzy Orietztutioti Derivatives Fuzzy local relation systems can also function as high-pass filters. In this section we show DTFCNN-based fuzzy orientation derivatives. We choose the input equation as
and BIG is given by
(423)
The cell dynamics are given by
420
TAO YANG
For horizontal fuzzy orientation derivative B ( i , j ; k , 1 ) is given by the following fuzzy set defined in Z2(N1 ( i j ) ) grid j - I
j
;+I 0.4
(425 1 i+l
For diagonal fuzzy orientation derivative B ( i , j ; k , 1) is given by the following ~ grid fuzzy set defined in Z 2 ( N (ij))
( B ( i ,j ; k , I ) ] =
i i+1
(
j-1
0.4 0.4
j j + l 0.4 0.4 0.4 0.4
)
.
(426)
Simulation results are shown in Fig. 46. Figure 46a shows the result of horizontal fuzzy orientation derivative. Figure 46b shows the result of diagonal fuzzy orientation derivative.
IX. CONCLUSIONS AND FUTURE WORK
The invention of FCNN comes in part from AI. The idea of embedding fuzzy set theory into a CNN framework is partially motivated by the inner connection
(a)
(b)
FIGLIRE 46. Simulation results of DTFCNN-based fuzzy orientation derivatives. ( a ) Horizontal fuzzy orientation derivative. (b) Diagonal fuzzy orientation derivative.
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
42 1
of mathematical morphology and fuzzy logic. Because the substantial basis of FCNN is FCA, which is a model for modeling and coping with complexity, the invention of FCNN can also be used to model the complexity from a high-level local activity. Another motivative comes from the demand to use the huge body of knowledge that comes from human experts. Encountering the complexity of the outer and inner world, a human individual uses a systematic but general description to model and handle the behaviors required for survival. If we only model the complexity itself, it is not very helpful to survive it. What fuzzy theory contributes to sciences is to model the survival strategy of the human individual directly. In FCNN, we model the systematic behavior of handling complexities due to local activity but not complexities themselves.
A. The C N N Uriiverse
From the standpoint of AI, the existing CNN universe is something like that shown in Fig. 47. In this figure, we do not include CNNUM because it is not a CNN class but a platform of CNN. The CNNUM emphasizes implementation and integration of different CNN structures. Although more than 400 papers in this field have been written since 1988, CNN is far from mature because we have found only the slightest bit of the CNN universe. The implementation of CNN using different techniques deserves further investigation because the simple structure of CNN provides us with the possibility of implementing it. Applications of CNN to signal processing, in particular, image processing, also need further study because it seems that CNN is a very promising candidate for the next generation of parallel image processing engines. On the other hand, the CNN paradigm can be used to animate many biologic, chemical and physical processes where dynamics are governed by local coupling of simple units. However, from Fig. 47 one can see that most parts of the high-level CNN are unknown. In fact, the only high-level CNN we know so far is FCNN. We can image that the high-level CNN should include some paradigms that can be used to model the dynamics of human society when nonphysical factors such as emotions, feelings, and intuitions are used as local couplings along with physical factors that include food, money, dwellings, and work opportunities. Although today we can not imagine how to embed nonphysical things into the structure of CNN, we know human society has used them efficiently to organize itself for thousands of years. We believe that the future CNN model should be something like those we have predicted in Sect. 1I.A. We are always very careful to avoid giving the reader an impression that CNN can do everything. On the contrary, we restrict the range of CNN to a of problems the can be decomposed into local components. Because the
422
TAO YANG
//
continuousCNN
\
/
~
\
-
\\ \
/ CNN UNIVERSE
FIGURE47. The map of the C N N universe
top-down process, that is, decomposing a global problem into local components, sometimes is very difficult, bottom-up processes, that is, using relatively simple elements and local rules to generate some global behaviors, are aiso employed. So far, almost all CNN applications to signal processing and biological modeling employ the top-down method. And some applications of CNN to pattern formation and spatiotemporal process modeling employ the bottom-up method. However, when the bottom-up method is used, the emergent behavior of CNN may be very difficult to interpret. In general, it is not the problem of CNN itself, but the elementary problem of emergent computation. We should come back to FCNN. Type-I1 FCNN is most studied and wellunderstood. In particular, we have presented an entire set of methods to exploit the world of some type-I1 FCNN that are used as computational CNN. However, when fuzzy numbers flow though type-I1 FCNN structures, the problem becomes very complicated. This makes FCNN totally different from
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
423
conventional CNN. Research in this field has just begun. Type-I and -111 FCNN provide both more possibility and flexibility in the modeling of local coupling processes. Sometimes, type-I FCNN is more complicated than type-I1 FCNN because its fuzzy structure may introduce more complexity. On the other hand, from the examples in Sect. 1V.D one can see that the potential application of type-I FCNN is beyond the range of linear signal processing. It is most possible to provide new methods to be used for nonlinear signal processing. So far, we know of very few type-I11 FCNN because they are too complicated to analyze. The results in Yang et 01. (19981) show that type-Ill FCNN can be used to model very complex systems where linguistic flows are used as state variables. Most of the world of type-I11 and even type-IV FCNN still awaits exploration. Although we emphasize FCNN applications to image processing, FCNN can also be used to model complex processes such as spatiotemporal chaos. As an example, we present a type-11 DTFCNN structure to implement fuzzy spatial Dilemmas. The fuzzy spatial Dilemma is a new concept we have generated from conventional spatial Dilemma (Nowak and May, 1993). B. Implenzenting Fuzzy Spatial Dilemmas Using Type-I1 DTFCNN The conventional spatial Dilemma (Nowak and May, 1993) is defined by a game played between two types of players: the defector (denoted by D) and the cooperator (denoted by C). The interaction between a cooperator and a defector is described by the following payoff matrix:
C
D
;(; :)
(427)
In this matrix, we show only the payoff of a player. If two cooperators interact, both receive 1 point. If a defector meet a cooperator, the defector receives 2 points and the cooperator receives 0 point. If two defectors interact both receive 0 point. The fuzzy generalization of the spatial Dilemmas is along two directions. The first one fuzzifies the payoff. As uncertainties exist in the payoff, we should describe the payoff by f u z i y variables such as “high” or “low.” The second one fuzzifies the property of a player. Because a player can be a very complex system such as an animal or even a human individual, we can not absolutely define which one is a defector or a cooperator. A better way to describe the property of a player is to assign a degree of being a defector (or being a cooperator). By using the fuzzy property of a player, we can say a player is low defection, middle defection or high defection.
424
TAO YANG
Here, we use two kinds of fuzzy descriptions to describe uncertainties in both the payoff and property of a player. We use membership function p ~ ( . ) to denote the degree of the player x being a defector. We use a fuzzy set P to denote the payoff. We use triangularly shaped fuzzy sets. We denote a triangularly shaped fuzzy set by A ( x ; c, w), which is given by:
The mathematical operations between two triangular-shaped fuzzy sets A ( x ; c,, w,) and B(x; C h , wh) are defined by (Bojadziev and Bojadziev, 1995)
where ci > 0 is a scalar. In this case, when player x plays with player y the payoff map T ( x , y) for player x is a 2D fuzzy set T : !)I x 91 H [0, I], which is defined by
We use p T ( x , y) to measure the degree of player x obtaining a “high” payoff. The payoff is also described by a fuzzy set. There are different fuzzy functions to combine the foregoing two kinds of fuzzinesses; here we use the following fuzzy function to denote the payoff P ( x , y) for x when it plays with y :
where S(x; cJ, w,) is the standard fuzzy set that corresponds to the payoff for a defector when it meets a cooperator. We then use a DTFCNN to model fuzzy spatial Dilemmas. The dynamics of this DTFCNN are given by: 1 . Fuzzy state equation
Notice that the state variable is a fuzzy set (usually a fuzzy number) instead of a signal (real number). This is a type-I1 FCNN. 2. Output equation Y,, ( t ) = f d
(X1/ ( t - 1 )>
(434)
FUZZY CELLULAR NEURAL NETWORKS A N D THEIR APPLICATIONS
where 1976)
,fd(.)
425
is a defuzzified function. Here, we choose f l l ( . ) as (Sanchez,
1: 1:
YPUiol) dY
fnG) =
(435 1 Pi(Y)dY
I f i is a triangularly shaped fuzzy set, then we have . f n ( i ( x ; c, WJ))= c
+ 3w I
(436)
At each iteration t , a center cell C;,i is replaced by a cell C ~ EI N , . ( i j ) whose output is the maximum in N , . ( i j ) . In the following simulations, a wrap-up boundary condition is used. Grayscale images with 256 gray levels are used to denote spatial patterns. The degree of whiteness denotes the membership value of a cell being a cooperator. Figure 48 shows cases when S(x; c,,, w,)= S(x;5 , 4 ) . Figure 48a shows the initial condition, which is of size 63 x 63 cells. Figure 48b shows the snapshot of the 20th iteration. Figure 48c shows the snapshot of the 150th iteration.
(d)
(CJ
( fJ
FIGURE48. Evolving process of fii7z.y spatial dilernmas with S ( x ; c ~ , w , ) = S ( x5.4). ; (a) Initial condition. (b) Output at / = 20. ( c ) Output at / = 150. ( d ) Output at / = 316. ( c ) Output at t = 317. ( f ) Output at t = 318.
426
TAO YANG
(c)
(d)
F l m m 49. Evolving process of fuzzy spatial dilemmas with S(.Y;c , , I V , ) = S(a; 10, 10). ( a ) Output at t = 455. (b) Output at / = 630. ( c ) Outptlt :It t = 845. (d) Output at f = 1000.
Finally, the pattern goes to a periodic 3 solution as shown in Fig. 48d, e and f of the 3 16th, 3 17th and 3 18th iterations, respectively. Figure 49 shows cases with S(x;c,,, w , ~= ) S(x; 10, 10).The initial condition is the same as that in Fig. 48a. Figures 49a, b, c and d show snapshots of the 4SSth, 630th, 845th and 1000th iterations, respectively. The evolution becomes chaotic. Because the payoff of cooperation becomes higher, the number of cells of high degree of cooperation increases. A very interesting phenomenon is that a large cluster of defectors can not exist for a long time because the high payoff of cooperation will soon change some of them into cooperators. This can not be observed in Fig. 48 where the payoff of cooperation is relatively low. Observe that the for mentioned type-I1 DTFCNN functions as a kind of fuzzy cellular automata (FCA) (Adamatzky, 1994).Although conventional cell-
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
427
ular automata (CA) are widely used in simulations of many natural phenomena such as fluid dynamics, diffusion, reaction-diffusion systems, populations, epidemics, etc., there are very few applications of FCA. The lack of a proper platform for studying FCA is the main reason for the lack of FCA applications. The DTFCNN is a very promising platform for FCA as shown in this section. REFERENCES Adainatiky, A. I . ( 1994). Hierarchy 0 1 l i m y cellular automata. Fuzy Sets cirrtl Sy.s~r~ri,s, 62: 167- 174. AiLenberg, N. N. and Aizenbcrg, I . N. (1994). CNN-like networks based on multi-v;ilued and universal binary neurons: learning and application to image processing. In Pro(.. Third IEEE lntrrrrtrtiorirrl Workshop O I I Ci~llrrl~rr Nc,irrtil Nrtworks trrrrl Their Applictrtioirs ( C N N A -W), pp, 1.53- 158. Ailenberg. N. N., Ai~enberg,I . N.. and Krivoshccv. G. A. (1996). CNN based on universal binary neurons: learning algorithm with error-corection and application to impulsive-noise filtering on gray-scale images. I n Proc. Forruth IEEE / r ~ ~ t ~ r r i ~ r t i Work.shop ~~ritr/ o r i Cc~llrrkirNlwrril N e f b i w k s r r r r d Thrir Aplicrrtioris, S e i ~ i l / (Sprliri, ~. Jurir 24-26. pp. 309 -3 14. Alcfeld. G. and Hcrzberger, J. ( 1983). Irltroduc~riorito / r i / r r i ~ rCoinl~rrttitiori.c. / New York: Academic Press. Anguita. M., Pelayo, F. J . , Fernandey. F. J.. and Prieto, A. (199.5). A low-power analog iinplenientation of cellular neural networks. In Frorti N t r r w d to Art~$ciri/ Nerrrtrl Cor?rprrtcitiorr. lritrnitrtioiirrl Workslzop on Arfifificitrl Ncro.trl Nctwwks. Proc.retlirig.s, pp. 736 - 743. Anguita. M.. Pelayo. F. J., Prieto, A,. and Ortega. J. ( 1993). Analog CMOS implenientation of a discrete time CNN with prograininable cloning templates. I Trriris. Circwit.\ crrrd S\.s/rin.> 11: Antilog iuid Di,qi/trl Si~qii~l Proce.s.siri,g.40(3):215-2 18. Anguita. M.. Pelayo, F. J.. Ros, E., Paloinar, D., and Prieto, A. (1996). VLSl irnpleinentationa of CNNa for image processing and vision tasks: single and multiple chip approach. In Proc. Fourtli IEEE ~iit~~riiirtioritil Workshop on Cellirliir Ntwrcrl Neniwrks m d Their Applic~itioiis, Srvillr, Sptrin, Jirnr 24-26, pp. 479-484. Anguita, M., Prieto. A., Pelayo, F. J.. Onega, J., CI t i / . (1991). CMOS implementation o f a cellular neural network with dynamically alterable cloning templates. I n Arr(fic,itrl Neurrrl Nrrworh. Ir~frriitrtioiitrl Work.sliop I W A N N 9 1 Procet4irig.s. pp. 260- 267. Arakawa, K. (1996). Median filter based on fuzry rules and its application t o image restoration. Fuzzy St,rs Nlltl Sy.stcII1.s. 7 7 3 - 13. Araujo. C. P.S. and Ritter. G. (1992). Morphological neural networks and iinagc algebra i n ar~ ~ i ~ dPruccssirrg //I. Proc. /he tificial perception systems. /rntr,qr Alg~brtr( i t i d M 0 ~ / J h I J / Olrirngr SPIE - Tlir Interr~otioi~ctl Sociery,fi)r Opricd Eri,qirrec~riiig.Snir Dirjio. C'A. USA. 1769(20- 2 2 ) : 128- 142. una, L., and Manpanaro, G . ( l995a). Chua's circuit can be generated ( I I I S . C'ircwits rrriil Sv.st~wi.~ 1: Fuiidriiiirntd Tliror? crrrtl App[it~trtiofi,s, 42(2):123-125. Ai-ena, P., Baglio. S.. Fortuna. L.. and Manganaro, G. (l995b). Hyperchaos from cellular neural networks. Elrc/rorlic.s Lrttrr,s. 31(4):2.50- 25 1 Arena, P., Baglio. S., Fortuna, L., and Manganaro, G. (1996). Generation of n-double scrolls via cellular neural networks. Iirtrrrrrrtioiirrl Jorrr. Circuit Tlieor:v arid Applictztioris, 24(3):24I -252. Artificial Lile Workshop. ( 1994). Ar/$c.itrl lip 111: Proc. Work.s/rop o i i Artjficial Ljfi.. Reading, MA.: Addison-Wesley.
428
TAO YANG
Back. T. and Kursawe, F. ( 1995). Evolirtionary Algorithrns fi)r FKZV Logic; A Briqf' O w r ~ ~ i r w . pp. 21 -28, River Edge. NJ: World Scientific. Baktir. 1. A. and Tan, M . A. (1993). Analog CMOS implementation of cellular neural networks. .~. m d Sysferns I/: Analog rnrd Digitul Signti1 Processinq. 40(3):200-206. IEEE T ~ N I ICircirifs Baktir, I. A. and Tan, M. A . (I99 1 ). Analog CMOS implementation of cellular neural networks. In Coml)ir/ercincl Injorni~itionSciences VI. Pro(.. I Y Y I In/rrncrtiontrl Syn?po.siun/,Oc~/.30- No\,. 2. pp. 825-834. Balsi. M. ( 1994). Hardware supervised learning for cellular and Hoplield neural networks. In World Congress on Neural Nencwk-Strn Diego. I YO4 Intemrrtionril Neirrnl Network Societ~, Annrrtrl Meeting, pp. III/45 I -4.56. F. (1994). Optoelectronic cellular neural Balsi. M . , Ciancaglioni. I.. Cimagalli. V.. and Gallu networks based on amorphous silicon thin f i l i i i technology. In Proc. Third lEEE Intrrnrr/iontrl Workslrop on Cellultir NeiirtiI Netnwrks trntl Tlirir Applicutions (CNNA-94). Dec. 18-21. pp. 399 - 403. Bang. S. H., Sheu, B. J.. and Wu. T. H.-Y. (1994). Paralleled hardware annealing of cellular neural nctworks for optimal solutions. In 1094 IEEE In/erncr/ionol Conferencr on Nrirrtrl Neti3.ork.s. IEEE World Congress oil C~~iirliiif~rfi~~~icil lnielli,~rnce,June 27- July 2, 4:2046-205 I . Baturone, I., Sanchez-Solano, S., Barriga. A., and Huertas, J. L. (1997). Implementation of CMOS fuzzy controllers as mixed-signal integrated circuits. IEEE Titnu. F r c y Systrni.s. 5( I ) : I - 19. Beccherelli, R.. de Cesare, G.. and Palma, F. (1994). Towards an hydrogenated amorphous silicon photo-transistor cellular neural network. In Proc. Third IEEE Intrrntrtiontil Work.slrop o i r Cdloltrr Neurcrl Ne/works trntl Their AlJp/ictrIion.s (CNNA-Y4), Dec. / 8 - 2 1 , pp. 351 - 362. Berger. T. W., Sheu, B. J . , and Tsai, R. H.-J. (1994). Analog VLSl implementation o f a now linear systems model of the hippocampal brain region. In Proc. Third IEEE Iirternt~tioirtrl Workshop on Cellirlrir Neiirtrl network.^ rnrtl Tlieir Applic,trtions (CNNA-94). Dec. 18- 2 I . p p . 47-51. Shi, B. E. (1994). Order statistic filtering with cellular neural networks. In Proc. Tliird IEEE Int. Workshop on Celli~lorNtwrtrl Ntmvork.s rnrtl 71ieir Applicritions. (CNNA-Y4). Dec. 18-21. pp. 441 -444. Betta. G. F. D., Grafti. S., Kovacs, Z. M., and Masetti, G . (19'93). CMOS implementation 0 1 an analogically prograrnmahle cellular neural network. IEEE Trcnn. Circnif,\ (rnd .Sy.s/cni.\ I / : Antr/og crntl Di,qittrl Signirl Proces.sing, 40(3):206-2 I S . Bey, Ir.. P.P., Yonce, D. J . , and Newcomb, R. W. ( 1993). Investigation of contrast enhancement by numerical methods for an optical cellular iictiI.;d network. I n Proc. -?tit//Mitlwr.st Synrlwxiinn on Circni/.s trntl S w t w w , pp. 582-583. Blanco. A,. Delgado, M . , and Requena. I . (1995a). Idenrificalion of fuzzy relational equations hy fuzzy neural networks. F'IK:J, Sc,t.s tnrd .Sy.stcnu, 71:2 IS -226. Blanco. A,, Delgado, M . , and Requena, 1. ( 199Sb). Improved fuzzy neural networks f o r solving relational cquations. F K : ~Sets untl Systmr.~,71:3 1 1 -322. Bojadziev, G . and Bo,jadiiev. M. (1995). F u x ~S ~ r s ,F n x y I,ogic,, App1icrrtioir.s. Singapore: World Scientific. Braspenning. P. J., Thui-jsman, F., and Wei.jters, A. J. M. M . (Eds.). (1995). Artjfic.ial Ne,irrtrl Nem>orks:Aii I i i t r o h c f i o i i to ANN Tlieorv orit/ Prtrctic,e. New York: Springer. Brucoli, M . , Carnimeo. L., and Grassi, G. ( 1995a). Discrete-time cellulai- ncural networks for associativc memories: a new design method via iterative learning and forgetting nlgorithnis. In 3Htlr Mithvst Synil~o.viinnoil Circuits crntl Sy,venr.s. Proceedings, pp. 542 - 545. Brucoli. M . , Carnimeo. L., and Grassi, G . ( 199%). Discrete-time cellular neural networks for nssociarive memories with learning and forgetting capabilities. IEEE Trtim. Cirt.rti/.s turd .Sy.sfew.\ I: Fnndtm~enrolTheor? rind Applicnrioiis, 42(7):396- 399.
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
429
Brucoli, M., Carnimeo, L.. and Grassi, C. (1996). A global approach to the design of discretetime cellular neural networks for associative memories. Interrrcrtionol Jour. Circirir Theon) r i n d application.^, 24(4):489-5 10. Buckley, J. J. and Hayashi, Y. (1996). F ~ z Neural y Networks, Chapter 1 I , pp. 233-249, New York: McGraw-Hall. Cardarilli, G. C., Lojacono, R., Salerno, M., and Sargeni. F. (1993). VLSl implementation of a cellular neural network with programmable control operator. In Proc. 36th Midwest SyinI)o.siirrn on Circuits t i n d Svsterns, pp. 1089- 1092. Cardarilli, G. C.. Lojacono, R., Salerno, M., and Sargeni, F. (1992). A VLSl implementation of programinable cellular neural networks. In Artijicid Neural Networks. 2. Proc. I992 Interntitionril Conf2rence (ICANN-YZ), Sept. 1-7, 2: pp. 1491 1494. Cardarilli, G. C. and Sargeni. F. (1995). Very efficient VLSI implementation of CNN with discrete templates. E1ectronic.s Lcrters, 2Y( 14):1286- 1287. Chcllappa, R.,Wilson, C. L.. and Sirohey, S. ( 1995). Human and machine recognition of faces: a survey. Proc. IEEE, 83:705-740. Chua, L. 0. (1998). CNN: A V i s i o n of' Coinplexir)~.Singapore/River Edge, NJ: World Scientific. Chua. L. 0. (1992). CNN. 11. applications and VLSl circuit realizations. In Proc. .?5r/i Midwe.st S~~tnpo.siirm on Circuirs orid S?/.stenrs.AUK. 9 - 12, 1: 146- 149. Chua, L. 0. and Goras, L. ( 1995). Turing patterns i n cellular neural networks. Intrriirr~ionnl Jour. E1rctronic.s. 79(6):7 19-736. Chua. L. O., Hasler, M., Moschytz. G. S., and Neirynck, J. (1995). Autonomous cellular neural networks: a unilied paradigm for pattern formation and active wave propagation. IEEE T,aris. Circ,ui/s rind S.y.stetn.s I: Fundcit~ientrr/Tliroiy m d Applic~titions.42( 10):559- 577. Chua, L. 0. and Roska, T. (1993). The CNN paradigm. I tins. Circuits arid Systems I: Funrlt~mentd T h e o ~ ynntl App/iccrtioii.\, 40(3):I47 156. Chua, L. O., Roska, T., Kozek, T., and Zarandy, A. (1996a). CNN universal chips crank up the computing power. IEEE Circuits and Devices Magazine, 12(4):I 8 -28. Chua, L. 0. and Yang, L. ( I 98Xa). Cellular neural networks: Applications. IEEE Trrrns. C'irr.rtit.s nntl Systems, 35( 10):1273- 1290. Chua, L. 0. and Yang, L. (1988b). Cellular neural networks: Theory. IEEE Trtms. Circuits trnd Systein.~,35( 10):1257- 1272. Chua, L. O., Yang, T., Zhong. G. Q..and Wu. C. W. (1996b). Adaptive synchronization of Chua's oscillators. Int~~rnationirl Joirr. B$irctitiori trnd Clmos, 6( 1 ): 189- 201 . Chua. L. 0..Yang, T., Zhong, G. Q., and Wu, C. W. ( 1 9 9 6 ~ ) Synchronization . of Chua's circuits with time-varying channels and parameters. IEEE Ti-tni,~.Circuits tznd Systetns -1: F~rntkirnrnto/ Tlieoy crnd Applicurions, 43(10):862 868. Civalleri, P. P. and Gilli, M. (1992). Some atability properties of CNN's with delay. In CNNA '92 Proceedings. S~contiInrernatiotial Workshop on Celliilar Neurril Nr%vorks and Their App/icutions, pp. 94-99. Civalleri, P. P. and Gilli, M. (1994). Some dynamic phenomena in delayed cellular neural networks. Interntrtiontil Jour. Circuit T/ieory rind Applicntions, 22(2):77- 105. Civalleri, P. P., Gilli, M.. and Pandolf. L. (1993). On stability of cellular neural networks with delay. IEEE Truns. Ciriuit.s irrrd Systtwu I: F ~ r n t l t ~ n ~ ~ Tlleo~y w ~ i i / rind Applictitions, 40(3):157 165. Clarke, R. J. ( 1995). Digitrrl Conil~re.s.sior~ of Still Itntige wit1 Video. New York: Academic Press. Coli, M., Palazzari, P.. and Rughi. R. (1995). Design of dynamic evolution of discrete-time ow Articontinuous-output cellular neural networks. I n ICANN '95. Internritional COI!~'P~C.IICP ,ficirrl Neurul Networks. Neuroniines '95 Scicntifc Coqfr,riwce, Oct. 9- 13, 2:419-424. thods for image processing and pattern formation in Crounse, K. R . and Chua, L. 0. (199 cellular neural networks: a tutorial. Trrins. Circuits urid Systerns I: Funn'ctmentul Theory ~ r Applicrrtio~~.~, ~ d 42( 10):583-60 I. -
-
-
-
430
TAO YANG
Crounse. K . R. and Chua, L. 0. (1996). The CNN universal machine is as universal as a Turning machine. IEEE Truris. Circuits uritl Systems I: Fundamental T l t e o n ~wid A ~ ~ p l i c n t i o i ~ . ~ , 43(3):353- 355. Crounse, K. R., Roska. T., and Chua, L. 0. (1993). Image halftoning with cellular neural networks. IEEE Truns. Circuitv und Systems 11: Antilog find Digital Signcrl Processing, 40(4): 267 -283. Cruz, J . M. and Chua, L. 0. (1991). A CNN chip for connected component detection. IEEE Truus. Circuits rind Systems, 38(7):812-817. Cruz. J . M. and Chua, L. 0. (1995). Application o f cellular neural networks to model population dynamics. IEEE Truns. Circuits und Sy.s/eni.s I: F u ~ ~ d u m e n t 711eop r~l und App/ication.s, 42( 10):715-720. Cruz, J . M.. Chua. L. O., and Roska, T. (1994). A fast, complex and efficient test implementation of the CNN universal machine. In Proc. Third Intemntionul Workshop 011 Cellulur N c w d Nr~tworkscrud Their App1icution.s (CNNA-94). Dec. 18-21, pp. 61 -66. Csapodi, M. and Roska, T. (1996). Dynamic analogic CNN algorithms for a complex recognition task -a first step towards a bionic eyeglass. Internutional Jour. Circuit T h e o p and Applicutiori.y, 24( I ): 127 - 144. Dalla Betta, G. F., Graffi, S., Masetti, G . , and Kovacs, Z. M. (1992). Design of a CMOS analog programmahle cellular neural network. In CNNA '92 Proc.eedings. Set.oild lriterizatinnul W(~rk.sIiopon Cellulur Neurrrl Networks n r i d Their Applicutions, Oct. 14- 16, pp. 15 1 156. Davidson. J . L. ( 1992). Simulated annealing and morphology neural networks. Inluge Algebrrr turd Mor[~ho/ogica/Iiriage Proce.ssirig Ill, P roc. SPIE - The Internutioizcil Society f;w Opficul E~?gblec,ri~lg. SUII Diego, CA, USA, 1769(20-22): I 19- 127. Davidaon, J . L. and Hummer, F. (1993). Morphology neural networks: an introduction with applications. Circuits. Systems, a r i d Signul Proceu.sin~, 12(2):177- 2 10. Davidson, J . L. and Ritter, G . X. (1990). A theory of morphological neural networks. Digitcrl Opticul C o i n p h g 11. Proc. SPIE - The Interiiutionnl Society ,fi)r Opticul Engineerirzg. 1215(17- 19):378-388. Davis, L. ( 1991). Hundhook of' Genetic, A1gorithm.s. New York: Van Nostrand Reinhold. Destri, G. and Marenzoni, P. (1996). Cellular neural networks as a general massively parallel computational paradigm. lriternutioritrl Jour. Circuit Theoty mid Applit~utions,24( 3):397 407. Diederich. J . (Ed.). (1990). Arrijiciul Neurul network^\: Concept Leciruing. Los Alamitos. CA: IEEE Computer Society Press. Doan, M.- D., Glesner, M., Chakrabaty, R., Heidenreich, M., and others. (1994). Realisation of a digital cellular neural network for iinage processing. In Proc. Tlzird IEEE Itlrernrrriond Workshop on Cellulur Neurul Nehvorks tint! Tlzeir Applicutions (CNNA-94). Dec. 18- 2 I . pp. 85-90. Doniinguez-Castro, R., Espejo, S., Rodriguez-Vazqucz, A., and Carmona, R. (1994b). A CNN universal chip in CMOS technology. In Proc. Third IEEE Interi~arioncrlWorkshop 011 Cellulur Ntwral Nefn.orks and Their App1i'licwtion.s(CNNA-Y4). Dee. 18-21, pp. 91 -96. Dominguez-Castro, R.. Espejo, S., Rodriguez-Vazquez, A,, Garcia-Vargas, 1.. and others. ( 1994). Sirena: a simulation environment for CNNS. In Proc. Third IEEE lriterriutiori~il Workshop on Cellular NeuruI Nehvorks crnd Their App1ication.s (CNNA-94). Dec. 18-21, pp. 4 17 422. Drivel-, R. D. ( 1 977). Ordintip arid Delny Djflertwtial Equation.~.Berlin: Springer-Verlag. Duhois, D. and Prade, H. (1980). Fuzzy Sets and Systems: Tlleory arid Ap~~/iccltioil.s. New York: Academic Press. -
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
43 1
Espejo, S., Carmona. R., Dominguez-Castro, R., and Rodriguez-Varquez, A,, (199621). A CNN iniiversal chip in CMOS technology. lnterntirional Joiir. Circuit Theory crnd Applicrrtions. 24( l):93- 109. Espejo, S., Cannona, R., Dominguez-Castro, R.. and Rodriguez-Vazquez, A. ( I996h). A VLSIoriented continuous-time CNN model. Internntionul Jour. Circuit Theon rind App/icntions, 24(3):341-356. Espejo. S., Dorninguez-Castro, R., Carmona, R., and Rodriguez-VaLquez, A. ( 1 9 9 4 ) . A continuous-time cellular neural network chip for direction-selectable connected component detection with optical image acquisition. In Proc. Fourth lriternutiorial Cor!fererice on Mic.ro-rlei.troriic.s j b r Neurd Networks and Fiiz,-?' Systrws, Sepi. 2 6 - 2 8 , pp. 383-391. Espejo, S.. Dorninguez-Castro, R.. Carmona, R., and Rodriguez-Vazquez, A. ( 1994d). Cellular neural network chips with optical image acquisition. In 1994 IEEE Itrtrrncitiorrtrl Cotfi,rence on Neurcil Networks. IEEE World Congrexs on Conipiitationcil Intelligenc~e.June 27-Jrily 2. pp, I X77- 1882. Espejo, S., Dominguez-Castro, R., Rodriguez-Vazquez. A., and Carmona, R. ( I994e). Weightcontrol strategy for programmable CNN chips. In Proc. Third IEEE Irilerntitiotiul Work.shop on Cellrrlnr Neurul Netkrorks cind 7%eir App1iccition.s (CNNA-94). Dec. 18-21, pp. 405-410. Espejo, S., RodrigueL-Vazquez, A.. Dominguez-Castro, R., Huertas, J . L., and othcrs. ( 1994). Sinartpixel cellular neural networks in analog currcnt-mode CMOS technology. IEEE Jour. Solid-Stute Circuits, 29(8):895 -905. Espejo, S., Rodriguez-Vazquez, A., and Huertas. J. L. (1992). Design and testing issties in currentmode cellular neural networks. In CNNA '92 Proc~wdin,gs,Sei.ontl lntc~rnc~tiond Work.shop on Ci4lulur Neurcil Ncrworks atid Their Applictitions, Ocr. 14- 16, pp. 169- 174. Espejo, S., Rodriguez Vazquez, A., DominguezCastro, R., Linares, B.. and others. ( 1993). A model for VLSl implementation of CNN image processing chips using current-mode techniques. In (Proceedings)I 993 IEEE Inteniutioritil Syniposiuni 011 Circuit.s trnd Swterrrs, 2:970973. Farmer. D., Tofloli, T., and Wolfram. S. (Ed\). ( 1984). Cdlirlnr Automtrtti: Procwding.7 of un Ititrrdi.\ci~~liirtrr~ Work.r/rop.New York: North-Holland Physics Pub. Faure, B. and Mazare, G. (1990). A VLSl cellular array for the processing of back-propagation neural networks. In A1gorithnr.s rind P(irrrlle1 VLSl Architectures. Lectures ( i d 7irtorial.s Pre.senrod nt the Iriterrrcttionrzl Workshop, Jurw 10- 16, pp. I93 -202. Finocchiaro. M. and Perfetti. R. ( 1995). Relation between template spectrum and stability of cellular neural networks with delay. Elrctrorric,.s Lrtters, 31(23):2O24- 2026. Forrest, S. (Ed.). (199 I ) . Enrergerit Cornputotion. Cambridge, MA: MIT Preas. Fruehauf, N., Chua, L. 0..and Lueder. E. (1992). Convergence of reciprocal time-diacrete cellular neural networks with continuous nonlinearities. In CNNA '92 Proceetlirig.\. Second Interncrtiontil Workshop on Cellirltrr N e r d Nerkiwks trnd Their App1icnrioii.s. Oct. 14- I6 pp. 106- 1 1 I. Fruehauf, N., Lueder, E., and Bader, G . (1993). Fourier optical realization of cellular neural networks. IEEE Truns. Circuits and Sy.itm.s I/: Antilog cind Digittil Signal P roc.r.s.\ing:,40(3):156162. Fruehauf, N. and Lueder, E. (1990). Realization o f CNNs by optical parallel processing with spatial light valves. In 1990 IEEE Irireniritioncil Workshop on Cdlulur Ncurrrl Networks und Their Applicririons, CNNA-YO. Lkc. 16- 19. pp.281 -290. Furukawa, M. and Yamakawa, T. (1995). The design algorithms of membership functions for a fuzzy neuron. Fuzzy S P I S and Systems. 71329 343. Galias, Z. ( 1992). Designing discrete-time cellular neural networks for the evaluation of local Boolean functions. In CNNA '92 Proceerlirij+v. Second lnterndiond Workshop on C'ellirltir Neurul Networks and Their Applications, Oct. 14- 16, pp.23-28. -
432
TAO YANG
Galias, Z.(1993). Designing cellular neural networks for the evaluation of local Boolean functions. /EEE Trttns. Civczrits ( r i d Systeins /I: Antilog c i n d Digitti1 SiRntil Proces.sing. 40(4):267283. Galias, Z. and Nossek, J . A. (1994). Control of a real chaotic cellular neural network. In Proc,. Third IEEE Internntionnl Workshop on Cellirlrrr Nrwrtil Nemork.r cind Their Ap[J/ictitions (CNNA-94), p. 345. Gilli. M. ( 1993). Strange attractors in delayed cellular neural networks. 1EEE Tr(JIf.s. Circ~iri/.s (ilJt/ .~i'.stfwI.\1.' ~ U l ~ d ~ l t l T~/ l P~Ol n~ f fi l /l d A/J[)/iC~ltiOll.S, 40( 1 1 ):849- 853. Gilli. M. (1994). Stability of cellular neural networks and delayed cellular neural networks with nonpositive templates and nonmonotonic output functions. IEEE Trtins. Circuits itnd Sys/cin.s I: F~rritkinienttr1 T / z e o n nnd Applictitions, 41(8):5 I X-528. Gilli. M. ( 1995). A spectral approach for chaos prediction in delayed cellular neural networks. /n/ernu/ion~i/ Jorrr. B(furcnriorr and Clitios irr App/iet/ S'cience.s trnd E~~,yiiiet,riiig. 5(3):86Y - 875. Goldberg. D. E. ( 1989). Genetic Algorithtns in Srtrrt.h, Op/inrix//iotf nnd Mrrchinr Letiming. Reading: Addison-Wesley. D~nunzictr1Svs/cni.s. trnd N w r d Goles, E. and Martinez, S. (Eda.). ( 1994). Cellirliir Aritonttrt~~, Nvtworks. Boston: Kluwer Academic Puhlihhers. Goutsias, J . and Schonfeld, D. ( 1989). Image coding via morphological traiisformation: a general theory. In Pro(,. IEEE Cnnf: Cortipiter Vision crnd Pottern Reco,ynition, June Gunsel. B. and Curelis, C. ( 1995). Supervised learning of smoothing parameters in image restoration by regularization under cellular neural networks framework. In Proceedings. / n / c v n d w i r i / Conft,rence on hrirgr Proce.wing. pp. 470-473. Gutowitz. H. (Ed.). ( 199 I ). Cr/lii/cir Airtonrafti: Tlieorv tint1 Ekp~~rirnenr. Cambridge, MA: MIT Press. Guzelis, C. ( 1992). Supcrvised learning of the steady-state outputs in generalized cellular nelworks In CNNA '92 Procrer1irig.r. .Sec.onrl Intern~rtioncr/Workshop on Cellultir Nerrr-(11Nrrrwrk., tnrd Their App/ications. pp. 74-79. Guzelis. C. ( 1993). Chaotic cellular neural networks made of Chua's circuits. Jour. Circuits, S ~ . s / e ~ tint1 n s Coinpiiters. 3(2):603-6 12. GuLelis, C. and Karamahmut, S. ( 1994). Recurrent perceptron learning algorithm for completely stable cellular neural networks. I n /'roc,. Third /EEE /nfc~rncrtionrr/Workshop on Ce1lulur Neural Nerrwrk.~and Their App1icntion.s (CNNA-Y4), pp. I77 - 182. Halonen. K., Porra, V., Roska, T., and Chua, L. ( 1 9 9 1 ~ ) Programmable . analog VLSI CNN chip with locnl digital logic. In 1991 IEEE /n/ernutiono/ Synposiinn o i i Circ.nits t n i d Systeins. Juiw 11-14, pp. 1291-1294. Halonen, K., Pona. V., Roska, T.. and Chua, L. (1990). VLSI implementation o f a reconfigurable cellular neural network containing local logic (CNNL). I n 1990 1EEE Interndomi/ Works/zop 0 1 1 Crllultir N c w d N ~ t w o r k scrnd Their App/icritiorr.s, CNNA-YO, Dec. 16- 19 pp. 206-2 I S . Halonen. K., Radvany, A,. and Roska, T. (1991a). The control strategy of a dual (progrnmmahle analoglogical) cellular neural network chip. I n Proc. Second Interntrtionul Corlfi,r-c.rrce on Micr-oc~Iec~troriies ,for Neirrnl Ne/\twks, p. 25 I . Halonen, K. and Vaananen, J . (1990). The non-idealities of the IC-realization and the stability o f CNN-networks. In 1YYO IEEE Intrrntrlionol Work.\/rop on Celliilrrr Nercrcrl Netrwrks tin(/ Tlwir A/Jp/~cnf~oif,s, CNNA-YO, Ilet.. 16- 19. pp. 226-234. I-ialonen, K.. Vaananen. J . , Porra, V., Roska, T., rt ( I / . (1991b). VLSI-impleinentatior1 of a programmable dual computing cellular neural network processor. In Artificitrl Neiri-ti/ N c . t w ~ ~ k s . Proc. 1991 1imrnntionci1 Conference, pp. 1581 1584. Hanebeck, U . D. and Schmidt, G. K. (1996). Genetic optimization of f u ; / ~ networks. y FK:Y Sets ( i n d Sv.strnr.s, 79:59-68. -
FUZZY CELLULAR NEURAL NETWORKS A N D THEIR APPLICATIONS
433
Hansen, L. K. (1992). Boltzmann learning of parameters in cellular neural networks. In C N N A '92 Procretlirigs. Secorrd / n t ~ ~ i - ~ ~ ~ i t Workshop iorur/ 011 Crllulur N r r t r d Network.\ trritl Their A / ~ p / i c ~ h i i i pp.62 , s , - 67. I-laralick, R. M., Sternberg. S. R., and Zhuang, X. (1987). Image analysis using mathematical morphology. IEEE Trtrrrs. Ptrttern Antrl. MacVr. / n t d / . , PAM1-9:.532- 550. Harrer, H. ( 1993). Multiple layer discrete-time cellular neural networks using time-vai-iant tenitrrid Digitd Sigricrl P rocr.ssirrg, 40( 3): I9 I plates. lEEE Trms. Circuir.s rrnd Systern.~/ I : A~itrlo~q 199. Harrer. H. and Nossek, J . A. (1992a). New test results of a 4 by 4 discrete-time cellular neural network chip. In CNNA '92 Proc~rr.cliiicq.s.Second Iriti~rircrtioritrlWorkshop o r i Ce//cr/(rrNrurrrl Networks rrrid Their App/ictrtiori.v. Oct. 14- 16 pp. I63 - 168. Harrer. H. and Nossek, J . A. ( I 992b). Discrete-time cellular neural networks. Irr~rrucrtiorrdJour. C'ircrtit Theory and App/ic~trtiori.s.20(5):453-467. I-larrer, H., Nossek, and R. Stclzl, ( 1092). An analog implementation of discrete-time cellular neural networks. lEEE Trcnis. Neirrcrl Networks. 3(3):466-476. Harrer, H., Nossek. J. A., and ZOLI,F. (1991a). A learning algorithm for discrete-time cellular neural networks. I n Proc.. IJCNN'91, Sirr,qqxire, pp.717-722. Harrer, H.. Nossek, J. A., and ZOLI,F. (1991b). A learning algorithm for time-discrcte cellular neural networks. In I99 I IEEE It~trrriritiorinlJoirit Cor?fireric.r or1 Nerrrrrl NctktwLs, pp.717-722. Harrer, H.. Venetianer, P. L., Nossek, J . A,, Roska. T., and others. (1994). Some examples of preprocessing analog images with discrete-tinie cellular neural networks. In Proc. Third IEEE / ~ r t i ~ r ~ i ~ ~ Workshop t i t i ~ i ~ i / on Cellulur Nrrrr(r/ Nct\iwrk.s cirid Their App/iecrti(im ( C N N A -94). Dec. 18-21 pp.201 -206. Hnssoun, M. H. ( 1995). Fundmutwfo/,s of ArtiJicid Nrirr~il N e h i w k s . Cambridge, MA: MIT Press. Hayashi, Y., Buckley, J . J . , and Czogala. E. (1993). Fuzzy neural network with f u i z y signals nnd weights. Irrt. .lour. /ntrl/igerit Syst., 8:527-537. He, C. and Ushida, A. (1994). Iterative middle mapping learning algorithm for cellular neural networks. /E/CE Trriris. Furirlrrtirrnro/.s o f Electrorrics. Cor~imuriic.ntionsarid Conipirrr Scirrrws. 1<77-A(4):706-7IS. He, C. and Ushida, A. (1996). A non-uniform discrete-time cellular neural network and its of E/(~clroriic.s.C~irnin~~~iiccctio~z.~ nncl Cotnpuler slability analysis. /E/CE Trtrrr.~.Fr~riclrrnrc~~r~tr/.(~ .kirric.e, E79-A(2):252- 257. Hcijnians, H. J . A. M. ( 1992). MorphologicuI ltturgr. O / x ~ r ~ r t o rNew s . York: Academic Press. Hirota, K. and Pedrycz. W. ( I 996). Solving fu7zy relational equations through logical tiltering. F u z y Sets and Systenis, 8 1 3 5 -363. Ho. C.- Y., Chu, T.- K., and Mori, S. (1993). Discrete-time cellular neural network for thinning: ;I compound synthesis. In ( Proccw/irig.s) I993 I t liitrr-rrotionr~lSymposiririr or1 Circuits t7l?d Systunrs, 4:2379-2382. Ho. C.- Y.. Yti, D.- H.. and Mori, S . ( 1993). Synthesis of discrete-time cellular neural networks for binary image processing. IElCE Trrrri.~.Furrtirrrtrentcr/.s of Elrctrorric.~.Coinrrrrrtric~rrtioir.sc u i d C~iriiptterS(,imces. E76-A(.5): 735 - 74 I. Ho. C.- Y. and Mori. S . ( 1994). An optimized synthesis of ii discrete-time cellular neural network t i ~ i ~Circuit ~ ~ r / Theory orid A p p / i ~ ~ t ~ t i ~ i22(5):363-375. ri,s. tor parallel thinning. / ~ ~ t r r ~ i c ~Jocir. H~i~iiig. Xiang-Zhu. Yang, Tao, and Yang, Lin-Bao. (1994). On stability of the time-variant delayed cellular neural networks. In f'rot.. Third IEEE Iriteriicrtiontrl Workshop or1 Cellrihr N ~ r 4 r dNrtwmkv n r i d their Ap/~/ic.rrtiori.s(CNNA-(W), pp. 18-2 I . Huhcl, D. H. (1988). Eye, Brtrin, nrid Visiorc. New York: Scientific American Library.
434
TAO YANG
Huertas, J. L.. Rodrigucz-Vasquez, A,, and Espejo, S. (1992). Analog VLSI implementation of cellular neural networks. I n CNNA '92 Procwriiug.s. Second Internrrtiorrtrl Workshop on Celliiltrr Neurd Networks trnd Their Appkrrtions, Oct. 14- 16, pp. 141 - 150. Huertas. J . L. and Rueda. A. (1990). Testability issues in analog cellular neural networks. In 1990 IEEE Internutiontrl Workshop on Cellular Neurul Neworks trncl Their Applicrrtions, CNNA-90. Dec. 16-19. pp. 172-176. Hung, K.- S.. Curtis, K. M., and Orton, J. W. ( 1994). A programmable multi-function optoelectronic cellular neural network. I n IEE Co/loyuiriin on 'Integrtzted Inicigirig Sensors nntl Procexsing, ' Drc. 5, pp. 3/1 - 6. Hung. K.- S., Curtis, K. M., and Orton, J . W. (1996). Optoelectronic irnplenientation of a multifunction cellular neural network. IEEE Trtrns. Circuits cind Swtern.s 11: Anrrlog t i n d Digittrl S i g i ~ r lP ~ O C C J . S 43( S ~ 8):60l I Z ~ , - 608. Ikegmii, M., and Tanaka, M. (1994a). linage coding and decoding by discrete time cellular neural networks and its error evaluation. Trtins. IEICE, A , J77-A(7):954-964. Ikegarni, M., and Tanaka, M. (1994b). Moving image coding and decoding by DTCNN with 3-D templates. In Proc. Tliirtl IEEE Iriter~iiationtilWork.thop on Ccllular N r u r d Networks trnd tlwir Appliccrtions (CNNA-94). Dec. IX-21, pp. 475-479. Ikcgaini. M.. and Tanaka, M. (1995). Image coding and decoding by discrete-time cellular neural networks. Electronics czrrtl Cotnmiitiiccrrinii.s in Jupon, Pcrrr 3 (Fundunieritul Electronic- Science), 78(3):32-43. Ishibuchi, H. (1993). Neural networks that learn from f i i z ~ yif-then rules. IEEE Tunis. F i r x ~ Sy.stern.s, 1( 2): 85 - 97. Jang, B. K., and Chin. R. T. (1989). A multiscaling approach based on morphological filtering. IEEE Tmns. Pnttern. A n d . Machine Intell.. 11(7):694-700 Jang, B. K.. and Chin, R. T. (1990). Analysis of thinning algorithms using mathematical inorphology. IEEE Trcrns. Pattern. A n d . Mtrchine Intell., 12(6):541-55 I . Jang. J. R.. and Sun, C. T. (1995). Neuro-fuzzy inodeling and control. Proc. IEEE, 83(3):378406. Jankowski, S.. Londei, A,, Lozowski, A,, and Mazur, C. (1996). Synchronization and control in a cellular neural network of chaotic units by local pinnings. Intrrnutioriul Jour. Ciriw'r Tlirorv c / n d App/icotions, 24(3):275-281. Jankowski. S., Londei, A,, Mazur, C., and Lozowski, A. (1994). Synchronization phenomena in 2D chaotic CNN. In Proc. Third IEEE Internarional Workshop on Cellultir Neurcrl Nenvoi-ks m d Their App1ic.cition.v (CNNA-94), pp. 339- 344. Jankowski, S., and Wanczuk, R. (1994). CNN models of complex pattern formation in excitable media. In Proc. Tliircl IEEE International Workshop on Cellular Neural Networks and Their App1icution.r (CNNA-Y4). pp. 333-338. Joseph. R. ( I 993). The Nuked neuron: Evolution trrzd rlw LungiuiRe.s of'tlze Bocly and Bruin. New York: Plenum. Kacprzak, T., and Slot, K . (1992). Multiple-input OTA based circuit for cellular neural aetworkimplementation in VLSI CMOS technology. In CNNA '92 Proc,eetiing.s. Sec-ond lntenintionul Wor-ksliop on Cellular Neural Networks and Their Applications, 0 c . t . 14- 16. pp. 157- 162. Kandcl, A. ( 1982). Fuzzy TLc.hniyue.s in Pcittern Rec.ognitiorz. New York: Wiley. Kaufmann, A,, and Gupta, M. M. (1995). I n r r o d ~ / i o nt o Fuzzy Arithmetic. New York: Van Nostrand Reinhold. Karayiannis. N. B. and Venetsanopoulos, A. N. (1993). Art~ficitrlNeurul N e m w k s ; Lrcirning Algorithms, Perfi,rinunc~rE~dircrtion.und Applications. Boston: Kluwer Academic. Kinget, P. and Steyaert. M. (1994a). A programmable analogue CMOS chip for high speed 1994 Cu.storn Integrcitcd image processing based on cellular neural networks. In Proc. I Circuits Conference, May 1-4, pp. 570- 573.
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLlCATlONS
435
Kinget, P. and Steyaert, M. (l994e). Evaluation o f CNN template robustness towards VLSl implementation. In Proc. Third I Interncrtioncil Workshop on Cc~llultrrNrirrrrl Nerw(irk.s m d Their App/ic(ilions (CNNA-94). / ) c ~ . / 8 - 2 1 , pp. 381 -386. Kinget, P. and Steyaert, M. (l994d). Impact of system specilications on analogue CMOS iinpleinentations of continuously prograinmahie cellular neural networks. I n 1994 IEEE I i ~ t ~ ~ r n ~ ~ t i o ~ i ~ i l C'oiiJerencr on Neurul Nc~hvork.v.IEEE World Congress 011 Coni~~utrrtioncrl / n t ~ ~ l l i ~ ~J~i mn' c e , 27-Ju/,v 2, 3:1949- 19.54. . hardware strategies lor cellular neural netKinget, P. arid Steyaert. M. ( 1 9 9 4 ~ ) Input/output works. In 1994 IEEE Interiirrtioiitrl Confereiwe O I I Neirrtil Nethork.~.IEEE World Coii,gre.s.son ~ ~ ~ i I l t ( ~ / /~i p n (J~~e~I ,I ~ 2( ,7 - ~~1 r / y2, 3:~i 899 - ~I 902. ~ ~ t ~ ~ t i Kinget, P. and Steyaert. M. ( l994h). Analogue CMOS VLSl implementation ol cellular neural networks with continuously progranimable templates. In I994 IEEE lnteriintioncil Svnipo.sium (111 Circxits (UM/ Sy.stein.s, M t i ~ 3 0 - J ~ n i c2, 6367 - 370. Kinget. P. and Stcyacrt, M. S. J . ( 1995). A programmable analog cellular neural network CMOS chip lor high speed image processing. IEEE Joirr. Solid-Sttrte Circuits, 30(3):23.5-243. Kohn. 1.. Smith, B., Harrer, H . , and Nossek, J . A. (1992). A flexible PC-controlled analog/digital i e h t set for CNNs. I n CNNA '92 Procee(1ing.s. Secorirl lntc~riiorioncilWorkshop o i i Cellirlrrr Neurrrl Nefivorks ( i t i d Their A p l ~ l i ~ ( r t i ~0c.t. i i . ~ .14- 16, pp. 22X -233. Kosko. B. ( 1992). Neural Nerwrks rrnd F u z y Sy.stein.\ ; A />yn(unicd Systeins Approwh to Machine Intolligonco. Englewood Cifls, NJ: Prentice-Hall. Kowalski, J.. Slot, K., and Kacprzak, T. ( 1994). A CMOS current-mode VLSI implernentation of cellular neural network for an iinagc objects area estimation. I n Pro(.. Third IEEE Interiitrtioncrl Work.shop 0 1 1 Cellulrrr Ncwrul NetMwks tint1 Their A p / ~ l i c ~ t i ~(CNNA-Y4), ii.~ l k . 18-21, p. 351. Koiek. T. and Roska. T. (1996). A double time-scale CNN lor solving two-diinensional NavierStokes equations. Internntiontrl ./our. Circxit Theoiy rnirl Applications, 1(24):49- 55. Koiek. T., Roska, T., and Chua, 1., 0. (1993). Genetic algorithm for CNN template learning. IEEE Tr(in.s. Circuits and Sy.r/rnts I: Funtkrnienttil Throi? mid Apldiccrtions, 40(6):392-402. Kricg, K. R. and Chua. L. 0. (1990). Hardware and algorithms for the functional evaluation of cellular neural networks and analog arrays. I n 1990 IEEE /nterntrrioml W~irk.slzopon Cellulnr Neur(i/ Nehvorks and Their App/i(~(i/ioif,$, CNNA-YO, Dec. 16- 19, pp. 169- 17 I . Kulller, S. W.. Nicholls, J. G., and Martin, A. R. (1984). /-~r(~n7 Ncuroii to Br(ri/i: A Cdlultir Appro(ich to the Function of' rhr Nervous Sy.s/e~n,(Second Edition). Sunderland, MA: Sinauer Associates. Kiilkarni, A. D. ( 1994). Artificitrl N e u r d Networks fi,r Irnn,qr Understcmding. New York: Van Nostrand Reinhold. Lai. K. K. and Leong, P. H. W. (1996). Implementation of time-multiplexed CNN building block cell. In Proc. Fifth Intorncitiontil Conjermce oti Microelectronics fi)r Nrurnl Networks (ind I.'rr:iy Systenis. MicroNeuro'Y6, Feb. 12- 14. pp. 80-85. Lai, K. K. and Leong, P. H. W. (1995). An arca efficient implementation of a cellular neural network. In Proceedings. IYYS. Second NCW Zerilnritl Iritrrntrtiorrcil Two-Strerim Coif(,reni.e on Artijkitrl N r u r ~ Networks l rrnd E.xperf S.v.sf(,in.s.Nail. 20-2-?, pp. 5 1 -54. Lai. K. K.. Leong. P. H . W., and Jahri, M. A. ( 1995). Analogue CMOS VLSl implementation of cellular neural networks. In Proc. Si.rth Austrtrlicui Coiifewnce U I I Neurul NetMwrks (ACNN'YS) h'eh. 6 - 8 , pp. 17-20. Langton. C. G. (1995). Art$cicil LVe; An Overview. Cambridge, MA: MIT Press. Lec, C.- C. and de Gyvez, J. P. (1994a). Single-layer CNN simulator. In 1994 Swiposiuni 011 Circuirs rind Systems, Mtrv 30 - June 2, 6:217-220. Lcc. C. C. and Pineda de Gyvez, J . (1994b). Time-inultiplexing CNN simulator. I n 1994 /EEE Intrrnritior~cilLYvinpo,siiini017 Circuits nnd Systeni.~.,Mtiy 30-Jurie 2, 6:407 -4 10.
436
TAO YANG
Lee, J . S. J., Haralick, R. M.. and Shapiro, L. G. (1987). Morphological edge detection. IEEE ./our. Rohotics and Automar.. 3(2). Lee. S. C. and Lee, E. T. (1974). Fuzzy sets and neural networks. Jour. Cybeniet., 4:83-103. Liin, D. and Moschyt7. G. S. ( 1994). A programmable, modular CNN cell. In proc. Third IEEE Internutionul Work.shop OII Cellulur New-irl Networks unrl Their App/icfrtions (CNNA- Y4), Doc. I K - 2 / , pp. 79-84. Lomno. M., Herrera. F.. and Verdegay. J. L. ( 1995). Generuting Fuz:y Rules from E.xtnnplrs U.sing Genetic A1goritlnn.s. I n Yager, R.R. and Zadeh, L.A. (Eds.). Fuzzy Logic and Soft Computing. pp. 21 -28, River Edge, NJ: World Scientific. Magnussen, H. and Nossek. J. A. (1992). Towards a learning algorithm for discrete-time cellular neural networks. In CNNA '92 Proceedings. Second lnternu/iorial Workshop 0 1 1 Cellirlur Neitrul Netkiwks und Their Applicutiom~pp. 80- 85. Magnussen, H. and Nossek. J. A. (l994a). A geometric approach to properties of the discretetinie cellular neural network. IEEE Truns. Circuirs und ,Jy.s/enis I : Fundmrientrrl T/icor-y tnid Applicutions, 41( I0):625 -634. Magnussen, H. and Nossek, J. A. ( 1994b). Global learning algorithms for discrete-time cellular neural networks. In Proc. Third IEEE Internutionul Workshop on Crllulur Neitrirl Networks untl their App/icutiorrs (CNNA-94).pp. I65 - 170. Magnussen, H.. Papoutsis, G., and Nossek, J. A. ( 1994). Continuation-based learning algorithm lor discrete-time cellular neural networks. In Proc. Third IEEE hiternntionrrl Worksliop oii C'ellulor Nwrul Net!.twk.s rind Their Applicutions (CNNA-94). pp. 17 1 - 176. Mancuso, M.. Luca, R. D.. Poluzzi, R., and Rizzotto, G. G. (1996). A fuzzy decision directed filter for impulsive noise reduction. Fuzzy Sets mid Sy.s/enz.s, 77: I I I - I 16. Maragos, P. A. and Schafcr, R. W. ( 1986). Morphological skeleton representation and coding of binary images. / E E E Truns. Acoir.rt. S p r c h S i p i d Proc?s.sing, 34(5):1228- 1244. Marks 11, R. J., Oh, S., Arabashahi, P., Caudell, T. P., Choi, J. J., and Song, B. G. (1992). Steepest descent adaption of rnin-inax fuzzy if-then rules. In Proc.. IJCNN, 3: 47 1-477, Beijing, China. Marks 11. R. J. (Ed.). (1994). Firzzy Logic, Technology und Applic-irtions. Piscataway, NJ: IEEE Press. Memrd, M., Parisi, G., and Virasoro, M. A. (Eds.). (1987). Spin G/as.s Tlleorey nnd Beyond. Singapore: World Scientitic. Mizutani. H. (1994). A new learning method for multilayered cellular neural networks. In /'roc. Third IEEE Inrernutionul Worhshop on Cellulur Neural Networks rind Their App1icution.s (CNNA-94).pp. 195-200. Los Alamitos. Morgan. N. (Ed.). ( 1990). Art$cirrl Nri~rulNetworks :E/c,ctronic,Iin~~lrmrrrrcrfiow.s. CA : IEEE Computer Society Press. Nakagawa, Y . and Rosenfeld, A. (1978). A note on the usc of local min and niax operations in digital picture processing. IEEE Trans., Syst., MNII.Cyhern.. SMC-8:632-635. Nemes, L. and Roska, T. (1995). A CNN model of oscillation and chaos in ant colonies: a case und Sy.yterns I: Fnndumentul T h e o y and Applicrrtiori.s,42( I0):74 I study. IEEE Trtrns. Circ~/it.s 745. Nemes, L. Toth, G., Roska. T. and Radvanyi, A. (1996). Analogic CNN algorithms for 3D interpolation-approxiniation and object rotation using controlled switched templates. !ntrrnutionul Jour. of Circuit Theory und Applicutions, 24(3):283 - 300. Nola. A. D., Pedrycz, W., and Sessa, S. (1995). Fuzzy relational structures: The state-of-art. F ' r i : q ~ Sets and Systems, 7 5 2 4 I - 262. Nossek, J. A. (1994). Design and learning with cellular neural networks. In Pro<,. Third 1 / 3 3 biternational Workshop on Cellulur Neurul Networks nnd Their Applications (CNNA-Y4), pp. 137- 146.
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
437
Nossek, J . A. ( 1996). Design and learning with cellular neural networks. /ii/eriici/ioiiol Jorrr. Circuit T h e o n n r i d Appliccztioiis, 24( I ): 15 - 24. Nossek, J . A,, Seiler. G., Roska, T.. and Chua, L. 0. (1992). Cellular neural networks: theory and circuit design. / n t e n i c i / i o i d Joiir. Circuit Theon' ririd Al)plic~crtion.s.20(5):533-553. Nowak. M. A. and May, R. M. ( 1993). The spatial dilemmas 0 1 evolution. Interiiritioricil Joiir. Bifurctitioii r i n d Chrro.s, 3( I ):3S-78. Ogorzalek, M. J . , Dabrowski. A,, and Dabrowski, W. (1994). Hyperchaos, clustering and cooperative phenomena in CNN arrays compo5ed of chaotic circuits. In Proc,. Third / E E E ltiteriicitioncil Workshoip 011 Cellular Ntwrcil N e t ~ w k (si n d Their App1ictition.s (CNNA-9.11, pp. 3 15-320. Ogorzalek, M. J . , Galias, Z., Dabrowski. W., and Dabrowski. A. (1996). Spatio-temporal cooperative phenomena in CNN arrays composed o t chaotic circuits-siinulation experiments. /iiterriutionrrl Jour. Ciruiit Theory mid App/icutioii.s, 24(3):26I - 268. Ono, Taketoshi. et t i / . (Eds. ). ( 1993). Bruiri M ~ ~ c h r i i ~ i , s oi f~ i Perceptiori ,s ciritl Mrinorj, : Froin Ne~rroiito Behtwior. New York : Oxford University Press. Osuna, J . A. and Moschytz. G. S. (1996). On the separating capability of cellular ncural networks. /ritrriitrtioriri/ Joirriinl of Circuit Theoi:~uiid App/i'lic.titioiis,24(3):253- 259. paasio, A., Dawidziuk. A., Halonen, K., and Porra, V. ( 1 9 9 4 ~ ) .Digitally controllable weights lriterii~tiotiril Workshop o r i in current mode cellular neural networks. I n PIYJC. c., 1 8 - 2 / , p p . 73-71. Cc4liilar N e u r d Networks orid Their App1imtiori.s ( C p ririsio, , : A., Dawidziuk, A,. Halonen, K., and Porra, V. (1996a). Robust CMOS CNN iiiiplenientation with respect to manut'xturing inncuracies. I n Proc. Fourth / E E E Internritiorirrl Workshop o r 1 Ci.l/ulrrr Neitrd Nemvrks ririrl Their Applii~trtioris.Seville. S p n i i i . Jiriie 24-26. pp. 38 I - 386. p r i. d.'s ~ oA,, , Dawidziuk, A,. and Porra, V. (l996b). Pulse stream current mode CMOS CNN I
,'
chip. In Proc. Fiiurth / E E E /iitertiritioiid Work.thop o i i Cel/irltir Neiirril Networks ciiirl 7heir App/icu/ioiis, Selille. Spuin, Jriiie 24-26, pp. 457 -460. Paasio, A., Halonen, K.. and Porra, V. (l994a). CMOS implementation of associative meniory using cellular neural network having adjustable template coeflicients. In 1994 / t i ~ n dSyrnyosiiiin o i i Circuits wid S y ~ t t w i Mliv ~ , .V- Jirrie 2 , 6 4 8 7 - 490. Paasio, A,, Halonen, K., Porra. V.. and Dawidziuk, A . ( 1994b). Current mode cellular neural network with digitally ad,justable template coefficients. I n Pro(,. Fourth /nreriiritioiicrl Confi.ri.rrc~e o i i Microelrctroiiic~.s ,fi)r Neurirl Nc.tMwk.s wid Fiiz-,' Systeriw, pp. 268 -272, 26- 28. Park, S., and Chae, S. (1994). Stereopsis with cellular neural networks. Joirr. Korerii~/ri.sti/uf~ of Te/emritic.swid Electronics, 31B( 12):124- 131. (in Korean) Pnrk, S.,Lini, J., and Chae, S. (1996). Hardware iniplementation of discrete-time cellular neiiral networks using distributed arithmetic. Jour. Koretrii /,i.stitute of Telmcrtic,.s tiiitl Electronics. 33B(l):l53-160 (in Korean). Park, S., Min, S., and Chae, S. (1994). Stereo correspondence with discrete-time cellular neural networks. In / Y O 4 IEEE /nterizatioritil S~mpo.siirrizor? Circuits crjid Sysferri.s. Mrry 30- Juiir 2, 6:225- 228. Paul. S., Huper, K., and Nossek, J. A. (1992). Analog media filtering. In IY92 IEEE lnt. Car$ or1 Acou.stic.s, Sperch r i n d Sigrid Processing. P ~ L I S., I . Huper, K.. and Nossek, J . A. ( I903a). Mapping nonlinear lattice equations onto cellular neural networks. / E E E Trcois. Circuits triicl Sy.s/eiri.s-l. Pnul. S., Huper. K.. Nossek, J. A,, and Chua, L. 0. (1993h). Mapping nonlineur lattice equations onto cellular neural networks. /EEE Trcrris. Circuits rind Sy.stei~i,s1: Fuiirf~iiiirritnlTlieon c r n d Applicution.~,40(3):196- 203. Pedryc~,W. (1991). Neurocoinpiitations in relational systems. IEEE Trm.s. Pciftcrii, A n d . M o ihirir I r i ~ l / . ,13(3):289-297.
438
TAO YANG
Pelayo, I:. J., Prieto, A,, Pino, B., and Martin-Smith, P. (1990). Analog VLSI implementation of a neural network with competitive learning. In 1 9 9 0 IEEE lnterntitioiiril Workshop on Cellultir Neiir~ilNetworks tiriel Their App1iccitiori.s. CNNA-YO, pp. 197 -205. Peleg. S., and Rosenfeld, A. (1989). A niin-max medial axis transformation. IEEE Trcrm. Pattern, Anti/. Mticliine Intell.. PAMI-3: 189- 192. Perdang, J. M., and Lejeune. A. (Eds). (1993). Celliilur Aiitonlutn: Prospects in A.strophy.\ic~cil Applicutions. River Edge, NJ: World Scientific. Perez-Munuzuri. V.. Munuzuri. A. P., Gomez-Gesteira. M.. Perez-Villar, V.. and others. ( 1995). Nonlinear waves. patterns and spatio-temporal chaos in cellular neural networks. Phil, Trcin.s. Royal Soc.. SeritJs A (Phwicnl Sciences c i n d Engineering), 353( I70 I ): I0 I- I 13. Perez-Munuzuri. V.. Perez-Villar, V., and Chua, L. 0. (1993). Autowaves for image processing on a two-dimensional CNN array of excitable nonlinear circuits: flat and wrinkled labyrinths. IEEE Trcl17.s.Circuits tinel Sys1i>ins I: Fiin~ltiinrnttil Tlieor? rind Applicatiori.s, 40(3): I74 - 1 X I . Perfetti, R. ( 1993a). On the convergence of reciprocal discrete-time cellular neural networks. IEEE Trans. Circiiits m e 1 Sysfrrns 1: F~rridumentrilTheory anti Applicririons. 40(4):286 2x7. Perfetti. R. ( I993b). Relation between template spectrum and convergence of discrete-time cellular neural networks. Electronics Letters, 29(25):2208- 2209. Perfetti. R. (1994). On the Op-Amp based circuit design of cellular neural networks. httt~n~ntioritil Jour. Circuit the on^ e n i d Applic~titions,22(S):42S-430. Perfetti, R. (1995). Some properties of the attractors of discrete-time cellular neural networks. I I ~ ~ ( W I U ~ ~ O I IJoirr. NI Circuit T h e o n r i n d Applic.ution.s, 23(5):485-499. Pham, C.- K.. Ikegami. M., and Tanaka, M. (I995a). Discrete time cellular neural networks with two types of neuron circuits for image coding and their VLSl implementations. lElCE Trrirrs. Fiindcnnentci1.s o j Electmiiic.s, Cr~riirii~iniccitions und Computer Scierires, E78-A(8):978-988. Pham, C.- K., Kimura, T., Ikegami, M . , and Tanaka. M. (1995b). Pulse coded cellular neural network and it's hardware implementation. In I Y Y S IEEE Internritiontrl Cov~ferenct,o i i Neitrril N o v . 2 7 - D e c . 1. 4: 1590- 15Y4. Nehrorks Proc~eeding.~, Pineda de Gyvez. J. (1994). XCNN: a software package for color image processing. In Proc. Third IEEE Internntioncil Workshop on Cellulrir Nenrul N e m o r k . r cine1 Their App1icritioii.s ( C N N A - Y 4 ) . Drc. 18-21, pp. 219-224. Piovaccari. A. and Setti, G. (1994). A versatile CMOS building block for fully analogicallyprograminable VLSl cellular neural networks. In Proc. Tlzird IEEE Internc~rionuIWorkshop on Celliilrir Neural Networks cind Tlipir A p p l i c ~ i t i o m( C N N A - Y 4 ) , Drc. 18-21. p. 347. Pitas, I. and Vanetaanopoulos. A. N. (1991). Nonlinecir Digirril Fil/ers: Principles ( n i d Applictitioris. Dordrecht: Kluwer Academic Publisher. Pitas, 1. and Venetsanpoulos, A. N. ( 1990). Morphological shape decomposition. IEEE Tr(in.s. Puttern. Anal. Machine lntell., 12( I ):38-45. Pol, S. A,. and King, R. A,. (1981). Image enhancement using smoothing with fuzzy sets. I Trms., Syst., Mnri, Cylxrn., SMC-ll:494-501. Poliakov, G. 1.. (1972). Neurori Strnctiirc. of the Hrriiri. Cambridge, MA.: Harvard Llniversity Presb. Proceedings. ( 1990). 1 9 9 0 IEEE Intrrncitionul Wor-kshop on Celliil~rNeurtil Networks nntl Their Appliceitions, CNNA-YO: Proceedings. Piscataway, NJ: IEEE Service Center. Proceedings. ( 1992). C N N A '92: Proc. Second Internritioricil Workshop on Celliilrir NeiirriI Networks anti Their Applictiti(117.s. Piscataway, NJ: IEEE Service Center. Proceedings. ( 1994). Proc. Third IEEE Interncitionril Work.shop on Cellirltir Ncwrtil Nrtwork.\ riritl Their App1icLition.s ( C N N A - 9 4 ) . Piscataway, NJ: IEEE Service Center. Proceedings. ( 1996). Proc. Fourth lEEE Internntional Workshop oii Cellitlcir Nrrtrril N e n w r k . ~ (in(/ Their App1ic~tition.s( C N N A - 0 6 ) . Piscataway, NJ: IEEE Service Center. Proceedings. ( 1998). Proc. F f t h IEEE Interncitioricil Workshop on Crllulcir Neurul N o m w k s cind Their Applicotions (CNNA-YR). Piscataway, NJ: IEEE Service Center. -
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
439
Puffer, F., Tetzlaff. R.. and Wolf, D., (1995). A learning algorithm for cellular neural networks (CNN) solving nonlinear partial diffcrential equations. I n PIYJC.IYYS ZJRSI liiternritionr~l ystenis, rind E1ectrorlic.s. ISSSE '9.5, pp. SO I -504. Raffo, L., Sabataini. S. P.. and Risio, G. M. (1996). A prograininable VLSI architccturc based on multilayer CNN paradigms for real-time visual processing. Irrtrrnaliorial .lour. Circuit Theorv arid App/icotiorl.F, 24(3):357 - 367. Rekeczky. C.. Nishio, Y., Ushida, A., and Roska, T. ( 199%). CNN based adaptive smoothing and some novel types of nonlinear operators for grey-scale image processing. In Proc. 19YS /tltrrnLl/i(JllU/ $vn!po.siuni on Nonlinear Tl7eOr) ( / ! i d I t s Applications (NOLTA '95). Rekeczky, C . , Nishio, Y.. Ushida, A,, and Roska, T. (199%). CNN based adaptive smoothing and some novel types of nonlinear operators for grey-scale image processing. In 199.5 Intrrnr~tiona/Svmpo.viuin 011 Nonlinear Tlieoi?; rind I t s Applicutiom (NOLTA 'YS), p p . 683-688. Kekeczky, C., Ushida, A,, and Roska, T. ( 1 9 9 5 ~ ) .Rotation invariant detection of moving and standing objects using analogic cellular neural network algorithms based on ring-codes. IEICE arid Conrputrr Scirnces. E78-A( 10): Trrins. Fuiidcnnental.~of Electmnicx, Coinrr~ui~ic~rrtioii.s 1.116- 1330. Rel,jin, B., Serdar. T., Kostic, P., and Pavasovic. A. (1995). CMOS VLSI realization of voltagemode programmable analog ccllular neural network. In IYYS 20th lnternationul Conference on Microe1ec~troiiic.s.Proccwlings, Sept. 12- 14, 2:497 505. Rodriguez-Vazquex, A,. Espejo, S., Doininguez-Castro, R., Huertas. J. L., et rrl. ( 1993). Currentmode techniques for the implementation o i continuous- and discrete-time cellular neural networks. IEEE Trans. Circuits and Sy.stein.s I / : A n d o g rind Digitrrl Sigiid Processbig, 40(3): 132- 146. Rodriquez-Vazquez, A,, Dorninguez-Castro, R., and Huertas, J. L. ( 1990). Accurate design of analog CNN in CMOS digital technologies. I n IYY0 IEEE Oitc~rnationalWorksliop on Cc,//ulrir Neunrl Netv1wk.s rind Their Appli'licrition.s. CNNA-90, Dec. 16- I Y, pp. 273 - 280. Roska, T. ( 1992). Programming CNN: a hardware accelerator for simulation, learning, and realon Circ14it.srlrid Sv.s/ein.s,pp. 437-440. time applications. In P ~ o c .35th Mirfwest Syrrip~~.viuni Roska, T. (1994a). Analogic algorithins running on the CNN universal machine. In Pmc. Third Workshop on Cellular Neurctl Netktwks and Their App1iccitiorr.s (CNNA-Y#), IEEE Interni~fiorcr~l pp. 3-8. Roska, T. (1994b). The CNN universal machine-a summary of an analogic supercomputer chip architecture for very high speed visual processing. In 199.1 CERN Sclrool of Coniputing Proceedings (CEKN 95-01). I994 CEKN School of Cotnl)u/itrg Proceedings, pp. 295 - 297. Roska, T., Bartfai, G., Szolgay, P.. Sziranyi, T., and others. (1990). A hardware accelerator board for cellular neural networks: CNN-HAC. In I Y Y O IEEE Internutioricil Workshop o i i Cellirlrir Neunil Networks rind Their Appliccrtions, CNNA-YO, 1 k ~ 16. 19, pp. 160- 168. Roska. T.. Bartfai, G., Szolgay. P., S7il.anyi. T., and others. ( 1992a). A digital inultiprocessor l Circ.rrit hardware accelerator board for cellular neural networks: CNN-HAC. I i r t e i ~ i i t r t i o i i r ~four. Theory [ M I Applicutions, 20(5):589 - 599. Roska, T., and Bartfai. G. (1990). CNN-HAC: a digital multiprocessor hardware accelerator for general cellular neural networks. In Hirngtrrinn Acnd. Sci., Burkipst, Hungriry (Technical report). Roska, T., Boros, T., Radvanyi, A., Thiran, P., and others. ( 1992b). Detecting moving and standirig objects using cellular neural networks. Interntitionul Jour. Circuit TI~eoryrind Appliccitions. 2O(S):613 -628. Roska, T., and Chua. L. 0. (1990). Cellular neural networks with nonlincar and delay-type tcrnplate elements. In lYY0 IEEE Intrmutioncil Workshop on Cellulrir Neurtrl Nerworks und Their Applicritions. CNNA-YO. pp. 12- 25. -
440
TAO YANG
Roska, T.. and Chua, L. 0. ( 1 9 9 2 ~ ) The . CNN universal machine. I . the architecture. In CNNA '92 Proceedings. Second Interntrtioiicrl Workshop on Cellulrrr Neurul Nehvorks and Their Applications, pp. I 10. Roska. T., and Chua. L. 0. (1992d). The CNN universal machine. 11. programinability and applications. In CNNA '92 Pniceeditigs. Secorrd Iittemutionul Workshop on Cellulrrr Neurcrl Network.s cirrd Their Applications, pp. 18 I - 190. Roska, T., and Chua, L. 0. (1993a). Cellular neural networks with nonlinear and delay-type template elements and nonuniform grids. Iriternationnl Jour. Circuit Theor?: unci App/ictrtioii.s, 20(5):469-481. Roska, T., and Chua, L. 0. (1993b). The CNN universal machine: an analogic array computer. IEEE Truris. Circuits mid Sy.s/em.s 11: Ariulog trnd Digital Sigrid Processing, 40(3):I63 - 173. Roska, T.. Chua, L. O., Wolf, D., Kozek, T., and others. (1995). Simulating nonlinear waves and partial differential equations via CNN. I. basic techniques. IEEE Truns. Circuits mid Systetns 1: Furrdunieritul Theor?: und Applications, 42( 10):807- 8 IS. Roska, T., and Kek, L. (Eds.). (1995). Median-removes impulsive noise from a grey-scale image. p.34 of CSL-CNN software library: Templates and algorithms (version 6.4). Technical report, Computer and Automaton Institute (MTA SzTAKI) of The Hungarian Academy of Sciences, Budapest. Roska. T. and Radvanyi, A. (1990). CNND simulator. cellular neural network embedded i n a simple dual computing structure. user's guide version 3.01. In Hurignrinrr /\cud. Sci., Budupe.si, -
Hurrgtrn.
Roska. T., and Vandewalle, J. (Eds.). (1993). Cellulur Neirrul Networks. New York: Wiley. Roska, T., Wu, C. W., Balsi, M., and Chua, L. 0. (1992e). Stability and dynamics of delay-type general and cellular neural networks. IEEE T r m x Circuits mid Systems I: Fundunzrnrtrl Tlreoq atid Applicclrions, 3Y(6):487-490. Roska. T., Wu, C. W., and Chua, L. 0. (1993). Stability of cellular neural networks with dominant nonlinear and delay-type templates. IEEE Trcrns. Circuits niid Systems I: Funciurnerittrl Theon' tnzd Applicutions, 40(4):270- 272. Rueda, A,, and Huertas, J . L. (1992). Testability in analogue cellular neural networks. Iiitertroriotla/ JiJUr. Circuit Theory crnd A p p l i c ~ t i o t i ~20(5):583 , - 587. Russ, J. C. (1992). The Inirige Prucessing Hurrdhook. Boca Raton, FL.: CRC Press. Russo, F. (1992). A user-friendly research tool for image processing with fuzzy rules. In Proc. First IEEE In/. Conf o i i Frizzv Svstenl, Fuzzy-IEEE '92, pp. 56 I - 568. Russo. F.. and Ramponi, G., (1994a). Combined FIRE filter for image enhancement. In Pro(,. Third IEEE Confererice O I I Fiizy Syster1l.s. IEEE World Congress OII Coni~~iitcitioricrl Intelligenc~e. pp. 249-253. Russo. F., and Ramponi, G., (1994b). Edge extraction by FIRE operators. In Prnc. Third IEEL Conference oir Fuzzy Systenrs. IEEE World Congre.s.s on Cumpututioriul Intelligence, pp. 249-253. Russo, F., and Ramponi, G. ( 1 9 0 4 ~ ) .Nonlinear fuzzy operators for image processing. S i g i i d Processing, pp. 429 -440. Russo, F.. and Ramponi, G. (1995). A fuzzy operator for the enhancement of blurred and noisy images. IEEE Truris. Initrge P recessing, 4(8): I 169 - I 174. Salerno, M., Sargeni. F., and Bonaiuto, V. (1996). 6x6 DPCNN: a programmable mixed analoguedigital chip for cellular neural networks. In Proc. Fourth IEEE lnternutionnl Workshop on Cellular Neirrul Networks and Their Aplicutiun.s, Seville. Spain, Jiirie 24- 26, pp. 45 1 -456. Salerno, M., Sargeni, F., and Bonaiuto, V. (1995). DPCNN: a modular chip for large CNN arrays. In 1995 IEEE Sytnposium on Circuits arid Systems (Cut. No.95CH35771). I995 IEEE Symposium otz Circuits und Systeni.s. April 28-May 3, 1:4 17-420. Sanchez, E. (1976). Resolution of composite fuzzy relation equations. In$ Cotitl., 30:38-48.
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
441
Snni, A., Graffi, S., Masetti, G . , and Setti, G. (1994). Design of CMOS cellular neural nelworks . IEEE Iiiternritioiuil Worksliop o i i Cellulnr operating at several supply voltages. In P r ~ c Third Nerrrul Networks uric1 Their App/icutions (CNNA-941, Dec. IH-21, pp. 363-368. Sargeni, F. ( 1994). Digitally programmable transconductance amplifier for CNN applications. Electronics Letters, 30( I I ):870-872. Sargeni. F., and Bonaiuto, V. ( 1994). High perfomlance digitally programmable CNN chip with discrete templates. In Pro(,. Third IEEE liiterricftioiid Work.shop o i i Cellular Neirrrrl Networh uric1 Their App1iccitioti.s (CNNA-Y4), Dec. 18-21. pp. 67-72. Sargeni, F., and Bonaiuto, V. (1995). A fully digitally programmable CNN chip. fEEE Trcins. Circuits orid ~ v s t c n iIf: . ~ Anei/og cfrrd f>igitci/ S i g r i d Processing, 42( 1 1 ):741 -745. Sargeni, F., and Bonaiuto, V. (1996). A 3*3 digitally prograrnmable CNN chip. Intrriicitioiiul JOLW.Circuit Theory curd Applications, 24(3):369- 379. Sato, T., Ushida, H., and Yaniaguchi. T. (1995). Retrieval system to generate facial expressions fiiteriiritiontil Coi!ference on Fuzzy Systcins, pp. 148')- 1494. using chaos. In Proc. 1995 I Schulcr, A. J . , Brabec, M., Schubel, D., and Nossek, J. A. (1994). Hardware-oriented learning ior cellular neural networks. In Proc. Tliirtl IEEE Intertirrtioriul Workshop 011 Cellulcir Neurcil Nehvorks mid Their Applictrtions (CNNA-94), pp. 183- 188. Schuler, A. J.. Nachbar, P., Nossek, J. A,, and Chua, L. 0. (1992). Learning state space irajectories in cellular neural networks. In CNNA '92 Proceediiigs. Second fnrerriationczl Work.rhop on Celblcir Nourol Nehr'orks nnd Their App1icution.s. pp. 68 -73. Serra. J. (1982). Irncigr Arrci1wi.s rlzcnioticol Morphology. New York: Academic. Serra, J. (Ed.). ( 1988). lnrtrge Aii el Meith~~i~icfticrd Morphology (vol. 2): T I i r o i ~Advtmces. New York: Academic. Scthi, I. K., and Jain. A. K . (Eds.). ( 1991). Artificier1 Neurul network.^ eind Strrtistical Patterii Recognition: Old mid New Coniwctioii.~.New York: North-Holland. Shnnn, J . J., and Fu, H. C. (1995). A f u z ~ yneural network for rule acquiring on fuizy control systems. Fuzzy Sets rriid Sy.stems, 71:345-357. Shcu, B. J., Bang, S. H., and Fang, Wai-Chi (1994). Analog VLSI design of cellular neural networks with annealing agility. In Proc. Third IEEE Int~~rtiutioiwlworks hey^ on Cellular Neirrcrl Networks ernel Their App/iccr/ions (CNNA-94). Dec. IK-21, pp. 387-392. Shcu, B. J., Bang, S . H., and Fang, Wai-Chi (199Sa). VLSI design of cellular neural networks Circuirs ruid Sy,~terri.s with annealing and optical input capabilities. In 199.5 IEEE Synzpo.siitm (Cot. No. 9SCH.35771), April 2K-Mtry 3, 199s IEEE LSyt~iposiirrno i i Circrrits oriel Sy.stei?i.s, 1:653-656. Sheu, B. J., Chang, R. C., Wu, T. H., and Bang, S. H . (1995b). VLSI-compatible cellular neural networks with optimal solution capability for optiinir.ation. In 1995 fEEE Svinposiuni (ni Circuits titid S j w o i i i s (Celt. No.95CH.35771). April 2 8 - May 3. 1995 IEEE Sytnpo.sium on Circuits and Systenis, 2: 1 165 - I 168. Shi, B. E., and Chua, L. 0. (1992). Resistive grid image filtering: input/output analysis via the CNN framework. IEEE Puns. Circuits rind Systrms I: Fuiiiiuinentnl Theory niid Applicci/ions. 39(7):531-548. Shi, B. E., Roska, T., and Chua, L. 0. (199.3). Design of linear cellular neural networks for motion sensitive filtering. IEEE Priiis. Circuits (2nd Systeins f I : Ancilog cuid Digitcil Signed Proce.vsiiig, 40(5):320- 33 I . Shih, F. Y., and Mitchell, 0. R. (1988a). Automated fast recognition and location of arbitary shaped objects by image morphology. In Proc,. IEEE Conj; Cornpurer Visrori rind Perttern Kwognitinn, Jun. 5-9, pp. 774-779. Shih, F. Y . , and Mitchell, 0. R. (1988b). Industrial parts recognition and inspection by image morphology. In Pmc. l9XK fEEE fnterncitionnl Car$ O I I Kohotics and Autoincctiori, Apr. 2 4 - 2 9 , 3: 1764- 1766.
442
TAO YANG
Shimizu. N., Cheng, G.- X., Ikegami, M., Nakamura. Y.. and others. ( 1994). Pipelining GaussSeidel method for analysis of discrete time cellular neural networks. l€/C€ Traus.Fitridrii~i~rit d s of' Electronic.\, Conimcrriications mid Coriiputer Sciences, E77-A(8):1396- 1403. Slavova, A. ( 1995). Cellular neural networks with nonlinear dynamics. Neurtrl, Ptimllel u i i d Scierit$c Coniprrtatioii, 3(3):369- 377. Slot. K. ( 1992). Optically realized feedback cellular neural networks. In CNNA 'Y2 Procrrdin,qs. Second Iiiteniatioriul Workshop on Cellulur Ncwral Nenvorks uncl Their Applicufims, Ocr. 1 4 - 1 6 . pp. 175-180. Slot, K. (1994). Large-neighborhood templates implementation in discrete-time CNN universal machine with a nearest-neighbor connection pattern. In Proc. Third I€€€ Interi~citioiitrl Workshup o r 1 Crllirlar Neitrul Netbvorks ~ i i t Their l Applicutions (CNNA-Y4).pp. 2 I3 - 2 18. Slot, K., Roska, T., and Chua, L. 0 . (1992). Optically realized feedforward-only cellular neural networks. Arcliiv jirr Elektronik untl liehrrtrrrguiig.stcc~iiiik,46(3): 158- 167. Special Issue on Cellular Neural Networks. (1996). Itit. Jour. Cirririf Tlieoiy mid A p p l x , 24:( 1 ). Special Issue on Cellular Neural Networks. ( I093a). I€€€ Truris. Circwirs (uid Sy.ctrms-1: Funclunrenttrl T h e o y rind Applicutions. 40:(3). Special Issue on Cellular Neural Networks. ( 1993b). I€€€Trans. Circuits uritl Systrm-11: Aiiulofi n r i d Digitrrl Sipirrl Processing. 40:(3). Special Issue on Cellular Neural Networks. ( 1095). Iiit. Jour. Circuit Theor)) mid App/.s.. 24:(3). Special Issue on Cellular Neural Networks. (1992). Int. J . Circuit Theor?:and Applx, 20:(5). Sullivan. G. O., Horan, P., Hegarty, J., Kakizaki, S., Kelly, B., and McCabe, E. (1906). A fully optically addressable connected component detector i n CMOS. In Proc. Fourth /EE€ Intcvutrtiorrul Workshop oii Cellulur Neural Networks unrl Their Ap/ictrtioi?s, Sevillc.. S p i i u , June. 2 4 - 2 6 , pp. 439-444. Suykens, J. A. K.. Yang, T., and Chua, L. 0. (1098). Impulsive synchronization of chaotic Lur'e systems by measurement feedback. Intemationul Jour. Btfifilrcufionurzd Chuos, S(6). pp. I37 1 1381. Suzuki. H., Matstunoto, T., and Chua. L. 0. (1992). A CNN handwritten character recognizer. Inferriatioiiul Jour. Circuit 7/ieoi:v and App/ic.nrioii.s. 20(5):601 -61 2. Sziranyi, T. (1996). Robustness of' cellular neural networks in image deblurring and texture segmentation. Iriterncitional Jour. Circuit T h e o y and Applicntioiis, 24(3):38 I - 396. SLiranyi, T., and Csapodi, M. (1994). Texture classilication by cellular neural network and genetic learning. I n Proc. 12th IAPR Iiiterntrtionul Conference 017 Puttem Recognition, pp. 381-383. S~olgay,P., Katona, A,, Eross, G., and Kiss, A. (1994). An experimental system for path tracking of a robot using a 16*16 connected component detector CNN chip with direct optical input. In Proc Third l E € € Irilernational Worksliop oii Cellular Neurd Networks a r i d Their App1icntion.s (CNNA-94). Dec. I N - 2 / , pp. 261 -266. Szolgay. P., Kispal, I., and Kozek, T. (1992). An experimental system for optical detection of layout errors of printed circuit boards using learned CNN templates. I n CNNA '92 Prot.eedirigs. Second Internutioncil Workshop on Cellulur Neurcil Netww-ks und Their AppIi,lic.citiori.\, pp. 203-209. Tanaka. M., Crounse, K. R., and Roska, T. (1994). Parallel analog image coding and decoding of Electronics, Cortimuiiiccitioii.r by using cellular neural networks. IEICE Truns. F~r~idarneritrrls arid Computer Scirnces, E77-A (8): 1387- 1395. Tao. L. H., Xi, Y. L.. Yun, W. B., and Ya. H. Z. (1995). A new type of chaotic attractor with cellular neural networks. In Proc. ISCAS'YS - lrrtenintional Syinpo.siimrr~oil Circuits uritl Systcwis, pp. 997 - IOOO. Tetzlaff, R.. and Wolf, D. (l996a). A learning algorithm for the dynamics of CNN with nonlinear templates-part I: Discrete-time case. In Procecdings ofthe Fourth I€€€lnteriiutioiiul W O ~ ~ . ~ / I O [ J or1 Cdlular Neurul Networks mid Their Aplic~cctioris.Seville, Spuin, June 24-26, pp. 46 I -466.
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
443
Tetilaff, R.. and Wolf, D. (1996h). A learning algorithm for the dynamics of CNN with nonlinear templates. part 11: Continuous-time case. I n Pro(.. Fourth IEEE Inrernrrtioncil Work.~IiopO N Cellular Neirrrrl Networks and Their Aplicutiow Seville. S p i n , June 24-26, pp. 467 -412. Thalmann. N . M., and Thalinann, D. (1994). ArtiJicicd Lrfe unrl Virtuul Reuli/y. New York: Wiley. Thiran, P., Crounse, K. R., Chua, L. O., and Hasler, M . (1995). Pattern formation properties of autonomous cellular ncural networks. IEEE Trtrns. Circuits rind Systems I: Fun~l~imentul Tliroiv trnd Applic(rtions, 42( IO):757-774. Jiir Morleling. Cambridge: Toffoli. T . ( 1987). Cellulrrr Autonintn Mrichines: A New Eni~ir~m~nent MIT Press (Series in Scientific Computation). Torres, L., and Kunt, M. ( 1996). Video Cotling: Tlie Src~ond Gc,nerci/ion Approtrch. Boston: Kluwcr Academic Publishers. Toth, G., Lent, C. S., Tougaw, P. D., Brazhnik, Y., c’t trl. (1996). Quantum cellular neural networks. Siipr1ortice.s mid Microstructure.s, 20(4):473 -478. Tryba, V. Heider, H., and Muhlenfeld, E. (1995). Auronratic Design qf’Fuz:y Systems liy Genetic AI,~prithm.s,pp. 21 -28. River Edge, NJ: World Scientitic. Tiionas, P. (1996). A cellular neural network learning the pseudorandom behaviour of a complex system. Internutiontrl .lour. electronic.^, 80(3):405-41 3 . Utxliick, W., and Nossek, J . A. (1994). Computational learning theory applied to discrete-time cellular neural networks. In Proc. Third IEEE lnterncrtionrrl Workshop on Cellnlur Neurcrl Networks and Their Applications (CNNA-Y4), pp. 159- 164. Van Dam, J. W. M., Krose, €3. J . A., and Groen, F. C. A. (1994). CNN: a neural architecture that learns multiple transformations of spatial representations. In ICANN ‘Y4. Proc. Inrr~rnrrrioncd L‘onjerence on Artijiciul Neurrrl Network.v, pp. 1420- 1423. Vandenberghe, L., Tan, S.. and Vandewalle, J. ( 1990). Cellular neural networks: dynamic properties and adaptive learning algorithm. In Neurcil Nerwork.s. EURASIP Workshop 1990 Prongs. Neural Networks. EURASIP Workshop 1990 Proceedings, pp. 141 - I SO. Varrientos. J. E., Ramirez-Angulo, J., and Sanchez-Sinencio, E. (1990b). Cellular neural network implementations: A current mode approach. In I Y Y O IEEE Inrrrncitional Workshop of1 Cc4lulrtr Nourrtl Networks mid Their App1icatioii.r. CNNA-YO, Dec. 16- 19, pp. 216-225. Varrientos, J. E., Ramirez-Angulo, J., and Sanchez-Sinencio, E. (1990a). A current-mode CMOS cellular neural network. In Proc. 33rd Midwest Symposiuni 017 Circuits rind Sy.steni.s, pp. 12- 14. Varrientos, J. E., and Sanchez-Sinencio. E. (1992). CELLSIM: a cellular neural network simulator for the personal computer. In Proc. 35th Midwest Syrnpo.siuni on Circuits on(/ Swstems, pp. 1384- 1387. Varrientos, J . E., Sanchez-Sinencio, E., and Ramirez-Angulo. J . (1993). A current-mode cellular ~. m i l Systems 11: Analog mid Digital neural network implementation. IEEE T r ~ r i . Circuit.s S i g r i d Proces.sing. 40(3): 147- 15.5. Venctianer, P. L., Werblin, F., Roska. T., and Chua, L. 0. (1995). Analogic CNN algorithms for sonie image compression and restoration tasks. IEEE Truns. Circ~uirsand Sy.strinu I: I~’uriclametiralT1ieor.v and Applicciriorzs, 42(5):278 - 284. Wen, K.- A,, Su, J.- Y., and Lu, C.- Y. (1994). VLSI design of digital cellular neural networks for image processing. Jour. visuul Communiccrtion arid ImnRe Representation, 5(2) Werhlin, F., Roska, T., and Chua, L. 0. (1994). The analogic cellular ncural network d Circuit Tlieory and Appliintion.r, 23(6):S41 -569. eye. I n t ~ ~ r n u t i n nJour. Wu, C. W. Xing, T. and Chua, L. 0. (1996). On adaptive synchronization and control of nonlinear dynamical systems. Internationnl .lour. R[frrrcution und Chaos. 6(3):4SS-47 1. Yager, R. (1979). A measurement informational discussion on fuzzy union and intersection. Int. Jorrr. Mrtn-Machine Studie.s, 1 1 : 189-200, 1979. Yagcr, R. R. and L. A. Zadeh (Ed ( 1992). An Introtlucriori to Fuz7Y Logic Applicrttion in lrrt~lligrrztSystems. Boston: Kluwer Academic Publishers.
444
TAO YANG
Yaniakawa, T. ( 1990). Pattern recognition hardware system employing a fuzzy neuron. I n Proc. Int. Car$ F w ; y Logic. pp. 934-938. Yamakawa T. and Furukawa, M. ( 1992). A design algorithm of membership function for a fuzzy neuron using example-based learning. In Pro(,. IEEE Inr. Con$ Fnzzy Sysr. (FUZZY-IEEE'92). pp. 943-948. Yang. C. M., Yang. T., and Zhang, K. Y. (1994). Chaos in the discrete time cellular neural networks. I n Proc. Third IEEE Internutioncrl Workshop on Celldur Neurul Nehtwrk.\ mid Their App/iccrtiorr.s (CNNA-94), pp. 297 - 302. Yang, H.-K., Yakout, M. A,. and El-Masry, E. I . (1994). Current-mode implementation of discrete-time cellular neural networks using the pulse width modulation technique. In Proc.. 37th M i d w w t Symposium on Circuirs arid Systems, Aug. 3 - 5, 1:4.57-460. Yang. L., Chua. L. 0. and Krieg. K. R. (1990). VLSl implementation ofcellular neural networks. I n I990 IEEE International Svinposiurn on Circuits cord Sysrerns, pp. 2425 -2427. Yang, T. ( 1994). Blind signal separation using cellular neurnl networks. Internorioritrl Jortr. Circuit Theory arid App/ic.utions, 22(5):399-408. Yang, T. and Chua, L. 0. (1998a). Applications of chaotic digital code-division multiple access(CDMA) to cable communication systems. Irltr~rnntiontrl Joirr. Rfurc.ntion m c l Clitros, 8(8):1657- 1669. Yang, T. and Chua, L. 0. ( I998b). Error performance for chaotic digital code-division multiple access(CDMA). /ntrrnutiorr~r/Jour. B$urcation trnd Chaos. 8( 10):2047- 2059. Yang. T. and Chua, L. 0. ( I 996a). Channel-independent chaotic secure communication. InternrrtionuI Jour. Rfurcatiori nrid Chaos, 6( 12Bj:26.53- 2660. , and Chua, L. 0. (199613). Secure communication via chaotic parameter modulation. Trum. Circuir.r nnd Sy.sr~'m.s-I: Fundamental Tlieorv c m d App/ic-arion.s. 43(9):8 I7 - 8 19. Yang, T. and Chua, L. 0. (1997a). Chaotic digital code-division multiple access (CDMAj systems. International Jour. Bifurcation arid Chaos, 7( 12):2789- 2805. Yang, T.. Chua, L. 0..and Crounse, K. R. ( 1 9 9 6 ~ ) .Application of discrete-time cellular neural networks to image copyright labeling. I n P roc. Fourth IEEE Intrriurtionul Work.shop on C'rllrtltrr N m r d Netviwrks (2nd Their Aplicariorrs (CNNA-96), pp. 19 -24. Yang. T., Wu, C. W., and Chua, L. 0. (1997g). Cryptography based on chaotic systems. IEEE T ~ / I I . Circuits v. c/11t1Sv.\ferll.S -I: F/ttIdCrmetitcl/ The()? U t I d Applicati0Il.S. 44(5):469 -472. Ymg, T.. Yang, C.-M.. and Yang, L.-B. (1998d). A detailed study of adaptive control of chaotic systems with unknown parameters. Dyncmrics und Conrrol, 8(3):2.5.5-267. Yang, T., Yang, C. M.. and Yang, L. B. (1998e). Break chaotic switching using generalized synchronization: Examples. lEEE Trrms. Circuits und S y s t r m - I: F~trrdumentc~lTheory u ~ d Applictrtioiis, 45( 10): 1062- 1067. Yang. T., Yang, C. M., and Yang, L. B. (1998f). The differences between cellular neural network based and f~izzycellular neural network based mathematical morphological operalions Irrti~rnutioncr/Jour. Circuir T h e o n und App/ictrtion.s, 26( 1 ): 13 - 2.5. Yang, T., and Yang, L. B. (1996). The global stability of f u u y cellular neural network. IEEE Trum. Circuirs rind System.s-I: Fundmwitcrl Theory urid Applicatiorr.v, 43( 10):880- 883. Yang. T., and Yang, L. B. (1997d). Application of f u z y cellular neural networks to Euclidean distance transformation, IEEE Truns. Circuits and Svstetrrs- 1: Fundanrentul Theoiy and Allplicutions, 44(3):242-246. Yang. T., and Yang, L. B. (1997e). Application of fuzzy cellular neural networks to moi-phological gray-scale reconstruction. Internationcil Jour. Circuir Theory rind A[J[dii~utions, 25(3):153- 165. Yang, T., and Yang, L. B. (1997f). Fuzzy cellular neural network: A new paradigm for image processing. Inrernationnl Jour. Circuit Theoiy trnd App1iccztion.i. 25(6):469-48 I .
FUZZY CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS
445
Xing, T., Yang, L. B., Wu, C. W., and Chua. L. 0. (1996d). Fuzzy cellular neural networks: Applications. In Proc. 4//7 IEEE 1171. Workshop on Cellulnr Neurrrl Networks nrid Tl7eirA~~plic.c~rioirs (CNNA ‘96).pp. 225 - 230. Yang, T., Yang, L. B.. Wu, C. W., and Chua, L. 0. (1996e). F u z ~ ycellular neural networks: ‘Theory. In Proc. 4/11 /LEE In/. Work.shopori Ct~llulrrrNeural Network.s eincl Their App1icutiori.s (CNNA‘96).pp. I8 1 - 186. Yiing, T., Yang, L. B., and Yang, C. M. (1997i). Fuzzy cellular neural network. Technical Report Memorandum No. UCB/ERL M97/6 I , Electronics Research Laboratory, College of Engineering, University of California, Berkeley, 3 Sept. 1997. pp. 1 - 196. Yang, T.. Yang, L. B. and Yang. C. M. (19988). Fuzzy cellular neural networks and their appli, 1):78- 8.5. cations. Cl7ine.w Jour. ElechUiiic.\ (Erigli.sh k r ~ s i o r i )7( Yung, T., Yang, L. B., and Yang, X. P. (1996f). Application of cellular neural network to facial cxpression animation and high-level image processing. Interntrtioriol Joirr. Circuit Tl7eorv rrritf Applicc/ti/lrl.s,24(3):425- 450. Yrung, T. ( 1995a). Application of cellular neural network to map recognition. Jorrr. Torigji Uniwr,c.ir\..23( l):107- 112. (In Chinese). Yang, T. (1995b). Recovery of digital signals from chaotic switching. Iiitrmnrioiinl Jour. Circuit Theory cud App1icrrtioii.s. 23(6):61 1-615. Yang, T.. and Chua, L. 0. (l997b). lrnpulsive control and synchronization of nonlinear dynanio ~ i ~ Bifirccitiori rl ond ical syslems and application to secure communication. l ~ r r ~ r ~ ~ t r / iJaw. C’htro.~,7(3):64S-664. Yang, T., and Chua, L. 0. ( 1 9 9 7 ~ ) .Impulsive stabilimtion for control and synchronization of chaotic systems: Theory and application to secure conimunication. l E E E Trclrrs. Circuits trnrl .Syvsterr~.s-I: F ‘ u ~ i ~ / nTheorv ~ ~ ~ urid t ~ ~Applicatioii.s, ~ / ~ ~ ~ 44( 10):976-988. Yang, T., Suykens, J.A.K., and Chua, L. 0. ( 1 9 9 8 ~ ) .Impulsive control of nonautonomous chaot~ .~Bifilrcrrtioii d rmcl Chrros. 8(7):1557ic systems using practical stabilization. l ~ ~ / r r ~ r u /,i/ o w 1564. Yang, T., Yang. C. -M., and Yang, L. -B. (I997g). Control of Riissler system to periodic motions using inipulsive control method. Physics Lt,t/er.s A, 232(5):356-361. Yang. T., Yang, C. M.. and Ymg, L.-B. (1998). Genetic optimization of fuzzy cellular neural Jour. Circuit Thron. networks -get knowledge from both learning and structures. lrr/errr~~/ioritd rirrrl App1iccirion.s. Subniitted. Yang. T., Yang, C. M.. and Yang, L.-B. (1998). Learning algorithm of fuzzy discrete-time cellular neural networks. IEEE Trcrii.suc/inrisoil Circuits crrrtl Sy,srerri.s-l: Furrc/rirnentnl the on^ errit/ Applictrtiorrs. Submitted. Yang, T., Yang, L.-B.. Yang, C.-M., Yang. X.-P., and Yang, H.-N. (199Xi). Linguistic flow in furzy discrete-time cellular neural networks and its stability. IEEE Trtr17.s. Circuirs ~ i i i r l .S\,.S~CI~I.S -I: ~ ‘ J / I I L ~ O I I I ~ I I ~The/)r\> (I/ C / I I ~ /AP/J/~CO/~OIIS, 45(9):869 - 878. Yanp, T., Yang, L. B., and Xing, C. M. (1998h). Theory of control of chaos using sampled data. physic.^ Letters A, 246(3 - 4):284 - 288. Yang, T., Yang, L.-B., and Yang, C.- M. ( 1997.1). Impulsive control of Lorenz system. Physicrr I), 100:18-24. Yang. T., Yang, L.- B.. and Yang. C.- M., (lW7k). Impulsive synchronization of Lorenz systems. Physics Lettrr.~A , 226(6):349- 354. Yanp, T., Yang, L.- B.. and Yang, G . (1994a). On unconditional stability of the general delayed cellular neural networks. I n Proc. Third IEEE Ii~terriutionnl Workshop OIZ Collrtltrr N c w w l Nomvrks trrid Their Applicrrtions (CNNA-94). pp. 9 - 14. Yang. X . P.. Yang, T., and Yang, L. B. (1994b). Extracting focused object from defocused hackground using cellular neural networks. In Pro(,. Third /EEE Iriterrin/ioricrl Workshop ori Crllultrr Nritrrrl Netawrks c i t d Their App1icdori.s (CNNA-94), pp. 45 I -455.
446
TAO YANG
Yin, L., Yang, R., and Neuvo. Y. (1996). Weighted median filters: A tutorial. IEEE Truns. Circuits Systenix - I / , 43( 3): 157 - 192. Yli-Harja, O., Astala, J . , and Neuvo, Y. (1991). Analysis of the properties of median and weighted median filters using threshold logic and stack filter representation. IEEE Truns. Signtrl Processing, 39: 395 -4 10. Zadeh. L. (1965). Fuzzy sets. lnfoi-ni. t r n d Coritrol, 8: 338-353. Zadeh. L. A,, Fu, K. S., Tanaka, K. and Shimura M. (Eds.). (1975). Fuzzy Sets trnd T/wir A/'p1icution.s to Cognitive und Deci.sion Processes. London: Academic. Zarandy, A,, Werblin, F., Roska, T., and Chua, L. 0 .(1996). Spatial logic algorithms using basic morphological analogic CNN operations. Intemntioriul Joitr. Circuit Theor? urrd A ~ i p l i ~ ~ u t i ~ ~ n s . 24(3):283-300. Zhang, X.. Hang, C. C., Tan, S., and Wang, P.Z. (1996). The min-max fiinction differentiation and training of fuzzy neural networks. IEEE Trciris. N e u r d Networks, 7(5): 1 139- I 150. Zhang, X. H., Hang. C. C., Tan, S.. and Wang, P.Z. (1994). The delta rule and learning for min-max neural networks. In Proc. IEEE-ICNN'Y4, 1: 38-43. Orlando, FL. Zhou, Yi-Tong ( 1992). Artijciul Neurul N e t w o k s ,for Compufer Vision. New York: SpringerVerlag. Zou. F.. Katerle, A,, and Nossek, J . A. (1993). Hoinoclinic and heteroclinic orbits of the threeTl,roi-\. rrntl cell cellular neural networks. IEEE Trcrri.~.Circuits rind Systems /: F~rnr/rrrnen~r~l App1iccition.s. 40( I I ): 843 -848. Zou, F., and Nossek, J. A. (1991). A chaotic attractor with cellular neural networks. lEEE Truris. Circuits crud Svstrms, 38(7):8 I I - 8 12. Zou F.. and Nossek, J . A. (1992). Double scroll and cellular neural networks. In I9Y2 IEEE Internrrtionul Symposium on Circuits rind Sys~e~ns. 1:320- 323. Zou, F., and Nossek, J. A. ( 1 9 9 3 ~ ) .An autonoinous chaotic cellular neural network and Chua's circuit. Jour. Circuits, Sv.sten7.s nrrd Cotnputer.~,3(2):59 I -601, Zou, F., and Nossek, J . A. (1993h). Bifurcation and chaos in cellular neural networks. lEEE Truns. Circuits trnd Systems I : Arndumenrul Tlieor?;trrid Applicntions. 40(3): 166- 173. Zou, F., and Nossek, J. A. (1993a). Hopf-like hifiircation in cellular neural networks. In (Proceediirp) I993 IEEE Infernnfioncrl Syn/iosiuni on Cirwits trnd Systems., 4: 239 I 2394. Zou F., Schwarz, S., and Nossek. J . A. (1990a). Cellular neural network dcsign using a learning algorithm. In 1990 IEEE Intertitrtioncrl Workshop on Ce/lulur Neural Networks r i n d Thrir A/J/J/iUitiOn.S, CNNA-90, pp. 73 - 8 I . Zou. F., Schwari, S., and Nossek, J . A. (l99Ub). Cellular neural network design using a learning s Tlieir algorithm. In IYYU lEEE Inrernufioncil Workshop on Cellultrr Neurcrl N e t ~ ~ o r kwid Applicntion.s, CNNA-90, pp. 73- 8 I . -
INDEX A
future work and conclusions, 421 -23 local connectedness, 266, 277 multilayer (MCNN), 272 nature of, 266-68 nonlinear synaptic laws (NCNN), 271-72 structures of conventional, 269-75 unified structure, 276-79 universal machine (CNNUM), 272-73 Chaotic CNN (CCNN), 272 Charge-coupled devices (CCDs) measurement of, 36-37 MOSFETs and, 83-84 Charge pumping, 35 Charge transfer efficiency, 36-37 Charge transient, 19 Conductance-voltage (G-V) method, 36 Constant capacitance deep-level transient spectroscopy (CC-DLTS) conclusions, 72 feedback circuit and setup for, 63-65 illustrations, 67-72 noise sources in, 143-44 Constant-capacitance voltage transient (CCVT), 29-30, 62 Constant resistance deep-level transient spectroscopy (CR-DLTS) conclusions, 72 feedback circuit and setup for, 65-67 illustrations, 67-72 noise sources in, 143-44 use of term, 74 Constant resistance deep-level transient spectroscopy (CR-DLTS), JFETs and applications for JFETs, 96-97
Acceptor, 7 Analog-to-digital converters (ADCs), 41,44, 47,48-49 Approximation space, 164-65 Arrhenius plot, 22 Artificial neural networks (ANNs), 266 Averaging process, 30-3 1, 144 continuous time, 52-55 pseudo-logarithmic, 43-52
Barnes-Wall lattice, 228 Berry-Hannay phase, 164 Binary linear code, 238 Blocking effect, 200 Boxcar function, 24-25, 32, 40, 42, 55-56
C Capacitance transient, 15 - 18 Capacitance transient deep-level transient spectroscopy (CT-DLTS), 68, 143 Capacitance-voltage (C-V) methods high-low frequency, 3.5-36 quasi-static, 35-36 Capture rates, 12, 14 Cartan matrices, 224, 260- 1 Cell, 269, 277-78 Cellular automata (CA), 266 Cellular neural networks (CNNs) See also Fuzzy cellular neural networks (FCNNs); MIN/MAX CNN (MMCNN) chaotic (CCNN), 272 delay-type (DCNN), 27 1 discrete-time (DTCNN), 273-75
447
448
INDEX
Constant resistance deep-level transient spectroscopy (CR-DLTS), JFETs and (cont.) comparison of CC-DLTS and CT-DLTS with, 107-18 conclusions, 1 18 germanium JFETs, results in, 105-7 silicon, radiation-induced defects in, 155-56 silicon JFETs, results in, 99- 105 theoretical background, 97 -98 Constant resistance deep-level transient spectroscopy (CR-DLTS), MOSFETs and experimental results, 77-83 theoretical background, 75 -77 Constant resistance deep-level transient spectroscopy (CR-DLTS), MOSFETs (depletion mode) and benefits of, 84 charge-coupled devices and reading, 83-84 conclusions, 95-96 experimental results and discussion, 87-95 theoretical background, 84 -87 Continuous time averaging, 52-55 Covering radius, 227-28 Current DLTS, 74 Current transient, 18- I9 Current transient spectroscopy (CTS), 27-28
D Daubechies wavelets, 170-5 symmetric, 175-76 Debye tail, 13, 16 Deep centers defined, 3, 8 effect of, on device performance, 10-1 1 role of, 9- 10 Deep holes, 227 Deep-level parameters, determining, 21 -24 Deep levels, 9
Deep-level transient spcctroscopy (DLTS) See nlsn Constant capacitance deep-level transient spectroscopy (CC-DLTS); Constant resistance deep-level transient spectroscopy (CR-DLTS) areas for future research, 120- I capacitance transient, 15- 18 charge transient, 19 classification scheme for, 34 conclusions, 119-20 conventional, 24-25 current transient, 18- 19 defects, impurities, and energy levels, 7- I I detection of emission of trapped charge, 15 -2 1 determination of deep-level parameters, 2 1 -24 field effect transistors and, 73-75 generation-recombination statistics, 11-15
magnitude errors, I35 -40 noise sources and signal-to-noise ratio, 141-44 other methods compared with, 35-38 recombination centers, 8-9 role of, 3-4 shallow impurities, 7-8 template for analysis program, 152-55 template for measurement program, 148-5 1 time constant errors, 140- I traps, electron and hole, 8-9. 1 I - 12 traps, interface, 10, 20 voltage transient, 19-21 Deep-level transient spectroscopy (DLTS), averaging and recording of digital analog signal processing methods, 40-41 applications, 55 -60
INDEX
Deep-level transient spectroscopy (DLTS), averaging and recording of digital (cont.) conclusions, 60-61 continuous time averaging, 52-55 digital signal processing methods, 41 -43 pseudo-logarithmic averaging, 43-52 technical overview, 39-43 Deep-level transient spectroscopy (DLTS), main stages in averaging, 30-3 1 detection (emission pulse), 27-30 digital methods, 33 excitation (filling pulse), 25-27 transient analysis, 3 1-34 Delay-type (DCNN), 27 I Differential Pulse Code Modulation (DPCM), 249 Direct memory access (DMA), 44, 48, 50 Discrete-time CNN (DTCNN), 273-75 Discrete-time fuzzy CNN (DTFCNN) advanced learning algorithms of additive, 382-87 applications of, 407-20 embedding f u u y IF-THEN-ELSE rules into, 408- 1 I embedding local fuzzy relation equations, 4 I3 -20 implementing fuzzy inference sharpener, 4 I 1 - I3 implementing fuzzy spatial dilemmas using Type-11, 423-27 lcarning algorithms of additive Type-11, 377-82 lcarning algorithms of Type-IV, 392-401 structure of, 407-8 structure of Type-IV, 387-92 Discrete wavelet transform (DWT), 202-3 Distortion function, 209- 10 minimization of. 212- 13 DLTS spectrum, 23, 40, 57-60
449
D, lattice, 232-33 Donor, 7 Double-correlation or DDLTS, 26, 62 Dual lattices. 2 19, 23 1 -32
E Electron trap, 8-9, 1 I - 12 Emission pulse, 27-30 Emission pulses memory, 63 Emission rates, 12. 14 Entropy coding of lattice vectors, 250-4 Equal slope algorithm, 204 Excitation (filling pulse), 25-27 Exponential correlator, 32, 40. 42
F Face image processing, Type-I FCNN and, 356-60 Factorizations, 166 Feedback and feedback template, 270, 27 I Feedback circuit, electrical circuits of. 146-47 Feedforward and feedforward template. 270, 271 Field effect transistors (FETs), DLTS techniques and, 73-75 Filling pulse, trap, 16, 22, 25-27 Filling pulses memory, 63 Filter, scaling iunction. I67 -68 FIRE edge extractor. 366-67 FIRE operators. 364-65 Flat-band voltage, 86 Flat structuring element, 328 Fourier domain, I65 -66 Fuzzy cellular neural networks ( FCN N s) See also Discrete-time fuzzy CNN (DTFCNN) applications of discrete-time, 407 - 20 applications to image processing. 366-75 cell, 277-78
INDEX
Fuzzy cellular neural networks (FCNNs) (con?.) classification of, 282 decentralization. 277 differences between FNN and, 287-90 different structures of, 282-87 dynamics, 277 future work for, 420-7 gcnetic algorithm applications to image processing, 404-7 genetic algorithms for optimizing, 401-4 local connectedness, 266. 277 principlcs and definitions of general, 279-81 unitied CNN structure, 276-79 Fuzzy cellular neural networks, computational arrays and implementation of morphological operations. 329-33 mathematical morphology, basic, 327-28 MIN/MAX CNN, 333-56 Fuzzy cellular neural networks, learning algorithms and of additive Type-11 discrete-time, 377 - 82 advanced, of additive discrete-time, 382-87 from linguistic inputs, 387-401 of Type-IV discrete-time, 392-401 Fuzzy cellular neural networks, theory of delay-type, 3 I3 -24 elementary theory, 290-9 global stability, 299-3 10 local stability, 3 10-3 stability of discrete-time, 324-27 Fuzzy cellular neural networks, Type-I delay-type, 320-4 description of, 283-84, 285-87 dynamical range of, 295-99 face image processing using, 356-60
global stability, 307- 10 local stability, 3 13 Fuzzy cellular neural networks, Type-Il additive, 330-3 delay-type, 3 13-20 description of, 284-87 dynamical range of, 291 -95 embedding fuzzy inferencc into, 364-66 fuzzy operations implemented by, 363 fuzzy set theory and properties of images, 36 1-63 global stability, 299-306 its interpreter, 363 -64 local stability, 3 10-3 MIN/MAX, 333-56 multiplicative, 329-30 Fuzzy set theory, 276, 361 -63 Fuzzy singleton, 362
G Generalized Gaussian function (GGF). 210-2, 217, 249, 251 Generation-recombination (G-R) centers, 1 1 - IS Genetic algorithm (GA) applications to image processing, 404 - 7 for optimizing FCNN, 401 -4 Glue vectors. 2 19 Gram matrix of lattices, 219 of root, 220 Gray-scale closing, 328 Gray-scale dilation, 328 Gray-scale erosion, 328 Gray-scale opening, 328
H Hall effect, 2 I , 34 Hamming distance, 238 Hamming weight, 238 Hole trap, 8-9, 11 - 12 Human visual perception, 204
lNDEX
I
45 I
Jacobi theta function, 236 Junction field-effect transistors. S ~ P Constant resistance deep-level transient spectroscopy (CR-DLTS), JFETs and
Cartan matrices, 224, 260- 1 codebook for, 2 14 conclusions, 2 3 - 6 0 counting lattice points, 233-42 distortion measure and quantization regions, 214- 16 entropy coding of vectors, 250-4 experimental results, 254-58 optimal quantizer for wavelet coefficients, 217- 18 quantization algorithms for selected lattices, 229-33 scaling algorithm, 242-44 selecting lattices for quantization, 244-SO Learning algorithms. See F u ~ z y cellular neural networks, learning algorithms of Lightly doped drain (LDD) devices, 84 Linear-spline function, 164-66 Local connectedness. 266. 277 Lock-in arnplitier, 32, 40, 42 Low-frequency noise measurements, 37
L
M
Lngrange multiplier technique, 2 12 Laminated lattices, 226-29. 233 Laplace transform, inverse, 33-34 Laplacian PDF, 25 1 Lattice points, counting, 233-42 Lattices construction of root, 224-26 defined. 218-19 description of root, 219-24 D,,, 232-33 dual, 219, 231-32 glue vectors, 219 Gram matrix of. 219 laminated, 226-29. 233 quadratic form, 2 I9 rclntionship between codes and, 238-42 Z,,, 232 Lattice vector quantization, 2 I3 See r d s o Wavelet coefficients, quantization of
Mallat algorithm with complex filters, 178-80 Mathematical morphology, basic, 327-28 Mean squared error (MSE), 216. 244-45 Memory circuit, 63 Metal-oxide-semiconductor FETs. See Constant resistance deep-level transient spectroscopy (CR-DLTS), MOSFETs and MIN/MAX CNN (MMCNN) applications, 3 4 7 4 6 for averaging operator, 335-37, 346 for horizontal derivative, 337-39, 346 for Laplacian operator, 335, 346-47 learning algorithms. 342-46 neighborhood patterns, 333 for orientation derivatives, 339-40 universal functions, 340-2
IF-THEN-ELSE rules, 364-66, 407, 408- I3 Image processing FCNN applications to, 366-75 genetic algorithm applications to, 404 - 7 Impedance spectroscopy, 36 Impurities, shallow, 7-8 Impurity characterization, importance of, 2-3 Interface traps, 10, 20 Ionization energy, 7 Isophase space, 18 I
J
452
INDEX
Multilayer CNN (MCNN), 272 Multiresolution analysis conclusions, 193-96 image enhancement, 183-87 Mallat algorithm with complex filters, 178-80 wavelet shrinkage technique, 183-87 Multiresolution wavelets, 167 -70
N Noise removal, fuzzy inference and impulsive. 367-7.5 Noise sources and signal-to-noise ratio, 141 -44 Nonlinear synaptic laws (NCNN), 27 1-72
0 Optimal bit allocation, 208- 13 Orthonormalization. 166. 168 Daubechies. I7 1
P Packing radius, 227 Phase, role of reconstruction 180-3 in signal processing, 163-64 in spline wavelet bases, 164-66 Phase-sensitive detector (PSD), 66, 72, 74-75 Photoluminescence (PL), 37 Pinch-off voltage, 86, 97-98 Pixels, 267 />-ti junction, 15-18, 72 Principle of detailed balance, I3 Probability distribution function (PDF), 209- 12, 217, 247, 248, 249, 251, 2.54-55 Project onto convex spaces (POCS), 181-83 Pseudo-logarithmic averaging demonstration, 50 - 52 electrical circuit of, 145 error analysis, 45-48
implementation, 48-50 theory, 43-45
Q Quantization. See Lattice vector quantization; Wavelet coefficients, quantization of Quantization algorithms for selected lattices, 229-33 Quasi-equilibrium, 14
R Random telegraph signal (RTS), 94 Rate window concept, 22-23, 99 Recombination centers, 8-9 Reed-Muller code, 228. 233, 238 Refinement equation, 167 Ridge and skeleton algorithm, 164 Riesz basis, 164-65, 166 Root lattices Cartan matrices, 224, 260- I construction of, 224-26 description of, 2 19-24 Gram matrix, 220 root systems, 220-3
S Scaling algorithm, 242-44 Scaling function Daubechies, 170 defined. 165, 166 multiresolution, 167 symmetric, 169 symmetric complex-valued, 174 Shannon lower bound, 215, 216 Shockley-Read-Hall (SRH) theory, 12- 1.5, 39, 76, 86, 98 Signal-to-noise ratio (SNR), 141 -44, 206. 207 Space-charge region (SCR). 13, IS- 18, 20 Spectral analysis DLTS (SADLTS), 41 Spline wavelet bases, 164-66 Strang-Fix condition, 170- 1 Symmetric Daubechies wavelets. 175-76
453
INDEX
Symmetric Daubechies wavelets, (cont.) phase of, and scaling function, 176-78 Symmetric scaling function, 169 Symmetry, 166
T Thermally stimulated capacitance (TSCAP), 36, 39 Thermally stimulated current (TSC), 36, 39 Theta functions, 236-38 Time-delay, 27 I Transient analysis, 3 1-34 Transient spectroscopy, use of term, 23 Traps filling pulse, 16, 22, 25-27 hole and electron, 8-9, 1 1 - 12 interface, 10, 20
U Uniform vector quantizer, 216
V Vanishing moments, 170 Vector quantization (VQ) See also Lattice vector quantization disadvantages of, 200 Voltage transient, 19-21
W Wavelet coefficients, quantization of, 201 -2 distortion function, 209- 10 distortion function, minimization of, 212-13 fundamentals of quantization process, 203-4 high-frequency coefficients, 21 0- I information distribution across coefficient matrix, 204-8 low-frequency coefficients, 211-12 optimal bit allocation, 208- 13 statistical model of. 212- 19 Wavelets Daubechies, 170-5 image enhancement, 183-87 Mallat algorithm with complex filters, 178-80 multiresolution, 167-70 shrinkage technique, 183-87 spline, 164-66 symmetric Daubechies, 175-76 Weight distribution of C, 239 Weight enumerator, 239 Weight function, 32
Z Zn lattice, 232
I S B N O-L2-014751-3