ADVANCES IN CHILD DEVELOPMENT AND BEHAVIOR
Volume 34
Contributors to This Volume Brian P. Ackerman
James A. Dixon
Rebecca S. Bigler
Katharine Graf Estes
Beverly A. Brehl
C. Donald Heth
Eleanor D. Brown
Elizabeth Kelley
Maggie Bruck
Jukka M. Leppa¨nen
Stephen J. Ceci
Lynn S. Liben
Carol L. Cheatham
Charles A. Nelson
John Colombo
Jenny R. Saffran
Edward H. Cornell
Cecilia Wainryb
ADVANCES IN CHILD DEVELOPMENT AND BEHAVIOR
edited by
Robert V. Kail Department of Psychological Sciences Purdue University West Lafayette, IN 47907, USA
Volume 34
AMSTERDAM BOSTON HEIDELBERG LONDON NEW YORK OXFORD PARIS SAN DIEGO SAN FRANCISCO SINGAPORE SYDNEY TOKYO Academic Press is an imprint of Elsevier
Academic Press is an imprint of Elsevier 84 Theobald’s Road, London WC1X 8RR, UK Radarweg 29, PO Box 211, 1000 AE Amsterdam, The Netherlands The Boulevard, Langford Lane, Kidlington, Oxford OX5 1GB, UK 30 Corporate Drive, Suite 400, Burlington, MA 01803, USA 525 B Street, Suite 1900, San Diego, CA 92101-4495, USA First edition 2006 Copyright ß 2006 Elsevier Inc. All rights reserved No part of this publication may be reproduced, stored in a retrieval system or transmitted in any form or by any means electronic, mechanical, photocopying, recording or otherwise without the prior written permission of the publisher Permissions may be sought directly from Elsevier’s Science & Technology Rights Department in Oxford, UK: phone (+44) (0) 1865 843830; fax (+44) (0) 1865 853333; email:
[email protected]. Alternatively you can submit your request online by visiting the Elsevier web site at http://elsevier.com/locate/permissions, and selecting Obtaining permission to use Elsevier material Notice No responsibility is assumed by the publisher for any injury and/or damage to persons or property as a matter of products liability, negligence or otherwise, or from any use or operation of any methods, products, instructions or ideas contained in the material herein. Because of rapid advances in the medical sciences, in particular, independent verification of diagnoses and drug dosages should be made ISBN-13: 978-0-12-009734-0 ISBN-10: 0-12-009734-6 ISSN: 0065-2407 (Series)
For information on all Academic Press publications visit our website at books.elsevier.com
Printed and bound in The Netherlands 06 07 08 09 10 10 9 8 7 6 5 4 3 2 1
Contents Contributors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
ix
Preface . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
xi
Mapping Sound to Meaning: Connections Between Learning About Sounds and Learning About Words JENNY R. SAFFRAN AND KATHARINE GRAF ESTES I. Introduction. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . II. Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . III. Phonetic Specificity in Early Lexical Representations. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . IV. Effects of Familiarity with the Sounds of Words on Word Learning . . . . . . . . . . . . . . . . . V. Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
1 2 4 21 31 32
A Developmental Intergroup Theory of Social Stereotypes and Prejudice REBECCA S. BIGLER AND LYNN S. LIBEN I. Introduction. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . II. Definitions and Forms of Stereotyping and Prejudice . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . III. An Ontogenetic Approach to Stereotyping and Prejudice . . . . . . . . . . . . . . . . . . . . . . . . . . . . . IV. Core Qualities and Goals of Developmental Intergroup Theory . . . . . . . . . . . . . . . . . . . . . . V. Theoretical Foundations of Developmental Intergroup Theory . . . . . . . . . . . . . . . . . . . . . . . VI. Core Components of Developmental Intergroup Theory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . VII. Principles of the Formation and Maintenance of Social Stereotypes and Prejudice. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . VIII. Summary and Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
40 42 43 45 49 52 61 79 81
Income Poverty, Poverty Co-Factors, and the Adjustment of Children in Elementary School BRIAN P. ACKERMAN AND ELEANOR D. BROWN I. Introduction. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . II. Framing Poverty Research. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . III. Poverty Co-Factors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . IV. Dynamic Aspects of the Ecology of Disadvantage. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . V. Person-Centered Approaches . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . VI. Summary and Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
91 93 100 112 119 122 124
vi
Contents
I Thought She Knew That Would Hurt My Feelings: Developing Psychological Knowledge and Moral Thinking CECILIA WAINRYB AND BEVERLY A. BREHL I. Introduction. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . II. Moral Judgments about the World as Understood . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . III. Children’s Developing Understandings of Persons: a Thumbnail Sketch. . . . . . . . . . . . . . IV. Children’s Moral Judgments about the Behaviors of Persons as Understood . . . . . . . . . V. Conclusions and Future Challenges . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
131 133 138 144 160 164
Home Range and The Development of Children’s Way Finding EDWARD H. CORNELL AND C. DONALD HETH I. Definition of the Topics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . II. Distance and Dispersion of Travel . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . III. The Ontogeny of Way Finding . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . IV. Landmark and Place Recognition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . V. Memories of Routes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . VI. Bearing Knowledge in Way Finding . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . VII. Strategy Development . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . VIII. General Discussion. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
174 175 178 182 186 190 193 199 200
The Development and Neural Bases of Facial Emotion Recognition JUKKA M. LEPPA¨NEN AND CHARLES A. NELSON I. Behavioral Studies of Facial Expression Recognition. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . II. Neural Basis of Facial Expression Recognition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . III. Developmental Mechanism . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . IV. Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
209 221 233 236 237
Children’s Suggestibility: Characteristics and Mechanisms STEPHEN J. CECI AND MAGGIE BRUCK I. Definitional Issues . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . II. Interviewer Bias: the Central Characteristic of Suggestive Interviews . . . . . . . . . . . . . . . . . III. Mechanisms Underlying Children’s Suggestibility. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . IV. Summary: Child vs. Situational Variables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
249 250 264 275 275
Contents
vii
The Emergence and Basis of Endogenous Attention in Infancy and Early Childhood JOHN I. II. III. IV. V. VI.
COLOMBO AND CAROL L. CHEATHAM Introduction. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Four Attentional Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . A Model for Endogenous Attention and Some Historical Perspectives . . . . . . . . . . . . . . . Behavioral Development of Endogenous Attention . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Neural Bases of Endogenous Attention . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . The Emergence of Endogenous Attention: Summary and Implications . . . . . . . . . . . . . . . References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
284 285 290 291 301 306 310
The Probabilistic Epigenesis of Knowledge JAMES A. DIXON AND ELIZABETH KELLEY I. Knowledge Acquisition: Foundational Issues . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . II. Probabilistic Epigenesis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . III. Epigenesis of Knowledge . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . IV. Epigenesis of Knowledge and Symbol Grounding . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . V. Epigenesis and Detecting Structure in the Environment . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . VI. Epigenetic Approaches to Language Acquisition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . VII. Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
324 326 329 340 347 352 353 359
Author Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Subject Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Contents of Previous Volumes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
365 381 387
This page intentionally left blank
Contributors
BRIAN P. ACKERMAN Department of Psychology, University of Delaware, Newark, Delaware 19716, USA REBECCA S. BIGLER Department of Psychology, University of Texas at Austin, Austin, Texas 78712, USA BEVERLY A. BREHL Department of Psychology, University of Utah, Salt Lake City, Utah 84112, USA ELEANOR D. BROWN Department of Psychology, University of Delaware, Newark, Delaware 19716, USA MAGGIE BRUCK Department of Psychiatry and Behavioral Science, Division of Child and Adolescent Psychiatry, Johns Hopkins Medical Institutions, Baltimore, Maryland 21287, USA STEPHEN J. CECI Department of Human Development, Cornell University, Ithaca, New York 14853, USA CAROL L. CHEATHAM The Schiefelbusch Institute for Life Span Studies, University of Kansas Lawrence, KS 66045 USA and Department of Dietetics and Nutrition, University of Kansas Medical Center, Kansas City, Kansas 66103, USA JOHN COLOMBO Department of Psychology and The Schiefelbusch Institute for Life Span Studies, University of Kansas Lawrence, Kansas 66045, USA EDWARD H. CORNELL Department of Psychology, University of Alberta, Edmonton, Alberta T6G 2E9, Canada JAMES A. DIXON Department of Psychology, University of Connecticut, Storrs, Connecticut 06269-1020, USA KATHARINE GRAF ESTES Department of Psychology & Waisman Center, University of Wisconsin – Madison, Madison, Wisconsin 53706, USA
x
Contributors
C. DONALD HETH Department of Psychology, University of Alberta, Edmonton, Alberta, T6G 2E9, Canada ELIZABETH KELLEY Department of Psychology, University of Connecticut, Storrs, Connecticut 06269-1020, USA JUKKA M. LEPPA¨NEN Human Information Processing Laboratory, Department of Psychology, University of Tampere, Finland LYNN S. LIBEN Department of Psychology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA CHARLES A. NELSON Harvard Medical School, Boston Children’s Hospital, Boston, Massachusetts 02115, USA JENNY R. SAFFRAN Department of Psychology & Waisman Center, University of Wisconsin – Madison, Madison, Wisconsin 53706, USA CECILIA WAINRYB Department of Psychology, University of Utah, Salt Lake City, Utah 84112, USA
Preface Advances in Child Development and Behavior is designed to provide scholarly technical articles and to provide a place for publication of scholarly speculation. In these critical reviews, recent advances in the field are summarized and integrated, complexities are exposed, and fresh viewpoints are offered. These reviews should be useful not only to the expert in the area but also to the general reader. No attempt is made to organize each volume around a particular theme or topic. Manuscripts are solicited from investigators conducting programmatic work on problems of current and significant interest. The editor often encourages the preparation of critical syntheses dealing intensively with topics of relatively narrow scope but of considerable potential interest to the scientific community. Contributors are encouraged to criticize, integrate, and stimulate, but always within a framework of high scholarship. Although appearance in the volumes is ordinarily by invitation, unsolicited manuscripts will be accepted for review. All papers—whether invited or submitted—receive careful editorial scrutiny. Invited papers are automatically accepted for publication in principle, but usually require revision before final acceptance. Submitted papers receive the same treatment except that they are not automatically accepted for publication even in principle, and may be rejected. I acknowledge with gratitude the aid of my home institution, Purdue University, which generously provided time and facilities for the preparation of this volume. Robert V. Kail
This page intentionally left blank
MAPPING SOUND TO MEANING: CONNECTIONS BETWEEN LEARNING ABOUT SOUNDS AND LEARNING ABOUT WORDS
Jenny R. Saffran and Katharine Graf Estes DEPARTMENT OF PSYCHOLOGY AND WAISMAN CENTER UNIVERSITY OF WISCONSIN-MADISON, MADISON, WISCONSIN 53706, USA
I. INTRODUCTION II. OVERVIEW III. PHONETIC SPECIFICITY IN EARLY LEXICAL REPRESENTATIONS
A. A RE-EXAMINATION OF PHONETIC DETAIL IN WORD LEARNING B. PHONETIC DETAIL IN WORD RECOGNITION IV. EFFECTS OF FAMILIARITY WITH THE SOUNDS OF WORDS ON WORD LEARNING
A. PHONOTACTIC PROBABILITY AND NEIGHBORHOOD DENSITY IN EARLY WORD LEARNING B. SEGMENTING WORDS AND MAPPING SOUND TO MEANING V. CONCLUSIONS ACKNOWLEDGMENTS REFERENCES
I. Introduction Human language is notorious for its multilayered structure. To acquire linguistic information at one level, such as how words pattern together to form grammatical structure, learners must already know something about the words themselves, such as their membership in grammatical categories (e.g., nouns vs. verbs), or other characteristics of their meanings and/or syntactic features. Conversely, knowing something about syntactic structure can help learners make informed guesses about what novel words might mean (e.g., Gilette et al., 1999). Knowledge about one level of language (e.g., syntax) helps to constrain the hypotheses put forth at other levels of language (e.g., word meanings), and vice versa. 1 Advances in Child Development and Behavior R Kail (Editor)
ß 2006 Elsevier B.V. All rights reserved.
2
Jenny R. Saffran and Katharine Graf Estes
Mutually constraining multiple levels of information act in an attempt to link sound to meaning in word learning as well. To figure out what words mean, learners must already have some idea of which sound sequences in their language correspond to words. However, relatively little research has focused on the process by which sound and meaning come together in word learning. A massive literature on how young children learn the meanings of words has focused primarily on the nature of the child’s hypotheses about possible meanings, and on the objects, actions, or concepts to be mapped to sounds (e.g., Bloom, 2000; Markman, 1990; Woodward & Markman, 1998); far less is known about how knowledge of sound structure contributes to word learning. There is also a burgeoning literature on how infants acquire the sound structure of their language, including the acquisition of individual native language sounds (phonemes), how those sounds regularly combine (phonology, phonotactics), and which sound sequences are segmented from the speech stream as words (e.g., Jusczyk, 1997). Infants’ prior learning about phonetic categories, sound sequence regularities, and word segmentation cues all likely contribute to the process of linking word forms with their meanings. Interestingly, however, the links between the early learning processes underlying the acquisition of native language sound patterns (which unfolds during the first year of life) and the mapping of those sounds to meaning (which emerges primarily during the second year of life), has only gained attention in the field since the beginning of the twenty-first century. Our goal in this chapter is to explore the potential links between the sounds of words and the word learning process. Initial learning about the sounds of a language may provide infants with a foundation for the subsequent association of sound sequences with meanings. To set the stage, we first briefly overview some relevant results in the field of infant speech processing (for thorough recent reviews, see Kuhl, 2004; Saffran, Werker, & Werner, 2006; Werker & Curtin, 2005), and preview how representations of speech might relate to lexical development. We then highlight empirical and theoretical developments in two areas where learning about the sounds of language may be relevant especially for word learning: The phonetic specificity of early lexical representations, and how familiarity with the sounds of words affects word learning, specifically through the influence of phonotactic probability and neighborhood density, and the process of word segmentation. By doing so, we hope to provide suggestions for future research that will explicitly link together two very fruitful, but all too often separate, domains of research: Infant speech perception and early word learning.
II. Overview Since the 1980s, a large body of work has traced the development of infants’ capacity to perceive phonetic contrasts (e.g., /p/ vs. /b/). Infants rapidly become
Mapping Sound to Meaning
3
tuned to those phonetic contrasts that are used in their native language. Language-specific tuning in vowel perception is already evident by 6 months (e.g., Kuhl et al., 1992). Consonant categories develop somewhat later, likely due to the relative prominence and clarity of vowels in infant-directed speech (e.g., Kuhl et al., 1997). At 6 – 8 months, infants can perceive both native language and non-native consonant contrasts that adults cannot discriminate. By 10 – 12 months, infants’ attention becomes focused on the acoustic dimensions that are important for their native language (e.g., Werker & Tees, 1984). Do infants use the fine phonetic distinctions acquired during the first year as the basis for subsequent word learning and word recognition? It seems likely that infants notice phonetic distinctions in new words, and then associate different meanings with different word forms. Data from a number of word learning experiments suggest that infants in fact do not have access to full phonetic detail in their lexical representations (e.g., Stager & Werker, 1997). However, the results of word-recognition experiments (as opposed to word learning experiments) appear to contradict this conclusion (Swingley & Aslin, 2002). Although attempts to clarify the phonetic specificity of early words have received substantial research attention, this debate has not been resolved (Ballem & Plunkett, 2005; Fennell & Werker, 2003a; Swingley, 2003; Swingley & Aslin, 2005; Werker & Curtin, 2005). In addition to advances in language-appropriate phoneme discrimination, infants also learn about how sounds are combined in their native language during their first year. By 9 months, but not at 6 months, infants can discriminate phoneme combinations in words that are legal in the native language from illegal combinations (Jusczyk et al., 1993), as well as common legal combinations vs. rare but legal combinations (Jusczyk, Luce, & Charles-Luce, 1994). Knowledge of probable vs. improbable phoneme combinations provides infants with a cue for segmenting words from fluent speech (Mattys & Jusczyk, 2001; Mattys et al., 1999). Learning about regularities in phoneme combinations may also contribute to word learning by influencing how readily new words are added to the lexicon. Words consisting of frequent sound sequences may be easier to associate with meanings because they contain phoneme combinations with which the infant is already familiar, allowing the infant to focus more attention on identifying the referent of the new word (its meaning). Alternatively, new words that are highly similar to familiar words may make associating meaning difficult because of the confusability of the word forms. This area of inquiry has only recently become the object of study in infant word learning experiments (Hollich, Jusczyk, & Luce, 2002; Swingley & Aslin, 2005). To associate a meaning with a word, infants must be able to isolate the word form from fluent speech. Because fluent speech does not contain reliable acoustic markers of word boundaries, infants must learn about the regular
4
Jenny R. Saffran and Katharine Graf Estes
sound patterns of their language and use this knowledge to find words in the speech stream. Between 7 and 11 months, infants develop the ability to take advantage of patterns of syllable-pair probabilities (e.g., Aslin, Saffran, & Newport, 1998; Saffran, Aslin, & Newport, 1996), as well as lexical stress (e.g., Johnson & Jusczyk, 2001; Jusczyk, Houston, & Newsome, 1999; Thiessen & Saffran, 2003), phonotactics (Mattys & Jusczyk, 2001), and allophonic cues (Jusczyk, Hohne, & Bauman, 1999) to isolate individual words. Given that infants have access to a variety of segmentation cues to find words in fluent speech during the second half of the first year, they likely segment many word forms before they begin to map meanings to those forms. However, the nature of the relation between these processes has received little attention. One possibility is that infants parse continuous speech into isolated sound sequences, which then serve as candidate words, ready to be associated with meanings. That is, perhaps the segmentation process feeds directly into the process of mapping sound to meaning. Alternatively, infants may require additional experience with isolated sequences or need to hear them in new contexts before they can be associated with meanings. Although only a few studies have investigated how infants map meaning to recently segmented words (Hollich, in press; Swingley, 2002), this area holds promise for understanding the mechanisms underlying the early stages of vocabulary acquisition. In the remainder of this chapter, we explore possible connections between infants’ knowledge about the sound structure of their language and how infants map these sounds to meaning. In the first section, we describe the debate about the phonetic specificity of early lexical representations, focusing on whether infants apply their fine-grained perceptual discrimination skills to word learning. In the second section, we address ways that familiarity with the sounds of words may affect infants’ ability to link word forms to meaning, focusing on the acquisition of phoneme pattern regularities, and how this knowledge might affect the addition of new words to the infant’s lexicon. Finally, within the second section we also ask how the process of word segmentation contributes to infants’ association of word forms with meanings.
III. Phonetic Specificity in Early Lexical Representations Studies of infant speech perception have amply demonstrated that even young infants possess prodigious speech discrimination skills. However, there has been considerable debate concerning the degree to which these sophisticated abilities are used in early word learning and word recognition. In a seminal study, Jusczyk and Aslin (1995) investigated the nature of infants’ emerging lexical representations by testing whether 7½-case-month-old infants would notice when pronunciations of familiarized words were altered slightly in the context
Mapping Sound to Meaning
5
of a word segmentation task. They proposed that if infants’ memory for the familiarized items was vague and lacking in detail, the infants should fail to notice a slight change in pronunciation. However, Jusczyk and Aslin found that infants listened longer to the altered pronunciations than to the familiarized words, indicating that they recalled sufficient detail in the original items to avoid confusing the similar sounding pronunciations. In contrast, Halle´ and de Boysson-Bardies (1996) found that 11½-montholds listened longer to lists of frequent words (in French, e.g., ‘‘bonjour’’) than to infrequent words, and that this preference carried over to phonetically similar mispronunciations of the familiar words (e.g., ‘‘ponjour’’). Halle´ and de Boysson-Bardies suggested that the difference between their finding and Jusczyk and Aslin’s (1995) results was that although both tasks could be performed successfully without knowledge of word meaning, 11-month-olds have started to associate meanings with words (unlike the 7-month-olds tested by Jusczyk and Aslin), and this changes how they process language. Halle´ and de Boysson-Bardies proposed that when listening for meaning, words are represented holistically rather than with fine-grained phonetic detail. On this view, the younger infants in Jusczyk and Aslin’s (1995) study maintained more phonetic detail in their emerging word forms because they are not yet mapping those forms to meanings (but see Swingley (2005a) and Vihman et al. (2004) for alternative findings and interpretations). Halle´ and de Boysson-Bardies’s findings support a long-standing claim from developmental phonology: The lexical representations of infants and young children are holistic, lacking in the detailed representations of phonetic segments that differentiate and organize words in the fully developed phonological system. Many authors have argued that young children’s lexical representations center on whole words (Ferguson, 1986; Menyuk, Menn, & Silber, 1986; Pisoni, Lively, & Logan, 1993; Walley, 1993; Waterson, 1971), or other broad units such as syllables (Jusczyk, 1993). This differs from mature lexical representations, which contain fine-grained segmental structure that enables word recognition to occur given only partial acoustic-phonetic information; lexical access can proceed prior to the end of the word (Allopenna, Magnuson, & Tanenhaus, 1998; Marslen-Wilson, 1987). The organization of the adult lexicon is affected by representations of phonetic segments, such that similar sounding words activate one another, and as a word unfolds, alternative lexical candidates are excluded (e.g., Marlsen-Wilson, 1987). Adults may even represent subphonemic information about the sounds of words (e.g., McMurray et al., 2003; Norris, McQueen, & Cutler, 2003). In contrast, children’s words are said to be represented as ‘‘unanalyzed wholes’’ (Walley, 1993, p. 293); little phonetic detail is required to differentiate them, and children need to hear the entire (or nearly the entire) word before identification. In her review of the role of vocabulary changes in phonological
6
Jenny R. Saffran and Katharine Graf Estes
development, Walley (1993) proposed that limitations on attentional and memory capacity are at the root of the underspecified lexical representations of infants and young children. Storing words as holistic units is proposed to be adaptive: Holistic representations would lend efficiency to word learning, and would be sufficient for lexical processing given a small vocabulary because there are few confusable entries. Evidence for holistic lexical representations comes from a variety of experimental and observational methods, and from children at many different stages of vocabulary development. In particular, studies of children’s attempts to learn similar sounding words rendered evidence for holistic representations at quite young ages (1 – 3 years of age). In an early demonstration, Schvachkin (1948/ 1973) presented 10- to 18-month-olds with phonetically similar labels for objects (e.g., ‘‘dak’’ and ‘‘gak’’) using a variety of phonetic contrasts. Schvachkin found that children could not consistently select the correct objects for similar sounding labels, although with age (the children were tested longitudinally for approximately 6 months), they seemed to gradually gain access to more of the consonant contrasts tested. Other researchers (Brown & Matthews, 1997; Edwards, 1974; Eilers & Oller, 1976; Garnica, 1973) have attempted similar minimal-pair learning experiments. These studies have indicated that 1½- to 3-year-olds have difficulty discriminating many phonetic contrasts in lexical tasks. However, Barton (1976, reviewed in Barton, 1980) found that when children were already familiar with the words tested, they were often able to discriminate a wide range of contrasts. Thus, the findings from young children’s word learning studies indicate that processing of phonetically similar words is vulnerable and highly variable depending on factors such as age, the familiarity of words tested, and the contrasts tested. Results from studies with older children also provide evidence for holistic lexical representations. Although production errors have been interpreted as support for immature phonological representations (e.g., Ferguson, 1986), many studies have used methods that reduce production demands to focus on underlying lexical representations. Gating tasks, in which participants are asked to identify familiar words with limited acoustic information (e.g., 100 ms of the word, followed by 150 ms, then 200 ms), indicate that children require more information to recognize words than adults (Elliott, Hammer, & Evan, 1987; Walley, 1988). In mispronunciation tasks, 4- and 5-year-olds are much less accurate than adults at identifying changes to familiar words presented within sentences (Cole, 1981, see also Walley & Metsala, 1990). Cole and Perfetti (1980) also argued that because children, unlike adults, are not better at identifying second syllable errors than first syllable errors, children must wait longer to identify a word than adults; younger listeners need to hear more of a word prior to identifying it. Word familiarity also affects how readily children recognize words. Walley and Metsala (1990) found that 5- and 8-year-olds were more
Mapping Sound to Meaning
7
likely to detect mispronunciations of words they had not learned recently (see also Garlock, Walley, & Metsala, 2001). Taken together, this evidence suggests that children’s lexical representations lack the detail of adult representations and the representations improve with experience with words. Charles-Luce and Luce (1990, 1995) took a different approach to investigate the notion of holistic representations. Using corpus analyses, they asked whether the words in the child’s existing lexicon are phonetically dissimilar enough to support holistic processing, or whether instead processing that lacks phonetic detail would lead to a great deal of word confusions. To do so, they compared the similarity neighborhoods of words in 5- and 7-year-olds’ and adult lexicons. Similarity neighborhood, or phonological neighborhood, refers to the number of words that differ from a given word by one phoneme deletion, substitution, or addition. Children’s productive vocabularies contained fewer neighbors than the adult lexicon. Similar results emerged from an analysis of words in infantdirected speech, which they used as a surrogate measure of infants’ receptive vocabulary. Charles-Luce and Luce concluded that children could maintain holistic representations without sacrificing word discrimination because the child lexicon contains sufficiently few confusable words (for opposing views, see Coady & Aslin, 2003; Dollaghan, 1994). They proposed that holistic representations may even be adaptive, as attention is not wasted on superfluous detail. The claim that children do not use phonetic detail in word learning and recognition seems to conflict with myriad findings demonstrating that infants represent speech in exquisite detail (reviewed in Kuhl, 2004; Saffran, Werker, & Werner, in press; Werker & Curtin, 2005). A reasonable supposition from the research exploring infants’ speech perception skills is that by the end of the first year of life, infants are well-equipped to apply their sophisticated discrimination abilities to word learning and recognition. Yet several researchers have argued that the phonetic representations and analytic processes used in infant speech perception tasks are not the same as the phonological representations used in word learning, recognition, and production (Jusczyk, 1992; Pisoni, Lively, & Logan, 1994; Studdert-Kennedy, 1986; Walley, 1993) and that phonetic perception is not adult-like until middle childhood, around age 6 or 7 (e.g., Nittrouer & Studdert-Kennedy, 1987). One explanation offered for this potential discrepancy is that infants process speech stimuli as meaningless sounds. The phonetic tuning that occurs in the first year relates to later learning only in that it sets the boundaries for the sounds that may be relevant in the native language. Later developing, mature speech processing emerges as the child begins to interpret discriminable elements as members of different categories (Ingram, 1989; also see the review in Walley, 1993). The representations used for word learning and recognition must be built. The hypothesized driving force behind the emerging specification of lexical representations is vocabulary development itself (Charles-Luce & Luce, 1990,
8
Jenny R. Saffran and Katharine Graf Estes
1995; Ingram, 1989; Jusczyk, 1992, 1993; Walley, 1993). For example, Walley (1993) links the start of the development of detailed representations to the vocabulary spurt at around 18 months, with continued development into middle childhood. According to a holistic representation account, as the lexicon expands, words become increasingly confusable. As more overlapping words are added to the lexicon, there is an increased pressure to precisely differentiate words; holistic representations can no longer adequately represent every word as an entity distinct from the rest of the child’s vocabulary (Brown & Matthews, 1997; Charles-Luce & Luce, 1990, 1995; Jusczyk, 1993; Metsala, 1997; Walley, 1993). The restructuring of lexical representations is proposed to proceed in a gradual matter, at a different pace for different sounds depending on the number of similar sounding words in the lexicon. Eventually, children begin to represent words as consisting of segmental units, allowing for incremental and unique identification of lexical items. Learning to read promotes further specification, enabling children to gain conscious access to phonemes (Walley, 1993).
A. A RE-EXAMINATION OF PHONETIC DETAIL IN WORD LEARNING
Janet Werker and her colleagues (Fennell & Werker, 2003a; Stager & Werker, 1997; Werker et al., 2002) approached the study of early lexical representations with a new method, one with minimal task demands designed to be more sensitive than the explicit judgment and object selection tasks used in previous research with young children. As has been shown in the area of infant cognition (e.g., Keen, 2003; Munakata et al., 1997), tasks differing in the demands they place on children (e.g., looking vs. reaching) may access different levels of knowledge representations. Tasks that require less overt or less complex responses may tap perceptions or knowledge that are not apparent in more challenging tasks which require stronger representations and greater coordination of knowledge representations and the means for expressing that knowledge. Thus, a task that requires children to watch objects on a monitor should allow them to express underlying knowledge more readily than a task that requires physically selecting objects from an array (e.g., Schvachkin, 1948/1973) or making meta-linguistic judgments (e.g., Cole, 1981). As Werker and Fennell (2004) explained, Werker’s research group originally expected that this simple habituation-based word learning task would tap infants’ previously acquired knowledge about phonetic categories. However, this was not the case. Although the results of their experiments at first appear to support the notion that early representations are holistic, Werker and colleagues have come to different conclusions about the nature of early lexical representations. In the word–object association task of Werker et al. (1998), the infant is first habituated to two novel word–object combinations played over a video
Mapping Sound to Meaning
9
monitor and speakers, one at a time. Following habituation (i.e., a decrease in looking time), the infant is tested with two types of test trials: ‘‘same’’ trials, in which the original word–object pairs are presented, and ‘‘switch’’ trials, in which the words and objects are presented in novel pairings (i.e., Object 1 appears with repetitions of Word 2). If infants learn the original word–object associations, the pairings in the switch trials should violate this newly learned expectation, leading to longer looks on switch trials than same trials. In fact, Werker et al. (1998) found that 14-month-olds, but not younger infants (8- to 12-month-olds), could learn phonetically dissimilar word–object pairings (‘‘lif ’’ and ‘‘neem’’). Fourteen months is similar to the age at which other researchers have been successful at teaching infants novel words with limited laboratorybased exposure (e.g., Ballem & Plunkett, 2005; Schafer & Plunkett, 1998; Woodward, Markman, & Fitzsimmons, 1994). The word–object association task provides a useful method for measuring infants’ ability to map words to their meanings while requiring minimal task demands. Clearly, the word–object association task does not incorporate the rich understanding of meaning commonly measured in older children and adults. Object identity serves as a highly simplified representation of meaning, yet this task does retain an essential quality of the process of word learning—the formation of an arbitrary association between a meaning representation and a sound representation. Stager and Werker (1997) used the word–object association task to investigate whether 14-month-olds could apply their sophisticated phonetic discrimination skills to associate sound and meaning. Infants attempted to form word–object associations for the novel words ‘‘bih’’ and ‘‘dih’’; these two syllables are a minimal pair, differing only in a single phonetic feature (here, the place of articulation of /b/ vs. /d/). The 14-month-olds looked equally long to same and switch trials, indicating that they failed to learn the labels. However, 14-montholds could perform the perceptual discrimination of ‘‘bih’’ from ‘‘dih’’ in an object-free task; their failure in the word–object association task is not because they could not discriminate the sounds. Instead, the infants’ difficulties appear to lie in mapping the labels onto the objects or the quality of the representations linked in the mapping. Infants’ difficulty in learning similar sounding word– object associations has been replicated with more phonotactically probable novel words (‘‘bin’’ and ‘‘din’’), as well as additional feature contrasts (i.e., ‘‘bin’’ vs. ‘‘pin’’, ‘‘din’’ vs. ‘‘pin’’) (Pater, Stager, & Werker, 2004). Stager and Werker (1997) attempted to simplify the task by habituating infants to a single word–object pair (‘‘bih’’ with Object 1) and testing whether infants would dishabituate when hearing the alternative word with the original object (‘‘dih’’ with Object 1). Fourteen-month-olds again showed no difference in looking time to the original vs. switched word–object pairs (see also Pater, Stager, & Werker, 2004). However, 8-month-olds dishabituated to the switch.
10
Jenny R. Saffran and Katharine Graf Estes
Stager and Werker suggested that for 8-month-olds, the task is one of simple sound discrimination that does not involve attempting to link the word with the object. In contrast, 14-month-old infants process words as potential sources of meaning whenever possible referents are available. The authors proposed that novice word learners, such as 14-month-olds, cannot attend to fine phonetic detail in new words because their computational resources are consumed by the attempt to map the sound sequence to a meaning. Werker and colleagues (2002) tested the prediction that older infants, who are more proficient word learners, should be able to attend to phonetic detail when forming new sound–meaning associations. They compared word–object association performance of 14-, 17-, and 20-month-old infants. At around 17 to 18 months, infants typically begin to add new words to their receptive and productive lexicons at a rapid rate (Bates, Dale, & Thal, 1995). Therefore, 17- and 20-month-olds should provide a good comparison for 14-month-olds, who know substantially fewer words. Although the 14-month-olds continued to show no difference in looking time on same and switch trials, the 17- and 20-month-olds showed significantly longer looking time during switch test trials, indicating that the older infants were able to learn the minimal pair labels. This pattern of results was subsequently bolstered by an ERP study of the detail in children’s lexical representations (Mills et al., 2004). Infants listened to lists of novel words (e.g., ‘‘keed ’’, ‘‘zav’’), known words (e.g., dog, book), and mispronunciations of familiar words that altered only the first consonant (e.g., ‘‘bog’’, ‘‘dook’’). The patterns of brain activity revealed that 14-month-olds treated the known words and mispronunciations similarly, but reacted differently to the novel words. However, 20-month-olds treated the mispronunciations like novel words. The authors concluded that vocabulary development is important for infants’ ease of access to phonetic detail. Based on the changes in learning from 14 to 20 months, Werker et al. (2002) proposed a period of development during which infants lack the capacity to attend to phonetic detail in word learning. After infants become capable of learning arbitrary word–object associations, but before they become proficient word learners, infants’ computational resources may be expended by the processing demands of attending to the link between a sound form and its referent. There are not sufficient resources remaining to attend to phonetic detail. Werker and colleagues (Fennell & Werker, 2003a; Werker & Fennell, 2004; Werker et al., 2002) proposed that the apparent discontinuity in young infants’ discrimination skills and later learning of phonetically similar words is due to this temporary lack of resources in unskilled word learners, not a qualitative change in phonetic representations. This explanation contrasts with the holistic representation account, which assumes discontinuity in the precise acoustic analysis performed in discrimination tasks vs. the global representations used in lexical tasks (e.g., Jusczyk, 1992; Walley, 1993).
Mapping Sound to Meaning
11
The resource limitation account is based on the assumption that when confronted with any difficult task, ‘‘something has to give’’ (Fennell & Werker, 2003a, p. 249). In word learning, phonetic detail is sacrificed in favor of attending to the sound–meaning association, which is more essential to the immediate task of word learning. Werker et al. (2002) investigated the relation between vocabulary size and the ability to learn phonetically similar words. They found a significant positive correlation between word–object association task performance and vocabulary size (productive and receptive) for 14-month-olds. Infants with larger switch vs. same trial looking time differences tended to have larger vocabularies. At 17 months, the correlation between receptive vocabulary and magnitude of looking time difference showed a trend towards significance; at 20 months, the correlations were no longer significant. In exploring the changing pattern of correlations, Werker et al. (2002) found threshold vocabulary sizes for successful learning of the similar sounding labels: Children with 25 or more words in their productive vocabulary or 200 or more words in their receptive vocabulary were more likely to exhibit learning in the laboratory task. Both the holistic representation and resource limitation accounts predict vocabulary size effects on learning of phonetically similar labels. However, the role ascribed to vocabulary development differs. According to a holistic representation account (e.g., Charles-Luce & Luce, 1990, 1995; Walley, 1993), the vocabulary size threshold may indicate the point at which infants’ lexicons become sufficiently crowded that they must reorganize and elaborate previously underspecified representations. However, Werker et al. (2002) proposed that without direct evidence of a causal effect of vocabulary development, ‘‘it is more prudent to assume the threshold is not absolute, but is merely an index of relative word learning ability’’ (p. 22). According to the resource limitation account, children who are better word learners, as indexed by larger vocabulary size, find learning links between sound and meaning less taxing. Therefore, they have more resources available to process phonetic detail than less skilled word learners. Werker and colleagues have sought evidence in support of the resource limitation account that is inconsistent with the notion of holistic lexical representations. This has proven difficult because the two proposals can often explain the same data (i.e., younger infants have more difficulty learning similar sounding words, the relation between word learning and vocabulary size). The resource limitation account predicts that if the word learning task is simplified, younger infants should be able to express their knowledge of phonetic contrasts. The holistic representation account holds that infants do not have access to phonetic details that have not yet emerged from vocabulary growth. In previous experiments, Werker and colleagues (Pater, Stager, & Werker, 2004; Werker et al., 2002) attempted to reduce the demands of the task (longer exposure, more
12
Jenny R. Saffran and Katharine Graf Estes
physically dissimilar objects, labels differing in more phonetic features), but were unable to improve the performance of 14-month-olds. However, Fennell (2004; personal communication) attempted to reduce computational demands by providing infants with prior experience with the novel objects used in the word– object association task. A group of infants received a novel (unnamed) toy to play with at home for six to eight weeks. Following the at-home exposure, infants were brought to the lab and habituated to the object associated with a label (e.g., ‘‘din’’). The infants were then tested to examine whether they would dishabituate when a phonetically similar novel label was presented with the object (e.g., ‘‘gin’’). A control group received no at-home experience with the toy; these infants were first exposed to the toys as pictures in the word–object association task. Only children in the toy exposure group dishabituated, indicating that they noticed the change in the label. The holistic representation hypothesis cannot readily explain the finding that extra experience with an object can facilitate infants’ access to phonetic detail. In summary, Werker and colleagues’ resource limitation account contends that the representations used for phonetic perception form the basis for word learning, but that infants’ ability to use phonetic detail is limited by their computational capacity. This explanation differs from proposals of authors such as Walley (1993) and Charles-Luce and Luce (1990, 1995), who argued that children begin with holistic lexical representations and build specified representations of words as a function of vocabulary development. What remains to be explained by the resource limitation hypothesis is a precise understanding of the capacity that is lacking in younger infants, and what changes between 14 and 17 months to enable infants to attend to the detail in new words. As we discuss later, Werker and Curtin (2005) have provided a new model for understanding the phonetic specificity in infant word learning that attempts to integrate infant word learning with studies of detail in infant word recognition. B. PHONETIC DETAIL IN WORD RECOGNITION
Evidence from studies of infant word recognition indicates that infants’ apparent lack of attention to phonetic detail in word learning tasks may not apply to online recognition of known words (Fernald, Swingley, & Pinto, 2001; Fernald et al., 1998; Swingley, 2003; Swingley & Aslin, 2000, 2002). In particular, studies probing children’s representations of newly taught words may underestimate children’s representational capacities. To test this hypothesis, Swingley and Aslin (2000) used a visual fixation paradigm designed to examine the time-course of young children’s word recognition. This procedure is based on the natural tendency to look towards an object when hearing its name. Children’s eye movements are monitored as they view two familiar objects (e.g., ball and shoe) and then hear a sentence containing a spoken target word
Mapping Sound to Meaning
13
(e.g., ‘‘Where’s the ball?’’). This task provides two gauges of word recognition: accuracy, or the duration of looking to the target relative to the distracter object, and latency, or how quickly the child switches gaze to the target object when initially focused on the distracter. Studies using this paradigm have revealed rapid changes in young children’s ability to recognize words: At 15 months, infants orient to a target picture just after hearing its entire label, and by 24 months, young children look to the appropriate picture before the completion of its label (Fernald et al., 1998). At 18 months, infants recognize words based on partial information (the first 300 ms of a word) as reliably as when hearing the entire word (Fernald, Swingley, & Pinto, 2001). Thus, before the end of the second year, children are able to process word forms incrementally, as is characteristic of adult word recognition. This finding contradicts the claim that children begin to recognize words from partial information only after developing a substantial vocabulary (e.g., Walley, 1993). Swingley and Aslin (2000) have further investigated the specificity of young children’s lexical representations by adding a mispronunciation detection component to the visual fixation task. The basis of the mispronunciation task is that if children’s lexical representations are holistic, their processing should not be disrupted by small changes in word forms. In contrast, if children’s representations contain fine-grained detail, it should be harder to recognize the mispronounced words. Swingley and Aslin tested 18- to 23-month-olds’ ability to recognize known words (based on parental report) when presented with correct pronunciations of the target words and mispronunciations. Children viewed two pictures on a large computer screen (e.g., baby and dog). Then, they heard a sentence containing either a correct pronunciation of the target word (‘‘Where’s the baby?’’) or a mispronunciation (‘‘Where’s the vaby?’’). The test included a mix of consonant and vowel mispronunciations, as well as word-initial and word-internal mispronunciations. Swingley and Aslin (2000) found that children were most accurate when hearing the correct pronunciations; children looked longer to the target objects after hearing a correct pronunciation rather than a mispronunciation. Looking accuracy was above chance for both pronunciations, indicating that children were still able to recognize the mispronounced words, albeit with more difficulty. Looking latency was also affected; children were slower to look to the target object after hearing a mispronunciation. These findings have been extended to children learning another language (Dutch) and mispronunciations using both common and rare sound substitutions (Swingley, 2003). The accuracy and latency results suggest that although mispronunciations still activate semantic knowledge of the known words, children’s lexical representations are sufficiently detailed that word recognition is hindered by slight changes in pronunciation.
14
Jenny R. Saffran and Katharine Graf Estes
Swingley and Aslin’s (2000) data could also support some versions of the holistic representation view, as many 18- to 23-month-olds have experienced a vocabulary spurt (Bates, Dale, & Thal, 1995). However, Swingley and Aslin (2002) provided further support for early detail in known words by testing 14- to 15-month-olds in the mispronunciation detection task. They found that well before the vocabulary spurt, infants’ word recognition was disrupted by mispronunciations of known words, both for close mispronunciations (e.g., ‘‘vaby’’ for baby) and more distant mispronunciations (e.g., ‘‘raby’’ for baby). Furthermore, according to parental report of receptive vocabulary, none of the infants knew any phonological neighbors for half of the words tested, and receptive vocabulary size was uncorrelated with the magnitude of the effect of mispronunciations on word recognition. These data indicate that crowding of vocabulary is not necessary for the development of fine-grained phonetic detail in lexical representations. However, the words children understand may differ considerably from the word forms with which they are familiar. Well before infants produce or understand many words, they are extracting and storing word forms as they segment words from fluent speech, or hear words spoken in isolation. Word forms that are familiar, but are not yet associated with meanings, may well act as neighbors to the target words. To address this issue, Swingley (2003) extended his analysis of phonological neighborhood effects by examining a corpus of infant-directed speech to identify frequently occurring word forms that could potentially act as neighbors to the test items. His analysis indicated that infants were unlikely to have previously stored phonological neighbors of the correctly pronounced and mispronounced word forms tested. In this regard, Swingley’s analysis yielded findings similar to those of Charles-Luce and Luce (1995)— infants were unlikely to know many similar sounding words. However, infants’ successful detection of mispronunciations is inconsistent with the contention that early lexical representations do not contain more detail than is necessary to discriminate word forms, and that phonological neighbors are necessary for the development of specified lexical representations (Charles-Luce & Luce, 1990; Jusczyk, 1993; Metsala, 1997). Swingley and Aslin (2002) suggested that their results support the notion of developmental continuity of speech representations, such that the phonetic categories infants learn in the first year provide the basis for word recognition. The apparently sophisticated word recognition skills of 14-month-olds reported by Swingley and Aslin (2002) raise the question of why infants of the same age in Werker and colleagues’ (Stager & Werker, 1997; Werker et al., 2002) task seem to be incapable of noticing alterations to words. The difference in performance may be due in part to the difference in the status of the words used. Swingley and Aslin tested infants’ recognition of known words and Stager and Werker tested knowledge of recently experienced words. Using a habituation
Mapping Sound to Meaning
15
task similar to the original Stager and Werker (1997) task, Fennell and Werker (2003a) found that infants could detect violations in word–object combinations for similar sounding known words, specifically ‘‘ball’’ and ‘‘doll.’’1 Fennell and Werker (2003b) also found that infants reported to understand the word ‘‘doll’’ and infants who did not explicitly have ‘‘doll’’ in their receptive vocabularies (but were probably familiar with it because of its frequency in child-directed speech) detected the mispronunciation of ‘‘doll’’ (‘‘goll’’) in a habituation task. Similarly, Swingley (2002; described in more detail later) found that when infants were familiarized with a word form such as ‘‘tiebie’’ several times before it was used to label a novel object, the infants did not treat a highly similar mispronunciation of the label (e.g., ‘‘kiebie’’) like a request for the labeled target object. Infants who were not first familiarized with ‘‘tiebie’’ failed to notice the change. Thus, familiarity with a word, even without knowing the word’s meaning, seems to facilitate attention to phonetic detail. Fennell and Werker (2003a,b) proposed that infants are able to detect detail in familiar words because the familiarity reduces the processing load. Infants no longer have to attend to the formation of a link between sound and meaning and can devote more attention to the details of the word form. Alternatively, Swingley (2003) indicated that infants engaged in word learning may initially perceive speech appropriately, but that information is not encoded robustly— the sounds of the new word are not represented in memory in a way that can support word recognition. Failure to store a word form appropriately is not a difficulty limited to novice word learners; adults sometimes encode words with errors as well. Swingley (2003) added that it is unclear why 14-month-olds fail to robustly encode a new word after hearing it up to 100 times, but pointed out that the notion that learning of new words occurs gradually, especially for young infants, is not surprising. Many characteristics of natural word learning contexts may promote better learning than laboratory tasks, such as exposure to multiple exemplars of the object, or distributed learning occasions rather than the concentrated exposure that occurs in experimental tasks. In addition, variability in teaching contexts can facilitate learning (e.g., Lively, Logan, & Pisoni, 1993). Perhaps variability in speakers or labeling contexts is critical in helping infants to learn which variations are important (e.g., phonemic change) and which do not matter and should be ignored (e.g., speaker change). Another important difference between the word learning studies (e.g., Stager & Werker, 1997) and word recognition studies (e.g., Swingley & Aslin, 2002) is methodological. In the word–object association task, failure to learn is expressed as failure to dishabituate on test trials presenting novel pairings of familiar words and objects. As Swingley and Aslin (2002, 2005) have suggested, infants’ 1 In Western Canadian English, the dialect of the participants, ‘‘doll’’ and ‘‘ball’’ are minimal pairs.
16
Jenny R. Saffran and Katharine Graf Estes
failure to dishabituate may not always be caused by a lack of learning, but may occur because the mismatching label is similar enough to the correct label to activate knowledge of the correct label and its referent. This possibility is analogous to infants looking to the picture of the baby after hearing ‘‘vaby’’ in the visual fixation procedure. Perhaps the visual fixation task is more sensitive because it is less susceptible to interference than the word–object association task. In the visual fixation task, infants can listen and seek the correct referent for a word, and have the opportunity to reject a referent that may be close, but not as good a match as the correct referent. Ballem and Plunkett (2005) used the visual fixation method to examine the detail in 14-month-olds’ representations of newly learned vs. familiar words (see also Bailey & Plunkett, 2002, for similar work with 18- to 24-month-olds). Ballem and Plunkett’s predictions were based on the consistent finding that 14-month-olds fail to notice detail in new words in the word–object association task (e.g., Stager & Werker, 1997), but notice detail in familiar words in both habituation and visual fixation tasks (Fennell & Werker, 2003; Swingley & Aslin, 2002). Thus, they predicted that infants would notice mispronunciations of known words (‘‘ball’’ and ‘‘cup’’) but not newly learned words (‘‘tuke’’ and ‘‘vope’’) taught during the experiment. Surprisingly, there was some evidence of phonetic detail in both the familiar and novel words: Infants showed systematic looking to the target objects given correct pronunciations of both familiar and new words. The infants did not look consistently to the target object following mispronunciations of either word type. However, infants’ responses to the known words differed from their responses to new words. Although the infants did look systematically to correct pronunciations and not incorrect pronunciations of the novel words, the looking patterns of the correct and incorrect pronunciations did not differ significantly. Ballem and Plunkett contended that this was because newly learned word representations remain fragile. The infants have sufficiently detailed representations of the new words for the mispronunciation to disrupt recognition, but the difference in recognition for correct vs. incorrect pronunciations is weaker than for familiar words. Ballem and Plunkett’s (2005) experiment suggests that although familiarity likely does play a role in infants’ word recognition (as it does for adults), high familiarity does not seem to be essential for infants’ representation of phonetic detail. The findings also indicate that the task demands of the visual fixation task may be crucial in revealing this detail, a proposal that merits further investigation. When 14-month-olds are required to react to a minor change in a highly familiarized word, as in habituation-based tasks, they do not show evidence of noticing phonetic detail. Although habituation-based tasks involve minimal demands on coordination and production, the testing conditions may not be ideal for expressing knowledge of phonetic detail—the objects and
Mapping Sound to Meaning
17
labels presented during testing are highly familiar, the only change is in their pairing. Thus, the conditions of same and switch test trials may differ sufficiently to spark an increase in interest, or can only do so for older infants who are highly sensitive to the change. In contrast, in visual fixation tasks, perhaps 14-month-olds are not easily misled by an altered pronunciation when required to look to a named object; the change may be noticed with little difficulty. This measure may be more revealing of their representations of fine-grained word form characteristics. As this review has shown, the findings from studies of phonetic detail in early lexical representations are complex. When words are familiar, novice word learners show attention to detail in both habituation-based (Fennell, 2004, personal communication; Fennell & Werker, 2003a,b) and visual fixation tasks (Swingley & Aslin, 2002). But when words are novel, infants seem not to notice detail in habituation tasks (Stager & Werker, 1997), but may notice some detail in visual fixation tasks (Ballem & Plunkett, 2005). Werker and Curtin (2005) presented a new framework for understanding the seemingly discrepant patterns in infant speech perception and early word learning and recognition. Processing Rich Information from Multidimensional Interactive Representations (PRIMIR) attempts to explain why infants use detailed phonetic information (e.g., McMurray & Aslin, 2005) and indexical information (such as speaker identity and affect; e.g., Houston & Jusczyk, 2000; Singh, Morgan, & White, 2004) in some tasks, but in other tasks, they seem to attend to higher level categorical and word-level properties (e.g., Jusczyk & Aslin, 1995). According to PRIMIR, infants’ difficulty processing the phonetic detail of new words comes from the demands of attending to the relevant information that makes words distinct. The features of word forms are appropriately represented based on a general perceptual analysis. However, when the words being examined overlap a great deal (as with ‘‘bih’’ and ‘‘dih’’), infants may not know what information is criterial for distinguishing these forms, making it very difficult to attend to the sound–meaning linkage. As children learn words, the overlap in sound characteristics of the word forms increases and categories of phonemes eventually emerge from the regularities in the overlapping words. In this way, PRIMIR places more emphasis on the role of vocabulary development in infants’ ability to access phonetic detail than the earlier resource limitation account by Werker and colleagues (e.g., Werker et al., 2002). Werker and Curtin proposed that the acquisition of a critical number of form–meaning associations is necessary for phonemic categories to develop, and that categories should emerge earliest from dense phonological neighborhoods (a proposal that is similar to those made by Walley, 1993 and Charles-Luce & Luce, 1990, 1995). After phonemic categories have emerged, children approach word learning with an idea of which sound distinctions are essential
18
Jenny R. Saffran and Katharine Graf Estes
to word form identity, leaving more resources remaining for associating forms with meanings. At this point, children can learn words readily and flexibly in a variety of tasks (see Werker & Curtin, 2005, for a more thorough explanation of this process). Thiessen (2005) has also addressed how infants develop the skills necessary to access phonetic detail in new words. He investigated the contribution of learning about the functional significance of phonetic distinctions. Thiessen pointed out that infants can often detect differences that they appear to not yet know how to use. For example, although 2-month-olds can discriminate between stressed and unstressed syllables (Turk, Jusczyk, & Gerken, 1995), infants do not use this cue to segment words until 8 months of age (Jusczyk, Houston, & Newsome, 1999). Thiessen and Saffran (2003) proposed that infants must learn where stress falls within words in their native language in order to use stress as a segmentation cue. Similarly, although 14-month-olds may be able to distinguish between phonetically similar word forms like ‘‘bih’’ and ‘‘dih,’’ they have to learn that the difference between ‘‘/b/’’ and ‘‘/d/’’ indicates a difference in word meaning to use this information in word learning tasks. Infants may need to learn the distributional contexts of speech sounds as they occur in different words to learn how to use the perceptual distinctions. Thiessen (2005) proposed that because older infants know more words than younger infants, they may have gathered more distributional information about how phonetic distinctions function in their language. Thiessen’s distributional account and PRIMIR (Werker & Curtin, 2005), both suggest that although infants may perceive phonetic distinctions appropriately, they do not always know to treat the distinctions. However, Thiessen’s perspective on the role of vocabulary development is somewhat different from the claim presented in PRIMIR, though the two are not necessarily mutually exclusive. Werker and Curtin emphasized the role of overlapping word forms and indicated that learning clusters of words in dense neighborhoods should promote the development of phonemic categories. Thiessen emphasized that the key is learning how phonemes pattern in different contexts. The distributional account specifically predicts that when infants have experience with phonemes in different lexical contexts, they should no longer confuse minimal pair words that differ on that phonemic contrast. To investigate this hypothesis, Thiessen used the word–object association task (e.g., Stager & Werker, 1997) to test whether experience with the phonemes /d/ and /t/, presented in different lexical contexts, would enable 15-month-olds to pick up on phonetic detail in a new word. Infants habituated to three word–object combinations: A ‘‘daw’’ object, a ‘‘dawbow’’ object, and a ‘‘tawgoo’’ object to provide lexical contexts for ‘‘/d/’’ and ‘‘/t/.’’ Infants’ looking time was then compared for trials in which the daw-object was presented with the label ‘‘daw’’ (same) and with the label ‘‘taw’’ (switch). Consistent with the
Mapping Sound to Meaning
19
distributional account, infants did not treat the labels interchangeably; they dishabituated when the daw-object was labeled with ‘‘taw.’’ To ensure the facilitation was not caused by reduced attentional capacity demands from hearing the ‘‘daw’’ in ‘‘dawgoo,’’ Thiessen presented another group of infants with the object labels ‘‘tawgoo’’ and ‘‘dawgoo,’’ in addition to ‘‘daw.’’ This set of word–object combinations provides less distributional information than ‘‘dawbow’’ and ‘‘tawgoo’’, as ‘‘/d/’’ and ‘‘/t/’’ are now in the same phonological context—‘‘awgoo’’. As predicted, infants now treated ‘‘taw’’ and ‘‘daw’’ as interchangeable. Thus, when 15-month-olds experienced speech sounds occurring in different lexical contexts (that is, associated with different objects and in different phoneme combinations), they noticed subtle phonetic differences in novel words. The manipulation of infants’ experiences with distributions of lexical contexts may provide a model of what occurs between 14 and 17 months to facilitate learning of phonetically similar words. Thiessen suggested that older infants have learned about the contexts in which phonetic distinctions occur and know which distinctions signal differences in meaning—they have learned which distinctions have functional significance. This process may occur at a different pace for different speech sounds depending on the order in which infants become familiar with clusters of words, a suggestion also proposed in other models (e.g., Walley, 1993; Werker & Curtin, 2005). Many accounts of the development of phonetic detail hypothesize an important role for vocabulary development. To test the roles of neighborhood density and other types of clusters (i.e., words beginning with the same phoneme) in word learning and phonological development, it will be important to examine how the distinctions that infants are sensitive to in word learning relate to the constellations of words in infants’ developing lexicons. It may be particularly important to consider how attention to phonetic detail is influenced by both distributional information from words stored with meanings (i.e., in receptive vocabulary) and without meanings (i.e., words that have been segmented but not yet paired with meanings), as these two types of experiences may not have equivalent effects on the acquisition of new words. What conclusions can we draw about the nature of early lexical representations from the study of phonetic specificity? First, children’s representations of words are more detailed than thought previously (reviewed in Walley, 1993). The use of familiar words and sensitive measures of learning and recognition have revealed sophisticated processing of spoken words in young children. Second, well-developed vocabularies and crowded phonological neighborhoods are not essential for forming fine-grained lexical representations. The evidence of detailed representations in novice word learners (Ballem & Plunkett, 2005; Swingley & Aslin, 2002) does not indicate, however, that vocabulary growth and increasingly dense phonological neighborhoods do not affect how
20
Jenny R. Saffran and Katharine Graf Estes
words are represented. Clearly, phonological development continues well past the second year and is shaped by the words children hear and learn (e.g., Beckman & Edwards, 2000; Edwards, Beckman, & Munson, 2004). Neighborhood density also affects language processing in later childhood and adulthood (Garlock, Walley, & Metsala, 2001; Vitevitch & Luce, 1998), and likely affects changes in lexical representations, particularly during periods of rapid vocabulary growth. In addition, experiments by Thiessen (2005) suggest that learning constellations of words that overlap in other ways (e.g., words beginning with /b/) may also affect the use of phonetic detail in word learning. The third conclusion is that the evidence supports developmental continuity in the representations of speech sounds. The phonetic distinctions infants learn in the first year provide a foundation for the representations of early words. This is good news for many researchers, as this conclusion helps to maintain the relevance of studies of infant speech perception. However, how infants transition from difficulty with the specificity of new words to ease in noticing detail remains mysterious. Based on comparisons of habituation-based (Stager & Werker, 1997) and visual fixation procedures (Ballem & Plunkett, 2005), the task in which learning is measured seems important. Additional demonstrations of the importance of how specificity is examined have come from Vihman et al. (2004) and Swingley’s (2005a) studies further examining the findings of Halle´ and de Boysson Bardies (1995). In contrast to the original results, Vihman et al. and Swingley have both shown that infants do detect alterations of familiar words presented in lists, particularly when one considers the effects of learning that occurs within the task and of making changes at different word positions. Other findings also suggest that vocabulary development (Werker et al., 2002; see also Edwards, Beckman, & Munson, 2004) and familiarity with word forms (Fennell & Werker, 2004) matter for infants’ ability to recognize alterations to words. However, the precise role that vocabulary development plays, and whether the influence of extant vocabulary is based on neighborhood density or some other type of vocabulary clustering, remains unclear. The effect of familiarity of sound sequences is also not well understood; it may not be as essential to noticing detail as was once suggested (Ballem & Plunkett, 2005). The final conclusion is that the study of phonetic specificity in early lexical representations has implications for understanding how the lexicon develops more generally. For example, the results of this body of work suggest that there may be changes in speed of processing and accuracy of word recognition depending on the amount of experience a child has had with a particular word. How rapidly lexical representations change from acting like new words to acting like well-known words may also change with development. This change may be part of what makes older infants better word learners—their new words may become integrated with the existing lexicon and become ‘‘familiar’’ more readily than for younger infants. Studies of phonetic specificity have generated
Mapping Sound to Meaning
21
sensitive measures of word learning and recognition that can be extended to the exploration of other connections between how infants learn about sound structure and the mapping of those sounds to meanings.
IV. Effects of Familiarity with the Sounds of Words on Word Learning In this section, we examine how infants’ increasing familiarity with the sounds of the words in their native language lays a foundation for word learning. Specifically, we ask how learning about patterns of sound combinations in words (phonotactic probability) and overlapping word forms (neighborhood density) affect infants’ ability to learn new words. We then discuss how infants’ ability to segment words from fluent speech influences how words are mapped to meaning. Although the processes by which familiarity with sounds of words affects early word learning are not yet well understood, it promises to be a fruitful and revealing area of study.
A. PHONOTACTIC PROBABILITY AND NEIGHBORHOOD DENSITY IN EARLY WORD LEARNING
Infants develop remarkable sensitivities to the patterns of sound combinations in their native language long before they understand many of the words they hear. The ability to track distributional information about sound sequences is available early in life (Chambers, Onishi, & Fisher, 2003; Saffran & Thiessen, 2003; Saffran, Aslin, & Newport, 1996) and is maintained through adulthood (Onishi, Chambers, & Fisher, 2002; Saffran et al., 1997). By 9 months of age, infants prefer to listen to words that fit the typical patterns of phoneme combinations of their native language (Jusczyk, Luce, & Charles-Luce, 1994). Regularities in the sound combinations of words affect the speed and accuracy of word recognition in older children and adults (Gathercole, Chambers, & Fisher, 1999; Onishi, Chambers, & Fisher, 2002; Vitevitch et al., 1999). Thus, characteristics of the sound structure of words affect processing throughout life. Although the development of semantic representations must be very influential in word learning, characteristics of phoneme combinations likely affect how new words are added to the lexicon as well. The connection between sound patterns and mapping to meaning has been investigated in only a handful of studies of early word learning. Research examining how learning about sound combinations affects word learning has primarily addressed two interrelated characteristics of sound structure: phonological neighborhood density and phonotactic probability. As described
22
Jenny R. Saffran and Katharine Graf Estes
previously, a word’s phonological neighborhood includes all of the words that differ from a given word by one phoneme deletion, substitution, or addition in any position in the word. Words that overlap with many other words are said to reside in dense neighborhoods; words with few neighbors reside in sparse neighborhoods. Phonotactic probability refers to the probability that a phoneme or phoneme combination occurs in a given position in words and syllables. Neighborhood density and phonotactic probability are highly correlated— words in high density neighborhoods tend to consist of high frequency phoneme combinations. However, neighborhood density and phonotactic probability have differential effects on lexical processing. Studies of adult word recognition have shown that nonwords consisting of high phonotactic probability sequences are repeated faster than nonwords with low probability sequences (Vitevitch & Luce, 1998; but see the debate in Lipinksi & Gupta, 2005 and Vitevitch & Luce, 2005). Children are able to repeat high probability nonwords with better accuracy than low probability items (e.g., Edwards, Beckman, & Munson, 2004; Gathercole, 1995) and show greater recall of lists of high probability nonwords (Gathercole et al., 1999). Even young children (2½ years of age) are sensitive to phoneme frequencies in nonword repetition (Coady & Aslin, 2004). The neighborhood density of real words seems to have an opposite effect; words from dense neighborhoods are recognized more slowly and less accurately than words from sparse neighborhoods (Vitevitch & Luce, 1998). Children also require more sound information to recognize words from dense neighborhoods than sparse neighborhoods, and are less accurate at repeating words from dense neighborhoods (Garlock, Walley, & Metsala, 2001; Metsala, 1997). These findings support the notion that the lexicon is organized such that similar sounding words compete for activation. Thus, high neighborhood density hinders rapid word recognition for items established in the lexicon, whereas high phonotactic probability facilitates the processing of nonwords that have not been stored previously. It is not yet clear how knowledge of phonotactic probabilities and the neighborhood density of the early lexicon affect the formation of new sound– meaning associations. In describing the effects of phonotactic probability on nonword repetition performance, Edwards, Beckman, and Munson (2004) explained that novel words containing common sound patterns may have the support of familiar words that can be ‘‘used by analogy’’ in the development of acoustic and articulatory representations (p. 433). Similarly, in word learning, children may be able to add familiar sounding, high phonotactic probability words to the lexicon more readily than words consisting of unusual sound sequences with low probability. High probability sequences consist of phoneme combinations that the infant has experienced frequently in the past; the sounds are connected by well-traveled pathways. These word forms may be easier to
Mapping Sound to Meaning
23
encode than low probability words, allowing the infant to focus on establishing the link between the form and its meaning. Higher probability sequences may also be easier to recall, making it more likely that the infant will remember the word form and its associated referent when recognition is required. Easily acquired and early-acquired words may tend to consist of high probability sound sequences. Alternatively, for new word forms that are similar to previously stored words, establishing the appropriate association between sound and meaning may be difficult. This is related to the effect of neighborhood density: If the infant has already established a dense phonological neighborhood, linking meaning to a new word in this neighborhood may be difficult because the word can be confused with several other words. High density could also impair recognition and comprehension of a new word, as attempting to recall the target word could activate similar sounding words. The representation of the target word may not be sufficiently well-developed to win the competition for activation. By this reasoning, in early vocabulary development, children may fill in the lexicon by acquiring words in sparse neighborhoods more readily than those in dense neighborhoods. In the literature investigating how knowledge of sound combinations might affect word learning, phonotactic probability and neighborhood density are sometimes considered independently (Hollich, Jusczyk, & Luce, 2002), and are sometimes correlated as in natural language (Storkel, 2001, 2004a). It is not clear whether there are independent effects of phonotactic probability and neighborhood density in early word learning, as there are for adults in some cases (i.e., words vs. nonwords). The existing developmental research indicates that prior knowledge of word forms and sound combinations affects how new words are added to the lexicon, but many of the details of this influence have yet to be tested. To examine whether infants and young children tend to acquire words in dense or sparse phonological neighborhoods, Storkel (2004a) analyzed the age of acquisition for nouns on the MacArthur Communicative Development Inventory, a parental report measure of vocabulary for infants (8 – 16 months, Words and Gestures version) and toddlers (16 – 30 months, Words and Sentences version). Early-acquired words tended to reside in high density neighborhoods, particularly for short and low frequency words. Later acquired words tended to come from sparse neighborhoods. The effects of neighborhood density on age of acquisition were reduced for high frequency words, perhaps because high exposure rates outweighed the effects of sparse neighborhoods. Storkel proposed that fitting a new word into an established neighborhood may strengthen its representation, making it learnable with fewer exposures compared to new words that are dissimilar from known words. Storkel’s
24
Jenny R. Saffran and Katharine Graf Estes
findings also suggest that specification of word forms must occur at a younger age than previously supposed (Charles-Luce & Luce, 1990, 1995). Coady and Aslin (2003) attempted to improve upon previous analyses of children’s vocabularies by using more representative speech samples, and by including both maternal input and children’s productive vocabulary in their neighborhood analysis. They also examined neighborhood density weighted by both neighborhood frequency and vocabulary size, to ensure that differences in density were not solely attributable to differences in the number of words in children’s vs. adults’ lexicons. They found that children’s vocabularies consisted of sparser neighborhoods than those of adults, a finding consistent with previous analyses. However, the neighborhoods were substantially denser than previous analyses reported (Charles-Luce & Luce, 1990, 1995); words in children’s productive vocabularies had an average of 6.5 neighbors at age 3½. Children also tended to know a greater proportion of shorter words than longer words, and shorter words overall are from higher density neighborhoods. In addition, when vocabulary size was controlled, children were found to have a greater proportion of confusable words than adults. Coady and Aslin (2003) concluded that children learn words with frequent sounds and sound combinations earlier than words with less frequent sounds. Their analyses also support the notion that children maintain detailed lexical representations early in life. If children did not represent phonetic detail in their early words, one would expect them to add dissimilar words to the vocabulary earlier than similar sounding words in order to maintain non-overlapping lexical entries. The findings from Storkel’s (2004a) and Coady and Aslin’s (2003) analyses lend support to the idea that characteristics of word forms play a role in how items are added to the lexicon and that children are not reluctant to incorporate overlapping word forms into their vocabularies. However, Coady and Aslin (2003) acknowledged limits to the conclusions that can be drawn from their analyses. The analyses contain a great deal of information, but they still represent vocabulary at one moment in time, rather than as a changing system. They cannot reveal how new items are added to the lexicon. This is also true of Storkel’s analysis of group age of acquisition trends. To test more directly the notion that children add new words to dense phonological neighborhoods early in word learning, it will be important to examine changes in the vocabulary constellations of individual children. Vocabulary analyses also cannot reveal how readily words with different degrees of overlap with known words are added to the lexicon when those new words are first encountered. Swingley and Aslin (2005) performed a set of experiments to examine the effects of phonological neighbors on 18-month-olds’ learning of new words. They proposed that the novel neighbor of a familiar word (e.g., ‘‘tog’’) may be sufficient to activate the representation of the known word (‘‘dog’’), although the exact match of the familiar word form would
Mapping Sound to Meaning
25
likely produce greater activation of it than the novel neighbor (see also Swingley & Aslin, 2002). However, knowledge of stored lexical items may interact with the output of phonological processing indicating that the novel neighbor is a new and different word form. Swingley and Aslin compared infants’ performance learning novel neighbors of known words (e.g., ‘‘tog’’) and novel non-neighbors (e.g., ‘‘meb’’) as object labels in a visual fixation paradigm. They investigated whether infants would tend to be open to adding new words to the lexicon, readily treating both the novel neighbors and the novel non-neighbors as labels for novel objects. Alternatively, infants might be conservative word learners, treating the novel neighbors as instances of the known words. In this case, infants should be reluctant to map the neighbor labels to objects. In the first experiment, Swingley and Aslin found that when infants were taught both a novel neighbor and a non-neighbor label and tested with both novel objects, they only looked consistently to the correct target when the non-neighbor was requested. However, when the neighbor object (the ‘‘tog’’) was presented with its associated familiar object (the ‘‘dog’’), infants looked appropriately to the target neighbor object. Thus, children appeared to learn something about the neighbor label (enough to know that it did not refer to a dog, for example), but learning was more robust for non-neighbor labels. In a second experiment, Swingley and Aslin taught one group of infants one novel neighbor object label and another group one novel non-neighbor label to reduce the demands of attending to multiple objects and labels. Again, only children taught non-neighbor labels looked consistently at the requested target object when the two novel objects were presented (the target and a second novel object that was introduced, but not labeled, during teaching of the target). Interestingly, when the labeled neighbor object was presented with its associated familiar object, infants failed to look consistently at the requested neighbor object (e.g., ‘‘tog’’) or the requested familiar object (e.g., ‘‘dog’’).2 Infants who were not taught neighbor labels readily recognized the familiar words. Thus, infants’ recognition of known words was disrupted after being taught a similar sounding object label. The findings of the second experiment indicate that infants learned something about the novel neighbor labels; however, the nature of what they learned is again unclear. Similar to the first experiment, infants seem to have learned enough to know that they should not treat the label (‘‘tog’’) like a similar sounding familiar word (‘‘dog’’), but not enough to identify the appropriate novel object the label was paired with. Swingley and Aslin proposed that the locus of infants’ difficulty learning neighbors of known words is in their ability to associate the word form with the meaning, and not in the perceptual analysis of the word form. They suggested 2 The second experiment was conducted with Dutch learning infants, using Dutch words and their novel neighbors. We carried over the same dog/tog example for simplicity.
26
Jenny R. Saffran and Katharine Graf Estes
that interference caused by similar sounding words makes it difficult for young children to learn words that are phonological neighbors. Hearing a novel neighbor word form, even if it is not an exact match to a known word form, may activate the known word’s meaning. This may disrupt the infant’s ability to associate a new meaning with the novel word. The interference explanation may have implications for understanding 14-month-olds’ difficulty learning two novel similar sounding words, as in Stager and Werker’s (1997) experiments. Activation of the two novel forms may cause problems for infants’ ability to attend to the form–meaning association for either word. In addition, during testing, hearing one word may activate the other word as well, making the detection of a mismatch difficult.3 Hollich and colleagues (2002) tested whether neighborhood density affects infants’ learning of new words by manipulating infants’ experience with the phonological neighbors of new words, rather than relying on infants’ nativelanguage experience. In this experiment, 17-month-olds were familiarized with repetitions of two novel word lists: A high-density list containing 12 neighbors of the novel word ‘‘tirb,’’ (differing only in the initial consonant) and a low density list consisting of three neighbors of the word ‘‘pawch’’ plus nine filler items. Using the visual fixation paradigm, the infants were then presented with two novel objects labeled with the words ‘‘tirb’’ and ‘‘pawch.’’ Critically, the infants had not heard these items during familiarization. Infants only showed evidence of learning the low density item, looking longer to the ‘‘pawch.’’ This suggests that infants learned the low density, and possibly less confusable, item more readily, a finding that seems consistent with Swingley and Aslin’s (2005) experiment but inconsistent with the neighborhood density analyses of children’s vocabularies (Coady & Aslin, 2003; Storkel, 2004a). However, Hollich, Jusczyk, and Luce (2002) also found that children in a control group with no previous exposure to either neighborhood had difficulty learning the labels. The authors performed a second experiment to directly test the idea that there is a ‘‘sweet spot’’ for word learning somewhere between starting fresh with a new word form and attempting to fit a new word into a crowded neighborhood. They found that exposing children to the high density word list only once, rather than six times as in the original experiment, facilitated learning of the novel label. This exposure provided the infants with enough familiarity with sound combinations similar to the novel label to facilitate learning without inducing competition and confusion. 3
The interference explanation does not as readily explain why infants fail in the one-object version of the word–object association task (used in Pater, Stager & Werker, 1997, Experiment 2). However, a related proposal is that the elevated activation of the habituated word form (e.g., ‘‘bih’’) may drown out the activation of the overlapping novel form (e.g., ‘‘dih’’) and contribute to infants’ failure to dishabituate.
Mapping Sound to Meaning
27
The experiments of Swingley and Aslin (2005) and Hollich, Jusczyk, and Luce (2002) demonstrated that infant learning about sound structure can affect how word forms are associated with objects. This broad conclusion is consistent with the findings of Coady and Aslin (2003) and Storkel’s (2004a) vocabulary analyses. The finding that word form characteristics matter is noteworthy, as many other factors have the potential to influence the mapping of sound to meaning, such as frequency of exposure, salience of the labeled referent, salience of the label in the input, and social learning cues. However, the analyses and experimental findings conflict. In Swingley and Aslin and Hollich and colleagues’ experiments, children have difficulty learning new words when they are potentially confusable with other familiar word forms. The vocabulary analyses indicate that children build the lexicon with words in neighborhoods that contain overlapping words. The difference may be attributable in part to the conditions of natural vocabulary learning, which may include repeated exposure in varied contexts and environmental and pragmatic support capable of overriding any confusion caused by similar sounding words. Perhaps facilitation of word learning in dense neighborhoods is difficult to demonstrate in experimental tasks because of the simplified word learning environment commonly used. As Coady and Aslin (2003) and Storkel (2004a) pointed out, the influence of neighborhood density may also be due in part to facilitative effects of phonotactic probability. Infants are very skilled at tracking distributional patterns and gathering knowledge of phonotactic probabilities during the first year of life. This learning is likely to transfer to the task of associating meanings with words. Hollich and colleagues provided some evidence for the role of phonotactic probabilities by showing that some familiarity with particular sound sequences facilitated word learning. However, we still know little about the role that natural phonotactic patterns play in infant word learning. Support for the influence of natural language phonotactic patterns on word learning comes from experiments with older children. Storkel (2001, 2003) found that for 3- to 6-year-old children, novel words consisting of common sound sequences were associated with meanings more readily than labels with rare sound sequences (but see also Storkel, 2004b). The type of distributional learning mechanism that tracks phonotactic probabilities is active throughout life (e.g., Chambers, Onishi, & Fisher, 2003; Onishi, Chambers, & Fisher, 2002). Such patterns would likely affect infant word learners as well. Highly probable phoneme sequences from the infant’s native language might be more readily encoded and remembered, facilitating links between sound and meaning. High probability sequences might also be more readily retrieved from the lexicon for recognition. This could be displayed in several ways: Infants may be more accurate at identifying items with high probability labels; infants may recognize items more rapidly; infants may retain the association between a high
28
Jenny R. Saffran and Katharine Graf Estes
probability word and its referent over a longer period of time than for a low probability word. These possibilities are yet to be explored. B. SEGMENTING WORDS AND MAPPING SOUND TO MEANING
Most words that infants hear are not presented in isolation (Woodward & Aslin, 1990; Brent & Siskind, 2001). Therefore, infants’ ability to associate meanings with words should be greatly facilitated by the ability to segment individual word forms from fluent speech. Infants learn a great deal about the sound structure of their language that helps them to find words in the speech stream. By the end of the first year, infants can take advantage of patterns of syllable co-occurrences (Saffran, Aslin, & Newport, 1996), rhythmic patterns (Johnson & Jusczyk, 2001; Jusczyk, Hohne, & Bauman, 1999; Thiessen & Saffran, 2003), and regularities in the phonetic variations and phonotactic patterns that occur at the beginnings and ends of words (Jusczyk, Hohne, & Bauman, 1999; Mattys et al., 1999). Presumably, after the infant has segmented a word form, it is available to be associated with a meaning. However, we do not yet understand the nature of the connection between word segmentation and the ability to link meaning to words, both of which are essential skills for lexical development. One feature of learning that might simplify this process is that some word forms may not require segmentation; that is, they may be presented to infants as words in isolation. However, although parents do present a minority of words this way—one estimate suggests 10% (Brent & Siskind, 2001)—parents more typically speak words within fluent utterances, even novel words that they are trying to teach their child (e.g., Woodward & Aslin, 1990). Some words are not spoken in isolation for grammatical or pragmatic reasons, for example parents are unlikely to present function words (‘‘the,’’ ‘‘among,’’ ‘‘over’’) as isolated words. A related problem is that when parents speak multisyllabic words in isolation, infants must somehow decide whether these utterances consist of a single multisyllabic word or multiple words. Infants need to make headway on the segmentation problem before word representations become available. After proto-words are segmented, these can play a key role in the segmentation of subsequent words (Bortfeld et al., 2005). All of these considerations strongly suggest some sort of link between the mechanisms underlying segmentation from fluent speech and the emerging lexicon. One way that the basic skills of segmenting and associating meanings with words may connect in vocabulary acquisition is that segmented sound sequences may be stored as potential words, waiting to be linked with meanings. Perhaps by the end of the first year, infants gather a rudimentary lexicon of segmented forms that have yet to be linked to referents. One of the factors preparing infants for building a productive and receptive lexicon may be the segmentation and
Mapping Sound to Meaning
29
storage of these word forms. Conversely, during the first year, infants likely gather concepts that are not yet associated with labels. Early word learning may be supported by the integration of previously gathered forms and conceptual representations. Thus, as infants begin to map meanings to words, they are not starting from scratch with each lexical item. There may also be developmental changes that affect how rapidly these newly segmented words are available to be mapped to meaning. Younger infants may require more experience with the sounds of words before they can link sounds to referents; they may rely more on previously gathered segmented forms for word learning than older infants do. Older infants may be able to segment a word form online and immediately associate a meaning with it. Also, for infants at any developmental level, newly segmented words may require additional experiences in new contexts before they are available to be associated with referents. One means of investigating the relation between segmentation and word learning is to start by understanding the nature of the representations that emerge from attempts to segment words. Despite the burgeoning literature on ‘‘word segmentation,’’ little work has assessed the claim that what infants segment is actually represented as potential native language words, as opposed to familiar sound sequences unrelated to the lexicon. Saffran (2001) and Saffran and Wilson (2003) examined the representations yielded by statistical segmentation mechanisms. Infants are highly sensitive to distributional information, and can use patterns of syllable co-occurrences to segment words from fluent speech in artificial languages (Aslin, Jusczyk, & Pisoni, 1998; Saffran, Aslin, & Newport, 1996). However, we know less about the representations that emerge from such processing. Do infants interpret the sound sequences they are segmenting from speech as actual words, or instead as sound sequences that are probable in the native language? To test whether the output of infants’ statistical segmentation yields wordlike sequences, Saffran (2001) first exposed infants to an artificial language in which the only cue to word boundaries was the high transitional probabilities within words vs. the low transitional probabilities across word boundaries. Infants were then tested using ‘‘words’’ and ‘‘partwords’’ (sequences crossing word boundaries) from the artificial language embedded within either English sentences or matched nonsense frames. Infants preferred to listen to words over part-words when they were embedded in English sentences, but not when the words and part-words were embedded in nonsense frames. The findings support the claim that infants segment word-like sequences that are ready to be integrated with native language knowledge. Curtin, Mintz, and Christiansen (2005), using the same paradigm, extended these results to show that infants’ representations of newly segmented words retain the stress patterns heard during exposure. Moreover, newly segmented sound sequences appear to participate in subsequent aspects of language learning. Saffran and Wilson (2003) provided
30
Jenny R. Saffran and Katharine Graf Estes
further support for the claim that ‘‘words’’ are the output of statistical segmentation by showing that infants can use the output of statistical segmentation in a grammar-learning task. Infants exposed to an artificial language were able to use syllable co-occurrences to segment words from the language, and then to discover a simple grammar that determined the legal orderings of those words. These findings support the hypothesis that statistical learning about syllable sequences yields representations that are word-like. Swingley (2005b) investigated how infants’ segmentation skills might relate to word learning by examining whether the sound sequences that are most likely to be segmented by infants actually correspond to real words. To do so, Swingley analyzed corpora of Dutch and English infant-directed speech. The analysis complements Saffran’s (2001) experiment by testing whether a statistical learning mechanism applied to natural speech input is likely to yield real words and not mis-segmentations. Swingley examined how infants might cluster sound sequences based on tracking the probability and frequency of syllable co-occurrences in infant-directed speech. The results suggest that if infants used such a mechanism, they would primarily extract real words. Swingley proposed that word segmentation provides infants with a ‘‘protolexicon’’ of word forms available to be mapped to meaning. The prior segmentation of a word form should make the mapping process easier because the infant no longer needs to figure out the sound form. Instead, the infant can concentrate on identifying the word’s meaning, and linking the sound and meaning representations. Thus, a stock of candidate word forms may then become early-learned vocabulary items. Swingley’s analysis illustrates the importance of considering what is likely to be stored in memory by infants in addition to what they are reported to actually understand. Segmented words encoded in memory may affect the association of sound and meaning, phonotactic probability and neighborhood density, and early syntax learning. Hollich (in press) performed an experimental test of whether prior segmentation of a word form affects mapping to meaning by familiarizing 23-montholds with a passage containing two target novel words. Children had the opportunity to segment the words from this speech stream before they were associated with novel objects. In this experiment, Hollich also examined whether children could generalize across speakers when mapping a segmented word to meaning. One of the novel words was presented by the same speaker during familiarization and labeling; the other novel word was presented by different speakers during familiarization and labeling. Hollich reported that children were better able to learn an object label that was a previously segmented word, demonstrating that prior experience with a word form rapidly facilitates the link between sound and meaning. However, there were limits on young children’s ability to apply prior learning about the word form. When the speaker changed between the familiarization and labeling phases of the experiment, prior
Mapping Sound to Meaning
31
experience did not facilitate word learning, even for the relatively sophisticated word learners tested. This is likely due to the difficult learning task presented; the children heard the two new labels only six times each. In follow-up experiments, Hollich found that children could learn the words given more repetition and variability during labeling. Hollich’s data demonstrate that 2-year-olds are still developing flexibility in lexical representations, particularly for newly presented words. When a learning task is difficult (i.e., few exposures), even practiced word learners may require prior experience with word forms to associate them with objects. Swingley (2002) also found that prior segmentation of word forms facilitates the development of robust sound–meaning associations. Dutch 18- and 19-month-olds watched an animated movie that included several presentations of a novel word form embedded in sentences with no referent present. Half of the children then heard the same novel word used as an object label in a visual fixation task: ‘‘This is a tiebie’’ (translated from Dutch). The other half of the children heard an unfamiliarized word used as a label. Children then viewed two objects on a screen, the labeled object and a second novel object. They were then asked, ‘‘Can you find the tiebie?’’ with a correct pronunciation of the label or a phonetically similar mispronunciation (e.g., ‘‘kiebie’’). Children who had the opportunity to segment the word prior to learning it as an object label showed a difference in looking behavior to the mispronunciation. Children who did not have previous experience with the word did not notice the mispronunciation. This experiment indicates that prior segmentation of a word form facilitates attention to phonetic detail in new words. The data from Hollich (in press) and Swingley’s (2002) studies demonstrate that segmentation of word forms affects how readily infants map sounds to meanings and attend to phonetic details in new words. Previous experience with a segmented word form can facilitate word learning even for skilled 2-year-old word learners. It is not yet clear how segmenting words and associating meanings with words are related for younger infants, who are likely still facing challenges in word segmentation. Younger infants may rely even more on prior word segmentation in order to learn new labels. Future experiments combining word segmentation tasks with word learning tasks hold promise for investigating how prior experience with word forms and gathering of a ‘‘proto-lexicon’’ may contribute to the rapid pace of vocabulary development during the second year of life.
V. Conclusions A large body of literature has been dedicated to discovering the nature of the perceptual tuning and native-language knowledge acquired before infants
32
Jenny R. Saffran and Katharine Graf Estes
begin to speak (reviewed in Aslin, Jusczyk, & Pisoni, 1998; Saffran, Werker, & Werner, 2006). A separate literature explores how infants and young children focus their attention on the appropriate meanings in learning new words (e.g., Bloom, 2000; Markman, 1990; Woodward & Markman, 1998). The connection between infants’ learning about the sounds of their language and mapping those sounds to meanings presents a relatively new area of research. In this chapter, we have reviewed potentially important connections between learning how language sounds and learning what language means: Phonetic specificity in early lexical development, and how familiarity with the sounds of words affects early word learning in areas of phonotactic probability and neighborhood density, and word segmentation. A general conclusion across the three areas of research reviewed here is that learning about the sound structure of the native language during the first year provides a critical foundation for later word learning. Furthermore, we suggest that the sound part of the sound– meaning mapping may be as important a determinant of learning as the meaning part. Although substantial additional research is needed, the sounds that make up the ‘‘wugs,’’ ‘‘blickets,’’ ‘‘tomas,’’ ‘‘daxes,’’ and ‘‘modis’’ used in decades of word learning research clearly do not come out of nowhere. The sounds of the words that infants are engaged in learning are intricately linked to extensive prior experience with the ambient language. By studying the acquisition of sounds and the acquisition of meanings in tandem, we may hope to reach a new level of insight into how infants accomplish the astonishing feat of learning words.
ACKNOWLEDGMENTS Preparation of this chapter was supported by grants to JRS from NICHD (R01HD37466) and NSF (BCS-9983630), and to KGE from NIDCD (F31 DC07277). We thank Chris Fennell, Dan Swingley, and Erik Thiessen for their helpful comments on a previous draft. We also thank Martha Alibali, Julia Evans, and Maryellen MacDonald for their valuable discussions. This chapter is based on the work submitted to the University of Wisconsin-Madison as part of KGE’s qualifying exams.
REFERENCES Allopenna, P. D., Magnuson, J. S., & Tanenhaus, M. K. (1998). Tracking the time course of spoken word recognition using eye movements: Evidence for continuous mapping models. Journal of Memory & Language, 38, 419 – 439. Aslin, R. N., Jusczyk, P., & Pisoni, D. B. (1998). Speech and auditory processing during infancy: Constraints on and precursors to language. In D. Kuhn & R. Siegler (Eds.),
Mapping Sound to Meaning
33
Handbook of Child Psychology: Vol. 2. Cognition, perception, and language (5th ed., pp. 147 – 198). New York: Wiley. Aslin, R. N., Saffran, J. R., & Newport, E. L. (1998). Computation of conditional probability statistics by 8-month-old infants. Psychological Science, 9, 321 – 324. Bailey, T. M., & Plunkett, K. D. (2002). Phonological specificity in early words. Cognitive Development, 11, 61 – 68. Ballem, K., & Plunkett, K. D. (2005). Phonological specificity in children at 1.2. Journal of Child Language, 32, 159 – 173. Barton, D. (1976). The role of perception in the acquisition of phonology. Palo Alto, CA: Stanford University dissertation. [Reproduced, Bloomington: Indiana University Linguistics Club, 1978.] Barton, D. (1980). Phonemic perception in children. In G. H. Yeni-Komshian, J. F. Kavanagh, & C. A. Ferguson (Eds.), Child phonology, Vol. 2: Perception (pp. 97 – 114). New York: Academic Press. Bates, E., Dale, P., & Thal, D. (1995). Individual differences and their implications for theories of language development. In P. Fletcher & B. MacWhinney (Eds.), Handbook of child language (pp. 97 – 151). Oxford: Basil Blackwell. Beckman, M. E., & Edwards, J. (2000). The ontogeny of phonological categories and the primacy of lexical learning in linguistic development. Child Development, 71, 240 – 249. Bloom, P. (2000). How children learn the meanings of words. Cambridge, MA: MIT Press. Bortfeld, H., Morgan, J. L., Golinkoff, R. M., & Rathbun, K. (2005). Mommy and me: Familiar names help launch babies into speech-stream segmentation. Psychological Science, 16, 298 – 304. Brent, M. R., & Siskind, J. M. (2001). The role of exposure to isolated words in early vocabulary development. Cognition, 81, B33 – B44. Brown, C., & Matthews, J. (1997). The role of feature geometry in the development of phonemic contrasts. In M. Young-Scholten (Ed.), Focus on phonological acquisition (pp. 67 – 112). Amsterdam: Benjamins. Chambers, K. E., Onishi, K. H., & Fisher, C. (2003). Infants learn phonotactic regularities from brief auditory experiences. Cognition, 87, B69 – B77. Charles-Luce, J., & Luce, P. A. (1990). Similarity neighbourhoods of words in young children’s lexicons. Journal of Child Language, 17, 205 – 215. Charles-Luce, J., & Luce, P. A. (1995). An examination of similarity neighbourhoods in young children’s receptive vocabularies. Journal of Child Language, 22, 727 – 735. Coady, J. A., & Aslin, R. N. (2003). Phonological neighbourhoods in the developing lexicon. Journal of Child Language, 30, 441 – 469. Coady, J. A., & Aslin, R. N. (2004). Young children’s sensitivity to probabilistic phonotactics in the developing lexicon. Journal of Experimental Child Psychology, 89, 183 – 213. Cole, R. (1981). Perception of fluent speech by children and adults. Annals of the New York Academy of Sciences, 379, 92 – 102. Cole, R., & Perfetti, C. A. (1980) Listening for mispronunciations in a children’s story: The use of context by children and adults. Journal of Verbal Learning and Verbal Behavior, 19, 297 – 315. Curtin, S., Mintz, T. H., & Christiansen, M. H. (2005). Stress changes the representational landscape: Evidence from word segmentation. Cognition, 96, 233 – 262. Dollaghan, C. A. (1994). Children’s phonological neighborhoods: Half empty or half full? Journal of Child Language, 21, 257 – 271. Eilers, R. E., & Oller, M. K. (1976). The role of speech discrimination in developmental sound substitutions. Journal of Child Language, 3, 319 – 329.
34
Jenny R. Saffran and Katharine Graf Estes
Edwards, M. L. (1974). Perception and production in child phonology: The testing of four hypotheses. Journal of Child Language, 1, 205 – 219. Edwards, J., Beckman, M. E., & Munson, B. (2004). The interaction between vocabulary size and phonotactic probability effects on children’s production accuracy and fluency in nonword repetition. Journal of Speech, Language, & Hearing Research, 47, 421 – 436. Elliott, L. L., Hammer, M. A., & Evan, K. E. (1987). Perception of gated, highly familiar spoken monosyllabic nouns by children, teenagers, and older adults. Perception & Psychophysics, 42, 150 – 157. Fennell, C. T. (2004). Infant attention to phonetic detail in word forms: Knowledge and familiarity effects. Unpublished doctoral dissertation. Fennell, C. T., & Werker, J. F. (2003a). Early word learners’ ability to access detail in well-known words. Language and Speech, 46, 245 – 264. Fennell, C. T., & Werker, J. F. (2003b). Infant attention to phonetic detail: Knowledge and familiarity effects. Paper presented at the 28th Annual Boston University Conference on Language Development, Boston. Ferguson, C. A. (1986). Discovering sound units and constructing sound systems: It’s child’s play. In J. S. Perkell & D. H. Klatt (Eds.), Invariance and variability in speech processes (pp. 36 – 51). Hillsdale, NJ: Erlbaum. Fernald, A., Swingley, D., & Pinto, J. P. (2001). When half a word is enough: Infants can recognize spoken words using partial phonetic information. Child Development, 72, 1003 – 1015. Fernald, A., Pinto, J. P., Swingley, D., Weinberg, A., & McRoberts, G. W. (1998). Rapid gains in speed of verbal processing by infants in the 2nd year. Psychological Science, 9, 228 – 231. Garlock, V. M., Walley, A. C., & Metsala, J. L. (2001). Age-of-acquisition, word frequency, and neighborhood density effects on spoken word recognition by children and adults. Journal of Memory & Language, 45, 468 – 492. Garnica, O. K. (1973). The development of phonetic speech perception. In T. E. Moore (Ed.), Cognitive development and the acquisition of language (pp. 215 – 222). New York: Academic Press. Gathercole, S. E. (1995). Is nonword repetition a test of phonological memory or long-term knowledge? It all depends on the nonwords. Memory & Cognition, 23, 83 – 94. Gathercole, S. E., Frankish, C. R., Pickering, S. J., & Peaker, S. (1999). Phonotactic influences on short-term memory. Journal of Experimental Psychology: Learning, Memory, and Cognition, 25, 84 – 95. Gillette, J., Gleitman, H., Gleitman, L., & Lederer, A. (1999). Human simulations of vocabulary learning. Cognition, 73, 135 – 176. Halle´, P. A., & de Boysson-Bardies, B. (1996). The format of representation of recognized words in infants’ early receptive lexicon. Infant Behavior and Development, 19, 463 – 481. Hollich, G. (in press). Combining techniques to reveal emergent effects in infants’ segmentation, word learning, and grammar. Brain and Language. Hollich, G., Jusczyk, P. W., & Luce, P. A. (2002). Lexical neighborhood effects in 17-month-old word learning. Paper presented at the 26th Annual Boston University Conference on Language Development, Boston. Houston, D. M., & Jusczyk, P. W. (2000). The role of talker-specific information in word segmentation by infants. Journal of Experimental Psychology: Human Perception & Performance, 26, 1570 – 1582.
Mapping Sound to Meaning
35
Ingram, D. (1989). First language acquisition: Method, description and explanation. Cambridge: Cambridge University Press. Johnson, E. K., & Jusczyk, P. W. (2001). Word segmentation by 8-month-olds: When speech cues count more than statistics. Journal of Memory and Language, 44, 1 – 20. Jusczyk, P. W. (1992). Developing phonological categories from the speech signal. In C.A. Ferguson, L. Menn, & C. Stoel-Gammon (Eds.) Phonological development: Models, research, implications (pp. 17 – 64). Timonium, MD: York Press. Jusczyk, P. W. (1993). From general to language-specific capacities: The WRAPSA model of how speech perception develops. Journal of Phonetics, 21, 3 – 28. Jusczyk, P. W. (1997). The discovery of spoken language. Cambridge, MA: MIT Press. Jusczyk, P. W., & Aslin, R. N. (1995). Infants’ detection of the sound patterns of words in fluent speech. Cognitive Psychology, 29, 1 – 23. Jusczyk, P. W., Friederici, A. D., Wessels, J. M., Svenkerud, V. Y., & Jusczyk, A. M. (1993). Infants’ sensitivity to the sound patterns of native language words. Journal of Memory and Language, 32, 402 – 420. Jusczyk, P. W., Hohne, E. A., & Bauman, A. (1999). Infant’s sensitivity to allophonic cues for word segmentation. Perception and Psychophysics, 61, 1465 – 1476. Jusczyk, P. W., Houston, D. M., & Newsome, M. (1999). The beginnings of word segmentation in English-learning infants. Cognitive Psychology, 39, 159 – 207. Jusczyk, P. W., Luce, P. A., & Charles-Luce, J. (1994). Infants’ sensitivity to phonotactic patterns in the native language. Journal of Memory and Language, 33, 630 – 645. Keen, R. (2003) Representation of objects and events: Why do infants look so smart and toddlers look so dumb? Current Directions in Psychological Science, 12, 79 – 83. Kuhl, P. K. (2004). Early language acquisition: Cracking the speech code. Nature Reviews Neuroscience, 5, 831 – 843. Kuhl, P. K., Williams, K. A., Lacerda, F., Stevens, K. N., & Lindblom, B. (1992). Linguistic experience alters phonetic perception in infants by 6 months of age. Science, 255, 606 – 608. Kuhl, P. K., Andruski, J. E., Chistovich, I. A., Chistovich, L. A., Kozhevnikova, E. V., Ryskina, V. L., Stolyarova, E. I., Sundberg, U., & Lacerda, F. (1997). Cross-language analysis of phonetic units in language addressed to infants. Science, 277, 684 – 686. Lipinski, J., & Gupta, P. (2005). Does neighborhood density influence repetition latency for nonwords? Separating the effects of density and duration. Journal of Memory and Language, 52, 171 – 192. Lively, S. E., Logan, J. S., & Pisoni, D. B. (1993). Training Japanese listeners to identify English /r/ and /l/: II. The role of phonetic environment and talker variability in learning new perceptual categories. Journal of the Acoustical Society of America, 94, 1242 – 1255. Markman, E. M. (1990). Constraints children place on word meanings. Cognitive Science, 14, 154 – 173. Marslen-Wilson, W. D. (1987). Functional parallelism in spoken word-recognition. Cognition, 25, 71 – 102. Mattys, S. L., & Jusczyk, P. W. (2001). Phonotactic cues for segmentation of fluent speech by infants. Cognition, 78, 91 – 121. Mattys, S. L., Jusczyk, P. W., Luce, P. A., & Morgan, J. L. (1999). Phonotactic and prosodic effects on word segmentation in infants. Cognitive Psychology, 38, 465 – 494. McMurray, B., & Aslin, R. N. (2005). Infants are sensitive to within-category variation in speech perception. Cognition, 95, B15 – B26.
36
Jenny R. Saffran and Katharine Graf Estes
McMurray, B., Tanenhaus, M., Aslin, R., & Spivey, M. (2003). Probabilistic constraint satisfaction at the lexical/phonetic interface: Evidence for gradient effects of withincategory VOT on lexical access. Journal of Psycholinguistic Research, 32, 77 – 97. Menyuk, P., Menn, L., & Silber, R. (1986). Early strategies for the perception and production of words and sounds. In P. Fletcher & M. Garman (Eds.), Language acquisition (2nd ed, pp. 198 – 222). Cambridge: Cambridge University Press. Metsala, J. L. (1997). An examination of word frequency and neighborhood density in the development of spoken-word recognition. Memory & Cognition, 25, 47 – 56. Mills, D. L., Prat, C., Zangl, R., Stager, C. L., Neville, H. J., & Werker, J. F. (2004). Language experience and the organization of brain activity to phonetically similar words: ERP evidence from 14- and 20-month-olds. Journal of Cognitive Neuroscience, 16, 1452 – 1464. Munakata, Y., McClelland, J. L., Johnson, M. H., & Siegler, R. S. (1997). Rethinking infant knowledge: Toward an adaptive process account of successes and failures in object permanence tasks. Psychological Review, 104, 686 – 713. Nittrouer, S., & Studdert-Kennedy, M. (1987). The role of coarticulatory effects in the perception of fricatives by children and adults. Journal of Speech and Hearing Research, 30, 319 – 329. Norris, D., McQueen, J. M., & Cutler, A. (2003). Perceptual learning in speech. Cognitive Psychology, 47, 204 – 238. Onishi, K. H., Chambers, K. E., & Fisher, C. (2002). Learning phonotactic constraints from brief auditory experience. Cognition, 83, B13 – B23. Pater, J., Stager, C. L., & Werker, J. F. (2004). The perceptual acquisition of phonological contrasts. Language, 80, 361 – 379. Pisoni, D. B., Lively, S. E., & Logan, J. S. (1994). Perceptual learning of nonnative speech contrasts: Implications for theories of speech perception. In J. C. Goodman & H. C. Nusbaum (Eds.), Development of speech perception: The transition from speech sounds to spoken words (pp. 121 – 166). Cambridge, MA: MIT Press. Saffran, J. R. (2001). Words in a sea of sounds: The output of infant statistical learning. Cognition, 81, 149 – 169. Saffran, J. R., Aslin, R. N., & Newport, E. L. (1996). Statistical learning by 8-month-old infants. Science, 274, 1926 – 1928. Saffran, J. R., Newport, E. L., Aslin, R. N., Tunick, R. A., & Barrueco, S. (1997). Incidental language learning: Listening (and learning) out of the corner of your ear. Psychological Science, 8, 101 – 105. Saffran, J. R., Werker, J. F., & Werner, L. A. (2006). The infant’s auditory world: Hearing, speech, and the beginnings of language. In R. Siegler & D. Kuhn (Eds.), Handbook of Child development (6th ed, pp. 58–108). New York: Wiley. Saffran, J. R., & Wilson, D. P. (2003). From syllables to syntax: Multilevel statistical learning by 12-month-old infants. Infancy, 4, 273 – 284. Schafer, G., & Plunkett, K. (1998). Rapid word learning by fifteen-month-olds under tightly controlled conditions. Child Development, 69, 309 – 320. Shvachkin, N. K. (1948/1973). The development of phonemic speech perception in early childhood. In D. I. Slobin (Ed.), Studies of child language development (pp. 91 – 127). New York: Holt, Rinehart, and Winston. Singh, L., Morgan, J. L., & White, K. S. (2004). Preference and processing: The role of speech affect in early spoken word recognition. Journal of Memory and Language, 51, 173 – 189. Stager, C. L., & Werker, J. F. (1997). Infants listen for more phonetic detail in speech perception than in word-learning tasks. Nature, 388, 381 – 382.
Mapping Sound to Meaning
37
Storkel, H. L. (2001). Learning new words: Phonotactic probability in language development. Journal of Speech, Language, and Hearing Research, 44, 1321 – 1337. Storkel, H. L. (2003). Learning new words II: Phonotactic probability in verb learning. Journal of Speech, Language, and Hearing Research, 46, 1312 – 1323. Storkel, H. L. (2004a). Do children acquire dense neighborhoods? An investigation of similarity neighborhoods in lexical acquisition. Applied Psycholinguistics, 25, 210 – 221. Storkel, H. L. (2004b). The emerging lexicon of children with phonological delays: Phonotactic constraints and probability in acquisition. Journal of Speech, Language, and Hearing Research, 47, 1194 – 1212. Studdert-Kennedy, M. (1986). Sources of variability in early speech development. In J. S. Perkell & D. H. Klatt (Eds.), Invariance and variability in speech processes (pp. 58 – 84). Hillsdale, NJ: Erlbaum. Swingley, D. (2002). On the phonological encoding of novel words by one-year-olds. Paper presented at the 27th Annual Boston University Conference on Language Development, Boston. Swingley, D. (2003). Phonetic detail in the developing lexicon. Language and Speech, 46, 265 – 294. Swingley, D. (2005a). Eleven-month-olds’ knowledge of how familiar words sound. Developmental Science, 8, 432 – 443. Swingley, D. (2005b). Statistical clustering and the contents of the infant vocabulary. Cognitive Psychology, 50, 86 – 132. Swingley, D., & Aslin, R. N. (2000). Spoken word recognition and lexical representation in very young children. Cognition, 76, 147 – 166. Swingley, D., & Aslin, R. N. (2002). Lexical neighborhoods and the word-form representations of 14-month-olds. Psychological Science, 13, 480 – 484. Swingley, D., & Aslin, R. N. (2005). Lexical competition in young children’s word learning. Unpublished manuscript. Thiessen, E. D. (2005). The role of distributional information in infants’ use of phonemic contrasts. Unpublished manuscript. Thiessen, E. D., & Saffran, J. R. (2003). When cues collide: Use of stress and statistical cues to word boundaries by 7- to 9-month-old infants. Developmental Psychology, 39, 706 – 716. Turk, A. E., Jusczyk, P. W., & Gerken, L. (1995). Do English-learning infants use syllable weight to determine stress? Language and Speech, 38, 143 – 158. Vihman, M. M. (1996). Phonological development: The origins of language in the child. Malden, MA: Blackwell. Vihman, M. M., Nakai, S., DePaolis, R. A., & Halle´, P. A. (2004). The role of accentual pattern in early lexical representation. Journal of Memory and Language, 50, 336 – 353. Vitevitch, M. S., & Luce, P. A. (1998). When words compete: Levels of processing in perception of spoken words. Psychological Science, 9, 325 – 329. Vitevitch, M. S., & Luce, P. A. (2005). Increases in phonotactic probability facilitate spoken nonword repetition. Journal of Memory and Language, 52, 193 – 204. Vitevitch, M. S., Luce, P. A., Pisoni, D. B., & Auer, E. T. (1999). Phonotactics, neighborhood activation, and lexical access for spoken words. Brain and Language, 68, 306 – 311. Walley, A. C. (1988). Spoken word recognition by young children and adults. Cognitive Development, 3, 137 – 165. Walley, A. C. (1993). The role of vocabulary development in children’s spoken word recognition and segmentation ability. Developmental Review, 13, 286 – 350.
38
Jenny R. Saffran and Katharine Graf Estes
Walley, A. C., & Metsala, J. L. (1990). The growth of lexical constraints on spoken word recognition. Perception & Psychophysics, 47, 267 – 280. Waterson, N. (1971). Child phonology: A prosodic view. Journal of Linguistics, 7, 179 – 211. Werker, J. F., Cohen, L. B., Lloyd, V. L., Casasola, M., & Stager, C. L. (1998). Acquisition of word-object associations by 14-month-old infants. Developmental Psychology, 34, 1289 – 1309. Werker, J. F., & Curtin, S. (2005). PRIMIR: A developmental framework of infant speech processing. Language Learning and Development, 1, 197 – 234. Werker, J. F., & Fennell, C. T. (2004). From listening to sounds to listening to words: Early steps in word learning. In G. Hall & S. Waxman (Eds.), Weaving a lexicon (pp. 79–109). Cambridge, MA: MIT Press. Werker, J. F., Fennell, C. T., Corcoran, K. M., & Stager, C. L. (2002). Infants’ ability to learn phonetically similar words: Effects of age and vocabulary size. Infancy, 3, 1 – 30. Werker, J. F., & Tees, R. C. (1984). Cross-language speech perception: Evidence for perceptual reorganization during the first year of life. Infant Behavior and Development, 7, 49 – 63. Woodward, A. L., & Aslin, R. N. (1990). Segmentation cues in maternal speech to infants. Paper presented at the biennial meeting of the International Conference on Infant Studies, Montreal, Quebec. Woodward, A. L., & Markman, E. M. (1998). Early word learning. In D. Kuhn & R. S. Siegler (Eds.), Handbook of child psychology: Vol. 2. Cognition, Perception, and Language (pp. 371 – 420). New York: Wiley. Woodward, A. L., Markman, E. M., & Fitzsimmons, C. M. (1994). Rapid word learning in 13- and 18-month-olds. Developmental Psychology, 30, 553 – 566.
A DEVELOPMENTAL INTERGROUP THEORY OF SOCIAL STEREOTYPES AND PREJUDICE
Rebecca S. Bigler UNIVERSITY OF TEXAS AT AUSTIN, AUSTIN, TEXAS 78712, USA
Lynn S. Liben THE PENNSYLVANIA STATE UNIVERSITY, UNIVERSITY PARK, PENNSYLVANIA 16802, USA
I. INTRODUCTION
A. HISTORICAL ROOTS B. THE SCOPE AND SIGNIFICANCE OF THE PROBLEM C. PURPOSE AND OVERVIEW II. DEFINITIONS AND FORMS OF STEREOTYPING AND PREJUDICE III. AN ONTOGENETIC APPROACH TO STEREOTYPING AND PREJUDICE IV. CORE QUALITIES AND GOALS OF DEVELOPMENTAL INTERGROUP THEORY
A. B. C. D.
A DOMAIN GENERAL APPROACH A DEVELOPMENTAL APPROACH AN INTERACTIONIST APPROACH SUMMARY
V. THEORETICAL FOUNDATIONS OF DEVELOPMENTAL INTERGROUP THEORY
A. THE INTERGROUP PERSPECTIVE B. THE COGNITIVE-DEVELOPMENTAL PERSPECTIVE VI. CORE COMPONENTS OF DEVELOPMENTAL INTERGROUP THEORY
A. ESTABLISH THE PSYCHOLOGICAL SALIENCE [EPS] OF PERSON ATTRIBUTES B. CATEGORIZE ENCOUNTERED INDIVIDUALS [CEI] BY SALIENT DIMENSIONS C. DEVELOP STEREOTYPES AND PREJUDICES [DSP] CONCERNING SALIENT SOCIAL GROUPS D. APPLY STEREOTYPE FILTER [ASF] TO ENCOUNTERED INDIVIDUALS VII. PRINCIPLES OF THE FORMATION AND MAINTENANCE OF SOCIAL STEREOTYPES AND PREJUDICE
A. FACTORS AFFECTING THE ESTABLISHMENT OF THE PSYCHOLOGICAL SALIENCE [EPS] OF PERSON ATTRIBUTES B. FACTORS AFFECTING THE DEVELOPMENT OF STEREOTYPES AND PREJUDICES [DSP] C. INTERACTIONS AMONG FACTORS THAT SHAPE EPS AND DSP D. DEVELOPMENTAL AND INDIVIDUAL DIFFERENCE VARIABLES VIII. SUMMARY AND CONCLUSIONS REFERENCES
39 Advances in Child Development and Behavior R Kail (Editor)
ß 2006 Elsevier B.V. All rights reserved.
40
Rebecca S. Bigler and Lynn S. Liben
I. Introduction A. HISTORICAL ROOTS
The study of social stereotyping and prejudice has a long and venerable history in psychology, stemming from the 1920s and 1930s (Bogardus, 1925; Katz & Braly, 1933; LaPiere, 1936; Lippmann, 1922). It is a history that is often related to the quality of relations among social groups within and outside the United States (Duckitt, 1992; Samelson, 1978). World War II, for example, led to an interest in personality factors associated with racism and prejudice (Adorno et al., 1950). The civil rights movements of the 1920s and 1960s sparked research on racial attitudes (e.g., Bogardus, 1925; Koslin, Amarel, & Ames, 1969). Similarly, the feminist movement of the 1960s sparked a marked increase in research on gender stereotyping and discrimination (e.g., Bem, 1974; Weisstein, 1968). Importantly, the future will be characterized by a continued, and probably increased, need to understand the complex forces that shape social stereotyping and intergroup relations.
B. THE SCOPE AND SIGNIFICANCE OF THE PROBLEM
There are several reasons to expect that social stereotyping and prejudice will continue to be pressing problems within the U.S. and throughout the world. The population of the U.S. is becoming increasingly ethnically and racially diverse with minorities soon expected to comprise over half of the U.S. population. At the same time, cities, neighborhoods, and schools continue to be characterized by high, and often increasing, levels of racial and ethnic segregation (Lewis Mumford Center, 2001; Orfield, 1996, 2001). Battles over legal policies pertaining to social groups, including affirmative action, immigration, and civil rights for gays, lesbians, and transsexuals, will continue to occupy politicians and judges in the coming years. In addition, ethnic, racial, and other intergroup conflicts outside of the U.S. are likely to remain intense or even increase. In sum, complex social problems that are both directly and indirectly related to social stereotyping, prejudice, and discrimination are likely to continue to plague human society. From the perspective of societal functioning, it is therefore important for social science to continue to investigate the formation, function, and consequences of social stereotyping and prejudice. We, like others (Brewer, 1997), believe that social science can contribute in potentially important ways to reducing and eliminating intergroup bias, discrimination, and conflict in human societies.
A Developmental Intergroup Theory of Social Stereotypes and Prejudice
41
Furthermore, from the perspective of basic inquiry into human cognitive and social processes, the domain is an important one to study. The formation and operation of group processes can provide tests of a wide range of theoretical positions and hypothetical mechanisms related to human development and behavior.
C. PURPOSE AND OVERVIEW
We propose a new theory of the formation of social stereotypes and prejudice among children: developmental intergroup theory (DIT). We identify mechanisms and rules that govern the processes by which children single out groups as targets of stereotyping and prejudice, and by which children learn both the characteristics (i.e., stereotypes) and affective responses (i.e., prejudices) that are associated with these groups in their culture. These rules are hypothesized to be relevant for the formation of stereotyping and prejudice for any human population. Thus, we argue that the qualities and propagation of stereotypes within given societies are open to a good deal of social control, and that children’s stereotyping and prejudice may be regulated by applying the tenets of our theory. In the second section of the chapter, we define the terms ‘‘stereotyping’’ and ‘‘prejudice’’ to specify the phenomena that our theory is designed to address. In the third section, we outline the importance of considering the ontogenetic emergence of stereotyping and prejudice. In particular, we argue that understanding the formation of stereotypes and prejudice among children is necessary if we are to explain the origins, operation, and maintenance of stereotypes and prejudice, both as they affect interactions among groups and influence the lives of individuals. In the fourth section, we describe the core characteristics and goals of our theory. In the fifth section, we highlight key components of intergroup and cognitive-developmental theories that provide foundations for our theoretical model. In the sixth section, we describe the mechanisms that we hypothesize account for the formation of stereotypes and prejudice among children as a group, emphasizing how our theory differs from other major theories of stereotyping and prejudice. In the seventh section, we review the environmental, developmental, and individual differences that contribute to variations in stereotyping and prejudice across children. In the eighth and concluding section, we summarize key points, and discuss briefly the implications of developmental intergroup theory both for research and social action to reduce stereotyping and prejudice.
42
Rebecca S. Bigler and Lynn S. Liben
II. Definitions and Forms of Stereotyping and Prejudice Although many and varied, most definitions of stereotype include a focus on beliefs about social groups, with many specifying, in addition, that such beliefs are widely shared (e.g., Brown, 1986) or are inaccurate (e.g., Milner, 1983). A definition of a stereotype that we find particularly useful holds that a stereotype is a ‘‘cognitive structure that contains the perceiver’s knowledge, beliefs, and expectancies about some human group’’ (Hamilton & Trolier, 1986, p. 133). Stereotypes might be targeted to any social group and, indeed, the basis and content of social stereotypes have fluctuated greatly across time and geographical region. So, for example, stereotypes of Irish, Italian, Chinese, and African Americans have been common throughout U.S. history, but the content of these stereotypes has changed in significant ways over time (e.g., Aarim-Herlot, 2003; Ignatiev, 1995). Social stereotyping is frequently, although not always, accompanied by prejudice, a more affectively laden facet of individuals’ thinking about groups. Ehrlich (1973) argued that most writers’ definitions shared the core idea that prejudice is ‘‘an unfavorable attitude toward others because of their membership in a particular group.’’ Many contemporary writers share this basic tenet (Aboud, 1988), although some add the additional claim that such feelings are irrational or unreasonable (e.g., Fishbein, 2002; Milner, 1983). Stereotyping and prejudice are often tightly interwoven. Groups associated with highly negative attributes (e.g., dumb, lazy) are likely to be regarded with prejudice. The distinction is useful, however, because individuals sometimes show prejudice toward a social group while simultaneously endorsing positive, or few, stereotypes concerning the group. For example, many individuals show biases against Asians, despite believing them to be intellectually gifted, hardworking, and polite (Tuan, 1998). Alternatively, individuals sometimes evidence positive affect (or a lack of prejudice) toward social groups, despite endorsing stereotypes of the groups (a particularly dramatic example is provided in a speech by White, 1998). Both stereotyping and prejudice can involve one or both of two underlying processes. The first is an automatic process, referred to as implicit attitudes, which involves unconscious stereotyping and prejudice toward groups. The second is a controlled process, referred to as explicit attitudes, which concerns conscious stereotyping and prejudice toward groups (see Greenwald & Banaji, 1995). Just as individuals’ stereotypes and prejudice are not always congruent, implicit and explicit forms of each construct (i.e., stereotype, prejudice) are not always congruent. So, for example, a given individual may hold negative (or biased) implicit attitudes (stereotypes or prejudices) toward some group, perhaps based on knowledge of the cultural stereotypes of that group, while simultaneously endorsing positive (or non-biased) explicit attitudes
A Developmental Intergroup Theory of Social Stereotypes and Prejudice
43
(stereotypes or prejudices) about that same target group (Greenwald & Banaji, 1995). Devine (1989) has suggested that, among adults, implicit attitudes are generally well-established beliefs acquired during childhood, whereas explicit attitudes represent more recent, consciously constructed views of social groups. This theoretical position underscores the necessity of studying stereotyping and prejudice developmentally, even if one is ultimately interested in their operation in adulthood. We next outline several additional arguments for the importance of taking a developmental perspective to study stereotyping and prejudice.
III. An Ontogenetic Approach to Stereotyping and Prejudice Our theory focuses on mechanisms by which stereotypes and prejudices form during childhood. Attitudes toward many social groups form early in development. By the age of five, for example, children show evidence of prejudice and stereotyping based on race (Aboud, 1988), gender (Ruble & Martin, 1998), age (Seefeldt et al., 1977), physical ability status (Westervelt & Turnbill, 1980), body weight (Lerner & Gellert, 1969), and facial attractiveness (Langlois, 1995). The early emergence of stereotyping and prejudice has several important consequences. First, although stereotyping and prejudice have potentially serious consequences at all ages, the effects of bias may be especially serious during childhood, when core human characteristics, values, and aspirations such as self-esteem, social identity, and academic and vocational goals are being formed. Social stereotypes and prejudice affect children’s cognition and behavior in a wide variety of domains, including their ability to remember information (Averhart & Bigler, 1997; Bigler & Liben, 1990, 1993; Liben & Signorella, 1980; Martin & Halverson, 1983); occupational judgments and goals (Bigler, Averhart, & Liben, 2003; Liben, Bigler, & Krogh, 2002); academic selfefficacy and esteem (Eccles et al., 1993; Osborne, 1997); peer preferences (Martin & Fabes, 2001); and activity or object preferences (Coker, 1984; Serbin, Powlishta, & Gulko, 1993). The dangers of self-fulfilling prophecies (such that individuals begin to behave in ways that confirm others’ expectations of them, see Klein & Snyder, 2003; Weinstein, Gregory, & Strambler, 2004) and stereotype threat (such that performance is hindered because individuals worry that others’ views of their own group will be biased by stereotyped perceptions, see Steele, 1997) imply that it is vital to understand how and why children come to endorse stereotypic beliefs about social groups. Second, inhibiting the formation of bias may be much easier and more cost-effective than working to undo social stereotyping and prejudice after they have been formed. Attitudes toward social groups are often highly resistant to
44
Rebecca S. Bigler and Lynn S. Liben
change. Interventions designed to change children’s gender and racial attitudes are limited in effectiveness (see Banks, 1995; Bigler, 1999; Liben & Bigler, 1987, for reviews). Changing adults’ social stereotypes and prejudices has also proven to be considerably more difficult than was often initially expected (see Hewstone & Brown, 1986). Third, accounts of stereotyping and prejudice posit that the developmental history of individuals’ thinking about social groups affects the functioning of such attitudes during adulthood. Thus, in addition to shaping core human characteristics that arise during childhood, vestiges of individuals’ earliest stereotypes and prejudices may influence their behavior toward others through the life course. Specifically, as mentioned earlier, Devine (1989) has argued that adults’ implicit attitudes toward social groups (e.g., race and gender) may represent beliefs formed in childhood, and that these beliefs are deeper, more entrenched, and less available to consciousness than more recently acquired beliefs. So, for example, although individuals’ beliefs about race may evolve during adulthood, perhaps as a result of formal education or encounters with peers from diverse racial backgrounds, beliefs and affect acquired during childhood are now believed to persist. The latter may continue to influence individuals’ race-related judgments and behaviors, often without those individuals’ own awareness. Understanding the processes involved in the formation of social stereotypes and prejudice in childhood is, therefore, critical to understanding behavior across the life course. The purpose of this chapter is thus to offer a new theoretical perspective on the formation and functioning of social stereotypes in children that is integrative along three dimensions. First, we integrate research on diverse forms of stereotyping (e.g., gender, race, age), with the goal of providing a broad account of the factors that produce social stereotyping and prejudice. Second, we integrate social psychological and developmental views on social stereotyping and prejudice, drawing heavily from intergroup theories developed and tested by social psychologists, as well as from developmental accounts of specific forms of stereotyping. Third, within our developmental approach, we integrate social and cognitive domains, arguing that cognitive development and environmental factors interact to produce stereotyping and prejudice. The theory draws on programs of empirical research by Bigler and colleagues designed to study the emergence of social stereotyping and prejudice in children (e.g., Bigler, 1995; Bigler, Brown, & Markell, 2001; Bigler, Jones, & Lobliner, 1997; Brown & Bigler, 2002) and by Liben and colleagues designed to study the role of cognitive-developmental processes on the operation and maintenance of stereotyping and prejudice (e.g., Bigler & Liben, 1990, 1992; Liben & Bigler, 1987; Liben & Signorella, 1980, 1993; Signorella, Bigler, & Liben, 1993; Signorella & Liben, 1984).
A Developmental Intergroup Theory of Social Stereotypes and Prejudice
45
IV. Core Qualities and Goals of Developmental Intergroup Theory The goal of developmental intergroup theory is to predict under what conditions, and to what extent, children develop stereotypes of, and prejudices toward, particular social groups. That is, the theory outlines the endogenous and exogenous conditions that promote or retard the development of specific stereotypes and prejudices among children. In doing so, our theory seeks to explain stereotyping and prejudice: (a) across multiple domains, (b) within a developmental framework, and (c) as a function of interactions between organismic and environmental factors. We discuss each of these characteristics in more detail below.
A. A DOMAIN GENERAL APPROACH
Developmental intergroup theory seeks to explain why children develop stereotypes and prejudice on the basis of some human characteristics (e.g., race, age, gender, and physical abilities) rather than on the basis of other human characteristics that are simultaneously available. Thus, we have developed a theoretical account of mechanisms of stereotyping and prejudice that is domain general (rather than domain specific). The only other theoretical orientation that approaches the problem of stereotyping and prejudice among children in this domain-general way is learning theory (Bandura, 1977; Skinner, 1969). Traditional and social learning theories offered a set of mechanisms such as classical conditioning, operant conditioning, and modeling that were used to explain the acquisition of all forms of stereotyping and prejudice. For reasons discussed in more detail later, such approaches were abandoned during the 1970s and 1980s in favor of other approaches. During those two decades, developmental research on social stereotyping and prejudice became highly compartmentalized within specific domains, a trend that continues. That is, with few exceptions (e.g., Katz, 1983; Katz & Kofkin, 1997), researchers typically address only a single category of social stereotyping in their theoretical and empirical work. Within their literature reviews, researchers rarely cite empirical work outside of their domain of interest. The tendency to read and work within a single domain has important consequences, including the tendency to propose domain-specific (rather than domain-general) theoretical accounts of stereotyping and prejudice. For example, Williams and Morland (1976) proposed that the innate fear of darkness drives young children to develop prejudice toward African Americans, and Maccoby (1990) argued that sex differences in play styles drive young children to develop gender-biased
46
Rebecca S. Bigler and Lynn S. Liben
peer preferences. Both explanations are obviously inapplicable to other forms of stereotyping and prejudice. The compartmentalization that characterizes the developmental literature on stereotyping and prejudice contrasts sharply with the literature on stereotyping and prejudice within social psychology. Many, perhaps even most, of the major theories that address social stereotyping and prejudice among adults do so quite broadly, proposing domain-general mechanisms such as categorization and illusory correlation (see Brewer, 1979, 1991; Fiske & Taylor, 1991; Hamilton & Trolier, 1986). The empirical work addressed in these mechanisms typically explores their operation within two or more domains (e.g., gender, race, age, and sexual orientation) simultaneously. Thus, in the service of formulating an integrative and parsimonious theory, our account draws from previous developmental work on stereotyping and prejudice that has been conducted to address individual domains (e.g., racial, gender, age- and attractiveness-based stereotyping). Our approach holds that the formation of stereotypes and prejudice along any particular attribute (e.g., race, gender, hair color)—as well as the failure to form stereotypes or prejudice along other attributes—can be explained by the operation of one or more of a discrete set of environmental and organismic variables. Broad approaches such as ours are sometimes criticized on the ground that they fail to recognize important differences across types of stereotyping and prejudice (e.g., racism, sexism, ageism). In this context, it is important to note explicitly that we are not arguing that different types of stereotyping and prejudice are interchangeable in all their causes, contents, or consequences. But to recognize differences is not tantamount to rejecting similarities. In fact, our own initial review of the literatures on multiple forms of prejudice was undertaken with the goal of identifying variations that might have important implications for theoretical accounts of attitude formation and change. Despite this goal, however, our review led us to identify a discrete set of causal factors that appeared to be operating in all forms of stereotyping and prejudice, even though each causal factor appeared to operate to different degrees across specific types of social stereotyping (e.g., racial, gender, sexual orientation), within some particular historic and cultural context. To illustrate, as we explain in greater detail subsequently, verbal labeling of social groups is hypothesized to play a causal role in creating stereotypes; when present, labeling invariably produces stereotyping, and it does so in a dosedependent manner. In contemporary U.S. culture, for example, labeling may be expected to play a relatively greater role in producing gender stereotypes than racial stereotypes because children typically encounter more frequent labeling of gender than race (e.g., children commonly hear classroom greetings such as ‘‘Good morning boys and girls,’’ but never the parallel ‘‘Good morning Whites and Latinos’’). Again, by way of example, segregation of social groups
A Developmental Intergroup Theory of Social Stereotypes and Prejudice
47
is hypothesized to play a universal causal role in stereotyping, but in contemporary U.S. culture, it may be expected to play a relatively greater role in producing racial stereotypes than gender stereotypes because children typically experience higher levels of racial than gender segregation in their environment (e.g., neighborhoods and schools are far more likely to be segregated by race than by sex). We propose that this broad approach of identifying universal causal factors is valuable because it allows predictions about how changes (increases or decreases) in the hypothesized causal factors should affect stereotyping and prejudice. The tenets of developmental intergroup theory lead one to predict, for example, that an increase in routine racial labeling (e.g., ‘‘Good morning, Black and White children’’) would lead to an increase in children’s racial bias, and that an increase in gender segregation (e.g., routinely providing single-sex educational and transportation facilities) would lead to an increase in children’s gender bias.
B. A DEVELOPMENTAL APPROACH
The second characteristic of developmental intergroup theory is that it is a truly developmental account of social stereotyping and prejudice insofar as it considers how developmental constraints and advances in children’s cognitive skills affect their construction of social categories and their meanings. As noted earlier, learning theory has served as the theoretical foundation for much theorizing about children’s stereotyping by gender, race, age, and physical abilities (e.g., Williams & Morland, 1976). Although learning theory has the capacity to address social stereotyping in a broad manner (i.e., positing that the same mechanisms of reinforcement and punishment operate across multiple domains), it overlooks development in the sense that the proposed mechanisms involved in forming social stereotypes are assumed to remain the same across all ages. However, this assumption is untenable, given empirical evidence indicating that cognitive skills affect the formation, function, and revision of social stereotypes (e.g., Aboud, 1988; Ruble & Martin, 1998). A more detailed discussion of the ways in which our theory draws on developmental perspectives is presented in a later section of this chapter.
C. AN INTERACTIONIST APPROACH
The third characteristic of the theory is that it is deeply interactionist in the sense that it holds that any behavior is the joint outcome of both what the individual brings to the environment and what the environment offers to the individual. Our interpretation of ‘‘interaction’’ is not simply that both
48
Rebecca S. Bigler and Lynn S. Liben
individuals and environments contribute to outcomes in some additive sense, but rather that there are profound reconfigurations of the functional environment depending upon the individual’s own qualities. An emphasis on individual– environment interactions is evident in various theories and research paradigms in a wide range of behavioral science. Within social psychology, for example, much of the conceptualization of the way that individuals process or remember social situations involves individuals’ initial attitudes and category structures (Kurzban, Tooby, & Cosmides, 2001; Ruble & Stangor, 1986; Taylor et al., 1978). Similarly, within cognitive psychology, the way that individuals process, infer, and recall information is said to depend on their cognitive schemata. Illustrative is the classic work by Bransford and Franks (1971) who demonstrated that people’s recall of text differs dramatically depending on their overall beliefs about the situation (as established by the title of the text) or by Stevens and Coupe (1978) who showed that people judge directions between cities differently depending on their overriding schemata about the locations of the states in which the named cities are located. Within developmental psychology, theories differ significantly in the degree to which they are interactionist (see Overton, 1998; Sameroff & Chandler, 1978). Some theories are not interactionist at all, but instead posit that developmental outcomes are controlled entirely by individual factors or environmental factors (‘‘main effect’’ models). Others are interactionist, but only in a statistical sense. In these approaches, outcomes are thought to be influenced by both individual and environmental qualities, but the two are taken as contributing in independent, additive ways (‘‘statistical interactionist’’ models). Other theories are more fundamentally interactionist, holding that individual and environmental qualities interact so that each quality is affected by the other and these entwined processes then affect the outcomes (‘‘transactional’’ models). At the more specific level of a focus on developmental outcomes related to stereotyping and prejudice, several theorists have considered the interaction of organismic and environmental factors in explaining the origins and developmental path of social stereotyping. In particular, both we (e.g., Bigler, 1995; Bigler & Liben, 1992; Liben & Bigler, 1987, 2002; Liben & Signorella, 1980, 1993; Signorella, Bigler, & Liben, 1997; Signorella & Liben, 1984) and others (e.g., Aboud, 1988; Bem, 1983; Katz, 1983; Levy & Dweck, 1999), have argued that social stereotyping in children can be understood only by examining both the environments in which children are raised and the individual characteristics (e.g., skills, goals, needs) that children bring to interpreting and interacting with that environment. In our formulation of developmental intergroup theory, we have drawn from this earlier work as well as from both cognitive-developmental and socialdevelopmental literatures more generally to identify individual qualities that should be expected to interact with environmental qualities in the formation,
A Developmental Intergroup Theory of Social Stereotypes and Prejudice
49
maintenance, and amelioration of stereotypes and prejudice. For example, and as explained at greater length later in this chapter, even children of the same chronological age vary in their ability to categorize, and these differences may be expected to affect the development of stereotypes in identical environmental conditions. Similarly, individual characteristics from the socio-emotional realm such as differences in self-esteem (Bigler, Jones, & Lobliner, 1997; Crocker & Schwartz, 1985) or differences with respect to which lay theories of groups and human characteristics children hold (Hirschfeld, 1996; Levy & Dweck, 1999; Levy, Stroessner, & Dweck, 1998) may be expected to interact with environmental conditions to produce stereotyping and prejudice. Furthermore, we argue that even an interactionist perspective—as that term is commonly interpreted—is incomplete. It is not simply that there are profiles of children’s characteristics that interact with profiles of particular environmental characteristics. It is also that the characteristics of individual children lead them to select certain types of intergroup environments in which to interact, and then to interpret their interactions in those environments differently, hence shaping their attitudes still further.
D. SUMMARY
In sum, developmental intergroup theory is a domain-general account of stereotyping and prejudice formation in the epistemological tradition of transactional theories of development. That is, the theory addresses both the environmental conditions necessary for stereotyping and prejudice to arise on the basis of some particular human characteristic and the ways in which the developing child’s cognitive progressions and personal proclivities modify the way that the environmental aliment is assimilated. The theory is also addressed to individual differences among children in their tendency to use particular social categories (e.g., gender, race) as the basis for stereotyping others. In the following section, we outline the more specific theoretical traditions in social and developmental psychology from which our theory draws.
V. Theoretical Foundations of Developmental Intergroup Theory A. THE INTERGROUP PERSPECTIVE
The major theoretical foundation for our account of stereotyping is the collection of perspectives on group relations that we refer to as ‘‘intergroup theory.’’ This umbrella term includes several formal theories such as Tajfel and Turner’s (1986) social identity theory and Turner’s (1987) self-categorization
50
Rebecca S. Bigler and Lynn S. Liben
theory, as well as less formal proposals stemming from empirical studies on intergroup relations (e.g., Brown, 1995; Sherif & Sherif, 1953). Work on intergroup theory was initially sparked by a study of the formation of intergroup hostility among children. In 1954, Muzafer Sherif and his colleagues conducted a classic field study at a boys’ summer camp in Oklahoma called Robbers Cave (Sherif et al., 1954/1961). In this study, 11-year-old boys were first assigned arbitrarily to two different groups, christened the ‘‘Eagles’’ and the ‘‘Rattlers’’ by the boys. After each group had formed and interacted in isolation from the other group, the two groups were brought together under conditions designed to maximize competition. Measures of peer preferences, trait ratings, and group evaluations revealed that the boys showed consistent biases favoring members of their own group. In the final week of the summer camp, the two groups were brought together under conditions designed to promote cooperation and unity, and as a consequence, intergroup hostility declined dramatically. Only a few intergroup studies since that initial early work have been conducted with children (e.g., Abrams, 1985; Billig & Tajfel, 1973; Tajfel & Billig, 1974), and even these have been focused on adolescents (rather than on younger children), and have been framed within a social psychological (rather than a developmental) perspective. It is ironic that 50 years after Sherif and colleagues undertook their widely read and praised work with 11-year-old children, so little is known about the formation of intergroup attitudes during childhood. Nevertheless, intergroup theory has remained popular in the socialpsychological literature (for reviews, see Brewer & Brown, 1998; Messick & Mackie, 1989): Here, we highlight a few seminal theoretical and empirical contributions. In 1971, Henri Tajfel and his colleagues proposed that the mere act of categorizing individuals into social groups was sufficient to produce intergroup prejudice and discrimination. This simple assignment of people into groups—in cases in which the social categories are entirely uninformative, irrelevant, or completely unfounded—has been referred to as the ‘‘minimal group’’ condition. Its consequences have been well documented within the social psychological literature (see Brewer, 1979; Hamilton & Trolier, 1986; Messick & Mackie, 1989). For example, mere social categorization has been shown to produce increased perception of between-group differences and within-group similarity (e.g., Doise, Deschamps, & Meyer, 1978), increased perception of outgroup homogeneity (e.g., Park & Rothbart, 1982; Quattrone & Jones, 1980), and increased intergroup bias, including both ingroup favoritism and outgroup discrimination (e.g., Allen & Wilder, 1975; Brewer & Silver, 1978). In addition, intergroup manipulations affect the processing of information related to group members. Individuals show better memory for negative behaviors performed by outgroup than by
A Developmental Intergroup Theory of Social Stereotypes and Prejudice
51
ingroup members; they are also more likely to attribute negative behaviors performed by outgroup members to dispositions (rather than situations), whereas the reverse is true for ingroup members (see Hamilton & Trolier, 1986). Developmental and social psychological theorists have increasingly argued that intergroup processes are also important components of stereotyping and prejudice in children (Aboud, 1988; Fishbein, 2002; Katz, 1983; Martin & Halverson, 1981; Serbin, Powlishta, & Gulko, 1993). Some empirical work also indicates that intergroup processes are involved in children’s social stereotyping. Within the domain of race, for example, developmental work indicates that European American children perceive more similarity within races than between races (e.g., Doyle & Aboud, 1995), and that individual differences in perceived between-race similarity are related to prejudice (e.g., Aboud & Mitchell, 1977; Doyle & Aboud, 1995). Within the domain of gender, Powlishta (1995a,b, 2004) has shown that young children’s gender attitudes are especially likely to be marked by ingroup biases, rather than by adherence to cultural views about gender. That is, young children are more likely than older children to associate positive traits (e.g., brave, smart, neat) with their own gender than with the other gender, even when these traits are associated with the other gender by adults within their culture. In addition, children have been shown to have strong preferences for same-sex (i.e., ingroup) playmates and friends (Martin & Fabes, 2001). Thus, previous theoretical and empirical work suggests that intergroup processes are implicated in stereotyping and prejudice. Although humans may inherently be ready to develop stereotypes and intergroup bias (Fishbein, 2002), these processes are not automatic or inevitable. Rather, there are large individual differences in children’s attitudes toward particular social groups (Signorella, 1987). Some children endorse many more gender and racial stereotypes than do others (Aboud, 1988; Bigler & Liben, 1993), and some children prefer samegender and same-race peers much more strongly than others (Martin & Fabes, 2001). Finally, some outgroups are more often the targets of bias than are other outgroups. In sum, knowledge that one is a member of a group is often, but not always, associated with stereotyping and prejudice.
B. THE COGNITIVE-DEVELOPMENTAL PERSPECTIVE
As noted earlier, one limitation of intergroup theory is that it fails to address issues of development. Implicitly, the individual is conceived of as a static entity, assumed to respond to social categorization consistently across the life course. This approach is undoubtedly, at least in part, a consequence of the fact that intergroup researchers have focused nearly exclusively on adolescent and adult samples. When a broader age range is sampled, however, it becomes
52
Rebecca S. Bigler and Lynn S. Liben
more apparent that ontogenetic change is likely to affect processes relevant to stereotypes and prejudice. Given that cognitive processes are involved in the formation of stereotypes about social groups, cognitive-developmental change may be expected to affect stereotyping and prejudice. One perspective that we have found useful for suggesting relevant arenas of cognitive change is Piagetian theory (e.g., see Piaget, 1970). Although the theory is undoubtedly unduly conservative with respect to the age at which various cognitive skills are said to emerge and develop, two broad aspects of this theory are nevertheless applicable. One relevant aspect of this theoretical tradition is that it highlights ways in which children’s logical skills change with development. Although most commonly studied in relation to children’s educational achievements (e.g., progress in mathematics and science), changes in logical reasoning skills are also likely to shape children’s thinking about, and behavior toward, social groups. Among the cognitive progressions that may be relevant to the formation of stereotypes and prejudices are those associated with perspective-taking (Piaget & Inhelder, 1956), probability judgments (Piaget & Inhelder, 1975), and nominalism (Piaget, 1929). We believe that children’s developing classification abilities are particularly important for the formation of stereotypes and we discuss this link in detail in Section VII of this chapter. The second relevant aspect of Piagetian theory is its constructivist nature. When applied to social stereotyping and prejudice, this perspective implies that children’s social stereotypes and prejudices are unlikely to arise directly and automatically from those held by adults in the environment. Instead, children actively create cognitive schemata about social categories and then attach affective and cognitive components to those categories. Furthermore, after categories or schemata are in place, they are likely to be used as filters for newly encountered information. Thus, children may be expected to ignore, forget, or distort newly encountered information in accord with existing cognitive schemata. Constructive processes thereby serve to strengthen the existing beliefs even when counter-stereotypic information is encountered. This process should lead children to conserve rather than to revise their views of social groups, in part accounting for why efforts to intervene in children’s existing stereotypes and prejudices have been of limited success (e.g., see Liben & Bigler, 1987). Having highlighted the major relevant concepts drawn from intergroup theory and from cognitive-developmental theory, we now turn to a description of core components of developmental intergroup theory.
VI. Core Components of Developmental Intergroup Theory Developmental intergroup theory posits that social stereotyping and prejudice arise in children as the result of constructivist cognitive-developmental
A Developmental Intergroup Theory of Social Stereotypes and Prejudice
53
processes which operate in environments that differentially foster the use of certain attributes as the basis of categorizing people into groups. The theory is summarized graphically in Figures 1 and 2. Figure 1 shows the major components relevant to the formation of stereotypes and prejudice. Figure 2 concerns the maintenance or modification of stereotypic views and prejudice. Note that the division into two figures is for ease of presentation; as evident from the components that are repeated across figures, in actuality, the processes are connected. In this section, we describe the four core component processes of developmental intergroup theory (identified by the double-bordered rectangles of the figures). The first three, shown in Figure 1, concern (a) the establishment of the psychological salience [EPS] of person attributes, (b) categorization of encountered individuals [CEI] along a salient dimension, and (c) the development of stereotypes and prejudices [DSP] concerning salient social groups. The fourth, shown in Figure 2, addresses (d) the application of a stereotype filter [ASF] when individuals are encountered. In the course of describing these four major processes, we mention briefly some of the factors hypothesized to affect these processes (see remaining constructs depicted in Figures 1 and 2), although a full discussion of these contributing factors is postponed until the next section of the chapter.
A. ESTABLISH THE PSYCHOLOGICAL SALIENCE [EPS] OF PERSON ATTRIBUTES
Virtually all approaches to social stereotyping rest on the foundation of categorization. However, there are almost endless bases on which humans might be parsed into groups. How and why do some bases for classification—and not others—come to be viewed by children as meaningful and thus be used to sort individuals? The first component of developmental intergroup theory depicted in Figure 1 (see EPS rectangle) is addressed to the process by which some person attributes become salient for categorization. Drawing from Piagetian theory, we assume that individuals are born with a predisposition to strive to understand their world, including the principles that might explain human behavior. Thus, we posit that children actively seek to determine which of the many possible bases for classification available for a given individual, or groups of individuals, are important. Characteristics in the environment thus interact with characteristics of the child’s cognitive system to render some, but not other, bases of categorization psychologically salient or meaningful. Our theory differs, therefore, from other major accounts of stereotyping and prejudice in several important ways. First, we argue that classification on
54
Self-esteem
PROPORTIONAL GROUP SIZE Drive to Classify
ESTABLISH PSYCHOLOGICAL SALIENCE OF PERSON ATTRIBUTES [EPS]
EXPLICIT LABELING AND USE
IMPLICIT USE
CATEGORIZE ENCOUNTERED INDIVIDUALS BY SALIENT DIMENSION [CEI]
Classification Skill
ESSENTIALISM
INGROUP BIAS
Internal Cognitive Motivation
Internal Affective Motivation
DEVELOP STEREOTYPES AND PREJUDICES CONCERNING SALIENT SOCIAL GROUPS [DSP]
EXPLICIT ATTRIBUTIONS Internalize Environmental Aliment
Fig. 1. The processes involved in the formation of social stereotypes and prejudice.
GROUP-ATTRIBUTE COVARIATION OR IMPLICIT ATTRIBUTIONS
Organize Environmental Aliment
STEREOTYPE ENCOUNTERED INDIVIDUALS entailing • assignment to categories • associating stereotypic attributes
Rebecca S. Bigler and Lynn S. Liben
PERCEPTUAL DISCRIMINABILITY
STRENGTHEN EXISTING STEREOTYPE
DEVELOP STEREOTYPES AND PREJUDICES CONCERNING SALIENT SOCIAL GROUPS [DSP]
STEREOTYPE ENCOUNTERED INDIVIDUALS [EI] entailing • assignment to categories • associating stereotypic attributes
FORGET OR DISTORT ENCOUNTERED INDIVDUAL TO MAKE CONSISTENT
IS ENCOUNTERED INDIVIDUAL STEREOTYPIC??
NO
APPLY STEREOTYPE FILTER TO ENCOUNTERED INDIVIDUALS? [ASF]
Multiple Classification
DIFFERENTIATE OR ELABORATE STEREOTYPE
YES
NO
PROCESS ENCOUNTERED INDIVIDUAL AS SUBTYPE
A Developmental Intergroup Theory of Social Stereotypes and Prejudice
YES
Fig. 2. Processes involved in the maintenance or modification of social stereotypes and prejudice. 55
56
Rebecca S. Bigler and Lynn S. Liben
the basis of any one particular ‘‘privileged’’ dimension does not occur reflexively as a result of an innate predisposition, as is suggested in some evolutionary theories. For example, some evolutionary theorists have argued that attention to social categories is driven by motivational concerns related to survival and reproduction. In this view, specific perceptual cues (race, gender, attractiveness) are believed to activate goals related to survival or reproduction, goals that are, in turn, thought to shape cognitive processes such as prejudice (e.g., Neuberg et al., 2004; Schaller, Park, & Faulkner, 2003). In contrast, we do not believe that any particular human trait invariably and inevitably serves as a salient dimension of categorization as a result of evolved biological mechanisms promoting survival of individuals and the species. Instead, we argue that survival is likely to have been promoted by the evolution of a highly flexible cognitive system that—rather than providing ‘‘hardwired’’ biases for categorization—instead leads children to construct hypotheses about which bases of classification are appropriate within a given context. One implication of this view is that culture and children’s cognitive skills interact to shape the particular dimensions children come to deem to be significant. Cultural environments might be explicitly structured to make some classification schemes psychologically salient. Contemporary U.S. governmental agencies require, for example, that some qualities (but not others) be labeled on birth certificates, identification cards, and the like; Nazi German rulers required Jews to wear as many as three yellow stars so that their ‘‘race’’ was observable from every possible angle (Klein, 1957). Other cultures might create and highlight categories unintentionally. For example, although perhaps not intended to cause gender discrimination, American culture provides young children with many cues that distinguish people by sex (e.g., differentiating hair and clothing styles). To the degree that there are consistencies across cultures with respect to emphasizing some kinds of distinctions (as in commonly dividing groups by gender) coupled with some universal cognitive constraints (e.g., limits on young children’s flexibility in categorizing), there are likely to be some consistencies across cultures in the appearance of some forms of stereotyping and prejudice. At the same time, to the degree that environments differ with respect to which human dimensions are made salient, there are likely to be variations across cultures in which specific forms of stereotyping and prejudice appear. Second, we argue that classification on the basis of some particular dimension is not—as proposed in traditional and social learning theories (Allport, 1954; Bandura, 1977; Skinner, 1969)—largely the result of individuals’ histories of reinforcement or punishment for using this dimension, nor of observing and then imitating others’ expressions of stereotyping and prejudice on the basis of this dimension. Instead, we posit that children search for categories that are appropriate to, and useful within, their culture. The environment does, indeed, play a critical role in shaping children’s social stereotyping and prejudice, but its
A Developmental Intergroup Theory of Social Stereotypes and Prejudice
57
role involves providing children with information about social groups that: (a) serves to make some bases for classification salient, and (b) provides the aliment, or the raw material, from which children construct the meaning of groups. So, for example, a father who tells his child to ‘‘Ask that lady if we are in the correct line’’ would not be considered by traditional or social learning theorists to be shaping his child’s gender stereotyping. The statement involves neither reward nor punishment, nor does it convey anything about the father’s gender attitudes (e.g., whether he holds more or less egalitarian views of gender). Nonetheless, we posit that this type of statement plays an important role in shaping children’s gender attitudes because it serves to make gender salient and leads children to devise hypotheses about the meaningfulness of gender. B. CATEGORIZE ENCOUNTERED INDIVIDUALS [CEI] BY SALIENT DIMENSIONS
As already noted, we share the assumption of many theorists that children are driven to categorize stimuli in an attempt to structure knowledge and reduce the complexity of operating within the world (Allport, 1954; Mervis & Rosch, 1981). Thus, we posit that children will classify encountered individuals into groups (as depicted in the CEI rectangle of Figure 1) using those dimensions that are psychologically salient. The degree and way in which the categorization process operates will be affected by the individual child’s classification skill, which undergoes age-related changes (see the ‘‘classification skill’’ rectangle associated with the CEI rectangle in Figure 1), and environmental experience (such as the number of encounters with exemplars). The mere act of categorization triggers processes involved in the construction of social stereotypes, discussed next. C. DEVELOP STEREOTYPES AND PREJUDICES [DSP] CONCERNING SALIENT SOCIAL GROUPS
The process of categorization is hypothesized to result in constructivist cognitive-developmental processes that serve to attach meaning to social groups in the form of beliefs (i.e., stereotypes) and affect (i.e., prejudice) (see DSP rectangle in Figure 1). Developmental intergroup theory outlines the factors that guide children’s acquisition of the content of their social stereotypes and the nature of their affective responses to social groups. We propose two broad classes of ways in which children attach meaning to groups: those that are internally driven (constructs shown by the top ovals feeding into the DSP rectangle of Figure 1) and those that are externally driven (those shown in the bottom ovals feeding into the DSP rectangle).
58
Rebecca S. Bigler and Lynn S. Liben
Here, we make broad remarks about these two types of processes. (A more detailed discussion of the four specific factors is provided later in Section VII.) Internally driven processes involve the self-generation, or construction, of links between social categories and (a) attributes (traits, behaviors, roles), and (b) affect (e.g., liking). With respect to social stereotyping, these processes are conceptualized as ones in which children go above and beyond veridical information available in the environment to inferentially construct information about the attributes associated with particular social categories. A child might claim, for example, that African Americans and European Americans have different blood types (see Hirschfeld, 1996) or, as the first author’s six-year-old daughter announced at her first sight of the (to her, disgusting looking) dish, ‘‘only boys eat oysters.’’ Obviously, such beliefs are not based on the accumulation of empirical evidence but instead reflect the child’s imposition of an internalized group schema onto the world (the origins of which are described subsequently). With respect to prejudice, the processes are conceptualized as those in which children actively generate more positive affective links to ingroups than outgroups. Studies of novel social groups demonstrate both processes: Children typically prefer their ingroup to their outgroup and come to believe that more positive traits characterize their ingroup than their outgroup, despite the fact that such beliefs are neither modeled by adults nor objectively true. The two processes are likely to be reciprocally reinforcing. Cognitive beliefs about the superiority of ingroups reinforce affective ingroup biases and vice versa. Thus, a developmental account of social stereotyping and prejudice must go beyond processes identified by traditional and social learning theories. Children’s cognitive processes are applied to what they encounter in the world, and the environments in which children reside (both macro-level and micro-level) are characterized by correlations between social categories and many attributes. Gender and race, for example, are correlated with occupational roles and activities. Children are much more likely to encounter male than female football players. Or, as another example, the occupation of President of the United States shows a perfect correlation with both gender and race (all Presidents have been white men), a high profile correlation that is likely to be detected by children. We posit that children’s attention to such correlations plays a role in shaping the content of stereotypes and, in turn, prejudice. So, although children typically show strong affective preferences for their ingroups, individuals who are members of groups that are strongly linked in the culture with negative attributes (e.g., African Americans, gays and lesbians) are more likely to develop non-biased or non-prejudiced affective responses to groups than are their peers. The veridical presence of correlations between social categories and some attributes has led some psychologists to claim that stereotypes are
A Developmental Intergroup Theory of Social Stereotypes and Prejudice
59
accurate generalizations (see Lee, Jussim, & McCauley, 1995). We agree that the detection of group-attribute correlations plays a role in stereotyping. However, we reject as inadequate those approaches positing that stereotypes are merely the reflection of true category-attribute relations or positing that children acquire stereotypes only through environmental models (e.g., social learning theory). Both approaches fail to account for some important facts about social stereotyping among children. First, as noted earlier, children and adults typically develop stereotypic views and prejudices concerning groups that are meaningless and thus uncorrelated with any observable traits or behaviors. Children in Bigler’s studies, for example, claim that ‘‘red’’ children differ from ‘‘blue’’ children, even in the complete absence of (a) adult reinforcement of such views, (b) adult modeling of such views, or (c) veridical correlations between groups and traits (Bigler, Jones, & Lobliner, 1997; Bigler, Brown, & Markell, 2001; Brown & Bigler, 2002). Second, children’s social stereotypes frequently are unrelated to environmental messages. For example, young children’s racial and gender attitudes are uncorrelated with their parents’ attitudes (Fishbein, 2002; Tenenbaum & Leaper, 2002) or with their peers’ attitudes (Aboud & Doyle, 1996). In addition, children sometimes express rigid beliefs about gender that contradict cultural views (e.g., a number of children in our own studies profess that ‘‘only men should vacuum’’) but, at the same time, serve to associate their own gender with positive characteristics (see Powlishta, 2004) or desired activities (e.g., many young boys in our studies were attracted to vacuum cleaners). Interestingly, even studies that assess knowledge of cultural stereotypes show discontinuities between children’s beliefs and environmental models (Trautner et al., 2005). So, for example, when asked about which gender usually bakes at home, young children (5 – 7-year-olds) are more likely than older children (8 – 10-year-olds) to say ‘‘only women’’ rather than the more accurate ‘‘mostly women.’’ Thus, social stereotyping clearly involves some degree of construction on the part of individual children. Third, enormous numbers of potential category-to-attribute correlations are available in a typical child’s world. It seems unlikely that a child could calculate the correlational relation between (a) all possible social groups within an environment and (b) all possible traits, roles, and activities within that environment. We think it unlikely, for example, that children calculate the relation between individuals’ height and the likelihood of being a nurse, individuals’ hair length and the likelihood of being gentle, or individuals’ religion and the likelihood of using an ironing board. Yet, most children detect the correlation between each of these qualities and gender, and thus some statistical learning of group-to-attribute relations appears to occur. We argue, as do individuals who study infants’ and young children’s attention to statistical information in the service of language learning (Gomez & Maye, 2005;
60
Rebecca S. Bigler and Lynn S. Liben
Saffran, 2003), that some processes must narrow the scope of the problem so that children’s attention is directed toward only a subset of possible correlations. Developmental intergroup theory suggests such processes, outlining the factors that serve to make some (but not others) person attributes the basis for children’s social categorization (depicted as ovals contributing to establishment of the psychological salience [EPS] of person attributes in Figure 1, and described later).
D. APPLY STEREOTYPE FILTER [ASF] TO ENCOUNTERED INDIVIDUALS
The final core component of our theoretical approach to the development of stereotypes and prejudice concerns children’s responses to the continual flow of information relevant to groups and the characteristics of group members. Over time, children are continually exposed to new information that potentially: (a) extends, (b) reinforces, or (c) contradicts the content of their social stereotypes and prejudices. In the domain of gender, for example, children have new experiences that expose them to an expanding scope of occupations that may or may not be correlated with gender (e.g., manicurists and real-estate agents, respectively), they encounter people who confirm or contradict their gender stereotypes (e.g., female nurses and female fire fighters, respectively), and they interact with men and women who trigger positive and negative affective responses. What rules govern children’s responses to such information? How and when are social stereotypes strengthened or revised? As depicted in Figure 2, we posit that, once formed, children’s social stereotypes and prejudices show a strong tendency to be maintained. The primary cause of this tendency is schematic processing (Bem, 1981; Liben & Signorella, 1980; Martin & Halverson, 1981). As argued earlier, constructive processes lead children to have a strong tendency to remember information that conforms to their stereotypic expectations, and this information serves then to strengthen individuals’ stereotypic beliefs and prejudices (see the pathway in Figure 2 stemming from an affirmative answer to the decision filter that asks, ‘‘Is the encountered individual stereotypic?’’). In contrast, children have a strong tendency to forget or distort information that contradicts their stereotypic beliefs (see the pathway in Figure 2 stemming from a negative answer to the decision filter ‘‘Is the encountered individual stereotypic?’’) The schematic processing of such information is depicted in Figure 2 by the double-bordered rectangle labeled ‘‘apply stereotype filter to encountered individual’’ [ASF]. When this filter operates, it serves to alter the way that the stimulus persons are encoded or processed. The result is either to lead the child to produce stereotype-consistent representations, or to forget those encountered persons in their entirety.
A Developmental Intergroup Theory of Social Stereotypes and Prejudice
61
Consistent with this proposal are, for example, findings that: (a) children show better overall memory for occupational depictions that are consistent with cultural gender stereotypes than those that contradict such stereotypes, and that (b) children distort the jobs or the biological sex of individuals who are depicted in counterstereotypic occupations (e.g., Liben, Bigler & Krogh, 2002; Liben & Signorella, 1980). Although counterstereotypic information is often misremembered or distorted, such information is occasionally remembered accurately. When it is accurately encoded, such information can serve to differentiate the original stereotype via the addition of ‘‘subtypes’’ (e.g., the differentiation of the schema for women into ‘‘housewives’’ and ‘‘career women’’). Explicit labeling of counterstereotypic models (e.g., ‘‘She is a female physician’’) has been shown to promote their retention (Hirschfeld, 1993; Liben & Signorella, 1993). Nonetheless, most researchers agree that—once stereotypes and prejudices are formed—exposure to individuals who are inconsistent with one’s personal beliefs and biases is rarely sufficient to undermine them (Banks, 1995; Liben & Bigler, 1987). Cognitive constraints, such as young children’s difficulty in understanding that individuals can belong simultaneously to two categories that are not traditionally linked, cause rigid schematic processing to be more common among children than adults. Thus, children’s social stereotypes and prejudices are especially difficult to revise. This cognitive constraint and other developmental and individual differences that contribute to filtering process are discussed in the next section.
VII. Principles of the Formation and Maintenance of Social Stereotypes and Prejudice In the previous section, we outlined the general process by which children are hypothesized to develop social stereotypes and prejudice. In this section, we review the factors posited to affect the core processes of stereotyping and prejudice and the individual and developmental differences believed to operate to affect these processes. A. FACTORS AFFECTING THE ESTABLISHMENT OF THE PSYCHOLOGICAL SALIENCE [EPS] OF PERSON ATTRIBUTES
Four factors are hypothesized to affect the establishment of the psychological salience [EPS] of person attributes. These are depicted as ovals that feed the EPS component of Figure 1 and include: (1) perceptual discriminability of social groups, (2) proportional group size, (3) explicit labeling and use of social groups, and (4) implicit use of social groups.
62
Rebecca S. Bigler and Lynn S. Liben
1. Perceptual Discriminability Developmental intergroup theory postulates that social groups that are marked by perceptually salient attributes become the target of intergroup bias among children much more readily than social groups that are not marked by perceptually salient attributes. This tenet stems, in large part, from cognitivedevelopmental theory. Piaget (1970) suggested that young children’s thinking is tied to perceptually salient dimensions of objects and people. Considerable research indicates that young children tend to focus on perceptually salient attributes in person perception tasks (see Livesley & Bromley, 1973). Consistent with this perspective, perceptually salient features such as race, gender, age, and attractiveness typically become the basis for social stereotyping among children, whereas non-perceptually salient features such as political affiliation typically do not. Children show, for example, little stereotyping and prejudice on the basis of most religious (e.g., Protestants, Jews) and national groups (e.g., Germans, Russians, Australians), even within cultures in which adults stereotype and show prejudices along these dimensions (Rutland, 1999; Stringer & Irwing, 1998). The probability of stereotyping and prejudice seems to increase, however, when observable features, such as attire, mark these groups (Kowalski & Lo, 2001). Why does perceptual salience facilitate the formation of intergroup bias? We hypothesize that perceptual salience is important for several reasons. First, perceptually discriminable categories are necessary for infants and young children to: (a) construct categories about which they develop beliefs, and (b) detect veridical co-variations between actual social categories and attributes, which become the bases of stereotypes and prejudices. As described earlier, social groups can (a) be the basis of self-generated beliefs about group-attribute links, or (b) differ in systematic ways, providing a ‘‘kernel of truth’’ to the formation of group stereotypes (see Lee, Jussim, & McCauley, 1995). However, neither of these is likely to occur unless children can identify an individual’s group membership from visual appearance. Adults, in contrast, can perform the mental operation of assigning attributes to hypothetical—even perceptually unidentifiable—social groups such as Democrats, Mormons, and Communists (e.g., Bialer, 1985). Second, children are likely to believe that individuals who share similar observable features (e.g., skin color) also share non-observable features, such as interests and desire (e.g., Hirshfield, 1996). As described earlier, this process is likely to be mediated by categorization. That is, the perceptual similarity of some individuals leads children to classify stimuli as belonging in the same category, and such categorization leads children to develop essentialist views of category members (see Gelman, 2003). Consistent with these predictions, perceptually salient markers of category membership seem to be necessary for the development of stereotypes
A Developmental Intergroup Theory of Social Stereotypes and Prejudice
63
and prejudice in children. For example, Bigler (1995) assigned children to novel ‘‘color’’ social groups (blue vs. yellow) in their classrooms. Children in these groups were not, however, marked by a perceptually salient feature (i.e., they were not given different colored shirts to wear) but were simply said to be members of these color groups. Even in classrooms in which teachers used the groups to label and organize the class, children failed to develop ingroupbiased views of the groups. This outcome thus differs sharply from the ingroup biases that have been shown to occur in similar situations when children wore different colored shirts to mark their group membership (Bigler, Jones, & Lobliner, 1997; Bigler, Brown, & Markell, 2001). Finally, it is worth noting that children sometimes do produce stereotypic remarks about social groups that they themselves are unable to perceptually detect. For example, a child might comment (as did a seven-year-old participant in one of our studies) that ‘‘Cubans are lazy,’’ despite showing no ability to identify Cuban individuals from photographs. Or, a child might produce stereotypic and prejudiced remarks about gays and lesbians while showing very little knowledge about, or ability to detect, these groups. Consistent with our anecdotal example, we believe that it is possible for children to learn (usually via direct, explicit instruction) stereotypes of visually indistinguishable groups. We argue, however, that when stereotypes are learned primarily by this direct-teaching route, they are less complex and less deeply held than are those stereotypes that are developed via constructive processes that draw on children’s ability to make perceptual discriminations between groups. In short, we argue that—at the group level—stereotypic beliefs and prejudice will be less common for perceptually indiscriminable groups than discriminable groups, and that—at the individual level—the stereotypic beliefs and prejudices derived from direct, explicit teaching (discussed later) will be less highly articulated for perceptually indiscriminable groups than for discriminable groups. In sum, we propose that being able to discriminate perceptually between or among groups is a necessary (but, as we outline later, not sufficient) condition for the establishment of widely shared, extensive social stereotyping and prejudice among children. There is a change in the ability to discriminate among groups as a function of the perceiver’s age and experience, but, as a general rule, groups that can be readily distinguished by endogenous visible qualities (e.g., skin color, eye shape) or are routinely made discriminable by exogenous markers (e.g., hair, badges, clothing) are likely to become the basis of social stereotypes and prejudice. 2. Proportional Group Size A second factor hypothesized to have an impact on the psychological salience of social groups among children is proportional group size, that is, whether
64
Rebecca S. Bigler and Lynn S. Liben
the members of social groups comprise a majority, a minority, or roughly half of the individuals within particular settings (e.g., classrooms, neighborhoods, cities). Given equivalence with respect to other model factors, social categories are more salient when group sizes are unequal (e.g., race within most settings in the U.S.) than when group sizes are roughly equal (e.g., gender within many settings in the U.S.). Furthermore, when attention is drawn to minority– majority social groups, the proportionately smaller of the two groups is more distinctive than the proportionately larger group (Mullen, 1991; Mullen & Hu, 1989). The distinctiveness of proportionally smaller groups for child and adult observers has important consequences for the development of stereotypes and prejudice. Social groups that have minority status become the target of negative intergroup attitudes (i.e., stereotypes and prejudice) more often than social groups that have majority status. One reason for this effect is the formation of illusory correlations, which occur when individuals perceive a relation between a social group and some trait or role that does not actually exist, or does exist but to a lesser extent than is believed to be the case (Hamilton & Sherman, 1989; Johnston & Jacobs, 2003; McConnell, Sherman, & Hamilton, 1994). In addition, the salience of social categories differs for individuals depending on their own membership in groups of proportionally different sizes. Just as minority groups have more psychological salience than majority groups, those individuals who are members of proportionately smaller groups experience their group membership as more salient than do those who are members of proportionately larger sub-categories within groups (Brewer, 1979; Brewer & Brown, 1998). Individuals who have been the sole members of their gender or race within some setting readily attest to the increased salience of their category membership (e.g., Parker, 1997). Furthermore, many individuals report that the experience of being a member of a minority group is a negative one. Children who are members of outnumbered groups often appear unhappy with their group membership. Illustrative data come from studies in which minority groups were created experimentally by, for example, having only two or three children wear blue shirts while all others in their classroom wore yellow shirts (Brown & Bigler, 2002). When initial assignments to groups were made, children assigned to the minority (but not majority) groups protested. Furthermore, after several weeks, children who were members of the minority group made a significantly greater number of requests to be reassigned to the other color group than did members of the majority group. At the same time, children who were members of the minority group showed stronger ingroup peer preferences than children who were members of the majority group. In sum, we propose that social groups marked by substantially unequal distributions are more likely to become psychologically salient bases for categorization than are groups with numerically balanced sub-groups. In addition,
A Developmental Intergroup Theory of Social Stereotypes and Prejudice
65
we propose that, in general, children who are members of proportionately smaller groups are more likely to view their social group as psychologically salient than are children who are members of proportionately larger social groups. 3. Explicit Labeling and Use A major tenet of developmental intergroup theory is that children develop stereotypes and prejudices on the basis of those characteristics that adults mark as important bases for social categorization. Conversely, children ignore those sources of human variation that adults fail to mark as important bases for social categorization. Specifically, developmental intergroup theory holds that social groups that are labeled and used explicitly by adults in the environment become the target of stereotyping and prejudice much more readily than social groups that are ignored by adults in the environment. It is important to note, however, that in our theory, we presume this process to be a constructive (rather than a social learning) one. That is, children do not merely imitate adults’ social stereotyping and prejudice. Instead we propose that they construct beliefs about groups based on adults’ cues, often quite subtle ones, which mark a particular basis for categorization (e.g., gender, race, height, sexual orientation) as important. The original source of our hypothesis concerning functional use of group divisions was Bem’s (1983) proposal that gender stereotyping among children was caused by adults acting in ways that invest gender with functional significance. She noted that adults label gender and use gender categories in functionally significant ways (e.g., asking children to line up by sex in school). She further noted that society made similar use of race during much of the country’s history, but had abandoned such practices in the 1960s, perhaps thereby reducing typical levels of racial prejudice in the country. Bem’s proposal cannot be tested experimentally, however, given that children are universally raised in environments that invest gender with functional significance. Thus, Bigler and colleagues developed a novel group paradigm to test whether investing a category with functional significance is sufficient to induce the formation of stereotyping and prejudice. In this research, Bigler, Jones, and Lobliner (1997) and Bigler, Brown, and Markell (2001) assigned children to novel color groups, and after several weeks, assessed children’s attitudes toward their ingroup and outgroup. The data from these studies confirmed that when adults used the novel (color) groups to label children and to organize the classroom, children developed biased beliefs about ingroups and outgroups. In the discussion thus far, we have not differentiated between the possible impact of two different kinds of adults’ behaviors: labeling groups vs. using groups in some functional way. In the studies conducted by Bigler et al., these behaviors co-occurred. That is, teachers both labeled the groups and used
66
Rebecca S. Bigler and Lynn S. Liben
these groups to organize the classroom environment. Subsequently, however, Gelman and her colleagues proposed that labeling alone (even without functional use) serves as an important cue that leads children to: (a) categorize others on the basis of some dimension, and more significantly, (b) endorse essentialist beliefs about groups (Gelman, Taylor, & Nguyen, 2004). As one test of this hypothesis, Gelman and Heyman (1999) exposed 5- and 7-year-olds to characters who were described either with noun labels (‘‘She is a carrot-eater’’) or with verb-predicate labels (‘‘She eats carrots whenever she can’’). Children who learned noun labels inferred that the given trait (e.g., carrot eating) was more stable over time and over contexts than children who learned verbpredicate labels. These results suggest that children treat noun labels about people as indicative of stable, non-obvious characteristics. Such thinking is thus likely to exacerbate social stereotyping. Using social categories in functionally significant ways may produce an additional effect beyond the effects of merely labeling. Although this possibility should be examined in future research, for practical purposes, the distinction may be relatively unimportant in the everyday world because when categories are labeled in the natural ecology, it is usually in the service of something beyond simply assigning labels. For example, teachers are unlikely to simply point to a group of boys to say ‘‘These are the boys in the class.’’ Instead, they would be likely to use labels for some functional purpose, as in ‘‘Will the boys please line up for lunch?’’ Thus, the two factors are likely to appear together in children’s environments. In sum, we propose that when authority figures distinguish among individuals via: (a) the use of labels, or via (b) some functional organization of the social environment (e.g., assigning desks or bulletin boards by group), the grouping criterion (e.g., gender, color, reading ability) is likely to become the basis of intergroup bias among children. This is true even when these categories are used by authority figures in a completely neutral manner. As long as the groups are perceptually discriminable, these acts would still be expected to be powerful even if social groups are not explicitly linked to particular roles or traits, and even if the authority figure offers no message about differential value (as when teachers simply ask boys and girls to line up separately without further comment about, say, which group has been better behaved and thus may be excused first for recess). 4. Implicit Use As explained previously, developmental intergroup theory postulates that children develop social stereotypes and prejudices on the basis of those groups that adults mark as important. Both mechanisms just discussed—verbal labeling and functional use of groups—provide explicit markers of adults’ judgments about groups distinction. We also posit that there are implicit mechanisms that
A Developmental Intergroup Theory of Social Stereotypes and Prejudice
67
lead children to infer that adults view social categories as important. One particularly important implicit mechanism is segregation. When children see social groups that are segregated (i.e., physically separated) along some particular characteristic (e.g., race, gender), they are more likely to develop stereotypes and prejudice on the basis of that characteristic than when children see social groups that are integrated with respect to that particular characteristic. Segregation has long been believed to cause stereotyping and prejudice (Allport, 1954; Wilson, 1947), but the primary mechanism believed to be responsible for its influence has been lack of contact. Several theorists have argued that contact with social groups increases liking and thus, groups from which one is isolated (via segregation) are liked less than groups that are encountered regularly (Pettigrew, 1971; Zajonc, 1968). Although there is ample evidence that contact promotes liking (see Pettigrew & Tropp, 2000, for a meta-analytic review), we propose that relative lack of contact is unlikely to be the primary mechanism by which segregation affects social stereotyping and prejudice among children. We propose instead that in the process of attempting to identify which of the many possible features of people are important, children begin by noticing the characteristics along which humans are sorted into contexts. That is, they note perceptual similarities among those who live, work, and socialize together. They then infer that individuals operate within segregated environments because they differ in important and nonobvious ways. In other words, we hypothesize that segregation drives stereotyping and prejudice because it encourages children to construct beliefs about group differences. Evidence that is consistent with this hypothesis comes from two studies conducted by Bigler (2004). In the first, children (ages 6 – 11 years) were assigned to novel social groups within segregated (all ‘‘red’’) or integrated (mixed ‘‘red’’ and ‘‘blue’’) summer school classrooms. After several weeks, children in the segregated classes developed more ingroup-biased attitudes than those within integrated classrooms. However, even children from the integrated classes developed attitudes that showed ingroup bias. Results from the second study helped to explain why this ingroup bias had been so universal. Children (ages 7 – 13 years) were not themselves assigned to be members of the novel (red or blue) color groups but instead merely observed red and blue children who were seen to be interacting in either segregated or in integrated contexts. Participant children were then asked about the personalities, interests, and interpersonal motivations of the children they had observed. Participants were significantly more likely to conclude that the two types of observed children (red and blue) differed from one another and preferred to play with ‘‘their own kind’’ if participants had observed red and blue children interacting in segregated groups than if participants had observed red and blue children interacting in integrated groups.
68
Rebecca S. Bigler and Lynn S. Liben
Of course, children are exposed to a great deal of social segregation in their real environments. For example, schools, neighborhoods, and occupations all show high (and in many areas, growing) levels of racial and ethnic segregation (Lewis Mumford Center, 2001; Orfield, 2001). In addition, children typically view many examples of age- and gender-based segregation. At the same time, these patterns of segregation are probably only rarely discussed explicitly with children. That is, adults are typically unlikely to explain patterns of segregation directly. Although there are certainly exceptions (e.g., some African American parents may explain the existence of white ‘‘enclaves’’ in their hometown to their children; some feminists may explain the lack of females within certain occupations to their children), our impression (in need of empirical test) is that most parents and educators are reticent when talking to children about the causes of most forms of social segregation. In the absence of such explanations, we believe that children construct their own reasons for such segregation. In sum, we propose that social groups marked by segregation are likely to become the subject of intergroup bias among children, especially when such segregation goes unexplained by adults. In such circumstances, we believe that children come to believe that the social divisions they observe must have been caused by deep and meaningful differences between groups.
B. FACTORS AFFECTING THE DEVELOPMENT OF STEREOTYPES AND PREJUDICE [DSP]
Four factors are hypothesized to have an impact on the processes of stereotype and prejudice formation: (1) essentialism, (2) ingroup bias, (3) explicit attributions, and (4) group-attribute covariations or implicit attributions. These constructs are shown in the ovals feeding into the DSP component of Figure 1. 1. Essentialism Cognitive psychologists and anthropologists have suggested that some types of categories, particularly those found in the natural world, are structured by an ‘‘essentialist’’ mode of thinking (Gellman & Wellman, 1991; Medin, 1989). According to Gelman (2003), essentialism is the belief that members of a category share important, non-obvious qualities. Hirschfeld (1996) drew on research concerning children’s conceptions of race to suggest that humans have an innate, domain-specific tendency to classify human kinds in essentialist ways. Thus, he argued, children are likely to view humans that are members of the same category as sharing internal and non-obvious properties. Children’s tendency to engage in essentialist thought has important implications for stereotyping and prejudice. We argue that, after a particular human
A Developmental Intergroup Theory of Social Stereotypes and Prejudice
69
attribute becomes the basis of categorization, children are likely to view the visible markers of group membership as denoting other non-obvious, inherent qualities. African Americans, for example, are understood by European American children as sharing some core character or qualities that determine important aspects of personality and behavior (rather than as sharing only a skin complexion). Differences between groups are presumed by the child to exist and thus, their content is constructed by the child. Furthermore, we speculate that if children also encounter evidence of group differences in the world (e.g., children see African Americans and European Americans differentially associated with traits and roles), they will interpret these differences as normative, natural, and inevitable rather than as being the result of social processes such as discrimination. 2. Ingroup Bias As reviewed earlier when discussing the theoretical foundations of our work, the mere act of categorizing individuals into social groups (‘‘minimal groups’’) is sufficient to produce intergroup prejudice and discrimination among adults (Tajfel & Turner, 1986). Intergroup processes do not operate in quite the same way among children, in part due to developmental constraints on children’s classification skills as explained in greater detail later in the chapter. Nonetheless, we believe that after children have come to place people in differing social categories, they typically show prejudice in viewing the group to which they belong (their ingroup) as superior to others’ groups (outgroups). In other words, children, like adults, show a reluctance, or even inability, to endorse the belief that groups are ‘‘separate but equal.’’ Instead, after they view a group as ‘‘separate’’ from one to which they themselves belong, they necessarily view the groups as ‘‘unequal.’’ As noted earlier, the tendency toward ingroup bias affects the development of stereotype content. When stereotype content is acquired via self-generative or constructive processes, children fabricate category-attribute links that favor their own group. Specifically, children claim that positive attributes are associated with their own group, whereas negative attributes are associated with the outgroup (Powlishta, 1995b). Illustrative is the example given earlier in which the first author’s daughter instantly attributed oyster-eating (viewed as a negative trait) to boys. When content is generated via the detection of category-toattribute links present in the environment, children will show greater attention to, and memory for, positive attributes linked to the ingroup and negative traits linked to the outgroup (Powlishta, 2004; Powlishta & Serbin, 1993). The causes of essentialism and ingroup bias among children are unknown. Some theorists have argued that both are the result of innate predispositions (e.g., Fishbein, 2002; Hirschfeld, 1996). Although biologically-based proclivities may play a role, we argue that both processes are also likely to be due, at least
70
Rebecca S. Bigler and Lynn S. Liben
in part, to learning. Specifically, we hypothesize that even by an early age, children acquire a good deal of implicit, or tacit, knowledge about categorization, including the rules that govern categorization and category-based inferences. It is likely that embedded in such knowledge is the understanding that: (a) social categories usually convey information about deep and nonobvious person characteristics and (b) people usually prefer those social groups to which they belong over other social groups. There are many mechanisms by which families and other socializing institutions (e.g., religions, political units) teach the lesson that people are generally expected to prefer those who are similar to oneself. For example, children may be discouraged from playing with children who are not like them; adolescents may be forbidden to date peers from another religion or nationality; comments may be made about the likelihood of friendships with new neighbors who are, or are not, members of one’s own ‘‘group.’’ Exposure to lessons like these is likely to teach children that individuals typically have greater affinity for those who share their group membership than for those who do not. Little work has examined children’s implicit understanding of the rules that govern social categorization and its consequences, or children’s meta-cognitive understanding of social categorization (see Heyman & Gelman, 2000, for an exception). There may well be individual differences among children in this domain that have important implications for stereotyping and prejudice. For example, some writers have argued that children who grow up on military bases that are characterized by diversity, and thus have contact with peers from many different racial, ethnic, economic, and religious backgrounds, are less likely than non-military children to endorse social stereotypes and show ingroup biases (Pollock & Van Reken, 1987). Future work on these topics is likely to advance our understanding of social stereotyping and prejudice. Nonetheless, children’s general tendency toward essentialism and ingroup bias effectively guarantees that once the categorization of humans into social groups has occurred, children infer meaning underlying those categories even when the culture offers no such stereotypes. 3. Explicit Attributions As suggested by many researchers (Allport, 1954; Bem, 1983; Williams & Morland, 1976), extensive patterns of linkages between social groups and various traits/roles within the environment are predicted to increase the likelihood that social stereotypes and prejudice will arise. Such links may be explicit or implicit. The explicit linking of groups to attributes involves labels and propositional statements. For example, a child might be told, ‘‘Women don’t like to fix cars’’ or ‘‘Old people are frail’’ or ‘‘Homosexuals are child molesters.’’ The direct teaching of the content of social stereotyping has, historically, been viewed as a primary means of stereotype formation
A Developmental Intergroup Theory of Social Stereotypes and Prejudice
71
(Allport, 1954) and we agree that it plays a role. We suggest, however, that direct teaching is likely to play a less important role in contemporary times than it did in the past. One reason for this suggestion is that many adults no longer believe in the links between social groups and attributes, at least not consciously (Devine, 1989). As a result, they are unlikely to verbalize such beliefs to children. Covert or unconscious biases have taken the place of much explicit stereotyping. Thus, although most individuals believe themselves to be non-prejudiced and although their explicit comments are in keeping with this self-conception, they may nevertheless harbor stereotypes and prejudices about which they are unaware (McConahay, 1986; Swim et al., 1995). Under conditions in which individuals have insufficient cognitive resources to monitor their responses (e.g., in a situation requiring a quick response or in an intensely emotional situation), these implicit stereotypes and prejudices may shape individuals’ judgments, inferences, and nonverbal behaviors (Banaji & Greenwald, 1995; Devine, 1989). Consistent with the belief that direct teaching probably plays only a limited role is the fact that parents and teachers infrequently make direct statements about gender and racial stereotypes such as ‘‘Only girls can play with dolls’’ or ‘‘Black children are selfish,’’ even in contexts explicitly designed to be conducive to comments of this kind (Gelman, Taylor, & Nguyen, 2004). Of course, a lack of evidence of such comments may reflect demand characteristics to avoid expressing prejudice in observational studies. That is, parents and teachers may well censor such comments when they know they are being observed, but may make them when they think their behavior is going undetected. In addition, there are undoubtedly situations in which explicit comments are likely, for example, among members of supremacist organizations and some religious groups (e.g., Ezekiel, 1995), and among groups of same-age peers (Kowalski & Kanitkar, 2003). Regrettably, to our knowledge few studies of parental socialization have employed new, less reactive measures of stereotyping and prejudice such as the Modern Sexism scale (Swim et al., 1995) or the Modern Racism scale (McConahay, 1986), and thus, we know little about whether children’s attitudes and behaviors might reflect differences in less blatant forms of sexism or racism among parents. Although direct statements linking groups and attributes are less frequent than in the past, when they do occur, they are still likely to produce social stereotyping and prejudice. A dramatic demonstration of both the atypicality of direct statements in contemporary society as well as their power is evident from the classic documentary, ‘‘Eye of the Storm’’ (Peters, 1970). In the film, Jane Elliot informs her third grade class in Riceville, Iowa, that brown-eyed children are dumb, lazy, and mean, whereas blue-eyed children are smart, hard working, and friendly. Demonstrating contemporary discomfort with direct statements about social groups, her remarks strike audiences today as startlingly direct, even unconscionable. Demonstrating the power of such remarks, the
72
Rebecca S. Bigler and Lynn S. Liben
children in Elliot’s class developed stereotypes and prejudice. Such remarks are extremely powerful because they operate at two levels simultaneously. First, the labels themselves serve to mark the social groups as important, and second, they provide clear information about specific attributes that are associated with the labeled group. Although we argue that—at the group level—learning of explicit links now generally plays a relatively minor role in stereotype formation, we add two caveats. First, some members of oppressed social groups may use explicit attributions to counter the prevailing cultural stereotypes of their groups. Organizers of the civil rights movement made use of such statements (e.g., ‘‘Black is beautiful’’). Today, women who are active as political feminists might encourage their daughters to learn links between females and positive traits, or males and negative traits, perhaps as compensation for the perceived bias in wider cultural messages about women’s skills and worth. Supporting this notion is the observation that there are many cards, magnets, notebooks, and other similar products that provide explicit messages about gender (see Barreca, 1992). Thus, although some kinds of explicit comments have become socially unacceptable and hence rare in public situations, others remain acceptable and even publicized, and still others may be common in contexts hidden from the researcher’s purview. Second, we note that, although explicit attributions may be relatively rare within adults’ speech, they are probably common among children’s speech, especially in speech directed to their peers. That is, children may ‘‘teach’’ attributions that they have detected (a process described next) or invented, as in the popular children’s rhyme that is, perhaps, a mixture of both: ‘‘Girls go to college to get more knowledge, boys go to Jupiter to get more stupider.’’ They may also explicitly teach prejudice without reference to attributes. Indeed, several writers have noted that high levels of gender antagonism and frequent statements such as ‘‘I hate girls,’’ mark the early elementary-school years (e.g., Renold, 2001). Obviously, tolerance and encouragement of such talk are likely to fuel social stereotype formation and prejudice. 4. Group-Attribute Covariation or Implicit Attributions In addition to hearing others make explicit links between groups and attributes, children may learn about the importance of social groups and their specific attributes via implicit environmental messages. That is, children may be exposed to models in their environment (live and symbolic) that suggest that some human characteristics (e.g., gender, race) are linked to, or correlated with, specific attributes. For example, a child who is exposed to 10 auto mechanics, 9 of whom are male, is likely to infer a correlation between gender and fixing cars. Exposure to such information was posited by social learning and social-cognitive learning theories to affect children’s attitudes toward gender (Bussey & Bandura, 1992, 1999) and race (Williams & Morland, 1976).
A Developmental Intergroup Theory of Social Stereotypes and Prejudice
73
We agree that the detection of such correlations is important because it forms the basis of a stereotype—and include this process in our model under ‘‘implicit attribute detection.’’ Unlike social learning theories, however, we further posit that children’s tendency to oversimplify and overextend categories (as discussed earlier) also leads to the tendency to generalize specific attributes to all members of some groups and none of the members of other groups, resulting, for example, in the belief that ‘‘only men can be auto mechanics.’’ In other words, although adult stereotypes are likely to function as probabilities concerning the likelihood that a member of some group will exhibit some trait or role (e.g., McCauly & Stitt, 1978), children’s social stereotypes are more likely to function as rules. Children’s endorsement of hard and fast rules is also likely to promote their rigid use of social group membership as the basis for making predictions about others. For example, knowing an individual is female leads young children—but not adults (who are better able to think in terms of probabilities)—to be certain that a given individual (a woman) is not an auto mechanic. However, researchers have not addressed the issue of the amount and nature of modeled information that is needed to form and revise stereotypic beliefs. We do not know, for example, what percent of fire fighters must be men to cause observers to develop the belief that fire fighters are men, or—once such a belief is formed—what percent of women fire fighters must be encountered to revise the prior belief. Developmental intergroup paradigms can be employed to answer these questions by manipulating the percentages and frequencies of the information presented to children. Another source of implicit information is the nonverbal behavior that adults (e.g., parents, teachers) direct toward members of social groups, or show in response to the presence of group members. For example, many European Americans become nervous or socially withdrawn in the presence of African Americans, in part as a result of automatic or implicit racial prejudice (e.g., Dovidio, Kawakami, & Gaertner, 2002). Importantly, these nonverbal behaviors are also likely to be unconscious (Chen & Bargh, 1997) and, as a consequence, adults are unlikely to explain their behavior to their children. The nonverbal communication of social biases is understudied in developmental psychology. In fact, we know of no research investigating whether parents respond to, or treat, members of different social groups differentially in the presence of children, and, in turn, whether children perceive the differential treatment or responses. For example, do parents make eye contact, smile, or shake hands with members of various social groups at differential rates while in the presence of their children, and if so, do children notice such differences? If children do perceive that adults treat individuals differentially as a function of social group membership, they may, as a consequence, develop social stereotypes and prejudices.
74
Rebecca S. Bigler and Lynn S. Liben
In sum, we propose that conditions in which social groups are distinguished by highly differentiated sets of roles and traits, especially when those differences go unexplained by adults, set the context for stereotyping and prejudice. Under such circumstances, children come to endorse specific group stereotypes and are likely to believe that deep and meaningful differences between groups cause the patterns of correlated attributes and roles they have detected.
C. INTERACTIONS AMONG FACTORS THAT SHAPE EPS AND DSP
Each of the factors that we have described has an important role in the process of developing social stereotypes and prejudice. Although each is important in its own right, any single factor, taken alone, is unlikely to lead to the development of social stereotypes and prejudice. Thus, a perceptually salient group that is not pointed out by adults in the environment (e.g., through verbal labeling), and is not linked to a network of attributes (e.g., through disproportionate appearance in either prestigious or low-level jobs), is highly unlikely to become the target of intergroup bias. Additionally, a social group distinction that is used in a functional manner, but is not perceptually salient and is not linked to a network of attributes, is also highly unlikely to become the basis for intergroup bias. Perceptual discriminability, in combination with any one of the other three factors (minority status, explicit labeling, implicit use), however, produces intergroup stereotyping and prejudice. For example, when perceptually discriminable groups are used in a functional manner by authority figures, children will associate negative traits and roles with the outgroup and positive traits and roles with the ingroup even in the complete absence of actual categoryattribute links. In other words, stereotypes are not always based on a ‘‘kernel of truth.’’ When additional factors are added to the mix, the likelihood of stereotyping and prejudice increases even further. The treatment of gender within the United States illustrates the way that many of the factors in our theory operate. ‘‘Male’’ and ‘‘female’’ are social categories that are characterized by: (a) perceptual discriminability (often exaggerated by various kinds of marking such as differential dress and hair styles), (b) explicit labeling (different words, forms of words, pronouns, names) and sorting (e.g., in bathrooms, basketball teams), and (c) implicit use or segregation by sex (e.g., the pervasiveness of same-sex friendships, segregation by sex in many occupational settings). Furthermore, both of the exogenous factors hypothesized to lead children to form stereotypes and prejudices are present in the case of gender. That is, there are: (a) category-to-attribute links (as in differential proportions of women
A Developmental Intergroup Theory of Social Stereotypes and Prejudice
75
and men in day care centers and police stations), and (b) at least occasional explicit attributions (as in the poem ‘‘What are little girls made of? Sugar and spice, and everything nice. What are little boys made of? Snips and snails, and puppy dog tails’’). Given the confluence of so many factors, it is not surprising that gender stereotyping and prejudice are among the earliestemerging and most stable forms of social bias among children in contemporary U.S. culture (see Ruble & Martin, 1998). We argue that any social group treated in these ways would be the target of high levels of social stereotyping and prejudice among children.
D. DEVELOPMENTAL AND INDIVIDUAL DIFFERENCE VARIABLES
Thus far, we have focused largely on the exogenous factors that influence what children attend to in their environments that will lead them to see particular criteria as relevant for social grouping, and that will lead them to generate or learn the content of stereotypes, and show affective biases toward groups. In addition, we believe that endogenous factors—that is, factors within the individual child—affect the formation of stereotypes and prejudice. We focus on two such factors here, one a developmental variable drawn from the cognitive domain—classification skill—and one an individual difference variable drawn from the socioemotional domain—self-esteem. These are meant only as illustrations. Future work is needed to identify and test the role of other relevant endogenous factors. 1. Classification Skill Because categorization is central to the process of stereotype and prejudice formation, children’s ability to categorize others consistently along a particular dimension should be relevant to their tendency to develop social stereotypes and biases (see CEI component of Figure 1). At what age might it be possible for children to sort individuals into social categories? Several research methods are used to assess classification skill. Habituation procedures are commonly used with infants. In a typical study, infants are shown a series of pictures of members of one category (e.g., females) until their looking time to each picture decreases substantially. Next, a new member of that same category (e.g., a previously unseen female) and a new member of a different category (e.g., a male) are shown in sequence to the infants. If infants discriminate the perceptual features that separate one group from another, they should look longer at the stimulus from the new category (e.g., look longer at the male than female face because the latter would be relatively boring because of its shared membership with the habituated category). Habituation studies indicate that children develop an ability to discriminate individuals on
76
Rebecca S. Bigler and Lynn S. Liben
the basis of some groups (e.g., gender) between 3 and 7 months of age (Quinn et al., 2002). Although one might conclude from this work that infants are cognitively equipped to form categories based on social groups such as sex and race as a result of encountering individuals in the world, such a conclusion would be premature for several reasons. First, habituation studies are unable to distinguish between pre-existing categories and those that are created during familiarization trials. That is, the experimental experience might teach children to categorize on the basis of some dimension. For example, repeatedly presenting pictures of people wearing hats, followed by a pair in which one person wore a hat and one did not, might well elicit longer looking times to the hatless person. This result would demonstrate infants’ ability to discriminate between people with vs. without hats, but would not allow the conclusion that ‘‘hat wearers vs. non-hat wearers’’ was a pre-existing, welldeveloped, and meaningful social category to infants prior to their participation in the study. Second, the stimuli presented in habituation studies are typically selected to reduce complexity (e.g., employing same race and same age stimuli), and are thus unlike most experiences encountered outside the lab. In one of the only papers to demonstrate infants’ ability to detect gender from facial cues, Quinn et al. (2002) presented babies with 8 male vs. 8 female European American fashion models. Stimuli that are constrained in this way (eliminating naturally occurring variability) are likely to facilitate categorization or the learning of novel categories. Indeed, in a study by Wild et al. (2000) in which stimuli represented a broader range of masculinity and femininity (e.g., the females were of average attractiveness and did not wear makeup), the findings were noticeably different. Even at seven years of age, children performed just above chance when discriminating the sex of adult faces and were unable to determine the sex of children’s faces in the absence of hair and clothing cues. A second method used to study categorization in very young children is referred to as sequential touching, and involves recording the order in which children handle stimuli that belong to one of two classes (Sugarman, 1983). Research by Mandler, Fivush, and Reznick (1987) and others (e.g., Mandler, Bauer, & McDonough, 1991; Mandler & McDonough, 1998) using sequential touching tasks indicates that most children can consistently sort objects into two categories between 18 and 24 months of age. Little work has yet examined children’s social cognitions using a sequential touching paradigm, but one existing study suggests that stimuli that vary by gender may also elicit category-based responding between 18 and 30 months of age. Johnston et al. (2001) reported that infants show a sharp increase in gender categorization between 18 and 22 months of age. Future research should evaluate the
A Developmental Intergroup Theory of Social Stereotypes and Prejudice
77
links between toddlers’ spontaneous categorization of encountered individuals along some attribute and social stereotyping and prejudice on the basis of that attribute. A third method used to examine classification skill is by asking children to sort stimuli into two or more groups, sequentially or simultaneously (Inhelder & Piaget, 1964). Most researchers agree that children can intentionally sort people into two non-overlapping categories on the basis of some observable attribute (e.g., gender) by the age of three (Thompson, 1975). Furthermore, verbal labeling (discussed earlier) promotes such classification. At this age, however, children are unable to immediately re-sort the same individuals along a second dimension (e.g., height or race). Thus our theory predicts that children should begin to show consistent classification of others on the basis of social categories and, as a result, to show significant levels of stereotyping and prejudices during the third year. We expect that children who are unable to classify people or objects consistently along a single dimension should show relatively little stereotyping and ingroup bias—regardless of the social environment. Thus, infants younger than 18 months should be unlikely to show stereotyping and prejudice. Nonetheless, classification skill probably need not be perfect to support stereotyping and prejudice. That is, social stereotyping is likely to occur among children whose classification of individuals is marked by occasional errors. We hypothesize, therefore, that the formation of social stereotyping and prejudice can conceivably begin quite early in development, probably around 18 months of age. The formation of social stereotypes at this age should, however, be confined to dimensions that are perceptually salient (e.g., race, gender, age). During the preschool years—after acquiring the ability to sort individuals into groups—children show various forms of rigidity to their classification of stimuli. They see the world in terms of simplistic either/or categories, with few shades of gray and little room for contradictions. So, for example, one child inquired, as each character in a movie appeared, whether the person was a ‘‘good guy’’ or a ‘‘bad guy.’’ This limitation facilitates prejudice by preventing the child from seeing individuals as having combinations of good and bad qualities. Furthermore, they center, or focus, on single dimensions of people, becoming ‘‘stuck’’ paying attention to one attribute such as gender or race (Bigler & Liben, 1992). Gradually, children acquire the ability to sort along two dimensions consecutively, and still later, they are able to sort along two dimensions simultaneously, as in categorizing objects into a 2 2 matrix (Inhelder & Piaget, 1964). The acquisition of multiple classification skill, in particular, is hypothesized to affect the process of stereotype and prejudice modification and maintenance (see the ASF component of Figure 2). Children who lack multiple classification skill have difficulty understanding that the same stimulus person can be a member
78
Rebecca S. Bigler and Lynn S. Liben
simultaneously of two groups that they do not normally conceptualize as overlapping (e.g., being both a ‘‘woman’’ and a ‘‘fire fighter’’). The processing of categorizing stereotype-consistent individuals (e.g., male fire fighters) does not, in contrast, challenge multiple classification skills because the two groups in question (e.g., ‘‘men’’ and ‘‘fire fighters’’) are—to a child who endorses gender stereotypes—entirely overlapping. Thus, children with less developed classification skills are hypothesized to show higher rates of distortion and forgetting of counterstereotypic information than are children with advanced classification skills, a hypothesized relation that has been supported empirically (Bigler & Liben, 1992, 1993). 2. Self-esteem Early theories of intergroup attitudes posited that attitudes toward ingroups are rooted in attitudes about the self, and thus positive self-views lead to the formation of positive ingroup attitudes, whereas negative self-views lead to the formation of negative ingroup attitudes (Ehrlich, 1973). Later theorists (e.g., Tajfel & Turner, 1986), in contrast, posited the inverse directional effect, namely that ingroup bias is a method for acquiring positive self-views. Thus they predicted that individuals with low self-esteem might raise their self-esteem by denigrating outgroup members. However, a comprehensive meta-analysis of the social psychological literature found no evidence that the motivation to enhance self-esteem underlies intergroup bias (Rubin & Hewstone, 1998). Furthermore, some empirical research on the links between self-esteem and intergroup bias suggests that higher (rather than lower) self-esteem is associated with ingroup bias among elementary-school-age children (Bigler, Jones, & Lobliner, 1997; Gagnon & Morasse, 1995) and adults (Abrams & Hogg, 1988; Gramzow & Gaertner, 2005). The relation between self-esteem and social stereotyping and prejudice may be especially pronounced among preschool and early elementary school-age children. Aboud (1988) has argued that young children generalize their highly favorable self-views to perceptually similar (but not dissimilar) others, leading to the formation of ingroup bias (e.g., ‘‘I’m smart, therefore, all white people are smart’’). Furthermore, as discussed earlier, young children lack sophisticated classification skills and thus may generalize their positive feelings about themselves to all other ingroup members in a simplistic and rigid fashion (Kohlberg, 1966). We expect, therefore, that variation in children’s self-esteem will be related to social stereotyping and prejudice. Finally, researchers have begun to examine individuals’ group, as well as their personal, self-esteem. Some theorists have argued that individuals’ assessments of their social group’s skills and competencies, referred to as collective selfesteem (Crocker & Luhtanen, 1990), may predict their social stereotyping and prejudice. Specifically, those individuals who show more positive assessment
A Developmental Intergroup Theory of Social Stereotypes and Prejudice
79
of their own social group are predicted to exhibit greater stereotyping of, and prejudice toward, outgroups than those individuals with less positive assessment of their own social group (Crocker & Luhtanen, 1990). To our knowledge, no empirical work, however, has examined collective self-esteem or its relation to (a) personal self-esteem or (b) social stereotyping and prejudice among children, a topic worthy of future research. In short, many developmental and individual difference variables appear to be relevant for the development of social stereotypes and prejudice. So far, there is suggestive but not yet definitive work that is consistent with the hypothesized processes. Empirical research must thus go hand in hand with the building of conceptual models such as the one offered by developmental intergroup theory.
VIII. Summary and Conclusions Developmental intergroup theory specifies the mechanisms and rules that govern the processes by which children single out groups as targets of stereotyping and prejudice, and by which children learn and construct both the characteristics (i.e., stereotypes) and affective responses (i.e., prejudices) that are associated with these groups in their culture. Specifically, we argue that children have a drive to understand their world, and that this drive is manifested in their tendency to classify natural and non-natural stimuli into categories, and to search the environment for cues about which of the great number of potential bases for categorization are important. The first step in the process of stereotype and prejudice formation is, therefore, the establishment of the psychological salience of some particular set of dimensions. Four factors are hypothesized to affect the establishment of the psychological salience of person attributes: (1) perceptual discriminability of social groups, (2) proportional group size, (3) explicit labeling and use of social groups, and (4) implicit use of social groups. We argue that person characteristics that are perceptually discriminable are more likely than other characteristics to become the basis of stereotyping, but that perceptual discriminability alone is insufficient to trigger psychological salience. Thus, for example, young children’s ability to detect race or gender does not mean that these distinctions will inevitably become the bases of stereotypes and prejudice. Instead, for perceptually salient groups to become psychologically salient, one or more additional circumstances must hold, including being characterized by minority status, by adults’ use of different labels for different groups, by adults using group divisions functionally, or by segregation. After a particular characteristic that may be used to differentiate among individuals becomes salient, we propose that children who have the ability
80
Rebecca S. Bigler and Lynn S. Liben
to sort consistently will then categorize newly encountered individuals along this dimension. The act of categorization then triggers the process of social stereotyping and prejudice formation. Four factors are hypothesized to have an impact on the processes of forming stereotypes and prejudice. These include: (1) essentialism, (2) ingroup bias, (3) explicit attributions to social groups, and (4) group-attribute covariation. As noted throughout this chapter, there has been relatively little developmental work on many of the processes outlined here. Although findings from our own programs of research are consistent with the role of factors we have identified in the theory (e.g., the role of minority status, segregation, labeling and functional use of groups have all been shown to influence children’s evaluations and beliefs about social groups), far more extensive research is needed. In addition to testing the reliability and generalizability of past findings to other samples, other research laboratories, and other experimentally manipulated groups, future work must move these theoretical models into the laboratory of the real world. If the tenets of developmental intergroup theory are correct, there would be many implications for social, educational, and legal policies related to social groups. We noted, for example, ways in which race and gender are made psychologically salient (e.g., the use of labels; segregated conditions). Importantly, factors such as these are largely under societal control. That is, institutions and individuals can choose to routinely label and use some particular category within children’s environments or not. It is a violation of federal law, for example, for public school teachers to ask the children in their classrooms to line up at the door by race. In contrast, no federal or state law prohibits teachers from organizing their classrooms by sex. Should such laws be enacted? There can also be social controls on various forms of social segregation. Is it within individual children’s rights to affiliate only with samesex or same-race individuals? Is it acceptable for children and adolescents to exclude peers from their games, play, study groups, or other cliques on the basis of gender, race, age, or ethnicity? Finally, social institutions such as schools offer potential opportunities for intervention programs. What, if any, programs should be offered or required? Should curricula explicitly discuss social stereotyping and prejudice? Should children be taught negative information about people with whom they share some characteristic to reduce ingroup favoritism? Our hope is that developmental intergroup theory will ultimately prove valuable not only for understanding the development of social stereotypes and prejudices in children, but also for guiding social interventions that can ultimately prevent the development of stereotypes and prejudices in individuals and society.
A Developmental Intergroup Theory of Social Stereotypes and Prejudice
81
REFERENCES Aarim-Herlot, N. (2003). Chinese immigrants, African Americans, and racial anxiety in the United States 1848 – 82. Urbana, IL: University of Illinois Press. Aboud, F. E. (1988). Children and prejudice. Oxford: Basil Blackwell. Aboud, F. E., & Doyle, A. B. (1996). Parental and peer influences on children’s racial attitudes. International Journal of Intercultural Relations, 20, 371 – 383. Aboud, F. E., & Mitchell, F. G. (1977). Ethnic role taking: the effects of preference and self-identification. International Journal of Psychology, 12, 1 – 17. Abrams, D. (1985). Focus of attention in minimal intergroup discrimination. British Journal of Social Psychology, 24, 65 – 74. Abrams, D., & Hogg, M. A. (1988). Comments on the motivational status of selfesteem and intergroup discrimination. European Journal of Social Psychology, 18, 317 – 334. Adorno, T. W., Frenkel-Brunswik, E., Levinson, D. J., & Sanford, R. N. (1950). The authoritarian personality. New York, NY: Harper & Row. Allen, V. L., & Wilder, D. A. (1975). Categorization, belief similarity, and group discrimination. Journal of Personality and Social Psychology, 32, 971 – 977. Allport, G. W. (1954). The nature of prejudice. Cambridge, MA: Addison-Wesley. Averhart, C. J., & Bigler, R. S. (1997). Shades of meaning: Skin tone, racial attitudes, and constructive memory in African American children. Journal of Experimental Child Psychology, 67, 363 – 388. Banaji, M. R., & Greenwald, A. G. (1995). Implicit gender stereotyping in judgments of fame. Journal of Personality and Social Psychology, 68, 181 – 198. Bandura, A. (1977). Social learning theory. Englewood Cliffs, N.J. Prentice-Hall. Banks, J. A. (1995). Multicultural education: Its effects on students’ racial and gender role attitudes. In J. A. Banks & C. M. Banks (Eds.), Handbook of research on multicultural education (pp. 617 – 627). New York: Macmillan. Barreca, R. (1992). They used to call me Snow White, but I drifted: Women’s strategic use of humor. New York: Penguin. Bem, S. L. (1974). The measurement of psychological androgyny. Journal of Clinical and Consulting Psychology, 42, 155 – 162. Bem, S. L. (1981). Gender schema theory: A cognitive account of sex typing. Psychological Review, 88, 354 – 364. Bem, S. L. (1983). Gender schema theory and its implications for child development: Raising gender-aschematic children in a gender-schematic society. Signs, 8, 598 – 616. Bialer, S. (1985). The psychology of U.S.-Soviet relations. Political Psychology, 6, 19 – 28. Bigler, R. S. (1995). The role of classification skill in moderating environmental effects on children’s gender stereotyping: A study of the functional use of gender in the classroom. Child Development, 66, 1072 – 1087. Bigler, R. S. (1999). The use of multicultural curricula and materials to counter racism in children. Journal of Social Issues, 55, 687 – 705. Bigler, R. S. (2004). The role of segregation in the formation of children’s intergroup attitudes. In S. Levy (Chair), Integrating developmental and social psychological research on prejudice processes. Symposium conducted at the 5th annual meeting of the Society for Personality and Social Psychology, Austin, TX.
82
Rebecca S. Bigler and Lynn S. Liben
Bigler, R. S., Averhart, C. J., & Liben, L. S. (2003). Race and the work force: Occupational status, aspirations, and stereotyping among African American children. Developmental Psychology, 39, 572 – 580. Bigler, R. S., Brown, C. S., & Markell, M. (2001). When groups are not created equal: Effects of group status on the formation of intergroup attitudes in children. Child Development, 72, 1151 – 1162. Bigler, R. S., Jones, L. C., & Lobliner, D. B. (1997). Social categorization and the formation of intergroup attitudes in children. Child Development, 68, 530 – 543. Bigler, R. S., & Liben, L. S. (1990). The role of attitudes and intervention in genderschematic processing. Child Development, 61, 1440 – 1452. Bigler, R. S., & Liben, L. S. (1992). Cognitive mechanism in children’s gender stereotyping: Theoretical and educational implications of a cognitive-based intervention. Child Development, 63, 1351 – 1363. Bigler, R. S., & Liben, L. S. (1993). A cognitive-developmental approach to racial stereotyping and reconstructive memory in Euro-American children. Child Development, 64, 1507 – 1518. Billig, M., & Tajfel, H. (1973). Social categorization and similarity in intergroup behavior. European Journal of Social Psychology, 3, 27 – 52. Bogardus, E. S. (1925). Measuring social distances. Journal of Applied Sociology, 9, 299 – 308. Bransford, J. D., & Franks, J. J. (1971). The abstraction of linguistic ideas. Cognitive Psychology, 2, 331 – 350. Brewer, M. B. (1979). In-group bias in the minimal intergroup situation: A cognitivemotivational analysis. Psychological Bulletin, 86, 307 – 324. Brewer, M. B. (1991). The social self: On being the same and different at the same time. Personality and Social Psychology Bulletin, 17, 475 – 482. Brewer, M. B. (1997). The social psychology of intergroup relations: Can research inform practice? Journal of Social Issues, 53, 197 – 211. Brewer, M. B., & Brown, R. J. (1998). Intergroup relations. In D. T. Gilbert, S. T. Fiske, & G. Lindsey (Eds.), The handbook of social psychology: Vol. 2. (4th ed., pp. 554 – 594). New York: McGraw-Hill. Brewer, M. B., & Silver, M. (1978). Ingroup bias as a function of task characteristics. European Journal of Social Psychology, 8, 393 – 400. Brown, C. S., & Bigler, R. S. (2002). Effects of minority status in the classroom on children’s intergroup attitudes. Journal of Experimental Child Psychology, 83, 77 – 110. Brown, R. J. (1986). Social psychology (2nd ed.) New York: Free Press. Brown, R. J. (1995) Prejudice: Its social psychology. Oxford: Blackwell. Bussey, K., & Bandura, A. (1992). Self-regulatory mechanisms governing gender development. Child Development, 63, 1236 – 1250. Bussey, K., & Bandura, A. (1999). Social cognitive theory of gender development and differentiation. Psychological Review, 106, 696 – 713. Chen, M., & Bargh, J. A. (1997). Nonconscious behavioral confirmation processes: The self-fulfilling consequences of automatic stereotype activation. Journal of Experimental Social Psychology, 33, 541 – 560. Coker, D. R. (1984). The relationships among gender concepts and cognitive maturity in preschool children. Sex Roles, 10, 19 – 31. Crocker, J., & Luhtanen, R. K. (1990). Collective self-esteem and ingroup bias. Journal of Personality and Social Psychology, 58, 60 – 67.
A Developmental Intergroup Theory of Social Stereotypes and Prejudice
83
Crocker, J., & Schwartz, I. (1985). Prejudice and ingroup favoritism in a minimal intergroup situation: Effects of self-esteem. Personality and Social Psychology Bulletin, 11, 379 – 386. Devine, P. (1989). Stereotypes and prejudice: Their automatic and controlled components. Journal of Personality and Social Psychology, 56, 5 – 18. Doise, W., Deschamps, J.-C., & Meyer, G. (1978). The accentuation of intracategory similarities. In H. Tajfel (Ed.), Differentiation between social groups: Studies in the social psychology of intergroup relations (pp. 159 – 170). London: Academic Press. Dovidio, J. F., Kawakami, K., & Gaertner, S. L. (2002). Implicit and explicit prejudice and interracial interaction. Journal of Personality and Social Psychology, 82, 62 – 69. Doyle, A. B., & Aboud, F. E. (1995). A longitudinal study of White children’s racial prejudice as a social-cognitive development. Merrill-Palmer Quarterly, 41, 209 – 228. Duckitt, J. (1992). Psychology and prejudice: A historical analysis and integrative framework. American Psychologist, 47, 1182 – 1193. Eccles, J. S., Wigfield, A., Harold, R. D., & Blumenfeld, P. (1993). Age and gender differences in children’s self- and task perceptions during elementary school. Child Development, 64, 830 – 847. Ehrlich, H. J. (1973). The social psychology of prejudice. New York: Wiley. Ezekiel, R. S. (1995). The racist mind: Portraits of American Neo-Nazis and Klansmen. New York: Viking. Fishbein, H. D. (2002). Peer prejudice and discrimination (2nd ed.). Mahwah, NJ: Lawrence Erlbaum. Fiske, S. T., & Taylor, S. E. (1991). Social cognition (2nd ed.). New York: McGraw-Hill. Gagnon, A., & Morasse, I. (1995, March). Self-esteem and intergroup discrimination in second grade school children. Paper presented at the biennial meeting of the Society for Research in Child Development, Indianapolis, IN. Gelman, S. A. (2003). The essential child. New York: Oxford University Press. Gelman, S. A., & Heyman, G. D. (1999). Carrot-eaters and creature-believers: The effects of lexicalization on children’s inferences about social categories. Psychological Science, 10, 487 – 493. Gelman, S. A., Taylor, M. G., & Nguyen, S. P. (2004). Mother-child conversations about gender. Monographs of the Society for Research in Child Development, 69 (1, Serial No. 275). Gelman, S. A., & Wellman, H. M. (1991). Insides and essences: Early understandings of the nonobvious. Cognition, 38, 213 – 244. Go´mez, R. L., & Maye, J. (2005). The developmental trajectory of nonadjacent dependency learning. Infancy, 7, 183 – 206. Gramzow, R. H., & Gaertner, L. (2005). Self-esteem and favoritism toward novel ingroups: The self as an evaluative base. Journal of Personality and Social Psychology, 88, 801 – 815. Greenwald, A. G., & Banaji, M. R. (1995). Implicit social cognition: Attitudes, selfesteem, and stereotypes. Psychological Review, 102, 4 – 27. Hamilton, D. L., & Sherman, S. J. (1989) Illusory correlations: Implications for stereotype theory and research. In D. Bar-Tal, C. F. Graumann, A. W. Kruglanski, (Eds.), Stereotyping and prejudice: Changing conceptions (pp. 59 – 82). New York: Springer Verlag.
84
Rebecca S. Bigler and Lynn S. Liben
Hamilton, D. L., & Trolier, T. K. (1986). Stereotypes and stereotyping: An overview of the cognitive approach. In J. F. Dovidio & S. L. Gaertner (Eds.), Prejudice, discrimination, and racism (pp. 127 – 163). Orlando, FL: Academic Press. Hewstone, M., & Brown, R. (1986). Contact is not enough: An intergroup perspective on the contact hypothesis. In M. Hewstone & R. Brown (Eds.), Contact and conflict in intergroup encounters (pp. 1 – 44). Oxford: Blackwell. Heyman, G. D., & Gelman, S. A. (2000). Preschool children’s use of novel predicates to make inductive inferences about people. Cognitive Development, 15, 263 – 280. Hirschfeld, L. A. (1993). Discovering social difference: The role of appearance in the development of racial awareness. Cognitive Psychology, 25, 317 – 350. Hirschfeld, L. A. (1996). Race in the making: Cognition, culture, and the child’s construction of human kinds. Cambridge, MA: MIT Press. Ignatiev, N. (1995). How the Irish became white. New York: Routledge. Inhelder, B., & Piaget, J. (1964). The early growth of logic in the child. London: Routledge & Kegan Paul Ltd. Johnston, K. E., Bittinger, K., Smith, A., & Madole, K. L. (2001). Developmental changes in infants’ and toddlers’ attention to gender categories. Merrill-Palmer Quarterly, 47, 563 – 584. Johnston, K. E., & Jacobs, J. (2003). Children’s illusory correlations: The role of attentional bias in group impression formation. Journal of Cognition and Development, 4, 129 – 160. Katz, P. A. (1983). Developmental foundations of gender and racial attitudes. In R. L. Leahy (Ed.), The child’s construction of social inequality (pp. 41 – 78). New York: Academic Press. Katz, D., & Braly, K. (1933). Racial stereotypes of one hundred college students. Journal of Abnormal and Social Psychology, 28, 280 – 290. Katz, P. A., & Kofkin, J. A. (1997). Race, gender, and young children. In S. S. Luthar, J. A. Burack, D. Cicchetti, & J. Weisz (Eds.), Developmental psychopathology: Perspectives on adjustment, risk, and disorder (pp. 51 – 74). New York: Cambridge University Press. Klein, G. W. (1957). All but my life: A memoir. New York, NY: Hill & Wang. Klein, O., & Snyder, M. (2003). Stereotypes and behavioral confirmation: From interpersonal to intergroup perspectives. In M. P. Zanna (Ed.), Advances in experimental social psychology: Vol. 35. (pp. 153 – 234). San Diego, CA: Academic Press. Kohlberg, L. (1966). A cognitive-developmental analysis of children’s sex-role concepts and attitudes. In E. E. Maccoby (Ed.), The development of sex differences (pp. 82 – 173). Stanford, CA: Stanford University Press. Koslin, S., Amarel, M., & Ames, N. (1969). A distance measure of racial attitudes in primary grade children: An exploratory study. Psychology in Schools, 6, 382 – 385. Kowalski, K., & Kanitkar, K. (2003, April). Ethnicity and gender in the kindergarten classroom: A naturalistic study. Poster presented that the biennial meeting of the Society for Research in Child Development, Tampa, FL. Kowalski, K., & Lo, Y.-F. (2001). The influence of perceptual features, ethnic labels, and sociocultural information on the development of ethnic/racial bias in young children. Journal of Cross-Cultural Psychology, 32, 444 – 455. Kurzban, R., Tooby, J., & Cosmides, L. (2001). Can race be erased? Coalitional computation and social categorization. Proceedings of the National Academy of Sciences, 98 (26), 15387 – 15392.
A Developmental Intergroup Theory of Social Stereotypes and Prejudice
85
Langlois, J. H. (1995). The origins and functions of appearance-based stereotypes: Theoretical and applied implications. In R. Eder (Ed.), Craniofacial anomalies: Psychological perspectives (pp. 22 – 47). New York: Springer-Verlag. LaPiere, R. T. (1936). Type-rationalizations of group antipathy. Social Forces, 15, 232 – 237. Lee, Y. T., Jussim, L. J., & McCauley, C. R. (1995). Stereotype accuracy. Washington DC: American Psychological Association. Lerner, R. M., & Gellert, E. (1969). Body build identification, preference, and aversion in children. Developmental Psychology, 1, 456 – 462. Levy, S. R., & Dweck, C. S. (1999). The impact of children’s static vs. dynamic conceptions of people on stereotype formation. Child Development, 70, 1163 – 1180. Levy, S. R., Stroessner, S. J., & Dweck, C.S. (1998). Stereotype formation and endorsement: The role of implicit theories. Journal of Personality and Social Psychology, 74, 1421 – 1436. Lewis Mumford Center. (2001). Ethnic diversity grows, neighborhood integration lags behind. Albany, New York: SUNY. Liben, L. S., & Bigler, R. S. (1987). Reformulating children’s gender schemata. In L. S. Liben & M. L. Signorella (Eds.), Children’s gender schemata. New Directions for Child Development, No. 38 (pp. 89 – 105). San Francisco: Jossey-Bass. Liben, L. S., & Bigler, R. S. (2002). The developmental course of gender differentiation: Conceptualizing, measuring, and evaluating constructs and pathways. Monographs of the Society for Research in Child Development, 67 (2, Serial No. 269). Liben, L. S., Bigler, R. S., & Krogh, H. R. (2001). Pink and blue collar jobs: Children’s judgments of job status and job aspirations in relation to sex of worker. Journal of Experimental Child Psychology, 79, 346 – 363. Liben, L. S., Bigler, R. S., & Krogh, H. R. (2002). Language at work: Children’s gendered interpretations of occupational titles. Child Development, 73, 810 – 828. Liben, L. S., & Signorella, M. L. (1980). Gender-related schemata and constructive memory in children. Child Development, 51, 11 – 18. Liben, L. S., & Signorella, M. L. (1993). Gender-schematic processing in children: The role of initial interpretations of stimuli. Developmental Psychology, 29, 141 – 149. Lippmann, W. (1922). Public opinion. New York: Harcourt, Brace. Livesley, W. J., & Bromley, D. B. (1973). Person perception in childhood and adolescence. New York: Wiley. Maccoby, E. E. (1990). Gender and relationships: A developmental account. American Psychologist. 45, 513 – 520. Mandler, J. M., Bauer, P. J., & McDonough, L. (1991). Separating the sheep from the goats: Differentiating global categories. Cognitive Psychology, 23, 263 – 298. Mandler, J. M., Fivush, R., & Reznick, J. S. (1987). The development of contextual categories. Cognitive Development, 2, 339 – 354. Mandler, J. M., & McDonough, L. (1998). On developing a knowledge base in infancy. Developmental Psychology, 34, 1274 – 1288. Martin, C. L., & Fabes, R. A. (2001). The stability and consequences of same-sex peer interactions. Developmental Psychology, 37, 431 – 446. Martin, C. L., & Halverson, C. F. (1981). A schematic processing model of sex typing and stereotyping in children. Child Development, 52, 1119 – 1134. Martin, C. L., & Halverson, C. F. (1983). The effects of sex-typing schemas on young children’s memory. Child Development, 54, 563 – 574.
86
Rebecca S. Bigler and Lynn S. Liben
McCauley, C., & Stitt, C. L. (1978). An individual and quantitative measure of stereotypes. Journal of Personality and Social Psychology, 36, 929 – 940. McConahay, J. B. (1986). Modern racism, ambivalence, and the modern racism scale. In J. Dovidio & S. Gaertner (Eds.), Prejudice, discrimination, and racism (pp. 91 – 125). New York: Academic Press. McConnell, A. R., Sherman, S. J., & Hamilton, D. L. (1994). Illusory correlations in the perceptions of groups: An extension of the distinctiveness-based account. Journal of Personality and Social Psychology, 67, 414 – 429. Medin, D. (1989). Concepts and conceptual structure. American Psychologist, 44, 1469 – 1481. Mervis, C., & Rosch, E. (1981). Categorization of natural objects. Annual Review of Psychology, 32, 89 – 115. Messick, D. M., & Mackie, D. M. (1989). Intergroup relations. Annual Review of Psychology, 40, 45 – 81. Milner, D. (1983). Children and race. London: Sage. Mullen, B. (1991). Group composition, salience, and cognitive representation: The phenomenology of being in a group. Journal of Experimental Social Psychology, 27, 297 – 323. Mullen, B., & Hu, L. (1989). Perceptions of ingroup and outgroup variability: A meta-analytic integration. Basic and Applied Social Psychology, 10, 233 – 252. Neuberg, S. L., Kenrick, D. T., Maner, J. K., & Schaller, M. (2004). From evolved motives to everyday mentation: Evolution, goals, and cognition. In J. Forgas & K. Williams (Eds.), Social motivation: Conscious and unconscious processes (pp. 133 – 152). New York: Cambridge University Press. Orfield, G. (1996). Dismantling desegregation: The quiet reversal of Brown vs. Board of Education. New York, NY: The New Press. Orfield, G. (2001). Schools more separate: Consequences of a decade of resegregation. Cambridge, MA: The Civil Rights Project Harvard University. Osborne, J. W. (1997). Race and academic disidentification. Journal of Educational Psychology, 89, 728 – 735. Overton, W. F. (1998). Develomental psychology: Philosophy, concepts, and methodology. In W. Damon (Series Ed.) & R. M. Lerner (Vol. Ed.), Handbook of child psychology, Vol. I: Theoretical models of human development (5th ed., pp. 107 – 188). New York: Wiley. Park, B., & Rothbart, M. (1982). Perception of out-group homogeneity and levels of social categorization: Memory for subordinate attributes of in-group and out-group members. Journal of Personality and Social Psychology, 42, 1051 – 1068. Parker, G. M. (1997). Trespassing: My sojourn in the halls of privilege. New York: Houghton Mifflin. Peters, W. (Producer). (1970). Eye of the storm [motion picture]. New York, NY: ABC News. Pettigrew, T. F. (1971). Racially separate or together? New York: McGraw-Hill. Pettigrew, T. F., & Tropp, L. R. (2000). Does intergroup contact reduce prejudice?: Recent meta-analytic findings. In S. Oskamp (Ed.), Reducing prejudice and discrimination (pp. 93 – 114). Mahwah, NJ: Lawrence Erlbaum. Piaget, J. (1929) The child’s conception of the world. London: Routledge. Piaget, J. (1970). Piaget’s theory. In P. H. Mussen (Ed.), Carmichael’s manual of child psychology (pp. 703 – 732). New York: Wiley. Piaget, J., & Inhelder, B. (1956). The child’s conception of space. New York: Norton.
A Developmental Intergroup Theory of Social Stereotypes and Prejudice
87
Piaget, J., & Inhelder, B. (1975). The origin of the idea of chance in children. New York: Norton. Pollock, D. C., & Van Reken, R. E. (1987). Third culture kids. Yarmouth, ME: Intercultural Press. Powlishta, K. K. (1995a). Intergroup processes in childhood: Social categorization and sex role development. Developmental Psychology, 31, 781 – 788. Powlishta, K. K. (1995b). Gender bias in children’s perceptions of personality traits. Sex Roles, 32, 17 – 28. Powlishta, K. K. (2004). Gender as a social category: Intergroup processes and genderrole development. In M. Bennett & F. Sani (Eds.), The development of the social self (pp. 103 – 133). East Sussex, England: Psychology Press. Powlishta, K. K., & Serbin, L. A. (1993, March). Ingroup favoritism and the acquisition of gender stereotypes. Poster presented at the biennial Meeting of the Society for Research in Child Development, New Orleans, LA. Quattrone, G. A., & Jones, E. E. (1980). The perception of variability within ingroups and outgroups: Implications for the law of small numbers. Journal of Personality and Social Psychology, 38, 141 – 152. Quinn, P. C., Yahr, J., Kuhn, A., Slater, A. M., & Pascalis, O. (2002). Representation of the gender of human faces by infants: A preference for female. Perception, 31, 1109 – 1121. Renold, E. (2001). Learning the ‘‘hard’’ way: Boys, hegemonic masculinity, and the negotiation of learner identities in the primary school. British Journal of Sociology of Education, 22, 369 – 385. Rubin, M., & Hewstone, M. (1998). Social identity theory’s self-esteem hypothesis: A review and some suggestions for clarification. Personality and Social Psychology Review, 2, 40 – 62. Ruble, D. N., & Martin, C. L. (1998). Gender development. In W. Damon (Series Ed.) & N. Eisenberg (Vol. Ed.), Handbook of child psychology: Vol. 3. Personality and social development (pp. 933 – 1016). New York: Wiley. Ruble, D. N., & Stangor, C. (1986). Stalking the elusive schema: Insights from developmental and social psychological analysis of gender schemas. Social Cognition, 4, 227 – 261. Rutland, A. (1999). The development of national prejudice, in-group favouritism and self-stereotypes in British children. British Journal of Social Psychology, 38, 55 – 70. Saffran, J. R. (2003). Statistical language learning: Mechanisms and constraints. Current Directions in Psychological Science, 12, 110 – 114. Samelson, F. (1978). From ‘‘Race Psychology’’ to ‘‘Studies in Prejudice’’: Some observations of the thematic reversals in social psychology. Journal of the History of the Behavioral Sciences, 14, 265 – 278. Sameroff, A. J., & Chandler, M. J. (1978). Reproductive risk and the continuum of caretaking casualty. In F. D. Horowitz (Ed.), Review of child development research (pp. 187 – 244). Chicago: University of Chicago Press. Schaller, M., Park, J., & Faulkner, J. (2003). Prehistoric dangers and contemporary prejudices. European Review of Social Psychology, 14, 105 – 137. Seefeldt, C., Jantz, R. K., Galper, A., & Serlock, K. (1977). Using pictures to explore children’s attitudes toward the elderly. The Gerontologist, 17, 506 – 512. Serbin, L. A., Powlishta, K. K., & Gulko, L. (1993). The development of sex typing in middle childhood. Monographs of the Society for Research in Child Development, 58 (2, Serial No. 232).
88
Rebecca S. Bigler and Lynn S. Liben
Sherif, M., Harvey, O. J., White, B. J., Hood, W. R., & Sherif, C. W. (1954/1961). Intergroup conflict and cooperation: The Robbers Cave experiment. Norman, OK: University Book Exchange. Sherif, M., & Sherif, C. W. (1953). Groups in harmony and tension: An integration of studies on intergroup relations. New York: Octagon. Signorella, M. L. (1987). Gender schemata: Individual differences and context effects. In L. S. Liben & M. L. Signorella (Eds.), Children’s gender schemata: New Directions for Child Development, No. 38 (pp. 23 – 37). San Francisco: Jossey-Bass. Signorella, M. L., Bigler, R. S., & Liben, L. S. (1993). Developmental differences in children’s gender schemata: A meta-analytic review. Developmental Review, 13, 147 – 183. Signorella, M. L., Bigler, R. S., & Liben, L. S. (1997). A meta-analysis of children’s memories for own-sex and other-sex information. Journal of Applied Developmental Psychology, 18, 429 – 445. Signorella, M. L., & Liben, L. S. (1984). Recall and reconstruction of genderrelated pictures: Effects of attitude, task difficulty, and age. Child Development, 55, 393 – 405. Skinner, B. F. (1969). Contingencies of reinforcement. New York: Appleton-CenturyCrofts. Steele, C. M. (1997). A threat in the air: How stereotypes shape intellectual identity and performance. American Psychologist, 52, 797 – 811. Stevens, A., & Coupe, P. (1978). Distortions in judged spatial relations. Cognitive Psychology, 10, 422 – 437. Stringer, M., & Irwing, I. (1989). Intergroup relationship rules in Northern Ireland: The effect of denominational information on children’s ratings. Journal of Social and Personal Relationships, 15, 421 – 430. Sugarman, S. (1983). Children’s early thought: Developments in classification. New York: Cambridge. Swim, J. K., Aikin, K. J., Hall, W. S., & Hunter, B. A. (1995). Sexism and racism: Old-fashioned and modern prejudices. Journal of Personality and Social Psychology, 68, 199 – 214. Tajfel, H., & Billig, M. (1974). Familiarization and categorization in intergroup behavior. Journal of Experimental Social Psychology, 10, 159 – 170. Tajfel, H., & Turner, J. C. (1986). The social identity theory of intergroup behaviour. In S. Worchel & W. G. Austin (Eds.), Psychology of intergroup relations (pp. 7 – 24). Chicago: Nelson. Tajfel, H., Flament, C., Billig, M., & Bundy, R. (1971). Social categorization and intergroup behaviour. European Journal of Social Psychology, 1, 149 – 178. Taylor, S. E., Fiske, S. T., Etcoff, N. L., & Ruderman, A. J. (1978). Categorical and contextual bases of person memory and stereotyping. Journal of Personality and Social Psychology, 36, 778 – 793. Tenenbaum, H. R., & Leaper, C. (2002). Are parents’ gender schemas related to their children’s gender-related cognitions?: A meta-analysis. Developmental Psychology, 38, 615 – 630. Thompson, S. K. (1975). Gender labels and early sex role development. Child Development, 46, 339 – 347. Trautner, H. M., Ruble, D. N., Cyphers, L., Kirsten, B., Behrendt, R., & Hartmann, P. (2005). Rigidity and flexibility of gender stereotypes in childhood: Developmental or differential? Infant and Child Development, 14, 365 – 381.
A Developmental Intergroup Theory of Social Stereotypes and Prejudice
89
Tuan, M. (1998). Unraveling the ‘‘model minority’’ stereotype: Listening to Asian American youth. Journal of Asian American Studies, 1, 198 – 201. Turner, J. C. (1987). Rediscovering the social group: A self-categorization theory. Oxford: Blackwell. Weisstein, N. (1968). Kinder, kuche, kirche as scientific law: Psychology constructs the female. Boston, MA: New England Free Press. Weinstein, R. S., Gregory, A., & Strambler, M. J. (2004). Intractable self-fulfilling prophecies: Fifty years after Brown v. Board of Education. American Psychologist, 59, 511 – 520. Westervelt, V. D., & Turnbill, A. P. (1980). Children’s attitudes toward physically handicapped peers and intervention approaches for attitude change. Physical Therapy, 60, 896 – 901. White, R. (1998). Remarks to the Wisconsin Legislature, April 1, 1998. http:// my.execpc.com/dross/aw/regwhite.html, retrieved June 24, 2005. Wild, H. A., Barrett, S. E., Spence, M. J., O’Toole, A. J., Cheng, Y. D., & Brooke, J. (2000). Recognition and sex categorization of adults’ and children’s faces: Examining performance in the absence of sex-stereotyped cues. Journal of Experimental Child Psychology, 77, 269 – 291. Williams, J. E., & Morland, J. K. (1976). Race, color, and the young child. Chapel Hill, NC: University of North Carolina Press. Wilson, R. M. (1947). The reduction of intergroup tension. New York: Social Science Research Council. Zajonc, R. B. (1968). Attitudinal effects of mere exposure. Journal of Personality and Social Psychology, 9, 1 – 27.
This page intentionally left blank
INCOME POVERTY, POVERTY CO-FACTORS, AND THE ADJUSTMENT OF CHILDREN IN ELEMENTARY SCHOOL
Brian P. Ackerman and Eleanor D. Brown DEPARTMENT OF PSYCHOLOGY, UNIVERSITY OF DELAWARE, NEWARK, DE 19716, USA
I. INTRODUCTION II. FRAMING POVERTY RESEARCH
A. CONCEPTUAL ISSUES B. RESEARCH PROGRAM III. POVERTY CO-FACTORS
A. CONTEXTUAL CO-FACTORS B. REPRESENTATION C. THEORETICAL ADVANCES IV. DYNAMIC ASPECTS OF THE ECOLOGY OF DISADVANTAGE
A. FAMILY INSTABILITY B. PERSISTENCE OF ENVIRONMENTAL ADVERSITY V. PERSON-CENTERED APPROACHES
A. CONTINUITY AND CHANGE IN EXTERNALIZING PROBLEMS B. RESILIENCE C. CO-OCCURRING PROBLEMS VI. SUMMARY AND CONCLUSIONS REFERENCES
I. Introduction Developmental research about growing up poor in America has established several facts. One is that poverty poses profound risks for child and adolescent adjustment. Economic disadvantage is associated with a wide range of adjustment problems in school, for instance, including low academic achievement and high levels of grade retention and school dropout (Duncan & Brooks-Gunn, 1997a; McLoyd, 1998), problematic peer relations (Dodge, Pettit, & Bates, 1994; Kupersmidt, Burchinal, & Patterson, 1995), and high levels of both externalizing
91 Advances in Child Development and Behavior R Kail (Editor)
ß 2006 Elsevier B.V. All rights reserved.
92
Brian P. Ackerman and Eleanor D. Brown
and internalizing problem behaviors (Duncan & Brooks-Gunn, 1997a; Lahey, Moffitt, & Caspi, 2003). Another fact is that child outcomes are diverse. Despite the risk, most children from economically disadvantaged families seem to follow normative developmental trajectories and are relatively unproblematic both in school and at home. The third fact is that risk exposure also varies widely for poor children. Apart from impoverished economic resources, for instance, disadvantaged families differ greatly in personal, parental, and family circumstances. Although poverty is associated with residential moves, instability in marital and co-habitation relations, parental anti-social conduct and psychiatric morbidity, and other kinds of adversity, negative life events of these sorts occur infrequently. Thus, most disadvantaged children grow up in relatively stable and secure environments. This diversity in child outcomes and family circumstances constitutes a challenge for developmental researchers. The challenge is to construct an articulated description of the ecology of disadvantage over time in relation to child adjustment. Such a description is necessary for understanding proximal mechanisms linking economic resources and other family factors to child outcomes, and for eliminating third variable explanations of these linkages. The third variable problem has plagued poverty research historically (Duncan & Brooks-Gunn, 1997b; Mayer, 1997) primarily because researchers have failed to distinguish theoretically and empirically between impoverished economic resources and co-varying aspects of the ecology of disadvantaged families (i.e., poverty co-factors). Such a description also acknowledges the fundamentally dynamic nature of key poverty variables, as reflected in parent employment, household residence, and parent figures that often vary substantially and repeatedly over short periods of time. Capturing such dynamic processes requires longitudinal designs at the least. Most poverty research, in contrast, reflects a static snapshot perspective of family adversity, as embodied in cross-sectional designs that preclude examination of developmental processes and of the potentially powerful and independent impact of family changes on child functioning. Such a snapshot approach ignores critical developmental questions about the effects of variations in the timing and duration of risk exposure on child outcomes. Finally, an articulated description allows determination of the combinations of factors that distinguish children with serious and persistent adjustment difficulties. Individual factors usually do poorly in this regard (Liaw & Brooks-Gunn, 1994; Sameroff, Seifer, & Bartko, 1997). In this chapter, we describe research that attempts to meet this challenge, focusing primarily on our own longitudinal research program and on familylevel processes. We restrict our discussion to 4- to 11-year-old children, and we focus on school adjustment in conceptualizing child outcomes
Poverty Co-factors and the Adjustment of Children in Elementary School
93
because school is a principal context for development in middle childhood. A school focus also allows us to have different informants about predictor variables (e.g., the mother) and child outcomes (the teacher). In the next section, we provide a theoretical frame for conceptualizing diversity in family circumstances and child outcomes. The following sections apply the frame in developing empirical distinctions between income poverty and correlated family variables in relation to child adjustment, the relations between dynamic aspects of the poverty ecology and change in child behavior, and developmental effects associated with differences in the duration and timing of risk exposure. These sections are variable-centered. The fourth section takes a more personcentered approach in describing factors that distinguish groups of disadvantaged children exhibiting serious adjustment difficulties.
II. Framing Poverty Research It probably is fair to say that a social address perspective (see Bronfenbrenner, 1986) dominated poverty research by developmental psychologists through the 1980s. The perspective reduces to the assumption that a poverty address is a sufficient basis for predicting and understanding adjustment problems of children and adolescents. Researchers paid little attention to the diverse circumstances of poor families, the diverse outcomes of poor children, or the potentially diverse mechanisms linking chronic economic impoverishment and child adjustment. Good evidence is the special edition of the American Psychologist in 1989 devoted to children and their development. The only treatment of economic hardship in this edition is McLoyd’s seminal article advancing a conceptual model of economic loss and family functioning. The model, however, is based primarily on Elder’s research on the Great Depression (Elder, 1974, 1979) and on acute loss, not chronic impoverishment. The only treatment of chronic poverty in this edition is Haskins’ review and critique of early childhood educational interventions (i.e., Head Start). Such interventions arguably presuppose a main effect and social address model of poverty effects. Contrast the poverty research in this 1989 special edition with research just a few years later in the special poverty issue of Child Development in 1994, with the book-length multi-study examination of the consequences of growing up poor by Duncan and Brooks-Gunn (1997a), with McLoyd’s seminal article on socioeconomic disadvantage and child development in another special developmental edition of the American Psychologist in 1998, and with the important review of poverty effects and welfare reform by Duncan and Brooks-Gunn (2000). These later treatments focus explicitly on the ecology
94
Brian P. Ackerman and Eleanor D. Brown
of disadvantage from multivariate and developmental perspectives. The contrast cues significant conceptual advances in many areas. Four are particularly important for understanding trends in poverty research on children and in framing the research strategies and findings of our own longitudinal research program.
A. CONCEPTUAL ISSUES
1. Definition of Poverty The core issue of poverty research from a developmental perspective concerns the effects of social position and stratification on developmental trajectories and outcomes. The traditional measure of social position is socioeconomic status (SES), which has five levels and represents an aggregate of three variables: family income, the nature of parental employment, and parent education. This measure has proven to be unsatisfactory to developmental researchers for a variety of reasons (see Bornstein & Bradley, 2003), including the dramatically unequal weights of the three variables in predicting child outcomes, and the differential stability of the variables across short time periods. Family income may vary dynamically over short periods for many families, for instance, but not parent education. Most important for our purpose is that SES is a confounded measure of poverty per se, which concerns family income. A more acceptable measure for many researchers is the family income-toneeds ratio. The ratio reflects total family income from both employment and non-employment sources, including welfare, disability payments, and child support divided by a federal standard representing the income necessary to meet the minimal needs of families of various sizes. The threshold for a family to be living in poverty is 1.0, a convention for deep poverty is 0.5 (i.e., the ‘‘far’’ poor), and the term disadvantaged describes families with ratios 2.0. The standard for 2004 for a family of five, for instance, was $18,725, which means that families of four with total yearly incomes of $20,597 and $9362, respectively, would have income-to-needs ratios of 1.1 and 0.5. Our research program began with the children in Head Start Programs (aged four and five), which are federally funded preschool programs for children from economically disadvantaged families. We first collected income reports about employment (only) when the children reached first grade, and we collected income reports about both employment and non-employment sources in the third and fifth grade assessments. Our typical family at each assessment consisted of three children and two adults (i.e., including grandparents and co-habiting partners). If we estimate non-employment income for the first grade assessment, the total income-to-needs ratios ranged from 1.02 to 1.10 across the three grade-school assessments and averaged about 1.05. The average median
Poverty Co-factors and the Adjustment of Children in Elementary School
95
was 0.90. In addition, well over 95% of our families had ratios 2.0 in every assessment and the few that exceeded this threshold for disadvantage in one assessment usually were below the threshold in other assessments. It is also noteworthy that despite this stability in the estimates for the entire sample across time, there was extraordinary instability in yearly income estimates for individual families from assessment to assessment. Between the first and third grade assessments, for instance, and again between the third and fifth grade assessments, about 20% of our families gained a mean of about $20,000 in income estimates, and about 20% lost the same amount. Such income volatility is unlikely in samples of more advantaged families. The volatility argues for the usefulness of dynamic characterizations of the poverty ecology and of research strategies restricting samples to disadvantaged families. We focus on this point next. 2. Deficiency Models An historical problem of most poverty research has been its implicit endorsement of deficiency models. Coll et al. (1996) provide a definitive description of deficiency models and their problems for understanding children growing up poor and minority. The core feature is that a deficiency model implicitly defines economically disadvantaged children as maladjusted. Other features are that such models usually reflect a social address/main effect perspective, and often confound minority status and poverty status in characterizing those problems. The confound reflects the striking disproportion in the number of minority children who are poor in this country. Deficiency models are fueled in part by a research strategy that attempts to understand the risk by analyzing for SES (or income) effects for economically heterogeneous samples of families. The usual result is a strong inverse relation between child maladjustment and social class. Such a strategy, however, ignores both the social diversity among poor families and the diversity of the social and academic outcomes for poor children. The co-variation and collinearity of resource, poverty, and of family disruption, for instance, make it difficult to estimate the independent contributions of income and family circumstances to child adjustment. A solution we adopted in designing our research program was to restrict the sample to disadvantaged families and children. We have no middle income ‘‘controls’’ for income risk. Given the substantial reduction in co-variation among risk factors, a strong advantage is that we are able to relate the withingroup variation among disadvantaged families in social circumstances to child functioning in school. Some of the disadvantages are that (a) many of our measures of family and individual functioning are normed on middle-income families of European American descent, (b) the sample restriction tends to restrict the range on process and outcome measures, and (c) our sample is dominated by African American families (70 – 74% in various analyses), given the
96
Brian P. Ackerman and Eleanor D. Brown
setting in Delaware, a relatively prosperous mid-Atlantic state. Our sample also is too small to analyze consistently for ethnicity. In the analyses we have conducted, however, we have not found significant main effects or interactions. One other salient aspect of deficiency models is a focus on negative outcomes. Coll et al. (1996) and many other researchers (Gutman, Sameroff, & Eccles, 2002) recently have called for a complementary focus on positive outcomes for children at-risk, termed resiliency. The labels, definitions, and operationalizations of factors that promote resiliency remain inconsistent and confusing (see Luthar, 2003), and are beyond the scope of this chapter. Though we have isolated some variables that seem to promote adaptive functioning for our disadvantaged children (Ackerman et al., 1999), our main efforts at conceptualizing resiliency reflect our attempts to understand child desistance in problem behaviors and self-righting after a bad start in school (Ackerman, Brown, & Izard, 2003). Such attempts require person-centered research, which we describe later in the chapter. 3. The Third Variable Problem The third variable problem refers to the possibility that a relation between a particular poverty variable and child outcomes may be explained by other variables that relate to both. Divorce and single parent status, for example, are more common for poor families than for more advantaged families (Seccombe, 2000; White & Rogers, 2000), and relations between marital status and child adjustment problems are legion. The interrelations among the variables suggest the possibility that marital status could explain the relations between economic disadvantage and child adjustment problems, or the reverse. Historically, the third variable problem has centrally concerned interpretation of income effects. The issue is that the relations between income poverty per se and child problems could and often do reflect other prominent variables that correlate with income (Mayer, 1997), which Duncan and Brooks-Gunn (1997b) labeled as poverty co-factors. One solution then is to take the multivariate approach adopted by Duncan and Brooks-Gunn (1997a) in pulling together studies in which it was possible to simultaneously co-vary income, marital status, and maternal education in predicting child outcomes. The results provided interesting evidence both for and against omitted variable bias in interpreting income effects. The evidence for bias is that family income correlated with child problem behaviors at a zero-order level, but usually not in the context of controls for marital status and maternal education. The evidence against bias, in contrast, included direct income effects on markers of child intellectual ability and academic achievement even in the context of these co-factors. These direct effects also tend to be stronger for preschoolers and adolescents than for elementary school children. The implications for our
Poverty Co-factors and the Adjustment of Children in Elementary School
97
research program concern the need to consider poverty co-factors in interpreting effects for income poverty, and vice versa, and the potential selectivity of effects to child outcomes (Duncan & Brooks-Gunn, 2000). The third variable problem reflects more than simply alternative environmental risks. Indeed, the problem most generally concerns the enduring debate about social causation and social selection in interpreting poverty effects (Caspi, 2004; Costello et al., 2003). The social causation model focuses on the effects of environmental adversity in degrading family and child functioning. The social selection model, in contrast, focuses on the role of parent and child variables in selecting into adverse situations. The core third variable problem for poverty researchers concerns the role of unmeasured person variables in explaining relations between economic disadvantage and family and child functioning. These variables may be rooted in genetic characteristics, prenatal and perinatal problems, early socialization processes, and other factors. The question is, to what extent do person characteristics invite or select for poverty situations? Longitudinal research is helpful in teasing apart model claims to the extent that analyzing change over time allows an estimation of the effects of environmental circumstances independently of prior adaptation and person variables. The most important point raised by the third variable problem, however, is the need for poverty researchers to conceptualize child and parent effects in interpreting relations for environmental adversity. Costello et al. (2003) represent an excellent example of the debate and of the problems posed by third variable explanations of poverty effects. The core issue was whether common associations between poverty and childhood behavior problems and psychopathology reflect the harmful effects of environmental adversity and stress (i.e., social causation) or perhaps unmeasured familial liabilities that pull for chronic economic disadvantage (‘‘downward drift’’) and environmental adversity (i.e., social selection). An appropriate test would be to determine whether relief from poverty is associated with reduced levels of problem behaviors and psychiatric symptoms. Such a finding would seem to favor the social causation perspective. The obvious interpretive problem for non-experimental studies, however, is that upward economic mobility typically is not random and may reflect positive characteristics of parents that also promote social adjustment by children. Costello et al. minimized the problem by examining the relation between subsequent child behavior and an essentially random environmental event that was independent of parent and person variables. The event was a casino opening on an Indian reservation that moved 14% of study families out of poverty, with 53% remaining poor. The results were that the conduct problems of ex-poor children fell significantly after the casino opened relative to the always-poor children, and to the levels of the never-poor children.
98
Brian P. Ackerman and Eleanor D. Brown
4. Mediators and Moderators of Effects A multivariate approach to understanding the effects of growing up poor encourages examination of the interrelations among variables, and the search for mediators and moderators of effects. A mediator describes how an effect occurs, or the processes by which one variable may influence another. A moderator is a variable that affects the strength of the relations between two variables. This search is critical for understanding the proximal mechanisms that translate more distal variables like family income into child outcomes, and the conditions under which an environmental factor is risky or not (i.e., interactions rather than main effects). If we are to advance beyond a deficiency model, this search is critical for describing processes underlying outcomes, and not just outcomes per se. A leading edge of poverty research in the last few years, then, has concerned this kind of focus on the ‘‘how’’ of effects. The selectivity of the effects for income poverty and some poverty co-factors (Duncan & Brooks-Gunn, 1997b, 2000) provides an excellent example of a mediational model. Why is it that income poverty usually relates robustly to child problem behaviors at a zero-order level, but not in the context of family variables? Proponents of family stress models argue persuasively that income effects may be mediated by a variety of family factors, including marital conflict, maternal mood, and harsh and inconsistent parenting most proximally and powerfully (Conger et al., 2002; Mistry et al., 2002). In contrast, these particular family factors do not seem to reduce the relation between income poverty and markers of children’s intellectual ability and academic achievement; that is, direct effects for income poverty remain in the context of these factors. Instead, what seems to mediate income effects to cognitive outcomes is the quality of the home learning environment, primarily in preschool (Britto, Fuligni, & Brooks-Gunn, 2002; Duncan & Brooks-Gunn, 2000; Hart & Risley, 1995), which varies somewhat independently of these other family factors. Mediational models focusing on family processes fit well for families suffering from both acute and chronic economic stress, and both European American and African American families. In this specific regard, the demographic factors of social status and race/ethnicity do not seem to interact with, or moderate, relations between income, family variables, and child outcomes. Nonetheless, interactions are common for many poverty and status variables, which urges caution in interpreting main effects. Some empirical findings, for instance, suggest harsh parenting relates differently to problem behaviors for boys and girls, for black and white children, and for disadvantaged and more advantaged children (Deater-Deckard & Dodge, 1997; McLoyd & Smith, 2002), depending on factors like the sex and emotional warmth of the parent. Similarly, changes in residential partners in disadvantaged families relate more strongly to the problem behaviors of boys than girls (Ackerman et al., 2001, 2002), perhaps due to the instability of male parent figures.
Poverty Co-factors and the Adjustment of Children in Elementary School
99
B. RESEARCH PROGRAM
These conceptual issues frame and motivate our research program. We close this section with a description of the program. In the following sections, we describe and interpret many of our findings. Our research program examines the longitudinal relations between economic disadvantage, family and parent variables, and the social and academic development of children in middle childhood. The program is part of a larger project, co-directed by Carroll Izard, that includes additional measures related to children’s emotional development. We do not discuss this emotional development research further because it does not treat poverty issues except for the use of family income as a risk marker. Mothers were the primary sources of information about family and parent functioning, and parent, direct observation, and especially teachers were the sources of information about child functioning. The observations included standardized measures of intellectual competence, including the Peabody Picture Vocabulary Test-Revised (Dunn & Dunn, 1981) in the preschool assessment and the Vocabulary and Pattern subtests of the Stanford–Binet Intelligence scale (4th ed; Thorndike, Hagen, & Sattler, 1986) in the first grade assessment, and a measure of child behavioral undercontrol culled from videotapes of the assessment sessions on tasks tapping motor control, compliance, and attention. Teachers’ reports about school adjustment included the Teacher Report Form of the Child Behavior Checklist (Achenbach, 1991), which has standardized subscales representing externalizing behavior (i.e., aggressive and delinquent behaviors) and internalizing behavior (withdrawn behavior, anxious/depressed behavior, and somatic complaints), and the academic competence subscale of the Social Skills Rating System (Gresham & Elliott, 1990), which also is standardized. We also obtained self-reports from the children about child and family functioning, but most of these started to become useful and interesting only in the fifth grade assessment. We recruited the sample from all the Head Start Centers in northern Delaware, across a range of urban, suburban, and semi-rural settings. In this respect, our sample differs from the typical poverty-based sample drawn from neighborhoods featuring concentrated poverty in large cities. We assessed family, parent, and child functioning in preschool (mean age 5 years, 0 months), and every two years thereafter, in grades 1, 3, 5, 7, and 9. Published research concerns the first four assessments through the fifth grade, and we restrict our discussion to this research. The effective sample size for publication purposes was about 196 in the preschool assessment and 129 in the fifth grade assessment. Attrition is significant, but seems to be random to the extent that none of our many comparisons of participants who left, or stayed, or returned has yielded significant interpretable results. The effective samples at each assessment have
100
Brian P. Ackerman and Eleanor D. Brown
consisted of approximately equal numbers of girls and boys and dramatically unequal numbers of African American children (70 – 74%). Our direct assessments of child functioning took place primarily in the school settings. Parent participation took place in school, community, and laboratory settings.
III. Poverty Co-factors A core challenge in our research program has been to create an articulated and rich view of the environmental risks children encounter with some frequency in economically disadvantaged families. We have focused on proximal aspects of the family environment, such as residence and maternal relationship changes, parental anti-social behavior (i.e., police contacts and substance abuse), psychiatric morbidity, and maternal education. We also have coded and examined many other aspects of family circumstances, including family size, maternal age, birth order, experiences with non-maternal caregivers, welfare status, employment status, and other negative life events (e.g., maternal physical morbidity, employment instability). For our sample, these variables either do not relate significantly to any aspect of child adjustment in school, or the weak relations are explained by other variables. In the subsections that follow, we describe our empirical findings for these proximal contextual variables, explain issues in representing environmental adversity, and discuss recent ideas about conceptualizing contextual influences. We chose this focus for at least five good reasons. First, these aspects co-vary with income poverty. The co-variation defines the term ‘‘poverty co-factors.’’ Second, the co-factors frame family processes that we also examine, including maternal emotionality and parent–child interactions. Third, the effects of more distal variables such as characteristics of the residential community usually are mediated partially or entirely through more proximal family variables (Leventhal & Brooks-Gunn, 2002). Fourth, the proximal focus increases the likelihood that children may actually have experienced ‘‘risky’’ circumstances in their daily lives. This point addresses the common criticism that family or community experiences are termed ‘‘risky’’ without knowing whether the child had the experience. Finally, a focus on these contextual co-factors is empirically and theoretically progressive. The empirical issue is simply that developmental poverty researchers have mostly ignored contextual factors other than marital status and maternal education, and marital status per se has not been framed in a way that is appropriate for the poverty ecology, as we argue later. Thus, it has not been clear whether these contextual aspects of environmental adversity affect child adjustment, or how they relate to income poverty per se or to process variables such as harsh parenting and maternal mood. Similarly, one additional
Poverty Co-factors and the Adjustment of Children in Elementary School
101
advantage of focusing on contextual co-factors is that interpretation of the direction of the effects to child outcomes is relatively straightforward. Contextual co-factors are reasonably independent of the child in this regard, unlike the bi-directional effects associated with process variables. High levels of externalizing behaviors by 6- or 8-year old children in school, for instance, seem likely to have a powerful influence on parenting practices, but perhaps not on residential moves or maternal terminations of intimate partner relationships. Theoretical issues concern the role of contextual co-factors as third variables in explaining income relations to child outcomes, and whether the relations for contextual co-factors are best explained by extant family stress models (Conger et al., 2002) or require some other unexamined mechanism. Most important, an articulated description of family adversity over time is required for addressing the social causation vs. social selection debate in framing the reproduction of poverty. Sameroff and his colleagues (Gutman, Sameroff, & Eccles, 2002; Sameroff, Gutman, & Peck, 2003) argue, for example, that the relative influence of environmental adversity and endogeneous person (i.e., child) characteristics on child outcomes varies strongly for disadvantaged vs. advantaged families, with person variables relatively unimportant for poor children. The claim is debatable, but joining the debate, at the least, requires a robust description of environmental adversity over time.
A. CONTEXTUAL CO-FACTORS
Our research has focused on two central issues, which we discuss in this subsection. These issues concern the ecological relevance of contextual co-factors and effects that distinguish co-factors from other poverty variables. The effects include selective effects for income and contextual co-factors, direct effects vs. indirect effects, and concurrent effects. We conclude this subsection with a brief discussion of remaining issues. 1. Ecological Relevance For poverty researchers, a good reason to restrict a sample to disadvantaged families is the opportunity to examine variables that have special relevance for the ecology of disadvantage. In our sample, the negative life events that constitute our contextual co-factors happen infrequently but often enough to constitute substantial risk for destabilizing family functioning and hence for child adaptation. From demographic questionnaires and life events surveys, for instance, we found in one study that the percentages of families showing one or more residential moves were 30, 49, and 47 for the two-year intervals ending in the first, third, and fifth grade assessments, respectively, with an average of about 40% of the caregivers reporting more than one move
102
Brian P. Ackerman and Eleanor D. Brown
(Ackerman, Brown, & Izard, 2004a). The percentages of mothers reporting one or more termination of an intimate relationship with a residential partner, respectively were 41, 31, and 44, with an average of about 15% reporting two terminations. Similar percentages occurred for adult police contacts indexing anti-social behavior. Adult substance abuse and psychiatric morbidity were less frequent, with the percentages about half of those for other contextual co-factors. We also found that these kinds of events seemed to occur episodically and mostly independently. The inter-assessment correlations for each kind of event, for instance, usually were 50.20, as were the inter-event correlations for each assessment. These statistics suggest the importance of describing risk factors in a way suited to the ecology of disadvantage. We focused on instability in maternal intimate relationships in two studies (Ackerman et al., 2001, 2002) specifically, because the characterization of the issue in the literature did not seem to adequately describe the experiences of disadvantaged families. In particular, the dominant frame for understanding the family structure in general (e.g., for middle income families) has concerned marital status, with single parenthood defined as risky for children. The problem for children in single parent families, the argument goes, is the absence of qualities that a biological father or a stepfather brings to a family, including economic resources most centrally, but also paternal supervision of children, an adult male role model, and the like. The core problems with this cultural frame from a poverty perspective are that (a) impoverished fathers do not bring robust economic resources to a family, (b) ‘‘single’’ mothers often are not single, given that cohabitation is frequent, if not normative, (c) family cohabitation relations are ‘‘fragile’’ and change frequently, which means that family structure may change frequently over the course of several years. and, (d) the presence of a biological father in a maritally intact family may not be normative for disadvantaged families, which calls into question the entire conceptual frame. The percentage of such ‘‘intact’’ families was 19.2 for our sample in the preschool assessment, for instance, and averaged 12.3 in the succeeding assessments. Step-parents filled the void to some extent (7.9% in preschool and an average of 17.0% thereafter). These problems suggest an alternative model that (a) separates cohabitation from single parent status in redefining family structure and that (b) focuses on variations in relationship instability as a risk variable. We define relationship instability as the number of terminations of maternal intimate relationships with a residential male partner from the child’s birth. The variation was considerable: about 14% of our sample of mothers experienced a termination of at least one relationship with a residential male in each of the three elementary school assessments, for instance, while another 11% experienced no instability at all. The importance of this variation as risky is documented in other studies as well showing associations between relationship transitions and child outcomes
Poverty Co-factors and the Adjustment of Children in Elementary School
103
(Dunn, et al., 1998; Kurdek, Fine, & Sinclair, 1995). Serial maternal relationships pose risk for children in a variety of ways, including exposure to family conflict (Forgatch & DeGarmo, 1999), residential and financial instability (Anderson et al., 1999), and coercive and harsh discipline practices (Patterson, 1999). Ackerman et al. (2001) tested this alternative model in examining the effects of both the redefined family structure variable and relationship instability in predicting the externalizing behaviors of first graders in school. Both variables showed significant unique effects in the context of variables representing family earned income and parent maladjustment. Concurrent co-habitation was associated with the worst outcomes for the children, and especially for boys, and problem behaviors varied strongly and independently with the number of relationship terminations over time. We argued that the variation for gender reflected the ambiguity of father presence for co-habiting male partners in supervising and providing a role model for young boys. Ackerman et al. (2002) extended these findings in examining the association between chronic relationship instability and a variety of third grade outcomes, including externalizing behavior, internalizing behavior, and academic competence. Table I shows the results for the problem behaviors, with the number of terminations 4 categorized as level 4. The results showed significant differences among the levels of relationship instability for both kinds of problem behaviors, but not for academic competence. The Tukey tests showed that level 0 differed from levels 3 and 4 for externalizing behavior and level 0 differed from level 3 for internalizing behavior. In addition, about a fourth of the families at level 0 were single mothers who never married or cohabited. The externalizing scores of the children of these mothers did not differ from those of the children of maritally intact families. We make more of this study later, but for now the results show the importance of describing risk variables in a way that is appropriate for the ecology of disadvantage. In particular, the environmental
TABLE I Numbers of Families, Means, and Standard Deviations for Each Level of Chronic Relationship Instability for Externalizing Behavior and Internalizing Behavior Externalizing behavior
Internalizing behavior
Level
n
M
SD
n
M
SD
0 1 2 3 4
47 35 26 15 16
9.82 16.00 13.58 25.33 25.88
10.44 16.29 14.02 19.37 20.66
47 35 26 15 16
5.55 7.91 6.76 11.86 9.44
5.36 8.02 7.04 9.36 7.72
104
Brian P. Ackerman and Eleanor D. Brown
press works against stable marriages for disadvantaged families, and true single parent status may be adaptive rather than a problem for the children in such families (Elder et al., 1995), given the scarcity of marriageable males and good employment opportunities, and the fragility of partner relationships. 2. Distinctive Effects The results of Ackerman et al. (2002) also are distinctive in several other ways, concerning selectivity of the effects for child outcomes for income-to-needs ratios and contextual co-factors, and the direct and concurrent nature of the effects for the co-factors. These kinds of effects replicate throughout our research program, regardless of the question at issue. Our selectivity hypothesis is based on the evidence marshaled by Duncan and Brooks-Gunn (1997a, 2000) in examining the third variable problem for understanding the relations between income poverty and child outcomes. Given that the ecology of disadvantage is risky in many ways for children, we have taken this problem seriously in all of our work. Accordingly, we have consistently co-varied representations of family income, child variables (i.e., hyperactivity/inattention and intellectual ability), and multiple contextual factors in attempting to predict specific indices of child adjustment in elementary school. The logic requires that we examine zero-order correlations between predictors and child outcomes, of course, but also that we focus on unique effects (i.e., squared ‘‘semi-partial’’ or ‘‘part’’ correlations in regressions) in multivariate analyses in interpreting predictive relations. Our findings in several studies fit well with the hypothesis about selective effects for income and poverty co-factors. Income-to-needs ratios, for instance, occasionally correlate weakly but significantly with children’s externalizing behavior in school, but significant relations disappear in the context of contextual co-factors (Ackerman, Brown, & Izard, 2004a). In contrast, incometo-needs ratios consistently and uniquely predict academic competence in the context of other variables. Child verbal ability and maternal education are other unique predictors of academic competence that typically share variance with income ratios. Among the contextual co-factors, maternal relationship instability and parent–police contacts are the most reliable and powerful predictors of child externalizing behavior in the context of income ratios and child variables, but relate weakly and inconsistently to academic competence. Representations of child behavioral hyperactivity/inattention tend to relate uniquely but weakly to both externalizing behavior and academic outcomes. However, this hypothesis is not without issue. An important article by Costello et al. (2003) discussed earlier, for instance, poses a strong challenge in showing relations between income increases for disadvantaged families and reductions in child externalizing behaviors. This study examines the effects of new sources of casino revenue on family and child functioning on an
Poverty Co-factors and the Adjustment of Children in Elementary School
105
American Indian reservation, with the beneficial effects on child behavior attributed primarily to increased parental supervision. Our perspective is that these kinds of findings fit well with the family stress model described next, and that significant effects that remain between income and child externalizing behavior can be explained by unexamined third variables (i.e., co-factors). Family stress models (Conger et al., 2002; Mistry et al., 2002) provide a theoretical frame for our consistent findings of both direct and concurrent effects for contextual co-factors and child problem behaviors. The typical findings of the models are that proximal process variables centering on harsh and disrupted parenting practices mediate relations between child problem behaviors and more distal variables, such as income stress, marital discord, and maternal distress (i.e., depressive symptomatology and lack of efficacy). Negative parenting practices in this sense represent the mechanism communicating family stress to the child. Accordingly, the models predict that our direct and unique effects relating contextual co-factors and child externalizing behavior should be dramatically reduced in the context of a harsh parenting variable. In all of our recent studies, we have indeed found consistent and unique relations between a measure of harsh parenting and child externalizing behavior in school (Ackerman et al., 2002; Ackerman, Brown, & Izard, 2003, 2004a,b), which is evidence for the validity of the measure. We have even found that harsh parenting functions as a mediator for certain variables in specific circumstances (Ackerman et al., 2002). In general, however, our typical findings show unique and direct relations between child externalizing behavior and both maternal relationship instability and parent–police contacts, with harsh parenting controlled. These direct effects make sense in that the contextual variables threaten family stability, disrupt child relations with parent figures and child agendas, and expose children to partner conflict and poor role models. The theoretical importance concerns the need to augment family stress models in conceptualizing relations at a family level between the ecology of disadvantage and child adjustment. In contrast, our findings are quite consistent with another aspect of family stress models, which concerns the focus on and privileging of concurrent effects between income stress, family functioning, and child behavior. The argument of stress models is that income stress degrades family and personal functioning concurrently, with the assumption that more normative family patterns will resume if perceived income loss or problems are alleviated, though this assumption is tested rarely. The model applies to acute or chronic income stress, except that chronic income stress is unlikely to change, by definition. A test requires a longitudinal design with growth curve modeling or a change model assessing the residual relation between recent or concurrent stressors and child behavior at Time 2 with Time 1 behavior controlled. In the case of our chronically disadvantaged sample, stressors other than income often change
106
Brian P. Ackerman and Eleanor D. Brown
substantially over time and seem to relate concurrently to child outcomes. Income changes also occur, but they rarely lift a family out of disadvantaged status. We argued earlier that contextual risk factors seem to occur episodically in that the frequency of occurrence on average in our sample was fairly low and was independent across assessment and between factors. Ackerman et al. (2004a) have shown, in addition, that even aggregate representations of risk factors (e.g., multiple risk indexes) correlate weakly across assessments. These findings argue for strong concurrent relations between contextual risks and child problem behaviors in school for disadvantaged families. Ackerman, Brown, and Izard (2004a) provided several kinds of evidence for this argument. First, the average concurrent correlation between multiple risk indexes of family adversity and child externalizing at a zero-order level across the first grade, third grade, and fifth grade assessments was 0.35 (N ¼ 117), p 5 0.01, whereas the average forward correlation across grade was 0.06. The indexes represent recent contextual risks since the previous assessment. Second, and most critically, the risk indexes showed unique concurrent effects in each assessment with controls for child externalizing behavior in the previous assessment, that is; the indexes explain behavioral change across time. Table II shows these effects, with sR2 (part correlation coefficient squared) representing the unique predictive relation, and child behavior in the previous assessment labeled prior behavior. Third, adding the risk index for the previous assessment to a regression model (i.e., third grade) had no effect on the relation between the index for recent events and current externalizing behavior (in fifth grade).
TABLE II Summary of the Effects in the Regressions for Externalizing Behavior for the First-Grade, Third-Grade, and Fifth-Grade Assessments First-grade Variable Controls Cognitive ability Maternal education Family Prior behavior Income-to-needs ratio Contextual risk index Gender Harsh parenting *p 5 0.05. **p 5 0.01.
0.10 0.09 0.42 0.13 0.30 0.17 —
sR2
Third-grade
Fifth-grade
sR2
0.01 0.01
0.28 0.08
0.07** 0.01
0.06 0.11
0.17** 0.02 0.08** 0.03 —
0.32 0.11 0.17 0.01 0.20
0.09** 0.01 0.03* 0.00 0.04**
0.44 0.02 0.17 0.06 0.18
sR2
0.00 0.01 0.16** 0.01 0.03* 0.00 0.03*
Poverty Co-factors and the Adjustment of Children in Elementary School
107
Ackerman et al. (2002) provided a particular interesting illustration of concurrent effects in separating variance explained by past relationship instability (to first grade) and recent relationship instability (first to third grade). These two variables interacted in predicting third grade externalizing behavior. Further analyses indicated that past instability continued to predict third grade externalizing behavior for children from recently stable families, mostly due to continuity in problem behaviors from first grade. Most important, current instability was associated with a dramatic difference in externalizing behavior in third grade for those children from previously stable families in first grade. The mean externalizing scores were 8.11 (SD ¼ 10.44, n ¼ 47) for currently stable families and 25.50 (SD ¼ 17.17, n ¼ 12) for currently unstable families. Clearly, a recent termination of a partner relation related to concurrent child adjustment in school. 3. Remaining Issues We have provided evidence for selective effects for family income and contextual co-factors in several studies. This evidence has extended our empirical and theoretical bases for understanding the relations between the ecology of disadvantage and child adjustment in school. Several salient issues remain in interpreting these effects. First, we do not have a firm handle on child internalizing behavior. The most reliable unique predictor is parent psychiatric morbidity, with income-to-needs ratios showing consistent zero-order correlations but inconsistent unique effects. Second, we do not have a robust model for explaining the effects or for understanding the mechanisms of effects. Stress models are succinct and powerful in these regards, and we have devoted much thought to characterizing our contextual variables as stressors. No doubt they are, to some extent at least. The difficulties are that individual variables seem to work in different ways and seem to require different mechanisms, and stress reactivity does not seem to exhaust the ways in which they may relate to children. The termination of a maternal relation with a male partner undoubtedly is stressful for all family members, for instance, but it also represents loss for a male child that does not seem to reduce conceptually to stress. Third, we have focused on contextual co-factors involving aspects of family instability, and it is likely that we are missing critical aspects of family circumstances that relate uniquely to child functioning. A good example concerns chronically stressful aspects of the physical environment, like noise, housing disrepair and dilapidation, environmental degradation (e.g., pollution), and adequacy of resources (e.g., number of rooms). Evans (2003, 2004) represents these and other aspects as imposing ‘‘allostatic load’’ on family members, which concerns the physiological demands on the body imposed by the mobilization of resources to meet environmental circumstances that are chronically
108
Brian P. Ackerman and Eleanor D. Brown
challenging and changing. These demands are adaptive and appropriate in meeting acute environmental stressors, but wear on the body in chronic situations (Marmot, 2004; Wilkinson, 2005). Emerging evidence shows that allostatic load relates directly to the physical health of family members in both the short term and long term and to behavioral functioning. Fourth, the implicit assumption of our poverty research is that profound environmental adversity has powerful effects on child adjustment that are independent of child effects and social selection effects. To this extent, we started from the perspective of a social causation model, like many other researchers. Sameroff, Gutman, and Peck (2003) argue, for example, that the contributions of environmental adversity overwhelm and minimize child effects for disadvantaged children, with child factors much more powerful for more advantaged children. The problems with our assumption are that (a) we consistently find unique effects for child factors such as intelligence and hyperactivity/inattention even in the context of robust representations of environmental adversity, (b) our designs are correlational, and (c) we cannot fully eliminate third variable/social selection explanations of our environmental effects, such as genetic explanations. We should say, however, that change models in which we control for prior child behavior in explaining current behavior go some way towards this end. B. REPRESENTATION
One of the reasons Sameroff and his colleagues (1997) have been forceful advocates for a social causation hypothesis about the academic and behavioral difficulties of disadvantaged children is that they helped advance a powerful way to represent environmental adversity. An empirical problem for determining effects for income poverty and poverty co-factors is that many aspects of environmental adversity tend to interrelate and to share variance in predicting child outcomes. A result is that individual variables usually have weak unique effects, at best. Thus, the variance explained by a block of such variables in this regard often exceeds the total variance associated with adding effects for individual factors. A second empirical problem is that such multivariate approaches lack power for small-to-moderate-sized samples. The theoretical problems seem just as troublesome. First, single variables do poorly in explaining serious adjustment problems (Rutter, 1975a; Seifer et al., 1992), which argues for a multivariate approach. Second, a good case can be made that children experience their environments as a whole, rather than variable by variable. A good example is the repeated finding that the relation between strict/harsh discipline and child externalizing behavior often varies with the emotional context of the discipline and the emotional climate of the family (McLoyd & Smith, 2002). More generally, adverse processes that cut
Poverty Co-factors and the Adjustment of Children in Elementary School
109
across family variables have the most powerful effects on child functioning (Cummings, Davies, & Campbell, 2000). An adequate representation of children’s experiences, then, seems to require a representation of environmental risk as a whole. Third, Sameroff and his colleagues (1997) argue that the amount of risk in the environment is more important for child functioning than the quality of the risk. Risk variables, for instance, may potentiate each other and have synergistic effects that do not reduce to and cannot be observed in qualitative effects for individual variables. A powerful solution to these problems has been to represent environmental adversity with a multiple risk index (Burchinal et al., 2000; Evans, 2003; Sameroff, Seifer, & Bartko, 1997). Such an index converts qualitative risk represented by individual variables to a quantitative representation of adversity by adding the number of risk factors a child potentially experiences, that is; the index cumulates adversity. The threshold for inclusion of a risk indicator is presence of risk on a categorical variable (e.g., presence/absence of parent police contacts, substance abuse, or single parent status, etc.) and scores above some predetermined cut-off on a continuous variable (i.e., the mean or one SD above the mean). The effects are powerful. In a seminal study, Rutter and his colleagues (1975a,b), found that a family adversity index of two or more factors was associated with a two- to fourfold increase in child behavior problems. In the Rochester Longitudinal Study, Seifer et al. (1992) found that a multiple risk index representing 10 aspects of family adversity explained between 25 and 50% of the variance in children’s social and intellectual competence. Many recent studies have found similar results (Deater-Deckard et al., 1998; Gerard & Buehler, 2004a,b). The disadvantages of multiple risk indexes are also well known. One concerns our previous discussion about selective effects for income poverty and contextual co-factors. These qualitative effects may be obscured in an index focusing on the quantitative relation between amount of ecological risk and child outcomes. The second problem is that all variables are weighted equally in predicting child outcomes, which is inconsistent with many findings of the differential importance of specific aspects of family functioning. The third problem is that a single index obscures possible interrelations among variables, including mediator and moderator effects. Consequently, multiple risk indexes are useful for predicting adverse child outcomes, but not particularly useful for understanding the mechanisms underlying the outcomes. We have extended the index approach in several ways in light of these and other problems. One way has been to separate potential effects for income poverty, process variables such as harsh parenting and maternal mood, and index representations of contextual co-factors. In Ackerman et al. (1999), we found that an index of 11 contextual co-factors predicted child (total) problem behavior scores independently of child variables and proximal emotion
110
Brian P. Ackerman and Eleanor D. Brown
variables, though sR2 (0.07) was considerably less for the single index than for the block score representing the additive factors (0.13). We also found that serious levels of problem behaviors (i.e., in the clinical range) vary with the amount of risk, to some threshold at least, rather than with individual variables. Table III illustrates the finding. Families with one risk factor did not have any children with clinical levels of problem behaviors. In contrast, the percentages of such children ranged from 15 to 23 for families with 2–4 risk factors, and from 32 to 56 for families with five or more risk factors. Finally, we found that the risk index interacted with variables representing parent negative emotionality. The relation between the index and child problem behaviors was strongest for lower levels of negative emotionality. This finding illustrates the importance of separating process and contextual variables and of examining potential moderator effects. The second method has been to examine cumulative representations at various time points in relation to changes in child problem behaviors in school. Almost all studies using multiple risk indexes to represent environmental adversity have examined relations to child behavior at one point in time, or used an index at Time 1 to predict behavior at Time 2, without regard for the concurrent relations for an index at Time 2. The ironic aspect of these kinds of designs is that a multiple risk index is especially well-suited to represent multiple aspects of environmental adversity sampled over multiple times because power considerations make it difficult to repeat the iterations of additive factors at Times 1, 2, and 3 for most samples. Ackerman, Brown, and Izard (2004a) tested these relations to child externalizing behavior with an index representing six contextual co-factors at each elementary school assessment. Table IV shows that the externalizing means TABLE III Numbers of Risk Factors and Families, Mean Total Problem Scores, Standard Deviations, and Percentages of Children in the Clinical Range Clinical Risk factors 0 1 2 3 4 5 6 7þ Sample
N
Mean
SD
%
8 14 35 26 21 28 16 7 155
21.1 16.4 36.1 30.2 37.4 48.2 59.4 54.7 38.1
22.9 16.9 24.3 23.1 30.2 27.7 30.6 27.9 27.9
0 0 22.9 15.4 19.0 32.1 56.3 42.9 23.9
111
Poverty Co-factors and the Adjustment of Children in Elementary School
TABLE IV Numbers of Risk Factors and Families, Externalizing Means, and Standard Deviations for the Longitudinal Assessments First-grade
Third-grade
Fifth-grade
Risk factors
N
M
SD
N
M
SD
N
M
SD
0 1 2 3 4 5þ
16 30 26 22 11 12
8.06 11.44 9.92 17.73 21.64 23.83
8.98 11.32 10.88 14.08 16.57 10.47
26 34 27 22 8 0
8.73 11.88 14.04 28.05 28.63
10.63 12.83 16.01 20.16 17.15
17 29 30 15 17 9
8.59 13.96 16.44 20.60 19.88 32.13
10.42 11.38 12.74 14.72 14.10 16.92
varied with index level (number of risk factors) up to level 3 for each grade. The patterns show that the cumulative amount of concurrent risk to some threshold predicts levels of children’s externalizing behavior throughout elementary school. Most importantly, however, the unique effects for each concurrent index remained significant and robust when we co-varied both prior behavior from the previous assessment, the cumulative index for that assessment, and controls for child factors, income, and harsh parenting. This co-variation uniquely related change in family circumstances represented by the risk index to changes in child behavior in school. We also tried other ways of testing such relations at an individual variable level, but the cost in subject-to-variable ratios tended to be unacceptable. Despite the limitations, cumulative indexes seem especially useful in describing the relations over time between environments containing multiple risks and child outcomes. Another method has been to try and find a middle ground between quantitative and qualitative approaches to describing poverty risks. We have found that some variables typically included in multiple risk indexes simply do not predict for our disadvantaged sample, including family size, adult employment, and the like. Other variables, in contrast, tend to show selective relations to externalizing behavior or internalizing behavior in school or academic competence, as we have shown. Capturing these selective relations may require some approach that has both quantitative and qualitative features. Our solution in Ackerman et al. (1999) was to construct separate multiple risk indexes representing variables that seemed to cluster together theoretically. The resulting indexes described family structure, family instability, and parent maladjustment. We found no effects for the family structure index for any outcome, a significant relation between the instability index and first grade anxious/depressed behaviors, and a significant relation between the parent maladjustment index and first grade aggressive behaviors. These findings are a good illustration of the selectivity and unequal weight problems described previously.
112
Brian P. Ackerman and Eleanor D. Brown
C. THEORETICAL ADVANCES
Our attempts to provide a robust and articulated description of risks associated with the ecology of disadvantage can be extended in several ways. One way is that we need to pay more attention to neighborhood variables and to peer processes. We have underdescribed both, mainly because our children have been too young through most of our project to experience community life directly or to participate substantially in a delinquent peer group. One consequence is that we have not successfully met Bronfenbrenner’s (1986) challenge to describe this ecology at multiple levels, including the exosystem level. We have focused on individual attributes, family-level and school-level processes, and on the relations between the two microsystems. Deater-Deckard et al. (1998), Greenberg et al. (1999) and a few others have done better in this regard. A second way is probably the most theoretically progressive for current conceptions of environmental effects. Cook et al. (2002) and Sameroff, Peck, and Eccles (2004) make a compelling argument that the most powerful and lasting environmental effects are those that occur across developmental contexts in which children and adolescents participate and across theoretically defined and distinct sets of variables. Risks associated with family, school, peer, and neighborhood variables tend to have additive and independent effects on child outcomes, and the developmental effects may differ.
IV. Dynamic Aspects of the Ecology of Disadvantage A goal of our research on contextual co-factors at the family level described in the last section has been to generate a robust description of environmental risks for disadvantaged children. We assumed that the risk was communicated primarily through risk experiences. But it seems likely that part of the risk has to do with change in experiences and the unpredictability of family life. Such changes may disrupt child and family agendas at the very least. In this section, we discuss dynamic aspects of the ecology of disadvantaged families. What we mean by dynamic aspects concerns significant changes in family circumstances over short periods of time. We expect change in circumstances to relate to change in child behavior in school. Developmental researchers have only just begun to conceptualize the effects of change on the adjustment of disadvantaged children, despite robust statistics showing that factors such as residential stability, partner stability, and employment/income stability vary with income levels and social class (Seccombe, 2000; White & Rogers, 2000). Instead a focus on continuity has dominated developmental research in at least three ways. First, much research historically can be characterized as taking a snapshot of disadvantaged families at a single point in time, which is then used to understand child adjustment problems both currently
Poverty Co-factors and the Adjustment of Children in Elementary School
113
and over time. This perspective is adequate for describing some aspects of adjustment problems for disadvantaged children, which typically are more stable than for other children and increase over time relative to other children. The achievement gap in school, for instance, describes both the relative poor academic starts of disadvantaged children at school entry and the widening gap with increasing grade. Second, Sameroff, Seifer, and Bartko (1997) and others (Duncan & BrooksGunn, 2000) argue for substantial continuity in environmental adversity over long periods of time for disadvantaged children, and that such environmental continuity may exceed continuity in person variables (e.g., IQ) and perhaps do better in explaining long-term outcomes. This perspective is salutary in refocusing attention on environmental variables (i.e., social causation) rather than person variables (i.e., social selection) in understanding disadvantage. Third, McLoyd (1998), Duncan and Brooks-Gunn (2000), and others argue that poverty that persists over time does more damage to children than acute disadvantage at one time. Like multiple risk indexes, this persistence idea is a way of capturing the ‘‘dose’’ of environmental adversity. Several ideas in our previous sections, however, suggest the need for conceptualizing dynamic aspects of the ecology of disadvantage. We described evidence for co-habitation as an emerging way of life for many disadvantaged families, for instance, and that such partner arrangements are ‘‘fragile’’ and pose a substantial threat to family stability. We also found that contextual co-factor experiences were episodic and had strong concurrent effects on child behavioral adjustment, which argues for discontinuity and change in child adjustment difficulties. In this regard, we discussed change models as a way of exploring such effects. Similarly, our data suggest that total family income varied substantially across assessment periods for many of our families, due in part to the unstable nature of employment for workers without a high school diploma or much in the way of marketable skills. Finally, the argument for selective effects for income poverty and poverty co-factors in relation to child outcomes suggests outcomes pull differentially for continuity and change. Academic outcomes may show more continuity than conduct problems because they are more dependent on early and incremental skill acquisition. Increases in the achievement gap to this extent need not be accompanied necessarily by increases in a conduct gap. We explore these themes in this section by describing our research on family instability and persistence of environmental adversity. A. FAMILY INSTABILITY
Family instability from our perspective describes aspects of the family ecology that tend to vary over short periods of time for economically disadvantaged
114
Brian P. Ackerman and Eleanor D. Brown
families, that seem to represent and disrupt core aspects of family functioning, and which may affect child agendas and adjustment. We have operationalized family instability as an aggregate of several contextual co-factors. In both Ackerman et al. (1999) and Ackerman, Brown, and Izard (2003) studies, the grade school aggregate included residence and maternal relationship changes over the two-year assessment interval and other negative life events. We represented these factors with a multiple risk index ranging from 0 to 3. Ackerman et al. (1999) also employed an index representing preschool instability that included indicators like separations from parents (e.g., foster care) and child illnesses. We motivated the choice of variables both theoretically and empirically. The theoretical motivation was that all the variables represent a substantial change in family circumstances that seems likely to destabilize family functioning. The empirical motivation is that we found evidence in the literature linking the variables to child outcomes. For example, Eckenrode et al. (1995) found relations between residential mobility and child academic performance, Stoneman et al. (1999) found relations between residential moves and sibling relations, and Adam and Chase-Lansdale (2002) found relations between residential moves, separations from parent figures, and adolescent adjustment. In Ackerman, Brown, and Izard (2003), we also examined income-to-needs ratios associated with behavioral change, both to control for potential third variable problems, and because of emerging evidence relating change in income and poverty status to behavioral change for children from disadvantaged families (Costello et al., 2003; Dearing, McCartney, & Taylor, 2001; Macmillan, McMorris, & Kruttschnitt, 2004; Pagani et al., 1999). Our findings for the family instability aggregates have consistently related change in family circumstances to change in child problem behaviors in school. In Ackerman et al. (1999), family instability uniquely predicted problem behaviors at home and in school in both preschool and first grade children, in the context of other family variables. In addition, the first grade effects occurred in the context of controls for preschool behavior. In Ackerman et al. (2003), one of our goals was to determine the relations between change in family circumstances and change in behavioral adjustment from first grade to third grade. We contrasted a group of children that differed in showing persistently high levels of externalizing behavior across grade, a group that improved substantially from first grade, a group that started well in first grade but showed high levels of externalizing behaviors in third grade, and a control group of children who were relatively unproblematic in both grades. We found that family instability distinguished the groups at each grade, and that change in family instability across grade was associated uniquely with change in behavior. We constructed the instability change score by subtracting first grade instability from third grade instability. The improver group in particular showed
Poverty Co-factors and the Adjustment of Children in Elementary School
115
high levels in family instability in first grade and low levels in third grade, and the new problem group showed the reverse pattern. At least three general issues remain, however. First, it would be foolish to understate the continuity in both child adjustment problems and in environmental adversity for disadvantaged families. Some aspects of adversity seem dynamic and should be conceptualized, but many aspects impose continuing risk, and even positive changes toward more favorable states (i.e., stable circumstances) often are short-lived and constrained. Employment and income gains, for instance, rarely propel families beyond the threshold defining disadvantage, and families that maintain a constant residence still tend to reside in impoverished neighborhoods. Second, mechanisms and moderators of instability effects remain to be determined. Concerning mediators, Forman and Davies (2003) argue that family instability undermines children’s emotional security and trust. Other possibilities include the effects of unpredictability on children’s feelings of efficacy and control, and the effects of having to adjust and readjust repeatedly to new circumstances, including new parent figures, new friends, a new school, and the like. Perhaps these acute stressors contribute to the ‘‘allostatic’’ load children experience (Evans, 2003, 2004) and to their ability to self-regulate. Concerning moderators, Ackerman et al. (1999) found evidence for an interaction of child temperamental adaptability and family instability, such that the least adaptable children were most affected. This finding points out the need to test the universality of the effects of family instability and of dynamic aspects of family adversity on child adjustment. Third, along the same lines, the effects are not likely to be the same for all child outcomes. In several recent studies, for instance, we have found substantial changes in levels of problem behaviors in school for fairly substantial numbers of children that relate to changes in family circumstances. The same is not true for academic competence. Stability across grade is much stronger for academic competence than for problem behaviors, and our ability to predict the changes that do occur in academic competence is much weaker. We explain these patterns by arguing that the influence of past adjustment on current success is greater for academic competence than for problem behaviors. Thus, poor reading skills in first and second grade are likely to constrain both reading and other sorts of skill acquisitions in later grades. The theoretical point is that dynamic aspects of environmental adversity for disadvantaged children seem likely to relate differently to different child outcomes. B. PERSISTENCE OF ENVIRONMENTAL ADVERSITY
The persistence issue for developmental researchers concerns the relation between the duration or chronicity of adversity over time and child and
116
Brian P. Ackerman and Eleanor D. Brown
adolescent outcomes. The strong assumption for poverty researchers has been that poor outcomes vary with the dose of poverty, with dose defined as the number of consecutive years a family has been economically disadvantaged. The word ‘‘consecutive’’ is critical in this definition because it distinguishes persistent from intermittent poverty, which then forms a contrast condition for testing persistence effects. We emphasize the word ‘‘assumption’’ here because few studies have examined persistence effects and the evidence is weak and qualified by design and third variable problems, and strong evidence has emerged recently for powerful effects of intermittency and environmental volatility over time (Ackerman, Brown, & Izard, 2004a; Dearing, McCartney, & Taylor, 2001; McLoyd & Smith, 2002; Pagani et al., 1999; Yeung, Linver, & Brooks-Gunn, 2002), some of which we developed in the last subsection. Nonetheless, most empirical and theoretical assessments of poverty effects stress the importance of persistence (Bolger et al., 1995; Duncan & Brooks-Gunn, 2000; McLeod & Nonnemaker, 2000; McLoyd, 1998; Schoon et al., 2002). The persistence issue is interesting theoretically because it taps the core developmental and poverty themes. One theme we have belabored is the need to examine development and change over time with longitudinal research. A focus on persistence in this sense is the antidote to a snapshot perspective of poverty effects. A second theme involves the weight placed on the timing of environmental risk, which seems unavoidably connected with concepts of duration and chronicity of the experiences. Much developmental theory, for instance, emphasizes the primacy of early experiences both for cognitive growth and socio-emotional interpersonal bonds (i.e., attachment). Family stress models, in contrast, seem to argue for concurrent effects (Conger et al., 2002), as do most socialization and social learning models (Reid, Patterson, & Snyder, 2002). The argument is that child adjustment reflects current parent–child relationships and processes, and these in turn reflect current family circumstances (Campbell et al., 1996; Lewis, Feiring, & Rosenthal, 2000). Although both perspectives highlight timing, neither privileges residual effects for persistence over time. A third theme concerns the selectivity hypothesis about the unique effects of income poverty for cognitive outcomes and of poverty co-factors for problem behavior outcomes. This hypothesis reconciles the two timing ideas in the previous paragraph. Poverty persistence may matter primarily for cognitive outcomes and primarily for preschool experiences because intellectual ability/ competence (a) is mostly a product of preschool experiences (i.e., early experience) and (b) is diminished by impoverished home learning environments. Intellectual ability and academic achievement in middle childhood and adolescence, then, are strongly constrained by ability at school entry. Current problem behaviors, in contrast, are less constrained by past adaptation and are more sensitive to current circumstances.
Poverty Co-factors and the Adjustment of Children in Elementary School
117
Despite the theoretical cachet of the persistence issue, it is not surprising that the evidentiary base is so weak for risk research. The longitudinal requirements are onerous, but so is the empirical complexity of the issue. Given the requirement to do multiple assessments, questions quickly arise about how many assessments over how many years do justice to a test of persistence, and what is a reasonable assessment gap, what is the impact of timing, and, most critically, must the effects vary linearly with duration (i.e., dose)? Suppose, for instance, disadvantaged children especially do poorly in school when experiencing maternal relationship instability in the first- to third-grade years and in the third- to fifth-grade years, but there is no additional cost associated with instability in the preschool to first-grade years. We could, of course, work this issue in many ways given the great potential variation in the number and spacing of assessments. We made several passes at persistence issues in Ackerman et al. (1999) and Ackerman, Brown, and Izard (2004a), and most recently and systematically in Ackerman, Brown, and Izard (2004b). In the latter study, we examined persistence effects over the three two-year assessment intervals from preschool to first grade, first grade to third grade, and third grade to fifth grade. Fifth grade externalizing behavior, internalizing behavior, and academic competence represented the child outcomes. Income-to-needs ratios and several contextual co-factors served as our predictor variables, which we motivated with the selectivity hypothesis. We represented these predictors with interval indexes, ranging mostly from 0 to 3, which reflected the number of intervals in which a family experienced risk. We defined experience of risk as an income-to-needs ratio below the poverty line, presence or absence of a categorical contextual variable (e.g., parent–police contact), and scores above the mean for continuous contextual variables (e.g., harsh parenting). The controls were preschool assessments of child intelligence, maternal education, and problem behaviors. Three intervals (or more) pose a problem in defining persistence, as we explained previously, which we solved by distinguishing between maximum persistence and minimum persistence. Evidence for maximum persistence was a linear dose-response effect with a child outcome worse for three intervals than for two, and worse for two intervals than for one, and so on. Evidence for minimum persistence was a finding that an outcome was worse for two consecutive intervals than for one interval, and the like. The contrast conditions were intermittent intervals, which varies the consecutive criterion while controlling for dose, and single interval doses in preschool and fifth grade, which vary timing of dose (early or concurrent) with the outcome. The results should be cautionary for advocates of persistence effects. Table V shows the strongest findings for interval effects for problem behaviors, and the means suggest effects for minimum persistence at best (i.e., two intervals), not maximum persistence (three intervals). There were no linear
118
Brian P. Ackerman and Eleanor D. Brown
TABLE V Numbers of Intervals and Families, Adjusted Means, and Standard Errors for Externalizing and Internalizing Behavior in Fifth-Grade Externalizing behavior Relationship instability
Internalizing behavior Income-toneeds
Police contact
Psychiatric morbidity
Intervals
N
M
SE
N
M
SE
N
M
SE
N
M
SE
0 1 2 3
33 37 24 16
11.54 10.67 22.02 23.84
1.83 1.71 2.08 2.83
36 41 24 9
11.82 12.88 21.10 20.52
1.93 1.68 2.17 3.26
25 24 24 37
4.55 4.74 8.26 8.21
1.16 1.12 1.14 0.93
73 24 13
4.87 8.54 12.72
0.66 1.18 1.55
dose-outcome relations. Even this evidence for minimum persistence in the table is difficult to interpret because the interval effect does not distinguish between consecutive and intermittent intervals or between single doses that occur recently (i.e., fifth-grade assessment) or in the past. We teased out these distinctions with planned comparisons. We found some weak evidence for persistence in that levels of problem behaviors were greater for two consecutive intervals of risk than two intermittent intervals of risk (i.e., first- and fifth-grade assessment) only for police contacts for externalizing behavior (the second set of columns in the table) and for psychiatric morbidity for internalizing behavior (the fourth set of columns). We found strong evidence in support of concurrency for problem behaviors in that behavior levels for two intervals exceeded those for a single recent interval only when one of the two intervals represented recent fifth-grade experiences. These findings for both two consecutive intervals and intermittent intervals also constitute strong evidence that dose matters, given that it is recent. In sum, we can argue fairly convincingly for the importance of concurrency and for the importance of the extended dose of environmental adversity over time for the problem behaviors of disadvantaged children in school, but not for persistence effects. We also found two kinds of additional evidence for the selectivity hypothesis. One kind is that the interval scores for the contextual co-factors primarily concerned problem behaviors but not academic competence in fifth grade, whereas the income-to-needs ratio predicted academic competence. A second kind is that the timing of the effects seems to distinguish income poverty and contextual co-factors. The effects for contextual co-factors, for instance, required concurrency of risk experiences in fifth grade in the context of the controls. In contrast, any interval effects for the income-to-needs ratio disappeared with the co-variation of maternal education (measured in the preschool assessment) and preschool cognitive ability. This finding implies that
Poverty Co-factors and the Adjustment of Children in Elementary School
119
preschool learning environments have privileged importance. Our study started with four- and five-year olds, so we have no direct evidence of that importance and no evidence about persistence effects in the preschool years. Nonetheless, our findings overall seem consistent with the claims of McLoyd (1998) and Duncan and Brooks-Gunn (2000) and others about the importance of persistence of income poverty in the preschool years for later academic competence. The claims arguably should not extend to problem behaviors in terms of either duration or timing of other aspects of environmental adversity.
V. Person-Centered Approaches Most developmental research on disadvantaged children has been variablecentered, meaning that the typical design has variables representing family income and other aspects of family circumstances predicting continuous variation in child outcomes. This kind of approach is limited in several related ways, including the failure of variables to distinguish individuals with clinical levels of adjustment problems (Rutter, Giller, & Hagell, 1998), the conceptual argument that these individual children are qualitatively different from children showing normal ups and downs, and non-linear effects in predictive equations (Deater-Deckard & Dodge, 1997). The non-linearity often reflects the differential weight in predictive equations of the 10 or 20% worst behaved children in the outcome distribution. An alternative person-centered approach starts with groups of individuals that are distinguished empirically through a variety of clustering procedures or distinguished theoretically through pre-defined criteria. Our sample size is not large enough for most clustering techniques, so we use pre-defined criteria in the research described in this section. Person-centered approaches are especially useful in examining developmental continuity and discontinuity in academic and behavioral problems for highly problematic children, the factors that distinguish children who seem surprisingly competent in highly adverse circumstances (i.e., resilient children), and children who show co-occurring problems in different outcome domains. In the rest of this section, we briefly describe our research on all three topics. All the three are associated with economic disadvantage.
A. CONTINUITY AND CHANGE IN EXTERNALIZING PROBLEMS
One of the most significant advances in recent delinquency research has been on developmental models that distinguish between children and adolescents who show high levels of problem behaviors that start early and persist across late childhood and adolescence or that start in late childhood or adolescence and
120
Brian P. Ackerman and Eleanor D. Brown
are limited to that period (Moffitt et al., 1996, 2002; Patterson et al., 1998). The so-called ‘‘early starters’’ are distinctive in that they typically come from economically disadvantaged families, have parents with anti-social tendencies, and show low verbal IQ (Lahey et al., 1995; Moffitt, 1993a) and perhaps other kinds of neuropsychiatric deficiencies focusing on impulse control (Moffitt, 1993b). These markers argue for a social selection view of the roots of serious and lasting behavioral maladjustment. Later starters (i.e., ‘‘adolescent-limited’’) turn out in recent studies to be a more heterogeneous and motley crew. Despite the heuristic value of the ‘‘early starter’’ concept with its focus on continuity of behavior, it turns out to be surprisingly difficult to predict which disadvantaged children with poor starts in elementary school will continue to be problematic throughout the school grades (Bennett et al., 1998). The challenge is to figure out why the problems of some children persist and others desist, and yet others seem to start anew for children who started well. Consistent with our themes in this chapter, the focal and novel challenge in sum is to understand behavioral diversity and change. In Ackerman, Brown, and Izard (2003), we isolated children with levels of externalizing behaviors in the clinical range that began in first grade and persisted or improved in third grade, or that newly began in third grade. The definition of change for the latter two groups of children was a difference of a standard deviation in standardized externalizing scores (T-scores). A fourth contrast group had relatively low levels in both grades. Our findings for the persistent group of children were consistent with those of other studies. These children showed relatively high behavioral impulsivity, low verbal ability, and experienced relatively high levels of parent maladjustment, family conflict, and harsh parenting. The important novel finding was the association of family instability with behavioral change. The improver group was able to self-right when the family environment stabilized; the new problem group also showed low verbal ability but only began to show conduct problems when the family environment became unstable. Even the persistent group showed relatively high levels of family instability in preschool, though not in first grade. The findings are interesting theoretically in arguing for environmental influences on both continuity and discontinuity in serious levels of problem behaviors. B. RESILIENCE
Coll et al. (1996) argue that a hallmark of deficiency models is a focus on negative outcomes. A complementary focus should concern resilience, which describes behavioral competence in adverse environmental circumstances (Masten, 2001; Masten et al., 1999). Our concern with understanding child diversity has motivated an approach to this issue in several studies, and it
Poverty Co-factors and the Adjustment of Children in Elementary School
121
remains prominent in our plans for future research. Certainly, the focus on improvement (i.e., desistance of problem behaviors) in Ackerman, Brown, and Izard (2003) represents one pass at resilience, at least to the extent that it concerns self-righting. In Ackerman et al. (2002), we tried to understand what distinguished children from families with chronically high levels of partner turnover who showed high or low externalizing scores. What we found was that relatively high verbal ability and relatively benign parenting seemed to protect the competent children from chronic instability. In Ackerman et al. (1999), we constructed an index of positive family variables and found that the index seemed to dilute the risk associated with multiple risk indexes. That is, the positive index seemed to promote competent functioning of high-risk children. Our problem in all of these studies has been that it is difficult to distinguish theoretically or empirically between positive attributes of the environment or child and simply the absence of risk. The children who do relatively well on outcome measures typically show relatively high verbal ability for the sample and experience relatively low environmental adversity.
C. CO-OCCURRING PROBLEMS
An emerging focus for us concerns children showing co-occurring externalizing and academic problems in school, or co-occurring externalizing and internalizing problems. We discuss the former because co-occurrence taps robust developmental issues. In particular, the challenge is to explain dramatic increases with grade in the numbers of children showing high levels of both externalizing and academic problems from a person-centered perspective, and increases in the correlations in these outcome measures from first grade to fifth grade from a variable-centered perspective. The challenge has been posed by other researchers as well (Hinshaw, 1992a,b; Maguin & Loeber, 1996), citing well-documented findings that most delinquent and disadvantaged adolescents suffer from academic failure, but poor behavior and poor academic achievement (i.e., reading) are relatively independent for disadvantaged children at school entry. What accounts for convergence of these problem areas in school? Brown, Ackerman, and Izard (2005) explored developmental convergence from both variable-centered and person-centered perspectives. From the former, the results were clear in showing substantial and significant increases in the correlations of externalizing behavior and academic competence from first grade (r ¼ 0.09) to fifth grade ( r ¼ 0.46). From a person-centered view, the critical theoretical issues concerned the mechanisms underlying convergence across time. One hypothesis is that co-occurrence reflects common or correlated third variables (Angold & Costello, 2001; Hinshaw, 1992b), such as child verbal
122
Brian P. Ackerman and Eleanor D. Brown
ability or hyperactivity/inattention. Another is that co-occurrence reflects concurrent risks drawn from specific predictors of each outcome measure. This hypothesis reflects our larger selectivity hypothesis about differential effects for income poverty and contextual co-factors. We isolated four groups of children at each grade showing the combinations of high/low externalizing scores and low/high academic competence scores, with the cut-offs for problematic children being similar z-scores representing approximately the worst 10% of the distributions. The combinations of firstand fifth-grade groups formed a matrix of 16 cells. We found that the number of children showing co-occurring problems increased from 14 to 32 across grade, with 8 of the children showing co-occurring problems at both grades. Hyperactivity/inattention did not distinguish the problem groups in fifth grade, but other variables did distinguish the groups in accordance with the selectivity hypothesis. Child verbal ability and income-to-needs ratios distinguished the co-occurring and high academic problem (only) groups, for instance, while relationship transitions and adult police contacts distinguished the co-occurring and high externalizing problem (only) groups. Most interesting was a planned comparison of children who migrated from the single problem groups in first grade to the co-occurring problem group in fifth grade vs. those who stayed in the single problem groups in both grades. Consistent with the selectivity hypothesis, the migrators from the high externalizing group in first grade selectively showed low verbal ability, while the migrators from the low academic group selectively experienced more relationship transitions between grades. The results argue that a fairly large percentage of disadvantaged children (32 of 117 or 27%) are at double jeopardy for profound academic and social problems in middle school and beyond based on both endogenous variables and specific kinds of environmental adversity.
VI. Summary and Conclusions Since 1990, there have been great advances in how developmental researchers construct poverty. These advances are important because they may help inform social policy at many levels and help frame how American culture constructs poverty for children, both symbolically and in the opportunities children and families get to escape from poverty. Historically, developmental perspectives have embodied social address and main effects models, snapshot views of poverty effects at single points in time, and a rather narrow focus on income as the symbolic marker of the ecology of disadvantage. More recent views, in contrast, emphasize the diverse circumstances of disadvantaged families and diverse outcomes of disadvantaged children, the multiple sources of risk and the multiple determinants of poor outcomes for these children,
Poverty Co-factors and the Adjustment of Children in Elementary School
123
dynamic aspects of that ecology, and change as well as continuity in outcome trajectories. The advances also consist of more powerful frames for understanding the ecology of disadvantage and the risk it poses for child outcomes. Most developmental researchers still tend to frame causal variables ultimately in terms of the dichotomy between social causation and social selection views, with a primary emphasis on the former. In part, this framing has reflected limitations of sample size and design, because the theoretical and empirical power of reciprocal selection models is clear (Kim et al., 2003). The conceptual advances that prompt such models include widespread acknowledgement of third variable problems in interpreting effects, of the clear need for multivariate approaches, and the need to pursue mechanisms and moderators of the relations between causal candidates and child outcomes. In the context of these advances, one of the core goals of our research program has been to construct robust representations of environmental adversity for disadvantaged families. Most of our research focuses on contextual co-factors at a family level (e.g., maternal relationship instability), which either have not been described by many researchers or have been described in a way that does not fit the ecology of disadvantage (e.g., marital status). We found that income poverty, key contextual co-factors, and endogenous child attributes tend to show independent and selective associations with child academic competence and externalizing behavior, and that co-factor effects tend to be direct rather than mediated by harsh parenting, tend to have effects that are episodic and concurrent, and are easily- and well-represented by multiple risk indexes that bear powerful relations to child problem behaviors. A second core goal has been to better understand the developmental construction of poor outcomes for disadvantaged children, which requires consideration of dynamic aspects of the ecology and the potential importance of the timing of risk experiences. We found that family instability and change in environmental circumstances predict increases in problem behaviors, and that dose of adversity seems to matter for some variables if it is recent, and not for other variables. Through person-centered research, we also are beginning to understand some factors that seem to underlie the convergence of adjustment problems over grade in school. Many of our co-factor findings and many of our developmental findings seem both complex and double-edged. One edge is that they encourage a certain pessimism in showing how environmental adversity progressively constructs poor outcomes for disadvantaged children in school. Overall, for instance, we saw more problems and more multi-dimensional problems in fifth grade than in first grade, and the impact of environmental change was mostly negative. The other edge, however, is more positive in reflecting the possibility of discontinuity in child adjustment problems associated with positive changes in family
124
Brian P. Ackerman and Eleanor D. Brown
circumstances. Findings for minimal persistence and for the strength of recent and concurrent effects argue that the possibility of self-righting and emergent competence in school is robust through the fifth grade even for the most problematic disadvantaged children.
REFERENCES Achenbach, T. M. (1991). Manual for the Child Behavior Checklist 4-18 and 1991 profile. Burlington, VT: University of Vermont Department of Psychiatry. Ackerman, B. P., Brown, E. D., & Izard, C. E. (2003). Continuity and change in levels of externalizing behavior in school of children from economically disadvantaged families. Child Development, 74, 694 – 709. Ackerman, B. P., Brown, E. D., & Izard, C. E. (2004a). The relations between contextual risk, earned income, and the school adjustment of children from economically disadvantaged families. Developmental Psychology, 40, 204 – 216. Ackerman, B. P., Brown, E. D., & Izard, C. E. (2004b). The relations between persistent poverty and contextual risk and children’s behavior in elementary school. Developmental Psychology, 40, 367 – 377. Ackerman, B. P., Brown, E. D., Schoff, D’Eramo, K., & Izard, C. E. (2002). Maternal relationship instability and the school behavior of children from economically disadvantaged families. Developmental Psychology, 38, 694 – 704. Ackerman, B. P., D’Eramo, K. S., Umylny, L., Schultz, D., & Izard, C. (2001). Family structure and the externalizing behavior of children from economically disadvantaged families. Journal of Family Psychology, 15, 288 – 300. Ackerman, B. P., Izard, C. E., Schoff, K., Youngstrom, E. A., & Kogos, J. (1999). Contextual risk, caregiver emotionality, and the problem behaviors of six- and sevenyear-old children from economically disadvantaged families. Child Development, 70, 1415 – 1427. Ackerman, B. P., Kogos, J., Youngstrom, E., Schoff, K., & Izard, C. (1999). Family instability and the problems behaviors of children from economically disadvantaged families. Developmental Psychology, 34, 258 – 268. Ackerman, B. P., Schoff, K., Levinson, K., Youngstrom, E., & Izard, C. E. (1999). The relations between cluster indexes of risk and promotion and the problem behaviors of 6- and 7-year-old children from economically disadvantaged families. Developmental Psychology, 35, 1355 – 1366. Adam, E. K., & Chase-Lansdale, P. L. (2002). Home sweet home(s): Parental separations, residential moves, and adjustment problems in low-income adolescent girls. Developmental Psychology, 38, 792 – 805. Anderson, E. R., Greene, S. M., Hetherington, E. M., & Clingempeel, W. G. (1999). The dynamics of parental remarriage: Adolescent, parent, and sibling influences. In E. M. Hetherington (Ed.), Coping with divorce, single parenting, and remarriage (pp. 295 – 319). Mahwah, NJ: Erlbaum. Angold, A., & Costello, E. J. (2001). The epidemiology of disorders of conduct: nosological issues and comorbidity. In J. Hill & B. Maughan (Eds.), Conduct disorders in childhood and adolescence (pp. 126 – 168). New York: Cambridge University Press.
Poverty Co-factors and the Adjustment of Children in Elementary School
125
Bennett, K. J., Lipman, E. L., Racine, Y., & Offord, D. R. (1998). Annotation: Do measures of externalizing behavior in normal populations predict later outcome?: Implications for targeted interventions to prevent conduct disorder. Journal of Child Psychology and Psychiatry, 39, 1059 – 1070. Bolger, K. E., Patterson, C. J., Thompson, W. W., & Kupersmidt, J. B. (1995). Psychosocial adjustment among children experiencing persistent and intermittent family economic hardship. Child Development, 66, 1107 – 1129. Bornstein, M. H., & Bradley, R. H. (2003), Socioeconomic status, parenting, and child development. Mahwah, NJ: Erlbaum. Britto, P. R., Fuligni, A. S., & Brooks-Gunn, J. (2002). Reading, rhymes, and routines: American parents and their young children. In N. Halfon, K. T. McLearn, & M. A. Schuster (Eds.), Child rearing in America (pp. 117 – 145). New York: Cambridge University Press. Bronfenbrenner, U. (1986). Ecology of the family as a context for human development: Research perspectives. Developmental Psychology, 22, 723 – 742. Brown, E. D., Ackerman, B. P., & Izard, C. E. (2005). Developmental convergence of externalizing behavior and academic difficulty in elementary school for economically disadvantaged children. Unpublished manuscript. Burchinal, M. R., Roberts, J. E., Hooper, S., & Zeisel, S. A. (2000). Cumulative risk and early cognitive development: A comparison of statistical risk models. Developmental Psychology, 36, 793 – 807. Campbell, S. B., Pierce, E. W., Moore, G., Marakovitz, S., & Newby, K. (1996). Boys’ externalizing problems at elementary school age: Pathways from early behavior problems, maternal control, and family stress. Development and Psychopathology, 8, 701 – 719. Caspi, A. (2004). Life-course development: The interplay of social selection and social causation within and across generations. In P. L. Chase-Lansdale, K. Kiernan, & R. J. Friedman (Eds.), Human development across lives and generation (pp. 8 – 27). Cambridge, UK: Cambridge University Press. Coll, C. G., Crnic, K., Lamberty, G., Waski, B. H., Jenkins, R., Garcia, H. V., & McAdoo, H. P. (1996). An integrative model for the study of developmental competencies in minority children. Child Development, 67, 1891 – 1914. Conger, R. D., Wallace, L. E., Sun, Y., Simons, R. L., McLoyd, V. C., & Brody, G. H. (2002). Economic pressure in African American families: A replication and extension of the family stress model. Developmental Psychology, 38, 179 – 193. Cook, T. D., Herman, M. R., Phillips, M., & Settersten, Jr., R. A. (2002). Some ways in which neighborhoods, nuclear families, friendship groups, and schools jointly affect changes in early adolescent development. Child Development, 73, 1283 – 1309. Costello, E. J., Compton, S. N., Keeler, G., & Angold, A. (2003). Relationships between poverty and psychopathology: A natural experiment. Journal of the American Medical Association, 290, 2023 – 2029. Cummings, E. M., Davies, P. T., & Campbell, S. B. (2000). Developmental psychopathology and family process. New York: The Guilford Press. Dearing, E., McCartney, K., & Taylor, B. A. (2001). Change in family income-to-needs matters more for children with less. Child Development, 72, 1779 – 1793. Deater-Deckard, K., & Dodge, K. A. (1997). Externalizing behavior problems and discipline revisited: Nonlinear effects and variation by culture, context, and gender. Psychological Inquiry, 8, 161 – 175.
126
Brian P. Ackerman and Eleanor D. Brown
Deater-Deckard, K., Dodge, K. A., Bates, J. E., & Pettit, G. S. (1998). Multiple risk factors in the development of externalizing behavior problems: Group and individual differences. Development and Psychopathology, 10, 469 – 493. Dodge, K. A., Pettit, G. S., & Bates, J. E. (1994). Socialization mediators of the relation between socioeconomic status and child conduct problems. Child Development, 65, 649 – 665. Duncan, G. J., & Brooks-Gunn, J. (1997a). Consequences of growing up poor. New York: Sage. Duncan, G. J., & Brooks-Gunn, J. (1997b). Income effects across the life span: Integration and interpretation. In G. J. Duncan & J. Brooks-Gunn (Eds.), Consequences of growing up poor (pp. 596 – 610). New York: Sage. Duncan, G. J., & Brooks-Gunn, J. (2000). Family poverty, welfare reform, and child development. Child Development, 71, 188 – 196. Dunn, L. M., & Dunn, L. M. (1981). Peabody Picture Vocabulary Test-Revised. Circle Pines, MN: American Guidance Service. Dunn, J., Deather-Deckard, K., Pickering, K., & O’Connor, T. G. (1998). Children’s adjustment and prosocial behavior in step-, single-parent, and non-stepfamily settings: Findings from a community sample. Journal of Child Psychology and Psychiatry, 39, 1083 – 1095. Eckenrode, J., Rowe, E., Laird, M., & Braithwaite, J. (1995). Mobility as a mediator of the effects of child maltreatment on academic performance. Child Development, 66, 1130 – 1142. Elder, G. J. (1974). Children of the Great Depression. Chicago: University of Chicago Press. Elder, G. J. (1979). Historical change in life patterns and personality. In P. Baltes & O. Brim (Eds.), Life span development and behavior (Vol. 2, pp. 117 – 159). New York: Academic Press. Elder, G. J., Eccles, J. S., Ardelt, M., & Lord, S. (1995). Inner-city parents under economic pressure: Perspectives on the strategies of parenting. Journal of Marriage and the Family, 57, 771 – 784. Evans, G. W. (2003). A multimethodological analysis of cumulative risk and allostatic load among rural children. Developmental Psychology, 39, 924 – 933. Evans, G. W. (2004). The environment of childhood poverty. American Psychologist, 59, 77 – 92. Forgatch, M. S., & DeGarmo, D. S. (1999). Two faces of Janus: Cohesion and conflict. In M. J. Cox & J. Brooks-Gunn (Eds.), Conflict and cohesion in families (pp. 167 – 184). Mahwah, NJ: Erlbaum. Forman, E. M., & Cavies, P. T. (2003). Family instability and young adolescent maladjustment: The mediating effects of parental quality and adolescent appraisals of family security. Journal of Clinical Child and Adolescent Psychology, 12, 94 – 105. Gerard, J. M., & Buehler, C. (2004a). Cumulative environmental risk and youth maladjustment: The role of youth attributes. Child Development, 75, 1832 – 1849. Gerard, J. M., & Buehler, C. (2004b). Cumulative environmental risk and youth problem behavior. Journal of Marriage and the Family, 66, 702 – 720. Greenberg, M. T., Lengua, L. J., Coie, J. D., & Pinderhughes, E. E. (1999). Predicting developmental outcomes at school entry using a multiple-risk model: Four American communities. Developmental Psychology, 35, 403 – 417. Gresham, F. M., & Elliott, S. N. (1990). Social skills rating system. Circle Pines, MN: American Guidance Service.
Poverty Co-factors and the Adjustment of Children in Elementary School
127
Gutman, L. M., Sameroff, A. J., & Eccles, J. S. (2002). The academic achievement of African-American students during early adolescence: An examination of multiple risk, promotive, and protective factors. American Journal of Community Psychology, 39, 367 – 399. Hart, B., & Risley, T. R. (1995). Meaningful differences in the everyday experiences of young American children. Baltimore: Paul H. Brookes. Haskins, R. (1989). Beyond metaphor: The efficacy of early childhood education. American Psychologist, 44, 274 – 282. Hinshaw, S. P. (1992a). Academic underachievement, attention deficits, and aggression: Comorbidity and implications for intervention. Journal of Consulting and Clinical Psychology, 60, 893 – 903. Hinshaw, S. P. (1992b). Externalizing behavior problems and academic underachievement in childhood and adolescence: Causal relationships and underlying mechanisms. Psychological Bulletin, 111, 127 – 155. Kim, K. J., Conger, R. D., Elder, G. H., & Lorenz, F. O. (2003). Reciprocal influences between stressful life events and adolescent internalizing and externalizing problems. Child Development, 74, 127 – 143. Kupersmidt, J. B., Burchinal, M. R., & Patterson, C. J. (1995). Developmental patterns of childhood peer relations as predictors of externalizing behavior problems. Development and Psychopathology, 7, 825 – 843. Kurdek, L. A., Fine, M. A., & Sinclair, R. J. (1995). School adjustment in sixth graders: Parenting transitions, family climate, and peer norm effects. Child Development, 66, 430 – 445. Lahey, B. B., Moffitt, T. E., & Caspi, A. (2003). Causes of conduct disorder and juvenile delinquency. New York: The Guilford Press. Lahey, B. B., Loeber, R., Hart, E. L., Frick, P. J., Applegate, B., Zhang, Q., et al. (1995). Four-year longitudinal study of conduct disorder in boys: Patterns and predictors of persistence. Journal of Abnormal Psychology, 104, 83 – 93. Leventhal, T., & Brooks-Gunn, J. (2000). The neighborhoods they live in: The effects of neighborhood residence on child and adolescent outcomes. Psychological Bulletin, 126, 309 – 337. Lewis, M. D., Feiring, C., & Rosenthal, S. (2000). Attachment over time. Child Development, 71, 707 – 720. Liaw, F., & Brooks-Gunn, J. (1994). Cumulative familial risks and low-birthweight children’s cognitive and behavioral development. Journal of Clinical Child Psychology, 223, 360 – 372. Luthar, S. S. (2003). Resilience and vulnerability: Adaptation in the context of childhood adversities. Cambridge, UK: Cambridge University Press. Macmillan, R., McMorris, B. J., & Kruttschnitt, C. (2004). Linked lives: Stability and change in maternal circumstances and trajectories of antisocial behavior in children. Child Development, 75, 205 – 220. Maguin, E., & Loeber, R. (1996). Academic performance and delinquency. In M. Tonry (Ed.), Crime and justice (Vol. 20, pp. 145 – 264). Chicago: University of Chicago Press. Marmot, M. (2004). The status syndrome. NY: Henry Holt and Company. Masten, A. S. (2001). Ordinary magic: Resilience processes in development. American Psychologist, 56, 227 – 238. Masten, A. S., Hubbard, J. J., Gest, S. D., Tellegen, A., Garmezy, N., & Ramirez, M. (1999). Competence in the context of adversity: Pathways to resilience and maladaptation from childhood to late adolescence. Development and Psychopathology, 11, 143 – 169.
128
Brian P. Ackerman and Eleanor D. Brown
Mayer, S. (1997). What money can’t buy: The effect of parental income on children’s outcomes. Cambridge MA: Harvard University Press. McLeod, J. D., & Nonnemaker, J. M. (2000). Poverty and child emotional and behavioral problems: Racial/ethnic differences in processes and effects. Journal of Health and Social Behavior, 41, 137 – 161. McLoyd, V. C. (1989). Socialization and development in a changing economy: The effects of paternal job and income loss on children. American Psychologist, 44, 293 – 302. McLoyd, V. C. (1998). Socioeconomic disadvantage and child development. American Psychologist, 53, 185 – 204. McLoyd, V. C., & Smith, J. (2002). Physical discipline and behavior problems in African American, European American and Hispanic children: Emotional support as a moderator. Journal of Marriage and the Family, 64, 40 – 53. Mistry, R. S., Vandewater, E. A., Huston, A. C., & McLoyd, V. C. (2002). Economic well-being and children’s social adjustment: the role of family process in an ethnically diverse low-income sample. Child Development, 73, 935 – 951. Moffitt, T. E. (1993a). ‘‘Life-course-persistent’’ and ‘‘adolescent-limited’’ antisocial behavior: A developmental taxonomy. Psychological Review, 100, 674 – 701. Moffitt, T. E. (1993b). The neuropsychology of conduct disorder. Developmental and Psychopathology, 5, 135 – 151. Moffitt, T. E., Caspi, A., Dickson, N., Silva, P., & Stanton, W. (1996). Childhood-onset vs. adolescent-onset antisocial conduct problems in males: Natural history from ages 3 to 18 years. Development and Psychopathology, 8, 399 – 424. Moffitt, T. E., Caspi, A., Harrington, H., & Milne, B. J. (2002). Males on the life-coursepersistent and adolescent-limited antisocial pathways: Follow-up at age 26 years. Development and Psychopathology, 14, 179 – 207. Pagani, L., Boulerice, B., Vitaro, F., & Tremblay, R. E. (1999). Effects of poverty on academic failure and delinquency in boys: A change and process model approach. Journal of Child Psychology and Psychiatry, 40, 1209 – 1219. Patterson, G. R. (1999). A proposal relating a theory of delinquency to societal rates of juvenile crime: Putting Humpty Dumpty together again. In M. J. Cox & J. BrooksGunn (Eds.), Conflict and cohesion in families (pp. 11 – 35). Mahwah, NJ: Erlbaum. Patterson, G. R., Forgatch, M. S., Yoerger, K. L., & Stoolmiller, M. (1998). Variables that initiate and maintain an early-onset trajectory for juvenile offending. Development and Psychopathology, 10, 531 – 547. Reid, J. B., Patterson, G. R., & Snyder, J. (2002). Antisocial behavior in children and adolescents. Washington, DC: American Psychological Association. Rutter, M., Cox, A., Tupling, C., Berger, M., & Yule, W. (1975a). Attainment and adjustment in two geographical areas: 1. The prevalence of psychiatric disorder. British Journal of Psychiatry, 126, 520 – 533. Rutter, M., Giller, H., & Hagell, A. (1998). Antisocial behavior by young people. Cambridge, UK: Cambridge University Press. Rutter, M., Yule, B., Quinton, D., Rowlands, O., Yule, W., & Berger, W. (1975b). Attainment and adjustment in two geographical areas: 3. Some factors accounting for area differences. British Journal of Psychiatry, 126, 520 – 533. Sameroff, A. J., Gutman, L. M., & Peck, S. C. (2003). Adaptation among youth facing multiple risks: Prospective research findings. In S. S. Luthar (Ed.), Resilience and vulnerability: Adaptation in the context of childhood adversities (pp. 364 – 391). New York: Cambridge University Press.
Poverty Co-factors and the Adjustment of Children in Elementary School
129
Sameroff, A. J., Peck, S. C., & Eccles, J. S. (2004). Changing ecological determinants of conduct problems from early adolescence to early adulthood. Development and Psychopathology, 16, 873 – 896. Sameroff, A. J., Seifer, R., & Bartko, T. (1997). Environmental perspectives on adaptation during childhood and adolescence. In S. S. Luthar, J. A. Burack, D. Cicchetti, & J. R. Weisz (Eds.), Developmental psychopathology: Perspectives on adjustment, risk, and disorder (pp. 507 – 526). Cambridge, UK: Cambridge University Press. Schoon, I., Bynner, J., Joshi, H., Parsons, S., Wiggins, R. S., & Sacker, A. (2002). The influence of context, timing, and duration of risk experiences for the passage from childhood to midadulthood. Child Development, 73, 1486 – 1504. Seifer, R., Sameroff, A. J., Baldwin, C. P., & Baldwin, A. (1992). Child and family factors that ameliorate risk between 4 and 13 years of age. Journal of the American Academy of Child and Adolescent Psychiatry, 31, 893 – 903. Seccombe, K. (2000). Families formed outside of marriage. Journal of Marriage and the Family, 62, 1094 – 1113. Stoneman, Z., Brody, G. H., Churchill, S. L., & Winn, L. L. (1999). Effects of residential instability on Head Start children and their relationships with older siblings: Influences of child emotionality and conflict between family caregivers. Child Development, 70, 1246 – 1262. Thorndike, R. L., Hagen, E. P., & Sattler, J. M. (1986). Stanford-Binet Intelligence Scale: Guide for administering and scoring the Fourth Edition. Chicago: Riverside. White, L., & Rogers, S. J. (2000). Economic circumstances and family outcomes: A review of the 1990s. Journal of Marriage and the Family, 62, 1035 – 1051. Wilkinson, R. (2005). The impact of inequality. NY: The New Press. Yeung, W. J., Linver, M. R., & Brooks-Gunn, J. (2002). How money matters for young children’s development: Parental investment and family processes. Child Development, 73, 1861 – 1879.
This page intentionally left blank
I THOUGHT SHE KNEW THAT WOULD HURT MY FEELINGS: DEVELOPING PSYCHOLOGICAL KNOWLEDGE AND MORAL THINKING
Cecilia Wainryb and Beverly A. Brehl DEPARTMENT OF PSYCHOLOGY, UNIVERSITY OF UTAH, SALT LAKE CITY, UTAH 84112, USA
I. INTRODUCTION II. MORAL JUDGMENTS ABOUT THE WORLD AS UNDERSTOOD III. CHILDREN’S DEVELOPING UNDERSTANDINGS OF PERSONS: A THUMBNAIL SKETCH
A. WHAT DO CHILDREN KNOW ABOUT PERSONS’ BELIEFS, DESIRES, AND INTENTIONS? B. AND WHAT DO THEY KNOW ABOUT PERSONS’ EMOTIONS? IV. CHILDREN’S MORAL JUDGMENTS ABOUT THE BEHAVIORS OF PERSONS AS UNDERSTOOD
A. BELIEFS, FALSE BELIEFS, WRONG BELIEFS, AND MORAL JUDGMENTS B. MENTAL STATES, TRANSPARENCY, AND MORAL JUDGMENTS C. PASSIVE MINDS, ACTIVE MINDS, AND MORAL JUDGMENTS V. CONCLUSIONS AND FUTURE CHALLENGES REFERENCES
I. Introduction Scientific interest in children’s psychological and moral understandings is not new. Nearly 80 years ago, Piaget considered children’s understandings of thoughts and dreams (1929/1960) and moral rules (1932/1965). His proposition that young children’s peculiar moral judgments were related to their limited understanding of intentionality, too, presciently foreshadowed the idea that psychological understandings are implicated in moral thinking. In the 1970s and early 1980s, following Piaget’s observations, the relation between children’s moral judgments and their understandings of intentionality and person perception was investigated further (Berndt & Berndt, 1975; Darley, Klosson, & Zanna, 1978; Darley & Zanna, 1982; Karniol, 1978; Keasey, 1977; Nelson-Le Gall, 1985; Shultz, Wright, & Schleifer, 1986). Since that time, 131 Advances in Child Development and Behavior R Kail (Editor)
ß 2006 Elsevier B.V. All rights reserved.
132
Cecilia Wainryb and Beverly A. Brehl
however, research on children’s psychological and moral concepts has proceeded along independent lines, with two considerable bodies of empirical evidence coexisting side by side. In this chapter, we argue that the relation between the two realms of development merits close attention and present a framework for thinking about this relation. Our interest in these issues is also not unique. In the past several years, theories-of-mind researchers have begun considering the role of children’s understandings of obligation, including moral obligation, in the interconnected network of mental state concepts. For some, this concern was part of their increasingly broader interest in the connections between children’s psychological understandings and their social development and behavior (Astington, 2003; Dunn, 1999; Dunn, Cutting, & Demetriou, 2000). Others recognized that a psychological sense of obligation might operate, not unlike desire, to motivate human behavior (Harris & Nunez, 1996; Kalish & Shiverick, 2004; Lagattuta, 2005; Leslie, 2000). From our perspective as moral development researchers, the interest in the relation between moral and psychological concepts lies elsewhere. Other than those who take a strict behaviorist stance, few would disagree with the proposition that a moral attitude hinges, at its very basis, on a consideration of the agent’s mental states. To pronounce a situation as implicating moral concerns, one must at the very least bear in mind the agent’s beliefs and reasons for acting. This is why, we argue, the unwitting dissociation between investigations of children’s developing psychological and moral concepts worked to constrain the study of children’s moral thinking. To fully understand children’s moral lives—to capture the ways in which children bring their moral concepts to bear on their actual social interactions and conflicts—it is important to understand how children’s developing psychological concepts become implicated in their moral judgments. The set of propositions we put forth in this chapter stem in part from a wellestablished tradition in moral development research that has held that starting at a young age, children develop moral concepts that bear on the welfare and fair treatment of others (Turiel, 1983a, 1998). Over the last three decades, researchers working from what has become known as the ‘‘domain-specific’’ tradition have radically altered the way we understand children’s morality by demonstrating that by the age of 3 or 4 children already possess moral concepts—ways of thinking about welfare and justice—that are not contingent on non-moral considerations, but prescriptive and generalizable. Even at this young age, children judge it to be wrong and unacceptable to hurt or mistreat others, not merely because of the potential for ensuing punishment, but because of their concerns with fairness and the well-being of persons. The results of more than 100 studies have indicated that children hold their moral concerns with welfare and justice to be applicable regardless of rules or
Developing Psychological Knowledge and Moral Thinking
133
custom (for comprehensive reviews of this research, see Helwig & Turiel, 2003; Smetana, 2006; Turiel, 1998). One way to understand the mounting evidence emerging from this research tradition is to say that, insofar as children understand an event as implicating an agent who intentionally mistreats or inflicts harm upon an unwilling victim, they bring to bear their moral concepts both on straightforward instances of physical and psychological harm and unfairness and on multifaceted situations that entail overlapping concerns with morality, social conventions, and personal choice. But how do children come to understand that an event involves intentional harm? This is not only a complex question, but also a potentially dangerous one, as it can introduce within the study of moral development a fair amount of subjectivity and, with it, the dreaded specter of relativism. In attempting to consider this question, we have relied on two additional long-standing research traditions in developmental psychology (Piaget, 1952; Piaget & Inhelder, 1969) and social psychology (Asch, 1952)—both of which emphasize the active and interpretive stance of people vis-a´-vis reality. In the process, we have concluded that children’s developing psychological understandings become implicated in their moral thinking in ways that have far-reaching implications. This is our conclusion, however. We shall start at the beginning. Against the backdrop of these three broad theoretical and empirical bodies of knowledge, we begin this chapter by outlining a framework for understanding why and how children’s psychological understandings are implicated in their moral thinking. To do so, we discuss, first, the role of interpretation in moral thinking, putting forth the proposition that children make moral judgments about the world as they understand it, or construe it, to be. Next we argue for the view that children’s understandings of persons—persons’ beliefs, desires, intentions, and emotions—are a central component of how children construe the world, and present evidence of the multiple ways in which children’s developing psychological concepts inform, and differently constrain, their moral thinking. We conclude by speculating broadly about the nature of the relation between psychological knowledge and moral reasoning, and discuss the implications that this relation may have for future research in moral development.
II. Moral Judgments About the World as Understood To understand the role of interpretation in moral thinking, it is necessary to situate this argument within the larger framework of moral development research. Long before the notions of subjective construal and interpretation entered the scholarly discourse by way of social constructionist and postmodernist writings (e.g., Gergen, 1991; Shweder, 1999), social and developmental
134
Cecilia Wainryb and Beverly A. Brehl
psychologists had provided compelling evidence that people construe their own understandings of reality. At least 50 years before Gergen or Shweder first spoke about subjectivism, Gestalt psychologists had amassed evidence suggesting that ‘‘objects of judgment’’ (whatever individuals make judgments about) are not fixed and do not merely reside in or coincide with the events themselves. Objects of judgment, they showed us, are cognitively created and transformed as individuals interpret unfolding events (Asch, 1952; Duncker, 1939). Piaget, too, beat postmodern thinkers to the realization that people’s construals of reality are distorted, or informed, by their own understandings and positions, thereby giving rise to a long-standing research tradition that views children, and people in general, as actively construing, rather than passively registering, reality (Piaget, 1952; Piaget & Inhelder, 1969). The proposition that people construe their own understanding of reality and the related realization that people behave and respond primarily in accord with their own interpretations of their experiences, rather to the experiences themselves, spurred interest among psychologists working in fields as diverse as aggression (Dodge, 1986, 2003), depression (Beck, 1967; Dalgleish et al., 2003; Hammen & Zupan, 1984); peer relations (Asher & Wheeler, 1985; Crick & Ladd, 1990; Prinstein, Cheah, & Guyer, 2005), everyday problem solving (Berg, Meegan, & Deviney, 1998), and emotional development (Lewis, 2001; Weiner & Graham, 1984). This was not the case in the field of moral development. Although Asch (1952; see also Duncker, 1939) understood, and pointed out, that beliefs and interpretations of reality play a fundamental role in moral decisions, the idea of interpretation has encountered serious resistance within the field of moral development. It is impossible to determine with any certainty why interpretation has been largely ignored in research on moral thinking. We speculate that this may have been due, at least in part, to the ever present concern, among moral developmentalists, that attending to the subjective ways in which individuals interpret moral conflicts would inevitably lead to moral relativism. It was Kohlberg (1971) who first introduced, into the study of moral development, the injunction against confounding matters of value (‘‘ought’’) with matters of fact (‘‘is’’). In so doing, Kohlberg successfully steered the field of moral psychology away from the relativistic definitions of morality put forth by the then prevalent behaviorist psychology; hence, the importance of this distinction cannot be overstated. It is equally important to note, however, that allowing for the possibility that children subjectively construe moral situations does not necessarily lead to moral relativism. This was not clear in Kohlberg’s stage theory of moral development. Kohlberg (1969, 1971) viewed moral development as an increasing process of differentiation, by which moral understandings come to be distinguished from prudential and conventional understandings. In his view, moral thinking at the early stages is contingent on
Developing Psychological Knowledge and Moral Thinking
135
aspects of reality (e.g., interpersonal expectations, existing laws) because ‘‘is’’ and ‘‘ought’’ are not yet distinguished, and it is only with development that matters of value become extricated from matters of fact, such that moral judgments become ‘‘entirely independent of factual assumptions’’ (Kohlberg, 1971, p. 292) and universal. Kohlberg’s emphasis on the increasing differentiation between matters of value and fact worked to obscure the possibility that the ways in which people think about and understand a situation—the ways in which they construe the relevant facts—may play a role in moral thinking after moral concepts have become prescriptive (Wainryb, 2004). A view of development that does not assume that moral concepts stem from nonmoral concepts renders this possibility less problematic. In the constructivist and interactional view of moral development put forth by Turiel (1983a, 1998), for example, children construct prescriptive moral concepts, not out of nonmoral concepts, but out of their social interactions. That is, children’s understandings and interpretations of the features of those interactions (e.g., their construal of the consequences of an attack on someone or an insult) constitute the basis out of which children develop prescriptive moral concepts. This is not the same as saying that children decide what is right or wrong merely on the basis of how things are. Rather, our argument has been that children’s beliefs and interpretations about the way things are serve as the background against which they make their moral judgments (Wainryb, 1991, 1993, 2000, 2004; Wainryb & Turiel, 1993). In other words, when children (and adults) bring their moral concepts to bear on specific social events and interactions, they do so with regards not to the ‘‘real’’ events and interactions as they ‘‘truly happened,’’ but to their interpretations or construals of those events and interactions. People, we have argued, make judgments about the world as they understand it to be (Wainryb, 2004). This proposition makes room for subjectivity in moral judgments, but associates such subjectivity not with relativism at the level of moral concepts, but with relativism at the level of understandings of reality—a distinction aptly described by Asch (1952) as the difference between moral relativism and ‘‘relational determination of meanings’’ (see also Turiel, Killen, & Helwig, 1987). That persons make judgments in relation to their own (sometimes mistaken, sometimes biased; see Wainryb, 2000) interpretations or understandings of the relevant facts is plainly evident in the nature of the controversies surrounding complex social issues, such as abortion, pornography, or capital punishment. The variability in opinions about these matters, as expressed in public discourses as well as in legal opinions, clearly revolves around ambiguities in basic assumptions about the features of those acts. For example, the definition of life and the determination of its beginning are ambiguous concepts that are difficult to specify. The consequences of pornography are also in dispute, with some believing that it leads to violent and criminal behavior and others disputing such
136
Cecilia Wainryb and Beverly A. Brehl
a causal connection. The deterring potential of capital punishment has been an equally disputed subject. The ambivalence concerning these assumptions exists not only in the thinking of experts in the relevant disciplines but also in the thinking of the layperson, and research with adolescents and adults has shown that their positive or negative evaluations of those issues were systematically associated with their differing beliefs about or construals of the relevant ‘‘facts’’ (Turiel, Hildebrandt, & Wainryb, 1991). This phenomenon—that people make moral judgments in relation to their specific construals of the relevant facts—is not restricted to larger societal issues or to the reasoning of adolescents and adults. Like adolescents and adults, children continuously appraise the features of the social contexts in which they participate, and make judgments that vary systematically in accord with their own construals and interpretations of those contexts. Children, even young children, may differently construe ordinary events in their social lives, such as conflicts over turn-taking, teasing, name-calling, or exclusion. Children may differently construe the relevant facts (e.g., whether Frank had his turn at the swing already; whether tossing Lisa’s hat around the room was part of the game), they might differentially attend to various aspects of the situation, and they may also differently construe what others knew (e.g., I thought she knew that would hurt my feelings), believed (e.g., did she really believe that calling her son ‘‘lazy’’ would motivate him to work harder?), or intended to do (e.g., did Dylan want to hurt Carl’s feelings by ignoring him, or was he merely preoccupied with other issues?). Although the possibility that children make moral judgments in relation to their specific interpretation of situations was considered many years ago in the context of empirical findings (e.g., Berndt & Berndt, 1975; Sedlak, 1979), this proposition did not receive systematic consideration until much later. Starting in the early 1990s, we began systematically documenting aspects of the interpretive process that goes into making moral judgments. While recognizing that a child’s construal of a moral situation is complex and involves multiple considerations (e.g., What was the agent’s intention? What did the agent think would happen? What did the victim feel?), we began the exploration of these issues with a simple heuristic. In an effort to demonstrate that children make moral judgments in reference to whatever they understand or believe to be true about a given event, we experimentally manipulated what they believed to be factually true about some relevant aspect of the event, and assessed whether their moral judgments varied accordingly. For example, in one study (Wainryb, 1991) participants (ages 11 through 21) were asked to discuss corporal punishment. Their understandings of the actual consequences of corporal punishment were not uniform; some believed that corporal punishment functions in ways that actually help young children to learn and remember, whereas others believed that corporal punishment does not have such positive (and has some negative) consequences.
Developing Psychological Knowledge and Moral Thinking
137
Their moral judgments (assessed separately) were, unsurprisingly, systematically associated with what they held to be true, such that participants who construed corporal punishment as having positive consequences judged it more positively than those who viewed it as lacking such consequences. Participants were then asked to entertain, hypothetically, the possibility of new information that proved their original factual belief to be mistaken and the opposite factual belief to be true. The main finding of this study was that, under an experimental manipulation in which participants agreed as a matter of fact to a different construal of the actual consequences of corporal punishment, the majority of participants changed their moral judgments in the direction expected. Similar results were obtained for children’s reasoning about the morality of practices such as dismissing older job applicants or segregating certain groups of children (Wainryb, 1991), and for adolescents’ and young adults’ reasoning about abortion, pornography (Smetana, 1981; Turiel, Hildebrandt, & Wainryb, 1991), and capital punishment (Wainryb & Laupa, 1994)—all practices whose consequences, like those of corporal punishment, were also differently construed by different individuals. In each case, the evidence indicated that individuals make moral judgments in reference to their specific factual beliefs, and that differences in moral judgments are systematically associated with those beliefs. In each case, also, individuals who made different moral judgments nevertheless endorsed the same moral concepts (e.g., regardless of how participants judged corporal punishment, all stated that it is morally wrong to intentionally inflict harm on others). As a whole, this body of findings bears directly on the role of construal and interpretation in moral thinking, demonstrating that throughout development moral concepts are applied against a background of factual understandings and beliefs about relevant aspects of reality. The proposition that individuals rely on whatever they believe to be true to inform and guide their moral decisions neither suggests nor requires that such beliefs be accurate. In the view of some (e.g., Bandura, 1991), in fact, people merely commit themselves to factual beliefs and construals of reality that cast a positive light on what they know to be immoral choices. In our view, such an extreme proposition is not warranted (Wainryb, 2000). Whereas it is possible that people (including children) engage at times in self-serving, deliberate, and even hypocritical rationalizations, it cannot be merely presumed that divergent construals of reality arise and function solely in that role. Attentional, perceptual, cognitive, and emotional factors might all result in divergent, or even mistaken, construals of reality that cannot be said to arise for the purpose of allowing disengagement from inner conflict (Malle, 2004; Ross, 1990; Ross & Nisbett, 1991). One source of divergence in construals of reality—one that is of most interest to us, as it has systematic effects on children’s moral judgments—is developmental in nature. From an early age, children develop their own understandings
138
Cecilia Wainryb and Beverly A. Brehl
of reality. Children, it has been suggested, are intuitive scientists, constructing concepts in the biological (e.g., Slaughter, Jaakkola, & Carey, 1999), societal (e.g., Kalish, 1998; Turiel, 1983a), and psychological (e.g., Wellman, 1990) realms, and children’s understandings of these various aspects of reality exhibit systematic age-related changes. Because children bring their moral concepts to bear on specific situations as they understand, or construe, them to be, it is often the case that their understandings of biological aspects of reality (e.g., the source of illness) and societal matters (e.g., the agreed-upon convention) may become implicated in their construals of moral situations. It is hard to think of moral situations in which children’s psychological understandings are not directly implicated. Children’s psychological knowledge—their understandings of what people think, feel, and want, and why people do what they do—are inevitably central to how children make sense of other people’s actions in moral situations. In turn, the unique features of the still-developing psychological understandings of young children limit in significant ways how these young children construe, and ultimately evaluate, moral situations. Thus we turn in the next section to discuss the development of children’s psychological understandings, along with the specific ways in which these developing understandings become implicated in their moral thinking.
III. Children’s Developing Understandings of Persons: A Thumbnail Sketch Children’s developing psychological understandings, we have argued, may be implicated in their construals and evaluations of moral conflict situations. What do children know about persons and their mental lives? As consumers of theories-of-mind findings, we approach the construct of theories of mind in a ‘‘neutral and inclusive’’ way (Davies & Stone, 2003, p. 305), taking it to refer to the ability to predict and explain behavior in terms of internal mental states. We do not take a stance in regards to how this ability comes about, or whether it should be thought of in terms of possessing a substantive theory about the psychological world or in terms of a capacity to identify with others and simulate their mental lives (for summaries of these competing propositions, see Astington & Gopnik, 1991a; Goldman, 2002; Harris, 1991; Hobson, 1991; Wellman, 1990). In spite of disagreements on the mechanisms by which psychological understandings develop, theories-of-mind researchers agree that the mentalistic stance (Dennet, 1978) implied in this ability is a core feature of everyday thinking, and also agree upon the general sequence by which children’s understandings of mental states change. Broadly speaking, children’s understandings of mental functioning are said to undergo a general shift from an understanding of the mind as ‘‘passive’’ to
Developing Psychological Knowledge and Moral Thinking
139
‘‘active’’ (Pillow, 1988), or from a ‘‘copy’’ to a ‘‘constructive’’ theory of mind (Chandler & Lalonde, 1996). Children progress from viewing the mind as passive in relation to the external world, to understanding that persons actively interpret their experience based on existing beliefs and expectations, and that mental states are organized such that they inform one another. Young children are unaware that information and experience are actively construed and subjectively organized, and instead think of the mind as a storehouse of information and experiences. Older children come to recognize that subjective experience and mental activities transform information in ways such that persons may arrive at different interpretations of or emotional reactions to the same situation. This general shift in children’s understandings of mental functioning is evident in each element of the conceptual network of mental states.
A. WHAT DO CHILDREN KNOW ABOUT PERSONS’ BELIEFS, DESIRES, AND INTENTIONS?
The most widely researched aspect of children’s network of mental state concepts has been their understandings of belief. As documented by an impressive body of evidence (e.g., Chandler & Lalonde, 1996; Fabricius & ImbensBailey, 2000; Mitchell, 2003; Wellman, Cross, & Watson, 2001), prior to the age of 4 or 5, children do not understand that beliefs are representations of reality or that different people may, therefore, come up with different representations of, or beliefs about, the same reality. Rather, these young children appear to rely on the assumption that perception is the sole basis for belief and that there is a one-to-one mapping between what one sees or hears and what one knows (Pillow & Mash, 1999). Research has shown that 3-year-olds do not understand that someone can have a belief that differs from the actual state of affairs (Wimmer & Perner, 1983), or a perception of something that differs from that of another person’s (Flavell et al., 1981). Three-year-olds also find it difficult to appreciate that someone might not know something (Leslie, 2000), and find it difficult to report even their own previous mistaken beliefs (Harris & Leevers, 2000). Beginning at around the age of 4 – 5 years, children first develop the understanding that even persons who have access to different information might end up with different (although, in children’s view, still mistaken or false) beliefs and only later, beginning somewhere around 7 – 8 years of age, do they begin to understand that the mind acts upon information, by selecting, transforming, and organizing perceptual experiences. At this time, children first realize that persons might form different beliefs even if they have equal access to all relevant information. At this age, also, children begin to recognize that mental events
140
Cecilia Wainryb and Beverly A. Brehl
are linked to one another, such that prior thoughts and emotions can inform current beliefs and interpretations of experience (Fabricius & Schwanenflugel, 1994; Lagattuta & Wellman, 2001; Pillow & Henrichon, 1996). A similar shift from a copy to a representational view of the mind characterizes children’s developing understandings of desire. Research has shown that already at age 2, children can conceive of people as agents whose actions are directed to achieving goals and can predict a person’s action on the basis of her or his desire (e.g., Wellman & Woolley, 1990), and by age 3, they can also understand the emotional consequences of simple fulfilled and unfulfilled desires (e.g., that people are happy when their desires are fulfilled and unhappy when their desires remain unfulfilled; see Hadwin & Perner, 1991; Yuill, 1984). Nevertheless, these early achievements do not yet constitute a full understanding of the concept of desire because such young children do not understand yet that desires are subjective relations to reality. Research has indeed shown that 3-year-olds tend to think of desirability as an objective property of situations rather than a relation between a person and a situation (Perner, 1991). Evidence to this ‘‘objective desirability’’ is young children’s responses to situations in which ‘‘wicked desires’’ (e.g., the desire to hit someone), rather than neutral desires (e.g., the desire to win a prize), are fulfilled. For example, when asked to say how an actor behaving on his or her desire to harm another child might feel, 3-year-olds, unlike their older peers, judged that such an actor would feel sad (Yuill et al., 1996). Similarly, when given information about rules (what is allowed or forbidden) and about a person’s preferences (what someone likes), children under 5 tend to predict that people will want to behave, and will be happier behaving, according to rules, whereas older children recognize that people will want to and will be happier satisfying their preferences (Kalish & Shiverick, 2004). Children’s understandings of intentions, too, change in ways similar to their understandings of beliefs and desires. Piaget’s (1932/1965) early observations, that young children tend to focus on consequences while overlooking intentions, were followed by research suggesting that when information about intentions is not confounded with information about consequences, even 5- and 6-year-olds judge intentional acts to be more wrong than accidental acts (Berndt & Berndt 1975; Darley, Klosson, & Zanna, 1978; Karniol, 1978; Keasey, 1977; NelsonLe Gall, 1985; Shultz, Wright, & Schleifer, 1986), and when not asked to weigh intentions against consequences, even 3-year-old children can distinguish between a deliberate and an accidental breach (Harris & Nunez, 1996; Nunez & Harris, 1998; Siegal & Peterson, 1998). Nonetheless, researchers continued to acknowledge that young children’s early understandings of intentions and motives are not as complete as those of older children (Jones & Nelson-Le Gall, 1995; Karniol, 1978; Nelson-Le Gall, 1985). Decades later, theories-of-mind
Developing Psychological Knowledge and Moral Thinking
141
researchers articulated the development of children’s understandings of intention differently but they, too, pointed to the limits of young children’s understandings. Whereas early work spoke about young children’s relative tendency to overlook information about intentions in favor of information about consequences, more current thinking (Astington & Gopnik, 1991b; Moses, 1993; Wellman, 1990) has been that young children do not yet understand that intentions are mental representations distinct from action and outcomes. Empirical evidence in support of this proposition abounds. For example, when presented with pictures of a child who is either performing an action (e.g., playing on the swings) or getting ready to perform an action (e.g., running towards a swing set), the majority of 3-year-olds cannot yet identify the latter picture as representing prior intention. Their tendency to pick instead the picture depicting the act in progress suggests that they confuse intention and action (Astington, 2001). Other research has shown that children between the ages of 3 and 4 inaccurately report their own intentions in ways that match the actual outcomes of their actions (Phillips, Baron-Cohen, & Rutter, 1998; Schult, 2002) and, in studies using deviant causal chains, 3- and 4-yearolds equate intentions with outcomes thereby confusing fulfilled desire with unfulfilled intention (Schult, 2002). Consistent with these confusions, too, young children are likely to assume that if something happens to be the case, it is because someone intended it to be so, even when provided information to the contrary (Kalish, 2005).
B. AND WHAT DO THEY KNOW ABOUT PERSONS’ EMOTIONS?
Whereas often investigated separately from their understandings of beliefs, desires, and intentions, children’s understandings of emotions constitute an integral part of the network of mental state concepts that, we propose, children bring to bear on their construals of moral situations. The role of emotion understanding in moral thinking is unquestioned. Moral development researchers have recognized early on that the development of moral concepts is, at least in part, related to the perception of the distress experienced by victims (Arsenio, Gold, & Adams, 2006; Arsenio & Lover, 1995; Turiel, 1983a). Theories-of-mind researchers have also been quick to point out that children’s moral intuitions are inextricably bound up with their understanding of emotion (Dunn, 2000; Harris, 1989). As it turns out, children seem able to infer basic emotions from both situations and facial expressions very early on. Even infants recognize different facial expressions displayed by others and, by the end of their first year, display discriminating behavioral responses to the emotional expressions of others (Harris, 1989). By the age of 2, children understand that they can affect another
142
Cecilia Wainryb and Beverly A. Brehl
person’s emotions, as demonstrated by their deliberate efforts to do so through teasing, hurting, comforting, and joking (Dunn, 1991, 1999), and by the age of 3, they recognize that people with different past experiences may have different emotional reactions to the same event (Lagattuta & Wellman, 2001). These early understandings suggest that even before they recognize that other people may have beliefs and intentions different from their own, young children can already grasp that others have some sort of mental life separate from their own (Dunn, 1991, 1999). This, along with children’s early realization that they can influence the emotions of others, is of tremendous relevance to moral development inasmuch as it suggests that a young child can understand that a person’s suffering may be caused and alleviated by the actions of another person. Nonetheless, young children’s understandings of emotion are still limited, as children tend to conflate the inner experience of emotion with external indicators of emotion (Arsenio, Gold, & Adams, 2006; Gnepp, 1989; Gnepp & Klayman, 1992). Young children’s limited understanding of emotion underlies their difficulty recognizing that people can hide their emotions. Indeed, before the age of 5 or 6, children do not understand that people can, for example, feign joy while experiencing distress or disappointment, and do not seem to be aware of the potentially misleading effect this dissemblance can have on others (Harris, 1989). Similarly, young children also tend to assume that a person’s emotion can be explained in terms of outcomes or circumstances in the world. Therefore, when asked to predict what someone else may be feeling, or to explain someone’s unexpected emotional expression, young children tend to rely on the valence of outcomes and on situational cues, rather than on information about what a person may be thinking or wanting. To return to an example we discussed previously, 3-year-olds expect that a child whose desire to hit a peer was fulfilled would feel sad rather than pleased—that is, they attribute emotions based on the objective desirability of certain outcomes, rather than on the relation between outcomes and subjective desires (Yuill et al., 1996). Similarly, before the age of 5, children tend not to incorporate information about a person’s intentions when making attributions about a person’s guilt but, instead, make guilt attributions based largely on the outcomes of a person’s actions (Denham & Kochankoff, 2002; Lagattuta, 2005; Nunner-Winkler & Sodian, 1988). Finally, because of their relative inattention to internal experience and over-reliance on outward expression, children aged 5 and younger have difficulties attributing or explaining emotions when the situation is equivocal (i.e., when there is not one emotion typically associated with an event; e.g., Gnepp & Klayman, 1992) or when a person’s outward emotional expression is inconsistent with other situational cues (Harris, 1989; Lagattuta & Wellman, 2001). Another manifestation of young children’s inadequate understanding of the internal experience of emotion is their limited understanding of mixed
Developing Psychological Knowledge and Moral Thinking
143
emotions (Harris, 1989; Harter & Whitesell, 1989; Nunner-Winkler & Sodian, 1988; Stein & Trabasso, 1989). In part because they conflate mental states with their external manifestations, and perhaps also because they have difficulty focusing on more than one aspect of a situation at a time, children younger than 6 cannot conceive of two emotions being experienced simultaneously. Beginning at around 6 – 8 years, children can describe two consecutive emotions resulting from the same event (e.g., ‘‘Getting my training wheels taken off my bike made me feel happy, but when I fell down I felt sad’’), but 6-year-olds consistently display a bias towards reporting ‘‘good’’ feelings whenever mixed feelings are possible. Soon after, children can describe situations that cause two simultaneous emotions, but begin by combining two feelings of similar valence, and not until the age of 11 or so do children recognize that a single situation can elicit simultaneous feelings of opposite valence. In all, children’s understandings of emotion can be thought of as undergoing a shift not unlike that exhibited by their understandings of belief, desire, and intention. In line with their ‘‘copy’’ theory of mind, up to about the age of 4 or 5, children have difficulty distinguishing between the inner experience of emotion and external indices of emotion, and focus on the outward expressions of emotion (facial expressions and behavior) and on their knowledge of typical event–emotion associations (e.g., getting a shot at the doctor is associated with negative emotion). At around the age of 5 they begin to link desire to emotion, and by the age of 7 – 8, as they move beyond their ‘‘copy’’ theory of mental life, they begin to recognize that a person’s emotional expression may belie the actual affective state which is, in turn, related to a person’s subjective beliefs about and appraisals of his or her experience. It is important to note here, for it is relevant to the arguments we develop in the next section, that in spite of the significant developments in children’s understandings of emotion, even older children and adults are poor at distinguishing between real and displayed emotion (Denham & Kochanoff, 2002; Harris, 1989). Indeed, none of the developments described in this and the previous section results in true ‘‘mind reading.’’ And yet, even though children may not become more accurate at reading another’s mind in the sense of being able to tell, accurately, what another person feels—or, for that matter, what another person thinks, intends, or wants—they do become more capable of acknowledging the complexity of mental states and the interrelatedness of mental life. It is this more complex understanding of psychological experience— rather than an ability to know precisely what other people feel or think—that is likely to render their construals of social and interpersonal conflicts substantially different from those of younger children. In a very real sense, a child with a more complex understanding of mental life is, in fact, evaluating a very different situation from that of a child with a more rudimentary understanding of the mind. This, in a nutshell, is the topic of the next section.
144
Cecilia Wainryb and Beverly A. Brehl
IV. Children’s Moral Judgments about the Behaviors of Persons as Understood Recall that our original starting point was that, in construing moral situations, children must account for the mental states (e.g., beliefs, intentions, and so on) of those involved. We have now provided a thumbnail sketch of what children, at different ages, actually understand about mental life. Now it remains to be examined how their developing psychological knowledge becomes implicated in their moral thinking. In our research examining the manifestations of psychological knowledge in children’s moral reasoning, we at first focused exclusively on the role played by children’s understandings of belief, and only later did we consider the role played by children’s understandings of other mental states. This strategy is only to be understood in heuristic terms rather than as implying that children’s understandings of beliefs have any sort of privileged status vis-a`-vis their moral thinking. For it is children’s understanding of the person as a whole—of the moral agent as an intentional agent—that informs how they interpret the agent’s actions and construe the moral situation. In this program of research, we have employed two distinct methods, each of which allowed us to examine different aspects of this relation. One method involved presenting participants with hypothetical stories depicting characters who were engaged in harmful or unfair behavior and whose beliefs—beliefs upon which the characters themselves presumably grounded their behavior— were explicitly given and experimentally manipulated. For example, in a series of studies (Shaw & Wainryb, 1999; Wainryb, 1993; Wainryb & Ford, 1998; Wainryb, Shaw, & Maianu, 1998) participants between the preschool and college years were asked to evaluate characters who engaged in harmful behaviors on the basis of beliefs different from their own (e.g., a teacher who puts down her students because she holds the belief that the way to teach children is to put them down when they make mistakes). Using this method we obtained important information about whether and how children of different ages account (or do not account) for other people’s different beliefs in evaluating their harmful behaviors. The intrinsic constraint of this approach is that it does not make it possible to ascertain whether children actually consider other people’s mental states when information about mental states is not provided explicitly in the stimuli. For the purposes of understanding whether and how children account for mental states in their own construals of interpersonal conflicts, in subsequent research we withheld explicit information about the other person’s mental states. In one such study (Brehl & Wainryb, 2005), for example, we used hypothetical scenarios depicting conflicts between peers whose mental states were left ambiguous, and asked participants to explain why actors may have behaved the way they did. Additional information relevant to these questions was gleaned
Developing Psychological Knowledge and Moral Thinking
145
from a study (Wainryb, Brehl, & Matwin, 2005) in which participants were asked to provide narrative accounts about their own interpersonal conflicts with their peers. In all, this program of research has provided evidence of how children’s stance in relation to the mind as passive or active, informs, in significant ways, their construal of moral situations.
A. BELIEFS, FALSE BELIEFS, WRONG BELIEFS, AND MORAL JUDGMENTS
In a first series of studies, we focused on whether children took account of other people’s different beliefs, and whether doing so impacted the ways in which they construed and evaluated moral situations. Recall that earlier in this chapter we described research (e.g., Turiel, Hildebrandt, & Wainryb, 1991; Wainryb, 1991) suggesting, first, that in making moral decisions children apply moral concepts to their own—sometimes mistaken or biased—construals or interpretations of the relevant facts and, second, that children’s divergent construals are consistently associated with their divergent moral judgments. In the research we describe in this section we asked, in a sense, whether children have a grasp of this process, that is, whether they recognize that people who make seemingly immoral decisions may be proceeding from beliefs different from their own, and whether they are differentially forgiving of people who act on the basis of different types of mistaken or wrong beliefs. The findings from this first series of studies were central for understanding the ways in which developments in children’s stance toward mental life differently inform their construals and evaluations of moral transgressions. This research was also valuable for understanding when that does not happen. The general strategy we used in these studies was to tell participants (who, in various studies, were between the ages of 3 and 21 years) about characters who engaged in seemingly harmful behaviors because of factual beliefs different from the participants’ beliefs; for the purposes of comparison, characters were also depicted as engaging in the same or similar behaviors because of moral beliefs different from the participants’ beliefs (the characters’ factual and moral beliefs were given explicitly in the stimuli, and baseline assessments were used to ascertain that participants thought that the depicted behaviors were harmful and that the underlying factual and moral beliefs were untrue/wrong). For example, in one study with 3-, 5-, and 7-year-olds (Wainryb & Ford, 1998), we asked children to consider ‘‘a teacher who tells one of his students that his picture is dumb’’ because he held one of two beliefs, namely, that ‘‘it will teach the child to do a better job next time’’ (a factual belief with which participants disagreed) or that ‘‘it is alright to be mean to children’’ (a moral belief with which participants disagreed). Another example was of ‘‘a teacher who gave girls
146
Cecilia Wainryb and Beverly A. Brehl
more snack food than boys’’ because she believed either that ‘‘girls need more food than boys’’ or that ‘‘it is alright to be nicer to girls and not as nice to boys.’’ Unsurprisingly, 3-year-old children did not understand that the characters depicted in the stimuli had beliefs different from their own even when they were explicitly and repeatedly told that this was the case, and thus uniformly evaluated the characters’ behaviors in terms of what they themselves thought to be true and right. By comparison, 5- and 7-year-olds had little difficulty attributing to characters’ factual and moral beliefs different from their own. Still, not all 5- and 7-year-olds considered the characters’ actions to be acceptable even when based on the characters’ different factual beliefs; nearly half judged the characters’ behaviors in accord with their own factual beliefs. Furthermore, even when they understood that characters were acting on the basis of different moral beliefs, the majority of 5- and 7-year-olds judged behaviors based on non-normative moral beliefs to be wrong. Starting at about the age of 8 – 9, children attended to the characters’ beliefs when judging the seemingly harmful practices those characters engaged in, but accounted differently for moral and factual beliefs. In general, they made negative judgments of practices that were based on moral beliefs different from their own, but were more accepting of the same practices if they were based on factual beliefs different from their own. For example, in one study (Wainryb, 1993) children and adolescents (ages 11 – 21) negatively judged the actions of people in a (hypothetical) culture who beat their children with sticks because they believe that it is alright to hurt children (a moral belief with which participants disagreed), but judged less negatively the same actions by people in a different culture who believe that children who misbehave are possessed by evil spirits that can only be exorcized with such beatings (a factual belief with which they disagreed). Their reasoning was largely that, unlike people who merely held to a different belief about what is ‘‘right,’’ people who had a different factual understanding of what causes misbehavior were well-intentioned in beating their children. A number of participants speculated, further, that children who are beaten by their parents under such a different factual understanding of reality might themselves experience the beating as less hostile or more helpful due to their sharing of those beliefs (see also Shaw & Wainryb, 1999; Wainryb, Shaw, & Maianu, 1998). Similar findings were obtained with scenarios in which parents prevented girls from attending school or inflicted harm on their children as the children reached puberty (Shaw & Wainryb, 1999; Wainryb et al., 1998), as well as for teachers who prevented some children, but not others, from playing certain games, or divvied up resources unequally among boys and girls (Wainryb & Ford, 1998). When taken as a whole, these findings serve to illustrate how children’s developing understandings of belief function—in this context—to inform and constrain, in systematic ways, children’s moral judgments. Three-year-olds,
Developing Psychological Knowledge and Moral Thinking
147
who understand neither the representational nature of beliefs nor the possibility that others may act on the basis of different beliefs, make moral judgments based on their own beliefs of what is true and right. In all likelihood, in fact, such young children do not view even themselves as having based their judgments on their own beliefs. Rather, they probably proceed as though their judgments refer to the world as it is. Five-year-olds, in contrast, understand that persons can form and act on the basis of different beliefs. However, their copy theory of mind is such that they tend to assume that different beliefs are mistaken or false and indicative of ignorance or misinformation. It is not surprising, therefore, that even as 5-year-olds understand that other people’s actions may be grounded on different beliefs, they often disallow the legitimacy of these ‘‘false’’ beliefs and make moral judgments based on their own ‘‘true’’ beliefs. Somewhere between the ages of 7 and 9, informed by a more mature, representational, understanding of belief, children begin to grasp that whatever people believe to be true (whether accurate or not) is bound to inform their moral decisions. Their more forgiving judgments of people who proceeded on the basis of different factual beliefs suggests that by middle childhood children have begun to understand the unavoidably interpretive nature of human knowledge and its implacable effects on behavior. Their reasoning also suggests something else of significance. Recall that participants reasoned that people who acted on different factual understandings of, say, spirits and misbehavior, were in fact intent on helping, rather than hurting, their children. Thus it appears that, in addition to grasping the connection between people’s beliefs and behaviors, by the age of 7 – 9, children have also begun to contemplate the connection among different mental states (e.g., beliefs and intentions) and to use one to infer the other—something to which we shall return later. And yet, it would be a mistake to think that this is all there is to these data. For, as you may recall, in stark contrast to their increasingly accepting and forgiving judgments of those acting on different factual beliefs, children (even older children and adolescents) judged harshly those who engaged in the same behaviors, but did so on the basis of different moral beliefs. The implications of these findings therefore are, in our view, two-pronged. Prior to the age of 5, children’s primitive psychological understandings significantly constrain their construals and judgments of moral situations. Older children’s more mature understanding of the mind renders them capable of recognizing that other people may behave according to different beliefs. Nevertheless, it cannot be merely assumed that, from that point on, children will positively judge persons who behave in harmful or unfair ways based on any type of different belief. Indeed, these findings strongly suggest that children’s thinking about diversity of belief is constrained in domain-specific ways. Whether children’s views about belief diversity are domain-specific or not is a matter of some controversy. Our own research with participants between
148
Cecilia Wainryb and Beverly A. Brehl
the ages of 5 and 9 (Wainryb et al., 2004) and 9 and 22 (Wainryb et al., 2001) has shown that, in regards to a wide range of disagreements, 5-year-olds, as might befit those adhering to a copy conception of belief, thought that there is always only one right belief and, for the most part, judged that it was unacceptable for people to hold beliefs different from their own. Starting at the age of 7, children began to distinguish consistently between diversity in moral beliefs and diversity in other realms of belief. With regard to moral diversity, they thought that some beliefs are right and some are wrong, and stated unabashedly that it is unacceptable for people to hold the wrong moral beliefs. Even college students, who according to some accounts of epistemological development (e.g., Kuhn, Cheney, & Weinstock, 2000), might have been expected to be more uniformly tolerant of diversity of belief, declared variation in moral beliefs to be both undesirable and unacceptable. At the same time, most children 7 and older also thought that diversity in realms of belief other than morality was acceptable. This is not to say that they thought that all knowledge is relative and that there are no true or right answers; on the contrary, their conception of knowledge was decidedly domain-specific. In their view some realms of belief (e.g., taste, religion) are subjective and relative whereas other realms (e.g., perceptible and verifiable facts) support a single unequivocal right answer. However, they judged that wrong beliefs in nonmoral realms—unlike wrong beliefs in the moral realm—should be tolerated (Wainryb et al., 2001, 2004). This pattern of findings suggests that, when faced with persons who engage in seemingly harmful or unjust behavior because they are acting on different beliefs, children 5 and older might not only take account of those beliefs, but also attend in systematic ways to the types of beliefs in question. The finding that children do not view moral beliefs different from their own as acceptable explains why they are not likely to think it acceptable for persons to engage in harmful behaviors on the basis of different moral beliefs. The finding that children are tolerant of other (non-moral) beliefs with which they disagree does not, however, imply that they would necessarily think it acceptable for persons to engage in harmful behaviors because they are proceeding from those different beliefs. Evidence from one of our studies (Shaw & Wainryb, 1999) indirectly suggests that college students draw distinctions, in their judgments, between factual misconstruals that are merely mistaken and those that are disingenuous and self-serving. In all, then, knowing that a child is capable of understanding belief as a mental representation is not enough to predict how that child might construe and evaluate a situation. This, in turn, suggests that the relation between children’s psychological knowledge and moral thinking is not as straightforward as it might otherwise seem. We shall return to this point towards the end of this chapter.
Developing Psychological Knowledge and Moral Thinking
149
B. MENTAL STATES, TRANSPARENCY, AND MORAL JUDGMENTS
Even as our findings indicate that, by middle childhood, children have a fairly sophisticated understanding of the bearing of factual beliefs on moral decisions and make discriminating judgments about other persons’ decisions and behaviors, their discerning judgments must be considered within the particular conditions specified in our research. Recall that in the aforementioned studies, children were always told explicitly what the characters in the stories believed to be factually true (e.g., ‘‘those people believe that children who misbehave are possessed by evil spirits that can only be exorcised with beatings’’) or morally right (e.g., ‘‘those people believe that it is alright to hurt children’’); moreover, the beliefs were systematically crafted and parsed out so that they would represent prototypical instances of factual or moral beliefs. Outside the confines of the laboratory, in the context of real-life interactions and conflicts, the mental states of others are not transparent or immediately accessible, and thus children (and adults) must make inferences about what others are thinking and feeling in constructing an understanding of these situations. It therefore bears asking whether, when information about other people’s beliefs, desires, and intentions is not explicitly given, children infer such information and take account of it in their moral judgments. Evidence from social psychological research with adults suggests that this might not necessarily be the case. In the absence of explicit information about what other people think, it has been argued, adults tend to underestimate the extent to which other persons proceed from factual beliefs different from their own and, instead, judge the actions of others as being reflective of personal values and traits (e.g., Pronin, Gilovich, & Ross, 2004; Ross, 2001; Ross & Ward, 1996). To address this question, we turned next to examining children’s own construals of hypothetical (Brehl & Wainryb, 2005) and real (Wainryb, Brehl, & Matwin, 2005) interpersonal conflicts. In the Brehl and Wainryb (2005) study, 112 boys and girls in three age groups (mean ages were 4½, 7, and 10), and a comparison group of adults, were presented with hypothetical scenarios in which one child hurt the feelings of a peer through excluding this peer from a group, making unequal distribution of desired goods, or saying something mean. In each scenario, the only information provided was whatever would be immediately observable to a third party witness; the mental states of the actor were left ambiguous. For example, one story depicted a group of children sitting four at a table working on a project. The teacher tells them that as soon as everyone at a table has completed the assignment, all four children are allowed to go outside to play. Jake, the main character in this story, asks only two of his peers at the table to go outside and play with him. In another story, the teacher gives Jayna, the
150
Cecilia Wainryb and Beverly A. Brehl
main character, a bag of candy, and asks her to go around the room and give some candy to each of the students. Jayna gives each child three pieces of candy, but gives one child only one piece of candy. In each case, the ‘‘victim’’ is depicted as looking sad and stating that his or her feelings were hurt. After hearing each story, participants were asked to explain the main character’s behavior. Analyses of the data thus obtained indicated that when construing situations entailing interpersonal conflict, unfairness, and hurt feelings, even young children appreciated that people’s behavior is informed by what they think and feel. Indeed, even though situational (e.g., ‘‘maybe he ran out of candy’’) and dispositional (e.g., ‘‘he’s always mean’’) explanations were present, they accounted for a small percentage of all explanations (18 and 6%, respectively). In all, children as young as 4½ spontaneously attributed to actors a variety of mental states (see also Bartsch & Wellman, 1989). More specifically, the large majority (82%) of participants of all ages offered explanations that included at least one reference to the actors’ desires and preferences (e.g., ‘‘Jake doesn’t like that kid and wanted to play with his other friends,’’ ‘‘Jayna wanted to have more candy for herself ’’); about half the children (49 and 44%, respectively) included in their explanations at least one reference to intentions (e.g., ‘‘Jake was trying to make it clear that the other kid wasn’t welcome,’’ ‘‘that kid must’ve done something mean to Jayna before, and so Jayna was trying to pay her back’’) and to beliefs (e.g., ‘‘Jake thought that this kid wouldn’t want to play with them,’’ ‘‘Jayna thought that this kid didn’t like candy so much’’), and some (22%) also included in their explanations at least one reference to emotions (‘‘Jake didn’t let that kid play because he was angry at that kid for some reason,’’ ‘‘I think Jayna was having a bad morning, like maybe she got in a fight with her mom, and so she was letting out all her bad feelings on that girl’’). In all, desires accounted for 37% of all explanations offered; references to intentions and beliefs accounted for an additional 14 and 10%, the attribution of emotions for 5%, with the remainder of explanations consisting of conflations between various mental states and the actors’ actions or their consequences. Although the attribution of beliefs made up only a small proportion of all explanations in this study, this should not be taken as meaning that children (and adults) generally failed to acknowledge that the actors may have understood or construed the situation differently. In everyday discourse, beliefs are often understood to be both implied and efficiently communicated through the assertion of desires, preferences, and intentions. For example, a child who explains that Jayna gave another child less candy because ‘‘she didn’t want her running around and around and not sitting still’’ implies that Jayna believed that candy would lead that one child to become hyperactive or ill-behaved. Thus, it may be that children do not find it necessary to speak explicitly about belief except when a belief is not made evident through the communication of a desire, preference, or intention (see also Bartsch & Wellman, 1989).
Developing Psychological Knowledge and Moral Thinking
151
Although in our past research (e.g., Shaw & Wainryb, 1999; Wainryb, 1991, 1993; Wainryb & Ford, 1998, Wainryb, Shaw, & Maianu, 1998) we had referred to divergent factual beliefs as markers of differences in interpretation, an examination of children’s spontaneous explanations of harmful and unfair behavior revealed that (as noted earlier, see page 147) children often used mental states other than belief to indicate differences in interpretation. In many cases, for example, explanations referring to the actor’s intentions also implied a difference in interpretation on the part of the actor (e.g., ‘‘maybe Jayna was trying to make sure that kid didn’t get sick by not giving her too much candy’’). When re-examining the data without equating the attribution of interpretive differences with the attribution of differences in factual belief, we found that 36% of children gave explanations that implicated interpretive differences. In general, the content of these explanations suggested that, in the view of participants, the actor had construed the situation in such a way that his or her behavior did not entail harm (e.g., ‘‘maybe Jake didn’t ask the other kid to play because he thought that kid didn’t like that game’’). Interestingly, as evidenced in children’s narrative accounts of their own interpersonal conflicts with others (Wainryb, Brehl, & Matwin, 2005), this pattern of findings is not confined to children’s explanations of the behaviors of hypothetical others. In this study, 112 boys and girls in four age groups (mean ages were 5, 7, 10, and 16) were asked to provide narrative accounts of an instance in which they had been harmed by a peer (‘‘victim’’) and one in which they had inflicted harm on a peer (‘‘perpetrator’’), along with moral evaluations of their own and the other child’s actions. The narratives were analyzed in terms of the semantic elements they included (e.g., the proportion of references to the perpetrator’s harmful behavior, the victim’s response, the perpetrator’s and victim’s emotions) as well as in terms of various measures of narrative structure (e.g., topic maintenance, event sequencing, fluency). Many aspects of the semantic and structural analyses are not relevant to our present purposes and are discussed in detail elsewhere (Wainryb et al., 2005). Of interest to our present purposes is whether children bring their psychological knowledge to bear on their construals of their own interactions, and whether (and how) their psychological construals are related to their moral thinking. We note here that the narratives were elicited using broad, non-directive probes (e.g., ‘‘Tell me about a time when you said or did something that hurt another kid; tell me everything you can remember about that time’’). Therefore, any references to mental states were entirely spontaneous. The findings of this study, not unlike those of the Brehl and Wainryb (2005) study, indicated that when children set out to make sense of interpersonal conflicts in which they had been directly involved, either as victims or as perpetrators, they frequently referred to their own mental states and those of the child with whom they had a conflict. Although children’s narratives included
152
Cecilia Wainryb and Beverly A. Brehl
references to the perpetrator’s harmful behavior (‘‘I pushed him down,’’ ‘‘they kept on calling me a baby’’), the victim’s behavioral response (‘‘he went and told the recess monitor on us,’’ ‘‘they ditched me, so I called them and pretended that I’d been attacked to make them feel bad’’), and the resolution of the episode (‘‘she got over it and now we’re friends and everything,’’ ‘‘he didn’t even say he was sorry’’), on the average, about half of all references that made up the narratives related to mental states. Overall, 90% of narratives included at least one reference to mental states. In terms of the types of mental states to which children referred, 62% of narratives included references to the perpetrator’s intentions, 74% included references to emotions, and 75% included references to other mental states, such as desire and belief. In depicting their own intentions and those of others, children largely referred to perpetrators as intending to pursue goals unrelated to the harm inflicted on the victim, with the resulting harm being mainly incidental (e.g., ‘‘Well, I didn’t play with her because I kind of wanted to play with another friend because I wanted to make new friends so I could have lots of friends’’); 33% of narratives that included references to intentions referred to the pursuit of such goals. Children also referred to acting with the intention of securing retribution for a past injury (13%; e.g., ‘‘she started spreading rumors about me, so I put some gross smelling stuff in her locker’’) and to acting without the intention to cause harm (20%; e.g., ‘‘it was just a really bad joke, I didn’t mean to get her that upset’’). When depicting the emotions of victims, children spoke largely of themselves and others as feeling sad or generally ‘‘bad,’’ and to a lesser extent as feeling angry. Perpetrators were depicted as feeling guilty and angry (so few children described either themselves or others as having positive emotions as perpetrators that these references were not included in the analyses). Children also referred to desires (e.g., ‘‘she really wanted to play animals’’; ‘‘I didn’t want to give it back to her’’), beliefs about the way things were (e.g., ‘‘he had thought we were just joking around’’; ‘‘I noticed he had his jacket and his keys and all the stuff he usually has when he just takes off from school, and I thought he was just gonna take off without me’’), and beliefs about the way things should have been (e.g., ‘‘I think that if she’s going to say that, she should come to me and ask me first’’; ‘‘we’ve been friends for so long, and friends just shouldn’t do that to each other’’). Although these findings cannot be directly compared to those obtained in the Brehl and Wainryb (2005) study with hypothetical scenarios, it is nevertheless worthy of notice that, in each case, the large majority of children included in their explanations at least one reference to desires, and about half included references to intentions and beliefs. The single striking difference between the two studies was in regards to the frequency of emotion attributions. Whereas, in all, about 30% of children included a reference to emotion in explaining hypothetical situations of wrongdoing, a full 74% did so when describing
Developing Psychological Knowledge and Moral Thinking
153
situations involving actual (their own or their peers’) wrongdoing. This difference, however, is misleading, as the larger figure (74%) refers to the emotions attributed to both the perpetrator and the victim. When only the emotions attributed to the wrongdoer are considered, the figure (26%) is much closer to that obtained with hypothetical transgressions. In all, then, mental state attributions seem remarkably similar whether they were provided in regard to hypothetical or real conflicts. It is also worth mentioning that even though children in the Wainryb, Brehl, and Matwin (2005) study were generally more likely to refer to their own mental states (82% of narratives included such references) than to the mental states of the child with whom they had had a conflict (73%), both their own and the other child’s mental states were present in the large majority of all narrative construals. When children had been the perpetrators, however, they referred most frequently to their own intentions (74% of narratives included such references) and the victim’s emotions (73%). In contrast, when children had been the victims, their most frequent references were to their own emotions (67%) and their own beliefs and desires (60%). Regardless of which perspective they themselves had filled, references to the perpetrator’s emotions were infrequent (only 26% of narratives included such references). In spite of these differences, the extent to which mental state references were central to children’s construals of their own interactions is noteworthy, as is the extent to which they attended not only to their own thoughts, feelings, and intentions, but also to those of the other child. Recall, now, that we started out asking two questions. First we asked whether children refer to mental states in their construals of moral situations when such information is not provided. In accordance with our findings, the answer to this question is straightforward. Yes, children do refer to mental states in their construals, and they do so quite often. Indeed, in this regard our findings run contrary to the expectation (e.g., Pronin, Gilovich, & Ross, 2004; Ross, 2001; Ross & Ward, 1996) that, in the absence of explicit information about alternative construals of a situation, children might rely largely on dispositional or situational explanations. It is possible that our open-ended methods, which do not call for ratings of specific dimensions (e.g., the actor’s undesirable traits), serve to facilitate such consideration of differences in belief, construal, and intention. In any case, children’s construals of their own transgressions and those of others—real or hypothetical—were rich in mental state information. Our second question referred to the role that the developmental constraints on young children’s understandings of the mind play in their construals and evaluations of moral situations. If before the age of 7 children view the mind as passive, one would expect significant age-related differences in children’s abilities to construct psychological explanations of harmful behavior. Thus, it is to these differences that we now turn.
154
Cecilia Wainryb and Beverly A. Brehl
C. PASSIVE MINDS, ACTIVE MINDS, AND MORAL JUDGMENTS
Based on the aforementioned findings, children of all ages clearly referred to psychological information in their attempts to understand (and explain) interpersonal conflicts. Nevertheless, there were also significant age differences in the prevalence of psychological explanations and in the types of mental states to which participants of different ages referred. In the Brehl and Wainryb (2005) study, in which children explained the harmful behaviors of actors in hypothetical transgressions, only about half (57%) of 4-year-olds’ explanations included references to mental states (i.e., beliefs, desires, emotions, and intentions), as compared to 75% of explanations given by 7- and 10-year-olds and adults. Similar age differences were observed in the Wainryb, Brehl, and Matwin (2005) study, where children described situations involving moral transgressions in which they themselves had participated. The finding that younger children referred less frequently than did older children to mental states is not surprising in light of young children’s view of the mind as passive. Given that young children do not recognize that the mind is involved in actively interpreting experience and assume, instead, that there is a one-to-one match between internal mental life and the external world, it makes sense that they would include fewer mental states in their construals of situations entailing harm. Indeed, speaking about mental states from the perspective of the mind as passive is akin to replicating a description of the scenario ‘‘as it was.’’ As would be expected, age differences were found not only in regards to the overall number of references to mental states, but also in the types of mental states (i.e., belief, desire, emotion, and intention) to which children referred. For example, age differences were found in the frequency with which children referred to beliefs and intentions—differences which were in keeping with what is known about the development of children’s understandings of mental states. In the Brehl and Wainryb (2005) study, only 12% of 4-year-olds referred at least once to intentions, as compared to 42–59% of 7- to 10-year-olds and 81% of adults. Similarly, only 16% of 4-year-olds referred at least once to the actor’s beliefs, as compared to 42–50% of 7- to 10-year-olds, and 67% of adults. In the Wainryb et al. (2005) study, 68% of 5-year-olds, as compared to 95% of 10- to 16-year-olds, referred at least once to intentions. Similarly, only 18% of 5-yearolds, but 75% of 10- to 16-year-olds included references to their own or the other party’s beliefs about what the situation was like and how it should have been (all differences were statistically significant unless noted). Age differences were also observed in the Brehl and Wainryb (2005) study in regards to explanations which involved attributing to actors a different interpretation of the situation. Only a minority of 4-year-olds (7%) and 7-year-olds (25%), as compared to 40% of 10-year-olds and the large majority of adults (71%), provided such explanations.
Developing Psychological Knowledge and Moral Thinking
155
In addition to the differences in the number and type of mental state explanations offered by younger and older children, young children offered two types of explanations that were infrequent or nonexistent among the older children. These explanations, one involving quasi-interpretations and the other conflations between mental states and outcomes, serve to illustrate further the ways in which young children’s limited understandings of the mind shape their construals of harmful behavior. The following explanation, spontaneously offered by a 7-year-old in response to why Jake may have excluded a peer from play, serves to illustrate what we have called ‘‘quasi-interpretive’’ explanation: ‘‘Jake didn’t ask that kid to play with him because that boy had a cold and couldn’t go outside.’’ In one respect, this sort of explanation resembles the ‘‘interpretive’’ explanation inasmuch as it, too, attributes to Jake, the actor, a different interpretation of the situation (i.e., Jake did not think that he was harming his peer, he had a different understanding of what happened). Unlike the ‘‘genuine’’ interpretive explanations, however, this explanation makes reference to features not present in the original situation (i.e., Jake did not differently construe the same information; he was privy to additional information, namely, that the victim ‘‘had a cold’’). This sort of quasi-interpretive explanation, which we think is more consistent with an understanding of false belief than with the more advanced understanding of interpretation, was offered by 32% of 7-year-olds (only a small minority of 4-year-olds and no older children or adults offered such an explanation). The other type of explanation that betrays the limited psychological understandings of young children involved a lack of differentiation between, on the one hand, indices of mental activity (desires, beliefs, intentions, and emotions), and on the other hand, behavior and its consequences. An example of such an explanation is the statement by a 4-year-old who, when asked to explain why Jake may have excluded a peer from play, reasoned that Jake ‘‘left that kid out and hurt his feelings because he was trying for him to have hurt feelings.’’ In conflating the actor’s intention with the actor’s behavior and the consequences of such behavior, this child’s explanation amounted to saying, essentially, that the actor did x because he was trying to do x. Additional examples of this type of reasoning were ‘‘Jayna gave that kid only one candy because she wanted her to only have one candy,’’ and ‘‘Jayna was mean because she was thinking to be mean.’’ Each of these examples features a failure to distinguish between mental states and behavior (for strikingly similar instances of young children’s conflated explanations, see research on 4-year-olds’ comprehension of verbs referring to mental activity; Miscione et al., 1978; Wellman & Johnson, 1979). This type of reasoning is in stark contrast with the sorts of explanations offered by older children who, while holding that the behavior was not accidental nevertheless differentiated between the reason for acting and the action itself or its end result (e.g., ‘‘Jake just wanted to play
156
Cecilia Wainryb and Beverly A. Brehl
with his friends, and he didn’t really know that kid’’; ‘‘that kid did something mean to her before, and so Jayna was trying to pay her back’’). Analyses indicate that 4-year-olds were more likely (28%) to provide explanations conflating mental states with outcomes than were older children (7%); no adults provided such explanations. What do these differences between younger and older children’s psychological explanations amount to as far as their moral thinking is concerned? We have thus far shown that young children tend to refer less frequently, in their construals and explanations of moral situations, to mental states—in particular, they include fewer references to intentions and beliefs. They also consider less frequently the possibility that actors may have behaved on the basis of alternative interpretations, and conflate mental states and outcomes thereby concluding that actors had intended whatever outcome came about. We suggest that these features of young children’s construals may be responsible for young children’s documented tendency to over-attribute intentionality (Astington, 2001; Kalish, in press; Piaget, 1932/1965) and make categorical moral judgments (Helwig & Turiel, 2003; Shaw & Wainryb, 2006; Smetana, 2006). How, you might ask, can the finding that young children refer less frequently to intentions be taken to underlie children’s tendency to over-attribute intentionality? We suggest that the reason young children tend to neglect referring to an actor’s intentions is that they conflate intentions and actions/ outcomes. In other words, if young children equate intention with actions/ outcomes, it stands to reason that they would not discuss intention separately from action and would make infrequent explicit references to an actor’s intention. Therefore, implied in young children’s tendency to not consider intentions when they make sense of moral situations (or to do so infrequently) is their propensity to over-attribute intentionality, that is, their propensity to assume that whatever outcomes are in place have, indeed, been intended. In contrast, older children, who have developed an understanding of intention as a mental state distinct from action, should be more likely to refer to an actor’s intention as separate from her or his behavior. The implication of this tendency goes beyond the greater or lesser frequency with which older and younger children refer to intentions in construing moral situations. Older children, we suggest, are less likely to over-attribute intentionality: They are, indeed, more likely to recognize that the outcome of someone’s actions is distinct from whatever that person may have intended. This difference, between younger children’s inability and older children’s ability to recognize the distinction between intentions and actions/outcomes, translates into differences in younger and older children’s moral judgments of what are, ostensibly, the same acts. Our research provided direct evidence to this effect. In the Wainryb, Brehl, and Matwin (2005) study, for example, younger children made categorically negative judgments of their own and the other child’s
Developing Psychological Knowledge and Moral Thinking
157
transgressions (e.g., ‘‘It was just not okay to leave him out because it made him feel not so good’’). By contrast, older children and adolescents gave more mixed evaluations of similar situations and, importantly, their justifications demonstrated their ability to distinguish between intentional behavior (that is, behavior that is not accidental) and the intention behind such behavior. Thus, although older children acknowledged both that the behaviors discussed were not accidental and that the outcome of such behaviors was that harm had been inflicted on someone, they also recognized that the intention behind those behaviors had not been to inflict harm (e.g., ‘‘It was kind of okay and also kind of not okay, because I wasn’t trying to hurt her feelings when I didn’t invite her to come over, I was really just trying to get to know another group of people, but still she did get hurt’’). This complex and less categorical moral judgment stands in stark contrast to the younger child’s judgment that ‘‘It was just not okay to leave him out because it made him feel not so good.’’ Younger children’s limited ability to understand cases in which the outcome of the act was in conflict with a stated intention (as when a harmful outcome was incidental to an actor’s pursuit of a neutral goal, for example), explains their categorically negative moral judgments. It would be a mistake, of course, to assume that young children’s categorical moral judgments are solely related to their limited understanding of intention. In their construals, young children also referred less frequently than their older peers to the characters’ beliefs—a feature that can also be explained in relation to their nonrepresentational conceptions of the mind (why refer to beliefs, if they are nothing but copies of the way things are?). We suggest that their limited understanding of belief, too, contributes to their construing moral situations as though they implicated only a ‘‘surface’’ or behavioral dimension. Furthermore, as shown in our research concerning young children’s thinking about other persons’ different beliefs, their limited understanding of belief was directly associated with their tendency to make unforgiving or intolerant moral judgments of the actions of others (Wainryb & Ford, 1998; Wainryb et al., 2004). Children’s limited understandings of emotions, especially mixed emotions and emotions experienced in equivocal contexts, also have substantial consequences for young children’s construals and judgments of moral situations. Whereas the findings we have reviewed thus far indicate that children of all ages attend to emotions, especially those of the victim, in construing moral situations, they were less informative in regards to how age differences in emotion understanding may impact moral construals and moral judgments. Other research, however, has shown that young children, who find it difficult to acknowledge that the phenomenological experience of emotion may differ from its outward expression, are likely to misunderstand the emotional experience of both victims and wrongdoers.
158
Cecilia Wainryb and Beverly A. Brehl
Young children’s limited understanding of the experience of wrongdoers has been amply documented in terms of the ‘‘happy victimizer’’ phenomenon (e.g., Arsenio & Lover, 1995; Nunner-Winkler & Sodian, 1988). Young children also experience difficulties deciphering the emotions—and hence experiences— of victims, especially when victims respond in ways that are not transparent or straightforward. For example, in a study by Shaw and Wainryb (2006), children in five age groups (5, 7, 10, 13, and 16 years) were presented with hypothetical situations in which victims either openly resisted, complied with, or covertly subverted unfair demands made by another child (e.g., a demand to surrender personal property or to complete someone else’s chores). Consistent with the notion that young children conflate mental states with external cues (Gnepp, 1989; Gnepp & Klayman, 1992; Harris, 1989; Yuill et al., 1996), 5-year-olds thought that victims who complied (more so than other victims) felt ‘‘good’’ for having acted pro-socially (e.g., for ‘‘sharing’’ her markers, or ‘‘helping’’ with someone’s chores). Older children, understanding that internal mental states are not necessarily equivalent to external indicators, thought that victims who complied may have felt afraid and victims who resisted or subverted may have felt good for standing up for themselves. Children’s understandings of emotion were also found to be involved in their construals and evaluations of the victim’s response. Children between the ages of 7 and 16 judged a victim’s resistance more positively than compliance and subversion, referring to the selfaffirming consequences of such a response for the victim. Meanwhile, 5-yearolds judged compliance more positively than resistance and subversion, referring to the pro-social nature of compliance and to the unfairness of open or covert resistance. In all, these data, along with evidence about young children’s misattributions of the emotions of victimizers lend support to the proposition that young children’s limited understandings of emotions, too, are implicated in their miscontruals of moral situations and their categorical judgments. Finally, we suggest that young children’s non-interpretive understanding of the mind, as evidenced in their limited understanding of the representational nature of mental states and their ensuing tendency to conflate internal mental activity with actions and outcomes, is also implicated in the fractured nature of their understandings of moral conflict. Findings from the Wainryb, Brehl, and Matwin (2005) study relating to the structural features of children’s narratives about their own experiences with interpersonal conflict help illustrate this argument. Recall that children participating in this study provided narrative accounts of conflict situations in which they had been directly involved. All narratives were rated on various indicators of coherence, such as topic maintenance, event sequencing, completion, and fluency and, in each respect, young children’s narratives, especially those of 5-year-olds, were found to be less coherent than those of older children. Additionally, on a measure of overall coherence, only a minority of the narratives of younger children (30%), as
Developing Psychological Knowledge and Moral Thinking
159
compared to 66% of those of older children, were deemed coherent. Narrative coherence is generally thought to reflect the integration (or lack thereof) of different aspects of an experience (McAdams, 1988; Polkinghorne, 1988). We speculate that the ability to understand the psychological dimension of human behavior—the ability to consider people’s desires, beliefs, intentions, and emotions that guide behavior—is what glues together fragmented behavioral moments into a coherent experience (Bruner, 1986, 2002). Thus their limited psychological understandings contribute to the fractured quality of young children’s construals of moral experiences—a quality that becomes manifested not only in their judgments (e.g., Shaw & Wainryb, 2006), but also in the behavioral strategies they employ for resolving conflicts (e.g., Dunn & Herrera, 1997; Shantz & Hartup, 1992). Taken together, the studies we have conducted since the 1990s go to show that children’s developing psychological understandings play a central role in their construals and judgments of situations of moral conflict. We have provided evidence that children use information about an actor’s mental states to evaluate both the actor and his/her behavior (Shaw & Wainryb, 1999; Wainryb, 1993; Wainryb & Ford, 1998; Wainryb, Shaw, & Maianu, 1998; Wainryb et al., 2001, 2004), and that even when such information is not given, children spontaneously consider what actors (and victims) may have felt, thought, or wanted, as they attempt to make sense of moral conflicts. Importantly, our research has also shown that the specific characteristics of young children’s still developing psychological knowledge shape in significant and predictable ways their moral thinking. Before the age of 5, children have limited understandings of mental states as such and thus are unable to account for mental state information even when such information is explicitly provided (Wainryb & Ford, 1998). Beginning at age 5, children recognize the diversity of mental life but ascribe such diversity to mistakes in perception or logic, thus continuing to use their own assumptions about the situation as the default from which to make sense of other people’s behavior. It is therefore not surprising that at 5, children were much less likely than older children to spontaneously attribute mental states to actors and victims, be it themselves or others, and, even when such information was explicitly provided, were less likely to consider what it actually means that others may have differently understood moral situations. Indeed, for as long as children subscribe to a passive ‘‘copy’’ theory of the mind, psychological information does not appear to add much of importance to their understanding of moral conflicts beyond what is immediately observable (i.e., outward expressions of emotion, behaviors, and situational cues). Beginning around the age of 7, as they shift into viewing mental life as active and constructive, children’s construals and evaluations of moral situations reflect a deeper understanding of and concern with the intentional nature of moral agents.
160
Cecilia Wainryb and Beverly A. Brehl
V. Conclusions and Future Challenges Our research program is not alone in its attempt to document the relations between children’s psychological and moral understandings. The interest in the connection between the two fields has grown over the years (Astington, 2004; Baird & Astington, 2004; Chandler, Sokol, & Wainryb, 2000; Dunn, Cutting, & Demetriou, 2000; Kalish, 2005; Lagattuta, 2005; Leslie, 2000; Nunez & Harris, 1998; Peterson & Siegal, 2002; Sokol & Chandler, 2004; Yuill et al., 1996), and considerable efforts in theories-of-mind research have been devoted to examining the ways in which children, with age, increasingly draw on both psychological and moral concepts in interpreting, explaining, and predicting human behavior. As examples, research on the ‘‘happy victimizer’’ phenomenon (e.g., Arsenio & Lover, 1995; Arsenio, Gold, & Adams, 2006; Nunner-Winkler & Sodian, 1988), to which we referred earlier in this chapter, has asked about the ways in which children attribute emotion to story characters that break moral rules. Research by Lagattuta (2005) extended those findings by asking how children make emotion attributions in situations in which there is conflict between a person’s desire and a prohibitive rule. Together, this research has shown that, with age, children increasingly come to recognize that emotional satisfaction is shaped by fulfillment of both desire and rules and obligations. Kalish and colleagues (e.g., Kalish & Shiverick, 2004) have similarly shown that, with age, children become more able to coordinate both moral and psychological information when making predictions about how people want, and are likely, to behave. Other research has investigated the relation between children’s interpretive stance and their ability to identify situations of rule violation and rule conformity (e.g., Cummins, 1996; Harris & Nunez, 1996; Nunez & Harris, 1998) and assign blame and responsibility in situations of moral transgression (e.g., Baird & Astington, 2004; Sokol & Chandler, 2004). Even as evidence accumulates showing that children bring to bear moral concepts along with psychological understandings when explaining or predicting people’s actions and emotions, the relation between children’s psychological and moral concepts is not yet well understood. Although some (e.g., Nunez & Harris, 1998) have argued for the view that the closely interwoven nature of the two realms of development results from children transposing their psychological understandings to the moral domain and vice versa, others (e.g., Sokol & Chandler, 2004) have rejected such domain-specific and modularist propositions (see also Astington, 2004). In spite of their differences, both the domain-specific and domain-general propositions share, implicitly, a recognition that the nature of the relation between children’s moral and psychological concepts cannot be fully understood except in reference to the underlying developmental processes.
Developing Psychological Knowledge and Moral Thinking
161
We have put forth a framework for explaining this relation. Our starting point was the process through which children develop moral concepts and make moral judgments. Next, we proceeded to explain why and how, within this developmental process, children’s psychological concepts come to bear on their moral thinking. Finally, we provided a detailed analysis of how children’s developing psychological concepts differently impact their ability to make sense of and judge moral situations at different ages. The latter point, concerning the varying constraints that younger and older children’s psychological understanding place on their moral thinking, merits further discussion. In particular, we have described at some length the precise constraints in young children’s moral thinking as related to their limited psychological understandings, but we have said comparatively little about the implications, for moral thinking, of having a mature theory of mind. Attending to this question will, in turn, illuminate further the nature of the relation between psychological and moral concepts throughout development. As we have alluded to earlier in this chapter, in referring to developments in children’s psychological understandings we do not mean that children become increasingly more accurate mind readers. Like theories-of-mind researchers, we refer to children’s growing understanding of the complex nature of people’s psychological experiences. Children become increasingly more able to understand that people may have beliefs and desires different from their own and may differently interpret aspects of reality, that people’s outward expressions of emotion may be misrepresentative and deceiving, and that people often intend to do things but fail, and do things not as intended. This increasing ability renders older children capable of appreciating subtleties in moral situations that younger children unavoidably miss. It is not that younger children are not concerned with fairness and the welfare of others—on the contrary, their categorical moral judgments suggest that their moral concerns are firm and unwavering—it is that they do not find injustice and pain in the same places as do their older peers. And yet, our argument that young children are bound to apply their moral concepts to incomplete, unelaborated, fractured, and disjointed construals of reality funneled by their limited psychological understandings does not imply that the reverse is true for older children. Older children too are unavoidably bound to misread other people’s intentions, beliefs, and emotions, sometimes for self-serving reasons and sometimes not. Research, as well as everyday experience, makes it abundantly clear that other people’s mental states never become transparent (some might say that even our own mental states never become entirely transparent), and even adults display systematic biases when explaining why a person may have acted a certain way (e.g., Malle, 2004; Ross, 1990, 2001; Ross & Ward, 1996). Therefore, even older children who have developed a more mature theory of mind are bound, at least some of the times,
162
Cecilia Wainryb and Beverly A. Brehl
to apply moral concepts to misconstruals of reality. An advanced or mature theory-of-mind cannot ensure otherwise. Relatedly, the fact that older children have a more advanced theory of mind does not imply that they will necessarily make more fair or caring moral judgments or display more positive behavior toward others. As is widely acknowledged, psychological knowledge can be used for good as well as for evil. Research has shown that children’s psychological understandings are related to various indicators of social competence (e.g., Astington, 2003; Lalonde & Chandler, 1995; Peterson & Siegal, 2002; Slaughter, Dennis, & Pritchard, 2002). For example, a greater understanding of emotion contributes to more positive play among 4-year-olds (Denham & Kochanoff, 2002) and moral sensibility among school-aged children (Dunn, 2000), and conversely, aggressive children demonstrate deficits in emotion understanding (e.g., Denham & Kochanoff, 2002; Dodge, 2003). However, there is also evidence that an advanced theory-of-mind does not guarantee social competence or pro-social behavior (Repacholi et al., 2003; Sutton, 2003). Conduct-disordered children, for example, have been depicted as having ‘‘intact but skewed theories-ofmind—perhaps a theory of nasty minds’’ (Happe´ & Frith, 1996, p. 395). And it is not only conduct-disordered children, or bullies, or psychopaths, who use their psychological knowledge for anti-social purposes. Developmental research suggests that most children use their psychological knowledge to comfort others and also use the same knowledge to provoke, deceive, and hurt others (Dunn, 1991, 1999; Harris, 1989). Indeed, it is not hard to see how a representational understanding of the mind can be used both to respond forgivingly to someone who acted on mistaken information and to pull the wool over someone else’s eyes. Along with Davies and Stone (2003), who described a theory of mind as ‘‘a collection of neutral tools that can be used for good or ill’’ (p. 339), we conclude that a mature theory of mind does not ensure a certain type of moral thinking but rather, as we have argued throughout, enables certain kinds of moral construals. Moreover, as we have suggested in an earlier section, even this statement must be qualified. For, even after having developed an understanding that people actively interpret reality and thus come up with divergent beliefs about reality, children do not merely judge that all alternative beliefs and interpretations are equally legitimate. Older children and adolescents, we have shown, are not accepting of persons who behave in harmful or unfair ways based on any type of different belief or interpretation. Altogether, our argument thus far is twofold. On the one hand, we claim that young children’s limited understandings of the mind severely constrain their construals of moral situations in ways that predict that their moral judgments will be, understandably, less attuned to the complexities and ambiguities related to the various psychological perspectives of those involved, and more
Developing Psychological Knowledge and Moral Thinking
163
negative. On the other hand, we also claim that knowing that an older child is capable of understanding that people are guided by their own interpretations of reality is not sufficient to predict how that child might construe and evaluate moral situations. This twofold argument might make it seem as though we are revitalizing Kohlberg’s ‘‘necessary but not sufficient hypothesis’’ (1969, 1971). Although this hypothesis has not received adequate empirical support (for further discussion of these issues, see Turiel, 1983b), it is important that we consider it here in relation to our arguments, for the ‘‘necessary but not sufficient’’ proposition is often readily and uncritically accepted and has, in fact, been alluded to specifically in relation to the connection between moral and psychological knowledge (see Astington, 2003). In keeping with the commonly held view of mental structure as encompassing the mind as a whole, Kohlberg (1969, 1971; see also Colby & Kohlberg, 1987) hypothesized that development through a sequence of moral judgment stages is partially dependent upon development in cognitive stages. Specifically, he maintained that the prior emergence of certain stages of cognitive development is a necessary prerequisite for, but does not guarantee, the emergence of certain moral judgment stages. Although our proposition is similar to Kohlberg’s inasmuch as we claim that the emergence of some types of psychological understandings is necessary for, but do not guarantee, certain types of construals and moral evaluations, our view of the nature of moral development is fundamentally at odds with the main assumptions underlying the ‘‘necessary but not sufficient hypothesis’’ as formulated by Kohlberg (1969, 1971). Our view of moral development is grounded on a constructivist and interactional view (Turiel, 1983a, 1998), according to which children construct a realm, or domain, of moral understandings that is not dependent upon nonmoral structures. Children’s moral concepts stem, not from cognitive structures or social-cognitive structures, but from their own experiences bearing on matters of welfare, justice, and rights. Children not only observe but are often involved, sometimes as targets and sometimes as perpetrators, in interpersonal conflicts involving physical or psychological aggression, social exclusion, or unfairness. It is through their abstractions from and reflection on features of those experiences (e.g., the consequences that such actions have for themselves or others) that moral development proceeds. Thus, we argue, conceptual changes in children’s moral understandings do not depend on conceptual changes in children’s psychological understandings. It is the application of moral concepts, and not their development, that is informed by children’s specific psychological understandings. And yet, the ways in which children make sense of the actions and interactions of those involved in situations of moral conflict—their construals of those people’s beliefs, intentions, and emotions—constitute the context within which conceptual change in moral understandings happens. Therefore, even as
164
Cecilia Wainryb and Beverly A. Brehl
their development does not depend upon each other, moral and psychological knowledge are inextricably intertwined in children’s experiences. The tight interconnection of children’s psychological concepts and moral thinking underscores the importance of considering the development of psychological knowledge when examining children’s moral reasoning, as what may seem to be age differences in moral thinking may in fact reflect age differences in children’s understandings of persons and their actions. This, in turn, represents a particular challenge for those interested in understanding moral development as manifested in children’s actual interpersonal interactions because, though it may be feasible to ensure that participants of all ages have a shared understanding of the hypothetical situations about which they make moral judgments, this task becomes much more complicated outside the research lab. Thus, we propose that a fundamental challenge of researchers intent on charting children’s moral development is to distinguish between the complexity of moral concepts and the complexity of the background of psychological understandings against which such concepts are applied.
REFERENCES Arsenio, W. F., Gold, J., & Adams, E. (2006). Children’s conceptions and displays of moral emotions. In M. Killen & J. G. Smetana (Eds.), Handbook of moral development (pp. 581 – 610). Mahwah, NJ: Erlbaum. Arsenio, W. F., & Lover, A. (1995). Children’s conceptions of sociomoral affect: Happy victimizers, mixed emotions and other expectancies. In M. Killen & D. Hart (Eds.), Morality in everyday life: Developmental perspectives (pp. 87 – 128). New York: Cambridge University Press. Asch, S. (1952). Social psychology. Englewood Cliffs, NJ: Prentice-Hall. Asher, S. R., & Wheeler, V. A. (1985). Children’s loneliness: A comparison of rejected and neglected peer status. Journal of Consulting and Clinical Psychology, 53, 500 – 505. Astington, J. W. (2001). The paradox of intention: Assessing children’s metarepresentational understanding. In B. F. Malle & L. J. Moses (Eds.), Intentions and intentionality: Foundations of social cognition (pp. 85 – 103). Cambridge, MA: MIT Press. Astington, J. W. (2003). Sometimes necessary, never sufficient: False-belief understanding and social competence. In B. Repacholi & V. Slaughter (Eds.), Individual differences in theory of mind: Implications for typical and atypical development (pp. 13 – 38). New York: Psychology Press. Astington, J. W. (2004). Bridging the gap between theory of mind and moral reasoning. In J. A. Baird & B. W. Sokol (Eds.), Connections between theory of mind and sociomoral development (pp. 63 – 72). San Francisco: Jossey-Bass. Astington, J. W., & Gopnik, A. (1991a). Theoretical explanations of children’s understanding of the mind. British Journal of Developmental Psychology, 9, 7 – 31. Astington, J. W., & Gopnik, A. (1991b). Developing understanding of desire and intention. In A. Whiten (Ed.), Natural theories of mind: Evolution, development and simulation of everyday mindreading (pp. 39 – 50). Cambridge, MA: Basil Blackwell.
Developing Psychological Knowledge and Moral Thinking
165
Baird, J. A., & Astington, J. W. (2004). The role of mental state understanding in the development of moral cognition and moral action. New directions for Child and Adolescent Development, 103, 37 – 49. Bandura, A. (1991). Social cognitive theory of moral thought and action. In W. M. Kurtines & J. L. Gewirtz (Eds.), Handbook of moral behavior and development: Theory, research, and applications (pp. 45 – 103). Hillsdale, NJ: Erlbaum. Bartsch, K., & Wellman, H. M. (1989). Young children’s attribution of action to beliefs and desires. Child Development, 60, 946 – 964. Beck, A. T. (1967). Depression: Clinical, experimental, and theoretical aspects. New York: Hoeber. Berg, C. A., Meegan, S. P., & Deviney, F. P. (1998). A social-contextual model of coping with everyday problems across the lifespan. International Journal of Behavioral Development, 22, 239 – 261. Berndt, T. J., & Berndt, E. G. (1975). Children’s use of motives and intentionality in person perception and moral judgment. Child Development, 46, 904 – 912. Brehl, B., & Wainryb, C. (2005). Children’s explanations of harmful behavior: Naive realism and interpretation. Unpublished manuscript, University of Utah. Bruner, J. (1986). Actual minds: Possible worlds. Cambridge, MA: Harvard University Press. Bruner, J. (2002). Making stories: Law, literature, life. New York: Farrar, Straus and Giroux. Chandler, M. J., & Lalonde, C. E. (1996). Shifting to an interpretive theory of mind: 5- to 7-year-olds’ changing conceptions of mental life. In A. J. Sameroff & M. M. Haith (Eds.), Five to seven year shift: The age of reason and responsibility (pp. 111 – 139). Chicago: University of Chicago Press. Chandler, M. J., Sokol, B. W., & Wainryb, C. (2000). Beliefs about truth and beliefs about rightness. Child Development, 71, 91 – 97. Colby, A., & Kohlberg, L. (1987). The measurement of moral judgment, Vol. 1: Theoretical foundations and research validation; Vol. 2: Standard issue scoring manual. New York: Cambridge University Press. Crick, N. R., & Ladd, G. W. (1990). Children’s perceptions of the outcomes of aggressive strategies. Developmental Psychology, 26, 612 – 620. Cummins, D. (1996). Evidence of deontic reasoning in 3- and 4-year-old children. Memory & Cognition, 24, 823 – 829. Dalgleish, T., Taghavi, R., Neshat-Doost, H., Moradi, A., Canterbury, R., & Yule, W. (2003). Patterns of processing bias for emotional information across clinical disorder: A comparison of attention, memory, and prospective cognition in children and adolescents with depression, generalized anxiety and posttraumatic stress disorder. Journal of Clinical Child & Adolescent Psychology, 32, 10 – 21. Darley, J. M., Klosson, E. C., & Zanna, M. P. (1978). Intentions and their contexts in the moral judgments of children and adults. Child Development, 49, 66 – 74. Darley, J. M., & Zanna, M. P. (1982). Making moral judgments. American Scientist, 70, 515 – 521. Davies, M., & Stone, T. (2003). Synthesis: Psychological understanding and social skills. In B. Repacholi & V. Slaughter (Eds.), Individual differences in theory of mind: Implications for typical and atypical development (pp. 305 – 352). New York: Psychology Press. Denham, S. A., & Kochanoff, A. (2002). ‘‘Why is she crying?’’ Children’s understanding of emotion from preschool to preadolescence. In L. F. Barrett & P. Salovey (Eds.),
166
Cecilia Wainryb and Beverly A. Brehl
The wisdom in feeling: Psychological processes in emotional intelligence (pp. 239 – 270). New York: Guilford Press. Dennett, D. (1978). Brainstorms: Philosophical essays on mind and psychology. Montgomery, VT: Bradford Books. Dodge, K. A. (1986). A social information processing model of social competence in children. In M. Perlmutter (Ed.), The Minnesota Symposium on Child Psychology (Vol. 18, pp. 77 – 125). Hillsdale, NJ: Erlbaum. Dodge, K. A. (2003). Do social information-processing patterns mediate aggressive behavior? In B. B. Lahey & T. E. Moffitt (Eds.), Causes of conduct disorder and juvenile delinquency (pp. 254 – 274). New York: Guilford Press. Duncker, K. (1939). Ethical relativity? Mind, 48, 39 – 53. Dunn, J. (1991). Understanding others: evidence from naturalistic studies of children. In A. Whiten (Ed.), Natural theories of mind: Evolution, development and simulation of everyday mindreading (pp. 51 – 61). Cambridge, MA: Basil Blackwell. Dunn, J. (1999). Making sense of the social world: Mindreading, emotion, and relationships. In P. D. Zelazo, J. W. Astington, & D. R. Olson (Eds.), Developing theories of intention: Social understanding and self-control (pp. 229 – 242). Mahwah, NJ: Erlbaum. Dunn, J. (2000). Mind-reading, emotion understanding, and relationships. International Journal of Behavioral Development, 24, 142 – 144. Dunn, J., Cutting, A. L., & Demetriou, H., (2000). Moral sensibility, understanding others, and children’s friendship interactions in the preschool period. British Journal of Developmental Psychology, 18, 159 – 177. Dunn, J., & Herrera, C. (1997). Conflict resolution with friends, siblings, and mothers: A developmental perspective. Aggressive Behavior, 23, 343 – 357. Fabricius, W. V., & Imbens-Bailey, A. (2000). False beliefs about false beliefs. In P. Mitchell & K. J. Riggs (Eds.), Children’s reasoning and the mind (pp. 267–280). Hove, UK: Psychology Press. Fabricius, W. V., & Schwanenflugel, P. J. (1994). The older child’s theory of mind. In A. Demitriou & A. Efklides (Eds.), Intelligence, mind, and reasoning: Structure and development (pp. 111 – 132). New York: Elsevier. Flavell, J. H., Everett, B. A., Croft, K., & Flavell, E. R. (1981). Young children’s knowledge about visual perception: Further evidence for the Level 1-Level 2 distinction. Developmental Psychology, 17, 99 – 103. Gergen, K. J. (1991). Emerging challenges for theory and psychology. Theory & Psychology, 1, 13–35. Gnepp, J. (1989). Children’s use of personal information to understand other people’s feelings. In C. Saarni & P. L. Harris (Eds.), Children’s understanding of emotion (pp. 151 – 180). New York: Cambridge University Press. Gnepp, J., & Klayman, J. (1992). Recognition of uncertainty in emotional inferences: Reasoning about emotionally equivocal situations. Developmental Psychology, 28, 145 – 158. Goldman, A. I. (2002). Simulation theory and mental concepts. In J. Dokic & J. Proust (Eds.), Simulation and knowledge of action (pp. 1 – 19). Amsterdam, Netherlands: John Benjamins. Hadwin, J., & Perner, J. (1991). Pleased and surprised: Children’s cognitive theory of emotion. British Journal of Developmental Psychology, 9, 215 – 234. Hammen, C., & Zupan, B. A. (1984). Self-schemas, depression, social problem solving, and social competence in preadolescents. Cognitive Therapy and Research, 9, 685 – 702.
Developing Psychological Knowledge and Moral Thinking
167
Happe´, F. G. E., & Frith, U. (1996). Theory of mind and social impairment in children with conduct disorder. British Journal of Developmental Psychology, 14, 385 – 398. Harris, P. L. (1989). Children and emotion. Oxford, UK: Blackwell. Harris, P. L. (1991). The work of the imagination. In A. Whiten (Ed.), Natural theories of mind: Evolution, development and simulation of everyday mindreading (pp. 283 – 304). Cambridge, MA: Blackwell. Harris, P. L., & Leevers, H. J. (2000). Reasoning from false premises. In P. Mitchell & K. J. Riggs (Eds.), Children’s reasoning and the mind (pp. 67 – 86). East Sussex, UK: Psychology Press. Harris, P. L., & Nunez, M. (1996). Understanding of permission rules by preschool children. Child Development, 67, 1572 – 1591. Harter, S., & Whitesell, N. R. (1989). Developmental changes in children’s understanding of single, multiple, and blended emotion concepts. In C. Saarni & P. L. Harris (Eds.), Children’s understanding of emotion (pp. 81 – 116). New York: Cambridge University Press. Helwig, C. C., & Turiel, E. (2003). Children’s social and moral reasoning. In P.K. Smith & C. H. Hart (Eds.), Blackwell handbook of childhood social development (pp. 475 – 490). Oxford, UK: Blackwell. Hobson, R. P. (1991). Against the theory of ‘‘theory of mind’’. British Journal of Developmental Psychology, 9, 33 – 51. Jones, E. F., & Nelson-Le Gall, S. A. (1995). The influence of personal effort cues on children’s judgments of morality and disposition. Merrill-Palmer Quarterly, 41, 53 – 69. Kalish, C. W. (1998). Natural and artifactual kinds: Are children realists or relativists about categories? Developmental Psychology, 34, 376 – 391. Kalish, C. W. (2005). Becoming status conscious: Children’s appreciation of social reality. Philosophical Explorations, 8, 245 – 263. Kalish, C. W., & Shiverick, S. M. (2004). Children’s reasoning about norms and traits as motives for behavior. Cognitive Development, 19, 401 – 416. Karniol, R. (1978). Children’s use of intention cues in evaluating behavior. Psychological Bulletin, 85, 76 – 85. Keasey, C. B. (1977). Children’s developing awareness and usage of intentionality and motives. Nebraska Symposium on Motivation, 25, 219 – 260. Kohlberg, L. (1969). Stage and sequence: The cognitive-developmental approach to socialization. In D. Goslin (Ed.), Handbook of socialization theory and research (pp. 347 – 480). Chicago: Rand McNally. Kohlberg, L. (1971). From Is to Ought: How to commit the naturalistic fallacy and get away with it in the study of moral development. In T. Mischel (Ed.), Cognitive development and epistemology (pp. 151 – 235). New York: Academic Press. Kuhn, D., Cheney, R., & Weinstock, M. (2000). The development of epistemological understanding. Cognitive Development, 15, 309 – 328. Lagattuta, K. H. (2005). When you shouldn’t do what you want to do: Young children’s understanding of desires, rules, and emotions. Child Development, 76, 713 – 733. Lagattuta, K. H., & Wellman, H. M. (2001). Thinking about the past: Early knowledge about links between prior experience, thinking, and emotion. Child Development, 72, 82 – 102. Lalonde, C. E., & Chandler, M. J. (1995). False belief understanding goes to school: On the social-emotional consequences of coming early or late to a first theory of mind. Cognition and Emotion, 9, 167 – 185.
168
Cecilia Wainryb and Beverly A. Brehl
Leslie, A. M. (2000). Theory of mind as a mechanism of selective attention. In M. Gazzaniga (Ed.), The new cognitive neurosciences (pp. 1235 – 1247). Cambridge, MA: MIT Press. Lewis, M. D. (2001). Personal pathways in the development of appraisal: A complex systems/stage theory perspective. In K. R. Scherer, A. Schorr, & T. Johnstone (Eds.), Appraisal processes in emotion: theory, methods, research (pp. 205 – 220). London: Oxford University Press. Malle, B. F. (2004). How the mind explains behavior: Folk explanations, meaning, and social interaction. Cambridge, MA: MIT Press. McAdams, D. (1988). Biography, narrative, and lives. Journal of Personality, 56, 1 – 17. Miscione, J. L., Marvin, R. S., O’Brien, R. G., & Greenberg, M. T. (1978). A developmental study of preschool children’s understanding of the words ‘‘know’’ and ‘‘guess’’. Child Development, 49, 1107 – 1113. Mitchell, P. (2003). Acquiring a theory of mind. In A. Slater & G. Bremner (Eds.), An introduction to developmental psychology (pp. 237 – 257). Malden, MA: Blackwell. Moses, L. J. (1993). Young children’s understanding of belief constraints on intention. Cognitive Development, 8, 1 – 25. Nelson-Le Gall, S. A. (1985). Outcome matching and outcome foreseeability: Effects on attribution of intentionality and moral judgments. Developmental Psychology, 21, 332 – 337. Nunez, M., & Harris, P. L. (1998). Psychological and deontic concepts: Separate domains or intimate connection? Mind & Language, 13, 153 – 170. Nunner-Winkler, G., & Sodian, B. (1988). Children’s understanding of moral emotions. Child Development, 59, 1323 – 1338. Perner, J. (1991). On representing that: The asymmetry between belief and desire in children’s theory of mind. In D. Frye & C. Moore (Eds.), Children’s theories of mind: Mental states and social understanding (pp. 139 – 155). Hillsdale, NJ: Erlbaum. Peterson, C. C., & Siegal, M. (2002). Mind-reading and moral awareness in popular and rejected preschoolers. British Journal of Developmental Psychology, 20, 205 – 224. Phillips, W., Baron-Cohen, S., & Rutter, M. (1998). Understanding intention in normal development and in autism. British Journal of Developmental Psychology, 16, 337–348. Piaget, J. (1952). The origins of intelligence in children. New York: International Universities Press. Piaget, J. (1960). The child’s conception of the world. Oxford, UK: Littlefield Adams. (Original work published 1929). Piaget, J. (1965). The moral judgment of the child. New York: Free Press. (Original work published 1932). Piaget, J., & Inhelder, B. (1969). The psychology of the child. New York: Basic Books. Pillow, B. H. (1988). The development of children’s beliefs about the mental world. Merrill-Palmer Quarterly, 34, 1 – 32. Pillow, B. H., & Henrichon, A. J. (1996). There’s more to the picture than meets the eye: Young children’s difficulty understanding biased interpretation. Child Development, 67, 803 – 819. Pillow, B. H., & Mash, C. (1999). Young children’s understanding of interpretation, expectation, and direct perception as sources of false belief. British Journal of Developmental Psychology, 17, 263 – 276. Polkinghorne, D. (1988). Narrative knowing and the human sciences. Albany, NY: State University of New York Press.
Developing Psychological Knowledge and Moral Thinking
169
Prinstein, M. J., Cheah, C. S., & Guyer, A. E. (2005). Peer victimization, cue interpretation, and internalizing symptoms: Preliminary concurrent and longitudinal findings for children and adolescents. Journal of Clinical Child and Adolescent Psychology, 34, 11 – 24. Pronin, E., Gilovich, T., & Ross, L. (2004). Objectivity in the eye of the beholder: Divergent perceptions of bias in self versus others. Psychological Review, 111, 781 – 799. Repacholi, B., Slaughter, V., Pritchard, M., & Gibbs, V. (2003). Theory of mind, Machiavellianism, and social functioning in childhood. In B. Repacholi & V. Slaughter (Eds.), Individual differences in theory of mind: Implications for typical and atypical development (pp. 67 – 97). New York: Psychology Press. Ross, L. (1990). Recognizing the role of construal processes. In I. Rock (Ed.), The legacy of Solomon Asch: Essays in cognition and social psychology (pp. 77 – 96). Hillsdale, NJ: Erlbaum. Ross, L. D. (2001). Getting down to fundamentals: Lay dispositionism and the attributions of psychologists. Psychological Inquiry, 12, 37 – 40. Ross, L. D., & Nisbett, R. M. (1991). The person and the situation: Perspectives on social psychology. Philadelphia: Temple University Press. Ross, L. D., & Ward, A. (1996). Naive realism in everyday life: Implications for social conflict and disagreement. In E. S. Reed, E. Turiel & T. Brown (Eds.), Values and knowledge (pp. 103 – 136). Mahwah, NJ: Erlbaum. Schult, C. (2002). Children’s understanding of the distinction between intentions and desires. Child Development, 73, 1727 – 1747. Sedlak, A. J. (1979). Developmental differences in understanding plans and evaluating actors. Child Development, 50, 536 – 560. Shantz, C. U., & Hartup, W. W. (1992). Conflict and development: An introduction. In C. U. Shantz & W. W. Hartup (Eds.), Conflict in child and adolescent development (pp. 1 – 14). New York: Cambridge University Press. Shaw, L., & Wainryb, C. (1999). The outsider’s perspective: Young adults’ judgments of social practices of other cultures. British Journal of Developmental Psychology, 17, 451 – 471. Shaw, L., & Wainryb, C. (2006). When victims don’t cry: Children’s understandings of victimization, compliance, and subversion. Child Development, 77, 1050 – 1062. Shultz, T. R., Wright, K., & Schleifer, M. (1986). Assignment of moral responsibility and punishment. Child Development, 57, 177 – 184. Shweder, R. A. (1999). Culture and development in our postcultural age. In A. S. Masten (Ed.), Cultural processes in child development (pp. 137 – 148). Mahwah, NJ: Erlbaum. Siegal, M., & Peterson, C. C. (1998). Preschoolers’ understanding of lies and innocent and negligent mistakes. Developmental Psychology, 34, 332 – 341. Slaughter, V., Dennis, M. J., & Pritchard, M. (2002). Theory of mind and peer acceptance in preschool children. British Journal of Developmental Psychology, 20, 545 – 564. Slaughter, V., Jaakkola, K., & Carey, S. (1999). Constructing a coherent theory: Children’s biological understanding of life and death. In M. Siegal & C. Peterson (Eds.) Children’s understanding of biology and health (pp. 71 – 98). Cambridge: Cambridge University Press. Smetana, J. G. (1981). Reasoning in the personal and moral domains: Adolescent and young adult women’s decision-making about abortion. Journal of Applied Developmental Psychology, 2, 211 – 226.
170
Cecilia Wainryb and Beverly A. Brehl
Smetana, J. G. (2006). Social domain theory: Consistencies and variations in children’s moral and social judgments. In M. Killen & J. G. Smetana (Eds.), Handbook of moral development (pp. 69 – 91). Mahwah, NJ: Erlbaum. Sokol, B. W., & Chandler, M. J. (2004). A bridge too far: On the relations between moral and secular reasoning. In J. Carpendale & U. Muller (Eds.), Social interaction and the development of knowledge (pp. 155 – 174). Mahwah, NJ: Erlbaum. Stein, N. L., & Trabasso, T. (1989). Children’s understanding of changing emotional states. In C. Saarni & P. Harris (Eds.) Children’s understanding of emotion (pp. 50 – 77). New York: Cambridge University Press. Sutton, J. (2003). Tom goes to school: Social cognition and social-values in bullying. In B. Repacholi & V. Slaughter (Eds.), Individual differences in theory of mind: Implications for typical and atypical development (pp. 99 – 120). New York: Psychology Press. Turiel, E. (1983a). The development of social knowledge: Morality and convention. New York: Cambridge University Press. Turiel, E. (1983b). Domains and categories in social-cognitive development. In W. Overton (Ed.), The relationship between social and cognitive development (pp. 53 – 89). Hillsdale, NJ: Erlbaum. Turiel, E. (1998). The development of morality. In W. Damon (Ed.), Handbook of child psychology: Vol. 3. N. Eisenberg (Ed.), Social, emotional, and personality development (5th ed., pp. 863 – 932). New York: Wiley. Turiel, E., Hildebrandt, C., & Wainryb, C. (1991). Judging social issues: Difficulties, inconsistencies, and consistencies. Monographs of the Society for Research in Child Development, 56 (2, Serial No. 224). Turiel, E., Killen, M., & Helwig, C. C. (1987). Morality: Its structure, functions, and vagaries. In J. Kagan & S. Lamb (Eds.). The emergence of morality in young children (pp. 155 – 243). Chicago: University of Chicago Press. Wainryb, C. (1991). Understanding differences in moral judgments: The role of informational assumptions. Child Development, 62, 840 – 851. Wainryb, C. (1993). The application of moral judgments to other cultures: Relativism and universality. Child Development, 64, 924 – 933. Wainryb, C. (2000). Values and truths: The making and judging of moral decisions. In M. Laupa (Ed.), Rights and wrongs: How children evaluate the world. New Directions for Child Development (Vol. 89, pp. 33 – 46). San Francisco: Jossey-Bass. Wainryb, C. (2004). Is and Ought: Moral judgments about the world as understood. In J. Baird & B. Sokol (Eds.), New Directions for Child and Adolescent Development: Vol. 103. Mind, morals, and action: The Interface between Children’s Theories of Mind and Socio-moral Development (pp. 3 – 17). San Francisco: Jossey-Bass. Wainryb, C., Brehl, B., & Matwin, S. (2005). Being hurt and hurting others: Children’s narrative accounts and moral judgments of their own interpersonal conflicts. Monographs of the Society for Research in Child Development, 7 (3, Serial No 28). Wainryb, C., & Ford, S. (1998). Young children’s evaluations of acts based on beliefs different from their own. Merrill-Palmer Quarterly, 44, 484 – 503. Wainryb, C., & Laupa, M. (1994). Moral and informational components in social decisionmaking: Young adults’ reasoning about capital punishment. Poster presented at the annual meeting of the Jean Piaget Society, Chicago. Wainryb, C., Shaw, L., Langley, M., Cottam, K., & Lewis, R. (2004). Children’s thinking about diversity of belief in the early school years: Judgments of relativism, tolerance, and disagreeing persons. Child Development, 75, 687 – 703.
Developing Psychological Knowledge and Moral Thinking
171
Wainryb, C., Shaw, L., Laupa, M., & Smith, K. R. (2001). Children’s, adolescents’, and young adults’ thinking about different types of disagreements. Developmental Psychology, 37, 373 – 386. Wainryb, C., Shaw, L., & Maianu, C. (1998). Tolerance and intolerance: Children and adolescents’ judgments of dissenting beliefs, speech, persons, and conduct. Child Development, 69, 1541 – 1555. Wainryb, C., & Turiel, E. (1993). Conceptual and informational features in moral decision making. Educational Psychologist, 28, 205 – 218. Weiner, B., & Graham, S. (1984). An attributional approach to emotional development. In C. E. Izard, J. Kagan, & B. Zajonc (Eds.), Emotions, cognitions, and behavior (pp. 167 – 191). New York: Cambridge University Press. Wellman, H. M. (1990). The child’s theory of mind. Cambridge, MA: MIT Press. Wellman, H. M., & Johnson, C. N. (1979). Understanding mental processes: A developmental study of ‘‘remember’’ and ‘‘forget’’. Child Development, 50, 79 – 88. Wellman, H. M., & Woolley, J. D. (1990). From simple desire to ordinary beliefs: The early development of everyday psychology. Cognition, 25, 245 – 275. Wellman, H. M., Cross, D., & Watson, D. (2001). Meta-analysis of theory-of-mind development: The truth about false belief. Child Development, 72, 655 – 684. Wimmer, H., & Perner, J. (1983). Beliefs about beliefs: Representation and constraining function of wrong beliefs in young children’s understanding of deception. Cognition, 13, 103 – 128. Yuill, N. (1984). Young children’s coordination of motive and outcome in judgments of satisfaction and morality. British Journal of Developmental Psychology, 2, 73 – 81. Yuill, N., Perner, J., Pearson, A., Peerbhoy, D., & van den Ende, J. (1996). Children’s changing understanding of wicked desires: From objective to subjective and moral. British Journal of Developmental Psychology, 14, 457 – 475.
This page intentionally left blank
HOME RANGE AND THE DEVELOPMENT OF CHILDREN’S WAY FINDING
Edward H. Cornell and C. Donald Heth DEPARTMENT OF PSYCHOLOGY, UNIVERSITY OF ALBERTA, EDMONTON, ALBERTA T6G 2E9, CANADA
I. DEFINITION OF THE TOPICS II. DISTANCE AND DISPERSION OF TRAVEL III. THE ONTOGENY OF WAY FINDING
A. ONSET AND MOTIVATION OF SKILLS B. EVIDENCE OF EARLY PROCESSES C. SUMMARY IV. LANDMARK AND PLACE RECOGNITION
A. DEVELOPMENTS IN RECOGNITION PROCESSES B. SUMMARY V. MEMORIES OF ROUTES
A. THE ACQUISITION AND REPRESENTATION OF SERIAL ORDER B. ACTION NODES AND SEGMENTS OF ROUTES C. SUMMARY VI. BEARING KNOWLEDGE IN WAY FINDING
A. POINTING OUTDOORS B. SUMMARY VII. STRATEGY DEVELOPMENT
A. B. C. D. E.
SELECTIVE ATTENTION TO LANDMARKS VERBAL MEDIATION TRAINING PROSPECTIVE STRATEGIES STRATEGIES WHEN LOST SUMMARY
VIII. GENERAL DISCUSSION REFERENCES
Twenty years ago, we began a research and training partnership with a division of the Royal Canadian Mounted Police. The RCMP are responsible for the search of missing children in rural towns and some wilderness parks. 173 Advances in Child Development and Behavior R Kail (Editor)
ß 2006 Elsevier B.V. All rights reserved.
174
Edward H. Cornell and C. Donald Heth
They wanted to improve their methods and align them with what was known about child behavior. We soon received a call regarding a 9-year-old boy who had wandered away from a campground. The search had progressed for a week and some clues had been found leading toward a swamp. The constable in charge of the operations asked ‘‘How far can 9-year-olds go?’’ About a month later, we investigated an incident involving a 3-year-old boy who walked away from his back porch and was later found playing in a farm implement sales yard surrounded by shiny new tractors. The constable who located the boy reported that the boy did not want to go home. His mother had not considered searching at the tractor yard because it was too far, but she was not surprised in hindsight: ‘‘Every time we go by that place he wants to stop. His uncle took him for a tractor ride when he was a baby, and ever since then, tractors have been his favorite toys. But—how did he get there?’’ We approached these two questions as issues for research in child development. In this chapter, we review answers from studies that we have completed as well as from the multidisciplinary effort to understand the development of largescale spatial cognition. Much of this effort has been heavily influenced by philosophical questions such as the innate comprehension of location and the progression of cognitive development that leads to abstract and systematic representation of space (Newcombe & Huttenlocher, 2000). In contrast, our police partners needed to know behavioral tendencies of lost children, or at least how representation influences way finding decisions. Hence, we have gathered research on age-related abilities for orienting and using bearings during travel, monitoring self-movement, selecting landmarks and recognizing places, learning routes, and developing way finding strategies.
I. Definition of the Topics The study of home range addresses the question of how far a child can go. A child’s home range refers to the outdoor territory that surrounds his or her home and provides a context for independent travel, play, and exploration (Anderson & Tindal, 1972; Stea, 1970). Home range expands as children discover sites from established territory or attempt to reach destinations that they have heard about or experienced from different means of approach. In modern Western societies, the home range of newly walking infants may be restricted to a porch or a fenced yard. In an East African society, the home range of older toddlers may include trips to a water source, sometimes over 100 m distance along a fixed route through the community (Munroe & Munroe, 1971). In both societies, 3-year-old toddlers may be permitted brief unsupervised travel away from their immediate home area, such as voluntary visits to neighboring
Children’s Way Finding
175
friends or play sites within calling range of parents. By 5 – 7 years of age, many children have accompanied peers or older playmates to sites that are beyond the views from their home. Home range thus represents the local geographic competency of a child. This competency is based on personal knowledge of the neighborhood and requires way finding, or perceptual and cognitive processes for directing travel. The study of way finding addresses the question of how children get to places. The term has only recently appeared in academic writings (Lynch, 1960), but is reminiscent of historical accounts of wilderness pathfinders (Gowans, 1989; Parkman, 1872/1920). Behavioral geographers point out that a comparable term, human navigation, is most frequently used to refer to formal procedures for locating position and plotting a course for ships and aircraft, whereas human way finding usually refers to the process of selecting paths from an environmental context (Bovy & Stern, 1990; Golledge, 1999). The process of way finding is not solely a matter of reading natural cues, because adults of all cultures refer to maps or verbal or written descriptions for devising routes and making choices at intersections, especially in unfamiliar territory (Kitchin & Freundschuh, 2000; Stea, Blaut, & Stephens, 1996). Because of the sociocultural representation of routes and places, much of the study of human way finding differs from the study of animal navigation. Although animals may have mental representations of their environments and behavioral algorithms for foraging and homing (Gallistel, 1993), they do not use external aids such as cartographic maps, compasses or odometers. Similarly, children do not typically use external aids. Yet, as early as three years of age, they may attempt independent travel outdoors, using routes they have been shown to neighborhood play sites, exploring while keeping familiar places in sight, and returning home when called. More demanding way finding problems typically occur with the expansion of home range into unknown territory during early and middle childhood (ages 3 to 12; Matthews, 1992; Moore & Young, 1978).
II. Distance and Dispersion of Travel Researchers from several disciplines have described the extent of children’s excursions in rural and urban locales in a variety of cultures (Biel & Torell, 1982; Hart, 1979; Matthews, 1987; Munroe, Munroe, & Bresler, 1985; Spencer & Darvizeh, 1983; Tindal, 1971; Whiting & Edwards, 1988). Most of these descriptions are based on children’s self reports, identification of sites from aerial photographs, and sketch maps. However, our police partners required information in a format that could address the requirements of directing a search operation. They were familiar with summaries of lost person behavior
176
Edward H. Cornell and C. Donald Heth
(Syrotuck, 1979) and requested that we first establish the crow’s-flight distance between the child’s home and the farthest destination they travel to independently. Studies of home range indicate that the maximum crow’s-flight distance is a good index of the child’s way finding competence, although sites for activities are typically not the same distance in all directions (Matthews, 1987). Because the maximum crow’s-flight distance represents the child’s farthest destination, the measure usually reflects recent attempts to visit a new place. However, because of the layout of paths, distractions and barriers in their neighborhood, children’s travel to their destinations is longer than that estimated by a straight line (see Figure 1). Nevertheless, the measure of the crow’sflight distance to the child’s farthest destination has been found to reflect the ease of travel within different environments and parental restraints on travel (Hart, 1979).
Fig. 1. The crow’s flight distance is measured along a straight line between the child’s home (H) and intended destination (ID). The child’s actual path is depicted as an irregular dashed line and the dispersion of travel can be indexed as the angle of the segment that minimally includes the actual path. Here, the child’s path is on both sides of the crow’s-flight line, so the total angle of dispersion consists of a small and large angle (Cornell & Hill, 2006).
Children’s Way Finding
177
Our police partners were familiar with the measure of the crow’s-flight distance because records of lost person incidents include two sites, the point last seen and the point found. By plotting on maps and connecting the point last seen and the point found with a crow’s-flight line, Syrotuck (1979) was able to summarize the extent of travel by people lost in wilderness parks. Records typically include little evidence of the actual paths of the lost person and reconstructions of events by the lost person are unreliable, so Syrotuck recommended using the median crow’s-flight distance traveled as a radius for a circular area to contain possible paths. The circle could be centered on the point last seen when a report of a lost person was received. Search managers could direct initial operations to the circular area if they judged that the circumstances of travel of the lost person were similar to those characterizing the summarized incidents. To use this procedure, urban and suburban police needed summary data for children traveling from their homes. Our response was to introduce the research issue to suburban parents and ask to accompany their children on a walk. We asked individual children to take us on an adventure by leading us to the farthest place they had ever traveled to alone (Cornell & Heth, 1996). We followed from behind, using a surveyor’s wheel (1 rotation ¼ 1 m) and a digital watch to keep track of distance and time. These measurements and notes taken during the trip were used to draw the child’s route on 1:5000 survey maps. The children made all the decisions about routes and could rest or walk home at any time. We followed them everywhere, including shortcuts through shopping malls, across snow-filled vacant lots, and once through an ongoing soccer game. Children interrupted their walks to throw stones, to stand on a fire hydrant to survey the upcoming path, or to dash off to kick a pile of leaves. We first noted that many young children had selected more remote sites than their parents were aware that they had visited. After summarizing the data, we also noted that our observations included longer trips than those learned about from interviews. For example, we recorded that 6-year-old children led the way to sites that were on average 769 m crow’s-flight distance from their homes, demonstrating 3 – 4 times more distant travel than reported in studies asking children of the same age to name the places they could travel to alone. Methodological differences may not be the only source of these different results. There are significant cultural and cohort differences between the children who have participated in studies of home range (Moore & Young, 1978). Nevertheless, children consider ‘‘showing the way’’ to be an accomplishment and will eagerly lead peers and adults to favorite places. While reconstructing their route to these places, children can point to and name landmarks that they recognize. The scenes and objects surrounding the path cue a variety of memories in a familiar spatiotemporal order. The trip provides an effective mnemonic, a match between the context of encoding and the context of retrieval
178
Edward H. Cornell and C. Donald Heth
of memories. Similarly, behavioral observations of taxi drivers indicate that their environmental knowledge prompted while route finding on city streets is 25% better than their verbal descriptions in the laboratory (Chase, 1983). The implication for the determination of home range is that the child’s experience must ultimately be assessed in natural contexts, including meaningful goals and familiar skills (Gauvain, 1993). We were able to observe how police search managers used data regarding children’s home range during search operations (Heth & Cornell, 2006a). Although they used the median crow’s-flight distance traveled to draw a circular area for search, they usually did not search all portions of the area equally. They initiated an investigation to determine the intended destination or recent favorite sites of the missing child. They would then segment the circular area to prioritize search at these sites. We extracted data from our observations to help search managers delineate a prioritized segment when the intended destination had been learned. The segment was based on the median dispersion of travel that we had witnessed when children led us to their suburban destinations. We plotted their actual paths, which included both what children set out as their established route and their wandering on the way to their chosen destination (see Figure 1). We then drew lines to bracket minimally all of their path on either side of the crow’sflight line that connected their home with their destination. If the child did not show extensive travel lateral to the crow’s-flight line, the brackets delineated a segment that resembled a wedge. An individual child’s dispersion can be expressed by the angle of the segment and typical dispersion for an age group within a circular area for search can be indicated by descriptive statistics. Interestingly, the median size of the segment increases from 138 to 2168 between 7 and 10 years of age, indicating that older children show more dispersion in their distant travel. A segment of greater than 1808 indicates that the child has gone beyond the expanse between their home and their intended destination. Travel as dispersed as this challenges the argument that development in middle childhood is associated with efficient route choices, or least distance solutions. Children between 7 and 10 are exploring and wandering. In the section that follows, we consider the cognitive processes and representational capacities used to address the requirements of such way finding adventures.
III. The Ontogeny of Way Finding A. ONSET AND MOTIVATION OF SKILLS
It is difficult to establish a developmental milestone for the start of independent way finding but 8- to 11-month-old infants are able to find locations in
Children’s Way Finding
179
a large room if they have had more than 6 weeks of crawling or walking experience (Clearfield, 2004). Most infants walk unaided between 9 and 17 months after full-term birth and by 20 months, the average toddler will have tried running, walking sideways, and walking backwards (Bayley, 1969; Knobloch & Pasamanick, 1974). Despite these early abilities, analyses of gait indicate that mature patterns of walking may not develop until 3 – 5 years of age (Rose-Jacobs, 1983; Sutherland et al., 1980). The distance that children can walk is restricted by their endurance and body size; for example, biomechanical analyses indicate that the frequency of stepping by 1- to 5-year-old children is limited by their stature and musculature (Grieve & Gear, 1966). Nevertheless, by 18 months children seldom fall when walking and parents are often taken aback at their toddler’s lack of hesitancy to leave their side after spotting an intriguing event. The development of way finding skills seems to be a result of these intrinsically motivated adventures. We can see the early process of joining familiar paths in the activities of the missing 3-year-old boy who was located in a tractor yard. At our request, the day following the incident, the boy led his mother on the same paths that he had used to find his way to the tractor yard. He showed that he had followed a sidewalk that his friends had used to go to a school ground, made a line-of-site shortcut under a torn fence, played on some school swings, made another line-of-sight shortcut to a sidewalk that he had previously walked with his mother, followed a route they had often taken to a convenience store, and crossed the street where he had previously seen the tractors. He had traveled 610 m as the crow flies, which places him beyond the 75th percentile of Canadian children of the same age group who lead researchers to their most distant destinations (Cornell & Heth, 1996). The boy stated that he did not mean to go to the tractors when he went to the swings, but his reaction when found indicated that he enjoyed the outcome of his way finding. Observations such as these remind us of how children’s curiosity and interests drive their competence. Erratic and extravagant acts of exploration often lead to way finding skills (Cornell et al., 2001).
B. EVIDENCE OF EARLY PROCESSES
Studies of newly walking infants indicate the earliest processes used to direct travel. These processes are part of way finding methods used throughout the lifespan. For example, 14-month-old infants know the way to turn at the first choice point in a room maze after watching their parent turn there (Heth & Cornell, 1980). Under the conditions of these studies, the infant could not simply extend a body posture after visually pointing toward the parent’s turn; infants were not released by an assistant until they oriented straight ahead. We believe that processes of observational learning allowed the infant to know
180
Edward H. Cornell and C. Donald Heth
the open path; the movement of the parent in relation to environmental cues is encoded visually and translated into a directed action. 1. Allocentric frame of reference Infants perceive and remember spatial relations between objects and events to know how to turn. In the case of observational learning, the room included posters and windows so that the infant’s mother could be seen to walk near a particular wall (Heth & Cornell, 1980). The association of her movement with visible features along the wall would indicate that the infant is using an allocentric frame of reference. In an allocentric frame of reference, the position of objects other than oneself are encoded in relation to each other (Hart & Moore, 1973; geocentric may be used to indicate Earth as the other-than-self frame of reference). After a portion of the wall has been identified as a landmark close to the place of action, the infant can move straight toward the landmark. We need not assume that the infant has an internal representation of the configuration of the maze during this locomotion; travel toward a remembered wall feature would expose the opening to the path, which could then be the target for a similar direct approach. As long as the way finder discovers opportunities while looking for a sequence of immediate cues, way finding may not require reference to a map-like representation. 2. Egocentric frame of reference In one situation where infants viewed the layout of a short maze from a 458 elevation, the room was circular and devoid of distinctive features (Rieser et al., 1982). When held aloft centered in front of the maze, the open floor could be seen in relation to the midline of the infant’s body. Because events in space are perceived differently by sense organs on either side of the midline, the position of the open floor is specified by an egocentric frame of reference (Howard & Templeton, 1966). In an egocentric frame of reference, the positions of objects are encoded in relation to self. Because there were no landmarks to approach, the infants’ correct choices indicated they had encoded the location of the opening in relation to an egocentric framework. 3. Memories of movements The infant’s performance in one-choice mazes also indicates an early ability to keep track of one’s own movements (Heth & Cornell, 1980). The history of movement from a place can be used to infer one’s position relative to events or landmarks that were perceived at the onset of the movement. The process is called dead reckoning (Gallistel, 1993; Cornell & Heth, 2004). The direction and distance of movements can be registered internally through patterns of efferent, kinesthetic, vestibular, and proprioceptive sensations; moreover,
Children’s Way Finding
181
infants and adults seem particularly sensitive to changes in external visual events accompanying movement (Loomis et al., 1999; Schmuckler & Tsang-Tong, 2000). As an example, consider the infant’s ability to choose a path that has been seen from a different perspective than at the choice point (Rieser et al., 1982). At the outset of this problem, 25-month-old infants were taken to the left or right side of the central starting point for a short maze. They were then lifted to see their mother seated within the layout of barriers. The open path to the mother was either near the infant or across the maze. The infant was then moved by the researcher to the central starting point. During movement, the visual flow of textures specify the direction of travel (Gibson, 1979). Once released, the infants who had seen the maze opening close to their original position could approach it by reversing the pattern of movement imposed by the researcher. For the infants who had seen the opening across from their original position, the pattern of movement imposed by the researcher would be consistent with an approach that could be continued once released midway. In this interpretation, we need not assume that infants regulate travel in reference to a map-like representation of the layout of the maze. Instead, we suggest that the 25-month-old infants could register the location of the opening relative to self when viewing it and see visual cues accompanying their movement. This ability seems to be within their repertoire; after displacements, infants as young as 6 months of age can visually anticipate the location of a stable target (Tyler & McKenzie, 1990).
C. SUMMARY
Our interpretations suggest that early processes of human way finding are based on the perception of movement. In particular, we favor Gibson’s (1979) analysis of spatial information available in visual flow to indicate how infants can register goals and direct their locomotion at choice points. Systematic changes in the perspective of things accompany the movement of self in a surround of objects and surfaces; these events provide information for orientation within an egocentric frame of reference. The regularities in visual flow also reveal features of the environment whose relations to one another do not change with changes in the viewer’s perspective; this invariant structure provides information for bearings within an allocentric frame of reference. Although several sense systems provide internal cues for biomechanical motion, we do not see strong evidence that ambulatory infants can accurately monitor their movements when deprived of vision (Cornell & Heth, 2004; Liben, 1988). For adults as well, the primary information specifying movement may be visual flow. The demonstration of observational learning as an early solution to spatial choice deserves some emphasis. There is accumulating evidence about how
182
Edward H. Cornell and C. Donald Heth
social activities introduce spatial concepts to infants. Their mother in particular may elicit attention to the location and direction of modeled behavior. A mother’s location in a room is a primary landmark, frequently updated and associated with neighboring environmental features (Presson & Somerville, 1985). By watching the direction of her head and gaze, infants make inferences about where important things are and how objects and space are partitioned by gesture and linguistic conventions (Baldwin, 1995; Butterworth & Jarrett, 1991). Although no evidence indicates that parents are better landmarks and guides than other adults or animate events, the early attention to caregivers is a natural introduction to the importance of others for demonstrating specific routes, teaching strategies for way finding and using the linguistic and symbolic conventions that are sociocultural representations of space.
IV. Landmark and Place Recognition A complete description of the information perceived in the immediate environment would not be a sufficient account of way finding. Information must be remembered because the features of large-scale environments that we travel through cannot be perceived from a single vantage point. Experiences along paths and at places can be retained and organized to constitute a route or history of movement. In addition, theories of cognitive mapping and theories of ecological perception both propose that memories of events are integrated so that we apprehend the relative directions, distances and layout of landmarks and paths (Heft, 1996; Kitchin & Blades, 2002). In this portion of our discussion, we first consider that the processing of memories may occasionally allow a simple method of repeating travel without the organizational properties of route and survey representations: Children can approach places that appear to be familiar (Cornell, Heth, & Alberts, 1994). The analysis suggests how recognition processes are sufficient to repeat a route or reverse a route, but may be inadequate for off-route way finding. In addition, mechanisms of recognition are necessary to retrieve associations of landmarks and places with actions such as turning or continuing. Studies of the development of home range indicate that young children independently repeat routes they have walked when accompanied by adults and peers (Hart, 1979; Matthews, 1992). If some landmarks and paths along the way are only partially familiar, processes of way finding may be interleaved with processes of route repetition. Rather than automatically turning at an intersection, the child might have to judge the familiarity of alternative paths. We also know that young children are easily distracted when traveling or playing outdoors at their everyday destinations. They may be drawn to interesting sights, become engrossed in group activities, or seek escapade. More cautious
Children’s Way Finding
183
young children will explore new places that allow a view of home or customary paths (Cornell et al., 2001). If a path is tried that is beyond what is known, the child can return by turning around and monitoring the recent familiarity of landmarks as they are encountered again. Hence, recognition processes are fundamental for way finding that occurs when partially familiar routes are repeated and newly traveled routes are reversed (Cornell, Heth, & Alberts, 1994). These early processes are practiced throughout subsequent development. Approaching familiarity is typically automatic when adults repeat oft-traveled routes (Chase & Chi, 1981; Hasher & Zachs, 1979).
A. DEVELOPMENTS IN RECOGNITION PROCESSES
A developmental study of recognition of landmarks in photographs suggests implications for way finding (Kirasic, Siegel, & Allen, 1980). The photographs were real-world scenes containing distinctive features such as bridges or fountains. These features could readily serve as landmarks to identify a place. Compared to 10- and 22-year-old participants, 6-year-old children were less accurate and slower at identifying photographs of places they had studied. In particular, the youngest children were slow to discriminate between the original scenes and foils that involved either substituting a new landmark or substituting a new environmental context for a previously seen landmark. The results suggest that with the complexity of natural scenes, young children have difficulty differentiating novel and familiar cues. Two field studies of way finding elaborate this finding. The first involved children’s ability to detect that they are traveling off a previously traveled route. Visual recognition can inform us that we are off route in two ways. One is an accumulating absence of familiar or expected cues. The other is noticing something en route that we are sure that we have never seen before. These modes of recognition involve an analysis of the heterogeneity of geographical space (Goodchild, 2001). Schoolyards, storefronts, vacant lots, and other distinct features populate the home range of suburban children, forming landmarks and regions of spatial uniqueness. Hence, the development of skills for scanning, discrimination, and anticipatory recall of landmarks is important for differentiating places as familiar or novel. Features within a geographic layout are also spatially correlated, or occur in patches as the result of a convergence of natural forces or arranged zoning. This means that objects that are closer together are more similar than objects that are farther apart; hence, when travel occurs over long distances, features at the beginning of the route are more likely to be different than the ones at the destination. Movement through geographic space also exposes repeated patterns such as the association of greenery with water and the dissociation of trees with asphalt. Hence, the development of perceptual
184
Edward H. Cornell and C. Donald Heth
and categorization processes that respond to spatial correlations of features is important for recognizing the context of travel. Cornell, Heth, and Alberts (1994) found that, in general, judgments of familiarity correctly decreased as children and adults were led astray for increasing distances from a novel route they had recently traveled. The 8-yearold children were less accurate than 12- and 25-year-olds. A signal detection analysis of recognition processes indicated that 8-year-olds were less likely to judge places off route as novel. In particular, 8-year-olds were more likely to judge that they were on the original route when recognition judgments were made at test sites off route that were close to and facing the intersection of the original route. This bias to accept familiarity does not seem to be inappropriate when approaching the original route after an off-route excursion. However, the 8-year-olds more often did not know the way to turn at that intersection. Cornell, Heth, and Alberts (1994) developed a theoretical interpretation that suggests that there are difficult discriminations of familiarity of paths at the choice point that would have allowed the participants to return along the original route. Cues alongside the off-route approach are familiar because they have been seen from the perspective of initiating the off-route excursion. Cues across the intersection are recently familiar because they have been seen in the background while approaching the intersection from off route; moreover, they may be historically familiar because the child looked down the side path at them when first traveling the original route. These considerations indicate that off-route way finding by approaching familiar cues may require more than accurate recognition processes. Choices between familiar paths may require accurate temporal coding of the memories of actions and landmarks. Children could differentiate similar impressions of visual familiarity if they could remember the serial order of events; views on the original route have been seen earlier in travel than views off route. In addition, Cornell, Heth, and Alberts (1994) found that 8-year-old children more often did not know the way to go even when they correctly judged that they were off the original route. This could occur because the youngest children were relying on cues that were close to the paths they were walking. When these were unfamiliar, they could not approach a more distant cue that was familiar. As we shall see, the temporal order of cues and the cues that specify the general heading of travel are part of first knowledge that occurs when 10-year-olds find their way along novel routes (Cornell, Heth, & Skoczylas, 1999). A second field study indicates that young children have difficulty encoding the spatial relations between a landmark and its environmental context (Heth, Cornell, & Alberts, 1997). The 8- and 12-year-old children were escorted on their first walk across a university campus. Along the way, they were instructed to pay attention to designated landmarks at four intersections: ‘‘Look at that brown sand box, the one with the white letters. You should try to remember that
Children’s Way Finding
185
brown box to help find the way back.’’ Some of these landmarks were then surreptitiously moved, either rotated in place or translated across the intersection before the return trip. During the return, 8-year-old children were more likely than 12-year-old children to judge that they were off the original route when they were at the intersections with changed landmarks. The younger children were also less likely to point to the correct direction to return at those intersections. When asked later, both age groups reliably detected that something was different about the changed landmarks, but the 12-year-olds were more likely to identify that the landmarks had been moved. There were no age differences and high recognition and pointing accuracy at intersections on the original route where landmarks had remained unchanged. Children were asked to describe what they saw that made them point to a path to return (Heth, Cornell, & Alberts, 1997). When stopped at intersections off route, 8-year-olds were more likely than 12-year-olds to incorrectly name a new landmark as familiar. In contrast, at these same intersections off route, 12-yearolds were more likely than 8-year-olds to correctly name a new landmark as unfamiliar. When stopped at intersections on route, children of both ages said they used the landmarks that had been pointed out to them during the original walk and both age groups reported using 5 – 9 additional landmarks as well. However, the 12-year-old children reported using the most landmarks and were more likely to name landmarks that were peripheral to the ones that had been pointed out. In this study, landmarks were classified as peripheral when they were outside of a photograph with a 358 visual field centered on the landmark that had been pointed out for remembering. The tendency of the older children to name landmarks that are more peripheral suggests that one of the general developments in the ability to recognize places is efficient perceptual search (Allen & Ondracek, 1995). When adults study scenes, as fixated objects are more quickly identified and localized, more flanking objects can be fixated (Rayner & Pollatsek, 1992). Processing accompanying successive fixations may serve to link landmarks to other objects or borders in the immediate surround (Blades, 1989; Golledge, 1995; Presson & Montello, 1988). Rapid execution of these processes may have freed 12-year-old children to register objects and spatial relations that were beyond the immediate neighbors of a designated landmark. In other words, efficient perceptual search provides children with opportunities to direct their attention and see places in greater area. Efficient perceptual search would affect route learning as well. Younger children typically know less than older children and adults about the sequence of events along a newly acquired route (Siegel, Kirasic, & Kail, 1978). Gaps in route knowledge could occur, for example, if a younger child had been preoccupied with an object next to the path, but the spatial extent of his or her attention to peripheral landmarks was less than that of older children.
186
Edward H. Cornell and C. Donald Heth
The younger child may not have time to register a distant landmark that could be seen from different places along the route. Landmarks in the skyline would be particularly important for anchoring route events in a large-scale frame of reference. B. SUMMARY
Our review suggests that children’s landmark and place recognition abilities at 12 years of age are not reliably different than those of adults. Nevertheless, some basic developments in visual recognition processes are relevant to way finding. The improvements before 12 years of age include increased place recognition accuracy, with an important shift from accepting objects off route as familiar to correctly identifying them as novel. The efficiency of visually processing scenes also increases with age (Day, 1975). More rapid encoding of a centrally fixated landmark may allow older children to move their focus to more of the objects that are sensed in peripheral vision. With a short eye movement, these objects can be identified and seen to be neighbors in the visual field. The extent of scanning may underlie the ability to see that places overlap along a route and are situated in a larger frame of reference. There were indications that these basic visual recognition abilities are supplemented by other abilities to learn and remember spatiotemporal events during travel. For example, after stepping off route and eventually recognizing that an encountered place was novel, some children attempt to retrace their steps. Retracing is not a simple matter at the intersection of their off-route path with the original path. The children could not simply approach what was familiar because features of all the intersecting paths would have been seen before. In this situation, off-route way finders could choose between alternative paths by approaching landmarks that were temporally encoded as earlier in their travels or by remembering the direction they chose when they turned off path. In the next section, we consider how a recognized place is situated in a series of memories of travel. We also examine associations between recognized places and actions such as continuing or turning. Following a well-known distinction made by Siegel and White (1975), we are moving our discussion from processes of recognition of landmarks to processes of route learning.
V. Memories of Routes To the extent that environments are heterogeneous, such as neighborhoods with mixed housing and commercial development, children can use landmarks, places, and vistas to distinguish a course of travel. The process of directing travel in accord with a progression of environmental events is known as piloting
Children’s Way Finding
187
(Gallistel, 1993). Piloting typically occurs episodically when way finders notice the identity, distance, and bearing of certain landmarks as they are encountered. Children show several developments in the ability to selectively attend to and remember route events that are precursors to this common form of way finding. During their first experience along a route, children may differentially attend to salient events, such as colorful or animate objects (Allen et al., 1979; Cornell et al., 2001). With repeated experience along the route, objects are increasingly distinguished from one another and those that were noticed on initial trips may accumulate familiarity (Acredolo, Pick, & Olsen, 1975). The differentiation of features along a route provides details that can help way finders determine whether they are on or off intended paths (Gibson, 1969). In addition, in most models of serial learning, improvements in the quality of representation of route events would facilitate the associations between those events (Brown, 1997). At the same time, as particular landmarks become familiar at places along the route, way finders need not examine them in detail and may explore and examine new objects from that viewpoint. As with our description of visual scanning, sequential attention provides spatial and temporal contiguity of the processing of route events. In addition, when children are accompanied on their repeated excursions to distant destinations, they reveal response learning: ‘‘I know that we turn up here somewhere’’ (Cornell, Heth, & Skoczylas, 1999). Similar to the anticipation of landmarks, the recollection of actions can also occur episodically and involve the qualities of the event (stopping, running, going uphill), distance and bearing; these memories are the phenomenological data people use when navigating by dead reckoning (Cornell & Heth, 2004; Sholl, 1996). In sum, theories of route learning have emphasized the child’s differentiation and serial ordering of environmental events (e.g., Siegel & White, 1975) or have emphasized the child’s representation of actions (Blades, 1997; Piaget, Inhelder, & Szeminska, 1960). Because these events naturally co-occur during travel, external events such as landmarks may serve to prime, organize, and confirm internal events such as the action of turning. Similarly, actions can produce expectations of events along the route (Cornell, Heth, & Skoczylas, 1999; Cornell, Sorenson, & Mio, 2003). In the next sections, we illustrate how the effects revealed during children’s route learning are compatible with theories of associative learning.
A. THE ACQUISITION AND REPRESENTATION OF SERIAL ORDER
In context-based models of associative learning, landmarks and actions would be linked to one another along a time line or serial representation of occurrence
188
Edward H. Cornell and C. Donald Heth
(Brown, 1997). The time of occurrence appears to be on an ordinal scale. For example, after taking a 2.4 km walk through a neighborhood for the first time, 10-year-old children and 24-year-old adults either repeated or reversed the route and were stopped before intersections came into view. Even though they had not been told they were to be tested, at six sites along the route, children and adults were able to discriminate pictures of the next intersection from a picture of an intersection they had passed by recently or a picture of an intersection that was farther up the path (Cornell, Heth, & Skoczylas, 1999). The serial ordering of route events by children and adults is also evident in scene sequencing tasks and verbal recall of trips (Cousins, Siegel, & Maxwell, 1983; Golledge et al., 1985; Torell, 1990). By 9 – 10 years of age, children who have walked a new neighborhood route 2 to 4 times seldom err when they arrange photographs of scenes along the route in a sequence (Golledge et al., 1992; Torell, 1990). Moreover, the assessment of route memories yields an effect that is considered to be fundamental to human serial processing: Events that occur near the termini of a complex urban route are more likely to be remembered than those in the middle (Cornell, Heth, & Broda, 1989; Cornell, Heth, & Rowat, 1992; Golledge et al., 1985, 1992). For example, 8- and 12-yearold children were asked to point to the way to proceed at intersections while returning along a 1 km route around a university core. From the most recent intersection at the end of the route to the intersection near the origin of the route, the pattern of correct choices was a U-shaped serial position curve (Cornell et al., 1996). The study of the serial nature of route learning provides a good example of translation of basic research into practice. Cornell et al. (1996) mathematically described the serial position curve to estimate children’s likely errors when reversing a new route. The probabilities of children’s errors at intersections during route reversal were then integrated with police procedures to create an algorithm for prioritizing areas for search. The priorities produced by this algorithm were different than those selected by a novice police search manager. The algorithm added unique emphases to areas close to the route and midway along the route reversal. When the algorithm was used in simulated searches, the prioritized areas were consistent with where children had previously wandered from the university core.
B. ACTION NODES AND SEGMENTS OF ROUTES
Patterns of serial memories cannot be adequately explained by chaining successive pairs of sequential events (Brown, 1997). Models of route learning that link landmark–action and action–landmark associations are strippeddown accounts of how people remember travel. During their very first
Children’s Way Finding
189
experience on a route, children may uniquely note special places, differentiate the character of areas, register the patterns of their movements, and observe bearings to distant landmarks. Golledge (1978) suggested that certain places serve as anchor points or organizational nodes for representing routes. In particular, there are both spatial and temporal labels for the termini of routes (e.g., home–destination, beginning– end, origin–goal), indicating positional distinctiveness for primacy and recency effects in children’s memories of travel (Cornell et al., 1996; Golledge et al., 1985; Neath & Crowder, 1990). In addition, early in route learning, 9- to 11-year-old children note places where way finding decisions or turns are made (Doherty, 1984; Golledge et al., 1985). Intersections are action nodes in representations of routes. As well, intersections are usually open areas that show how paths meet and opportunities to check views of places where paths are headed. These visual prospects are noticed by children as young as 7 years; after viewing a slide presentation simulating a walk through a commercial district, they were asked to select photographs ‘‘. . . that would most help them to remember where they were.’’ They chose scenes in the vicinity of choice points (Allen et al., 1979). Because a choice point involves environmental features that are associated with a potential or real change of heading, the choice point establishes a place early in serial learning where a route may be segmented. Segmentation, like other forms of chunking, provides a means of organizing information so that a smaller number of superordinate representations can embed a larger number of route events, some of which may be forgotten (Carr & Schlisser, 1969). Segmentation was apparent in a detailed case study when an 11-year-old boy showed clustered retrieval of memories of environmental features. He concentrated his recall on landmarks and path cues in the vicinity of choice points (Golledge et al., 1985). Moreover, the boy’s attempts to sketch his route revealed hierarchical knowledge, a tendency to compose first a skeletal map of road sections where way finding actions were necessary. Roads that connected these sections were second to be drawn, followed by landmarks that appeared next to the route. Intersections are not the only delineators of route segments. By 10 years of age, children can also discern features that characterize different areas traversed by paths (Allen, 1981). For example, when reconstructing a sequence of photographs of a route, children partition the route into segments bordered by a wooded park, a university campus, and a residential neighborhood. The ends for these segments were photos depicting environmental transitions. For example, children placed photos of the campus in a row and photos of the residential neighborhood constituted another row; children selected photos showing a major street bounding these two areas as the last scene of the campus row and the first scene of the residential row. Segmentation by environmental
190
Edward H. Cornell and C. Donald Heth
features indicates that children are beginning to form place schemas or general expectancies based on world knowledge, such as the expectancies that farmland is flat, that parks have trees and swings, and that malls have food courts. In the outdoors, people often form place schemas when they notice the correlation of geographic features, such as the correlation of a watershed with a downward slope. People use place schemas to check that travel is appropriate in an unfamiliar area (Cornell, Heth, & Skoczylas, 1999). For example, playmates may realize that they have misinterpreted the directions to a friend’s suburban house if they encounter high-rise construction. C. SUMMARY
By middle childhood, children’s route learning is more than a linear series of associations between landmarks and actions. Landmarks do not have equal status; early in their learning, 9- to 11-year-old children remember those perceived at choice points. The selective memory for choice points suggests that children of these ages appreciate the advantages of staying on route. The 9- to 11-year-old children also organize their memories of routes in a hierarchy, with landmarks, bearings, and actions embedded within segments. Route segments may be delimited by choice points or by the commonalities of the territory they pass through. The segments themselves constitute a smaller number of schematic memories embedded within a larger spatiotemporal framework, defined by the beginning and end of the trip.
VI. Bearing Knowledge in Way Finding Since the demonstrations of detour and shortcutting behaviors by animals (Tolman, 1948), psychologists have been particularly intrigued with the notion that our mental representation of our movements while on the ground is organized to reflect a survey of the territory as if seen from above. Knowledge of bearings between self and landmarks, and knowledge of bearings between landmarks are primitives of survey representations (Golledge, 1995). Because well-organized survey representations include formal (typically Euclidean) properties, they have special status or are an ultimate development in stage theories of spatial cognition (Hart & Moore, 1973; Piaget & Inhelder, 1967; Siegel & White, 1975). However, early in development, children are seeing bearing and distance relations of landmarks during ground level exploration of their home range. Even without comprehensive survey knowledge, children may use perspectives of a familiar landmark in the skyline to guide way finding. Awareness of self-to-object bearings and object-to-object bearings is roughly indicated by 1- to 2-year-olds’ ability to point and look where someone is
Children’s Way Finding
191
pointing (Butterworth & Grover, 1999). By 18 months of age, infants are accurate within 608 in their first turn toward a visual target that their mother is looking at. Under some conditions, infants’ turning indicates that they can extrapolate their mother’s line of gaze to one of two targets separated by 608 in the visual field behind them (Butterworth & Grover, 1999). Because the reference target is out of view, this latter performance indicates that 18-month infants are beginning to represent and parse the spatial surround. Studies of route reversal performance suggested that older children are registering bearings during their first exposure to novel territory (Cornell, Heth, & Broda, 1989; Cornell, Heth, & Rowat, 1992). These were studies on a university campus that called for 6- and 12-year-old children to return along a guided route. When they were not informed that they would be leading the way back, both children and adults typically failed to reverse the newly walked route exactly (Cornell, Heth, & Rowat, 1992). They made at least one incorrect path choice at intersections and had to find their way back to the initial route or its origin. Interestingly, when children as young as 6 years of age were wandering off path, the majority correctly headed toward the expanse that contained the origin of the route (Cornell, Heth, & Broda, 1989; Cornell, Heth, & Rowat, 1992). Some of this correctly oriented way finding could be the result of monitoring environmental features on the incorrect path such as the orientation of shadows (Cornell, Heth, & Skoczylas, 1999). By 12 years of age, children were able to direct their off-route way finding in reference to a distant feature such as the position of the sun, a line of trees along the skyline, or a tall building (Cornell, Heth, & Broda, 1989). Finally, some children may be able to maintain a short course by dead reckoning, monitoring their movements, and correcting any deviations that occur in reference to remembered bearings (Rieser, 1999; Rieser, Garing, & Young, 1994).
A. POINTING OUTDOORS
In the context of way finding outdoors, children as young as 8 years of age demonstrate bearing knowledge by pointing to reference sites along a complex route with intermediate accuracy (Anooshian & Owens, 1979). In this demonstration, children were stopped at the end of four route segments and were instructed to study landmarks from these sites during two initial walks around an apartment complex. During a third walk, they were again stopped at the segment end points and asked to point a telescope as if they could view the other sites, which were not visible. The discrepancy between the bearings indicated by the children’s pointing and the actual bearings to the target sites was a 488 mean absolute error. When 5-year-old children are assessed when pointing with their finger, the mean absolute error of bearing estimates within
192
Edward H. Cornell and C. Donald Heth
a rectilinear building is as low as 208 (Lehnung et al., 2001). Mean absolute error by adults in similar tasks in large-scale environments ranges from 20–548 and chance performance is 908 (Cornell, Sorenson, & Mio, 2003; Golledge et al., 1993; Greidanus, 2003; Montello & Pick, 1993; Silverman et al., 2000). The accuracy of young children’s pointing suggests that, in familiar environments, they know bearings to the extent that they could be used to construct Euclidean mental representations (Conning & Byrne, 1984). Children also learn bearings without instructions to do so. For example, children as young as 6 years of age were able to infer a bearing to a single reference point, the origin of a 1-km walk across a university campus (Heth, Cornell, & Flood, 2002). They had not been told to keep track of their movements, yet had registered information during their first trip that allowed them to point from the end point of the walk with a mean absolute error of 548. The extent of the children’s pointing error was significantly less than chance performance and was similar to the 528 dispersion of their return path when they attempted to retrace the walk. The adults’ mean absolute pointing error was 308, significantly less than the children’s under the same conditions of incidental learning and testing. The angular extent of an adult’s return path was 418, close to the minimum dispersion required to circumvent obstacles on the way to the origin of the walk. Young children show more accurate knowledge of bearings when they are in familiar environments. Children of 3 – 4 years of age point more directly to nonvisible targets in their own homes (238 mean absolute error) than when pointing at non-visible targets in an area around their home where they frequently went on walks with their parents (458 mean absolute error; Conning & Byrne, 1984). Interestingly, when these young children pointed to targets from neighborhood sites that were relatively less familiar, their inaccuracy tended to be biased in the direction of routes they used to walk to the targets. Adults show a similar bias (Chase, 1986; Heth, Cornell, & Flood, 2002). The implication for both age groups is that mental representations of large-scale spaces are influenced by the course of travel through those environments.
B. SUMMARY
Developmental studies indicate that bearings are perceived and registered by young children as they travel outdoors and that development consists of increasing accuracy, especially in the ability to estimate bearings after initial experience in an environment. Knowledge of bearings seems particularly useful when children are expanding their home range into territory where there are no familiar landmarks near paths. Six-year-old children who showed more accurate estimates of bearings to the origin of their walk showed less spread
Children’s Way Finding
193
of wandering when they had stepped off route during their attempted return (Heth, Cornell, & Flood, 2002). Way finders can avoid excessive lateral movement if they ‘‘steer a course,’’ or make adjustments to return to an imagined bearing to their goal after deviations through travel corridors (Jonsson, 2002). The evidence that very young children can use bearings and that people register bearings incidentally is compatible with ecological theories of orientation as the result of perceptual processes rather than deliberate mnemonics and calculations (Gibson, 1979; Heft, 1996; Rieser, 1999; Sholl, 1996). Ecological theory suggests that children’s early examples of survey perspectives and inferences concerning bearings are derived from sensorimotor experience (Pick, 1988; Rieser, 1983). As the infant is picked up by the parent, or as a child climbs the playground ladder, objects and places the child was viewing at a horizontal elevation undergo continuous perspective transitions until surveyed from above. As children walk through their neighborhood, they can see that the position and form of distant objects do not change as much as those near their paths. By 12 years of age, children select these relatively stable objects to monitor their bearings during travel.
VII. Strategy Development Our review of home range activities indicated that efficiency is not always a priority for children—they sometimes choose to lollygag. Nevertheless, the consequences of errant travel provide strong motivation for learning way finding skills. One of the oldest studies of children’s fears reported that the ‘‘dread of getting lost is common’’ in school children and adults alike (Hall, 1897). Infants and toddlers may not realize that the world consists of expansive and complicated spaces. Although they can be apprehensive about separation from their parents, when attracted, they will readily follow an animal into the forest or strike out on little expeditions with no concern for the return trip (Hill, 1999). Parents typically admonish such impulsiveness and even a brief incident of disorientation reinforces for the child the importance of ‘‘paying attention’’ during travel. When children are not instructed what to attend to, they experiment. For example, when leading a researcher to the farthest place from home he had ever ventured to alone, one 6-year-old boy volunteered that he carefully attended downward: ‘‘I just know how to get there by looking at the ground. All I need to look at is the ground’’ (Cornell et al., 2001, p. 223). The boy’s statement and subsequent explanation forecast several facets of the development of way finding strategies. Many strategies begin with selfdiscovery. The boy noted particular features of the path—cracks in the sidewalk, crumbling curbs—as cues for upcoming turns and as reassurance that he was on the correct route. His selectivity was a prospective strategy, limiting his
194
Edward H. Cornell and C. Donald Heth
attention systematically to establish a small series of unique events to guide route reconstruction. Although prospective, like the bread crumbs used by Hansel and Gretel, his strategy was vulnerable; a downward pattern of scanning would not help to register landmarks that could be used if he stepped off path during his return. Reactions to errant travel could precipitate a new, more advanced strategy (Siegler, 1996; Cornell et al., 2001). The boy could, as we shall see, begin looking for cues on the horizon during his route learning.
A. SELECTIVE ATTENTION TO LANDMARKS
Modern urban parents may not demonstrate way finding strategies or tell their children much about the layout of their community because they have reservations about safety (Cornell et al., 2001; Torell & Biel, 1985; Valentine, 1997; Woolley, Dunn, & Spencer, 2000). As a result, children’s exploration and expansion of home range may provide raw lessons about characteristics of environments and the requirements of self-directed travel. For example, 12-year-old children quickly learn to attend to pertinent landmarks; after leading a walk to one of their far special places in their neighborhood, they increasingly named objects near choice points as useful to find their way back (Cornell et al., 2001). An analysis of the suburban landscape indicated how children select landmarks from heterogeneous views (Cornell et al., 2001). Videotapes of children’s neighborhood walks were randomly sampled and objects that appeared in randomly sampled frames were counted in categories as permanent, distant, unique, or near intersections. While leading their walks, 8- and 12-year-old children said they used unique objects as landmarks, such as a house with a red door or a yard with an alpine rock garden. Both age groups named unique objects as landmarks with four times greater frequency than the baseline count of their occurrence in the videotape samples of the walks. The 8-year-olds named objects at intersections and distant objects (visible in the skyline to be at least two blocks off their route) as landmarks with a frequency corresponding to their baseline counts, whereas 12-year-olds referred to them significantly more often. The baseline count of permanent objects indicated that 8-year-old children showed inappropriate selective attention; they named transient events as landmarks more frequently than the events occurred in the videotape samples. The 8-year-olds named bumblebees, litter dancing as it was blown down the street, dogs barking, and hot rods as things that could help them find their way, but did not associate these events with more permanent site cues. In contrast, 12-year-old children named permanent objects as good landmarks with a frequency equivalent to their high baseline count.
Children’s Way Finding
195
Recordings taken during the walks indicated that characteristics of landmarks were only one of a variety of lessons learned that would allow for efficient way finding. One 11-year-old girl had observed the routes that buses used in her neighborhood, another showed how street numbers could tell you how many blocks there were left before reaching a destination, and an 8-year-old boy correctly noticed that a street exited in the opposite direction of its entrance (Cornell et al., 2001). These reports suggest some of the multiple sources of information that adults say they observe and use when solving problems of orientation and way finding (Cornell, Sorenson, & Mio, 2003). The pattern of reports is consistent with models of executive selection of processes to use readily interpretable information, to monitor progress, and to react to outcomes of problem solving attempts.
B. VERBAL MEDIATION
By 10 years of age, children show a variety of verbal elaborations when they encounter landmarks. Not only are they reading street signs, addresses, and advertising, they invent names for unique configurations of landscaping and buildings and comment on the activities and purposes associated with sites (Torell, 1990). Torell recorded several examples during neighborhood walks with 10-year-olds: A girl named a hill ‘‘the Rocky Mountains;’’ a boy referred to a patterned light-and-dark surface of a commercial square as a ‘‘chess board;’’ after observing bank offices along the square, a girl speculated about ‘‘competition in the banking business.’’ It is well-documented that children’s naming or verbal encoding helps to establish the distinctiveness of perceptual cues and allows for rehearsal, organization, and elaboration of associations for later retrieval (Gibson, 1969; Schneider & Pressley, 1989). Yet, there is little direct evidence that children are intentionally using verbal mediators as mnemonics for way finding. Their verbal associations may be covert and automatic, a natural consequence of encountering the heterogeneity in the environment. Nonetheless, verbal processing requires mental effort over and above activities (such as approaching familiar landmarks) that are sufficient for way finding. Traditional definitions of strategic behavior accept that strategies such as verbalization achieve cognitive purpose, such as comprehension and memorizing, and are potentially conscious and controllable operations (Bjorklund & Harnishfeger, 1990). Older children may notice the correspondence between commenting about events they see and their ability to remember routes. For example, verbalization of landmarks increased as 10-year-old children were asked to reconstruct a neighborhood walk over three trips and was associated with increasing spatial accuracy of their sketch maps (Torell, 1990).
196
Edward H. Cornell and C. Donald Heth
C. TRAINING PROSPECTIVE STRATEGIES
Recommendations for way finding and staying oriented abound. Scouting manuals, wilderness guides, folklore, natural history books, and field notes of anthropologists have described children’s tracking games, exercises for map and compass reading, mnemonics for routes and landmarks, and strategies for monitoring travel in relation to geographic frameworks (McVey, 1989). Although the methods are based on adults’ intuitions of way finding processes and pedagogy, very few have been assessed. This is unfortunate because observations of children responding to instructions for strategic way finding reveal the development of cognitive abilities in a natural problem domain. These observations also yield information about the ways that parents can instruct their children so that they do not become lost. By comparing two attempts to teach 6-year-old children, we illustrate the look-back strategy (Cornell, Heth, & Rowat, 1992; Heth, Cornell, & Flood, 2002). Observations of hunter–gatherer cultures indicate that novices are often instructed to look back when experienced travelers show them an important site such as an intersection or water hole (Gould, 1969; Nelson, 1969). Trail breakers and explorers also use this mnemonic (Gatty, 1958). The strategy anticipates that objects and layout are often not recognized from different perspectives; when returning along a route, way finders may make a wrong choice because they had not seen a unique side of a landmark or the angle of confluence of a branching path. In an early attempt to measure the effectiveness of the look-back strategy, Cornell, Heth, and Rowat (1992) gave instructions to 6-, 12-, and 22-year-old participants in way finding research. Participants were accompanied during their first walk on a university campus and were stopped on 11 occasions to be told, ‘‘We have walked far enough so that it might be a good idea to turn around and look where we came from.’’ Analyses of distance traveled and choice point errors indicated that the 12- and 22-year-old participants who received the look-back instructions were more likely to stay on route during the return than cohorts who had been uninstructed. Six-year-old children did not benefit from the instructions and research indicated that, when looking back, they might not select environmental cues that are unique or permanent landmarks. In a subsequent attempt, Heth, Cornell, and Flood (2002) only stopped participants to receive look-back instructions at three choice points along a novel campus walk. However, Heth, Cornell, and Flood (2002) had analyzed which choice points were likely to be difficult, called for anticipatory recall of the position of landmarks and paths before turning to look back and made explicit the advantages of the participant’s choice of landmarks for directing the return. These elaborations effectively reduced 6-year-olds’ errors at these intersections as well as the distance that they traveled off route when they led the return.
Children’s Way Finding
197
The different results in the two studies fit with Flavell’s (1970) description of the development of strategic thinking. With appropriate instructions, children as young as 6 years can execute prospective strategies for attending to and memorizing environmental features; their way finding performance improves (Cornell, Heth, & Broda, 1989; Heth, Cornell, & Flood, 2002). However, younger children may not spontaneously produce strategies from their repertoire that effectively anticipate the requirements of route reversal (Cornell, Heth, & Rowat, 1992). The key for ensuring that young children produce attentive strategies may be an emphasis to anticipate what is needed to direct future travel. The successful instructions by (Heth, Cornell, & Flood, 2002) reinforced for 6-year-olds that landmark choices were good when the landmarks were permanent and their appearances were known at locations during the return.
D. STRATEGIES WHEN LOST
The condition of being lost is difficult to describe, but usually begins when a way finder is disoriented (Cornell & Hill, 2006). Disorientation may include a failure to identify the present location but more typically involves a failure to know how movement may be directed to a desired destination: the original paths, the origin of travel, places with people, or home (Hill, 1999; Montello, 1998). In addition to an inability to find the way, lost is a psychological state that involves reflection on negative consequences. The resultant anguish and physiological arousal can seriously interfere with problem solving (Hill, 1999). Although when alone many people move impulsively during their initial reactions to being lost, most settle down and attempt a more effective response (Hill, 1999). As a volunteer for a search and rescue team, Hill has interviewed lost children after they have been found, during the period when they realize they are safe and before their multiple reconstructions of their ordeal are affected by adult feedback. Hill was able to identify four strategies that children used when lost, although he has not gathered enough data to establish the frequency of use of strategies for children of different ages in different environments. In trail running, children realize that they are disoriented, then hasten down the nearest trail or follow a least effort course. In some instances, the strategy is directed toward quickly finding out what is in the direction indicated by the travel corridor. Minimizing effort while seeking new information is a rational strategy: Trails are thought to go somewhere, other people may be on the trail, and sights along the trail may help to reestablish bearings. However, in their anxiety to achieve these ends, or because of fearinduced arousal, children often run to exhaustion. Even though a trail is fading or taking them farther into rugged territory, an anxious child is unlikely to reverse direction (Hill, 1999).
198
Edward H. Cornell and C. Donald Heth
The other strategies recorded to be used by lost children are classified as direction sampling, view enhancing, and staying put (Cornell & Heth, 1999; Hill, 1999; Syrotuck, 1979). Direction sampling and view enhancing are brief excursions to obtain more information about the surrounding terrain. These strategies involve procedures to return to an anchor point once a bearing or view has been sampled and not found to be informative. Direction sampling involves systematically trying routes that are seen to head off in different directions, whereas view enhancing may involve a singular off-route goal such as a visible peak. As an example of direction sampling, in one urban incident two 9-year-old girls used a playground knoll as an anchor point to search for familiar cues. One girl monitored from the knoll while the other progressed down a street to stand on her toes and look for their school. When she did not see it, she returned to the playground while her friend was still in view. The girls took turns going out from the knoll, moving clockwise around the knoll to search adjacent streets (Cornell & Heth, 1999). The view-enhancing strategy is illustrated by a rural incident: A 13-year-old boy reported that he interrupted his walking to climb a tree to scan for any house (Hill, 1999). Staying put is considered strategic when children who do so report that their way finding attempts might have led them farther from home. To be strategic, the reports should indicate metacognition, that the child has assessed that he or she lacks enough knowledge or skill to solve the problem independently. Cornell and Heth (1999) tell of an incident involving a 9-year-old girl who had wandered for a week in snow-covered wilderness. The pilot of a search and rescue aircraft spotted her posed on a rocky outcrop overlooking the icy surface of a lake. When recovered, she reported that she had selected the site because she could be seen and she could see a large area where somebody might come looking for her. In this case, staying put included a strategy to take the perspective of a searcher.
E. SUMMARY
The strategies created by children in response to way finding problems reveal cognitive development in natural settings. Both the design and subsequent modification of their attempted strategies are attuned to the resources, constraints, and outcomes they encounter during their adventures outdoors. Some strategies seem to be spontaneous. For example, memories of travel are encoded when children selectively attend to and comment on unique and salient objects they encounter. The attempts to train more prospective strategies indicate that 6- to 8-year-old children may not remember that, to return along a new route, landmarks must be permanent and localized with respect to actions along the paths. Children of 10 to 12 years are attempting strategies to
Children’s Way Finding
199
guide way finding off route, including selective attention to distant landmarks and registration of sites as anchors for excursions.
VIII. General Discussion The cross-cultural study of home range suggests a fundamental human development: Children between the ages of 3 and 12 are extending the spatial extent of their activities independently. Measures of home range have been ecologically valid and the norms for distance and dispersion of travel have been adopted into search and rescue methodologies (Colwell, 2005; Heth & Cornell, 2006b). Quick reunions of lost children with their parents have terminated extreme emotional duress, reduced the risk of criminal predation, and saved children from exposure and hypothermia (Heth & Cornell, 1998; Hill, 1997; LaValla, Stoffel, & Jones, 1995). In addition, the study of home range has from its onset noted sociocultural and environmental contexts of travel. In all countries where observations have occurred, youngsters are characterized as willing or eager to explore their nearby world. Patterns of activity affirm that home range is a geographic competency and that children learn the characteristics that distinguish large scale natural environments. For example, geographic and built heterogeneity provides the distinctive cues that become location indicators and landmarks. Intersections along routes are unique discontinuities, providing delimiters for route segments and opportunities for scanning the surroundings. As recognition processes improve, children can not only pilot according to the familiarity of landmarks, but they can also examine more distal objects and discover characteristic patterns of places. Geomorphic processes provide spatial correlation of natural features and children can also direct their activities based on zoned correspondences such as parks and pedestrian paths. The most impactful lesson accompanying the development of home range may be the vast scale of geographic space. Cognitive capacity, parental restrictions, and the limits of personal knowledge are often breached as children venture alone into new territory. As the distance traveled independently from home increases during childhood, so does the dispersion of travel. Walks that are more distant require a longer duration of memories and increases in area could mean as much as a quadratic expansion in the number of environmental features experienced. Naming, temporal ordering, segmentation of routes, and hierarchical organization of areas and landmarks are ways that children manage this cognitive load. As children move beyond the sight of their home, they use bearings along the horizon to direct their travel and organize a representation of the vastness. As one 6-year-old way finder explained when he pointed to his distant destination, ‘‘I know its there, under the sky.’’
200
Edward H. Cornell and C. Donald Heth
Toddlers begin exploring close to home and select direct routes, some without turns or loss of views of familiar sites. The adventures of older children include elements of risk, happenstance, and wonder. Hart (1979) provides fascinating evidence that independent children often go out of their way to take ‘‘shortcuts’’ that are frequently longer and more hazardous than the original routes that they know. Because the territory is unknown, planning before such adventures is often incomplete. The child will rely on strategies for way finding rather than specific route knowledge. The challenge is to size up new situations and react successfully. In this sense, adventure fosters adjusting to unanticipated and changing circumstances (Rogoff, Gauvain, & Gardner, 1987). When disoriented, the child may shift to activities that provide information about places and select among strategies to achieve an intermediate goal such as regaining orientation. It is important to emphasize that these processes of way finding have typically been first attempted as processes of route learning. While traveling on a familiar route, if a portion is forgotten, the child is way finding. The novelty of discovery and the realization of independent achievement motivate children to go beyond where they already know. In the context of local geography, they come across new play sites and meet different peers. When returning home, children can appreciate that errors in route reversal occur where they were distracted and where interesting but transient landmarks no longer appear. Children learn that finding their way back home is easier when they prospectively encode permanent objects in relation to paths, characteristics of places, distant landmarks, actions, and the order of events during outbound travel. Strategy development also involves adaptations to react to novel information, such as when children turn from off route after recognizing that they have never seen a feature of the path. These examples illustrate how the expansion of home range leads to world knowledge and the development of efficient cognitive abilities.
ACKNOWLEDGEMENTS Our research is supported by grants from the Natural Sciences and Engineering Research Council and the Search and Rescue Secretariat of Canada.
REFERENCES Acredolo, L. P., Pick, H. L., & Olsen, M. G. (1975). Environmental differentiation and familiarity as determinants of children’s memory for spatial location. Developmental Psychology, 11, 495 – 501.
Children’s Way Finding
201
Allen, G. L. (1981). A developmental perspective on the effects of ‘subdividing’ macrospatial experience. Journal of Experimental Psychology: Human Learning and Memory, 7, 120 – 132. Allen, G. L., & Ondracek, P. J. (1995). Age-sensitive cognitive abilities related to children’s acquisition of spatial knowledge. Developmental Psychology, 31, 934 – 945. Allen, G. L., Kirasic, K. C., Siegel, A. W., & Herman, J. F. (1979). Developmental issues in cognitive mapping: The selection and utilization of environmental landmarks. Child Development, 50, 1062 – 1070. Anderson, J., & Tindal, M. A. (1972). The concept of home range: New data for the study of territorial behavior. In W. Mitchell (Ed.), Environmental design: Research and practice (pp. 1 – 7). Los Angeles: University of California Press. Anooshian, L. J., & Owens, C. B. (March, 1979). Children’s acquisition of spatial location information in an unfamiliar environment. Paper presented at the biennial meeting of the Society for Research in Child Development, San Francisco, CA. Baldwin, D. A. (1995). Understanding the link between joint attention and language. In C. Moore & P. Dunham (Eds.), Joint attention: Its origins and role in development (pp. 131 – 158). Hillsdale, NJ: Lawrence Erlbaum Associates. Bayley, N. (1969). Bayley scales of infant development. New York: The Psychological Corporation. Biel, A., & Torell, G. (1982). Experience as a determinant of children’s neighborhood knowledge. Go¨teborg Psychological Reports, 12, No. 9. Bjorklund, D. F., & Harnishfeger, K. K. (1990). Children’s strategies: Their definition and origins. In D. Bjorklund (Ed.), Children’s strategies: Contemporary views of cognitive development (pp. 309 – 323). Hillsdale, NJ: Lawrence Erlbaum Associates. Blades, M. (1989). Children’s ability to learn about the environment from direct experience and from spatial representations. Children’s Environments Quarterly, 6, 4 – 14. Blades, M. (1997). Research paradigms and methodologies for investigating children’s way finding. In N. Foreman & R. Gillet (Eds.), A handbook of spatial research paradigms and methodologies. Vol. 1: Spatial cognition in the child and adult (pp.103 – 129). East Sussex, UK: Psychology Press. Bovy, P. H. L., & Stern, E. (1990). Route choice: Way finding in transport networks. Dordrecht, Netherlands: Kluwer Academic. Brown, G. D. A. (1997). Formal models of memory for serial order: A review. In M. A. Conway (Ed.), Cognitive models of memory (pp. 47 – 77). Cambridge, MA: MIT Press. Butterworth, G., & Grover, L. (1999). The origins of referential communication in human infancy. In P. Lloyd & C. Fernyhough (Eds.), Lev Vygotsky: Critical assessments: Vol. II. Thought and language (pp. 3 – 22). New York: Routledge. Butterworth, G., & Jarrett, N. (1991). What minds have in common is space: Spatial mechanisms serving joint visual attention in infancy. British Journal of Developmental Psychology, 9, 55 – 72. Carr, S., & Schlisser, D. (1969). The city as a trip. Environment and Behavior, 1, 7 – 35. Chase, W. G. (1983). Spatial representations of taxi drivers. In D. Rogers & J. Sloboda (Eds.), Acquisition of symbolic skills (pp. 391 – 405). New York: Plenum. Chase, W. G. (1986). Visual information processing. In K. Boff, L. Kaufman & J. Thomas (Eds.), Handbook of perception and human performance. Vol. II: Cognitive processes and performance (pp. 1 – 71). New York: John Wiley. Chase, W. G., & Chi, M. T. H. (1981). Cognitive skill: Implications for spatial skill in large-scale environments. In J. Harvey (Ed.), Cognition, social behavior, and the environment (pp. 111 – 136). Hillsdale, NJ: Lawrence Erlbaum Associates.
202
Edward H. Cornell and C. Donald Heth
Clearfield, M. W. (2004). The role of crawling and walking experience in infant spatial memory. Journal of Experimental Child Psychology, 89, 214 – 241. Colwell, M. (2005). Incident Commander (Version 4.0) [Computer software]. Vancouver, British Columbia: SAR Technology. Conning, A. M., & Byrne, R. W. (1984). Pointing to preschool children’s spatial competence: A study in natural settings. Journal of Environmental Psychology, 4, 165 – 175. Cornell, E. H., & Heth, C. D. (1996). Distance traveled during urban and suburban walks led by 3- to 12-year-olds: Tables for search managers. Response! The Journal of the National Association for Search and Rescue, 15, 6 – 9. Cornell, E. H., & Heth, C. D. (1999). Report of a missing child. In K. Hill (Ed.), Lost person behavior (pp. 31 – 44). Ottawa, Ontario: The National Search and Rescue Secretariat. Cornell, E. H., & Heth, C. D. (2004). Memories of travel: Dead reckoning within the cognitive map. In G. Allen (Ed.), Human spatial memory: Remembering where (pp. 191 – 215). Mahwah, NJ: Erlbaum. Cornell, E. H., & Hill, K. A. (2006). The problem of lost children. In C. Spencer & M. Blades (Eds.), Children and their environments: Learning, using, and designing spaces. (pp. 24 – 40). Cambridge, UK: Cambridge University Press. Cornell, E. H., Heth, C. D., & Rowat, W. L. (1992). Way finding by children and adults: Response to instructions to use look-back and retrace strategies. Developmental Psychology, 28, 328 – 336. Cornell, E. H., Heth, C. D., & Skoczylas, M. J. (1999). The nature and use of route expectancies following incidental learning. Journal of Environmental Psychology, 19, 209 – 229. Cornell, E. H., Heth, C. D., & Alberts, D. M. (1994). Place recognition and way finding by children and adults. Memory & Cognition, 22, 633 – 643. Cornell, E. H., Heth, C. D., & Broda, L. S. (1989). Children’s way finding: Response to instructions to use environmental landmarks. Developmental Psychology, 25, 755 – 764. Cornell, E. H., Sorenson, A., & Mio, T. (2003). Human sense of direction and way finding. Annals of the American Association of Geographers, 93, 402 – 428. Cornell, E. H., Heth, C. D., Kneubuhler, Y., & Sehgal, S. (1996). Serial position effects in children’s route reversal errors: Implications for police search operations. Applied Cognitive Psychology, 10, 301 – 326. Cornell, E. H., Hadley, D. C., Sterling, T. M., Chan, M. A., & Boechler, P. (2001). Adventure as a stimulus for cognitive development. Journal of Environmental Psychology, 21, 219 – 231. Cousins, J. H., Siegel, A. W., & Maxwell, S. E. (1983). Way finding and cognitive mapping in large-scale environments: A test of a developmental model. Journal of Experimental Child Psychology, 35, 1 – 20. Day, M. C. (1975). Developmental trends in visual scanning. In H. Reese (Ed.), Advances in child development and behavior (Vol. 10, pp. 153 – 193). New York: Academic Press. Doherty, S. E. (1984). Developmental differences in cue recognition at spatial decision points. Unpublished doctoral dissertation, Department of Geography, University of California, Santa Barbara. Flavell, J. H. (1970). Developmental studies of mediated memory. In H. Reese & L. Lipsitt (Eds.), Advances in child development and child behavior (Vol. 5, pp. 181 – 211). New York: Academic Press. Gallistel, C. R. (1993). The organization of learning. Cambridge, MA: MIT Press. Gatty, H. (1958). Nature is your guide. London: Collins.
Children’s Way Finding
203
Gauvain, M. (1993). The development of spatial thinking in everyday activity. Developmental Review, 13, 92 – 121. Gibson, E. J. (1969). Principles of perceptual learning and development. New York: Appleton Century Crofts. Gibson, J. J. (1979). The ecological approach to visual perception. Boston: Houghton Mifflin. Golledge, R. G. (1978). Learning about urban environments. In T. Carlstein, D. Parkes, & N. Thrift (Eds.), Timing space and spacing time. Vol. I. Making sense of time (pp. 76 – 98). London: Edward Arnold. Golledge, R. G. (1995). Primitives of spatial knowledge. In T. Nyerges, D. Mark, R. Laurini, & M. Egenhofer (Eds.), Cognitive aspects of human–computer interaction for geographic information systems (pp. 29 – 44). Dordrecht: Kluwer Academic. Golledge, R. G. (1999). Human way finding and cognitive maps. In R. Golledge (Ed.), Way finding behavior: Cognitive mapping and other spatial processes (pp. 5 – 45). Baltimore: John Hopkins Press. Golledge, R. G., Gale, N., Pellegrino, J. W., & Doherty, S. (1992). Spatial knowledge acquisition by children: Route learning and relational distances. Annals of the Association of American Geographers, 82, 223 – 244. Golledge, R. G., Ruggles, A. J., Pellegrino, J. W., & Gale, N. D. (1993). Integrating route knowledge in an unfamiliar neighborhood: Along and across route experiments. Journal of Environmental Psychology, 13, 293 – 307. Golledge, R. G., Smith, T. R., Pellegrino, J. W., Doherty, S., & Marshal, S. P. (1985). A conceptual model and empirical analysis of children’s acquisition of spatial knowledge. Journal of Environmental Psychology, 5, 125 – 152. Goodchild, M. F. (2001). A geographer looks at spatial information theory. In D. Montello (Ed.), Spatial information theory: Foundations of geographic information science (pp. 1 – 13). Berlin: Springer-Verlag. Gould, R. A. (1969). Yiwara. Foragers of the Australian desert. New York: Scribner. Gowans, F. R. (1989). A fur trade history of Yellowstone Park: Notes, documents, maps. Orem, UT: Mountain Grizzly Publications. Greidanus, E. (2003). The representation of dead reckoning during a suburban walk. Unpublished Honor’s thesis, Department of Psychology, University of Alberta, Edmonton, Alberta, Canada. Grieve, D. W., & Gear, R. J. (1966). The relationships between length of stride, step frequency, time of swing and speed of walking for children and adults. Ergonomics, 5, 379 – 399. Hall, G. S. (1897). A study of fears. American Journal of Psychology, 8, 147 – 163. Hart, R. A. (1979). Children’s experience of place. New York: Irvington. Hart, R. A., & Moore, G. T. (1973). The development of spatial cognition: A review. In R. Downs & D. Stea (Eds.), Image and environment (pp. 246 – 288). Chicago: Aldine. Hasher, L., & Zachs, R. T. (1979). Automatic and effortful processes in memory. Journal of Experimental Psychology: General, 108, 356 – 388. Heft, H. (1996). The ecological approach to navigation: A Gibsonian perspective. In J. Portugali (Ed.), The construction of cognitive maps (pp. 105 – 132). London: Kluwer Academic. Heth, C. D., & Cornell, E. H. (1980). Three experiences affecting spatial discrimination learning by ambulatory children. Journal of Experimental Child Psychology, 30, 246 – 264. Heth, C. D., & Cornell, E. H. (1998). Characteristics of travel by persons lost in Albertan wilderness areas. Journal of Environmental Psychology, 18, 223 – 235.
204
Edward H. Cornell and C. Donald Heth
Heth, C. D., & Cornell, E. H. (2006a). A Geographic Information System for managing search for lost persons. In G. Allen (Ed.), Applied spatial cognition: From research to cognitive technology. Mahwah, NJ: Lawrence Erlbaum Associates. Heth, C. D., & Cornell, E. H. (2006b). First response incident command (Version ß3) [Computer software]. Edmonton, Alberta: Expert Spatial Systems. Heth, C. D., Cornell, E. H., & Alberts, D. M. (1997). Differential use of landmarks by 8- and 12-year-old children during route reversal navigation. Journal of Environmental Psychology, 17, 199 – 213. Heth, C. D., Cornell, E. H., & Flood, T. L. (2002). Self-ratings of sense of direction and route reversal performance. Applied Cognitive Psychology, 16, 309 – 324. Hill, K. A. (1997). Managing the lost person incident. Chantilly, VA: National Association for Search & Rescue. Hill, K. A. (Ed.) (1999). The psychology of lost. In K. Hill (Ed.), Lost person behavior (pp. 1 – 15). Ottawa, Ontario: National Search and Rescue Secretariat. Howard, I. P., & Templeton, W. B. (1966). Human spatial orientation. New York: Wiley. Jonsson, E. (2002). Inner navigation: Why we get lost and how we find our way. New York: Scribner. Kirasic, K. C., Siegel, A. W., & Allen, G. L. (1980). Developmental changes in recognition-in-context memory. Child Development, 51, 302 – 305. Kitchin, R., & Freundschuh, S. (2000). Cognitive mapping: Past, present and future (p. 1). New York: Routledge. Kitchin, R., & Blades, M. (2002). The cognition of geographic space. New York: Tauris. Knobloch, H., & Pasamanick, B. (Eds.) (1974). Gesell and Amatruda’s developmental diagnosis (3rd ed.). New York: Harper & Row. LaValla, P., Stoffel, R., & Jones, A. (1995). Search is an emergency: A text for managing search operations. Olympia, WA: Emergency Response Institute. Lehnung, M., Haaland, V., Pohl, J., & Leplow, B. (2001). Compass versus finger pointing tasks: The influence of different methods of assessment on age-related orientation performance in children. Journal of Environmental Psychology, 21, 283 – 289. Liben, L. S. (1988). Conceptual issues in the development of spatial cognition. In J. Stiles-Davies, M. Kritchevsky, & U. Bellugi (Eds.), Spatial cognition: Brain bases and development (pp. 167 – 194). Hillsdale, NJ: Erlbaum Associates. Loomis, J. M., Klatzky, R. L., Golledge, R. G., & Philbeck, J. W. (1999). Human navigation by path integration. In R. Golledge (Ed.), Wayfinding behavior: Cognitive mapping and other spatial processes (pp. 125 – 151). Baltimore: Johns Hopkins. Lynch, K. (1960). The image of the city. Cambridge, MA: MIT Press. Matthews, M. H. (1987). Gender, home range and environmental cognition. Transactions of the Institute of British Geographers (New Series), 12, 43 – 56. Matthews, M. H. (1992). Making sense of place: Children’s understanding of large scale environments. Savage, MD: Barnes & Noble. McVey, V. (1989). The Sierra Club way finding book. San Francisco: Little Brown & Company. Montello, D. R. (1998, September). What it means to be lost. Proceedings of the Search and Rescue Secretariat of Canada (SARSCENE), Banff, Alberta, Canada. Montello, D. R., & Pick, H. L. (1993). Integrating knowledge of vertically aligned large-scale spaces. Environment and Behavior, 25, 457 – 484. Moore, R., & Young, D. (1978). Childhood outdoors: Toward a social ecology of the landscape. In I. Altman & J. Wohlwill (Eds.), Human behavior and the environment: Vol. 3. Children and the environment (pp. 83 – 130). New York: Plenum Press.
Children’s Way Finding
205
Munroe, R. L., & Munroe, R. H. (1971). Effect of environmental experience on spatial ability in an East African society. Journal of Social Psychology, 83, 15 – 22. Munroe, R. L., Munroe, R. H., & Bresler, A. (1985). Precursors of spatial ability: A longitudinal study among the Logoli of Kenya. Journal of Social Psychology, 125, 23 – 33. Neath, I., & Crowder, R. G. (1990). Schedules of presentation and temporal distinctiveness in human memory. Journal of Experimental Psychology: Learning, Memory & Cognition, 16, 316 – 327. Nelson, R. K. (1969). Hunters of the northern ice. Chicago: University of Chicago Press. Newcombe, N. S., & Huttenlocher, J. (2000). Making space: The development of spatial representation and reasoning. Cambridge, MA: MIT Press. Parkman, F. (1872/1920). The Oregon trail: Sketches of prairie and Rocky-mountain life. Boston, MA. Little, Brown, and Company. Piaget, J., & Inhelder, B. (1967). The child’s conception of space. New York: W.W. Norton. Piaget, J., Inhelder, B., & Szeminska, A. (1960). The child’s conception of geometry. New York: Basic Books. Pick, H. L. (1988). Perceptual aspects of spatial cognitive development. In J. Stiles-Davis, M. Kritchevsky, & U. Bellugi (Eds.), Spatial cognition: Brain bases and development (pp. 145 – 156). Hillsdale, NJ: Lawrence Erlbaum Associates. Presson, C. C., & Montello, D. R. (1988). Points of reference in spatial cognition: Stalking the elusive landmark. British Journal of Developmental Psychology, 6, 378 – 381. Presson, C. C., & Somerville, S. C. (1985). Beyond egocentrism: A new look at the beginnings of spatial representation. In H. Wellman (Ed.), Children’s searching: The development of search skill and spatial representation (pp. 1 – 26). Hillsdale, NJ: Lawrence Erlbaum Associates. Rayner, K., & Pollatsek, A. (1992). Eye movements and scene perception. Canadian Journal of Psychology, 46, 342 – 376. Rieser, J. J. (1983). The generation and early development of spatial inference. In H. Pick & L. Acredolo (Eds.), Spatial orientation: Theory, research, and application (pp. 39 – 71). New York: Plenum Press. Rieser, J. J. (1999). Dynamic spatial orientation and the coupling of representation and action. In R. Golledge (Ed.), Way finding behavior: Cognitive mapping and other spatial processes (pp. 168 – 190). Baltimore, MD: Johns Hopkins University Press. Rieser, J. J., Doxsey, P. A., McCarrell, N. S., & Brooks, P. H. (1982). Wayfinding and toddlers’ use of information from an aerial view of a maze. Developmental Psychology, 18, 714 – 720. Rieser, J. J., Garing, A. E., & Young, M. F. (1994). Imagery, action, and young children’s spatial orientation: It’s not being there that counts, it’s what one has in mind. Child Development, 65, 1262 – 1278. Rogoff, B., Gauvain, M., & Gardner, W. (1987). The development of children’s skills in adjusting plans to circumstances. In S. Friedman, E. Scholnick, & R. Cocking (Eds.), Blueprints for thinking: The role of planning in cognitive development (pp. 303 – 320). New York: Cambridge University Press. Rose-Jacobs, R. (1983). Development of gait at slow, free, and fast spurts in 3- and 5-year-old children. Physical Therapy, 63, 1251 – 1259. Schmuckler, M. A., & Tsang-Tong, H. Y. (2000).The role of visual and body movement information in infant search. Developmental Psychology, 36, 499 – 510. Schneider, W., & Pressley, M. (1989). Memory development between 2 and 20. New York: Springer-Verlag.
206
Edward H. Cornell and C. Donald Heth
Sholl, J. (1996). From visual information to cognitive maps. In J. Portugali (Ed.), The construction of cognitive maps (pp. 157 – 186). Dordrecht, The Netherlands: Kluwer Academic. Siegel, A. W., & White, S. H. (1975). The development of spatial representations of large-scale environments. In H. W. Reese (Ed.), Advances in child development and behavior (Vol. 10, pp. 9 – 55). New York: Academic Press. Siegel, A. W., Kirasic, K. C., & Kail, R. V. (1978). Stalking the elusive cognitive map: The development of children’s representations of geographic space. In I. Altman & J. Wohlwill (Eds.), Human behavior and environment (Vol. 3, pp. 223 – 258). New York: Plenum. Siegler, R. S. (1996). Emerging minds: The process of change in children’s thinking. New York: Oxford University Press. Silverman, I., Choi, J., Mackewn, A., Fisher, M., Moro, J., & Olshansky, E. (2000). Evolved mechanisms underlying way finding: Further studies on the hunter–gatherer theory of spatial sex differences. Evolution and Human Behavior, 21, 201 – 213. Spencer, C., & Darvizeh, Z. (1983). Young children’s place-descriptions, maps, and routefinding: A comparison of nursery school children in Iran and Britain. International Journal of Early Childhood, 15, 26 – 31. Stea, D. (1970). Home range and the use of space. In L. Pastalan & D. Carson (Eds.), The spatial behavior of older persons (pp. 139 – 147). Ann Arbor, MI: University of Michigan. Stea, D., Blaut, J. M., & Stephens, J. (1996). Cognitive mapping and culture. In J. Portugali (Ed.), The construction of cognitive maps (pp. 345 – 360). London: Kluwer Academic. Sutherland, D. H., Olshen, R., Cooper, L., & Woo, S.Y. (1980). The development of mature gait. Journal of Bone and Joint Surgery, 62, 336 – 353. Syrotuck, W. G. (1979). Analysis of lost person behavior: An aid to search planning. Westmoreland, NY: Arner Publications. Tindal, M. A. (1971). The home range of Black elementary school children: An exploratory study in the measurement and comparison of home range. Unpublished master’s thesis, Clark University, Worcester, MA. Tolman, E. C. (1948). Cognitive maps in rats and men. Psychological Review, 55, 189 – 208. Torell, G. (1990). The acquisition and development of environmental cognition in children. Go¨teborg Psychological Reports, 20, 1 – 7. Torell, G., & Biel, A. (1985). Parents’ influence on children’s cognitive maps and on activity ranges in residential neighborhoods. In T. Ga¨rling & J. Valsiner (Eds.), Children in environments. New York: Plenum Press. Tyler, D., & McKenzie, B. E. (1990). Spatial updating and training effects in the first year of life. Journal of Experimental Child Psychology, 50, 445 – 461. Valentine, G. (1997). ‘Oh yes I can.’ ‘Oh no you can’t’: Children and parents’ understandings of kids’ competence to negotiate public space safely. Antipode, 29, 65 – 89. Whiting, B. B., & Edwards, C. P. (1988). Children of different worlds: The formation of social behavior. Cambridge, MA: Harvard University Press. Woolley, H., Dunn, J., Spencer, C.P., Short, T., & Rowley, G. (2000). Children describe their experiences of the city centre: A qualitative study of the fears and concerns which may limit their full participation. Landscape Research, 24, 287 – 301.
THE DEVELOPMENT AND NEURAL BASES OF FACIAL EMOTION RECOGNITION
Jukka M. Leppa¨nen HUMAN INFORMATION PROCESSING LABORATORY, DEPARTMENT OF PSYCHOLOGY, UNIVERSITY OF TAMPERE, FINLAND
Charles A. Nelson HARVARD MEDICAL SCHOOL, BOSTON CHILDREN’S HOSPITAL, BOSTON, MA 02215, USA
I. BEHAVIORAL STUDIES OF FACIAL EXPRESSION RECOGNITION
A. FACIAL EXPRESSION PROCESSING BY ADULTS B. DISCRIMINATION AND CATEGORIZATION OF FACIAL EXPRESSIONS BY INFANTS C. SPONTANEOUS REACTIONS TO FACIAL EXPRESSIONS IN INFANTS D. THE ROLE OF EXPERIENCE IN THE DEVELOPMENT OF FACIAL EXPRESSION PROCESSING E. SUMMARY II. NEURAL BASIS OF FACIAL EXPRESSION RECOGNITION
A. NEURAL BASIS OF ADULTS’ PROCESSING OF FACIAL EMOTIONS B. DEVELOPMENT OF NEURAL SYSTEMS IMPLICATED IN FACIAL EXPRESSION PROCESSING C. SUMMARY III. DEVELOPMENTAL MECHANISM IV. CONCLUSIONS REFERENCES
Certain emotional reactions such as happiness, anger, fear, and sadness are linked to specific facial expressions that are similarly produced across different cultures (Ekman, 1999). Facial expressions have traditionally been viewed as unintentional displays or ‘‘read-outputs’’ of emotional reactions but they can also be conceptualized as communicatory signals that evolved for rapid transmission of emotional information between conspecifics (Blair, 2003). Caregivers’ facial expressions may, for example, convey specific information about novel objects or situations to a developing child. This is demonstrated by findings that infants will crawl over a visual cliff to approach a novel toy if their mother’s face 207 Advances in Child Development and Behavior R Kail (Editor)
ß 2006 Elsevier B.V. All rights reserved.
208
Jukka M. Leppa¨nen and Charles A. Nelson
displays a happy expression, whereas they avoid the cliff if their mother poses a fearful facial expression (Sorce et al., 1985). Given the importance of the information conveyed by facial expressions, it seems reasonable to assume that humans are comparatively adept at perceiving and recognizing those facial expressive cues that denote specific emotional information. This assumption is borne out by findings that certain facial expressions are universally recognized (Elfenbein & Ambady, 2002), and by variety of findings showing that facial expressions are recognized and responded to very efficiently and automatically (O¨hman, 2002). The goal of the present chapter is to review the development of the ability to recognize facial expressions. Findings of universal recognition of facial expressions led many to assume that these skills have a strong biological basis. It has been argued, for example, that vigilance for facial expressions co-evolved with the evolution of communicative facial expressions (O¨hman, 2002). This evolutionary process might have resulted in the creation of specialized neural mechanisms that subserve facial expression processing (Nelson, 1987). It is, however, unclear how specific the innate preparedness for facial expression processing is and what role, if any, experience plays in the development of this skill. One possibility is that the innately specified component involves a domainspecific neural system that is dedicated to the processing of facial expressions and that matures largely independently of experience. From this perspective, the basic organization of the facial expression processing system is laid down in the early stages of ontogeny and there may be little change in this system thereafter. An alternative for this nativist view is that, rather than a domain-specific neural system, the innate preparedness for recognition of facial expressions involves a general perceptual-cognitive system that is relevant but not specialized for facial expression processing (although it may become specialized for this function over time, see Karmiloff-Smith, 1998). From this viewpoint, the development of adult-like organization and function of facial expression processing is critically dependent on experience and it is likely to emerge at relatively late stages of postnatal development. The goal of the present chapter is to discuss and evaluate these alternative views in light of the existing literature on the development and neural basis of facial expression processing. We first focus on behavioral investigations of facial expression processing in adults and developing children. The goal of this section is to describe how adults process facial expressions and also to discuss the developmental origins of different aspects of adult facial expression processing (i.e., to what extent they are present at the early stages of development and what is the role of experience in their development). After reviewing the behavioral studies, we turn our attention to neuropsychological and neuroimaging-type studies. Here, we first summarize what is currently known about the neural basis of facial expression processing in adults. We then review studies examining the
Facial Emotion Recognition
209
development of these neural circuits with the aim of providing some insight into the initial organization and function of these systems as well as how their response properties change over the course of postnatal development. As we shall see, the findings reviewed here do not conform to the extreme view that the systems underlying facial expression processing are entirely innately specified. Instead, various sources of evidence suggest that experience plays an important role in the development of these systems. One possible developmental mechanism that incorporates the influences of biological preparedness as well as experience on the development of the ability to recognize facial expressions is discussed in the end of this chapter.
I. Behavioral Studies of Facial Expression Recognition A. FACIAL EXPRESSION PROCESSING BY ADULTS
Facial expressions are produced by facial muscle movements that result in specific patterns of changes in the visual appearance of the face. The expression of joy, for example, involves contraction of two facial muscles: zygomaticus major, which pulls lip corners up and backwards and produces a smile on to the face; and orbicularis oculi, which orbits the eye and causes several appearance changes around the eye region, including narrowing of the eye opening and crow’s-feet wrinkles in the eye corner (Ekman & Friesen, 1978). All facial expressions involve, then, characteristic types of changes in individual facial features (e.g., mouth), facial configuration (i.e., the spatial relations of key facial features such as eyes, mouth, and nose), and a characteristic type of facial motion when all these changes unfold over time. The perceiver’s task is to discriminate those facial cues that are critical for the recognition of different expressions and to categorize individual instances of a particular facial expression into a common class. 1. Categorization of Facial Expressions Following the early findings of universal recognition of a limited set of facial expressions, researchers have attempted to examine in more detail how adults perceive and recognize facial expressions. An important discovery of these investigations has been that, although facial expression varies continuously from a neutral face to different prototypical expressions or from one expression to another, perceptual representations of these expressions are categorical. That is, adults tend to classify facial expressions as one expression or another but they are relatively insensitive to intermediate expressions (Etcoff & Magee, 1992; Young et al., 1997). Such categorical representations may, in fact, serve the purposes of adaptive behavior because discrimination of one emotional expression from
210
Jukka M. Leppa¨nen and Charles A. Nelson
another may be of crucial importance with respect to behavior, whereas detection of small variations within a particular emotional category is probably not as useful. Categorical perception effects have been studied by using computergenerated ‘‘morphs’’ of facial expressions that represent regular steps of a transition from one prototypical facial expression to another (Figure 1a). When adult observers are asked to identify the emotion expressed in each of these morphs, expressions are consistently identified as one prototypical expression or another and there is a clear category boundary at which the identification changes abruptly (Figure 1b). In addition, discrimination of the differences between facial expressions is better for pairs of facial expressions taken from different sides of the category boundary as compared to pairs of facial expressions taken within a particular emotion category, even when the physical differences are kept identical in the two conditions (Etcoff & Magee, 1992; Young et al., 1997; see also Campanella et al., 2002a,b). This shows that categorical perception of facial expressions reduces one’s ability to detect small and perhaps meaningless variations within a particular expression category. It is important to add, however, that adult observers are not ‘‘blind’’ to physical differences between expressions that have been assigned to the same category. Adults, for example, typically rate morphed expressions that are closest to the prototypical facial expression as better examples of that particular expression than morphs near category boundaries (de Gelder, Teunisse, & Benson, 1997). Close to the prototype morphs are generally the most efficiently recognized (Young et al., 1997). Overall, these findings suggest that facial expressions are likely to be represented as prototypes, which (a) function like ‘‘perceptual magnets’’ that pull similar-looking facial expressions together and, hence, create perceptual categories and (b) allow within-category discriminations (i.e., comparisons between the idealized prototype and individual examples). In this sense, categorical representations of facial expressions are very similar to categorical representation of other types of information (e.g., phonemes, see Young et al., 1997 for discussion). What information do adults use to recognize different facial expressions? Several findings show that, although individual facial features can have some emotional signal value, configural information is more important in the recognition of facial expressions (i.e., spatial relations of key facial features). First, presenting faces upside-down (i.e., face inversion) significantly impairs recognition of facial expressions (McKelvie, 1995; White, 1999). The categorical perception effects summarized previously also disappear if faces are inverted (de Gelder, Teunisse, & Benson, 1997). Because face inversion is known to be more detrimental to the processing of facial configurations than to the processing of facial features, face inversion effects are usually taken as evidence for configural coding. Second, the emotional meaning of isolated facial expressive features (e.g., smiling mouth) is significantly altered when these features are
a)
c)
Percent responses
100
happy sad
50
Facial Emotion Recognition
b)
0 10% 20% 30% 40% 50% 60% 70% 80% 90% Percent morph
211
Fig. 1. (a) Examples of computer-generated morphs of facial expressions that show small 10% steps of a transformation of facial expression from happy to sad. (b) Percentages of ‘‘happy’’ and ‘‘sad’’ responses to happy–sad morphs from an experiment in which 18 adults were asked to categorize each face as ‘‘happy’’ or ‘‘sad.’’ (c) Examples of simple eyebrow–mouth configurations and complete schematic faces, which achieved lowest (top) and highest (bottom) ratings of negative affect in a study by Lundqvist, Esteves, & O¨hman (2004). Modified from Lundqvist et al. (2004) with permission from Psychology Press. ß Taylor & Francis.
212
Jukka M. Leppa¨nen and Charles A. Nelson
presented in an odd configural context (e.g., as an undivided part of a face with fearful eyes, see Calder et al., 2000; White, 2000). The novel configuration formed by a smiling mouth and fearful eyes may override the impact of individual facial parts, again suggesting that configural information is more important than partbased information in facial expression processing. The third source of evidence for this comes from a study showing that simple, schematic eyebrow–mouth configurations (Figure 1c) can convey emotional impression that is not simply an addition to the impression conveyed by their constituent parts and highly similar to the impression conveyed by complete schematic faces including all key facial elements (Figure 1c, Lundqvist, Esteves, & O¨hman, 2004). Schematic faces lack information that is present in real faces and therefore, they may not provide a complete picture of the processes involved in the recognition of real facial expressions. However, schematic faces may do a better job than real faces in approximating stored facial expression prototypes because these expressions contain the important low-level features of facial expression but they are free from interindividual ‘‘noise’’ present in pictures of real faces (O¨hman, 2002). 2. Automatic Responses to Facial Expressions Not only do adults categorize facial expressions into different classes but they also respond to some of these expressions in a very efficient and automatic fashion. Two separate lines of research illustrate this phenomenon. First, electromyographic recordings of facial muscle activity have shown that passive observation of happy and angry faces evoke emotion-specific facial muscle activity in the perceiver’s own face (Dimberg, 1990; Surakka & Hietanen, 1998). These facial muscle reactions occur within 400 ms from stimulus onset (Dimberg & Thunberg, 1998), they are difficult to restrain voluntarily (Dimberg, Thunberg, & Grunedal, 2002), and can be evoked by subliminally presented (i.e., not consciously recognized) happy and angry faces (Dimberg, Thunberg, & Elmehed, 2000). It has been suggested that the rapid facial responses reflect automatically controlled emotional responses to facial expressions (Dimberg, Thunberg, & Grunedal, 2002). A second source of evidence for automatic decoding of facial expressions comes from studies demonstrating that adults are especially quick at detecting the presence of certain negative facial expressions in their visual field (e.g., angry face). This has been demonstrated by studies employing a face-in-the-crowd paradigm, in which the perceiver’s task is to detect a discrepant (target) face that is embedded in a matrix of several distractor faces. Results consistently show that the target is detected more quickly when it displays an angry expression as compared to happy expressions (Fox et al., 2000; Hansen & Hansen, 1988; O¨hman, Lundqvist, & Esteves, 2001; Tipples, Atkinson, & Young, 2002). Interestingly, increasing the crowd size (i.e., the number of distractors), which typically slows down controlled or effortful target search, has comparatively
Facial Emotion Recognition
213
little effect on search times for negative facial expressions (Eastwood, Smilek, & Merikle, 2001). This provides further support for the hypothesis that the detection of facial expressions may occur relatively automatically. Together, these studies suggest that some facial expressions may entail automatic responses involving emotional response components as well as rapid attention orienting. It is important to note, however, that these reactions are not likely to be based on a detailed analysis of the stimulus and, thus, they do not necessarily entail categorization of facial expressions (as discussed previously). Furthermore, as we discuss later in this chapter, rapid emotional responses to facial expressions and categorization of facial expressions may be mediated by partially separable neural mechanisms. 3. Development of Facial Expression Processing The studies just reviewed indicate (a) adults discriminate between different categories of facial expressions and (b) some facial expressions elicit automatic responses in adult observers. What is the developmental origin of these abilities? One possibility is that the organization of the system underlying these effects is innately specified, and there are few changes in these systems during development. One could envisage, for example, that the categorical perception effects described previously arise from ‘‘natural prototypes’’ that represent the basic elements of a limited set of facial expressions and function like perceptual magnets to assign the observed expression as exemplifying the nearest prototype (see de Gelder, Teunisse, & Benson,1997, for a discussion of such a model). An alternative possibility is that the innate preparedness involves more general perceptual mechanisms and that facial expression prototypes are acquired via the same mechanisms that make possible prototypical representation of other types of stimuli. In this view, the acquisition of representations of different facial expressions relies heavily on experience and it may be expected to occur within the same time frame as the acquisition of other types of object representations (e.g., different geometric forms or animal species). These two views differ, then, in their predictions regarding (a) whether adult-like processing of facial expressions is present in the early stages of development (i.e., in infants), (b) whether there are changes in the organization of systems underlying facial expression processing over time, and (c) whether experience plays an important role in development. Studies that have either directly or indirectly addressed these questions are reviewed in the next section. B. DISCRIMINATION AND CATEGORIZATION OF FACIAL EXPRESSIONS BY INFANTS
The processing of basic facial expressions, most commonly involving happy, surprised, fearful, sad, and angry facial expressions, has been examined in
214
Jukka M. Leppa¨nen and Charles A. Nelson
infants ranging in age from a few hours to several months. These studies have focused either on the infant’s ability to discriminate facial expressions (i.e., perceive the difference between two or more stimuli) or on the ability to categorize facial expressions (classify discriminable instances of a particular stimulus into a common class). Detailed reviews of the existing findings have been published (de Haan & Nelson, 1998; Nelson, 1987; Nelson & de Haan, 1997; Walker-Andrews, 1997; Walker-Andrews & Dickson, 1997) and, therefore, we limit our discussion here to a brief summary of the findings that are most germane to the present discussion.1 A widely used method to study infants’ discriminatory abilities is a habituation paradigm in which the infant is first presented with a particular stimulus until habituation is obtained (i.e., looking time is decreased below a certain criterion time). After that, the stimulus or some aspect of it is changed. Discrimination of the stimulus change is inferred from longer looking times to the novel stimulus as compared to the one to which the infant had been habituated. By using this type of a paradigm, Field et al. (1982) tested whether 36-hour-old newborns could discriminate between happy, surprised, and sad facial expressions posed by a live model. Infants were habituated to one of these expressions after which the model changed the expression on her face. Looking times increased each time the expression was changed, suggesting that the newborn infants were sensitive to the expression changes. Furthermore, infants tended to fixate the most distinctive feature of each expression; specifically, the eye region of surprised expressions and mouth region of happy or sad expressions. Other studies examining the discrimination of facial expressions have usually been conducted in older, 3 – 7-month-old infants, and the findings from these studies generally show that, at these ages, infants can discriminate most expression contrasts (for a review, see de Haan & Nelson, 1998). Three-month-olds also discriminate even low-intensity smiles from neutral faces, and possibly also variations in the intensity of smiles (Kuchuk, Vibbert, & Bornstein, 1983). Together, these studies suggest that the ability to discriminate facial expression emerges remarkably early. These findings are often taken as support for the hypothesis of biological preparedness for discrimination of facial expressions. Caution must be exercised in accepting this conclusion, however. First, findings of newborn facial expression discrimination can be criticized on methodological grounds (e.g., because live faces were used, there was little control over the precise emotions produced; moreover, observers were not blinded to the emotions being produced; for elaboration, see de Haan & Nelson, 1998). 1
There are several studies on the development of the processing of multimodal emotional expressions (see Walker-Andrews, 1997 for a review). However, because these studies involved both vocal and facial information, they are not considered here.
Facial Emotion Recognition
215
Second, even though it is clear from these studies that infants respond differentially to different facial expressions, it is less clear whether this is based on detection of facial expression change per se or whether infants responded to some other concomitant feature change (e.g., open vs. closed mouth). Newborns especially may base their discrimination on salient stimulus features because of the limits of newborn vision. Newborns are sensitive to a much lower and more restricted range of spatial frequencies than adults (Banks & Salapatek, 1983) and, during the first two months of postnatal life, infants show little attention to the internal features of the face (Bushnell, 1979; Maurer & Salapatek, 1976). This makes it unlikely that newborns have access to all those cues that adults use to recognize facial expressions (Figure 2). Third, and perhaps most importantly, the observation that infants can discriminate facial expressions does not necessarily entail preparedness for this particular skill. Despite the limitations of infant vision, infants show a range of rudimentary visual abilities at birth. These include an ability to discriminate and process patterns and forms (i.e., geometric shapes, stimulus orientation, angles), combinations of features, stimulus compounds, and spatial relations between stimulus features (Slater, 1993).
Fig. 2. Picture of a face filtered according to the contrast sensitivity function of the adult (up left), 1-month-old (up right), 2-month-old (bottom left), and 3-month-old infant (bottom right). Stimuli should be viewed from a distance of 15 cm. Reprinted from Nelson, 1987.
216
Jukka M. Leppa¨nen and Charles A. Nelson
Given this, it may be more parsimonious to attribute the early ability to discriminate facial expressions to such general visual skills rather than a mechanism specialized for processing facial expressions per se. Categorization of facial expressions has been examined by using a multiple exemplar habituation–recovery paradigm in which the infant is first habituated to faces of several individuals all posing the same facial expression. In the test phase, the infant is shown two novel individuals, one posing the now-familiar expression and another posing a novel expression. Successful categorization requires that the infant is able to recognize the similarity of the expression in the habituation faces and that on one of the test faces and can discriminate these from the novel test expression. Studies using this technique have shown that 7-month-old infants categorize happy and surprised facial expressions (Caron, Caron, & Myers, 1982; Ludemann & Nelson, 1988; Nelson & Dolgin, 1985; Nelson, Morse, & Leavitt, 1979). Caron, Caron, and Myers (1982), for example, showed that 7-month-old infants categorized happy faces posed by six female models and discriminated this expression from surprise. Similarly, they categorized surprised expressions posed by six female models and discriminated them from happy. Younger, 4-month-old infants were not able to categorize either happy or surprised expressions, and 5½-month-olds could categorize happy but not surprised expressions. Other studies have extended these findings by showing that 5 – 7-month-olds can categorize discriminable exemplars of happy and surprised faces that vary not only by the model posing the expression but also by the intensity of the expression (Bornstein & Arterberry, 2003; Ludemann & Nelson, 1988). There is also evidence that 7-month-old infants perceive happy facial expressions categorically. Specifically, they failed to discriminate between two stimuli that belonged to the same expression category whereas they showed good discrimination between the stimuli when they were taken from different emotion categories (Kotsoni, de Haan, & Johnson 2001). Recall that, in adults, recognition of facial expressions is based on configural information. An interesting question is whether infants also base their responses on configural information, or whether they perhaps categorize expressions on the basis of certain salient features common to different exemplars of a particular expression (e.g., visible teeth in happy expressions). Caron, Caron, and Myers (1985) addressed this issue by habituating 4 – 9½-month-old infants to eight models posing either toothy angry, nontoothy angry, or nontoothy happy expressions, and then testing them with the familiar expression and a toothy happy expression. The rationale behind this design was that if infants categorize expressions on the basis of affect-relevant information, they should show a novelty preference only when there is a change in emotional expression; specifically, a change from toothy or nontoothy angry to toothy happy. In contrast, if infants categorize expressions on the basis of isolated features that are not correlated with emotion (e.g., visible teeth), they should show novelty preference
Facial Emotion Recognition
217
when there is a change in this feature (i.e., a change from nontoothy happy or angry expression to toothy happy expression). The results obtained at all age levels almost uniformly supported this latter possibility, suggesting that young infants cannot extract emotion-relevant configural information from static pictures of facial expressions. It is noteworthy, however, that these results were obtained in a task context in which a single facial feature was made especially salient (i.e., all habituation stimuli were toothy and the test stimulus was nontoothy). Given this, it is perhaps not surprising that infants’ attention was directed to a single salient feature. Furthermore, other studies have shown that, although infants may categorize faces on the basis of single salient isolated features, they can also code emotionrelevant configural information from facial expressions. Kestenbaum and Nelson (1990), for example, showed that 7-month-old infants categorized toothy happy expressions and discriminated them from toothy fearful as well as toothy angry expressions. A particularly interesting result of this study was that the categorization was evident when the faces were shown in an upright orientation, but not when the faces were inverted. Similarly as in the adult literature, the inversion effect can be interpreted as an evidence in favor of orientation-specific (and familiar) configural coding of facial expressions. It seems, then, that infants categorize some facial expressions at the age of 5 – 7 months. At about the same time, infants also show an ability to categorize a range of other visual stimuli including such complex stimuli as pictures of cats and dogs (see de Haan & Nelson, 1998; Nelson & Snyder, 2005 for discussion). Because the timetable for categorizing both faces and non-face objects is so similar, it would appear that a common process underlies both abilities. Of course, it is also possible that behavioral assays lack sensitivity in examining the underlying processes, and that a more sensitive physiological measure may reveal that categorizing faces and non-face objects do in fact depend on different processes. One striking feature of the developmental literature concerns the rapid development of the categorization of happy faces as compared to other facial expressions. With few exceptions (Serrano, Iglesias, & Loeches, 1992; Serrano, Iglesias, & Loeches, 1995), studies with 5 – 7-month-old infants have failed to show evidence of categorization of angry or fearful facial expressions (Caron, Caron, & Myers, 1985; Ludemann & Nelson, 1988; Nelson & Dolgin, 1985; Nelson, Morse, & Leavitt, 1979; Phillips et al., 1990). It is possible that, whereas the categorical representation of happy faces is well-established by the age of 5 – 7 months, representations of discrete negative expressions require a longer period of time to develop. Indeed, several studies have charted age-related improvement during the preschool and school years in tasks sensitive to facial expression discrimination and/or categorization (Boyatzis, Chazan, & Ting, 1993; Denham & Couchoud, 1990; Vicari et al., 2000;
218
Jukka M. Leppa¨nen and Charles A. Nelson
Widen & Russell, 2003). There is also evidence that discrimination of angry and sad faces is not yet adult-like in 9 – 10-year-old children (de Gelder, Teunisse, &, Benson, 1997). Even in adults, representations of negative facial expressions are ‘‘less robust’’ (cf. Tong & Nakayama, 1999) than representations of happy faces, as shown by faster and more accurate recognition of happy facial expressions as well as lower recognition thresholds for happy facial expressions (Esteves & O¨hman, 1993; Hess, Blairy, & Kleck, 1997; Leppa¨nen & Hietanen, 2004). There are possibly many different explanations for this difference. However, one plausible mechanism may be that because happy faces are encountered more often than other facial expressions in our normal social environment (Bond & Siddle, 1996; Whalen, 1998), more robust representations for these expressions are formed. In the developmental literature, such evidence can be found in naturalistic observations of infant–caregiver dyads, where it has been consistently reported that infants are disproportionately exposed to positive facial expressions (see Nelson, 1987; Nelson & de Haan, 1996).
C. SPONTANEOUS REACTIONS TO FACIAL EXPRESSIONS IN INFANTS
Some studies have focused on infants’ spontaneous behavioral or physiological reactions to specific facial expressions. These studies are relevant in the present context because spontaneous responding to facial expressions entails some sort of perceptual discrimination of facial expressions. These studies may also shed light on the issue whether facial expressions may act as innate ‘‘releasing stimuli’’ for certain emotion-related behaviors (cf. Sackett, 1966). Researchers have focused on various aspects of infants’ behavioral or physiological reactions to facial expressions but perhaps most commonly on the presence of specific facial expressive movements on the face (expressions are usually interpreted as signs of emotional reactions). Some studies have demonstrated emotion-specific facial reactions in 2 – 4-month-old infants (e.g., Haviland & Lelwica, 1987; Montague & Walker-Andrews, 2001), but these studies have employed either live or videotaped models expressing emotions both facially and vocally. Studies focusing on emotion-related reactions to facial expressions alone have shown that neonates react with mouth opening and eye-widening when they see a live adult model posing surprised expressions; lip widening when seeing a happy face; and tightened mouth, protruding lips as well as furrowed brows when seeing a sad face (Field et al., 1982). These reactions were, however, not interpreted as reflecting emotional reactions to facial stimuli but rather as reflecting an early ability to integrate visual and proprioceptive information and imitate others’ facial movements in infants (see Meltzoff & Moore, 1997, for further discussion of this phenomenon).
Facial Emotion Recognition
219
Facial expressions have been shown to affect other emotion-related behaviors as well. Balaban (1995) reported a study in which 5-month-old infants were exposed to 95 – 105 dB bursts of white noise while viewing pictures of happy, neutral, or angry faces. Presentation of sudden bursts of white noise typically evokes a startle reflex with an eyeblink component. Balaban’s study showed that the magnitude of this eyeblink was modulated by the facial expressions; specifically, compared to the neutral-expression condition, viewing angry faces augmented and viewing happy faces reduced the blink magnitude. This result is highly similar to findings reported in the adult literature, which consistently show that positive pictures reduce and negative pictures potentiate the startle reflex (Lang, Bradley, & Cuthbert, 1990).2 Finally, there is evidence that infants spontaneously prefer fearful facial expressions. When exposed to happy–fearful face pairs, 7-month-olds consistently prefer or look longer at the fearful face (Nelson & Dolgin, 1985; see also Nelson & de Haan, 1996). The origins of this effect are not clear, but it may possibly reflect (a) the novelty of fearful faces in infants’ environment, (b) some low-level physical characteristic of fearful faces, or (c) the greater signal value of fearful faces as they may signal potential danger (Nelson & Dolgin, 1985).
D. THE ROLE OF EXPERIENCE IN THE DEVELOPMENT OF FACIAL EXPRESSION PROCESSING
Studies of infants of depressed mothers (Diego et al., 2004), children reared in institutionalized settings (Wismer Fries & Pollak, 2004; Parker, Nelson & BEIP Core Group, 2005), and children of neglectful parents (Pollak et al., 2000) have provided valuable insights into the role of normal social experiences in the development of facial expression recognition. These rearing environments are characterized by a lack of species-typical social experiences (e.g., neglectful parenting). Children reared in these types of environments are less accurate at discriminating facial expressions and they see fewer distinctions between different emotional expressions than children reared in normal social environments (Pollak et al., 2000).
2 The logic behind this finding is that negatively valenced stimuli, be they facial expressions or scenes depicting negative affect (e.g., mutilation), activate a circuit that involves the amygdala, a structure known to be involved in processing negative emotion. Because of connections between the amygdala and the brain stem nuclei that control eye blink startle generally, it is believed that negatively valenced stimuli lead to an augmentation of the startle response. Less is known about the circuitry involved in processing positively valenced stimuli, although the literature is consistent in reporting reductions in startle responses to such stimuli (for discussion, see Grillon et al., 1999).
220
Jukka M. Leppa¨nen and Charles A. Nelson
Similar disruption of the normal development of perceptual representations of facial expressions due to social deprivation may be occurring in certain neurodevelopmental disorders such as autism. A relatively long history of research has suggested that individuals with autism show abnormalities in processing of social information including faces and facial expressions (Marcus & Nelson, 2001). In a study by Teunisse and de Gelder (2001), a group of 16- to 25-year-old individuals with autism and a group of typically developing controls were asked to do both within-category as well as between-category discriminations of stimuli from happy–sad, angry–sad, and angry–fearful expression continua. The category boundaries were determined on the basis of an identification task that was conducted subsequently to the discrimination task. The most important finding of the study was that, unlike controls, individuals with autism did not show the typical pattern of better discrimination of betweencategory as compared to within-category facial expressions. This suggests that autistic adolescents and young adults do not process facial expressions categorically. Although the authors of the study interpreted these results as reflecting abnormal development of the innate mechanisms underlying facial expression processing, they may actually arise from the fact that autistic individuals have less experience with facial expression than typically developing individuals (cf. Grelotti, Gauthier, & Schultz, 2002). Infants with autism, for example, attend less to the faces of others as compared to typically developing infants (Osterling, Dawson, & Munson, 2002). Research has also been conducted in children whose rearing environments are characterized by excessive amounts of certain types of parental behaviors (i.e., abusive environments, Camras, Grow, & Ribordy, 1983; Pollak et al., 2000). Abusive parents show several species-atypical forms of behaviors towards their children, including heightened levels of negative emotional expressions, as well as high rates of direct verbal and physical aggression (Pollak & Sinha, 2002). Consequently, abused children’s affective environment involves an atypical amount of social threat. A series of studies by Pollak and colleagues (Pollak & Kistler, 2002; Pollak et al., 2000, 2001) has shown that physical abuse in early childhood has profound effects on the development of facial emotion recognition. School-aged children with a history of being physically abused by their parents have a specific bias in the recognition of angry faces. That is, as compared to normal controls, abused children show a response bias for anger, which means that they have a low threshold to respond ‘‘angry’’ when they are uncertain about the nature of the expressions presented to them (Pollak et al., 2000). They also allocate a disproportionate amount of processing resources (as inferred by the amplitude of the event-related potential (ERP)) to angry facial expressions (Pollak et al., 1997, 2001). Particularly germane to the present discussion are two studies showing that abused children show a perceptual bias in the processing of angry faces, which
Facial Emotion Recognition
221
causes them to classify a broader range of facial expressions as perceptually similar to angry faces and also recognize anger on the basis of partial sensory cues. In one study, Pollak and Kistler (2002) found that abused children show an altered perceptual category boundary for angry faces, so that they categorized a wider range of morphed facial expressions as angry than controls. In another study, Pollak and Sinha (2002) presented normal and abused children image sequences in which an unstructured image gradually evolved into an organized angry, happy, fearful, or sad face. Children were asked to identify the expression at regular steps during the image evolution. There were no differences in the recognition of fearful and happy facial expressions between the two groups, but abused children recognized angry expression at earlier stages of the image sequence (i.e., on the basis of less sensory input) than controls. In contrast, abused children required more information to recognize sadness than controls. Together, these findings provide strong evidence that the categorical representations of facial expressions can be significantly modified by early socioemotional experiences. E. SUMMARY
Our discussion has so far dealt with the categorization and automatic response to facial expressions in adults and children. As these two processes are likely to be partially separable (automatic responding does not necessarily entail categorization), their developmental time course may also differ. The literature reviewed thus far indicates that human infants discriminate a range of visual stimuli soon after birth, including facial expressions. By the age of 5 – 7 months, infants categorize happy and surprised facial expressions, whereas categorization of other expressions continues to develop throughout the preschool and school years. Categorization of happy and surprised facial expressions emerges at the same time as the categorization of other visual objects in infancy, which suggests that facial expression processing is not, in this sense, special (see de Haan & Nelson, 1998 for discussion). Finally, the literature shows that the development of facial expression processing is experience-dependent: Social deprivation and species-atypical parenting disrupts the normal development of facial expression processing. Collectively these findings have important implications for the discussion about the developmental origins of the ability to recognize facial expressions. We return to them in later sections of this chapter.
II. Neural Basis of Facial Expression Recognition Neuropsychological studies of patients with focal brain lesions and electrophysiological as well as functional imaging (functional magnetic resonance
222
Jukka M. Leppa¨nen and Charles A. Nelson
imaging (fMRI), positron emission tomography (PET)) studies of healthy adults have provided valuable information about the brain mechanisms of facial expression processing (for reviews, see Adolphs, 2002; Haxby, Hoffman, & Gobbini, 2002). The existing data suggest that the recognition of facial expressions is subserved by a distributed network of brain structures involving occipitotemporal cortical areas, which are assumed to play a major role in forming perceptual representations of different facial attributes, and non-visual subcortical and cortical structures (amygdala, orbitofrontal cortex, basal ganglia, insula, and right somatosensory cortex) which, in turn, are important in representing the emotional signal value of the expressions. In the following sections, we focus our discussion on structures that are directly involved in the perceptual processing of facial expressions (occipitotemporal cortex, the amygdala).3 It appears that the processing of facial expressions is mediated by two parallel routes; specifically a ‘‘cortical’’ route and a ‘‘subcortical’’ route (Adolphs, 2002). The cortical route may play an important role in the construction of a detailed representation of facial expressions required for stimulus categorization, whereas the subcortical route is usually thought to underlie some aspects of automatic response to certain salient expressive cues.
A. NEURAL BASIS OF ADULTS’ PROCESSING OF FACIAL EMOTIONS
1. Cortical Routes to the Perception of Facial Expressions Cortical areas implicated in facial expression processing involve occipital regions important for visual information in general (V1/V2) and also occipitotemporal higher-level visual areas that respond selectively to faces (fusiform gyrus, superior temporal sulcus; see Figure 3). These areas are thought to play an important role in the construction of structural representations of those facial attributes that are important for the recognition of facial identity and facial expressions (Adolphs, 2002; Haxby, Hoffman, & Gobbini, 2002). Ventral occipitotemporal regions involving the lateral fusiform gyrus and adjacent inferior temporal and occipital regions have been shown to be especially important for the analysis of those invariant aspects of faces that are critical for recognizing facial identity (Haxby, Hoffman, & Gobbini, 2002). 3 Other structures have been implicated in representing subjective emotional reactions occurring during the observation of other facial expressions. Recent theorizing suggest that such emotional reactions elicited by viewing others’ facial expressions may play an essential role in attributing emotional states to others (Adolphs, 2002; Goldman & Sripada, 2005; Gallese, Keysers, & Rizzolatti, 2004; Sprengelmeyer et al., 1997). There are several possible computational mechanisms as to how the perceiver reproduces the sender’s emotional state, and how subjective emotional reactions are used to attribute emotional states to other people (see Goldman & Sripada, 2005).
Facial Emotion Recognition
223
Fig. 3. Cortical areas that respond selectively to faces. The two upper rows show the lateral and ventral surfaces of the occipitotemporal cortex. In the next two rows, the cortical surface has been inflated to show the cortex in the sulci and flattened to show the entire cortical surface of the left and right hemispheres. To view this figure in color, see Haxby et al. (2000). Reprinted from Haxby et al. 2000 with permission from Elsevier.
This is supported by several findings. First, single-cell recordings in macaque monkeys have identified neurons in inferior temporal cortex that fire selectively to the sight of faces and, in particular, to facial identity (Desimone, 1991; Hasselmo, Rolls, & Baylis, 1989). Second, lesions to ventral occipitotemporal and temporal cortices lead to a relatively specific impairment in the recognition of familiar faces, while the ability to recognize other visual objects is spared (see Farah, 1996). Third, functional imaging studies have shown face-specific activation in the fusiform gyrus in healthy adults (Kanwisher, McDermott, & Chun, 1997). Face-specific brain activity has also been found by recording ERPs or magnetoelectric potentials from the scalp surface or by recording
224
Jukka M. Leppa¨nen and Charles A. Nelson
directly from the surface of cortex in patients undergoing brain surgery (Allison et al., 1999; Bentin et al., 1996; Halgren et al., 2000). Whereas the fusiform cortex is most strongly associated with facial identity processing, other areas of the occipitotemporal cortex (i.e., superior temporal sulcus) may play a major role in processing the changeable aspects of faces, such as eye movements, lip movements, and facial expressions (Allison, Puce, & McCarthy, 2000; Haxby, Hoffman, & Gobbini, 2002). Hasselmo, Rolls, and Baylis (1989), for example, showed that neurons responsive to facial gestures in monkey temporal cortex were found primarily in the superior temporal sulcus. Consistent with this, functional imaging studies in humans have shown that active attention to emotional expression strengthens activation of the superior temporal sulcus (Narumoto et al., 2001; Winston, O’Doherty, & Dolan, 2003).4 Information is fed forward from the previously mentioned occipitotemporal cortical regions to the amygdala, which serves to link the expression with its motivational significance. The amygdala is a collection of cell nuclei located in anterior and medial parts of the temporal lobe (Figure 4). Studies with rodents, monkeys, and humans have consistently linked the amygdala to processing of motivationally significant stimuli. Briefly, the amygdala is critically involved in learning about negative and positive stimuli in rodents (LeDoux, 2000; Davies & Whalen, 2000). Lesion to the amygdala results in abnormal responding to potentially dangerous objects or situations (e.g., rubber snakes) in monkeys (Amaral, 2003). In humans, lesions to the amygdala are associated with selective impairments in the recognition of certain negative facial expressions such as fear (Adolphs et al., 1994), evaluation of the approachability and trustworthiness of unfamiliar persons (Adolphs, Tranel, & Damasio, 1998), and processing of others’ direction of gaze (Young et al., 1995). These findings have been complemented by functional imaging studies showing that the amygdala is activated by watching fearful faces (Morris et al., 1996), eye contact (Kawashima et al., 1999), and also by viewing faces of racial out-group members (Hart et al., 2000). 4
It is currently unresolved whether the occipitotemporal areas should be viewed as cortical modules dedicated to the processing of facial information. The fusiform gyrus, for example, has been thought of as an innately specified ‘‘face module’’ that is dedicated to the detection of face geometry (Farah et al., 2000; Kanwisher, 2000). Two objections have been presented against this view. First, distinct categories of objects evoke widely distributed and overlapping activity in occipitotemporal cortex, which makes it unlikely that processing of faces is strictly localized (Haxby et al., 2001; Ishai et al., 1999). Second, the view that the fusiform regions are dedicated to face processing per se is weakened by findings showing that this area is activated by images of birds in bird experts, images of cars in car experts, and images of artificial objects (‘‘greebles’’) in perceivers who have gained expertise in processing these stimuli (Gauthier et al., 1999, 2000). This suggests that rather than face detection, the fusiform cortex is specialized for fine-grained visual processing and it is recruited whenever the perceiver has learned to discriminate separate individuals within a generic class of objects (e.g., separate facial identities, Tarr & Gauthier, 2000).
Facial Emotion Recognition
225
Fig. 4. Line drawing of the monkey brain that illustrates the cortical pathway of visual input to the amygdala and feedback projections of the amygdala to different cortical visual areas. Different subnuclei of the amygdala are shown in the inset at left. Reproduced from Amaral et al., 1992.
Although both lesion and functional imaging studies in humans suggest a special role of the amygdala in processing facial expressions of fear, the results are not entirely consistent and some functional imaging findings suggest that the amygdala may play a broader role in processing several negative and also positive facial expressions. A study by Yang et al. (2002), which involved a relatively large sample size, a large number of carefully selected facial expressions, and conservative data analytic techniques, showed reliable amygdala activation for happy, sad, angry, and fearful faces (each compared to neutral faces). There were no differences between different emotional expressions in the strength of the amygdala activation (see Winston, O’Doherty, & Dolan, 2003 for similar findings). In adult models of facial expression processing (Adolphs, 2002), the role of the amygdala is to (a) modulate perceptual processing of facial expressions in occipitotemporal cortex via feedback connections to these regions, (b) participate in the retrieval of conceptual knowledge associated with the facial expression, and (c) trigger components of an emotional response to facial expressions (Adolphs, 2002). The possibility that the amygdala plays an important role in modulating the perception of facial expressions has gained support in recent research. First, the amygdala sends ‘‘feedback’’ projections to several areas of the ventral visual processing systems, including those areas that have been implicated in face processing (Amaral, Behniea, & Kelly, 2003, Figure 4). Second, a well-replicated finding shows enhanced activity in occipitotemporal visual cortices for emotional facial expressions as compared to neutral faces (e.g., Morris et al., 1998). Third, one study has showed that the enhanced responses to facial expressions in occipitotemporal cortical regions are
226
Jukka M. Leppa¨nen and Charles A. Nelson
not found in patients with amygdala lesions (Vuilleumier et al., 2004), which provides evidence that the amygdala is necessary for this phenomenon to occur. The amygdala may also affect cortical sensory processing indirectly by modulating the perceiver’s vigilance level. Whalen (1998) suggested that one of the key functions of the amygdala is to modulate an organism’s moment-tomoment vigilance level, and to increase an organism’s capacity to gather further information about stimuli whose predictive value is, at least partially, uncertain or ambiguous (e.g., direct gaze of another person, fearful facial expressions). These stimuli are biologically relevant, but their exact meaning is ambiguous and context-dependent (i.e., a direct gaze may indicate potential threat or attraction; a fearful face communicates the presence of threat but the source of the threat is unclear). The amygdala may mediate changes in organisms’ vigilance level via its connections to basal forebrain structures that release acetylcholine onto cortical sensory systems and, thereby, increase their excitability and processing capacity (Whalen, 1998). 2. Subcortical Route to the Perception of Facial Expressions Besides the cortical–amygdala route just described, the amygdala can be activated by ‘‘direct’’ inputs from the thalamus. Studies with rodents first suggested that conditioned fear responses are mediated by a subcortical thalamus– amygdala pathway that bypasses cortical sensory areas (LeDoux, 2000). Interestingly, studies have pointed to the existence of corresponding subcortical emotion processing systems in humans as well. For example, a patient with unilateral damage to striate cortex could identify dynamic happy, sad, angry, and fearful facial expressions above the chance level even when these expressions were presented to his blind hemifield (de Gelder et al., 1999). Functional imaging results have also shown that very briefly presented angry and fearful facial expressions, which were not consciously recognized by participants because of a backward masking procedure, activated the amygdala (Morris et al., 1998; Whalen et al., 1998). These responses may be mediated by a subcortical retinal– collicular–pulvinar pathway to the amygdala. Consistent with this proposal, some studies have shown that the activation of the superior colliculus as well as posterior parts of the thalamus correlated positively with amygdala activation when participants were presented with masked facial expressions (Morris, Ohman, & Dolan, 1999; Morris et al., 2001). The subcortical pathway to the amygdala may constitute a fast-operating system that subserves rudimentary affective discriminations and perhaps mediates some of the automatic responses to facial expressions described earlier in this chapter (O¨hman, 2002). A study by Vuilleumier et al. (2003) showed that the subcortical route is especially sensitive to coarse, low spatial-frequency information. Specifically, the study showed that the key structures of the putative subcortical route (i.e., amygdala, pulvinar thalamus, superior colliculus) were activated by fearful faces that were either
Facial Emotion Recognition
227
naturalistic (broad spatial frequency) or filtered so that only low spatial frequencies were spared (see Figure 2). In contrast, the fusiform gyrus, which constitutes one of the major cortical face-related areas, was more strongly activated by intact or by high spatial frequency faces (i.e., faces sparing fine detail). The authors of this study suggested that the subcortical route to amygdala is mainly activated by coarse, low spatial-frequency cues from the magnocellular visual pathways, whereas the parallel, cortical route responds to higher spatial-frequency inputs (8 – 16 cycles/face) and fine detail from parvocellular visual pathways.
B. DEVELOPMENT OF NEURAL SYSTEMS IMPLICATED IN FACIAL EXPRESSION PROCESSING
Thus, the adult brain includes neural systems that are specialized for the perceptual processing and emotional responding to different facial attributes. From a developmental point of view, it is important to ask how this functional specialization of certain brain regions emerges. One position on this question is that the adult-like segregation of the processing of different types of visual inputs in the brain is established by innate mechanisms, and that experience has only a limited effect on the basic organization of these systems. Alternatively, the cortical and subcortical regions implicated in facial expression processing may be relatively unspecialized at the start of ontogenesis (i.e., they are not inherently specialized or selectively responsive to any particular type of visual input). However, they may become increasingly ‘‘specialized’’ for the processing of different facial attributes because they are repeatedly used to this task in species-typical environment (cf. Karmiloff-Smith, 1998). In this view, then, experience and interaction with the social environment eventually leads to the establishment of adult-like functional specialization in certain brain regions. In the following section, we discuss these views in light of the existing literature on the anatomical and functional development of the cortical and subcortical facerelated structures. 1. Cortical Structures Studies with non-human primates suggest that those inferior temporal cortical areas that show a high density of face-selective neurons in adult monkeys play a limited role in early visual processing and also that these systems undergo a relatively protracted period of postnatal development (cytoarchitectonic areas TE and TEO, Figure 4). First, the metabolic activity of area TE is not adult-like in 6-month-old monkeys, which are comparable to 2-year-old children (Hagger et al., 1988). Second, surgical ablation of area TE causes only a transient deficit in certain visual functions in infant monkeys, whereas the equivalent lesion
228
Jukka M. Leppa¨nen and Charles A. Nelson
in adults causes a long-lasting deficit (Bachevalier & Mishkin, 1994; Bachevalier et al., 1990). Third, there are connections from these areas to the amygdala, perirhinal cortex, and parahippocampal cortex in infants that are not found in adults (Webster, Ungerleider, & Bachevalier, 1991). Cortical connections between primary visual areas and superior temporal sulcus also differ in infant and adult monkeys (Kennedy, Bullier, & Dehay, 1989). However, direct recordings of single-cell activity from the surface of the cortex showed that certain neurons in inferior temporal cortex show adult-like stimulus selectivity already in 6-week-old infants (e.g., preferential firing to a view of a face image, see Rodman, Skelly, & Gross,1991; Rodman, O’Scalaidhe, & Gross, 1993). It is noteworthy, however, that the response properties of stimulus-selective cells in infants were not completely adult-like. For example, neurons in inferior temporal cortex in infants showed generally lower response magnitude, longer response latencies, and were more susceptible to the effects of anesthesia as compared to neurons in corresponding regions in adult monkeys. Also, the role that single face-specific cells play in face processing is not clear. For example, groups or ensembles of face-specific neurons rather than individual neurons alone may be important in representing facial information (Gross, 1992). Whether infants show different patterns of activity at the level of groups of neurons than adult monkeys has not yet been investigated. Nevertheless, these studies on monkeys indicate that temporal neocortical regions undergo notable anatomical and functional changes during postnatal development. Electrophysiological studies in human infants converge on much the same conclusion. In these studies, measures of ERPs have been used to examine the selectivity of certain brain responses to faces as compared to other objects at various stages of postnatal development. ERPs are oscillations in the electrical activity of the brain that occur in response to a discrete event. A discrete stimulus typically invokes a series of ERP components that can be recorded from the surface of the scalp. The latency, amplitude, and scalp topography of the ERP components provide information about the timing of a particular mental event, the amount of neural activity involved in a particular cognitive process, and in some situations, the neural generators of the observed activity. Investigations of face-related ERPs in 3 – 12-month-old infants have generally focused on two components of the ERP: N290 (i.e., a negative deflection with the mean latency of 290 ms) and P400 (i.e., a positive deflection with the mean latency of 400 ms). These components are largest in the posterior regions of the head and based on studies performed with infants, children, and adults, they have been interpreted as developmental precursors of face-specific ERP components in adults (i.e., N170, which is generally considered an electrophysiological marker of structural encoding of faces in occipitotemporal cortical face areas). One of the first studies addressing the possibility of face-specific ERPs in infants showed that the latency of the P400 component at occipital
Facial Emotion Recognition
229
recording sites was shorter for faces as compared to pictures of objects in 6-month-old infants (de Haan & Nelson, 1999). Later studies showed that in both 3- and 6-month-old infants, the N290 differentiated between human and monkey faces, thus showing some specificity to those faces that a developing infant most often encounters in his/her environment (de Haan, Humphreys, & Johnson, 2002; Halit, de Haan, & Johnson, 2003). Yet, the face-sensitive ERPs do not show adult-like sensitivity to upright human faces at these ages. Specifically, whereas the adult face component (N170) differed in amplitude to upright human faces as compared to inverted human faces, the putative equivalent of this component in 3- and 6-month-old infants (N290) did not differentiate between upright and inverted faces (de Haan, Humphreys, & Johnson, 2002; Halit, de Haan, & Johnson, 2003). However, adult-like specificity to upright human faces was found in older, 12-month-old infants (Halit, de Haan, & Johnson, 2003). Together, these finding suggests that the 3rd through the 12th postnatal months are critical for the development of the cortical systems underlying face perception. The results of the first wave of ERP studies with human infants generally conform to the models of face processing that emphasize experience-driven ‘‘specialization’’ of visual cortical systems for the processing of upright human faces (see de Haan, de Haan, & Johnson, 2003 for a review). It has been proposed, for example, that the cortical system implicated in face processing in adults is initially a domain-general, non-specific visual pattern recognition system, but as a function of visual experience, this system becomes tuned to those faces that are most frequent in a species-typical environment (de Haan, Humphreys, & Johnson, 2002; Nelson, 2001). This tuning or neuronal commitment is reflected by the results that the cortical generators of the face-specific electrophysiological activity are initially activated by a broader range of stimuli than they are at later stages of development (de Haan, Humphreys, & Johnson, 2002; Halit, de Haan, & Johnson, 2003). The infant may also initially perform a range of visual discriminations successfully (e.g., discriminate different monkey identities) but some of these abilities are lost and the range of visual discrimination is narrowed because the perceptual systems become more tuned to those discriminations that are relevant to the successful discrimination of human faces (Nelson, 2001; Pascalis, de Haan, & Nelson, 2002). Although the ERP studies in infants have provided valuable information about how the processing of human faces becomes separated from the processing of other visual objects and faces of other species, these studies have not investigated whether the development of facial expression processing follows a similar pattern of experience-driven specialization. Assuming that the functional specialization described here reflects a general characteristic of the development of higher-level visual systems in occipitotemporal cortex, one could argue that just as higher-level visual cortices tune to human faces over time, they may tune
230
Jukka M. Leppa¨nen and Charles A. Nelson
to those configurations of facial features that are important for the recognition of species-typical facial expressions. Support for experience-driven development of cortical facial expression processing systems comes from a study showing abnormal patterns of ERPs to facial expressions in children suffering from early social deprivation because of institutional rearing (Parker, Nelson & BEIP Core Group, 2005). Results showing that children and adults with autism show abnormal brain responses to facial expressions can also be interpreted to be consistent with this view (recall that infants with autism attend less to the faces of others as compared to typically developing infants and thus they can be expected to have less experience with facial expressions). Whereas typically developing 3 – 4-year-old children showed higher amplitude of early and late ERP components to fearful as compared to neutral faces, chronological or mental-age matched children with autism failed to show this difference (Dawson et al., 2004). An fMRI study on children and adolescents (8 – 23-year-olds) with autism, in which children were asked to either match faces by emotion or assign a label to facial expressions, also showed that individuals with autism show activation in brain regions typically activated in such tasks (i.e., fusiform gyrus, amygdala) but the activation in these structures was either reduced (fusiform gyrus) or showed atypical pattern of modulation by task (amygdala, Wang et al., 2004). These findings corroborate the results of studies in adults with autism (Critchley et al., 2000). 2. Amygdala Whereas occipitotemporal cortical face-related regions become functional only some time after birth, the amygdala appears to be relatively mature already at birth. Studies with infant monkeys have shown that afferent connections from higher-order sensory cortices to the amygdala and efferent connections from the amygdala back to the sensory and other cortical regions are established soon after birth (Amaral & Bennett, 2000; Nelson et al., 2002). In contrast, there are transient projections from inferior temporal area TEO to the amygdala in infant monkeys that are not found in adults (Webster, Ungerleider, & Bachevalier, 1991). This suggests that cortical input to the amygdala is more rudimentary in infancy than later in life (Machado & Bachevalier, 2003). The hypothesis that the amygdala becomes functional early in development is supported by the findings that experimental lesions to the amygdala in infant monkeys impairs visual recognition memory (Bachevalier & Mishkin, 1994) and the processing of social and non-social fear (Amaral, 2003; Bauman et al., 2004). Whether the amygdala participates in emotional processing in human infants as well is not known. Indirect evidence for this comes, however, from the previously cited study by Balaban (1995), which showed that the magnitude of the eyeblink component of the human startle reflex was modulated by facial expressions. Although the mechanisms underlying these modulatory effects by
Facial Emotion Recognition
231
facial expression are not entirely clear (Balaban, 1995), it is plausible that they are mediated by amygdala circuitry. Startle modulation by negatively and positively valenced emotional foreground cues is well-documented in animals as well as in humans (Lang, Bradley, & Cuthbert, 1990; Lang, Davis, & O¨hman, 2000). Furthermore, results from animal (see Lang, Davis, & O¨hman, 2000) as well as human lesion and neuroimaging studies (Angrilli et al., 1996; Funayama et al., 2001; Pissiota et al., 2003) suggest that emotional modulation of the startle reflex (especially the fear-potentiated startle) is mediated by the amygdala and its connections to brain systems responsible for the startle reflex. Viewed in this light, the results by Balaban suggest that the amygdala participates in the recognition of facial expressions in early infancy. The role of the amygdala in early facial expression processing could be directly tested by using modern functional imaging technology such as fMRI to assess whether the amygdala shows increased activation (i.e., blood oxygen levels) during passive observation of facial expressions. It is, however, not presently feasible to use these technologies with infants because of the stringent requirements that they place on the participant (Nelson & Bloom, 1997). However, functional imaging (fMRI) technologies have been successfully used with older, 9- to 17-year-old children. The first results from these studies show that, consistent with the adult data, the amygdala is more strongly activated during viewing of fearful facial expressions than during viewing of visual control stimuli (Baird et al., 1999; Kilgore, Oki, & Yurgelun-Todd, 2001; Monk et al., 2003). Stronger amygdala activation has also been observed for happy as compared to neutral faces (Yang et al., 2003). These studies clearly demonstrate that the amygdala participates in facial emotion processing prior to adulthood. In addition, some interesting age differences in the amygdala response to facial expressions have been found. Thomas et al. (2001), for example, found that adults and 11-year-olds showed greater amygdala activation to fearful and neutral faces relative to a control condition (i.e., fixation). However, the pattern of responses to the two expression differed between the age groups; specifically, adults showed the typical pattern of greater amygdala response to fear as compared to neutral faces, whereas children showed the reverse (i.e., greater amygdala response to neutral than to fearful faces). This finding suggests developmental changes in the amygdala response to facial expressions, and that the amygdala circuitry may respond to a wider range of facial expression stimuli in children than in adults. It is not known whether the amygdala circuitry is selectively responsive to facial expressions in the first place or whether it learns to respond to these cues. Results from studies with non-human primates, which have examined behavioral consequences of early amygdala damage, suggest that the amygdala is not specialized or dedicated to support social behavior per se (Amaral, 2003). Rather, it can be thought of as a system that participates in the evaluation of
232
Jukka M. Leppa¨nen and Charles A. Nelson
potential dangers (Amaral, 2003; O¨hman, 2002), or the biological relevance of the stimulus more generally (e.g., Whalen, 1998). It is also thought that, rather than innately specified, selective responding to specific classes of stimuli in the amygdala is acquired through experience (e.g., through associative learning, LeDoux, 2000; Mineka & O¨hman, 2002; Davis & Whalen, 2000). Some adult studies suggest, however, that associative learning (i.e., connecting certain stimuli with specific emotional outcomes) occurs more readily for those stimuli that have posed recurrent threats in evolutionary history (e.g., pictures of certain facial expressions or animals such as snakes; Mineka & O¨hman, 2002). In this sense, the amygdala may be ‘‘prepared’’ for responding to certain facial expressions, making the infant more predisposed to learn about them than about other stimuli. It may also be that, because of this preparedness, rudimentary representations of certain facial expressions are acquired very quickly and with a limited amount of exposure. If certain expressive cues are marked as emotionally significant by the amygdala in the early stages of development, this may increase infants’ vigilance to these cues, enhance their perceptual processing and, in this way boost the development of cortical representations of these stimuli (cf., Grelotti, Gauthier, & Schultz, 2002). There are, indeed, reasons to believe that the amygdala is critical for the acquisition of knowledge about facial expressions. Congenital or acquired damage to the amygdala during development is, for example, associated with worse recognition of certain facial expressions than damage sustained in adulthood (Adolphs, Russell, & Tranel, 1999; Adolphs, Tranel, & Damasio, 2001; Hamann & Adolphs, 1999; Hamann et al., 1996; but see Schmolck & Squire, 2001). Abnormal functioning of the amygdala in autism may also explain why the development of normal representations of faces and facial expressions is disrupted in these disorders (Grelotti, Gauthier, & Schultz, 2002).
C. SUMMARY
Research with adults has identified two routes for processing facial expressions: (a) a cortical route involving occipitotemporal cortical regions and the amygdala and (b) a subcortical retinal–collicular–pulvinar pathway to the amygdala. The former presumably plays an important role in the categorization of facial expressions and its developmental time course is likely to be protracted and dependent on experience. The subcortical route, in turn, may mediate rudimentary discrimination of facial expressions on the basis of low spatialfrequency information. This system matures earlier than the cortical system. In addition, it may possibly show some sort of innate selectivity to certain facial expressions.
Facial Emotion Recognition
233
III. Developmental Mechanism We began this chapter by outlining two alternative views for the development of the ability to recognize facial expressions. One possibility, which was suggested by the data on the universal recognition of a limited set of facial expressions, emphasizes innate specification of the organization of the systems underlying facial expression processing. The literature reviewed here teaches us, however, that at least an extreme version of this nativist position, which places little emphasis on the influence of experience on development, is probably wrong. The main argument for this conclusion comes from the growing number of studies showing that the development of the ability to recognize facial expressions can be disrupted by early social deprivation. These studies have made it clear that the fundamental representations of facial expressions can be different in children reared in species-atypical environments. Electrophysiological studies with human infants also suggest that, in the early stages of development, the cortical face-related systems are less specialized or attuned to facial information than the corresponding mature systems. These findings suggest that at least some components of the system underlying facial expression processing are modified and sculpted by experience. The findings reviewed in this chapter also cast doubt on the assumption that the development of facial expression perception builds on perceptual mechanisms that are ‘‘dedicated’’ to this particular function. The development of categorization of facial expressions, for example, follows a similar time course as the development of categorization of other visual objects, suggesting a common underlying mechanism. Consistent with this, categorical representations of facial expressions in adult perceptual systems resemble categorical representation of other (learned) stimulus classes (i.e., the information may be represented in the form of acquired prototypes). It is, thus, not necessary to assume a special mechanism to explain these findings. It seems, then, that the existing data are better captured by a model that incorporates both innate and experiential contributions to the development of facial expression processing. One possibility is that the innately specified elements of this ability involve a general perceptual mechanism that allows the infant to discriminate between different facial expressions as well as a mechanism that allows the infant to associate these cues to emotional significance and to prioritize these stimuli over others. These systems are to a limited extent prepared for processing facial expressions (i.e., facial expressions may be more readily marked as emotionally significant by these systems than other cues). Importantly, however, the acquisition of representations of distinct facial expressions as well as the development of adult-like organization of the facial expression processing system relies heavily on experiential input.
234
Jukka M. Leppa¨nen and Charles A. Nelson
Specifically, at the earliest stages of development, discrimination and responding to facial expressions may be largely mediated by subcortical emotion-related brain circuits critically involving the amygdala. The data reviewed thus far showed that the amygdala is relatively mature at birth (Amaral & Bennett, 2000) and evidence suggests that coarse (low-frequency) visual information reaches the amygdala via subcortical structures alone (Vuilleumier et al., 2003). This subcortical circuitry might allow infants to ‘‘mark’’ certain salient facial cues as emotionally significant. It is also tempting to suggest that the subcortical amygdala circuitry is to a limited extent ‘‘prepared’’ for processing facial expressions because these have been linked to positive or negative outcomes in evolutionary history (cf. Mineka & O¨hman, 2002). Maturation of cortical regions that are important in facial expression processing, in turn, allows the infant to acquire more detailed representations of different facial expressions and also provides increasingly detailed input to the amygdala (Machado & Bachevalier, 2003). Based on prior research (e.g., de Haan et al., 2001; Morton & Johnson, 1991), cortical systems come on line at the age of 6 – 8 weeks. It can be assumed that from this age onwards, infants start to form categorical representations of different types of facial expressions (similarly as of other stimuli, see de Haan et al., 2001; Kuhl, 2004; Quinn, 2002; Sherman, 1985). Infants may, for example, compute some type of summary representations (i.e., prototypes) of different facial expressions that they encounter in their natural environment. Although there is no direct evidence for the assumption that facial expressions are represented as prototypes in infants, the possibility that infants form such prototypes is suggested by findings from studies using other stimuli. Threemonth-old infants can, for example, form prototypes (averages) of different facial identities (de Haan et al., 2001). These mechanisms may underlie the acquisition of adult-like categorical representations of facial expressions that allow infants to compare newly encountered instances of different facial expressions to stored prototypes of these expressions. Although this account emphasizes that learning about facial expressions is based on the same mechanisms as acquisition of knowledge about other types of stimulus representations, it also acknowledges the possibility that the acquisition of knowledge about facial expressions is to some extent ‘‘special.’’ First, we already mentioned that emotional associative learning mechanisms might show some innate stimulus selectivity. This may make the whole system slightly biased towards facial expressions. Second, because facial expressions are marked as emotionally significant at the early stages of development, infants probably prioritize these stimuli over other stimuli. This, in turn, may ‘‘boost’’ the development of cortical representations for these stimuli (cf. Grelotti, Gauthier, & Schultz, 2002). As noted previously, there are direct re-entrant connections from the amygdala to sensory cortical areas via which the amygdala may modulate cortical processing (Amaral, Behniea, & Kelly, 2003). Other pathways also exist
Facial Emotion Recognition
235
that may be important in increasing the excitability of cortical sensory neurons (Whalen, 1998). Fluctuations in vigilance level and excitability of cortical neurons are likely to be very important factors in the early development of higher-level cortical visual areas. Single-cell recordings from monkeys, for example, show that the response properties of inferior temporal neurons are state-dependent in infant monkeys (Rodman, Skelly, & Gross, 1991; Rodman, O’Scalaidhe, & Gross, 1993). It may even be the case that the development of cortical representations of facial expressions is critically dependent on the modulatory effects from the amygdala. This may explain why the recognition of fearful faces is impaired in patients with congenital or early-sustained damage to the amygdala but not in patients who sustained the damage in adulthood (Adolphs, Russell, & Tranel, 1999; Adolphs, Tranel, & Damasio, 2001; Hamann & Adolphs, 1999; Haman et al., 1996). Abnormal functioning of the amygdala may also explain why the development of normal representations of facial expressions is disrupted in specific neurodevelopmental disorders (e.g., in autism, cf. Grelotti, Gauthier, & Schultz, 2002). The systems underlying facial expression processing are, thus, not initially dedicated to the processing of facial expressions. However, with exposure to species-typical facial expressions, these systems become more sensitive to those facial feature configurations that represent particular emotions. This view of development is consistent with a neuroconstructive perspective (Karmiloff-Smith, 1998) according to which higher-level cortical systems are initially not specialized for the processing of any particular class of stimuli (i.e., domain-general), but they have the potential to become specialized for processing of those stimuli to which they are repeatedly exposed (i.e., domainspecific). This experience-driven ‘‘specialization’’ can occur in several different domains of perceptual development. From this perspective, disruption of the development of facial expression systems may arise if (a) the mechanisms that are relevant for the processing of facial expressions are dysfunctional (e.g., due to the amygdala dysfunction, facial expressions are not linked with emotional significance and, hence, the child is not motivated to learn about these expressions) or (b) the child is not exposed to a normal range of species-typical facial expressions. The view of the development of facial expression processing presented here is quite similar to Pollak’s (2002) suggestion that the development of facial expression processing builds on a general perceptual-cognitive mechanism that allows the infant to parse sensory information into separate units and to track the regularity, predictiveness, and temporal synchrony of emotion-related information. Pollak also emphasizes that, with experience, this mechanism becomes tuned to specific combinations of signals. He did not, however, suggest that the general mechanisms are initially biased towards learning from facial expressions per se.
236
Jukka M. Leppa¨nen and Charles A. Nelson
The present view is also very similar to models that have been proposed to account for how the processing of facial identity-related information becomes segregated from the processing of other visual objects. Specifically, these models emphasize that initial biases in the infant visual system arising either from an innate preference for face-like patterns (Morton & Johnson, 1991) or from general constraints of infant vision (Cassia, Turati, & Simion, 2004; Turati et al., 2002) function to bias infants’ attention towards faces. With exposure to faces, initially domain-general cortical systems become ‘‘specialized’’ for the processing of species-typical faces (de Haan, Humphreys, & Johnson, 2002; Morton & Johnson, 1991; Nelson, 2001). Some models have also emphasized the importance of amygdala-mediated modulation of attention and cortical systems in the development of cortical face representations (Grelotti, Gauthier, & Schultz, 2002).
IV. Conclusions Following the early findings of universal recognition of certain expressions, several studies have been conducted that provide an increasingly detailed picture of the cognitive, neural, and developmental mechanisms underlying facial emotion recognition. Based on the review of these data, we proposed that a neuroconstructive framework, which emphasizes the influences of both innate preparedness as well as experiential input on brain development (KarmiloffSmith, 1998), might prove useful for viewing the development of the ability to recognize facial expressions. The innate components of this ability may involve broadly tuned perceptual and emotional learning mechanisms that require experience in order to develop normally and establish the adult-like ‘‘specificity’’ to facial expressions. This perspective might prove useful not only in accounting for the existing data but also in generating interesting directions for future research in areas that have been comparatively unexplored so far. First, future studies might examine whether there is a sensitive period in development when the neural circuits underlying facial expressions are especially powerfully affected by experience. Second, we suggested that the neural circuits underlying facial expression processing and, in particular, the subcortical amygdala circuitry may be prepared for this function in such a way that facial expressions are more readily linked to emotional outcomes than other types of visual cues. These assumptions have not been directly tested yet, however. Third, the effects of experience on the development of the ability to recognize facial expressions should be investigated not only by studying children reared in species-atypical environments but also by studying typical development. For example, over the course of postnatal development, facial expression recognition might become
Facial Emotion Recognition
237
increasingly specific to those expressions that are most frequently encountered in one’s normal ecology (e.g., facial expressions from one’s own species or one’s own race). With the new techniques that allow investigation of the brain basis of different cognitive function in developing populations (e.g., highdensity ERP recordings), these and related questions are now more approachable than they were before.
Author Notes This chapter was written during the time the first author was a postdoctoral fellow at the Institute of Child Development, University of Minnesota, and Charles A. Nelson was on the faculty at the University of Minnesota. The writing of this chapter was made possible, in part, by postdoctoral fellowship grants from the Academy of Finland and Finnish Cultural Foundation for J.M.L. and grants from the NIH (NS32976) to C.A.N.
REFERENCES Adolphs, R. (2002). Recognizing emotion from facial expressions: psychological and neurological mechanisms. Behavioral & Cognitive Neuroscience Reviews, 1, 21 – 62. Adolphs, R., Russell, J. A., & Tranel, D. (1999). A role for the human amygdala in recognizing emotional arousal from unpleasant stimuli. Psychological Science, 10, 167 – 171. Adolphs, R., Tranel, D., & Damasio, A. R. (1998). The human amygdala in social judgment. Nature, 393, 470 – 474. Adolphs, R., Tranel, D., & Damasio, H. (2001). Emotion recognition from faces and prosody following temporal lobectomy. Neuropsychology, 15, 396 – 404. Adolphs, R., Tranel, D., Damasio, H., & Damasio, A. R. (1994). Impaired recognition of emotion in facial expressions following bilateral damage to the human amygdala. Nature, 372, 669 – 672. Allison, T., Puce, A., & McCarthy, G. (2000). Social perception from visual cues: role of the STS region. Trends in Cognitive Sciences, 4, 267 – 278. Allison, T., Puce, A., Spencer, D. D., & McCarthy, G. (1999). Electrophysiological studies of human face perception. I: Potentials generated in occipitotemporal cortex by face and non-face stimuli. Cerebral Cortex, 9, 415 – 430. Amaral, D. G. (2003). The amygdala, social behavior, and danger detection. Annals in New York Academy of Sciences, 1000, 337 – 347. Amaral, D. G., Behniea H., & Kelly J. L. (2003). Topographic organization of projections from the amygdala to the visual cortex in the macaque monkey. Neuroscience, 118, 1099 – 1120. Amaral, D. G., & Bennett, J. (2000). Development of amygdalo-cortical connection in the macaque monkey. Society for Neuroscience Abstracts, 26, 17 – 26. Amaral, D. G., Price, J. L., Pitka¨nen, A., & Carnmichael, S. T. (1992). Anatomical organization of the primate amygdaloid complex. In J. P. Aggleton (Ed.), The
238
Jukka M. Leppa¨nen and Charles A. Nelson
amygdala: neurobiological aspects of emotion, memory, and mental dysfunction (pp. 1 – 66). New York: Wiley-Liss. Angrilli, A., Mauri, A., Palomba, D., Flor, H., Birbaumer, N., Sartori, G., & di Paola, F. (1996). Startle reflex and emotion modulation impairment after a right amygdala lesion. Brain, 119, 1991 – 2000. Bachevalier, J., Brickson, M., Hagger, C., & Mishkin, M. (1990). Age and sex differences in the effects of selective temporal lobe lesion on the formation of visual discrimination habits in rhesus monkeys (Macaca mulatta). Behavioral Neuroscience, 104, 885 – 899. Bachevalier, J., & Mishkin, M. (1994). Effects of selective neonatal temporal lobe lesions on visual recognition memory in rhesus monkeys. Journal of Neuroscience, 14, 2128 – 2139. Baird, A. A., Gruber, S. A., Fein, D. A., Maas, L. C., Steingard, R. J., Renshaw, P. F., Cohen, B. M., & Yurgelun-Todd, D. A. (1999). Functional magnetic resonance imaging of facial affect recognition in children and adolescents. Journal of American Academy of Child & Adolescent Psychiatry, 38, 195 – 199. Balaban, M. T. (1995). Affective influences on startle in five-month-old infants: reactions to facial expressions of emotions. Child Development, 66, 28 – 36. Banks, M. S., & Salapatek, P. (1983). Infant visual perception. In M. M. Haith & J. J. Campos (Eds.), P. H. Mussen (Series Ed.), Handbook of child psychology: Vol. 2. Infancy and developmental psychobiology (pp. 435 – 571). New York: Wiley. Bauman, M. D., Lavenex, P., Mason, A., Capitanio, J. P., & Amaral, D. G. (2004). The development of social behavior following neonatal amygdala lesions in rhesus monkeys. Journal of Cognitive Neuroscience, 16, 1388 – 1411. Bentin, S., Allison, T., Puce, A., Perez, E., & McCarthy, G. (1996). Electrophysiological studies of face perception in humans. Journal of Cognitive Neuroscience, 8, 551 – 565. Blair, R. J. R. (2003). Facial expressions, their communicatory functions and neurocognitive substrates. Philosophical Transactions of the Royal Society of London: B., 358, 561 – 572. Bond, N. W., & Siddle, D. A. T. (1996). The preparedness account of social phobia: some data and alternative explanations. In R. M. Rapee (Ed.), Current controversies in the anxiety disorders (pp. 291 – 316). London: Guilford Press. Bornstein, M. H., & Arterberry, M. E. (2003). Recognition, discrimination and categorization of smiling by 5-month-old infants. Developmental Science, 6, 585 – 599. Boyatzis, C. J., Chazan, E., & Ting, C. Z. (1993). Preschool children’s decoding of facial emotions. Journal of Genetic Psychology, 154, 375 – 382. Bushnell, I. W. R. (1979). Modification of the externality effect in young infants. Journal of Experimental Child Psychology, 28, 211 – 229. Campanella, S., Gaspard, C., Debatisse, D., Bruyer, R., Crommelinck, M., & Guerit, J.-M. (2002a). Discrimination of emotional facial expressions in a visual oddball task: an ERP study. Biological Psychology, 59, 171 – 186. Campanella, S., Quinet, P., Bruyer, R., Crommelinck, M., & Guerit, J.-M. (2002b). Categorical perception of happiness and fear facial expressions: an ERP study. Journal of Cognitive Neuroscience, 14, 210 – 227. Camras, L. A., Grow, J. G., & Ribordy, S. (1983). Recognition of emotional expression by abused children. Journal of Clinical Child Psychology, 12, 325 – 328. Caron, R. F., Caron, A. J., & Myers, R. S. (1982). Abstraction of invariant face expressions in infancy. Child Development, 53, 1008 – 1015.
Facial Emotion Recognition
239
Caron, R. F., Caron, A. J., & Myers, R. S. (1985). Do infants see emotional expressions in static faces? Child Development, 56, 1552 – 1560. Cassia, V. M., Turati, C., & Simion, F. (2004). Can a nonspecific bias toward top-heavy patterns explain newborns’ face preference? Psychological Science, 15, 379 – 383. Critchley, H. D., Daly, E. M., Bullmore, E. T., Williams, S. C., Van Amelsvoort, T., Robertson, D. M., Rowe, A., Phillips, M., McAlonan, G., Howlin, P., & Murphy, D. G. (2000). The functional neuroanatomy of social behaviour: changes in cerebral blood flow when people with autistic disorder process facial expressions. Brain, 123, 2203 – 2212. Dawis, M., & Whalen, P. J. (2000). The amygdala: vigilance and emotion. Molecular Psychiatry, 6, 13 – 34. Dawson, G., Webb, S. J., Carver, L., Panagiotides, H., & McPartland, J. (2004). Young children with autism show atypical brain responses to fearful versus neutral facial expressions of emotion. Developmental Science, 7, 340 – 359. de Gelder, B., Teunisse, J.-P., & Benson, P. J. (1997). Categorical perception of facial expressions: categories and their internal structure. Cognition & Emotion, 11, 1 – 23. de Gelder, B., Vroomen, J., Pourtois, G., & Weiskrantz, L. (1999). Non-conscious recognition of affect in the absence of striate cortex. Neuroreport, 10, 3759 – 3763. de Haan, M., Humphreys, K., & Johnson, M. H. (2002). Developing a brain specialized for face perception: a converging methods approach. Developmental Psychobiology, 40, 200 – 212. de Haan, M., Johnson, M. H., & Halit, H. (2003). Development of face-sensitive eventrelated potentials during infancy: a review. International Journal of Psychophysiology, 51, 45 – 58. de Haan, M., Johnson, M. H., Maurer, D., & Perrett D. I. (2001). Recognition of individual faces and average face prototypes by 1- and 3-month-old infants. Cognitive Development, 16, 659 – 678. de Haan, M., & Nelson, C.A. (1998). Discrimination and categorization of facial expressions of emotion during infancy. In A. M. Slater (Ed.), Perceptual development: visual, auditory, and language perception in infancy (pp. 287 – 309). London: University College London Press. de Haan, M., & Nelson, C. A. (1999). Brain activity differentiates face and object processing in 6-month-old infants. Developmental Psychology, 35, 1113 – 1121. de Haan, M., Pascalis, O., & Johnson, M. H. (2002). Specialization of neural mechanisms underlying face recognition in human infants. Journal of Cognitive Neuroscience, 14, 199 – 209. Denham, S. A., & Couchoud, E. A. (1990). Young preschoolers’ understanding of emotions. Child Study Journal, 20, 171 – 192. Desimone, R. (1991). Face-selective cells in the temporal cortex of monkeys. Journal of Cognitive Neuroscience, 3, 1 – 8 Diego, M. A., Field, T., Jones, N. A., Hernandez-Reif, M., Cullen, C., Schanberg, S., & Kuhn, C. (2004). EEG responses to mock facial expressions by infants of depressed mothers. Infant Behavior & Development, 27, 150 – 162. Dimberg, U. (1990). Facial electromyography and emotional reactions. Psychophysiology, 27, 481 – 494. Dimberg, U., & Thunberg, M. (1998). Rapid facial reactions to emotional facial expressions. Scandinavian Journal of Psychology, 39, 39 – 45. Dimberg, U., Thunberg, M., & Elmehed, K. (2000). Unconscious facial reactions to emotional facial expressions. Psychological Science, 11, 86 – 89.
240
Jukka M. Leppa¨nen and Charles A. Nelson
Dimberg, U., Thunberg, M., & Grunedal, S. (2002). Facial reactions to emotional stimuli: Automatically controlled emotional responses. Cognition & Emotion, 16, 449 – 472. Eastwood, J. D., Smilek, D., & Merikle, P. (2001). Differential attentional guidance by unattended faces expressing positive and negative emotion. Perception & Psychophysics, 63, 1004 – 1013. Ekman, P. (1999). Facial expressions. In T. Dalgleish & M. J. Power (Eds.), Handbook of cognition and emotion (pp. 301 – 320). Chichester: John Wiley & Sons. Ekman, P., & Friesen, W. V. (1978). Facial action coding system. Palo Alto: Consulting Psychologists Press. Elfenbein, H. A., & Ambady, N. (2002). On the universality and cultural specificity of emotion recognition: a meta-analysis. Psychological Bulletin, 128, 203 – 235. Esteves, F., & O¨hman, A. (1993). Masking the face: recognition of emotional facial expressions as a function of the parameters of backward masking. Scandinavian Journal of Psychology, 34, 1 – 18. Etcoff, N. L., & Magee, J. J. (1992). Categorical perception of facial expression. Cognition, 44, 227 – 240. Farah, M. J. (1996). Is face recognition ‘‘special’’? Evidence from neuropsychology. Behavioural Brain Research, 76, 181 – 189. Farah, M. J., Rabinowitz, C., Quinn, G. E., & Liu, G. T. (2000). Early commitment of neural substrates for face recognition. Cognitive Neuropsychology, 17, 117 – 123. Field, T. M., Woodson, R., Greenberg, R., & Cohen, D., (1982). Discrimination and imitation of facial expression by neonates. Science, 218, 179 – 181. Fox, E., Lester, V., Russo, R., Bowles, R. J., Pichler, A., & Dutton K. (2000). Facial expressions of emotions: are angry faces detected more efficiently? Cognition & Emotion, 14, 61 – 92. Funayama, E. S., Grillon, C., Davis, M., & Phelps, E. A. (2001). A double dissociation in the affective modulation of startle in humans: effects of unilateral temporal lobectomy. Journal of Cognitive Neuroscience, 13, 721 – 729. Gallese, V., Keysers, C., & Rizzolatti, G. (2004). A unifying view of the basis of social cognition. Trends in Cognitive Sciences, 8, 396 – 403. Gauthier, I., Skudkarski, P., Gore, J. C., & Anderson, A. W. (2000). Expertise for cars and birds recruits brain areas involved in face recognition. Nature Neuroscience, 3, 191 – 197. Gauthier, I., Tarr, M. J., Anderson, A. W., Skudlarski, P., & Gore, J. C. (1999). Activation of the middle fusiform ‘‘face area’’ increases with expertise in recognizing novel objects. Nature Neuroscience, 2, 568 – 573. Goldman, A. I., & Sripada, C. S. (2005). Simulationist models of face-based emotion recognition. Cognition, 94, 193 – 213. Grelotti, D. J., Gauthier, I., & Schultz, R. T. (2002). Social interest and the development of cortical face specialization: what autism teaches us about face processing. Developmental Psychobiology, 40, 213 – 225. Grillon, C., Dierker, L., Snidman, N., Donzella, B., Dikel, T., Arriaga, R. I., Nelson, C. A., Kagan, J., & Merikangas, K. R. (1999). Startle potentiation by threat of aversive stimuli and darkness in children and adolescents: a multi-site study. International Journal of Psychophysiology, 32, 63 – 73. Gross, C. G. (1992). Representation of visual stimuli in inferior temporal cortex. Philosophical Transactions of the Royal Society of London: B, 29, 3 – 10. Hagger, C., Bachevalier, J., Macko, K. A., Kennedy, C., Sokoloff, L., & Mishkin, M. (1988). Functional maturation of inferior temporal cortex in infant rhesus monkeys. Neuroscience Abstracts, 14, 2.
Facial Emotion Recognition
241
Halgren, E., Raij, T., Marinkovic, K., Jousma¨ki, V., & Hari, R. (2000). Cognitive response profile of the human fusiform face area as determined by MEG. Cerebral Cortex, 10, 69 – 81. Halit, H., de Haan, M., & Johnson, M. H. (2003). Cortical specialisation for face processing: face-sensitive event-related potential components in 3- and 12-month-old infants. Neuroimage, 19, 1180 – 1193. Hamann, S. B., & Adolphs, R. (1999). Normal recognition of emotional similarity between facial expressions following bilateral amygdala damage. Neuropsychologia, 37, 1135 – 1141. Hamann, S. B., Stefanacci, L., Squire, L. R., Adolphs, R., Tranel, D., Damasio H., & Damasio, A. (1996). Recognizing facial emotion. Nature, 379, 497. Hansen, C., & Hansen, R. (1988). Finding the face in the crowd: an anger superiority effect. Journal of Personality & Social Psychology, 54, 917 – 924. Hart, A. J., Whalen, P. J., Shin, L. M., McInerney, S. C., Fischer, H., & Rauch, S. L. (2000). Differential response in the human amygdala to racial outgroup vs ingroup face stimuli. Neuroreport, 11, 2351 – 2355. Hasselmo, M. E., Rolls, E. T., & Baylis, G. C. (1989). The role of expression and identity in the face-selective responses of neurons in the temporal visual cortex of the monkey. Behavioural Brain Research, 32, 203 – 218. Haviland, J. M., & Lelwica, M. (1987) The induced affect response: 10-week-old infants’ responses to three emotion expressions. Developmental Psychology, 23, 97 – 104. Haxby, J. V., Hoffman, E. A., & Gobbini, M. I. (2000). The distributed human neural system for face perception. Trends in Cognitive Sciences, 4, 223 – 233. Haxby, J. V., Hoffman, E. A., & Gobbini, M. I. (2002). Human neural systems for face recognition and social communication. Biological Psychiatry, 51, 59 – 67. Haxby, J. V., Gobbini, M. I., Furey, M. L., Ishai, A., Schouten, J. L., & Pietrini, P. (2001). Distributed and overlapping representations of faces and objects in ventral temporal cortex. Science, 293, 2425 – 2430. Hess, U., Blairy, S., & Kleck, R. E. (1997). The intensity of emotional facial expressions and decoding accuracy. Journal of Nonverbal Behavior, 21, 241 – 257. Ishai, A., Ungerleider, L. G., Martin, A., Schouten, J. L., & Haxby, J. V. (1999). Distributed representation of objects in the human ventral visual pathway. Proceedings in the National Academy of Sciences of the United States of America, 96, 9379 – 9384. Kanwisher, N. G. (2000). Domain specificity in face perception. Nature Neuroscience, 3, 759 – 763. Kanwisher, N. G., McDermott, J., & Chun, M. M. (1997). The fusiform face area: a module in human extrastriate cortex specialized for face perception. Journal of Neuroscience, 17, 4302 – 4311. Karmiloff-Smith, A. (1998). Development itself is the key to understanding developmental disorders. Trends in Cognitive Sciences, 2, 389 – 398. Kawashima, R., Sugiura, M., Kato, T., Nakamura, A., Hatano, K., Ito, K., Fukuda, H., Kojima, S., & Nakamura, K. (1999). The human amygdala plays an important role in gaze monitoring. A PET study. Brain, 122, 779 – 783. Kennedy, H., Bullier, J., & Dehay, C. (1989). Transient projection from the superior temporal sulcus to area 17 in the newborn macaque monkey. Proceedings in the National Academy of Sciences of the United States of America, 86, 8093 – 8097.
242
Jukka M. Leppa¨nen and Charles A. Nelson
Kestenbaum, R., & Nelson, C. A. (1990). The recognition and categorization of upright and inverted emotional expressions by 7-month-old infants. Infant Behavior & Development, 13, 497 – 511. Kilgore, W. D. S., Oki, M., & Yurgelun-Todd, D. A. (2001). Sex-specific developmental changes in amygdala responses to affective faces. Neuroreport, 12, 427 – 433. Kotsoni, E., de Haan, M., & Johnson, M. H. (2001). Categorical perception of facial expressions by 7-month-old infants. Perception, 30, 1115 – 1125. Kuchuk, A., Vibbert, M., & Bornstein, M. H. (1983). The perception of smiling and its experiential correlates in three-month-old infants. Child Development, 57, 1054 – 1061. Kuhl, P. K. (2004). Early language acquisition: cracking the speech code. Nature Reviews Neuroscience, 8, 831 – 843. Lang, P. J., Bradley, M. M., & Cuthbert, B. N. (1990). Emotion, attention, and the startle reflex. Psychological Review, 97, 377 – 398. Lang, P. J., Davis, M., & O¨hman, A. (2000). Fear and anxiety: animal models and human cognitive psychophysiology. Journal of Affective Disorders, 61, 137 – 159. LeDoux, J. E. (2000). Emotion circuits in the brain. Annual Review of Neuroscience, 23, 155 – 184. Leppa¨nen, J. M., & Hietanen, J. K. (2004). Emotionally positive facial expressions are processed faster than negative facial expressions, but why? Psychological Research, 69, 22 – 29. Ludemann, P., & Nelson, C. A. (1988). The categorical representation of facial expressions by 7-month-old infants. Developmental Psychology, 24, 492 – 501. Lundqvist, D., Esteves, F., & O¨hman, A. (2004). The face of wrath: the role of features and configurations in conveying social threat. Cognition & Emotion, 18, 161 – 182. Machado, C. J., & Bachevalier, J. (2003). Non-human primate models of childhood psychopathology: the promise and the limitations. Journal of Child Psychology & Psychiatry & Allied Disciplines, 44, 64 – 87. Marcus, D. J., & Nelson, C. A. (2001). Neural bases and development of face processing in autism. CNS Spectrums, 6, 36 – 59. Maurer, D., & Salapatek, P. (1976). Developmental changes in the scanning of faces by young infants. Child Development, 47, 523 – 527. McKelvie, S. J. (1995). Emotional expression in upside-down faces: evidence for configurational and componential processing. British Journal of Social Psychology, 34, 325 – 334. Meltzoff, A. N., & Moore, M. K. (1997). Explaining facial imitation: a theoretical model. Early Development & Parenting, 6, 179 – 192. Mineka, S., & O¨hman, A. (2002). Phobias and preparedness: the selective, automatic, and encapsulated nature of fear. Biological Psychiatry, 52, 927 – 937. Monk, C. S., McClure, E. B., Nelson, E. E., Zarahn, E., Bilder, R. M., Leibenluft, E., Charney, D. S., Ernst, M., & Pine, D. S. (2003). Adolescent immaturity in attentionrelated brain engagement to emotional facial expressions. Neuroimage, 20, 420 – 428. Montague, D. P. F., & Walker-Andrews, A. S. (2001). Peekaboo: a new look at infants’ perception of emotion expressions. Developmental Psychology, 37, 826 – 838. Morris, J. S., de Gelder, B., Weiskrantz, L., & Dolan, R. J. (2001). Differential extrageniculostriate and amygdala responses to presentation of emotional faces in a cortically blind field. Brain, 124, 1241 – 1252. Morris, J. S., Friston, K. J., Buechel, C., Frith, C. D., Young, A. W., Calder, A. J., & Dolan, R. J. (1998). A neuromodulatory role for the human amygdala in processing emotional facial expressions. Brain, 121, 47 – 57.
Facial Emotion Recognition
243
Morris, J. S., Frith, C. D., Perrett, D. I., Rowland, D., Young, A. W., Calder, A. J., & Dolan, R. J. (1996). A differential neural response in the human amygdala to fearful and happy facial expressions. Nature, 383, 812 – 815. Morris, J. S., Ohman, A., & Dolan, R. J. (1999). A subcortical pathway to the right amygdala mediating ‘‘unseen’’ fear. Proceedings in the National Academy of Sciences of the United States of America, 96, 1680 – 1685. Morton, J., & Johnson, M. H. (1991). CONSPEC and CONLERN: a two-process theory of infant face recognition. Psychological Review, 98, 164 – 181. Narumoto, J., Okada, T., Sadato, N., Fukui, K., & Yonekura, Y. (2001). Attention to emotion modulates fMRI activity in human right superior temporal sulcus. Cognitive Brain Research, 12, 225 – 231 Nelson, C. A. (1987). The recognition of facial expressions in the first two years of life: mechanisms of development. Child Development, 58, 889 – 909. Nelson, C. A. (2001). The development and neural bases of face recognition. Infant & Child Development, 10, 3 – 18. Nelson, C. A., & Bloom, F. E. (1997). Child development and neuroscience. Child Development, 68, 970 – 987. Nelson, C. A., Bloom, F. E., Cameron, J., Amaral, D., Dahl, R., & Pine, D. (2002). An integrative, multidisciplinary approach to the study of brain-behavior relations in the context of typical and atypical development. Development & Psychopathology, 14, 499 – 520. Nelson, C. A., & de Haan, M. (1996). Neural correlates of infants’ visual responsiveness to facial expressions of emotion. Developmental Psychobiology, 29, 577 – 595. Nelson, C. A., & de Haan, M. (1997). A neurobehavioral approach to the recognition of facial expressions in infancy. In J. A. Russell & Ferna´ndez-Dols (Eds.), The psychology of facial expression (pp. 176 – 204). Cambridge, MA: Cambridge University Press. Nelson, C. A., & Dolgin, K. (1985). The generalized discrimination of facial expressions by 7-month-old infants. Child Development, 56, 58 – 61. Nelson, C. A., Morse, P. A., & Leavitt, L. A. (1979). Recognition of facial expressions by 7-month-old infants. Child Development, 50, 1239 – 1242. Nelson, C.A., & Snyder, K. (2005). The segregation of face and object processing in development: a model system of categorization? In L. Gershkoff-Stowe & D. Rakison (Eds.), Building object categories in developmental time: The 32nd Carnegie Cognition Symposium ( pp. 1 – 36). Mahwah, NJ: Lawrence Erlbaum. O¨hman, A. (2002). Automaticity and the amygdala: nonconscious responses to emotional faces. Current Directions in Psychological Science, 11, 62 – 66. O¨hman, A., Lundqvist, D., & Esteves, F. (2001). The face in the crowd revisited: a threat advantage with schematic stimuli. Journal of Personality & Social Psychology, 80, 381 – 396. Osterling, J. A., Dawson, G., & Munson, J. A. (2002). Early recognition of 1-year-old infants with autism spectrum disorder versus mental retardation. Development & Psychopathology, 14, 239 – 251. Parker, S. W., Nelson, C. A., & BEIP Core Group. (2005). The impact of early institutional rearing on the ability to discriminate facial expressions of emotion: an event-related potential study. Child Development, 76, 54 – 72. Pascalis, O., de Haan, M., & Nelson, C. A. (2002). Is face processing species-specific during the first year of life? Science, 296, 1321 – 1323. Phillips, R. D., Wagner, S. H., Fells, C. A., & Lynch, M. (1990). Do infants recognize emotion in facial expressions? Categorical and ‘‘metaphorical’’ evidence. Infant Behavior & Development, 13, 71 – 81.
244
Jukka M. Leppa¨nen and Charles A. Nelson
Pissiota, A., Frans, O., Michelgard, A., Appel, L., Langstrom, B., Flaten, M. A., & Fredrikson, M. (2003). Amygdala and anterior cingulate cortex activation during affective startle modulation: a PET study of fear. European Journal of Neuroscience, 18, 1325 – 1331. Pollak, S. D. (2002). Experience-dependent affective learning and risk for psychopathology in children. Annals in New York Academy of Sciences, 1008, 102 – 111. Pollak, S. D., Cicchetti, D., Hornung, K., & Reed, A. (2000). Recognizing emotion in faces: developmental effects of child abuse and neglect. Developmental Psychology, 36, 679 – 688. Pollak, S. D., Cicchetti, D., Klorman, R., & Brumaghim, J. T. (1997). Cognitive brain event-related potentials and emotion processing in maltreated children. Child Development, 68, 773 – 787. Pollak, S. D., & Kistler, D. J. (2002). Early experience is associated with the development of categorical representations for facial expressions of emotions. Proceedings of the National Academy of Sciences of the United States of America, 99, 9072 – 9076. Pollak, S. D., Klorman, R., Thatcher, J. E., & Cicchetti, D. (2001). P3b reflects maltreated children’s reactions to facial displays of emotion. Psychophysiology, 38, 267 – 274. Pollak, S. D., & Sinha, P., (2002). Effects of early experience on children’s recognition of facial displays of emotion. Developmental Psychology, 38, 784 – 791. Quinn, P. C. (2002). Category representation in young infants. Current Directions in Psychological Science, 11, 66 – 70. Rodman, H. R., O’Scalaidhe, S. P., & Gross, C. G. (1993). Response properties of neurons in temporal cortical visual areas of infant monkeys. Journal of Neurophysiology, 70, 1115 – 1136. Rodman, H. R., Skelly, J. P., & Gross, C. G. (1991). Stimulus selectivity and state dependence of activity in inferior temporal cortex of infant monkeys. Proceedings of the National Academy of Sciences of the United States of America, 88, 7572 – 7575. Sackett, G. P. (1966). Monkeys reared in isolation with pictures as visual input: Evidence for an innate releasing mechanism. Science, 154, 1468 – 1473. Schmolck, H., & Squire, L. R. (2001). Impaired perception of facial emotions following bilateral damage to the anterior temporal lobe. Neuropsychology, 15, 30 – 38. Serrano, J. M., Iglesias, J., & Loeches, A. (1992). Visual discrimination and recognition of facial expressions of anger, fear, and surprise in 4- to 6-month-old infants. Developmental Psychobiology, 25, 411 – 425. Serrano, J. M., Iglesias, J., & Loeches, A. (1995). Infants’ responses to adult static facial expressions. Infant Behavior & Development, 18, 477 – 482. Sherman, T. (1985). Categorization skills in infants. Child Development, 56, 1561 – 1573. Slater, A. M. (1993). Visual perceptual abilities at birth: implications for face perception. In B. de Boysson-Bardies & S. de Schonen (Eds.), Developmental neurocognition: speech and face processing in the first year of life. NATO ASI series D: Behavioural and social sciences (Vol. 69, pp. 125 – 134). New York: Kluwer Academic/Plenum Publishers. Sorce, J. F., Emde, R. N., Campos, J. J., & Klinnert, M. D. (1985). Maternal emotional signaling: its effect on the visual cliff behavior of 1-year-olds. Developmental Psychology, 21, 195 – 200. Sprengelmeyer, R., Young, A. W., Sprenglemeyer, A., Calder, A. J., Rowland, D., Perrett, D. I., Ho¨mberg, V., & Lange, H. (1997). Recognition of facial expressions:
Facial Emotion Recognition
245
selective impairment of specific emotions in Huntington’s disease. Cognitive Neuropsychology, 14, 839 – 879. Surakka, V., & Hietanen, J. K. (1998). Facial and emotional reactions to Duchenne and non-Duchenne smiles. International Journal of Psychophysiology, 29, 23 – 33. Tarr, M. J., & Gauthier, I. (2000). FFA: a flexible fusiform area for subordinate-level visual processing automatized by expertise. Nature Neuroscience, 3, 764 – 769. Teunisse, J. P., & de Gelder, B. (2001). Impaired categorical perception of facial expressions in high-functioning adolescents with autism. Child Neuropsychology, 7, 1 – 14. Thomas, K. M., Drevets, W. C., Whalen, P. J., Eccard, C. H., Dahl, R. E., Ryan, N. D., & Casey, B. J. (2001). Amygdala response to facial expressions. Biological Psychiatry, 49, 309 – 316. Tipples, J., Atkinson, A. P., & Young, A. W. (2002). The eyebrow frown: a salient social signal. Emotion, 2, 288 – 296. Tong, F., & Nakayama, K. (1999). Robust representations for faces: evidence from visual search. Journal of Experimental Psychology: Human Perception and Performance, 25, 1016 – 1035. Turati, C., Simion, F., Milani, I., & Umilta, C. (2002). Newborns’ preference for faces: what is crucial? Developmental Psychology, 38, 875 – 882. Vicari, S., Reilly, J. S., Pasqualetti, P., Vizzotto, A., & Caltagirone, C. (2000). Recognition of facial expressions of emotions in school-age children: the intersection of perceptual and semantic categories. Acta Paediatrica, 89, 836 – 845. Vuilleumier, P., Armony, J. L., Driver, J., & Dolan, R. J. (2003). Distinct spatial frequency sensitivities for processing faces and emotional expressions. Nature Neuroscience, 6, 624 – 631. Vuilleumier, P., Richardson, M. P., Armony, J. L., Driver, J., & Dolan, R. J. (2004). Distant influences of amygdala lesion on visual cortical activation during emotional face processing. Nature Neuroscience, 7, 1271 – 1278. Walker-Andrews, A. S. (1997). Infants’ perception of expressive behaviors: differentiation of multimodal information. Psychological Bulletin, 121, 437 – 456. Walker-Andrews, A. S., & Dickson, L. R. (1997). Infants’ understanding of affect. In S. Hala (Ed.), The development of social cognition. Studies in developmental psychology. (pp. 161 – 186). England: Psychology Press. Wang, A. T., Dapretto, M., Hariri, A. R., Sigman, M., & Bookheimer, S. Y. (2004). Neural correlates of facial affect processing in children and adolescents with autism spectrum disorder. Journal of American Academy of Child & Adolescent Psychiatry, 43, 481 – 490. Webster, M. J., Ungerleider, L. G., & Bachevalier, J. (1991). Connections of inferior temporal areas TE and TEO with medial temporal-lobe structures in infant and adult monkeys. Journal of Neuroscience, 11, 1095 – 1116. Whalen, P. J. (1998). Fear, vigilance, and ambiguity: initial neuroimaging studies of the human amygdala. Current Directions in Psychological Science, 7, 177 – 188. Whalen, P. J., Rauch, S. L., Etcoff, N. L., McInerney, S. C., Lee, M. B., & Jenike, M. A. (1998). Masked presentations of emotional facial expressions modulate amygdala activity without explicit knowledge. Journal of Neuroscience, 18, 411 – 418. White, M. (1999). Representation of facial expressions of emotion. American Journal of Psychology, 112, 371 – 381. White, M. (2000). Parts and wholes in expression recognition. Cognition & Emotion, 14, 39 – 60.
246
Jukka M. Leppa¨nen and Charles A. Nelson
Widen, S., & Russell, J. A. (2003). A closer look at preschoolers’ freely produced labels for facial expressions. Developmental Psychology, 39, 114 – 128. Winston, J. S., O’Doherty, J., & Dolan, R. J. (2003). Common and distinct neural responses during direct and incidental processing of multiple facial emotions. Neuroimage, 20, 84 – 97. Wismer Fries, A. B., & Pollak, S. D. (2004). Emotion understanding in postinstitutionalized Eastern European children. Development & Psychopathology, 16, 355 – 369. Yang, T. T., Eliez, S., Blasey, C., White, C. D., Reid, A. J., Gotlib, I. H., & Reiss, A. L. (2002). Amygdalar activation associated with positive and negative facial expressions. Neuroreport, 13, 1737 – 1741. Yang, T. T., Menon, V., Reid, A. J., Gotlib, I. H., & Reiss, A. L. (2003). Amygdalar activation associated with happy facial expressions in adolescents: a 3-T Functional MRI study. Journal of American Academy of Child & Adolescent Psychiatry, 42, 979 – 985. Young, A. W., Aggleton, J. P., Hellawell, D. J., Johnson, M., Broks, P., & Hanley, J. R. (1995). Face processing impairments after amygdalotomy. Brain, 118, 15 – 24. Young, A. W., Rowland, D., Calder, A. J., Etcoff, N. L., Seth, A., & Perrett, D. I. (1997). Facial expression megamix: tests of dimensional and category accounts for emotion recognition. Cognition, 63, 271 – 313.
CHILDREN’S SUGGESTIBILITY: CHARACTERISTICS AND MECHANISMS
Stephen J. Ceci DEPARTMENT OF HUMAN DEVELOPMENT, CORNELL UNIVERSITY, ITHACA, NY 14853, USA
Maggie Bruck DEPARTMENT OF PSYCHIATRY AND BEHAVIORAL SCIENCES, JOHNS HOPKINS UNIVERSITY, BALTIMORE, MARYLAND 21287, USA
I. DEFINITIONAL ISSUES II. INTERVIEWER BIAS: THE CENTRAL CHARACTERISTIC OF SUGGESTIVE INTERVIEWS
A. SUGGESTIVE INTERVIEWING TECHNIQUES B. SUMMARY AND QUALIFICATIONS III. MECHANISMS UNDERLYING CHILDREN’S SUGGESTIBILITY
A. COGNITIVE PREDICTORS OF SUGGESTIBILITY B. PSYCHOSOCIAL PREDICTORS OF SUGGESTIBILITY C. DISCUSSION OF PSYCHOLOGICAL MECHANISMS UNDERLYING CHILDREN’S SUGGESTIBILITY IV. SUMMARY: CHILD VS. SITUATIONAL VARIABLES
REFERENCES
Since the 1980s, research on the topic of child witnesses has been one of the fastest-growing areas in all of developmental psychology. This is mainly because it calls for the expertise of researchers across all areas of development. Peruse developmental journals and you will find articles dealing with child witnesses that are written by researchers from the subfields of cognitive, neurobiological, perceptual, linguistic, and social development. The sheer expansiveness of the topic makes it difficult to synthesize. In this Advances chapter we review major findings and trends in one sub-area of the child witness literature, the suggestibility of the child witness’ report. As will be seen, even this sub-area involves criss-crossing between studies concerning cognitive, social, and biological developments because changes in each of these domains has an implication for the study of developmental differences in suggestibility. In this chapter, we attempt to move the 247 Advances in Child Development and Behavior R Kail (Editor)
ß 2006 Elsevier B.V. All rights. reserved.
248
Stephen J. Ceci and Maggie Bruck
discussion of suggestibility away from its traditional focus on leading questions, to a broader concept called ‘‘interviewer bias’’ and its affiliated manifestations. In tackling this goal, we review a representative sample of the large corpus of studies dealing with the various elements of suggestive interviewing. Following this review, we describe the emerging literature on cognitive and social predictors of suggestibility, focusing on those studies that highlight major avenues of findings. First, however, we discuss the applied and theoretical factors that have spurred the enormous increase of studies on children’s suggestibility. There are several related factors that account for the large increase of this field of study. In the United States alone, over a million children are interviewed each year by law enforcement, mental health, and social service professionals. Children are interviewed primarily because they are either witnesses or victims of criminal events, or because they are involved in civil proceedings regarding family placement (e.g., custody cases), or because of pending abuse or neglect investigations. Clearly, it behooves researchers and professionals to understand the psychological factors that influence the reliability of these children’s statements. This research has had a large impact on the treatment of children in the forensic arena (for example, the development of structured interview protocols). It also had an impact on the testimony of experts who are often called to court to provide scientifically based testimony about the reliability of specific interviewing techniques used with child witnesses, and the findings have been invoked in many legal contexts, (e.g., California v. Raymond Buckey et al. (1990); Commonwealth of Massachusetts v Cheryl Amirault LeFave (1997); New Jersey v. Michaels (1994); North Carolina v. Robert Fulton Kelly Jr. (1995); State v. Fijnje, 11th Judicial Circuit Court, Dade Country, Florida (1995); Lillie and Reed v Newcastle City Council & Ors (2002); Commonwealth of Pennsylvania v. Gerald J. Delbridge (2004)). Notwithstanding its legal relevance, however, child witness research is grounded in developmental theory and in some cases, has produced innovative developmental paradigms, theories, and frameworks. Below is a partial list of the theoretical strands that run through much of the work on this topic. . Weaker encodings and impoverished representations are more easily
overwritten by suggestions than stronger, more elaborate ones (Brainerd et al., 1990; Bruck & Ceci, 1999; Pezdek & Roe, 1995). . Young children’s limited ability to monitor and regulate their memory, possibly the result of immature maturation of the frontal cortices, leads to source misattributions in which they confuse suggestions with experiences (Ceci & Bruck, 1995; Poole & Lindsay, 1995, 2001).
Children’s Suggestibility
249
. Limited symbolic representational ability leads to confusions when children
are interviewed with props, drawings, and dolls (Bruck, Melnyk, & Ceci, 2000; Deloache & Smith, 1999). . Social influences, such as the eagerness to please adult authority figures, can mask and sometimes even change event memory (Ceci, Ross, & Toglia, 1987; Poole & White, 1991). . Limited appreciation of pragmatics/conversational principles can result in young children giving ambiguous or misleading statements to interviewers (Battin & Ceci, 2003). . Encoding, storage, and retrieval of episodic information is influenced by knowledge, beliefs, and goals (e.g., Ornstein et al., 1998; Paris & Lindauer, 1977; Schacter, Norman, & Koutstall, 1996). Thus, although this area of research can have profound practical consequences, it is often a source of basic research as well. In this chapter, we summarize the major findings in the field of children’s suggestibility focusing on the parameters of the phenomenon, the potential mechanisms that underlie suggestibility, as well as some qualifications about the generalizability of the research. We begin by providing the definition of ‘‘suggestibility.’’
I. Definitional Issues In what was to become a widely accepted definition, Gudjonsson (1986) defined suggestibility as ‘‘the extent to which individuals come to accept and subsequently incorporate post-event information into their memory recollections’’ (p. 195). This definition is explicitly cognitive, focusing on memory incorporation of suggestions. Ceci and Bruck (1993) proposed a broader view of suggestibility that refers to ‘‘the degree to which encoding, storage, retrieval, and reporting of events can be influenced by a range of social and psychological factors’’ (p. 404). This latter definition permits the possibility of conscious acquiescence to social demands and lying, as well as the alteration of underlying memory that was the hallmark of the 1986 definition. Moreover, the 1993 definition expanded the realm of suggestibility to include factors occurring not only post-event, but also before and during an event, such as when a child is exposed to a stereotype prior to an event or when children’s generalized knowledge influences how they fill in missing gaps. Despite the broadening of the definition, until the end of the 1980s, most developmental studies of suggestibility focused on children’s answers to misleading questions or to children’s incorporation of misleading suggestions into their reports. This focus on leading questions and misinformation did not, however, capture the many elements of interviews that were conducted in a
250
Stephen J. Ceci and Maggie Bruck
number of legal cases involving child witnesses and that raised concern about their potential to taint the child’s reports. To deal with the discrepancy between the structure and content of interviews in actual cases and the explicit intent of definitions of suggestibility, we have broadened our model again. Currently, we use the term ‘‘suggestive interview’’ to refer to conditions that produce suggestibility, and we have hypothesized that the concept of ‘‘interviewer bias’’ is central to the characterization of suggestive interviews. As will be seen, this new model goes far beyond the asking of leading questions, to capture the behaviors of interviewers who have strong expectations and shape the interview to confirm them by ignoring contradictory evidence and selectively reinforcing consistent evidence. Thus, interviewer bias is a broadly inclusive concept that covers not only traditional forms of suggestion (leading questions, pressurized interrogations, repeated questions, misleading postevent information) but also various factors that have been shown to damage report accuracy in the absence of traditional suggestions (e.g., failure to test alternative hypotheses, selective reinforcement of responses, posing hypotheticals, inducing imagery, stereotypes etc.).
II. Interviewer Bias: The Central Characteristic of Suggestive Interviews Interviewer bias refers to interviewers who hold a priori beliefs about the occurrence of certain events and, consequently, shape the interview to elicit confirmatory statements from the interviewee (Ceci & Bruck, 1995). An interviewer may be a parent, police officer, child protective service worker or any adult who is attempting to gather information from a child. One of the hallmarks of ‘‘interviewer bias’’ is the single-minded attempt to gather only the confirmatory evidence and to avoid asking questions that may lead to contradictory evidence; a hallmark is the failure to pursue alternate explanations that are inconsistent with a ‘‘pet’’ hypothesis. When provided with inconsistent or bizarre evidence from the child, biased interviewers either ignore it or else interpret it within the framework of their initial hypothesis. This belief is transmitted to the child via a range of suggestive techniques that are associated with the elicitation of false reports, as we illustrate subsequently. Consequently, the child may come to inaccurately report what the interviewer believes rather than the child’s own experience. The following studies provide examples of how ‘‘interviewer bias’’ is studied in the laboratory. Thompson, Clarke-Stewart, and Lepore (1997) conducted a study in which children viewed a staged event that could be construed as either abusive or innocent. Some children interacted with a confederate named ‘‘Chester’’ as he cleaned some dolls in a playroom. Other children interacted with Chester as he
Children’s Suggestibility
251
handled the dolls roughly and in a mildly abusive manner. The children were questioned about this event. The interviewer was either (1) ‘‘accusatory’’ (suggesting that the janitor had been inappropriately playing with the dolls instead of working), (2) ‘‘exculpatory’’ (suggesting that the janitor was just cleaning them and not playing), or (3) ‘‘neutral’’ and non-suggestive. In the first two types of interviews, the questions changed from mildly to strongly suggestive as the interview progressed. Following the first interview, children were asked to tell what they had witnessed and then asked questions about the event. Immediately after the interview and two weeks later, children were asked by their parents to recount what the janitor had done, in the absence of the interviewer. When questioned by a neutral interviewer, or by an interviewer whose interpretation was consistent with the activity viewed by the child, children’s accounts were both factually correct and consistent with the janitor’s script. However, when the interviewer was biased in a direction that contradicted the activity viewed by the child, children’s stories quickly conformed to the suggestions or beliefs of the interviewer. In addition, children’s answers to interpretive questions (e.g., ‘‘Was he doing his job or just being bad?’’) were consistent with the interviewer’s point of view, as opposed to what actually happened. When asked neutral questions by their parents, the children’s answers remained consistent with the interviewers’ biases, indicating the long-term damage wrought by suggestive interviewers. Bruck et al. (1999) showed how interviewer bias can quickly develop in natural interviewing situations, and how it not only taints the responses of child interviewees but also the reports of the adult interviewers. In this study, a special event, a surprise birthday party, was staged for preschool children in their school. In groups of three and with the guidance of research assistant, the children surprised a research assistant, played games, ate food, and watched magic tricks. Another group of children did not attend the birthday party but in groups of two, they simply colored a picture with research assistants. These children were told that it was the assistant’s birthday. Interviewers (who were recruited from university graduate degree programs in social work or counseling and who had training and experience in interviewing children) were asked to question four children about what had happened when special visitors came to the school. The interviewers were not told about the events but were simply told to find out from each child what had happened. The first three children that each interviewer questioned attended the birthday party and the fourth child attended the coloring event. Bruck et al. (1999) found that the fourth children (those who attended the coloring event and were interviewed last) produced twice as many errors as the children who attended the birthday party; 60% of the children who only colored made false claims that involved a birthday party. This result suggests that the interviewers had built up a bias that all the children had attended
252
Stephen J. Ceci and Maggie Bruck
a birthday party. By the time they interviewed the child, they structured their interviews to elicit claims consistent with this hypothesis. Thus if interviewers believe that all the children they are interviewing have experienced a certain event, then many of the children will come to make such claims even though they were non-participants (or non-victims). Another important finding was that even when children denied attending a birthday party, 84% of their interviewers later reported that all the children they interviewed had attended a birthday party. These interviewers did not consider the possibility (either at the time of their interview with the children or at a later time when they were asked about what happened) that the fourth child did not participate in the special event. These data suggest that regardless of what children actually say, biased interviewers inaccurately report the child’s claims, making them consistent with their own hypotheses. These and similar other studies provide evidence that interviewers’ beliefs about an event can influence their judgments as well as their style of questioning. This, in turn, can affect the accuracy of children’s testimony. These data highlight the dangers of having only one hypothesis about the event in question–– especially when this hypothesis is incorrect.
A. SUGGESTIVE INTERVIEWING TECHNIQUES
What are the mechanisms through which interviewer bias operates? Research indicates that interviewer bias influences the entire architecture of interviews and it is revealed through a number of different component features that we term ‘‘suggestive techniques.’’ As we describe in the next sections, the results of the scientific literature indicate that the use of these suggestive techniques, especially in the hands of biased interviewers, and especially when used in combination, can bring children to make convincing claims about events they have never experienced. We have not attempted to provide a comprehensive review of this very large literature; rather we summarize studies that illustrate the influence of some of the most commonly studied suggestive techniques. 1. Types of Questions To confirm their suspicions, biased interviewers may avoid asking children ‘‘open ended’’ questions such as, ‘‘What happened?’’ but instead quickly resort to a barrage of very specific questions, which require the child to provide one-word answers (e.g., ‘‘yes’’ or ‘‘no’’). Sometimes the questions are leading (for example, asking the child ‘‘Where did your teacher touch you?’’ is leading if the child never previously mentioned touching by the teacher), and the questions are often repeated until the child provides a desired response.
Children’s Suggestibility
253
Although the strategy of using specific questions, leading questions, and of repeating questions ensures that the child will provide information, it is also problematic because children’s answers to these types of questions are often inaccurate. For example, Peterson and Bell (1996) interviewed children after they had been treated in an emergency room for a traumatic injury. They were first asked free recall questions (‘‘Tell me what happened’’). Then to obtain additional information, children were asked more specific questions (e.g., ‘‘Where did you hurt yourself?’’ or ‘‘Did you hurt your knee?’’). Peterson and Bell found that children were most likely to accurately provide important details during free recall. Across all age groups, errors increased when children were asked more specific questions. The percentage of errors elicited by free recall and specific questions was 9 and 45%, respectively. Specific questions include yes/no questions (‘‘Did the lady have a dog?’’) and forced choice questions (e.g., ‘‘Was it the man or the woman?’’). Use of these questions is risky because children rarely reply ‘‘I don’t know’’ even when explicitly told this is an option (Peterson & Grant, 2001) and even when the question is nonsensical and incomprehensible (Hughes & Grieve, 1980; Waterman, Blades, & Spencer, 2000). One of the reasons that children so willingly provide answers to specific yes/no or to forced choice questions, even though they may not know the answer or understand the question, is that young children are cooperative conversational partners: They perceive their adult interviewer as truthful, and not deceptive. To comply with a respected adult, children sometimes attempt to make their answers consistent with what they see as the intent of the questioner rather than consistent with their knowledge of the event (see Ceci & Bruck, 1993, for a review). Because of this compliant, cooperative characteristic, and because of young children’s poor performance on specific questions, it is particularly important in interviews to avoid these types of specific and forced-choice questions until after the child has first provided an exhaustive free recall. 2. Repeated Interviews and Questions In formal investigations, children are often interviewed on many different occasions. There are numerous concerns about the influence of these repeated interviews on children’s reports, especially when conducted by biased interviewers. As shown subsequently, the results of several studies indicate that repeated questioning and interviewing in suggestive interviews increases the number of false allegations. For example, preschool children were interviewed on five different occasions about two true events and two false events (Bruck, Ceci, & Hembrooke, 2002). The two true events involved the child helping a visitor to the school who had tripped and hurt her ankle, and a recent incident where the child was punished by the teacher or the parent. The two false events involved helping a lady find her monkey and witnessing a man steal food from their school. In the first interview,
254
Stephen J. Ceci and Maggie Bruck
children were simply asked if each event had ever happened. If they said yes, they were asked to describe the event. During the next three interviews, the children were suggestively interviewed (e.g., they were asked repeated leading questions, they were praised for responses, they were asked to try to think about what might have happened; they were told that their friends had already disclosed and now it was their turn). In the fifth interview, a new interviewer questioned each child about each event in a nonsuggestive manner. Across the five interviews, all children consistently and accurately assented to the true event about helping a lady who fell. However, children were at first reluctant to talk about the true punishment event; many of the children denied that the punishment had occurred. With repeated suggestive interviews, increasing numbers of children agreed that the punishment had occurred. Similar patterns of disclosure occurred for the two false events; that is, children initially correctly denied the false events but with repeated suggestive interviews, they began to assent to these events. By the third interview, most children had assented to all the true and false events. This pattern continued to the end of the study (see also Scullin, Kanaya, & Ceci, 2002). One of the rationales for re-interviewing children is that it provides them additional opportunities to report important information that was forgotten or simply not reported in earlier interviews. Thus, it is assumed that when children provide new details in subsequent interviews, these new reports are accurate memories that were not remembered in previous interviews. Another rationale is to allow children to rehearse, so that their memories will not fade. However, the results of studies dispute the claim that new details provided by children in subsequent interviews are accurate. One set of studies consistently shows that reports which emerge in a child’s first interview with a neutral interviewer are the most accurate. When children are later interviewed about the same event and report new details not mentioned in a previous interview, the newer details are less accurate than those repeated from the first interview (Peterson, Moores, & White, 2001; Pipe et al., 1999; Salmon & Pipe, 2000). In some studies, the inaccuracy rates of new inserted details in neutrally conducted interviews rise to a level of 50% (Peterson, Moores, & White, 2001; Salmon & Pipe, 2000). Similar results are obtained when children are suggestively questioned about an actual event (Bruck, Ceci, & Hembrooke, 2002; Scullin, Kanaya, & Ceci, 2002). Thus, insertion of new but inaccurate details can be a natural memory phenomenon, it can be due to prior suggestions that become incorporated into later reports, but it can also be due to the demand characteristics of the interview. When interviewers urge children to tell them anything (that is consistent with the bias of the interviewer), these requests for additional information will sometimes result in false reports that are supplied by the child to comply with their perception of the interviewer’s wishes. Although these studies show the detrimental effects of repeated interviews, there is an important qualification to this conclusion: In a number of studies,
Children’s Suggestibility
255
children who are provided with misinformation across multiple interviews are no more likely to incorporate this information into a later report than children who only receive one suggestive interview. The causal factor has to do with the timing of the suggestive interviews: If the first suggestive interview occurs soon after an event and the second interview occurs close to the final interview, then misinformation effects are maximized (Melnyk & Bruck, 2004). However, two caveats are in order. First, as shown in some studies, a single suggestion can damage a child’s report accuracy. Second, several suggestive interviews may be needed to elicit additional false details and embellishments (see Bruck, Ceci, & Hembrooke, 2002, in which repeated suggestive interviews were associated with the addition of false details particularly for false events). Just as there are risks associated with repeated interviews, there are also risks associated with repeating questions within the same interview. Biased interviewers sometimes repeatedly ask the same question until the child provides a response consistent with their hypothesis. Poole and White (1991) found that repeating the same question within an interview, especially a yes/no question, often results in young children changing their original answer (see Cassel, Roebers, & Bjorklund, 1996, for similar effects when children are asked repeated leading questions). Furthermore, when children are asked the same question on numerous occasions, they sound increasingly confident about their statements even if such statements are false. Apparently, when interviews contain a preponderance of (mis)leading questions, children initially resist the suggested response, but with repeated misleading questions (that differ in content), their resistance dissipates. Garven et al. (1998) found that preschoolers provided increasingly inaccurate responses to misleading statements and questions as the suggestive interview proceeded. In this study, children were suggestively interviewed for 5 to 10 min about a stranger who came to read their class a story. As a result of the suggestive devices used by Garven and her colleagues, children falsely claimed that the visitor said a bad word, that he threw a crayon, that he broke a toy, that he stole a pen, that he tore a book, and that he bumped the teacher. Importantly, the children came to make more false claims as the interview progressed: That is, within a 5 to 10 min interview, children made more false claims in the second half than in the first half of the interview. Not simply repeating questions but repeating questions about a specific theme (e.g., the visitor doing bad things) may compromise the reliability of children’s reports. 3. Coerced Confabulation In some actual cases a child witness is asked repeated questions until the child provides answers even if the child had initially protested that she did not know the answer (i.e., the child was required to give an answer). This practice raises the following question. If children were forced to knowingly provide an inaccurate
256
Stephen J. Ceci and Maggie Bruck
answer to a question, would they correct this inaccuracy at the next interview? To illustrate, if a child first denied abuse until he was coaxed to provide abuse-related information, would the child later recant the abuse-consistent disclosure? According to two studies, children do not later recant an inaccurate answer. Zaragoza and her colleagues (Ackil & Zaragoza, 1998; Zaragoza et al., 2001) found that when children are forced to provide an incorrect answer (confabulation), later they not only continue to provide the same incorrect answer but they actually come to believe in the validity of the wrong answer. For example, Zaragoza et al. (2001) had children visit a laboratory and play computer games; during this time a handyman came into the room and fixed some broken things. Immediately after, the child was asked a number of questions about the event and was told to provide an answer no matter what. Children were asked questions about things that really happened as well as misleading questions about things that did not happen. For example, children were asked how the handyman had broken the videotape (he had not even touched the videotape). Although most children correctly claimed that he had not broken the videotape, they were told to answer anyway. Sometimes it took several rounds of coaching to get the children to provide an incorrect answer. Two weeks later, these children were brought back to the laboratory and informed by a new, neutral interviewer that the original experimenter had made some mistakes and had asked them questions about things that never happened. Children were asked to report only what they actually saw. In this interview, children at all ages (6 – 10 years) reported that the confabulated false items had actually occurred (claiming that they saw it): 60% of the 6-year-old children now reported in the neutral interviewer that they saw the handyman break the videotape. This study shows that no matter how resistant to misleading questions children may be in an earlier interview, with sufficient pressure not only will they come to report this false information in a later interview, but also they will report that it actually happened. Using a milder version of the forced confabulation interview, similar dramatic suggestibility effects have been reported by Finnila et al. (2003) and by Bruck et al. (in press). In summary, if children’s allegations are elicited by specific leading questions that have been repeated within the same interview or across interviews, there is a high risk that the children’s statements will be unreliable. Conversely, children’s answers to open-ended questions asked prior to any suggestive interviewing have a high probability of being accurate. 4. Emotional Atmospherics Interviewers can use verbal and nonverbal cues to communicate their bias. These cues can set the emotional tone of the interview. Research shows that children are quick to notice the emotional tones in an interview and that they
Children’s Suggestibility
257
act accordingly. For example, in some studies when an accusatory tone is set by the examiner, (e.g., ‘‘It isn’t good to let people kiss you in the bathtub,’’ or ‘‘Don’t be afraid to tell,’’ or ‘‘You’ll feel better once you’ve told me’’), children are likely to fabricate reports of past events even in cases when they have no memory of any event occurring. In some cases, these fabrications are sexual in nature. In one such study, children played with an unfamiliar research assistant for five minutes while seated across a table from him. Four years later, researchers asked these same children to recall the original experience (Goodman et al., 2002). The researchers created ‘‘an atmosphere of accusation,’’ telling the children that they were to be questioned about an important event and saying things like, ‘‘Are you afraid to tell? You’ll feel better once you’ve told.’’ Although few children had any memory of the original event from four years earlier, five out of the 15 children incorrectly agreed with the interviewer’s suggestive question that they had been hugged or kissed by the confederate, two of the fifteen agreed that they had their picture taken in the bathroom, and one child agreed that she or he had been given a bath. Thus, children may give inaccurate responses to misleading questions about events for which they have no memory when the interviewer creates an emotional tone of accusation. 5. Rewards and Punishments Rewards and punishments shape the emotional tone of an interview and provide another means for the interviewer to express her bias. The use of rewards and punishments in interviews with children can be beneficial by motivating children to tell the truth. However, there may also be negative consequences; children may learn that if they produce stories consistent with an interviewer’s beliefs, the interviewer will reward them. A study conducted by Garven, Wood, and Malpass (2000) illustrates how the use of rewards and punishments in an interview can quickly shape the child’s behavior and have long lasting consequences. Children between the ages of 5 to 7 attended a special story time led by a visitor called Paco. During this 20-minute visit, Paco read the children a story, handed out treats, and placed a sticker on the child’s back. One week after the visit, the children were asked mundane questions (‘‘Did Paco break a toy?’’) and fantastic questions (‘‘Did Paco take you somewhere in an helicopter?’’). In the neutral-no reinforcement condition, children were simply asked a list of 16 questions and provided no feedback after each question. In the reinforcement condition, children were asked the same 16 questions, but they were provided with feedback after each question, as illustrated by the following example: Interviewer: Did Paco take you somewhere on a helicopter? Child: No (Note. This is an accurate denial) Interviewer: You’re not doing good. Did Paco take you to a farm?
258
Stephen J. Ceci and Maggie Bruck
Child: Yes. (Note this is an incorrect assent) Interviewer: Great. You’re doing excellent now. (The next question is asked) The reinforcement had large negative effects on children’s accuracy. Children in the reinforcement condition inaccurately assented to 35% of the misleading mundane questions and to 52% of the misleading fantastic questions. The rates for the non-reinforcement group were 13 and 15%, respectively. In a second interview, a week later, conducted by a new interviewer, all children were asked the same questions, without any reinforcement. The same high error rates continued for the reinforcement children. When children were challenged and asked in the second interview, ‘‘Did you see that or just hear about that?’’ children in the reinforcement group stated that they had personally observed 25% of the misleading mundane events and 30% of the misleading fantastic events. Children in the non-reinforcement group only made these claims 4% of the time. These findings show that reinforcing statements can quickly shape children to provide inaccurate responses no matter how bizarre the question. Furthermore, the inaccurate responses persist upon a second questioning with a number of children claiming that they actually observed the suggested but false event. 6. Stereotype Induction Suggestions do not always take the form of explicit (mis)leading questions such as, ‘‘Your Dad was mad, right?’’ One suggestive interviewing technique involves the induction of negative stereotypes by telling a child that the suspect ‘‘does bad things.’’ As the following study by Lepore and Sesco (1994) shows, some children incorporate this negative information into their reports. In this study 4- to 6-year-old children played games with a man called Dale, who also asked the child to help him take off his sweater. Later, an interviewer asked the child to tell her everything that happened with Dale. For half the children, the interviewer maintained a neutral stance whenever they recalled an action. For the remaining children, the interviewer re-interpreted each of the child’s responses in an incriminating way by stating, ‘‘He wasn’t supposed to do or say that. That was bad. What else did he do?’’ Thus, in this condition, the bias that Dale had misbehaved was induced. At the conclusion of these incriminating procedures, children heard three misleading statements about things that had not happened. (‘‘Didn’t he take off some of your clothes, too?’’ ‘‘Other kids have told me that he kissed them, didn’t he do that to you?’’ and, ‘‘He touched you and he wasn’t supposed to do that, was he?’’) All children were then asked a series of direct questions, requiring ‘‘yes’’ or ‘‘no’’ answers, about what had happened with Dale. Children in the incriminating condition gave many more inaccurate responses to the direct yes or no questions than children in the neutral condition.
Children’s Suggestibility
259
Interestingly, one-third of the children in the incriminating condition embellished their responses to these questions, and the embellished responses were always in the direction of the incriminating suggestions. The question that elicited the most frequent embellishments was: ‘‘Did Dale ever touch other kids at the school?’’ Embellishments to this question included information about who Dale touched (e.g., ‘‘He touched Jason, he touched Tori, and he touched Molly.’’), where he touched them (e.g., ‘‘He touched them on their legs.’’), how he touched them (e.g., ‘‘. . . and some he kissed . . . on the lips’’), and how he took their clothes off (‘‘Yes, my shoes and my socks and my pants. But not my shirt.’’). When they were re-interviewed one week later, children in the incriminating condition continued to answer the yes/no questions inaccurately and they continued to embellish their answers. Similar findings with other designs have also revealed powerful negative effects of stereotypes (e.g., Leichtman & Ceci, 1995). 7. Peer Conformity Pressure and Rumors The effect of telling children that their friends have ‘‘already told’’ is a much less investigated area. Certainly, the common wisdom is that a child will go along with a peer group; but will a child provide an inaccurate response just so he or she can be one of the crowd? The most recent and most relevant studies in the literature suggest that the answer is yes (Principe & Ceci, 2002; Principe et al., 2006). In the Principe and Ceci (2002) study, preschoolers in groups ranging in size from 6 to 8 took part in a contrived ‘‘dig’’ with a fictitious archaeologist named Dr. Diggs. Children used plastic hammers to dig pretend artifacts (e.g., dinosaur bones, gold coins). Dr. Diggs also showed the children two special artifacts––a map to a buried treasure and a rock with a secret message. The children were warned not to touch these because they could be ruined. All children in the study participated/viewed these core events. However, one-third of the children also saw Dr. Diggs ruin the two special artifacts (heretofore referred to as the target activities) and get upset about their loss. A second third of the children were the classmates of those in the first group, but did not witness the extra target activities. The remaining children were neither the classmates of those who witnessed the target activities, nor did they witness the target activities themselves, and thus served to provide a baseline against which to assess the effects of peer contact. Following the dig, children were interviewed in either a neutral or suggestive manner on three occasions. The suggestive interviews for children who did not view Dr. Diggs ruining the special artifacts provided children with this information. Children in the neutral interview were not given information about the two target events. In a later interview, children were asked questions about the ‘‘dig.’’ Children who had not reported the two absent special activities were prompted to tell more by telling them that their friends had already told.
260
Stephen J. Ceci and Maggie Bruck
Children who were classmates of those who saw Dr. Diggs ruining the two artifacts were more likely to claim that they had viewed the target activities (i.e., they incorporated the misinformation from the previous interview) than children who did not view the special activities and were in another classroom. These data suggest that there was contamination from classroom interactions; children who had not experienced the target events learned of them from their classmates and thus were more likely to assent to false events. Finally, telling children that their friends had told increased their false assent rate. Principe et al. (2006) conducted another study in which they found that children who overheard another child talking were as likely to falsely claim to have seen the event in question (a rabbit that escaped from a magician) as were peers who actually saw it escape. Moreover, in this study, the effect of suggestive questioning did not notably increase their false reports; they were as likely to report falsely if they overheard peers talking about the rabbit, regardless of whether interviewers employed suggestive questions. 8. Combining Suggestive Techniques For ease of exposition, we attempted to discretely categorize a number of suggestive interviewing techniques. However, in actual ‘‘suggestive’’ interviews with child witnesses, these elements rarely occur in a vacuum (see Bruck, Ceci, & Principe, 2006; Ceci & Bruck, 1995 for examples) but co-occur or combine with a variety of suggestive interviewing techniques. Thus, children may be asked repeated suggestive questions within one interview and across interviews. In one interview, the interviewer may use selective reinforcement, stereotype induction, and peer pressure, for example. The scientific literature demonstrates that as interviews become more suggestive, the number of false allegations increases (e.g., Finnila et al., 2003; Leichtman & Ceci, 1995; Zaragoza et al., 2001). One reason for this is that as the number of techniques increases, the bias of the interviewer becomes clearer to the child who then provides the desired answer. 9. The Effects of Suggestion on the Credibility of Children’s Reports It is one thing to demonstrate that children can be induced to make errors and include false perceptual details in their reports, but it is another matter to show that such faulty reports are convincing to an observer, especially a highly trained one. In a series of studies, Ceci and colleagues (Ceci et al., 1994a,b; Leichtman & Ceci, 1995) have shown videotapes to experts of children’s reports that emerged as a consequence of repeated suggestive interviews. In some cases, the experts also saw videotapes of children who resisted suggestions and denied that anything had happened. These experts were asked to decide which of the events reported by the children actually transpired and then to rate the overall credibility of each child. Experts who conduct research on the reliability of children’s reports, who provide therapy to children suspected of having been
Children’s Suggestibility
261
abused, and who carry out law enforcement interviews with children, generally failed to detect which of the children’s claims were accurate and which were not, despite being confident in their judgments. Some professionals state that they can detect suggestion because the children simply parrot the words of their investigators. However, there is no support for this assertion. First, children’s false reports are not simply repetitions or monosyllabic responses to leading questions. Under some conditions, their answers go well beyond the suggestion and incorporate additional details and emotions. For example, in the Bruck, Ceci, and Hembrooke study (2002), children’s false reports contained the prior suggestion that they had seen a thief take food from their day care but also non-suggested details such as chasing, hitting, and shooting the thief (also see Bruck et al., 1995; Ceci et al., 1994b). Second, linguistic markers do not consistently differentiate true from false narratives that emerge from repeated suggestive interviews. In the Bruck et al. study (2002) where children were repeatedly and suggestively interviewed about true and false events, the children’s narratives of the false events actually contained more embellishments (including descriptive and emotional terms) and details than their narratives of the true events. Also the false narratives had more spontaneous statements than the true narratives. Although details in false stories were typically realistic, as suggestive interviews continued, children added fantastic claims to their stories.
B. SUMMARY AND QUALIFICATIONS
The literature reviewed in this section has focused on the suggestive influences that can undermine accurate reporting in children. It is relevant to understanding children’s reports that emerge in concert with suggestive techniques and when there has been no prior spontaneous report of the event prior to the use of suggestive techniques. That is, the literature reviewed here does not call into doubt the accuracy of a spontaneous statement a child makes before any suggestions were made. A large body of research shows that children as young as age 2 or 3 are capable of providing accurate, detailed, and useful information about actual events, some of which are traumatic. These studies are characterized by a neutral tone of the interviewer, a limited use of misleading questions (for the most part, if suggestions are used, they are limited to a single occasion), and the absence of any motive for the child to make a false report. When such conditions are present, much, but not all, of what children report can be quite accurate (e.g. Bahrick et al., 1998; Goodman, et al., 1991; Lamb et al., 2003; McCauley & Fisher, 1995; Peterson, 1999; Quas et al., 1999; Saywitz, 1987). Therefore, if a child’s statements are made in the absence of any previous suggestive interviewing and in the absence of any motivation on the part of the
262
Stephen J. Ceci and Maggie Bruck
child or adults to make incriminating statements, then the risk that the statement is inaccurate is quite low. If however, the child initially denies any wrong-doing when first asked about an event, but later as a result of suggestive interviewing practices comes to make allegations, the statements may be unreliable. Errors that result from suggestive techniques involve not only peripheral details, but also central events that involve children’s own bodies. In laboratory studies, children’s false reports can be tinged with sexual connotations. Young children have made false claims about ‘‘silly events’’ that involved body contact (e.g., Did the nurse lick your knee? Did she blow in your ear?), and these false claims persisted in repeated interviewing over a three-month period (Ornstein, Gordon, & Larus, 1992). Young children falsely reported that a man put something ‘‘yuckie’’ in their mouth (Poole & Lindsay, 1995, 2001), and falsely alleged that their pediatrician had inserted a finger or a stick into their genitals (Bruck et al., 1995), or that some man touched their friends, kissed their friends on the lips, and removed some of the children’s clothes (Lepore & Sesco, 1994). A significant number of preschool children falsely reported that someone touched their private parts, kissed them, and hugged them (Goodman et al., 1990, 1991; Bruck, Melnyk, & Ceci, 2000). In addition, when suggestively interviewed, some children will make false allegations about nonsexual events that could have serious legal consequences were they to occur. For example, preschoolers claimed to have seen a thief in their day care (Bruck, Ceci, & Hembrooke, 2002). At times, suggestive interviewing techniques result in false beliefs. Children who incorporate the suggestions of their interviewers come to truly believe that they were victims. The ‘‘mix’’ of suggestive interviewing techniques in conjunction with the degree of interviewer bias can account for variations in suggestibility estimates across studies. If an (biased) interviewer uses more than one suggestive technique, there is a greater chance for taint than if he uses just one technique. Suggestive interviewing affects the perceived credibility of children’s statements. The major reason for this lack of accurate discrimination between true and false reports is perhaps due to the fact that suggestive techniques breathe authenticity into the resulting false reports. When false reports emerge as a result of suggestive interviews, these are not simple repetitions or monosyllabic responses to leading questions. Under some conditions, these suggested reports become spontaneous and elaborate, going beyond the suggestions provided by their interviewers. There are no valid scientific tests to determine which aspects of a report or which reports are accurate accounts of the past. There is no scientific ‘‘Pinocchio Test’’ that indicates that the child’s metaphorical nose is growing longer when her statement is inaccurate. Although much of the literature pays lip service to the concept that suggestibility exists at all ages, including in adulthood, the dominant view is that
Children’s Suggestibility
263
preschool children are disproportionately suggestible, and that there should be less concern about the tainting effects of suggestive interviews with older schoolaged children. The focus on younger children reflects the disproportionate number of studies of preschool children at the end of the Twentieth century. This practice was directly motivated by forensic concerns of the day; in a number of high-profile criminal cases, preschool children made horrific claims about sexual abuse. Although the case facts showed that these children had been subjected to highly suggestive interviews, at that time no relevant body of scientific literature indicated the risk of such interviews in producing false allegations about a range of salient events. When researchers began to fill this empirical void, most of the studies focused on preschoolers, and few examined age-related differences. Those that did include age comparisons usually found that the older children rarely fell sway to suggestion, leading to the conclusion that only preschoolers are suggestible (e.g., Ceci, Ross, & Toglia, 1987). However, this conclusion is discrepant with the findings of another body of literature showing that many of the suggestive techniques used in the child studies also produce tainted reports or false memories in adults (e.g., see Loftus, 2003). By inference, one might assume that children in middle childhood must also be quite suggestible, given that both younger and older groups are. Subsequent evidence supports this view: Susceptibility to suggestion is highly common in middle childhood, and under some conditions developmental differences in suggestibility are small or even nonexistent. For example, Finnila¨ et al. (2003) staged an event (a version of Garven et al.’s (2000) Paco visit described earlier) for 4- to 5-year-olds and 7- to 8-year-olds. One week later, half the children were given a low-pressure interview that contained some misleading questions with abuse themes (e.g., ‘‘He took your clothes off, didn’t he?’’). The other children received a high-pressure interview; they were told that their friends had answered the leading questions affirmatively, they were praised for assenting to the misleading questions, and when they did not assent, the question was repeated. In both conditions, there were no significant age differences in the percentage of misleading questions answered affirmatively (and fully 68% were assented to in the high-pressure condition) (see also Bruck et al., in press; Zaragoza et al., 2001). In addition to this work, other studies, some of which we reviewed, have also found that under some conditions, older children are more suggestible than younger children (e.g., Finnila¨ et al., 2003; Zaragoza et al., 2001) (see also Bruck et al., in press; Zaragoza et al., 2001). However, these are exceptions to the rule, and generally younger children are significantly more suggestible. The literature on developmental trends in children’s suggestibility serves as a cornerstone for the research summarized in the next section, namely the mechanisms that underlie children’s suggestibility.
264
Stephen J. Ceci and Maggie Bruck
III. Mechanisms Underlying Children’s Suggestibility The search for mechanisms underlying children’s suggestibility is an enterprise that attempts to integrate developmental differences in basic cognitive skills, neurobiological maturation, and social behaviors with both developmental and individual differences in suggestibility. A general strategy is to link agerelated reductions in suggestibility with social and/or cognitive development. That is, although age is usually highly correlated with levels of suggestibility, studies on mechanisms of suggestibility attempt to uncover those skills for which age acts as a proxy, such as memory, theory of mind, and/or social compliance. Although age accounts for a great deal of variation in children’s suggestibility, it does not account for all of the variation. Even among children within a given age group, there are marked differences in suggestibility. Consequently, research on the cognitive and social predictors of suggestibility (individual differences research) dovetails with the research on the mechanisms underlying suggestibility. Although the same mechanism involved in producing developmental differences may also account for individual differences within age groups, very few studies have employed quantitative procedures that disentangle individual and developmental effects (e.g., path analyses), so in our review of the mechanisms underlying suggestibility we switch back and forth between both types of studies. In the following sections, we summarize the literature on those factors that have received the most attention in the literature. First we focus on cognitive factors and then on social factors.
A. COGNITIVE PREDICTORS OF SUGGESTIBILITY
From a developmental perspective, one would predict that factors such as language, memory, and semantic knowledge among others would underlie changes in children’s suggestibility. Simply put, as children mature they accumulate increasingly rich reservoirs of semantic knowledge, memory strategies, and insights into the inner workings of their memories and their minds. In most, but not all, situations, these developments should help shield children from the deleterious effects of suggestions (Ceci & Bruck, 1995). One might also predict that children who excel on a combination of cognitive factors (for example as measured by IQ tests) might be most resistant to suggestion. The following review addresses some of these more common predictions. 1. Intelligence For nearly hundred years researchers have been interested in the relation between suggestibility and intelligence. The earlier studies (e.g., Cohn & Dieffenbacher, 1911; Hurlock, 1930; Otis, 1924) were more likely than later
Children’s Suggestibility
265
studies to find correlations between suggestibility and intelligence (Bruck, Ceci, & Melnyk, 1997). These conflicting findings could be due in part to betweenstudy differences in the range of the children’s intellectual capacities. No relations are more likely to occur in studies that include children with skills within the average range (e.g., Burgwyn-Bailes et al., 2001). However, researchers who included children whose intelligence scores ranged from below average to above average were more likely to report that higher IQs were related to lower levels of suggestibility (Chae & Ceci, 2005; McFarlane, Powell, & Dudgeon, 2002). This implies a threshold effect, in which the significant relation between intelligence and suggestibility is observed when children with very low intelligence are examined, whereas intelligence may not be a significant predictor of suggestibility for children within the normal range. In a detailed review of this literature, Bruck and Melnyk (2004) concluded that the strongest demonstrations of IQ-suggestibility relations occur when children with mental retardation are included in studies. 2. Language Skills Language skills are predicted to be a mechanism accounting for suggestibility effects because suggestions are verbal by nature. Children with poorer language skills may be more suggestible because they do not understand the suggestions and simply assent to questions they do not understand. This hypothesis is supported in studies of preschool children when the comprehensive language measures (productive and receptive skills) serve as the predictor variable (Clarke-Stewart, Malloy, & Allhusen, 2004; Roebers & Schneider, 2005). However, in most cases, when only vocabulary tests serve as the measure of language, there are no consistent relations with suggestibility (see Bruck & Melnyk, 2005). 3. Semantic knowledge Although vocabulary tests are not a reliable predictor of children’s suggestibility, other aspects of word knowledge may be important. Specifically, the degree to which a child falls sway to suggestion may reflect their general understanding of the concepts used in the suggestion itself. If children do not have a strong concept or a good representation of a concept, perhaps suggestions containing that concept may not interfere with the memory of an actual event. Because older children have more developed conceptual representations, this leads to the counter-intuitive prediction that under some circumstances, older children may be more suggestible than younger children. In a multi-step study, Ceci (2005) first mapped out the density of children’s conceptual spaces (how knowledge was represented) for common objects (e.g., lemon, lobster, eagle). Next, children were read stories that contained some of
266
Stephen J. Ceci and Maggie Bruck
these objects. Later, they were given misleading suggestions, substituting an object from the same category for the one that had appeared in the story (e.g., lemon for an orange, egg for cheese). As predicted, the way a child represented an item in memory influenced its susceptibility to suggestion. Older children better understood relations between citrus items and between dairy items, and consequently they were more suggestible than younger children. In other words, if children viewed a lemon tree in a story and were misinformed later that it is an orange tree, older children fell prey to this suggestion more than younger children (see Connally & Price, in press, for similar findings). Although developmental memory researchers have long known about the role of semantic knowledge in recallability (Chi & Ceci, 1987; Hagen, Jongeward, & Kail, 1975), this new line of research shows that semantic knowledge also plays a role in suggestibility. Sometimes the presence of semantic knowledge actually renders older children more suggestible than younger ones, contrary to the usual developmental finding.
4. Theory of Mind Theory of mind refers to a cognitive capacity to know that others may have different feelings, intentions, beliefs, and perceptual knowledge than oneself. This skill develops rapidly in preschool children so that by the age of 5, most children come to understand that two people can hold conflicting beliefs about the world (Astington, 1993). The developmental increase in the theory of mind and the developmental decrease in suggestibility within the same age range have motivated a number of investigations into their association in preschoolers. The basic premise is that after children come to appreciate that another person can hold a belief that is different from one’s own, then one can more easily reject it (as in misinformation). Children who cannot simultaneously entertain conflicting information may replace the original information with the suggested misinformation. To test this possibility, Welch-Ross, Diecidue, and Miller (1997) investigated 3- and 5-year-olds’ performance on conflicting mental representation tasks in relation to their suggestibility for a short story read to them by an experimenter. To assess children’s representational abilities, a series of appearance-reality tasks (e.g., asking whether a white piece of paper placed under a colored filter was really white or really colored) and pretend-reality tasks (e.g., asking whether objects such as a spoon, a cup, a silk flower, and a ceramic orange were real or pretend) were conducted. To assess suggestibility, children answered specific and misleading questions about the story. As predicted, after controlling for age and general memory ability, children who were more successful on the mental representation tasks were less suggestible about story details than those who were not as successful at the tasks.
Children’s Suggestibility
267
Although the results of this one study are promising, the generalizability of the findings is limited. First, as reviewed by Bruck and Melnyk (2004), other studies have not reported a significant relation between theory of mind and suggestibility. Second, some investigators have found a negative relation between theory of mind and suggestibility (Templeton & Wilcox, 2000; Welch-Ross, 1997) leading to the hypothesis that once one understands that another person can hold another belief different from one’s own, one can come to accept the discrepant information (knowing that it is wrong) for the sake of social niceties. The fact that theory of mind has been used to predict both increases and decreases in children’s suggestibility limits its theoretical power. 5. Memory A number of studies have examined the relation between children’s ‘‘memory’’ and their suggestibility. As we discuss in this section, there is not one simple relation but it varies depending upon the ‘‘construct’’ of memory or the type of memory tested in each study. A common finding is that children with weak memory for an event are more susceptible to suggestions about that event. For example, Pezdek and Roe (1995) showed that 4- and 10-year-olds with stronger memories were more likely to resist erroneous suggestions than were those with weaker memories, with strength of memory manipulated by means of frequency of presentation of target items. That is, children were less likely to be misled if they had viewed an item two times rather than only once. Likewise, Endres, Poggenpohl, and Erben (1999) found that better memory due to repeated reading of the stimulus story reduced incorrect answers to suggestive questions in 4- and 10-year-old children. One interpretation of these findings is that tight integration of the semantic and perceptual components of memories make it more difficult to add extraneous (or suggested) components from external sources to the representation of an event in memory. In contrast, if the original memory traces for an event have begun to disintegrate and are retained only in a weak unelaborated form, an erroneous suggestion may more easily coexist with the original trace and, due to its recency, may be more readily retrieved than the relatively weaker original features (Ceci & Bruck, 1993). In most studies that have examined the relation between memory and suggestibility, a correlation is built-in because the measures used to assess children’s memory came from the same information about which they were questioned to measure suggestibility. This means that the event details, presentation rates, delays, and instructions were similar, thus possibly inflating the relation between memory and suggestibility. If suggestibility is associated with a general ‘‘event memory’’ ability rather than with event memory for one specific event, then the former needs to be assessed independently. Bruck and Melnyk (2004) identified five studies containing 78 different correlations between suggestibility for one
268
Stephen J. Ceci and Maggie Bruck
event and event memory for another event. Only 4 of the 78 correlations were in the predicted direction, leading to the conclusion that children’s suggestibility is not related to event memory competence per se. In another set of studies, researchers have attempted to link children’s suggestibility with more general measures of memory (e.g., serial recall, digit span, visual memory). There is very little support for a reliable relation between memory skills, as measured on traditional standardized tests, and suggestibility (Bruck & Melnyk, 2004; Chae & Ceci, 2005). In contrast to the general absence of significant effects in these correlational studies, when specific types of memory are linked to suggestibility, the results are more promising. This is evidenced in studies relating children’s source memory to their suggestibility. 6. Source Monitoring Source monitoring is the ability to identify the origin of one’s own knowledge, beliefs, and memories (Johnson, Hashtroudi, & Lindsay, 1993). For example, children and adults make source errors by confusing whether they actually saw an event or heard about it; sometimes source errors involve confusing fantasy (imagining) with reality. Although these errors occur at all ages (e.g., Belli et al., 1994; Zaragoza & Lane, 1994) such errors decrease with age (e.g., Ackil & Zaragoza, 1998; Lindsay, Johnson, & Kwon, 1991). In addition, preschoolers lack skill at distinguishing between two or more sources of input into their memories (Gopnik & Graf, 1988; Wimmer, Hogrefe, & Perner, 1988). Based upon these findings, one would predict that children with good sourcemonitoring skills might resist suggestion in that they can easily separate events that were experienced from those that were only heard about. Leichtman et al. (2000) conducted a series of experiments in which several diverse forms of source-monitoring and suggestibility tasks were performed. For example, in a source-monitoring task, 4-year-olds were shown a small set of six drawers, each of which held an object. Children learned the contents of the drawers by seeing what was inside them or by being told what was inside them, or by being given clues about their contents. Children were later given a sourcemonitoring task where they were asked to recall how they learned the name of the object in each drawer. These same children were also given a suggestibility task; over a period of 5 weeks, they were suggestively questioned about true and false events. Children who provided many incorrect assents to false events on the suggestibility task also performed poorly on the source-monitoring task. Further analyses indicated that this significant relation was not due to a general memory factor. In another study (Giles, Gopnik, & Heyman, 2002), 3- to 5-year-old children were simultaneously presented with a brief story in two different modalities, as a silent video vignette and a spoken narrative. Each modality presented unique
Children’s Suggestibility
269
information about a story, but the information in the two versions was mutually compatible. Children were then asked questions about the sources of story details and they were also asked leading questions about story details. Correct answers on the source-monitoring questions were highly correlated with the ability to resist suggestion after controlling for age. In addition, children who were asked source-monitoring questions prior to leading questions were less suggestible than were those who were asked the leading questions first. Giles, Gopnik, and Heyman (2002) concluded that encouraging children to think about the sources of their knowledge improved resistance to suggestion for several reasons. First, source-monitoring task solidified the original memory trace. Second, source monitoring also encourages the use of stringent decision criteria that might help reduce suggestibility. Finally, the source-monitoring questions might make children aware of multiple sources of knowledge about an event and that this information might be helpful in resolving conflicting mental representations (such as the case when there are misleading questions). The relation between source monitoring and suggestibility is also shown directly in training studies. For example, Thierry and Spence (2002) showed preschoolers a scientist conducting various experiments. Children saw half of the experiments in a live demonstration and they saw half of the experiments on a video. Several days later, half of the children received source-monitoring training in which they were explicitly taught to pay attention to the source of their experiences (did I see the puppet do that in a video or did I see the puppet do it on a real stage). Later all children were asked leading and misleading questions about the science demonstrations. Source-monitoring training led to higher rates of resistance to misleading questions about the source of the event. In summary, in both correlational and training studies, the ability to distinguish the sources of one’s memories predicted children’s suggestibility even when the source memory test was independent of the events that were the focus of the suggestibility test. 7. Summary The literature reviewed in this section illustrates some of the difficulties and inconsistencies in this field of research. Much of the inconsistency is between studies in which variables are experimentally manipulated vs. studies that correlate test performance with suggestibility. To a large degree, it is more likely to obtain significant relations with the former types of studies. These studies also show the importance of specifying the type of skill that serves as the predictor. For example, although vocabulary knowledge did not correlate with suggestibility, conceptual knowledge of the terms did; although most measures of memory did not correlate with suggestibility, source memory did.
270
Stephen J. Ceci and Maggie Bruck
Finally it is important to remember that this is not an inclusive review as we have omitted a number of factors because the few existing studies do not indicate reliable relations with suggestibility (see Bruck & Melnyk, 2004 for a review). Our review does indicate, however, that there are some plausible candidates such as source monitoring, semantic knowledge, and language abilities that deserve further inspection.
B. PSYCHOSOCIAL PREDICTORS OF SUGGESTIBILITY
In addition to the cognitive predictors of suggestibility that have been studied, researchers have examined the influence of many social and emotional influences. Much of this work is built on the assumption that some suggestibility effects reflect compliance, acquiescence or approval seeking. Children are thought to be more suggestible because they are more willing to defer to the requests of adults. Further it is posited that with development, children become less dependant on adults’ approval, they become more independent in terms of expressing their own thoughts and therefore, are more likely to resist suggestions from adults. In some studies researchers have examined the relation between temperament/personality types to suggestibility. Drawing from the basic concepts of suggestibility involving compliance, acquiescence, and approval, researchers have tested hypotheses that characteristics of inhibition (withdrawal, shyness), poor self-esteem, or poor attachment, place children at risk for suggestibility. We now review the major areas of research in these areas. 1. Temperament The interpersonal nature of interview conditions raises the possibility that the temperamental characteristics, sometimes referred to as behavioral styles or personality characteristics, may affect a child’s reports in an interview. What follows is a review of the existing studies that have explored the effects of interrelated aspects of temperament, such as shyness, emotionality, and adaptability, on suggestibility. Kagan (as cited in Schacter, Kagan, & Leichtman, 1995) found that inhibited children were reluctant to oppose an adult request and even acted in ways that they knew to be wrong because of excessive anxiety about disapproval or punishment from adults. For instance, when asked to implement some punishable acts (e.g., throwing a ball at the examiner’s face), they were more obedient and less likely to ask why an act should be carried out, compared with uninhibited children. Therefore, Kagan speculated that inhibited children might be more likely to acquiesce to misleading suggestions in interrogative contexts, because they would try harder to satisfy the request of authority than
Children’s Suggestibility
271
uninhibited children. A related expectation is that shyness and emotionality (i.e., the vigor with which negative affect is expressed) may be associated with heightened suggestibility, because inhibited children are typically shy and easily aroused. Difficult or slow-to-warm-up children (e.g., shy, emotionally intense) may feel more uncomfortable in the new interview situations and thus in order to end an interview, they may be more willing to simply agree to suggested information in the presence of an unfamiliar interviewer, compared with easygoing children (e.g., outgoing, even-tempered, adaptable). Endres, Poggenpohl, and Erben (1999) reported findings supporting the hypothesized association of shyness with suggestibility: 4- to 7-year-olds rated as shy by their mothers were more suggestible on misleading yes/no questions than non-shy children. Yet, the majority of other studies (Burgwyn-Bailes et al., 2001; Chae & Ceci, 2005; Crossman, 2001; Geddie, Fradin, & Beer, 2000; Young et al., 2003) failed to find any significant relation between shyness rated either by teachers or by parents and suggestibility among preschool-aged through early adolescent children. It seems plausible that children who are even-tempered would perform better in interview situations. However, empirical investigations (Burgwyn-Bailes et al., 2001; Geddie et al., 2000) have shown no significant effect of children’s emotional intensity on suggestibility. Chae and Ceci (2005) found that although emotional children were more likely to comply with the interviewer’s suggestions than those who were not rated as emotional, emotionality did not make a significant independent contribution to the variance in suggestibility after controlling for age. Hence, to date, children’s suggestibility apparently is not uniquely related to emotional intensity. Adaptability has also been predicted to affect children’s suggestibility, because children who adapt well to interview contexts may feel more comfortable and thus be able to resist an interviewer’s suggestions more than less adaptable children. However, in the work of Burgwyn-Bailes et al. (2001), children’s adaptability was not related to suggestibility (see Geddie, 2000, for similar findings). The general failure to find relations between temperament measures and children’s suggestibility also holds for measures such as shyness, acquiescence, and compliance. 2. Self-Perceptions Self-perceptions of one’s competence and social acceptance represent another individual difference variable thought to predict suggestibility. Children with higher levels of self-confidence and social acceptance may feel empowered in interview settings, they may be certain about the accuracy of their own memory, and thus they resist social pressure to agree with an interviewer when the interviewer is thought to be wrong. In contrast, children who perceive that they lack competence and are not socially accepted may be particularly sensitive
272
Stephen J. Ceci and Maggie Bruck
to pressure and may be likely to succumb to an interviewer’s requests for inaccurate information. Some investigations provided empirical support for the hypothesis that higher levels of self-perceptions would be associated with decreased suggestibility. Howie and Dowd (1996) showed that 7- to 10-year-olds who were rated by their teachers as higher in self-esteem displayed greater resistance, less yielding, and fewer ‘‘don’t know’’ responses to misleading questions about an experienced event than those rated as lower in self-esteem. Vrij and Bush (2000) also reported that self-confidence was negatively correlated with suggestibility for a short video among both 5–6- and 10–11-year-olds. These significant relations are also found in other studies that have employed different aspects of self-perceptions. Mazzoni (1998), for example, examined the influence of self-efficacy reflecting the confidence in one’s own memory for various topics (i.e., whether the child believed herself to remember better, equally well, or worse than his/her parents and peers) on suggestibility. Selfefficacy had an inverse impact on 9-year-olds’ suggestibility but was not related to 6-year-olds’ suggestibility. Similarly, Davis and Bottoms (2002) explored the effects of 6- and 7-year-olds’ perceived interview-related self-efficacy (i.e., the degree to which the child felt he/she could resist an interviewer’s suggestions) on suggestibility and found that self-efficacy was negatively correlated with suggestibility to misleading questions for older, but not for younger children. These findings imply that self-perceptions may play a more important role in suggestibility for school-aged children who may have a more developed sense of self than for preschool-aged children (see Bruck & Melnyk, 2004, for other confirming evidence). 3. Attachment Style Following the lead of Goodman and her colleagues (1991), a number of researchers have examined the association between maternal romantic attachment style and their children’s suggestibility about a stressful event, such as a painful medical procedure. Previous work in this domain has shown that there are individual differences in adult attachment styles that are similar to the mother–infant attachment patterns described by Ainsworth (see Shaver & Fraley, 2000, for a review). These independent dimensions are called anxiety (fear of abandonment in close relationships) and avoidance (discomfort with close relationships). Individuals low on both of these scales are classified as secure. The original premise for studying the relation between mothers’ attachment styles and their children’s memory (for stressful events) was that anxious mothers might transmit fear (about the medical procedure) to their children or that avoidant mothers might be less comforting or willing to discuss the stressful events with their children. This in turn might affect children’s
Children’s Suggestibility
273
encoding as well as retrieval of the target events. In contrast, mothers with supportive attachment styles (to romantic partners) might promote processing styles that are conducive to accurate recall. It is also assumed that mothers’ romantic attachment style is correlated with mother–child attachment style. In other words, mothers’ romantic attachment may exert both direct and indirect influences on children’s recollection accuracy. In one study, for example, Goodman et al. (1997) found that 3- to 10-yearold children’s suggestibility for a painful genital medical procedure was correlated with their parents’ self-reported romantic attachment styles. Parents who rated themselves as more similar to an avoidant attachment style had more suggestible children than parents who rated themselves as less similar to an avoidant attachment style. Also, parents’ self-rated similarity to the anxious–ambivalent dimension (i.e., hypervigilance to attachment figures and attachment-related issues) was positively associated with children’s omission errors to both specific and misleading questions (i.e., incorrect rejection of accurate information). In contrast, parents’ ratings of their similarity to a secure attachment style were negatively correlated with the proportion of omission errors to misleading questions (see also Quas et al., 1999). As mentioned previously, maternal attachment style may be a proxy measure for child attachment style, which is the primary predictor of suggestibility. Quas et al. (1999) suggested that insecure children might be particularly susceptible to demand characteristics of the interview (i.e., social pressure to please and agree with an interviewer) because they might be emotionally needy and thus more likely to assent to misleading suggestions the interviewer provided. To date, only one study has examined the relation between mother–child attachment style and suggestibility. In a longitudinal study, Clarke-Stewart, Malloy, and Allhusen (2004) assessed mother–child attachment at 15 months assessed using the Strange Situation and children’s suggestibility at 5 years for a play event that had occurred almost a year earlier. Children who lacked a close and secure attachment relationship with their mothers at 15 displayed increased suggestibility at years. If maternal attachment style is an indirect measure of children’s attachment and interaction style, then one should find maternal attachment style–child suggestibility relationships in laboratory situations where the mother is not present. For example, in one study (Bottoms, Quas, & Davis, in press), 6- and 7-year-olds engaged in a structured play event in a laboratory. The children were then suggestively questioned by a supportive interviewer, who built rapport with each child, used a warm and friendly voice, gazed and smiled at the child often, and assumed a relaxed body position, or by a neutral nonengaging interviewer. Bottoms et al. hypothesized that children of insecurely attached mothers would also demonstrate some of these same characteristics
274
Stephen J. Ceci and Maggie Bruck
and thus might be more apprehensive and experience more stress during the interview than children of securely attached mothers. Consequently, they predicted that children of insecure mothers might have the most difficulty contradicting the interviewer and resisting suggestions. As a result, supportive interviewers would be most beneficial for these children and this in turn would result in decreases in suggestibility. This is the pattern of results obtained in that study.
C. DISCUSSION OF PSYCHOLOGICAL MECHANISMS UNDERLYING CHILDREN’S SUGGESTIBILITY
In this section, we have reviewed some but not all studies that examine the cognitive and social mechanisms underlying children’s suggestibility. As should be clear from our selective but representative review, no evidence provides unqualified support for a specific causal factor or even for a combination of causal factors. Significant findings are very difficult to obtain in this field of research, and even when these are obtained they can often be in the unpredicted direction or be inconsistent with others in the field. There are many reasons for this state of affairs. We review the more prominent concerns. First, the studies in this area are based upon the assumption that suggestibility is a trait; in other words it is stable across time and across situations. However, this may not be the case; suggestibility might be determined by a variety of external factors that characterize the interviewing situation as well as internal psychological reactions to the interview (the child’s interest, motivation, attention at the time of the interview is given). If this is the case, then the measurement of suggestibility is not reliable across time; consequently, correlations between suggestibility and social or cognitive factors will also be low. Second, even if suggestibility is trait-like, causal mechanisms may change as a function of age. That is the primary mechanisms underlying preschool children’s suggestibility may not be the same as those underlying elementary school children’s suggestibility. There were a few hints in our review that this might occur. For example, correlations between self-perceptions and suggestibility were not significant for preschoolers but they were for older children. Another factor that is related to age is the finding that under some conditions older children are more suggestible than younger children (e.g., Ceci, Papierno, & Kulkofsky, in press), leading to hypotheses that better skills may be related to higher rates of suggestibility in one age group but not in another. Third, to date, most of the studies have examined the relation of one variable at a time on suggestibility. This is clearly too simple a view. Cognitive and social
Children’s Suggestibility
275
factors probably interact and have different patterns of action as a function of age and of interviewing conditions. This is most clearly seen by the results of the Goodman and Quas (1997) study in which attachment was only correlated with suggestibility for older children who were interviewed in a supportive setting. In summary, suggestibility is a very complex construct and may not fit simple models that contain several predictors. Rather, it will probably be the case that a child’s suggestibility at a given time is a product of demographic, biological, cognitive, and social variables.
IV. Summary: Child vs. Situational Variables Since the 1980s, developmental psychology has produced a wealth of knowledge about the external factors inherent to interviewing conditions that can result in suggestibility in children at a variety of ages. We are still, however, at the beginning stages of understanding the mechanisms that account for these effects. Certainly at the present time, at all ages, situational factors are more potent predictors of children’s suggestibility than are cognitive or social factors. The situational factors are not complex in nature; primarily, they include the degree of interview bias which determines the intensity of the suggestiveness of the interview. In terms of ‘‘child’’ factors, despite the vast amount of research to find proxy measures, age is still the most powerful determinant (across a large range of studies) of levels of suggestibility. However, age may also interact with situational factors; the more suggestive the interview contexts (e.g., coerced confabulations by highly biased interviewers), the smaller the developmental differences.
REFERENCES Ackil, J. K., & Zaragoza, M. (1998). The memorial consequences of forced confabulation: Age differences in susceptibility to false memories. Developmental Psychology, 34, 1358 – 1372. Bahrick, L., Parker, J. F., Fivush, R., & Levitt, M. (1998). The effects of stress on young children’s memory for a natural disaster. Journal of Experimental Psychology: Applied, 4, 308 – 331. Battin, D. B., & Ceci, S. J. (2003, Spring). Essay: Children as witnesses: What we hear them say may not be what they mean. Court Review, 3 – 4. Belli, R. F., Lindsay, D. S., Gales, M. S., & McCarthy, T. T. (1994). Memory impairment and source misattribution in postevent misinformation experiments with short retention intervals. Memory & Cognition, 22, 40 – 54. Bottoms, B., Quas, J., & Davis, S. L. (in press). The influence of interviewer-provided social support on children’s suggestibility, memory, and disclosures. In M.-E. Pipe, M. Lamb, Y. Orbach, & A. C. Cedarborg (Eds.), Child sexual abuse: Disclosure, delay and denial. Mahwah, NJ: Erlbaum.
276
Stephen J. Ceci and Maggie Bruck
Brainerd, C. J., Reyna, V. F., Howe, M. L., & Kevershan, J. (1990). The last shall be first: How memory strength affect children’s retrieval. Psychological Science, 1, 247 – 252. Bruck, M., & Ceci, S. J. (1997). The suggestibility of young children. Current Directions in Psychological Science, 6, 75 – 79. Bruck, M., & Ceci, S. J. (1999). The suggestibility of children’s memory. Annual Reviews of Psychology, 50, 419 – 439. Bruck, M., & Ceci, S. J. (2004). Forensic developmental psychology: Unveiling four common misconceptions. Current Directions in Psychological Science, 12, 229 – 233. Bruck, M., Ceci, S. J., & Francoeur, E. (1999). The accuracy of mothers’ memories of conversations with their preschool children. Journal of Experimental Psychology: Applied, 5, 1 – 18. Bruck, M., Ceci, S. J., Francoeur, E., & Barr, R. (1995). ‘‘I hardly cried when I got my shot!’’: Influencing children’s reports about a visit to their pediatrician. Child Development, 66, 193 208. Bruck, M., Ceci, S. J., Francoeur, E., & Renick, A. (1995). Anatomically detailed dolls do not facilitate preschoolers’ reports of a pediatric examination involving genital touching. Journal of Experimental Psychology: Applied, 1, 95 – 109. Bruck, M., Ceci, S. J., & Hembrooke, H. (2002). Nature of true and false narratives. Developmental Review, 22, 520 – 554. Bruck, M., Ceci, S. J., Melnyk, L., & Finkelberg, D. (1999). The effect of interviewer bias on the accuracy of children’s reports and interviewer’s reports. Paper presented at the Biennial Meeting of the Society for Research in Child Development. Albuquerque, NM. Bruck, M., Ceci, S. J., & Principe, G. (2006). The child and the law. In K. A. Renninger and I. E. Sigel (Vol. Eds.) Child psychology in practice, Volume 5. In W. Damon and R. Lerner (Gen. Eds.), Handbook of child psychology, 6th edition. New York: Wiley. Bruck, M., London, K., Landa, R., & Goodman, J. (in press). Autobiographical memory and suggestibility in children with autistic spectrum disorder. Developmental Psychopathology. Bruck, M., & Melnyk, L. (2004). Individual differences in children’s suggestibility: A review and synthesis. Applied Cognitive Psychology, 18, 947 – 996. Bruck, M., Melnyk, L., & Ceci, S. J. (1997). External and internal sources of variation in the creation of false reports in children. Learning and Individual Differences, 9, 289 – 316. Bruck, M., Melnyk, L., & Ceci, S. J. (2000). Draw it again Sam: The effect of drawing on children’s suggestibility and source monitoring ability. Journal of Experimental Child Psychology, 77, 169 – 196. Burgwyn-Bailes, E., Baker-Ward, L., Gordon, B. N., & Ornstein, P. A. (2001). Children’ memory for emergency medical treatment after one year: The impact of individual difference variables on recall and suggestibility. Applied Cognitive Psychology, 15, 525 – 548. California v. Raymond Buckey et al. (1990). Los Angeles County Sup. Ct. #A750900. Cassel, W., Roebers, C. M., & Bjorklund, D. F. (1996). Developmental patterns of eyewitness responses to repeated and increasingly suggestive questions. Journal of Experimental Child Psychology, 61, 116 – 133. Ceci, S. J. (2005, May 27). James McKean Cattell Award Address. American Psychological Society Annual Meeting. Los Angeles, CA. Ceci, S. J., & Bruck, M. (1993). The suggestibility of children’s recollections: An historical review and synthesis. Psychological Bulletin, 113, 403 – 439.
Children’s Suggestibility
277
Ceci, S. J., & Bruck, M., (1995). Jeopardy in the courtroom: A scientific analysis of children’s testimony. Washington, D.C.: APA Books. Ceci, S. J., Crotteau-Huffman, M., Smith, E., & Loftus, E. W. (1994a). Repeatedly thinking about non-events. Consciousness & cognition, 3, 388 – 407. Ceci, S. J., Loftus, E. F., Leichtman, M., & Bruck, M. (1994b). The possible role of source misattributions in the creation of false beliefs among preschoolers. International Journal of Clinical & Experimental Hypnosis, 42, 304 – 320. Ceci, S. J., Papierno, P. B., & Kulkofsky, S. C. (in press). Representational constraints on children’s suggestibility. Psychological Science. Ceci, S. J., Ross, D., & Toglia, M. (1987). Suggestibility of children’s memory: Psycholegal implications. Journal of Experimental Psychology: General, 116, 38 – 49. Chae, Y. J., & Ceci, S. J. (2005). Individual differences in children’s recall and suggestibility. Journal of Applied Cognitive Psychology, 19, 383 – 407. Chi, M. T. H., & Ceci, S. J. (1987). Content knowledge: Its representation and restructuring in memory development. In H. W. Reese & L. Lipsett (Eds.), Advances in Child Development and Behavior (Vol. 20, pp. 91 – 146). New York: Academic Press. Clarke-Stewart, K. A., Malloy, L. C., & Allhusen, V. D. (2004). Verbal ability, selfcontrol, and close relationships with parents protect children against misleading suggestions. Applied Cognitive Psychology, 18, 1037 – 1058. Cohn, J., & Dieffenbacher, J. (1911). Untersuchungen uber Geschlechts und Altersunterschiede bei Schulern (pp. 219 – 224). Summary given in Otis, 1924. Commonwealth v. Amirault, 424 Mass. 618 (1997) 3, 52n., 91, 92, 98. Commonwealth of Massachusetts v. Cheryl Amirault LeFave. Commonwealth of Pennsylvania v. Gerald J. Delbridge (2004). No. 2027 of 1998. Crossman, A. M. (2001). Predicting suggestibility: The role of individual differences and socialization. Unpublished doctoral dissertation, Cornell University, Ithaca, NY. Davis, S. L., & Bottoms, B. L. (2002). Effects of social support on children’s eyewitness reports: A test of the underlying mechanism. Law and Human Behavior, 26, 185 – 215. Deloache, J. S., & Smith, C. M. (1999). Early symbolic representation. In I. Siegal (Ed.), Theoretical perspectives in the concept of representation (pp. 61– 86). Hillsdale, NJ: Erlbaum. Endres, J., Poggenpohl, C., & Erben, C. (1999). Repetitions, warnings and video: Cognitive and motivational components in preschool children’s suggestibility. Legal and Criminological Psychology, 4, 129 – 146. Finnila¨, K., Mahlberga, N., Santtilaa, P., Sandnabbaa, K., & Niemib, P. (2003). Validity of a test of children’s suggestibility for predicting responses to two interview situations differing in their degree of suggestiveness. Journal of Experimental Child Psychology, 85, 32 – 49. Garven, S, Wood, J. M., & Malpass, R. S. (2000). Allegations of wrongdoing: The effects of reinforcement on children’s mundane and fantastic claims. Journal of Applied Psychology, 85, 38 – 49. Garven, S., Wood, J. M., Shaw, J. S., & Malpass, R. (1998). More than suggestion: Consequences of the interviewing techniques from the McMartin preschool case. Journal of Applied Psychology, 83, 347 – 359. Geddie, L., Fradin, S., & Beer, J. (2000). Child characteristics which impact accuracy of recall and suggestibility in preschoolers: Is age the best predictor? Child Abuse and Neglect, 24, 223 – 235. Giles, J. W., Gopnik, A., & Heyman, G. D. (2002). Source monitoring reduces the suggestibility of preschool children. Psychological Science, 13, 288 – 291.
278
Stephen J. Ceci and Maggie Bruck
Goodman, G. S., Batterman-Faunce, J., Schaaf, J., & Kenney, R. (2002). Nearly 4 years after an event: Children’s eye witness memory and adult’s perceptions of children’s accuracy. Child Abuse & Neglect, 26, 849 – 884. Goodman, G. S., Bottoms, B. L., Schwartz-Kenney, B., & Rudy, L. (1991). Children’s memory for a stressful event. Journal of Narrative and Life History, 1, 69 – 99. Goodman, G. S., Hirschman, J. E., Hepps, D., & Rudy, L. (1991). Children’s memory for stressful events. Merrill-Palmer Quarterly, 37, 109 157. Goodman, G. S., & Quas, J. A. (1997). Trauma and memory: Individual differences in children’s recounting of a stressful experience. In N. Stein, P. A. Ornstein, C. J. B. Tversky, & C. J. Brainerd (Eds.), Memory for everyday and emotional events (pp. 267294). Mahwah, NJ: Erlbaum. Goodman, G. S., Quas, J. A., Batterman-Faunce, J. M., Riddlesberger, M. M., & Kahn, J. (1997). Children’s reactions to and memory for a stressful event: influences of eye, anatomical dolls, knowledge, and parental attachement. Applied Developmental Science, 1, 54 – 75. Goodman, G. S., Rudy, L., Bottoms, B., & Aman, C. (1990). Children’s concerns and memory: Issues of ecological validity in the study of children’s eyewitness testimony. In R. Fivush & J. Hudson (Eds.), Knowing and remembering in young children (pp. 249 – 284). New York: Cambridge University Press. Gopnik, A., & Graf, P. (1988). Knowing how you know: Young children’s ability to identify and remember the sources of their beliefs. Child Development, 59, 1366 – 1371. Gudjonsson, G. H. (1986). The relationship between interrogative suggestibility and acquiescence: Empirical findings and theoretical implications. Personality and Individual Differences, 7, 195 – 199. Hagen, J. W., Jongeward, R. H., & Kail, R. V. (1975). Cognitive perspectives on the development of memory. In H. W. Reese (Ed.), Advances in child development and behavior (Vol. 10, pp. 57 – 101). New York: Academic Press. Howie, P. M., & Dowd, H. J. (1996). Self-esteem and the perceived obligation to respond: Effects on children’s testimony. Legal and Criminological Psychology, 1, 197 – 209. Hughes, M., & Grieve, R. (1980). On asking children bizarre questions. First Language, 1, 149 – 160. Hurlock, E. B. (1930). The suggestibility of children. Pedagogical Seminary and Journal of Genetic Psychology, 37, 59 – 75. Johnson, M. K., Hashtroudi, S., & Lindsay, D. S. (1993). Source monitoring. Psychological Bulletin, 114, 3 – 28. Lamb, M. E., Sternberg, K. J., Orbach, Y., Esplin, P. W., Stewart, H., & Mitchell, S. (2003). Age differences in young children’s responses to open-ended invitations in the course of forensic interviews. Journal of Consulting and Clinical Psychology, 71, 926 – 934. Leichtman, M. D., & Ceci, S. J. (1995). The effects of stereotypes and suggestions on preschoolers’ reports. Developmental Psychology, 31, 568 – 578. Leichtman, M. D., Morse, M. B., Dixon, A., & Spiegel, R. (2000). Source monitoring and suggestibility: An individual differences approach. In K. P. Roberts & M. Blades (Eds.), Children’s source monitoring (pp. 257 – 287). Mahwah, NJ: Erlbaum. Lepore, S. J., & Sesco, B. (1994). Distorting children’s reports and interpretations of events through suggestion. Applied Psychology, 79, 108 – 120. Lillie and Reed v. Newcastle City Council & Ors EWHC 1600 (QB) (2002). Lindsay, D. S., & Johnson, M. K. (1987). Reality monitoring and suggestibility. In S. J. Ceci, M. P. Toglia, & D. F. Ross (Eds.), Children’s eyewitness memory (pp. 92 – 121). New York: Springer Verlag.
Children’s Suggestibility
279
Lindsay, D. S., Johnson, M. K., & Kwon, P. (1991). Developmental changes in memory source monitoring. Journal of Experimental Child Psychology, 52, 297 – 318. Loftus, E. F. (2003). Make believe memories. American Psychologist, 58, 867 – 873. Mazzoni, G. (1998). Memory suggestibility and metacognition in child eyewitness testimony: The roles of source monitoring and self-efficacy. European Journal of Psychology of Education, 13, 43 – 60. McCauley, M. R., & Fisher, R. P. (1995). Facilitating children’s eyewitness recall with the revised Cognitive Interview. Journal of Applied Psychology, 80, 510 – 516. McFarlane, F., Powell, M. B., & Dudgeon, P. (2002). An examination of the degree to which IQ, memory performance, socio-economic status and gender predict young children’s suggestibility. Legal and Criminological Psychology, 7, 227 – 239. Melnyk, L., & Bruck, M. (2004). Timing moderates the effects of repeated suggestive interviewing on children’s eyewitness memory. Applied Cognitive Psychology, 18, 613 – 631. New Jersey v. Michaels, 625 A.2d 579 aff’d 642 A.2d 1372 (1994). North Carolina v. Robert Fulton Kelly Jr., 456 S.E.2d 861 (1995). Ornstein, P. A., Gordon, B. N., & Larus, D. (1992). Children’s memory for a personally experienced event: Implications for testimony. Applied Cognitive Psychology, 6, 49 – 60. Ornstein, P. A., Merritt, K. A., Baker-Ward, L., Furtado, E., Gordon, B. N., & Principe, G. F. (1998). Children’s knowledge, expectation, and long-term retention. Applied Cognitive Psychology, 12, 387 – 405. Otis, M. (1924). A study of suggestibility in children. Archives of Psychology, No. 70. Paris, S. G., & Lindauer, B. K. (1977). Constructive aspects of children’s comprehension and memory. In R. V. Kail, Jr. & J. W. Hagen (Eds.), Perspectives on the development of memory and cognition (pp. 35 – 60). Hillsdale, NJ: Lawrence Earlbaum. Peterson, C. (1999). Children’ memory for medical emergences: 2 years later. Developmental Psychology, 35, 1493 – 1506. Peterson, C., & Bell, M. (1996). Children’s memory for traumatic injury. Child Development, 67, 3045 3070. Peterson, C., & Grant, M. (2001). Forced-choice: Are forensic interviewers asking the right questions? Canadian Journal of Behavioural Science, 33, 118 – 127. Peterson, C., Moores, L., & White, G. (2001). Recounting the same events again and again: Children’s consistency across multiple interviews. Applied Cognitive Psychology, 15, 353 – 371. Pezdek, K., & Roe, C. (1995). The effect of memory trace strength on suggestibility. Journal of Experimental Child Psychology, 60, 116 – 128. Pipe, M. E., Gee, S., Wilson, J. G., & Egerton, J. M. ( 1999). Children’s recall 1 or 2 years after an event. Developmental Psychology, 35, 781 – 789. Poole, D. A., & Lindsay, D. S. (1995). Interviewing preschoolers: Effects of nonsuggestive techniques, parental coaching and leading questions on reports of nonexperienced events. Journal of Experimental Child Psychology, 60, 129 – 154. Poole, D. A., & Lindsay, D. S. (2001). Children’s eyewitness reports after exposure to misinformation from parents. Journal of Experimental Psychology: Applied, 7, 27 – 50. Poole, D. A., & White, L. (1991). Effects of question repetition on the eyewitness testimony of children and adults. Developmental Psychology, 27, 975 – 986. Principe, G. F., & Ceci, S. J. (2002). ‘‘I saw it with my own ears’’: The effects of peer conversations on preschoolers’ reports of nonexperienced events. Journal of Experimental Child Psychology, 83, 1 – 25.
280
Stephen J. Ceci and Maggie Bruck
Principe, G. F., Kanaya, T., Ceci, S. J., & Singh, M. (2005). Hearing is believing: Rumor transmission in preschoolers’ reports. Psychological Science, 17, 243 – 248. Principe, G. F., Kanaya, T., Ceci, S. J., & Singh, M. (in press). Believing is seeing: How rumors can engender false memories in preschoolers. Psychological Science. Quas, J. A., Goodman, G. S., Bidrose, S., Pipe, M. E., Craw, S., & Ablin, D. S. (1999). Emotion and memory: Children’s long-term remembering, forgetting, and suggestibility. Journal of Experimental Child Psychology, 72, 235 – 270. Roebers, C. M., & Schneider, W. (2005). Individual differences in young children’s suggestibility: Relations to event memory, language abilities, working memory, and executive functioning. Cognitive Development, 20, 427 – 447. Salmon K., & Pipe, M. E. (2000). Recalling an event one year later: The impact of props, drawing and a prior interview. Applied Cognitive Psychology, 14, 99 – 120. Saywitz, K. J. (1987). Children’s testimony: Age-related patterns of memory errors. In S. J. Ceci, M. P. Toglia, & D. F. Ross (Eds.), Children’s eyewitness memory (pp. 36 – 52). New York: Springer Verlag. Schacter, D. L., Curran, T., Gallucio, L., Milberg, W., & Bates, J. (1996). False recognition and the right frontal lobe. Neuropsychologia, 34, 793 – 808. Schacter, D. L., Kagan, J., & Leichtman, M. D. (1995). True and false memories in children and adults: A cognitive neuroscience perspective. Psychology, Public Policy, and Law, 1, 411 – 428. Scullin, M. H. (2002). Measurement of individual differences in children’s suggestibility across situations. Journal of Experimental Psychology: Applied, 8, 233 – 241. Shaver, P. R., & Fraley, R. C. (2000). Attachment theory and caregiving. Psychological Inquiry, 11(2), 109 – 114. State v. Fijnje, 11th Judicial Circuit Court, Dade Country, Florida, 84 – 19728 (1995). Templeton, L. M., & Wilcox, S. A. (2000). A tale of two representations: The misinformation effect and children’s developing theory of mind. Child Development, 71, 402 – 416. Thierry, K., & Spence, M (2002). Source-monitoring training facilitates preschoolers’ eyewitness memory performance. Developmental Psychology, 38, 2, 428 – 437. Thompson, W. C., Clarke-Stewart, K. A., & Lepore, S (1997). What did the janitor do? Suggestive interviewing and the accuracy of children’s accounts. Law & Human Behavior, 21, 405 – 426. Vrij, A., & Bush, N. (2000). Differences in suggestibility between 5-6 and 10-11 year olds: The relationship with self confidence. Psychology, Crime, and Law, 6, 127 – 138. Waterman, A. H., Blades, M., & Spencer, C. (2000). Do children try to answer nonsensical questions? British Journal of Developmental Psychology, 18, 211 – 225. Welch-Ross, M. K. (1995). Developmental changes in preschoolers’ ability to distinguish memories of performed, pretended, and imagined actions. Cognitive Development, 10, 421 – 441. Welch-Ross, M. K. (1997, April). Children’s understanding of the mind: Implications for suggestibility. Poster presented at the biennial meeting of the Society for Research in Child Development, Washington, DC. Welch-Ross, M. K., Diecidue, K., & Miller, S. A. (1997). Young children’s understanding of conflicting mental representation predicts suggestibility. Developmental Psychology, 33, 43 – 53. Wimmer, H., Hogrefe, G. J., & Perner, J. (1988). Children’s understanding of informational access as source of knowledge. Child Development, 59, 386 – 396.
Children’s Suggestibility
281
Young, K., Powell, M. B., & Dudgeon, P. (2003). Individual differences in children’s suggestibility: a comparison between intellectually disabled and mainstream samples. Personality and Individual Differences, 35, 31 – 49. Zaragoza, M. S., & Lane, S. M. (1994). Source misattributions and the suggestibility of eyewitness memory. Journal of Experimental Psychology: Learning, Memory, and Cognition, 20, 934 – 946. Zaragoza, M., Payment, K., Kichler, J., Stines, L., & Drivdahl, S. (2001). Forced confabulation and false memory in child witnesses. Paper presented at the 2001 biennial meeting of the Society for Research in Child Development. Minneapolis, MN.
This page intentionally left blank
THE EMERGENCE AND BASIS OF ENDOGENOUS ATTENTION IN INFANCY AND EARLY CHILDHOOD
John Colombo THE DEPARTMENT OF PSYCHOLOGY, UNIVERSITY OF KANSAS, LAWRENCE, KS 66045, USA THE SCHIEFELBUSCH INSTITUTE FOR LIFE SPAN STUDIES, UNIVERSITY OF KANSAS, LAWRENCE, KS 66045, USA
Carol L. Cheatham THE SCHIEFELBUSCH INSTITUTE FOR LIFE SPAN STUDIES, UNIVERSITY OF KANSAS, LAWRENCE, KS 66045, USA THE DEPARTMENT OF DIETETICS AND NUTRITION, UNIVERSITY OF KANSAS, MEDICAL CENTER, 3901 RAINBOW BOULEVARD, KANSAS CITY, KS 66103, USA
I. INTRODUCTION II. FOUR ATTENTIONAL FUNCTIONS
A. B. C. D.
AROUSAL/ALERTNESS VISUOSPATIAL ORIENTING OBJECT RECOGNITION ENDOGENOUS ATTENTION
III. A MODEL FOR ENDOGENOUS ATTENTION AND SOME HISTORICAL PERSPECTIVES IV. BEHAVIORAL DEVELOPMENT OF ENDOGENOUS ATTENTION
A. NONLINEAR CHANGES IN THE DURATION OF LOOKING B. CHANGES IN PREDICTIVE CORRELATIONS BETWEEN ATTENTION AND COGNITIVE OUTCOMES C. NONLINEAR CHANGES IN LOOK LATENCIES D. THE QUALITY OF ATTENTION IN INFANCY AND TODDLERHOOD E. MEMORY CHANGES COINCIDENT WITH THE EMERGENCE OF ENDOGENOUS ATTENTION F. SUMMARY V. NEURAL BASES OF ENDOGENOUS ATTENTION
A. B. C. D.
CHANGES IN THE FRONTAL LOBES DEVELOPMENT OF MEDIAL TEMPORAL LOBE STRUCTURES DEVELOPMENT OF THE FRONTO-PARIETAL PATHWAY SUMMARY
283 Advances in Child Development and Behavior R Kail (Editor)
ß 2006 Elsevier B.V. All rights reserved.
284
John Colombo and Carol L. Cheatham
VI. THE EMERGENCE OF ENDOGENOUS ATTENTION: SUMMARY AND IMPLICATIONS
A. ENDOGENOUS ATTENTION AND THE DEVELOPMENT OF HIGHER COGNITIVE FUNCTION B. CONCLUSIONS ACKNOWLEDGMENTS REFERENCES
I. Introduction Attention is a familiar and ubiquitous psychological construct that is widely alluded to in various scientific, clinical, and colloquial domains. Nevertheless, attention also remains as one of the least well-understood cognitive functions. This paradox is aptly delineated in William James’ The Principles of Psychology (1890), where James simultaneously states that ‘‘everyone knows what attention is,’’ but fails to provide the reader with a comprehensive definition or description of the phenomenon. Other scientists have attempted to define attention since James’ exposition at the end of the nineteenth century with varying amounts of success. We find it likely that the reason for the continued difficulty in defining attention is that the construct of attention is actually composed of many different neural and behavioral processes (e.g., see Colombo, 2001; Parasuraman & Davies, 1984). Interestingly, James identified this property of the phenomenon correctly (albeit implicitly) when, among his taxonomy of cognitive functions, he included the heading ‘‘Varieties of Attention.’’ Indeed, this phrase graced the title of an influential volume (Parasuraman & Davies, 1984) that appeared a century after the publication of Principles. The processes that represent the construct of attention are often said to share a common theme of ‘‘selection.’’ Given that cognitive systems are likely finite and limited in capacity, organisms will not process all of the available inputs that the environment affords; that is, some inputs are selected at the expense of others. The organism’s access to unselected inputs is a point of controversy; the metaphors of ‘‘filter’’ or ‘‘spotlight’’ have been proposed to characterize such access (e.g., Broadbent, 1957; Sperling, 1960; Treisman & Gelade, 1980). However, the process of selection is complex, and depends on a number of subsidiary processes, such as detection, localization, and probably some form of recognition. Advances in the cognitive neuroscience of attention1 during the 1 A clear limitation within the field is the fact that most of what is known about attention is derived from the study of visual attention. This is no doubt attributable to the fact that attention in the visual realm is the easiest to study, given that such attention requires an overt behavior (e.g., observing responses; Cohen, 1969).
Endogenous Attention
285
latter part of the twentieth century (cf. Posner, 2004) have elucidated the neural pathways by which these processes occur, and have lent support to the notion of the existence of many ‘‘varieties’’ of attention. Indeed, the nature of how inputs are selected, and the levels at which such selection takes place, can vary markedly. In addition, selection may be accomplished by different, seemingly independent neural substrates (Schiller, 1998; Rafal & Robertson, 1995; Webster & Ungerleider, 1998). Most important for the readership of this series is the fact that these systems have distinct and dissociable developmental courses (Colombo, 2002; Richards, 1998), and thus the nature, character, and function of attention during development will be determined by the interaction of different systems at different levels of maturity (Colombo, 2001).
II. Four Attentional Functions We have found it helpful to conceptualize the construct of attention and its development in terms of four different systems (Colombo, 2001, 2002). In adults, these systems undoubtedly work together seamlessly in providing the individual with the impression of continuous consciousness and singular process. However, the systems appear to have different developmental courses and predominate at different times during the first two years of life (Colombo, 2001). In what follows, we briefly review these attentional systems.
A. AROUSAL/ALERTNESS
The most basic aspects of attention involve the states of alertness or arousal. These functions serve to set background parameters within which other cognitive components operate. Arousal has long been considered to be central to the efficacy of cognitive function (e.g., Yerkes & Dodson, 1908), and we see the fundamental state of readiness to accept input as a basic setting variable against which other cognitive functions operate. It is also worth noting that the interaction of readiness and higher-order functions is bidirectional. Research on the neurochemistry of brainstem systems informs our understanding of these basic characteristics, which might be considered to be fundamental precursors to attention. Four different ascending brainstem systems appear to modulate these functions, and each pathway is characterized by a predominant neurotransmitter. A noradrenergic system originates in the locus coeruleus (Dahlstrom & Fuxe, 1964), and neurons from this brainstem region project globally to nearly every
286
John Colombo and Carol L. Cheatham
other part of the brain (Aston-Jones, Foote, & Bloom, 1984; Foote, Bloom, & Aston-Jones, 1983). Output from this noradrenergic system modulates neuronal responses involving other neurotransmitter systems (e.g., glutamate and GABA), effectively preparing neurons for input and increasing the signal sensitivity. Behaviorally, this system has been the one most closely linked to attentional processes, and is thought to be integral to anticipatory readiness or alertness for stimulus input (Aston-Jones et al., 1994; Rajkowski, Kubiak, & Aston-Jones, 1994; Usher et al., 1999). A cholinergic pathway also arises from the brainstem, mostly from the pontine tegmentum. This system has been implicated generally in cortical activation and aspects of the sleep–wake cycle (e.g., Datta & Siwek, 1997; Douglas et al., 2004; Moruzzi & Magoun, 1949), in producing saccadic eye movements (e.g., Glimcher, 1999), as well as in motor behavior and nociception. The pontine tegmentum has been hypothesized to be involved specifically in attentional and learning processes by Steckler et al. (1994), while the broadly distributed cholinergic system that arises from this structure has been implicated in cognitive performance during sustained attention tasks (Robbins et al., 1989; Sarter, Givens, & Bruno, 2001; Sarter, 1994; Warburton, 1977; see also Porges, 1976, 1992). It is important to note that aspects of this system have been hypothesized to be involved in the synchronization and desynchronization of neural processes (e.g., Routtenberg, 1966), which is widely suspected to be the neural basis for improved cognitive function and learning during ‘‘attentive states’’ from research in neural networks (Muresan, 2004; Tiesinga et al., 2004) as well as neuroscience (Fell et al., 2003; Freiwald & Kanwisher, 2004; Niebur, Hsiao, & Johnson, 2002; Ward, 2003). The role of the other two neurotransmitter-based brainstem systems in attention is less clear. A dopaminergic system arises from the nucleus accumbens and has been broadly characterized as being integral to the activation of behavior (Brown & Robbins, 1991; Koob, 1992; Robbins et al., 1989). It has been linked to various motivational and regulatory aspects of cognitive performance, such as hedonic (e.g., Young, Moran, & Joseph, 2005) or emotional (e.g., Salgado et al., 2005) processes. Finally, the action of the serotonergic pathway is the least well understood (Robbins, 1998), although one working hypothesis is that it mediates aspects of behavioral inhibition (Gray, 1982; Harrison, Everitt, & Robbins, 1997a,b, 1999). These pathways have several common limbic and cortical targets, including frontal areas (Arnsten & Robbins, 2002; Robbins & Everitt, 1995). It is also clear that in the mature organism, the communication between these ascending pathways and higher neural areas is bidirectional, thus allowing these pathways to mediate higher-order structures, as well as to be mediated by them.
287
Endogenous Attention
B. VISUOSPATIAL ORIENTING
Visuospatial orienting is a second function within the construct of visual attention, and is generally considered to involve monitoring of the visual field, detecting change, and orienting the organism to spatial loci in which events or objects of interest are present. The structure and function of this system corresponds in many ways to Posner’s ‘‘posterior attentional system’’ (e.g., Posner & Petersen, 1990), although many other authors have characterized these functions as being mediated by a dorsal pathway (see Figure 1), which leads from the lateral geniculate nucleus of the thalamus to the occipital cortex and involves both lower-order brainstem structures such as the superior colliculus and higher cortical areas in the parietal lobe (e.g., Schiller, 1998; Webster & Ungerleider, 1998). The dorsal pathway is thought to govern basic functions involved in orienting to a stimulus in the visual field; the dorsal pathway monitors the visual field (most likely responding to low spatial frequency ‘‘blobs’’ or ‘‘patches’’: Driver & Baylis, 1998; Egly, Driver, & Rafal, 1994, He & Nakayama, 1995), disengages attention from its current focus, shifts attention to the new locus or stimulus, and engages it there (Posner, 1980; Posner & Cohen, 1984; Posner et al., 1985;
P
BG SN
V1
MT V2 V4 LGN SC BS
Fig. 1. Schematic for the dorsal visual attentional pathway. Notes: LGN ¼ lateral geniculate nucleus, SC ¼ superior colliculus, BS ¼ brainstem, BG ¼ basal ganglia, SN ¼ substantia nigra, P ¼ parietal lobe, MT ¼ area MT at temporal/parietal/occipital junction, V1, V2, V4 ¼ primary visual areas.
288
John Colombo and Carol L. Cheatham
Posner & Petersen, 1990). In many reports, the dorsal system is referred to as the ‘‘where’’ system in visual attention. C. OBJECT RECOGNITION
The determination of where an object or event is in the visual field is important for orienting sensory receptors, but somewhat simultaneous with the localization of an object or event is analysis of the properties and perhaps the identity of the object. Thus, the third attentional function may be conceptualized as a system for object recognition. This is generally thought to be accomplished by the ventral pathway (see Figure 2), which also arises from the geniculostriate visual stream, but branches from the occipital lobe to visual areas along the temporal lobe that appear to reflect the internal representation of progressively complex visual stimuli (e.g., Schiller, 1998; Webster & Ungerleider, 1998). Given these characteristics, the function of the ventral pathway has been linked closely to object perception and recognition. As such, the inferotemporal and fusiform cortices, which reside under the temporal cortex, have been conceptualized as the logical culmination of the ventral pathway (e.g., Brincat & Connor, 2004); these two areas appear
BG SN
V1 V2 V4
LGN
T SC BS
Fig. 2. Schematic for the ventral visual attentional pathway. Notes: LGN ¼ lateral geniculate nucleus, SC ¼ superior colliculus, BS ¼ brainstem, BG ¼ basal ganglia, SN ¼ substantia nigra, T ¼ temporal lobe visual areas, V1, V2, V4 ¼ primary visual areas.
Endogenous Attention
289
to mediate perception and recognition of complex visual objects or stimuli. Indeed, the fusiform cortex has long been hypothesized as the ‘‘face-processing area’’ (see Rhodes et al., 2004; Rossion et al., 2000). More recent neuroimaging studies suggest that the area around the fusiform gyrus may have a more general function in recognition (Cohen et al., 2004; Fuster et al., 2005; Goh et al., 2004; Koutstaal et al., 2001; Pins et al., 2004; Rogers et al., 2005), perhaps representing the locus at which highly practiced specific visual features or objects (e.g., faces, emotions, animals, letters, etc.) are encoded. As a complement to the dorsal pathway’s characterization as the where system, the ventral pathway is often characterized as the what system.
D. ENDOGENOUS ATTENTION
The primary focus of this chapter, however, is the fourth function, which we characterize as endogenous attention. The construct of endogenous attention is widely employed (Ashkenazi & Marks, 2004; Barrett, Bradshaw, & Rose, 2003; Beer & Roder, 2004, 2005; Couyoumdjian, Di Nocera, & Ferlazzo, 2003; Danziger, 2002; Dosher, Liu, Blair, & Lu, 2004; Driver & Spence, 1994; Eimer et al., 2005; Goldsmith & Yeari, 2003; Han, Wan, & Humphreys, 2005; Lee, 1999; Lupianez & Milliken, 1999; Pascual-Leone, 2004; Schmidt, 2000) but is not often well specified. We have elsewhere (Colombo, 2001, 2002) implied that endogenous attention may be characterized as the process through which the allocation of attention is controlled to stimuli, objects, or events as a function of events that are internal to the organism. By ‘‘controlled,’’ we mean that attention is directed or inhibited on a volitional basis. The operation of the dorsal and ventral pathways biases visual attention to be drawn from an attended stimulus at midline to a stimulus that appears in the peripheral visual field. In some cases, however, the organism may inhibit the movement of receptors toward that peripheral distractor, and maintain focus on the central, focal stimulus. We propose that such a sequence typically reflects the function of endogenous attention. For example, consider an adult engaged in the completion of a complex task, or a toddler engaged in a simple problem-solving sequence. In each case, the individual may be observed to maintain focus in the face of distraction. Those processes that ‘‘hold’’ attention on task, generally in the service of the attainment of a goal, would be characterized as endogenous.2 2
The maintenance of attention to the central target may occur as a result of the failure to detect the peripheral stimulus, or to simple, reflexive processes that involve the perceptual or sensory salience of the midline stimulus or object due to arousal or tropistic mechanisms. In such cases, maintenance of the central focus would reflect lower-order cognitive processes that we would not characterize as ‘‘endogenous.’’
290
John Colombo and Carol L. Cheatham
We use the term ‘‘endogenous’’ here because attention appears to be directed from within the organism without any obvious exogenous referent. A scientist who observes the organism behaving under such circumstances would be given the distinct impression that attention is being allocated on a purposeful, proactive, and voluntary basis. Indeed, the conceptualization of voluntary behavior within the realm of cognition has resisted satisfactory or comprehensive analysis. In this chapter, we propose an initial step in addressing this issue; we have no delusions that this will be a definitive analysis, but we do hope that it represents a generative step in a better understanding of the origins and structure of higher-order cognitive processes from a developmental view.
III. A Model for Endogenous Attention and Some Historical Perspectives Our aim in this chapter is to present a speculative overview of endogenous attention from a developmental cognitive neuroscience perspective. Our thesis is fairly simple. We propose that endogenous attention emerges in a rudimentary form at some point late in the first year of life. Endogenous attention then shows considerable improvement through the middle of the second year, such that during toddlerhood it becomes a predominant cognitive process that forms the basis for a number of higher-order functions—particularly those that are commonly referred to as ‘‘executive’’ functions. We further propose that the basis of endogenous attention lies with the integration of various memory systems with pathways that mediate more basic attentional processes of arousal, orienting, engagement, and object recognition; this integration is largely attributable to the maturation of circuitry in the frontal areas of the brain. Along with a brief review of the development of some of the CNS structures that presumably serve as underlying substrate for this function, we hope to explicate the ways in which endogenous attention may serve as the basis for many higher-order cognitive and socioemotional functions. The distinguishing characteristic of endogenous attention is the notion that it is driven from within the organism; that is, endogenous attention appears to be volitional in nature. Our proposal that endogenous attention is a qualitatively distinct form of attention is not new. Indeed, the distinction between voluntary and involuntary types of attention appears in James (1890), but it recurs as a theme in the psychological literatures on attention through much of the early twentieth century (e.g., Atkinson, 1943; Pillsbury, 1917; Sokolov, 1992). For example, Baldwin (1895) made particular note of the existence of voluntary attention in young children. Vygotsky (1928) also recognized the distinction
Endogenous Attention
291
between involuntary and voluntary attention; he suggested that language acquisition was responsible for improvements in both memory and voluntary attention in development. Leontiev (1932) made a number of proposals that will be echoed in the sections that follow; he put forth a model of voluntary attention in which the replacement of ‘‘external stimuli’’ by ‘‘internal stimuli’’ constituted a key development in the emergence of voluntary attention, and recognized the relation between the emergence of voluntary control of attention and the decline of distractibility. Wright and Vlietstra (1975) reviewed the literature on the development of attention. They posited the existence of a qualitative shift in the nature of attention seen in early development to that seen later in childhood; this shift represents a change from attention being largely drawn to or captured by exogenous influences to attention being driven in a more proactive, purposeful manner by endogenous influences. Wright and Vlietstra (1975) noted that the emergence of endogenous attention is a critical step in the development of the child as an active seeker of information. Finally, although Ruff and Rothbart (1996) used a different terminology in their careful analysis of the development of attention, they also recognized a qualitative shift in the nature of attention across development. For them, orienting-based behaviors that were present during the first year were contrasted with more mature forms of attention seen later on during the toddler period; the latter form of attention they described closely parallels the description of endogenous attention that we have put forth here.
IV. Behavioral Development of Endogenous Attention To this point, we have discussed endogenous attention in the abstract sense. The purpose of this section is to discuss the evidence for its emergence and its developmental course in infancy and early childhood. We begin by reviewing data on the nonlinear developmental course of some widely used dependent measures; we hypothesize that changes seen in these variables reflect changes in underlying attentional process toward the end of the first year, and that such changes reflect the emergence of endogenous attention.
A. NONLINEAR CHANGES IN THE DURATION OF LOOKING
The traditional methods for studying attention in very early development typically involve measures of what might be termed visual fixation, visual regard, or more colloquially, looking. Our own research on the development of visual habituation during the 1980s and 1990s suggested that the duration of looking
292
John Colombo and Carol L. Cheatham
was an important index of cognitive process, from the standpoint of both developmental and individual differences. Among the most consistent findings from our laboratory and those of other scientists was the observation that the length of looking decreased substantially across the middle of the first year (e.g., Colombo & Mitchell, 1990). Some data suggested that this decline in look duration leveled off after about 6 or 7 months of age, although the topography of this developmental course may be reasonably expected to vary with the complexity of objects and the situation in which looking is measured (e.g., Ruff, 1986a). However, we were surprised to find that, in one large-sample longitudinal study (Saxon, Frick, & Colombo, 1997), look duration actually significantly increased between 6 and 8 months of age (see also Ruff, 1986a, Study 2). This spurred us to compile the available data on the development of look duration across the first years of postnatal life (Colombo, Mitchell, & Harlan, 1999; see Colombo, 2002). This analysis suggested that look duration followed a complex developmental course; a decline from 3 months that leveled off at approximately 6 months, and then began to increase toward the end of the first year (see Figure 3). Although a subsequent longitudinal study in our own 1.5
Standardized Look Duration
1
0.5
0
-0.5
-1
-1.5 16
34
52
70
88
106
124
142
160
Age (weeks) Fig. 3. The nonlinear course of infant look duration from birth to 3 years of age. Note the change in direction with development in look duration toward the end of the first year. Data are drawn from part of the database originally described Colombo, Harlan, & Mitchell (1999); in this analysis, we used an effects-coding regression scheme to control for differences in mean level across studies. Dots represent standardized means calculated for each age point; the best-fitting regression line for the entire period shown appears in gray.
Endogenous Attention
293
laboratory (Colombo et al., 2004) did not show significant increases up to 9 months, this U-shaped function was confirmed in a careful cross-sectional study of infants from 6 to 12 months of age with a number of different stimulus types by Courage, Reynolds, and Richards (2004). In addition, a number of other studies seeking to measure attention at ages beyond one year have shown increases in look duration (or components of attention based on look duration) with age (Ruff & Capozzoli, 2003; Ruff, Capozzoli, & Weissberg, 1998; Ruff & Lawson, 1990; see also Lawson & Ruff, 2001; Ruff & Rothbart, 1996). The process mediating the latter portion of this curvilinear function is undetermined, but we suspect that the existence of this change belies the emergence of endogenous attentional function. The early decline in looking has long been attributed to improvements in the rapidity with which infants encode visual stimuli (Colombo & Mitchell, 1990; Colombo et al., 1991). However, that explanation does not fit with the unexpected rise in look duration toward the end of the first year; if we maintain the use of the same underlying construct used during early infancy to explain what happens as infants approach or go beyond the first year of life, we would have to hold that they actually get slower in their processing. This is quite unlikely, and therefore, some different underlying cognitive process must be reflected by look duration toward the end of the first year and into the second. This point had been made by Ruff (1986a) much earlier, on the basis of evidence from very different types of attentional paradigms. In her research program, Ruff pioneered the assessment of attention of toddlers and preschoolers in naturalistic situations, where objects were presented during free play periods, and where manipulation and examination behaviors were coded carefully. Her findings suggested sophisticated changes in attention and in attention-action linkages that emerged toward the end of the first year (Ruff, 1984; Ruff et al., 1992) as stable individual characteristics that persisted into the preschool period (Ruff & Dubiner, 1987; Ruff et al., 1990).
B. CHANGES IN PREDICTIVE CORRELATIONS BETWEEN ATTENTION AND COGNITIVE OUTCOMES
Another point in favor of this hypothesis is based on the fact that these early look duration measures are modestly correlated with later cognitive performance, but the direction of the correlation between look duration and later outcome is different, depending on the age at which look duration is measured. Early on (e.g., during the first year), the valence of the correlation has been reported to be negative, with shorter looking being related to more optimal cognitive outcomes (see Colombo, 1993; McCall & Carriger, 1993; Tamis-LeMonda & Bornstein, 1989). This is generally explained in terms of
294
John Colombo and Carol L. Cheatham
the efficiency or rapidity of stimulus processing during early development; young babies who are more efficient (shorter look durations, more rapid habituators) tend to show more optimal cognitive function (IQ, language) later. At ages beyond one year, however, the valence of the correlation is typically reversed, and tends to be positive. Here, longer look durations predict better performance later on (Lawson & Ruff, 2004a; Ruff et al., 1990; see also Lawson & Ruff, 2004b), or increases in look duration with age are related to more positive factors (e.g., Colombo et al., 2004). A more parsimonious explanation for this reversal in valence is to propose a change in the process(es) underlying look duration at earlier vs. later ages in infancy. The later, positive correlation may be best addressed by hypothesizing that look duration at the older ages reflects the ability to voluntarily sustain or maintain attention on an object, either in response to the object’s properties or to some short-term goal being served by attention being held in this way. An explanation of this sort would be highly consistent with the construct of endogenous attention.
C. NONLINEAR CHANGES IN LOOK LATENCIES
A similar curvilinear function exists in studies that employ paradigms that measure ocular or visual latencies to a target presented in the visual periphery. Such latencies are measured in two types of paradigms: gap/overlap paradigms and distractibility paradigms. In the gap/overlap paradigm, the infant’s attention is drawn to a central target, and a visual target is illuminated in the periphery (Atkinson et al., 1992; Blaga & Colombo, in press; Frick, Colombo, & Saxon, 1999; Richards, 1985, 1987). This procedure is typically used with very young infants (e.g., 3 – 7 months of age), and the latency to move fixation to the peripheral target is interpreted as a measure of the infants’ facility to disengage attention from the central target. That is, faster latencies are considered positively (e.g., Frick, Colombo, & Saxon, 1999), and longer latencies are associated with less optimal predictions for developmental outcome (e.g., Ruff, 1986b). In a similar paradigm used with older infants and toddlers, the latency is generally regarded with the opposite valence. In this paradigm, toddlers are presented with one or more objects or a problem-solving task, and a salient multimodal stimulus is presented in the periphery. Here, the latency to look to the stimulus is interpreted as a measure of the toddler’s distractibility. That is, long latencies are often regarded more positively. It is clear from a number of studies using the gap/overlap paradigm, in which latency is interpreted as disengagement (Atkinson et al., 1992; Blaga & Colombo, in press; Frick, Colombo, & Saxon, 1999; Richards, 1985, 1997, 1989), that latencies show a precipitous decrease across early infancy. It appears that this decrease occurs chiefly between 3 and 4 months of age, given that
295
Endogenous Attention
7000 6000 Latency (ms)
5000 4000 3000 2000 1000 0 0
2
4
6
8
10
12
14
16
18
Age (months) Fig. 4. Visual latencies across the first two years, culled from a number of independent studies conducted across several laboratories (Atkinson et al., 1992; Blaga & Colombo, in press; Frick, Colombo, & Saxon, 1999; Lansink & Richards, 1997; Oakes & Tellinghuisen, 1994; Richards, 1985, 1997, 1989; Ruff, 1986a). Note the latencies toward the end of the first year. Stimuli vary across ages and studies, but the basic procedure and coding of latency is consistent. Latencies were taken from means reported in the studies and then subject to an effects-coding regression scheme to derive a normative or summary developmental function that controls for differences in mean latency across studies.
latencies do not change appreciably between 4 and 7 months (Frick, Colombo, & Saxon, 1999; Ruff, 1986a), although Richards (1997) and Lansink and Richards (1997) have shown that latencies decrease from 2 to 9 months of age. However, Oakes and Tellinghuisen (1994) showed that similarly measured ocular latencies actually increase from 7 to 10 months of age. Whereas it is true that the focal or midline stimuli are less complex for the gap/overlap paradigms (two-dimensional visual stimuli) than are the stimuli for the distractibility paradigms (objects and multimodal stimuli), and the complexity of the distractors vary between paradigms and even, studies, these data nevertheless suggest nonlinear development (see Figure 4). Indeed, Ruff and Capozzoli (2003) examined the data and proposed a curvilinear developmental course for distractibility. Lansink and Richards’ (1997) physiological data support this conclusion, in that heart-rate-defined distraction latencies were shortest at 9 months, relative to both 6- and 12-month-olds; it is important to note that this curvilinear function was obtained under conditions where the stimuli were held constant across ages. Once again, a nonlinear change in the direction of developmental course for an important measure of attention is seen. Interestingly, the course of this change is quite similar to that seen for look duration, in that it occurs toward
296
John Colombo and Carol L. Cheatham
the latter half of the first year and into the second year. Furthermore, like the nonlinear course for look duration, the developmental course of distraction latency is difficult to interpret in terms of the same underlying construct across the entire period of infancy. Instead, we would again suggest that visual latencies reflect different processes at these different points. In early infancy, it seems likely that visual latencies reflect the ability to disengage attention, which is an important function of the dorsal stream and Posner’s posterior attentional network (Posner & Petersen, 1990). Later on, however, visual latencies reflect processes such as the inhibition of shifting attention to a distractor, presumably in the service of holding attention to a focal object, stimulus, or event. Once again, this later interpretation is highly consistent with the construct we have proposed as reflecting endogenous attention.
D. THE QUALITY OF ATTENTION IN INFANCY AND TODDLERHOOD
In the last two sections we have described a set of behavioral measures that are thought to reflect attentional function. Attention, of course, is presumably directly related to the quality of the ongoing cognitive process, but the degree to which behavioral measures reflect this process has long been suspect. A great deal of effort has been expended in attempts to determine whether an infant or child is truly engaged in substantial cognitive activity when looking at a stimulus, object, or event; these efforts have yielded considerable literature on the quality of attention during infancy and childhood. 1. Heart-Rate Defined Phases of Attention The most widely known characterization of differential quality of attention in early infancy comes from Richards’ research program, in which phases of attention are identified or delineated through the simultaneous measurement of looking and cardiac activity (Casey & Richards, 1988; Richards & Casey, 1991). During a typical period of visual inspection, the young infant’s heart rate decelerates in a very robust manner. Richards (e.g., Richards, 1997) has shown that the phase of looking characterized by sustained deceleration likely reflects the period during which attention is truly engaged and related to information processing. Richards has termed this decelerative phase as the sustained attention phase. During looking, sustained attention is preceded by a phase called orienting (sometimes including an initial acceleration, but more generally reflecting the latency to decelerate) and is followed by a phase called attention termination (a period during which the infant is still looking, but where the infant’s heart rate has returned to baseline or pre-stimulus levels). Whereas sustained attention presumably reflects higher-quality attention, attention termination has been linked to the disengagement of attention.
Endogenous Attention
297
Very young infants are less distractible during the heart-rate defined period of sustained attention (Lansink & Richards, 1997; Richards & Turner, 2001), and infants better remember stimuli that are presented during periods of sustained attention than stimuli presented during other heart-rate defined periods. In addition, infants showing higher levels of sustained attention perform better in paradigms that require the maintenance of attention (Colombo & Richman, 2002). Moreover, deficits in stimulus recognition performance during early infancy have been significantly linked to longer periods of attention termination (Colombo et al., 2001), rather than lower levels of sustained attention. Furthermore, the developmental course of sustained attention follows a nonlinear course; data from the Colombo et al. (2004) longitudinal database are shown in Figure 5. Although the percentage of attention termination declines linearly from early infancy, the percentage of sustained attention follows an inverted-U-shaped function, increasing from 3 to 6 months of age, and thereafter, decreasing. Once again, a change is seen as the infant enters the second half of the first year that is difficult to explain in terms of the invocation of a single process. 2. Focused and Casual Attention The endeavor to differentiate the quality of attention in older infants has been pursued extensively. Through facial expressions and body movement, developmental scientists have sought to differentiate a more engaged state of focused attention from a less engaged state of casual attention. Focused attention is characterized by a concentrated, intent facial expression accompanied by a stilling of extraneous body movements and a centering of the body around the item of interest; casual attention involves regard for the item without any evidence of interest (simple looking: e.g., Oakes & Tellinghuisen, 1994; Ruff, 1986a; Ruff & Lawson, 1990; Ruff & Capozzoli, 2003). Focused attention has been observed as early as 6½ months of age (Oakes, Kannass, & Shaddy, 2002), although it has generally proven to be a more appropriate measure for infants later in the first year and into the second and third years. Across this time period, the amount of focused attention increases (Ruff & Capozzoli, 2003; Ruff, Capozzoli, & Weissberg, 1998; Ruff & Lawson, 1990). Not surprisingly, when it is possible to measure both behaviorally defined and psychophysiologically defined states of attention, the occurrence of focused attention corresponds substantially to physiologically defined sustained attention phases (Lansink & Richards, 1997). However, empirical assessments of attention suggested that the HR-defined attentional phase of sustained attention (e.g., Casey & Richards, 1988) may not be isomorphic with behavioral measures of focused attention (Oakes & Tellinghuisen, 1994), and future research will have to determine whether this is merely an issue of measurement error or an issue that deserves theoretical analysis.
Actual SA Modeled SA
Time (msec)
10000 8000 6000 4000 2000
0.58 Proportion of looking
12000
0.56 0.54 0.52 0.50 Actual SA Modeled SA
0.48 0.46
0 3
4
5
6
7
Age (months)
8
9
3
4
5
6
7
8
9
Age (months)
Fig. 5. The developmental course of sustained attention across the first year, from the Colombo et al. (2004) longitudinal database. The graph at the left shows the amount (in real time) of sustained attention per look, which drops to asymptotic levels after 6 or 7 months of age. The graph at the right shows the proportion of sustained attention per look, and this shows a curvilinear relationship, with a peak at 6 months. Clearly, both measures show a transition point toward end of the first year.
Endogenous Attention
299
As with the construct of sustained attention, it can be shown that infants are engaged in more active processing of information about the target stimuli during focused attention (Oakes, Madole, & Cohen, 1991; Oakes & Tellinghuisen, 1994; Ruff, 1986a; Ruff et al., 1992). It is worth noting that, particularly in studies with older infants, latencies to turn to a distractor are longer when the distractor onset is coincident with a period of focused attention to an initial stimulus, relative to latencies when the distractor occurs during a period of casual attention (Oakes & Tellinghuisen, 1994; Oakes, Tellinghuisen, & Tjebkes, 2000; Ruff & Capozzoli, 2003; Ruff, Capozzoli, & Saltarelli, 1996; Tellinghuisen & Oakes, 1997; Tellinghuisen, Oakes, & Tjebkes, 1999). These data are generally interpreted as evidence of the allocation of attention to the target stimulus and inhibition of attention to competing stimuli. These findings are generally gleaned from object examining tasks, in which one observes while a child is looking at, manipulating, or playing with a toy. Examining behaviors coincide with the definition of focused attention with the addition of two characteristics: fingering of the object and rotating the object in the hands. It is during periods of examining that processing of the object’s details occurs (Cheatham, Bauer, & Georgieff, 2006; Oakes & Tellinghuisen, 1994; Ruff, 1984, 1986a; Ruff et al., 1992). Moreover, infants examine the object more during early periods of the session when the object is more novel, as opposed to later periods, after the object has become familiar through examining or extended exposure (Cheatham, Bauer, & Georgieff, 2006; Courage, Howe, & Squires, 2004; Oakes & Tellinghuisen, 1994; Ruff, 1986a; Ruff et al., 1992), and infants examine complex objects longer than simple objects (Cheatham, Bauer, & Georgieff, 2006; Oakes & Tellinghuisen, 1994). In paradigms where the infants are presented with a second novel object after familiarization with the initial object, they reorganize their attention to the second novel object and the length of examining returns to initial levels. When the familiar object is returned to the child, examining is not evidenced (Cheatham, Bauer, & Georgieff, 2006; Oakes & Tellinghuisen, 1994). Furthermore, latencies to look to a distractor have been shown to be dependent on the infants’ familiarity with the primary stimulus (Oakes, Kannass, & Shaddy, 2002; Oakes & Tellinghuisen, 1994). To illustrate, in a study of 6½-month-olds and 9- to 10-month-olds, Oakes, Kannass, and Shaddy (2002) found that novelty of the target object differentially predicted the tendency for the 9- and 10month-olds to turn to a distractor. These older infants were more distractible when exploring a familiar toy, relative to when they were examining a novel toy; most importantly, this was not the case for the 6½-month-olds. This study suggests that attention is differentially allocated in infants at the end of the first year, in part based on the memory of the target object. In summary, studies of the quality of attention suggest significant advances in the allocation of attention, especially toward the end of the first year.
300
John Colombo and Carol L. Cheatham
This is in keeping with our hypothesis that endogenous attentional functions emerge at that time. The Oakes, Kannass, and Shaddy (2002) study is important, because the allocation of attention based on the integration of memory has a salient role in our model of endogenous attention. If this model is viable, then we should also see significant changes in memory toward the end of the first year. We now turn to a review of these data. E. MEMORY CHANGES COINCIDENT WITH THE EMERGENCE OF ENDOGENOUS ATTENTION
If our model of endogenous attention is to be taken seriously, changes seen in attention allocation occurring toward the end of the first year should coincide with the emergence and integration of memory functions during this same time period. Indeed, memory researchers report significant changes between 9 and 10 months of age in infants’ ability to recall sequences of actions. There is a considerable extant literature on infants’ manual search and on their propensity to make the A-not-B error (Diamond, 1985, 1988, 1991, 2002), and the age trends in this literature clearly indicate the latter half of the first year is a watershed point in the ability of memory and inhibition to guide behavioral action. Mnemonic abilities of preverbal infants have also been assessed using imitation paradigms in which the infants are shown some sequence of actions and then, either immediately or after a delay, are encouraged to imitate (for review see Bauer, 2005). Only 50% of 9-month-olds readily recall single actions for 24 hours (Meltzoff, 1988), and they all exhibit difficulty with longer delays (Bauer et al., 2001) unless exposed to the events several times (Bauer et al., 2001; Carver & Bauer, 1999) after which the 50% success rate holds. In addition, 9-month-olds do not evidence ordered recall of target actions without numerous exposures. Only one month later, 10-month-olds exhibit memory at a higher percentage, in both target actions and ordered recall, with or without re-exposure, and after longer delays (Carver & Bauer, 1999). Clearly, important developments in mnemonic abilities occur during the second half of the first year of life. F. SUMMARY
Taken together, these studies lead us to propose that the development of mnemonic processes contribute substantially to endogenous attention. Indeed, the relation between memory and attention is a reciprocal one: Focused, sustained allocation of attention to a stimulus allows for the establishment of an enduring memory trace, and memory for that stimulus can serve as a basis
Endogenous Attention
301
for the distribution of attentional resources at some later point. The coincident development of both the behavioral indices of endogenous attention and various aspects of memory toward the end of the first year of life further bolsters the idea that the integration of these two systems contributes in a substantial manner to the higher-order skills that emerge in the toddler and preschool periods. In our model, endogenous attention arises from frontally mediated integration of the lower-order attentional systems described in previous systems and various memory processes. The relation between working memory and what we have called endogenous attention here has been a topic of investigation with adults. It is worth noting that a proposal similar to the one we are putting forth here has been delineated by Engle and his colleagues (Kane & Engle, 2002; see also Engle, Kane, & Tuholski, 1999). In their model, they posit frontal mediation of the integration of working memory and attention in the service of higher-order executive cognitive processes, but they do not address the developmental components. Our proposal complements theirs in many ways, as we will argue that endogenous attention is, in fact, the emergent form of a number of higher-order functions, including those typically included under the heading of executive functions. In support of Engle’s model, working memory has been found to predict both attentional and the higher-order ‘‘executive’’ functions with striking consistency (Bleckley et al., 2003; Engle, 2002; Kane & Engle, 2003; Kane et al., 2001). It is worth noting that myelination and functional activity of frontal pathways have been directly linked to improvements in working memory during childhood (Klingberg, Forssberg, & Westerberg, 2002; Nagy, Westerberg, & Klingberg, 2004). In addition, the linkage between working memory and a broad set of psychological domains, including inhibitory control, psychophysiology, individual differences in emotional regulation, and vocabulary has been reported by Wolfe and Bell (2004). We now turn to a discussion of the neural developments thought to support the developments in endogenous attention and mnemonic processes at the end of the first year of life.
V. Neural Bases of Endogenous Attention A fundamental point in our model is that endogenous attention emerges as the result of the integration of attention and various forms of memory during late infancy and toddlerhood. Although the constructs of memory, attention, and inhibition are typically considered to be independent and distinct functions, in fact the development of these functions is supported by common neural circuitry in the frontal lobe (Casey, Giedd, & Thomas, 2000). Indeed, the prefrontal lobe is widely regarded as one of the last cortical regions to
302
John Colombo and Carol L. Cheatham
mature (Fuster, 1997; Casey et al., 2000). Marked changes in the frontal areas of the brain occur from infancy through adolescence, and it is also the case that cognitive processes change at an unsurpassed rate during this time; this association has suggested to many scientists before us that the frontal areas are a likely place to look for the basis of cognitive advancement. What remains for us to do in this proposal is to identify specific changes during the transition from infancy to toddlerhood that could support the emergence of endogenous functions. Thus, in the sections that follow, we present a brief survey of the nature of the structure of the frontal lobes, as well as the emergence of particular structures and pathways that could serve to integrate the lower-order attentional systems with memory. A. CHANGES IN THE FRONTAL LOBES
1. The Nature of Frontal Area Function The development of frontal structures has long been posited to drive much of the development of complex cognitive function in infancy and early childhood (e.g., Johnson, 2001), and the frontal lobes have long been implicated in many classic tasks of attention (e.g., Goldman, Shapiro, & Nelson, 2004). In the last decade, frontal development in infancy and early childhood has most prominently been assessed in terms of the A-not-B error (Bell, 1998; Bell & Adams, 1999; Bell & Fox, 1997; Diamond, 1988, 1991, 2002; Diamond & Doar, 1989; Espy et al., 1999). The specific process or processes thought to be mediated by the development of frontal areas are many and varied; these include (for example) the development of competent inhibition (e.g., Diamond, 1988; Stroganova et al., 2003), the temporal sequencing of behavior (e.g., Fuster, 2002; Kolb, 1984), or various aspects of language processing (e.g., Caramazza & Zurif, 1976; Hagoort et al., 1999; Noppeney, Phillips, & Price, 2004). Although particular functions have been associated with some specific frontal areas (e.g., Barbas, 2004), some investigators have proposed that frontal function might be conceptualized a little more broadly (e.g., Luria, 1974; Jackson, 1958); in this view, the frontal areas may be characterized as coordinating and facilitating the cross-communication of a number of lower-order subsystems (Desimone & Duncan, 1995; Miller & Cohen, 2001; Thompson-Schill, Bedny, & Goldberg, 2005). This view is supported by the nature of circuitry in the frontal cortex (e.g., Barbas, 2000), which, for example, supports bidirectional communication with the rest of the brain. 2. Structural and Functional Changes in the Frontal Lobe Frontal volume increases from 5 months of age to 18 years of age in a sigmoidal manner; there is a steady increase from 5 months to 8 years of age,
Endogenous Attention
303
and this gives way to a very rapid increase in volume between 8 and 18 years of age (Kanemura et al., 2003). Changes in the frontal cortex occurring during late infancy and toddlerhood have also been characterized as being significant and qualitatively distinct within the general trend of maturation. Indeed, analysis of the fibroarchitecture of the cortex in infants across the first year shows a growth in axonal fibers and a decrease in the number of neurons; these curves cross between 6 and 8 months of age (Tsekhmistrenko et al., 2004). Between 7 and 12 months of age, there are also major changes in the prefrontal cortex that have been consistently linked to memory processing, storage, and retrieval (for review see Moscovitch & Winocur, 2002). These developments are seen in the pyramidal neurons of cortical layer III, the layer through which numerous association feedback loops pass. Three critical changes are worth reviewing. First, the dendritic field morphology changes. Terminal segments lengthen, and dendritic branches increase in number and size until 12 months postnatal age, at which point the level of dendritic complexity is the same as that seen in 27-year-olds’ prefrontal cortex (Koenderink, Uylings, & Mrzljak, 1994). Second, the number of synapses in cortical layers II and III increases through 15 months of age; development of these layers is specific to this period, given that synaptic increases in layers below these are not observed after 3 months of age (Huttenlocher & Dabholkar, 1997). Third, the somas of the pyramidal neurons show a substantial increase in size from 7½ to 12 months of age (Koenderink et al., 1994). These anatomical changes are accompanied by concomitant functional change. For example, utilization of glucose by the prefrontal cortex reaches adult levels at 12 months of age (Chugani, Phelps, & Mazziotta, 1987). The alterations in cell body, synapses, and the dendritic tree in Layer III during the end of the first year of life are consistent with the notion that important changes are occurring during the end of the first year and the early part of the second year in terms of the connections between the frontal areas and other cortical areas. Perhaps not coincidentally, changes in electroencephalographs (EEGs) show a similar trend within this period. Frontal sites appear to be specifically activated between 9 and 12 months of age (Schmidt, Trainor, & Santesso, 2003). At 8 months, EEG markers derived from frontal sources significantly predict the quality and accuracy of attentional responses on tasks that required voluntary control (Stroganova, Orekhova, & Posikera, 1997). Subsequently, Orekhova, Stroganova, and Posikera (1999) reported a distinct spike of slow-wave (i.e., reflecting synchronous activity) EEG activity from 7 to 12 months of age; this activity predicted performance on tasks that reflected voluntary control of attention. Furthermore, frontal activity reliably differentiated infants who succeeded at such tasks from those who were unable to do so. Finally, EEG asymmetries that correspond to
304
John Colombo and Carol L. Cheatham
enduring individual differences in inhibition are present by 9 months of age (Fox et al., 2001).
B. DEVELOPMENT OF MEDIAL TEMPORAL LOBE STRUCTURES
Mnemonic functions that emerge during the end of the first year of life (e.g., ordered recall) are generally thought to be subserved by the structures of the medial temporal lobe (e.g., hippocampus, parahippocampal areas, perirhinal cortex, and entorhinal cortex) and the frontal lobe. The hippocampus is functional in a rudimentary form at birth, but substantial developments in the first year of life are evident in neuron size and shape (Zaidel, 1999), dentate connectivity (for review, see Bauer, 2004; Nelson, 1995, 1997; Seress, 2001), and neocortical connectivity (for review see Bauer, 2004). Zaidel (1999) showed that subfields of the hippocampus exhibit increases in neuronal size between 24 weeks gestation and 9 months of age. These increases were accompanied by a decrease in density, which is a hallmark of neuronal maturation; Zaidel posited that this reflects the generation of axons. Neuronal maturation in these structures undoubtedly continues past the oldest age studied (9 months). Indeed, Seress (2001) concluded from a review of the available data that the neurons of the hippocampal system may not reach full maturation until the fifth year of life (see also Tsekhmistrenko & Vasil’eva, 2001). Nevertheless, important developments occur at the boundary of the first and second postnatal years. The subfield of the hippocampus with the most protracted development is the dentate gyrus (for a review, see Bauer, 2004; Nelson, 1995, 1997; Seress, 2001), where the number of synapses increases beginning at 8 months and reaches adult levels at 12 – 15 months of age. With this development, the trisynaptic loop of the hippocampus (Mori et al., 2004) is complete and able to code abstract relations within the incoming cortical information. Information from the multimodal processing cortical areas of the temporal, parietal, prefrontal, and cingulate areas enters the hippocampal system at the parahippocampal region (entorhinal, perirhinal, and parahippocampal cortices), which sends the information into the hippocampus for processing. The processed information is sent back to the entorhinal and perirhinal cortices, from where it is returned to the cortical areas of origination (Eichenbaum & Cohen, 2001). Co-occurring developments in the medial temporal lobe and the frontal areas as described previously in this section allow for an emergent episodic memory system. Theoretically, the prefrontal areas can hold information ‘‘online’’ while processing of relational information occurs in the medial temporal lobe (Baddeley, 2000) comprising one stage in the encoding of a long-lasting memory trace. Empirical evidence for this ‘‘episodic buffer’’ has been reported
Endogenous Attention
305
(Jensen & Lisman, 2005). During tasks thought to involve this process, the inferior parietal lobe is activated (LaBar et al., 1999) implying that the frontoparietal feedback loop is active. Its purpose would be twofold: to interpret incoming information from the prefrontal lobe regarding where to allocate attention (Petrides & Pandya, 2002) and to transmit information back to the prefrontal lobe to support the maintenance of its activation (Otten, Henson, & Rugg, 2002). Thus, attention and memory work together in the encoding of a mnemonic trace. Embedded in the description of the emergent episodic memory system is the implication that the prefrontal lobes capitalize on memory stores for use in the allocation of attention to novel stimuli in the environment or even, to preferred stimuli. To do this, the prefrontal lobes must also play a role in retrieval of memories. Research utilizing magnetic resonance imaging (MRI) with humans (e.g., Brewer et al., 1998; Henson, Shallice, & Dolan, 1999; for review see Fletcher & Henson, 2001) indicates a central role for the frontal areas in memory retrieval.
C. DEVELOPMENT OF THE FRONTO-PARIETAL PATHWAY
The endogenous control of visual attention is likely mediated by communication between frontal areas and portions of the dorsal pathway that provide information about the location of objects or events in the visual field. Indeed, there is a fronto-parietal feedback loop that comprises an aspect of the superior longitudinal fasciculus. In this loop, the dorsolateral prefrontal cortex communicates with the inferior parietal region and adjacent occipitoparietal association areas. A pathway such as this one could guide the deployment of attention to the location of relevant information in the environment, as informed by memory functions relayed from the frontal cortex (Petrides & Pandya, 2002). Studies of the development of this pathway in humans are few, due in part to the scarcity of samples. However, thus far it has been shown that the frontoparietal pathway is present at birth, but is unmyelinated. Myelination occurs rapidly in the first 5 months of life and then, continues at a more gradual pace (Okuda, 1994), and proceeding from inferior to superior and posterior to anterior: The axons of the occipital and parietal lobes myelinate sooner than those of the frontal lobes (for review see Sowell, Thompson, & Toga, 2004). Given the protracted development of the frontal areas, it seems reasonable to posit that this feedback loop is not fully functional until such time as the prefrontal cortex is maturationally functional, between 12 and 15 months of age. Supporting this notion are data from two studies conducted by Csibra, Tucker, and Johnson (1998, 2001), who examined saccade-related ERP components in
306
John Colombo and Carol L. Cheatham
4- and 6-month-olds; although frontal activity was observed during saccades with both age groups, infants did not show pre-saccadic components typically attributed to planning processes (see, however, Richards, 2001, 2005). Once the frontal-parietal pathway matures, it can be differentially activated by endogenous attentional processes relative to those elicited by exogenous stimuli or events. The dorsolateral prefrontal area and the inferior parietal area are active as measured by MRI during a task requiring voluntary control of attention, but not when the task only requires a response to features of the stimuli (Rosen et al., 1999). Rosen and her colleagues concluded that memory functions, as subserved by the dorsolateral prefrontal cortex, are only necessary when endogenous allocation of attention is required. We find it reasonable to posit that the information used to direct voluntary attention to a particular stimulus is, in part, mnemonic in nature, and is ‘‘gathered’’ by the dorsolateral prefrontal area from its connections with the prefrontal association areas.
D. SUMMARY
In sum, the converging evidence suggests that between 6 and 15 months of age, changes in the frontal lobes, medial temporal lobes, and the fronto-parietal pathway drive the development of a system capable of the voluntary focusing of attention to and the processing, storing, and retrieving of episodic mnemonic information that has been coded with abstract and concrete relational information. Even though the system will not be fully developed until much later in life, the behavioral evidence indicates that once the system reaches peak synaptic development (between 12 and 15 months of age) it is functionally mature. In the months prior to this (6 – 12 months of age), we would expect that there would be wide individual variation in episodic memory and endogenous attention.
VI. The Emergence of Endogenous Attention: Summary and Implications We have argued here that the integration of memory and attention systems from 6 to 15 months of age, as supported by the maturation of frontal circuitry, gives rise to the construct of endogenous attention. In this final section of the chapter, we argue that the emergence of endogenous attention also serves as a precursor for, or as the basis of, a number of other important cognitive changes seen during the latter parts of infancy, as well as for a number of skills that develop later in childhood.
Endogenous Attention
307
A. ENDOGENOUS ATTENTION AND THE DEVELOPMENT OF HIGHER COGNITIVE FUNCTION
1. Means-Ends/Goal-Directed Behavior The ability to direct attention in the service of the maintenance of a goal in working memory would obviously seem to provide the basis for the development of means-ends or goal-directed behavior. Indeed, the literature that proliferated during the 1980s and 1990s on the A-not-B problem indicated clear improvements in performance from 8 through 12 months (e.g., Diamond, 1985), exactly the period during which we proposed the emergence of endogenous attention occurs. Similar improvements are seen on other goal-directed tasks during this period (Moore & Meltzoff, 1999; Willatts, 1999); many of the tasks included in these studies involve a series of responses (means) that must be executed in sequence to attain the goal (end), and several studies have investigated the specific relation between the distribution of attention and the attainment of success on the problem presented to the infant (e.g., Goldfield, 1983). It seems likely that the ability to solve such tasks would logically and naturally serve as a basis for more flexible and sensitive cognitive approaches to problem solving that emerge in the second year (Clohessy, Posner, & Rothbart, 2001). 2. Categorization A current debate in the literature on infant cognition concerns the nature of categorization in early life (Oakes & Madole, 2000). This debate centers on whether infants’ ability to form categories of visual stimuli is based on the simple abstraction of the perceptual qualities of a stimulus set, or whether infants’ categories are based on a truly conceptual framework (Rakison, 2005). Interestingly, members of the opposing camps tend to address the questions with infants of relatively different ages, with the perceptual-basis group studying relatively young infants (usually 3 – 7 months; e.g., Quinn, 2004) and the conceptual-basis group studying older infants (9 – 21 months; e.g., Casasola, 2005; Gopnik & Meltzoff, 1987). Studies that bridge these two age ranges often describe important changes in what exactly is processed or included in the category (e.g., Casasola, 2005; Quinn et al., 2003). Although sophisticated developmental models have been explored (Oakes & Madole, 2003), earlier findings in this area pointed to an important change in the nature of categorization abilities in infants after the end of the first year. Younger infants were hypothesized to extract basic level perceptual features during categorization, but older infants were hypothesized to process such features in a more relational manner (Younger, Hollich, & Furrer, 2004; Shultz & Cohen, 2004; see also Casasola, 2005). This shift occurs within the same period that we are proposing for the emergence of endogenous attention, and is consistent with the integration of
308
John Colombo and Carol L. Cheatham
attention and memory that we propose for the basis of endogenous functions. Category-based information that is acquired under such an integrated system would allow for the entry of such information into a broader semantic network. As such, one might predict that this later type of categorization would be directly related to the kinds of means–ends tasks discussed previously (see Gopnik & Meltzoff, 1987; Mandler, 2004), and that this type of categorization would relate to various aspects of early language acquisition (see Booth & Waxman, 2002; Gopnik & Meltzoff, 1984; Waxman & Braun, 2005). 3. Strategies, Executive Function, and Metacognitive Skills A final point to make with respect to the long-term implications of endogenous attention for purely cognitive development per se is the notion that when attention is driven by working memory, it will appear to be goal-directed. Furthermore, to the degree that attention, working memory, and longerterm stores are integrated with behavioral responses, those responses will look increasingly intentional, purposeful, and (in the language of the cognitive developmentalists), strategic. In this vein, we find it reasonable to suggest that the emergence of endogenous attention represents the first developmental step toward the development of metacognitive skills. Carlson, Moses, and Breton (2002) have examined various functions that we would count among the fundamental components of endogenous attention (e.g., inhibitory control and working memory) in preschoolers, and have found them to be significantly related to executive function measures and performance on false-belief tasks. 4. Social Interaction and Language It also seems likely that the cognitive flexibility provided by the ability to integrate attention and memory will contribute to the infant’s ability to coordinate social interchanges (Parrinello & Ruff, 1988); such early interchanges are known to be relevant to language acquisition (e.g., Henderson et al., 2002; Mundy, Card, & Fox, 2000). For example, although it is known that the ability of caregiver and infant to simultaneously share an attentional focus (i.e., joint attention) contributes to lexical development (e.g., Smith, Adamson, & Bakeman, 1988), the manner in which infant and caregiver achieve joint attention is variable. A small sample but provocative longitudinal study of free-play toy-based interaction from our own laboratory (Saxon et al., 2000) indicated that mothers show a developmental course of ‘‘backing off’’ in their interactions with their infants from 6 to 8 months of age (see also relevant findings in Adamson & Bakeman, 1984; Bakeman & Adamson, 1984). Although mothers took the initiative in providing and introducing objects in earlier interactive episodes, later episodes were characterized more by mothers waiting for cues from their infants as to when to introduce materials.
Endogenous Attention
309
Moreover, dyads who followed this normative course showed markedly better cognitive outcomes and language at 4 years of age. 5. Regulation of Emotion Thus far, we have laid out the case for the role of endogenous attention in basic cognitive skills and development. However, it is worth noting that there is also a case to be made that the same frontal-attentional complexes that govern endogenous attention also contribute to emotional regulation (Harman & Fox, 1997). A detailed and scholarly exposition of this aspect of the model is well beyond the scope of this chapter, but an elaboration of this aspect of endogenous attention would provide a theoretical link to constructs such as temperament and personality. Indeed, if one defines personality/temperament at least in part as individual differences in regulative processes (e.g., see Derryberry & Reed, 1998, 2001, 2002; Rothbart & Rueda, 2005) then it is not a particularly tenuous leap to assume that the developmental changes that subsume endogenous attentional functions would also contribute to these domains of behavior and individual differences.
B. CONCLUSIONS
Much effort has been expended upon the study of the abilities of infants immediately after birth. However, the speculative model we have put forth in the preceding pages suggests that a careful investigation of the changes seen at the end of infancy may be particularly productive in elucidating the nature of many complex skills, and in the prediction of a number of normal and atypical outcomes. This focus, combined with the expected improvements in our methods for understanding brain-behavior relations, will make for an important second wave of infancy research in the twenty-first century. In this chapter, we have attempted to characterize attention as a multidimensional construct, and we have attempted to develop more fully a case for the particular and specific dimension of this attention that we have called endogenous attention. We have delineated a theoretical framework for what we think endogenous attention might be (i.e., the integration of lower-order attentional functions and memory), and have dovetailed this framework with historical concepts as well as extant models from the adult literature. We have tried to collate evidence for its emergence in the behavioral realm during late infancy, and in doing so, have identified its earliest manifestations as no earlier than the end of the first year. Furthermore, we have posited specific neural pathways that might mediate its development. Finally, we have tried to explicate the possible implications of this type of attention for later development in the cognitive, linguistic, and emotional realms.
310
John Colombo and Carol L. Cheatham
The framework makes some fairly strong predictions. For example, if attention deficit disorder is truly a disorder of endogenous attention, one should look for predictive precursors of the condition during the second year of life, but not before. For another, the model clearly suggests that the emergence of endogenous attention will be related to some form of integrative frontal functioning, perhaps the measure of frontal coherence that can be derived from most electroencephalograph and evoked potential measures. Finally, the model suggests that individual differences in endogenous attention will be related to a number of important higher-order cognitive functions later in life. We hope that our proposal here is convincing enough to allow for tests of these predictions in the near future.
ACKNOWLEDGMENTS We acknowledge many colleagues for conversations and perspectives that have contributed to the ideas put forth in this chapter, but we are particularly grateful to Dr. Kathleen Kannass for her earlier contributions to the development of the concept of endogenous attention while working in our laboratory at the University of Kansas, and to Dr. Jon Templin for his assistance with analyses used to generate the generic developmental functions shown in Figures 3 and 4. The preparation of this chapter was supported by NSF grant 0318072 and NIH grant R01 HD35903 to the first author, and by the University of Kansas Center for the Behavioral Neuroscience of Communicative Disorders (DC005803) and the University of Kansas Mental Retardation/Developmental Disabilities Research Center.
REFERENCES Adamson, L., & Bakeman, R. (1984). Mothers’ communicative acts: Changes during infancy. Infant Behavior and Development, 7, 467 – 478. Arnsten, A. F. T., & Robbins, T. W. (2002). Neurochemical modulation of prefrontal cortical function in humans and animals. In D. T. Stuss & R. T. Knight (Eds.), Principles of frontal lobe function (pp. 51 – 84). London: Oxford University Press. Ashkenazi, A., & Marks, L. E. (2004). Effect of endogenous attention on detection of weak gustatory and olfactory flavors. Perception & Psychophysics, 66, 596 – 608. Aston-Jones, G., Foote, S. L., & Bloom, F. E. (1984). Anatomy and physiology of locus coeruleus neurons: Functional implications. In M. G. Ziegler & C. R. Lake (Eds.), Norepinephrine (Vol. 2, pp. 92 – 116). Baltimore, MD: Williams & Wilkins. Aston-Jones, G., Rajkowski, J., Kubiak, P., & Alexinsky, T. (1994). Locus coeruleus neurons in monkey are selectively activated by attended cues in a vigilance task. Journal of Neuroscience, 14, 4467 – 4480.
Endogenous Attention
311
Atkinson, T. G. (1943). Psychological factors in visual tests and correctives, Optometric Weekly, 34, 127 – 128. Atkinson, J., Hood, B., Wattam-Bell, J., & Braddick, O. (1992). Changes in infants’ ability to switch visual attention in the first three months of life. Perception, 21, 643–653. Baddeley, A. (2000). The episodic buffer: A new component of working memory? Trends in Cognitive Sciences, 4, 417 – 423. Bakeman, R., & Adamson, L. (1984). Coordinating attention to people and objects in mother-infant and peer-infant interactions. Child Development, 55, 1278 – 1289. Baldwin, J. M. (1895). The origin of attention. In Mental development in the child and the race: Method and processes (pp. 451 – 475). New York, NY: Macmillan Publishing. Barbas, H. (2000). Connections underlying the synthesis of cognition, memory and emotion in primate prefrontal cortices. Brain Research Bulletin, 52, 319 – 330. Barbas, H. (2004). Dead tissue, living ideas: Facts and theory from neuroanatomy. Cortex, 40, 205 – 206. Barrett, D. J. K., Bradshaw, M. F., & Rose, D. (2003). Endogenous shifts of covert attention operate within multiple coordinate frames: Evidence from a feature-priming task. Perception, 32, 41 – 52. Bauer, P. J. (2004). Getting explicit memory off the ground: Steps toward construction of a neuro-developmental account of changes in the first two years of life. Developmental Review, 24, 347 – 373. Bauer, P. J. (2005). New developments in the study of infant memory. In D. M. Teti (Ed.), Handbook of research methods in developmental psychology (pp. 467 – 488). Oxford, United Kingdom: Blackwell. Bauer, P. J., Wiebe, S. A., Waters, J. M., & Bangston, S. K. (2001). Reexposure breeds recall: Effects of experience on 9-month-olds’ ordered recall. Journal of Experimental Child Psychology, 80, 174 – 200. Beer, A. L., & Roder, B. (2004). Unimodal and crossmodal effects of endogenous attention to visual and auditory motion. Cognitive, Affective and Behavioral Neuroscience, 4, 230 – 240. Beer, A. L., & Roder, B. (2005). Attending to visual or auditory motion affects perception within and across modalities: An event-related potential study. European Journal of Neuroscience, 21, 1116 – 1130. Bell, M. A. (1998). Frontal lobe function during infancy: Implications for the development of cognition and attention. In J. E. Richards (Ed), Cognitive neuroscience of attention: Developmental perspectives (pp. 287 – 316). Mahwah, NJ: Lawrence Erlbaum Associates. Bell, M. A., & Adams, S. E. (1999). Comparable performance on looking and reaching versions of the A-not-B task at 8 months of age. Infant Behavior and Development, 22, 221 – 235. Bell, M. A., & Fox, N.A. (1997). Individual differences in object permanence performance at 8 months: Locomotor experience and brain electrical activity. Developmental Psychobiology, 31, 287 – 297. Blaga, O. M., & Colombo, J. (2006). Visual processing and infant ocular latencies in the overlap paradigm. Developmental Psychology, in press. Bleckley, M. K., Durso, F. T., Crutchfield, J. M., Engle, R. W., & Khanna, M. M. (2003). Individual differences in working memory capacity predict visual attention allocation. Psychonomic Bulletin and Review, 10, 884 – 889. Booth, A. E., & Waxman, S. R. (2002). Object names and object functions serve as cues to categories for infants. Developmental Psychology, 38, 948 – 957.
312
John Colombo and Carol L. Cheatham
Brewer, J., Zhao, Z., Desmond, J. E., Glover, G. H., & Gabrieli, J. D. E. (1998). Making memories: Brain activity that predicts how well visual experience will be remembered. Science, 281, 1185 – 1187. Brincat, S. L., & Connor, C. E. (2004). Underlying principles of visual shape selectivity in posterior inferotemporal cortex. Nature Neuroscience, 7, 880 – 886. Broadbent, D. E. (1957). A mechanical model for human attention and immediate memory. Psychological Review, 64, 205 – 215. Brown, V. J., & Robbins, T. W. (1991). Simple and choice reaction time performance following unilateral striatal dopamine depletion. Journal of Neuroscience, 9, 983 – 989. Caramazza, A., & Zurif, E. B. (1976). Dissociation of algorithmic and heuristic processing in language comprehension: Evidence from aphasia. Brain and Language, 3, 572 – 582. Carlson, S. M., Moses, L. J., & Breton, C. (2002). How specific is the relation between executive function and theory of mind? Contributions of inhibitory control and working memory. Infant and Child Development, 11, 73 – 92. Carver, L. J., & Bauer, P. J. (1999). When the event is more than the sum of its parts: 9-month-olds’ long-term ordered recall. Memory, 7, 147 – 174. Casasola, M. (2005). When less is more: how infants learn to form an abstract categorical representation of support. Child Development, 76, 279 – 290. Casey, B. J., Giedd, J. N., & Thomas, K. M. (2000). Structural and functional brain development and its relation to cognitive development. Biological Psychology, 54, 241 – 257 Casey, B. J., & Richards, J. E. (1988). Sustained visual attention in young infants measured with an adapted version of the visual preference paradigm. Child Development, 59, 1514 – 1521. Cheatham, C. L., Bauer, P. J., & Georgieff, M. K. (2006). Predicting individual differences in recall by infants born preterm and fullterm. Infancy, 10, 17 – 42. Chugani, H. T., Phelps, M. E., & Mazziotta, J. C. (1987). Positron emission tomography study of human brain functional development. Annals of Neurology, 22, 487 – 497. Clohessy, A. B., Posner, M. I., & Rothbart, M. K. (2001). Development of the functional visual field. Acta Psychologica, 106, 51 – 68. Cohen, L. B. (1969). Observing responses, visual preferences, and habituation to visual stimuli in infants. Journal of Experimental Child Psychology, 7, 419–433. Cohen, L. B. (1976). Habituation of infant visual attention. In T. J. Tighe & R. N. Leaton (Eds.), Habituation: Perspectives from child development, animal behavior, and neurophysiology. (pp. 207 – 238). Oxford, England: Erlbaum. Cohen, L. B., Henry, C., Dehaene, S., Martinaud, O., Lehericy, S., Lemer, C. et al. (2004). The pathophysiology of letter-by-letter reading. Neuropsychologia, 42, 1768 – 1780. Colombo, J. (1993). Infant cognition: Predicting later intellectual functioning. Newbury Park, CA: Sage Publications, Inc. Colombo, J. (2001). The development of visual attention in infancy. Annual Review of Psychology, 52, 337 – 367. Colombo, J. (2002). Infant attention grows up: The emergence of a developmental cognitive neuroscience perspective. Current Directions in Psychological Science, 11, 196 – 199. Colombo, J., Harlan, J. E., & Mitchell, D. W. (1999). The development of look duration in infancy: Evidence for a triphasic course. Presented at the Biennial Meeting of the Society for Research in Child Development, Albuquerque, NM.
Endogenous Attention
313
Colombo, J., Kannass, K. N., Shaddy, D. J., Kundurthi, S., Anderson, C. J., Blaga, O. M., & Carlson, S. E. (2004). Maternal DHA and the development of attention in infancy and toddlerhood. Child Development, 75, 1254 – 1267. Colombo, J., & Mitchell, D. W. (1990). Individual and developmental differences in infant visual attention: Fixation time and information processing. In J. Colombo & J. W. Fagen (Eds.), Individual differences in infancy: Reliability, stability, and prediction (pp. 193–227). Hillsdale, NJ: Lawrence Erlbaum. Colombo, J., Mitchell, D. W., Coldren, J. T., & Freeseman, L. J. (1991). Individual differences in infant attention: Are short lookers faster processors or feature processors? Child Development, 62, 1247–1257. Colombo, J., & Richman, W. A. (2002). Infant timekeeping: Anticipation and attention in 4-month-olds. Psychological Science, 13, 475 – 479. Colombo, J., Richman, W. A., Shaddy, D. J., Greenhoot, A. F., & Maikranz, J (2001). HR-Defined phases of attention, look duration, and infant performance in the pairedcomparison paradigm. Child Development, 72, 1605 – 1616. Colombo, J., Shaddy, D. J., Richman, W. A., Maikranz, J. M., & Blaga, O. M. (2004). The developmental course of infant attention and preschool cognitive outcome. Infancy, 4, 1 – 38. Courage, M. L., Howe, M. L., & Squires, S. E. (2004). Individual differences in 3.5-month-olds’ visual attention: What do they predict at 1 year? Infant Behavior and Development, 27, 19 – 30. Courage, M., Reynolds, G.D., & Richards, J. E. (2004). Developmental and individual differences in infants’ attention as a function of stimulus characteristics. Presented at the Biennial Meeting of the International Society for Infancy Studies, Chicago, IL. Couyoumdjian, A., Di Nocera, F., & Ferlazzo, F. (2003). Functional representation of 3D space in endogenous attention shifts. Quarterly Journal of Experimental Psychology: Human Experimental Psychology, 56A, 155 – 183. Csibra, G., Tucker, L. A., & Johnson, M. H. (1998). Neural correlates of saccade planning in infants: A high-density ERP study. International Journal of Psychophysiology, 29, 201 – 215. Csibra, G., Tucker, L. A., & Johnson, M. H. (2001). Differential frontal cortex activation before anticipatory and reactive saccades in infants. Infancy, 2, 159 – 174. Dahlstrom, A. B., & Fuxe, K. (1964). Evidence for the existence of monoaminecontaining neurons in the cell bodies of brainstem neurons. Acta Physiologica Scandinavia Supplement, 62 (232), 5 – 55. Danziger, S. (2002). The effects of endogenous attention and stimulus onsets on encoding target location. Quarterly Journal of Experimental Psychology: Human Experimental Psychology, 55A, 987 – 1006. Datta, S., & Siwek, D. F. (1997). Excitation of the brain stem pedunculopontine tegmentum cholinergic cells induces wakefulness and REM sleep. Journal of Neurophysiology, 77, 2975 – 2988. Derryberry, D., & Reed, M. A. (1998). Anxiety and attentional focusing: Trait, state and hemispheric influences. Personality and Individual Differences, 25, 745 – 761. Derryberry, D., & Reed, M. A. (2001). A multidisciplinary perspective on attentional control. In B. S. Gibson, & C. L. Folk (Eds.) Attraction, distraction and action: Multiple perspectives on attentional capture (pp. 325 – 347). New York, NY, US: Elsevier Science. Derryberry, D., & Reed, M. A. (2002). Anxiety-related attentional biases and their regulation by attentional control. Journal of Abnormal Psychology, 111, 225 – 236.
314
John Colombo and Carol L. Cheatham
Desimone, R., & Duncan, J. (1995). Neural mechanisms of selective visual attention. Annual Review of Neuroscience, 18, 193 – 222. Diamond, A. (1985). Development of the ability to use recall to guide action, as indicated by infants’ performance on AB. Child Development, 56, 868 – 883. Diamond, A. (1988). Abilities and neural mechanisms underlying AB performance. Child Development, 59, 523 – 527. Diamond, A. (1991). Frontal lobe involvement in cognitive changes during the first year of life. In A. C Petersen & K. R. Gibson (Eds.), Brain maturation and cognitive development: Comparative and cross-cultural perspectives (pp. 127 – 180). Hawthorne, NY: Aldine de Gruyter. Diamond, A. (2002). Normal development of prefrontal cortex from birth to young adulthood: Cognitive functions, anatomy, and biochemistry. In R. T. Knight & D. T. Stuss (Eds.), Principles of frontal lobe function (pp. 466 – 503). London: Oxford University Press. Diamond, A., & Doar, B. (1989). The performance of human infants on a measure of frontal cortex function, the delayed response task. Developmental Psychobiology, 22, 271 – 294. Dosher, B. A., Liu, S. H., Blair, N., & Lu, Z. L. (2004). The spatial window of the perceptual template and endogenous attention. Vision Research, 44, 1257 – 1271. Douglas, C. L., DeMarco, G. J., Baghdoyan, H. A., & Lydic, R. (2004). Pontine and basal forebrain cholinergic interaction: Implications for sleep and breathing. Respiratory Physiology and Neurobiology, 143, 251 – 262. Driver, J., & Baylis, G. C. (1998). Attention and visual object segmentation. In R. Parasuraman (Ed.), The attentive brain (pp. 299 – 326). Cambridge, MA: MIT Press. Driver, J., & Spence, C. J. (1994). Spatial synergies between auditory and visual attention. In M. Moscovitch & C. Umilta (Eds.), Attention and performance 15: Conscious and nonconscious information processing (pp. 311 – 331). Cambridge, MA: MIT Press. Egly, R., Driver, J., & Rafal R. (1994). Shifting attention between objects and locations: evidence from normal and parietal lesion subjects. Journal of Experimental Psychology: General, 123, 161 – 177. Eichenbaum, H., & Cohen, N. J. (2001). From conditioning to conscious recollection: Memory systems of the brain. New York: Oxford University Press. Eimer, M., Forster, B., Van Velzen, J., & Prabhu, G. (2005). Covert manual response preparation triggers attentional shifts: ERP evidence for the premotor theory of attention. Neuropsychologia, 43, 957 – 966. Engle, R. W. (2002). Working memory capacity as executive attention. Current Directions in Psychological Science, 11, 19 – 23. Engle, R. W., Kane, M. J., & Tuholski, S. W. (1999). Individual differences in working memory capacity and what they tell us about controlled attention, general fluid intelligence, and functions of the prefrontal cortex. In P. Shah & A. Miyake (Eds.), Models of working memory: Mechanisms of active maintenance and executive control. (pp. 102 – 134). New York, NY: Cambridge University Press. Espy, K. A., Kaufmann, P. M., McDiarmid, M. D., & Glisky, M. L. (1999). Executive functioning in preschool children: Performance on A-not-B and other delayed response format tasks. Brain and Cognition, 41, 178 – 199. Fell, J., Fernandez, G., Klaver, P., Elger, C. E., & Fries, P. (2003). Is synchronized neuronal gamma activity relevant for selective attention? Brain Research Reviews, 42, 265 – 272.
Endogenous Attention
315
Fletcher, P. C., & Henson, R. N. (2001). Frontal lobes and human memory: Insights from functional neuroimaging. Brain, 124, 849 – 881. Foote, S. L., Bloom, F. E., & Aston-Jones, G. (1983). Nucleus locus ceruleus: New evidence of anatomical and physiological specificity. Physiological Reviews, 63, 844 – 914. Fox, N. A., Henderson, H. A., Rubin, K. H., Calkins, S. D., & Schmidt, L. A. (2001). Continuity and discontinuity of behavioral inhibition and exuberance: Psychophysiological and behavioral influences across the first four years of life. Child Development, 72, 1 – 21. Freiwald, W. A., & Kanwisher, N. G. (2004). Visual selective attention: insights from brain imaging and neurophysiology. In M. S. Gazzaniga (Ed.), The cognitive neurosciences (3rd ed., pp. 575 – 588). Cambridge, MA: MIT Press. Frick, J. E., Colombo, J., & Saxon, T. F. (1999). Individual and developmental differences in disengagement of fixation in early infancy. Child Development, 70, 537 – 548. Fuster, J. M. (1997). The prefrontal cortex - anatomy, physiology, and neuropsychology of the frontal lobe. Philadelphia: Lippincott-Raven. Fuster, J. M. (2002). Frontal lobe and cognitive development. Journal of Neurocytology, 31, 373 – 385. Fuster, J. M., Garoff, R. J., Slotnick, S. D., & Schacter, D.L. (2005). The neural origins of specific and general memory: The role of the fusiform cortex. Neuropsychologia, 43, 847 – 859. Glimcher, P. W. (1999). Eye movements. In M. J. Zigmond, F. E. Bloom, S. C. Landis, J. L. Roberts, & L. R. Squire (Eds.), Fundamental neuroscience (pp. 993 – 1010). San Diego, CA: Academic Press. Goh, J. O.-S., Siong, S.-C., Park, D., Gutchess, A., Hebrank, A., & Chee, M. W.-L. (2004). Cortical areas involved in object, background, and object-background processing revealed with functional magnetic resonance adaptation. Journal of Neuroscience, 24, 10223 – 10228. Goldfield, E. C. (1983). The development of control over complementary systems during the second year. Infant Behavior and Development, 6, 257 – 262. Goldman, D. Z., Shapiro, E. G., & Nelson, C. A. (2004). Measurement of vigilance in 2-year-old children. Developmental Neuropsychology, 25, 227 – 250. Goldsmith, M., & Yeari, M. (2003). Modulation of object-based attention by spatial focus under endogenous and exogenous orienting. Journal of Experimental Psychology: Human Perception and Performance, 29, 897 – 918. Gopnik, A., & Meltzoff, A. N. (1984). Semantic and cognitive development in 15- to 21month-old children. Journal of Child Language, 11, 495 – 513. Gopnik, A., & Meltzoff, A. N. (1987). The development of categorization in the second year and its relation to other cognitive and linguistic developments. Child Development, 58, 1523 – 1531. Hagoort, P., Indefrey, P., Brown, C., Herzog, H., Steinmetz, H., & Seitz, R. J. (1999). The neural circuitry involved in the reading of German words and pseudowords: A PET study. Journal of Cognitive Neuroscience, 11, 383 – 398. Han, S., Wan, X., & Humphreys, G. W. (2005). Shifts of spatial attention in perceived 3-D space. Quarterly Journal of Experimental Psychology A: Human Experimental Psychology, 58A, 753 – 764. Harman, C., & Fox, N. A. (1997). Frontal and attentional mechanisms regulating distress experience and expression during infancy. In G. R. Lyon & N. A. Krasnegor (Eds.), Development of the prefrontal cortex: Evolution, neurobiology, and behavior (pp. 191 – 208). Baltimore, MD: Brookes.
316
John Colombo and Carol L. Cheatham
Harrison, A. A., Everitt, B. J., & Robbins, T. W. (1997a). Central 5-HT depletion enhances impulsive responding without affecting the accuracy of attentional performance: interactions with dopaminergic mechanisms. Psychopharmachology, 133, 329 – 342. Harrison, A. A., Everitt, B. J., & Robbins, T. W. (1997b). Doubly dissociable effects of selective median raphe lesions on performance on the five-choice serial reaction time test of attention in rats. Behavioural Brain Research, 89, 135 – 149. Harrison, A. A., Everitt, B. J., & Robbins, T. W. (1999). Central serotonin depletion impairs both the acquistion and performance of a symmetrically reinforced go/no-go conditional visual discrimination. Behavioural Brain Research, 100, 99 – 112. He, Z. J., & Nakayama, K. (1995). Visual attention to surfaces in 3D space. Proceedings of the National Academy of Science (USA), 92, 11155 – 11159. Henderson, L. M., Yoder, P. J., Yale, M. E., & McDuffie, A. (2002). Getting the point: Electrophysiological correlates of protodeclarative pointing. International Journal of Developmental Neuroscience, 20, 449 – 458. Henson, R. N., Shallice, T., & Dolan, R. J. (1999). Right prefrontal cortex and episodic memory retrieval: A functional MRI test of the monitoring hypothesis. Brain, 122, 1367 – 1381. Huttenlocher, P. R., & Dabholkar, A. S. (1997). Regional differences in synaptogenesis in human cerebral cortex. Journal of Comparative Neurology, 387, 167 – 178. Jackson, J. H. (1958). Selected writings of John Hughlings Jackson. Oxford, England: Basic Books. James, W. (1890). The principles of psychology. New York: Dover Publications. Jensen, O., & Lisman, J. E. (2005). Hippocampal sequence-encoding driven by a cortical multi-item working memory buffer. Trends in Neurosciences, 28, 67 – 72. Johnson, M. H. (2001). Functional brain development during infancy. In A. Fogel & G. Bremner (Eds.), Blackwell handbook of infant development (pp. 169 – 190). Malden, MA: Blackwell. Kane, M. J., & Engle, R. W. (2002). The role of prefrontal cortex in working-memory capacity, executive attention, and general fluid intelligence: An individual-differences perspective. Psychonomic Bulletin and Review, 9, 637 – 671. Kane, M. J., & Engle, R. W. (2003). Working-memory capacity and the control of attention: The contributions of goal neglect, response competition, and task set to Stroop interference. Journal of Experimental Psychology: General, 132, 47 – 70. Kane, M. J., Bleckley, M. K., Conway, A. R. A., & Engle, R.W. (2001). A controlledattention view of working-memory capacity. Journal of Experimental Psychology: General, 130, 169 – 183. Kanemura, H., Aihara, M., Aoki, S., Araki, T., & Nakazawa, S. (2003). Development of the prefrontal lobe in infants and children: A three-dimensional magnetic resonance volumetric study. Brain Development, 25, 195 – 199. Klingberg, T., Forssberg, H., & Westerberg, H. (2002). Increased brain activity in frontal and parietal cortex underlies the development of visuospatial working memory capacity during childhood. Journal of Cognitive Neuroscience, 14, 1 – 10. Koenderink, M. J., Uylings, H. B., & Mrzljak, L. (1994). Postnatal maturation of the layer iii pyramidal neurons in the human prefrontal cortex: A quantitative golgi analysis. Brain Research, 653, 173 – 182. Kolb, B. (1984). Functions of the frontal cortex of the rat: A comparative review. Brain Research Reviews, 8, 65 – 98.
Endogenous Attention
317
Koutstaal, W., Wagner, A. D., Rotte, M., Maril, A., Buckner, R. L., & Schacter, D. L. (2001). Perceptual specificity in visual object priming: Functional magnetic resonance imaging evidence for a laterality difference in fusiform cortex. Neuropsychologia, 39, 184 – 199. LaBar, K. S., Gitelman, D. R., Parrish, T. B., & Mesulam, M. (1999). Neuroanatomic overlap of working memory and spatial attention networks: A functional MRI comparison within subjects. Neuroimage, 10, 695 – 704. Lansink, J. M., & Richards, J. E. (1997). Heart rate and behavioral measures of attention in six-, nine-, and twelve-month-old infants during object exploration. Child Development, 68, 610 – 620. Lawson, K. R., & Ruff, H. A. (2001). Focused attention: Assessing a fundamental cognitive process in infancy. In P. S. Zeskind & L. T. Singer (Eds.), Biobehavioral assessment of the infant (pp. 293 – 311). New York, NY: Guilford Press. Lawson, K. R., & Ruff, H. A. (2004a). Early focused attention predicts outcome for children born prematurely. Journal of Developmental and Behavioral Pediatrics, 25, 399 – 406. Lawson, K. R., & Ruff, H. A. (2004b). Early attention and negative emotionality predict later cognitive and behavioural function. International Journal of Behavioral Development, 28, 157 – 165. Lee, D. (1999). Effects of exogenous and endogenous attention on visually guided hand movements. Cognitive Brain Research, 8, 143 – 156. Leontiev, A. N. (1932). Studies in the cultural development of the child: III. The development of voluntary attention in the child. Journal of Genetic Psychology, 40, 52 – 83. Lupianez, J., & Milliken, B. (1999). Inhibition of return and the attentional set for integrating versus differentiating information. Journal of General Psychology, 126, 392 – 418. Luria, A. R. (1974). The working brain. New York: Basic Books. Mandler, J. M. (2004). Thought before language. Trends in Cognitive Science, 8, 508 – 513. McCall, R. B., & Carriger, M. S. (1993). A meta-analysis of infant habituation and recognition memory performance as predictors of later IQ. Child Development, 64, 57 – 79. Meltzoff, A. N. (1988). Infant imitation and memory: Nine-month-olds in immediate and deferred tests. Child Development, 59, 217 – 225. Miller, E. K., & Cohen, J. D. (2001). An integrative theory of prefrontal cortex function. Annual Review of Neuroscience, 24, 167 – 202. Mori, M., Abegg, M. H., Gahwiler, B. H., & Gerber, U. (2004). A frequency-dependent switch from inhibition to excitation in a hippocampal unitary circuit. Nature, 431, 453 – 456. Moore, M. K., & Meltzoff, A. N. (1999). New findings on object permanence: A developmental difference between two types of occlusion. British Journal of Developmental Psychology, 17, 623 – 644. Moruzzi, G., & Magoun, H. W. (1949). Brain stem reticular formation and activation of the EEG. Electroencephalography and Clinical Neurophysiology, 1, 455 – 473. Moscovitch, M., & Winocur, G. (2002). The frontal cortex and working with memory. In D. T. Stuss & R. T. Knight (Eds.), Principles of frontal lobe function (pp. 188 – 209). London: Oxford University Press. Mundy, P., Card, J., & Fox, N. A. (2000). EEG correlates of the development of infant joint attention skills. Developmental Psychobiology, 36, 325 – 338. Muresan, R. C. (2004). The coherence theory: Simple attentional modulation effects. Neurocomputing, 58–60, 949 – 955.
318
John Colombo and Carol L. Cheatham
Nagy, Z., Westerberg, H., & Klingberg, T. (2004). Maturation of white matter is associated with the development of cognitive functions during childhood. Journal of Cognitive Neuroscience, 16, 1227 – 1233. Niebur, E., Hsiao, S. S., & Johnson, K. O. (2002). Synchrony: A neuronal mechanism for attentional selection? Current Opinion in Neurobiology, 12, 190 – 194. Nelson, C. A. (1995). The ontogeny of human memory: A cognitive neuroscience perspective. Developmental Psychology, 31, 723 – 738. Nelson, C. A. (1997). The neurobiological basis of early memory development. In N. Cowan (Ed.), The development of memory in childhood (pp. 41 – 82). Hove, England: Psychology Press. Noppeney, U., Phillips, J., & Price, C. (2004). The neural areas that control the retrieval and selection of semantics. Neuropsychologia, 42, 1269 – 1280. Oakes, L. M., Kannass, K. N., & Shaddy, D. J. (2002). Developmental changes in endogenous control of attention: The role of target familiarity on infants’ distraction latency. Child Development, 73, 1644 – 1655. Oakes, L. M., & Madole, K. L. (2000). The future of infant categorization research: A process-oriented approach. Child Development, 71, 119 – 126. Oakes, L. M., & Madole, K. L. (2003). Principles of developmental changes in infants’ category formation. In L. M. Oakes & D. H. Rakison (Eds.), Early category and concept development: Making sense of the blooming, buzzing confusion (pp. 132 – 158). London: Oxford University Press. Oakes, L. M., Madole, K. L., & Cohen, L. B. (1991). Infants’ object examining: Habituation and categorization. Cognitive Development, 6, 377 – 392. Oakes, L. M., & Tellinghuisen, D. J. (1994). Examining in infancy: Does it reflect active processing? Developmental Psychology, 30, 748 – 756. Oakes, L. M., Tellinghuisen, D. J., & Tjebkes, T. L. (2000). Competition for infants’ attention: The interactive influence of attentional state and stimulus characteristics. Infancy, 1, 347 – 361. Okuda, Y. (1994). [Brain development during the first year of life: Quantitative assessment with ADC imaging]. Nippon Igaku Hoshasen Gakkai Zasshi – Nippon Acta Radiologica, 54, 1245 – 1251. Orekhova, E. V., Stroganova, T. A., & Posikera, I. N. (1999). Theta synchronization during sustained anticipatory attention in infants over the second half of the first year of life. International Journal of Psychophysiology, 32, 151 – 172. Otten, L. J., Henson, R. N., & Rugg, M. D. (2002). State-related and item-related neural correlates of successful memory encoding. Nature Neuroscience, 5, 1339 – 1344. Parasuraman, R., & Davies, D. R. (1984). (Eds.). Varieties of attention. San Diego: Academic Press. Parrinello, R. M., & Ruff, H. A. (1988). The influence of adult intervention on infants’ level of attention. Child Development, 59, 1125 – 1135. Pascual-Leone, J. (2004). Hidden operators of mental attention applying on LTM give the illusion of a separate working memory. Behavioral and Brain Sciences, 27, 709 – 711. Petrides, M., & Pandya, D. N. (2002). Comparative cytoarchitectonic analysis of the human and the macaque ventrolateral prefrontal cortex and corticocortical connection patterns in the monkey. European Journal of Neuroscience, 16, 291 – 310. Pillsbury, W. B. (1917). Attention and interest. Psychological Bulletin, 14, 169 – 170. Pins, D., Meyer, M. E., Foucher, J., Humphreys, G., & Boucart, M. (2004). Neural correlates of implicit object identification. Neuropsychologia, 42, 1247 – 1259. Posner, M. I. (1980). Orienting of attention. Quarterly Journal of Experimental Psychology, 32, 3 – 25.
Endogenous Attention
319
Posner, M. I. (2004). (Ed.). The cognitive neuroscience of attention. New York: Guilford. Posner, M. I., & Cohen, Y. (1984). Components of visual orienting. In H. Bouma & D. G. Bowhuis (Eds.), Attention and Performance X (pp. 531 – 556). Hillsdale, NJ: Erlbaum. Posner, M. I., & Petersen, S. (1990). The attention system of the human brain. Annual Review of Neuroscience, 13, 25 – 42. Posner, M. I., Rafal, R. D., Choate, L. S., & Vaughan, J. (1985). Inhibition of return: neural basis and function. Cognitive Neuropsychology, 2, 211 – 228. Quinn, P. C. (2004). Development of subordinate-level categorization in 3- to 7-monthold infants. Child Development, 75, 886 – 899. Quinn, P. C., Adams, A., Kennedy, E., Shettler, L., & Wasnik, A. (2003). Development of an abstract category representation for the spatial relation between in 6- to 10-monthold infants. Developmental Psychology, 39, 151 – 163. Rafal, R., & Robertson, L. C. (1995). The neurology of visual attention. In M. Gazzaniga (Ed.), The cognitive neurosciences (pp. 625 – 648). Cambridge, MA: MIT Press. Rajkowski, J., Kubiak, P., & Aston-Jones, G. (1994). Locus coeruleus activity in monkey: phasic and tonic changes are associated with altered vigilance. Brain Research Bulletin, 35, 607 – 616. Rakison, D. H. (2005). The perceptual to conceptual shift in infancy and early childhood: a surface or deep distinction? In D. H. Rakison & L. Gershkoff-Stowe (Eds.), Building object categories in developmental time (pp. 131 – 158). Mahwah, NJ: Lawrence Erlbaum Associates. Rhodes, G., Byatt, G., Michie, P. T., & Puce, A. (2004). Is the fusiform face area specialized for faces, individuation, or expert individuation? Journal of Cognitive Neuroscience, 16, 189 – 203. Richards, J. E. (1985). The development of sustained visual attention in infants from 14 to 26 weeks of age. Psychophysiology, 22, 409 – 416. Richards, J. E. (1987). Infant visual sustained attention and respiratory sinus arrhythmia. Child Development, 58, 488 – 496. Richards, J. E. (1989). Development and stability in visual sustained attention in 14, 20, and 26 week old infants. Psychophysiology, 26, 422 – 430. Richards, J. E. (1997). Effects of attention on infants’ preference for briefly exposed visual stimuli in the paired-comparison recognition-memory paradigm. Developmental Psychology, 33, 22 – 31. Richards, J. E. (1998). (Ed.), Cognitive neuroscience of attention: A developmental perspective. Mahwah, NJ: Erlbaum. Richards, J. E. (2001). Cortical indexes of saccade planning following covert orienting in 20-week-old infants. Infancy, 2, 135 – 157. Richards, J. E. (2005). Localizing cortical sources of event-related potentials in infants’ covert orienting. Developmental Science, 8, 255 – 278. Richards, J. E., & Casey, B. J. (1991). Heart rate variability during attention phases in young infants. Psychophysiology, 28, 43 – 53. Richards, J. E., & Turner, E. D. (2001). Extended visual fixation and distractibility in children from six to twenty-four months of age. Child Development, 72, 963 – 972. Robbins, T. W. (1998). Arousal and attention: psychopharmacological and neuropsychological studies in experimental animals. In R. Parasuraman (Ed.) The attentive brain. (pp. 189 – 220). Cambridge, MA: MIT Press. Robbins, T. W., & Everitt, B. J. (1995). Arousal systems and attention. In M. Gazzaniga (Ed.), The cognitive neurosciences (pp. 703 – 720). Cambridge, MA: MIT Press.
320
John Colombo and Carol L. Cheatham
Robbins, T. W., Everitt, B. J., Marston, H. M., Wilkinson, J., Jones, G. H., & Page K. J. (1989). Comparative effects of ibotenic acid- and quisqualic acid-induced lesions of the substantia nigra nominata on attentional function in the rat: further implications for the role of the cholinergic neurons of the nucleus basalis in cognitive processes. Behavioral and Brain Research, 35, 221 – 240. Rogers, T. T., Hocking, J., Mechelli, A., Patterson, K., & Price, C. (2005). Fusiform activation to animals is driven by the process, not the stimulus. Journal of Cognitive Neurosciences, 17, 434 – 445. Rosen, A. C., Rao, S. M., Caffarra, P., Scaglioni, A., Bobholz, J. A., & Woodley, S. J. (1999). Neural basis of endogenous and exogenous spatial orienting. A functional MRI study. Journal of Cognitive Neuroscience, 11, 135 – 152. Rossion, B., Dricot, L., Devolder, A., Bodart, J-M., Crommelinck, M., de-Gelder, B. et al. (2000). Hemispheric asymmetries for whole-based and part-based face processing in the human fusiform gyrus. Journal of Cognitive Neuroscience, 12, 793 – 802. Rothbart, M. K., & Rueda, M. R. (2005). The development of effortful control. In E. Awh, S. W. Keele, & U. Mayr (Eds.), Developing individuality in the human brain: A tribute to Michael I. Posner (pp. 167 – 188). Washington, DC: American Psychological Association. Routtenberg, A. (1966). Neural mechanisms of sleep: changing view of reticular formation function. Psychological Review, 73, 481 – 499. Ruff, H. A. (1984). Infants’ manipulative exploration of objects: Effects of age and object characteristics. Developmental Psychology, 20, 9 – 20. Ruff, H. A. (1986a). Components of attention during infants’ manipulative exploration. Child Development, 57, 105 – 114. Ruff, H. A. (1986b). Attention and organization of behavior in high-risk infants. Journal of Developmental and Behavioral Pediatrics, 7, 298 – 301. Ruff, H. A., & Capozzoli, M. C. (2003). Development of attention and distractibility in the first 4 years of life. Developmental Psychology, 39, 877 – 890. Ruff, H. A., Capozzoli, M. C., & Saltarelli, L. M. (1996). Focused visual attention and distractibility in 10-month-old infants. Infant Behavior and Development, 19, 281 – 293. Ruff, H. A., Capozzoli, M. C., & Weissberg, R. (1998). Age, individuality, and context as factors in sustained visual attention during the preschool years. Developmental Psychology, 34, 454 – 464. Ruff, H. A., & Dubiner, K. (1987). Stability of individual differences in infants’ manipulation and exploration of objects. Perceptual and Motor Skills, 64, 1095 – 1101. Ruff, H. A., & Lawson, K. R. (1990) Development of sustained, focused attention in young children during free play. Developmental Psychology, 26, 85 – 93. Ruff, H. A., Lawson K. R., Parinello, R., & Weissberg, R. (1990). Long-term stability of individual differences in sustained attention in the early years. Child Development, 61, 60 – 75. Ruff, H. A., & Rothbart, M. K. (1996). Attention in early development: Themes and variations. London: Oxford University Press. Ruff, H. A., Saltarelli, L. M., Capozzoli, M., & Dubiner, K. (1992). The differentiation of activity in infants’ exploration of objects. Developmental Psychology, 28, 851 – 861. Salgado, P. P., Delaveau, P., Blin, O., & Nieoullon, A. (2005). Dopaminergic contribution to the regulation of emotional perception. Clinical Neuropharmacology, 28, 228 – 237.
Endogenous Attention
321
Sarter, M. (1994). Neuronal mechanisms of the attentional dysfunctions in senile dementia and schizophrenia: Two sides of the same coin? Psychopharmacology, 114, 539 – 550. Sarter, M., Givens, B., & Bruno, J. P. (2001). The cognitive neuroscience of sustained attention: Where top-down meets bottom-up. Brain Research Reviews, 35, 146 – 160. Saxon, T. F., Frick, J. E., & Colombo, J. (1997). Individual differences in infant visual fixation and maternal interactional styles. Merrill-Palmer Quarterly of Behavioral Development, 43, 48 – 66. Saxon, T. F., Colombo, J., Robinson, E. L., & Frick, J. E. (2000). Dyadic interaction profiles in infancy and preschool intelligence. Journal of School Psychology, 38, 9 – 25. Schiller, P. H. (1998). The neural control of visually guided eye movements. In J. E. Richards (Ed.), Cognitive neuroscience of attention: A developmental perspective (pp. 3–50). Mahwah, NJ: Erlbaum. Schmidt, W. C. (2000). Endogenous attention and illusory line motion reexamined. Journal of Experimental Psychology: Human Perception and Performance, 26, 980 – 996. Schmidt, L. A., Trainor, L. J., & Santesso, D. L. (2003). Development of frontal electroencephalogram (EEG) and heart rate (ECG) responses to affective musical stimuli during the first 12 months of post-natal life. Brain and Cognition, 52, 27 – 32. Seress, L. (2001). Morphological changes of the human hippocampal formation from midgestation to early childhood. In C. A. Nelson & M. Luciana (Eds.), Handbook of developmental cognitive neuroscience (pp. 45 – 58). Cambridge, Massachusetts: MIT Press. Shultz, T. R., & Cohen, L. B. (2004). Modeling age differences in infant category learning. Infancy, 5, 153 – 171. Smith, C., Adamson, L, & Bakeman, R. (1988). Interactional predictors of early language. First Language, 8, 143 – 156. Sokolov, Y. N. (1992). The neurophysiological mechanisms of consciousness. Journal of Russian and East European Psychology, 30, 6 – 12. Sowell, E. R., Thompson, P. M., & Toga, A. W. (2004). Mapping changes in the human cortex throughout the span of life. Neuroscientist, 10, 372 – 392. Sperling, G. (1960). The information available in brief visual presentation. Psychological Monographs, 74, 1 – 29. Steckler, T., Inglis, W., Winn, P., & Sahgal, A. (1994). The pedunculopontine tegmental nucleus: A role in cognitive processes? Brain Research Reviews, 19, 298 – 318. Stroganova, T. A., Orekhova, E. V., & Posikera, I. N. (1997). The mechanisms of attention control in infants: EEG analysis. Human Physiology, 23, 394 – 399. Stroganova, T. A., Posikera, I. N., & Pushina, N. P, & Orekhova, E. V. (2003). Lateralization of motor functions in early human ontogeny. Human Physiology, 29, 48 – 58. Tamis-LeMonda, C. S., & Bornstein, M. H. (1989). Habituation and maternal encouragement of attention in infancy as predictors of toddler language, play, and representational competence. Child Development, 60, 738 – 751. Tellinghuisen, D. J., & Oakes, L. M. (1997). Distractibility in infancy: The effects of distractor characteristics and type of attention. Journal of Experimental Child Psychology, 64, 232 – 254. Tellinghuisen, D. J., Oakes, L. M., & Tjebkes, T. L. (1999). The influence of attentional state and stimulus characteristics on infant distractibility. Cognitive Development, 14, 199 – 213.
322
John Colombo and Carol L. Cheatham
Thompson-Schill, S. L., Bedny, M., & Goldberg, R. F. (2005). The frontal lobes and the regulation of mental activity. Current Opinion in Neurobiology, 15, 219 – 224. Tiesinga, P. H. E., Fellous, J.-M., Salinas, E., Jose, J. V., & Sejnowski, T. J. (2004). Synchronization as a mechanism for attentional gain modulation. Neurocomputing, 58 – 60, 641 – 646. Treisman, A. M., & Gelade, G. (1980). A feature integration theory of attention. Cognitive Psychology, 12, 97 – 136. Tsekhmistrenko, T. A., & Vasil’eva, V. A. (2001). Structural transformations of the associative cortex as the morphological base of the development of human cognitive functions from birth to 20 years of age. Human Physiology, 27, 544 – 550. Tsekhmistrenko, T. A., Vasil’eva, V. A., Shumeiko, N. S., & Vologirov, A. S. (2004). Quantitative changes in the fibroarchitectonics of the human cortex from birth to the age of 12 years. Neuroscience and Behavioral Physiology, 34, 983 – 988. Usher, M., Cohen, J. D., Servan-Schreiber, D., Rajkowski, J., & Aston-Jones, G. (1999). The role of locus coeruleus in the regulation of cognitive performance. Science, 283, 549 – 554. Vygotsky, L. S. (1928). Pedology of the school child. Oxford, England: Second Moscow State University Press. Ward, L. M. (2003). Synchronous neural oscillations and cognitive processes. Trends in Cognitive Sciences, 7, 553 – 559. Waxman, S. R., & Braun, I. (2005). Consistent (but not variable) names as invitations to form object categories: New evidence from 12-month-old infants. Cognition, 95, B59 – B68. Webster, M. J., & Ungerleider, L. G. (1998). Neuroanatomy of visual attention. In R. Parasuraman (Ed.), The attentive brain (pp. 19 – 34). Cambridge, MA: MIT Press. Willatts, P. (1999). Development of means-end behavior in young infants: Pulling a support to retrieve a distant object. Developmental Psychology, 35, 651 – 667. Wolfe, C. D., & Bell, M. A. (2004). Working memory and inhibitory control in early childhood: contributions from physiology, temperament, and language. Developmental Psychobiology, 44, 68 – 83. Wright, J. C., & Vlietstra, A. G. (1975). The development of selective attention: from perceptual exploration to logical search. Advances in Child Development and Behavior, 10, 195 – 239. Yerkes, R. M., & Dodson, J. D. (1908). The relation of strength of stimulus to rapidity of habit formation. Journal of Comparative Neurology and Psychology, 18, 459 – 482. Young, A. M. J., Moran, P. M., & Joseph, M. H. (2005). The role of dopamine in conditioning and latent inhibition: What, when, where and how? Neuroscience and Biobehavioral Reviews, 29, 963 – 976. Younger, B. A., Hollich, G., & Furrer, S. D. (2004). An emerging consensus: Younger and Cohen revisited. Infancy, 5, 209 – 216. Zaidel, D. W. (1999). Quantitative morphology of human hippocampus early neuron development. Anatomical Record, 254, 87 – 91.
THE PROBABILISTIC EPIGENESIS OF KNOWLEDGE
James A. Dixon and Elizabeth Kelley DEPARTMENT OF PSYCHOLOGY, UNIVERSITY OF CONNECTICUT, STORRS, CT 06269–1020, USA
I. KNOWLEDGE ACQUISITION: FOUNDATIONAL ISSUES
A. LOGICAL PROBLEMS IN KNOWLEDGE ACQUISITION B. INNATE KNOWLEDGE AND CONSTRAINTS II. PROBABILISTIC EPIGENESIS
A. PRINCIPLES OF PROBABILISTIC EPIGENESIS B. EPIGENETIC ACCOUNTS OF NOVEL BEHAVIOR III. EPIGENESIS OF KNOWLEDGE
A. B. C. D. E. F.
DEVELOPING NEW REPRESENTATIONS OF THE GEAR DOMAIN DISCOVERING FIGURE-EIGHT FIRST DISCOVERING LEFT-RIGHT GENERALIZING LEFT-RIGHT FIGURE-EIGHT TO LEFT-RIGHT: SUMMARY AND IMPLICATIONS REVISITING THE LOGICAL CONUNDRUMS
IV. EPIGENESIS OF KNOWLEDGE AND SYMBOL GROUNDING
A. SYMBOL GROUNDING: ACTIONS VS. INSTRUCTIONS B. SYMBOL GROUNDING: ACTIONS VS. PERCEPTUAL CUES V. EPIGENESIS AND DETECTING STRUCTURE IN THE ENVIRONMENT VI. EPIGENETIC APPROACHES TO LANGUAGE ACQUISITION
A. EMERGENCE OF PHONOLOGY B. SHAPE BIAS AND THE LEXICON VII. CONCLUSIONS
A. CORE MODELS B. AN ILLUSTRATION OF EPIGENETIC PRINCIPLES IN KNOWLEDGE ACQUISITION C. INTEGRATING DISPARATE DEVELOPMENTAL EFFECTS D. RELATION TO OTHER SYSTEMS APPROACHES E. RESEARCH STRATEGIES FOR EPIGENETIC AND SYSTEMS APPROACHES REFERENCES
Children know a great deal about the world. From an early age, children appear to have a sophisticated understanding of their physical, linguistic, and social surroundings. Explaining how children acquire knowledge in these various domains has been a fundamental challenge for developmental science. 323 Advances in Child Development and Behavior R Kail (Editor)
ß 2006 Elsevier B.V. All rights reserved.
324
James A. Dixon and Elizabeth Kelley
Knowledge acquisition, as we discuss subsequently, is a surprisingly difficult logical problem and one that encapsulates many of the core issues in psychology and cognitive science. For example, a theory of knowledge acquisition is necessarily intertwined with issues such as the nature of representation and meaning, as well as learnability and ontogenesis. Therefore, constructing an adequate theory of knowledge acquisition has deep implications for our understanding of cognition and development. In this chapter, we outline an approach that takes knowledge acquisition as an instance of a striking, but general, developmental phenomenon, the emergence of new structure. Drawing on Gottlieb’s theory of probabilistic epigenesis (Gottlieb, 1992, 1997, 2002), we show that knowledge, like other structures, emerges from the interaction among levels in a hierarchical system. This approach circumvents many of the logical problems of knowledge acquisition and provides a rational framework for integrating seemingly disparate developmental effects. We first discuss two conundrums that have made knowledge acquisition so hard to explain. These fundamental logical problems highlight the very broad and general nature of the issue, as well as its difficulty. Next, we review the theory of probabilistic epigenesis, and provide examples in support of its major principles. We then show, using data from our laboratory and others, how probabilistic epigenesis extends to knowledge acquisition. Finally, we discuss the implications of adopting an epigenetic approach to knowledge acquisition.
I. Knowledge Acquisition: Foundational Issues For centuries, knowledge acquisition has been a core issue for philosophers and scientists concerned with epistemology (e.g., Bickhard, 2001; Chomsky, 1988; Hume, 1739/1978; Locke, 1690/1993; Piaget, 1970; Pollock & Cruz, 1999). Even a cursory examination of the extensive work in this area reveals the profound and central nature of the problem. How we theorize about knowledge acquisition determines to a large extent our approaches to representation, learning, and the development of the mind. Philosophers have plumbed the depths of this issue, identifying a host of interrelated conceptual puzzles. Although the relevance of some of these conceptual issues remains open to debate, psychological theories of knowledge acquisition have traditionally grappled with two very difficult conundrums. A. LOGICAL PROBLEMS IN KNOWLEDGE ACQUISITION
The first conundrum concerns the possibility that knowledge acquisition is akin to hypothesis testing; the child proposes a hypothesis about a relation in the
The Probabilistic Epigenesis of Knowledge
325
environment, collects evidence relevant to that hypothesis, and modifies the hypothesis in light of contrary evidence. A major difficulty with this account is that it is not clear how the child could generate initial hypotheses, unless he or she already knew quite a bit about the world (Fodor, 1975, 1981; Laurence & Margolis, 2002). Where would hypotheses come from, if the child did not have prior knowledge? One might argue that the child generates hypotheses completely at random, but this still requires that the hypotheses get their content from somewhere. For example, on seeing one object disappear behind the occluding edge of another object, a child might generate the relation ‘‘merge,’’ the first object has become part of the second object. The problem, of course, is that the generated hypothesis contains many relations itself, relations that define what it means to ‘‘merge.’’ Therefore, in order to generate a hypothesis, the child must already know the relations contained within the hypothesis. Furthermore, even if we were to stipulate that hypotheses could be randomly generated somehow, the number of potential hypotheses would be unbounded, literally infinite, which would make the probability of discovering the correct hypothesis tend towards zero. This first conundrum, therefore, appears to undermine the possibility that hypothesis testing plays a foundational role in knowledge acquisition. The second conundrum revolves around the possibility that relations are induced from regularities in the environment (Hume, 1739/1978). The idea here is that an associative system detects covariation in the world and these associations eventually become knowledge. A serious difficulty with this account is that the environment contains an infinite number of features or dimensions. The child, having no knowledge, cannot know which features will be relevant and which will be uninformative (see Keil et al., 1998). Because the child cannot know, a priori, which features to encode, the probability of detecting the covariation among the relevant features would be infinitesimally small. In this way, the second conundrum casts doubt on the possibility that relations induced from patterns of covariation are foundational for knowledge acquisition.
B. INNATE KNOWLEDGE AND CONSTRAINTS
Given these logical difficulties, some theorists have proposed that the foundations of knowledge are innate (e.g., Chomsky, 1975; Feigenson, Dehaene, & Spelke, 2004; Pinker, 1994). The proposition has been that humans are somehow endowed with knowledge about a core set of relations. This small core of knowledge makes further knowledge acquisition possible. For example, Spelke (1990) proposed that children are born with a number of principles about objects: Objects move as wholes, distinct objects move separately from one another, objects maintain their size and shape as they move, and can only affect
326
James A. Dixon and Elizabeth Kelley
movement on other objects if they come into direct contact. This innate knowledge allows infants to learn more about objects and their environment. According to Spelke, the environment regulates the development of knowledge beyond these principles, but this core knowledge is essential for knowledge acquisition to advance. Similarly, Chomsky proposed that the child is born with a module containing knowledge of the deep structure of grammar (Chomsky, 1966, 1988). Exposure to language sets parameters for the specific instantiation of grammar appropriate to one’s native language. Alternatively, other theorists have suggested that the problem of knowledge acquisition is innately constrained (e.g., Karmiloff-Smith, 1992; Leslie, Friedman, & German, 2004; Newport, 1991). Innate constraints, according to this position, substantially narrow the number of potential relations that a child might propose or encode. For example, an innate bias that focuses attention on whole objects, rather than on parts of objects, might greatly simplify the task of learning object labels (Markman, 1990). Gopnik et al. (2004) proposed that children have a specialized system for learning causal relations. This system solves the difficult problems of inferring causation, in part, by constraining the possible relations between patterns of covariation and causal inferences. Acquiring knowledge about causality is made possible by constraining the problem; these constraints are built into the system itself. Karmiloff-Smith (1992) argued for a more general set of innate constraints. She proposed that children are born with a set of innate attentional biases. These biases cause the child to encode certain aspects of the world, thereby, allowing learning to occur. An important feature of these proposals is that they license the development of knowledge via other structures that emerge without any explanation. These structures (i.e., innate knowledge or constraints) must arise somehow; they are not present in the zygote, but identifying them as innate places these structures outside the explanatory realm of developmental theory. This results in a division between structures that emerge through developmental processes and those that do not; the latter structures being somehow predetermined or given by biology. Accepting this conceptualization seems to imply that a primary objective of developmental research is to identify those structures that are developmental and those that are predetermined biologically.
II. Probabilistic Epigenesis We present an alternative view of knowledge acquisition in which all structures are the products of epigenesis. In the epigenetic approach, development occurs as the result of interactions among and within levels in a
The Probabilistic Epigenesis of Knowledge
327
Environment Behavior Neural Activity Genetic Activity Development Fig. 1. An example of a hierarchical, layered developmental system, adapted from Gottlieb (1992). Development proceeds from left to right, as shown by the arrow at the bottom. The diagonal arrows show that each level affects adjacent levels and is affected by adjacent levels. The current state of each level also affects its next state, as indicated by the arrowhead at the end of each level.
hierarchical system. Figure 1 shows an example of such a system, adapted from Gottlieb (1992). In this example, four levels interact over time: environment, behavior, neural activity, and genetic activity. These levels are illustrative; epigenetic analysis of a developmental phenomenon requires identifying the relevant levels within the system, levels which may have little overlap with those in Gottlieb’s idealized example. That said, the diagram highlights major principles of the epigenetic approach. These principles are about the general structure of developmental systems, rather than about the content of the levels. A. PRINCIPLES OF PROBABILISTIC EPIGENESIS
First, levels that are adjacent within the system directly and reciprocally affect one another. For example, in the diagram, genetic activity affects neural activity and the resulting neural activity affects genetic activity. Non-adjacent levels also affect one another, but these effects are mediated by the intervening levels. For instance, Figure 1 shows that the environment affects neural activity through its effects on behavior. Second, new structures within the system emerge from the interactions among existing structures. Existing structures were the products of epigenesis at an earlier point in time. In this way, structures in the system are both products of epigenesis and potential agents of epigenesis. Note that this redefines the developmental problem in an important way. Development change is always the result of structures that already exist; these structures are developmentally prior, but they are not predetermined. On this view, all developmental analysis, from embryology to peer relations, involves examining a slice of the developmental stream. The stream always contains structures that are the agents of epigenesis (and were its products earlier). Third, time is central to the epigenetic account. Interactions at different levels will occur on different time scales (Van Orden, Moreno, & Holden, 2003;
328
James A. Dixon and Elizabeth Kelley
Wagman & Miller, 2003). For example, the genetic activity involved in creating neurotransmitters occurs very rapidly relative to the behaviors that involve those neurons. Development involves integrating effects over a wide range of time scales. Finally, the diagram shows that all levels have both concurrent and lagged effects on other levels in the system. For example, the environment at this moment in time affects one’s current behavior, a concurrent effect. However, one’s prior environments will also affect current behavior; these are lagged effects. Note that lagged effects cascade through the system over time. We note that age effects in the model are driven by the local interactions among levels. Chronological time does not have causal status, rather the system is self-regulating in terms of the timing of developmental changes (see Karmiloff-Smith, 1997; Sisk & Foster, 2004). Thus, although age-related change is often very useful descriptively, it is ultimately a phenomenon to be explained. Models such as this one are motivated, in part, by long established findings from developmental biology. For example, the principle of equipotentiality states that cells have the ability to become part of any system in the body (Dreisch, 1929; Gottlieb, 2001). The implication here is that a particular cell’s function and structure is not predetermined by an internal plan, rather it is the product of interactions across the system. These models are also motivated by subsequent work in developmental neuroscience. For example, sensory stimulation affects the expression of immediate-early genes, which in turn create a cascade of subsequent effects on other genes and on other levels in the system (see Johnston & Edwards, 2002). This process results in changes at the neural level and ultimately at the behavioral level. The development of new structures in the system, from cells to complex behaviors, is not predetermined by genetics; instead new structures apparently emerge through interactions among levels in the system (Wahlsten, 1999). The epigenetic approach also resonates with other developmental systems approaches (e.g., Clark, 1997; Fischer & Rose, 1994; Ford & Lerner, 1992; Goldfield, 1995; Oyama, 1985; Thelen & Smith, 1994; van Geert, 1998). Although a review of these approaches and their relation to probabilistic epigenesis is beyond the scope of this chapter, we discuss connections to specific systems approaches in the final section.
B. EPIGENETIC ACCOUNTS OF NOVEL BEHAVIOR
Consider, as an example of epigenetic processes, the development of an instinctive behavior in Peking ducklings (see Gottlieb, 1997, for a review). Peking ducklings exhibit a striking behavioral phenomenon: They approach
The Probabilistic Epigenesis of Knowledge
329
when their species-appropriate maternal call is presented. This behavior occurs when the ducklings are newborns ( 24 hours after hatching). Ducklings that have never heard any maternal vocalizations previously (and who have never been in any danger) respond to the maternal call by approaching the sound (i.e., an electronic speaker), just as maternally reared ducklings do. The behavior is novel in its structure; ducklings do not approach a sound prior to hearing the maternal call, nor have they seen other ducklings perform the behavior. In sum, the ducklings’ approach behavior constitutes a new structure and seems to reflect knowledge, at least in the sense of making an appropriate response to an auditory signal. One might suggest, given that the ducklings are so young and have not been exposed to maternal calls previously, that this behavior was innate; encoded in, and constructed from, a genetic blueprint. However, the work of Gottlieb and his colleagues has shown that the ducklings’ approach behavior develops through epigenetic processes (e.g., Gottlieb, 1998). For example, in a series of elegant experiments, Gottlieb and his colleagues showed that exposure to self-generated vocalizations, while still in the egg, is a necessary component of this developmental system. Ducklings that were devocalized in the egg prior to making their first vocal sounds did not exhibit the approach behavior after hatching. Similarly, Miller (1994) showed that manipulating the social context in which (fully vocal) ducklings were reared (i.e., in isolation vs. with conspecifics) affected the ducklings’ responsiveness to a second type of maternal call, the alarm call. The alarm call results in ducklings exhibiting freezing behavior—they immediately stop moving and hold completely still. Ducklings reared in isolation do not respond to alarm calls that have repetition rates like those observed in the wild, but do respond to faster rates. Social interactions affect the tuning of the ducklings’ perception of the alarm call. These results illustrate that the ducklings’ behavior in response to maternal calls is the result of complex interactions among levels of the developing system; vocals, auditory, and social levels are all implicated in its ontogenesis. Note that this account also explicitly includes genetic effects, but they impact the system through their effects on adjacent levels, not by creating new structures, novel behavior in the current example, independently.
III. Epigenesis of Knowledge Epigenetic accounts hold considerable promise for explaining the development of new structures including, we argue, knowledge. New knowledge, presumably, enters the cognitive system prenatally and continues to develop through adulthood. Clearly, a complete theory of knowledge acquisition will explain development across this broad age range. We suggest that, because epigenetic principles can explain the development of new structures, they will
330
James A. Dixon and Elizabeth Kelley
undergird such a theory. In this section, we illustrate how epigenetic principles can be applied to knowledge acquisition, using our research on the development of understanding gear systems in children (ages 8 and 12 years) and young adults as an example. Our goal here is to show that an epigenetic approach to knowledge acquisition can circumvent the logical problems outlined previously. The broader goal of research within this framework is to explain how structures that emerge in prenatal development and infancy, as well as later in life, interact to produce knowledge.
A. DEVELOPING NEW REPRESENTATIONS OF THE GEAR DOMAIN
In an initial study (Dixon & Bangert, 2002), we asked participants to predict the final turning direction of one gear, the target gear, given the turning direction of the driving gear, a gear that provided the force to the system. (See Figure 2 for examples.) Participants could solve the gear problems any way they wished. We encouraged participants to think aloud, and we videotaped and coded their solution strategies. The gear problems were presented in the context of a computerized train race. The participant’s train was racing against a train controlled by the computer. The gear systems were presented as fueling stations; correctly predicting the turning direction of the target gear resulted in catching the fuel, thus making one’s train go faster. Participants completed two separate sessions. They solved 16 gear problems in the first session and 32 in the second. Participants could take as much time as they wished to solve each problem. The computer provided feedback about whether the target gear turned in the predicted direction, however, participants did not observe the motion of the other gears. Over the course of the study, participants’ understanding of the gear system changed radically. These changes form a sequence, both conceptually and empirically (Lehrer & Schauble, 1998; Schwartz & Black, 1996). The lowest level change involved discovering the simple physics of the system: Each gear turns around its center axis and the teeth of one gear push on the teeth of the next gear in the chain. Many participants, after generating incorrect hypotheses about the physics of the gears (e.g., all the gears turn the same way) or guessing at the answer, discovered that the gears turn and push one another. Because the hand movements that result from tracing the force (i.e., the turning and pushing) across the gear system resemble a figure eight, we refer to this as the FigureEight strategy. In a second change, participants discovered that the gears form an alternating series: adjacent gears turn in opposite directions. Understanding this relation allowed participants to simply classify each gear as turning ‘‘left’’ or ‘‘right’’ (i.e., counter-clockwise or clockwise), rather than computing
331
The Probabilistic Epigenesis of Knowledge
Small, One Pathway, NoExtraneous ?
?
?W ?
W
H
?
?
? ?
G
H
?
X
? ?
?
W
H
?
Large, One Pathway, No Extraneous
?
?
?
M
G
?
?
? X
?
Small, Two Pathways, No Extraneous
?
H
?
W
G
?
Jams!!
?
?
Q
?
?
?
?
Jams!!
Jams!!
?
Q
?
?
W
?
?
Jams!!
? H ?
?
G
M
?
Large, Two Pathways, NoExtraneous
?
? Jams!!
Small, Two Pathways, Extraneous
Fig. 2. Examples of different types of gear system problems. The driving gear which provided the system has a single-headed arrow on its face. The task was to predict the movement of the target gear in order to obtain the ‘‘fuel,’’ a pile of black coal. Participants clicked one of the two ramps below the shelf that held the coal to indicate the turning direction of target gear, or the button labeled ‘‘Jams!!!’’ which indicated that the target gear would not turn. (Gear systems with two pathways can present opposing forces at the target gear and, therefore, not turn.) Three different dimensions are illustrated by the examples: size (small, large), number of pathways (one, two), and extraneous gear (absent, present).
332
James A. Dixon and Elizabeth Kelley
the turning and pushing of each individual gear. We call this the Left-Right strategy. Finally, some participants discovered that the parity of the number of gears (i.e., odd or even) indicated whether the target gear would turn in the same direction as the driving gear. If the number of gears was odd, the target and driving gear turned in the same direction. If the number of gears was even, the target geared turned in the opposite direction. We refer to this as the CountingParity strategy. How could participants discover these relations, given such sparse feedback (i.e., whether the predicted turning direction was correct) and no opportunity to observe the movement of the gears themselves? This mystery is compounded by the fact that both the Figure-Eight and Left-Right strategies produce correct answers (when performed appropriately), so it seems unlikely that error can be driving changes beyond the first level in the sequence. Indeed, we have shown empirically that error does not drive changes beyond Figure-Eight (see Dixon & Bangert, 2002). The explanation of these spontaneous discoveries lies in the epigenetic principles outlined previously. We propose that the environment, perception-action system, working, and long-term memory systems form a hierarchical, layered system, as shown in Figure 3. The gear-system task provides a highly structured environment. The perception-action system detects affordances and produces actions in the environment (Gibson, 1986; Turvey, Carello, & Kim, 1990). Working memory receives information from the perception-action system, both about the affordances themselves and the person’s actions (Barsalou et al., 2003). Working memory contributes to the perception-action system by providing information about recent affordances and actions. Long-term memory receives information from working memory and contributes information about the history of the system back to working memory (Goldinger, 1998). In the next section, we address the discovery of the Figure-Eight and LeftRight strategies as a function of this system. Because the Counting-Parity
Environment Perception/Action Working Memory Long-term Memory
Development Fig. 3. The environment, perception-action, working and long-term memory are shown in a hierarchical, layered developmental system. Diagonal arrows show the reciprocal causal effects between adjacent levels. Each level also affects its own future state.
The Probabilistic Epigenesis of Knowledge
333
strategy was primarily discovered by the older students and predicting its discovery requires a quite detailed analysis of the types of gear problems, we do not discuss it further here. Full details are available in Dixon and Bangert (2004).
B. DISCOVERING FIGURE-EIGHT FIRST
We consider how participants discover the Figure-Eight strategy, the lowest correct strategy in the sequence. The perception-action system detects affordances in the environment (e.g., gears turn) and receives information from memory (e.g., items that look the same, act the same). For some children, this process results in the Figure-Eight strategy, but for others it generates an incorrect interpretation of the gear system (e.g., all the blue gears turn one way). When the perception-action system generates a strategy that results in errors, it modifies the interpretation of the environment by responding to different affordances (Adolph, 2002) and/or integrating different information from memory. This process is an important component of theory revision which has been investigated in some detail (e.g., Kuhn et al., 1995). Consistent with this account, Dixon and Bangert (2002) showed that the probability of discovering the Figure-Eight strategy increased as a function of recently using an incorrect strategy, but decreased as a function of recently guessing at the answer. (We considered the previous 5 trials as ‘‘recent’’ for these analyses). Both incorrect strategies and guessing produced chance performance, but only engaging the perception-action system (i.e., proposing an incorrect strategy) increased the likelihood of discovering the turning and pushing interpretation (i.e., the Figure-Eight strategy). These effects did not depend on grade level. The ability of the perception-action system to modify behavior in response to error allows the system to arrive at a low-level strategy, Figure-Eight, for solving the problem correctly. Consistent with the epigenetic principles described previously, this simple strategy, a product of epigenesis, then becomes an agent of epigenesis. That is, repeated use of the Figure-Eight strategy interacts with other structures in the system to extract a new relation about the gear system.
C. DISCOVERING LEFT-RIGHT
Recall that the Figure-Eight strategy involves tracing the turning motion of each gear and the pushing of interconnected teeth. These local computations on each gear and gear-to-gear coupling are based on the ‘‘turning’’ and ‘‘pushing’’ affordances. However, when this strategy is employed across a set of gears,
334
James A. Dixon and Elizabeth Kelley
a higher-order relation emerges: The gears form an alternating series. The participant’s tracing motion alternates direction as he or she goes from gear to gear. In this way, the Figure-Eight strategy, the result of tracing the force in this specific environment, creates new information. Through further interactions that involve both working and long-term memory, participants extract this information and ultimately use the alternation relation as a new representation of the gear system. Each use of the Figure-Eight strategy activates working memory and creates an episodic trace in long-term memory (Goldinger, 1998; Hintzman, 1986). Episodic traces contain a wide variety of encoded features, many of which may be ultimately irrelevant to the problem (e.g., the spatial arrangement of the gears). These long-term memory traces also contain information about the participants’ motions. Therefore, if the Figure-Eight strategy was performed correctly, then the alternation relation will be part of the episodic trace. (Participants sometimes make errors as they trace the force through the system. This results in actions that violate the alternation relation). In this way, episodic traces that include the alternation relation create the potential for a new representation. When episodic traces are activated, they contribute back to the current representation in working memory; features in the current working memory representation that match those in episodic memory become more strongly activated. Therefore, when a participant uses the Figure-Eight strategy, stored information about the participant’s history with the Figure-Eight strategy contributes to activation in working memory. To the degree that the participant has used Figure-Eight accurately, stored information about the alternation relation contributes activation to the representation in working memory. Repeated and successive use of the strategy increasingly activates the relational information, as the representation does not have time to decay in working memory (Barrouillet, Bernardin, & Camos, 2004). In this way, the episodic traces form the potential for a new representation and the activation that results from repeated, successive use of the strategy catalyzes the traces into a representation. Dixon and Bangert (2002) showed that, consistent with this account, discovery of the Left-Right strategy was predicted by the interaction between two factors, the proportion of correct Figure-Eight uses on all previous trials and the proportion of Figure-Eight use on recent trials. The proportion of correct Figure-Eight uses across the participant’s entire history indexes the degree to which episodic traces of the Figure-Eight strategy contain the alternation relation. The proportion of Figure-Eight use on recent trials, computed across the 5 previous trials, indexes the degree to which the FigureEight information is activated in working memory. As can be seen in Figure 4, the probability of discovering the Left-Right strategy, literally its first use, rose
335
The Probabilistic Epigenesis of Knowledge
Probability of Discovering LR on Focal Trial
.10 .08 .06 F8 Accuracy .04
1.0
.02
.8 .6
.00 0.2
0.4
0.6
0.8
.4 .2 .0
1
Proportion of F8 Use on Recent Trials Fig. 4. Model predictions from the event history analysis of the discovery of the Left-Right strategy. The probability of discovering Left-Right is shown as a function of the proportion of Figure-Eight (F8) use of recent trials (horizontal axis) with a separate curve for level of FigureEight accuracy. The estimated probability is for a single focal trial; these probabilities cumulate additively across trials. Therefore, changes in these seemingly small values have large effects over the course of the experiment.
dramatically when both these factors increased. That is, we can predict the discovery of the Left-Right strategy from measures of how the Figure-Eight strategy has affected the contents of long-term memory and the degree to which it has activated working memory. In this way, the Figure-Eight strategy acts as an integral part of the knowledge acquisition system; it has become an agent of further epigenesis. The account presented here suggests that the Figure-Eight strategy, in interaction with the environment, creates information about the alternation relation. Through further interactions with working and long-term memory this information forms a representation. This account allowed us to predict, in real time, the discovery of a new problem representation—the gears form an alternating series. The effects of these predictors did not depend on grade. This account also makes predictions about differences in the generalizability of the alternation strategy (i.e., Left-Right) after its initial discovery. These additional predictions provide converging evidence about the underlying processes and illustrate an important epigenetic principle.
D. GENERALIZING LEFT-RIGHT
If each time the Figure-Eight strategy is used correctly it lays down in longterm memory an instance containing the alternation relation, as well as other
336
James A. Dixon and Elizabeth Kelley
features, then the number of those instances should affect the quality of the resulting representation. As the number of correct Figure-Eight instances increases, use of Figure-Eight will more clearly activate their common features (e.g., alternation) relative to their idiosyncratic features (Hintzman, 1986). For example, assume that a child first solved a small gear-system problem with the Figure-Eight strategy. The next use of the Figure-Eight strategy would activate that episodic trace, including the alternation relation and other features of that previous problem, such as having a small number of gears. Assume that the child then solved a second problem with Figure-Eight, but this problem had a large number of gears. Both these problem-solving experiences create episodic traces. When the child uses the Figure-Eight strategy for a third time, both stored traces (from the first and second problems) will be activated. Note that both instances share the alternation relation, but have conflicting values for the size feature (i.e., small vs. large). Therefore, the joint activation of these two instances will highlight alternation, but the small and large values for size will, in effect, cancel each other out. In this way, a greater number of correct Figure-Eight uses prior to discovering Left-Right should yield an increasingly decontextualized or abstract representation of the alternation relation. That is, the shared alternation feature should emerge in sharp relief, but other features, because they take multiple values, should average into the background. Abstract, decontextualized representations generalize more easily to different types of problems. Therefore, the greater the number of correct Figure-Eight uses prior to the participant’s discovery of Left-Right, the greater the probability of their generalizing the Left-Right strategy to different problem types. To test this hypothesis, we examined the probability of generalizing the LeftRight strategy. For these analyses, we considered each gear system as belonging to one of eight types, defined along three dichotomous dimensions: size (small, large), number of pathways (one, two), and extraneous gear (present, absent). (An extraneous gear is turned by the system but does not contribute to the outcome, as illustrated in the bottom panel of Figure 2). Each participant who discovered the Left-Right strategy must, of course, have first used it on some particular type of problem (e.g., a small, one-pathway problem, with an extraneous gear). On encountering a different type of problem (e.g., a large, onepathway problem, with an extraneous gear), he or she may have applied the Left-Right strategy, a generalization of the strategy, or fallen back on another strategy, such as Figure-Eight. Therefore, each time a participant encountered a problem that was of a different type (i.e., different on at least one dimension), relative to the problem on which he or she had discovered Left-Right, he or she was ‘‘at risk’’ for generalization (Trudeau & Dixon, 2004). We modeled the conditional probability of generalizing the Left-Right strategy, given that the participant was currently at risk for generalization, as a function of the number of correct Figure-Eight strategy uses prior to their
337
The Probabilistic Epigenesis of Knowledge
Probability of Generalizing Left-Right
0.8
0.6
0.4
0.2
0 1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
Correct Figure-Eight uses prior to Discovering Left-Right Fig. 5. Model predictions from the event history analysis of the generalization of the Left-Right strategy. The probability of generalizing Left-Right is shown as a function of the number of correct Figure-Eight uses prior to discovering Left-Right.
discovery of Left Right (i.e., first use of Left-Right). We also included as control variables the number of Left-Right uses (prior to the current trial), the proportion of correct Figure-Eight use prior to discovering Left-Right, and the proportion of Figure-Eight use on the 5 trials preceding discovery of Left-Right. (The latter two predictors were used previously to predict discovery of Left-Right). Consistent with the account described previously, the number of correct Figure-Eight instances prior to discovering Left-Right predicted the probability of generalizing the Left-Right strategy post-discovery. Figure 5 shows the results as model predictions. Participants who had correctly used Figure-Eight a greater number of times prior to discovery were more likely to generalize the Left-Right strategy to different types of problems.
E. FIGURE-EIGHT TO LEFT-RIGHT: SUMMARY AND IMPLICATIONS
The analyses we discussed here are admittedly complex. Therefore, we summarize the major findings here and then discuss how the transition from Figure-Eight to Left-Right illustrates many of the epigenetic principles we presented in the introduction.
338
James A. Dixon and Elizabeth Kelley
First, we note the interaction between the perception-action system and the task environment creates the new relational information. The perception-action system specifies the turning of the individual gears and pushing at their junctions. It does not contain the alternation information in some hidden form. Participants are tracing the force through the system from the driving gear to the target gear. Given this particular task environment in which the gears are in a single plane and connected directly to one another, tracing the force results in alternating motions. However, if two gears were, for example, at right angles to each other, tracing the force through the system would not produce alternation; movement in one plane would be orthogonal to movement in the other. Similarly, the alternation information is not in the display (i.e., the task environment); the gears do not actually move. Alternation is quite literally created by correctly tracing the force across the gears (i.e., performing the Figure-Eight strategy). We showed that discovery of the alternation relation (as evidenced by using the Left-Right strategy) was predicted by two aspects of Figure-Eight strategy use: accuracy over all previous trials and successive use on recent trials. Accuracy of the Figure-Eight strategy over all previous trials indexed the degree to which episodic memory contained the alternation relation. Successive use of the Figure-Eight strategy on recent trials indexed the degree to which FigureEight information was activated in working memory. The interaction of these two factors predicted discovery of the alternation relation. In addition, we showed that the generalization of the Left-Right strategy across different problem types was predicted by the number of correct FigureEight uses in episodic memory, prior to discovering Left-Right. As the corpus of exemplars that contain the alternation relation becomes more diverse, activation of those exemplars results in a representation of alternation that is dissociated from the idiosyncratic features of the particular problems, and therefore, generalizes more easily. The four epigenetic principles presented previously are illustrated by these results. First, adjacent levels in the system directly and reciprocally affect one another. For example, the environment directly affects the perception-action system and vice versa. Non-adjacent levels also affect one another, but these effects are mediated; that is, they run through whatever levels are intervening. For example, long-term memory affects perception-action through its contributions to working memory. Second, the new structure, the representation of the alternation relation, emerges as the result of interactions among existing structures. Alternation information is created by the interaction between the perception-action system and the environment; further interactions between the perception-action, working, and long-term memory levels result in a representation of the relation. The existing structures, which serve as the current agents of epigenesis, were
The Probabilistic Epigenesis of Knowledge
339
themselves products of epigenesis earlier. The Figure-Eight strategy illustrates this most clearly. Figure-Eight develops as a result of interactions between the environment and perception-action system, and then goes on to play a crucial role in the acquisition of the alternation relation. Third, the interactions among the levels occur on different time scales. For example, the perception-action system and the environment interact moment-tomoment to produce motions that trace the force through the system. Detecting the alternation relation requires integrating across those moments; working memory, through its interaction with the perception-action system, encodes the pattern of motions at that higher time scale—across the gears in the system. The encoded relation emerges as a representation through further integration across a broader temporal span; the individual episodic traces of the Figure-Eight strategy which contain the alternation relation must be repeatedly activated. Fourth, the levels in the system have both concurrent and lagged effects. For example, a child’s current use of the Figure-Eight strategy activates working memory and, therefore, has a concurrent effect on the probability of discovering alternation. On subsequent trials, that same use of the Figure-Eight strategy affects the probability of discovering alternation through its episodic trace in long-term memory, a lagged effect. However, even after discovery of the alternation relation, the Figure-Eight strategy has lasting, lagged effects. That is, use of the Figure-Eight strategy affected the quality of the resulting representation of alternation, as indexed by its generalizability. This raises an important point embedded in the epigenetic conception of development: Small differences in the interactions among levels early in development can have longterm and non-obvious effects on later behavior and development. For instance, making a mistake while tracing the force through the system alters one’s ability to apply the alternation relation to different types of problems.
F. REVISITING THE LOGICAL CONUNDRUMS
Now that we have sketched an epigenetic account of knowledge acquisition in the gear-system task, we briefly discuss how epigenetic accounts in general avoid the logical conundrums presented previously, and then illustrate these points using the gear system results as an example. Recall that both logical problems we presented concern how knowledge acquisition, considered either as hypothesis testing or the detection of covariation, could get off the ground. That is, without some prior knowledge it would be impossible to generate hypotheses or detect covariation. Knowledge acquisition, both conundrums hold, cannot start from zero. Epigenetic accounts posit that this is a miscasting of the developmental problem. The question is not how to create structure from nothing, but rather
340
James A. Dixon and Elizabeth Kelley
how to create new structure from a set of existing structures—structures that were the products of previous epigenesis. We note explicitly that it is epigenesis ‘‘all the way down’’; epigenetic processes do not turn on at some point in time after a set of predetermined events has occurred (e.g., implantation in the uterine wall, completion of sensory systems, birth). Therefore, the question to be addressed is how can knowledge arise from existing structures. Recasting the question this way is not just sleight of hand that avoids theorizing about difficult issues. Rather, it directs attention to the relevant causal processes. To see more fully how an epigenetic account can circumvent the logical problems outlined previously, consider the discovery of alternation. Recall that we located the source of new relational information in the interaction between perception-action and the environment. The new relation is not generated as a hypothesis. That is, it is not transferred from some other problem context or assembled from a set of known relations (Dixon & Tuccillo, 2001). Nor is it detected in the environment. This particular external environment does not contain the alternation relation, only the target gear moves. Instead, alternation is generated through one’s own actions; the action pattern itself contains the alternation information. Because people, and probably most other self-locomoting organisms, encode their own action patterns (Classen et al., 1998), the problem of selecting which features to encode is greatly ameliorated, if not completely eliminated. In this way, the epigenetic account offered here avoids the logical problems of knowledge acquisition. Theorizing about the structures that play a central role in knowledge acquisition leads to new and testable predictions. In our account, action plays a crucial role; it literally creates new relations. Put another way, action provides the essential link between representations and the environment; the symbols on which cognition operates are the developmental products of action. This hypothesis has a long history in development (e.g., Baldwin, 1895; Piaget, 1954) and has recently reemerged as an important potential account of representation (Barsalou, 1999; Bickhard, 2001; Glenberg, 1997). The connection between action and representation has profound implications for understanding the basis of representational thought, often called the symbol grounding problem (Harnad, 1990). In the current context, the hypothesis that repeated actions create a representation of a new relation makes a testable set of predictions. We next outline the symbol grounding problem and two experiments addressing the hypothesis that representations are grounded in action.
IV. Epigenesis of Knowledge and Symbol Grounding Locating new relational information in the interaction between perceptionaction and the environment provides a way of grounding representations.
The Probabilistic Epigenesis of Knowledge
341
A widely recognized problem for theories of cognition is that they operate on arbitrary, content-free symbols (Glenberg & Robertson, 2000; Harnad, 1990; Lakoff & Johnson, 1980). These representations or symbols must ultimately have meaning for the cognizer, but how representations might carry meaning is a very tricky issue. For example, semantic networks represent meaning through connections among nodes within the system. However, neither the nodes themselves nor the connections have any content or meaning. They are only symbols in a formal system. For the formal system to be a model of cognition, the symbols must be connected to the objects and relations they represent (Glenberg, 1997; Glenberg & Robertson, 2000). We propose that representations of relations are grounded at their genesis; that is, as they are first formed. New relations enter the cognitive system through the interaction between perception-action and the environment. Representations formed in this manner are grounded, because they are constructed from action in the environment. Action has been suggested as a prime candidate for creating meaning, because it involves the relation between affordances in the environment (body-scaled information) and movement in response to those affordances (Gallese & Lakoff, 2005; Glenberg, 1997; Shannon, 1988). In this way, actions are hypothesized to embody affordances in the environment, thereby, creating a crucial, meaningful link between the person and the exogenous world.
A. SYMBOL GROUNDING: ACTIONS VS. INSTRUCTIONS
An interesting prediction from this account of symbol grounding is that representations that are created through action (or more properly through the interaction between the environment and perception-action) should be uniquely meaningful. As we described previously, these representations are constructed from the summation of multiple action patterns stored in episodic memory. Very early in life, this may be the only way or the primary way to construct new representations of relations. However, for older children and adults, it is also possible to construct relations by concatenating component relations transferred from other domains. This is, of course, what we assume happens when a set of relations, such as the spatial layout of an unfamiliar scene (Langston, Kramer, & Glenberg, 1998), is described verbally. For example, consider the drawing of three connected balance beams in Figure 6. The fulcrum of each balance beam is shown as a triangle. Each balance beam is connected to the next one in the series by a flexible joint, the ovals in the drawing. The arrow shows that the right side of the first beam will tilt down. To help a participant figure out how this system functions, allowing them to predict what will happen to the final beam in the
342
James A. Dixon and Elizabeth Kelley
? Fig. 6. An example of the balance-beam task used in Dixon and Dohn (2003). Three balance beams are connected by two flexible joints, shown as ovals. The arrow indicates that the right arm of the left-most beam is pushed down. Participants were asked to predict whether the right arm of the final beam in the series would go up or down.
system (i.e., will it go up or down), we might describe the alternation relation as follows: The way we’d like you to solve this problem is to classify the end of each balance beam as going either up or down. For example, in this problem the first balance beam is a ‘down beam’ because its end goes down. So the second balance beam will be a ‘up beam’, and the third one will be an ‘down beam’. So, to solve the problem you classify each balance beam as an ‘up’ or ‘down’ beam.
This simple verbal description of the alternation relation allows participants to transfer relations they already understand, such as ‘‘up’’ and ‘‘down,’’ and assemble them into a new structure, alternation. Although the assembled components must ultimately be grounded, each component relation also has its own history (i.e., the set of situations to which it has been previously applied). Accessing the meaning of the new relation involves activating these histories. In this way, a new relation constructed by concatenating already known relations runs into the usual problems associated with transferring relations across domains (Dixon & Tuccillo, 2001; Ross & Kilbane, 1997). Features from the previous problem contexts interfere with accessing the relation and mapping it to the target domain. By contrast, new relations constructed through actions are directly grounded—the actions are the meaning. Therefore, relations constructed from actions should be more abstract, in the sense of being easily and robustly transferable. To test this prediction, Dixon and Dohn (2003) asked college students to solve 10 balance-beam problems similar to the one shown in Figure 6. Half the participants received direct instruction on an alternation strategy which we call ‘‘Up-Down.’’ These participants were read the short, instructional paragraph presented previously, and asked to use the Up-Down strategy to solve all 10 problems. The remaining participants were asked to solve the balance-beam problems using any method they wished. We recruited college students for this experiment, rather than children, to ensure that our participants were reasonably expert at constructing good representations from verbal instructions. Based on pilot work, we anticipated that many of the non-instructed participants would trace the force through the system, using a strategy we call Force-Tracing.
The Probabilistic Epigenesis of Knowledge
343
Force-Tracing is analogous to the Figure-Eight strategy, except that participants make an alternating, sinusoidal motion, rather than a circular one. Next, we gave participants a supposedly unrelated task that shared a deep structural relation (i.e., alternation), solving gear-system problems in the context of a train race. The central question was whether participants who had constructed alternation through their own actions (the non-instructed condition) would transfer more quickly than participants who had constructed the alternation relation through our instructions. We anticipated that participants who were not instructed on the Up-Down strategy would initially use ForceTracing during the balance-beam task, and thereby construct the alternation relation through the patterns of their own actions. The resulting representation of alternation should be highly abstract. In contrast, participants who had been instructed on the Up-Down strategy should have constructed alternation by transferring the component relations specified in the verbal instructions about the strategy. First, we note that the manipulation of the balance-beam task worked as intended. In the non-instructed condition, 83% of participants used ForceTracing on the first trial, 17% used Up-Down. By trial 10, use of Force-Tracing decreased to 50%, use of Up-Down increased to 38%. In the instructed condition, nearly all participants complied with the instruction to use the UpDown strategy (a single participant used a different strategy). Accuracy in both conditions was about 100%. A central prediction was that participants in the non-instructed condition would transfer alternation to the gear-system task (i.e., as evidenced by using the Left-Right strategy) more quickly than the instructed group, despite the fact that the instructed group had just used alternation on all trials of the previous task. Figure 7 shows the proportion of participants who had discovered Left-Right (used it at least once) across the gear trials. As can be seen in the figure, on early trials, the non-instructed group was more likely to use the Left-Right strategy than their instructed counterparts. The median discovery lifetime (i.e., trial by which 50% of the sample had discovered Left-Right) for the non-instructed group was 6.5 for the non-instructed group and 11 for the instructed group. The non-instructed group transferred more quickly than the instructed group; the instructed group transferred alternation, but did so later. However, the difference in the quality of the representation of alternation should have persistent effects. That is, we should also see differences in how easy it is to employ the Left-Right strategy after the first use, depending on how one initially constructed the alternation representation (i.e., through action vs. instruction). To test this hypothesis, we examined the distribution of strategies on the 10 trials immediately following participants’ first use of Left-Right on the gear task. Figure 8 shows the distribution of strategy use on the 10 trials immediately following the first use of Left-Right. The non-instructed group
344 James A. Dixon and Elizabeth Kelley
Fig 7. The cumulative proportion of participants who had discovered Left-Right by trial, with separate curves for the instructed and non-instructed experimental conditions. The cumulative proportion is the number of participants who had discovered Left-Right on trials up to and including the current trial divided by the total number of participants in that condition.
The Probabilistic Epigenesis of Knowledge
345
Fig. 8. The proportion of participants using each type of strategy on the 10 trials immediately following discovery of Left-Right. Results for the non-instructed condition are in the top panel, the instructed condition is in the bottom panel. Figure-Eight is abbrevated F8, Left-Right is LR, and Counting Parity is CP.
346
James A. Dixon and Elizabeth Kelley
continued to use the Left-Right strategy, averaging around 90% across the 10 trials. Very rarely did these participants fall back on the Figure-Eight strategy. The instructed group used the Left-Right strategy much less often over these 10 post-discovery trials, often falling back on the lower Figure-Eight strategy. Consistent with the hypothesis that actions create more meaningful, abstract representations, non-instructed participants discovered Left-Right earlier and used it more robustly.
B. SYMBOL GROUNDING: ACTIONS VS. PERCEPTUAL CUES
One concern with this experiment was that participants in the instructed condition did not actually discover alternation, the Up-Down strategy, themselves. The experience of discovery might have made alternation particularly salient and, therefore, drive the observed effects. To address this possibility, we conducted a second experiment in which the two balance-beam conditions would both lead to discovery of alternation, but through different means. We created two types of balance-beam displays for this experiment. In the alternating fulcrum condition, participants saw balance beams that had alternating light and dark fulcrums. In the random-fulcrum condition, participants also saw balance beams with light and dark fulcrums, but the saturation did not systematically alternate (e.g., two light fulcrums might be adjacent). Figure 9 shows an example of each type problem. We predicted that participants in the alternating-fulcrum condition would tend to quickly discover alternation through the salient pattern in the display (i.e., the alternating fulcrums), rather than through their own actions. Participants in the random-fulcrum condition would be less likely to discover alternation through the display, as the fulcrums do not regularly alternate. Rather, these participants should be more likely to discover alternation through their own actions via the Force-Tracing strategy. As in the first
Alternating Fulcrums
? Random Fulcrums
? Fig. 9. Examples of the balance-beam systems presented in the alternating- and randomfulcrum conditions in Dixon and Dohn (2003), experiment 2.
The Probabilistic Epigenesis of Knowledge
347
experiment, participants solved the gear system problems after completing the balance-beam task. Consistent with our expectations, 75% of participants in the alternatingfulcrum condition used the Up-Down strategy on the first trial of the balancebeam task, 85% used it at least once. In the random-fulcrum condition, 50% used Force-Tracing on the first trial; remaining participants used the Up-Down strategy. The majority, 82%, of participants in the random-fulcrum condition used Up-Down at least once. The results for the gear task replicate those of the first experiment. Participants in the random-fulcrum condition had a median discovery lifetime (trial by which 50% of participants had discovered) of 5, compared to 8 for the alternating fulcrum condition. Similarly, participants in the random-fulcrum fell back to the Figure-Eight strategy less often than their counterparts in the alternating-fulcrum condition. Figure 10 shows the distribution of strategies across the 10 trials post-discovery of Left-Right. Consistent with the idea that action creates directly meaningful representations, participants who used the Force-Tracing strategy, a strategy that embodies the alternation information, discovered the Left-Right strategy more quickly and used the Left-Right strategy more consistently after discovery than their counterparts who had received instruction (experiment 1) or discovered alternation through a property of the display (experiment 2). The instructed participants in experiment 1 had not only heard instructions about alternation, but had also successfully executed the strategy on the balance-beam trials. Similarly, participants in the alternating-fulcrum condition had, on average, used the alternation strategy, Up-Down, on many more trials. Therefore, we might reasonably have expected participants in these conditions to have a more abstract representation of alternation (by virtue of the fact that they had used Up-Down successfully on more trials) and, thus, transfer more easily to the gear task. Indeed, most accounts of abstraction and transfer would predict just that. However, our results suggest that representations created through action transfer more easily and are used more consistently than those constructed through other means. These findings are consistent with the hypothesis that the interaction between the perception-action system and the environment provides the necessary grounding for representations at the time of their initial genesis.
V. Epigenesis and Detecting Structure in the Environment Although most behavioral situations involve physical action to some degree, it seems unlikely that actions are the only source of information about new relations. Indeed, there is substantial evidence that children acquire relational information through perceptual modalities without any accompanying actions
348
James A. Dixon and Elizabeth Kelley
Fig. 10. The proportion of participants using each type of strategy on the 10 trials immediately following discovery of Left-Right. Results for the random-fulcrum condition are in the top panel, the alternating-fulcrum condition is in the bottom panel.
The Probabilistic Epigenesis of Knowledge
349
(e.g., Casasola, 2005; Cohen, Chaput, & Cashon, 2002). For example, children readily acquire the structure available in an auditory stream (Creel, Newport, & Aslin, 2004), a learning context that does not appear to involve action. A major issue for understanding how structure in the environment could be detected by perceptual systems concerns the process of tuning such systems to the relevant dimensions (Gibson & Walker, 1984). Epigenetic principles apply here as well. Perceptual systems, like other aspects of the developing system, are both products and agents of epigenesis. That is, perceptual systems undergo development, and these developments alter the flow of information detected in the environment. Research on the effects of comparison in knowledge acquisition demonstrates how repeated interactions between the perceptual system and other levels of the developmental system change the perceptual organization of the environment, thereby affecting knowledge acquisition. Building on her well-known work on structural alignment in analogy (Gentner, 1983; Ratterman & Gentner, 1998), Gentner and her colleagues have shown in a number of studies that comparison also invokes structural alignment, and has incremental effects on perceptual tuning (Gentner & Medina, 1998; Namy & Gentner, 2002). In one such study, Kotovsky and Gentner (1996) first asked preschoolers to make comparisons based on available perceptual relations. For instance, they presented a display that showed three rectangles increasing in size from left to right and asked participants to select which of two displays was more similar. Both alternative displays contained three circles. In one, the circles increased in size, and, therefore, shared the relational property ‘‘increasing size.’’ The other display did not share the relational property. (See the upper panel of Figure 11.) Under these conditions, even quite young children (i.e., 4 years old) selected the alternative that shared the relational property about 70% of the time. However, when the relation was at a higher level, as in the lower panel of Figure 11, their performance dropped to chance. Note that in this case, objects in the alternative display change in saturation rather than size; hence a crossdimension comparison is necessary. Therefore, selecting the relational alternative (i.e., the display with increasing saturation) requires detecting the higher-order relation ‘‘increasing,’’ given the two lower-order relations ‘‘increasing size’’ and ‘‘increasing saturation.’’ Put differently, the ‘‘increasing’’ relation must be disembedded from the specific properties (e.g., size and saturation). Kotovsky and Gentner showed that concentrated experience making withindimension comparisons (e.g., increasing size increasing size) facilitated performance on cross-dimension comparisons (e.g., increasing size increasing saturation). Repeated and successive comparisons involving the lower-order relations allowed 4-year-olds to detect the higher-order relation, reliably make the relational choice on cross-dimension problems.
350
James A. Dixon and Elizabeth Kelley
Within-Dimension Comparison
Cross-Dimension Comparison
Fig. 11. Examples of within-dimension (upper panel) and cross-dimension (lower panel) comparison tasks adapted from Kotovsky and Gentner (1996). Participants were asked to select which of the two alternative displays (upper row of each panel) ‘‘goes best’’ with the standard (lower row of each panel).
An epigenetic account of this phenomenon (and one that is quite in line with that proposed by Kotovsky and Gentner) involves the interaction among various levels in the developing system. Consider how a system that contains perception, working, and long-term memory might self-organize to focus attention on the ‘‘increasing size’’ relation. The perceptual system interacts with the environment to yield representations of each triad in working memory. Initially, children’s representations contain a wide variety of relations, some of which may be quite idiosyncratic. For example, one child might focus on the spaces between the items, while another may focus on the degree to which the overall shape resembles a favorite animal. Among these many relations are the components of the relational structure of interest, ‘‘increasing size.’’ At the lowest relational level of this structure, the leftmost square in the display is smaller than the middle square (Left 5 Middle), and the middle square is smaller than the rightmost square (Middle 5 Right). At the next level, relations between adjacent squares are alike (i.e., ‘‘smaller than’’ ¼ ‘‘smaller than’’). Among the two alternative displays, one set of circles shares this relational structure.
The Probabilistic Epigenesis of Knowledge
351
Comparing these representations to one another involves aligning relational structures. This favors the component relations of the ‘‘increasing size’’ structure (e.g., Left 5 Middle), because they are common to both displays and part of a system of relations. Therefore, making a comparison activates relations within the ‘‘increasing size’’ structure. These relations become part of the representation in long-term memory. Consequently, the next presentation of the display activates those stored instances, which contributes to the organization of the working memory representation; components of the ‘‘increasing size’’ relational structure become increasingly activated in working memory. In this way, repeated comparisons serve to organize the perception of the individual displays relative to one another such that the ‘‘increasing size’’ relation becomes prominent. A similar set of events would, of course, occur for the within-dimension comparisons on ‘‘increasing saturation’’ trials. Cross-dimension comparisons that occur after repeated and successive withindimension comparisons involve aligning these now well-organized representations. This allows children to detect the commonality between the ‘‘increasing size’’ and ‘‘increasing saturation’’ relation structures. Gentner and her colleagues have shown in a number of studies that this effect depends, in part, on participants aligning the representations. In this account, the environment, perception, working and long-term memory interact to detect a new relation in the environment. The perceptual system becomes tuned to the lower-level relations through repeated alignment (i.e., comparisons). This allows the higherorder relation to be detected when a cross-dimension comparison occurs. Repeatedly engaging the system in the simple within-dimension comparison task changes its functioning, thereby allowing a new relation to be extracted. This example illustrates that the perceptual system can, through interactions with other levels of the system, become tuned to relations in the environment in a way that ultimately facilitates knowledge acquisition, the extraction of a higher-order relation. Acquiring knowledge about the physical world is a profoundly important developmental task. However, it is not the only epistemological challenge children face. For example, children also acquire a sophisticated understanding of their own mentality, social relations, and language. Language acquisition has been particularly fertile ground for theorizing about knowledge acquisition, in part, because the rich nature of its structure is relatively easy to observe (e.g., the complex syntax of a sentence) and formal theories of some of its structures have been advanced (e.g., Chomsky, 1975; Jackendoff, 2002). Because language has been such an important arena for research on knowledge acquisition, we briefly review two examples of epigenetic approaches to language development.
352
James A. Dixon and Elizabeth Kelley
VI. Epigenetic Approaches to Language Acquisition A number of researchers have proposed emergentist or dynamic systems approaches to language development (Bates & Goodman, 1999; Evans, 2001; Locke, 1999; MacWhinney, 2004; Smith, 2001; Tucker & Hirsh-Pasek, 1993). The essence of these emergent/dynamic systems approaches is epigenetic; language development occurs through the constant interaction of many levels within the developing system including physical structures in the brain, auditory perception, and linguistic input, as well as aspects of cognitive, affective, and social development. In many of these approaches, aspects of language that were traditionally viewed as very separate—phonology, semantics, syntax, and pragmatics—are seen by theorists as continuously interacting with one another to drive changes in language (e.g., Jackendoff, 2002). The system is selforganizing; its structures emerge from interactions within the system. Each moment in the system is both the product of all the moments that have occurred before, and predicates the system’s future moments. As Smith (2001) stated, ‘‘Each event creates the context for, and thus constrains, what can happen next’’ (p. 125).
A. EMERGENCE OF PHONOLOGY
A central issue in language acquisition is how language learners might come to understand and produce spoken language. This requires a representation that appropriately connects the relevant speech sounds, speech acts, and meaning of the word. Plaut and Kello (1999) proposed that phonological representations arise from the interactions among acoustic, articulatory, and semantic levels within the developing system. They presented a connectionist model in which representations of acoustics, articulations, and semantics jointly produced a hidden layer of phonological representation. Their simulation also included a sophisticated conception of exogenous influences. Specifically, they trained the system with four different types of experience: babbling comprehension, invitation, and intentional naming. For example, babbling involved the articulatory level providing input to the acoustic level. The resulting interaction between these levels (iteratively adjusting the weights such that the acoustic level better predicted the actual outcome of articulation) allowed the model to learn the complex relation between acoustics and articulation. Training on comprehension involved input at the acoustic level that is predicted by the semantic level, again with weights adjusting during the training phase. Plaut and Kello showed that a representation of the phonology emerged from the interactions among these levels. Their simulation also replicated the
The Probabilistic Epigenesis of Knowledge
353
general pattern of errors that children make as they engage in comprehension and imitation. From an epigenetic perspective, this model illustrates how a detailed theory of relevant levels in the system can produce new knowledge about language, representations of word sounds in the current example.
B. SHAPE BIAS AND THE LEXICON
Another important question for language development concerns how children might learn new word meanings. Labeling an object for a child is, from a logical perspective, uninformative; whether the new label refers to the type of object, the object’s color, its shape, or some other property, is not specified (Quine, 1960). A large body of work suggests that children have a set of biases which help overcome this logical dilemma (e.g., Gershkoff-Stowe & Smith, 2004; Markman, Wasow, & Hansen, 2003). For example, children tend to extend new labels to objects with a similar shape, hence the ‘‘shape bias.’’ Work by Gershkoff-Stowe and Smith (2004) has shown that the shape bias emerges over the course of lexical development (see also Smith, 2001). In a longitudinal study, Gershkoff-Stowe and Smith showed that lexical development, more specifically noun learning, is tied to the development of the shape bias over time. The number of nouns and the shape bias appear to develop together. Gershkoff-Stowe and Smith suggested that these two factors are mutually and continuously affecting one another; the organizational basis of the categories (i.e., the shape bias) is strongly affected by the members of the categories and vice versa. Interestingly, they noted that children’s first 25 nouns tended to be shape nouns (i.e., refer to items similar in shape). This is consistent with the idea that the shape bias emerges as a higher-order property of the category members. In the epigenetic terms we have been using here, the properties of the categorical structure, such as the shape bias, are levels in the developmental system, as is the lexicon. These levels reciprocally affect one another.
VII. Conclusions Epigenetic approaches to development have proven very useful in explaining the emergence of new structure in a variety of domains, such as early instinctive behavior (e.g., Gottlieb, 1997), memory (Brennan, Hancock, & Keverne, 1992), and neurogenesis (Nordeen & Nordeen, 1990). Understanding development requires theorizing about interactions among levels in a reciprocally causal system. Developmental explanations of new structure, therefore, require specifying the appropriate levels and how those levels interact.
354
James A. Dixon and Elizabeth Kelley
A. CORE MODELS
Adopting an epigenetic approach requires that we reconsider our core models of development (Gottlieb, 2002). Specifically, adopting an epigenetic perspective directly contradicts a widely held, implicit model of development—biology delivers a set of physical structures which then facilitate and/or constrain psychological development. This seemingly reasonable conception sets us firmly on the wrong road, because it tacitly assumes that: (a) some structures are biologically based and others are not; (b) biological structures arise from the genetic plan; and (c) non-biological structures arise from experience with the environment. Epigenetic approaches reject all three of these assumptions, replacing them with: (a) all structures (mental and physical) are systems based, a system that always includes biology; (b) all structures arise from the interaction of levels in the hierarchical system; and (c) there is no moment in developmental time at which biological influences stop and experiential influences begin (and vice versa). In this way, considering development from an epigenetic perspective changes the developmental problem fundamentally. New structures arise from interactions among existing structures; they do not arise from nothing or from predetermined structures. The question, therefore, becomes characterizing the existing structures and the processes (i.e., interactions among structures) that drive change, rather than attempting to identify which structures are predetermined (or innate or biological) and which are developmental (or learned or experiential).
B. AN ILLUSTRATION OF EPIGENETIC PRINCIPLES IN KNOWLEDGE ACQUISITION
Our research on changes in understanding the gear-system task illustrates how the epigenetic approach may be applied to knowledge acquisition. We proposed that the environment, perception-action system, working and long-term memory form a hierarchical, layered system. When a child first encounters the gear-system environment, the perception-action system organizes a response based on the perceived affordances. The perception-action system is responsive to error and capable of generating alternative interpretations of the environment. Repeatedly generating an incorrect strategy causes the perception-action system to change the response, attending to other affordances and/or reintegrating current ones. Consistent with this hypothesis, we were able to predict (if and when) participants would discover an appropriate interpretation of the gear-system task (i.e., the Figure-Eight strategy), based on their incorrect strategy use. This finding squares with many
The Probabilistic Epigenesis of Knowledge
355
current conceptions of knowledge acquisition as hypothesis testing or theory revision. More interestingly from an epigenetic perspective, after the Figure-Eight strategy enters the system, it becomes an agent of further developmental change. The actions that comprise the Figure-Eight strategy generate new relational information about the gear system—the gears form an alternating series. Further interactions with working and long-term memory extract that relational information, creating a new representation of the problem, that is, new knowledge. We showed that discovery of the alternation relation (if and when a participant first used the Left-Right strategy) was jointly predicted by recent use of the Figure-Eight strategy and its long-term accuracy, measures of working memory activation, and the contents of episodic memory, respectively. We also showed that this account of the emergence of the alternation relation from the hierarchical system made further predictions about the quality of the resulting representation. Because the alternation relation is constructed by activating the stored Figure-Eight exemplars, it follows that having more exemplars will create a representation that better reflects the common features of the exemplars (e.g., alternation), as opposed to the idiosyncratic features of particular problems. The analysis showed that the number of exemplars containing the alternation relation (i.e., number of correct Figure-Eight uses) prior to discovering the Left-Right strategy predicted the generalization by the LeftRight strategy to new problems. In this way, the epigenetic account predicts both the discovery of the new relation and its generalization post-discovery. Because epigenetic accounts are committed to the proposition that structures are the products of epigenesis, they must ultimately explain the processes that give rise to those structures. In the case of knowledge acquisition, an epigenetic theory will explain how new representations emerge from other structures. Such a theory will necessarily have implications for core problems in cognitive science, such as representation, learnability, and symbol grounding, because these issues concern how new relations could be acquired by the system. We have taken some first steps towards exploring the implications of our account of knowledge acquisition for one such core issue, symbol grounding. Because we propose that new relations are created by the system through its own actions, actions should serve to ground the resulting representations. That is, we proposed that actions provide the meaningful link between representations and the exogenous world. In two experiments, we showed that representations which arise through actions are more easily transferred, and more robustly employed, than representations which arise through direct instruction or cueing from the display. Although our results suggest that actions serve to ground representations, new representations may enter the system through other pathways. For example, a considerable body of work with infants suggests that new relations may be encoded by the perceptual system in the absence of relevant actions.
356
James A. Dixon and Elizabeth Kelley
Regardless of whether action is the sole pathway for symbol grounding, epigenetic approaches will require an explanation of how new structures (i.e., symbols) emerge. Therefore, epigenetic approaches promise to make important contributions to cognitive science quite generally.
C. INTEGRATING DISPARATE DEVELOPMENTAL EFFECTS
Epigenetic approaches also allow us to naturally integrate many disparate influences within developmental science. Consider, for example, how individual differences in genetics might have effects on discovering the alternation relation and generalizing it to new problems. Evidence indicates that allelic variation in a gene that helps regulate cholinergic neurotransmission has concurrent effects on visuospatial attention (Parasuraman et al., 2005). This is not, of course, a gene for attention, but it does affect attention through a causal chain across levels of the system. Attention, in turn, should affect one’s ability to reliably trace the force through the system, which affects the probability of discovering alternation and also has lagged effects on its generalizability. In this way, relatively small differences at the most endogenous level of the system may cascade across other levels, and ultimately have lasting effects on representation. At the other end of the system, the most exogenous level, peer interactions might also plausibly affect the probability of discovering and generalizing alternation. Peer relations, such as being rejected or neglected by one’s social group, affect self-regulation (Baumeister et al., 2005). Self-regulation, the ability to continuously monitor one’s own actions, should in turn affect problem solving with the Figure-Eight strategy, because accurately performing the Figure-Eight strategy requires keeping track of one’s actions across the gears. Accurately performing the Figure-Eight strategy has effects on discovering and generalizing alternation. Our point here is not to argue for the effects of neurotransmitter genes or peer relations, although these are potentially interesting hypotheses. Instead, we hope to emphasize that epigenetic approaches allow one to rationally integrate effects through known or hypothesized causal pathways. Epigenetic approaches also release the field from debates concerning the proportion of variance attributable to genes vs. environments. Effects from various levels, including genes and environment, become fused and inseparable over time. Both contribute to an interactive and reciprocally causal system. Therefore, although one may investigate how local differences in the activity of a level (i.e., across a defined time period) affect other levels in the system, attributing a meaningful proportion of variance to the level globally is a logical impossibility. The problem is that the levels affect one another, so variance in one is driven by variance in the other, and vice versa. The fundamental issue
The Probabilistic Epigenesis of Knowledge
357
concerns a mismatch between the underlying assumptions about how development works, specifically that genes and environment have independent effects on developmental outcomes, and actual developmental processes. Therefore, the issue is not ameliorated by having tighter measures, more dense sampling, or longitudinal and multivariate designs.
D. RELATION TO OTHER SYSTEMS APPROACHES
Epigenetic approaches are broadly compatible with other accounts of development as a self-organizing system. For example, Turvey and Fitzpatrick (1993) contrasted the ‘‘genetic program’’ view of morphogenesis, in which new structures are created according to a genetic plan, with an ‘‘execution-driven’’ account. In an ‘‘execution-driven’’ account of development, new structures emerge from the dynamic interactions among levels in the system. These interactions are not goal-directed or instructed, rather they reflect the chemical and physical properties of their constituent parts. New structures can arise from very simple initial conditions, given the appropriate dynamics. Turvey and Fitzpatrick suggested that these principles should obtain regardless of scale; the principles extend from morphogenesis, through the development of perceptionaction, cognition, and social systems. Applying a systems approach at a relatively microgenetic level, Thelen et al. (2001) created a computational model that successfully demonstrated many of the major results surrounding the A-not-B error, a mystifying error that occurs in a simple search task. Children, approximately 7 – 12 months of age, are shown a desirable object which is then hidden, in full view of the child, at location ‘‘A.’’ The child reaches for and recovers the object from ‘‘A.’’ Typically, after a few such trials, the object is hidden at a new location ‘‘B.’’ Despite the fact that the child watched the object being hidden at ‘‘B,’’ he or she still searches at ‘‘A.’’ Thelen et al.’s model included the environment, perception-action system, and memory. These levels all contributed to a planning field, which is roughly analogous to working memory. The planning field had both temporal and spatial dynamics; activation in the planning field takes time to reach its peak and decays over time, and proximal locations in the field were mutually excitatory (i.e., cooperativity). Thelen et al. showed that the model produced the A-not-B error when the cooperativity of the planning field was low; that is, the field was unable to sustain its own activation. When cooperativity was high, the model solved the search task appropriately, as slightly older children do. The model showed that interactions among levels in the system create a representation of the task and that changes in how those interactions take place (i.e., cooperativity of the planning field) created new reaching behavior. (See also Spencer & Schoner, 2003.)
358
James A. Dixon and Elizabeth Kelley
A review of system approaches to development is beyond the scope of this chapter. Therefore, we offer these examples to show the strong commonalities between epigenetic and system approaches: self-organization, multiple interacting levels, and integration over time scales, to name just a few.
E. RESEARCH STRATEGIES FOR EPIGENETIC AND SYSTEMS APPROACHES
A problem for systems theories has been that, although many researchers might agree with system principles, aligning one’s research strategies with those principles has been a significant challenge. The classic experimental, quasiexperimental, and correlational designs offer very little leverage on developmental systems. In their usual instantiations, these types of designs, and their concomitant statistical models, have difficulty with dependent measures that change continuously over time. They are even less adequate for investigating the discrete onset of new structures in time. An additional shortcoming of these models for research on developmental systems is their inability to handle predictors that change value over time. Research paradigms in developmental science must evolve to handle the complex causal structures of current theories. Computational models have filled this void to some degree. These models offer either existence proofs, showing that a particular architecture can learn a structure, or ordering proofs, showing that an architecture replicates the developmental ordering observed empirically (i.e., in children) as it learns the task (for discussions, see Cohen & Chaput, 2002; Dixon, 2005). These models may be particularly important for investigating the potential effects of unobserved variables, such as the cooperativity property in the work of Thelen et al. (2001) That said, computational models are one step removed from the data; they model the qualitative result, but not changes in real time. Nor do they employ measured predictors that change in real time. We suggest that statistical models originally developed for longitudinal data (e.g., Singer & Willett, 2003) coupled with microgenetic designs can provide researchers with enormously powerful tools for investigating developmental systems. These mature statistical methods allow for the prediction of change in continuous measures over time and for predicting the onset of discrete changes, such as a new strategy. These methods also handle time-varying predictors; one can predict what the system will do next based on its previous states. Microgenetic designs, because they densely sample the states of the system (Kuhn, 1995; Siegler & Crowley, 1991), may be especially useful for understanding how change in the system emerges as a function of its own actions. Accepting epigenetic principles has profound implications for developmental theory, in general, and for knowledge acquisition, in particular. One major
The Probabilistic Epigenesis of Knowledge
359
benefit of adopting an epigenetic model is that it brings us closer to the processes under investigation. That is, the system we are studying is much more like a self-organizing, multi-layered system, than an additive or multiplicative, linear one. Embracing this complexity can yield important insights into development.
REFERENCES Adolph, K. E. (2002). Learning to keep balance. In R. Kail (Ed.), Advances in child development and behavior. (Vol. 30, pp. 1 – 40). San Diego CA: Academic Press. Baldwin, J. M. (1895). Mental development in the child and the race: Methods and processes. New York: Macmillan. Barrouillet, P., Bernardin, S., & Camos, V. (2004). Time constraints and resource sharing in adults’ working memory spans. Journal of Experimental Psychology: General, 133, 83 – 100. Barsalou, L. W. (1999). Perceptual symbol systems. Behavioral and Brain Sciences, 22, 577 – 660. Barsalou, L. W., Simmons, W. K., Barbey, A. K., & Wilson, C. D. (2003). Grounding conceptual knowledge in modality-specific systems. Trends in Cognitive Science, 7, 84 – 91. Bates, E., & Goodman, J. C. (1999). On the emergence of grammar from the lexicon. In B. MacWhinney (Ed.), The emergence of language (pp. 29 – 70). Mahwah, NJ: Erlbaum. Baumeister, R. F., DeWall, C. N., Ciarocco, N. J., & Twenge, J. M. (2005). Social exclusion impairs self-regulation. Journal of Personality and Social Psychology, 88, 589 – 604. Bickhard, M. H. (2001). Why children don’t have to solve the frame problems: Cognitive representations are not encodings. Developmental Review, 21, 224 – 262. Brennan, P. A., Hancock, D., & Keverne, E. B. (1992). The expression of the immediateearly genes c-fos, egr-1, and c-jun in the accessory olfactory bulb during the formation of an olfactory memory in mice. Neuroscience, 49, 277 – 284. Casasola, M. (2005). When less is more: How infants learn to form an abstract categorical representation of support. Child Development, 76, 279 – 290. Chomsky, N. (1966). Cartesian linguistics. New York: Harper & Row. Chomsky, N. (1975). Reflections on language. New York: Pantheon. Chomsky, N. (1988). Language and problems of knowledge: The Managua Lectures. Cambridge, MA: MIT Press. Clark, A. (1997). Being there: Putting brain, body and world together again. Cambridge, MA: MIT Press. Classen, J., Liepert, J., Wise, S. P., Hallett, M., & Cohen, L. G. (1998). Rapid plasticity of human cortical movement representation induced by practice. Journal of Neurophysiology, 79, 1567 – 1573. Cohen, L. B., & Chaput, H. H. (2002). Connectionist models of infant perceptual and cognitive development: Comment. Developmental Science, 5, 173 – 175. Cohen, L. B., Chaput, H. H., & Cashon, C. H. (2002). A constructivist model of infant cognition. Cognitive Development, 17, 1323 – 1343.
360
James A. Dixon and Elizabeth Kelley
Creel, S., Newport, E. L., & Aslin, R. N. (2004). Distant melodies: Statistical learning of nonadjacent dependencies in tone sequences. Journal of Experimental Psychology: Learning, Memory, and Cognition, 30, 1119 – 1130. Dixon, J. A. (2005). Strong tests of developmental ordering hypotheses: Integrating evidence from the second moment. Child Development, 76, 1 – 23. Dixon, J. A., & Bangert, A. S. (2002). The prehistory of discovery: Precursors of representational change in solving gear-system problems. Developmental Psychology, 38, 918 – 933. Dixon, J. A., & Bangert, A. S. (2004). On the spontaneous discovery of a mathematical relation during problem solving. Cognitive Science, 28, 433 – 449. Dixon, J. A., & Dohn, M. C. (2003). Redescription disembeds relations: Evidence from relational transfer and use in problem solving. Memory & Cognition, 31, 1082 – 1093. Dixon, J. A., & Tuccillo, F. (2001). Generating initial models for reasoning. Journal of Experimental Child Psychology, 78, 178 – 212. Dreisch, H. (1929). The science and philosophy of the organism. London: Black. Evans, J. L. (2001). An emergent account of language impairments in children with SLI: Implications for assessment and intervention. Journal of Communication Disorders, 34, 39 – 54. Feigenson, L., Dehaene, S., & Spelke, E. S. (2004). Core systems of number. Trends in Cognitive Sciences, 8, 307 – 314. Fischer, K. W., & Rose, S. P. (1994). Dynamic development of coordination of components in brain and behavior: A framework for theory and research. In G. Dawson & K. W. Fischer (Eds.), Human behavior and the developing brain (pp. 3 – 66). New York: Guilford. Fodor, J. A. (1975). The language of thought. Cambridge, MA: Harvard University Press. Fodor, J. A. (1981). The present status of the innateness controversy. In J. Fodor (Ed.) Representations: Philosophical essays on the foundations of cognitive science (pp. 257 – 316). Cambridge, MA: MIT press. Ford, D. H., & Lerner, R. M. (1992). Developmental systems theory: An integrative approach. Newbury Park, CA: Sage. Gallese, V., & Lakoff, G. (2005). The brain’s concepts: The role of the sensory-motor system in conceptual knowledge. Cognitive Neuropsychology, 22, 455 – 479. Gentner, D. (1983). Structure-mapping: A theoretical framework for analogy. Cognitive Science, 7, 155 – 170. Gentner, D., & Medina, J. (1998). Similarity and the development of rules. Cognition, 65, 263 – 287. Gershkoff-Stowe, L., & Smith, L. B. (2004). Shape and the first hundred nouns. Child Development, 75, 1098 – 1114. Gibson, J. J. (1986). The ecological approach to visual perception. Hillsdale, NJ: Erlbaum. Gibson, E. J., & Walker, A. S. (1984). Development of knowledge of visual-tactual affordances of substance. Child Development, 55, 453 – 460. Glenberg, A. M. (1997). What memory is for. Behavioral & Brain Sciences, 20, 1 – 55. Glenberg, A. M., & Robertson, D. A. (2000). Symbol grounding and meaning: A comparison of high-dimensional and embodied theories of meaning. Journal of Memory and Language, 43, 379 – 401. Goldfield, E. C. (1995). Emergent forms: Origins and early development of human action and perception. New York: Oxford University Press. Goldinger, S. D. (1998). Echoes of echoes? An episodic theory of lexical access. Psychological Review, 105, 251 – 279.
The Probabilistic Epigenesis of Knowledge
361
Gopnik, A., Glymour, C., Sobel, D. M., Schulz, L. E., Kushnir, T., & Danks, D. (2004). A theory of causal learning in children: Causal maps and Bayes nets. Psychological Review, 111, 3 – 32. Gottlieb, G. (1992). Individual development and evolution: The genesis of novel behavior. New York: Oxford University Press. Gottlieb, G. (1997). Synthesizing nature-nurture: Prenatal roots of instinctive behavior. Mahwah, NJ: Erlbaum. Gottlieb, G. (1998). Normally occurring environmental and behavioral influences on gene activity: From central dogma to probabilistic epigenesis. Psychological Review, 105, 792 – 802. Gottlieb, G. (2001). The relevance of development-psychological metatheory to developmental neuropsychology. Developmental Neuropsychology, 19, 1 – 9. Gottlieb, G. (2002). On the epigenetic evolution of species-specific behavior: The developmental manifold concept. Cognitive Development, 17, 1287 – 1300. Harnad, S. (1990). The symbol grounding problem. Physica D, 42, 335 – 346. Hintzman, D. L. (1986). ‘‘Schema abstraction’’ in a multiple-trace memory model. Psychological Review, 93, 411 – 428. Hume, D. (1978). A treatise on human nature. Oxford: Oxford University Press. (Original work published 1739.) Jackendoff, R. (2002). Foundations of language: Brain, meaning, grammar, evolution. Oxford: Oxford University Press. Johnston, T. D., & Edwards, L. (2002). Genes, interactions, and the development of behavior. Psychological Review, 109, 26 – 34. Karmiloff-Smith, A. (1992). Beyond modularity: A developmental perspective on cognitive science. Cambridge, MA: MIT Press. Karmiloff-Smith, A. (1997). Promissory notes, genetic clocks, and epigenetic outcomes. Behavioral and Brain Sciences, 20, 355 – 360. Keil, F. C., Smith, W. C., Simons, D. J., & Levin, D. T. (1998). Two dogmas of conceptual empiricism: Implications for hybrid models of the structure of knowledge. Cognition, 65, 103 – 135. Kotovsky, L., & Gentner, D. (1996). Comparison and categorization in the development of relational similarity. Child Development, 67, 2797 – 2822. Kuhn, D. (1995). Microgenetic study of change: What has it told us? Psychological Science, 6, 133 – 139. Kuhn, D., Garcia-Mila, M., Zohar, A., & Andersen, C. (1995). Strategies of knowledge acquisition. Monographs of the Society for Research in Child Development, 60 (4, Serial No. 245). Lakoff, G., & Johson, M. (1980). Metaphors we live by. Chicago: University of Chicago Press. Langston, W., Kramer, D. C., & Glenberg, A. M. (1998). The representation of space in mental models derived from text. Memory & Cognition, 26, 247 – 262. Laurence, S., & Margolis E. (2002). Radical concept nativism. Cognition, 86, 25 – 55. Lehrer, R., & Schauble, L. (1998). Reasoning about structure and function: Children’s conceptions of gears. Journal of Research in Science Teaching, 35, 3 – 25. Leslie, A. M., Friedman, O., & German, T. P. (2004). Core mechanisms in ‘theory of mind’. Trends in Cognitive Sciences, 8, 529 – 533. Locke, J. (1993). An essay concerning human understanding (Abridged and edited by J. W. Yolton). London: Dent. (Original work published in 1690.)
362
James A. Dixon and Elizabeth Kelley
Locke, J. L. (1999). Towards a biological science of language development. In M. Barrett (Ed.), The development of language (pp. 373 – 395). East Sussex, UK: Psychology Press. MacWhinney, B. (2004). A multiple process solution to the logical problem of language acquisition. Journal of Child Language, 31, 883 – 914. Markman, E. M. (1990). Constraints children place on word meanings. Cognitive Science, 14, 57 – 77. Markman, E. M., Wasow, J. L., & Hansen, M. B. (2003). Use of the mutual exclusivity assumption by young word learners. Cognitive Psychology, 47, 241 – 275. Miller, D. B. (1994). Social context affects the ontogeny of instinctive behavior. Animal Behaviour, 48, 627 – 634. Namy, L. L., & Gentner, D. (2002). Making a silk purse out of two sow’s ears: Young children’s use of comparison in category learning, Journal of Experimental Psychology: General, 131, 5 – 15. Newport, E. L. (1991). Contrasting concepts of the critical period for language. In S. Carey & R. Gelman (Eds.), The epigenesis of mind: Essays on biology and cognition (pp. 111 – 130). Hillsdale, NJ: Lawrence Erlbaum Associates. Nordeen, E. J., & Nordeen, K. W. (1990). Neurogenesis and sensitive periods in avian song learning. Trends in Neurosciences, 13, 31 – 36. Oyama, S. (1985). The ontogeny of information: Developmental systems and evolution. Cambridge: Cambridge University Press. Parasuraman, R., Greenwood, P. M., Kumar, R., & Fossella, J. (2005). Beyond heritability: Neurotransmitter genes differentially modulate visuospatial attention and working memory. Psychological Science, 16, 200 – 207. Piaget, J. (1954). The construction of reality in the child. New York: Basic. Piaget, J. (1970). Genetic epistemology. New York: Columbia. Pinker, S. (1994). The language instinct. New York: William Morrow. Plaut, D. C., & Kello, C. T. (1999). The emergence of phonology from the interplay of speech comprehension and production: A distributed connectionist approach. In B. MacWhinney (Ed.), The emergence of language (pp. 381 – 415). Mahwah, NJ: Erlbaum. Pollock, J., & Cruz, J. (1999). Contemporary theories of knowledge (2nd ed.). Lanham, MD: Rowman & Littlefield. Quine, W. V. O. (1960). Word and object. Cambridge, MA: MIT Press. Rattermann, M. J., & Gentner, D. (1998). More evidence for a relational shift in the development of analogy: Children’s performance on a causal-mapping task. Cognitive Development, 13, 453 – 478. Ross, B. H., & Kilbane, M. C. (1997). Effects of principle explanation and superficial similarity on analogical mapping in problem solving. Journal of Experimental Psychology: Learning, Memory, and Cognition, 23, 427 – 440. Schwartz, D. L., & Black, J. B. (1996). Shuttling between depictive models and abstract rules: Induction and fallback. Cognitive Science, 20, 457 – 497. Shannon, B. (1988). Semantic representation of memory: A critique. Psychological Bulletin, 104, 70 – 83. Singer, J. D., & Willett, J. B. (2003). Applied longitudinal data analysis. New York: Oxford University Press. Siegler, R. S., & Crowley, K. (1991). The microgenetic method: A direct means for studying cognitive development. American Psychologist, 46, 606 – 620. Sisk, C. L., & Foster, D. L. (2004). The neural basis of puberty and adolescence. Nature Neuroscience, 7, 1040 – 1047.
The Probabilistic Epigenesis of Knowledge
363
Smith, L. B. (2001). How domain-general processes may create domain-specific biases. In M. Bowerman & S. C. Levinson (Eds.), Language acquisition and conceptual development (pp. 101 – 131). New York, NY: Cambridge University. Spelke, E. S. (1990). Principles of object perception. Cognitive Science, 14, 29 – 56. Spencer, J. P., & Schoner, G. (2003). Bridging the representational gap in the dynamic systems approach to development. Developmental Science, 6, 392 – 412. Thelen, E., Scho¨ner, G., Scheier, C., & Smith, L. B. (2001). The dynamics of embodiment: A field theory of infant perseverative reaching. Behavioral and Brain Sciences, 24, 1 – 86. Thelen, E., & Smith, L. B. (1994). A dynamic systems approach to the development of cognition and action. Cambridge, MA: MIT Press. Trudeau, J. J., & Dixon, J. A. (2004). Beyond discovery: Repeated actions predict transfer of relational knowledge. Paper presented at the annual meeting of the Psychonomic Society, Minneapolis, MN. Tucker, M., & Hirsh-Pasek, K. (1993). Systems and language: Implications for acquisition. In L. Smith & E. Thelen (Eds.), A dynamic systems approach to development (pp. 359 – 384). Cambridge, MA: MIT Press. Turvey, M. T., Carello, C., & Kim, N.-G. (1990). Links between active perception and the control of action. In H. Haken & M. Stadler (Eds.), Synergetics of cognition (pp. 269 – 295). New York: Springer-Verlag. Turvey, M. T., & Fitzpatrick, P. (1993). Commentary: Development of perceptionaction systems and general principles of pattern formation. Child Development, 64, 1175 – 1190. Van Orden, G. C., Moreno, M. A., & Holden, J. G. (2003). A proper metaphysics for cognitive performance. Nonlinear Dynamics, Psychology, and Life Sciences, 7, 49 – 60. Van Geert, P. (1998). A dynamic systems model of basic developmental mechanisms: Piaget, Vygotsky, and beyond. Psychological Review, 105, 634 – 677. Wagman, J. B., & Miller, D. B. (2003). Nested reciprocities: The organism-environment system in perception-action and development. Developmental Psychobiology, 42, 317 – 334. Wahlsten, D. (1999). Single-gene influences on brain and behavior. Annual Review of Psychology, 50, 599 – 624.
This page intentionally left blank
Author Index
A Aarim-Herlot, N., 42 Aboud, F. E., 42, 43, 47, 48, 51, 59, 78 Abrams, D., 50, 78 Achenbach, T. M., 99 Ackerman, B. P., 96, 98, 102–107, 109–111, 114–117, 121 Ackil, J. K., 256, 268 Acredolo, L. P., 187 Adam, E. K., 114 Adams, E., 141, 142, 160 Adams, S. E., 302 Adamson, L., 308 Adolph, K. E., 333 Adolphs, R., 222, 224, 225, 232, 235 Adorno, T. W., 40 Alberts, D. M., 182–185 Allen, G. L., 183, 185, 187, 189 Allen, V. L., 50 Allhusen, V. D., 265, 273 Allison, T., 224 Allopenna, P. D., 5 Allport, G. W., 56, 57, 67, 71 Amaral, D. G., 224, 225, 230–232, 234 Amarel, M., 40 Ambady, N., 208 Ames, N., 40 Anderson, E. R., 103 Anderson, J., 174 Angold, A., 121 Angrilli, A., 231 Anooshian, L. J., 191 Arnsten, A. F. T., 286 Arsenio, W. F., 141, 142, 158, 160 Arterberry, M. E., 216 Asch, S., 133–135 Asher, S. R., 134
Ashkenazi, A., 289 Aslin, R. N., 3–5, 7, 12–17, 19, 22, 24–29, 32, 349 Astington, J. W., 132, 138, 141, 156, 160, 162 Aston-Jones, G., 286 Atkinson, A. P., 212 Atkinson, T. G., 290, 294, 295 Averhart, C. A., 43 Averhart, C. J., 43 B Bachevalier, J., 228, 230, 234 Baddeley, A., 304 Bahrick, L., 261 Bailey, T. M., 16 Baird, A. A., 231 Baird, J. A., 160 Bakeman, R., 308 Balaban, M. T., 219, 230, 231 Baldwin, D. A., 182 Baldwin, J. M., 290, 340 Ballem, K., 3, 9, 16, 17, 19, 20 Banaji, M. R., 42, 43, 71 Bandura, A., 45, 56, 72, 137 Bangert, A. S., 330, 332–334 Banks, J. A., 44, 61 Banks, M. S., 215 Barbas, H., 302 Bargh, J. A., 73 Baron-Cohen, S., 141 Barreca, R., 72 Barrett, D. J. K., 289 Barrouillet, P., 334 Barsalou, L. W., 332, 340 Bartko, T., 92, 109, 113 Barton, D., 6 365
366
Author Index
Bartsch, K., 150 Bates, E., 10, 14, 352 Bates, J. E., 91 Battin, D. B., 249 Bauer, P. J., 76, 299, 300, 304 Bauman, M. D., 230 Baumeister, R. F., 356 Bayley, N., 179 Baylis, G. C., 223, 224, 287 Beck, A. T., 134 Beckman, M. E., 20 Bedny, M., 302 Beer, A. L., 289 Beer, J., 271 Behniea, H., 225, 234 BEIP, 219, 230 Bell, M., 253 Bell, M. A., 301, 302 Belli, R. F., 268 Bem, S. L., 40, 48, 60, 65, 70 Bennett, J., 230, 234 Bennett, K. J., 120 Benson, P. J., 210, 213, 218 Bentin, S., 224 Berg, C. A., 134 Bernardin, S., 334 Berndt, E. J., 131, 136, 140 Berndt, T. J., 131, 136, 140 Bialer, S., 62 Bickhard, M. H., 324, 340 Biel, A., 175, 194 Bigler, R. S., 43, 44, 48, 49, 51, 52, 59, 61, 63–65, 77, 78 Billig, M., 50 Bjorklund, D. F., 195, 255 Black, J. B., 330 Blades, M., 182, 185, 187 Blades, N., 253 Blaga, O. M., 294, 295 Blair, R. J. R., 207 Blairy, S., 218 Blaut, J. M., 175 Bleckley, M. K., 301 Bloom, F. E., 231, 286 Bloom, P., 2, 32
Bogardus, E. S., 40 Bolger, K. E., 116 Bond, N. W., 218 Booth, A. E., 308 Bornstein, M. H., 94, 214, 216, 293 Bortfeld, H., 28 Bottoms, B., 273 Bovy, P. H. L., 175 Boyatzis, C. J., 217 Bradley, M. M., 219, 231 Bradley, R. H., 94 Bradshaw, M. F., 289 Brainerd, C. J., 248 Braly, K., 40 Bransford, J. D., 48 Braun, I., 308 Brehl, B., 144, 145, 149, 151, 152, 154 Brennan, P. A., 353 Brent, M. R., 28 Bresler, A., 175 Breton, C., 308 Brewer, J., 305 Brewer, M. B., 40, 46, 50, 64 Brincat, S. L., 288 Britto, P. R., 98 Broadbent, D. E., 284 Broda, L. S., 188, 191, 197 Bromley, D. B., 62 Bronfenbrenner, U., 93, 112 Brooks-Gunn, J., 91–93, 96–98, 100, 104, 113, 116, 119 Brown, C., 6, 8 Brown, C. S., 44 Brown, E. D., 96, 102, 104, 105, 114, 117, 121 Brown, G. D. A., 187, 188 Brown, R., 42, 44 Brown, R. J., 50, 64 Brown, R. S., 59 Brown, V. J., 286 Bruck, M., 248–251, 253–256, 260–265, 267, 268, 270, 272 Bruner, J., 159 Buehler, C., 109 Bullier, J., 228
Author Index
Burchinal, M. R., 91, 109 Burgwyn-Bailes, E., 265, 271 Bush, N., 272 Bushnell, I. W. R., 215 Bussey, K., 72 Butterworth, G., 182, 191 Byrne, R. W., 192 C Calder, A. J., 212 Camos, V., 334 Campanella, S., 210 Campbell, S. B., 109, 116 Camras, L. A., 220 Capozzoli, M. C., 293, 295, 297, 299 Caramazza, A., 302 Card, J., 308 Carello, C., 332 Carey, S., 138 Carlson, S. M., 308 Caron, A. J., 216, 217 Caron, R. F., 216, 217 Carr, S., 189 Carriger, M. S., 293 Carver, L. J., 300 Casasola, M., 307, 349 Casey, B. J., 296, 297, 301, 302 Cashon, C. H., 349 Caspi, A., 92, 97 Cassel, W., 255 Cassia, V. M., 236 Ceci, S. J., 248–250, 253–255, 259–268, 271, 274 Chae, Y. J., 265, 268, 271 Chambers, K. E., 21, 27 Chandler, M. J., 48, 139, 160, 162 Chaput, H. H., 349, 358 Charles-Luce, J., 3, 7, 8, 11, 12, 14, 17, 21, 23, 24 Chase, W. G., 178, 183, 192 Chase-Lansdale, P. L., 114 Chazan, E., 217 Cheah, C. S., 134 Cheatham, C. L., 299
367
Chen, M., 73 Cheney, R., 148 Chi, M. T. H., 183, 266 Chomsky, N., 324–326, 351 Christiansen, M. H., 29 Chugani, H. T., 303 Chun, M. M., 223 Clark, A., 328 Clarke-Stewart, K. A., 250, 265, 273 Classen, J., 340 Clearfield, M. W., 179 Clohessy, A. B., 307 Coady, J. A., 7, 22, 24, 26, 27 Cohen, J. D., 302 Cohen, L. B., 284, 289, 299, 307, 349, 358 Cohen, N. J., 304 Cohen, Y., 287 Cohn, J., 264 Coker, D. R., 43 Colby, A., 163 Cole, R., 6, 8 Coll, C. G., 95, 96, 120 Colombo, J., 284, 285, 289, 292–295, 297, 283 Colwell, M., 199 Conger, R. D., 98, 101, 105, 116 Conning, A. M., 192 Connor, C. E., 288 Cook, T. D., 112 Cornell, E. H., 176–185, 187–199 Cosmides, L., 48 Costello, E. J., 97, 104, 114, 121 Couchoud, E. A., 217 Coupe, P., 48 Courage, M., 293 Courage, M. L., 299 Cousins, J. H., 188 Couyoumdjian, A., 289 Creel, S., 349 Crick, N. R., 134 Critchley, H. D., 230 Crocker, J., 49, 79 Cross, D., 139 Crossman, A. M., 271
368
Author Index
Crowder, R. G., 189 Crowley, K., 358 Cruz, J., 324 Csibra, G., 305 Cummings, E. M., 109 Cummins, D., 160 Curtin, S., 2, 3, 7, 12, 17–19, 29 Cuthbert, B. N., 219, 231 Cutler, A., 5 Cutting, A. L., 132 D Dabholkar, A. S., 303 Dahlstrom, A. B., 285 Dale, P., 10 Dalgleish, T., 134 Damasio, A. R., 224 Damasio, H., 232, 235 Danziger, S., 289 Darley, J. M., 131 Darvizeh, Z., 175 Datta, S., 286 Davies, D. R., 284 Davies, M., 138, 162, 231 Davies, P. T., 109 Davis, S. L., 272, 273, 275 Dawson, G., 220, 230 Day, M. C., 186 de Boysson-Bardies, B., 5, 20 de Gelder, B., 210, 213, 218, 220, 226 de Haan, M., 214, 216–219, 221, 229, 234, 236 Dearing, E., 114, 116 Deater-Deckard, K., 98, 109, 112, 119 DeGarmo, D. S., 103 Dehaene, S., 325 Dehay, C., 228 DeLoache, J. S., 249 Demetriou, H., 132 Denham, S. A., 142, 143, 162, 217 Dennett, D., 138 Dennis, M. J., 162 Derryberry, D., 309 Deschamps, J.-C., 50
Desimone, R., 223, 302 Devine, P., 43, 44, 71 Deviney, F. P., 134 Di Nocera, F., 289 Diamond, A., 300, 302, 307 Dickson, L. R., 214 Diecidue, K., 266 Dieffenbacher, J., 264 Diego, M. A., 219 Dimberg, U., 212 Dixon, J. A., 330, 332–334, 336, 340, 342, 346, 358 Doar, B., 302 Dodge, K. A., 91, 94, 119, 134, 162 Dodson, J. D., 285 Doherty, S. E., 189 Dohn, M. C., 342, 346 Doise, W., 50 Dolan, R. J., 224–226, 305 Dolgin, K., 216, 217, 219 Dollaghan, C. A., 7 Dosher, B. A., 289 Douglas, C. L., 286 Dovidio, J. F., 73 Dowd, H. J., 272 Doyle, A. B., 51, 59 Dreisch, H., 328 Driver, J., 287, 289 Dubiner, K., 293 Duckitt, J., 40 Dudgeon, P., 265 Duncan, G. J., 91–93, 96–98, 104, 113, 116, 119 Duncan, J., 302 Duncker, K., 134 Dunn, J., 103, 132, 141, 142, 159, 160, 162 Dunn, L. M., 99 Dweck, C. S., 48, 49 E Eastwood, J. D., 213 Eccles, J. S., 43, 96, 101 Eckenrode, J., 114
369
Author Index
Edwards, C. P., 175 Edwards, J., 20, 22 Edwards, L., 328 Edwards, M. L., 6 Egly, R., 287 Ehrlich, H. J., 42, 78 Eichenbaum, H., 304 Eilers, R. E., 6 Eimer, M., 289 Ekman, P., 207, 209 Elder, G. J., 93, 104 Elfenbein, H. A., 208 Elliott, L. L., 6 Elliott, S. N., 99 Elmehed, K., 212 Endres, J., 267, 271 Engle, R. W., 301 Erben, C., 267, 271 Espy, K. A., 302 Esteves, F., 211, 212, 218 Etcoff, N. L., 209, 210 Evan, K. E., 6 Evans, G. W., 107, 109, 115 Evans, J. L., 352 Everitt, B. J., 286 Ezekiel, R. S., 71 F Fabes, R. A., 43, 51 Fabricius, W. V., 139, 140 Farah, M. J., 223, 224 Faulkner, J., 56 Feigenson, L., 325 Feiring, C., 116 Fell, J., 286 Fennell, C. T., 3, 8, 10, 11, 12, 15–17, 20 Ferguson, C. A., 5, 6 Ferlazzo, F., 289 Fernald, A., 12, 13 Field, T. M., 214, 218 Fine, M. A., 103 Finnila¨, K., 256, 260, 263 Fischer, K. W., 328
Fishbein, H. D., 42, 51, 59, 69 Fisher, C., 21, 27 Fisher, R. P., 261 Fiske, S. T., 46 Fitzpatrick, P., 357 Fitzsimmons, C. M., 9 Fivush, R., 76 Flavell, J. H., 139, 197 Fletcher, P. C., 305 Flood, T. L., 192, 193, 196, 197 Fodor, J. A., 325 Foote, S. L., 286 Ford, D. H., 328 Ford, S., 144–146, 151, 157, 159 Forgatch, M. S., 103 Forman, E. M., 115 Forssberg, H., 301 Foster, D. L., 328 Fox, E., 212 Fox, N. A., 302, 304, 308, 309 Fradin, S., 271 Fraley, R. C., 272 Franks, J. J., 48 Freiwald, W. A., 286 Freundschuh, S., 175 Frick, J. E., 292, 294, 295 Friedman, O., 326 Friesen, W. V., 209 Frith, U., 162 Fuligni, A. S., 98 Funayama, E. S., 231 Furrer, S. D., 307 Fuster, J. M., 289, 302 Fuxe, K., 285 G Gaertner, L., 78 Gaertner, S. L., 73 Gagnon, A., 78 Gallese, V., 222, 341 Gallistel, C. R., 175, 180, 187 Gardner, W., 200 Garing, A. E., 191 Garlock, V. M., 7, 20, 22
370
Author Index
Garnica, O. K., 6 Garven, S., 255, 257, 263 Gathercole, S. E., 21, 22 Gatty, H., 196 Gauthier, I., 220, 224, 232, 234–236 Gauvain, M., 178, 200 Gear, R. J., 179 Geddie, L., 271 Gelade, G., 284 Gellert, E., 43 Gelman, S. A., 62, 66, 68, 70, 71 Gentner, D., 349, 350 Georgieff, M. K., 299 Gerard, J. M., 109 Gergen, K. J., 133 Gerken, L., 18 German, T. P., 326 Gershkoff-Stowe, L., 353 Gibson, E. J., 187, 195, 349 Gibson, J. J., 181, 193, 332 Giedd, J. N., 301 Giles, J. W., 268, 269 Giller, H., 119 Gillette, J., 1 Gilovich, T., 149, 153 Glenberg, A. M., 340, 341 Glimcher, P. W., 286 Gnepp, J., 142, 158 Gobbini, M. I., 222–224 Goh, J. O.-S., 289 Gold, J., 141, 142, 160 Goldberg, R. F., 302 Goldfield, E. C., 307, 328 Goldinger, S. D., 332, 334 Goldman, A. I., 138, 222 Goldman, D. Z., 302 Goldsmith, M., 289 Golinkoff, R. M., 28 Golledge, R. G., 175, 185, 188–190, 192 Gomez, R. L., 59 Goodchild, M. F., 183 Goodman, G. S., 257, 261, 262, 273 Goodman, J. C., 352 Gopnik, A., 138, 141, 268, 269, 307, 308, 326
Gordon, B. N., 262 Gottlieb, G., 324, 327–329, 353, 354 Gould, R. A., 196 Gowans, F. R., 175 Graf, P., 268 Gramzow, R. H., 78 Grant, M., 253 Greenberg, M. T., 112 Greenwald, A. G., 42, 43, 71 Gregory, A., 43 Greidanus, E., 192 Grelotti, D. J., 220, 232, 234–236 Gresham, F. M., 99 Grieve, D. W., 179 Grieve, R., 253 Gross, C. G., 228, 235 Grover, L., 191 Grow, J. G., 220 Grunedal, S., 212 Gudjonsson, G. H., 249 Gulko, L., 43, 51 Gupta, P., 22 Gutman, L. M., 96, 101 Guyer, A. E., 134 H Hadwin, J., 140 Hagell, A., 119 Hagen, E. P., 99 Hagen, J. W., 266 Hagger, C., 227 Hagoort, P., 302 Halgren, E., 224 Halit, H., 229 Hall, G. S., 193 Halle, P. A., 5, 20 Halverson, C. F., 43, 51, 60 Hamann, S. B., 232, 235 Hamilton, D. L., 42, 46, 50, 51, 64 Hammen, C., 134 Hammer, M. A., 6 Han, S., 289 Hancock, D., 353 Hansen, C., 212
371
Author Index
Hansen, M. B., 353 Hansen, R., 212 Happe, F. G. E., 162 Harlan, J. E., 292 Harman, C., 309 Harnad, S., 340, 341 Harnishfeger, K. K., 195 Harris, P. L., 132, 138–143, 158, 160, 162 Hart, A. J., 224 Hart, B., 98 Hart, R. A., 175, 176, 180, 182, 190, 200 Harter, S., 143 Hartup, W. W., 159 Hasher, L., 183 Hashtroudi, S., 268 Haskins, R., 93 Hasselmo, M. E., 223, 224 Haviland, J. M., 218 Haxby, J. V., 222–224 He, Z. J., 287 Heft, H., 182, 193 Helwig, C. C., 133, 135, 156 Hembrooke, H., 253–255, 261, 262 Henderson, L. M., 308 Henrichon, A. J., 140 Henson, R. N., 305 Herrera, C., 159 Hess, U., 218 Heth, C. D., 177–185, 187, 188, 190–193, 196–199 Hewstone, M., 44 Heyman, G. D., 66, 70, 268, 269 Hietanen, J. K., 212, 218 Hildebrandt, C., 136 Hill, K. A., 176, 193, 197–199 Hinshaw, S. P., 121 Hintzman, D. L., 334, 336 Hirschfeld, L. A., 49, 58, 61, 68, 69 Hirsh-Pasek, K., 352 Hobson, R. P., 138 Hoffman, E. A., 222–224 Hogg, M. A., 78 Hogrefe, G. J., 268 Hohne, E. A., 4 Holden, J. G., 327
Hollich, G., 3, 4, 23, 26, 27, 30, 31, 307 Houston, D. M., 4, 17 Howard, I. P., 180 Howe, M. L., 299 Howie, P. M., 272 Hsiao, S. S., 286 Hu, L., 64 Hughes, M., 253 Hume, D., 324, 325 Humphreys, G. W., 289 Humphreys, K., 229 Hurlock, E. B., 264 Huttenlocher, J., 174 Huttenlocher, P. R., 303 I Iglesias, J., 217 Ignatiev, N., 42 Imbens-Bailey, A., 139 Ingram, D., 7, 8 Inhelder, B., 52, 77, 133, 134, 187, 190 Irwing, I., 62 Ishai, A., 224 Izard, C. E., 96, 102, 104, 105, 114, 117, 121 J Jaakkola, K., 138 Jackendoff, R., 351, 352 Jackson, J. H., 382 Jacobs, J., 64 James, W., 290 Jarrett, N., 182 Jensen, O., 305 Johnson, C. N., 155 Johnson, E. K., 4, 28 Johnson, K. O., 286 Johnson, M. H., 216, 229, 234, 236, 302, 305 Johnson, M. K., 268 Johnston, K. E., 64, 76 Johnston, T. D., 328 Jones, A., 199
372
Author Index
Jones, E. E., 50 Jones, E. F., 140 Jones, L. C., 44 Jongeward, R. H., 266 Jonsson, E., 193 Joseph, M. H., 286 Jusczyk, P. W., 2–5, 7, 8, 10, 14, 17, 18, 21, 28, 32 Jussim, L. J., 59, 62 K Kagan, J., 270 Kail, R. V., 185, 266 Kalish, C. W., 132, 138, 140, 141, 156, 160 Kanaya, T., 254 Kane, M. J., 301 Kanemura, H., 303 Kanitkar, K., 71 Kannass, K. N., 297, 299, 300 Kanwisher, N. G., 223, 224, 286 Karmiloff-Smith, A., 208, 227, 235, 236, 326, 328 Karniol, R., 131, 140 Kastenbaum, R., 217 Katz, D., 40 Katz, P. A., 45, 48, 51 Kawakami, K., 73 Kawashima, R., 224 Keasey, C. B., 131, 140 Keen, R., 8 Keil, F. C., 325 Kello, C. T., 352 Kelly, J. L., 225, 234 Kennedy, H., 228 Keverne, E. B., 353 Keysers, C., 222 Kilbane, M. C., 342 Kilgore, W. D. S., 231 Killen, M., 135 Kim, K. J., 123 Kim, N.-G., 332 Kirasic, K. C., 183, 185 Kistler, D. J., 220, 221
Kitchin, R., 175 Klayman, J., 142, 158 Kleck, R. E., 218 Klein, G. W., 56 Klein, O., 43 Klingberg, T., 301 Klosson, E. C., 131 Knobloch, H., 179 Kochanoff, A., 142, 143, 162 Koenderink, M. J., 303 Kofkin, J. A., 45 Kohlberg, L., 78, 134, 135, 163 Kolb, B., 302 Koslin, S., 40 Kotovsky, L., 349, 350 Kotsoni, E., 216 Koutstaal, W., 289 Kowalski, K., 62, 71 Kramer, D. C., 341 Krogh, H. R., 43 Kubiak, P., 286 Kuchuk, A., 214 Kuhl, P. K., 2, 3, 7, 234 Kuhn, D., 148, 333, 358 Kulkofsky, S. C., 274 Kupersmidt, J. B., 91 Kurdek, L. A., 103 Kurzban, R., 48 Kuttschnitt, C., 114 Kwon, P., 268 L LaBar, K. S., 305 Ladd, G. W., 134 Lagattuta, K. H., 132, 140, 142, 160 Lahey, B. B., 92, 120 Lakoff, G., 341 Lalonde, C. E., 139, 162 Lamb, M. E., 261 Lane, S. M., 268 Lang, P. J., 219, 231 Langlois, J. H., 43 Langston, W., 341 Lansink, J. M., 295, 297
373
Author Index
LaPiere, R. T., 40 Larus, D., 262 Laupa, M., 137 Laurence, S., 325 LaValla, P., 199 Lawson, K. R., 293, 294, 297 Leaper, C., 59 Leavitt, L. A., 216, 217 LeDoux, J. E., 224, 226, 232 Lee, D., 289 Lee, Y. T., 59, 62 Leevers, H. J., 139 Lehnung, M., 192 Lehrer, R., 330 Leichtman, M. D., 259, 260, 268, 270 Lelwica, M., 218 Leontiev, A. N., 291 Lepore, S., 250 Lepore, S. J., 258, 262 Leppa¨nen, J. M., 218 Lerner, R. M., 43, 328 Leslie, A. M., 132, 139, 160, 326 Leventhal, T., 100 Levy, S. R., 48, 49 Lewis Mumford Center, 40, 68 Lewis, M. D., 116, 134 Liaw, F., 92 Liben, L. S., 43, 44, 48, 51, 52, 60, 61, 77, 78, 181 Lindauer, B. K., 249 Lindsay, D. S., 248, 262, 268 Linver, M. R., 116 Lipinski, J., 22 Lippmann, W., 40 Lisman, J. E., 305 Lively, S. E., 5, 7, 15 Livesley, W. J., 62 Lobliner, D. B., 44 Locke, J., 324 Loeber, R., 121 Loeches, A., 217 Loftus, E. F., 263 Logan, J. S., 5, 7, 15 Loomis, J. M., 181 Lover, A., 141, 158, 160
Luce, P. A., 3, 7, 8, 11, 12, 14, 17, 20, 21–24, 26 Ludemann, P., 216, 217 Luhtanen, R. K., 78, 79 Lundqvist, D., 211, 212 Lupianez, J., 289 Luria, A. R., 302 Luthar, S. S., 96 Lynch, K., 175 M Maccoby, E. E., 45 Machado, C. J., 230, 234 Mackie, D. M., 50 Macmillan, R., 114 MacWhinney, B., 352 Madole, K. L., 299, 307 Magee, J. J., 209, 210 Magnuson, J. S., 5 Magoun, H. W., 286 Maguin, E., 121 Maianu, C., 144, 151, 159 Malle, B. F., 137, 161 Malloy, L. C., 265, 273 Malpass, R. S., 257 Mandler, J. M., 76, 308 Marcus, D. J., 217 Margolis, E., 325 Markell, M., 44 Markman, E. M., 2, 9, 32, 326, 353 Marks, L. E., 289 Marmot, M., 108 Marslen-Wilson, W. D., 5 Martin, C. L., 43, 47, 51, 60, 71, 75 Mash, C., 139 Masten, A. S., 120 Matthews, J., 6, 8 Matthews, M. H., 175, 176, 182 Mattys, S. L., 3, 4, 28 Matwin, S., 145 Maurer, D., 215 Maxwell, S. E., 188 Maye. J., 59
374
Mayer, S., 92, 96 Mazziotta, J. C., 303 Mazzoni, G., 272 McAdams, D., 159 McCall, R. B., 293 McCarthy, G., 224 McCartney, K., 114 McCauley, C., 73 McCauley, C. R., 59, 62 McCauley, M. R., 261 McConahay, J. B., 71 McConnell, A. R., 64 McDermott, J., 223 McDonough, L., 76 McFarlane, F., 265 McKelvie, S. J., 210 McKenzie, B. E., 181 McLeod, J. D., 116 McLoyd, V. C., 91, 98, 108, 113, 116, 119 McMorris, B. J., 114 McMurray, B., 5, 17 McQueen, J. M., 5 McVey, V., 196 Medin, D., 68 Medina, J., 349 Meegan, S. P., 134 Melnyk, L., 255, 262, 265 Meltzoff, A. N., 218, 300, 307, 308 Menn, L., 5 Menyuk, P., 5 Merikle, P., 213 Mervis, C., 57 Messick, D. M., 50 Metsala, J. L., 6–8, 14, 20, 22 Meyer, G., 50 Miller, D. B., 328, 329 Miller, E. K., 302 Miller, S. A., 266 Milliken, B., 289 Mills, D. L., 10 Milner, D., 42 Mineka, S., 232, 234 Mintz, T. H., 29
Author Index
Mio, T., 187, 192, 195 Miscione, J. L., 155 Mishkin, M., 228, 230 Mistry, R. S., 98, 105 Mitchell, D. W., 292, 293 Mitchell, F. G., 51 Mitchell, P., 139 Moffitt, T. E., 92, 120 Monk, C. S., 231 Montague, D. P. F., 218 Montello, D. R., 185, 192, 197 Moore, G. T., 180, 190 Moore, M. K., 218, 307 Moore, R., 175, 177 Moran, P. M., 286 Morasse, I., 78 Moreno, M. A., 327 Morgan, J. L., 17, 28 Mori, M., 304 Morland, J. K., 45, 47, 70, 72 Morris, J. S., 224–226 Morse, P. A., 216, 217 Morton, J., 234, 236 Moruzzi, G., 286 Moscovitch, M., 303 Moses, L. J., 141, 308 Mrzljak, L., 303 Mullen, B., 64 Munakata, Y., 8 Mundy, P., 308 Munroe, R. H., 174, 175 Munroe, R. L., 174, 175 Munson, B., 20 Munson, J. A., 220 Muresan, R. C., 286 Myers, R. S., 216, 217
N Nagy, Z., 301 Nakayama, K., 218, 287 Namy, L. L., 349 Narumoto, J., 224 Neath, I., 189
375
Author Index
Nelson, C. A., 208, 214–221, 229–231, 236, 302, 304 Nelson, R. K., 196 Nelson-Le Gall, S. A., 131, 140 Neuberg, S. L., 64 Newcombe, N. S., 174 Newport, E. L., 4, 21, 28, 326, 349 Newsome, M., 4 Nguyen, S. P., 66 Niebur, E., 286 Nisbett, R. M., 137 Nittrouer, S., 7 Nonnemaker, J. M., 116 Noppeney, U., 302 Nordeen, E. J., 353 Nordeen, K. W., 353 Norris, D., 5 Nunez, M., 132, 140, 160 Nunner-Winkler, G., 142, 143, 158, 160 O O’Scalaidhe, S. P., 228, 235 O’Doherty, J., 224, 225 Oakes, L. M., 295, 297, 299, 300, 307 O¨hman, A., 208, 211, 212, 218, 226, 231, 232, 234 Oki, M., 231 Okuda, Y., 305 Oller, M. K., 6 Olsen, M. G., 187 Ondracek, P. J., 185 Onishi, K. H., 21, 27 Orekhova, E. V., 303 Orfield, G., 40, 68 Ornstein, P. A., 249, 262 Osborne, J. W., 43 Osterling, J. A., 220 Otis, M., 264 Otten, L. J., 305 Overton, W. F., 48 Owens, C. B., 191 Oyama, S., 328
P Pagani, L., 114, 116 Pandya, D. N., 305 Papierno, P. B., 274 Parasuraman, R., 284 Paris, S. G., 249 Park, B., 50 Park, J., 56 Parker, G. M., 64 Parker, S. W., 219, 230 Parkman, F., 175 Parrinello, R. M., 308 Pasamanick, B., 179 Pascalis, O., 229 Pascual-Leone, J., 289 Pater, J., 9, 11, 26 Patterson, C. J., 91 Patterson, G. R., 103, 116, 120 Peck, S. C., 101, 112 Perfetti, C. A., 6 Perner, J., 139, 140, 268 Peters, W., 71 Petersen, S., 287, 288, 296 Peterson, C., 253, 254, 261 Peterson, C. C., 140, 160, 162 Petrides, M., 305 Pettigrew, T. F., 67 Pettit, G. S., 91 Pezdek, K., 248, 267 Phelps, M. E., 303 Phillips, J., 302 Phillips, R. D., 217 Phillips, W., 141 Piaget, J., 52, 62, 77, 133, 134, 140, 156, 187, 324, 340 Pick, H. L., 187, 192, 193 Pillow, B. H., 139, 140 Pillsbury, W. B., 290 Pinker, S., 325 Pins, D., 289 Pinto, J. P., 12 Pipe, M. E., 254 Pisoni, D. B., 5, 7, 15, 29, 32 Pissiota, A., 231
376
Author Index
Plaut, D. C., 352 Plunkett, K. D., 3, 9, 16, 17, 19, 20 Poggenpohl, C., 267, 271 Polkinghorne, D., 159 Pollak, S. D., 219–221 Pollatsek, A., 185 Pollock, D. C., 70 Pollock, J., 324 Poole, D. A., 248, 249, 255, 262 Posikera, I. N., 303 Posner, M. I., 285, 287, 288, 296, 307 Powell, M. B., 265 Powlishta, K. K., 43, 51, 59, 69 Pressley, M., 195 Presson, C. C., 182 Price, C., 302 Principe, G. F., 259, 260 Prinstein, M. J., 134 Pritchard, M., 162 Pronin, E., 149, 153 Puce, A., 224 Q Quas, J., 273, 275 Quattrone, G. A., 50 Quine, W. V. O., 353 Quinn, P. C., 75, 76, 234, 307 R Rafal, R., 285, 287 Rajkowski, J., 286 Rakison, D. H., 307 Rathbun, K., 28 Rayner, K., 185 Reed, M. A., 309 Reid, J. B., 116 Renold, E., 72 Repacholi, B., 162 Reynolds, G. D., 293 Reznick, J. S., 76 Rhodes, G., 289 Ribordy, S., 220 Richards, J. E., 285, 293–297, 306
Richman, W. A., 297 Rieser, J. J., 180, 181, 191, 193 Risley, T. R., 98 Rizzolatti, G., 222 Robbins, T. W., 286 Robertson, D. A., 341 Robertson, L. C., 285 Roder, B., 289 Rodman, H. R., 228, 235 Roe, C., 248 Roebers, C. M., 255, 265 Rogers, S. J., 96, 112 Rogers, T. T., 289 Rogoff, B., 200 Rolls, E. T., 223, 224 Rosch, E., 57 Rose, D., 289 Rose, S. P., 328 Rose-Jacobs, R., 179 Rosen, A. C., 306 Rosenthal, S., 116 Ross, B. H., 342 Ross, D., 249, 263 Ross, L. D., 137, 149, 153, 161 Rossion, B., 289 Rothbart, M., 50 Rothbart, M. K., 291, 293, 307, 309 Routtenberg, A., 286 Rowat, W. L., 188, 191, 196, 197 Ruble, D. N., 43, 47, 48, 75 Ruff, H. A., 291–295, 297, 299, 308 Rugg, M. D., 305 Russell, J. A., 218, 232, 235 Rutland, A., 62 Rutter, M., 108, 119, 141 S Saffran, J. R., 2, 4, 7, 18, 21, 28–31, 59 Salapatek, P., 215 Salgado, P. P., 286 Salmon, K., 254 Saltarelli, L. M., 299 Samelson, F., 40
Author Index
Sameroff, A. J., 48, 92, 96, 101, 108, 109, 112, 113 Santesso, D. L., 303 Sattler, J. M., 99 Saxon, T. F., 292, 294, 295, 308 Saywitz, K. J., 261 Schacter, D. L., 249, 270 Schafer, G., 9 Schaller, M., 56 Schauble, L., 330 Schiller, P. H., 285 Schleifer, M., 131, 140 Schlisser, D., 189 Schmidt, L. A., 303 Schmidt, W. C., 289 Schmolck, H., 232 Schmuckler, M. A., 181 Schneider, W., 195, 265 Schoner, G., 357 Schoon, I., 116 Schult, C., 141 Schultz, R. T., 220, 232, 234–236 Schwanenflugel, P. J., 140 Schwartz, D. L., 330 Schwartz, I., 49 Scullin, M., 254 Seccombe, K., 96, 112 Sedlak, A. J., 136 Seefeldt, C., 43 Seifer, R., 92, 108, 109, 113 Serbin, L. A., 43, 51, 69 Seress, L., 304 Serrano, J. M., 217 Sesco, B., 258, 262 Shaddy, D. J., 297, 299, 300 Shallice, T., 305 Shannon, B., 341 Shantz, C. U., 159 Shapiro, E. G., 302 Shaver, P. R., 272 Shaw, L., 144, 146, 148, 151, 156, 158, 159 Sherif, C. W., 50 Sherif, M., 50 Sherman, S. J., 64
377
Sherman, T., 234 Shiverick, S. M., 132, 140, 160 Sholl, J., 187, 193 Shultz, T. R., 131, 140 Shvachkin, N. K., 6, 8 Shweder, R. A., 133 Siddle, D. A. T., 218 Siegal, M., 140, 160, 162 Siegel, A. W., 183, 185–188, 190 Siegler, R. S., 194, 358 Signorella, M. L., 43, 44, 48, 51, 60, 61 Silber, R., 5 Silver, M., 50 Silverman, I., 192 Simion, F., 236 Sinclair, R. J., 103 Singer, J. D., 358 Singh, L., 17 Sinha, P., 220, 221 Sisk, C. L., 328 Siskind, J. M., 28 Siwek, D. F., 286 Skelly, J. P., 228, 235 Skinner, B. F., 45, 56 Skoczylas, M. J., 184, 187, 188, 190, 191 Slater, A. M., 215 Slaughter, V., 138, 162 Smetana, J. G., 133, 137, 156 Smilek, D., 213 Smith, C., 308 Smith, C. M., 249 Smith, J., 98, 108 Smith, L. B., 328, 352, 353 Snyder, J., 116 Snyder, K., 217 Snyder, M., 43 Sodian, B., 142, 143, 158, 160 Sokol, B. W., 160, Sokolov, Y. N., 290 Somerville, S. C., 182 Sorce, J. F., 208 Sorenson, A., 187, 192, 195 Sowell, E. R., 305 Spelke, E. S., 325 Spence, C. J., 289
378
Author Index
Spence, M., 269 Spencer, C., 175, 253 Spencer, J. P., 357 Sperling, G., 284 Sprengelmeyer, R., 222 Squire, L. R., 232 Squires, S. E., 299 Sripada, C. S., 222 Stager, C. L., 3, 8, 9, 11, 14–18, 20, 26 Stangor, C., 48 Stea, D., 174, 175 Steckler, T., 286 Steele, C. M., 43 Stein, N. L., 143 Stephens, J., 175 Stern, E., 175 Stevens, A., 48 Stitt, C., 73 Stoffel, R., 199 Stone, T., 138, 162 Stoneman, Z., 114 Storkel, H. L., 23, 24, 26, 27 Strambler, M. J., 43 Stringer, M., 62 Stroessner, S. J., 49 Stroganova, T. A., 302, 303 Studdert-Kennedy, M., 7 Sugarman, S., 76 Surakka, V., 212 Sutherland, D. H., 179 Sutton, J., 162 Swim, J. K., 71 Swingley, D., 3–5, 12–17, 19, 20, 24–27, 30, 31 Syrotuck, W. G., 176, 177, 198 Szeminska, A., 187 T Tajfel, H., 49, 50, 69, 78 Tamis-LeMonda, C. S., 293 Tanenhaus, M. K., 5 Tarr, M. J., 224 Taylor, B. A., 114 Taylor, M. G., 66
Taylor, S. E., 46, 48 Tees, R. C., 3 Tellinghuisen, D. J., 295, 297, 299 Templeton, L. M., 267 Templeton, W. B., 180 Tenenbaum, H. R., 59 Teunisse, J.-P., 210, 213, 218, 220 Thal, D., 10 Thelen, E., 328, 357, 358 Thierry, K., 269 Thiessen, E. D., 4, 18–21, 28 Thomas, K. M., 231, 301 Thompson, P. M., 305 Thompson, S. K., 77 Thompson, W. C., 250 Thompson-Schill, S. L., 302 Thorndike, R. L., 99 Thunberg, M., 212 Tiesinga, P. H. E., 286 Tindal, M. A., 174, 175 Ting, C. Z., 217 Tipples, J., 212 Tjebkes, T. L., 299 Toga, A. W., 305 Toglia, M., 249, 263 Tolman, E. C., 190 Tong, F., 218 Tooby, J., 48 Torell, G., 175, 188, 194, 195 Trabasso, T., 143 Trainor, L. J., 303 Tranel, D., 224, 232, 235 Trautner, H. M., 59 Treisman, A. M., 284 Trolier, T. K., 42, 46, 50, 51 Tropp, L. R., 67 Trudeau, J. J., 336 Tsang-Tong, H. Y., 181 Tsekhmistrenko, T. A., 303, 304 Tuan, M., 42 Tuccillo, F., 340, 342 Tucker, L. A., 305 Tucker, M., 352 Tuholski, S. W., 301 Turati, C., 236
Author Index
Turiel, E., 132, 133, 135–138, 141, 145, 156, 163 Turk, A. E., 18 Turnbill, A. P., 43 Turner, E. D., 297 Turner, J. C., 49, 69, 78 Turvey, M. T., 332, 357 Tyler, D., 181 U Ungerleider, L. G., 228, 230, 285, 287, 288 Usher, M., 286 Uylings, H. B., 303 V Valentine, G., 194 Van Geert, P., 328 Van Orden, G. C., 327 Van Reken, R. E., 70 Vasil’eva, V. A., 304 Vibbert, M., 214 Vicari, S., 217 Vihman, M. M., 5, 20 Vitevitch, M. S., 20–22 Vlietstra, A. G., 291 Vrij, A., 272 Vuilleumier, P., 226, 234 Vygotsky, L. S., 290 W Wagman, J. B., 328 Wahlsten, D., 328 Wainryb, C., 135–137, 144–146, 148, 149, 151–154, 156, 157, 159, 160 Walker, A. S., 349 Walker-Andrews, A. S., 214, 218 Walley, A. C., 5–8, 10–13, 17, 19, 20, 22 Wan, X., 289 Wang, A. T., 230 Ward, A., 149, 153, 161 Ward, L. M., 286
379
Wasow, J. L., 353 Waterman, A. H., 253 Waterson, N., 5 Watson, D., 139 Waxman, S. R., 308 Webster, M. J., 228, 230, 285, 287, 288 Weiner, B., 134 Weinstein, R. S., 43 Weinstock, M., 148 Weissberg, R., 293, 297 Weisstein, N., 40 Welch-Ross, M. K., 266, 267 Wellman, H. M., 68, 138, 139–142, 150, 155 Werker, J. F., 2, 3, 7–12, 14–20, 26, 31 Werner, L. A., 2, 7, 31 Westerberg, H., 301 Westerlund, A., 217 Westervelt, V. D., 43 Whalen, P. J., 218, 226, 232, 235 Wheeler, V. A., 134 White, G., 254 White, K. S., 17 White, L., 96, 112, 249, 255 White, M., 210, 212 White, R., 42 White, S. H., 186, 187, 190 Whitesell, N. R., 143 Whiting, B. B., 175 Widen, S., 218 Wilcox, S. A., 267 Wild, H. A., 76 Wilder, D. A., 50 Wilkinson, R., 108 Willatts, P., 307 Willett, J. B., 358 Williams, J. E., 45, 47, 70, 72 Wilson, D. P., 29 Wilson, R. M., 67 Wimmer, H., 139, 268 Winocur, G., 303 Winston, J. S., 224, 225 Wismer Fries, A. B., 219 Wolfe, C. D., 301 Wood, J. M., 257
380
Author Index
Woodward, A. L., 2, 9, 28, 32 Woolley, H., 194 Woolley, J. D., 140 Wright, J. C., 291 Wright, K., 131, 140 Y Yang, T. T., 225, 231 Yeari, M., 289 Yerkes, R. M., 285 Yeung, W. J., 116 Young, A. M. J., 286 Young, A. W., 209, 210, 212, 224 Young, D., 175, 177
Young, K., 271 Young, M. F., 191 Younger, B. A., 307 Yuill, N., 140, 142, 158, 160 Yurgelun-Todd, D. A., 231 Z Zachs, R. T., 183 Zaidel, D. W., 304 Zajonc, R. B., 67 Zanna, M. P., 131 Zaragoza, M., 256, 260, 263, 268, Zupan, B. A., 134 Zurif, E. B., 302
Subject Index
C
A active minds, 154 allocentric frame, 180 always-poor, 97 amygdala, 219, 222, 224–8, 230–2, 234–6 arousal/alertness, 285 attachment style, 272 attention, 284–296, 301 allocation, 299–300 disengagement, 294 functions, 283–90 integration, 290, 300–1, 307 involuntary, 290 termination, 296–7 voluntary control, 291, 303, 306 autism, 220, 230, 235
child problem behavior, 96, 105, 110 child witness, 247–8, 250, 255 research, 247–9, 252, 260 coerced confabulation interview, 255–6, 275 cognition, 284, 289–90, 307 cognitive function, higher-order, 290 conduct-disordered children, 162 contextual co-factors, 101–8, 117–18, 122–3 contradictory evidence, 250 copy theory of mental life, the, 143 cortical structures, 222, 227–30, 234 counting-parity strategy, 332 credibility, 260, 262 D
B bearings, 174, 181, 189, 190–3 object-to-object, 190 self-to-object, 190 behavior, 132, 138, 141, 144, 146, 149–52, 156–159 behavioral development, 291 early childhood, 291, 302 infancy, 291, 293–4, 296–7, 301–3, 309 belief function, 146 beliefs, attribution of, 150 beliefs, 132–63 brain activity, face-specific, 223 brain, development, 302 brain, neural systems, 227 brainstem, neurochemistry, 285
deficiency models, 95, 120 desire, 132–3, 139–43, 149–50, 152–4, 159–60 developmental psychology, 133, 138 belief, 139, 145 one-to-one mapping, 139 perception, 131, 139, 159 disadvantaged families/children, 92, 95, 100–4, 106, 112–15, 120, 122–24 distractibility, 291, 294–5 distraction latencies, 295–6 downward drift, 97 E EEG see electroencephalograph egocentric frame, 180–1 electroencephalograph (EEG), 303, 310 381
382
Subject Index
electrophysiological studies, 228, 233 emergentist/dynamic systems approach, 352 lexicon, 353 phonology, 352 shape-bias, 353 emotional context/climate, 108 emotional reaction, 207, 218, 222 emotional tone, 256, 257 emotions, 133, 140–3, 150–5, 157–3, 309 mental state concepts, 132, 139, 141 moral development researchers, 131, 141 moral intuitions, 141 endogenous attention, 289–90, 306, 310 environmental adversity, 97, 101, 108–10, 113, 115, 118–19, 121 environmental risks, 97, 100, 112, 116 environmental variable, 113 epigenetic approach, 327–8, 330, 352–3 behavioral phenomenon, 328 hierarchical system, 324, 327, 354–5 episodic information, 249 ERP see event-related potential event-related potential (ERP), 10, 220–30 ex-poor, 97 externalizing behavior, 99, 101, 103–7, 110–11, 117–21, 123 F facial expression processing, infants, 208–9, 212–18, 220–2, 227, 230–1, 233–6 development mechanism, 233–6 discriminatory ability, 214–16 neural basis, 208–9, 220–2 social experiences, impact, 219 spontaneous reaction, 218–19 facial expression recognition, 219, 221, 236 automatic responses, 212–13, 226
cortical/subcortical route, 222, 226–7 prototypes, 210, 212–13, 234 facial muscle activity, 212, 218 facial reactions, emotion-specific, 218 familiarization, 299 family instability, 107, 111, 113–15, 123 family stress models, 98, 105, 114 figure-eight strategy, 330, 333–9, 343, 346–7, 354–6 perception–action system, 332–3, 338–41, 347, 354, 357 fMRI see functional magnetic resonance imaging focused/casual attention, 297, 299–300 form–meaning associations, 17 functional imaging studies, 223–6 functional magnetic resonance imaging (fMRI), 222, 230–1 G goal-directed behavior, 307 H habituation, 14–15, 17 habituation–recovery paradigm, 216 happy victimizer, 158, 160 harmful behavior, 144, 148, 153–5 higher cognitive function, 307 higher-order skills, 301 hypothetical transgressions, 153–4 I ignorance/misinformation, 147 infant cognition, 307 infant speech perception, 20 inhibition, 220 intellectual ability, 96, 98, 104, 116 intentionality and person perception, 131 intentions, 133, 139, 140–2, 144, 151, 153–4, 156, 159
383
Subject Index
internalizing behavior, 99, 103, 111, 107, 117–18 interpersonal conflicts, 143, 145, 149, 151, 154, 163 interpretation, 133–6, 140, 145, 151, 155–6, 162–3 interviewer bias, 248, 250–2, 262 K knowledge, 323–6, 329–30, 335, 339–40, 349, 351, 353–5, 358 L landmarks, 182–6, 189, 190, 192, 194–9 spatial relations, 184 language, 1, 308, 351 acquisition, 291 left-right strategy, 334–37, 343–46 lexical development, 308 lexical representations, 5, 19 evidence, 6 holistic nature, 5 phonetic specificity, 4–21 research, 7–12 look/visual latencies, 294 looking, 291–3 M MacArthur Communicative Development Inventory, 23 magnetic resonance imaging (MRI), 306 mapping, sound to meaning, 1 maternal attachment, 273 maternal education, 96, 100, 104, 106, 118 means-ends, 307 mediational model, 98 mediator/moderator, 98 memory, 248, 291, 300–1, 304 mental functioning, 138 mental life, 142–5, 154, 159 mental state attributions, 153
metacognitive skills, 308 mind, 143, 157–8 mind reading, 143 mnemonic abilities, 300 mnemonic functions, 304 moral conflict, 138 moral decisions, 145, 149 morality, 134, 137 moral judgments, 133–8, 145–8, 154 objects, 134 moral reasoning, 144 moral relativism, 134–5 moral situations, 147, 153, 156, 159 moral transgression, 160 morphs, 210–11 schematic faces, 212 movements, memories, 180 dead reckoning, 180, 191 MRI see magnetic resonance imaging multiple risk index, 106, 109, 111, 113–14, 121, 123 developmental effects, 112 multivariate approach, 96, 98 myelination, 305 N natural interviewing situations, 251 natural memory phenomenon, 254 necessary but not sufficient, hypothesis, 163 negative judgments, 156 neighborhood density, phonological, 19, 21–8 never-poor, 97 O object, 299 object recognition, 288 obligation, 132 P parent psychiatric morbidity, 107 passive minds, 154
384
Subject Index
Peabody Picture Vocabulary Test-Revised, 99 peer conformity pressure, 259 person variable, 113 person-centered approaches, 119–22 person, behavior, 144 PET see positron emission tomography phoneme combinations, 3, 22 phoneme discrimination, 3 phonetic detail, 5, 8, 10, 12, 16, 19 phonetic discrimination skills, 9 discontinuity, 10 phonetically similar labels, 11, 19 phonological development, 19 phonotactic patterns, 27 phonotactic probability, 21–8 physical abuse, 220–1 piloting, 187–8 context-based models, 187 intersections, 189, 191 route learning, 187–8 serial order, 187–8 place recognition, 182–6 positron emission tomography (PET), 222 poverty, 94–5, 99, 116 poverty co-factors, 96, 100–12, 116 multivariate approach, 108 poverty research, 93–100 Great Depression, 93 poverty variable, 92 pragmatics/conversational principles, 249 primary stimulus, 299 PRIMIR see Processing Rich Information from Multidimensional Interactive Representations probabilistic epigenesis, 323–4, 326 problematic children, 119 Processing Rich Information from Multidimensional Interactive Representations (PRIMIR), 17 psychological experience, 143
psychological knowledge/moral thinking, 131, 138 psychological understanding, 147 Q question, types, 252 R readiness, 285 reality, 137 real-life interaction/conflict, 149 relationship instability, 102–3, 107 repeated interviews and questions, 253–4 representational ability, 249 resilience, 120 resource limitation account, 11–12 response, 255 reward/punishment, 257 Rochester Longitudinal Study, 109 S selection, 284 selectivity hypothesis, 104, 116–18, 122 self-perceptions, 271 SES see socioeconomic status social causation model, 97 social influences, 249 social interaction, 308 social skills rating system, 99 socioeconomic status (SES), 94–5 socioemotional function, 290 sound structure, 2 infant learning, 27 sound–meaning associations, 10, 17, 31 speech, fluent, 3–4 segmentation cues, 4 speech perception skills, 7 speech sounds, lexical contexts, 19 Stager and Werker task, 14–15 standardized externalizing scores (T-scores), 120 Stanford–Binet Intelligence scale, 99 stereotype questions, 258
385
Subject Index
stimulus processing, 294 stimulus selectivity, 234 strategy development, 193–9 direction sampling, 198 disorientation, 197 look-back strategy, 196 staying put, 198 trail running, 197 verbal encoding, 195 view enhancing, 198 subjectivism, 134–5 matters of fact, 134 matters of value, 134 suggestibility, 247, 249 levels, 272, 275 measurement, 274 memory, alteration, 249 misleading questions/suggestions, 249 suggestibility, cognitive predictors, 264–9 intelligence, 265 language skills, 265 memory, 267–8 semantic knowledge, 265–6 source monitoring, 268 suggestive interviewing, 248, 250, 252–3 suggestive techniques, 252, 260 susceptibility to suggestion, 263 sustained attention, 296 symbol grounding, 341, 346 alternating-fulcrum, 346 random fulcrum, 346 T temperament, 270 temperamental adaptability, 115 theories of mind, 138 theories-of-mind research(ers), 138, 141, 160–1 theory of moral development, (Kohlberg), 134 third variable problem, 96–7, 104 transitional probabilities, 29 transparency/moral judgments, 149
U up-down strategy, 342–3 force-tracing, 342–3, 347 V ventral visual attentional pathway, 288 vigilance level, organisms, 226, 235 visual fixation paradigm, 12, 16–17, 26, 31 visual stimuli, categorization, 217 visuospatial orienting, 287 vocabulary, 19, 24 development, 7–8 size, 11 voluntary attention, 290–1 vowels, 3 W way finding, 175, 178–83, 196, 200 crow’s-flight distance, 176–8 home range, 174, 176, 193–4, 199–200 intended destination, 176 navigation, 175 outdoors, 191 skills, 179 word forms, 24, 30 overlapping, 18, 21 word learners, 17, 25 word learning/recognition studies, 3, 12–21 word–object association/pair, 8–11, 15, 27 words familiar/novel, 15–16 familiarization, 30 new, phonetic detail, 18 segmentation, 28–31 statistical segmentation, 30 understanding, 17 working memory, 301 wrongdoers, 158
This page intentionally left blank
Contents of Previous Volumes Volume 1 Responses of Infants and Children to Complex and Novel Stimulation Gordon N. Cantor Word Associations and Children’s Verbal Behavior David S. Palermo Change in the Stature and Body Weight of North American Boys during the Last 80 Years Howard V. Meredith Discrimination Learning Set in Children Hayne W. Reese Learning in the First Year of Life Lewis P. Lipsitt Some Methodological Contributions from a Functional Analysis of Child Development Sidney W. Bijou and Donald M. Baer The Hypothesis of Stimulus Interaction and an Explanation of Stimulus Compounding Charles C. Spiker The Development of ‘‘Overconstancy’’ in Space Perception Joachim F. Wohlwill Miniature Experiments in the Discrimination Learning of Retardates Betty J. House and David Zeaman AUTHOR INDEX—SUBJECT INDEX
Evidence for a Hierarchical Arrangement of Learning Processes Sheldon H. White Selected Anatomic Variables Analyzed for Interage Relationships of the Size-Size, Size-Gain, and Gain-Gain Varieties Howard V. Meredith AUTHOR INDEX—SUBJECT INDEX
Volume 3 Infant Sucking Behavior and Its Modification Herbert Kaye The Study of Brain Electrical Activity in Infants Robert J. Ellingson Selective Auditory Attention in Children Eleanor E. Maccoby Stimulus Definition and Choice Michael D. Zeiler Experimental Analysis of Inferential Behavior in Children Tracy S. Kendler and Howard H. Kendler Perceptual Integration in Children Herbert L. Pick, Jr., Anne D. Pick, and Robert E. Klein Component Process Latencies in Reaction Times of Children and Adults Raymond H. Hohle AUTHOR INDEX—SUBJECT INDEX
Volume 2 The Paired-Associates Method in the Study of Conflict Alfred Castaneda Transfer of Stimulus Pretraining to Motor Paired-Associate and Discrimination Learning Tasks Joan H. Cantor The Role of the Distance Receptors in the Development of Social Responsiveness Richard H. Walters and Ross D. Parke Social Reinforcement of Children’s Behavior Harold W. Stevenson Delayed Reinforcement Effects Glenn Terrell A Developmental Approach to Learning and Cognition Eugene S. Gollin
Volume 4 Developmental Studies of Figurative Perception David Elkind The Relations of Short-Term Memory to Development and Intelligence John M. Belmont and Earl C. Butterfield Learning, Developmental Research, and Individual Differences Frances Degen Horowitz Psychophysiological Studies in Newborn Infants S. J. Hutt, H. G. Lenard, and H. F. R. Prechtl Development of the Sensory Analyzers during Infancy Yvonne Brackbill and Hiram E. Fitzgerald
387
388
Contents of Previous Volumes
The Problem of Imitation Justin Aronfreed AUTHOR INDEX—SUBJECT INDEX
Volume 5 The Development of Human Fetal Activity and Its Relation to Postnatal Behavior Tryphena Humphrey Arousal Systems and Infant Heart Rate Responses Frances K. Graham and Jan C. Jackson Specific and Diversive Exploration Corinne Hutt Developmental Studies of Mediated Memory John H. Flavell Development and Choice Behavior in Probabilistic and Problem-Solving Tasks L. R. Goulet and Kathryn S. Goodwin AUTHOR INDEX—SUBJECT INDEX
Volume 6 Incentives and Learning in Children Sam L. Witryol Habituation in the Human Infant Wendell E. Jeffrey and Leslie B. Cohen Application of Hull-Spence Theory to the Discrimination Learning of Children Charles C. Spiker Growth in Body Size: A Compendium of Findings on Contemporary Children Living in Different Parts of the World Howard V. Meredith Imitation and Language Development James A. Sherman Conditional Responding as a Paradigm for Observational, Imitative Learning and Vicarious-Reinforcement Jacob L. Gewirtz AUTHOR INDEX—SUBJECT INDEX
Volume 7 Superstitious Behavior in Children: An Experimental Analysis Michael D. Zeiler Learning Strategies in Children from Different Socioeconomic Levels Jean L. Bresnahan and Martin M. Shapiro
Time and Change in the Development of the Individual and Society Klaus F. Riegel The Nature and Development of Early Number Concepts Rochel Gelman Learning and Adaptation in Infancy: A Comparison of Models Arnold J. Sameroff AUTHOR INDEX—SUBJECT INDEX
Volume 8 Elaboration and Learning in Childhood and Adolescence William D. Rohwer, Jr. Exploratory Behavior and Human Development Jum C. Nunnally and L. Charles Lemond Operant Conditioning of Infant Behavior: A Review Robert C. Hulsebus Birth Order and Parental Experience in Monkeys and Man G. Mitchell and L. Schroers Fear of the Stranger: A Critical Examination Harriet L. Rheingold and Carol O. Eckerman Applications of Hull–Spence Theory to the Transfer of Discrimination Learning in Children Charles C. Spiker and Joan H. Cantor AUTHOR INDEX—SUBJECT INDEX
Volume 9 Children’s Discrimination Learning Based on Identity or Difference Betty J. House, Ann L. Brown, and Marcia S. Scott Two Aspects of Experience in Ontogeny: Development and Learning Hans G. Furth The Effects of Contextual Changes and Degree of Component Mastery on Transfer of Training Joseph C. Campione and Ann L. Brown Psychophysiological Functioning, Arousal, Attention, and Learning during the First Year of Life Richard Hirschman and Edward S. Katkin Self-Reinforcement Processes in Children John C. Masters and Janice R. Mokros AUTHOR INDEX—SUBJECT INDEX
Contents of Previous Volumes
Volume 10 Current Trends in Developmental Psychology Boyd R. McCandless and Mary Fulcher Geis The Development of Spatial Representations of Large-Scale Environments Alexander W. Siegel and Sheldon H. White Cognitive Perspectives on the Development of Memory John W. Hagen, Robert H. Jongeward, Jr., and Robert V. Kail, Jr. The Development of Memory: Knowing, Knowing About Knowing, and Knowing How to Know Ann L. Brown Developmental Trends in Visual Scanning Mary Carol Day The Development of Selective Attention: From Perceptual Exploration to Logical Search John C. Wright and Alice G. Vlietstra AUTHOR INDEX—SUBJECT INDEX
Volume 11 The Hyperactive Child: Characteristics, Treatment, and Evaluation of Research Design Gladys B. Baxley and Judith M. LeBlanc Peripheral and Neurochemical Parallels of Psychopathology: A Psychophysiological Model Relating Autonomic Imbalance to Hyperactivity, Psychopathy, and Autism Stephen W. Porges Constructing Cognitive Operations Linguistically Harry Beilin Operant Acquisition of Social Behaviors in Infancy: Basic Problems and Constraints W. Stuart Millar Mother-Infant Interaction and Its Study Jacob L. Gewirtz and Elizabeth F. Boyd Symposium on Implications of Life-Span Developmental Psychology for Child Development: Introductory Remarks Paul B. Baltes Theory and Method in Life-Span Developmental Psychology: Implications for Child Development Aletha Huston-Stein and Paul B. Baltes The Development of Memory: Life-Span Perspectives Hayne W. Reese Cognitive Changes during the Adult Years: Implications for Developmental Theory and Research Nancy W. Denney and John C. Wright
389
Social Cognition and Life-Span Approaches to the Study of Child Development Michael J. Chandler Life-Span Development of the Theory of Oneself: Implications for Child Development Orville G. Brim, Jr. Implication of Life-Span Developmental Psychology for Childhood Education Leo Montada and Sigrun-Heide Filipp AUTHOR INDEX—SUBJECT INDEX
Volume 12 Research between 1960 and 1970 on the Standing Height of Young Children in Different Parts of the World Howard V. Meredith The Representation of Children’s Knowledge David Klahr and Robert S. Siegler Chromatic Vision in Infancy Marc H. Bornstein Developmental Memory Theories: Baldwin and Piaget Bruce M. Ross and Stephen M. Kerst Child Discipline and the Pursuit of Self: An Historical Interpretation Howard Gadlin Development of Time Concepts in Children William J. Friedman AUTHOR INDEX—SUBJECT INDEX
Volume 13 Coding of Spatial and Temporal Information in Episodic Memory Daniel B. Berch A Developmental Model of Human Learning Barry Gholson and Harry Beilin The Development of Discrimination Learning: A Levels-of-Functioning Explanation Tracy S. Kendler The Kendler Levels-of-Functioning Theory: Comments and an Alternative Schema Charles C. Spiker and Joan H. Cantor Commentary on Kendler’s Paper: An Alternative Perspective Barry Gholson and Therese Schuepfer Reply to Commentaries Tracy S. Kendler
390
Contents of Previous Volumes
On the Development of Speech Perception: Mechanisms and Analogies Peter D. Eimas and Vivien C. Tartter The Economics of Infancy: A Review of Conjugate Reinforcement Carolyn Kent Rovee-Collier and Marcy J. Gekoski Human Facial Expressions in Response to Taste and Smell Stimulation Jacob E. Steiner AUTHOR INDEX—SUBJECT INDEX
Volume 14 Development of Visual Memory in Infants John S. Werner and Marion Perlmutter Sibship-Constellation Effects on Psychosocial Development, Creativity, and Health Mazie Earle Wagner, Herman J. P. Schubert, and Daniel S. P. Schubert The Development of Understanding of the Spatial Terms Front and Back Lauren Julius Harris and Ellen A. Strommen The Organization and Control of Infant Sucking C. K. Crook Neurological Plasticity, Recovery from Brain Insult, and Child Development Ian St. James-Roberts AUTHOR INDEX—SUBJECT INDEX
Volume 15 Visual Development in Ontogenesis: Some Reevaluations Ju¨ri Allik and Jaan Valsiner Binocular Vision in Infants: A Review and a Theoretical Framework Richard N. Aslin and Susan T. Dumais Validating Theories of Intelligence Earl C. Butterfield, Dennis Siladi, and John M. Belmont Cognitive Differentiation and Developmental Learning William Fowler Children’s Clinical Syndromes and Generalized Expectations of Control Fred Rothbaum AUTHOR INDEX—SUBJECT INDEX
Volume 16 The History of the Boyd R. McCandless Young Scientist Awards: The First Recipients David S. Palermo Social Bases of Language Development: A Reassessment Elizabeth Bates, Inge Bretherton, Marjorie Beeghly-Smith, and Sandra McNew Perceptual Anisotropies in Infancy: Ontogenetic Origins and Implications of Inequalities in Spatial Vision Marc H. Bornstein Concept Development Martha J. Farah and Stephen M. Kosslyn Production and Perception of Facial Expressions in Infancy and Early Childhood Tiffany M. Field and Tedra A. Walden Individual Differences in Infant Sociability: Their Origins and Implications for Cognitive Development Michael E. Lamb The Development of Numerical Understandings Robert S. Siegler and Mitchell Robinson AUTHOR INDEX—SUBJECT INDEX
Volume 17 The Development of Problem-Solving Strategies Deanna Kuhn and Erin Phelps Information Processing and Cognitive Development Robert Kail and Jeffrey Bisanz Research between 1950 and 1980 on Urban–Rural Differences in Body Size and Growth Rate of Children and Youths Howard V. Meredith Word Meaning Acquisition in Young Children: A Review of Theory and Research Pamela Blewitt Language Play and Language Acquisition Stan A. Kuczaj II The Child Study Movement: Early Growth and Development of the Symbolized Child Alexander W. Siegel and Sheldon H. White AUTHOR INDEX—SUBJECT INDEX
Contents of Previous Volumes
Volume 18 The Development of Verbal Communicative Skills in Children Constance R. Schmidt and Scott G. Paris Auditory Feedback and Speech Development Gerald M. Siegel, Herbert L. Pick, Jr., and Sharon R. Garber Body Size of Infants and Children around the World in Relation to Socioeconomic Status Howard V. Meredith Human Sexual Dimorphism: Its Cost and Benefit James L. Mosley and Eileen A. Stan Symposium on Research Programs: Rational Alternatives to Kuhn’s Analysis of Scientific Progress—Introductory Remarks Hayne W. Reese, Chairman World Views and Their Influence on Psychological Theory and Research: Kuhn-Lakatos-Laudan Willis F. Overton The History of the Psychology of Learning as a Rational Process: Lakatos versus Kuhn Peter Barker and Barry Gholson Functionalist and Structuralist Research Programs in Developmental Psychology: Incommensurability or Synthesis? Harry Beilin In Defense of Kuhn: A Discussion of His Detractors David S. Palermo Comments on Beilin’s Epistemology and Palermo’s Defense of Kuhn Willis F. Overton From Kuhn to Lakatos to Laudan Peter Barker and Barry Gholson Overton’s and Palermo’s Relativism: One Step Forward, Two Steps Back Harry Beilin
391
Effects of the Knowledge Base on Children’s Memory Strategies Peter A. Ornstein and Mary J. Naus Effects of Sibling Spacing on Intelligence, Interfamilial Relations, Psychosocial Characteristics, and Mental and Physical Health Mazie Earle Wagner, Herman J. P. Schubert, and Daniel S. P. Schubet Infant Visual Preferences: A Review and New Theoretical Treatment Martin S. Banks and Arthur P. Ginsburg AUTHOR INDEX—SUBJECT INDEX
Volume 20 Variation in Body Stockiness among and within Ethnic Groups at Ages from Birth to Adulthood Howard V. Meredith The Development of Conditional Reasoning: An Iffy Proposition David P. O’Brien Content Knowledge: Its Role, Representation, and Restructuring in Memory Development Michelene T. H. Chi and Stephen J. Ceci Descriptions: A Model of Nonstrategic Memory Development Brian P. Ackerman Reactivation of Infant Memory: Implications for Cognitive Development Carolyn Rovee-Collier and Harlene Hayne Gender Segregation in Childhood Eleanor E. Maccoby and Carol Nagy Jacklin Piaget, Attentional Capacity, and the Functional Implications of Formal Structure Michael Chapman INDEX
AUTHOR INDEX—SUBJECT INDEX
Volume 21 Volume 19 Response to Novelty: Continuity versus Discontinuity in the Developmental Course of Intelligence Cynthia A. Berg and Robert J. Sternberg Metaphoric Competence in Cognitive and Language Development Marc Marschark and Lynn Nall The Concept of Dimensions in Developmental Research Stuart I. Offenbach and Francine C. Blumberg
Social Development in Infancy: A 25-Year Perspective Ross D. Parke On the Uses of the Concept of Normality in Developmental Biology and Psychology Eugene S. Gollin, Gary Stahl, and Elyse Morgan Cognitive Psychology: Mentalistic or Behavioristic? Charles C. Spiker Some Current Issues in Children’s Selective Attention Betty J. House
392
Contents of Previous Volumes
Children’s Learning Revisited: The Contemporary Scope of the Modified Spence Discrimination Theory Joan H. Cantor and Charles C. Spiker Discrimination Learning Set in Children Hayne W. Reese A Developmental Analysis of Rule-Following Henry C. Riegler and Donald M. Baer Psychological Linguistics: Implications for a Theory of Initial Development and a Method for Research Sidney W. Bijou Psychic Conflict and Moral Development Gordon N. Cantor and David A. Parton Knowledge and the Child’s Developing Theory of the World David S. Palermo Childhood Events Recalled by Children and Adults David B. Pillemer and Sheldon H. White INDEX
Volume 22 The Development of Representation in Young Children Judy S. DeLoache Children’s Understanding of Mental Phenomena David Estes, Henry M. Wellman, and Jacqueline D. Woolley Social Influences on Children’s Cognition: State of the Art and Future Directions Margarita Azmitia and Marion Perlmutter Understanding Maps as Symbols: The Development of Map Concepts Lynn S. Liben and Roger M. Downs The Development of Spatial Perspective Taking Nora Newcombe Developmental Studies of Alertness and Encoding Effects of Stimulus Repetition Daniel W. Smothergill and Alan G. Kraut Imitation in Infancy: A Critical Review Claire L. Poulson, Leila Regina de Paula Nunes, and Steven F. Warren AUTHOR INDEX—SUBJECT INDEX
Volume 23 The Structure of Developmental Theory Willis F. Overton
Questions a Satisfying Developmental Theory Would Answer: The Scope of a Complete Explanation of Development Phenomena Frank B. Murray The Development of World Views: Toward Future Synthesis? Ellin Kofsky Scholnick Metaphor, Recursive Systems, and Paradox in Science and Developmental Theory Willis F. Overton Children’s Iconic Realism: Object versus Property Realism Harry Beilin and Elise G. Pearlman The Role of Cognition in Understanding Gender Effects Carol Lynn Martin Development of Processing Speed in Childhood and Adolescence Robert Kail Contextualism and Developmental Psychology Hayne W. Reese Horizontality of Water Level: A Neo-Piagetian Developmental Review Juan Pascual-Leone and Sergio Morra AUTHOR INDEX—SUBJECT INDEX
Volume 24 Music and Speech Processing in the First Year of Life Sandra E. Trehub, Laurel J. Trainor, and Anna M. Unyk Effects of Feeding Method on Infant Temperament John Worobey The Development of Reading Linda S. Siegel Learning to Read: A Theoretical Synthesis John P. Rack, Charles Hulme, and Margaret J. Snowling Does Reading Make You Smarter? Literacy and the Development of Verbal Intelligence Keith E. Stanovich Sex-of-Sibling Effects: Part I. Gender Role, Intelligence, Achievement, and Creativity Mazie Earle Wagner, Herman J. P. Schubert, and Daniel S. P. Schubert The Concept of Same Linda B. Smith Planning as Developmental Process Jacquelyn Baker-Sennett, Eugene Matusov, and Barbara Rogoff AUTHOR INDEX—SUBJECT INDEX
Contents of Previous Volumes
393
Volume 25
Volume 27
In Memoriam: Charles C. Spiker (1925–1993) Lewis P. Lipsitt Developmental Differences in Associative Memory: Strategy Use, Mental Effort, and Knowledge Access Interactions Daniel W. Kee A Unifying Framework for the Development of Children’s Activity Memory Hilary Horn Ratner and Mary Ann Foley Strategy Utilization Deficiencies in Children: When, Where, and Why Patricia H. Miller and Wendy L. Seier The Development of Children’s Ability to Use Spatial Representations Mark Blades and Christopher Spencer Fostering Metacognitive Development Linda Baker The HOME Inventory: Review and Reflections Robert H. Bradley Social Reasoning and the Varieties of Social Experiences in Cultural Contexts Elliot Turiel and Cecilia Wainryb Mechanisms in the Explanation of Developmental Change Harry Beilin
From Form to Meaning: A Role for Structural Alignment in the Acquisition of Language Cynthia Fisher The Role of Essentialism in Children’s Concepts Susan A. Gelman Infants’ Use of Prior Experiences with Objects in Object Segregation: Implications for Object Recognition in Infancy Amy Needham and Avani Modi Perseveration and Problem Solving in Infancy Andre´a Aguiar and Rene´e Baillargeon Temperament and Attachment: One Construct or Two? Sarah C. Mangelsdorf and Cynthia A. Frosch The Foundation of Piaget’s Theories: Mental and Physical Action Harry Beilin and Gary Fireman
AUTHOR INDEX—SUBJECT INDEX
Volume 26 Preparing to Read: The Foundations of Literacy Ellen Bialystok The Role of Schemata in Children’s Memory Denise Davidson The Interaction of Knowledge, Aptitude, and Strategies in Children’s Memory Performance David F. Bjorklund and Wolfgang Schneider Analogical Reasoning and Cognitive Development Usha Goswami Sex-of-Sibling Effects: A Review Part II. Personality and Mental and Physical Health Mazie Earle Wagner, Herman J. P. Schubert, and Daniel S. P. Schubert Input and Learning Processes in First Language Acquisition Ernst L. Moerk AUTHOR INDEX—SUBJECT INDEX
AUTHOR INDEX—SUBJECT INDEX
Volume 28 Variability in Children’s Reasoning Karl S. Rosengren and Gregory S. Braswell Fuzzy-Trace Theory: Dual Processes in Memory, Reasoning, and Cognitive Neuroscience C. J. Brainerd and V. F. Reyna Relational Frame Theory: A Post-Skinnerian Account of Human Language and Cognition Yvonne Barnes-Holmes, Steven C. Hayes, Dermot Barnes-Holmes, and Bryan Roche The Continuity of Depression across the Adolescent Transition Shelli Avenevoli and Laurence Steinberg The Time of Our Lives: Self-Continuity in Native and Non-Native Youth Michael J. Chandler AUTHOR INDEX—SUBJECT INDEX
Volume 29 The Search for What is Fundamental in the Development of Working Memory Nelson Cowan, J. Scott Saults, and Emily M. Elliott Culture, Autonomy, and Personal Jurisdiction in Adolescent–Parent Relationships Judith G. Smetana
394
Contents of Previous Volumes
Maternal Responsiveness and Early Language Acquisition Catherine S. Tamis-Lemonda and Marc H. Bornstein Schooling as Cultural Process: Working Together and Guidance by Children from Schools Differing in Collaborative Practices Eugene Matusov, Nancy Bell, and Barbara Rogoff Beyond Prototypes: Asymmetries in Infant Categorization and What They Teach Us about the Mechanisms Guiding Early Knowledge Acquisition Paul C. Quinn Peer Relations in the Transition to Adolescence Carollee Howes and Julie Wargo Aikins AUTHOR INDEX—SUBJECT INDEX
Volume 30 Learning to Keep Balance Karen Adolph Sexual Selection and Human Life History David C. Geary Developments in Early Recall Memory: Normative Trends and Individual Differences Patricia J. Bauer, Melissa M. Burch, and Erica E. Kleinknecht Intersensory Redundancy Guides Early Perceptual and Cognitive Development Lorraine E. Bahrick and Robert Lickliter Children’s Emotion-Related Regulation Nancy Eisenberg and Amanda Sheffield Morris Maternal Sensitivity and Attachment in Atypical Groups L. Beckwith, A. Rozga, and M. Sigman Influences of Friends and Friendships: Myths, Truths, and Research Recommendations Thomas J. Berndt and Lonna M. Murphy AUTHOR INDEX—SUBJECT INDEX
Volume 31 Beyond Point And Shoot: Children’s Developing Understanding of Photographs as Spatial and Expressive Representations Lynn S. Liben Probing the Adaptive Significance of Children’s Behavior and Relationships in the School Context: A Child by Environment Perspective Gary W. Ladd
The Role of Letter Names in the Acquisition of Literacy Rebecca Treiman and Brett Kessler Early Understandings of Emotion, Morality, and Self: Developing a Working Model Ross A. Thompson, Deborah J. Laible and Lenna L. Ontai Working Memory in Infancy Kevin A. Pelphrey and J. Steven Reznick The Development of a Differentiated Sense of the Past and the Future William J. Friedman The Development of Cognitive Flexibility and Language Abilities Gedeon O. Dea´k A Bio-Social-Cognitive Approach to Understanding and Promoting the Outcomes of Children with Medical and Physical Disorders Daphne Blunt Bugental and David A. Beaulieu Expanding Our View of Context: The Bio-ecological Environment and Development Theodore D. Wachs Pathways to Early Literacy: The Complex Interplay of Child, Family, and Sociocultural Factors Megan M. McClelland, Maureen Kessenich, and Frederick J. Morrison AUTHOR INDEX—SUBJECT INDEX
Volume 32 From the Innocent to the Intelligent Eye: The Early Development of Pictorial Competence Georgene L. Troseth, Sophia L. Pierroutsakos, and Judy S. DeLoache Bringing Culture into Relief: Cultural Contributions to the Development of Children’s Planning Skills Mary Gauvain A Dual-Process Model of Adolescent Development: Implications for Decision Making, Reasoning, and Identity Paul A. Klaczynski The High Price of Affluence Suniya S. Luthar and Chris C. Sexton Attentional Inertia in Children’s Extended Looking at Television John E. Richards and Daniel R. Anderson Understanding Classroom Competence: The Role of Social-Motivational and Self-Processes Kathryn R. Wentzel
Contents of Previous Volumes Continuities and Discontinuities in Infants’ Representation of Objects and Events Rachel E. Keen and Neil E. Berthier The Mechanisms of Early Categorization and Induction: Smart or Dumb Infants? David H. Rakison and Erin R. Hahn AUTHOR INDEX—SUBJECT INDEX
Volume 33 A Computational Model of Conscious and Unconscious Strategy Discovery Robert Siegler and Roberto Araya Out-of-School Settings as a Developmental Context for Children and Youth Deborah Lowe Vandell, Kim M. Pierce and Kimberly Dadisman Mechanisms of Change in the Development of Mathematical Reasoning Martha W. Alibali
395
A Social Identity Approach to Ethnic Differences in Family Relationships during Adolescence Andrew J. Fuligni and Lisa Flook What Develops in Language Development? LouAnn Gerken The Role of Children’s Competence Experiences in the Socialization Process: A Dynamic Process Framework for the Academic Arena Eva M. Pomerantz, Qian Wang and Florrie Ng The Infant Origins of Intentional Understanding Amanda L. Woodward Analyzing Comorbidity Bruce F. Pennington, Erik Willcutt and Soo Hyun Rhee Number Words and Number Concepts: The Interplay of Verbal and Nonverbal Quantification in Early Childhood Kelly S. Mix, Catherine M. Sandhofer and Arthur J. Baroody AUTHOR INDEX—SUBJECT INDEX
This page intentionally left blank