Biochemistry, Sixth Edition - PDF Free Download

• • • eremy M. Berg W. H. Freem an and Company • New York SIXTH I--r-c • • Publisher: Sara Tenney Senior Acqui...

Author: Jeremy M. Berg | John L. Tymoczko | Lubert Stryer

814 downloads 3974 Views 321MB Size Report

This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!

Report copyright / DMCA form

DOWNLOAD PDF

•

•

•

eremy M. Berg

W. H. Freem an and Company • New York

SIXTH

I--r-c

• •

Publisher: Sara Tenney Senior Acquisitions Editor: Kate Ahr Marketing Managers: Sarah Martin, John Britch Senior Developmental Editor: Susan Moran Media Editor: Alysia Baker Supplements Editors: Nick Tymoczko, Deena Goldman Photo Editor: Bianca Moscatelli Design Manager: Diana Blume Text Designer: Patrice Sheridan Senior Project Editor: Georgia Lee Hadler Manuscript Editor: Patricia Zimmerman Illustrations: Jeremy Berg with Network Graphics Senior Illustration Coordinator: Bill Page Production Coordinator: Susan Wein Composition: Techbooks Printing and Binding: RR Donnelley

Library of Congress Cataloging-in-Publication Data Berg, Jeremy Mark. Biochemistry / Jeremy M. Berg, John L. Tymoczko, Lubert Stryer. . 6th ed. p. cm. Includes bibliographical references and index. ISBN 0-7167-H724-5 hardcover 1. Biochemistry. 1. Tymoczko, John L., II. Stryer, Lubert III. Title. QP514.2.S662006 572 dc22 2005052751 ISBN: 0-7167-8724-5 EAN: 9780716787242 ©2007, 2002 by W. H. Freeman and Company; © 1975, 1981, 1988, 1995 by Lubert Stryer All rights reserved Printed in the United States of America First printing W. H. Freeman and Company 41 Madison Avenue New York, NY 10010 Houndmills, Basingstoke RG21 6XS, England www.whfreeman.com

To our teachers and our students

About the Authors

JEREMY M. BERG received his B.S. and M .S. degrees in C hemi stry from Stanford (where he did research with Keith Hodgson and Lubert Stryer) and his Ph.D . in chemistry from Harvard with Richard Holm. He then completed a postdoctoral fellow ship with Carl Pabo in Biophysics at Johns H opkins University School of Medicine. He was an Assistant Professor in the Department of Chemistry at Johns Hopkins from 1986 to 1990. He then moved to Johns Hopkins University School of M edi cine as Professo r and Director of the Department of Biophysics and Biophysical Chemistry, where he remained until 2003 . In 2003, he became the Director of the National Institute of General Medical Sciences at the National Institutes of Health. He is recipient of the American C hemical Society Award in Pure C hemistry (1994 ), the Eli Lilly Award for Fundamental Research in Biological Chemistry (1995), the Maryland Outstanding Young Scientist of the Year (1995), and the Harrison Howe Award (1997 ). While at Johns Hopkins, he received the W. Barry Wood Teaching Award (selected by medical students as award recipient), the Graduate Student Teaching Award, and the Professor's Teaching Award for the Preclini cal Sciences. He is coauthor, with Stephen Lippard, of the textbook Principles of Bioinorganic Chemistry.

JOHN L. TYMOCZKO is Towsley Professor of Biology at Carleton College, where he has taught since 1976. He currently teaches Biochemistry, Biochemistry Laboratory, Oncogenes and the Molecular Biology of Cancer, and Exercise Biochemistry and coteaches an introductory course, Energy Flow in Biological Systems. Professor Tymoczko received his B.A. from the University of Chicago in 1970 and his Ph.D. in Biochemistry from the University of Chicago with Shutsung Liao at the Ben May Institute for Cancer Research. He then had a postdoctoral position with Hewson Swift of the Department of Biology at the University of Chicago. The focus of his research has been on steroid receptors , ribonucleoprotein particles, and proteolytic processing enzymes.

LUBERT STRYER is Winzer Professor of Cell Biology, Emeritus, in the School of Medicine and Professor of Neurobiology, Emeritus, at Stanford University, where he has been on the faculty since 1976. He received his M .D. from Harvard Medical School. Professor Stryer has received many awards for his research on the interplay of light and life, including the Eli Lilly Award for Fundamental Research in Biological Chemistry and the Distinguished Inventors Award of the Intellectual Property Owners' Association. He was elected to the National Academy of Sciences in 1984. He currently chairs the Scientific Advisory Boards of two biotechnology companies Affymax, Inc ., and Senomyx, Inc. and serves on the Board of the McKnight Endowment Fund for Neuroscience. The publication of his first edition of Biochemistry in 1975 transformed the teaching of biochemistry.

PREFACE he more we learn, the m ore we discover connection s threading through our biochemical world . In wri tin g th e sixth edi t ion, we have made every effort to present these connections in a way that will help first -time students of biochemistry understand the subj ect and h ow very relevan t it is to their lives.

Emphasis on Physiological Relevance Biochemistry is returning to its roots to renew the study of its role in physiology, with the tools of m olecular biology and the information gained from gene sequencing in hand. In the sixth edition, we emphasize that an understanding of biochemical pathways is the underpinning for an understanding of physiological systems. Biochemical pathways make m ore sense to students when they understand how these pathways relate to the physiology of familiar acti vities such as di gesti on, respiration, and exercise. In this edition, particularly in the chapters on m etabolism, we have taken several steps to ensure that students have a view of the bigger picture: • Discussions of m etabolic regulation emphasize the everyday conditions that determine regul ation: exercise versus rest; fed versus fasting. • New pathway-integration figures show how multiple pathways work together under a specific condition, such as during a fast. • More physiologically relevant examples have been added throughout the book. This physiological perspective is also evident in th e new chapter on drug development. The use of a foreign compound to inhibit a specific enzyme som etimes has surprising physiological consequences that reveal new physiological principles.

FAT CELL

BLOOD

FASTING or D IAB ETES

-

Glycerol

Fatty acids , . .

Triacylglycerol

G~erol

LIVER CELL

I

Fatty aCI.d S

Glycerol

Glucose

•

_+-

Fatty acids Fatty

acids

~ Ace tyl CcA

1

Ketone

bodies

HEART·MUSCLE CELL RENAL· CORTEX CELL BRAIN CELL DURING STARVATION

Ketone bodies,=

= Active pathways:

.0

Acetyl CoA

1. Fatty acid oxidation, Chapter 22 2. Formation of ketone bod ies, Cha pter 22 3. Gluconeogenesis, Chapter 16

4. Ketone bodies --7 acetyl CcA. Chapter 22 5. Cit ric acid cycle, Chapter 17 6. Oxidative phosphorylation, Chapter 18

Figure 22.21 Pathway Integration: Liver supplies ketone bodies to the peripheral t issues. During fa sting or in untreat ed diabet ics. th e liver converts fatty aci ds into keto ne bodies. whi ch are a fuel source fo r a number of ti ssues. Ket one bodies are t he predo m inant fue l during starvati on.

v

vi

Prefa ce

A Molecular Evolutionary Perspective Evolutionary perspectives greatly enabl e and enhance the study of biochemistry. As Theodosius Dobzhansky noted, "nothing in biology makes sense except in the light of evolution ." In the course of evolution, mutations altered many protei ns and biochemical motifs so that they perform different functions while maintaining their core biochemical elements. By exa mining related protein s, we hi ghli ght essenti al chemi cal features as well as the specialization necessary for particular functions. The tracks of evolution are clear from the analysis of gene and protein sequences. As seq uence analysis becomes more important, the field of biochemistry is shifting from a science performed alm ost entirel y in the laboratory to one that may also be explored through computers , by using information gathered from genomics and proteo mi cs. Thi s shi ft is manifest in th e current edi tion and can be seen most clearl y in C hapter 6, "E xploring E volution and Bioinformatics," which develops the co nceptual basis for comparing protein and nucl eic ac id sequ ences. Protein co mpari so ns are a frequent source of insight throughout the book, especiall y for illuminatin g relations between stru cture and fun ction .

• How drugs relate to other topics in the bookkineti cs, enzyme inhibitors, membrane receptors, metaboli c regulation, lipid synth esis, and signal transduction • How the body's defen ses respond to foreign compounds, especiall y the defenses provided by the biochemical pathways of xenobiotic metabolism • The importance of admini stration, distribution, metaboli sm , excretion (ADME), and toxicology in drug development • How the drug-development process works from target identification through clinical trials • H ow the concepts and tools of genomics are used in the development of drugs

New Clinical Applications W e have added a number of new exa mples from medical science to the already abundant selection of such examples (indicated by the icon above) (For a full list see p. x.) N ew topics include: • Diseases of protein misfolding (C hapter 2) • Human gene therapy (Chapter 5) • Aggregan and osteoarthritis (Chapter 11 )

New Chapters: Hemoglobin and Drug Development

• T he use of erythropoieti n (EPO) to tTeat anemia and its abuse by athletes (Chapter 11 )

Two new chapters illustrate the relation between stru cture and fun cti on by using a classic exampl e and a contemporary one.

• The use of monoclonal antibodies to target epidermal-growth -factor receptors in the treatment of colon and breast can cers (Chapter] 4)

Chapter 7: Hemoglobin: Portrait of a Protein in Action.

• Role of exercise in building defenses against superoxide radicals (Chapter 18)

This classic example, used to convey the relation between stru cture and function, returns in an expanded treatment. New insights include: • Oxygen transport during rest and d uring exercise • Th e physiology of oxygen and CO 2 transport • The mol ecular basis of sickle-cell anemia and thalassemia • Balancing the production of a and 13 chains • Newly discovered globins

• Diseases of altered ubiquination (Parkinson's disease, Angelman syndrome) (Chapter 23) • The use of the proteasome inhibitor bortezomib to treat mu ltiple myeloma (Chapter 23) • Adenosine d eaminase and severe combined immune deficiency (Chapter 25) • Much enhanced discussion of gout (Chapter 25) • Folic acid and spina bifid a (Chapter 25) • Type II diabetes (C hapter 27)

Knowledge of biochemical pathways is key to the development of new drugs such as Lipitor, Viagra, and Vioxx . In this new chapter, plentifu l case studies illustrate: Chapter 15: Drug Development.

,

• Tumor suppressor genes and p 53 (C hapter 2H) • C hemotherapy targeting DNA repair pathways (Chapter 28)

Preface

• Diseases of defective RNA spli cing, including thalassemias and retinitis pigmentosa (Chapter 29)

• The structure and function of the EGF receptor (Chapter 14)

• Innate immunity (Chapter 33)

• The structure of the ATP-ADP translocase (Chapter 18)

• •

VII

• Role of glycogen synthase kinase in glycogen regulation (Chapter 21)

Recent Advances

• The role of perlipin A in fatty acid mobilization (Chapter 22)

The sixth edition has been thoroughly updated throughout, including new discussions of the following recent advan ces:

• The newly revised structure of fatty acid synthase (Chapter 22)

• The nucleation condensation model of protein folding (Chapter 2)

• Prokaryotic and eukaryotic replication initiation (Chapter 28)

• Using MALDI-TOF mass spectrometry to identify components oflarge protein compl exes (Chapter 3)

• DNA polymerase components (Chapter 2S)

• Update on the human genome project (Chapter 5)

• The trombone model of DNA elongation (Chapter 28)

• Comparative genomics (Chapter 5)

• Promoter structure in eukaryotes (Chapter 29)

• Gene disruption by RNA interference (Chapter 5)

• Transcription initiation in eukaryotes (Chapter 29)

• Using BLAST searches (Chapter 6) • Lipid rafts (Chapter 12 )

• The carboxy-terminal domain (CTD ) of RNA polymerase (Chapter 29)

• Mechanisms of action of several types of membrane channels and pumps such as the acetylcholine receptor (Chapter 13)

• The rol e of SNARE proteins in protein targeting (Chapter 30)

• Aquaporin (Chapter 13)

• The structure of taste receptors for detecting sweetness (Chapter 32)

• The insulin receptor pathway (Chapter 14)

Insulin receptor In sulin

PIP,

PDKl (PIP-dependent kinase) Phosphoinositid e 3-kinase

IRS- I

Akt

AlP

ADP

Activated Akt

Figure 14.20 Insulin signaling. The binding of insulin to its receptor leads to a seri es of phosphorylations, resulting in the activati on of the kinase Aktl. Activated Akt1 diffuses thro ughout the cell to continue the Signal-transduction pathway.

vii i

Preface

• Living Figures for most m olecular structures now appear on the Web site in J m ol to allow stud ents to rotate 3- D molecul es and view alternative renderings online.

Visualizing Molecular Structure As in the fifth edition, all molecular structures have been selected and rendered by one of us , Jeremy Berg. T he sixth edition includes new tools to help students read and und erstand molecular stru ctures:

End-of-Chapter Problems

• A molecular model " primer" explains the different types of protein models and examines their strengths and weaknesses (appendices to Chapters 1 and 2).

In addition to general probl ems, the end-of-chapter problems include three categories to fo ster the develop ment of specific skills. • Mechanism problems ask students to suggest or elaborate a chemical mechanism .

• Figure legends direct students explicitly to the key features of a model.

• Data interpretation problems ask questions about a set of data provided in tabulated or graphic form. These problems give students a sense of how scientific conclusions are reached .

• A greater variety of types of molecular structures are represented , in cluding clearer renderings of membrane proteins. • For most molecular models, the name of the file from the Protein Data Bank is given at the end of the figure legend . This file name (also known as a PDB ID) allows the reader easy access to the file used in generating the structure from the Protein Data Bank W eb site (http ://www.rcsb. org/pdb/ ). At thi s site, a variety of tools for visualizing and analyzing the stru cture are availabl e.

(A)

• Chapter integration problems require students to use information from several chapters to reach a solution. These problems reinforce a student's awaren ess of the interconnectedness of the different aspects of biochemi stry. Brief solutions to these problems are presented at the end of the book; expanded solutions are available in the accompanying Student Companion.

(8)

Heme

Iron atom

~ Figure 2.48 Three-dimensional structure of myoglobin.

{Al A ribbon diagram shows

that the protein consists largely of a. hel ices. {Bl A space-filling model in the same orientation shows how ti ghtly packed the fol ded protein is. Notice that the heme group is nestled into a crevice in t he compact protein with on ly an edge exposed. One hel ix is blue to allow comparison of the two structural dep ictions. [Drawn from lA6N.pdb.]

Molecular Evolution This ico n signals the start of many discussions that hi ghlight protein commonalities or other molecular evolutionary insights that provide a framework to help students organize information.

Why this set of 20 amino acids' (p. 33) Is the genetic code universal? (p . 126)

Increasing sophistication of glycogen phosphorylase regulation (p. 604)

Many exons encode protein domains (p . ] 28)

The ", -amylase family (p. 606)

Fetal hemoglobins (p. 192)

A recurring motif in the activation of carboxyl groups (p. 623)

.A dditional globins (p. 197)

Prokaryotic co unterparts of the ubiquitin path way and the

proteasome (p. 655)

Catalytic triads in hydrolytic enzymes (p . 24~)

A famil y of pyridoxal -dependent enzymes (p . 660)

iVlajor classes of peptide-cleaving enzymes (p. 251 )

Evolution of the urea cycle (p. 664)

Zinc-based active sites in carbonic anhydrases (p. 258) A co mmo n catalytic core in lype II restric tion enzymes (p. 266 )

Aspartate aminotran sferase, prototype of PLP -dependent enzymes

P-Ioop NTPase domains (p . 270)

(p . 687)

A commo n catalyti c co re in protei n kinases (p. 288 )

Feedback inhibition (p . 698)

Why might human blood types differ? (p . 3] 5)

Recurring steps in purine rin g synthesis (p . 7 15)

Archaeal membranes (p. 331)

lZibonucleotide redu ctases (p . 720)

P-type ATPases (p. 354 and p. 3:;8)

Increase in urate levels during primate evol ution (p. 726)

ATP -binding cassette domains (p . 35~) Using sequence co mparison s to understand Na

The P -Ioop NTPase domain in nitrogenase (p. 6R 2)

t

and Ca

2

The cytochrome P450 superfamily (p . 752) I

channels (p . 366)

D N A polymerases (p. 794)

Small G proteins (p . 398)

Helicases (p. 798)

Evolution of metabolic pathways (p . 429 )

Thymine and fidelity of geneti c message (p.

Why is glucose a prominent fuel ? (p. 43:;)

Evolutionary relationship of recombinases and topoisomerases

A common binding site in dehydroge nases (p. 448)

~09)

(p .814)

T he maj or facilitator (MF) superfamily of transporters (p . 457)

Evolution of spliceosome-catal yzed splicing (p . 850)

Isozymic forms of lactate dehydrogenase (p. 469)

C lasses of aminoacyl-tRNA synthetases (p. 865)

Evolutionary relationship of glycolysis and gluconeogenesis

(p. 469)

Composition of t he primordal ribosom e (p . R69) Homologous G proteins (p. 877)

Decarboxylati on of ,, -ketoglutarate and pyruvate (p. 48:;)

A family of protein s with common li gand binding domains (p. 899)

Evolution of succinyl eoA synthetase (p . 487) Evolutionary history of the citric acid cycle (p. 495)

Independent evolution of DNA -binding sites of regulatory proteins (p. 900)

Endosymbiotic origins of mitochondria (p . 504)

CpG islands (p. 907)

Conservation of cytochrom e c stru cture (p. 52 0)

Iron response elements (p. 9 16)

Common features of ATP synthase and G protei ns (p. 527)

The odorant rece pto r fa mily (p. 923)

Related uncoupling proteins (p. 533)

Photoreceptor evolution (p. 936)

Evolution of chloroplasts (p. 543)

The immunoglobulin fold (p . 952)

Evolutionary origins of photosynthesis (p . 560)

Relationship of actin to hexokinase and prokaryotic proteins (p. 9Xb)

Evolu tion of the C 4 pathway (p . 576)

Tubulins in P-loop NTPase family (p. 990)

•

•

IX

Clinical Applications

..

"

T his icon signals the start of a clinical application m the text. Additi onal, bri efer cl ini cal correlati ons appear in the text as appropriate.

Diseases of protein misfolding (p. 53)

Diseases of altered ubiquination (p. 653)

Disease due to fa il ure of protein modification (p . 57 )

Protein degradatio n and the inflammatory response (p. 664 )

Antigen detection with ELi CiA (p. 87)

I n herited defects of the urea cycle (hyperammonemia) (p . 664)

Vasopressin deficiency (p. YO )

inborn errors of amino acid degrauation (p. 672 )

Human gene th erapy (p. 158)

H igh homocysteine levels and vascu lar disease (p. 693)

Sickl e-cell anemia (p. 19 5)

Inherited disorders of porphyrin metabolism (p . 704)

Thalassemia (p. 1Y6)

An ticancer drugs t hat block the synthesis of t hymidy late (p . 722)

A ction of penicillin (p . 232) Protease in hibitors (p. 253)

Adenosine deaminase and Severe combined immunodeficiency (SClD) (p. 725)

Carboni c anhydrase and osteopetrosis (p . 254)

Cout (p. 726)

Use uf i!:)Ozym es to diagno~e ti ss ue clamage (p. 28 3 )

Lesch- Nyhan syndrome (p. 726)

Emphysema (p. 292)

Folic acid and spi na bifida (p. 727)

Thromboses prevention (p . 295)

Respiratory d ist ress synd rome and Tay-Sacl1S disease (p . 738)

H emophi lia (p. 297)

D iagnostic use of blood cholesterol levels (p. 74 5)

Regu lation of blood clotting (p . 297)

Hypercholesteremia and atherosclerosis (p . 747)

A ggregan and osteoarthr itis (p. 313)

C linical management of cholesterol levels (p. 748)

J:llood groups (p. 315 )

Rickets and vitam in D (p. 754)

Use and abuse of erythropoietin (EPO) (p. 316)

ProlongeJ starvation (p . 772)

I d isease (p . 318 )

Dia betes (p. 773)

Selectins and the inflammatory response (p. 321)

Regulat ing boJ y weigh t (p . 774)

Influenza virus (p. 321)

M etabolic effects of et hano l (p. 777)

Clini cal uses oflipusumes (p. 335)

A n tibiotics that target DNA gyrase (p . 792)

Aspirin and ibuprofen (p. 339)

H untington d isease (p. 805)

Digitalis and congestive heart fa ilure (p . 357)

D efective repair of D NA and cancer (p . 810)

Mul tidrug resistan ce (p . 358)

D etection of carcinogens (Ames test) (p. 81 1)

Signal -transduction pathways and cancer (p. 400 )

A n tibiotic in hibitors of transcri ption (p. 831)

Monoclonal antibod ies as anticancer Jrugs (p . 401)

Burki tt lymphoma and B-ceilleukemia (p . 83Y)

P rotei n kinase in hibitors as anticancer drugs (p . 40 1)

D iseases of d efective R NA splicing (p . 847)

Cholera and whooping cough (p . 401 )

A ntibiotics that inhibit protein synthesis (p . 884)

Vitam in deficiencies (p . 423)

Diphther ia (p. 885)

L actose in tolerance (p. 451)

Ricin, a leth al pro tein synthesis inhibitor (p . 885)

Galactose toxicity (p. 45 1)

Anabolic steroids (p . 910)

Cancer and glycolysis (p. 457)

SERMs and breast cancer (p . 910)

Phosphatase defi ciency and lactic acidos is (p. 4n )

Color blIndness (p. 916)

Heriberi and poi soning by mercury and arsenic (p. 494 )

Use of ca psaicin in pain management (p . 941)

M itochondrial diseases (p . 534)

Innate immunity (p . 946)

H emolytic anemia (p. 586)

Immune -system s uppressants (p . 9,)9)

Glucose 6-phosphate dehydrogenase deficiency (p . 587)

M H C and transplantation rejec ti on (p . %8)

G lycoge n-storage d iseases (p. 61 1)

AID S vaccine (p. 969)

Carnitine d eficiency (p. 624)

A utoimmune diseases (p. Y7 1)

Zellweger sy nJrome (p. 630)

Immune system an d cancer (p. 971)

Diabetic ketosis (p. 6.B)

Myos ia and deahles. (p. 984)

Use of fatty ac id synthase inhibitors as Jrugs (p . 640)

Kinesia anti nervous system di sorders (p. 9R9 )

Effects of aspirin on signal ing pathways (p. 644)

' Iaxo l (p . 990)

x

Tools and Techniques The sixth ed ition of Biochemistry offers three chapters that present the tools and techniques of biochemistry: "Exploring Proteins and Proteomes" (C hapter 3), "Exploring Genes and Genomes" (Chapter 4), and "Exploring Evolution and Bioinformatics" (Chapter 6). Additional experimental tech niques are presented throughout the book, as appropriate.

Exploring Proteins and Proteomes (Chapter 3) Protein purification (p . 67) Oifferential centrifu gation (p. 67 ) Salting out (p . 68) Dial ysis (p. 69) Gel-filtration chro matograph y (p. 69 ) Ion -exchange chromatography (p. 69) Affinity chromatography (p. 70) High -pressure liquid chromatography (p . 71) Gel electrophoresis (p. 71) Isoelectri c focusing (p. 73) Two-dim ensional electrophoresis (p. 74) Qualitati ve and quantitative evaluation of protein purificati on (p . 74 ) Ultracentrifugation (p . 76) Edman degradation (p . 78) Protein sequencin g (p . 78) Production of polyclonal antibodies (p. 84) Producti on of monoclonal antibodies (p. 85) Enz yme-linked immunosorbet assay (ELISA) (p. 87) Western blotting (p . 88) Fluorescence microscopy (p . 89) Green fluo rescent protein as a marker (p . 89) [mrnunoelectron rnicroscopy (p . 89) Automated solid-p hase peptide synthesis (p . 90) MALDI -T O r mass spectrom etry (p. 93) Proteomic anal ysis by ma~~ ~pectrometry (p. 94) X-ray crystallography (p. 96) Nuclear magneti c resonance spectroscopy (p . ~8 ) NOESY spectrosco py (p . 99 )

Exploring Proteins (other chapters) Kasis of fluorescence in green fluorescent protei n (p . 58) Time-resolved crystallography (p. 21.1) Using fluorescence spectroscopy to analyze enzy me substrate in · teractions (p. 213) Using irreversible inhibitor. to map the active site (p. 228) Enzyme studies with catalytic antibodies (p . 232)

Exploring Genes and Genomes (Chapter 5) Restriction -enzyme analys is (pp. 135- 137) Southern and Northern blotting techniyues (p. 137) San ger dideoxy method of DNA seq uencing (p . 138) Solid -phase synthesis of nucleic acids (p . 139) Polymerase chain reaction (PC R) (p. 140) , Reco mbinant DNA technology (pp . 142- 159)

UNA cloning in bacteria (pp . 143 147) Mutagenesis techniques (p . 14 7) Examining expression levels (gene chips) (p . 151) Creating cDNA libraries (p . 152 ) Introducing genes into eukaryotes (p . 154) Transgenic animals (p . 155 ) Gene disruption (p . 155) Gene disruption by RNA in terference (p. 157) Tumor-inducing plasmid s (p. 157)

Exploring Genes (other chapters) Density -gradi ent equi li brium ~~dimentation (p. 113) footprinting technique for isolati ng and characterizing promoter sites (p . R24) Chromatin immunoprecipitation (C h II') (p . 906)

Exploring Evolution and Bioinformatics (Chapter 6) Sequence-comparison methods (p. 166) Sequence-align ment methods (p . 166) Estimatin g th e statisti ca l significance of alignments (by shuffling) (p . 168) Substitution matrices (p . 168 ) Performing a BLAST databa.e search (p. 171 ) Sequence templates (p. 174) I letectin g repeated motifs (p . 174) Mapping secondary structures throu gh RN A sequence comparisons (p. 176) Construction of evolu tionary trees (p. 177) Combinatorial chemistry (p . 178)

Other Techniques Sequencing of carbohydrates by using MALOI-T O F mass spectroscopy (p . 3 19) Use of liposomes to investi gate membrane permeability (p. 334) Use of hydropath y plots to locate transmembrane helices (p. 340) Fluorescence recovery after photobleachin g (FRAP) for measurin g lateral diffu sion in membranes (p . .142) Patch· clamp technique for measur ing channel activity (p. 363) Measurement of redox potential (p . 506) Functional magnetic resonance imaging (fM RI ) (p. 926) Animated Techniques: Animated explanations of experimental techniques used for exploring genes and proteins are availabl e at www.whfreeman .com/ stryer •

XI

Living Figures T hi s ico n identifies molecular structures that are avail ab le in rotatable Jmol form at on the companion W eb site: www. whfreeman .com / stryer. Structu re dictates funclion: a prote in ~u rround ing DNA Figure 2.1

Conformatianal change in lactoferrin Figure 2.3

Nonspecific and cognate DNA w ith in Eco RV end o nuclease Fig ure 9.41

Ferritin, a large ly a - hel ical pro tein Figure 2.33

A conserved structu ra l core in type II restriction enzy m es Figure 9.44

A fa tty-acid b inding protei n rich in

i3 sheets

Figu re 2.40

Loops on an antibody prote in surface Figure 2.42

T he core domain af N M P kinases Figure 9.47

An a -heli cal coiled coil Figu re 2.43

Con fo rmat ional changes in adeny late ki nase Figure 9.51

Heptad repeals in a coi led coi l protein Figu re 2.44

Three protein s containing P -Ioop NTPase domains Figu re 9.52

Three-dimen sio nal structu re of myoglobin Figure 2.48

A TCa.e Figure 10.6

Distribution of amino acids in myoglobin Figure 2.49

The active site af ATCase Figure 10.8

" l nsiJt:: out" ami no acid distri bution in perin Figure 2.50

Protein kinase A boun d to an inh ih itor Figure lO.l8

The helix -turn - helix motif Figure 2.51

Conformation s of chymotrypsinogen (red ) and chymotrypsin (bl ue) Figure 10.22

Protein domain s of the cell surface protein C ])4 Figu re 2.52 Quatern ary structure of the era protein of bac teriophage "Figure 2.53 The

Q' 2f3 2

tetramer of human hemoglobin Figure 2.54

Alternative conformation s of a peptide sequence Figure 2.60 C hemical rear rangemen t in green fl uorescent protein (GFP) Figu re 2.68 Repeating motifs in calmodulin Figure 3.25 Immu noglob ulin G antibody Figure 3.27 A n tigen{lysazyme)-a ntibody in teractions Figure 3.28

Interaction of tr ypsin w ith its inhibitor Figure 10.24 A fi bri!logen molecule Figure 10.27 The calcium -b indi ng region of prothrombin Figure 10.32 Oligosaccharides attached to erythropoietin Figure 11 .21 Structure of a C -type car bo hyd rate -binding do m ain from an a ni ma l lecti n Figure 11.26 Bacteriorhodopsin Figure 12.18 Bacterial porin (fro m W lOdopseudurnunas blastica) Figure 12.20

H2 synthase· ' to the membrane

Watson -Crick model of double- helical DNA Figure 4.11

Attachmen t of prostaglandi n Figure 12.23

RNA polymerase Figure 4.24

H ydropho bic channel of pros taglan din H 2 synt hase Figure 12.24

A protein with no natu ral counterpart Figure 5.22 Ribonuclease from cows and human beings Figu re 6.1

C alcium pu mp Figure 13.3 C onfo rmational changes in thl:: calciu m pu mp Figure 13.4

Angiogen in Figure 6.2

ABC transporter Figure 13.8

I-Tum an hem oglobin (cr chain ), human myoglobin, and lupine leg hemoglobin Figure 6.14

A lactose permease with a bound lactose analog Figure 13.11 The po ta ssium cha nnel Figure 13.17 A voltage -gated potassi um ch annel Figure 13.22 The acetylcholi ne receptor Figure 13.27 Aq uapori n Figure 13.33 7T M receptor Figure 14.4

Act in and the large frag m ent of heat shock protein 70 (H sp-70 ) Figure 6.15 Chymotrypsin and su btilisin Figure 6.18 Myoglobi n Figure 7.1 Quaternary structural c hanges in hem oglob in o n ox ygen b ind in g Figure 7.10 ~1ode of binding of2,3-BPG ta human deoxy hcmoglobin Figure 7.16

[lemoglobin S Figure 7.25 ~tablizing

free a -hemoglobin Figure 7.27

A heterotri meric G protei n Figure 14.6

Ad en ylate cycl ase activatio n Figure 14.7 El' hand Figure 14.15 C alm odu lin bi nd in g a helices Fig ure 14.16 Insul in Figu re 14.17

A co mplex of lhe enzyme cytochrome P450 and its substrate camphor Figure 8.5

Activati o n of t he insulin recepto r by phosph o rylatio n Figure 14.19

Lysozyme w ith severa l componen ts o f the active site Figu re 8.7

SH 2 d om ai n Figure 14.22

Chymotrypsin Figure 9.6

Epider ma l growth facto r Figure 14.25

Trypsin and chymotrypsin Figure 9.12

EGF receptor dimeriza tion Figure 14.27

Carboxypeptidase II Figure 9.15

The unactivated EG F recep tor Figure 14.28

Three classes o[ pro leases and their act ive sites Figure 9.l7

G rb-2 , an ad apt or protein Figure 14.29

H IV protease and its binding pocket Figure 9.19

Src Figu re 14.32

H IV protease, a dimeric aspartyl protease Figure 9.21 Human carbon ic an hydrase Ir and its zinc s ite Figure 9.22 )' -Carbon ic anhydrase Figure 9.31 Eco RV embracing a cognate DNA m olecu le Figu re 9.38

Ind uced fi t in hexokinase Figure 16.3

H ydrogen bonding in teractions between Eco RV endonuclease and its DNA substrate Figu re 9.39

,

Adenylate kinase and guanylate kinase Figu re 9.46

.. XII

T riose phosphate iso merase Figure 16.4

G lyceralde hyd e , -phos phate d ehydrogenase Figu re 16.6 )!AD+ -bi ndi ng regio n in dehyd rogen ases Figure 16.12 P hosphofructoki nase Fig ure 16.15 Biotin -binding domain of py ru vate carboxyl ase Figure 16.24

Domain structure of phosphofructokinase 2 Figure 16.29

A self- splicing intran Fi gu re 29.38

Conformational changes in citrate synthase on binding oxaloacetate Figure 17.10 Binding of citrate to the iron- sulfur complex of aconitase Figure 17.12 Succinyl CoA synthetase Figure 17.14 Q-cytochrome c oxidoreductase (cytochrome be,) Figure 18.11 Cyt ochrome c oxidase Figure 18.13 Conservation of the three-dimensional structure of cytochrome c Figure 18.21 ATP synthase Figure 18.25 ATP -ADP translocase Fig ure 18.38 Bacterial photosynthetic reaction center Figure 19.9 Photosystem II Figure 19.13 Photosystem J Figure 19.19 Ferredoxin Figure 19.21 Ferredoxin-NADl'+ reductase Figure 19.22 A light-harvestin g complex Figure 19.30 Rubisco Figure 20.3 Thioredoxin Figure 20.15 Glycogen phosphorylase Figure 21.6 Phosphorylase a and phosphorylase b Figure 21.9 Active site of methyl malonyl CoA mutase Figure 22.17 Ubiquitin Figure 23.2 Tetraubiquitin Figure 23.4 20S proteasome Figure 23.5 Proteasome evolution Figure 23.8 Biosynthesis of thiamine Figure 23.9 Aspartate aminotransferase Figure 23.12 H omologous enzymes Figure 23.19 Fe protein Figure 24.2 MoFe protein Figure 24.3 A DNA methylase bound to a target Figure 24.12 Tryptophan synthetase Figure 24.16 3-Phosphoglycerate dehydrogenase Figure 24.18 Regulatory domain formed by two subunits of 3-phosphoglycerate dehydrogenase Figure 24.20 G lutathione peroxidase Figure 24.26 Carbamoyl phosphate synthetase Figure 25.3 C hannel in carbamoyl phosphate synthetase Figure 25.4 Ribonucleotide reductase R2 subunit Figure 25.10 Structure of propeller domain Figure 26.19 B -ferm and A -form DNA Figure 28.3 Z -DNA Figure 28.8 Topoisomerase T Figure 28.11 Topoisomerase II Figure 28.13 DNA polymerase Figure 28.15 Conformational change in DNA polymerase on binding of a dNTP Figure 28.19 H elicase Figure 28.23 Conserved residues among helicases Figure 28.25 A sliding DNA clamp Figure 28.26 DNA-repair enzyme AlkA Figure 28.43 ere recombinase and topoisomerase I Figure 28.50 RNA polymerase Figure 29.1 RNA- DNA hybrid separation by a str ucture within RNA polymerase Figure 29.9 Co mpl ex formed by TATA-box-binding protein and DNA Figure 29.21 Assembly of the initiation complex Figure 29.22

tRNA Figure 30.4 Helix stacking in tRNA Figure 30.5 Active site of threonyl-tRNA synth etase Figure 30.7 Editing site in threonyl-tRNA synthetase Figure 30.8 Threonyl-tRNA synthe tase complex Figure 30.10 Classes of aminoacyl -tRNA synthetases Figure 30.12 The ribosome at high resolution Figure 30.13 Ribosomal RNA folding pattern Figure 30.14 Transfer RNA binding sites Figure 30.18 Elongation factor Tu Figure 30.23 lac repressor- DNA interactions Figure 31.2. H elix-turn-helix motif Figure 31.3 DNA recognition through 13 strands Figure 31.4 H omeodomain structure Figure 31.5

nasie -leucine zipper Figure 31.6 Zinc-finger domains Figure 31.7 The lac repressor Figure 31 .11 A dimer of CAP bound to DNA Figure 31.16 Nucleosome core particle Figure 31.20 H omologous his tones Figure 31.21 GAL4 binding sites Figure 31.23 Two nuclear hormone receptor domains Figure 31.26 T.igand binding to nuclear hormone receptor Figure 31.27 Estrogen receptor- tamoxifen complex Figure 31.29 Hi stone acetyl transferase Figure 31.30 A bromodomain Figure 31.31 Ferritin Figure 31.36 Aconitase Figure 31.39 Ankyrin repeat Figure 32.35 PAMP-recognition unit of the Toll-like receptor Figure 33.3 Immunoglobulin G Figure 33.5 Immunoglobulin fold Figure 33.12 Variable domains of the Land H chains Figure 33.13 Complex between an F ' " fragment of an antibody and its larget, phosphorylcholine Figure 33.14 Antibodies against lysozyme Figure 33.15 Antibody-lysozyme interactions Figure 33.16 C lass I M He protein Figure 33.26 C lass 1 MHC peptide-binding site Figure 33.27 T-eell receptor Figure 33.29 T-eell recep lor- Class I MHC complex Figure 33.30 The coreceptor CD8 Figure 33.31 C lass II MHC protein Figure 33.36 Coreceptor CD4 Figure 33.37 Polymorphism in class I MHC proteins Figure 33.40 HIV receptor Figure 33.42 Myosin structure at high resolution Figure 34.4 Myosin light chains Figure 34.5 "Nfyosin two -stranded coiled coil Figure 34.6 H ead domain ofkinesin at high resolution Figure 34.7 Dynein head-domain model Figure 34.8 Lever-arm motion Figure 34.9 Neck linker Figure 34.11 Actin Figure 34.15 Actin and hexokinase Figure 34.16 Tubulin Figure 34.22 FlageLlin Figure 34.26 Flagellar motor components Figure 34.28

xii i

Media and Supplements Companion Web site at www.whfreeman.com/stryer For students

• Living Figures. Every textbook illustration of a protein structure can also be viewed online in interactive 3- D using Jmol. Students can zoom and rolate the "li ve" structures to get a better understandin g of th eir three -dimensional nature and can experiment with different display styles (space-filling, ball -and- sti ck, ribbon, backbone) by means of a user -friend ly interface. • In teractive structure -based tutorials in Jmol show how structure helps explain experimental data (such as the effect of mutations, sequence variation among homologs, the effects of chemical modification, and the results of spectroscopic experiments ). The tutorials were written by N eil D. Clarke, Johns lIopkins University School of Medicine. • C oncept-based tutorials help students build intuitive understanding of some of the more difficult concepts covered in the text. The tutorials were written by Neil D. Clarke, Johns J-Jopkins University School of Medicine . • Animated techniques help students grasp experimental techniques used for explor ing gen es and proteins .

A&M U ni versity at Galveston , offers more than 1500 questions in ed itable Word format.

Instructor's Resource CD-ROM [0-7167-4590-9] The CD includes all the instructor's resources from the Web site.

Overhead Transparencies [0-7167-6049-5] 200 full- color illustrations from the textbook, optimized for classroom projection

Student Companion [0-7167-7067-9] Richard 1. G umport, College of Medicine at Urbana C hampaign , University of Illinois Frank H . D eis, Rutgers University Nancy Counts Gerber, San Francisco State University Expanded solutions to text problems provided by Roger E . Koeppe 11 , Uni versity of Arkansas

• Self-assessment tool. Students can test their understanding by taking an online multiple -choice quiz provided for each chapter, as well as a general chemistry review.

For each chapter of th e textbook, the Student Companion includes :

• G lossary of key term s. • Web links. This resource connects students with the world of biochem istry beyond the classroom.

• Self-Assessment Problems, including multiplechoice, short-answer, matching questions , and challenge problems, and their answers

For Instructors

• Expanded Solution s to text end-of-chapter problems

• Chapter Learning Objectives and Summary

All of the above pl us: • All illustrati ons and tables from the textbook, including structures from the Glossary of Compounds, in jpeg and Power Point formats, opti mi zed for classroom projection . • Lecture-ready Personal Response System ("clicker" ) q uestions. More than 100 questions for cl assroom use that will work seam lessl y with any personal response system, including i-clicker, the new radiofreq uency classroom response system being offered by W . H . Freeman and Company (www.iclicker.com). • Assessment Bank, by H arvey Nikkel of G rand Valley State U ni versity and Susan Knock of Texas ,

• XIV

Lecture Notebook [0-7167-7157-8] For students who find that they are too busy copyi ng fig ures, equations, and diagrams to follow the lecture, the Notebook is an indispensable classroom companion, with: • Illustrations and tables in the order in which they appear in the textbook, with plenty of room to take notes • Three- hole punched, perforated pages so that students can reorgan ize th e Notebook in any order necessary to follow lectures, and can insert instructor handouts

Acknowledgments Thanks go first and foremost to our students . Not a word was written or an illu stration constructed without the knowledge that bright, engaged students would immediately detect vagueness and ambiguity. We also thank our colleagues who supported, advised, instructed , and simply bore with us during this arduo us task . We are also grateful to our colleagues throughout th e world who patiently answered our questions and shared their insights into recent developments . W e thank Susan J. Baserga and Erica A. Champion of the Yal e University School of Medicine for their outstandi ng contributions in the revision of C hapter 2\). Al an Mellors of the University of G uelph , Emeritu s, d eserves our thanks for reading every chapter of page proof to chec k for accuracy. W e also especiall y thank those who served as reviewers for this new ed ition . Their thoughtful comments, suggestio ns, and encouragement have been of immense help to us in maintaining the excellence of the preceding editi o ns. These reviewers are: Steven Ackerman

BansiJhar Da tta

Anne Grave

University uf Massachusetts

Kent S tate University

Louisiana State University

John S. Anderson

Frank H . D eis

Stuart Haring

University of Minnesota

Rutgers University

Kenneth Balazovich

Zachariah Dhanarajan Plorida A & M Universi ty Preeti Dhar

University of Iowa Carver College of Medicine

University uf Michigan

Susan J. Baserga

Edward D . Harris Texas A&M University

S tate University of N ew York , New Paltz

Newton Hilliard, J r.

Huangen Ding

Gerwald J ogl

Donald C. Beitz rowa S tate Universi ty Peggy R . Borum

L ouisiana S tate Universi ty

Brown Universi ty

Joseph Eich berg

Konstantin V. Kandrar

University of Houston

Boston University School of M edicine

University of Florida

Thomas Ell enberger

A. Wali Karzai

Amanda J. Bradley

H arvard M edical School

Stony Brook University

University of British Co lumbia

Susan C. Evans

Phillip E. Klebba

Randy Brewton

Ohio University

University of Oklahoma

University of Tennessee

Ray Fall

Aileen F Knowles

Scott D. Briggs

University of Co lorado

San Diego State University

Purdue University

Wilson A. Francisco

John Koontz

Martin L. Brock

Arizona State University

University of Tennessee

Eastern Kentucky Universi ty

Terrence G. Frey

Robert Kranz

Roger Brownsey

San Diego S tate University

Washington University

University of British Columbia

K . Christopher Garcia

Min - Hao Kuo

Michael F Bruist

Stanf ord University School of Medicine

Michigan State University

University of the Sciences in Philadelphia

Ronald K. Gary

John D . Brunstein

Alexandros Georgaki las

Patrick D. Larkin A & M University, Corpus Christi Sylvia Lee-Huang

University of British Columbia

East Carolina Universi ty

Mauricio Bustos

Burt Goldberg

New York University School of Medicine

University of Maryland , Baltimore County

New York Universi ty

G len B. L egge

Michael R. G reen

University of Houston

Vince Li Cata

Howard University College of Medicine

University of Massachusetts Medical School

Larry D . Crouch

E. M. Gregory

Robley J. Light

University of Nebraska Medical Center

Virginia Tech

Florida State University

David L. Daleke

C harles B. Grissom

Xuedong Liu

Indiana Universi ty

University of Utah

University of C%mc/o, Boulder

Ya le University School of M edicine

Gai l S. Begley Northeastern University

W. Malcolm Byrnes

Universi ty of Nevada, Las Vegas

Eastern New Mexico University

'/exas

Louisiana S tate University

xv

Andy LiWang Texas A&M University Timothy M . Logan Florida State Universi ty M ichael A. Massiah Oklahoma S tate University Douglas D . M cAbee California S tate University, L ong Beach .I ames M cAfee Pittsburg S tate University Megan M. McEvoy University of Arizona Bryant W . Mil es Texas A&M University Patricia M. Moroney Louisiana State University Mike Mass in g University of Mississippi Michael P. Myers California S tate University, Long Beach Ha rry Noll er Un iversity of California , Santa Cruz Mary Kay O rgill University of Missouri O li ver E. Owen Retired clinical investigator, administrator, and academician

David C. Pendergrass Kansas University

Cynthia B. Peterson Universi ty of Tennessee Philip A. Rea University of Pennsylvania Douglas D . Root University of North Texas Robert Rosenberg Iloward University Richard L. Sabina M edical Co llege of Wisconsin Robert Sanders Universi ty of Kansas Jamie L. SehJ essman United States Naval Academy Todd P. Sil verstein Wil/amette University Melanie A. Simpson Unive·rsity oJ Nebras ka, Lincoln Kerry S. Smith Clemson Universi ty Deborah A. Spikes Stony l3rook University Takita Felder Sumler Winthrop Universi ty David C. Teller University oJ Washington

Marc E. Tischler University of Arizona Lian g Tong Co lumbia University Michael U hler University of Michigan Ronald Vale University of California, San Francisco Katherine \Vall University of Toledo Malcom Walford Rutgers University Joachim Weber Texas Tech University Tan A. Wllson The Scripps R esearch institute MareS. Wold University of Iowa Carver College of Medicine C harles Yocum University of Mi chigan RobertZand University oJ Michigan Brent M . Z nosko Saint Louis University

Working with our colleagues atW. H . Freeman and Company has been a wonderful experience. We would especially like to acknowledge the efforts of the following people. Our d evelopmental editor, Susan Moran, has contributed immensely to the success of this project . O ur project editor, Georgia Lee H adler, managed the fl ow of the project from final manuscript to final product with admirable efficiency. The careful manuscri pt editor, Patricia Z immerman, enhanced the text's literary consistency and clarity. Design manager Diana Blume produced a design and layout that are organizationally clear and esthetically pleasing. Our photo editor, Bianca Moscatelli, tenaciously tracked down new inl ages. nill Page, the illustration coordinator, ably oversaw the rendering of new illustrations, and Susan Wein, the production manager, astutely handled all the difficulties of scheduling, composition, and manufacturing. M edia editor Alysia Baker and assistant editors N ick Ty moczko and Deena Goldman were invaluable in their management of the media and supplements program. W e would also like to thank Timothy Driscoll for his work in converting our living figures into Jmo\. O ur acquisitions editor, Kate Abr, was an outstanding director of the proj ect. Her enthusiasm, encouragement, patience, and good humor kept us going when we were tired , frustrated, and discou raged. Marketing mavens John Britch and Sarah Martin oversaw the introduction of this edition to the academi c world. We also thank the sales people at W . H . Freeman and Compan y for their excellent su ggestion s and view of the market. W e thank Elizabeth Widdicombe, President of W . H . Freeman and Company, for never losing faith in us . Finally, the project would not have been possible without the unfailing support of our families specially our wives, Wendie Berg and Alison Unger. Their patience, encouragem ent, and enthusiasm have made this endeavor possible. W e also thank our children, Alex, Corey, and Monica Berg and Janina and Nicholas Tymoczko, for their forbearance and good humor and for constantly providing us a perspective on what is truly important in life . •

xVI

Brief Contents Part I THE MOLECULAR DESIGN OF LIFE

Contents Preface

v

1

Biochemistry: An Evolving Science 1

2

Protein Composition and Structure 25

3

Exploring Proteins and Proteomes 65

4

DNA, RNA, and the Flow of Genetic Information 107

Chapter 1 Biochemistry: An Evolving Science 1

5

Exploring Genes and Genomes 134

1.1 Biochemical Unity Underlies Biological Diversity 1

6

Exploring Evolution and Bioinformatics 164

1.2 DNA Illustrates the Interplay Between

7

Hemoglobin: Portrait of a Protein in Action 183

Form and Function

8

Enzymes: Basic Concepts and Kinetics 205

DNA Is Constructed from Four Building Blocks

4

9

Catalytic Strategies 241

Two Single Strands of DNA Combine to form a Double Helix

S

DNA Structure Explains Heredity and the Storage of Information

S

10 Regulatory Strategies 275 11

Carbohydrates 303

12 Lipids and Cell Membranes 326 13 Membrane Channels and Pumps 351 14 Signal-Transduction Pathways 381 Part II TRANSDUCING AND STORING ENERGY 15 Metabolism: Basic Concepts and Design 409

Part I THE MOLECULAR DESIGN OF LIFE

4

1.3 Concepts from Chemistry Explain the Properties of Biological Molecules

6

The Double H el ix Can Form from Its Component Strands

6

Covalent and Noncovalent Bonds Are Important for the Structure and Stabi lity of Biological Molecules

7

The Double Helix Is an Expression of the Rules of C hemistry

10

18 Oxidative Phosphorylation 502

The Laws of Thermodynamics Govern the Behavior of Biochemical Systems

11

19 The Light Reactions of Photosynthesis 541

Heat Is Released in the Formation of the Double H elix

12

20 The Calvin Cycle and the Pentose Phosphate Pathway 565

Acid Base Reactions Are Central in Many Biochemical Reactions

14

Acid- Base Reactions Can Disrupt the Double Helix

15

Buffers Regulate pH in Organisms and in the Laboratory

16

16 Glycolysis and Gluconeogenesis 433 17 The Citric Acid Cycle 475

21 Glycogen Metabolism 592 22 Fatty Acid Metabolism 617 23 Protein Tumover and Amino Acid Catabolism 649

1.4 The Genomic Revolution Is Transforming Biochemistry and Medicine

Part III SYNTHESIZING THE MOLECULES OF LIFE 24 The Biosynthesis of Amino Acids 679 25 Nucleotide Biosynthesis 709 26 The Biosynthesis of Membrane Lipids and Steroids 732 27 The Integration of Metabolism 760 28 DNA Replication, Repair, and Recombination 783

17

The Sequencing of the Human Genome rsa Landmark in Human History

18

Genome Sequences Encode Proteins and Patterns of Expression

18

Individuality Depends on the Interplay Between Genes and Environment

20

APPENDIX: Visualing Molecular Structures I: Small Molecules

22

Chapter 2 Protein Composition and Structure

25

29 RNA Synthesis and Processing 821 30 Protein Synthesis 857 31

The Control of Gene Expression 892

•

Part IV RESPONDING TO ENVIRONMENTAL CHANGES

2.1 Proteins Are Built from a Repertoire of 20 Amino Acids

27

32 Sensory Systems 921

2.2 Primary Structure: Amino Acids Are Linked by Peptide Bonds to Form Polypeptide Chains

34

33 The Immune System 945 34 Molecular Motors 977 35 Drug Development 1001

Proteins Have Unique Amino Acid Sequences Specified by Genes

36

• • •

xv "

I

Cont ents

Polypeptide C hains Are Flexible Yet Conformationall y Restricted

37

2.3 Secondary Structure: Polypeptide Chains Can Fold into Regular Structures Such As the Alpha Helix, the Beta Sheet, and Turns and Loops 40 The Alpha Helix Is a Coiled Structure Stabilized by Intrachain H ydrogen Bonds Beta Sheets Are Stabilized by Hydrogen Bonding Between Pol ypeptide Strands Polypeptide Chai ns Can Change Direction by Making Reverse Turns and Loops Fibrous Proteins Provide Structural Support for Cells and Tissues

40 42 44 45

2.4 Tertiary Structure: Water-Soluble Proteins Fold into Compact Structures with Nonpolar Cores 46 2.5 Quaternary Structure: Polypeptide Chains Can Assemble into Multisubunit Structures

49

Proteins Must Be Released from the Cell to lie Purified

67

Proteins Can Be Purified According to Solubility, Size, Charge, and Binding Activity

68

Proteins Can De Separated by Gel Electrophoresis and Displayed

71

A Protein Purification Scheme Can Be Quantitatively Evaluated

74

Ultracentrifugation Is Valuable for Separating Diomolecules and Determining Their Masses

76

3.2 Amino Acid Sequences Can Be Determined by Automated Edman Degradation

78

Proteins Can Be Specificall y Cleaved into Small Pep tides to Facilitate Analysis

80

Amino Acid Sequences Are Sources of Many Kinds ofI nsight

82

Recombinant DNA Technology Has Revolutionized Protein Sequencing

83

3.3 Immunology Provides Important Techniques with Which to Investigate Proteins

84

Antibodies to Specific Proteins Can Be Generated

84

Monoclonal Antibodies with Virtually Any Desired Specificity em Be Readily Prepared

85

Proteins Can Be Detected and Quantitated by Using an Enzyme· Linked [mmunosorbent Assay

R7

Western Blottin g Permits the Detection of Proteins Separated by Gel Electrophoresis

88

Fluorescent Markers Make Possible the Visualization of Proteins in the Cell

89

50

3.4 Peptides Can Be Synthesized by Automated Solid-Phase Methods

90

Amino Acids Have Different Propensities for forming Alpha Helices, Beta Sheets, and Beta Turns

52

Protein Misfolding and Aggregation Are Associated with Some Neurological Diseases

3.5 Mass Spectrometry Provides Powerful Tools for Protein Characterization and Identification

93

5.1

2.6 The Amino Acid Sequence of a Protein Determines Its Three-Dimensional Structure

Protein Folding Is a Highly Cooperati ve Process Proteins Fold by Progressive Stabilization of lntermediates Rather Than by Random Search Prediction of Three-Dimensional Structure from Seqllel1ce Remail1s a Great C hallenge

55

S5 57

Protein Modification and Cleavage Confer New C apabilities APPEND1X : Visualizing Molecular Structures II: Proteins 61

Chapter 3 Exploring Proteins and Proteomes The Proteome Is the Functional Representation of the Genome

3.1 The Purification of Proteins Is an Essential First Step in Understanding Their Function The Assay: How Do We Recognize the Protein That We Are Looking For)

The Mass of a Protein Can Be Precisely Determined by Mass Spectrometry

93

Individual Protein Components of Large Protein Complexes 94 C an FIe rdentified by MALDI · TOF Mass Spectrometry

3.6 Three-Dimensional Protein Structure Can Be Determined by X-ray Crystallography and NMR Spectroscopy

96

X -ray C rystallography Reveals Three · Dimensional Structure in Atomic Detail

96

N uelear Magnetic Resonance Spectroscopy Can Reveal the titructures of Proteins in Solution

98

Chapter 4 DNA, RNA, and the Flow of Genetic Information

107

65 66

4.1 A Nucleic Acid Consists of Four Kinds of

66 67

Bases Linked to a Sugar-Phosphate Backbone RNA and DNA Differ in the Su gar Component and One of the Bases

108 108

Contents x i x Nucleotides Are the Monomeric Units of Nucleic Acids

109

4.2 A Pair of Nucleic Acid Chains with omplementary Sequences Can Form a Double-Helical Structure 111

PCR Is a Powerful Technique in Medical Diagnostics , Forensics, and Studies of Molecular Evolution

5.2 Recombinant DNA Technology Has Revolutionized All Aspects of Biology

The Double Helix Is Stabilized by H ydrogen Bonds and Hydrophobic Interactions

1 11

The [)ouble Helix Facilitates the Accurate Transmission of Hereditary Information

11:1

Restriction Enzymes and DNA Ligase Are Key Tools in Forming Recombinant DNA Molecules Plasmids and Lambda Phage Are Choice Vectors for DNA C loning in Bacteria

The Double H elix Can He Reversibly Melted

114

Bacterial and Yeast Artificial C hromosumes

Some DNA Molecules Are Circular and Supercoiled

115

Specific Genes Can lie C loned from Digests of Genom ic Dl\'A Protein s with New Functions C an Be C reated Through Directed C hanges in [)NA

Single-Stranded Nucleic Acids C an Adopt Elaburate Structu res

4.3 DNA Is Replicated by Polymerases That Take Instructions from Templates

116

117

[)NA Polymerase Catal yzes Phosphodiester-Rond Formation

117

The Genes of Some Viruses Are Made of RNA

118

4.4 Gene Expression Is the Transformation of DNA Information into Functional Molecules

119

Several Kinds of RNA Play Key Roles in Gene Expression

119

All Cellular RNA Is Synthesized by RN A Polymerases

120

RNA Polymerases Take Instructiuns from DNA Templates 121 Transcription Begins Near Promoter Sites and Ends at Terminator Sites Transfer RNA Is the Adaptor Molecules in Protein Synthesis

4.5 Amino Acids Are Encoded by Groups of Three Bases Starting from a Fixed Point Major Features of the Genetic Code

122 123

124 125

Messenger RNA Contains Start and Stop Signals for Protein Synthesis

126

The Genetic Code Is Nearly Universal

126

4.6 Most Eukaryotic Genes Are Mosaics of Introns and Exons

127

RNA Processing Generates Mature RNA

128

Many Exons Encode Protein Domains

128

Chapter 5 Exploring Genes and Genomes

134

5.1 The Exploration of Genes Relies on Key Tools 135 Restriction Enzymes Split DNA into Specifi c Fragments

135

Restriction Fragments Can Be Separated by Gel Electrophoresis and Visualized

136

DNA C an Be Sequenced by Controlled Termination of Replication

138

DNA Pro bes and Genes Can Fle Synthesized by Automated Solid -Phase M ethods

139

Selected DNA Sequences C an Be G reatly Amplified by the Polymerase C hain Reaction

140

5.3 Complete Genomes Have Been Sequenced and Analyzed '1'he Genomes of Organisms Ranging from Bacteria to Multicellular Eukaryotes Have Been Sequenced T he Seyuencing uf the I-Iuman Genome Has lieen Finished Comparative Genomics Has Become a Powerful Research Tool Gene-Expression Levels Can He Comprehensively Studied

5.4 Eukaryotic Genes Can Be Manipulated with Considerable Precision

14 1

142 142 143 145 146 14 7

149 149 150 I 51 lSI

152

Complementary DNA Prepared frum mR NA Can Be Expressed in Host Cells

152

New Genes Inserted into Euka ryotic Cells Can Be Efficiently Expressed

154

Transgenic Animals H arbor and Express Genes That Were Introduced into Their Germ Lines Gene Disruption Provi des C lues to Cene Function RNA Interference Provides an Additional 1001 for Disrupting Gene Expression Tumor-Ind ucing Plasm ids C an Be Used to Introduce New Genes into Plant Cell s

155 155 157 157

Human Gene Therapy Holds Great Promise for M edicine

158

Chapter 6 Exploring Evolution and Bioinformatics

164

6.1 Homologs Are Descended from a Common Ancestor

165

6.2 Statistical Analysis of Sequence Alignments Can Detect Homology The Statistical Significance of Alignments Can Be Estimated by Shuffling Di stant Evolutionary Relationships C an Be Detected Through the Use of Substitution Matrices Databases Can Be Searched to Identify Homologous Sequences

166 168 168 171

xx

Contents

6.3 Examination of Three-Dimensional Structure Enhances Our Understanding of Evolutionary Relationships 172 Tertiary Structure 1s More Conserved Than Pri mary Structure Knowledge of Three- Dimensional Structures Can Aiu in the Evaluation of Sequence Alignments Repeated Motifs C an Be Detected by Aligning Sequences with Them selves Convergent Evol ution Jllustrates Common Solutions to Biochemical C hallenges

1 73 174

178

Ancient DNA C an Sometimes [Je Amplified and Sequenced

178

Molecular Evolution Can Be Examined Experimentally

178

Human Hemoglobin Is an Assembly of Four M yoglobin -like Subunits

7.2 Hemoglobin Binds Oxygen Cooperatively Oxygen Binding Markedly C hanges the Quaternary Structure of H emoglobin Hemoglobin Cooperativity Can Be Potentiall y Explai ned by Several Models Stru ctural C hanges at the Heme Croups Are Transm itted to the Uti3 t-<>Al 2 Interface 2,3-Bisphosphoglycerate in Red Cells Is Crucial in D etermining the Oxyge n Affinity of H emoglobin

183 184 185 186

187

196

The Accumulation of Free Alpha-Hemoglobin C hains Is Preventeu

197

199

Chapter 8 Enzymes: Basic Concepts and Kinetics

205

8.1 Enzymes Are Powerful and Highly Specific Catalysts

206

Many Enzymes Require Cofactors fur Activity

207

Enzymes May Tran sform Energy from One Form into Another

207

8.2 Free Energy Is a Useful Thermodynamic Function for Understanding Enzymes

208

The Free-Energy C hange Provides Jn fo rmation About t he Spontaneity but Not the Rate of a Reaction

208

T he Standard Free-Energy C hange of a Reaction Js Related to the Equilibrium Constant

208

Enzymes Alter O nl y the Reaction Rate and Not the Reaction Equilibrium

210

8.1 Enzymes Accelerate Reactions by Facilitating the Formation of the Transition State

211

188

The Formation of an Enzyme- Substrate Complex Js the First Step in Enzymatic C atalysis

213

189

T he Active Sites of Enzymes Have Some Common Features

21 4

190

The Bind ing Energy Between Enzyme and Substrate Is Important for Catalysis

215

190

8.4 The Michaelis-Menten Equation Describes the Kinetic Properties of Many Enzymes

216

7.3 Hydrogen Ions and Carbon Dioxide Promote t he Release of Oxygen: The Bohr Effect 192

o

Thalassemi" Is Caused by an Imbalanced Production of Hemoglo bin Chain s

17"

177

The Structure of M yoglobin Prevents the Release of Reactive Oxygen Species

195

APPENOlX: Binding M odels Can Be Formulated in Q uantitative Terms : The Hill Plot and the Concerted Model

6.4 Evolutionary Trees Can Be Constructed on the Basis of Sequence Information

7.1 Myoglobin and Hemoglobin Bind Oxygen at Iron Atoms in Heme

Sickle-Cell Anemia Results from the A ggregation of Mutated Deoxyhemogl obin Molecul es

Additional Globin s Are Encoded in the [-I uman Genome 197

176

Chapter 7 Hemoglobin: Portrait of a Protein in Action

194

174

Comparison of RNA Sequences Can Be a Source of Insight into RNA Seconda ry Structures

6.5 Modern Techniques Make the Experimental Exploration of Evolution Possible

7.4 Mutations in Genes Encoding Hemoglobin Subunits Can Result in Disease

Kinetics Is the Study of Reaction Rates

216

The Steauy -State Assumption Facilit"tes a Descrip tion of Enzyme Kinetics

217

KM and Vmax Values Can Be D etermined by Several Means

220

KM and Vm . . Values Are Important Enzyme

o , Body tissue

Blood capil!ary

C haracteristics

220

k",IKM Is a Measure of Catalytic Efficiency Most Biochemical Reaction s Include Multiple Substrates Allosteric Enzymes Do Not Obey Michaelis- Menten Kinetics

221

223 224

Contents x X i

8.5 Enzymes Can Be Inhibited by Specific Molecules Reversible Inhi bitors Are Kinetically Dist;nguishable Irreversible Inhibitors Can Be Used to Map the Active Si te Transition-State Analogs Are Potent Inhibitors of Enzy mes Catalytic Antibodies Demonstrate the Importance of Selective Binding of the Transition State to Enzy mat ic Activity Penici llin Irreversibly Inacti vates a Key Enzyme in Bacterial Cell -\ \1al l Synth esis APPENDIX: Enzymes Are C lassified on the Basis of the Types of Reactions That They Catal yze

Chapter 9 CatalytiC Strategies

225 226 228 231

232 232

236

Type H Restriction E nzymes H ave a Catalytic Core in Common and Are Probabl y Related b y Horizontal Gene Transfer

266

9.4 Nucleoside Monophosphate Kinases Catalyze Phosphoryl-Group Transfer Without Promoting Hydrolysis

267

NMP Kinases Are a Fam ily of Enzymes Containi ng P-Loop Structures

267

Magnesium or Manganese Complexes of Nucleoside

Triphosphates Are the True Substrates for E ssentially A ll t-:TP -Dependent Enzymes

26 8

ATP Binding Induces Large Conformational Changes

269

P -L oop NTPase Domains Are Present in a Range of Important Proteins

270

241 Chapter 10 Regulatory Strategies

A few Basic Catalytic Principles Are Used by Many E nzymes

242

9.1 Proteases Facilitate a Fundamentally Difficult Reaction

243

Chymotrypsin Possess a Highl y Reacti ve Serine Residue Chymotrypsi n Action Proceed in Two Steps Linked by a Covalentl y Bound Intermediate Serine Is Part of a Catalytic Triad That Also Includes Histidine and Aspartate Catalytic Triads A re Found in Other H ydroly tic Enzymes The Catal yt ic Triad Has Been Dissected b y SiteDirected Mutagenesis

243 244 245

248 250

275

10.1 Aspartate Transcarbamoylase Is Allosterically Inhibited by the End Product of Its Pathway 276 All osterically Regul ated Enzymes Do Not follow Michaelis- Menten Kinetics

277

ATCase Consists of Separable Catalytic and Regulatory Subuni ts

27 7

Allosteric Interac tion s in ATCase Are M ediated by Large C hanges in Quaternary Structure

278

Allosteric [{egulators Mod ulate the T-to- R Equilibrium

281

10.2 Isozymes Provide a Means of Regulation Specific to Distinct Tissues and Developmental Stages

283 283

Cysteine. Aspartyl. a nd Metalloproteases Are O ther Major C lasses of Peptide-Cleaving Enzymes

2 51

Protease Inhibitors Are Impor tan t Drugs

253

10.3 Covalent Modification Is a Means of Regulating Enzyme Activity

254

Phosphorylation Is a Highl y Effective !V!eans of Regulating the Activities of Target Proteins

2H4

Carbonic Anhydrase Contains a Bound Zinc Ion Essential for Catalyti c Activity

255

Cyclic AMP Activates Protein Kinase A by Altering the Quaternary Structure

2X7

Catalysis Entails Zin c Activation of a Water Molecule

256

ATP and the Target Protein Bind to a Deep Cleft in the Catalytic Su bunit of Protein Kinase A

288

9.2 Carbonic Anhydrases Make a Fast Reaction Faster

A Proton Shuttle facilitates Rapid Regeneration of the Active f orm of the Enzyme Convergent Evolution Has Generated Z inc- Based Active Sites in Different Carbonic Anhydrases

257 258

9.3 Restriction Enzymes Perform Highly Specific DNA-Cleavage Reactions 259 Cleavage Is by fn -Line Displacement of 3' -Oxygen from Phosphorus by IVlagnesium -Activated Water Restriction Enzymes Require Magnesium for Catal ytic Activity The Compl ete Catalytic Aparatus Is Assembled O nl y Within Complexes of Cognate DNA M olecules. Ensuring Specificity

260 262

263

10.4 Many Enzymes Are Activated by Specific Proteolytic Cleavage

288

Chymotrypsinogen Is Activated by pecific C leavage of a Single Peptide Bond

289

Proteolytic A ctiva tion of C hymotrypsinogen Leads to the Formation of a Substrate-Binding Site

290

The Generation o f Trypsin from Trypsinogen L eads to the Activation of Other Zymogens

291

Some Proteolytic Enzymes Have Specif ic Inhibitors

291

Blood Clotting Is Accomplished by a Cascade of Zymogen Activations

293

Fibrinogen Is Converted by Thrombin into a Fibrin Clot

29 3

• •

x X II

Contents

Prothrombin h Readied for Activation by a Vitamin K-D ependenl Modification

295

Hemophilia Revealed an F.arly Step in C lotting

296

The Clolling Process Must Be Regulated

296

Chapter 11 Carbohydrates

Fatty Acids Vary in C hain Length and Degree of

303

11.1 Monosaccharides Are Aldehydes or Ketones with Multiple Hydroxyl Groups 304 Pentoses and Hexoses Cyclize to Form Furanose and Pyranose Rings

306

Pyra nose and Furanose Rings C an Assum e Different Conformati ons

:lOX

Monosaccharides Are Joined to Alcohols and Amines Through Glycosidic Bonds Phosphoryl ated Sugars Are Key Intermediates in Energy Generation and Biosyntheses

11.2 Complex Carbohydrates Are Formed by the Linkage of Monosaccharides

309 310

310

Sucrose. Lactose. and Maitose Are the Common Di saccharides

310

Glycogen and Starch Are Mobil izable Stores of Glucose Cellulose. the Major Structural Pol ymer of Plants. Consists of Linear C hains of Gl ucose Units Glycosaminoglycans Are Anionic Polysaccharide Chains Made of Repeating Disaccharide Units Specific Enzymes Are Responsible for Oligosaccharide Assembly

11.3 Carbohydrates Can Be Attached to Proteins to Form Glycoproteins Carbohydrates C an Be Linked to Proteins Through Asparagine (N-Linked) or Through Serine or Threonine to -Linked) Residues

331 331 331 332

12.3 Phospholipids and Glycolipids Readily Form 333 Bimolecular Sheets in Aqueous Media Lipid Vesicles Can Be Formed from Phospholipids Lipid Bilayers Are Highly Impermeable to fons and Most Polar Molecules

12.4 Proteins Carry Out Most Membrane Processes

334 335

336

312

Proteins interact with M embranes in a Variety of Ways

336

312

Some Proteins Associate with Membranes Through Covalently Attached Hydrophobic Croups

340

314

Transmembrane Helices Can De Accuratel y Predicted from Amino Acid Sequences

340

316

316

3 18

Oligosaccharides Can Be "Sequenced "

319

320

Lectins Promote [nteraction Between Cells

320

Innuenza Virus Binds to Sialic Acid Residues

32 1

12.1 Fatty Acids Are Key Constituents of Lipids Fatty Acid Names Are Based on Their Parent Hydrocarbons

329

336

Errors in Gl ycosylation Can Result in Pathological Conditions

Many Common Features Underlie the Diversity of Biological Membranes

Phospholipids Are the Major Class of Membrane Lipids M embrane Lipids Can Include Carboh ydrate Moieties C holesterol [s a Lipid Based on a Steroid Nucleus Archaei Membranes Are Built from Ether Lipids with Branched C hains A M embrane Lipid Is an Amphipathic Molecule Containing a H ydrophilic and a H ydrophobic Moiety

329

31 1

317

Chapter 12 Lipids and Cell Membranes

12.2 There Are Three Common Types of Membrane Lipids

32X

Proteins Associate with the Lipid Bilayer in a Variety of Ways

Protein G lycosylation Takes Place in the Lumen of the Endoplasmic Reticulum and in the Golgi Complex

11.4 Lectins Are Specific Carbohydrate-Binding Proteins

U nsaturation

326 327

12.5 Lipids and Many Membrane Proteins Diffuse 342 Rapidly in the Plane of the Membrane The Fluid Mosaic Model Allows Lateral Movement but Not J{otation Through the Membran e

343

Membrane Fluidity Is Controlled by Fatty Acid Composition and C holesterol Content

343

All Hiological Membranes Are Asymmetric

345

12.6 Eukaryotic Cells Contain Compartments Bounded by Internal Membranes

345

Chapter 13 Membrane Channels and Pumps

351

The Expression of Transporters Largely Defines the Metabolic Activities of a Gi ven Cell Type

13.1 The Transport of Molecules Across a Membrane May Be Active or Passive

352

352

327

M an y Molecules Require Protein Transporters to Cross Membranes

352

327

Free Energy Stored in Concentration Gradients C an lie Quantified

353

••

•

. Contents x X III

13.2 Two Families of Membrane Proteins Use ATP Hydrolysis to Pump Ions and Molecules Across Membranes 354 P-Type ATPases Couple Phosphorylation and Con formational Changes to Pump Calcium Ions Across Membranes Digitalis Specifically Inhibits the Na I K + Pump by Blocking Its Dephosphorylation

354

Multidrug Resistance Highlights a Family of Memhrane Pumps with ATP -Binding Cassette Domains

385

3,8

Cyclic AMP Stimulates the Phosphorylation of Many Target Proteins by A ctivating Protein Kinase A

386

G Proteins Spontaneously Reset Themsel yes Through GTP Hydrolysis

387

Some 7TM Receptors Activate the Phosphoinositide Cascade

388

Calcium Ion Is a Widely U sed Second Messenger

389

Calcium Ion Often Activates the Regulatory Protein Calmodulin

390

362

363

The Structure of a P o tassium Ion Channel I s an Archetype [or Many lon-Channel Structu res

364 365 367 36X

A C hannel Can Be Inactivated by Occlusion of the Pore: The Ball -and-Chain Model

369

The Acetylcholine Receptor Is an Archetype for Ligand-Gated Ion C hannels

370

Action Potentials In tegrate the Activities of Several Ion Channels Working in Concert

383

358

Patch -Clamp Conductance Meas urements Reveal the Activities of Single Channels

Voltage Gating Requires Substantial Conformational Changes in Specific Ion -Channel Domains

14.1 Heterotrimeric G Proteins Transmit Signals and Reset Themselves

Activated G Proteins Tran smit Signal s by Bindin g to Other Proteins

362

The Structure of th e Potassi um Ion C hannel Explains Its Rapid Rate of Transport

382

384

Action P otentials Are Mediated by Transient Changes in Na + and K+ Permeability

T'he Stru cture of th e Potassium Ion C hannel Reveals the Basis of Ion Specificity

Signal- Transd uction Depends on Molecular Circuits

35 7

13.3 Lactose Permease Is an Archetype of Secondary Transporters That Use One Concentration Gradient to Power the Formation of Another 360 13.4 Specific Channels Can Rapidly Transport Ions Across Membranes

381

Ligand Binding to 7 TM Receptors Leads to the A ctivation of Heterotrimeric G Proteins

p- Type ATPases Are Evolutionarily Conserved and Playa Wide R ange of Roles

Chapter 14 Signal -Transduction Pathways

372

14.2 Insulin Signaling: Phosphorylation Cascades Are Central to Many Signal-Transduction Processes

391

The Insulin Receptor I s a Dimer That C loses Around a Bound Insulin Molecule

392

Insulin Binding Results in the Cross-Phosphorylation and A ctivation of the Insulin Receptor

39 2

The A ctivated Insulin Receptor Kinase Initiates a Kinase C ascade

393

In sulin Signaling Is T erminated by the Action of Phosphatases

395

14.3 EGF Signaling: Signal-Transduction Pathways Are Poised to Respond 395 EGF Binding ResulLs in the Dimerization of the EGF Receptor

396

The EGF Receptor Undergoes Phosphor ylatio n of Its C arhoxyl -Terminal Tail

397

EGF Signaling Leads to the Activation of Ras, a Small G Protein

397

Activated Ras Initiates a Protein Kinase C ascade

398

EGF Signaling Is Terminated by Protein Phos phatases and the Intrinsic GTPase A ctivity of Ras

13.5 Gap Junctions Allow Ions and,small Molecules to Flow Between Communicating Cells

373

13.6 Specific Channels Increase the Permeability of Some Membranes to Water 374

398

14.4 Many Elements Recur with Variation in Different Signal-Transduction Pathways

399

14.5 Defects in Signal-Transduction Pathways Can Lead to Cancer and Other Diseases

400

Monoclonal Antibodies Can Be Used to Inhibit Signal-Transduction Pathways Activated in Tumors

401

Protein Kinase Inhibitors Can Be Effective Anti cancer Drugs

401

C holera and Whooping Cough Are Due to Altered G -Protein A ctivity

401

xxi v

Contents

Part II TRANSDUCING AND STORING ENERGY Chapter 15 Metabolism: Basic Concepts and Design

409

15.1 Metabolism is Composed of Many Coupled, interconnecting Reactions 410 Metaboli sm Consists of Energy -Yielding and Energy Requiring Reactions 41 0 A Thermod ynamicall y Unfavorable Reaction Can Be Driven by a Favorable Reaction 411 15.2 ATP is the Universal Currency of Free Energy in Biological Systems ATP Hydroly sis [, Exergonic ATP Hydrolysis Drives Metabolism by Shifting the Equilibrium of Coupl ed Reaction s The High Phosphoryl Potential of ATP Results from Structural Differences Between ATP and Its Hydrolysis Products Phosphoryl-Tran sfer Potential Is an Important Form of Cellular Energy Transformation

15.3 The Oxidation of Carbon Fuels Is an important Source of Cellular Energy Compounds with High Phosphoryl- Transfer Potential Can Couple Carbon Oxidation to ATP Synthesis Ion G radients Across M embranes Provide an Important Form of Cel lul ar Energy T hat Can Be Cou pled to ATP Sy nthe,is Energy from Foodstuffs [s Extracted in Three Stages

15.4 Metabolic Pathways Contain Many Recurring Motifs Activated Carriers Exemplify the Modular Design and Economy of Metabolism Many Activated Carriers Are Derived from Vitamins Key Reactions Are Reiterated Throughout Metaboli sm Metabolic Prace"es Are Re!,>ulated in Three Principal Ways Aspects of Metabolism May Have Evolved from an RNA World

412 412 413

4'15 41 6

417 418

418 41 9

420 420 423 425 428 429

Chapter 16 Glycolysis and Gluconeogenesis 433 Glucose Is Generated from Dietary Carbohydrates

434

Glucose Is an Important Fuel for Most Organisms

435

16.1 Glycolysis is an Energy-Conversion Pathway in Many Organisms 435 Hexokinase Traps G lucose in the Cell and lIegins G lycolysis Fructose 1,6-bi sphosphate Is Generated from G lucose 6-phosphate ,

435 437

The Six -Carbon Sugar Is Cleaved inlo Two ThreeCarbon Fragments 438 Mechanism : Triose Pho,phate homerase Salvages a Three-Carbon Fragment 439 The Oxidation of an A ldehyde to an Acid Powers the Formation of a Compound with High PhosphorylTransfer Potential 440 Mechanism : PhosphOlylation Is Coupl ed to the Oxidation of G lyceraldehyde 3-phosphate by a Th ioester lntennediate 442 ATP Is Formed by Phosphoryl Transfer from 1,3 -Bisphosphoglyccrate 443 Additional ATP b Generated with the Formation of P yruvate 444 Two ATP Molecules Are Formed in the Conversion of Glucose into P yruvate 440 r--:AD + Is Regenerated from the Metabolism of Pyru vate 446 Fermentations Provide Usable Energy in the Absence of Oxygen 448 The Binding Site for l\AD+ Is Similar in Many Dehydrogenases 448 Fructose and Galactose Are Converled in to Glycolytic Intermediates 449 Many Adults Are Intolerant of Milk Beca use They Are Deficient in Lactase 451 Galactose Is Highly Toxic If the Transferase Is Missing 451

16.2 The Glycolytic Pathway Is Tightly Controlled G lycolysis in Muscle Is Regulated lO Meet the Need forATP The Regulation of Glycolysis in the Liver Reflects the Biochemical Versatility of the Liver A Family of Transporters Enable, Glucose to Enter and Leave Animal Cells Cancer .nJ Exercise Training Affect Glycolysis in a Similar Fashion

16.3 Glucose Can Be Synthesized from Noncarbohydrate Precursors

452 452 45 5 456 457

458

G luconeogenesis Is Not a Reversal of Glycolysi s

460

The Conve"ion of Pyruvate into Phosphoenolpyruvate Begins with the Formation of Ox ala acetate

461

Oxaloacetate Is Shuttl ed into the Cytoplasm and C onverted into Phosphoenolpyruvale

462

The Conversion of Fructose 1,6-bisphosphate into Fructose 6-phosphate and Orthophosphate Is an I rreversible Step

463

The Generation of Free Glucose Is an Important Control Point

463

Six High -Transfer -Potential Phosphoryl Groups Are Spent in Synthesizing Glucose from Pyruvate

464

16.4 Gluconeogenesis and Glycolysis Are ReCiprocally Regulated

464

Energy Charge Determines Whether Glycolysis or Gluconeogenesis Will Be Most Active

465

Contents x x v

The Balance Between Glycolysis and Gluconeogenesis in the Liver 1s Sensitive to Blood -Glucose Concentration Substrate Cycles Amplify Metabolic Signals and Produce Heat Lactate and Alanine Formed by Contracting Muscles Are Used by Other Organs Glycolysis and G luconeogenesis Are Evolutionarily Intertwined

Chapter 17 The Citric Acid Cycle T he Citric Acid Cycle Harvests High -Energy Electrons

17.1 Pyruvate Dehydrogenase Links Glycolysis to the Citric Acid Cycle

467 468 469

475 476

477 478

Flexible L inkages Allow Lipoamide to Move Between Different Active Sites

480

Citrate Synthase Forms C itrate from Oxaloacetate and Acetyl Coenzyme A

482 4R2

Mechanism : The M echani sm of C itrate Synthase Prevents Undesirable Reactions

48 2

Citrate Is Isomerized into Isocitrate

484

[socitrate 1s Ox idi zed and D ecarboxylated to a-Ketoglutarate

4R4

Succinyl Coenz yme A 1s Formed by the Oxidative Decarboxy lation of a-Ketoglutarate

4R5

A Compound with High Phosphoryl- Transfer Potential Is Generated fro m Succinyl Coenzyme A

485

'vlechanism: Succinyl Coenzyme A Synthetase Transforms Types of Biochemical E nergy 486 Oxaloacetate Is Regenerated by the Oxidation of Succinate

487

The Citric Acid Cycle Produces High-Transfer-Potential Electrons, G TP, and CO 2 488

17.3 Entry to the Citric Acid Cycle and Metabolism Through It Are Controlled

490

The Pyruvate Dehydrogenase Complex 1s Regu lated Allosterically and by Reversible Phosphor ylation

490

The C itric Acid Cycle Is Controlled at Several Points

492

17.4 The Citric Acid Cycle Is a Source of Biosynthetic Precursors The Citric Acid Cycle Must Be Capable of Deing Rapidly Replenished

495

Chapter 18 Oxidative Phosphorylation

502

46 6

Mechanism: The Synthesis of Acetyl Coenzyme A from Pyruvate Requires Three Enzymes and Five Coenzymes

17.2 The Citric Acid Cycle Oxidizes Two-Carbon Units

17.S The Glyoxylate Cycle Enables Plants and Bacteria to Grow on Acetate

493 493

The Disruption of Pyruvate Metabolism Is the Cause of Beriberi and Poison ing by M ercury and Arsenic

494

The Ci tric Acid Cycle May Have Evolved fro m Preexisting Pathways

495

Oxidative Phosph orylation Couples the Oxidation of Carbon Fuels to ATP Synthesis with a Proton Gradient

502

18.1 Oxidative Phosphorylation in Eukaryotes Takes

Place in Mitochondria

503

Mitochondria Are Bounded by a Double Membrane

SOl

Mitochondria Are the Result of an Endosymbiotic Event

504

18.2 Oxidative Phosphorylation Depends on Electron Transfer The Eleclron -Transfer Potential of an Electron Is M easured As Redox Potential

506 506

A 1.14 -Volt Potential Difference Between NAD H and Molecul ar Oxygen Drives Electron Transport Through the C hain and Favors t he Formation of a Proton Gradient 508

18.3 The Respiratory Chain Consists of Four Complexes: Three Proton Pumps and a Physical Link to the Citric Acid Cycle 509 The High-Potential Electrons ofNADH Enter the Respiratory C hain at NAD H -Q Oxidoreductase

510

U biquinol Is the Entry Point for Electron s from FADH , of Flavoproteins

51 2

Electrons Flow from Ubiquinol to Cytochrome c Through Q-Cytochrome' c Ox idoreductase

51 2

The Q Cycle F unnels Electrons from a Two-Electron Carrier to a One-Electron Carrier and Pumps Proteins Cytochrome c Oxidase Catalyzes the Reduction of Molecular Oxygen to Water Toxic Derivatives of M olecular Oxygen Such As Superoxide Radical Are Scavenged b y Protecti ve Enzymes Electrons Can Be Transferred Between Groups That Are Not in Contact The Conformation of Cy tochrome c Has Remained Essentially Con stant for M ore Than a Billion Years

18.4 A Proton Gradient Powers the Synthesis of ATP ATP Synthase Is Composed of a Proton -Conducting Unit and a Catalytic Unit Proton Fl ow Through ATP Synthase Leads to the Release of Tightly Bound ATP: The Binding-C hange M ech anism

513 514 517 519 520

520 522

523

x x V •I

Contents

Rotational Catalysis Is the World 's Smallest Molecular Motor

524

Proton Flow Around the c Ring Powers ATP Synthesis

525

ATP Synthase and G Proteins H ave Severa l Common Features

527

18.5 Many Shuttles Allow Movement Across the Mitochondrial Membranes 527 Electrons from Cytoplasmic NADH Enter Mitochondria by Shuttles The Entry of ADP into Mitochondria Is Coupled to the Exit of ATP by ATP-ADP Translocase Mitochondrial Transporters for Metabolites Have a Common Tripartite Structure

18.6 The Regulation of Cellular Respiration Is Governed Primarily by the Need for ATP The Complete Oxidation of Glucose Yields About 30 Molecules of ATP The Rate of Oxidative Phosphorylation Is Determined by the Need for ATP Regulated Uncouplin g Leads to the Generation of Heat Oxidative Phosphorylation Can Be Inhibited at Many Stages Mitochondrial Diseases Are Being Discovered Mitochondria Playa Key Role in Apoptosis Power Transmission by Proton Gradients Is a Central Motif of Bioenergetics

Chapter 19 The Light Reactions of Photosynthes is Photosynthesis Converts Light Energy into C hemical Energy

19.1 Photosynthesis Takes Place in Chloroplasts The Primary Events of Photosynthesis Take Place in Thylakoid Membranes Chloroplasts Arose from an Endosymbiotic Event 19.2 Light Absorption by Chlorophyll Induces Electron Transfer A Special Pair of Chlorophylls Initiate Charge Separation Cyclic Electron Flow Reduces the Cytochrome of the Reaction Center

19.3 Two Photosystems Generate a Proton Gradient and NADPH in Oxygenic Photosynthesis Photosystem II Transfers Electrons from W ater to Plastoquinone and Generates a Proton Gradient Cytoch rome bfLink s Photosystem II to Photosystem I Photosystem 1 Uses Light Energ y to Generate Reduced Ferredoxin , a Powerful Reductant Ferredoxin-NADP + Red uctase Converts NADP+ into NADPH

527 529 530

530 531 5]2 53 2 533 534 535 535

541

19.4 A Proton Gradient Across the Thylakoid Membrane Drives ATP Synthesis The ATP Synthase of C hloroplasts C losely Resembles Those of Mitochondria and Prokaryotes Cycli c Electron F low Through Photosystem [ Leads to the P roduction of ATP Instead of NADP H The Absorption of Eight Photons Yields One 0 ." Two NADPH , and Three ATP Molecules

19.5 Accessory Pigments Funnel Energy into Reaction Centers Resonance Energy Transfer Allows Energy to Move from the Site of Tnitial Absorbance to the Reaction Center Light -Harvesting Complexes Contain Additional Chlorophylls and Carotinoids The Components of Photosynthesis Are Highly O rgan ized Many Herbicides Inhibit the Light Reactions of Photosynthesis

553 554 5SS 556

557

557 55H 559 560

19.6 The Ability to Convert Light into Chemical Energy Is Ancient

560

Chapter 20 The Calvin Cycle and the Pentose Phosphate Pathway

565

20.1 The Calvin Cycle Synthesizes Hexoses from 566 Carbon Dioxide and Water C arbon Dioxide Keacts with Kibulose 1,5- bisphosphate to Form Two Molecules of 3-Phosphoglycerate

567

542

542 543 543

544 ,45 547

548 548 551 551 552

Large

subunit

Rubisco Activity Depends on Magnesium and Carbamate Rubisco Also Catalyzes a Wasteful Oxygenase Keaction : Catalytic Imperfection Hexose Phosphates Are M ade from Phosphoglycerate, and Ribu lose 1,5-bisphosphate Is Regenerated Three ATP and Two NADPH Molecules Are Used to Dring Carbon Dioxide to the Level of a H exose Starch and Sucrose Are the Major Carbohydrate Stores in Plants

568 569 570 572 573

•

•

Contents x x v II

20.2 The Activity of the Calvin Cycle Depends on Environmental Conditions Ru bisco Is Acti vated by Light· Driven C hanges in Proton and Magnesium Ion Concentrations Thioredoxi n Plays a Key Role in Regulating the Calvin Cycle The C, Pathway of Tropical Plants Accelerates Photosynthesis by Concentrating Carbon Dioxide Crassulacean Acid Metabolism Permits Growth in Arid Ecosystems

574 574 574 575 577

577 579 581

The Rate of the Pentose Phosphate Pathway Is Controlled by the Level ofNADP + The Flow of Glucose 6· phosphate Depends on the Need for NADPH , Ribose 5·phosphate, and ATP Through the Looking Glass : The Calvin Cycle and the Pentose Phosphate Pathway Are Mirror [mages

Chapter 21 Glycogen Metabolism Glycogen M etabolism Is the Regulated Release and Storage of Glucose

Phosphorylase Kinase Is Activated hy Phosphorylation and Calcium Ions

598 598 600 600

21.3 Epinephrine and Glucagon Signal the Need

for Glycogen Breakdown G Proteins Transmit the Signal for the Initiation of G lycogen Breakdown G lycogen Breakdown Must Be Rapidl y Turned Off When Necessary The Regulation ofGlycngen Phosphorylase Became More Sophisticated As the Enzyme Evolved

601 60 1

603 604

604

583

UDp · G lucose Is an Activated Form of Glucose

604

583

Glycogen Synthase Catalyzes the Transfer of Glucose from U DP · Glucose to a Growing C hain

605

A Branching Enzyme Forms ('<-1,6 Linkages

606

Glycogen :Synthase 1s the Key Regulatory Enzyme in Glycogen Synthesis

607

Glycogen Is an Efficient Storage Form of G lucose

607

583 585

586

587

592 593

21.1 Glycogen Breakdown Requires the Interplay of Several Enzymes 593 Phosphorylase Catalyzes the Phosphorolytic Cleavage of G lycogen to Rel ease G lucose I . phosphate A Debranching Enzyme Also Is Needed for the Breakdown of G lycogen

Muscle Phosphorylase Is Regulated by the Intracellular Energy Charge Liver Phosphoryl ase Produces Glucose for Use by Other Tissues

Different Pathways

20.5 Glucose 6-phosphate Dehydrogenase Plays a Key Role in Protection Against Reactive Oxygen Species 586 A Defi ciency of Glucose 6. phosphate Dehydrogena se Causes a Drug· Induced Hemolytic Anemia A Deficiency of Glucose 6·phosphate Dehydrogenase Confers an Evolutionary Advantage in Some Circumstances

Interactions and Reversible Phosphorylation

21.4 Glycogen Is Synthesized and Degraded by

20.4 The Metabolism of Glucose 6-phosphate by

the Pentose Phosphate Pathway Is Coordinated with Glycolysis

596

21.2 Phosphorylase Is Regulated by Allosteric

20.3 The Pentose Phosphate Pathway Generates NADPH and Synthesizes Five-Carbon Sugars 577 Two Molecules of N ADPH Are Generated in the C.onversion of G lucose 6. phosphate into Ribu lose 5· phosphate The Pentose Phosphate Pathway and Glycolysis Are Linked by Transketolase and Transaldolase Mechanism : Transketolase and Transaldolase Stabilize Carbanioni c Interm ediates by Different Mechanism s

Mechanism: Pyridoxal Phosphate Participates in the Phosphorolytic Cleavage of Glycogen

594 59 4

Phosphoglucomutase Converts Glucose 1 · phosphate into Glucose 6· phosphate

S9S

The Liver Contains G lucose 6· phosphatase, a Hydrolytic Enzyme Absent from Muscle

596

21.5 Glycogen Breakdown and Synthesis Are

Reciprocally Regulated

607

Protei n Phosphatase 1 Reverses t he Regulatory Effects of Kinases on G lycogen M etaholi sm

60R

Insulin Stimulates G lycogen Synth esis by Acti vatin g Glycogen Synthase Kinase

610

Glycogen M etaboli sm in the Liver Regulates the l:l1ood·G lucose Level A Biochemical Understanding of G lycogen Storage Diseases Is Possibl e

Chapter 22 Fatty Acid Metabolism Fatty Acid Degradation and Synthesis Mirror Each Other in Their Chemical Reaction s

610 611

617 6 1R

• • •

XXVIII

Contents

22.1 Triacylglycerols Are Highly Concentrated Energy Stores Di etary Lipids Are Digested by Pancreatic Lipases D ietar y Lipids Are Transported in C hylomicrons

22.5 Acetyl CoA Carboxylase Plays a Key Role 619 619 620

22.2 The Utilization of Fatty Acids as Fuel Requires Th ree Stages of Processing Triacylglycerols Are Hydrolyzed by HormoneStimulated Lipases

621

in Controlling Fatty Acid Metabolism

640

Acetyl CoA Carboxylase Is Regulated by Conditions in the Cell

640

Acetyl CoA Carboxylase Is Regulated by a Variety of Hormones

64 1

22.6 The Elongation and Unsaturation of

62 1

Fatty Ac ids Are Accomplished by Accessory Enzyme Systems

642

622

Carnitine Carries Long-Chain Acti vated Fatty Acids into the Mitochondrial M atrix

M embrand -Bound Enzymes G enerate Unsaturated Fatty Acids

642

62:1

Acetyl CoA, N ADH, and FADH2 Are Generated in Each Round of Fatty Acid O xidation

Ei cosanoid Hormones Are D eri ved from Polyun saturated Fatty Acids

641

62 4

Fatty Acids Are Linked to Coenzyme A Before They Are Ox idized

The Complete O xidation of Palmitate Yields 106 lvlolecules of ATP

22.1 Unsaturated and Odd -Chain Fatty Acids Requi re Add itional Steps for Degradation An [som erase and a Reductase Are Required for the Oxidation of Unsaturated Fatty Acids

625

626

626

O dd -C hain Fatty Acids Yield Propionyl eoA in the Final Thiolysis Step

6T

Vitamin BI1 Contains a Corrin Ring and a Cobalt Ion

628

Mechani sm : M ethylmalonyl CoA Mutase Calalyzes a Rearrangement to Form Succinyl CoA Fatly Acids Are Al so O xidized in Peroxisomes Ketone Bodies Are Fonned from Acetyl CoA When Fat Breakdown Predominates Ketone Bodies Are a Major Fuel in Some Tissues Animal s Cannot Convert Fatty Acids into Glucose

-,

629 610 631

632 634

The Formation of Malonyl CoA [s the Committed Step in Fatty Acid Synthesis

634

635

Intermediates in Fatty Acid Synthesis Are Attached to an A cyl Carrier Protein

Fatty Acid Synthesis Consists of a Series of Condensation , Reduction, Dehydration , and Reduction Reactions Fatty Acids Are Synthesized by a Multifunctional Enzyme Complex in Animal s The Synthesis of Palmitate Requires 8 Mol ecul es of Acetyl CoA, 14 Molecules of NADPH, and 7 Molecules of ATP

649

21.1 Proteins Are Degraded t o Amino Acids

650

The Digestion of Dietary Protein s Begins in th e Stomach and Is Completed in the Intestine

65 0

Cellular Proteins Are Degraded at Different Rates

651

21.2 Protein Turnover Is Tightly Regulated

615

636

651

The Proteasome Di gests the Ubiquitin -Tagged Proteins

653

Protein Degradation Can Be Used to Regulate Biological Function

654

The U biquitin Pathway and the Proteasome Have Prokaryotic Counterparts

655

21.1 The First Step in Amino Acid Degradation Alpha-Amino G roups Are Converted into Ammonium Ions by the Oxidative Deamination of G lutamate

656 656

Mechanism : Pyridoxal Phosphate Forms Schiff-Base

rntermediates in Aminotransferases

657

Aspartate Aminotransferase Is an Archetypal Pyridoxal -Dependent Transaminase

659

Pyridoxal Phosphate Enzymes Catalyze a Wide Arra y of Reactions Serine aod Threonine Can Be Directly Deaminated

637

651

Ubiquitin Tags Protei ns for D estruction

Is the Removal of Nitrogen

22.4 Fatty Acids Are Synthesized and Degraded by Different Pathways

Chapter 23 Protein Turnover and Amino Acid Catabolism

Peripheral Tissues T ransport Nitrogen to the Liver

659 660 660

21.4 Ammonium Ion Is Converted into Urea in 638

Citrate Carries Acetyl Groups from Mitochondria to the Cytoplasm for Fatty Acid Synthesis Several Sources Supply N ADPH for Fatty Acid Synthesis

639

Fatty Acid Synthase Inhibitors May Be Useful Drugs

640

638

Most Terrestrial Vertebrates

661

The Urea C ycle llegins with the Formation of Carbamoyl Phosphate

661

The Urea Cycle 1s Linked to Gluconeogenesis

661

Urea-Cycle Enzymes Are Evolutionarily Related to Enzymes in Other Metabolic Pathways

664

•

Co ntents x x I X

Inherited Defects of the Urea Cycle Cause Hyperammonemia and Can Lead to Brain Damage

664

Glutamate Is the Precursor of Glutamine, Proline, and Arginine

6RR

666

3-Phosphoglycerate Is the Precursor of Serine, Cysteine, and G lycine

688

Tetrahydrofolate Carries Activated One -Carbon Units at Several Oxidation Levels

689

S-Adenosylmethionine Is the Major Donor of Methyl Groups

691

666

Cysteine Is Synthesized from Serine and Homocysteine

693

668

H igh H omocysteine Levels Correlate with Vascular Disease

693

668

Shikimate and Chorismate Are Intermediates in the Riosynthesis of Aromatic Amino Acids

693

Succinyl Coenzyme A Is a Point of Entry for Several t\onpolar Amino Acids

668

Tryptophan Synthase Illustrates Substrate Channeling in Enzymatic Catalysis

696

Methionine Degradation Requires the Formation of a Key Methyl Donor, S-Adenosylmethionine

669

24.3 Feedback Inhibition Regulates Amino Acid Biosynthesis

697

670

Branched Pathways Require Sophisticated Regulation

697

671

An Enzyme Cascade Modulates the Activity of G lutamine Synthetase

699

Urea Is Not the Only Means of D isposing of Excess l\itrogen

23.5 Carbon Atoms of Degraded Amino Acids Emerge As Major Metabolic Intermediates Pyruvate Is an Entry Point into Metabolism for a Number of Amino Acids Oxaloacetate Is an Entry Point into Metabolism for Aspartate and Asparagine Alpha-Ketoglutarate Is an Entry Point into Metabolism fo r Five-Carbon Amino Acids

The Branched- C hain Amino Acids Yield Acetyl CoA, Acetoacetate, or Propionyl CoA Oxygenases Are Required for the Degradation of Aromatic Amino Acids

23.6 Inborn Errors of Metabolism Can Disrupt Amino Acid Degradation

666

672

Part III SYNTHESIZING THE MOLECULES

OF LIFE Chapter 24 The Biosynthesis of Amino Acids Amino Acid Synthesis Requires Solutions to Three Key Biochemical Problems

24.1 Nitrogen Fixation: Microorganisms Use ATP and a Powerful Reductant to Reduce Atmospheric Nitrogen to Ammonia The Iron- Molybdenum Cofactor of Nitrogenase Binds and Reduces Atmospheric Nitrogen Ammonium Ion Is Assimilated into an Amino Acid Through Glutamate and Glutamine

679

A Common Step Determines the Chirality of All Amino Acids The Formation of Asparagine from Aspartate Requires an Adenylated Intermediate

700

Glutathione, a Gamma-Glutamyl Peptide, Serves As a Sulfhydryl Buffer and an Antioxidant

701

Nitric Oxide, a Short-Lived Signal Molecule, Is rormed from Arginine

702

Porphyrins Are Synthesized from Glycine and Succinyl Coenzyme A

702

Porphyrins Accumulate in Some Inherited Disorders of Porphyrin Metabolism

704

680

680 681 683

24.2 Amino Acids Are Made from Intermediates of the Citric Acid Cycle and Other Major Pathways 685 Human Beings Can Synthesize Some Amino Acids but Must Obtain Others from the Diet Aspartate, Alanine, and G lutamate Are formed by the Addition of an Amino Group to an a -Keto.cid

24.4 Amino Acids Are Precursors of Many Biomolecules

685 686 686 6X7

Chapter 2S Nucleotide Biosynthesis

709

Nucleotides Can Be Synthesized by de Novo or Salvage Pathways

710

25.1 In de Novo Synthesis, the Pyrimidine Ring Is Assembled from Bicarbonate, Aspartate, and Glutamine

710

Bicarbonate and Other Oxygenated Carbon Compounds Are Activated by Phosphorylation 711 The Side Chain of Glutamine Can Be Hydrolyzed to Generate Ammonia

711

Intermediates Can M ove Between Active Sites by C hanneling

7 11

Orotate Acquires a Ribose Ring from PRPP to Form a Pyrimidine Nucleotide and Is Converted into Uridylate

712

N ucleotide Mono- , Di- , and Triphosphates Are Interconvertible

713

CTP Is Formed by Amination of UTP

713

xxX

Contents

25.2 Purine Bases Can Be Synthesized de Novo or Recycled by Salvage Pathways Salvage Pathways Economize Intracellular Energy Expenditures T he Purine Ring System [s Assembled on Ribose Phosphate

714 714 714

The Purine Ring Is Assembled hy Successive Steps of Activation by Phosphorylation Followed by Displacement 715 AMP and GMP Are Formed from IMP

25.3 Deoxyribonucleotides Are Synthesized by the Reduction of Ribonucleotides Through a Radical Mechanism Mechanism : A Tyrosyl Radical Is C ritical to the Action of Ribonucleotide Reductase

717

718 7 18

Thymidylate Is Formed by the Methylation of Deoxyuridylate

720

Dihydrofolate Reductase Catalyzes the Regeneration of Tetra hydro folate, a One-Carbon Carri er

721

Several Valuable Anticancer f)rugs Block the Synthesis ofThymidylate

25.4 Key Steps in Nucleotide Biosynthesis Are Regulated by Feedback Inhibition

722

Respiratory Distress Syndrome and Tay-Sachs Disease Resul t from the Di sruption of Lipid Metabolism

738

26.2 Cholesterol Is Synthesized from Acetyl Coenzyme A in Three Stages

The Synthesis of Purine N ucleotides Is Controll ed by Feedback Inhibition at Several Sites

723

739

T he Synthesis of M evalonate, Which Is Activated As Isopenten yl Pyrophosphate, In itiates the Synthesis of Cholesterol

739

Squalene (C 3 oJ Is Synthesized from Six Molecules of [sopentenyl Pyrophosphate (C, )

740

Sq ualene Cyclizes to Form Cholesterol

741

26.3 The Complex Regulation of Cholesterol Biosynthesis Takes Place at Several Levels

742

Lipoproteins Transport C holesterol and Triacylglycerols Throughout the Organism

742

SREBP

ER

SCAP

DNA-binding domain Cytoplasm

723 723

25.5 Disruptions in Nucleotide Metabolism Can Cause Pathological Conditions

7:lR

...-.

Pyrimidin e Biosynthesis Is Regulated hy Aspartate Transcarbamoylase

The Synthesis of Deoxyribonucleotides Is Con trolled by the Regulation of Ribonucleotide Red uctase

Sphingolipids Confer Diversity on Lipid Structure and F un ction

Lumen

724

The mood Levels of Certain Lipoproteins Can Serve Diagnostic Purposes

745

725

Low -D ensity Lipoproteins P laya Centrol Role in Cholesterol Metabolism

745

The L DL Receptor Is a Transmembrane Protein Having Five Differe nt Functional Regions

746 747

The Loss of Adenosine Deaminase Activity Results in Severe Combined Immunodeficiency

725

Gout Is Induced by High Serum Levels of Urate

726

The Absence of the LDL Receptor Leads to Hypercholesteremia and Atherosclerosis

Lesch Nyhan Syndrome Is a Dramatic Consequence of Mutations in a Salvage-Pathway Enzyme

726

The Clinical Management of C holesterol Levels Can Be Understood at a Biochemical Level

748

Folic Acid Deficiency Prom otes Birth Defects Such As Spina Bifida

727

26.4 Important Derivatives of Cholesterol Include Bile Salts and Steroid Hormones

748

Letters Identify the Steroid Rings and Num bers Identify the Carbon Atoms

750

Steroids Are H ydroxylated by Cytochrome P4 50 Monooxygenases That Use NADPH and O 2

750

The Cytochro me P45 0 System Is Widespread and Performs a Protective Function

751

Pregnenolone, A Precursor for Many Other Steroids, Is Formed from C holesterol by C leavage of Its Side C hain

752 752 751

Chapter 26 The Biosynthesis of Membrane Lipids and Steroids 732 26.1 Phosphatidate Is a Common Intermed iate in the SynthesiS of Phospholipids and Triacylglycerols

733

The Syn thesis of Phospholipids Requires an Activated Intermediate

734

Sphingoli pids Are Synthesized from Ceramide

736

Progesterone and Corticosteroids Are Synthesized from Pregnenolone

738

Androgens and Estrogens Are Synthesized from Pregnenolone

Gangliosides A re Carbohydrate-Rich Sphingolipids That Contain Acidic Sugars

Contents

Vitamin D Is Derived from Cholesterol by the Ring·Splitting Activity of Light

Chapter 27 The Integration of Metabolism 27.1 Metabolism Consists of Highly Interconnected Pathways

754

•

XXXI

Topoisomerases Prepare the Double H elix for Unwinding

790

Type I Topoisomerases Relax Supercoil ed Structures

790

Type II Topoisomerases Can In troduce Negative Supercoils Through Coupl ing to ATP Hydrolysis

791

760 761

Recurring Motifs Are Common in Metabolic Regulation

761

Major Metabolic Pathways Have Specific Control Sites

28.3 DNA Replication Proceeds by the Polymerization of Deoxyribonucleoside Triphosphates Along a Template

793

DNA Polymerases Require a Template and a Primer

793

762

All DNA Polymerases Have Structural Features in Common

793

765

Two Bound Metal Ions Participate in the Polymerase Reaction

794

The Specificity of Replication Is Dictated by Complementarity of Shape Between Bases

794

An RNA Primer Synthesized by Primase Enables DNA Sy nthesis to Begin

795

One Strand of DNA Is Made Continuously, Whereas the Other Strand Is Synthesized in Fragments

796

772

DNA Ligase Joins Ends of DNA in Duplex Regions

796

Metabolic Derangements in Diabetes Result from Relative Insulin Insufficiency and Glucagon Excess

773

The Separation of DNA Strands Requires Specific I-Ielicases and ATP Hydrolysis

797

Caloric Homeostasis Is a Means of Regulating Body Weight

774

28.4 DNA Replication Is Highly Coordinated

798

Glucose 6·phosphate. Pyruvate, and Acetyl CoA Are Key Junctions in Metabolism

27.2 Each Organ Has a Unique Metabolic Profile 27.3 Food Intake and Starvation Induce Metabolic Changes Metabolic Adaptations in Prolonged Starvation Minimize Protein Degradation

27.4 Fuel Choice During Exercise Is Determined by the Intensity and Duration of Activity 27.5 Ethanol Alters Energy Metabolism in the Liver Ethanol Metabolism Leads to an Excess ofNADH Excess Ethanol Consumption Disrupts Vitamin Metabolism

Chapter 28 DNA Replication, Repa ir, and Recombination 28.1 DNA Can Assume a Variety of Structural Forms The A-DNA Double Helix Is Shorter and Wider Than the More Common B-DNA Double Helix

766

770

DNA Replication Requires Highly Processive Polymerases

798

The Leading and Lagging Strands Are Synthesized in a Coordinated Fashion

799

777

DNA Replication in Escherichia coli Begins at a Unique Site

801

778

DNA Synthesis in Eukaryotes Is Initiated at Multiple Sites

802

Telomeres Are Unique Structures at the Ends of Linear Chromosomes

803

Telomeres Are Replicated by Telomerase, a Specialized Polymerase That Carries Its Own RN A Template

804

775 777

783 784 784

28.5 Many Types of DNA Damage Can Be Repaired

804

Errors Can Occur in DNA Replication

804

Some Genetic Diseases Are Caused by the Expansion of Repeats of Three Nucleotides

80S

The Major and Minor Grooves Are Lined by SequenceSpecific Hydrogen ·Bonding Groups

785

Studies of Single Crystals of DNA Revealed Local Variations in DNA Structure

Bases Can Be Damaged by Oxidizing Agents, Alkylating Agents, and Light 805

7R6

DNA Damage Can Be Detected and Repaired by a Variety of Systems

807

The Presence of Thymine In stead of Uracil in DNA Permits the Repair of Deaminated Cy tosine

809

788

Many Cancers Are Caused by the Defective Repair of DNA

810

789

Many Potential Carcinogens Can Be Detected by Their Mutagenic Action on Bacteria

811

Z· DNA Is a Left-Handed Double H elix in Whi ch Backbone Phosphates Zigzag

28.2 Double-Stranded DNA Can Wrap Around Itself to Form Supercoiled Structures The Linking N umber of DNA , a Topological Property, Determines the Degree of Supercoiling

787

x x xii

Contents

28.6 DNA Recombination Plays Important Roles in Replication, Repair, and Other Processes RecA Can Initiate Recombination by Promoting Strand lnvasion Some l:I.ecombination Reactions Proceed Through Holliday-Junction Intermediates Some Recombinases Are Evolutionarily Related to Topoisomerases

Chapter 29 RNA SyntheSiS and Processing RNA Synthesis Comprises Three Stages: Initiation, Elongation, and Termination

29.1 RNA Polymerase Catalyzes,Transcription RNA Polymerase Binds to Promoter Sites on the DNA Template to Initiate Transcription

The Product of RNA Polymerase II, the Pre-m RNA Transcript, Acquires a 5' Cap and a 3' Poly(A) Tail

840

RNA Editing Changes the Proteins Encoded by mRNA

842

812

Sequences at the Ends of Introns Specify Spl ice Sites in mRNA Precursors

843

813

Splicing Consists of Two Sequential Transesterification Reactions

R43

814

Small Nuclear RNAs in Spliceosomes Catalyze the Splicing of mRNA Precursors

844

Transcription and Processing of mRNA Are Coupled

846

Mutations That Affect Pre-mRNA Splicing Cause Disease

847

Most Human Pre-mRNAs Can Be Spliced in Alternative Ways to Yield Different Proteins

847

29.4 The Discovery of CatalytiC RNA Was Revealing in Regard to Both Mechanism and Evolution

848

812

821 822

823 824

Sigma Subunits of RNA Polymerase Recognize Promoter Sites

825

RNA Polymerase Must Unwind the Template Double Helix for Transcription to Take Place

827

Chapter 30 Protein Synthesis

857

RNA Chains Are Formed de Novo and Grow in the 5' -to-3' Direction

827

Elongation Takes Place at Transcription Bubbles That Move Along the DNA Template

828

30.1 Protein SyntheSis Requires the Translation of Nucleotide Sequences into Amino Acid Sequences

858

829

The Synthesis of Long Proteins Requires a Low Error Frequency

858

Transfer RNA Molecules Have a Common Design

859

The Activated Amino Acid and the Anticodon of tRNA Are at Opposite Ends of the L-Shaped Molecule

861

30.2 Aminoacyl-Transfer RNA Synthetases Read the Genetic Code

862

Amino Acids Are First Activated by Adenylation

862

Aminoacyl -tRNA Synthetases Have Highly Discriminating Amino Acid Activation Sites

863

Proofreading by Aminoacyl -tRNA Synthetases Increases the Fidelity of Protein Synthesis

864

Synthetases Recognize Various Features of Transfer RNA M olecules

865

Aminoacyl-tRNA Synthetases Can Be Divided into Two C lasses

865

Sequences Within the Newly Transcribed RNA Signal Termination The rho Protein Helps to Terminate the Transcription of Some Genes

810

Some Antibiotics lnhibit Transcription

831

Precursors of Transfer and Ribosomal RNA Are Cleaved and Chemically Modified After Transcription in Prokaryotes

29.2 Transcription in Eukaryotes Is Highly Regulated Three Types of RNA Polymerase Synthesize RNA in Eukaryotic Cells Three Common Elements Can Be Found in the RNA Polymerase II Promoter Region The TFIID Protein Complex Initiates the Assembly of the Active Transcription Complex

832

833 834 836 836

Multiple Transcription Factors Tnteract with Eukaryotic Promoters

838

Enhancer Sequences Can Stimulate Transcription at Start Sites Thousands of Bases Away

838

29.3 The Transcription Products of All Three Eukaryotic Polymerases Are Processed

839

RNA Polymerase 1 Produces Three Ribosomal RNAs

839

RNA Polymerase TIl Produces Transfer RNA

840

30.3 A Ribosome Is a Ribonucleoprotein Parti cle (70S) Made of a Small (305) and a Large (50S) Subunit

866

Ribosomal RNAs (5S , 16S, and 23S rRNA ) Play a Central Role in Protein Synthesis

867

Proteins Are Synthesized in the Amino-to-Carboxyl Direction

869

Contents

Messenger RNA [s Translated in the 5' · to-3 ' Direction

869

Polypeptide

The ·tart Signal Is Usually AUG Preceded by Several Rases That Pair with 16S rR NA

870

Bacterial Protein Synthesis [s Initiated by Formylmethionyl Transfer RNA

87 1

Ri bosomes H ave Three tRNA · Bindin g Sites That Bridge the 30S and 50S Subunits The Growing Polypeptide C hain [ s Tran sferred Bel ween tRNAs on Peptide-Bond Formation

87 1 872

Only the Cooon- Anticodon In teractions Determine the Amino Acid That Is Incorporated

873

Some Transfer RNA Molecules Recognize M ore Than One Codon Hecause of Wobble in Base-Pairing

874

30.4 Protein Factors Play Key Roles in Protein SynthesiS Formylmethionyl -tR NA r Is Pl aced in the P Site of the l{i bosome in the Formation of the 70S Initi ati on Compl ex Elongation Factors Deliver Amj noacyl· tRNA to the Rihnsome The Formation of a Peptide Rond Is Followed by the GTP · Driven Translocation of tR NAs and m RNA Protein Synthesis Is Terminated by Rel ease Factors That Read Stop Codons

30.5 Eukaryotic Protein Synthesis Differs from Prokaryotic Protein SyntheSiS Primarily in Translation Initiation 30.6 Ribosomes Bound to the Endoplasmic Reticulum Manufacture Secretory and Membrane Proteins Signal Sequences Mark Proteins for Translation Across the Endoplasmic Reticulum Membrane Transport Vesicles Carry Cargo Proteins to Their Final Desti nation

876

876 876 877 878

879

880 HH{)

XX2

... X X X III

30.7 A Variety of Antibiotics and Toxins Can Inhibit Protein Synthesis

884

D iphtheri a Toxin Illocks Protein Synthesis in Eukaryotes by Inhibiting· f"ran slocation

885

Ricin [ s an N-Glycosidase That Inhibits Protein Synthesis

885

Chapter 31 The Control of Gene Expression

892

31.1 Many DNA-Binding Proteins Recognize Specific DNA Sequences

893

T he Helix-Turn-Helix Motif Is Common to Many Prokaryotic DNA - Ilinding Proteins

894

A Range ofDNA -llinding Structures Are Employed by Eukaryotic DNA-Binding Proteins

895

31.2 Prokaryotic DNA-Binding Proteins Bind Specifically to Regulatory Sites in Operons

896

An Operon Consists of Regulatory Elements and Protein -Encoding Genes Th e lac Repressor Protein in th e Absence uf Lactose Binds to the Operator and Blocks Transcription

89 7

Ligand Bindin g Can Induce Structural C hanges in l{egulatory Proteins

898

897

The Operon [s a Common Regulatory Unit in Prokaryotes 899 Transcription Can fle Stimula ted by Pro teins That Contact RNA Polymerase 900

31.3 The Greater Complexity of Eukaryotic Genomes Requires Elaborate Mechanisms for Gene Regulation

901

Multipl e Transcription Fa ctors Interact with Eukaryotic Regulatory Sites

902

Eukaryotic Transcription Factors Are M odular

902

Activation Domain s Interact with Other Proteins

902

N ucleosomes Are Compl exes of DN A and Hi stones

901

Eukaryotic DNA Is Wrapped Around Histones to Form N ucleosomes

904

The Control of Gene Expression Can Require C hromatin Remodelin g

905

Enhancers Can Stimul ate Tran scription in Specific Cell T ypes

906

The M ethylation of DNA Can Alter Patterns of Gene Expression

907

Steroids and Related H ydrophobic Mol ecu les Pass Through M embranes and Uind to DNA -Binding Receptors

907

N uclear Hormone Receptors Regulate Transcription by Recruiting Cmctivators to the Transcription Complex

909

Steroid· Hormone Receptors Are Targets for Drugs

9 1()

•

x x XIV

Contents

C hromatin Stru cture Is Modulated Through Covalent Modifications of Histone Tail s Histone Deacetylases Contribute to Transcriptional Repression

31.4 Gene Expression Can Be Controlled at Posttranscriptional Levels

910

912

913

Attenuation Is a Prokaryotic M echanism for Regu lating Transcription Through the M odulati on of Nascent RNA Secondary Structure

913

Genes Associated with Tron Metabolism Are Tran slationally Reg ulated in Animals

9 14

Part IV RESPONDING TO ENVIRONMENTAL CHANGES Chapter 32 Sensory Systems 32.1 A Wide Variety of Organic Compounds Are Detected by Olfaction Olfaction Is Mediated by an Enorm ous Family of Seven · Transmembrane- Heli x Receptors Odorants Are Decoded hy a C:ombinatori al M echanism

921

922

Sequencing of the Human Genome Led to the Discovery of a Large Family of 7 TM llitter Receptors A H eterod imeric 7TM Receptor Responds to Sweet Com pounds Umami . the T aste of Glutamate and Aspartate, Is M ediated by a H eterod imeric Receptor Related to the Sweet Receptor Salty Tastes Are Detected Primarily by the P assage ofS()(lium Tons Through C:ha.nnels Sour Tastes Ari se from the Effects of H yd rogen Ions (Acids) on Channels

32.3 Photoreceptor Molecules in the Eye Detect Visible Light Rhodopsin , a Speciali zed 7TM Receptor, Absorbs Visible Light

924

926 927 929

930 930

931

931 932

Light Abso rption Induces a Specific Isom erization of llound 11-cis - Retinal

933

L ight - Jnduced L owerin g of the C:alcium Level Coordinates R ecovery

934

Color Vision Is M ediated by Three Cone Receptors That A re H omologs of Rhodopsin

32.4 Hearing Depends on the Speedy Detection of Mechanical Stimuli

936

937

H ai r Cells Use a Connected Bundle of Stereocilia to D etect Tiny M otions

937

Mechanosen sory C hannels H ave Been Identified in Drosophilio and Vertebrates

939

32.5 Touch Includes the Sensing of Pressure, Temperature, and Other Factors

939

Studies of Capsaicin Reveal a Receptor for Sen si ng High Temperatures and Other Painful Stimuli

940

More Sen sory Systems Remain to be Studied

941

Chapter 33 The Immune System

945

Tnn ate Immunity Is an Evolution arily Ancient Defense System

946

Th e Adaptive Tm mune System Responds h y Using the Principles of Evolution

947

923

F unctional Magnetic Resonance Imagin g Reveals Regions of the Brain P rocessing Sensory Information

32.2 Taste Is a Combination of Senses That Function by Different Mechanisms

Rearrangements in the Genes for the G reen and Red Pigments L ead to "Color Blindness"

93S

33.1 Antibodies Possess Distinct Antigen-Binding 949 and Effector Units 33.2 The Immunoglobulin Fold Consists of a Beta-Sandwich Framework with Hypervariable Loops

952

33.3 Antibodies Bind Specific Molecules Through Their Hypervariable Loops

953

X -ray Analyses Have Revealed H ow Antibodies Bind Antigen s

933

Large Antigens Rind Antihodies with Numerous Interactions

954

33.4 Diversity Is Generated by Gene Rearrangements

956

J (Joining) Genes and D (Diversity) Genes Increase Antibody Diversity

956

More Than 10 8 Antibodies Can Be Formed b y Combinatorial A ssociation and Somatic Mutation

958

The O ligomerization of Antibodies Expressed on th e Surfaces of Immature B Cells Triggers Antibody Secretion

958

Different C lasses of Antibodies Are Formed by the Hopping of V H Genes

960

33.5 Major-Histocompatibility-Complex Proteins Present Peptide Antigens on Cell Surfaces for Recognition by T-Cell Receptors 961 Peptides Presented by MHC Protein s Occupy a Deep Groove Flanked hy Alpha H eli ces

962

Contents x x x v T-Cell Receptors Are Antibody-like Proteins Containing Variable and Constant Regions

963

CDXon Cytotoxic T Cells Acts in Concert with T -Cell Receptors

964

Hel per T Cell s Stimulate Cells That Display Foreign Peptides Bound to Class II MHC Proteins

965

Helper T Cells Rely on the T -Cell Receptor and C D4 to Recognize Foreign Pep tides on Antigen -Presenting Cells

M HC Proteins Are Highly Diverse Human Immunodeficiency Viruses Subvert the Immune System by Destroying Helper T Cells

33.6 Immune Responses Against Self-Antigens Are Suppressed

966 968

Chapter 35 Drug Development

1001

35.1 The Development of Drugs Presents Huge Challenges

1002

Drug Candidates Must l.\e Potent Modulators of Their Targets

1002

Drugs Must H ave Suitable Properties to Reach Their Targets

1003

Toxicity Can Limit Drug Effectiveness

1008

969

970

T Cells Are Subjected to Positi ve and Negative Selection in the Thymus

070

Autoimmune Diseases Results from the Generation of Immune Responses Against Self-Antigens

971

The Immune System Plays a Role in Cancer Prevention

97 1

35.2 Drug Candidates Can Be Discovered by Serendipity, Screening, or Design

Chapter 34 Molecular Motors

977

34.1 Most Molecular-Motor Proteins Are Members of the P-Loop NTPase Superfamily A Motor Protein Consists of an ATrase Core and an Extended Structure

978

978

ATP Binding and Hydrolysis Induce Changes in the Conformation and Binding Affinity of Motor

Proteins

Serendipitous Observations Can Drive Drug Development

1009

Screening Libraries of Compounds Can Yield Drugs or Drug Leads

1011

Drugs Can Be Designed on the Basis of ThreeDimensional StTuctural Information About

Their Targets 980

Muscle Is a Complex of Myosin and Actin Actin Is a Polar, Self-Assembling, Dynamic Polymer Motions of Single Motor Proteins Can Be Directly Observed Phosphate Release Triggers the Myosin Power Stroke The Length of the Lever Arm Determines Motor Velocity

982

082

1017

985

Animal M odels Can Be Developed to Test the Validity of Potential Drug Targets

1018

Potential Targets Can Be Identified in the Genomes of Pathogens

1018

Genetic Differences InOuence Individual Responses to Drugs

1019

9R6 987 988 989 980

Kinesin Motion Is Highly Processive

991 993

Proton fl ow Drives Bacterial Flagellar Rotation

993 994

Bacterial Chemotaxis Depends on Reversal of the Direction of F lagellar Rotation

995

Bacteria Swim by Rotating Their Flagella

35.4 The Development of Drugs Proceeds Through Several Stages

Microtubules Are Hollow Cylindrical Polymers

34.4 A Rotary Motor Drives Bacterial Motion

1017

Potential Targets Can Be Identified in the Human Proteome

34.3 Kinesin and Dynein Move Along M icrotubules

1014

35.3 The Analysis of Genomes Holds Great Promise for Drug Discovery

34.2 Myosins Move Along Actin Filaments

1009

1020

C linical Trials Are Time Consuming and Expensive

1020

The Evolution of Drug Resistance Can Limit the Utility of Drugs for Infectious Agents and Cancer

1021

Appendices

A1

Glossary of Compounds

B1

Answers to Problems

C1

Index

D1

Chapter

1

Biochemistry: An Evolving Science

Disease and the genome. Studies of the human genome are revealing disea se o rigins and oth er bi ochemi cal myst eri es. Human chro moso mes. left. contain the DNA molecules tha t constitute the human genome. The staining pattern serves to q31.2

identify specifi c regions of a chromosome. On t he ri ght is a diagram of human chro mosome 7. with band q31.2 indicated by an arrow. A gene in thi s reg ion encodes a protein that. when ma lfuncti o ning. causes cysti c fibrosis. [(Left) Alfred Pa sieka / Peter Arn old.]

iochemistry is the study of the chemistry oflife processes. Since the dis covery that biological molecules such as urea could be synthesized from nonliving components in 1828, scientists have explored the chemistry oflife with great intensity. T hrough these investigations, many of the most funda mental mysteries of how living things function at a biochemical level have now been solved. However, much remains to be investigated. As is often the case, each discovery raises at least as many new questions as it answers. Furthermore, we are now in an age of unprecedented opportunity for the application of our tremendous knowledge of biochemistry to problems in medicine, dentistry, agriculture, forens ics, anthropology, environmental sciences, and many other fields. We begin our journey into biochemistry with one of the most startling discoveries of the past century: namely, the great unity of all living things at the biochemical level.

1.1

Outline 1.1

Biochemica l Unity Underlies Biological Diversity

1.2 DNA Illustrates the Interplay Between Form and Function 1.3 Concepts from Chemistry Explain the Properties of Biological Molecules 1.4 The Genomic Revolution Is Transforming Biochemistry and Medicine

Biochemical Unity Underlies Biological Diversity

The biological world is magnificently diverse. The animal kingdom is rich with species ranging from nearl y microscopic insects to elephants and whales. The plant kingdom includes species as small and relatively simple as algae and as large and complex as giant sequoias. Unlike animals that mll st eat to survive, plants have the remarkable abi li ty to use sunlight to 1

2 CHAPTER 1 Biochemi stry: An Evolving Science

CH 2 0H HO

C

OH

HO OH

H

CH2 0H Glycerol

Glucose

Sulfa/abus acidica/darius

con vert carbon dioxide in the air into living tissues. This diversity extends further when we descend into the microscopic world . Single-celled organ isms such as protozoa, yeast, and bacteria are present with great diversity in water, in soil, and on or within larger organisms. Some organi sms can survive and even thrive in seemingl y hostile environments such as h ot springs and glaciers. T he microscope revealed a key unify ing feature that underli es thi s d iversity. L arge organisms are built up of cells, resembling, to som e extent, single-celled microscopic organ is ms. T he construction of animals, plants, and microorganism s from cells suggested that these d iverse organisms mi ght have more in common than is apparent from their outward appearance. With the development of bi ochemi stry, this suggestion has been tremendously supported and expanded. At the biochemi cal level, all organ isms h ave m an y common features (Figure 1.1). As m ent ioned ea rlier, biochem istry is the study of th e ch emistry of life p rocesses. T hese processes entail the interpl ay of two different cl asses of m olecules: large molecules su ch as proteins and nucleic acids, referred to as biological macromolecules, and low -m olecular-weight m olecules such as glu cose and glycerol, referred to as metabolites, that are chemi call y tran sformed in biolog ical processes. M embers of both these cl asses of molecul es are common, with m inor variations , to all li ving things. For exam p le, deoxyribonucleic acid (DNA ) stores genetic inform ati on in all cellular organisms. Proteins, th e m acrom olecul es that are key participants in most bi ological p rocesses, are built from a set of 20 building blocks that are the same in all

Arabidapsis thaJiana

Homo sapiens

Figure 1.1 Biological diversity and similarity. The shape o f a key molecule in gene regulatio n (th e TATA-box-bind ing protein) is similar in three ve ry differen t o rganisms that are separated from one another by billions of years of evolution. [(Left) Dr. T. J. Beveri dge/V;suals Un limited; (middle) Holt Studios/Photo Researchers; (right) Time Life Pi ctu res / Getty Images.]

·-
'" E .'"c:

."

•E

u

::l

c:

-

.s::;

'"""

"0

0 0

UJ

.::;;

-'"
t

1

1

~

u

~

"

~

I

I

4.5

4.0

8 E 0 '"~ ''"c:-

.;:

~

.J:

u .-c.",

u

I

I

I

3.5

3.0

2.5

Oxygen atmosphere

u'"

1tI~ ::;;0

forming I

I

I

2.0

1.5

1.0

'"c: "" .'"::l '"'"0co .Cl ~

3 1.1 Unity and Diversity

..0

c: E

'"::>

I

1 11 I

I

0.5

0.0

Billions of yea rs Figure 1.2 A possible time line for biochemical evolution. Selected key events are indicated. Not e t hat life on Earth began approximately 3.5 billion years ago. whe reas human beings emerged quite recently.

organisms. F urthermore, proteins that play similar roles in different organisms often have very similar three-dimensional structures (see F igure 1.1 ). Key metaboli c processes also are common to many organisms . For example, the set of chemical transformations that converts glucose and oxygen into carbon dioxide and water is essentiall y identical in simple bacteria such as Escherichia coli (E. coli) and human beings. Even processes that appear to be quite distinct often have common features at the biochemical level. Remarkably, the biochemical processes by which plants capture light en ergy and convert it into more-useful form s are strikingly similar to steps used in animals to capture energy released from the breakdown of glucose. These observations overw helmingly suggest that all living things on Earth have a common ancestor and that modern organisms have evolved from this ancestor into their present forms. Geological and biochemical findings support a time line for this evolutionary path (Figure 1.2). On the basis of their biochemical characteristics, the diverse organisms of the modern world can be divided into three fundamental groups cal led d()mains: Eukarya (eukaryotes), Bacteria, and Archaea. E ukarya comprise all multicellular organisms, incl uding human beings as well as many microscopic, unicellular organisms such as yeast . The defining characteristic of eukaryotes is the presence of a well-defined nucleus within each cell. Unicellular organisms such as bacteria, which lack a nucleus, are referred to as prokaryotes. The prokaryotes were reclassified as two separate domains in response to Carl Woese's discovery in 1977 that certain bacteria -like organisms are biochemically quite distinct from other previously characARCHAEA BACTERIA EUKARYA terized bacterial species. These organisms, now recognized as having diverged from bacteria early in evolution, ~ '" E '" -c 8 .'2 ~ " are the archaea . Evolutionary paths from a common an .9 EO 80 0> ~ e cestor to modern organisms can be deduced on the basis of c " 0 -c ' " " " biochemi cal inform ation. O ne such path is shown in "§ -<: -S ... ~ EO u 0 S; '" :t: ~ ~ ~ figure 1.3. Much of thi s book will explore the chemical reactions and the associated biological macromolecules and metabolites that are found in biological processes common to all organisms. The unity oflife at the biochemical level makes this approach possible. At the same time, different organisms have specific needs, d epending on th e particular biological niche in which they evol ved and live. lly comparin g and contrastin g details of particular biochemical pathways in different organisms, we can learn how biological challenges are sol ved at the biochemical level. In most cases, these chall enges are addressed by the Figure 1.3 The tree of life. A possible evo lutionary path from a adaptation of existing macromolecules to new roles rather commo n ancestor approximately 3.5 billio n years ago at the bottom o f the t ree to o rganisms fo und in the modern world at th e t o p. than by the evolution of entirely new ones . ~

"

"

~

"

Biochemistry has been greatly enrich ed by our ability to examin e the three-dimensional stru ctures of biological macromol ecules in great detail. Some of these structures are simple and elegant, whereas others are incredibly complicated but, in any case, these structures provide an essen tial framework for und erstanding function. W e begin our exploration of the in terplay between structure and function with the genetic material, DNA.

4 CHAPTER 1 Biochemistry : An Evolving Science

1.2

DNA Illustrates the Interplay Between Form and Function

A fundamental biochemical feature common to all cellul ar organi sms is the use of DNA for the storage of genetic information. The discovery that DNA plays thi s central role was first made in studies of bacteria in the 1940s. This discovery was foll owed by the elucidation of the three-d imensional structure of DNA in 19 53, an event that set the stage for m any of the advances in biochemi stry and many other fields, extending to the present. T he structure of DNA powerfully illustrates a basic principle common to all biological macromolecules: the intimate relation between structure and function . The remarkable properties of th is chemical substance allow it to function as a very efficient and robust vehicle for storing information . We start with an examination of the covalent structure of DNA and its extension into three dimensions. DNA Is Constructed from Four Building Blocks

DNA is a linear polymer made up of four different types of monomers. It has a fixed backbone from which protrude variable substituents (Figure 1.4), The backbone is built of repeatin g sugar- phosphate units. The sugars are molecules of deoxyribose from which DNA receives its name. Each sugar is connected to two phosphate groups through different linkages. Moreover, each sugar is oriented in th e same way, and so each DNA strand is polar, with one end distinguishable from the other. Joined to each deoxyribose is one of four possible bases: adenine (A) , cytosin e (C), guanine (G), and thymine (T ). NH,

NH, N____.. H

f

0 H

N "'"

"" N

N H

N

/

.c N

H

0

/ H

"-

CH,

H' N

0

NH,

/

Cytosine (C)

Adenine (A)

N

f N

H

0

"""

I N, /

H

Thymine (T)

Guanine (G)

T hese bases are conn ected to the sugar components in the DNA backbone through the bonds shown in black in Figure 1.4. All four bases are planar but differ significantly in other respects. Thus, each monom er of DNA con sists of a sugar- phosphate unit and one of four bases attached to the sugar. T hese bases can be arranged in any order along a strand of D NA. base,

Figure 1.4 Covalent structure of DNA. Each unit of the po lymeric structure is composed of a sugar (deoxyribose), a phosphat e, and a variable base that protrudes from th e sugar- phosphate backbo ne.

base,

q

o 0,

Sugar

Phosphate

'"

5 1.2 DNA: Fo rm and Functi o n

Figure 1.5 The double helix. The double-helical structure of DNA proposed by Watson and Crick. The sugar-phosphate back bones of the two chains are shown in red and blue, and the bases are shown in green, purple, orange. and yellow. The two strands are anti parallel, running in opposi t e directions with respect to the axis of the double helix, as indicated by the arrows.

Two Single Strands of DNA Combine to Form a Double Helix

Most DNA molecules consist of not one but two strands (Figure 1. 5). In 1953, James Watson and Francis Crick ded uced the arrangement of these strand s and proposed a three-dim ensional structu re for DNA m olecules. This structure is a double helix composed of two intertwined strands arranged such that th e sugar- phosphate backbone lies on the outside and the bases on the inside. The key to th is structure is that the bases form specific base pairs (bp ) held togeth er by hydrogen bonds (Section 1.3 ): adenine pairs with thymine (A- T) and guanine pairs with cytosine (G - C), as shown in Figure 1.6. Iiydrogen bonds are much weaker th an covalent bonds such as the carbon-carbon or carbon-nitrogen bonds th at define the structures of the bases themselves. Such weak bonds are crucial to biochemical systems; they are weak enough to be reversibly broken in b iochemical processes, yet they are strong enough, when many form simultaneously, to help stabilize specific structures such as the double helix.

DNA Structure Explains Heredity and the Storage of Information The structure proposed by Watson and C rick has two properties of central importan ce to the role of DNA as the hereditary material. First, the structure is compatible with any sequence of bases. The base pairs have essentially the same shape (see Figure 1.6) and th us fit equall y well into the center of the double -helical structure of any sequen ce. Without any constraints, the sequence of bases along a DNA strand can act as an efficient means of storing information. Indeed, the sequence of bases along DNA strands is how genetic information is stored. T he DNA sequence determines the sequences of the ribonucleic acid (RNA) and protein molecules that carry out most of the activities within cells. Second, because of base-pairing, the seq uence of bases along one strand compl etely determ ines the sequence along the other strand. As Watson and Crick so coyly wrote: "It has not escaped our notice that the specific pairing we have postulated immediately suggests a possible copying mechanism for the genetic material. " Thus, if the DNA double helix is separated into two single strands , each strand can act as a template for the generation of its H I

o__-- -- H -

N

\---"

N

N

/

/'

o

NI=--=Z

_-0

N- H- - -- -

, H

Adenine (A)

Thymine (T)

Guanine (G)

CytOSine (C)

Figure 1.6 Watson-Crick base pairs. Ad enine pairs with thymine (A - T). and guani ne wit h cytosine (G-C). The dashed green lines represent hydrogen bonds.

7--~~~--

Newly synthesized strands

.)

CAT ,

G

T

A

T ," A

C G

(.,

"

partner strand through specifi c base- pair formati on (Figure 1.7). The threedimen sional structure of DNA beautifully illustrates the close connection between molecular form and function.

'y/

Figure 1.7 DNA replication. If a DNA

molecule is separated into two strands, eac h strand can act as t he te mplate fo r the generation of its partner strand .

1.3

Concepts from Chemistry Explain the Properties of Biological Molecules

W e have seen how a chemical insight, into the hydrogen -bonding capabiliti es of the bases of D N A , led to a deep und erstanding of a fundamental biological process. To lay the groundwork for the rest of the book, we begin our stud y of biochemistry by examining selected concepts from chemistry and showing how these concepts apply to biological system s. T he concepts include the types of chemical bond s; the structure of water, th e solvent in which most biochemical processes take place; the First and Second Laws of Th ermodyn amics; and the principles of acid- base chemistry. We will use these concepts to examine a archetypi cal bi ochemi cal process nam ely, the formation of a DNA double helix from its two component strands. Th e p rocess is bu t one of m any examples that could have been chosen to illustrate these topics. Keep in mind that , although the specific discussion is about D NA and double- helix formation, the concepts considered are quite general and will apply to many other classes of molecules and processes that we will di scuss in the remainder of the book. The Double Helix Can Form from Its Component Strands The discovery that DNA from natural sources ex ists in a doubl e- heli cal form with Watson- Crick base pairs suggested, but did not prove, that such doubl e heli ces would form spontaneou sly outside biological system s. Suppose that two short strands of D NA were chemi call y synth es ized to have complementary sequences so that they could , in principl e, form a double hel ix with Watson- C rick base pairs. Two such sequences are C GAT T A AT and ATTAATCG . Th e structures of th ese molecul es in solution can be examined by a variety of techniques. In isolation, each sequence exists almost exclu sively as a single -stranded molecule. H owever, when the two seq uences are mixed, a double helix with W atson- C rick base pairs does form (Figure 1. 8). This reaction proceeds nearly to completion. If each of the strands are initially present at equal concentration s of 1 mM , then more than 99.99% of the strands are in th e doubl e heli x at 25°C and in the pres ence of 1 M N aCI.

..··· .. G

Figure 1.8 Formation of a double helix. When two DNA strand s with appro priate,

complementary sequences are mixed, t hey spontaneously assemble to form a double hel ix.

G A T T A A T

c; ...... C T

+

<

"

A...... · T T ...... ·A T .. .. ·.. A A ....... T

T A

A...... · T I ·.. ·· ..A

What forces cause t he two strand s of D N A to bind to each other? To analyze this binding reaction, we must consider several factors: the types of interactions and bond s in biochemical system s and the energetic favorabil ity of the reacti on. W e also mu st consider the influ ence of the solu tion conditions in particular, the consequences of ac id- base react ions. 6

7

Covalent and Noncovalent Bonds Are Important for the Structure and Stability of Biological Molecules

1.3 Chemical Concepts

Atoms interact with one another through chemical bonds. These bond s in clude the covalent bonds that define the structure of molecules as well as a variety of noncovalent bonds that are of great importance to biochemistry.

Covalent Bonds. The strongest bonds are covalent bonds, such as the bonds that hold the atoms together within the individual bases shown on page 4. A covalent bond is form ed by the sharin g of a pair of electrons between adjacent atoms. A typi cal carbon-carbon (C-C) covalent bond has a bond length of 1.54 A and bond energy of 356 kJ mol I (8 5 kcal mol- I). Because covalent bonds are so strong, considerable energy must be expended to break them. M ore than one electron pair can be shared between two atoms to form a multiple covalent bond. For example, three of the bases in Figure 1.6 include carbon- oxygen (C 0) double bonds. Th ese bonds are even stron ger than C C sin gle bonds, with energies near 730 kJ mol - I (175 kcal mol- I) and are somewhat shorter. For some m olecules, more than one pattern of covalent bonding can be written. For example, adenine can be written in two equivalent ways called resonance structures. NH2 ;

__ 5

""" N (

/

•

/

N

Interat o mic distances and bo nd lengths are " units: usual ly mea sured in angstrom (A) 1 A - 10 -

10

m = lo- Hem = 0.1 nm.

Several energy units are in common use.

O ne jo ul e (1) is the amount of energy required t o move 1 meter against a force of 1 newton. A kilo joule (kJ) is 1000 joules. One cal orie is the amount of energy required t o

raise the temperature of 1 gram o f water 1 deglee Celsius. A kil oca lorie (kcat) is 1000 calories. One joule is equal to

0 .239 ca l.

NH2

H N

Distance and energy units

H

,

N H N~

/

."'"

H

N

T hese adenine structures depi ct altern ative arrangements of single and dou ble bonds that are possible within the same structural framework . Resonance structures are shown connected by a double- headed arrow. Adenine's true structure is a composite of its two resonan ce structures. The composite structure is manifested in the bond lengths such as that for the bond joining carbon atoms C- 4 and C- 5. The observed bond length of 1.40 A is between that expected for a C C single bond (1.54 A) and a C C double bond ( 1.34 A). A molecul e that can be written as several resonance structures of ap prox imately eq ual energies has greater stability than does a molecule without multiple resonance structures. o

No ncova lent Bond s. Noncovalent bond s are weaker than covalent bond s but are crucial for biochemical processes such as the formation of a double heli x. Four fundam ental noncovalent bond types are electrostatic interactions, hydrogen bonds, van der Waals interactions, and hydrophobic interactions. T hey differ in geometry, strength, and specificity. Furthermore, these bond s are affected in vastly different ways by the presence of water. Let us consider the characteristics of each:

Electrostatic Interactions. A charged group on one molecule can attract an oppositely charged group on another molecule . T he energy of an electrostati c interacti on is given by Coulomb 's law: 1.

r

where E is the energy, q, and q2 are th e charges on the two atoms (in units of the electronic charge), r is th e distan ce between the two atoms (in angstroms), 0 is the dielectric constant (which accounts for the effects of the

in tervening m edium ), and k is a p roportionality constant (k = 1389 , for en ergies in units of kilojoules mol - I, or 332 for energi es in kcal mol- I). By conventi on , an attractive interaction has a negati ve energy. The electrostatic interacti on between two ions bearing single opposite charges sepao rated by 3 A in water (which has a dielectri c constant of 80 ) has an energy of - 5.8 kJ mol 1 (-1 .4 kcal mol - I). Note how important th e dielectric con stant of the medium is. For the sam e ion s separated by 3 A in a nonpolar solvent such as hexane (which has a di electri c constant of 2), the energy of this interacti on is - 231 kJ mol- I (- 55 kcal m ol- ' ).

8 CHAPTER 1 Biochemistry: An Evolving Science

Hydrogenbond do no r

Hydrogenbond acce ptor

N 8

H -- - -- - - N 8+ 8-

N

H-- - -- - -O

o

H --- -- - -N

o

H------- O

Figure 1.9 Hydrogen bonds. Hydrogen bonds are depicted by dashed green lines. The posi t ions of th e parti al charges (0 + and 8- ) are shown.

2. Hydrogen Bonds. These interactions are fundam entall y electrostatic interaction s. H ydrogen bonds are responsible for specif ic base- pair formation in the DNA double helix. The hydrogen atom in a hyd rogen bond is shared by two electronegative atom s such as nitrogen or oxygen. The hydrogen-bond donor is the group th at includes both the atom to which the hydrogen is more tightly linked and the hydrogen atom itself, whereas the hydrogen-bond acceptor is the atom less tightly linked to the hyd rogen atom (Figure 1.9). The electronegati ve atom to which the hydrogen atom is covalently bonded pulls electron density away from the hydrogen atom , which thus develop s a partial positi ve charge (8 +) . T hus, the hydrogen atom can interact with an atom having a partial n egative charge (8- ) through an electrostatic interaction. H yd rogen bonds are much weaker than covalent bond s. They have en ergies ranging from 4 to 20 kJ mol- I (1- 5 kcal mol-I). H ydrogen bonds are also som ewhat lon ger than coval ent bonds; their bond lengths (measured o from the hydrogen atom) range [rom 1.5 A to 2.6 A; hence, a di stance ranging fro m 2.4 A to 3.5 A separates th e two n onhydrogen atoms in a hydrogen bond . Th e strongest hyd rogen bonds have a tendency to be approximately straight, such that the hydrogen -bond donor, the hydrogen atom , and the hydrogen -bond acceptor lie along a strai ght line. Hydrogen-bonding interactions are respon sible for many of the properti es of water that make it such a special solvent, as will be described shortl y. 0

Hydrogenbond donor

\ 09

N

Hydrogen-bond acceptor

A

2.0

A

!

H- -- - -- ·O

I

.-"0

-:J ~

va n der Waa ls contact distance

0-

>1)1) ~

'""

'"'"

Distance

0

LU

"-0

v

~

t::

'"

I I I I I

\. :

I

Figure 1.10 Energy of a van der Waals interaction as two atoms approach each other. The energy is most favorabl e at the van der Waals contact distance. Due to elect ron-electron repulSion, the energy rises rap idly as the ato ms approach cl o ser than t his d istance.

3. van der Waals fnteractions. The basis of a van der W aals interaction is that the distribution of electronic charge aro und an atom flu ctuates with time. At an y instant, the charge distributi on is not perfectly symmetric. This transient asy mmetry in the electronic charge about an atom acts throu gh electrostatic interaction s to induce a complementary asymm etry in the electron distribution within its neighboring atoms. T he atom and its neighbors then attract one another. This attracti on in creases as two atom s come closer to each other, until they are separated by the van der W aals contact distance (Figure 1.10). At di stan ces shorter than the van der Waals con tact di stance, very strong repul sive forces hecome dominant because the outer electron clouds of the two atoms overlap. E nergies associated with van der Waal s interactions are quite small; typical interactions con tribute from 2 to 4 kJ m ol 1 (0.5 to 1 kcal mol- I) per atom pair. Wh en th e surfaces of two large molecules come together, however , a large number of atoms are in van cler Waals contact, and th e net effect, summed over man y atom pairs, can be substantiaL Properties of Water_ Water is the solven t in which most biochemi cal reactiOl's take place, an d its properties are essential to the formation of macromolecul ar structures and the progress of chemi ca l reaction s. T wo properties of water are especiall y relevant:

1. Water is a polar molecule. The water molecul e is bent , not linear, and so the distribution of charge is asymmetric . The oxygen nucleus draws el ec tron s away from the two hydrogen nucl ei, which leaves the region around

each hydrogen nucleus with a net positive charge. The water molecul e is thus an electrically polar structure.

2. Water is highly cohesive. Water molecules interact strongly with one another through hydrogen bonds. These interactions are apparent in tl~e structure of ice (Figure 1.11 ). Networks of hydrogen bond s hold th e structure together; similar interactions link molecules in liquid water and account for the cohesion of liquid water , although, in the liquid state, approximately one-fourth of the hydrogen bonds present in ice are broken. The polar nature of water is responsible for its high dielectric constant of 80. Molecules in aqueous solution interact with water molecules through the formation of hydrogen bonds and through ionic interactions. These in teractions make water a versatile solvent, able to readily dissolve many species, especially polar and charged compounds that can participate in these interactIOns.

9 1.3 Chemical Concepts

Electric dipole

H.....O'- H

I:

•

I

~

Figure 1.11 Structure of ice. Hydrogen bonds (shown as dashed green lines) are formed between water molecu les to produce a highly ordered and open structure.

The Hydrophobic Effect. A final fundam ental interaction called the hydrophobic eff ect is a manifestation of the properties of water. Some molecul es (termed nonpolar molewles) cannot participate in hydrogen bonding or ionic interactions. The interactions of nonpolar molecul es with water molecules are not as favorabl e as are interactions between the water molecules themselves. The water molecules in contact with these n onpolar m olecules form "cages" around them, becoming more well ordered than water mole cules free in solution. However, when two such nonpolar molecules com e together, some of the water molecules are released , allowing them to interact freely with bulk water (Figure 1.12). The release of water from such cages is favorable for reasons to be considered shortly. The resuit is that

¥ Nonpolar molecule

Nonpolar molecule Nonpolar molecule Nonpolar molecule

»

t'

t'

Figure 1.12 The hydrophobic effect. The aggregation o f nonpolar groups in water leads t o the release o f water molecules, initially int eracting w ith th e nonpolar surface, into bulk wate r. The release of water molecules into solution makes th e aggregation of nonpolar groups favo rable,

nonpolar m olecul es show an increased tendency to associate with one another in water compared with other, less polar and less self-associating, solvents. This tend ency is called the hydrophobic effect and the associated interactions are called hydrophobic interactions.

The Double Helix Is an Expression of the Rules of Chemistry

Figure 1.13 Electrostatic interactions in DNA . Each un it within the double helix includes a phosphate group (the phosphorus atom being shown in purple) that bears a negative charge. The unfavorable interactions of one phosphate (also known as a phosphoryl group) with several others are shown by red lines. These repulsive interactions oppose the formation of a double helix.

L et us now see how these four noncova) ent interactions work together in driving the association of two strand s of DNA to form a d ouble h elix. First, each phosphate group in a DNA strand carries a negative charge. These negatively charged groups interact unfavorably with one another over di stan ces. T hu s, unfavo rable electrostatic interaction s take place when two strands of DNA come together. T hese phosphate groups are far apart in the double helix with di stances greater than loA, but many such interactions take place (Figure 1.13). Thus, electrostatic interactions oppose the formation of the double helix . The strength of these repul sive electrostatic interactions is diminished by the high dielectric con stant of water and the presence of io nic species such as Na + or Mg2 I ions in solution. These positively charged species interact with the phosphate groups and partly neutralize their negative charges . Second, we noted the importan ce of hydrogen bond s in determining the formation of specific base pairs in the double helix. H owever, in singlestranded DNA, the hydrogen -bond donors and acceptors are exposed to so lution and can form h ydrogen bonds with water molecules. ~ c/

~ c/

II

° • • ••

•

H I O- H

van der Waals

Figure 1.14 Base stacking. In the DNA double helix. adjacent base pairs are stacked nearly on top o f one ano ther. and so many atoms in each base pai r are separated by their van der Waals contact distance. Th e ce ntral base pair is shown in dark blue and the two adjacent base pairs in l ight blue. Several va n der Waa ls contacts are shown in red.

10

II

H, , H +

0 •• • • •

H I

/N~

0

•

H.. . ' H

0• •• • •

H ,H '0

H I /N~

When two single strands come together, these h ydrogen bonds with water are broken and new hydrogen bonds between the bases are formed . Because th e num ber of hydrogen bonds broken is the sam e as th e number formed , these hydrogen bonds do not contribute substantially to driving the overall process of d ouble -helix formati on. H owever, they contribute greatly to the spec ifi city of binding. Suppose two bases that cannot form Watson- Crick base pairs are brought together. Hydrogen bonds with water mu st be broken as the bases come into contact. Because the bases are not complementary in structure, n ot all of these bonds can be simulta neously replaced by hydrogen bonds between the bases. T hus. the formation of a d oubl e helix between noncomplementary sequences is disfavored . Third , within a double helix, the base pairs are parallel and stacked nearly on top of one another. The typical separation between the planes of adjacent base pairs is 3.4 A, and the di stances between the m ost closely approaching atoms are approximately 3.6 A. Thi s separation distance corresponds ni cely to the van der Waals contact distance (Figure 1.14). Bases tend to stack even in single-stranded DNA molecules. However, the base stacking and associated van der Waal s interactions are nearly optimal in a d ou b le- heli cal structure. Fourth, the hydrophobic effect also contributes to the favorability of base stacking. More- complete base stacking moves the nonpolar surfaces of the bases out of water into contact with each other. The principles of double -helix formation between two strands of DNA apply to many other biochemical processes. Many weak interactions con tribute to the overall energetics of the process, some favorably and some

unfavorably. F urthermore, surface complementarity is a key feature : when complementary surfaces meet, hydrogen-bond donors align with hydrogen bond acceptors and nonpolar surfaces come together to maximize van der Waals interactions and minimize nonpolar surface area exposed to the aqueous environment. The properties of water playa major role in determining the importance of these interactions.

The Laws of Thermodynamics Govern the Behavior of Biochemical Systems We can look at the formation of the double helix from a different perspective by examining the laws ' of thermodynami cs. These laws are general principles that apply to all physical (and biological ) processes. They are of great importance because th ey determine the conditions under which specific processes can or cannot take place. We will consider these laws from a general perspective first and then apply the principl es that we have devel oped to the formation of the double helix. The laws of thermodynamics distinguish between a system and its surroundings. A system refers to the matter within a d efined region of space. The matter in the rest of the universe is called the surroundings. The First Law oj Thermodynamics states that the total energy oj a system and its surroundings is constant. In other words, the energy content of the universe is constant; energy can be neither created nor destroyed . Energy can take different forms, however. Heat, for example, is one form of energy. Heat is a manifestation of the kinetic energy associated with the random motion of molecules. Alternatively, energy can be present as potential energy energy that wilJ be released on the occurrence of some process. Consider, for exam pie, a ball held at the top of a tower. The ball has considerabl e potential energy because, when it is released, the ball will develop kinetic energy associated with its motion as it fall s. Within chemical systems, potential energy is related to the likelihood that atoms can react with one another. For instance, a mixture of gasoline and oxygen has a large potential energy because these molecules may react to form carbon dioxide and water and release energy as heat. The First Law requires that any energy released in the formation of chemical bonds must be used to break other bonds, released as heat, or stored in some other form. Another important thermodynamic concept is that of entropy, a measure of the degree of randomness or disorder in a system. The Second Law of Thermodynamics states that the total entropy of a system plus that of its surroundings always increases. For example, the release of water from nonpolar surfaces responsible for the hydrophobic effect is favorable because water molecules free in solution are more disordered than they are when they are associated with nonpolar surfaces. At first glance, the Second Law appears to contradict much common experience, particularly about biological systems. Many biological processes, such as the generation of a well-defined structure such as a leaf from carbon dioxide gas and other nutrients , clearly increase the level of order and hence decrease entropy. Entropy may be decreased locally in the formation of such ordered structures only if the entropy of other parts of the universe is in creased by an equal or greater amount. The local decrease in entropy is often accomplished by a release of heat, which increases the entropy of the environment. We can analyze this process in quantitative terms. First, consider the system. The entropy (S) of the system may change in the course of a chemical reaction by an amount tlSsys,cm' If heat flows from the system to its surroundings, then the heat content, often referred to as the enthalpy (H), of the system will be reduced by an amount tlHsystcnr To apply the Second Law, we

11 1.3 Chemical Concepts

12 CHAPTER 1 Biochemistry: An Evolving Science

must determine the change in entropy of the surroundings. If heat fl ows from the system to the surroundings, then the entropy of the surroundings will increase. T he precise change in the entropy of the surroundin gs depends on the temperature; the change in entropy is greater when heat is added to relatively cold surroundings than when heat is added to surroundings at high temperatures that are already in a high degree of disorder. To be even more specific, the change in the entropy of the surroundings will be proportional to the amount of heat transferred fro m the system and inversely proportional to the temperature (T) of the surroundi ngs. In biological systems, T [in kelvins (K), absolute temp erature] is usually assumed to be constant. Thus, a change in the entropy of the surroundings is given by (1)

The total entropy change is given by the expression ~ Stotal = ~Ssystem

+

A S s ulTo unJings

(2)

Substituting equation 1 in to equation 2 yields (3)

Multiplying by

~T

gives (4 )

The function -T!lS has units of energy and is referred to asfree energy or Gibbs free energy, after Josiah Willard G ibbs, who developed this function in 187R: (5)

The free-energy change, !l G, will be used throughout thi s book to describe the energetics of b iochemical reactions. Recall that the Second Law of Thermodynamics states that, for a process to take place, the entropy of th e universe must increase. Examination of equation 3 shows that the total entropy will increase if and onl y if (6)

Rearranging gives T !lS ,ystem crease if and only if

>

!lH, or, in other words, entropy will in-

(7)

Thus, the free-energy change must be negative for a process to occur spontaneously. Th ere is negative free -energy change when and only when the overall entropy of the universe is increased. The free energy represents a sin gle term that takes into account both the entropy of the system and the entropy of the surroundin gs. Heat Is Released in the Formation of the Double Helix

L et us see how the principles of thermodynamics apply to th e formation of the double helix (figure 1.15). Suppose solutions containing each of the two single strands are mixed. Before the double helix forms, each of the single strands is free to translate and rotate in solution, whereas each matched pair

13 1.3 Chemical Concepts

( 11111111

c c

I,)UI-<
,,• ,••

Mixing

c c

,, •, 1

•

Reacting

c·.. c

G" . C

A· •. T

r "' A r" 'A A •• , T A·· , T T .•• "

( '1'11111

. . . . -1. "0 .• .

..... >O . .... .... > C'lr'I

:• •• •• •• •• > .... .... »

• •

•

•

•

' "11111)

c" ' Co

G"' C A. •• • T { , •• II> 1 .. . ,. " . .• 1

,. " . 1 1 '" A

of strands in the double helix mu st move together. F urthermore, the free single strands ex ist in more conformations than possible when bound to gether in a d ouble helix. T hus. the formation of a double helix from two single strands appears to result in an in crease in order for the system. On the basis of this analysis. we expect that the double helix cannot form without vi olating the Second Law of Thermodynamics unless heat is released

Figure 1.15 Double-helix formation and entropy. When solutions contai ning DNA strands with com plem entary sequences are mixed, the stra nds react t o fo rm double helices. This process results in a loss of entropy from the system, indicating that heat must be released to the surroundings to avoid vio lating the Second Law of Thermodynamics.

14

-------

CHAPTER 1 Biochemistry: An Evolving Science

to increase the entropy of the surroundings. Experimentally, we can measure the heat released by allowing the solutions containing the two single strands to come together within a water bath, which here corresponds to the surroundings. We then determine how much heat mu st be absorbed by the water bath or released from it to maintain it at a constant temperature. For these sin gle DNA strands at 25 °C and at pH 7.0 in 1 M NaCl, this experiment reveals that a substantial amount of heat is released namely, approximately 250 kJ m ol - I (60 kcal mol t). This experimental result reveals that the change in enthalpy for the process is quite large, -2 50 kJ mol - t, consistent with our expectation that significant heat would have to be released to the surroundings for the process not to violate the Second Law. We see in q uantitative terms how order within a system can be increased by releasing sufficient heat to the surroundings to ensure that the entropy of the universe increases. We will encounter this general theme again and again throughout this book. Acid-Base Reactions Are Central in Many Biochemical Processes

Throughout our consideration of the formation of the double helix, we have dealt only with the noncovalent bonds that are formed or broken in this process. Many biochemical processes entail the formation and cleavage of covalent bonds. Of these, a particularly important class of reactions prominent in biochemistry is acid-base Teactions. In acid and base reactions, hydrogen ions are added to molecules or removed from them. A hydrogen ion, often written as H + , corresponds to a bare proton. In fact, hydrogen ions ex ist in solution bound to water molecules, thus forming what are known as hydmnium ions, H 3 0 + . For simplicity, we will continue to write H + , but we should keep in mind tbat H + is shorthand for the actual species present. The concentration of hydrogen ion s in solution is expressed as the pH. Specifically, the pH of a solution is defined as

where [H +] is in units of molarity. T hu s, pH 7.0 refers to a solution for which - log[H '] = 7.0, and so 10g[H +] = - 7.0 and [H +] = 10'og[I-I '1 = 10 - 70 = 1.0 X 10 - 7 M . The pH al so indirectly expresses the concentrati on of hydroxide ions, [OH - ], in solution. To see how, we must realize that water molecules dissoI ciate to form H - and O H - ions in an equilibrium process.

The equilibrium constant (K ) for the dissociation of water is defined as

°.

and has a value of K = 1.8 X 10 - 1 Note that an eq uilibrium constant does not formally have units. Nonetheless, the value of the equilibrium constant given assu mes that particular units are used for concentration; in this case and in most others, units of molarity (M) are assumed. T he concentration of water, [H 2 0) , in pure water is 55. 5 M, and thi s concentration is constant under most conditions. Thus, we can define a new con stant, K w: Kw

K[H :!O]

=

K[HzO)

= 1. 8 X = 1.0 X

=

10 10 -

[H +][OH - ) 16 14

X 55.5

BecauseK w = [H +)[OH ) = 1.0 X 10 [OH - )

= 1O -

14

/[H +)

and

14

,

1.0

we can calculate

[H+)

= 10 -

14

.-c

~

/[OH - ).

-

E 0.8

-::J 0 u - '" Q)

Q) -

With these relations in hand, we can easily calculate the concentration of hydroxide ions in an aqu eous solution given the pH. For example, at pH = 7.0, we know that [H +) = 10 - 7 M and so [OH - ] = 10 - 14 110 - 7 = 10 - 7 M. 7 In acidic solutions, the concentration of hydrogen ions is higher t h an 10 and, hence , the pH is below 7. For example, in 0.1 M Hel, [H + ) = 10- 1 M 14 and so pH = 1.0 and [OH ) = 10 - / 10 - 1 = 10 - 13 M .

o.~

0.6

E Qi - .<: o Q)' c - 0.4

.0 _ -" ::J

t ",o ,,

-

u.

0.2

8

10

11

pH

Acid-Base Reactions Can Disrupt the Double Heli x The reaction that we have been considering between two strands of DNA to form a double helix takes place readily at pH 7.0. Suppose that we take the solution containing the double -helical DNA and t reat it with a solution of concentrated base (w ith a high concentration of OH ). As the base is added, we monitor the pH and the fraction of DNA in doubl e-helical form (Figure 1.16). When the first additions of base are made, the pH rises, but the concentration of the double- helical DNA d oes not ch an ge signifi cantly. However, as the pH approaches 9, the DNA dou ble helix begin s to dissoci ate into its component single strands . As the pH continu es to rise from 9 to 10, this dissociation becomes essentiall y complete. Why do the two strands dissociate? The hydrox ide ions can react with bases in DNA base pairs to remove certain protons. The most susceptib le proton is t he on e bound to the N -1 n itrogen atom in a guanine base.

o

9

Figure 1.16 DNA denaturation by the addition of a base. The addition of base to a solution of double-helical DNA initially at pH 7 causes the double helix t o separate into single strand s. The process is half complete at slightly abo ve pH 9.

o 'N

+

H

+

Guanine (G)

Proton dissociation for a substance HA has an equilibrium constant defined by the expression

The susceptibility of a proton to removal by reaction with base is described by its pKa value: pKa = - log( Ka) When the pH is equal to the pK. , we have pH = pK. and so

and

Dividing by [H + ] reveals that

1 = [A - ]/ [HA ]

15

and so

16 CHAPTER 1 Biochemistry: An Evolving Science

[A - l

=

[HA l

Thus, when the pH equals the pKa' the concentration of the deprotonated form of the group or molecule is equal to the concentration of the proto nated form; the deprotonation process is halfway to completion. The pKa for the proton on N -1 of guanine is typically 9.7 . When the pH approaches this value, the proton on N -1 is lost (see Figure 1.16). Because this proton participates in an important hydrogen bond, its loss substantially destabilizes the DNA double helix . The DNA double helix is also destabilized by low pH. Below pH 5, some of the hydrogen bond acceptors that participate in base-pairing become protonated. In their protonated forms, these bases can no longer form hydrogen bonds and the double helix separates. Thus, acid- base reactions that remove or donate protons at specific positions on the DNA bases can disrupt the double helix. Buffers Regulate pH in Organisms and in the Laboratory

12 10 0.1 mM Na+CH, COO8 I

0.

6

Gradual pH change

4

2

Water

00"---:1'=-0--=20

30

40

50

60

Number of drops Figure 1.17 Buffer action. The addition of a st rong acid, 1 M HCl, to pure water results in an immediate drop in pH to near 2. In contra st, the add iti o n of the acid t o a 0.1 M sodi um acetate (Na I CHJCO(T) sol ution results in a much more gradual change in pH until the pH dro ps below 3.5.

A significant change in pH can disrupt molecular structure and initiate harmful reactions. Thus, systems have evolved to mitigate changes in pH in biological systems. Solutions that resist such changes are called buffers. Specifically, when acid is added to an unbuffered aqueous solution, the pH drops in proportion to the amount of acid added. In contrast, when acid is added to a buffered solution, the pH drops more gradually. Buffers also mitigate the pH increase caused by the addition of base. Compare the result of adding aiM solu tion of the strong acid HCl drop by drop to pure water with adding it to a solution containing 100 mM of the buffer sodium acetate (Na +CH 3 COO - ; Figure 1.17). The process of gradually adding known amounts of reagent to a solution with. which the reagent reacts while monitoring the results is called a titration. For pure water, the pH drops from 7 to close to 2 on the addition of the first few drops of acid . However, for the sodium acetate solution, the pH first falls rapidly from its initial value near 10, then changes more gradually until the pH reaches 3.5, and then falls more rapidly again. Why does the pH decrease gradually in the middle of the titration? The answer is that, when hydrogen ions are added to this solution, they react with acetate ions to form acetic acid . This reaction consumes some of the added hydrogen ions so that the pH does not drop. Hydrogen ions continue reacting with acetate ions until essentially all the acetate ion is converted into acetic acid. After this point, added protons remain free in solution and the pH begins to fall sharp Iy again. We can analyze the effect of the buffer in quantitative terms. Th.e equilibrium constant for the deprotonation of an acid is

Taking logarithms of both sides yields 10g(K. )

10g([H + ]) + 10g([A - J/[ HAJ) .

=

Recalling the definitions of pKa and pH and rearranging gives pH

=

pKa

+ 10g([A-J/[ HAJ).

This expression is referred to as the Henderson- Hasselbalch equation.

We can apply the eq uation to our titration of sodium acetate. The pKa of acetic acid is 4.75. We can calcu late the ratio of the concentration of acetate ion to the concentration of acetic acid as a function of pH by using the Henderson- Hasselbach eq uation, slightly rearranged. [Acetate ion )l [acetic acid ) = [A -)1 [HA ) = 1OPH -

17 1.4 The Genomic Revolution

12

pK.,

10

At pH 9, thi s ratio is approximately 18,000; very little acetic acid has been formed. At pH 4.75 (when the pH equals the pK,l, the ratio is 1. At pH 3, the ratio is approx imately 0.02; almost all of the acetate ion has been con verted into acetic acid. We can follow the conversion of acetate ion into acetic acid over the entire titration (figure 1.18). The graph shows that the region of relatively constant pH corresponds precisely to the region in which acetate ion is being protonated to form acetic acid. From this discussion, we see that buffers function best close to the pKa values of their acid component. Physiological pH is typically about 7.4. An important buffer in biological systems is based on phosphoric acid (H 3 P0 4 ). The acid can be deprotonated in three steps to form a phosphate

r---1100%

8 I

"'-

6 4

2

o '----- _ o

10

--"---_L-_ ------'----' 0%

20

30

40

50

60

Number of drops

•

Ion . H+

/

' HPO 4 2 -

pK. = 7.2 1

Figure 1.18 Buffer protonation. When acid is added to sodium acetate, the added hydrogen ions are used to convert

H+

--:==:::/==' PO 3 -

, -

4

pK. - 12.67

acetate ion into aceti c acid. Because the proton con centrat ion does not increase

significaotly, the pH remains relatively constant until all of the acetate has been converted into acetic acid.

At about pH 7.4, inorganic phosphate exists primarily as a nearly equal mixture of H 2 P0 4 - and I-IPO/ . Thus, phosphate solutions function as effective buffers near pH 7.4. Th e concentration of inorganic phosphate in blood is typica ll y approx im ately 1 mM, providing a useful buffer against processes that produce either acid or base.

1.4

The Genomic Revolution Is Transforming Biochemistry and Medicine

Watson and Crick's di scovery of the structu re of DNA suggested the hypothesis th at h ereditary information is stored as a sequence of bases along long strands of DNA. This remarkable insight prov ided an entirely new way of thinking about biology. However, at the time that it was made, their discovery was full of potential but the practical consequences were unclear. Tremendously fundam ental q uestions remained to be addressed. Is the hypothesis correct? How is the sequ ence information read and translated into action? What are the sequences of naturally occurring DNA molecules and how can such sequen ces be experimentally deter mined? Through advances in biochemistry and related sciences, we now have essent iall y complete answers to these q uest ion s. Tn deed, in the past decade, scientists have determined th e comp lete genome sequences of hundreds of different o rganisms, including simple microorgani sms , plants, animals of varying degrees of complexity, and hum an beings. Compari son s of these genome sequen ces with the use of methods intro duced in Chapter 6 have been sources of insight into many aspects of biochemistry. Because of these achievements, biochemistry has been transformed. In addition to its experimental and clini cal aspects, biochemi stry has now become an information science.

18 CHAPTER 1 Biochemistry: An Evolving Science

The Sequencing of t he Human Genome Is a Landmark in Human History

The sequenci ng of the human genome was a daunting task because it contain s 9 approximately 3 billion (3 X 10 ) base pairs. For example, the sequence ACATTTGCTTCTGACACAACTGTGTTCACTAGCAACCTC AAACAGACACCATGGTGCATCTGACTCCTGAGGAGAAGT CTGCCGTTACTGCCCTGTGGGGCAAGGTGAACGTGGA. .. is a part of one of the genes that encodes hemoglobin, the oxygen carri er in our blood . This gene is found on the end of chromosome 9 among our 24 distinct chromosomes. If we were to include the complete sequence of our entire geno me, this chapter would run for more than 500,000 pages. The sequencing of our genom e is truly a landmark in human history. This seq uence contains a vast amount of information, some of which we can now extract and interpret, but much of which we are only beg innin g to understand . for example, some hum an diseases have been linked to particular variations in genomic seq uence. Sickle -cell anemi a, discussed in detail in C hapter 7, is ca used by a single base change of an A (noted in boldface type in the preceding sequence) to a T. W e will encounter many other examples of diseases that have been linked to specific DNA sequence changes. In addition to the implications for understanding human health and d isease, the genome seq uence is a source of deep insight into other aspects of human biol ogy and culture. for example, by comparin g the sequences of different individual persons and populations, we can learn a great deal about human history. On the basis of such analysis, a co mpelling case can be made that the human species originated in Africa, and the occurrence and even the timing of important migrations of groups of human beings can be demonstrated. finally, compar iso ns of the human genome with the genomes of other organisms are confirming the tremendous unity that exists at the level of biochemistry and are revealing key steps that have been taken in the course of evolution from relatively simple, single-celled organisms to complex, multicellular organisms such as human beings. For example, many genes key to the function of the human brain and nervou s system have evolutionary and fu nctional relatives that can be recognized in the genomes of bacteria. Because many studies that are possible in model organisms are difficult or unethical to conduct in human bei ngs, these discov eries have many practical implica tions. Cumparative genomics has become a powerful science, linking evol uti on and biochemistry. Genome Sequences Encode Proteins and Patterns of Expression

The structure of DNA revealed how information is stored in the base sequence along a DNA strand . But what information is stored and how is this information expressed? The most fundam ental role of D A is to encode the sequences of proteins. Like DNA , proteins are linear polymers. However, proteins differ from DNA in two important ways. First, proteins are built from 20 building blocks, called amino acids, rather than just four , as are present in DN A . The chemical co mpl exity provided by this variety of building blocks enables proteins to perform a wide range of functions. Second, proteins spontaneously fold up into elaborate three-dimensional stru ctures, determined only by their amino acid seq uences (Figure 1.19). W e have explored in depth how solutions containing two appropriate

19 1.4 The Genomic Revolution

;)

I Amino acid sequence 1

Figure 1.19 Protein folding. Proteins are linear polymers of am ino aci ds that fold ;)

1

into elaborate structures. The sequence

of amino acids determines the threeAmino acid sequence 2

dimensional structure. Thus amino acid sequence 1 gives rise only to a protein

w ith the shape depicted in blue. not the shape depicted in red.

strands of DNA come togeth er to form a sol ution of double -helical mole cules. A similar spontaneous folding process gives proteins their threedimensional structure. A balance of hydrogen bonding, van der Waals in teraction s, and hydrophobic interactions overcome the entropy lost in going from an unfolded ensemble of proteins to a homogenous set of well-folded molecules. Proteins and protein folding will be di scussed extensively in Chapter 2. The fundam ental unit of hereditary information , the gene, is becoming increasingly difficult to precisely define as our knowledge of the complexities_of genetics and genomi cs increases. The genes that are simplest to de fine encode the sequences of proteins. For these protein-encoding ge nes, a block of DNA bases encod es the amino acid seq uence of a specific protein molecule. A set of three bases along the DNA strand , called a cudun, determines the ident ity of one amino acid within the protein sequ en ce. The rela tion that links the DNA seq uence to the encoded protein sequence is called the genetic cude . O ne of the biggest surprises from the sequencing of the human genome is the small number of protein -encoding genes. Before the genome-seq uencing proj ect began, the consensus view was that the human genome would include approximately 100,000 protein-encoding genes. T he current analysis suggests that the actual number is between 20, 000 and 25,000. W e shall use an estimate of25,000 throughout this book. However, additional mechanism s allow many genes to encode more than one protein. For example, the genetic information in some genes is translated in more than one way to produce a set of proteins that differ from one another in parts of their amino acid sequences. I n other cases, proteins are modified after they are synthesized throu gh the addition of accessory chemical groups . Through these indirect mechanism s, much more complexity is en coded in our genomes than would be expected from the number of protein encoding genes alone. O n the basis of current knowledge, the protein-encoding regions ac count for onl y about 3% of the human genome. Wh at is th e function of the rest of the DNA? Som e of it contains information that regulates the expression of specific genes (i.e., the production of specific proteins) in pa rti cul ar cell types and physiological conditions. Essentiall y every cell contains the same D A genome, yet cell types differ considerably in the proteins that they produce . For example, hemoglobin is expressed only in precursors of red b lood cells, even though the genes for hemoglobin are

-

20 CHAPTER 1 Biochemistry: An Evolving Science

present in essentially every cell. Specific sets of genes are expressed in response to hormones, even though these genes are not expressed in the same cell in the absence of the hormon es. The control regions that regulate such differences account for only a small amount of the remainder of our genomes. The truth is that we do not yet understand all of the fun ction of much of the remainder of the DNA. Some of it appears to be "junk," stretches of DNA that were inserted at some stage of evolution and have remained. In some cases, this DNA may, in fact , serve important functions. In others, it may serve no function but, because it does not cause significant harm, it has remained. Individuality Depends on the Interplay Between Genes and Environment

With the exception of monozygotic ("identical" ) twins, each person has a unique sequence of DNA base pairs. How different are we from one another at the genomic level? An examination of variation across the genome reveals that, on average, each pair of individual people has a different base in one position per thousand; that is, the difference is approximately 0.1 %. This person -to -person variation is quite substantial compared with differences in populations. Thus, the average difference between two people within one ethnic group is greater than the difference between the averages of two different ethnic groups. The significance of much of this genetic variation is not understood. As noted earlier, variation in a single base within the ge nome can lead to a disease such as sickle-cell anemia. Scientists have now identified the genetic variations associated with more than 100 diseases for which the cause can be traced to a single gene. for other diseases and traits , we know that variation in many different genes contributes in significant and often complex ways. Many of the most prevalent human ailments such as heart disease are linked to variations in many genes. Furthermore, in most cases, the presence of a particular variation or set of variations does not inevitably result in the onset of a disease but, instead, leads to a predisposition to the development of the disease. In addition to these genetic differences, epigenetic factors are important. Th ey are factors associated with the genome but not simpl y represented in the sequen ce of DNA. For example, th e consequences of some of this genetic variation depend, often dramatically, on whether the unusual gene sequence is inherited from the mother or [rom the fat her. This phenomenon, known as genetic imprinting, depends on the covalent modification of DNA, particularly the addition of methyl groups to particular bases. Epigenetics is a very active field of study and many novel discoveries can be expected . Although our genetic makeup and associated epigenetic characteristics are important factors that contribute to disease susceptibility and to other traits, factors in a person's environment also are signifi cant. What are these environmental factors? Perhaps the most obvious are chemicals that we eat or are exposed to in some other way. The adage "you are what you eat" has considerable validity; it applies both to substances that we ingest in significant quantities and to those that we ingest in only trace amounts. Throughout our study of biochemistry, we will encounter vitamins and trace elements and their derivatives that play cru cial roles in many processes. In many cases, the roles of these chemicals were first revealed through investigation of deficiency disorders observed in people who do not take in a sufficient quantity of a particular vitamin or trace element. Despite the fact that the most important vitamins and trace elements have

Figure 1.20 Food pyramid. A healthful diet includes a balance of food groups to supply an appropriat e supply of calories and an appropriate mixture of biochemica l build ing blocks. [Courtesy of the U. s. Department of Agriculture.]

Grains

Vegetables Fruits Oils Milk

Meats and beans

been known for some time, new roles for these essential dietary factors con ti nue to be di scovered. A healthful diet requires a balance of major food groups (Figure 1. 20). In addition to providing vitamins and trace elements, food provides calories in the form of substances that can be broken down to release energy to drive other biochemical processes. Proteins, fats, and carbohydrates provide the building blocks used to construct the molecules oflife. Finally, it is possible to get too much of a good thing . Human beings evolved under circum stances in which food , particularly rich foods such as meat, was scarce. With the devel opment of agriculture and modern economies, rich foods are now plentiful in parts of the world . Some of the most prevalent diseases in the so-call ed devel oped world, such as heart disease and diabetes, can be attributed to the large quantities of fats and carbohydrates that are present in modern diets. We are now developing a deeper understanding of the biochemical consequences of these diets and the interplay between diet and genetic factors. Chemicals are only one important class of environmental factors. The behaviors in which we engage also have b iochemical consequences. Through physical activity, we consume the calories that we take in, ensuring an appropriate balance between food intake and energy expenditure. Activities ranging from exercise to emotional responses such as fear and love may activate specifi c biochemical pathways, leading to changes in levels of gene expression, the release of hormones, and other consequences. For example, recent discoveries reveal that high stress levels are associated with the shortening of telomeres, structures at the ends of chromosomes. Furthermore, the interplay between biochemistry and behavior is bidirectional. Just as our biochemistry is affected by our behavior, so, too, our behavior is affected, although certainly not completely determined, by our genetic makeup and other aspects of our biochemistry. Genetic factors associated with a range of behavioral characteristics have been at least tentatively identified. Just as vitamin deficiencies and genetic diseases revealed fundamental pri ncipl es of bioch emistry and biology, investigations of variations in behavior and their linkage to genetic and biochemical fac tors are potential sources of great insight into mechani sms within the brain . For example, stud ies of drug addiction have revealed neural circuits and biochemical pathways that greatly influence aspects of behavior. Unraveling the interplay between biology and behavior is one of the great challenges in modern science, and biochemistry is providing some of the most important concepts and tools for this endeavor.

21 1.4 The Genomic Revolution

22

CHAPTER 1 Biochemistry: An Evolving Science

APPENDIX: Visualizing Molecular Structures I: Small Molecules T he authors of a biochemistry textbook face the problem of trying to present three- dlmensional molecules in the two dimensions avail able on the printed page. The in terplay between the three-dimension al structures ofbiomolecul es and their biological functions will be discussed extensively throughout this book. Toward this end , we wi ll frequ entl y use representations that, although of necessity are rendered in two dimensions, emphasize the three-dimensional structures of molecules. Stereochemical Renderings

Most of the chemical formu las in this book are drawn to depict the geometric arrangement of atoms, crucial to chemical bonding and reactivity, as accurately as possible. For exampl e, the carbon atom of methane is sp' hybridized and tetrahedral, with H-C-H angles of 109 .5 degrees, where.c'ls the carbon atom in formaldehyde is sp2 hybridized with bond angles of 120 degrees.

,

o

H ".. H H

/ c" H

H/

Methane

Formaldehyde

To illustrate the correct stereochemistry about tetrahedral carbon atoms, wedges will be used to depict the direction of a bond into or out of the plane of the page. A solid wedge with the broad end away from the carbon atom denotes a bond comin g toward the viewer out of the plane. A dashed wedge, with its broad end at th e carbon atom, represents a bond going away from the viewer behind the pl ane of the page. The remaining two bonds are depicted as strai ght lines. Fischer Projections

Although representative of the actual structure of a com pound, stereochemical stru ctures are often diffi cult to draw quickl y. An alternative, less -representative method of depicting structures with tetrahedral carbon centers reiies on tl,e use of Fischer projections. W

z

W

x_ y

;

z-~-x

•

Z =

y/

W

\/ '/-

"x

y

Fischer

projection

Molecular Models for Small Molecules

For depicting the molecular architecture of small molecul es in more d etail, two types of models will often be used : space filling and ball and stick. These models show structures at the atomic level. 1. Space -Pilling Models. The space-filling models are

~" H

,

In a Fischer projection, the bonds to the central carbon are represented by horizontal and vertical lines from the substituent atoms to the carbon atom, which is assumed to be at the center of the cross. By convention, the hori zontal bonds are assumed to proj ect out of the page toward the viewer, whereas the vertical bonds are assumed to project behind the page away from the viewer. The G lossary of C ompounds found at the back of the book is a structural glossary of the key molecules in biochemistry, each presented in two forms: with stereochemi cally accu rate bond angles and as a Fisher projection.

Stereochemical rendering

the most realistic. The size and position of an atom in a space-filling model are determined by its bonding properties and van der Waals radius, or contact dis tance. A van der Waal s radiu s describes how closely two atoms can approach each other when they are not linked by a covalent bond. The colors of the model are set by conventi on. C arbon , black H ydrogen, white N itrogen, blue Oxygen, red Sulfur , yellow Phosphorus, purple Space-fi lling model s of several simple mol ecul es are shown in Figure 1. 21.

2. Ball-and -Stick Models. Ball -al1d -stick models are not as realistic as space- fillin g models, beca use the atoms are depicted as spheres of radii smaller than their van der Waals radii. However, the bonding arrange m ent is easier to see because the bonds are ex plicitly represented as sticks. in an illustration, the taper of a stick, representing parallax , tell s which of a pair of bonded atoms is closer to the reader. A ball -and -stick m odel reveals a complex structure more clearly than a space-filling mod el does. Ball-and -stick models of several simple molecules are shown in Figure 1.21 . Molecular models for depicting large mol ecul es will be discussed in the appendix to C hapter 2.

Key Terms biological mac romolecule (p . 2)

protein (p . 2)

Archaea (p . 3)

metabol ite (p . 2)

Eukarya (p ..1)

eukaryote (p. 3)

deoxyri bonucleic acid (D N A ) (p . 2)

Hacteria (p . 3)

prokaryote (p. 3)

Problems Water

Acetate

Formamide

23

Cysteine

SH

o

H, C -

H20

;,'

C'

~,

/

H

H

H2N - - C

~

o

o

Ii o

Figure 1.21 Molecular representations. Structural formulas (bottom), ball ~and~stick models (middle), and space~filling representations (top) of selected molecules are shown. Black carbon, red 0 oxygen, white 0 hydrogen, yellow 0 sulfur, blue 0 nitrogen. 0

double helix (p. 5)

hydrophobic effect (p . 9)

b uffer (p. 1 (,)

covalent bond (p . 7)

entropy (p. 11 )

amino acid (p. 18 )

resonance structure (p . 7)

enthalpy (p . 11 )

genetic code (p . 19 )

electrostatic interaction (p . 7)

free energy (p . 12)

pred isposition (p. 20 )

hydrogen bond (p. 8)

pH (p . 14)

van de r Waals interaction (p. 8)

pK, (p, 15)

Problems 1. Donors and acceplors. Identify lhe hydrogen -bond donors and accepto rs in each of the fo ur bases on page 4.

2. Resonance structures. The structure of an amino acid, tyrosine, is shown here. Draw an alternative reso nance s tru cture. H

4 . Don't break the law . G iven the following val ues for the changes in enthalpy (611) and en tropy (6 S), which of the fol ~ lowing processes can occur at 298 K without violating the Second Law of Th ermod ynami cs? (a)

H H

3. It takes all types. What types of noncovalent bonds hold together the following solids'

(a) Table salt (NaCI), which conta ins Na + and CI - ions. (b) Graphite (e) , which consists of sheets of covalentl y bonded carbon atom s.

MI = - 84 kJm ol- ' ( -2 0 kcalmol- ' ),

6S = + 125 .111101- 1 K- I (+ 30caII1101(h ) 6H = - H4 kJ mol 1 ( - 20 kcal mol ') , 6 S = - 125 JI1101 - 1 K- I ( - 30caI I1101(c) 6H = +84kJmol I (+ 20kcalmol ' ), 6 8 = + 125 Jmol - ' K - ' ( + 30calmol(d) 6[-[ = +84 kJ mol- I (+20 kcal mol- I), 6 S = - 125 J 11101- 1 K - I ( - 30 cal 11101-

1

K- ' )

1

K- ' )

' K- ' ) 1

K - I)

24

CHAPTER 1 Biochemistry: An Evolving Science

5. Double- helix-formation entropy. For double- helix formation , {)'C can be measured to be -54 kJ mol - ' (- 13 kcal mol- 1 ) at pH 7.0 in 1 M N aC I at 25°C(29R K). The heat released indicates an enthalp y change of -25 1 kJ mol - ' ( - 60 kcal mol - '). For thi s process, calculate t he entropy change for the system and the entropy change ror the universe.

6. Find the pi I . What are the pH values for the following solutions?

(a) (b) (e) (d )

0.1 M HCI 0.1 M NaOH 0.05 M H e l Il.OS M NaO H

7. A weak acid . What is the pH of a O. 1 M solution of acetic acid (pK. = 4.75)? (Hint: Let x be the concentration ofH + ions released [rom acetic acid when it dissoc iates. The solutions to a quadratic c uation of the form ax2 + bx + c = 0 are x = (- b :!: h2 - 4ac)/2a.

8. fi nd the pK,. r or oJl acid HA , t he concentrations of HA and 11 - are 0.075 and 0.025, respectively, at pI! 6.0. What is the pK, value for H A ?

9. pH indicator. A dye that is an acid and that appears as differ-

ent colors in its protonated and deprotonated forms can be used as a pH indicator. Suppose that yo u have an 0.00 1 M solution of a dye with a pK. of 7.2. From the color, the concentration of the protonated form is found to be 0.0002 M . Assume that the remainder of the dye is in the d eprotonated form. What is the pH ofthe solution ? 10. What 's the ratio ? An acid with a pK. of8.0 is present in a so-

lution with a pH of 6.0. What is the ratio of the proton"ted to the deprotonated form of the acid? 11. Phosphate buffer. What is the ratio of the concentration s of

H 2 PO, - and HPO / - at (a) pH 7.0; (b ) pH 7.5; (c) pH 8.0? 12. Buffer capacity . Two solutions of sodium acetate are prepared, one with a concentration of 0. 1 M and one with a concentration of 0.0 1 M. Calcul ate the pH values when the following concentration s of H C I have been added to each of these solu tions : 0.0025 M, 0.005 M , 0.0 1 M, and 0.05 M. 13. Viva Ie diffeTence. On average, how many base differences are there between two human heings?

C h apter

2

Protein Composition

and Structure Crystals of human insulin. In sulin is a protein hormone, crucial for

maintaining blood sugar at appropriate levels. (Below) Chain s of amino acid s in a specific sequence (the primary struct ure) define a protein such as insulin. These chains fold into well-defined structures (the tertiary structure)- in this case, a single insulin molecule. Such structures assemble with other chain s to form arrays such as the complex of six insulin

molecules shown at the far right (the quarternary structure). These arrays can often be induced to form well -defined crystals (photograph at left), which allows determinati on of these structu res in detail. [Phot ograp h from Alfred Pasieka / Peter Arnold.]

C Primary

Secondary

structure

structure

Tertiary structure

roteins are the most versatile macromolecules in living systems and serve crucial functions in essentially all biological processes . They func tion as catalysts, transport and store other m olecules such as oxygen, pro vide mechanical support and immune protection, generate movement, transmit nerve impulses, and control growth and differentiation. Indeed, much of this book will focus on understanding what proteins do and how they perform these functions. Several key properties enable proteins to participate in a wide range of functions.

Quarternary structure

Outl i n e 2.1

Proteins Are Built from a Repertoire

of 20 Amino Acids 2.2 Prima ry Structure: Amino Acids Are Linked by Peptide Bonds to Form Polypeptide Chains 2.3 Secondary Structure: Polypeptide Chains Can Fo ld into Regular Structures Such As the Alpha Helix,

1.

Proteins are linear polymers built of monomer units called amino acids, which are linked end to end. Remarkably, proteins spontaneously fo ld up into three-dimensional structures that are determined by the sequence of amino acids in the protein polymer. Protein fu nction is directly dependent on this three -dimensional structure (Figure 2.1). Thus, proteins are the embodiment of the transition from the one -dimensional world of sequences to the three- dimensional world of molecules capable of diverse activities. 2. Proteins contain a wide range offunctional groups. These functional groups include alcohols, thiols, thioethers, carboxylic acids, carboxamides, and a variety of basic groups. Most of these groups are chemically reactive. When

the Beta Sheet, and Turns and Loops 2.4 Tertiary Structure: Water-Sol uble Protein s Fo ld into Compact Structu res

w ith Nonpolar Cores 2.5 Q uaternary Struct ure: Polypeptide Chains Ca n Assem ble into Multis ubunit Stru cture s 2.6 The Amino Aci d Sequence of a Prot ei n Determines Its ThreeDim ensional Structure

25

26 CHAPTER 2 Protein Composition and Structure

~ Figure 2.1 Structure dictates

function . A protein component of the DNA replication machinery surrounds a section of DNA doub le he lix dep icted as a cyl inder. Th e prot ein, which consist s of two identical subunits (shown in red and

yellow), acts as a clamp that allows large segments of DNA t o be copied without the rep licat ion machinery dissociating from the DNA. [Drawn from 2POL.pdb.]

combined in various sequences, this array of functional groups accounts for the broad spectrum of protein function. For in stance, their reactive properties are essential to the function of enzymes, the proteins that catalyze specific chemical reactions in biological systems (see C hapters 8 through 10). 3. Proteins can interact with one another and with other biological macro molecules to form complex assemblies. The proteins within these assemblies can act synergistically to generate capabilities that individual proteins may lack (Figure 2.2). Examples of these assemblies include macromol ecular machines that replicate DNA, transmit signals within cells, and carry out many other essenti al processes. Figure 2.2 A complex protein assembly. An electron micrograph of insect flight tissue in cross section shows a hexagonal array of two kinds of protein filaments. [Courtesy of Dr. Michae l Reedy.]

4. Some proteins are quite rigid , whereas others display a considerable flexibility. Rigid units can function as structural elements in the cytoskeleton (the internal scaffolding within cells) or in connective tissue. Proteins with some flexibi li ty may act as hinges, springs, or levers that are crucial to pro tein function , to the assembly of protein s wi th one another and with other molecules into complex units, and to the transmission of information within and between cells (Figure 2.3).

Iron )

'11,

Figure 2.3 Flexibility and function . O n binding iron, the protein lact oferrin undergoes a substantial change in confo rmation that allows other mo lecu les t o dist inguish

between the Iron-free and the iron-bound forms. [Drawn from ILFM.pdp and ILFG.pdb.]

2.1

27

Proteins Are Built from a Repertoire of 20 Amino Acids

2.1 A Repertoire of 20 Amino Acids

Amino acids are the building blocks of proteins. An a -amino acid consists of a central carbon atom, called the 0' carbon, linked to an amino group, a carboxylic acid group, a hydrogen atom, and a distinctive R group. The l{ group is often referred to as the side chain. With four different groups connected to the tetrahedral a-carbon atom, a-amino acids are chiml: they may exist in one or the other of two mirror-image forms, called the L isomer and the [) isomer (Figure 2.4).

Notation for distinguishing stereoisomers The four different subst ituents of an asymmetric carbon atom are assigned a pr iority according to ato mi c number. The lowestpriority substituent, often hydrogen, is pointed away from the viewer. The configuration about the ca rbon atom is ca lled 5 (from the Lati n sinister, "left") if the progression from the highest to the lowest priority is counterclockwi se. The configuration is call ed

R (from the Latin rectus, "right ") if

the progression is clockwise.

•

L isomer

D Isomer

Figure 2.4 The L and 0 isomers of amino acids. The letter R refers to the side cha in. The L and D isomers are mirro r images of each other.

(3)

Only L amino acids are constituents of proteins. For almost all amino acids, the L isomer has S (rather than R) absolute configuration (Figure 2. 5). Although considerable effort has gone into understanding why amino acids in proteins have the L absolute configuration, no satisfactory explanation has been reached. It seems plausible that the selection ofL over 0 was arbi trary but. once made. the selection was fixed early in evolutionary history. Amino acids in solution at neutral pH exist predominantly as dipolar ions (also called zwitterions). In the dipolar form , the amino group is protonated ( NH3+) and the carboxyl group is deprotonated ( COO- ). The ionization state of an amino acid varies with pH (Figure 2.6). In acid solution (e.g., pH 1), the amino gro up is protonated ( NH1+) and the carboxyl

V}~ H (4)

Figure 2.5 Only L amino acids are found in proteins. A lmost al l L amino acids have an 5 absolute configuration. The counterclock wi se directi on of the arrow from highest- to lowest-priority substituents indicates that th e chiral cent er is of the 5 configurati on .

•

Zwitteri onic form

Both groups deprotonated

c .-o

-1l ~

Both groups protonated

c

c o

Figure 2.6 Ionization state as a function

U

a

2

4

6

8

pH

10

12

14

of pH. The ion ization state of amino acids is al tered by a change in pH. The zwitter ionic form predominates near physiological pH.

28 CHAPTER 2 Protein Composition and Structure

group is not dissociated ( COOH). As the pH is rai sed, the carboxylic acid is the first group to give up a proton, inasmuch as its pK, is near 2. The dipolar form persists until the pH approaches 9, when the protonated am ino group loses a proton. Twenty kinds of side chains varying in size, shape, charge, hydrogenbonding capacity, hydrophobic character, and chemical reactivity are commonly found in proteins. Indeed, all proteins in all species bacterial, archaeal, and eukaryoti c are constructed from the same set of20 amino acids with on ly a few exceptions. This fundamental alphabet for the construction of proteins is several billion years old . The remarkable range of functions med iated by proteins results from the diversity and versatility of these 20 building blocks. Understanding how this alphabet is used to create the intricate three-dimensional structures that enabl e proteins to carry out so many biological processes is an excitin g area of biochemistry and one that we will return to in Section 2.6. Let us look at this set of amino acids. The si mplest one is glycine, which has a single hydrogen atom as its side chain . With two hydrogen atoms bonded to the a-carbon atom, glycine is unique in being achiral. Alanine, the next simplest amino acid, has a methyl group ( CH 3) as its side chain (Figure 2.7).

Glycine (Gly, G)

Figure 2.7 Structures of glycine and alanine. (Top) Ball-and-stick models show the arrangement of ato ms and bonds in space. (Middle) Stereochemi cally realistic formulas show the geometric arrangement of bonds around atoms (see p. 21). (Bottom) Fischer projections show all bonds as being perpendicular for a simpli fied representation (see p. 22).

H

Alanine (Ala, A)

CH,

H Glycine (Gly. G)

Alanine (Ala. A)

Larger hydrocarbon sid e chains are found in valine, leucine, and isoleucine (Figure 2.8). Methionine contains a largely aliphatic side chain that includes a thioether ( S ) group. The side chain of isoleucin e includes an additional ch iral center; on ly the isomer shown in Figure 2.8 is found in proteins. The larger aliphatic side chains are hydrophobic that is, they tend to cluster together rather than contact waler. The three -dimensional structures of water -soluble proteins are stabi li zed by thi s tendency ofhydrophobic groups to come together, which is called the hydrophobic effect (p. 9). The different sizes and shapes of these hydrocarbon side chains enable them to pack together to form com pact structures with little empty space. Proline also has an aliphatic side chain , but it differs from other members of the set of 20 in that its side chain is bonded to both the nitrogen and the a -carbon

Valine

Leucine (Leu, L)

(Val, V)

Isoleucine

Methionine (Met, M)

(lie, I)

CH ,

I

CH, CH, H +H,N

H

C

CH,

C

COO-

C

5

CH,

CH, H

C

CH,

+H,N

C

COO-

H

H

H

H

Valine

leucine (Leu, L)

Isoleucine

Methionine (Met, M)

(Val, V)

(lie, I)

Figure 2.8 Amino acids w ith aliphatic side chains. The additional chiral center of isoleucine is indicated by an asterisk.

atoms (Figure 2.9). Proline markedly influences protein architecture because its ring structure makes it more conformat ionally restri cted than the other amin o acids. Three amino acids with relatively simple aromatic side chains are part of the fundamental repertoire (Figure 2.10). Phenylalanine, as its name indicates, contains a phenyl rin g attached in place of one of the hydrogens of ala nine. The aromatic ring of tyrosine contains a hydroxyl group. This hy droxyl group is reactive, in contrast with the rather inert sid e chains of the other amino acids discussed thus far. Tryptophan has an indole group joined to a methylene ( C H 2 ) group; the indole group comprises two

H, /

H,

_.H-"CH,

",J

H, C" N+/

'--..COO-

H, Proline (Pro, P)

C

H C....... " CH

,\

/ '

W- C

H,

I

(00-

H

Figure 2.9 Cyclic structure of proline. The side chain is joined to both the Of-carbon atom and t he amino group.

29

30

Tyrosine (Tyr. Y)

Phenylalanine (Ph •• F)

CHAPTER 2 Protei n Composition and Structure

Tryptophan (Trp. W)

H v- H

H

H

H

H

H ~ H

H

H

H

C

HC~

'

CH

I

HC ~ / C, C H +H, N

CH 2

I CIH

Phenylalanine (Ph •• F)

C ' OO-

+H, N-

C-

IH Tyrosine (Tyr. Y)

COO-

+H, N-

C

COO-

IH

Tryptophan (Trp. W)

Figure 2.10 Amino acids with aromatic side chains. Ph enylalanine. t y rosi ne. and trypt ophan have hydrophob ic character. Tyros ine and t ryptophan also have hydrophi lic properties because of their - O H and - NH- groups. respectively.

fused rin gs containing an N H group . Phenylalanine is purely hydrophobic, whereas tyrosine and tryptophan are less so because of their hydroxyl and NH groups. F ive amino acids are polar but uncharged. Two amino acids, serine and threonine, contain hydroxyl groups ( OH) attached to an aliphatic side chain (figure 2.1 1). Serine can be thought of as a version of alanine with a hydroxyl group attached, whereas threonine resembles valine with a hydroxyl group in place of one of the valine methyl groups. The hydroxyl groups on serine and threonine mak e them mu ch more hydrophilic (water lov ing) and reactive than alanine and valine. Threonine, like isoleucine, contain s an additional asymmetric center; again only one isomer is present in • protems. In addition, the set includes asparagine and glutamine, un charged deri vatives of th e acidic amino acids aspartate and glutamate (see F igure 2. 15). Each of these two amino acids contain s a terminal carboxamide in place of a carboxylic acid (f igure 2. 12). The side chain of glutamine is one methylene group longer than that of asparagine.

OH H

IC

CH,

H

IC IH

Serine (Ser. 5)

Threonine (Thr. T)

C

Glutamine (Gin. Q)

OH

CH, +H3N

Asparagine (Asn. N)

Threonine (Thr. T)

Serine (Ser. 5)

COO-

+H3 N

I

0 "", / NH,

C

COO-

JH,

N- ,J

+H3

Figure 2.11 Amino acids containing aliphatic hydroxyl groups. Serine and threonine contain hydroxyl groups that render them hydrophilic. The add itional chiral center in threonine is indicated by an asterisk.

Cysteine is structurally similar to serine but contains a sulfhydryl, or thiol (-SH), group in place of the hydroxyl ( OR) group (Figure 2. 13). The sulfhydryl group is much more reactive. Pairs of su lfhydryl groups may come together to form disulfide bonds, which are particularly important in stabi lizi ng some proteins, as will be discussed shortly.

COO-

~

Asparagine (Asn. N)

I

CH,

N- ,J

+H3

COO-

I

H Glutamine (Gin. Q)

Figure 2.12 Structure of asparagine and glutamine. carboxamide-containing polar amino acids.

SH

w\

H./H, 'c

+H3N/

ICH,

0"", / NH, C

'-....COO-

Cysteine (Cy" C)

CH, +H, N-

C

COO-

I

H

Figure 2.13 Structure of cysteine.

We turn now to amino acids with complete charges that render them high ly hydrophilic. Lysine and arginine have relatively long side chain s that terminate with groups that are positively charged at neutral pH. Lysine is capped by a primary amino group and arginine by a guanidinium group . Histidine contains an imidazole group, an aromatic ring that also can be positively charged (Figure 2. 14). With a pKa valu e near 6, the imidazole group can be uncharged or positively charged near neutral pH, depending on its local environment (Figure 2.15). Histidine is often found in the active sites of enzymes, where the imidazole ring can bind and release protons in the course of enzymatic • reactIons .

Guanidinium

Imidazole

31

Lysine (ly•• K)

31 CHAPTER 2 Protein Composition and Stru cture

Arginine

Histidine

(Arg. R)

(Hi •• H)

H

t

HzN",,{ / NH z • • •

•

CH 2

NH H

CH 2

CH z

ICH

CH z

HC/

N"

CH

~

z

II

N-

C" CH z

CH z t H, N

C

IH

Lysine (ly•• K)

COO-

t H, N

C

COO-

H

t H, N

C

COO-

H

Arginine

Histidine

(A.g. R)

(His. H)

Figure 2.14 The basic amino acids lysine. arginine, and histidine.

w ,

I ,

"\

H+

Figure 2.15 Histidine ionization. Histi dine can bind or release proton s near physiological pH.

This set of amino acids also contains two with acidic side chains: aspartic acid and glutamic acid (Figure 2.16). These amino acids are often called aspartate and glutamate to emphasize th at at physiological pH their sid e chains usually lack a proton present in the acid form and hence are negatively charged. N onetheless, in some proteins these side chains do accept protons, and thi s ability is often fun ctionally important. Seven of the 20 amin o acids have readil y ionizable side chain s. Th ese 7 amino acids are able to donate or accept protons to facilitate reactions as well as to form ionic bonds. Table 2.1 gives equilibria and typical pK. values for ionization of the side chains of tyrosine, cysteine, arginine, lysine, histi dine, and aspartic and glutamic acids in proteins. Two other groups in proteins the terminal a -amino group and the terminal a -carboxy l group,can be ionized , and typical pKa values also are included in Table 2.1. Amino acids are often des ignated by either a three-letter abbreviation or a one-letter symbol (Table 2.2). The abbreviations for an1ino acids are the first three letters of their names, except for asparagine (Asn), glutamine (G In), isoleucine (lle), and tryptophan (Trp). The sy mbols for many amino

TABLE 2.1 Typical pK. va lues of ionizable groups in proteins Acid

Group

•

33

•

Base

0

0

Ii / C

II

Telmlnal ('(-carboxyl group

/ C ' O, H

0

II

•

/ C ' O, H

\1

11

/ C

•

\

4.1

6.0

N 'H

'H

B.O

N' H \ H

,H

Cysteine

-

Ni

•

+,H - N, \H H

Terminal a -am ino group

•

- 5

•

0

'\

Lysine

Glutamate

(Asp. 0 )

(Glu. E)

10.9

•

•

Aspartate

8.3

- 5-

,H

/

Tyrosine

•

"" 0

H I N Histidine

3.1

"' 0

0 Aspartic acid Glutamic acid

2.1 A Reperto ire of 20 Ami no Acids

Typ ical pK:

•

10.8

- N\H H

Arginine

•

•

H \ N

H,

II

/

\

N- (

H'

12.5 1

N- H

'pK, values depend on temperature. ionic strength, nnd the microenvironment of the ionizable group.

acids are the first letters of their names (e.g .. G for glycine and L for leucine); the oth er symbols have been agreed on by convention. These abb rev iations and symbols are an integral part of the vocabulary of biochemists.

->
T

blocks of proteins? First, as a set, they are diverse; their structural and chemical properties span a wide range, endowing proteins with the versatility to assume many functional roles. Second, many of these amino acids were probably available from pre biotic reactions, that is, [rom reaction s that occurred before the origin of life. Finally, other possibl e amino acids may TABLE 2.2 Abbreviations for amino acids Am ino acid Al anine Arginine Asparagine AspartiC add Cysteine Glutamine Glutamic acid Glycine Histidine Isoleucine Leucine Lysine

Three-letter abbreviation

Ala Arg Asn Asp Cys Gin Glu Gly His lie Leu Ly,

Three- letter

One-le tte r abbreviation

Amino acid

abbreviation

A R

Methionine Phenylalanine Proline Serine Threon ine

Met Phe Pro Ser Thr Trp Tyr Val Asx

N

D

C

Q E

G H I

L K

Tryptophan Tyrosine Valine Asparagine or aspartic acid Glutamine or glutamiC acid

GI,

One- letter abbreviation

M F P S T W Y

V B

l

0 ..

-

"'-c /

+H3N-

.0

I CH, IC-

~

COO -

Aspartate

Glutamate

(Asp. 0)

(Glu. E)

Figure 2.16 Amino acids with side-chain carboxylates.

34 CHAPTER 2 Protein Composition and Structure

Figure 2.17 Undesirable reactivity in amino acids.

Hz

C

amino acid or another potential leaving gro up.

C

H' \

Some amino acids are unsuitable fo r proteins because of undesi rab le cycl izati on. Homoserine can cyclize to form a stable, fivemembered ring, potentially resulting in peptide-bond cleavage. Cyel ization of seri ne wou ld form a strained, fourmembered ring and thus is di sfavored. X can be an amino group from a neighboring

Hz

'-

_ , ~ . /C,-- /

-

N I

H

0

C II

+ HX

0

Homoserine

Serine

have simply been too reactive. For exampl e, amino acids such as homoserine and homocysteine tend to form five- membered cyclic forms that limit their use in proteins; the alternative amino acids that are found in proteins serine and cysteine do not readily cyclize, because the rings in their cyclic forms are too small (Figure 2.17).

2_2

Primary Structure: Amino Acids Are Linked by Peptide Bonds to Form Polypeptide Chains

Proteins are linear polymers formed by linking the (l( -carboxyl group of one amino acid to the (l( -amino group of another amino acid. This type of linkage is called a peptide bond (or an amide bond). The form ation o f a dipeptide from two amino acids is acco mpanied by the loss of a water mol ecule (Figure 2.18). The equilibrium of this reaction lies on the side of hydrolysis rather than synthesis under most conditions. Hence, the biosynthesis of peptide bonds requires an input of fr ee energy. Nonetheless, peptide bonds are quite stable kinetically because the rate of hydrol ysis is extremely slow; the lifetime of a peptide bond in aqueous solution in the absence of a catalyst approaches 1000 years.

~ ? /t / N, / c".o +H N / " c/ ,. ' c/ 3 II , '"

tt /,

o

+

HzO

H "Rz

Peptide bond Figure 2.18 Peptide-bond fo rmation. The linking o f two amino acids is accompanied by the loss of a mo lecule o f water.

A series of amino acids joined b y peptide bonds form a polypeptide chain, and each amino acid unit in a polypeptide is called a residue. A polypeptide chain has polarity because its ends are different: an (l( -amino group is present at one end and an (l( -carboxyl group at the other. By con vention, the amino end is taken to be the beginning of a polypeptide chain, and so the sequen ce of amino acids in a polypeptide chain is written starting with the amino -terminal residue. Thus, in the pentapeptide T yr-G ly-G lyPhe-L eu (YG GFL ), tyrosine is the amino-terminal (N- termin al) residu e

35

OH

2.2 Primary Structure

CH, / - CH, HC -

Tyr Aminoterminal residue

Gly

Gly

Phe

Leu Carboxylterminal residue

f igure 2.19 Amino acid sequences have direction . Thi s illustratio n of the penta peptide TryGly-Gly-Phe- Leu (YGGFL) shows the sequence from the amino terminus to the carboxyl term inus. This pentapeptide. Leu -enkephalin. is an opioid peptide that modulates th e perception of pain. The reverse penta peptide. Leu- Ph e-Gly-G ly-Tyr (LFGGY). is a different molecule and shows no such effects.

and leucine is the carboxyl-terminal (C -terminal) residue (Figure 2. 19). Leu-Phe-Gly-Gly -Tyr (LFGGY) is a different pentapeptide. with different chemical properties. A polypeptide chain consists of a regularly repeating part, called the main chain or backbone, and a variable part, comprising the distinctive side chains (Figure 2.20). The polypeptide backbone is rich in hydrogen bonding potential. Each residue contains a carbonyl group (C 0 ), which isa good hydrogen-bond acceptor and, with the exception of proline, an NH group, which is a good hydrogen-bond donor. These groups interact with each other and with functional groups from side chains to stabilize particu1ar structures, as will be discussed in Section 2.3. Most natural polypeptide chains co ntain between 50 and 2000 amino acid residues and are commonly referred to as proteins. The largest protein known is the muscle protein titin , which consists of more than 27,000 amino acids. Peptides made of small numbers of amino acids are called oligopeptides or simply peptides. The mean mol ecular weight of an amino acid residue is about 110 gm mol - I, and so the molecular weights of most protei ns are between 5500 and 220,000 gm mol-I . We can also refer to the mass of a protein, which is expressed in units of daltons; one dalton is equal to one atomic mass unit. A protein with a molecular weight of 50,000 gm mol -I has a mass of 50,000 daltons, or 50 kd (kilodaltons). In some proteins, the linear polypeptide chain is cross-linked. The most common cross-links are disulfide bonds, formed by the oxidation of a pair of

figure 2.20 Components of a polypeptide chain. A polypeptide chain consists o f a constant backbone (shown in black) and var iable side chai ns (shown in green).

Dalton

A unit of

mass very nearly

equal to that of a

hydrogen atom. Named after John Dalton (1766-1844). who developed the atomic theory of matter.

Kilodalton (kdJ A uni t of mass equal to 1000 daltons.

36 CHAPTER 2 Protein Composition and Structure

Cysteine

,

Oxidation

•

Reduction

Cysteine

Cystine

Figure 2.21 Cross-links. The formation of a disulfide bond from two cysteine residues is an oxidation reactio n.

cysteine residues (Figure 2.2 1). T he resulting unit of two linked cysteines is called cystine . Extracellul ar proteins often have several disulfide bonds, whereas intracellular proteins usually lack them . Rarely, non disulfide crosslinks derived from other side chains are present in proteins. For example, collagen fibers in conn ective tissue are strengthened in this way, as are fibrin blood clots. Proteins Have Unique Amino Acid Sequences Specified by Genes

Tn 19 53, Frederick Sanger determined the amino acid sequence of insulin , a protein hormone (Figure 2.22). This work is a landmark in biochemistry be cause it showed for the first time that a protein has a precisely defined amino acid sequence consisting onl y of L amino acids linked by peptide bonds. This accomplishment stimulated other scientists to carry out sequence studies of a wide variety of proteins. C urrentl y, the complete amino acid sequences of more than 2,000,000 proteins are known . The striking fact is that each protein has a unique, precisely defined amino acid sequence. Th e amino acid sequence of a protein is referred to as its primary structure . 5

A chain

5

I

I

Gly-lie-Val-Glu-Gln-Cys-Cys-Ala -Ser-Val-Cys-Ser-Leu-Tyr-Gln-Leu-Glu-Asn-Tyr-Cys-Asn

I

5

10

I

15

5

I

5

B chain

5

/

21

5

I

I

Phe-Va I-Asn-Gin -Hi s-Le u-Cys- GIy-Ser-Hi5- Leu -Va 1-GIu-AI a-Leu -Tyr-Leu-Va 1-Cys- GIy-GIu-Arg- GIy-P he-Phe-Tyr -Th r-Pro-Lys-Ala 5

10

15

20

25

30

Figure 2.22 Amino add sequence of bovine insulin.

A series of incisive studies in the late 1950s and early 1960s revealed that the amino acid sequences of proteins are determin ed by the nucleotide seq uences of gel1es. The sequence of nucleotides in DNA specifies a comple mentary sequence of nucleotides in RNA , which in turn specifies the amino acid sequence of a protein. In particul ar, each of the 20 amino acids of the repertoire is encoded by one or more specific sequences of three nucleotides (Section 5.5).

Knowing amino acid seq uences is important fo r several reasons . First, knowledge of the sequence of a protein is usually essential to elucidating its mechanism of action (e.g., the catalytic mechanism of an enzyme). In fact, proteins with novel properties can be generated by varying the sequence of known proteins. Second, amino acid seq uences determine th e threedimensional structures of proteins. Amino acid sequence is the link between the genetic message in DNA and the three -dimensional structure that per[arms a protein's biological functio n . Analyses of relations between amino acid sequences and three-dimensional structures of proteins are uncovering the rules that govern the folding of polypeptide chains . Thi rd , sequence de termination is a component of molecul ar pathology, a rapidly growing area of medicine. A lterations in amino acid seq uence can produce abnormal func tion and disease. Severe and sometimes fatal diseases, such as sickle-cell anemia (195) and cystic fibrosis, can result from a change in a single amino acid within a protein . Fourth , the sequence of a protein reveals much about its evolutionary hi story (Chapter 6). Proteins resemble one another in ami no acid sequence only if they have a common ancestor. Consequently, molecular events in evolu tion can be traced from amino acid sequences; molecular paleontology is a flouris hing area of research.

Polypeptide Chains Are Flexible Yet Conformationally Restricted Examin ation of the geometry of the protein backbone reveals several important features. First, the peptide bond is essentia lly planar (Figure 2.23). Thus, for a pair of amino acids lin ked by a peptide bond , six atoms lie in the same plane: the a-carbon atom and CO group of the first am ino acid and the NH group and a -carbon atom of the second amino acid. The nature of the chemical bonding within a peptide accounts for the bond's planarity. The peptide bond has con siderable double-bond character, which prevents rotation about this bond and thus constrains the conformati on of the peptide backbone.

Peptide-bond resonance structures

The double -bond ch aracter is also expressed in the length of the bond between the CO and the NH groups. The C N distance in a peptide bond is typical ly 1.32 A, which is between the values expected for a C N single

H

Figure 2.23 Peptide bonds are planar. In a pair of linked amino acids, six atoms (Co' C, 0, N, H, and CJ lie in a plane. Side chains are shown as green ball s.

-

37 2.2 Primary Structure

38

H

CHAPTER 2 Protein Composition and Structure

1.0 A

1.24

A

Figure 2.24 Typical bond lengths within a peptide un it . The peptide unit is shown in the trans configuration.

bond (1.49 A) and a C N double bond (1.27 A), as shown in Figure 2.2 4. Finally, the peptide bond is uncharged, allowing polymers of amino acids linked by peptide bonds to form tightly packed globular structures. Two configurations are possible for a planar peptide bond . In the trans configuration, the two ex -carbon atoms are on opposite sides of the peptide bond . In the cis confi guration, these groups are on the same side of the peptide bond. Almost all peptide bonds in proteins are trans. This preference for trans over cis can be explained by the fact that steric clashes between groups attached to the ex -carbon atoms hinder formation of the cis form but do not occur in the trans confi guration (Figure 2.25). By far the most common cis peptide bonds are X Pro linkages. Such bonds show less preference for the trans configuration because the nitrogen of proline is bonded to two tetra hedral carbon atoms, limiting the steric differences between the trans and cis forms (Figure 2.26).

Cis

Trans

Figure 2.25 Trans and cis peptide bonds. The tran s form is st rongly favored because of steric clashes that occur in the cis fo rm .

Trans

Cis

Figure 2.26 Trans and cis X-Pro bonds. The energies of these fo rms are simi lar to one another because steric clashes occur in both forms.

(6)

IA)

39

(C)

2.2 Primary Structure

Figure 2.27 Rotation about bonds in a polypeptide. The structure of each amino acid in a polypeptide can be adjusted by rotation about t wo single bonds. (A) phi Iq,) is the angle of rotation about the bond between the nitrogen and the a -carbon atoms, whereas psi ('/1) is the angle of rotation about the bond between the a -carbon and the carbonyl ca rbon atoms. IB) A view down the bond between the nitrogen and the a-carbon atoms, showing how q, is measured. (e) A view down the bond between the a-carbon and t he carbonyl ca rbon atoms, showing how II' is measured.

In contrast with the peptide bond, the bonds between the amino group and the ex -carbon atom and between the ex -carbon atom and th e carbonyl group are pure single bonds. The two adjacent rigid peptide units may ro tate about these bonds, takin g on various orientations. This freedom of rotation about two bonds of each amino acid allows proteins to fold in many different ways. The rotations about these bonds can be specified by torsion angles (Figure 2.27) . The angle of rotation about the bond between the nitrogen and the ex-carbon atoms is call ed phi (4) ). The angle of rotation about the bond between the a.-carbon and the carbonyl carbon atoms is called psi (1/1). A clockwise rotation about either bond as viewed from the nitrogen atom toward the a. -carbon atom or from the carbonyl group toward the ex-carbon atom corresponds to a positive value. The 4> and 1/1 angles determine the path of the polypeptide chain. Are all combinations of

Biochemistry (Sixth edition)

Read more

Clinical Biochemistry of Domestic Animals, Sixth Edition

Read more

Gems, Sixth Edition

Read more

World Civilizations. Sixth Edition

Read more

Engineering Mathematics, Sixth Edition

Read more

Interacciones , Sixth Edition

Read more

Gems, Sixth Edition

Read more

College Trigonometry, Sixth Edition

Read more

Criminal Law , Sixth Edition

Read more

Building Surveys, Sixth Edition

Read more

Building Surveys, Sixth Edition

Read more

Criminal Law , Sixth Edition

Read more

Organic Chemistry, Sixth Edition

Read more

Engineering Mathematics, Sixth Edition

Read more

Introductory Chemistry , Sixth Edition

Read more

Postgraduate Haematology, Sixth Edition

Read more

Ship Construction Sixth Edition

Read more

Engineering Mathematics, Sixth Edition

Read more

Plant Biochemistry, Fourth Edition

Read more

Biochemistry (Seventh Edition)

Read more

Biochemistry (Fourth Edition, 2008)

Read more

Microbial Biochemistry, 2nd Edition

Read more

Biochemistry (Fourth Edition)

Read more

Biochemistry, 6th Edition

Read more

Biochemistry (Seventh Edition)

Read more

Plant Biochemistry, 4th Edition

Read more

Plant Biochemistry, Third Edition

Read more

Learning Perl, Sixth Edition

Read more

Advanced Photography, Sixth Edition

Read more

MATLAB Primer, Sixth Edition

Read more

Recommend Documents

Biochemistry (Sixth edition)

LIST OF ABBREVIATIONS A ACAT Adenine Acyl-CoA cholesterol acyl transferase ACP Acyl carrier protein ADP Adenosine ...

Clinical Biochemistry of Domestic Animals, Sixth Edition

Preface It has been more than a decade since the publication of the fifth edition, and understandably numerous changes ...

Gems, Sixth Edition

Gems This Page is Intentionally Left Blank Gems Their Sources, Descriptions and Identification Sixth Edition Edite...

World Civilizations. Sixth Edition

This is an electronic version of the print textbook. Due to electronic rights restrictions, some third party content ma...

Engineering Mathematics, Sixth Edition

Engineering Mathematics In memory of Elizabeth Engineering Mathematics Sixth edition John Bird, BSc (Hons), CEng, C...

Interacciones , Sixth Edition

80° 70° 60° 50° 40° BELICE MAR CARIBE HONDURAS OCÉANO ATLÁNTICO EL SALVADOR Magdalena Río NICARAGUA Lago de ...

Gems, Sixth Edition

Gems This Page is Intentionally Left Blank Gems Their Sources, Descriptions and Identification Sixth Edition Edite...

College Trigonometry, Sixth Edition

College Trigonometry This page intentionally left blank College Trigonometry SIXTH EDITION Richard N. Aufmann Vern...

Criminal Law , Sixth Edition

Criminal Law SI XTH E DI TI ON JOHN M. SCHEB, J.D., LL.M. Judge, Florida Court of Appeal, Second District (Ret.) Dist...

Building Surveys, Sixth Edition

Building Surveys This page intentionally left blank Building Surveys Sixth edition Peter Glover AMSTERDAM • BOSTO...