PASCAL user manual and report

Lecture Notes in Computer Science Edited by G. Goos and J. Hartmanis 18 Kathleen Jensen Niklaus Wirth PASCAL User Ma...

Author: K. Jensen | N. Wirth

665 downloads 2532 Views 6MB Size Report

This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!

Report copyright / DMCA form

DOWNLOAD PDF

Lecture Notes in Computer Science Edited by G. Goos and J. Hartmanis

18

Kathleen Jensen Niklaus Wirth

PASCAL User Manual and Report

Editorial Board: P. Brinch Hansen - D. Gries C. Moler • G. SeegmQIler • N. Wirth Ms.,Kathleen Jensen Prof. Dr. Niklaus Wirth Institut fLir Informatik ETH Zerich Clausiusstra6e 55 C H - 8 0 0 6 Z0rich

Library of Congress Cataloging in Publication Data

Jensen, Kathleen, 1949PASCAL: user manual and report. (Lecture notes in computer science, v. 18) Bibliography: p. 1. PASCAL (Computer program language). I. Wirth, Niklaus, joint author. II. Title. IIIo Series. QA76.73 .P35J46 001.6'424 74-16327

AMS Subject Classifications (1970): 68-02, 68 A 0 5 CR Subject Classifications (1974): 4.2, 4.22

ISBN 3-540-06950-X Springer-Verlag Berlin • Heidelberg • New York ISBN 0-387-06950-X Springer-Verlag New York • Heidelberg • Berlin This work is subject to copyright. All rights are reserved, whether the whole or part of the material is concerned, specifically those of translation, reprinting, re-use of illustrations, broadcasting, reproduction by photocopying machine or similar means, and storage in data banks. Under § 54 of the German Copyright Law where copies are made for other than private use, a fee is payable to the publisher, the amount of the fee to be determined by agreement with the publisher. © by Springer-Verlag Berlin • Heidelberg 1974. Printed in Germany.

PREFACE

This manual is directed to those who h a v e p r e v i o u s l y acquired some programming s k i l l . The i n t e n t i o n is to p r o v i d e a m e a n s of learning Pascal without outside guidance. It is based on The ~rparammino Lanauaae Eascal (~evised ~eaort) Ill--the basic definition of Pascal and concise reference manual for t h e experienced Pascal programmer.

The linear structure of a book is by no means ideal for introducing a language, whether it be a formal or natural one. Nevertheless, it is recommended to follow the given organization, paying particular attention to the example programs, and then t o reread those sections causing dlfficultes. O n e m a y w i s h , h o w e v e r , t o r e f e r e n c e c h a p t e r 12 if t r o u b l e s a r i s e concerning the input and output conventions of the programs. The manual was prepared as e file on a c o m p u t e r , that is . as a sequence of characters of a single type font. This is very convenient for the p u r p o s e s of u p d a t i n g : u n f o r t u n a t e l Y ~ it is sometimes a bit awkward to read. T h e r e a d e r is a s k e d to be indulgent with the absence of s u b - and s u p e r s c r i p t s (e.g. m r a i s e d t o t h e p o w e r n is d e n o t e d by m * * n ) . Chapters 0--12 define the language Pascal and serve as a standard for both the implementor and the programmer. The implementer must regard the task of recognizing Standard Pascal as the minmum requirement of his system, while the programmer who intends his programs to be transferable from one installation to another s h o u l d u s e o n l y f e a t u r e s d e s c r i b e d as Standard P a s c a l . On the o t h e r h a n d , any i m p l e m e n t a t i o n may (and u s u a l l y d o e s ) go b e y o n d t h e m i n i m u m . C h a p t e r s 13 a n d 14 d o c u m e n t t h e i m p l e m e n t a t i o n of P a s c a l on t h e CDC 6 0 0 0 m a c h i n e . C h a p t e r 13 describes the a d d i t i o n a l f e a t u r e s of t h e l a n g u a g e P A S C A L 6 0 0 0 , w h e r e a s c h a p t e r 14 is d e v o t e d to t h e u s e of t h e c o m p i l e r and t h e system under the operating system SCOPE.

The efforts o f m a n y go i n t o this manual, e n d we e s p e c i a l l y thank the members of the Institut fuer Informatik, ETH Z u r i c h . end John Larmouth, Rudy Schild, Olivier Lecerme, and Pierre Desjardins for their criticism, suggestions, and encouragement. Our implementation of Pascsl--which made this manual both possible and necessary--is the work of Urs A m m a n n , a i d e d by Helmut Sandmayr.

June

1974

Kathleen densen Niklaus Wirth ETH Zurich Switzerland

Table

of Contents

U S E R M A N U A L by K. Jensen and N. Wirth O.

Introduction

I. N o t a t i o n

and

. . . . . . . . . . . . . . . . . . . . . . . .

3

Vocabulary . . . . . . . . . . . . . . . . . . .

9

2. T h e A. B. C. D.

Concept The type The type The type T h e type

of D a t a Boolean integer real . char .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . . . . . . . ..... . . . . . . . . . .

3. T h e A. B. C. D. E. F.

Program Heading and the Declaration Part . . . . . . . . The program heading . . . . . . . . . . . . . . . . . . . The label declaration part . . . . . . . . . . . . . . . . The constant definition part . . . . . . . . . . . . . . . The type definition part . . . . . . . . . . . . . . . . . The variable declaration part . . . . . . . . . . . . . . The procedure and function declaration part . . . . . . .

12 12 13 14 14 16 16 16 16 18 18 19

4. T h e C o n c e p t of A c t i o n . . . . . . . . . . . . . . . . . . . . . A. T h e a s s i g n m e n t s t a t e m e n t . . . . . . . . . . . . . . . . . B. T h e c o m p o u n d s t a t e m e n t . . . . . . . . . . . . . . . . . . . C. R e p e t i t i v e statements . . . . . . . . . . . . . . . . . . C.I T h e w h i l e s t a t e m e n t . . . . . . . . . . . . . . . . . C.2 The repeat statement . . . . . . . . . . . . . . . . . C.3 The for statement . . . . . . . . . . . . . . . . . . D. C o n d i t i o n a l statements . . . . . . . . . . . . . . . . . . D.I T h e if s t a t e m e n t . . . . . . . . . . . . . . . . . . . D.2 The case statement . . . . . . . . . . . . . . . . . . E. T h e g o t o s t a t e m e n t . . . . . . . . . . . . . . . . . . . .

2O 20 21 22 22 23 23 26 26 31 31

5. S c a l a r a n d S u b r a n g e T y p e s . . . . . . . . . . . . . . . . . . A. S c a l a r t y p e s . . . . . . . . . . . . . . . . . . . . . . . B. S u b r a n g e t y p e s . . . . . . . . . . . . . . . . . . . . . .

34 34 35

6.

Structured

7. R e c o r d A. T h e 8.

Set

Types

in G e n e r a l

-- T h e

Array

in Particular

.

Types . . . . . . . . . . . . . . . . . . . . . . . . with statement . . . . . . . . . . . . . . . . . . . .

Types . . . . . . . . . . . . . . . . . . . . . . . . . .

9. ~ i l e T y p e s . . . . . . . . . . . . . . . . . . . . . . . . . A. T e x t f i l e s . . . . . . . . . . . . . . . . . . . . . . . . B. T h e s t a n d a r d f i l e s " i n p u t " a n d " o u t p u t " . . . . . . . . . 10.

Pointer

11.

Procedures and A. P r o c e d u r e s . B. F u n c t i o n s . C. R e m a r k s . .

Types . . . . . . . . . . . . . . . . . . . . . . . .

12.

Input and Output A. T h e p r o c e d u r e B. T h e p r o c e d u r e

13.

PASCAL 6000-3.4 . . . . . . . . . . . . . . . . . . . . . . . A. E x t e n s i o n s to t h e l a n g u a g e P a s c a l . . . . . . . . . . . . A.I S e g m e n t e d f i l e s . . . . . . . . . . . . . . . . . . .

36 42 47 5O 55 57 59 62

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

67 67 78 82

. . . . . . . . . . . . . . . . . . . . . . read . . . . . . . . . . . . . . . . . . . . write . . . . . . . . . . . . . . . . . . .

84 84 86

Functions . . . . . . . . . . . . . . . . . . . . . . . . . .

. . . .

. . . .

. . . .

. . . .

88 88 88

VI

A.2 External procedures . . . . . . . . . . . . . . . . . Specifications left undefined in the preceding chapters B.I T h e p r o g r a m h e a d i n g a n d e x t e r n a l f i l e s . . . . . . . . B.2 Representation of files . . . . . . . . . . . . . . . B.3 The standard types . . . . . . . . . . . . . . . . . . B.4 The standard procedure write . . . . . . . . . . . . . C. R e s t r i c t i o n s . . . . . . . . . . . . . . . . . . . . . . . D. A d d i t i o n a l p r e d e f i n e d t y p e s , p r o c e d u r e s , a n d f u n c t i o n s . D.I A d d i t i o n a l p r e d e f i n e d t y p e s . . . . . . . . . . . . . D.2 Additional predefined procedures and functions.. B.

14.

How to Use the PASCAL 6000-3.4 System . . . . . A. C o n t r o l s t a t e m e n t s . . . . . . . . . . . . . . B. C o m p i l e r o p t i o n s . . . . . . . . . . . . . . . C. E r r o r m e s s a g e s . . . . . . . . . . . . . . . . C.I C o m p i l e r . . . . . . . . . . . . . . . . . C.2 Run-time . . . . . . . . . . . . . . . . . References Appendix Appendix Appendix Appendix Appendix Appendix

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . . . . . . . . . . . . . . . . . Standard Procedures S u m m a r y of O p e r a t o r s Tables . . . . . . . . Syntax . . . . . . . . Error Number Summary Programming Examples

A B C D E F

and . . . . . . . . . .

Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

. . . . . .

. . . . . .

. . . . . .

.

. . . . . . .

. . . . . . . . . . . . . . . . . . . . . . . . . . .

9O 91 91 92 93 96 97 97 97 98 I00 100 100 I O2 102 I02 I04 I05 I O8 I09 110 119 122 126

Index . . . . . . . . . . . . . . . . . . . . . . . . . . . .

by N. Wirth Preface

to

the

1. I n t r o d u c t i o n 2.

Summary

3. N o t a t i o n ,

Revised

Report

of the

language . . . . . . . . . . . . . . . . . . .

terminology,

4.

Identifiers,

5.

Constant

. . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . . . . . .

Numbers

definitions

and and

vocabulary . . . . . . . . . . . .

Strings

. . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . .

133 136 137 140 140 141

. . . .

. . . .

. . . .

. . . .

. . . .

142 142 143 145

7. D e c l a r a t i o n s a n d d e n o t a t i o n s of v a r i a b l e s . . . . . 7.1. E n t i r e v a r i a b l e s . . . . . . . . . . . . . . . . 7.2. C o m p o n e n t v a r i a b l e s . . . . . . . . . . . . . 7.3. R e f e r e n c e d v a r i a b l e s . . . . . . . . . . . . . .

. . . .

. . . .

. . . .

. . . . .

146 147 147 148

6. D a t a 6.1. 6.2. 6.3.

type definitions . Simple types . . . Structured types . Pointer types . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

8.

Expressions . . . . . . . . . . . . . . . . . . . . . . . . . 8.1. O p e r a t o r s . . . . . . . . . . . . . . . . . . . . . . . 8.2. F u n c t i o n d e s i g n a t o r s . . . . . . . . . . . . . . . . . .

148 149 151

9.

Statements . . . . . . . . . . . . . . . . . . . . . . . . . 9.1. S i m p l e s t a t e m e n t s . . . . . . . . . . . . . . . . . . . 9.2. S t r u c t u r e d s t a t e m e n t s . . . . . . . . . . . . . . . . .

151 152 153

10.

Procedure declarations . . . . . . . . . . . . . . . . . . . . 10.1.Standard procedures . . . . . . . . . . . . . . . . . .

158 160

11.

Function declarations . . . . . . . . . . . . . . . . . . . . 11.1.Standard functions . . . . . . . . . . . . . . . . . . .

163

162

VII

12. Input and Output 13. P r o g r a m s

. . . . . . . . . . . . . . . . . . . . .

164 167

. . . . . . . . . . . . . . . . . . . . . . . . .

14. A s t a n d a r d for i m p l e m e n t a t i o n and p r o g r a m

interchange.

15. I n d e x . . . . . . . . . . . . . . . . . . . . . . . . . . .

.

168 169

0 ~NTRODUCTION

Much

of

grasp of e

the

following

text

of computer terminology program. The purpose

assumes

the

reader

and a "feeling" of this section

for is

has

a minimal

the structure to spark that

intuition.

{ program assuming find the sterling, 1. 2 . . . . ~rocram

0.1 annual inflation rates o f 9 , 8 , a n d 10 p e r c e n t . factor by which the frank, dollar, pound m a r k , or g u i l d e r will have been devalued in n years.} inflation(output);

#onst n = 10 ; var i : integer; wl.w2~oW3 becin i := O; w l := 1.0; r_@~.~at i := i + I ; w I

:=

w I

*

: real; w 2 := 1.0;

w3

:=

1.0;

1.07;

w2 w3

:= w2 * 1.08; := W3 * 1 . 1 0 ; w r i t e l n (i .w l , w 2 . w 3 ) i=n end.

1 2 3 4 5 6

1.090000000000e+00 1.144900000000e+00 1.225043000000e+00 1.310796010000e+00 1.402551730700e+00 1.500730351849e+00

1.080000000000e+00 1.166400000000e+00 1.2597t2000000e+00 1.360488960000e+00 1.469328076UOOe+00 1.586874322944e+00

1.100000000000e+00 1.210000000000e+00 1.331000000000e+00 1.464100000000e+00 1.610510000000e+00 1.771561000000e+00

7

1.bObTU1476478e+O0

1.713824268779e+00

1.948717100000e+00

9 10

1.718186179832e+00 1.838459212420e+00 1.96915t359290e+00

1.850930210282e+00 1.999004627104e+00 2.158924997293e+00

2.14358~810000e+00 2.357947691000e+00 2.593742460t00e+00

An &3ooritbj~ or computer program consists of t w o e s s e n t i a l parts, a description of ~ctions which ere to be performed, and a description o f t h e ~@_~&o w h i c h is m a n i p u l a t e d by t h e s e a c t i o n s .

Actions described The

are described by so-called

program

is

by so-called ~tatements, ~cl~ra~L~zns end ~L~finitions.

divided

into

a

~eadin~

and

end

a body,

data

is

called

b3ock. The heading gives the program s name a n d l i s t s parameters. (These ere (file) variables and represent arguments and results of the computation. See chapter 13.) file "output" is e compulsory parameter. The b l o c k consists six sections, where any except the lest may be e m p t y . I n required order they ere:

a

its the The of the

<procedure and f u n c t i o n d e c l a r a t i o n p a r t > <statement p a r t >

The first section l i s t s a l l l a b e l s d e f i n e d in t h i s b l o c k . The second section defines synonyms for constants: i.e. it introduces i d e n t i f i e r s t h a t m a y l a t e r be u s e d in p l a c e of t h o s e constants. The third contains type definitions: end the fourth. variable definitions. The fifth section defines subordinate program parts (i.e. procedures and functions). The s t a t e m e n t part specifies t h e a c t i o n s to be t a k e n . The above program outline is more precisely e x p r e s s e d in a syntax ~ a p r a m . Starting a t t h e diagram named p r o g r a m , a p a t h through the diagram defines a syntactically correct proqram. Each box r e f e r e n c e s a d i a g r a m by t h a t n a m e . w h i c h is t h e n u s e d to define its meaning. Terminal symbols (those actually written in a P a s c a l p r o g r a m ) a r e in r o u n d e d e n c l o s u r e s . (See a p p e n d i x D for t h e f u l l s y n t a x d i a g r a m of P a s c a l . )

program

block ~1 .... i~.di,l,~o~ I-

9

j ~ I ~~G-o~ ~-~PROCEDUII~~--I

Figure O.a

p..... ter list I

Syntax diagrams defining the general structure of a program

"i

An alternative formulation of a syntax is the traditional Backus-Naur ~orm. where syntactic constructs a r e d e n o t e d by English words enclosed between the angular brackets < a n d >. These words are suggestive of the n a t u r e or m e a n i n g of t h e construct. A sequence of constructs (I or more elements) enclosed by the mete-brackets { and } imply their repetition zero or m o r e t i m e s . (For t h e B N F of P a s c a l . see appendix D . ) As an example, the construct <program> of f i g u r e O.a is d e f i n e d by the following formulas, called "productions": <program> ::= . <program heading> ::~ ~ r o m r a ~ ( { . } ) ; ::= 

Each procedure (function) has a structure similar to a program; i.e. each consists of a heading and a block. Hence. procedures may be declared (nested) within other procedures. Labels, constant synonyms, type, variable, and procedure declarations are local to the procedure in w h i c h t h e y a r e d e c l a r e d , That i s , their identifiers have significance only within the program text which constitutes the procedure declaration a n d w h i c h is c a l l e d the ~eooe of these identifiers. Since procedures m a y be n e s t e d , so may scopes. Objects which are declared in t h e m a i n p r o g r a m , i.e. not local to some procedrue, are called ~Iobal and have significance throughout the entire program. Since blocks m a y be n e s t e d w i t h i n o t h e r b l o c k s by p r o c e d u r e and function declarations, o n e is a b l e to a s s i g n a l e v e l of n e s t i n g to each. If the outermost program-defined block (e.g. the main program) is called level O, t h e n a b l o c k d e f i n e d within this block w o u l d be of l e v e l I; in g e n e r a l , a block defined in l e v e l i would be of level (i+I). Figure O.b i l l u s t r a t e s a block structure,

where

QF--q

I

Figure

O.b

Block

structure

level level level level

0 I 2 3

: = : :

M P, A, B

Q R,

S

In t e r m s of t h i s f o r m u l a t i o n , t h e s c o p e or r a n g e of v a l i d i t y of an identifier x is the entire b l o c k in w h i c h x is d e f i n e d , i n c l u d i n g t h o s e b l o c k s d e f i n e d in t h e s a m e b l o c k as x. (For t h i s example, note that all identifiers m u s t be d i s t i n c t . S e c t i o n 3.e discusses the case where identifiers are not necessarily distinct,) objects

defined

in b l o c k

are

accessible

in

blocks

M

M ,P ,A .B ,Q .R ,S

P A B

P ,A ,B A.B B

Q

Q ,R ,S

R S

R S

For programmers acquainted with ALGOL, PL/I, or FORTRAN, it may prove helpful to glance at Pascal in t e r m s of t h e s e o t h e r languages. For this purpose, we list the following characteristics of P a s c a l : 1. 2.

3. 4.

5. 6.

7.

8. 9.

10.

Declaration of variables is mandatory, Certain key words (e.g. ~eain, ~nd, ~eoeat) ere "reserved" and cannot be used as identifiers. In this manual they are underlined. The semicolon (;) is c o n s i d e r e d as a s t a t e m e n t s e p a r a t o r , not a s t a t e m e n t t e r m i n a t o r (as e . g . in PL /I). The standard data types are those of whole and real numbers, the logical values, and the (printable) characters. The b a s i c d a t a s t r u c t u r i n g facilities include the array, the record (corresponding to COBOL's and PL / I ' s "structure"). the set, and the (sequential) file. These structures can be combined end nested to form arrays of sets, files of records, etc. D a t a may b e allocated dynamically and accessed via pointers. These pointers allow the full generality of l l s t p r o c e s s i n g . T h e m e is a f a c i l i t y to d e c l a r e n e w , b a s i c d a t a t y p e s w i t h symbolic constants. The let data s t r u c t u r e offers facilities s i m i l a r to t h e P L / I "bit s t r i n g " . Arrays may be of arbitrary dimension with arbitrary bounds; the array bounds are constant. (i.e. There are no dynamic arrays.) As in FORTRAN. ALGOL, and PL/I, there is a go to statement. Labels are unsigned integers and m u s t be declared. The compound statement is that of ALGOL, and corresponds t o t h e DO g r o u p i n P L / I . The facilities o f t h e ALGOL s w i t c h and the computed go to o f FORTRAN a r e r e p r e s e n t e d by the case statement.

The

for

statement,

corresponding

F O R T R A N . may o n l y have steps is executed only as long variable lles within the controlled s t a t e m e n t m a y not

to

the

DO

loop of

of 1 (~g) or -1 (~ownto) and as t h e v a l u e o f t h e c o n t r o l limits. Consequently. the be e x e c u t e d at a l l .

11. 12. 13. t4. 15.

16.

There are no conditional expressions a n d no m u l t i p l e assignments. Procedures e n d f u n c t i o n s m a y be c a l l e d r e c u r s i v e l y . T h e r e is no " o w n " a t t r i b u t e for v a r i a b l e s (as in A L G O L ) . Parameters are called either by value or by reference: there i s no c a l l by name. The "block structure" differs from that o f ALGOL a n d P L / I insofar as there are no anonymous blocks, i.e. each block is given a name, and thereby i s made i n t o a procedure. All objects--constants, variables, etc.--must be declared ~e£o~_~ they are referenced. The following two exceptions ere however allowed: I) the type identifier in a pointer type definition (chaptem 10) 2) procedure and function calls when there is a forward reference (chapter 11).

Upon f i r s t c o n t a c t w i t h P a s c a l , m a n y t e n d to b e m o a n the a b s e n c e of certain "favorite features", Examples include an exponentiation operator, concatenation of strings, dynamic arrays, arithmetic operations on B o o l e a n v a l u e s , a u t o m a t i c t y p e conversions, and default declarations. These were not oversights, but deliberate omissions. In some cases their presence would be primarily an invitation to inefficient programming solutions; in o t h e r s , it w a s felt t h a t t h e y w o u l d be Contrary to the aim of clarity and reliability a n d "good programming style". Finally, a rigorous selection among the immense variety of programming facilities available had to be made in order to keep the compiler relatively compact and efficient--efficient and economical for both the user who writes o n l y s m a l l p r o g r a m s u s i n g f e w c o n s t r u c t s of t h e l a n g u a g e a n d t h e u s e r w h o w r i t e s l a r g e p r o g r a m s a n d t e n d s to m a k e u s e of the full language.

I

__NTAT~.~.~.!~T.J~A N D

The basic wocabularv consists letters, digits, and special operators and delimiters:

VOCABULARY

of b a s i c symbols.

symbols classified into The ~ . ~ ~ymbols are

+ -

: "

( )

and Drrav

and _f..il~

ail aot

/

<>

]

case

function

~

• ,

<: >: >

} % ..

~iv ~a downto 91se

iZ in /~l mod

£rocedure ~r_Qar.am ~ Keoea~

~hen

~ar ahila ~ith

~9~d-delim~ (or reserved words) ere normally underlined in the hand-wrltten program to e m p h a s i z e t h e i r i n t e r p r e t a t i o n as single symbols with fixed meaning. The orogrammer m a y not u s e these words in a context other than that e x p l i c i t in t h e d e f i n i t i o n of P a s c a l : in p a r t i c u l a r , t h e s e w o r d s m a y not be u s e d as identifiers. They are written as a s e q u e n c e of l e t t e r s (without surrounding escape characters). The

construct: {
sequence

of s y m b o l s

not

containing

"} ">}

m a y be i n s e r t e d b e t w e e n a n y t w o i d e n t i f i e r s , n u m b e r s , or s p e c i a l symbols. It is called a ~ o m m e n £ a n d m a y be r e m o v e d from t h e program text without altering its m e a n i n g . T h e s y m b o l s { a n d } do not occur o t h e r w i s e in t h e l a n g u a g e , a n d w h e n a p p e a r i n g in syntactic descriptions, they denote meta-symbols l i k e I and ::=. (On systems where the curly brackets are unavailable, the c h a r a c t e r p a i r s (* and *) a r e u s e d in t h e i r p l a c e . ) ~de~if~ers are names denoting constants, types, variables. Procedures. and functions. They must begin with a letter, which may be followed by any combination and number of letters and d i g i t s . A l t h o u g h an i d e n t i f i e r m a y be v e r y l o n g . i m p l e m e n t a t i o n s may impose a limit as to how m a n y of t h e s e c h a r a c t e r s a r e significant. Implementations of Standard Pascal will always recognise the first ~ characters of an identifier as significant. That is. identifiers denoting distinct objects s h o u l d d i f f e r in t h e i r f i r s t U c h a r a c t e r s .

10

~

Figure

l

1.a

e

t

t

e

r

~

Identifier

examples of legal identifiers: sum root3 pi h4g x thisisaverylongbutneverthelesslegalidentifier thisissverylongbutprobablythesameidentifierasabove illegal 3rd

identifiers: array

level.4

root-3

Certain i d e n t i f i e r s , called ~ e n d a r d i d e n t i f i e r s , ere predefined (e.g. sin, cos). In contrast to the w o r d - d e l i m l t e r s (e.g. QrraM), one is not r e s t r i c t e d to this d e f i n i t i o n and may elect to redefine any standard identifier, as they are assumed to be declared in a hypothetical block s u r r o u n d i n g the entire program block. Decimal n o t a t i o n is used for ~ u m b e r s . The letter E preceding the scale factor is pronounced as "times 10 to the power of". The syntax of u n s i g n e d numbers is s u m m a r i z e d in figure 1.b.

Figure

1.b

Unsigned

number

Note that if the number c o n t a i n s digit must precede and succeed occur in a number.

unsigned 3

numbers : 03 6272844

0.6

a decimal point, the point. Also.

5E-8

49.22E +08

at least one no comma may

IE 10

incorrectly written numbers: 3,487.159 XII .6

El0

5.E-16

Blanks. ends of lines, and comments ere considered as semara~rs. An a r b i t r a r y n u m b e r of s e p a r a t o r s may occur between any two consecutive Pascal symbols with the following exception: no separators may o c c u r w i t h i n i d e n t i f i e r s , n u m b e r s , or s p e c i a l symbols. H o w e v e r , at l e a s t o n e s e p a r a t o r m u s t o c c u r b e t w e e n a n y p a i r of c o n s e c u t i v e identifiers, n u m b e r s , or w o r d s y m b o l s .

Sequences of celled ~nos. the quote mark

characters enclosed To i n c l u d e a q u o t e twice.

examples of strings: "a' ";" "3 ° * t h i s s t r i n g has

by mark

single quote marks are in s s t r i n g , o n e w r i t e s

'begln" "don'°t " 33 c h a r a c t e r s "

2 --THE _C~.~,CF,,.P_-TBE_ D A T A

Data is the general expression d e s c r i b i n g a l l t h a t is o p e r a t e d on by t h e c o m p u t e r . At t h e h a r d w a r e a n d m a c h i n e c o d e l e v e l s , a l l data are represented as sequences of binary digits (bits). Higher level languages allow one to u s e a b s t r a c t i o n s a n d to ignore t h e d e t a i l s of r e p r e s e n t a t i o n - - l a r g e l y by d e v e l o p i n g t h e C o n c e p t of d a t a ~ v o e . A data type defines t h e s e t of v a l u e s a v a r i a b l e m a y a s s u m e . Every variable occurring in a p r o g r a m m u s t be a s s o c i a t e d w i t h One and only one type. A l t h o u g h data t y p e s in P a s c a l can be quite sophisticated, each must be ultimately built from unstructured types. An unstructured t y p e is e i t h e r d e f i n e d by the programmer, a n d t h e n c a l l e d a d e c l a r e d s c a l a r t y p e . or o n e of the four standard scalar types--integer, r e a l , B o o l e a n . or char. A scalar type is characterized by the s e t of its d i s t i n c t values, u p o n w h i c h a l i n e a r o r d e r i n g is d e f i n e d . The v a l u e s a r e denoted by identifiers in the definition of t h e t y p e (see chapter 5).

A. T h e

type

Boolean

A Boolean value is one of t h e the predefined identifiers false The following applied to Operators.)

and or no__t

logical Boolean

logical logical logical

operators operands:

logical truth and true. yield e (Appendix

values

denoted

Boolean value B summarizes

by

when all

conjunction disjunction negetion

Each of the relational operators (=, <>. <=, <, >, >=, ~) yields a Boolean value. Furthermore. the type Boolean is defined such that false < true. Hence, it is possible to define each of the 16 B o o l e a n operations using the above logical and relational operators. For example, if p and q are Boolean values, one can express implication equivalence e x c l u s i v e OR

as as as

p <= q p = q p <> q

13

Standard Boolean functions--i.e, standard functions which yield a Boolean result--ere: (Appendix A summarizes all s t a n d a r d functions.)

odd(x) e o l n (f) eof(f)

B.

The

t r u e if t h e i n t e g e r x is odd, f a l s e o t h e r w i s e e n d of a l i n e , e x p l a i n e d in c h a p t e r 9 e n d of f i l e . e x p l a i n e d in c h a p t e r 9

type i n t e g e r of

A value of type integer is an element implementation-defined s u b s e t of w h o l e n u m b e r s . The following arithmetic operators a p p l i e d to i n t e g e r o p e r a n d s : *

rag_d_ + -

multiply divide and truncate a mw3t b : a - ((a ~ add subtract

(i.e. v a l u e b)*b)

The relational operators =, r e s u l t w h e n a p p l i e d to i n t e g e r Four

important

abs (x) sqr(x) trunc(x)

round(x)

standard

yield

is

integer

not

when

rounded)

<>. <, <=. >=. > y i e l d a B o o l e a n opersnds. <> d e n o t e s i n e q u a l i t y .

functions

yielding

sqr yield of type

integer

results

are:

an integer result only when their i n t e g e r . If i is a v a r i a b l e of t y p e

the " n e x t " i n t e g e r , a n d the preceding integer m o r e c l e a r l y e x p r e s s e d by i-I

There exists an implementation-dependent maxint, if a a n d b are i n t e g e r e x p r e s s i o n s ,

the

to

be c o r r e c t l y

implemented

expressions

standard identifier the operation:

b

is g u a r a n t e e d

value

t h e r e s u l t is t h e a b s o l u t e v a l u e of x . t h e r e s u l t is x s q u a r e d . x is a r e a l v a l u e : t h e r e s u l t is its w h o l e p a r t . (The fractional part is discarded. Hence trunc(J.7)=3 and trunc(-J.7)=-3) x is a real value; the result is t h e r o u n d e d integer, round(x) m e a n s for x > = 0 t r u n c ( x + 0 . 5 ) , and for x < 0 t r u n c ( x - 0 . 5 )

Notes: abs and argument is also integer, then succ(1) yields pred(i) yields This is, h o w e v e r , i+1 and

a go

an

the

when:

14

a b s (a o@. b ) abs (a) abs (b)

C . The

type

<: <: <:

maxint maxint maxint

, , and

real

A v a l u e of t y p e r e a l implementation-defined

is

an e l e m e n t of subset of r e a l

As l o n g as at l e a s t o n e of t h e real (the other possibly being operators yield a real value: * / + -

multiply divide (both operands the result is add subtrcct

Standard result:

functions

abs (x) sqr (x) Standard result:

when

the numbers.

operands is of t y p e of type integer) the

may be always

accepting

integers, real)

a reel

following

but

argument

yield

a real

absolute value x squared

functions

sin (x) cos (x) amctan (x) ln(x) e x p (x) sqrt (x)

with

trigonometric

real

or

integer

argument

and

meal

functions

natural logarithm exponential function square root

Warninq: although r e a l is i n c l u d e d as a s c a l a r t y p e , it c a n n o t a l w a y s be u s e d in t h e s a m e c o n t e x t as t h e o t h e r s c a l a r t y p e s . I n Particular. the functions pred and succ cannot take real arguments, a n d v a l u e s of t y p e m e a l c a n n o t be u s e d w h e n i n d e x i n g arrays, n o r in c o n t r o l l i n g for statements, nor for defining the base type of a set.

D.

The

type

char

A value of t y p e c h a r is an e l e m e n t of a f i n i t e a n d o r d e r e d s e t of characters. Every computer s y s t e m d e f i n e s s u c h a s e t for t h e purpose of c o m m u n i c a t i o n . These characters are then available on the input and output equipment. Unfortunately. there does not

15

exist one standard character set; the elements and their ordering dependent.

therefore, the definition of is strictly implementation

The following minimal assumptions hold independent of t h e u n d e r l y i n g impementation:

The c h a r a c t e r set includes 1. t h e alphabetically ordered A...Z 2. the numerically ordered and digits 0...9 3. t h e b l a n k c h a r a c t e r .

A character enclosed c o n s t a n t of t h i s t y p e .

examples: '~" (To represent

'G"

set

"°°'

an apostrophe,

one

the

capital

contiguous

in a p o s t r o p h e s

'3 °

of

for

set

(single

type

Latin of

letters

the

quotes)

char,

decimal

denotes

a

°X ° writes

it

twice.)

The two standard f u n c t i o n s ~r.~[ a n d ~ h r a l l o w t h e m a p p i n g o f t h e g i v e n c h a r a c t e r s e t o n t o a s u b s e t of n a t u r a l n u m b e r s - - c a l l e d the ordinal numbers of the character set--and vice verse; ord end chr e r e c a l l e d ~ r a n s f e r ~ u n c t i o n s . ord(c)

chr(i)

is the ordinal number of the character c in t h e underlying ordered c h a r a c t e r s e t . (also s e e s e c t i o n 5.A) is the character value with the ordinal number i.

One sees immediately i.e. chr(ord(c)) = c

Furthermore, ci < e 2

the

that

ord

-and-

ordering iff

and

chr

are

ord(chr(i))

of a given character ord(cl) < ord(c2)

inverse

functions.

= i

set

is

defined

by

This definition can be extended to each of t h e r e l a t i o n a l operators: =. <>. <, <=, >=. >. If R d e n o t e s o n e of t h e s e operators, then ci

When type

R e2

iff

ord(cl)

R ord(c2)

t h e a r g u m e n t of t h e s t a n d a r d functions c h a r , t h e f u n c t i o n s c a n be d e f i n e d as:

pred(c) succ(c)

= =

pred

and

succ

is

of

c h r (ord (c)-I) chr (ord (c)+I)

Note: The predecessor ( s u c c e s s o r ) of a c h a r a c t e r is d e p e n d e n t upon the underlying c h a r a c t e r s e t a n d is u n d e f i n e d if one d o e s not e x i s t .

TH~ PR0~RAM H E A ~ N G

3 AN_E THE DECLARAT~]~N EAR_T_

Every program consists of a heading and a block. The block contains a declaration part, in which all objects local to the program are defined, and a statement part. which specifies the actions to be executed upon these objects. <program> ::= ::: < l a b e l d e c l a r a t i o n part> <procedure and function declaration <statement part>

A.

Program

part>

heading

The heading gives the program a n a m e (not o t h e r w i s e significant inside the program) a n d l i s t s its p a r a m e t e r s , through which the program communicates with the environment (see c h a p t e r 13.B.I). <program

B.

Label

heading>

::= & ~ l i g ~ i m { ,

declaration

(
identifier>

part

Any statement in a program may be m a r k e d by p r e f i x i n g the statement with a label followed by a c o l o n (making possible a reference by a goto statement). However, t h e l a b e l m u s t be defined in t h e j a b e l ~ c l a r ~ & i ~ A &~i~ before its u s e . T h e s y m b o l label heads this part, which has the general form: label

A l a b e l is d e f i n e d most 4 digits. example: i~el

C.

}; to

be

an

unsigned

integer,

end

consists

of

at

~,1~;

Constant

A ~&tao~ a constant. part, which f,Ji~&~

{,

definition

part

~efinition introduces an i d e n t i f i e r as a s y n o n y m for The symbol ~ introduces the constant definition has t h e g e n e r a l form:

= ;

{

= ;}

17

where a (possibly

constant signed),

is either" or a s t r i n g .

a

number,

a constant

identifier

The use of c o n s t a n t i d e n t i f i e r s g e n e r a l l y m a k e s a p r o g r a m m o r e readable and acts as a c o n v e n i e n t d o c u m e n t a t i o n a i d . It a l s o allows the programmer to group machine or example dependent quantities at the beginning of the p r o g r a m w h e r e t h e y can be easily noted and/or changed. (Thereby aiding the portability and m o d u l a r i t y of t h e p r o g r a m . ) As an

{

example,

program example

~roaram #onst

consider

3.1 of constant convert

addin

the

following

definition

pert

program:

}

(output);

= 32;

mulby

=

1.8;

low

= O;

high

sesarator = * °; var degree : low..high; J~eain writeln (separator): f o r d e g r e e := low J&o h i g h d o beQi~ write(degree°'c',round(degree*mulby if odd(degree) then writeln end : writeln ; writeln (separator) and.

0c 2c 4c 6c 8c I0c 12c 14c 16c IBC 20C 22c 24c 26c 28c 30c 32c 34c 36c 38c

32f 36f 39f 43f 46f 50f 54f 57f 61f 64f 68f 72f 75f 79f 82f 86f 90f 93f 97f 100f

Ic 3c 5c 7c 9c 11c 13c 15c 17e 19c 21c 23c 25c 27c 29c 31c 33c 35c 37c 39c

34f 37f 41f 45f 48f 52f 55f 59f 63f 66f 70f 73f 77f 81f 84f 88f 91f 95f 99f I02f

= 39;

+ addin),'f')"

18

D.

Type

definition

part

A data type in P a s c a l m a y be e i t h e r d i r e c t l y described in t h e variable declaration or referenced by a ~ypQ ~dentifier. Provided are not only several standard type identifiers, but also e mechanism, t h e ~Z#,~ ~ e f i n i t i o o , for creating new types. The symbol ~.@ introduces a program part containing type definitions, The definition itself determines a set of values and associates an i d e n t i f i e r with the set. The g e n e r a l form is: t~oe

Examples chapters.

E.

Variable

of

type

= : definitions

declaration

Every variable ~ar!ab!e ~ ~ a n y u s e of t h e

[ ere

found

in

the

subsequent

part

occurring in a s t a t e m e n t declaration . This variable.

A variable declaration associates en with a new variable by s i m p l y listing its type. The symbol ~&~ heads the The general f o r m is: {

= :}

{, } { , }

example : v~ rootl,root2.root3: count,i: integer ; found : Boolean; filler : chari

m u s t be d e c l a r e d in a must textually Precede

identifier and a data type the identifier followed by variable declaration part.

: : : i}

reali

This identifier/type association is v a l i d t h r o u g h o u t the entire block containing the declaration, unless the identifier is redefined in a subordinate block. Suppose a b l o c k B is n e s t e d within block A. (i.e. declared w i t h i n t h e s c o p e of a n d h e n c e subordinate t o A . as in f i g u r e 0 . b ) It is p o s s i b l e t o d e c l a r e an identifier in B that is already declared in A . T h i s has t h e effect of associating that identifier with a variable local to B--not available to A--which may be o f a n y t y p e . T h e l a t t e r definition is then valid throughout the s c o p e o f Bo u n l e s s redeclared in a block subordinate to B . It is n e t a l l o w e d to declare a single identifier more than once within the same level and scope. Hence the following is a l w a y s i n c o r r e c t . ~J~ a : integer; a : real;

19

F.

Procedure

and

function

declaration

part

Every procedure or function must be defined (or a n n o u n c e d ) before its use. Procedure end function declarations are treated in chapter 11. P r o c e d u r e s are subroutines end are activated by procedure statements. Functions are subroutines that yield a result value, and therefore can be u s e d as c o n s t i t u e n t s of

expressions.

4 THE

~QNCEPT

OE A C T I O N

Essential to a computer program is action. That is, a program must do something with its data--even if that action is the choice of doing nothing! ~ ~ describe these actions. Statements are either &imole (e.g. the assignement statement) or structured.

A.

The

assignment

statement

The most fundamental of s t a t e m e n t s is t h e ~ s s i ~ d 3 ~ wtatement. It specifies that a newly computed value be assigned to a Variable. T h e f o r m o f an a s s i g n m e n t is:

:= < e x p r e s s i o n >

w h e r e := is t h e & s s i c n m e n t ~ p e r a t o r . relational operator =. The statement current value of a is replaced with ~ecomes 5".

not t o be c o n f u s e d w i t h t h e "a := 5" is pronounced "the the value 5", or simply, "a

The n e w v a l u e is o b t a i n e d by e v a l u a t i n g an ~ x m r e s s i o n consisting of constant or variable operands, operators, and function designators, (A f u n c t i o n designator specifies the activation of a function. Standard functions are listed in Appendix A; user defined functions are explained in chapter 1 1 . ) An e x p r e s s i o n is e rule for calculating e value where the conventional rules of left to right evaluation and ~Eerator erecedence are observed. The operator ~ot (applied to a Boolean operand) has the highest precedence, followed by the multiplying operators (*. /, ~iv, ~_#J~, ~U~), then the adding operators (+, -, ~), and of lowest precedence, the relational operators (=, <>, <, <=. >=, >. &~). Any expression enclosed within parentheses is evaluated independent of preceding or succeeding operators. examp les : 2 * 3-4 * 5 15 ~ 4 * 4 B0/5/3 4/2 *3 sqrt (sqr(3)+11*5) The syntax The reader

of is

= = = =

(2*3) (15 ~iv (80/5)/3 (4/2)*3

(4*5) 4)*4

Appendix D reflects the recommended to reference

= -14 = 12 = 5.333 = 6.000 = 8.000

exact rules of precedence. it whenever in doubt.

Boolean expressions have the property thst their value known before the entire expression has been evaluated. for example, that x=O. Then (x>O) is

~

already

may b e Assume

(x
to

be

false

after

computation

of

the

first

21

factor, and the second need not be e v a l u a t e d . T h e r u l e s of Pascal neither require nor forbid the evaluation of t h e s e c o n d part in such cases. This means that the programmer must assure that the second f a c t o r is w e l l - d e f i n e d , independent of t h e v a l u e of the first factor. Hence. if o n e a s s u m e s that the array a has an index ranging f r o m I t o 10. t h e n t h e f o l l o w i n g example is in error~ x

:=

O:

~eoeat (Note

a[11] .)

x

that

:=

x+1

if

no

~ntil a[i]

=

(x>lO)

~

O,

program

the

(a[x]=O) will

refer

to

an

element

Direct assignment is p o s s i b l e to variables of any type, except files. However. the variable (or the function) and the expression m u s t be of i d e n t i c a l type. with the exception t h a t if the type of the variable is r e a l , t h e t y p e o f t h e e x p r e s s i o n may be integer. (If a subrange t y p e is i n v o l v e d , its associated scalar type determines the validity of the assignment; see section 5 . B .)

examples of root I root 1 root3 found

assignments : := p i * x / y := - r o o t I := ( r o o t l + r o o t 2 ) * ( 1 . 0 := y>z count := c o u n t + I degree := d e g r e e + 10 sqrpr := s q r ( p r ) y := s i n ( x ) + cos(y )

B.

The

compound

+ y)

statement

The ~pmpound ~tatement specifies t h a t its c o m p o n e n t statements be executed in the same sequence as they are written. The symbols ~egin a n d ~ n d a c t as s t a t e m e n t brackets. Note that the "body" of a Program has t h e f o r m o f a c o m p o u n d statement.

{ program 4.1 the compound ~ro~ram

statement

beginend(output);

~j~ sum : integer; ~e~in s u m := 3 + 5 ; w r i t e l n (sum , - s u m )

e_o~. B

-8

}

22

Pascal uses the semicolon to ~eoarete statements, not to terminate statements; i.e. the semicolon is N O T p a r t of t h e statement. The explicit rules regarding semicolons are reflected in the syntax of Appendix D . If one h a d w r i t t e n a s e m i c o l o n a f t e r t h e s e c o n d s t a t e m e n t , t h e n an ~ m o t y ~ t a t e m e n t ( i m p l y i n g no action) would have been assumed between the semicolon and the symbol ~nd. This does no harm, for an e m p t y s t a t e m e n t is allowable at this point. Misplaced semicolons can. h o w e v e r , cause troubles--note t h e e x a m p l e in s e c t i o n 4.D .

C. R e p e t i t i v e statements

Reoetltive ~tatmments specify that certain statements be repeatedly executed. If the number of repetitions is k n o w n beforehand (before the repetitions are b e g u n ) , t h e for s t a t e m e n t is usually the appropriate c o n s t r u c t to e x p r e s s t h e s i t u a t i o n ; o t h e r w i s e , t h e r e p e a t or w h i l e s t a t e m e n t s h o u l d be u s e d .

C .1 T h e The

while

while while

statement

statement

has

<expression>

the

~g

form:

<statement>

The expression controlling the repetition must Boolean. It is e v a l u a t e d b e f o r e e a c h i t e r a t i o n , so t a k e n t o k e e p t h e e x p r e s s i o n as s i m p l e es p o s s i b l e .

{ program 4.2 compute h(n) Qroeram

=

I +

1/2 +

egwhile(input,

I/3

+

o..

+ I/n

be care

of t y p e must be

}

output):

varn ~ealn

: integer; h : real; read(n); write(n); h :: O; while n>O ~ ~p.~ h := h + I/n; nnd; writeln(h) end.

10

n

:= n-1

2.928968253968e+00

The statement executed statement in the above becomes false. If its

by the while statement (a c o m p o u n d c a s e ) is r e p e a t e d u n t i l t h e e x p r e s s i o n value is f a l s e at t h e b e g i n n i n g ° t h e

23

statement

C.2

The

is

not

repeat

The repeat

has

<statement>

The sequence is executed the Boolean iteration.

{ program 4.3 compute h(n)

at

all.

statement

statement

reoeat

orooram

executed

the

form:

{ ; <statement>}

until

<expression>

of at

statements between the symbols ~eat and until least once. Repeated execution is c o n t r o l l e d by expression, which is evaluated after every

=

I +

I/2

egrepeat(input,

+

I/3

+

...

+

I/n

}

output);

vat n : integer ; h : real; beoin read(n); write(n); h := O; repeat h := h + I/n; n

:= n - 1

n=O; writeln

(h)

am~. 10

The above happens if correct for

2.928968253968e+00

program performs correctly n<=O. The while-verslon a l l n. i n c l u d i n g n=O.

Note that it is a sequence of statement executes; a bracketing redundant (but n o t i n c o r r e c t ) .

C.3

The

for

of

for n>O, Consider what the same program is

statements that pair ~e~!~...~nd

the repeat w o u l d be

statement

The for statement indicates that a statement be r e p e a t e d l y executed while a progression of values is assigned to t h e &pn~ zariable of t h e f o r s t a t e m e n t . It h a s t h e g e n e r a l form: for

variable>

:= do < s t a t e m e n t >

to

variable>

:= do < s t a t e m e n t >

downto

value>

(or)
value>

24

{

program 4.4 compute h(n)

=

1 +

1/2

+

1/3

+

...

+

1/n

}

o r o o r ~ eGfor(input, output); yam i,n : integer; h : real; beain r e a d (n) ; w r i t e (n) ; h := O; i := n d Q ~ n t o I d ~ h := writeln(h) end.

10

+

1/i;

2.928968253968e+00

{ program 4.5 compute the cosine using cos(x) = 1 -x*'2/(2"I) ~rQoram

h

cosine

(input,

the expansion: + x*'4/(4"3"2"I)

-

...

}

output);

const vat

eps = Ie-14; x , s x , s , t :real; i .k ,n :integer ; b#ain read(n); for i := I t o n ~L~ beoin read(x); t := I; k :-- O; s := I; s x ~DJ~ abs(t) > eps*abs(s) do ~e~D k :~ k + 2; t := -t*sx/(k*(k-1)); :=

S

end

:~ s q r ( x ) ;

s+t

;

writeln

(x ,s ,k d i v

2)

end end.

1.534622222233e-01 3.333333333333e-01 5.000000000000e-01 1.000000000000e+O0 3.141592653590e+00

9.882477647614e-01 9.449569463147e-01 8.V75825618904e-01 5.403023058681e-01 -1.000000000000e+O0

5 6 ? 9 14

The control variable, the initial value, and the final value must be of t h e s a m e s c a l a r t y p e (excluding type real), and must not be altered by the for statement. The initial and final values are evaluated o n l y o n c e . If in t h e c a s e of ~ (~nw~to) the initial value is g r e a t e r (less) than the final value, the for statement is not e x e c u t e d . The final value of the control variable is left undefined upon normal exit from the for statement.

25

A

for

statement

for is

v

of t h e

:= el & g

equivalent

to

e2 ~g the

j~ e1<=e2 &hen ~eain v := el; {at

and

is

this

a

for

~or

v

el

equivalent

As

v

statement

v

is

the

statements:

succ(v);

S:

...;

v

:=

S;

...;

v

:= e2;

form:

e2 ~ g

S

statement:

v

a final

consider

S; is

v

:=

pred(v);

summing

S

undefined}

the

following

program.

{ program 4.6 compute I - I/2 + 1 / 3 - . . . + I / 9 9 9 9 - 1/10000 1) l e f t to r i g h t , in s u c c e s s i o n 2) l e f t to r i g h t , a l l p o s a n d n e g t e r m s , 3) r i g h t t o l e f t in s u c c e s s i o n 4) r i g h t t o l e f t , a l l p o s a n d n a g t e r m s , ~roqram

e2;

undefined}

if el>=e2 then beain v := el; ea~ {at this point,

example

of

:=

of t h e

#ownto to

S

sequence

S;

point,

:=

form:

4 ways. then

subtract

then

subtract}

(output);

v_ar

s 1.s 2p ,s 2n ,s 3,s4p ,s 4n .lrp .lrn ,rlp ,rln : real; i : integer ; _be.qin s 1 := 0; s2p := O; s 2 n := O; s 3 := O; s 4 p := O; s 4 n fgj& i := I J2o 5 0 0 0 ~L~ begin l r p := I / ( 2 - i - I ) ; { pos terms, left to right } l r n := I / ( 2 - i ) ; { n e g t e r m s , left to right} r l p := I / ( I 0 0 0 1 - 2 - i ) ; { p o s t e r m s , r i g h t to l e f t } r l n := I / ( I 0 0 0 2 - 2 - i ) ; { n e g t e r m s , r i g h t to l e f t } sl := s l + lrp - l r n : s 2 p := s 2 p + I r p ; s 2 n := s 2 n + I r n : s 3 := s 3 + r l p - r l n ; s 4 p := s 4 p + r l p ; s 4 n := s 4 n + r l n gnU; w r i t e l n (s I ,s 2p ~s 2n ) ; writeln(s3,s4p-s4n) end.

~.930991830595e-01 6.930971830599e-01

6.930971830612e-01 6.930991830601e-01

:=

O;

26

Why

do t h e

four

D. Conditional

"identical"

sums

differ?

statements

A ~qgo~L~t±onal ~tatement an if or case statement, selects a single statement of its component statements for execution. The if ~Lt~tement specifies that a statement be executed only if a certain condition (Boolean expression) is true. If it is false, then either no statement or the statement following the symbol e l s ~ is e x e c u t e d .

D .I The

if s t a t e m e n t

The t w o

forms

if

for an

if s t a t e m e n t

are:

<expression>

~hen

<statement>

<expression>

~hen

<statement>

(or) if

else

<statement>

The expression b e t w e e n t h e s y m b o l s if a n d ~ h e n must be of t y p e Boolean. Note that the first form may be regarded as an abbreviation of the second when the alternative statement is the empty statement. Caution: there is never a semicolon before an else1 Hence, the text: if

p J&heI3 . ~ e a i n

is i n c o r r e c t . if

p then;

$1;

Perhaps ~#ain

Here. the statement between the then statement following

S2;

even

$1;

S2;

$3 end;

more

else

$4

deceptive

is t h e

text:

$3 ~nd

c o n t r o l l e d by the if is t h e e m p t y s t a t e m e n t . and the semicolon; hence, the compound the if s t a t e m e n t will a l w a y s be e x e c u t e d .

The s y n t a c t i c a m b i g u i t y a r i s i n g f r o m the c o n s t r u c t : if < e x p r e s s i o n - l > J~Q2.J3 i f < e x p r e s s i o n - 2 > ~ h e n < s t a t e m e n t - l > rise <statement-2> is r e s o l v e d by i n t e r p r e t i n g t h e c o n s t r u c t as e q u i v a l e n t iL <expression-l> ~hen ~eain if <expression-2> ~hen <statement-l> else <statement-2>

The r e a d e r is f u r t h e r c a u t i o n e d t h a t e c a r e l e s s l y statement can be very c o s t l y . Take t h e e x a m p l e n-mutually exclusive conditions, c1.,.Cno each

to

f o r m u l a t e d if w h e r e one has instigating a

27

distinct action, si. Let P ( c i ) be t h e true, and say that P(ci)>=P(cj) for i < j . s e q u e n c e o f if c l a u s e s is : if

probability of ci b e i n g Then the most efficient

ci t h e n sl e l s e i f c2 S b e n s 2 else ... else

The fulfillment of statement completes remaining tests.

if

c(n-1)

~hen

s(n-1)

e!se

a condition and the execution of the if statement, thereby bypassing

If " f o u n d " is a v a r i a b l e of t y p e B o o l e a n , a n o t h e r of t h e if s t a t e m e n t can be i l l u s t r a t e d by: if e = b A much

J~b~.J3 f o u n d

simpler

found

:= a = b

sn

:= t r u e

statement

is:

:lse

found

:= f a l s e

frequent

its the

abuse

28

{ program 4.7 write roman numerals ~ro~ram

}

roman (output):

~ar x oy : integer ; bgqio y := I; ~j~p~&t_ x := y: write(x," w_hile x > = 1 0 0 0 do bw_qin write(°m°): if x>=500 then bea~j3 w r i t e ( ' d ") ;

"): x

:= x - l O 0 0

~n~:

x

:= x - 5 0 0

end :

x

:= x - l O 0

end:

x

:= x - 5 0

~nd:

x

:= x - t 0

eZld;

x

:= x - 5

end;

x

:= x - 1

end;

w_h_Ale x>=lO0 d o if

b~gin write('c°): x > = 5 0 t, hen beEi_.on w r i t e ( ' l ' ) :

w_hi!e x>=10 d~ be_~i~n

write('x'):

if x > : 5 ~hsn beoln x >= be~i[z writeln;

~h_~le

write('v'); 1 ~l write('i'): y : : 2*y

until y>5000 and.

I 2 4 U 16 32 64 128 256 512 1024 2048 4096

i ii iiii viii xvi xxxii lxiiii cxxviii cclvi dxii mxxiiii mmxxxxviii mmmmlxxxxvi

N o t i c e a g a i n t h a t it is o n l y o n e s t a t e m e n t t h a t is c o n t r o l l e d by an if c l a u s e . Therefore, when more than one action is i n t e n d e d , a compound statement is n e c e s s a r y .

The next program r a i s e s a r e e l v a l u e x t o t h e p o w e r y. w h e r e y is a non-negative integer. A simpler, and evidently correct version is obtained by o m i t t i n g the inner while statement: the result z is t h e n o b t a i n e d through y multiplications by x. N o t e the loop invariant: z*(u**e)=x**y. The inner while statement leaves z and u**e invariant, and obviously improves the efficiency of t h e a l g o r i t h m .

29

{ program 4.B exponentiation #j~oera~

with

natural

exponent

exponentiatiom(input,

}

output);

v~/~ e , y : i n t e g e r ; u , x , z : r e a l ; #eQin read(x,y); write(x oy)' Z := 1; u :: X ; e := y; Wb~

e>O

b ~

{z*u**e

do

while ~_~ b~_~in

en~. ; := e - l ;

e a~; writeln(z)

= x**y,

z {z

e>O}

odd(e) do e :-- e d i v

2:

u

:= s q r ( u )

:= u * z

= x*~y}

2.000000000000e+O0

7

I .2UOOOOOOOOOOe+02

The following program plots a reel-valued function f ( x ) by letting the X~xis run vertically and then printing an a s t e r i s k in positions corresponding to t h e c o o r d i n a t e s . The position of the asterisk is o b t a i n e d by c o m p u t i n g y=f(x), multiplying by a scale factor s, rounding the product to t h e n e x t i n t e g e r , a n d then adding a constant h end letting the asterisk be p r e c e d e d by that many blank spaces.

30

{

orogram 4.9 graphic representation of e f(x) = exp(-x) * sin(2*pi*x)

&re,ram

function }

graph 1(cutout) ;

const

d = 0.0625: {3/16o 16 l i n e s for i n t e r v a l [x,x+1]} s = 31: {31 c h a r a c t e r widths for i n t e r v a l [y,y+1]} h = 34: { c h a r a c t e r position of x-axis} c = 6.28318: {2*pi} lim = 32: var XoY : reel: i,n : integer: beain x := O; f_.gor_ i := I J2~ l i m d o b~eqin y := e x p ( - x ) * s i n ( c * x ) ; n := r o u n d ( s ' y ) + h: r_epe_a~ w r i t e ( ° "); n := n - 1 until n=O: writeln('**) : X

:=

x+d

end_ en_dd.

95

31

O.2

T he

case

statement

The case statement c o n s i s t s of an e x p r e s s i o n (the s e l e c t o r ) a n d a llst of s t a t e m e n t s , e a c h b e i n g l a b e l l e d by a c o n s t a n t of t h e type of t h e s e l e c t o r . T h e s e l e c t o r t y p e m u s t b9 a s c a l a r t y p e , excluding the type real. The case statement selects for execution that statement whose l a b e l is e q u a l t o t h e c u r r e n t v a l u e of t h e s e l e c t o r ; if no s u c h l a b e l is l i s t e d , t h e e f f e c t is undefined. Upon completion of t h e s e l e c t e d s t a t e m e n t , control goes t o t h e e n d of t h e c a s e s t a t e m e n t . T h e f o r m is: cas~ <expression> ~Z
label'list>

: <statement>; : <statement>

end

examples: (assume var gase i of O: x := O; I: x := s i n ( x ) ; 2: x := c o s ( x ) ; 3: x := e x p ( x ) ; 4: x := I n ( x ) end

i:

i n t e g e r ; ch: c h a r ; ) c a s e ch of ' a ' , ' g ' . ' d ' : ch := s u c c ( e h ) ; "z°,*e°: ch := p r e d ( c h ) ; "f" °S°: {null case} end

Notes: "Case labels" are ~ o r d i n a r y l a b e l (see s e c t i o n 4 . E ) and c a n n o t be r e f e r e n c e d by a g o t o s t a t e m e n t . T h e i r o r d e r i n g is arbitrary; however, labels m u s t be u n i q u e w i t h i n a g i v e n c c s e statement. Although the efficiency of the case statement d e p e n d s on t h e implementation, the general rule is to u s e it w h e n o n e has several mutually exclusive statements with similar probability of s e l e c t i o n .

E.

The

goto

statement

A qoto ~tatement is a s i m p l e s t a t e m e n t indicating that further processing s h o u l d c o n t i n u e at a n o t h e r p a r t of t h e p r o g r a m t e x t , n a m e l y at t h e p l a c e of t h e l a b e l . ~ot~

Each label (an u n s i g n e d i n t e g e r t h a t is at m o s t 4 d i g i t s ) m u s t appear in a label declaration p r i o r to its o c c u r r e n c e in t h e program body. The s c o p e of a l a b e l L d e c l a r e d in a b l o c k A is the entire text of block A. That is, ~ i ~ s t a t e m e n t in t h e statement oart of A may be p r e f i x e d w i t h L : . T h e n any o t h e r statement w i t h i n t h e ~ b o l e of b l o c k A m a y r e f e r e n c e L in a g o t o statement.

32

example

(program

label

I:

fragment):

{block

A}

* o ,

g r o c e d u r e B; { b l o c k B} l a b e l 3; :eoin 3: w r i t e l n ( ' e r r o r ' ) ; . , °

got o 3

end ; { b l o c k ~in

{block

B}

A}

I: w r l t e l n ( ° t e s t f a i l s ' ) {a ~'~oto 3" is not a l l o w e d sod

in b l o c k

A}

Warning: The effect of jumps from outside of a s t r u c t u r e d statement into that statement is not defined . Hence. the following examples are i n c o r r e c t . (Note t h a t c o m p i l e r s do not n e c e s s a r i l y i n d i c a t e an e r r o r . ) examples : a)

for

i:= I ~ 10 do _beoin S I; 3:S2 and ;

b)

if P then woto

3;

o

if c)

a'~bed~ 3: S

oroc@dure beoin

P

...

3: S eo~ ; b=_q~i~z ... ~oto 3 end.

A goto statement should be r e s e r v e d for u n u s u a l or u n c o m m o n s i t u a t i o n s w h e r e the n a t u r a l s t r u c t u r e of an a l g o r i t h m has to be broken. A good rule is to a v o i d the u s e of j u m p s to e x p r e s s regular i t e r a t i o n s and c o n d i t i o n a l e x e c u t i o n of s t a t e m e n t s , for such jumps destroy the reflection of the structure of computation in the t e x t u a l ( s t a t i c ) s t r u c t u r e of t h e p r o g r a m . Moreover, the lack of correspondence between textual and computational (static and dynamic) structure is extremely detrimental to the c l a r i t y of t h e p r o g r a m and m a k e s the task of

33

verification much more difficult. The Pascal program is o f t e n an i n d i c e t i o n not yet learned "to think" in P a s c a l

construct

in

other

programming

lenguages).

presence that the (as t h i s

o £ g o t o ° s in a programmer has is a n e c e s s a r y

SCALAR

A.

Scalar

AND

5 SUBRANG~

TYPE~

types

The basic definition identifiers

data types in Pascal indicates an o r d e r e d set which denote the values.

identifier>

example: type color

=

are the ~calar ~YPeS. Their of values by enumerating the

(

{.

}

) ;

=

(white,red,blue,yellow.purple.green, orange,black); (male,female);

sex = day = (mon,tues.wed,thur,fri,sat,sun); operators = (plus.minus,times,divide): illegal ~voe

example: workday = (mon,tues,wed,thur,fri.sat): free = (sat,sun): the type of sat is ambiguous)

(for

The reader defined as: type This true

is

already

Boolean

=

automatically and specifies

acquainted

(false,

with

the

standard

type

Boolean

true);

implies the standard that false<true.

identifiers

false

and

The relational operators on all scalar types types. The order is constants are listed.

=, <>, <, <=. >=, and >. are applicable provided both comparands are of the same determined by t h e s e q u e n c e in which the

Standard

arguments

functions

succ(×)

e.g.

pred (x)

ord ( x ) The ordinal ord(pred(x))

Assuming Boolean, following

+

with

succ(blue) pred(blue) ord(blue)

number I.

of t h e

that c and cl and sl...sn are meaningful

of

scalar

= yellow = red = 2 first

constant

are of type color are arbitrary statements:

types

the the the

are:

successor of x predecessor of x ordinal n u m b e r of x

listed

is

(above), statements,

O,

ord(x)

b is of then

=

type the

35

for

c

while if

:= b l a c k

(cl<>c)

c>white

#ownto

and

then

c

red

b do

~

sl

sl

:= p r e d ( c )

case c of red .blue ,yellow: s I; p u r p l e : s2: g r e e n , o r a n g e : s3; white.black: s4 and

B.

Subrange

types

A t y p e may b e d e f i n e d as a a~bran~e of any other already defined scalar type--called its ~ss#_~iated ~calar ~voa. The definition o£ a subrange simply indicates the least and the largest constant value in the subrange, w h e r e t h e l o w e r b o u n d m u s t be less then the upper bound. A subrange of t h e t y p e r e a l in ~#_~ allowed. i&t#_~ < t y p e

identifier>

=

., < c o n s t a n t >

;

Semantically, a subrange type is an appropriate substitution for the associated scclar type in all definitions. Furthermore. it is the associated scalar type which determines the validity of all operations involving values of subrange types. For example, given the declaration; vat

a:

1..10;

The associated assignments a

:= b;

c

b:

0..30;

scalar

:= b;

b:=

type

c:20..30; for

a.

b,

and

e is

integer.

Hence

the

c;

are all valid statements, although their execution may sometimes be infeasible. The phrase "or s u b r a n g e thereof" is t h e r e f o r e assumed to be implied throughout t h i s t e x t a n d is not a l w a y s mentioned (as it is in t h e R e v i s e d Report.)

example : type days = (mon,tues ,wed,thur,fri,sat.sun): workd = mon..fri: {subrange of days} index = 0..63; {subrange of inteqer} letter = a .. z ; { s u b r a n g e of c h a r }

{scalar

type}

Subrange types provide the means for a more explanatory statement of the problem. To the implementor they also suggest an opportunity to conserve memory space and to introduce validity checks upon assignment at run-time. (For an example with subrange types, see program 6.3.)

STRUCTURED TYPES

IN

6 GN~NJ~_~AL--THE

ARRAY --IN PA_~T~CDLA,B,

Scaler and subrange types are unstructured types, The other types in Pascal are &~Wcture~ ~vees. As s t r u c t u r e d statements were compositions of other statements, structured types are compositions of o t h e r t y p e s . It is t h e t y p e ( s ) of the ~ommo~nt& and--most importantly--the structuring method that characterize a structured type, An option available to e a c h of t h e s t r u c t u r i n g methods is an indication of t h e p r e f e r r e d internal data representation. A type definition prefixed with the symbol ~_~ed signals the compiler to economize storage requirements, even at the e x p e n s e of additional execution time and a possible expansion of t h e c o d e . due to the necessary packing and unpacking operations. It is t h e user's responsibility to realize if he w a n t s t h i s t r a d e of efficiency for space, (The a c t u a l e f f e c t s upon efficiency and savings in s t o r a g e space are implementation dependent, and may. in f a c t , be n i l . )

The array

type

An array type consists of e f i x e d n u m b e r of c o m p o n e n t s (defined when the array is i n t r o d u c e d ) w h e r e a l l a r e of t h e s a m e t y p e , called the ~mooneOJ2 or ~se ~voe. Each component can be explicitly denoted and directly accessed by t h e n a m e of t h e array variable followed by the so-called io_~ex in square brackets. Indices are computable; t h e i r t y p e is c a l l e d t h e & B ~ . ~ , Furthermore, the time required to select (access) a component does not depend upon the value of the selector (index); hence the array is termed a £_Qndom-~cces& &~uctur~. The the

definition of an a r r a y s p e c i f i e s i n d e x t y p e . The g e n e r a l f o r m is: ~&e

A

= array[T1]

both

the

component

of

variable

declarations

memory : ~.~L~[O..max] sick : ~r_~[days] £~ (Of c o u r s e these identifiers.)

and

o_f T2;

where A is a n e w t y p e i d e n t i f i e r ; TI is t h e i n d e x scalar type (where typos integer and real are index types); a n d T 2 is a n y t y p e .

examples

type

examples

-and-

~L integer Boolean assume

the

sample

t y p e a n d is a not allowable

assignments

memory[i+j] := x sick[men] := t r u e definition

of

the

auxiliary

37

{ program find the mrooram

part 6.1 largest and

minmax(input,

smallest

number

in a g i v e n

output);

const n = 20; vat i.u .v.min.max : integer; a : arrav[1..n] of integer; beoin [ a s s u m e t h a t at t h i s p o i n t in t h e p r o g r a m , contains the values: 35 6 8 94 7 88 -5 -3 - 6 3 0 -2 74 88 52 4 3 5 4} rain := a[ while i < beoiQ u _if u > v

I] ; m a x := rain; i := 2; n do := a]~i] ; v := a [ i + 1 ] ; then beqin ~ u > m a x J.hen max := u ; i f v < m i n t h e n rain := v en_ d e l s e be_~i__nn ~ v>max then m a x : = v: if u < m i n t h e n rain := u en~ : i := i + 2 e_O~: if i = n t h e n i f a [ n ] >max ~Lh~o m a x :: a [ n ] _ei~_~ i ~ e [ n ] <~nin t h e n rain := a [ n ] ; writeIn(max,min) erld .

94

-6

list

array a 12 3 5 9

}

38

[ program 0.2 extend program ~rocr~

4.9

to

print

x-axls

}

graph2(output);

const

d = 0.0625: { 1 / 1 6 , 16 l i n e s for interval s = 31; |31 character widths for interval maxl = 67; {max length of line} hl = 34; {character position of x-axls} c = 6.2U318; |2*pi} lim = 32; vat x.y : real, i.n : integer: a : ~rr~l~[O..maxl] of char" b@ain x := 0 ; f o r i := 0 t o m a x l d o a [ i ] := " ", j~ i := I t o l l m ~-G beef in

a[h]

:=

[x.x+1]~ [y.y+1]}

° : ° :

y := e x p ( - x ) * s i n ( c * x ) : a[n] := " * ' ; writeln(a); a[n] := " "; x := x

n

:= r o u n d ( s ' y )

+ d

en~ end.

!

+ hl;

39

(Consider how one would extend program 6.2 to print more one function--both with and without t h e u s e of an a r r a y . )

Since T2 may be of a n y t y p e , structured. In particular, if original array A is said to declamation of a multidimentional vat

M

: grrav[a..b]

than

the components of a r r a y s m a y b e T 2 is a g a i n an a r r a y , t h e n t h e be m u l t i d i m e n s i o n ~ . Hence. the a r r a y M c a n be s o f o r m u l a t e d :

of a r r e v [ c . . d ]

of

T;

(of t y p e

T)

and M [ i ] [j] then of M.

denotes

the

jth

component

For multidimensional arrays, convenient abbreviations: va____rr M

: ~r_~av[a..b,c.,d]

~

it

is

of

the

customary

ith

component

to

make

the

T;

and M[i.j]

We may regard M component (in t h e i t h r o w o f NI).

as jth

a matrix column) of

and the

say ith

that M[i.j] is t h e j t h component of M (of t h e

T h i s is not l i m i t e d to t w o d i m e n s i o n s , f o r T c a n a g a i n be structured t y p e . In g e n e r a l the (abbreviated) f o r m is: ~vo~ &r_~2~&~[ If O index o~dimensional, expressions .

types and

are a

= [ ,
type>}] the is

gL

a

type>

is said array by n denoted

;

t o be index

40

{ program 6.3 matrix multiplication EroRr~j~

1

matrixmul(input,

output):

const vat i s a b c beain fan ~

m = 4; p : 3; n : 2; : 1.,m; j : 1..n; k : 1..p; : integer ; : arr_@~t[1..m/1..p] of integer; : ~r_~[1..p °1..n] o f i n t e g e r ; : ~rrav[1..m,1.,n] of integer; {assign ~nitial values t o a a n d b} i "= I t o m d o fan k := I t o p d o beoio read(s); write(s); eli,k] := s end : writeln end : writeln ; ~ k := 1 ~j& p ~ ~eoin f~L~ J := I • o n d o bf,~ read(s): write(s); b[k°j] := s end : writeln end : writ eln ; {multiply e * b} ? o r i := 1 k s m d a O~t~l f a n J := I t a n do b~_qUia s := O: f o r k := I t o p d o s := s + a [ i , k ] * b [ k ° j ] c[i~,O] := s ; w r i t e ( s ) and : writeln n___dd e ; writeln #_O_~ •

~trinc&

I -2 1 -I

2 0 0 2

-I -2 2

3 2 I

I 6 I -9

10 -4 4 -2

were

defined

;

3 2 I -3

earlier

as

sequences

of

characters

enclosed

41

in single quote marks (chapter I). S t r i n g s c o n s i s t i n g of e single character are the c o n s t a n t s of t h e s t a n d a r d t y p e c h a r (chapter 2); those of n characters (n>l), are d e f i n e d as c o n s t a n t s of t h e t y p e d e f i n e d by: oacked

~r~[

Assignment types.

1..n] ~

(:=)

The

char

is p o s s i b l e

relational

between

operators

operands =,

<>,

of ~ d e n t i c a l <,

<=,

array

and >= a r e

applicable on operends of identical packed character arrays, where the underlying character set determines the ordering,

Access to individual components of packed arrays is o f t e n c o s t l y , a n d t h e p r o g r a m m e r is a d v i s e d to p a c k or u n p a c k e p e e k e d array in e single operation. This is possible through the standard

variable

procedures

~j~rey[m..n] and

where

end

unpack.

Letting

A be an a r r a y

o~ T

Z be a v a r i a b l e Eacked

pack

of t y p e

~rrav[u..v]

(n-m)

>=

of t y p e

~L T

(v-u), t h e n

p a e k (A ,i .Z )

means

~

j := u ~ v do Z[J] := A[j--u+i]

u n p a c k ( Z ,A ,i )

means

~or

j := u ~ £ v ~ A[j--u+i] := Z[j]

and

where J denotes the p r o g r a m .

an a u x i l i a r y

variable

not

occurring

elsewhere

in

7 RE&gAE TYPES

The record types are perhaps the most flexible of data constructs. Conceptually, a record type is a template for a structure whose parts may h a v e q u i t e distinct characteristics. For example, assume one wishes to record information about a person. Known are the name, the social security number, sex. date of birth, number of dependents, and marital status. Furthermore, if the person is married or widowed, the date of the (last) marriaqe is given; if divorced, one knows the date of the (most recent) divorce and whether this is the first divorce or not; and if single, given is whether an independent residency is established. All of this information can be expressed in a single "record". More formally, a record is a s t r u c t u r e consisting of a f i x e d number of components, celled field&. Unlike the array. components are not constrained to be of identical type and c a n n o t be d i r e c t l y indexed. A type definition specifies for e a c h component its t y p e a n d an i d e n t i f i e r , t h e 2ijild i # @ n t i f i e r , to denote it. The scope of a field identifier is t h e s m a l l e s t record in which it is defined. In o r d e r t h a t t h e t y p e o f a selected component be evident from the program text (without executing the program), the record selector consists of constant field identifiers rather than a computable value. To take a simple example, assume one wishes to compute with complex numbers of the form a+bi. where a and b are real numbers and i is the square root of - 1 . T h e m e i s no s t a n d a r d type "complex", However, the programmer can easily define a record type to represent complex numbers. This record would need two fields, both of type real. for the real and imaginary parts. The syntax necessary to express this is: ::= ~ e c o r ~ ~nd ::= < f i x e d p a r t > I ; ::= < r e c o r d s e c t i o n > {; < r e c o r d s e c t i o n > } ::= < f i e l d i d e n t i f i e r > { , } I <empty>

Applying these declaration: ~Yoe vat

complex x

rules,

one

= ~ecord ~nd ; : complex;

can

state

re.im

the

following

definition

l

:

and

: real

where complex is e t y p e i d e n t i f i e r ° r e a n d im fields, a n d x is a v a r i a b l e of t y p e c o m p l e x . a r e c o r d m o d e up o f t w o c o m p o n e n t s or f i e l d s .

are identifiers Consequently. x

of is

43

Likewise, date

a toy toy

or

a variable :

representing

a date

can

be

defined

as:

~ecord

mo:(jan.feb,mar,apr,may,june. july ,aug ,sept ,oct ,nov ,dec ) ; day: 1..31; year : integer

as : = ~ecord

kind

: (ball ,too ,boat ,doll ,blocks game , m o d e l , b o o k ) ; cost : real; received: date; enjoyed: (alot,some,alittle.none); b r o k e n ,lost : B o o l e a n

a homework

assignment

assignment

= ~eeord

as: subject

:(history.language,lit, math,psych.science); assigned: date; grade: 0..4: weight: 1..10

end

To reference a record component, the followed by a point, and the respective example, the following assigns 5+3i to x: x,re :: 5; x.im :: 3

name o f t h e r e c o r d field identifier.

is For

If the record is itself nested within another structure, the naming of the record variable reflects this structure. For example, assume one wishes to record the most recent small pox vaccination for each member in t h e f a m i l y . A p o s s i b i l i t y is t o define the m e m b e r s as a s c a l a r , a n d t h e n t h e d a t e s in an a r r a y of r e c o r d s : family= ~L~d~ v a c c i n e :

An u p d a t e

(father,mother.childl,child2,child3); g~[family] £L date:

might

then

vaccine[child3] vaccine[child3] vaccine[child3] Note:

the

type

be recorded

as:

.mo := a p r ; .day := 23; .year := 1 9 7 3

"date"

also

i~icludes,

for

instance,

e

31st

April.

44

{

program operations

&re,ram

7.1 on

complex

complex

numbers

(output);

const fac = 4; tree complex = record re.im vat x,y : complex; n : integer ; be~in x.re := 2; x .Ira := 7; y.re := 6 ; y.im : = 3; for n := 1 &o 4 d~

writeln("

{x

x

=

:

integer

",x.re:3,x.im:3,

end;

'

y

=

".y.re:3,y.im:3);

+ y}

writeln(

{x

}

° sum

=

",x.re x .im

+ y.re:3, + y.im:3);

* y}

writeln

(" p r o d u c t

writ eln : x .re := x . r e

=

+ fac;

",x . r e * y . r e x.re*y.im x .im

:= x . i m

- x . i m * y .ira:3, + x.im*y.re:3); - fac;

end end.

x = 2 sum = product

7

y

=

6

3

x = 6 3 y = sum = 12 6 product = 27 36

6

3

x = 10 - 1 sum = 16 product =

6

3

6

3

b =

10 -9

48

y 2 63

= 24

x = 14 - 5 y = sum = 20 -2 product = 9 9 12

The syntax for a record type also makes provisions for ~art. implying that a record type may be specified as of several ~#j~_~. This means that different although said to be of the same type. may assume which differ in a certain manner. The differences may a different number and different types of components. Each variant declarations by one or case clause

a ~arisnt consisting variables. structures consist of

is charaeterised by a list. in parentheses, of of its pertinent comoonents. E a c h l i s t is l a b e l l e d more labels~, and the set of lists is p r e c e d e d by a specifying the data type of these labels (i.e. the

45

type according exsmple, assume type Then

to w h i c h t h e the existence

maritalstatus

one

t~De

can

=

describe

variants of a

(married.

persons

by

are

widowed, data

discriminated).

divorced,

As

an

sinRle)

of t h e

person = ~ecord #ase maritalstatus ~ married: ( < f i e l d s of m a r r i e d p e r s o n s o n l y > ) ; single: ( < f i e l d s of s i n g l e p e r s o n s o n l y > ) ;

;

end

Usually, currently r e c o r d is ms

e component ( f i e l d ) of t h e r e c o r d valid variant. For example, the l i k e l y to c o n t a i n a c o m m o n f i e l d

i t s e l f i n d i c a t e s its above defined person

: maritelstetus

This frequent situation can be abbreviated by declaration of the discriminating component--the f i e l d - - i n t h e c a s e c l a u s e i t s e l f , i.e. by w r i t i n g case

ms:

The s y n t a x
maritalstatus

defining

the

including so-called

aL

variant

part

is:

: := c a s e < t a g f i e l d > < t y p e i d e n t i f i e r > { ; } ::= : ( < f i e l d l i s t > ) I <emp t y > : : = { , } <ease l a b e l > : : = ::= : I <empty>

It is before

Dart>

helpful defining

the ~a~

to "outline" it as a v a r i a n t

the information about record structure.

I , Person A. n a m e (last, f i r s t ) B, s o c i a l s e c u r i t y n u m b e r ( i n t e g e r ) C . s e x (male, f e m a l e ) D. d a t e of b i r t h ( m o n t h , day. y e a r ) E . n u m b e r of d e p e n d e n t s (integer) F. marital status if m a r r i e d , w i d o w e d a. d a t e of m a r r i a g e (month. day, year) if d i v o r c e d a. d a t e of d i v o r c e (month. day, year) b. f i r s t d i v o r c e (false, true) if s i n g I e a. i n d e p e n d e n t r e s i d e n c y (false,true)

DJE

a person.

46

Figure 7.a with different

is

a corresponding attributes.

woodyard

picture

of

two

"sample"

people

A)

edward i i lliH

845680539

B) iiiiii i

c)

male

~ug i

30

i1941

1

t

D)

E)

H,,,,, ,,

single

F)

true Figure

A record ~ype

defining

7.a

Two

"person"

sample

can

people

now

be

formulated

as:

alfa = ~acked ~r_~,~[1..lO] ~L char; status = (married,widowed,divorced,single); date = ~eeord mo : ( J a n . f e b . m a r , a p r , m a y , j u n , july.aug.sept.oet,nov,dec); day : I..31;

person

year : integer and; = ~eeord name : E e c o ~ end;

first.last:

alfa

ss : i n t e g e r ; sex : (male ,female): birth : date; depdts : integer; c a s e ms : s t a t u s ~ married~widowed : (mdete: date); divorced : (ddate: date; firstd: Boolean); single : (indepdt : Boolean)

end;

{person}

I. A l l field names must be distlnct--even if t h e y o c c u r in different variants. 2. If t h e f i e l d l i s t f o r a l a b e l L is e m p t y , t h e f o r m is: L : () 3. A field list can have only one variant p e r t a n d it m u s t succeed the fixed pert(s). (However, a variant part may itself contain variants. Hence, it is p o s s i b l e to h a v e n e s t e d variants.)

47

Referencing a record component is e s s e n t i a l l y a simple linear reconstruction of the outline. As e x a m p l e , assume a variable p of t y p e p e r s o n a n d " c r e a t e " t h e f i r s t of t h e m o d e l p e o p l e . p.name.last := " w o o d y a r d p.name.first := " e d w a r d p.ss := 8 4 5 6 8 0 5 3 9 ; p .sex := m a l e : p.birth.mo := a u g : p.birth.day := 30; p .birth.year := 1941; p.depdts : = 1; p .ms := s i n g l e ;

p.indepdt

A.

The

with

:= t r u e

statement

The above notation can be a b i t t e d i o u s , and the user may wish to abbreviate it using the ~ith ~tatement. The with clause effectively opens the scope containing the field identifiers of the specified record variable, so that the field identifiers may o c c u r as v a r i a b l e identifiers. (Thereby providing an o p p o r t u n i t y for the compiler to optimize the qualified statement.) The general f o r m is: ~tik__h < r e c o r d

variable>

{.

variable>}

do

<statement>

Within the component statement of t h e w i t h s t a t e m e n t one denotes a field of a record variable by d e s i g n a t i n g o n l y its f i e l d identifier (without preceding it w i t h t h e n o t a t i o n of t h e e n t i r e record variable). The with statement of a s s i g n m e n t s :

below

is

~ith p.name.birth ~L~ ~J~l l a s t := " w o o d y a r d "; first := " e d w a r d "; ss := 8 4 5 6 8 0 5 3 9 ; s e x := m a l e ; mo := a u g ; d a y := 30; year := 1941; depdts := I; ms := s i n g l e ; indepdt := t r u e end {with}

equivalent

to

the

preceding

series

48

Likewise. vat

currentdate . .

: date:

°

_with c u r r e n t d a t e do i ~ too=dec t h e n b e u i n mo := j a n :

en~ ~ I s e mo is

equivalent var

:= y e a r + 1

:= s u c c ( m o )

to

currentdate if

year

: date;

currentdate.mo=dec then b ~ eurrentdate.mo := j a n : currentdate.year := e u r r e n t d a t e . y e a r + 1

grid else And the earlier :

currentdate.mo following

:= s u c c ( c u r r e n t d a t e . m o )

accomplishes

with vaccine[child3] b e Q i ~ mo := a p t :

d~ day

:= 23:

the

year

vaccine

:=

update

exampled

1973

grid

No assignments elements of the w__ith r

do

may be m a d e by t h e record variable list.

contain

any

variables

with a[i] dn b_eoin ... i := i+1 end is _nw~ a l l o w e d . form : w_ith r I, r 2 . . . . . . is

to

any

S;

for

S

r must not example :

The

qualified statement That is, given:

equivalent

r n do

to

k i t h r I do ~ith r 2 w~h

rn

do

S

S

subject

to

change

by

49

Whereas :

is

yam

a a

NOT

allowed,

y_~

a b

IS

: array[2..8] : 2..8; for

the

of

integer:

definition

: integer ; : ~eoo~_~[ a: r e a l ; end ;

allowed,

for

distinguishable

the

from

variable b is Furthermore, within

a is

ambiguous,

b : Boolean

notation

the

of

real

for

the

"'b.a '°.

integer

Likewise.

distinguishable from the qualified statement

the S of

wi~h b ~ g S both

"b"

and

"b.b"

reference

the

Boolean

"b.b",

a

is

the

Boolean

easily

record "b.b*

8

--THE ~ET T~PE~

A s e t t y p e d e f i n e s t h e s e t of v a l u e s t h a t is t h e p o w e r s e t of its base type, i.e. the s e t of a l l s u b s e t s of values of t h e b a s e type. including t h e e m p t y s e t . T h e b a s e t y p e m u s t be s s c a l a r or subrange type. ~y_#_~ 

The base type implementations sets, which can

= &~,JL ~L~ ;

of a set must be e scalar of Pascal may define limits be quite small (e.g. the number

type; however, for the size of bits in a word).

Sets are built up from their elements by s e t c o n s t r u c t o r s (denoted by <set> in the syntax). They consist of the enumeration of the set elements, i . e . of e x p r e s s i o n s of t h e b a s e type. which are separated by c o m m a s a n d e n c l o s e d by s e t b r a c k e t s [ and ] . Accordingly. [] d e n o t e s the empty set, <set> ::= [ < e l e m e n t llst> ] <element l i s t > ::= < e l e m e n t > {, < e l e m e n t > } <element> ::= < e x p r e s s i o n > I <expression>

I <empty> .. < e x p r e s s i o n >

The form m..n denotes the set of all elements such that m<=i<=n. If m > n , [ m . . n ] d e n o t e s t h e Examples

of

[13] [ i + j .i-J] ['A'..'Z"

set

The following structure:

Relational

±a

<>

type

:

"0"..'9"] operators

union intersection set difference of A t h a t a r e

+

=

constructors

i of the base empty set.

operators

are

applicable

on

all

(e.g. A-B denotes the set not also eIements of B . )

applicable

to

set

operands

objects

of

all

with

set

elements

are:

t e s t on ( i n ) e q u a l i t y t e s t on s e t i n c l u s i o n set membership. The first operand is a s c a l a r t y p e , the s e c o n d is of its a s s o c i a t e d set type; the result is true w h e n t h e f i r s t is an e l e m e n t of the second. otherwise false.

51

examples tyoe vat

of

primary = (red . y e l l o w .blue); color = ~ £J~ p r i m a r y ; hue1o'hue2 : color:

ch: char: chsetl,chset2: var

Set more

-and -

declarations

opeode add

~8_~ ~ Z

"a'..'z';

: &at g~ 0..7; : Boolean:

operations complicated

assignments hue1 hue2

:= [red] ; h u e 2 := []7 := h u e 2 + [ s u c c ( r e d )]

chsetl chset2 add

are relatively fast and can tests. A simpler test for:

:= := :=

[ "d'.'a'°'g'] [ "a'..°z°]-[ch] [2",3]

be u s e d

~d~ ( c h = ' a ' ) o n ( e h = ° b ' ) o ~ ( e h = ' c ' ) o ~ ( c h = ' d ° ) g ~ ( c h = ' z is : i_f£ ch i n [ "a°..'d °,'z "] &hen s

{ program example program

8.1 of set

operations

setop(output

;

<= o p c o d e

to

")

eliminate

~hen

s

}

)

type

d a y s = (m , t ,w , t h . f r , s a , s u ) ; w e e k = se__t_t o f d a y s ; varr wk,work,free : week; d : days ; ~roce~w~ check(s : week): {procedures introduced ~ d : days; ~eoin write(" "); f o ~ d := m & ~ su ~ i ~ d i ~ s & h e n w r i t e ( ° x ") ~ I s e w r i t e ( ' o ' ) ; writeln end; { c h e c k } beoin w o r k := [] ; f r e e := [] ; wk := [ m . . s u ] : d := sa; f r e e := [d] + f r e e +[su] ; check (free) ; w o r k := w k - f r e e ; check(work); /j[ f r e e <= w k t h e n w r i t e ( ' o ° ) 7 ~/[ w k >= w o r k t h e n w r i t e ( ' k ' ) ; i f D o t (work >= f r e e ) the[l w r i t e ( " j a c k °): i ~ [sa] <= w o r k ~ h e n w r i t e ( ' forget it'); writeln end.

O0000XX xxxxxoe ok j a c k

in

chapter

11}

52

On

program

development

Prooramming--in the sense of designing and formulating algorithms--is in general a complicated process requiring the mastery of numerous details and specific techniques. O n l y in exceptional c a s e s w i l l t h e r e be a s i n g l e g o o d s o l u t i o n . Usually, so many solutions e x i s t t h a t t h e c h o i c e of an o p t i m a l p r o g r a m requires a thorough analysis not only of the available algorithms and computers but also of the w a y in w h i c h t h e program will most frequently be u s e d . Consequently, the construction of an a l g o r i t h m should c o n s i s t of a sequence of deliberations, investigations, and design decisions. In the early stages, attention is b e s t c o n c e n t r a t e d on the global problems, and the first draft of a solution may pay little attention to details. As the design process progresses, the problem can be split into subproblems, and gradually more consideration given to the details of p r o b l e m smecification a n d to t h e c h a r a c t e r i s t i c s of the availmb~e tools. The terms ~ i s e ~efin_~ment [2] a n d ~ f ~ ~ m m i n ~ [4] are associated with this approach. The remainder of t h i s c h a p t e r illustrates the development of an algorithm by rewording (to be c o n s i s t e n t with Pascal notation) an example C .A.R. Hoare presents in ~ r u ~ ~rog_r_~in~ [4."Notes on O a t a S t r u c t u r i n g " ] . The assignment is to g e n e r a t e the prime numbers range 2..n, where n>=2. After a comparison algorithms, t h a t of E r a t o a t h e n e s " s i e v e is c h o s e n simplicity (no m u l t i p l i c a t i o n s or d i v i s i o n s ) . The

first I. 2. 3. 4. 5.

formulation

is

f a l l i n g in t h e of t h e v a r i o u s b e c a u s e of i t s

verbal.

Put a l l t h e n u m b e r s Select and remove sieve. Include this number Step through the number. I f t h e s i e v e is not

between 2 and the smallest

n into number

in t h e sieve,

"primes" removing

empty,

repeat

all

steps

t h e "'s~eve" remainino in

multiples

of

the

this

2--5.

Although initialization of variables is t h e f i r s t s t e m in t h e execution of a p r o g r a m , it is o f t e n t h e l a s t in t h e d e v e l o p m e n t process. Full comprehension of t h e a l g o r i t h m is a p r e r e q u i s i t e for making the proper initializations; updating of these initializations with each program modification is n e c e s s a r y t o keep the program runnino. (Unfortunately. updating is not a l w a y s sufficient!) Hoare chooses a set t y p e w i t h e l e m e n t s 2 , . n to r e p r e s e n t both the sieve and the primes. The following is a s l i g h t v a r i a t i o n of the program s k e t c h he p r e s e n t s .

53

const n = 10000: va~ sieve,primes : set next ,j : integer ;

of

2..n;

{ initialize} sieve := [2..n] : p r i m e s := [] : n e x t ~eoeat {find next prime} while Dot(next in sieve) do n e x t primes := p r i m e s + [next] : j := n e x t ; ~hile j < = n d~l { e l i m i n a t e } beoin sieve := s i e v e - [j] ; j

:= 2; := s u c c ( n e x t ) :

:=

J + next

and until

sieve=[]

As an exercise Hoare makes the assignment to rewrite program, so that the sets only represent the odd numbers. following is one proposal. Note the close correlation with first solution.

cons~

n = 5000; sieve,primes next ,j,c :

the The the

{n' = n di~A 2} : set o~ 2..n; integer ;

DJ&W~LJ3 { i n i t i a l i z e } sieve := [2..n] : p r i m e s := [] ; n e x t r_em~at { f i n d n e x t p r i m e } ~ h i l e DD_t(next in s i e v e ) d o n e x t primes := p r i m e s + [next] ; c := 2 * n e x t - I; {c = n e w p r i m e } j := n e x t ; mile j < = n ~LQ { e l i m i n a t e } ~eoin sieve := s i e v e - [J] ; j

:= 2: := s u c c ( n e x t ) ;

:=

j+c

and unt$~

sieve=[]

end .

It is desirable that all basic set operations ere relatively fast. Many implementations restrict the maximum s i z e of s e t s according to their " w o r d l e n g t h ~. so t h a t e a c h e l e m e n t of t h e b a s e s e t is r e p r e s e n t e d b y o n e b i t (0 m e a n i n g absence, I meaning presence). Most implementations would therefore not accept a set with 1Oo000 elements. These considerations lead to an adjustment in t h e d a t a r e p r e s e n t a t i o n , as s h o w n in p r o g r a m U.2. A large set can be represented as an array of smaller sets such that each "fits" into one word (implementation dependent). The following program uses the second sketch as an abstract model of the algorithm. The sieve and the primes are redefined as arrays

of sets: next undeveloped.

is

defined

as

a

record.

The

output

is

left

54

{ program 8.2 generate the primes between 3..I0000 using a sieve containing o d d i n t e g e r s in t h i s r a n g e . } oroQram

primes (output);

const

wdlenoth = 59; { i m p l e m e n t a t i o n dependent} m a x b i t = 58; w = 634; {w : n div w d l e n g t h div 2} sieve ,primes : ~rrav[O..w] ~J~ : e t ~ f O . . m a x b i t next : rec0rd word,bit :inteoer end ; j , k . t ,e : i n t e g e r ; empty : boolean; beoin {initialize} ZDJZ t

:-- 0 ~j~ w

beein sieve[O] next.bit

;

do

sieve[t] := [ O . o m a x b i t ] ; p r i m e s [ t ] := := s i e v e [ O ] - [0] ; n e x t . w o r d := O; := I; e m p t y := f a l s e ;

[] e n d ;

~ h next do r_~3J~ { find next prime } ~h~ n o t ( b i t in s i e v e [ w o r d ] ) d o bit := s u c e ( b i t ) ; primes[word] := p r i m e s [ w o r d ] + [bit] ; c := 2 * b i t + 1; j := b i t ; k := w o r d ; ~b.ile k < = w do { e l i m i n a t e } b~liil s i e v e [ k ] := s i e v e [ k ] - [j] ; k := k + w o r d * 2 ; j := j + c: while j>maxbit do begin k := k + 1 ; j := J - w d l e n g t h end end ; if sieve[word]=[] then be__qin e m p t y := t r u e ; bit := 0 --end ; while empty ~ (word<w) d~ _b-e.q~ w o r d := w o r d + l ; e m p t y := s i e v e [ w o r d ] = [ ]

and up~il and.

empty ; {ends

with}

9

EZLE T~_EE~

In m a n y w a y s t h e s i m p l e s t s t r u c t u r i n g m e t h o d is t h e s e q u e n c e , In the data processing profession t h e g e n e r a l l y a c c e p t e d t e r m to d e s c r i b e a s e q u e n c e is a ~_~ouential ~ i l e . P a s c a l u s e s s i m p l y t h e word LJ,~J: to specify a structure consisting of a s e q u e n c e of components--all of w h i c h a r e of t h e s a m e t y p e . A natural ordering of the components is d e f i n e d t h r o u g h t h e sequence, and at any i n s t a n c e o n l y one c o m p o n e n t is d i r e c t l y accessible. The other components are a c c e s s i b l e by p r o g r e s s i n g sequentially t h r o u g h t h e f i l e . T h e n u m b e r of c o m p o n e n t s , called the I en at~h of the file, is not fixed by the file type definition, T h i s is a c h a r a c t e r i s t i c which clearly distinguishes t h e f i l e f r o m t h e a r r a y . A f i l e w i t h no c o m p o n e n t s is s a i d to be _em_~. type

: ~i1~

~

;

The declaration of every file variable f automatically introduces a buffer wariablg, d e n o t e d by ft, of t h e c o m p o n e n t type. It can be considered as a ~ i n d o w t h r o u g h w h i c h o n e c a n either inspect (read) e x i s t i n g c o m p o n e n t s or a p p e n d (write) n e w components, and which is automatically m o v e d by c e r t a i n f i l e operators. The s e q u e n t i a l p r o c e s s i n g a n d t h e e x i s t e n c e of a b u f f e r v a r i a b l e suggest t h a t f i l e s may be a s s o c i a t e d with ~condarv ~to=a~e and ~erioheral~. Exactly how the components are allocated is implementation dependent, but w e a s s u m e t h a t o n l y s o m e of t h e components are present in p r i m a r y s t o r e at a n y o n e t i m e , a n d o n l y t h e c o m p o n e n t i n d i c a t e d by ft is d i r e c t l y a c c e s s i b l e . When the standard otherwise

window ft is moved beyond the gnd gf Boolean function eof(f) returns the f a l s e . The b a s i c f i l e - h a n d l i n g operators

a ~ile value are:

f, t h e true,

r e s e t (f)

resets the f i l e w i n d o w to t h e b e g i n n i n g for t h e purpose of r e a d i n G , i.e. a s s i g n s to ft t h e v a l u e of the f i r s t e l e m e n t of f. e o f ( f ) b e c o m e s f a l s e if f is not e m p t y ; o t h e r w i s e , ft is not d e f i n e d , and eof(f) remains true.

r e w r i t e (f)

precedes the rewriting of t h e f i l e f. The c u r r e n t value of f is replaced with the empty file. eof(f) becomes true, and a new file may be written.

get(f)

advances the file window to the next component; i.e. assigns t h e v a l u e of t h i s c o m p o n e n t to t h e buffer variable ft . If no n e x t c o m p o n e n t e x i s t s , then eof(f) becomes true. and the resulting value of ft is not d e f i n e d . T h e e f f e c t of g e t ( f ) is

56

defined only execution.

put(f)

examples

if

eof(f)

is

false

prior

to

its

appends the value of the buffer variable ft t o the file f. T h e e f f e c t is d e f i n e d o n l y if p r i o r to execution the predicate eof(f) is t r u e . e o f ( f ) r e m a i n s t r u e , a n d ft b e c o m e s u n d e f i n e d ,

of

declarations

-and-

statements

with

yam d a t a : ~J~J~ ~L~ i n t e g e r ; a : integer;

a := s q r ( d a t a ~ get (data)

~ar

clubt := p : put (club)

club : ~ile p : person;

Program

parts

I. R e e d

a

file

~L~ p e r s o n ;

files );

: of r e a l

numbers

and

compute

their

sum

S.

{ program part c o m p u t e sum } S := O; reset(f); w_j31]e n o t e o f (f) d ~ be~ S := S + ft ; g e t ( f )

en_g 2,

The following program fragment operates ordered sequences of i n t e g e r s fl,f2 ..... fm end g l g2 . . . . . gn s u c h t h a t f ( i + 1 ) >= fi and g(i+1) a n d _merges t h e m i n t o o n e o r d e r e d file h(k+1) >= h ( k ) f o r k = 1,2 . . . . . It

uses the following variables: endfg : Boolean; fog,h : ~ile ~ integer

on

two

>= gi, f o r a l l h such that (re+n-l).

files

i.j

of

57

{ program part merge f and g into

h }

bea~3 reset(f): reset(g); rewrite(h); endfg := s o l ( f ) or eof(g); ~hile D o t e n d f g do bw~in if ft
not ht (g)

eof(g) do := g t ; p u t ( h ) ;

not eof(f) do h t := f t ; p u t ( h ) ; (f)

_end

--end

Files may be local to a p r o g r a m (or l o c a l t o a p r o c e d u r e ) or they may already exist outside the program. The latter are called e~ternal ~les. External files e r e p a s s e d as p a r a m e t e r s in the program heading (see chapter 13) i n t o the program.

A.

Textfiles

Files whose Accordingly, type

text

components the standard =

~ile

D~

are type

characters are t e x t is d e f i n e d

called ~extfil~. as f o l l o w s :

char;

Texts are usually subdivided into l ~ . A straight-forward method of indicating the separation of two consecutive lines is by using control characters. For instance, in the ASCII character set the two characters ~ (carriage return) and l~ (line feed) are used to mark the end of a line. H o w e v e r . many computer installations use a character set devoid of such control characters; this implies that other methods for indicating t h e end o f a l i n e m u s t be e m p l o y e d . We may consider the t y p e t e x t as b e i n g d e f i n e d over the base type char (containing printable characters only) extended by a (hypothetical) line separator character. This control character cannot be assigned to v a r i a b l e s of t y p e c h a r , b u t c a n be b o t h

58

recognized operators:

and

generated

of

the

textfile

skip to the beginning textfile x (xt b e c o m e s next line)

eoln(x)

a Boolean function indicating whether t h e e n d of the current line in the textfile x has b e e n reached. (If t r u e . xt c o r r e s p o n d s to the position of a line separator, b u t xt is a b l a n k . )

write

line

special

readln(x)

abbreviated

current

following

terminate

a textfile notation

the

the

writeln(x)

If f is abbreviated operators°

of the the first

textfile next line character

x of of

the the

a n d ch a c h a r a c t e r variable, the following may be u s e d in p l a c e of t h e g e n e r a l f i l e

form

(f .eh )

read (f .ch )

The following program demonstrate some typical I.

by

expanded

form

ft

: = ch;

ch

:= ft : g e t (f)

schemata operations

p u t (f)

use the above conventions performed on t e x t f i l e s .

to

Writing a text y. Assume that P(c) computes a (next) character and assigns it t o p a r a m e t e r c. I f t h e c u r r e n t l i n e is to be terminated, a Boolean variable p is s e t to t r u e ; a n d if t h e t e x t is t o be t e r m i n a t e d , q is s e t t o t r u e .

rewrite(y); ~eeeat £eo eat P(c); wntil p ; wr it e l n (y) until q

2.

w r i t e (y~.'h)

Reading a text x. Assume that Q(c) denotes the processing of a (next) character c. R denotes an action to be executed upon encountering the end of a line. r e s e t (x) ; while opt eof(x) do be~in wb_~ ~ eoln(x) do b#_q/J3 r e a d (x .'c ) ; Q ( c ) end ; R ; readln(x) ~nd

59

3.

Copying a text s t r u c t u r e of x.

x

to

a text

y, w h i l e

preserving

the

line

reset(x); rewrite(y); w h i l e #3&J2 e o f ( x ) d o b e q i n {copy a line} while not eoln(x) do beain read(x,c); write(y,c) end ; readln(x); writeln(y) end

B . The s t a n d a r d

files

"input"

and

"output"

The textfiles "input" and "output" usually represent the standard I/O media of a computer installation (such as the card reader and the line printer). Hence, they are the principal communication line between the computer and its human user.

Because these two files are used very frequently, considered as "default v a l u e s " in t e x t f i l e o p e r a t i o n s t e x t f i l e f is not e x p l i c i t e l y i n d i c a t e d . That is is e q u i v a l e n t

write

read(ch)

read (input,ch)

writeln

writeln (output)

r eadln

r e a d l n (input)

eof

e a r (input)

eoln

eoln (input)

Note: The s t a n d a r d f u n c t i o n s to the f i l e i n p u t (output).

~se&

Accordingly, for the case where x the first two of the program follows: (assume ~ar sh: char) a text

on

file

are the

to

write(ch)

Writing

they when

"output";

~leo eat ~ e o e a ~ P (ch); w r i t e ( c h ) unti~, p ; writeln uDtil q

(output

,eh)

(rewrite)

must

not

is "input" schemata

and y can be

be a p p l i e d

is "output" expressed as

60

Reading

a text

from

file

"input":

wh~ no#- e o f d o b~,q~& { p r o c e s s a line} w h i l £ no#_ e o l n d o beain read(ch); Q (oh) ond ; R ; readln end

Further extensions convenient handling described in c h a p t e r

of the procedures of legible input 12.

write and

and read (for output date)

the are

The next two examples of programs show the use of the textfiles input and output, (Consider what changes would be necessary if only get and put° not read and write° are to be used.)

{ program orosr~d~

9.1 fcount

--

frequency

count

of

letters

in

input

file

}

(input .output);

~ar

ch: char ; count : ~r_~av[ "a'..'z "] of integer: letter : set of "a'..°z'; beRin letter : = [ °a ° . . ' z ' ] ; f o e c h : = "a ° J&£ "z ° d o c o u n t [ c h ] := wh~ not eof do D#qia ~hile no& eoln do b~g~3 read (ch) : write(ch) ; if ch &n letter J2h£.J3 c o u n t [ c h ] end ; wrtteln; readln

O;

:=

count[ch]+l

e~ end.

In some installations when a textfile is s e n t t o a o r i n t e r , the first character of each line is used as a p r i n t e r control character; i.e. this first character is n o t o r i n t e o , but i n s t e a d interpreted as controlling the paper feed mechanism of t h e printer. The following conventions a r e in w i d e u s e : blank

: :

r I "

:

:

feed one llne space before printing feed double space before printing s k i p to t o p of n e x t p a g e b e f o r e p r i n t i n g no l i n e f e e d ( o v e r p r i n t )

61

The following llne, resulting

{ program oroeram

9.2 insert

program inserts a b l a n k at t h e b e g i n n i n # in n o r m a l s i n g l e space printing.

--insert

leading

blank

of

each

}

(input ,output):

vat ch: char; begin @D~i~J& D o t e o f d o beoin write(" '); ~ h ~ l e n o t e o l n do be~in r e a d (ch) : w r i t e ( c h ) eoj~ ; wrlteln ; readln end .

If read and parameter, the and output are the Paramenter

write are used without indication of a file default convention specifies that the files input assumed: in w h i c h e a s e , t h e y m u s ~ be m e n t i o n e d in l i s t of t h e p r o g r a m heading.

10 POINTER

~YPES

A ~ _ ~ ~#riable (staticly allocated) is o n e t h a t is d e c l a r e d in a program and subsequently d e n o t e d by its i d e n t i f i e r . It is called static, for it e x i s t s ( i . e . m e m o r y is a l l o c a t e d for it) during the entire execution of the block to which it is l o c a l . A variable may, on the other hand, be generated dynamically (without any correlation to the static structure of the program) by the procedure new. Such a variable is c o n s e q u e n t l y called a

Dynamic variables do not occur in an explicit variable declaration and cannot be r e f e r e n c e d directly by i d e n t i f i e r s . Instead, generation of a dynamic variable introduces a ~oi~_E~ Value (which is nothing other than the storage address of t h e newly allocated variable). Hence, a pointer t y m e P c o n s i s t s of an unbounded s e t of v a l u e s p o i n t i n g to elements of a given type T. P is t h e n s a i d to be b o u n d to T . T h e v a l u e n i l is a l w a y s an element o f P a n d p o i n t s t o no e l e m e n t at a l l , tvo~

If p is reference

a to

a

= ~

pointer variable

identifier>;

variable of type

bound T, and

to a type pf denotes

T. then p is that variable.

a

Pointers are a simple tool for the construction of complicated and flexible data structures. If the type T is a record structure that contains one or more fields of type fT. then structures equivalent to arbitrary f i n i t e g r a p h s m a y be b u i l t . where the T's represent the nodes, and the pointers are the edges. As an example, consider the construction of a "data bank""for a given ~roup of people. Assume the persons are represented by records as defined in chapter 7. One may then form a chain or linked list of such records by adding a field of a pointer type as shown below.

~

link person

= fperson; = ~ecord . . ,

next

: link:

~nd:

A

linked

list

of

n persons

can

be

represented

as

in

figure

10.a.

63

first Figure

A variable element of

10.a

Linked

list

of type llnk. the list. The link

called of t h e

If we assume that the file numbers, then the following construct the above chain. Va~, f i r s t ,

first for_

p : link;

i:

"first" points te t h e last person is n i l .

"input" cede

contains n social could have been

first

security used to

integer;

:= n i l : i

:=

I ~o

n do

be~zln mead(s); new(p); Pt .next := f i r s t ; pt .ss := s ; first := p a~

For purposes of access, ene intreduces anether variable, say pt. of type l i n k a n d a l l o w s it t o m o v e f r e e l y t h r o u g h t h e l i s t . To demenstrate selection, assume there is a person with social security number equal to n and access this person. Th e s t r a t e g y is te advance pt via link until the desired member is located: pt := ~j~le

first; ptt.ss

<>

n do

pt

:= p t t . n e x t

In words this s a y s , "'Let pt p o i n t t o t h e f i r s t e l e m e n t . While the social security number of the member pointed to (referenced) by pt d o e s n e t e q u a l n, a d v a n c e pt to t h e v a r i a b l e indicated by the link (also a pointer variable) of t h e r e c o r d w h i c h pt currently references." N o t e in p a s s i n g that f i r s t t .nextt .next accesses the third person. Note that this simple search statement w o r k s o n l y , if o n e is s u r e t h a t t h e r e is at l e a s t o n e p e r s o n w i t h s e c u r i t y n u m b e r n on the list. But is this realistic? A check against failing te recognize t h e e n d of t h e l l s t is t h e r e f o r e mandatory. One might first try the following solution: pt := f i r s t ; wh~l& (pt <> But recall referenced

nil)

section in the

and

(ptt .ss

4.A. second

<>

If pt factor

n) d O

pt

:= p t t .next

= nil, the variable ptt , of t h e t e r m i n a t i o n condition,

64

does which

(1)

not ~ x i s t at a l l . T h e f o l l o w i n g treat this situation correctly: pt

:= f i r s t ;

~hile ~[ (2)

b

two

possible

solutions

:= t r u e ;

(pt <> nil) P t t .ss

are

and

b ~

= n &h#~3 b

pt := f i r s t ; ~ h i l e pt <> ~ i l d ~ b ~ if p t f .ss = n ~ pt := pt~ .next

:= f a l s e

goto

else

pt

:= p t t .next

13;

end To p o s e a n o t h e r problem, say one wishes to add the sample to the bank. First a s p a c e i n memory m u s t be a l l o c a t e d , reference created by m e a n s o f t h e s t a n d a r d procedure Dew. new ( p )

allocates a new variable v and assigns the pointer reference of v to the pointer variable p. If t h e t y p e of v is e record type with variants, then new(p) allocates enough storage to accommodate all variants. The form

n e w ( p ot 1 . . . . .

Warning: of new, program allowed;

person and a

tn)

can be used to a l l o c a t e a variable of the appropriate size for the variant with tag field values equal to the constants t 1...tn. The tag field values must be listed contiguously a n d in t h e order of their declaration. Any trailing t a g f i e l d s m a y be o m i t t e d . T h i s ~W~,~& ~ o t imply assignment to the tag fields.

if a record variable pt is c r e a t e d by t h e s e c o n d f o r m then this variable m u s t not c h a n g e its v a r i a n t during execution. Assignment to the entire variable is n o t however one can assign to the components of pf.

I f n e w p is a v a r i a b l e o f t y p e l i n k (as d e f i n e d a b o v e ) , new(newp) creates a new variable of t y p e p e r s o n e n d a s s i g n s the reference to newp. The value of the new variable is u n d e f i n e d upon allocation. A c c e s s is v i a t h e p o i n t e r . examples: newp~ . s s

:=

newpf

paul

:=

The procedure either the form d i s p o s e (p) or"

845680539

assignes assigns

~isoose

is

the

a social the

record

"inverse"

security paul

of

i1~

number to

newpf

and

may h a v e

65

d l s p o s e ( p ,t I .... in) to which form of new was u s e d to c r e a t e the v a r i a b l e to by p. D i s p o s e t h e n " e r a s e s " t h e v a r i a b l e r e f e r e n c e d

relative pointed by p.

In t h e next step the new m e m b e r , r e f e r e n c e d by t h e p o i n t e r n e w p , must be i n s e r t e d after t h e m e m b e r r e f e r e n c e d by pt. See f i g u r e 10.b.

newp

Figure

Insertion

is a s i m p l e

newpt .next pt~

Figure

.next

10.e

::

10.b

matter

Before

of

changing

:= p i t . n e x t : newp

illustrates

the

result.

newp ~ I p n e e w #

Figure

I0.c

After

the

pointers:

66

Deletion of accomplished ptf .next

the member in t h e s i n g l e

following the instruction;

auxiliary

pointer

pt

is

:= p t t . n e x t t . n e x t

It is often p r a c t i c a l t o p r o c e s s a list u s i n g 2 p o i n t e r s - - o n e following t h e o t h e r . In t h e c a s e of d e l e t i o n , it is t h e n l i k e l y that one pointer--say p l - - p r e c e d e s t h e m e m b e r to be d e l e t e o , a n d p2 p o i n t s t o t h a t m e m b e r . D e l e t i o n can t h e n be e x p r e s s e d in t h e single instruction:

p It ,next

: = p2t . n e x t

One is, h o w e v e r , w a r n e d t h a t d e l e t i o n s in t h i s most installations, r e s u l t in t h e l o s s of u s a b l e p o s s i b l e r e m e d y is t h e f o l l o w i n g :

manner (free)

w i l l . in store. A

p l t . n e x t := p 2 t . n e x t ; d i s p o s e (p 2 ) This provides the implementor with the opportunity to m a r k t h e store defining the variable referenced by p 2 as f r e e . What a c t u a l l y is done may g r e a t l y v a r y a m o n g i n s t a l l a t i o n s ; elaborate "garbage collections" may be implemented, or the dispose

instruction

may s i m p l y

be i g n o r e d .

Li_~ ~liQcation is the most efficient representation for inserting and deleting e l e m e n t s . A r r a y s r e q u i r e s h i f t i n g down (up) of e v e r y e l e m e n t b e l o w t h e i n d e x in t h e c a s e of i n s e r t i o n (deletion). and files require complete rewriting. For an example involving a tree l i s t . r e f e r to c h a p t e r 11 ( p r o g r a m

A word

to

the

structure 11.5).

instead

of

a linear

wise

Pascal provides a w i d e v a r i e t y of date s t r u c t u r e s . It is left to the programmer to e v a l u a t e his p r o b l e m in d e t a i l s u f f i c i e n t to d e t e r m i n e t h e s t r u c t u r e b e s t s u i t e d to e x p r e s s t h e s i t u a t i o n and to evaluate the algorithm. As indicated by t h e "data b a n k " example, linked allocation is e s p e c i a l l y n i c e for i n s e r t i o n a n d d e l e t i o n . If. h o w e v e r , t h e s e o p e r a t i o n s happen infrequently, but instead efficient a c c e s s is m a n d a t o r y , t h e n t h e r e p r e s e n t a t i o n of t h e data as an a r r a y of r e c o r d s is u s u a l l y m o r e a p p r o p r i a t e .

PROCE~RUS

11 AND f U N C T I O N S

As one grows in the art of computer programming, one constructs programs in a sequence of refinement ~tes&o At each step the Programmer breaks his task into a number of subtaskso thereby defining a number of partial programs. Although it is possible to camouflage this structure, this is undesirable. The concegt of the ~K~_~edure (or &wbroutine) allows the display o£ the subtasks as explicit subprograms.

A,

Procedures

The &roce~ur~ ~eclaration serves to d e f i n e a p r o g r a m p a r t a n d to associate it w i t h an i d e n t i f i e r , s o t h a t it c a n b e a c t i v a t e d by a orocedu~A statement. The declaration h a s t h e s a m e f o r m as a program, e x c e p t it is i n t r o d u c e d by a & r _ ~ z e d u r e b e a d i D _ q i n s t e a d of a p r o g r a m heading. Recall the program part that found the minimum and maximum values in a list of integers. As an extension, say that increments of j 1 . . j n a r e a d d e d t o a[ I] . . . a [ n ] , t h e n m i n a n d m a x are again computed. The resulting program, which employs a Procedure to d e t e r m i n e min and max, follows.

68

{ program 11.1 extend program program

6.1

}

minmax2(input.output);

c o n s t n = 20; vat a : ~rrav[1..n] ~j[ i n t e g e r ; i,j : i n t e g e r ; ~rocedure minmax; va~ i :1..n; u,v,min.max :integer; ~e~in rain := a[ I] : m a x := rain; i := 2; wh i ] . ~ iv ~h~n b~aio i f u > m a x th~d3 m a x := u ; i f v < m i n t h e n rain := v e~l~ @ l s e h ~ ~ v > m a X t/3F,/1 m a x := v; ~C[ u < m i n tll@Xl rain := u and : i :: i + 2 and ; i f i=n theo i f a [ n ] >max t h e n m a x := a [ n ] e l s e i f a[n]
{read

for

i

array}

:=

I ~a

n

do

be~i__~n r e a d ( a [ i ] )" w r i t e ( a [ i ] :3) e~u3 : writeln ; minmax , f o r i := I t o n do bw_~in r e a d ( j ) ; a [ i ] := a[i] +j" write(a[i] eo_~ : wrlteln ; minmax an~.

-1

-3

44 40

Although I.

4 -6

7

7 15 -7

8 54 23 - 5 79

3

9

9

9 -6

45

79

79

3

1

1

9 88

7 43

12

17 - 7

48

59

39

9

?

7 12

simple, The

:3)

15 - 4 88

this

simplest ~rocedure

program

form

illustrates

of the

PROCEDURE

:

many

points:

HEADING,

namely:

5

69

2.

LOCAL VARIABLES. Local variables i, u, v , m i n , only within the scope variables h a v e no e f f e c t o£ m i n m a x .

to procedure minmax are the a n d m a x . T h e s e m a y be r e f e r e n c e d of minmax; assignments to these on t h e p r o g r a m outside the scope

3.

GLOBAL VARIABLES. Global variables are a, i. and j: They may b e r e f e r e n c e d throughout the program. (e.g. The first assignment in m i n m a x is m i n := a [ i ] .)

4.

NAME PRECEDENCE. Note that i is the name for both a global and a local variable. These a r e not t h e s a m e variable! A procedure may reference any variable global to it, or it may choose to redefine t h e n a m e . If a variable n a m e is r e d e f i n e d , the new name/tyoe association is then valid f o r t h e s c o p e of t h e d e f i n i n g procedure, and the global variable of t h a t n a m e ( u n l e s s p a s s e d as a parameter) is no longer available within the procedure scope. Assignment to t h e l o c a l i ( e . g . i := i + 2 ) h a s no effect upon t h e g l o b a l i; a n d s i n c e i d e n o t e s the local variable', the global variable i is effectively inaccessible. It is a good programming practice to declare every identifier w h i c h is n o t r e f e r e n c e d outside the procedure. as s t r i c l y l o c a l to t h a t p r o c e d u r e . N o t o n l y is t h i s g o o d documentation, but it a l s o p r o v i d e s added security. For example, i c o u l d h a v e b e e n l e f t as a g l o b a l v a r i a b l e ; but then a later extension to the program which called procedure mlnmax within a loop controlled by i w o u l d cause incorrect computation.

5.

The PROCEDURE "minmax" in the

STATEMENT. In this example main program activates the

the statement, procedure.

Examining the last example in m o r e d e t a i l , o n e n o t e s t h a t m i n m a x is called twice. By formulating the program part as a procedure--i.e, by not explicitly writing this program part twice--the programmer conserves not o n l y his t y p i n g t i m e , b u t also space in m e m o r y . T h e s t a t i c c o d e is s t o r e d only once, and space defining local variables is activated only during the execution of the procedure.

One should not hesitate, however, from formulating an action as a procedure--even when called only once--if doing so enhances the readability. Defining development s t e p s as p r o c e d u r e s makes a more communicable and verifiable program. Often necessary with the decomposition of a problem into subroutines is the introduction of n e w v a r i a b l e s to represent the arguments and the results of t h e s u b r o u t i n e s . The purpose of such variables s h o u l d be c l e a r f r o m t h e p r o g r a m text. The following program extends m i n i m u m a n d m a x i m u m v a l u e of an

the above example to compute the a r r a y in a m o r e g e n e r a l sense.

70

{ program 11.2 extend program groqram

11.1

}

minmax3(input

,output)

const n = 20; & y o e l i s t = aF_~av[ 1..n] yam a . b : list ; i.j,minl,min2,maxl.max2

of

;

integer; :

integer;

~ocedure m i n m a x ( v _ a r g : l i s t ; liar j,k : i n t e g e r ) ; v a ~ i : 1..n; u ,v : i n t e g e r ; beain j := g[ I] ; k := j; i := 2; ~hile i v ~_hen beajd3 if u > k J~h#O k := u ; if v < j J 2 ~ J := V end e ~ ~.e_q.~.n i f v > k ..i~hen k := v ; .i.f" u < j & h e n j : = u .~ o.#. ;

i := i + 2 end : if i =n ~b#d& i f g [ n ] > k t h e n k := g[n] e ~ & e if g[n] <j J&h~,d3 J := g[n] ; end; {minmax} ~e~in {read array} f o ~ i := 1 t o n do beQi~ r e a d ( e l i ] ); write(a[i] :3) end; writeln ; minmex (a,minl.maxl); writeln(minl,mexl,maxl-minl) ; writelm; fort i := I & O n d o be.qin r e a d ( b i l l ); write(b[i] :3) end: writeln : m i n m a x (b , m i n 2 . m a x 2 ) ; writeln(min2,max2,mex2-min2) : writeln(abs (mini-rain2) ,abs (max 1 - m a x 2 ) ) • w r i t e l n ; ~a i := 1 J2a n d ~ begicL e[i] := a[i] + b [ ~ ] : w r i t e ( a [ i ] :5) e~&d; writeln ; m i n m a x (a ,rain 1 , m a x l ) ; wr it e l n (rain I ,max I .max l-rain I ) end.

-1

-3

4 -6

?

U 54

23 79

-5

45

43

3 ~ -U 2

I 34

-8 45 34

4.4 40

2 15 -7

9 U8

15 - 4 ~d8

I

3

9 8b

9

9 -6

4

34 53

3

7 43 9.5

12

45

79

79

3

1

1

5

d -I

3 -2

-4

6

6

6

7

17 - 7

4U 72

75

9

'7

7

12

71

In p r o g r a m heading:

11.2

procedure

The formal parameter declaration procedure.

one

encounters

(

the

second

form

of

the

procedure

{ ; }

parameter section lists followed by its type. part, which introduces

);

the n a m e of each f o r m a l It is followed by the the o b j e c t s l o c a l to t h e

The labels in the label definition part and all identifiers introduced in t h e f o r m a l p a r a m e t e r part, the constant definition part, the type definition part, the variable, procedure, or function declaration parts are i~cal to the procedure declaration w h i c h is c a l l e d t h e ~ c o o e of t h e s e o b j e c t s . T h e y a r e not known o u t s i d e t h e i r s c o p e . In t h e c a s e of l o c a l v a r i a b l e s , their values are undefined at t h e b e g i n n i n g of t h e s t a t e m e n t part. Parameters provide a substitution mechanism that allows a process to be r e p e a t e d w i t h a v a r i a t i o n of its a r g u m e n t s . (e.g. minmex is called t w i c e t o s c a n a r r a y a a n d o n c e to s c a n a r r a y b.) One n o t e s a correspondence between the procedure heeding and the procedure statement. The latter contains a list of ~ctu~J, &arame~@~r_&, which are s u b s t i t u t e d for t h e c o r r e s p o n d i n g ~ormal ~arameters that are d e f i n e d in t h e p r o c e d u r e declaration. The correspondence is established by the positioning of the parameters in t h e l i s t s of a c t u a l and f o r m a l p a r a m e t e r s . There exist four kinds of parameters: so-called value parameters, variable parameters, procedure parameters (the a c t u a l p a r a m e t e r is a procedure identifier), and function parameters (the a c t u a l parameter is a f u n c t i o n i d e n t i f i e r ) . Program 11.2 shows the case of the E#j~bl~ ~arameter. The actual parcmeter ~ust ~ ~ E~riable; the corresponding formal parameter m u s t be p r e c e d e d by t h e s y m b o l E#32 a n d r e p r e s e n t s this actual variable during t h e e n t i r e e x e c u t i o n of t h e p r o c e d u r e . Furthermore, if x 1 . . x n e r e t h e a c t u a l v a r i a b l e s that correspond to the formal variable parameters v l . . v n , t h e n x l..xn s h o u l d be ~latincj~ v a r i a b l e s . All address calculations a r e d o n e at t h e t i m e of t h e c a l l , H e n c e . if a v a r i a b l e is a c o m p o n e n t o f an a r r a y , expression is e v a l u a t e d w h e n t h e p r o c e d u r e is c a l l e d . To d e s c r i b e arrow for Oarameter parameter. performed parameter with j and When

procedure its i n d e x

the memory allocation pictorially, o n e c o u l d d r a w en each variable parameter f r o m t h e n a m e of t h e f o r m a l to the memory l o c a t i o n of t h e c o r r e s p o n d i n g actual Any operation involving the formal parameter is t h e n directly upon the actual parameter. Whenever the represents a ~ e s u l t of t h e p r o c e d u r e - - a s is t h e c a s e k a b o v e - - i t m u s t be d e f i n e d as a v a r i a b l e p a r a m e t e r .

no s y m b o l

heads

the

parameter

section,

the

parameter(s)

of

72

t h i s s e c t i o n a r e s a i d to be M~llw_~ ~i@J~eJr~J&~/~(s). In t h i s case t h e actual parameter Wjistha an ~KD/~/~JJ3 (of w h i c h a v a r i a b l e is a simple case). The corresponding formal parameter represents a local variable in the called procedure. As its i n i t i a l v a l u e , this variable receives t h e c u r r e n t v a l u e of t h e c o r r e s p o n d i n g actual parameter (i.e. t h e v a l u e of t h e e x p r e s s i o n at the t i m e of the procedure call). The procedure may t h e n c h a n g e t h e v a l u e of this variable by assigning to it; t h i s c a n n o t , h o w e v e r , affect the value of the actual parameter. Hence. a value parameter can never represent a r e s u l t of a c o m p u t a t i o n , The difference in is shown in program

{ program procedure ~roora~

the effects 11.3.

11.3 Parameters

parameters

of

value

and

variable

parameters

}

(output);

va~ a,b : integer; ~roced~r_e h(x : integer; ~/~ beqin x := x + 1 ; y := y+1; writeln(x ,y) end ; ~e~in e := 0; b := 0; h(a.b); writeln(a,b) e~u~,

1

1

0

1

y : integer);

In program 11.2 none of the values in array g are altered; i.e. g is not a result. Consequently g could have been defined as a value parameter without affecting the end result. To understand why this was not done. it is helpful to look at the implementation. A procedure c a l l a l l o c a t e s a n e w a r e a for e a c h v a l u e p a r a m e t e r ; this represents the local v a r i a b l e . T h e c u r r e n t v a l u e of t h e actual parameter is " c o p i e d " i n t o this l o c a t i o n ; exit from t h e procedure simply releases this storage. If a parameter is not used to transfer a result of the procedure, a value parameter is generally preferred. The referencing is then quicker, and one is protected against mistakenly altering the data. However in the case where a parameter is of a structured type (e.g. an array), one should be cautious, for the copying operation is relatively expensive, and the amount of storage needed to allocate t h e c o p y may b e l a r g e . Because referencing of each element in the array occurs only once, it is desirable to define the parameter as a variable parameter.

73

One

may

change

the d i m e n s i o n

of the

array

simply

by r e d e f i n i n g

n. To make the program applicable for an array of reals, needs only to change the type and variable definitions: statements are not dependent upon integer data.

one the

The use of the procedure identifier w i t h i n the text of the procedure itself implies Cenur@_~ve e x e c u t i o n of the p r o c e d u r e . Problems whose definition is n a t u r a l l y r e c u r s i v e , often lend themselves to r e c u r s i v e s o l u t i o n s . An e x a m p l e is the f o l l o w i n g p r o g r a m . G i v e n as data ere the s y m b o l i c e x p r e s s i o n s : (a+b) ~ (c-d) a +b *c -d

( a + b)* a+b* (c-d) b+c*(d+c*a*a

which period

c-d

)*b+a

ere formed according t e r m i n a t e s the i n p u t .

to

the

expression

term J

factor I if actor

factor

identifier

d v I letter I Figure

11.a

Expressions

syntax

of

figure

11.a.

A

74

The task is to c o n s t r u c t a p r o g r a m to c o n v e r t t h e e x p r e s s i o n s into postfix form (Polish notation). This is done by constructing an individual conversion ~rocedure for each syntactic construct (expression, term, factor). As these syntactic constructs are defined recursively, their corresponding procedures may activate themselves recursively,

75

{

program 11.4 conversion to

mrooram ~ar

ch

postfix

form

}

postfix(input,output); : char ;

orocedure find; beoin ~eoent read(oh) until ( c h < > ° °) a n d end ; orooedure vat op

expression : char:

or_~ure

t erm :

~ot

eoln(input)

:

oroce,~ure factor, beoiQ i f o h = °(* t h e n beoin find; expression; end #lse write(oh) : find end: {factor}

~nd;

factor while benin end {term}

: oh= °*° find:

do factor:

beoin term ; ~hile (ch='+')oz(ch =°-') beqin op : = c h ; f i n d : end ~.d331: { e x p r e s s i o n }

{oh

=

) }

write('*')

do term:

write(op)

beoin find ; ~eoeat write(" "); expression ; writeln until oh='."

end. a b +cd - * abc~+dab+c*dabcd -*+ a a ~a {a ~

bcdca*a

~+*b*+a +

The binary r eeursive consists consists

JEr e e i s a d a t a s t r u c t u r e that is naturally defined in terms and processed by recursive algorithms. It of a finite set of nodes t h a t is e i t h e r e m p t y or of a n o d e (the r o o t ) w i t h t w o d i s j o i n t binary trees.

76

c a l l e d t h e left and r i g h t s u b t r e e s (6). R e c u r s i v e p r o c e d u r e s for generating and traversing binary trees naturally reflect this m o d e of d e f i n i t i o n . Program 11.5 builds a binary t r e e and t r a v e r s e s it in p r e - , posta n d e n d o r d e r . The t r e e is s p e c i f i e d in p r e o r d e r , i.e. by listing t h e n o d e s ( s i n g l e l e t t e r s in this c a s e ) s t a r t i n g at t h e root a n d f o l l o w i n g first t h e left and t h e n t h e r i g h t s u b t r e e s so t h a t t h e i n p u t c o r r e s p o n d i n g to f i g u r e 11.b is:

abc..deo,fg...hi..jkt..m..n.. where

a point

signifies

Figure

an e m p t y

11.b

subtree.

Binary

tree s t r u c t u r e

77 { program 11.5 binary tree traversal

}

~ro0ra~ traversal(input.output): tvae ptr = tnode; node = ~ecord info : char: llink.rlink : ptr ~nd; ~ar root : ptr; ch : char; ~roced~ preorder(p : ptr); ~eoin if p<>~3il then ~e~iE write (p~ .info ) ; preorder(pf .llink): preorder(pt .rlink) e_~:

{preorder}

~ro_~~ postorder(p : ptr); beoin if p<>~il then ~eoin postorder(p~ ,llink): write(pt .info); postorder(p~ ~ l i n k ) and

~d3~; {postorder} orocedw~ endorder(p : ptr): ~eoin i f p <>13~Ll J~hen be~in endorder (p~ ,llink); endorder(p~ .rlink); write(p~ .info) and

~nd:

{ endorder }

~ r o e e d u r ~ enter(yam p :ptr); beoin read(oh); write(oh) : if

_~P.O. new(p);

ch<>'.'

be~i~

p? .info := ch; e n t e r ( p t .llink); e n t e r (pt .rlink) : end ~dL~a p

:= n i l

end ;{ enter} beain write(" write(" write(" write("

"); "); ") : ");

enter(root); writeln : preorder(root); writeln; postorder(root); writeln; endorder (root); writeln

en__d.

abc.ode..fg...hi.,jkl..m..n.. abodefghijklmn cbedgfaihlkmjn cegfdbilmknjha

78

The reader is cautioned against applying reeursive techniques indiscriminately. Although appearing "clever". they do not always produce the most efficient solutions.

If a procedure P calls a procedure Q a n d Q a l s o c a l l s P. t h e n either P or Q m u s t be " p r e - e n n o u n c e d " by a ~ o r w a r d ~eclaratioo (section 11.C).

The &&andard ~rgcedures in A p p e n d i x A are predeclared in e v e r y implementation of Pascal. Any implementation may feature additional predeclared procedures. Since they are, as all standard objects, a s s u m e d t o be d e c l a r e d in a scope surrounding the user program, no conflict arises from a declaration redefining the same identifier within the program. The standard procedures get, put, read, write, reset, and rewrite were introduced in c h a p t e r 9. R e e d a n d w r i t e a r e f u r t h e r discussed in chapter 12.

B.

Functions

Functions are program parts (in t h e s a m e s e n s e as p r o c e d u r e s ) which compute a single scalar or p o i n t e r v a l u e f o r u s e in t h e evaluation o£ an e x p r e s s i o n . A ~wnction ~ . s ~ specifies the activation of a function and consists of the identifier designating the function a n d a l i s t of a c t u a l p a r a m e t e r s . The parameters are variables, exoressions, procedures, or f u n c t i o n s end are substituted for t h e c o r r e s p o n d i n g formal parameters. The the

function exception

L~Oct~gJ3 -orL W ~

declaration has the same form of t h e _f_wj3c_t~DJ& ~ e a d i n o which

:
type>

as t h e p r o g r a m , has the form:

with

:

( })

section> :
type>

;

As in t h e c a s e of p r o c e d u r e s , t h e l a b e l s in t h e l a b e l d e f i n i t i o n part and all identifiers introduced in t h e f o r m a l p a r a m e t e r part. the constant definition part. the type definition part, the variable, procedure, or f u n c t i o n declaration parts are i~cal to the function declaration, w h i c h is c a l l e d t h e ~ of t h e s e objects. They a r e not k n o w n o u t s i d e their scope. T h e v a l u e s of local variables are undefined at t h e b e g i n n l n B of the statement pert. The identifier specified in the function heading names the function. T h e r e s u l t t y p e m u s t be a s c a l a r , subrange, or p o i n t e r type. Within the function declaration t h e r e m u s t be an e x e c u t e d assignment (of t h e r e s u l t t y p e ) to t h e f u n c t i o n identifier. This assignment "returns" the result of t h e f u n c t i o n .

79

The examples to date have only dealt with variable and value parameters. Also possible are procedure and function parameters. Both must be introduced by a special symbol: the symbol ~rocedun~ signals a farmal procedure parameter: the symbol function, a formal function parameter. The following program finds a zero of a function by bisection: the function is specified at t h e t i m e of t h e c e l l .

{ program 11.6 find zero of a

Pro~ra~

bisect

function

(input,

by

bisection

}

output):

const eps =1a-14; v__ar_ x ,y :reaI; function zero(J~unction f': real; v a t x ,z : r e e l ; e :boalean; b#oin s := f(a)<0; ~eoeat

z if

x :=

:=

a,b:

real):

real:

(a+b)/2.0;

f(x):

(z
then

a

:: x w l s e

b

:= x

WJ3~Jil a b s ( a - b ) < e p s ; zero :--- X e a ~ : {zero} beqin

{main}

read(x,y); r e a d ( x ,y ) ; ~n~.

writeln(x.y,zero(sin,x ,y)); writeln(x ,y ,z e r o ( c o s , x , y ) )

-l.000O00000000e+00 1.00O000000000e+00

1.00O000000000e+00 2.000000000000e+00

-7.105427357601e-15 1.570796326795e+00

An assignment (occurring in a function declaration) to a non-local variable or taa variable parameter is called a ~i9_~ #f£ec~. Such occurrences often disguise the intent of the program and greatly complicate the task of verification. (Some implementations may e v e n a t t e m p t to forbid side effects.) Hence, the use of functions producing side effects is strongly discouraged. As an

example,

consider

program

11.7.

80

{ program 11.7 test side effect mro~ram

sideffect

} (output);

v~ a,z : integer : ~wnction sneaky(x : integer): integer; ~e~in z := z - x ; { s i d e e f f e c t on z} sneaky := s q r ( x ) end; j~e~in z := 10; a := s n e a k y ( z ) ; writeln(a,z); z := I0; a := s n e a k y ( t 0 ) * sneaky(z); z := 10; a := s n e a k y ( z ) * sneaky(t0); end.

100 0 10000

writeln(aoZ), writeln(a,z)

0 0 -10

The next example formulates the exponentiation program 4 . 6 as a f u n c t i o n declaration.

a3gorithm

of

81

{ p r o g r a m 11.8 extent p r o g r a m proora~

4.8

}

expon2(output);

~AE pi,spi:

real;

~unetioIl p o w e r ( x :real; z: r e a l ; benin z := I; while y>O do ,be.q i_a

y :integer):

w_1~i~ n o t b~_~n y

y

end ; := y-l:

:= y

z

div

2:

x

real;

{y>=O}

o dd ( y ) d o

:= s q r ( x )

:= x*z

end ;

power := ,@Pd: { p o w e r }

Z

_be~In pi := 3.14159; writeln(2.0o7,power(2.0,7)); spi := P o w e r ( p i , 2 ) : w r i t e l n (pi ,2.spi ); writeln(spi~,~2,power(spl,2)); w r i t e l n ( p i , 4 , p o w e r (pi ,4) )

2.000000000000e+O0 3.141590000000e+00 9.86958772U100e+O0 3.141590000000e+00

7 2 2 4

1.2UOOOOOOOOOOe+02 9.869587728100e+00 9.740876192266e+01 9.740876192266e+01

The appearance of the function identifier within the function itself implies ~eeursive function.

in an e x p r e s s i o n e x e c u t i o n of the

82

{ p r o g r a m 11.9 recursive formulation

of g c d }

_Pro@ram r e c u r s i v e g c d ( o u t p u t ) ; Mar x,y,n

: integer;

~unction gcd(m,n: integer):integer; h eoin i f n=0 ~b~J3 god := m ~ I s e god := g c d ( n , m m o d n) end; {gcd}

&~#.~J~LwZ.~ t r y ( a , b :integer): b~J3 w r i t e l n (a ,b ,god (e ,b ) ) end ; beoin

try(18,27); try(312,2142); t r y (61 ,53); try (98,868)

end.

18 312

27

9

2142

61 98

53 868

6 1 14

F u n c t i o n c a l l s may o c c u r b e f o r e is a ~ D J 2 j ~ ~eference (section

The & J ~ predeclared implementation

~unction8 in every may feature

the function 11.C).

definition

if t h e r e

of

Appendix A ere assumed to be implementation of Pascal. Any additional predeclared functions.

C. Remarks I. P r o c e d u r e (function) calls may o c c u r b e f o r e the p r o c e d u r e (function) definition if t h e r e is a ~9J&_~J&~ ~ e f e r e n c e . The form is as follows: (Notice that t h e p a r a m e t e r list and eventual result type are written ~l~llv in the forward reference.)

83 p r o g d#_~L~ Q (x : T ) ; ~ro#edure P (y : T ) ,

forward;

Q(a)

~nd : &~ocedure Q; ~e~in P(b)

{parameters

are

not

repeated}

end : b~zj~ P(a): Q(b) end.

2.

Procedures and functions which are used other procedures and functions must have only. (Consequently. it is not necessary to whether a parameter is called by value or by

as parameters to value 9arameters test at run time address.)

3.

A component of a packed structure must not appear as an actual variable parameter. (Consequently. there is no need to pass addresses of partwords, and to test at run time £or the internal representation of the actual variable.)

12 ~N PU T _AND # U T P U T

The problem of communication between man and c o m p u t e r was a l r e a d y m e n t i o n e d in c h a p t e r 9. B o t h l e a r n to w ~ e r s t a n d through what is t e r m e d p a t t e r n [ e c 0 ~ n i t ~ . Unfortunately, the patterns recognized most e a s i l y by men ( d o m i n a n t l y t h o s e of p i c t u r e and sound) are very d i f f e r e n t from t h o s e a c c e p t a b l e t o a c o m p u t e r (electrical impulses). In fact, the expense of physically t r a n s m i t t i n g d a t a - - i m p l y i n g a t r a n s l a t i o n of p a t t e r n s l e g i b l e to man into o n e s l e g i b l e to a c o m p u t e r , a n d v i c e v e r s a - - c a n be as Costly as the processing of the transmitted information. (Consequently, much r e s e a r c h is d e v o t e d t o m i n i m i z i n g the cost by "automatizing" or "'automating" m o r e of t h e t r a n s l a t i o n process.) T h i s t a s k of c o m m u n i c a t i o n is c a l l e d i n p u t a n d o u t p u t h a n d l ~ n g (I /O). The h u m a n can s u b m i t his i n f o r m a t i o n v i a ~ O o u ~ ~ e v i c e & (e.g. key punches, card readers, paper tapes, magnetic tapes, terminals) and r e c e i v e his r e s u l t s v i a ~ A W ~ d e v i c e s (e,g. l l n e p r i n t e r s , card and paper tape punches, terminals, visual display units), Common to b o t h - - a n d d e f i n e d by each i n d i v i d u a l i n s t a l l a t i o n - - I s a set of legible characters (chapter 2). It is over t h i s character set that Pascal defines the two standard textfile variables (program parameters) ~OD~t and ~ (also see c h a p t e r 9). Textfiles may be a c c e s s e d t h r o u g h the s t a n d a r d f i l e p r o c e d u r e s get and put. This can, of c o u r s e be q u i t e c u m b e r s o m e as t h e s e procedures are defined for single character"manipulation. To illustrate, consider one has a natural number s t o r e d in a variable x a n d w i s h e s to p r i n t it on t h e file o u t p u t . Note that t h e p a t t e r n of c h a r a c t e r s d e n o t i n g t h e d e c i m a l r e p r e s e n t a t i o n of the value w i l l be q u i t e d i f f e r e n t from t h a t d e n o t i n g the v a l u e written as a Roman numeral (see p r o o r a m 4 , 7 ) ° g u t as one is usually interested in d e c i m a l n o t a t i o n , it a p p e a r s s e n s i b l e to offer built-in standard transformation procedures that translate a b s t r a c t n u m b e r s (from w h a t e v e r c o m p u t e r - i n t e r n a l representation is u s e d ) i n t o s e q u e n c e s of d e c i m a l d i g i t s a n d v i c e v e r s a . The two standard procedures to facilitate the analysis syntax for calling these can be used with a variable not fixed.

A.

The procedure

Let real,

I,

~ead and ~rite are thereby extended and the formation of textfiles. The procedures is non-standard, for they number of parameters whose types are

read

v l.v2 . . . . . vn denote variables and let f denote a textfile.

read(v I ..... vn) stands r e a d (input,v I . . . . . vn)

for

of

type

char,

integer,

or

85

2.

read(f ,v 1 . . . . . vn) be~in r e a d ( f ,v l ) ;

3.

readln(vl ..... readln(inPut.v

4.

readln(f,v be~in

.,.

vn) 1 .....

stands for ; read(f,vn) stands vn)

for

I ..... vn) stands for r e a d ( f ,v l ) ; ... ; r e a d ( f . v n ) :

The effect is that after vn the remainder of the current values of v l...vn may s t r e t c h 5.

end

If ch is read(f,ch)

a variable stands

begin

oh

of type for

:= ff ; g e t (f)

readln(f)

eild

is read (from the textfile f), line is skipped, (However ° the over several lines .)

char.

then

~nd

If a parameter v is o f t y p e i n t e g e r (or m s u b r e n g e thereof) or real° a sequence of characters, which represents an integer or a r e e l n u m b e r a c c o r d i n g to the Pascal syntax, is read. (Consecutive numbers m u s t be s e p a r a t e d by b l a n k s or e n d s of l i n e s . )

examples:

Read and process a sequence of numbers where the last value is immediately followed by an asterisk. Assume f to be a textfile, x and ch to be variables of types integer (or real) and char respectively. reset (f) ; reoe~& read(f,x,ch): P(x) ~t~_~l ch :°*"

Perhaps knowing special schemata

a more common situation is w h e n o n e has no w a y o f how m a n y d a t a i t e m s a r e to be r e a d ° a n d t h e r e is no symbol that terminates the list. Two convenient f o l l o w . In t h e f i r s t , s i n g l e i t e m s a r e p r o c e s s e d .

reset(f); read(f,x): ~ n o t e o f ( f ) do .b~:~d.a P ( x ) : r e a d ( f ,x ) ~nd

Notice become The

that true,

second

the does schema

last call not return processes

of read, a defined n-tuples

which value of

causes for x.

numbers :

eof(f)

to

86

reset(f);

~la

read(f

not

beojJ3

eof(f)

,xl);

do

read(f ,x2 ..... P(xl ..... xn); read(f ,xl)

xn);

~nd (For the of s i n g l e

B.

above items

The p r o c e d u r e

schema to function m u s t be a m u l t i p l e

properly, of n.)

the

total

number

write

The procedure write appends character strings (one or m o r e characters) to a textfile. L e t pl p 2 . . . . . p n be p a r a m e t e r s of the form defined below (see 5), e n d let f be a t e x t f i l e . Then, when writing o n t o t h e f i l e f:

1.

w r i t e (p 1 . . . . . pn) stands for write (output,p 1 ..... pn)

2.

wrtte(f,p be~i~

1 ..... pn) write(f,pl):

3.

wrlteln(p I ..... writeln(output,p

4.

writeln b~in

Every

e e e where

stands for ; write(f.pn)

the the

parameter

: el : el

e,

e~

pn) stands for I ..... pn)

(f.p 1 ..... pn) write(f ,pl): ...

This has terminating 5.

...

stands for ; write(f,pn):

effect current

of writing l i n e of t h e

pi

be

must

of

one

of

writeln(f)

pl . . . . . textfile f. the

eo~ pn

end

then

forms:

: e2

el,

and

e2 a r e

expressions,

6.

e is the valw_E t o be written a n d may be o f t y o e integer, real, Boolean, o r i t may be a s t r i n g . In the case, write(f,c) stands for ff := c; p u t ( f )

char, first

7.

el--called the minimum field wldth--is an o p t i o n a l control. It m u s t be a n a t u r a l n u m b e r and indicates the minimum number of characters to be written. In g e n e r a l , t h e v a l u e e is written w i t h el c h a r a c t e r s (with preceding blanks). If el is "too small", m o r e s p a c e is a l l o c a t e d , ( R e a l s m u s t be w r i t t e n w i t h at l e a s t o n e p r e c e d i n g blank; however, this restriction does not apply to integer values.) I f no f i e l d l e n g t h is specified, a default value (implementation dependent) is assumed according to t h e t y p e of t h e e x p r e s s i o n e.

87

8.

e2--called the ~raction i~o_q~--is an o p t i o n a l control and is applicable only when e is of t y p e r e a l . It m u s t be a natural n u m b e r and s p e c i f i e s t h e n u m b e r of d i g i t s to f o l l o w t h e d e c i m a l p o i n t . (The n u m b e r is t h e n s a i d to be w r i t t e n in fi×ed-~oint notation.) If no f r a c t i o n l e n g t h is s p e c i f i e d , t h e v a l u e is p r i n t e d in d e c i m a l f l o a t i n g - p o i n t form.

I0. If the value identifier true

e or

is of type Booleen, f a l s e is w r i t t e n .

then

the

standard

13

~ASOAL &000-3.4

The p u r p o s e of t h i s c h a p t e r is to i n t r o d u c e t h o s e f e a t u r e s t h a t are peculiar to the implementation on t h e C o n t r o l D a t a 6 0 0 0 computers. The reader is w a r n e d t h a t r e l i a n c e u p o n a n y of t h e characteristics peculiar to PASCAL 6000-3.4 may r e n d e r his programs unacceptable to other implementations of P a s c a l . O n e is, therefore, advised to use only features described as ~tandard £ a s c a l in t h e p r e v i o u s chapters whenever possible, and certainly when writing "portable" programs. The topics A) B) C) D)

A.

of t h i s

chapter

fall

into

four

categories:

Extensions to the l a n g u a g e Specifications left u n d e f i n e d in t h e p r e c e d i n g chapters Restrictions Additional predefined procedures, functions, and types

Extensions

to

the

language

Pascal

This section defines non-standard language constructs available on the Pascal 6000-3,4 system. Although they may b e o r i e n t e d toward the particular environment provided by the given operating system, they a r e d e s c r i b e d a n d can be u n d e r s t o o d in machine independent terms.

A.1

Segmented

files

A file can be regarded as being subdivided into so-called ~e~ments, i.e. as a sequence of segments, e a c h of w h i c h is itself a s e q u e n c e . P A S C A L 6 0 0 0 - 3 . 4 o f f e r s e f a c i l i t y to d e c l a r e a file as b e i n g ~ e ~ m e n t L d , a n d to r e c o g n i z e s e g m e n t s a n d t h e i r boundaries. E a c h s e g m e n t of s u c h a file is a " ~ w e i c a l r e c o r d " in CDC S C O P E t e r m i n o l o g y . declaration:

an

example: ~vm~ T = Qeomented

The p r e d i c a t e

eas (x)

The

::= & e ~ m e n t e ~

following

~ile

Lile

~

~

char;

function returns the value positioned st the false.

two

standard

true when the file x is end of a segment, otherwise

procedures

are

introduced:

89

putseg(x)

m u s t be c a l l e d the file x has

getseg(x)

is called in o r d e r to i n i t i a t e t h e r e a d i n g of t h e next segment of the file x . It a s s i g n s to t h e buffer variable ~ the f i r s t c o m p o n e n t of t h a t next segment. If no next s e g m e n t e x i s t s , e o f ( x ) becomes true; if a next s e g m e n t e x i s t s but is empty, then eos(x) becomes true and xt is undefined. S u b s e q u e n t c a l l s of w e t ~ x ) w i l l e i t h e r step on t o t h e next c o m p o n e n t or, if it does not exist, cause eos(x) to b e c o m e t r u e .

Get(x) eof(x)

must always

when been

the generation completed, and

not be c a l l e d if e i t h e r i m p l i e s e o s ( x ).

eos(x)

of

a segment

or e o f ( f )

of

is t r u e :

The advantages of a segmented file lie in the possibility positioning the reading or w r i t i n g head (relatively) quickly any. segment in the file. For the purposes of reading (re)writing a segmented file, the standard procedures getseg rewrite are extended to accept two arguments.

of to and and

g e t s e g (x ,n )

initiates the reading of the nth segment c o u n t i n g f r o m t h e c u r r e n t p o s i t i o n of t h e f i l e , n>O implies counting s e g m e n t s in t h e f o r w a r d direction: n
rewrite(x ,n)

i n i t i a t e s t h e ( r e ) w r i t i n g of x at t h e b e g i n n i n g of the nth s e g m e n t c o u n t i n g f r o m the c u r r e n t position. N o t e : r e w r i t e ( x , 1 ) is n £ & e q u i v a l e n t to r e w r i t e ( x ) . The l a t t e r c a u s e s i n i t i a t i o n of ( r e ) w r i t i n g at t h e v e r y b e g i n n i n g of t h e e n t i r e file .

Since files are organized for sequentfal one should not expect getseg and rewrite n<=0 as they are for n>0.

to

(forward) processing, b e as e f f i c i e n t for

The following two program schemas, with the statements W, R, and S, show the operations of writing and reading of a segmented file.

Writing

a segmented

file x :

r e w r i t e (x); ~eat {generate a segment} ~eo~ {generate a component}

W(xt): un~2 p : putseg w~til q

(x)

put(x)

parametric sequential

90

Note: this segment.

Reading

schem~

will

a segmented

reset

(x):

~hila

not

abila

file

eof(x)

bm.oin { p r o c e s s

never

generate

an

empty

file

nor

an

empty

x :

m~ a segment}

Dot eos (x) d s

be_%i[1 { p r o c e s s

S ( x t );

a component}

set(x)

en~ : S: getseg(x) end The next example shows textfile f and copies the file output.

a procedure first n lines

that reads a segmented of e a c h s e g m e n t onto the

orocedure list ; 3&ar i.s : i n t e g e r ; h~iO s := 0; reset(f):

~ b _ i l a zl#-~ e o f ( f )

do

hf..qin s

i

:= s + 1 :

:=

O;

writeln( ° segment 'os); w__hil~ n o t e o s ( f ) a n d ( i < n ) berlin

i

:='i+I;

{copy

w_hi/J~ D o t e o l n ( f ) b,#oin write(ff ):

a

d~

line}

do get(f)

{next

character}

eOJ~ :

writeln; readln(f) {next eo_~ : getseg(f) {next segment}

line}

end

oo~ The s t a n d a r d procedures segmented textfiles.

A.2

External

~£~q.d and ~ r i t e

can a l s o

be a p p l i e d

to

procedures

PASCAL 6000-3.4 provides a facility to access oxt~oal oroeedures, i.e. procedures (functions) that exist outside the user program and have been separately compi3ed. This enables the Pascal programmer to a c c e s s p r o g r a m libraries. The declaration of such a procedure consists of a p r o c e d u r e heading followed by

the

word

"extern"

or

"Fortran".

91

B.

8.1

Specifications

The

left

program

undefined

heading

and

in

the

external

preceding

chapters

files

A PASCAL file variable is implemented as a f i l e in t h e C D C operating system. Local files are allocated on d i s c s t o r e or in the Extended Core Store (ECS). Storage is a l l o c a t e d when they are generated and automatically released when the block to which t h e y a r e l o c a l is t e r m i n a t e d . Files that exist outside the program ( i . e . b e f o r e or a f t e r program execution) m a y be m a d e a v a i l a b l e to t h e p r o g r a m if t h e y are specified as ~ c t u a I ~ a r a m e t e r s in t h e p r o g r a m ceil statement (EXECUTE) of the control card record. They are called external files and ere subsituted for the ~ l ~#ramE~WJ2~ specified in t h e ~ro~j~am ~ a d i n o . The heeding has the following form: ~£~Oram

where

a program

( <program parameter> { , <program parameter>}

parameter

identifier>

is

) ;

either:

-or-

The parameters are formal declared as file variables same w a y a s a c t u a l local file

identifier>

file identifiers; in t h e m a l n p r o g r a m varfables.

*

they must in e x a c t l y

be the

Files denoted by t h e f o r m a l p a r a m e t e r s inowJ2 a n d ~ u t o u ~ have somewhat special status. The following r u l e s m u s t be n o t e d : I .

2,

The program heading ~L~S~ contain the formal parameter output. Contrary to all other external files, the two formal file identifiers ~nout and ~-9_~ ~st Dot be defined in a declaration, because their declaration is automatically assumed t o be: 3&&~ i n p u t ,

3.

a

output:

text;

The procedures reset and rewrite have to the actual files INPUT and OUTPUT.

examp le : Proora~ ~_~_ x ,y

P (output, text

If an actual card record is

x,

no

effect

if

applied

y);

;

parameter in t h e E X E C U T E statement of the control left empty, the corresponding formal parameter in

92

the p r o g r a m h e a d i n g i s t h e n a s s u m e d as t h e a c t u a l "logical file n a m e " . For e x a m p l e , w h e n c e i l i n g a p r o g r a m w i t h t h e h e a d i n g : p r or ~

p (output,f

,g);

then EXECUTE,(,X,) Ss e q u i v a l e n t t o E X E C U T E , P ( O U T P U T , X . G ) . The full s p e c i f i c a t i o n of t h e f i l e p a r a m e t e r s is r e c o m m e n d e d b e c a u s e reliance on default v a l u e s o f t e n l e a d s t o m i s t a k e s that c o u l d easily have been avoided.

B.2

Representation

of f i l e s

In the case of external files it is i m p o r t a n t t o know t h e representation of files chosen by the P A S C A L c o m p i l e r . E v e r y component of a P A S C A L - 6 0 0 0 file o c c u p i e s an i n t e g r a l n u m b e r of ~O-bit words, with t h e e x c e p t i o n of files w i t h c o m p o n e n t t y p e ~har (~extfiles). In t h i s c a s e P A S C A L f i l e s u s e t h e " s t a n d a r d " representation imDosed by CDC°s text file conventions: 10 characters are packed into each word, implying that the procedures put and get i n c l u d e p a c k i n g and u n p a c k i n g o p e r a t i o n s when applied to t e x t f i l e s . The end of a l i n e is r e p r e s e n t e d by at least 12, right-adjusted zero-bits in a word. Files originating from ~ard ~ f o l l o w the s a m e g e n e r a l t e x t f i l e conventions. Note that the operating system ~moves most (but not necessary all) trailing blanks when reading cards, Hence, such files do Dot necessarily c o n s i s t of 8 0 - c h a r a c t e r "card images" Files that are not s e g m e n t e d a r e w r i t t e n as a s i n g l e " l o o i c a l record" (in SCOPE terminology). W h i l e r e a d i n g an u n s e g m e n t e d external file, end-of-record marks are ignored [for an exception, s e e p o i n t 3 b e l o w ] . In s e g m e n t e d f i l e s , each s e g m e n t corresponds to a "logical record". T h e r e is no p r o v i s i o n to specify a "record level".

Use of e x t e r n a l I.

files

If an external file is t o be r e a d ( w r i t t e n ) , t h e n in t h e Case of non~egmented files, reading (writing) must be i n i t i a t e d by t h e s t a t e m e n t (rewrite(x)

r es et (x)

and

in t h e

case

reset(x) getseg(x.n)

of s e g m e n t e d

) files

by

(rewrite(x) > or (rewrite(x.n) )

(This statement is automatically implied for the f i l e s denoted by the f o r m a l p a r a m e t e r s j~omut a n d ~ u t n u t , and must not be s p e c i f i e d by the p r o g r a m m e r . )

2.

Every

external

file

is a u t o m a t i c a l l y

"opened"

by a call

of

93

the OPE r o u t i n e of t h e o p e r a t i n g s y s t e m . I f t h i s o p e n i n g is to be r e s t r i c t e d to t h e r e a d f u n c t i o n - - e . g , in t h e c a s e of a permanent file without write permission--then t h i s has t o be i n d i c a t e d by an a s t e r i s k f o l l o w i n g t h e f i l e p a r a m e t e r in t h e program heading. The asterisk itself constitutes no protection a g a i n s t w r i t i n g on t h e f i l e . example: ~r_Q_gram ~i

testdata (output.

d a t a : "f_ila ~ f r: r e a l ; r

:='data~ ;

data*) :

real ;

get(date)

o . .

3.

If the a c t u a l file n a m e I N P U T is s u b s t i t u t e d corresponding to a formal program parameter, s a y f, t h e n f is t h e c u r r e n t s i n g l e l o g i c a l r e c o r d of t h e f i l e I N P U T .

B.3

The

standard

types

INTEGER

The

standard ~on~

identifier

maxint

=

~axint

is

281474976710655;

defined {

as

= 2**48

-

1 }

The r e a d e r is c a u t i o n e d , h o w e v e r , that t h e CDC c o m p u t e r p r o v i d e s no indication of o v e r f l o w . It is, t h e r e f o r e , the programmer's responsibility to p r o v i d e a c h e c k w h e n e v e r t h i s m i g h t o c c u r . Actually. the machine is c a p a b l e of s t o r i n g i n t e g e r s up t o an absolute value of 2**59, but then ~ the operations of addition (+), subtraction (-), taking the absolute value, multiplication and division by powers of 2 ( i m p l e m e n t e d as shifts), and comparisons a r e c o r r e c t l y e x e c u t e d in t h i s r a n g e (as l o n g as no o v e r f l o w o c c u r s ) . In p a r t i c u l a r , one cannot even p r i n t an i n t e g e r v a l u e i w h e n a b s ( i ) > m a x i n t . This does, however, allow the following test: if a b s ( i )

> maxint

the•

write("

too

big')

REAL

The type ~al is d e f i n e d a c c o r d i n g to CDC 6 0 0 0 f l o a t i n g f o r m a t . P r o v i d e d is a m a n t i s s a w i t h 48 b i t s , c o r r e s m o n d i n g d e c i m a l d i g i t s . The m a x i m u m a b s o l u t e m a g n i t u d e is 1 0 - ' 3 2 2 .

point to 14

94

CHAR

A v a l u e of t y p e ~ h a r is an e l e m e n t in t h e c h a r a c t e r set p r o v i d e d by t h e p a r t i c u l a r i n s t a l l a t i o n . The f o l l o w i n g 3 v e r s i o n s e x i s t : I) The C D C 2) T h e C D C 3) The CDC

Scientific 64-character Scientific 63-character A S C I I 6 4 - c h a r a c t e r set

set set

Table 13.a lists the a v a i l a b l e c h a r a c t e r s and i n d i c a t e s t h e i r o r d e r i n g : ~ o t e : t h e CDC s p e c i f i c a t i o n i m p l i e s an o r d e r i n g of the ASCII characters w h i c h d i f f e r s from the I n t e r n a t i o n a l S t a n d a r d (ISO)!

95

CDC

Scientific

Character

Set

with

64

elements

j~

1

2

3

4

5

6

7

8

9

:

A

a

C

O

E

F

G

H

I

J

K

L

M

N

0

P

q

R

S

T

U

V

W

X

Y

Z

~

I

2

3

4

5

6

7

8

9

+

-

*

/

(

)

$

=

-~

[

]

~

~

m

v

~

<

>

not

used

set

version

, ^

~

Comments: -

~

- 51

:

in

the

- 48

'

at

ETH

-

53

t

at

ETH

-

57

t

at

ETH

ASCII

Character

1

in

63-character

5et

2

63-character

with

set

CDC's

version

ordering

3

4

5

6

7

8

9

:

A

B

C

D

E

F

G

H

I

J

K

L

M

N

0

P

q

R

5

T

U

V

W

X

Y

Z

~

I

2

3

4

5

6

7

8

9

+

/

(

)

,

=

J

Figure

~

13.a

:

,,

CDC

, &

character

'

sets

?

*

~

[

<

>

96

Based upon the above character sets, the following characters are accepted as s y n o n y m s for t h e s t a n d a r d s y m b o l g i v e n in t h e left c o l u m n :

5tandard

Pascal

CDC

scientific -7

not and o__r <> <= >= Figure

B.4

The

13.b:

standard

_< > Alternative representation standard symbols

of

"'write"

If no m i n i m u m f i e l d l e n g t h p a r a m e t e r default values are assumed.

is s p e c i f i e d ,

the

following

default

integer real

10 22

Boolean

10

char

a string a l f a (see

&

A V

procedure

type

ASCII

( w h e r e t h e e x p o n e n t is a l w a y s e x p r e s s e d in t h e f o r m : E~999

)

1

length D.I)

of t h e

string

10

The end of each line in a textfile f m u s t be e x p l i c i t l y indicated by writeln(f)o where writeln(output) m a y be w r i t t e n simply as w r i t e l n . If a t e x t f i l e is to be s e n t to a p r i n t e r ~ no line may c o n t a i n m o r e t h a n 136 c h a r a c t e r s . The f i r s t c h a r a c t e r of each line is interpreted by the printer as a control character and ~s not printed. The following characters are interpreted to mean °+"

blank "0"

no llne f e e d (overprinting) single spacing double spacin~ triple spacing s k i p to top of next p a o e b e f o r e

printing

The procedure writeln(x) is u s e d to m a r k t h e end of a line on fi&e x, The conventions of t h e C D C o p e r a t i n g s y s t e m r e o a r d i n g textfile representation a r e s u c h that t h i s p r o c e d u r e is f o r c e d to emit some extra blanks under certain circumstances. Hence,

97

upon that

reading, a textfile were never explicitly

(as

may c o n t a i n blanks at the end written. (Sorry about this!)

C.

Restrictions

of

1.

The word

2.

T h e b a s e t y p e of a set m u s t be e i t h e r a) a s c a l a r w i t h at m o s t 58 e l e m e n t s (or a s u b r a n g e

se.qmented

March

of

lines

1974)

is r e s e r v e d .

thereof)

OF

b)

c)

3.

a subrange with a minimum element greater than or equal to zero, and a maximum element less than or equal to 58, or a subrange of the type chat with the maximum element less than or equal to the value chr(58).

Standard (built-in) proeedrues or functions are not accepted as actual parameters. For example, in order to run program 11.6 in PASCAL 6 0 0 0 - 3 . 4 , one would have to write auxiliary functions as follows:

functi~233 s i n e ( x : r e e l ) : r e a l ; beqin s i n e := s i n ( x ) ~U~: fw43ction c o s i n e ( x : r e a l ) : r e a l : b#~in c o s i n e := cos (x) ~3~; fu~£~

zero(function ... e~ :

be~in

f: r e a l ;

e,b:

real):

real;

..•

be~in read(x,y); r e a d ( x oy ) ;

4.

It is records

not and

D. Additional

D,I

Additional

The t y p e ~ i f & ~y~

ella

writeln writeln

( x .y , z e r o ( s i n e , x , y ) ) ; (x ,y , z e r o ( c o s i n e , x , y )

p o s s i b l e to c o n s t r u c t a file of f i l e s : h o w e v e r , a r r a y s w i t h f i l e s as c o m p o n e n t s a r e a l l o w e d .

predefined

types,

predefined

procedures,

by:

= ~acked ~rrav[1..10] type of

functions

types

is p r e d e f i n e d

(Hence, a value of word.) The constants characters.

and

ella this

~

char;

is representable type are &trinqs

in exactly of exactly

one 10

98

Applicable comparison, test order values may

on operands of type ella are assignment where = a n d <> t e s t equality and <, <=, according to the underlying character be p r i n t e d by t h e p r o c e d u r e write.

(:=) a n d >=. and > set. Alfa

{ program 13.1 al£a values } ~roeram

egalfa

(output);

vat nl.n2: alfa; beoin write(" names: '); nl := "raymond ": n2 := "debby if n2 < nl then writeln(n2.nl) else writeln (nl,n2) end.

nameS:

debby

Note: It is the following vi~

raymond

not possible to is s u g g e s t e d :

read

alfa

values

directly;

instead,

b u r : a r r a y [ I..10] of c h a r ; a: a l f a ; i: i n t e g e r ;

E e l i :: I t o 10 d o r e a d ( b u f [ i ] ); pack(buf01.a ) {accomplishes read(a)}

D .2 A d d i t i o n a l

predefined

procedures

and

functions

~roce~ures date (a)

assigns

halt

terminates the execution issues a poat-mortem dump.

linellmit(f,x)

f is a t e × t f i l e a n d x is an i n t e g e r expression, The effect is to cause the program to be terminated, if m o r e t h a n x l i n e s a r e a s k e d t o be written on f i l e f.

message (x)

the string x is written (Hence. x should contain at

time (a)

assigns

putseg

.

getseg,

and

to

to the

the

the

alfa

alfa

extensions

variable

variable to

the of

current the

date.

program

and

into the dayfile. most 40 characters.) a the rewrite

current and

reset

time. are

99

discussed

in s e c t i o n

13.A.I.

~unctlons

card(x)

equals number

clock

a function, without parameters, yielding an i n t e g e r v a l u e e q u a l to the c e n t r a l p r o c e s s o r t i m e , expressed in milliseconds, already u s e d by t h e job.

expo(x)

yields an integer the floating~oint value x.

undefined(x)

a Boolean function. Its value real value x is "out of r a n g e " o t h e r w i s e f a l s e [ 7] .

cos(x)

(discussed

the cerdinal±ty of the s e t x (i.e. t h e of e l e m e n t s c o n t a i n e d in t h e s e t x.)

v a l u e e q u a l to t h e e x p o n e n t of representation of the real

in s e c t i o n

13.A.1)

is t r u e w h e n t h e or " i n d e f i n i t e " ,

14 Use ~he ~ASCAL ~ 0 0 0 - 3 . 6 ~ v s t e m

Uow ~ A. C o n t r o l

statements

(for

SCOPE

3.4)

A Pascal job u s u a l l y c o n s i s t s of f o u r s t e p s . F i r s t , the P a s c a l compiler is loaded. The s e c o n d s t e p is t h e c o m p i l a t i o n step, which yields a l i s t i n g of t h e s o u r c e p r o g r a m a n d t h e c o m p i l e d code. In the third step t h e c o m p i l e d c o d e , d e p o s i t e d by t h e compiler on secondary store, is loaded and linked with precompiled routines for input and output handling, which are provided on a "program l i b r a r y " f i l e . F i n a l l y , t h e c o m p i l e d a n d loaded code is executed. These four s t e p s a r e i n i t i a t e d by appropriate orders to the operating system in t h e f o r m of ~ontro~ Qtetements. The e x a c t f o r m of t h e s e s t a t e m e n t s and t h e i r abbreviated f o r m s ( l o a d i n g a n d e x e c u t i o n c a n o f t e n be o r d e r e d by a single statement) depend entirely upon the available operating system, and must therefore be specified by the particular installation. The a c t u a l f i l e p a r a m e t e r s , w h i c h c o r r e s p o n d t o t h e f o r m a l file identifiers l i s t e d in t h e p r o g r a m h e a d i n g , m u s t be s p e c i f i e d in the statement initiating execution of the compiled program ( u s u a l l y an E X E C U T E c o m m a n d ) . The

compiler ~r_~wJE~

itself

is a l s o

a Pascal

program.

Its

heading

is

Pascal(input.output,lgo);

The first formal source p r o g r a m to end t h e t h i r d , the

parameter d e n o t e s t h e file r e p r e s e n t i n g the be c o m p i l e d ; t h e s e c o n d , t h e p r o g r a m l i s t i n g ; compiled "binary", relocatable code.

The COC operating systems allow the omission of a c t u a l parameters in t h e c o n t r o l s t a t e m e n t s . If an a c t u a l f i l e name is omitted, the Pascal convention on p r o g r a m p a r a m e t e r s specifies t h a t t h e f o r m a l file i d e n t i f i e r be u s e d as t h e a c t u a l file n a m e . Hence, the standard files INPUT, OUTPUT, and LGO are automatically a s s u m e d as t h e d e f c u l t ~ i l e s for t h e s o u r c e file, the listing, and the relooatable binary code respectively. Note, however, that these roles m a y be a s s u m e d by o t h e r f i l e s w h e n their names are entered as actual parameters. Note: actual parameters m u s t c o n s i s t of at m o s t 7 c h a r a c t e r s .

B . Compiler

options

The compiler may be i n s t r u c t e d to g e n e r a t e c o d e a c c o r d i n g to c e r t a i n o p t i o n s ; in p a r t i c u l a r , it may be r e q u e s t e d to insert or omit r u n - t i m e t e s t i n s t r u c t i o n s . Compiler directives are written as comments and a r e d e s i g n a t e d as s u c h by a S - c h a r a c t e r as the first c h a r a c t e r of t h e c o m m e n t :

101

{$