Next: Operating Systems - Introduction Up: Computer Architecture - CSC Previous: Assembly Language Programming

Subsections

Further Assembly Programming

Introduction

The instruction set (Mac-1a) introduced in chapter 6 was severely limited - particularly in its inability to call subprograms. We now extend our coverage to the remainder of the Mac-1 instruction set, and attempt some more ambitious programs.

Firstly, we will introduce the additional instructions - especially those based on the stack; we will explain the purpose of a stack and its use for handling subprograms.

In addition, we will describe the memory addressing modes found on most general purpose computers. Next, we will show how some more real programs can be constructed, from simple three or four liners, to subroutines, input-output and interrupts.

Although this chapter is entirely about Mac-1, the presentation is such that the principles of general-purpose computers are emphasised. Thus, someone who follows this chapter will have little difficulty in understanding Motorola 68000, Intel 80x86 series, or virtually any conventional computer.

Addressing - General

In a computer, memory locations can hold instructions or data. In addition, as we shall see the data can be interpreted either as a plain value, e.g. 100, or as an address or reference to another data item. Those of who are familiar with C/C++ will recognise pointers; and Java people will recognise references.

In general, machine instructions usually take zero, one, or two operands (e.g. in lodd a0; a0 is the single operand; lodd is the operation;

Actually Mac-1 has no multi-operand instructions. For a start, it is an accumulator machine, i.e. in instructions like addd, lodd, the second - implicit - operand is AC, the accumulator.

Operands can be data, or can refer to data - i.e. address of data, or can be labels - which translate to addresses - of instructions, e.g. for jumps.

The question of addressing is concerned with how operands are interpreted. In the case of data the operand can be:

A so-called immediate constant, e.g. LOCO 10;
A value held in a register - no example in Mac-1;
A value held in memory, e.g. LODD a0;
In addition we can qualify the operand to say in which mode it should be interpreted:
Immediate:
When an operand is interpreted as an immediate value, e.g. LOCO 5, it is the actual value `5' that is put in the accumulator. In the case of Mac-1, the value `5' is part of the instruction: LOCO 5; 0111 0000 0000 0101;

Direct:
the specified name or number is the `direct' address of the intended operand, e.g. LODD a0, which is the same as LODD 501 - if our usual definitions apply -, so that the contents of memory cell '501' are loaded into the accumulator. Thus, 0000 0101 0000 0001 ; lodd 501;
If memory cell 501 contains `7'
```
          +--------------+
     501: | ... 0111     |
          +--------------+
```
it is the `7' that gets loaded into the AC.
Register:
the number or name of a register is given, which contains the real operand; conceptually similar to direct except that the register name/number is substituted for memory address;

Indirect:
the specified name or number contains the address of the operand, i.e. the operand is the `indirect' address of the true operand, e.g. let us invent LODI 501, so that the contents of the contents of memory cell `501' are loaded into the accumulator. Thus, xxxx 0101 0000 0001 lodi 501; if memory cell 501 contains `50a', and memory cell 50a contains `3', we have:
```
          +--------------+
     501: | 50a          |
          +--------------+
            ...
          +--------------+
     50a: | 3            |
          +--------------+
```
and it is the `3' that gets loaded into the AC.
Indexed:
Lets us assume that we have an integer array stored in an array of N (= 9, say) contiguous locations starting at location 1230 (say), and we want to successively add them into the accumulator:
```
               lodd 1230
     loop:     addi 1230+SI   ;ac<- [ac]+m[1230+si]
               inc  SI
               ...test for end
               jump loop
```
Of course, because Mac-1 has no index registers, I have had to invent one (SI), and an addressing mode for add: addi - add direct using SI as an index; also I had to invent an `increment instruction. Indexing is a bit like running your finger along a table of figures. The instructions 'lodl'...'subl' introduced below are a bit like indexed - but they use a very special index register called the stack-pointer (SP).

Addressing modes look complicated, but if you are careful to analyse what a construct means - by drawing a diagram, if necessary - then there are no real pitfalls.

Also, for those who are not specialists in assembly programming, you should keep to the simple modes and only use the complex modes when they are absolutely essential.

Mac-1 Instruction Set Extensions

Figure 7.1 shows the Mac-1 instruction set extended to the full repertoire given in [Tanenbaum, 1990]; we do not bother with the binary version of the instruction - as in Figure 5.2 and Figure 6.1, since we will not be assembling programs using the additional instructions, nor writing them in machine code.

**Figure 7.1:** Complete Mac-1 Instruction Set
$\begin{figure}\begin{tex2html_preform}\begin{verbatim}Mnemonic Name Action(s) --... ... address offset in range 0 - 255\end{verbatim}\end{tex2html_preform}\end{figure}$

The Additional Jumps

jneg x    Jump on negative    if ac<0 : pc <- x
jnze x    Jump on nonzero     if ac!=0 : pc <- x

These save some of the trouble encountered using just jpos and jzer; however, as we have seen, they are not essential.

The Stack

In Mac-1, the register SP is the stack-pointer and is dedicated to maintaining the stack; the stack itself - the data pointed-to - is actually part of main-memory.

The stack is a memory into which values can be stored and from which they can be retrieved on a last-in-first-out (LIFO) basis. Ideally, you store with a PUSH and retrieve with a POP. It may help to think of an analogy such as a spring loaded canteen tray dispenser, or a bus conductor's coin dispenser; the main point is that you can only put on the top (PUSH), get from the top (TOP) or remove the value at the top (POP). In spite of its simplicity this device has a remarkably large impact on the computational capability of a computer. A stack gives us a sort-of indirect addressing and also a sort-of indexed addressing via a the stack pointer; but a stack does much more than that, it is the basis of the implementation of functions and procedures, and blocks in block-structured high-level languages. SP points to the top of the stack - i.e. to the memory location where the last value was pushed.

In the case of Mac-1, the stack grows from high memory towards low memory. PUSH increases the size of the stack by one and places a value in the new memory cell (at the top). POP exactly reverses the process, i.e.. retrieves the last value written (the top) and decreases the size of the stack. PUSH followed by POP has exactly NO effect. And, as usual with Mac-1 most things are done through the accumulator (AC); PUSH pushes the number in the AC, and POP retrieves the top of the stack into the AC.

PUSH operates as follows:

     PUSH      ;sp <- sp - 1  ;SP decremented, NB. this INCREASES
                              size of stack.
               ;m[sp] <- ac   ;put contents of AC into the memory
                              cell that SP POINTS TO

POP:

     POP       ;ac <- m[sp]   ;get contents of cell pointed to by
                              SP, into AC.
               ;sp <- sp + 1  ;decrease size of stack.

Note carefully again that the stack actually grows downwards, one word at a time - actually this is the case on a great many machines. Normally, in Mac-1 programs, we will assume that SP starts off pointing at memory cell 4020.

Example

. The tables below show how the state of the stack and memory cells change, in response to the following code fragment, (assume SP initially set to 4020, and that a0 is at 500, and contains 30, that a1 is at 501 and contains 91):

                    /(a)
          lodd a0   /ac <- [a0] (=30)
          push      /(b)
          lodd a1   /ac <- [a1] (=91)
          push      /(c)
          pop       /ac <- m[sp]; sp <- sp -1
          stod a0   /(d)
          pop       /
          stod a1   /(e)

In examples like this, to show the address of a memory cell and what it contains, we use the notation:

          address: contents
          500:      30

At the beginning (a):

     a0   500: 30
     a1   501: 91
          AC : ?                   4018:      ?
                                   4019:      ?
          SP: 4020 --points to---> 4020:      ?

At (b):

     a0   500: 30
     a1   501: 91
          AC : 30                  4018:      ?
                                +->4019:      30
          SP: 4019 --points to--+  4020:      ?

At (c):

     a0   500: 30
     a1   501: 91
          AC : 91               +-->4018:      91
                                |   4019:      30
          SP: 4018 --points to--+   4020:      ?

At (c):

     a0   500: 91
     a1   501: 91
          AC : 91                  4018:      ?
                                +->4019:      30
          SP: 4019 --points to--+  4020:      ?

At (e):

     a0   500: 91
     a1   501: 30
          AC : 30                  4018:      ?
                                   4019:      ?
          SP: 4020 --points to---->4020:      ?

Comments:

1.: The contents of A0 and a1 have been swapped; if we had wanted the same values POPped as PUSHed we would have had to POPped in the reverse order of PUSHing;
2.: Once the SP moves back (after POP) we show ? in the stack area; the value would probably remain, but it would be exceptionally foolish to rely on this happening - as we shall see later when we mention interrupts.

Stack Instructions

Direct Accumulator-Stack Instructions

push      Push onto stack     sp <- sp-1; m[sp] <- ac
pop       Pop off stack       ac <- m[sp]; sp <- sp+1

See section 7.4.

Indirect Accumulator-Stack Instructions

pshi      Push indirect       sp <- sp-1; m[sp]<-m[ac]
popi      Pop indirect        m[ac] <- m[sp]; sp <- sp+1

Thus, the AC is used `indirectly' - see indirect addressing mode, section 7.2; i.e. the value that is contents of the memory cell that is pointed-to by [ac] is pushed and popped.

Call and Return - CALL and RETN

CALL and RETN are used for CALLing subprograms (methods or functions in Java) and RETurning from them.

CALL

call x    Call procedure   sp<-sp-1; m[sp] <- pc; pc <- x
                          (get stack (save pc)   (put jump
                           ready)                 address
                                                  in pc)

CALL causes all of the following to happen:

1.: Decrement the stack pointer - so that we will not overwrite last thing put on stack,
2.: The contents of PC - which is pointing to NEXT instruction, the one just after the CALL - is pushed onto the stack, and,
3.: Jump to `x', which is the address of the start of the subprogram is put in the PC register, this is all a jump does. Thus, we go off to the subprogram - just as in JUMP label, but the important difference is that we remember where we were in the calling program, i.e. we must remember where we came from, so that we can get back there again.

RETN

retn      Return from proc.   pc <- m[sp]; sp <- sp + 1

RETN causes all of the following to happen:

1.: Pops the stack, to yield an address/label; if correctly used, the top of the stack will contain the address of the next instruction after the call from which we are returning; it is this instruction with which we want to resume in the calling program;
2.: Jump to the popped address, i.e. put the address into the PC register.

Conclusion

We have now introduced most of the new instructions; we wait for later sections to introduce the others. Also, we shall see in later sections more detail on how subprograms work.

Input-Output Instructions

This is mostly a repeat of section 5.6.

Introduction

As stated earlier, there are no direct instructions for input- output; instead Mac-1a uses memory-mapped input-output, whereby some memory cells are mapped to input-output ports; for simplicity we assume that there are only two ports, one connected to a standard-input device, the other connected to a standard-output device:

Input, mapped to 4092 (0FFCHex); status 4093 (0FFDHex).
Output, mapped to 4094 (0FFEHex); status 4095 (0FFFHex).

We assume that each device works with bytes (i.e. 8-bits).

Input from standard-input device

A read from address 0FFCHex yields a 16-bit word, with the actual data byte in the lower order byte. There is no use in reading the input port until the connected device has put the data there: so 0FFDH is used to read the input status register; the top bit (sign) of 0FFDH is set when the input data is available (DAV).

Thus, a read routine should go into a tight loop, continuously reading 0FFDHex, until it goes negative; then 0FFCHex can be read to get the data. Reading 0FFC clears 0FFD again.

Output to the standard-output device

Output, to 0FFE, runs along the same lines as input. A write to 0FFE will send the lower order byte to the standard-output device. The sign bit of 0FFFH signifies that the device is in a ready to receive (RDY) state; again there is no use writing data to the output port until the device is ready to read it.

Example

Write a fragment of program that will output the contents of the lower-order byte of address 500 to the standard output device mentioned in section 7.6.3.

          test:     lodd fff       /read status
                    jpos test      /not ready
                    jzer test      /not ready
          out:      lodd 500
                    stod ffe       /output