This chapter gives the rules for mixing assembly language with Dynamic C code. A reference guide to the Rabbit Instruction Set is available from the Help menu of Dynamic C and is also documented in the Rabbit 2000 Microprocessor User's Manual.
11.1 Mixing Assembly and C
Dynamic C permits assembly language statements to be embedded in C functions and/or entire functions to be written in assembly language. C statements may also be embedded in assembly code and refer to C-language variables in the assembly code.
11.1.1 Embedded Assembly Syntax
Use the
#asmand#endasmdirectives to place assembly code in Dynamic C programs. For example, the following function will add two 64-bit numbers together. The same program could be written in C, but it would be many times slower because C does not provide an add-with-carry operation (adc).The keywords
debugandnodebugcan be placed on the same line as#asm. Assembly code blocks arenodebugby default. This saves space and unnecessary calls to the debugger kernel.All blocks of assembly code within a C function are assembled in nodebug mode. The only exception to this is when a block of assembly code is explicitly marked with
debug. Any blocks markeddebugwill be assembled in debug mode even if the enclosing C function is markednodebug.11.1.2 Embedded C Syntax
A C statement may be placed within assembly code by placing a C in column 1. For example, initialize global variables.
#asm
InitValues::
ld hl,0xa0;
c start_time = 0;
c counter = 256;
ret
#endasm11.1.3 Setting a Breakpoint in an Assembly Block
Starting with Dynamic C version 7.20, there are two ways to enable breakpoint support in a block of assembly code.
One way is to explicity mark the assembly block as
debug(the default condition isnodebug). This causes the insertion of "rst 0x28" instructions between each assembly instruction. These rst 0x28 instructions may cause jump relative (i.e.,jr) instructions to go out of range, but this problem can be solved by changing the relative jump (jr) to an absolute jump (jp).The other way to enable breakpoint support in a block of assembly code is to add a C statement before the desired assembly instruction. Note that the assembly code must be contained in a debug C function in order to enable C code debugging. Below is an example.
debug dummyfunction() {
#asm
function::
...
label:
...
c ; // add line of C code to permit a breakpoint before jump relative
jr nc, label
ret
#endasm}
NOTE Single-stepping through assembly code is always allowed if the assembly window is open. 11.2 The Assembler and the Preprocessor
The assembler parses most C language constant expressions. A C language constant expression is one whose value is known at compile time. All operators except the following are supported:
Table 13. Operators Not Supported By The Assembler ?: conditional [ ] array index . dot -> points to * dereference 11.2.1 Comments
C-style comments are allowed in embedded assembly code. The assembler will ignore comments beginning with
;-- text from the semicolon to the end of line is ignored.//-- text from the double forward slashes to the end of line is ignored./* ... */-- text between slash-asterisk and asterisk-slash is ignored.11.2.2 Defining Constants
Constants may be created and defined in assembly code. The assembly language keyword
db("define byte") places bytes at the current code segment address. The keyworddbshould be followed immediately by numerical values and strings separated by commas as shown here.Example
Each of the following defines a string "ABC" in code space.
db 'A', 'B', 'C'
db "ABC"
db 0x41, 0x42, 0x43The numerical values and characters in strings are used to initialize sequential byte locations.
The assembly language keyword
dwdefines 16-bit words, least significant byte first. The keyworddwshould be followed immediately by numerical values, as shown in the following example.Example
This example defines three constants. The first two constants are literals, and the third constant is the address of variable xyz.
dw 0x0123, 0xFFFF, xyzThe numerical values initialize sequential word locations, starting at the current code address.
11.2.3 Multiline Macros
The Dynamic C preprocessor has a special feature to allow multiline macros in assembly code. The preprocessor expands macros before the assembler parses any text. Putting a
$\at the end of a line inserts a new line in the text. This only works in assembly code. Labels and comments are not allowed in multiline macros.
#define SAVEFLAG $\
ld a,b $\
push af $\
pop bc#asm
...
ld b,0x32
SAVEFLAG
...
#endasm11.2.4 Labels
A label is a name followed by one or two colons. A label followed by a single colon is local, whereas one followed by two colons is global. A local label is not visible to the code out of the current embedded assembly segment (i.e., code before the
#asmor after the#endasmdirective).Unless it is followed immediately by the assembly language keyword
equ, the label identifies the current code segment address. If the label is followed byequ, the label "equates" to the value of the expression after the keywordequ.Because C preprocessor macros are expanded in embedded assembly code, Z-World recommends that preprocessor macros be used instead of
equwhenever possible.11.2.5 Special Symbols
This table lists special symbols that can be used in an assembly language expression.
Table 14. Special Assembly-Language Symbols @SPIndicates the amount of stack space (in bytes) used for stack-based variables. This does not include arguments. @RETVALEvaluates the offset from the frame reference point to the stack space reserved for the struct function returns. See Section 11.4.1.1 on page 121 for more information. @LENGTHDetermines the next reference address of a variable plus it size. 11.2.6 C Variables
C variable names may be used in assembly language. What a variable name represents (the value associated with the name) depends on the variable. For a global or static local variable, the name represents the address of the variable in root memory. For an
autovariable or formal argument, the variable name represents its own offset from the frame reference point.The name of a structure element represents the offset of the element from the beginning of the structure. In the following structure, for example,
struct s {
int x;
int y;
int z;
};the embedded assembly expression
s+xevaluates to 0,s+yevaluates to 2, ands+zevaluates to 4, regardless of where structuresmay be.In nested structures, offsets can be composite, as shown here.
struct s {
int x; // s + x = 0
struct a{ // s + a = 2
int b; // a + b = 0 s + a + b = 2
int c; // a + c = 2 s + a + c = 4
};
};11.3 Stand-Alone Assembly Code
A stand-alone assembly function is one that is defined outside the context of a C language function. Before Dynamic C version 7.25, stand-alone assembly functions were always placed in root memory.
A stand-alone assembly function has no
autovariables and no formal parameters. It can, however, have arguments passed to it by the calling function. When a program calls a function from C, it puts the first argument into a primary register. If the first argument has one or two bytes (int, unsigned int, char, pointer), the primary register is HL (with register H containing the most significant byte). If the first argument has four bytes (long, unsigned long, float), the primary register is BC:DE (with register B containing the most significant byte). Assembly-language code can use the first argument very efficiently. Only the first argument is put into the primary register, while all arguments--including the first, pushed last--are pushed on the stack.C function values return in the primary register, if they have four or fewer bytes, either in HL or BC:DE.
Assembly language allows assumptions to be made about arguments passed on the stack, and auto variables can be defined by reserving locations on the stack for them. However, the offsets of such implicit arguments and variables must be kept track of. If a function expects arguments or needs to use stack-based variables, Z-World recommends using the embedded assembly techniques described in the next section.
11.3.1 Stand-Alone Assembly Code in Extended Memory
Starting with Dynamic C 7.25, stand-alone assembly functions may be placed in extended memory by adding the
xmemkeyword as a qualifier to#asm, as shown below. Care needs be taken to make sure that branch instructions do not jump beyond the current xmem window. To help prevent such bad jumps, the compiler limits xmem assembly blocks to 4096 bytes. Code that branches to other assembly blocks in xmem should always useljporlcall.#asm xmem
main::
...
lcall fcn_in_xmem
...
lret
#endasm#asm xmem
fcn_in_xmem::
...
lret
#endasm11.3.2 Example of Stand-Alone Assembly Code
The stand-alone assembly function
foo()can be called from a Dynamic C function.int foo ( int ); // A function prototype can be declared for stand-alone
// assembly functions, which will cause the compiler
// to perform the appropriate type-checking.
main(){
int i,j;
i=1;
j=foo(i);
}#asm
foo::
...
ld hl,2 // The return value expected by main() is put
ret // in HL just before foo() returns
#endasmThe entire program can be written in assembly.
#asm
main::
...
ret
#endasm11.4 Embedded Assembly Code
When embedded in a C function, assembly code can access arguments and local variables (either
autoorstatic) by name. Furthermore, the assembly code does not need to manipulate the stack because the functionsprologandepilogalready do so.11.4.1 The Stack Frame
The purpose and structure of a stack frame should be understood before writing embedded assembly code. A stack frame is a run-time structure on the stack that provides the storage for all
autovariables, function arguments and the return address for a particular function. If the IX register is used for a frame reference pointer, the previous value of IX is also kept in the stack frame. The following figure shows the general appearance of a stack frame.![]()
The return address is always necessary. The presence of auto variables depends on the function definition. The presence of arguments and structure return space depends on the function call. (The stack pointer may actually point lower than the indicated mark temporarily because of temporary information pushed on the stack.)
The shaded area in the stack frame is the stack storage allocated for
autovariables. The assembler symbol@SPrepresents the size of this area.11.4.1.1 The Frame Reference Point
The frame reference point is a location in the stack frame that immediately follows the function's return address. The IX register may be used as a pointer to this location by putting the keyword
useixbefore the function, or the request can be specified globally by the compiler directive#useix. The default is#nouseix. If the IX register is used as a frame reference pointer, its previous value is pushed on the stack after the function's return address. The frame reference point moves to encompass the saved IX value.11.4.2 Example of Embedded Assembly Code
The purpose of the following sample program,
asm1.c, is to show the different ways to access stack-based variables from assembly code.void func(char ch, int i, long lg);
main(){
char ch;
int i;
long lg;
ch = 0x11;
i = 0x2233;
lg = 0x44556677L;func(ch,i,lg);
}void func(char ch, int i, long lg){
auto int x;
auto int z;x = 0x8888;
z = 0x9999;#asm
// @SP+igives the offset ofifrom the stack frame on entry.
// On the Z180, this is how HL is loaded with the value ini.
// (The assembler combinesiand @SP into one constant.)
ld hl,@SP+i
add hl,sp
ld hl,(hl)// On the Rabbit, this code does the same:
ld hl,(sp+@SP+i)
// This works iffunc()is useix, however, if the IX register
// has been changed by the user code, this code will fail.
ld hl,(ix+i)// This method works in either case because the assembler
// adjusts the constant @SP, so changing the function to
// nouseix with the keywordnouseix, or the compiler
// directive#nouseixwill not break the code. But, if SP has
// been changed by user code,(e.g. a push) it won't work.
ld hl,(sp+@SP+lg+2)
ld b,h
ld c,L
ld hl,(sp+@SP+lg)
ex de,hl
#endasm
}11.4.2.1 The Disassembled Code Window
A program may be debugged at the assembly level by clicking the Assemb radio button on Dynamic C's toolbar to open the Disassembled Code window. Single-stepping and breakpoints are supported in this window. When the Disassembled Code window is open, single-stepping occurs instruction by instruction rather than statement by statement. The figure below shows the Registers, Stack and Disassembled Code windows for the example code,
asm1.c, just before the function call.![]()
11.4.2.2 Instruction Cycle Time
The Disassembled Code window shows the memory address on the far left, followed by the code bytes for the instruction at the address, followed by the mnemonics for the instruction. The last column shows the number of cycles for the instruction, assuming no wait states. The total cycle time for a block of instructions will be shown at the lowest row in the block in the cycle-time column, if that block is selected and highlighted with the mouse. The total assumes one execution per instruction, so the user must take looping and branching into consideration when evaluating execution times.
11.4.3 Local Variable Access
Accessing static local variables is simple because the symbol evaluates to the address directly. The following code shows, for example, how to load static variable
yinto HL.
ld hl,(y) ; load hl with contents of y11.4.3.1 Using the IX Register
Access to stack-based local variables is fairly inefficient. The efficiency improves if IX is used as a frame pointer. The arguments will have slightly different offsets because of the additional two bytes for the saved IX register value.
Now, access to stack variables is easier. Consider, for example, how to load
chinto register A.
ld a,(ix+ch) ; ach
The IX+offset load instruction takes 9 clock cycles and opcode is three bytes. If the program needs to load a four-byte variable such as lg, the IX+offset instructions are as follows.
ld hl,(ix+lg+2) ; load LSB of lg
ld b,h ; longs are normally stored in BC:DE
ld c,L
ld hl,(ix+lg) ; load MSB of lg
ex de,hlThis takes a total of 24 cycles.
The offset from IX is a signed 8-bit integer. To use IX+offset, the variable must be within +127 or -128 bytes of the frame reference point. The
@SPmethod is the only method for accessing variables out of this range. The@SPsymbol may be used even if IX is the frame reference pointer, .11.4.3.2 Functions in Extended Memory
If the
xmemkeyword is present, Dynamic C compiles the function to extended memory. Otherwise, Dynamic C determines where to compile the function. Functions compiled to extended memory have a 3-byte return address instead of a 2-byte return address.Because the compiler maintains the offsets automatically, there is no need to worry about the change of offsets. The
@SPapproach discussed previously as a means of accessing stack-based variables works whether a function is compiled to extended memory or not, as long as the C-language names of local variables and arguments are used.A function compiled to extended memory can use IX as a frame reference pointer as well. This adds an additional two bytes to argument offsets because of the saved IX value. Again, the IX+offset approach discussed previously.can be used because the compiler maintains the offsets automatically.
11.5 C Functions Calling Assembly Code
Dynamic C does not assume that registers are preserved in function calls. In other words, the function being called need not save and restore registers.
11.5.1 Passing Parameters
When a program calls a function from C, it puts the first argument into HL (if it has one or two bytes) with register H containing the most significant byte. If the first argument has four bytes, it goes in BC:DE (with register B containing the most significant byte). Only the first argument is put into the primary register, while all arguments--including the first, pushed last--are pushed on the stack.
11.5.2 Location of Return Results
If a C-callable assembly function is expected to return a result (of primitive type), the function must pass the result in the "primary register." If the result is an
int,unsigned int,char, or a pointer, return the result in HL (register H contains the most significant byte). If the result is along, unsigned long,orfloat, return the result in BCDE (register B contains the most significant byte). A C function containing embedded assembly code may, of course, use a Creturnstatement to return a value. A stand-alone assembly routine, however, must load the primary register with the return value before theretinstruction.11.5.2.1 Returning a Structure
In contrast, if a function returns a structure (of any size), the calling function reserves space on the stack for the return value before pushing the last argument (if any). Dynamic C functions containing embedded assembly code may use a C
returnstatement to return a value. A stand-alone assembly routine, however, must store the return value in the structure return space on the stack before returning.Inline assembly code may access the stack area reserved for structure return values by the symbol
@RETVAL, which is an offset from the frame reference point.The following code shows how to clear field
f1of a structure (as a returned value) of typestruct s.It is crucial that @SP be added to
@RETVALbecause@RETVALis an offset from the frame reference point, not from the current SP.11.6 Assembly Code Calling C Functions
A program may call a C function from assembly code. To make this happen, set up part of the stack frame prior to the call and "unwind" the stack after the call. The procedure to set up the stack frame is described here.
Save all registers that the calling function wants to preserve. A called C function may change the value of any register. (Pushing registers values on the stack is a good way to save their values.)
If the function return is a
struct, reserve space on the stack for the returned structure. Most functions do not return structures.
Compute and push the last argument, if any.
Compute and push the second to last argument, if any.
Continue to push arguments, if there are more.
Compute and push the first argument, if any. Also load the first argument into the primary register (HL for
int, unsigned int, char, and pointers, or BCDE for long, unsigned long,andfloat) if it is of a primitive type.
Issue the call instruction.
The caller must unwind the stack after the function returns.
Recover the stack storage allocated to arguments. With no more than 6 bytes of arguments, the program may pop data (2 bytes at time) from the stack. Otherwise, it is more efficient to compute a new SP instead. The following code demonstrates how to unwind arguments totaling 36 bytes of stack storage.
If the function returns a
struct, unload the returned structure.
Restore registers previously saved. Pop them off if they were stored on the stack.
If the function return was not a
struct, obtain the returned value from HL or BCDE.11.7 Interrupt Routines in Assembly
Dynamic C allows Interrupt Service Routines (ISRs) to be written in C (declared with the keyword
interrupt). However, the efficiency of one interrupt routine affects the latency of other interrupt routines. Assembly routines can be more efficient than the equivalent C functions, and therefore more suitable for ISRs.Either stand-alone assembly code or embedded assembly code may be used for interrupt routines. The benefit of embedding assembly code in a C-language ISR is that there is no need to worry about saving and restoring registers or reenabling interrupts. The drawback is that the C interrupt function does save all registers, which takes some amount of time. A stand-alone assembly routine needs to save and restore only the registers it uses.
Interrupts are turned off by the CPU before the ISR is called. Generally, the ISR performs the following actions:
Save all registers (that will be used) on the stack. Interrupt routines written in C save all registers on the stack automatically. Stand-alone assembly routines must push the registers explicitly.
Determine the cause of the interrupt. Some devices map multiple causes to the same interrupt vector. An interrupt handler must determine what actually caused the interrupt.
Remove the cause of the interrupt.
If an interrupt has more than one possible cause, check for all the causes and remove all the causes at the same time.
When finished, restore registers saved on the stack. Naturally, this code must match the code that saved the registers. Interrupt routines written in C perform this automatically. Stand-alone assembly routines must pop the registers explicitly.
Reenable interrupts. Interrupts are disabled for the entire duration of the interrupt routine (unless they are enabled explicitly). The interrupt handler must reenable the interrupt so that other interrupts can get the attention of the CPU. Interrupt routines written in C reenable interrupts automatically when the function returns. Stand-alone assembly interrupt routines, however, must reenable the interrupt (ipres) explicitly.
The interrupts should be reenabled immediately before the return instructionsretorreti. If the interrupts are enabled earlier, the system can stack up the interrupts. This may or may not be acceptable because there is the potential to overflow the stack.
Return. There are three types of interrupt returns:
ret, reti, andretn.11.8 Common Problems
Unbalanced stack. Ensure the stack is "balanced" when a routine returns. In other words, the SP must be same on exit as it was on entry. From the caller's point of view, the SP register must be identical before and after the call instruction.
Using the @SP approach after pushing temporary information on the stack. The
@SPapproach for inline assembly code assumes that SP points to the low boundary of the stack frame. This might not be the case if the routine pushes temporary information onto the stack. The space taken by temporary information on the stack must be compensated for.The following code illustrates the concept.
Registers not preserved. In Dynamic C, the caller is responsible for saving and restoring all registers. An assembly routine that calls a C function must assume that all registers will be changed.
Unpreserved registers in interrupt routines cause unpredictable and unrepeatable problems. In contrast to normal functions, interrupt functions are responsible for saving and restoring all registers themselves.
| Z-World http://www.zworld.com Voice: (530) 757-3737 FAX: (530) 757-3792 sales@zworld.com |