Awesome
armips assembler v0.11
- Author: Kingcom
- Source: https://github.com/Kingcom/armips
- Automated builds: http://buildbot.orphis.net/armips
1. Introduction
Note: This file is still incomplete, some information is missing or may be outdated.
1.1 Usage
The assembler is called from the command line. The usage is as follows:
armips code.asm [optional parameters]
code.asm
is the main file of your assembly code, which can open and include other files.
The following optional command line parameters are supported:
-temp <filename>
Specifies the output name for temporary assembly data. Example output:
; 1 file included
; test.asm
00000000 .open "SLPM_870.50",0x8000F800 ; test.asm line 1
8000F800 .org 0x800362DC ; test.asm line 5
800362DC jal 0x801EBA3C ; test.asm line 7
800362E0 .Close ; test.asm line 9
-sym <filename>
Specifies the output name for symbol data in the sym format. This format is supported by the debuggers in NO$PSX and NO$GBA. Example output:
00000000 0
80000000 .dbl:0010
80000010 main
8000002C subroutine
80240000 newblock
-sym2 <filename>
Specifies the output name for symbol data in the sym2 format. This format is supported by the debuggers in PCSX2 and PPSSPP. Example output:
00000000 0
80000000 .dbl:0010
80000010 Main
8000002C Subroutine,0000001C
80240000 NewBlock,00000014
-erroronwarning
Specifies that any warnings shall be treated like errors, preventing assembling. This has the same effect as the .erroronwarning
directive.
-equ <name> <replacement>
Equivalent to using name equ replacement
in the assembly code.
-strequ <name> <replacement>
Equivalent to using name equ "replacement"
in the assembly code.
-definelabel <name> <replacement>
Equivalent to using .definelabel name, replacement
in the assembly code.
-root <directory>
Specifies the working directory to be used during execution.
-stat
Outputs statistics for bytes used within areas after completion. Example output:
Total areas and regions: 5342 / 7934
Total regions: 916 / 1624
Largest area or region: 0x0806E80C, 564 / 1156
Most free area or region: 0x0806E80C, 564 / 1156 (free at 0x0806EA40)
Most free region: 0x0806E80C, 564 / 1156 (free at 0x0806EA40)
2. Installation
2.1 Download binary
Download the latest Windows 32-bit binary from the Automated armips builds site. You will need the Microsoft Visual Studio 2015 x86 Redistributable.
2.2 Building from source
The latest code is available at the armips GitHub repository. Make sure to also initialize and update submodules. This can be accomplished with one command:
$ git clone --recursive https://github.com/Kingcom/armips.git
You will need CMake and a C++17 compliant compiler (recent versions of Visual Studio, GCC and Clang have been tested). All CMake generators should be supported, but Ninja is recommended and the most well tested. Create a build directory, invoke CMake from there, and then simply run the chosen build tool. E.g. on Unix platforms:
$ mkdir build && cd build
$ cmake -DCMAKE_BUILD_TYPE=Release ..
$ cmake --build .
Or on Windows using Visual Studio:
$ mkdir build && cd build
$ cmake ..
$ cmake --build . --config Release
Please refer to the CMake documentation for further information.
3. Overview
The assembler includes full support for the MIPS R3000, MIPS R4000, Allegrex and RSP instruction sets, partial support for the EmotionEngine instruction set, as well as complete support for the ARM7 and ARM9 instruction sets, both THUMB and ARM mode. Among the other features of the assembler are:
- a full fledged C-like expression parser. It should behave exactly like in any C/C++ code, including all the weirdness. All immediate values can be specified by an expression, though some directives can't use variable addresses including labels
- you can open several files in a row, but only one output file can be open at any time. You can specify its address in memory to allow overlay support. Any file can cross-reference any other included file
- local, static, and global labels (see 4.3 Labels)
- table support for user defined text encodings (see 4.7 String encoding)
- several MIPS macros to make writing code easier and faster (see 5.1 General directives)
- user defined macros (see 6.3 User defined macros)
- built-in checks for possible load delay problems (see 4.6 Load delay detection)
- optional automatic fix for said problems by inserting a nop between the instructions
- output of the assembled code to a text file, with memory addresses and origin (see 1.1 Usage)
- a directive to ensure that data is not bigger than a user defined size (see 4.8 Areas)
4. Features
4.1 Files
Unlike other assemblers, you don't specify the input/output file as a command line argument. You have to open the file in the source code, and also close it yourself. This was done in order to support overlays, which are very common in PSX and NDS games. Instead of only having one output file, you can have as many as you need - each with its own address in memory. The files can cross-reference each other without any problems, so you can call code from other files that are currently not opened as well.
.Open "SLPS_035.71", 0x8000F800
; ...
.Close
.Open "System\0007.dat", 0x800CC000
; ...
.Close
4.2 Syntax
Comments
Both ;
and //
style single-line comments are supported.
/* */
style block comments are also accepted.
Statement separator
Statements are separated by newlines or ::
can be used between statements on the same line. For example, to insert four nop
instructions, this could be written on one line:
nop :: nop :: nop :: nop
Statement line spanning
Single statements can continue on to the next line by inserting a \
at the end of a line. Comments and whitespace can follow. For example:
.ascii "NSM", (VERSION == "us") ? "E" : \
(VERSION == "jp") ? "J" : \
(VERSION == "eu") ? "P" : \
"X"
4.3 Labels
A label is defined by writing its name followed by a colon. It creates a symbol with that name as its identifier, and with the current memory address as its value. There is support for both local, global and static labels. Local labels are only valid in the scope between the previous and the next global or static label. Specific directives, like .org
, will also terminate the scope. All labels can be used before the point where they are defined.
GlobalLabel: ; This is a global label.
@StaticLabel: ; This is a static label.
@@LocalLabel: ; This is a local label, it is only valid
; until the next global or static one.
OtherGlobalLabel: ; this will terminate the scope where
; @@LocalLabel can be used.
b @@LocalLabel ; as a result, this will cause an error.
Static labels behave like global labels, but are only valid in the very file they were defined. Any included files or files that include it cannot reference it. They can, however, contain another static label with the same name. When a static label is defined insde a (nested) macro, it is treated as being defined in the file where the top-level macro call occurred, rather than the file holding the macro definition where the static label is created.
A label name can contain all characters from A-Z, numbers, and underscores. However, it cannot start with a digit. All label names are case insensitive.
Additionally, .
is a special label and can be used to reference the current memory address; it is equivalent to calling the expression function org()
.
A label can also be defined using the .func
/.function
directive. The example below will create a label MyLabel
pointing to the current memory address. In addition, if the -sym2
command line flag is used to output a sym2 file, the size of the function block (from .func
to .endfunc
) will also be written to the symfile.
.func MyLabel
; assembly code
.endfunc
4.4 equ
The equ
directive works as a direct text replacement on the assembly source level and is defined as follows. Unlike labels, an equ
must be defined before it can be used.
GlobalEqu equ 1
@StaticEqu equ 2
@@LocalEqu equ 3
There has to be at least one whitespace character before and after equ
. The assembler will replace any occurrence of GlobalEqu
, @StaticEqu
and @@LocalEqu
with 1
, 2
and 3
respectively. Similarly to labels, a global equ
is valid anywhere, a static equ
is only valid in the file it was defined, and a local equ
is only valid in the current section, which is terminated by any global or static label or specific directives. The replacement value can be any sequence of valid tokens. Any usage of the equ name identifier is replaced by the replacement tokens in-place, before any parsing is done. The replacement can therefore also contain partial commands or expressions. For example, this code:
@@StringPointer equ 0x20(r29)
lw a0,@@StringPointer
nop
sw a1,@@StringPointer
will assemble to this:
lw a0,0x20(r29)
nop
sw a1,0x20(r29)
4.5 Expression parser
A standard expression parser with operator precedence and bracket support has been implemented. It is intended to behave exactly like any C/C++ parser and supports all unary, binary and ternary operators of the C language. Every numeral argument can be given as an expression, including label names. However, some directives do not support variable addresses, so labels cannot be used in expressions for them. The following bases are supported:
0xA
and0Ah
for hexadecimal numbers0o12
and12o
for octal numbers1010b
and0b1010
for binary numbers
Everything else is interpreted as a decimal numbers, so a leading zero does not indicate an octal number. Be aware that every number has to actually start with a decimal digit. For example, as FFh
is a perfectly valid label name, you have to write 0FFh
or 0xFF
in this case. Labels, on the other hand, cannot start with a digit.
A few examples:
mov r0,10+0xA+0Ah+0o12+12o+1010b
ldr r1,=ThumbFunction+1
li v0,Structure+(3*StructureSize)
Value types
Three value types are supported: integers, floats and strings. Integers are defined by writing just a number in one of the supported bases. Writing a character surrounded by single quotation marks will also give its Unicode value as an integer. For example, 'a'
is equivalent to 97
.
A float is defined by an integer numerator, followed by a period, followed by the denominator, e.g. 2.5
. Floats can also use a different base prefix; in this case, both the numerator and denominator are evaluated using that base. For example, 11.5
is equivalent to 0xB.8
. Alternatively, a float can also be defined by exponential notation. This is formatted as an integer, followed by the letter e
, followed by (optionally) a plus or minus representing the exponent sign, followed by an integer representing the exponent. For example, 314e-2
is equivalent to 3.14
; 5e3
and 5e+3
are equivalent to 5000.0
.
Strings are defined by text wrapped in double quotation marks (e.g. "text"
). Double quotation marks can be escaped by prefixing them with a backslash (\
). Any backslash not followed by a double quotation mark is kept as-is. If you want to use a backslash at the end of a string, prefix it by another backslash.
For example, to write a double quotation mark followed by a backslash:
.ascii "\"\\"
String concatenation is possible with the +
binary operator. Concatenating integers or floats with a string will convert those integers or floats to a string representation.
Strings can also be compared to other strings using the standard comparison operators or compared to numbers using the ==
and !=
operators.
Built-in functions
Below is a table of functions built into the assembler that can be used with the expression parser for runtime computation.
Function | Description |
---|---|
version() | armips version encoded as int, e.g. armips v3.4.5 returns 3045 (3*1000 + 4*10 + 5 ) |
endianness() | current endianness as string, e.g. "big" or "little" |
outputname() | currently opened output filename, exactly as written in .create or .open directive |
org() | current memory address (like . ) |
org(label) | memory address of label |
orga() | current absolute file address |
orga(label) | absolute file address of label |
headersize() | current header size (displacement of memory address against absolute file address) |
headersize(label) | header size of label (displacement of memory address against absolute file address) |
defined(symbol) | 1 if symbol is a defined symbol, 0 otherwise |
fileexists(file) | 1 if file exists, 0 otherwise |
filesize(file) | size of file in bytes |
tostring(val) | string representation of int or float val |
tohex(val, optional digits = 8) | hex string representaion of int val |
round(val) | float val rounded to nearest int |
int(val) | cast float val to int, dropping fractional part |
float(val) | cast int val to float |
frac(val) | fractional part of float val |
abs(val) | absolute value of int or float val |
hi(val) | High half of 32-bit value val , adjusted for sign extension of low half (only available in MIPS) |
lo(val) | Sign-extended low half of 32-bit value val (only available in MIPS) |
min(a, b, ...) | minimum of int or float parameters a , b , ...; result type is int if all parameters are int, float otherwise |
max(a, b, ...) | maximum of int or float parameters a , b , ...; result type is int if all parameters are int, float otherwise |
strlen(str) | number of characters in str |
substr(str, start, count) | substring of str from start , length count |
regex_match(source, regex) | 1 if regex matched entire source , 0 otherwise |
regex_search(source, regex) | 1 if regex matched subsequence of source , 0 otherwise |
regex_extract(source, regex, optional index = 0) | string of regex matched in source |
find(source, substr, optional start = 0) | lowest index of substr in source from start , else -1 |
rfind(source, substr, optional start = -1) | highest index of substr in source from start , else -1 |
readbyte(file, optional pos = 0) | read unsigned 8-bit value from file at position pos |
readu8(file, optional pos = 0) | read unsigned 8-bit value from file at position pos |
readu16(file, optional pos = 0) | read unsigned 16-bit value from file at position pos |
readu32(file, optional pos = 0) | read unsigned 32-bit value from file at position pos |
readu64(file, optional pos = 0) | read unsigned 64-bit value from file at position pos |
reads8(file, optional pos = 0) | read signed 8-bit value from file at position pos |
reads16(file, optional pos = 0) | read signed 16-bit value from file at position pos |
reads32(file, optional pos = 0) | read signed 32-bit value from file at position pos |
reads64(file, optional pos = 0) | read signed 64-bit value from file at position pos |
readascii(file, optional start = 0, optional len = 0) | read ASCII string from file at start length len until null terminator |
isarm() | 1 if in ARM mode, 0 otherwise (only available in ARM/THUMB) |
isthumb() | 1 if in THUMB mode, 0 otherwise (only available in ARM/THUMB) |
User defined functions
It is possible to define additional expression functions. These can contain any number of parameters, but their content is limited in scope to a single arbitrarily long expression. This expression can contain calls to other functions or user defined functions - including recursive calls to the function itself. User defined functions are defined as follows:
.expfunc name(parameters), content
name
is the name of the function and must be unique. parameters
has to be a comma separated list of one or more identifiers. content
has to be a single expression and encodes the return value of the function. The result of evaluating content
is returned to the caller.
As an example, the following will define a function to calculate a fibonacci number and then use it to print the 10th such number:
.expfunc fib(n), n <= 2 ? 1 : fib(n-1)+fib(n-2)
.notice "The 10th fibonacci number is " + fib(10)
The ternary operator can be useful to chain various conditions in a single expression.
4.6 Load delay detection
This feature is still unfinished and experimental. It works in most cases, though. On certain MIPS platforms (most notably the PlayStation 1), any load is asynchronously delayed by one cycle and the CPU won't stall if you attempt to use it before. Attempts to use it will return the old value on an actual system (emulators usually do not emulate this, which makes spotting these mistakes even more difficult). Therefore, the assembler will attempt to detect when such a case happens. The following code would result in a warning:
lw a0,0x10(r29)
lbu a1,(a0)
This code doesn't take the load delay into account and will therefore only work on emulators. The assembler detects it and warns the user. In order to work correctly, the code should look like this:
lw a0,0x10(r29)
nop
lbu a1,(a0)
The assembler can optionally automatically insert a nop
when it detects such an issue. This can be enabled with the .fixloaddelay
directive.
However, as there is no control flow analysis, there is a chance of false positives. For example, a branch delay slot may cause a warning for the opcode that follows it, even if there is no chance that they will be executed sequentially. The following example illustrates this:
bnez a0,@@branch1
nop
j @@branch2
lw a0,(a1)
@@branch1:
lbu a2,(a0)
You can fix the false warning by using the .resetdelay
directive before the last instruction.
bnez a0,@@branch1
nop
j @@branch2
lw a0,(a1)
.resetdelay
@@branch1:
lbu a2,(a0)
4.7 String encoding
You can write ASCII text by simply using the .db
/.ascii
directive followed by the string to write. Using .asciiz
will insert a zero byte after the string.
You can also write text with custom encodings. In order to do that, you first have to load a table using .loadtable <tablefile>
, and then use the .string
directive to write the text. It behaves exactly like the .db
instruction (so you can also specify immediate values as arguments), with the exception that it uses the table to encode the text, and appends a termination sequence after the last argument. This has to be specified inside the table, otherwise 0 is used. The termination sequence can also be omitted with .stringn
.
.loadtable "custom.tbl"
.string "Custom text",0xA,"and more."
The first and third arguments ("Custom test"
, "and more."
) are encoded according to the table, while the second one (0xA
) is written as-is.
4.8 Areas
If you overwrite existing data, it is critical that you don't overwrite too much. The area directive will take care of checking if all the data is within a given space. In order to do that, you just have to specify the maximum size allowed.
.area 10h
.word 1,2,3,4,5
.endarea
This would cause an error on assembling, because the word directive takes up 20 bytes instead of the 16 that the area is allowed to have. This, on the other hand, would assemble without problems:
.org 8000000h
.area 8000020h-.
.word 1,2,3,4,5
.endarea
Here, the area is 32 bytes, which is sufficient for the 20 bytes used by .word.
Optionally, a second parameter can be given. The remaining free size of the area will then be completely filled with bytes of that value. For example, the following code writes 01 02 03 04 05 05 05 05
:
.area 8,5
.byte 1,2,3,4
.endarea
Regions
To help manage allocating new data in existing space, you can use .region
and .autoregion
for armips to automatically find an area with enough space.
.region
uses the same parameters as .area
, but creates a space that shared for future .autoregion
usage. You can still use code in the region, and the remaining space is considered free (with or without fill.)
Example .autoregion
usage:
.org @FreeSpace
.region 0x4000
.byte 0x12, 0x34
.endregion
.autoregion
@TheAnswer:
.byte 42
.endautoregion
Auto region content will be allocated as if it was placed after the content of the region it's allocated to (potentially after other auto regions.)
A shortcut is available for regions without content at a specific location, to quickly define pools. These are equivalent:
.defineregion @FreeSpace,0x4000,0x00
.org @FreeSpace :: .region 0x4000,0x00 :: .endregion
By default, .autoregion
will allocate to any region with sufficient space. It can be limited to a specific range of start addresses if necessary:
.autoregion @TextStart,@TextStart+@TextEnd
@CoverAdvisory:
.asciiz "Don't Panic"
.endautoregion
For example, this might be used to ensure the code is reachable by bl
.
If only the first parameter is given, it will simply require allocation after that virtual address.
Note that after .endautoregion
, the output position will be reset to what it was before the .autoregion
directive.
4.9 Symbol files
Functions.
4.10 C/C++ importer
You can link object files or static libraries in ELF format. The code and data is relocated to the current output position and all of its symbols are exported. You can in turn use armips symbols inside of your compiled code by declaring them as extern
. Note: As armips labels are case insensitive, the exported symbols are treated the same way. Be aware of name mangling when trying to reference C++ functions, and consider declaring them as extern "C"
.
.importobj "code.o"
You can optionally supply names for constructor and destructor functions. Functions with those names will be generated that call of the global constructors/destructors of the imported files.
.importlib "code.a",globalConstructor,globalDestructor
5. Assembler directives
These commands tell the assembler to do various things like opening the output file or opening another source file.
5.1 General directives
Set the architecture
These directives can be used to set the architecture that the following assembly code should be parsed and output for. The architecture can be changed at any time without affecting the preceding code.
Directive | System | Architecture | Comment |
---|---|---|---|
.psx | PlayStation 1 | MIPS R3000 | - |
.ps2 | PlayStation 2 | EmotionEngine | - |
.psp | PlayStation Portable | Allegrex | - |
.n64 | Nintendo 64 | MIPS R4000 | - |
.rsp | Nintendo 64 | RSP | - |
.gba | Game Boy Advance | ARM7 | Defaults to THUMB mode |
.nds | Nintendo DS | ARM9 | Defaults to ARM mode |
.3ds | Nintendo 3DS | ARM11 | Defaults to ARM mode, incomplete |
.arm.big | - | ARM | Output in big endian |
.arm.little | - | ARM | Output in little endian |
.saturn | SEGA Saturn | SuperH | - |
.32x | SEGA 32x | SuperH | Alias to .saturn |
Architecture | Word size (in bits) |
---|---|
MIPS | 32 |
ARM | 32 |
SuperH | 16 |
Open a generic file
.open FileName,HeaderSize
.openfile FileName,HeaderSize
.open OldFileName,NewFileName,HeaderSize
.openfile OldFileName,NewFileName,HeaderSize
Opens the specified file FileName
for output. This directive terminates the scope for local labels and equ
s. If two file names are specified, the assembler will first copy the file from OldFileName
to NewFileName
, then open NewFileName
. In this case, if the copy operation fails, e.g. because the two paths point to the same file, an error is thrown.
HeaderSize
specifies the header size, which is the difference between the first byte of the file and its position in memory. So if file position 0x800 is loaded at position 0x80010000 in memory, the header size is 0x80010000-0x800=0x8000F800. It can be changed later with the .headersize
directive.
If relative include is off, all paths are relative to the current working directory. Otherwise the path is relative to the including assembly file.
Only the changes specified by the assembly code will be inserted, the rest of the file remains untouched.
The following copies the file input.bin
to output.bin
, then opens output.bin
with a header size of 0x8000000
.
.open "input.bin","output.bin",0x8000000
Create a new file
.create FileName,HeaderSize
.createfile FileName,HeaderSize
Creates the specified file for output. If the file already exists, it will be overwritten. This directive terminates the scope for local labels and equ
s.
If relative include is off, all paths are relative to the current working directory. Otherwise the path is relative to the including assembly file.
HeaderSize
specifies the difference between the first byte of the file and its position in memory. So if file position 0x800 is loaded at position 0x80010000 in memory, the header size is 0x80010000-0x800=0x8000F800. It can be changed later with the .headersize
directive.
The following creates and opens the file output.bin
with a header size of 0x8000000
.
.create "output.bin",0x8000000
Close a file
.close
.closefile
Closes the currently opened output file. This directive terminates the scope for local labels and equ
s.
Set the output position
.org RamAddress
org RamAddress
.orga FileAddress
orga FileAddress
Sets the output pointer to the specified address. .org
/org
specifies a memory address, which is automatically converted to the file address for the current output file. .orga
/orga
directly specifies the absolute file address. This directive terminates the scope for local labels and equ
s.
Change the header size
.headersize HeaderSize
Sets the header size to the given value which is the difference between the file position of a byte and its address in memory. This is used to calculate all addresses up until the next .headersize
or .open
/.create
directive. The current memory address will be updated, but the absolute file offset will remain the same. The header size can be negative so long as the resulting memory address remains positive.
Include another assembly file
.include FileName[,encoding]
Opens the file called FileName
to assemble its content. If relative include is off, all paths are relative to the current working directory. Otherwise the path is relative to the including assembly file. You can include other files up to a depth level of 64. This limit was added to prevent the assembler from getting stuck in an infinite loop due to two files including each other recursively. If the included file has an Unicode Byte Order Mark then the encoding will be automatically detected. If no Byte Order Mark is present it will default to UTF-8. This can be overwritten by manually specifying the file encoding as a second parameter.
The following values are supported:
SJIS
/Shift-JIS
UTF8
/UTF-8
UTF16
/UTF-16
UTF16-BE
/UTF-16-BE
ASCII
Text and data directives
Align the output position
.align [num[,value]]
.aligna [num[,value]]
Writes bytes of value
into the output file until the memory position is a multiple of num
. num
has to be a power of two. If num
isn't specified, then the alignment will be 4. If value
isn't specified, zeros are inserted. Only the lowest 8 bits of value
are inserted. .align
aligns the memory address (i.e. org()
), whereas .aligna
aligns the file address (i.e. orga()
).
Fill space with a value
.fill length[,value]
defs length[,value]
Inserts length
amount of bytes of value
. If value
isn't specified, zeros are inserted. Only the lowest 8 bits of value
are inserted.
Skip bytes
.skip length
Skips length
amount of bytes without overwriting them. This is equivalent to .org .+length
.
Include a binary file
.incbin FileName[,start[,size]]
.import FileName[,start[,size]]
Inserts the file specified by FileName
into the currently opened output file. If relative include is off, all paths are relative to the current working directory. Otherwise the path is relative to the including assembly file. Optionally, start
can specify the start position in the file from it should be imported, and size
can specify the number of bytes to read.
Write bytes
.byte value[,...]
.db value[,...]
.dcb value[,...]
.d8 value[,...]
.ascii value[,...]
.asciiz value[,...]
db value[,...]
dcb value[,...]
Inserts the specified sequence of bytes. Each parameter can be any expression that evaluates to an integer or a string. If it evaluates to an integer or float, only the lowest 8 bits are inserted. If it evaluates to a string, every character is inserted as a byte using ASCII encoding.
Write 16/32/64-bit integers
.d16 value[,...]
.d32 value[,...]
.d64 value[,...]
Inserts the specified sequence of 16, 32, or 64-bit integers, depending on the directive used. If it evaluates to an integer or float, only the lowest 16/32/64 bits are inserted. If it evaluates to a string, every character is inserted as a 16/32/64-bit values using ASCII encoding.
Write halfwords
.halfword value[,...]
.hword value[,...]
.dh value[,...]
.dcw value[,...]
dh value[,...]
dcw value[,...]
Inserts the specified sequence of halfwords. The size of a halfword is defined by architecture's word size. Each parameter can be any expression that evaluates to an integer or a string. If it evaluates to an integer, only the lowest 16/8 bits are inserted. If it evaluates to a string, every character is inserted as a halfword using ASCII encoding.
If No$gba semantics are enabled, then dh
and .dh
are treated as invalid directives and will return an error.
Write words
.word value[,...]
.dw value[,...]
.dcd value[,...]
dw value[,...]
dcd value[,...]
Inserts the specified sequence of words. The size of a word is defined by architecture. Each parameter can be any expression that evaluates to an integer, a string, or a floating point number. If it evaluates to an integer, only the lowest 32/16 bits are inserted. If it evaluates to a string, every character is inserted as a word using ASCII encoding. Floats are inserted using an integer representation of the single-precision float's encoding.
If No$gba semantics are enabled, then dw
and .dw
are treated as inserting 16-bit values instead (i.e. equivalent to .d16
).
Write doublewords
.doubleword value[,...]
.dword value[,...]
.dd value[,...]
.dcq value[,...]
dd value[,...]
dcq value[,...]
Inserts the specified sequence of doublewords. The size of a doubleword is defined by architecture's word size. Each parameter can be any expression that evaluates to an integer, a string, or a floating point number. If it evaluates to a string, every character is inserted as a doubleword using ASCII encoding. Floats are inserted using an integer representation of the double-precision float's encoding.
If No$gba semantics are enabled, then dd
and .dd
are treated as inserting 32-bit values instead (i.e. equivalent to .d32
).
Write floating point numbers
.float value[,...]
.double value[,...]
.float
inserts the specified sequence of single-precision floats and .double
inserts double-precision floats. Each parameter can be any expression that evaluates to an integer or a floating point number. If it evaluates to an integer, it will be converted to a floating point number of that value.
Load a table specifying a custom encoding
.loadtable TableName[,encoding]
.table TableName[,encoding]
Loads TableName
for using it with the .string
directive. The encoding can be specified in the same way as for .include
.
The table file format is a line-separated list of key values specified by hexbyte=string
and optional termination byte sequence by /hexbytes
02=a
1D=the
2F=you
/FF
FF
will be used as the termination sequence. If it is not given, zero is used instead. Strings are matched using the longest prefix found in the table.
Write text with custom encoding
.string "String"[,...]
.stringn "String"[,...]
.str "String"[,...]
.strn "String"[,...]
Inserts the given string using the encoding from the currently loaded table. .string
and .str
insert the termination sequence specified by the table after the string, while .stringn
and .strn
omit it.
Write text with Shift-JIS encoding
.sjis "String"[,...]
.sjisn "String"[,...]
Inserts the given string using the Shift-JIS encoding. .sjis
inserts a null byte after the string, while .sjisn
omits it.
Conditional directives
Begin a conditional block
.if cond
.ifdef symbol
.ifndef symbol
The content of a conditional block will only be used if the condition is met. In the case of .if
, it is met if cond
evaluates to a non-zero integer. .ifdef
is met if the given symbol (such as a label) is defined anywhere in the code, and .ifndef
if it is not.
Else case of a conditional block
.else
.elseif cond
.elseifdef symbol
.elseifndef symbol
The else block is used if the condition of the condition of the if block was not met. .else
unconditionally inserts the content of the else block, while the others start a new if block and work as described before.
End a conditional block
.endif
Ends the last open if or else block.
Define labels
.definelabel Label,value
Defines Label
with a given value, creating a symbol for it. This can be used similar to equ
, but symbols can be used before labels are defined and can be used in conjunction with the .ifdef/.ifndef
conditionals. These can also be useful for declaring symbols for existing code and data when inserting new code.
Unlike Label:
, note that .definelabel Label,value
is evaluated only once, thus using any expressions that refer to the current state of the assembler (e.g. org()
, .
) in combination with .definelabel
leads to undefined behavior.
Function labels
.func Label
.function Label
Creates a symbol Label
with the current memory address as its value. This is equivalent to Label:
. However, used in conjunction with the -sym2
command line flag, the size of the function block will also be written to the symfile along with its memory location. A function block must be terminated with .endfunc/.endfunction
. This is also implicitly invoked when starting another function block.
.func Function1
; assembly code
.func Function2
; assembly code
.endfunc
Areas
.area SizeEquation[,fill]
.endarea
Opens a new area with the maximum size of SizeEquation
. If the data inside the area is longer than this maximum size, the assembler will output an error and refuse to assemble the code. The area is closed with the .endarea
directive and if the fill
parameter is provided, the remaining free space in the area will be filled with bytes of that value.
Messages
.warning "Message"
.error "Message"
.notice "Message"
Prints the message and sets warning/error flags. Useful with conditionals.
Error on warning
.erroronwarning on
.erroronwarning off
By specifying .erroronwarning on
, any warnings emitted by the assembler will be promoted to errors. Errors cause armips to abort the assembly process return a nonzero exit code. This property can also be enabled from the command line with the -erroronwarning
flag, and can be turned off again with .erroronwarning off
. By default, this feature is off.
Relative paths
.relativeinclude on
.relativeinclude off
By default, any paths used in assembly files (such as for .open
, .include
, etc.) are treated as relative to the current working directory. By specifying .relativeinclude on
, any paths specified after it will instead be treated as relative to the path of the current assembly file that uses the path. This can be turned off again with .relativeinclude off
. By default, this feature is off.
No$gba semantics
.nocash on
.nocash off
By specifying .nocash on
, No$gba semantics will be enabled for data directives. This has the effect that dh
/.dh
will fail, dw
/.dw
will write 16-bit halfwords instead of 32-bit words, and dd
/.dd
will write 32-bit words instead of 64-bit doublewords. It can be turned off again with .nocash off
. By default, this feature is off.
Enable/disable symfile writing
.sym on
.sym off
By specifying .sym off
, any symbols (e.g. labels) defined after it will not be written to the symfile (if specified with the -sym
/-sym2
command line flag). This can be useful when using labels to define enum values that should not be interpreted as memory addresses. Writing to the symfile can be enabled again with .sym on
. By default, this feature is on.
5.2 MIPS directives
Load delay
.resetdelay
Resets the current load delay status. This can be useful if the instruction after a delay slot access the delayed register, as the assembler can't detect that yet.
.fixloaddelay
Automatically fixes any load delay problems by inserting a nop
between the instructions. Best used in combination with .resetdelay
.
.loadelf name[,outputname]
Opens the specified ELF file for output. If two file names are specified, then the assembler will copy the first file to the second path. If relative include is off, all paths are relative to the current working directory, so from where the assembler was called. Otherwise the path is relative to the including assembly file. All segments are accessible by their virtual addresses, and all unmapped sections can be accessed by their physical position (through .orga
).
Currently this is only supported for the PSP architecture, and only for non-relocateable files. The internal structure of the file may be changed during the process, but this should not affect its behavior.
5.3 ARM Directives
Change instruction set
.arm
.thumb
These directives can be used to select the ARM or THUMB instruction set. .arm
tells the assembler to use the full 32 bit ARM instruction set, while .thumb
uses the cut-down 16 bit THUMB instruction set.
Pools
.pool
This directive works together with the pseudo opcode ldr rx,=value
. The immediate is added to the nearest pool that follows it, and the instruction is turned into a PC relative load. The range is limited, so you may have to define several pools.
Example:
ldr r0,=0xFFEEDDCC
; ...
.pool
.pool
will automatically align the memory position to a multiple of 4 before writing the pool.
Debug messages
.msg
Inserts a no$gba debug message as described by GBATEK.
6. Macros
6.1 Assembler-defined MIPS macros
There are various macros built into the assembler for ease of use. They are intended to make using some of the assembly simpler and faster. At the moment, these are all the MIPS macros included:
Immediate macros
li reg,Immediate
la reg,Immediate
Loads Immediate into the specified register by using a combination of lui
/ori
, a simple addiu
, or a simple ori
, depending on the value of the Immediate.
Immediate float macros
li.s reg,Immediate
Loads float value Immediate into the specified FP register by using a combination of li
and mtc1
.
Memory macros
lb reg,Address
lbu reg,Address
lh reg,Address
lhu reg,Address
lw reg,Address
lwu reg,Address
ld reg,Address
lwc1 reg,Address
lwc2 reg,Address
ldc1 reg,Address
ldc2 reg,Address
Loads a byte/halfword/word from the given address into the specified register by using a combination of lui
and lb
/lbu
/lh
/lhu
/lw
/ld
/lwc1
/lwc2
/ldc1
/ldc2
.
ulh destreg,imm(sourcereg)
ulh destreg,(sourcereg)
ulhu destreg,imm(sourcereg)
ulhu destreg,(sourcereg)
ulw destreg,imm(sourcereg)
ulw destreg,(sourcereg)
uld destreg,imm(sourcereg)
uld destreg,(sourcereg)
Loads an unaligned halfword/word/doubleword from the address in sourcereg by using a combination of several lb
/lbu
and ori
or lwl
/lwr
or ldl
/ldr
instructions.
sb reg,Address
sh reg,Address
sw reg,Address
sd reg,Address
swc1 reg,Address
swc2 reg,Address
sdc1 reg,Address
sdc2 reg,Address
Stores a byte/halfword/word/doubleword to the given address by using a combination of lui
and sb
/sh
/sw
/sd
/swc1
/swc2
/sdc1
/sdc2
.
ush destreg,imm(sourcereg)
ush destreg,(sourcereg)
usw destreg,imm(sourcereg)
usw destreg,(sourcereg)
usd destreg,imm(sourcereg)
usd destreg,(sourcereg)
Stores an unaligned halfword/word/doubleword to the address in sourcereg using a combination of several sb
/sbu
and shifts or swl
/swr
/sdl
/sdr
instructions.
Branch macros
blt reg1,reg2,Dest
bltu reg1,reg2,Dest
bgt reg1,reg2,Dest
bgtu reg1,reg2,Dest
bge reg1,reg2,Dest
bgeu reg1,reg2,Dest
ble reg1,reg2,Dest
bleu reg1,reg2,Dest
bltl reg1,reg2,Dest
bltul reg1,reg2,Dest
bgtl reg1,reg2,Dest
bgtul reg1,reg2,Dest
bgel reg1,reg2,Dest
bgeul reg1,reg2,Dest
blel reg1,reg2,Dest
bleul reg1,reg2,Dest
blt reg,Imm,Dest
bltu reg,Imm,Dest
bgt reg,Imm,Dest
bgtu reg,Imm,Dest
bge reg,Imm,Dest
bgeu reg,Imm,Dest
ble reg,Imm,Dest
bleu reg,Imm,Dest
bne reg,Imm,Dest
beq reg,Imm,Dest
bltl reg,Imm,Dest
bltul reg,Imm,Dest
bgtl reg,Imm,Dest
bgtul reg,Imm,Dest
bgel reg,Imm,Dest
bgeul reg,Imm,Dest
blel reg,Imm,Dest
bleul reg,Imm,Dest
bnel reg,Imm,Dest
beql reg,Imm,Dest
If reg/reg1 is less than/greater than or equal to/equal to/not equal to reg2/Imm, branches to the given address. A combination of sltu
and beq
/bne
or li
, sltu
and beq
/bne
is used.
Set macros
slt reg1,reg2,Imm
sltu reg1,reg2,Imm
sgt reg1,reg2,Imm
sgtu reg1,reg2,Imm
sge reg1,reg2,Imm
sgeu reg1,reg2,Imm
sle reg1,reg2,Imm
sleu reg1,reg2,Imm
sne reg1,reg2,Imm
seq reg1,reg2,Imm
sge reg1,reg2,reg3
sgeu reg1,reg2,reg3
sle reg1,reg2,reg3
sleu reg1,reg2,reg3
sne reg1,reg2,reg3
seq reg1,reg2,reg3
If reg2 is less than/greater than or equal to/equal to/not equal to reg3/Imm, sets reg1 to 1
, otherwise sets reg1 to 0
. Various combinations of li
, slt
/sltu
/slti
/sltiu
and xor
/xori
are used.
Rotate macros
rol reg1,reg2,reg3
ror reg1,reg2,reg3
rol reg1,reg2,Imm
ror reg1,reg2,Imm
Rotates reg2 left/right by the value of the lower 5 bits of reg3/Imm and stores the result in reg1. A combination of sll
, srl
and or
is used.
Absolute value macros
abs reg1,reg2
dabs reg1,reg2
Stores absolute value of word/doubleword in reg2 into reg1 using a combination of sra
/dsra32
, xor
, and subu
/dsubu
.
Upper/lower versions
Additionally, there are upper and lower versions for many two opcode macros. They have the same names and parameters as the normal versions, but .u
or .l
is appended at the end of the name.
For example, li.u
will output the upper half of the li
macro, and li.l
will output the lower half. The following macros support this: li
,la
,lb
,lbu
,lh
,lhu
,lw
,lwu
,ld
,lwc1
,lwc2
,ldc1
,ldc2
,sb
,sh
,sw
,sd
,swc1
,swc2
,sdc1
,sdc2
This can be used when the two halves of the macros need to be used in nonconsecutive positions, for example:
li.u a0,address
jal function
li.l a0,address
6.2 Assembler-defined ARM macros
The assembler will automatically convert the arguments between the following opcodes if possible:
mov <-> mvn
bic <-> and
cmp <-> cmn
E.g., mov r0,-1
will be assembled as mvn r0,0
Additionally, ldr rx,=immediate
can be used to load a 32-bit immediate. The assembler will try to convert it into a mov/mvn instruction if possible. Otherwise, it will be stored in the nearest pool (see the .pool directive). add rx,=immediate
can be used as a PC-relative add and will be assembled as add rx,r15,(immediate-.-8)
6.3 User defined macros
The assembler allows the creation of custom macros. This is an example macro, a recreation of the builtin MIPS macro li
:
.macro myli,dest,value
.if value & ~0xFFFF
ori dest,r0,value
.elseif (value & 0xFFFF8000) == 0xFFFF8000
addiu dest,r0,value & 0xFFFF
.elseif (value & 0xFFFF) == 0
lui dest,value >> 16
.else
lui dest,value >> 16 + (value & 0x8000 != 0)
addiu dest,dest,value & 0xFFFF
.endif
.endmacro
The macro has to be initiated by a .macro
directive. The first argument is the macro name, followed by a variable amount of arguments. The code inside the macro can be anything, and it can even call other macros (up to a nesting level of 128 calls). The macro is terminated by a .endmacro
directive. It is not assembled when it is defined, but other code can call it from then on. All arguments are simple text replacements, so they can be anything from a number to a whole instruction parameter list. The macro is then invoked like this:
myli a0,0xFFEEDDCC
In this case, the code will assemble to the following:
lui a0,0xFFEF
addiu a0,a0,0xDDCC
Like all the other code, any equs are inserted before they are resolved.
Macros can also contain global, static and local labels that are changed to an unique name. The label name is prefixed by the macro name and a counter is appended. This label:
.macro Test
@@MainLoop:
.endmacro
will therefore be changed to the following (note that label names are case insensitive):
@@test_mainloop_00000000:
Each call of the macro will increase the counter. The counter is output as a hexadecimal number, e.g. the eleventh call of the test
macro will create a label named:
@@test_mainloop_000000a
Static labels defined inside a (nested) macro are treated as if they were defined inside the file that called the macro.
It is possible to pass an as-of-yet undefined symbol identifier to a macro and define the symbol as a label inside the macro. For example, the following:
.macro function,name
.align 4
name:
.endmacro
.org 0x2000002
function Main
will align the memory address to a multiple of 4, then create a label named Main
, which will have value 0x2000004
as a result.
7. Meta
7.1 Change log
- Version 0.11
- new
.aligna
directive for absolute address alignment - new expression functions:
org(label)
,orga(label)
,headersize(label)
- new expression functions:
min
andmax
- fixed major bug in MIPS LO/HI ELF symbol relocation
- COP2, TLB*, RFE instructions added to PSX
- fixed output of RSP VMOV/VRSQ*/VRCP* instructions
- RSP CTC2/CFC2 control register name support added
- fixed edge case bugs in ARM shift handling
- new
-definelabel
command line argument for defining label -equ
command line option now normalizes the case of the name- additional validations of command line arguments
- relativeinclude settings now respected in table directives
- fixed bugs in float exponential notation parsing
- fixed NaN and string comparisons with
<
,<=
,>
,>=
- negative initial header sizes now allowed (with warnings)
- correct line and column numbers for equ invocations
- other bugfixes and enhancements
- new
- Version 0.10
- many bugfixes and enhancements
- several new MIPS macros and pseudo-ops
- improved command argument handling, allows the input file argument to be after flag arguments and detects errors better
- C-style block comments supported
- expression values are now signed
- 64-bit data defines now written in generated symbol files, same command as 32-bit for compatibility with current emulators
- ELF relocator now checks object file machine and endianness before linking
- new directives:
.asciiz
,.skip
- new expression functions:
hi
(MIPS only),lo
(MIPS only),reads{8,16,32,64}
- float division by zero in expression now has standard float behaviour (returns
±∞
orNaN
), while integer divisions by zero returns dummy value-1
- exponential notation for floats supported
- Version 0.9
- huge rewrite with many enhancements and fixes
- can now read from UTF8, UTF16, and Shift-JIS files and convert the input correctly
- several new MIPS pseudo-ops, new COP0 and FPU control register types
- Nintendo 64 CPU + RSP support
- PSP support, load ELFs with
.loadelf
- able to import and relocate static C/C++ libraries
- new
-sym2
format for use with PPSSPP and PCSX2 - new directives:
.sym
,.stringn
,.sjis
,.sjisn
,.function
,.endfunction
,.importlib
,.loadelf
,.float
,.dd
,.double
- removed directives:
.ifarm
,.ifthumb
,.radix
- added support for floats in data directives
- added expression functions
- variable expressions supported in
.org
/.orga
/.headersize
- new statement syntax with
::
as separator and\
as line continuation.
- Version 0.7d
- added automatic optimizations for several ARM opcodes
- many bugfixes and internal changes
- added static labels
- new directives:
.warning
,.error
,.notice
,.relativeinclude
,.erroronwarning
,.ifarm
,.ifthumb
- quotation marks can now be escaped in strings using
\"
.
- Version 0.7c
- Macros can now contain unique local labels
.area
directive added- countless bugfixes
- no$gba debug message support
- full no$gba sym support
- Version 0.7b
- ARM/THUMB support
- fixed break/syscall MIPS opcodes
- added check if a MIPS instruction is valid inside a delay slot
- fixed and extended base detection
- added
.
dummy label to the math parser to get the current memory address - added
dcb
/dcw
/dcd
directives
- Version 0.5b
- Initial release
7.2 Migration from older versions
There are several changes after version 0.7d that may break compatibility with code written for older versions. These are as follows:
- String literals now require quotation marks, e.g. for file names
$XX
is no longer supported for hexadecimal literals
7.3 License
MIT Copyright (c) 2009-2020 Kingcom: LICENSE.txt