1333 lines
49 KiB
Plaintext
1333 lines
49 KiB
Plaintext
This is Info file gcc.info, produced by Makeinfo version 1.68 from the
|
||
input file ../../gcc-2.95.2/gcc/gcc.texi.
|
||
|
||
INFO-DIR-SECTION Programming
|
||
START-INFO-DIR-ENTRY
|
||
* gcc: (gcc). The GNU Compiler Collection.
|
||
END-INFO-DIR-ENTRY
|
||
This file documents the use and the internals of the GNU compiler.
|
||
|
||
Published by the Free Software Foundation 59 Temple Place - Suite 330
|
||
Boston, MA 02111-1307 USA
|
||
|
||
Copyright (C) 1988, 1989, 1992, 1993, 1994, 1995, 1996, 1997, 1998,
|
||
1999 Free Software Foundation, Inc.
|
||
|
||
Permission is granted to make and distribute verbatim copies of this
|
||
manual provided the copyright notice and this permission notice are
|
||
preserved on all copies.
|
||
|
||
Permission is granted to copy and distribute modified versions of
|
||
this manual under the conditions for verbatim copying, provided also
|
||
that the sections entitled "GNU General Public License" and "Funding
|
||
for Free Software" are included exactly as in the original, and
|
||
provided that the entire resulting derived work is distributed under
|
||
the terms of a permission notice identical to this one.
|
||
|
||
Permission is granted to copy and distribute translations of this
|
||
manual into another language, under the above conditions for modified
|
||
versions, except that the sections entitled "GNU General Public
|
||
License" and "Funding for Free Software", and this permission notice,
|
||
may be included in translations approved by the Free Software Foundation
|
||
instead of in the original English.
|
||
|
||
|
||
File: gcc.info, Node: Machine Constraints, Next: No Constraints, Prev: Modifiers, Up: Constraints
|
||
|
||
Constraints for Particular Machines
|
||
-----------------------------------
|
||
|
||
Whenever possible, you should use the general-purpose constraint
|
||
letters in `asm' arguments, since they will convey meaning more readily
|
||
to people reading your code. Failing that, use the constraint letters
|
||
that usually have very similar meanings across architectures. The most
|
||
commonly used constraints are `m' and `r' (for memory and
|
||
general-purpose registers respectively; *note Simple Constraints::.),
|
||
and `I', usually the letter indicating the most common
|
||
immediate-constant format.
|
||
|
||
For each machine architecture, the `config/MACHINE.h' file defines
|
||
additional constraints. These constraints are used by the compiler
|
||
itself for instruction generation, as well as for `asm' statements;
|
||
therefore, some of the constraints are not particularly interesting for
|
||
`asm'. The constraints are defined through these macros:
|
||
|
||
`REG_CLASS_FROM_LETTER'
|
||
Register class constraints (usually lower case).
|
||
|
||
`CONST_OK_FOR_LETTER_P'
|
||
Immediate constant constraints, for non-floating point constants of
|
||
word size or smaller precision (usually upper case).
|
||
|
||
`CONST_DOUBLE_OK_FOR_LETTER_P'
|
||
Immediate constant constraints, for all floating point constants
|
||
and for constants of greater than word size precision (usually
|
||
upper case).
|
||
|
||
`EXTRA_CONSTRAINT'
|
||
Special cases of registers or memory. This macro is not required,
|
||
and is only defined for some machines.
|
||
|
||
Inspecting these macro definitions in the compiler source for your
|
||
machine is the best way to be certain you have the right constraints.
|
||
However, here is a summary of the machine-dependent constraints
|
||
available on some particular machines.
|
||
|
||
*ARM family--`arm.h'*
|
||
|
||
`f'
|
||
Floating-point register
|
||
|
||
`F'
|
||
One of the floating-point constants 0.0, 0.5, 1.0, 2.0, 3.0,
|
||
4.0, 5.0 or 10.0
|
||
|
||
`G'
|
||
Floating-point constant that would satisfy the constraint `F'
|
||
if it were negated
|
||
|
||
`I'
|
||
Integer that is valid as an immediate operand in a data
|
||
processing instruction. That is, an integer in the range 0
|
||
to 255 rotated by a multiple of 2
|
||
|
||
`J'
|
||
Integer in the range -4095 to 4095
|
||
|
||
`K'
|
||
Integer that satisfies constraint `I' when inverted (ones
|
||
complement)
|
||
|
||
`L'
|
||
Integer that satisfies constraint `I' when negated (twos
|
||
complement)
|
||
|
||
`M'
|
||
Integer in the range 0 to 32
|
||
|
||
`Q'
|
||
A memory reference where the exact address is in a single
|
||
register (``m'' is preferable for `asm' statements)
|
||
|
||
`R'
|
||
An item in the constant pool
|
||
|
||
`S'
|
||
A symbol in the text segment of the current file
|
||
|
||
*AMD 29000 family--`a29k.h'*
|
||
|
||
`l'
|
||
Local register 0
|
||
|
||
`b'
|
||
Byte Pointer (`BP') register
|
||
|
||
`q'
|
||
`Q' register
|
||
|
||
`h'
|
||
Special purpose register
|
||
|
||
`A'
|
||
First accumulator register
|
||
|
||
`a'
|
||
Other accumulator register
|
||
|
||
`f'
|
||
Floating point register
|
||
|
||
`I'
|
||
Constant greater than 0, less than 0x100
|
||
|
||
`J'
|
||
Constant greater than 0, less than 0x10000
|
||
|
||
`K'
|
||
Constant whose high 24 bits are on (1)
|
||
|
||
`L'
|
||
16 bit constant whose high 8 bits are on (1)
|
||
|
||
`M'
|
||
32 bit constant whose high 16 bits are on (1)
|
||
|
||
`N'
|
||
32 bit negative constant that fits in 8 bits
|
||
|
||
`O'
|
||
The constant 0x80000000 or, on the 29050, any 32 bit constant
|
||
whose low 16 bits are 0.
|
||
|
||
`P'
|
||
16 bit negative constant that fits in 8 bits
|
||
|
||
`G'
|
||
`H'
|
||
A floating point constant (in `asm' statements, use the
|
||
machine independent `E' or `F' instead)
|
||
|
||
*IBM RS6000--`rs6000.h'*
|
||
|
||
`b'
|
||
Address base register
|
||
|
||
`f'
|
||
Floating point register
|
||
|
||
`h'
|
||
`MQ', `CTR', or `LINK' register
|
||
|
||
`q'
|
||
`MQ' register
|
||
|
||
`c'
|
||
`CTR' register
|
||
|
||
`l'
|
||
`LINK' register
|
||
|
||
`x'
|
||
`CR' register (condition register) number 0
|
||
|
||
`y'
|
||
`CR' register (condition register)
|
||
|
||
`z'
|
||
`FPMEM' stack memory for FPR-GPR transfers
|
||
|
||
`I'
|
||
Signed 16 bit constant
|
||
|
||
`J'
|
||
Constant whose low 16 bits are 0
|
||
|
||
`K'
|
||
Constant whose high 16 bits are 0
|
||
|
||
`L'
|
||
Constant suitable as a mask operand
|
||
|
||
`M'
|
||
Constant larger than 31
|
||
|
||
`N'
|
||
Exact power of 2
|
||
|
||
`O'
|
||
Zero
|
||
|
||
`P'
|
||
Constant whose negation is a signed 16 bit constant
|
||
|
||
`G'
|
||
Floating point constant that can be loaded into a register
|
||
with one instruction per word
|
||
|
||
`Q'
|
||
Memory operand that is an offset from a register (`m' is
|
||
preferable for `asm' statements)
|
||
|
||
`R'
|
||
AIX TOC entry
|
||
|
||
`S'
|
||
Constant suitable as a 64-bit mask operand
|
||
|
||
`U'
|
||
System V Release 4 small data area reference
|
||
|
||
*Intel 386--`i386.h'*
|
||
|
||
`q'
|
||
`a', `b', `c', or `d' register
|
||
|
||
`A'
|
||
`a', or `d' register (for 64-bit ints)
|
||
|
||
`f'
|
||
Floating point register
|
||
|
||
`t'
|
||
First (top of stack) floating point register
|
||
|
||
`u'
|
||
Second floating point register
|
||
|
||
`a'
|
||
`a' register
|
||
|
||
`b'
|
||
`b' register
|
||
|
||
`c'
|
||
`c' register
|
||
|
||
`d'
|
||
`d' register
|
||
|
||
`D'
|
||
`di' register
|
||
|
||
`S'
|
||
`si' register
|
||
|
||
`I'
|
||
Constant in range 0 to 31 (for 32 bit shifts)
|
||
|
||
`J'
|
||
Constant in range 0 to 63 (for 64 bit shifts)
|
||
|
||
`K'
|
||
`0xff'
|
||
|
||
`L'
|
||
`0xffff'
|
||
|
||
`M'
|
||
0, 1, 2, or 3 (shifts for `lea' instruction)
|
||
|
||
`N'
|
||
Constant in range 0 to 255 (for `out' instruction)
|
||
|
||
`G'
|
||
Standard 80387 floating point constant
|
||
|
||
*Intel 960--`i960.h'*
|
||
|
||
`f'
|
||
Floating point register (`fp0' to `fp3')
|
||
|
||
`l'
|
||
Local register (`r0' to `r15')
|
||
|
||
`b'
|
||
Global register (`g0' to `g15')
|
||
|
||
`d'
|
||
Any local or global register
|
||
|
||
`I'
|
||
Integers from 0 to 31
|
||
|
||
`J'
|
||
0
|
||
|
||
`K'
|
||
Integers from -31 to 0
|
||
|
||
`G'
|
||
Floating point 0
|
||
|
||
`H'
|
||
Floating point 1
|
||
|
||
*MIPS--`mips.h'*
|
||
|
||
`d'
|
||
General-purpose integer register
|
||
|
||
`f'
|
||
Floating-point register (if available)
|
||
|
||
`h'
|
||
`Hi' register
|
||
|
||
`l'
|
||
`Lo' register
|
||
|
||
`x'
|
||
`Hi' or `Lo' register
|
||
|
||
`y'
|
||
General-purpose integer register
|
||
|
||
`z'
|
||
Floating-point status register
|
||
|
||
`I'
|
||
Signed 16 bit constant (for arithmetic instructions)
|
||
|
||
`J'
|
||
Zero
|
||
|
||
`K'
|
||
Zero-extended 16-bit constant (for logic instructions)
|
||
|
||
`L'
|
||
Constant with low 16 bits zero (can be loaded with `lui')
|
||
|
||
`M'
|
||
32 bit constant which requires two instructions to load (a
|
||
constant which is not `I', `K', or `L')
|
||
|
||
`N'
|
||
Negative 16 bit constant
|
||
|
||
`O'
|
||
Exact power of two
|
||
|
||
`P'
|
||
Positive 16 bit constant
|
||
|
||
`G'
|
||
Floating point zero
|
||
|
||
`Q'
|
||
Memory reference that can be loaded with more than one
|
||
instruction (`m' is preferable for `asm' statements)
|
||
|
||
`R'
|
||
Memory reference that can be loaded with one instruction (`m'
|
||
is preferable for `asm' statements)
|
||
|
||
`S'
|
||
Memory reference in external OSF/rose PIC format (`m' is
|
||
preferable for `asm' statements)
|
||
|
||
*Motorola 680x0--`m68k.h'*
|
||
|
||
`a'
|
||
Address register
|
||
|
||
`d'
|
||
Data register
|
||
|
||
`f'
|
||
68881 floating-point register, if available
|
||
|
||
`x'
|
||
Sun FPA (floating-point) register, if available
|
||
|
||
`y'
|
||
First 16 Sun FPA registers, if available
|
||
|
||
`I'
|
||
Integer in the range 1 to 8
|
||
|
||
`J'
|
||
16 bit signed number
|
||
|
||
`K'
|
||
Signed number whose magnitude is greater than 0x80
|
||
|
||
`L'
|
||
Integer in the range -8 to -1
|
||
|
||
`M'
|
||
Signed number whose magnitude is greater than 0x100
|
||
|
||
`G'
|
||
Floating point constant that is not a 68881 constant
|
||
|
||
`H'
|
||
Floating point constant that can be used by Sun FPA
|
||
|
||
*SPARC--`sparc.h'*
|
||
|
||
`f'
|
||
Floating-point register that can hold 32 or 64 bit values.
|
||
|
||
`e'
|
||
Floating-point register that can hold 64 or 128 bit values.
|
||
|
||
`I'
|
||
Signed 13 bit constant
|
||
|
||
`J'
|
||
Zero
|
||
|
||
`K'
|
||
32 bit constant with the low 12 bits clear (a constant that
|
||
can be loaded with the `sethi' instruction)
|
||
|
||
`G'
|
||
Floating-point zero
|
||
|
||
`H'
|
||
Signed 13 bit constant, sign-extended to 32 or 64 bits
|
||
|
||
`Q'
|
||
Memory reference that can be loaded with one instruction
|
||
(`m' is more appropriate for `asm' statements)
|
||
|
||
`S'
|
||
Constant, or memory address
|
||
|
||
`T'
|
||
Memory address aligned to an 8-byte boundary
|
||
|
||
`U'
|
||
Even register
|
||
|
||
|
||
File: gcc.info, Node: No Constraints, Prev: Machine Constraints, Up: Constraints
|
||
|
||
Not Using Constraints
|
||
---------------------
|
||
|
||
Some machines are so clean that operand constraints are not
|
||
required. For example, on the Vax, an operand valid in one context is
|
||
valid in any other context. On such a machine, every operand
|
||
constraint would be `g', excepting only operands of "load address"
|
||
instructions which are written as if they referred to a memory
|
||
location's contents but actual refer to its address. They would have
|
||
constraint `p'.
|
||
|
||
For such machines, instead of writing `g' and `p' for all the
|
||
constraints, you can choose to write a description with empty
|
||
constraints. Then you write `""' for the constraint in every
|
||
`match_operand'. Address operands are identified by writing an
|
||
`address' expression around the `match_operand', not by their
|
||
constraints.
|
||
|
||
When the machine description has just empty constraints, certain
|
||
parts of compilation are skipped, making the compiler faster. However,
|
||
few machines actually do not need constraints; all machine descriptions
|
||
now in existence use constraints.
|
||
|
||
|
||
File: gcc.info, Node: Standard Names, Next: Pattern Ordering, Prev: Constraints, Up: Machine Desc
|
||
|
||
Standard Pattern Names For Generation
|
||
=====================================
|
||
|
||
Here is a table of the instruction names that are meaningful in the
|
||
RTL generation pass of the compiler. Giving one of these names to an
|
||
instruction pattern tells the RTL generation pass that it can use the
|
||
pattern to accomplish a certain task.
|
||
|
||
`movM'
|
||
Here M stands for a two-letter machine mode name, in lower case.
|
||
This instruction pattern moves data with that machine mode from
|
||
operand 1 to operand 0. For example, `movsi' moves full-word data.
|
||
|
||
If operand 0 is a `subreg' with mode M of a register whose own
|
||
mode is wider than M, the effect of this instruction is to store
|
||
the specified value in the part of the register that corresponds
|
||
to mode M. The effect on the rest of the register is undefined.
|
||
|
||
This class of patterns is special in several ways. First of all,
|
||
each of these names *must* be defined, because there is no other
|
||
way to copy a datum from one place to another.
|
||
|
||
Second, these patterns are not used solely in the RTL generation
|
||
pass. Even the reload pass can generate move insns to copy values
|
||
from stack slots into temporary registers. When it does so, one
|
||
of the operands is a hard register and the other is an operand
|
||
that can need to be reloaded into a register.
|
||
|
||
Therefore, when given such a pair of operands, the pattern must
|
||
generate RTL which needs no reloading and needs no temporary
|
||
registers--no registers other than the operands. For example, if
|
||
you support the pattern with a `define_expand', then in such a
|
||
case the `define_expand' mustn't call `force_reg' or any other such
|
||
function which might generate new pseudo registers.
|
||
|
||
This requirement exists even for subword modes on a RISC machine
|
||
where fetching those modes from memory normally requires several
|
||
insns and some temporary registers. Look in `spur.md' to see how
|
||
the requirement can be satisfied.
|
||
|
||
During reload a memory reference with an invalid address may be
|
||
passed as an operand. Such an address will be replaced with a
|
||
valid address later in the reload pass. In this case, nothing may
|
||
be done with the address except to use it as it stands. If it is
|
||
copied, it will not be replaced with a valid address. No attempt
|
||
should be made to make such an address into a valid address and no
|
||
routine (such as `change_address') that will do so may be called.
|
||
Note that `general_operand' will fail when applied to such an
|
||
address.
|
||
|
||
The global variable `reload_in_progress' (which must be explicitly
|
||
declared if required) can be used to determine whether such special
|
||
handling is required.
|
||
|
||
The variety of operands that have reloads depends on the rest of
|
||
the machine description, but typically on a RISC machine these can
|
||
only be pseudo registers that did not get hard registers, while on
|
||
other machines explicit memory references will get optional
|
||
reloads.
|
||
|
||
If a scratch register is required to move an object to or from
|
||
memory, it can be allocated using `gen_reg_rtx' prior to life
|
||
analysis.
|
||
|
||
If there are cases needing scratch registers after reload, you
|
||
must define `SECONDARY_INPUT_RELOAD_CLASS' and perhaps also
|
||
`SECONDARY_OUTPUT_RELOAD_CLASS' to detect them, and provide
|
||
patterns `reload_inM' or `reload_outM' to handle them. *Note
|
||
Register Classes::.
|
||
|
||
The global variable `no_new_pseudos' can be used to determine if it
|
||
is unsafe to create new pseudo registers. If this variable is
|
||
nonzero, then it is unsafe to call `gen_reg_rtx' to allocate a new
|
||
pseudo.
|
||
|
||
The constraints on a `movM' must permit moving any hard register
|
||
to any other hard register provided that `HARD_REGNO_MODE_OK'
|
||
permits mode M in both registers and `REGISTER_MOVE_COST' applied
|
||
to their classes returns a value of 2.
|
||
|
||
It is obligatory to support floating point `movM' instructions
|
||
into and out of any registers that can hold fixed point values,
|
||
because unions and structures (which have modes `SImode' or
|
||
`DImode') can be in those registers and they may have floating
|
||
point members.
|
||
|
||
There may also be a need to support fixed point `movM'
|
||
instructions in and out of floating point registers.
|
||
Unfortunately, I have forgotten why this was so, and I don't know
|
||
whether it is still true. If `HARD_REGNO_MODE_OK' rejects fixed
|
||
point values in floating point registers, then the constraints of
|
||
the fixed point `movM' instructions must be designed to avoid ever
|
||
trying to reload into a floating point register.
|
||
|
||
`reload_inM'
|
||
`reload_outM'
|
||
Like `movM', but used when a scratch register is required to move
|
||
between operand 0 and operand 1. Operand 2 describes the scratch
|
||
register. See the discussion of the `SECONDARY_RELOAD_CLASS'
|
||
macro in *note Register Classes::..
|
||
|
||
`movstrictM'
|
||
Like `movM' except that if operand 0 is a `subreg' with mode M of
|
||
a register whose natural mode is wider, the `movstrictM'
|
||
instruction is guaranteed not to alter any of the register except
|
||
the part which belongs to mode M.
|
||
|
||
`load_multiple'
|
||
Load several consecutive memory locations into consecutive
|
||
registers. Operand 0 is the first of the consecutive registers,
|
||
operand 1 is the first memory location, and operand 2 is a
|
||
constant: the number of consecutive registers.
|
||
|
||
Define this only if the target machine really has such an
|
||
instruction; do not define this if the most efficient way of
|
||
loading consecutive registers from memory is to do them one at a
|
||
time.
|
||
|
||
On some machines, there are restrictions as to which consecutive
|
||
registers can be stored into memory, such as particular starting or
|
||
ending register numbers or only a range of valid counts. For those
|
||
machines, use a `define_expand' (*note Expander Definitions::.)
|
||
and make the pattern fail if the restrictions are not met.
|
||
|
||
Write the generated insn as a `parallel' with elements being a
|
||
`set' of one register from the appropriate memory location (you may
|
||
also need `use' or `clobber' elements). Use a `match_parallel'
|
||
(*note RTL Template::.) to recognize the insn. See `a29k.md' and
|
||
`rs6000.md' for examples of the use of this insn pattern.
|
||
|
||
`store_multiple'
|
||
Similar to `load_multiple', but store several consecutive registers
|
||
into consecutive memory locations. Operand 0 is the first of the
|
||
consecutive memory locations, operand 1 is the first register, and
|
||
operand 2 is a constant: the number of consecutive registers.
|
||
|
||
`addM3'
|
||
Add operand 2 and operand 1, storing the result in operand 0. All
|
||
operands must have mode M. This can be used even on two-address
|
||
machines, by means of constraints requiring operands 1 and 0 to be
|
||
the same location.
|
||
|
||
`subM3', `mulM3'
|
||
`divM3', `udivM3', `modM3', `umodM3'
|
||
`sminM3', `smaxM3', `uminM3', `umaxM3'
|
||
`andM3', `iorM3', `xorM3'
|
||
Similar, for other arithmetic operations.
|
||
|
||
`mulhisi3'
|
||
Multiply operands 1 and 2, which have mode `HImode', and store a
|
||
`SImode' product in operand 0.
|
||
|
||
`mulqihi3', `mulsidi3'
|
||
Similar widening-multiplication instructions of other widths.
|
||
|
||
`umulqihi3', `umulhisi3', `umulsidi3'
|
||
Similar widening-multiplication instructions that do unsigned
|
||
multiplication.
|
||
|
||
`mulM3_highpart'
|
||
Perform a signed multiplication of operands 1 and 2, which have
|
||
mode M, and store the most significant half of the product in
|
||
operand 0. The least significant half of the product is discarded.
|
||
|
||
`umulM3_highpart'
|
||
Similar, but the multiplication is unsigned.
|
||
|
||
`divmodM4'
|
||
Signed division that produces both a quotient and a remainder.
|
||
Operand 1 is divided by operand 2 to produce a quotient stored in
|
||
operand 0 and a remainder stored in operand 3.
|
||
|
||
For machines with an instruction that produces both a quotient and
|
||
a remainder, provide a pattern for `divmodM4' but do not provide
|
||
patterns for `divM3' and `modM3'. This allows optimization in the
|
||
relatively common case when both the quotient and remainder are
|
||
computed.
|
||
|
||
If an instruction that just produces a quotient or just a remainder
|
||
exists and is more efficient than the instruction that produces
|
||
both, write the output routine of `divmodM4' to call
|
||
`find_reg_note' and look for a `REG_UNUSED' note on the quotient
|
||
or remainder and generate the appropriate instruction.
|
||
|
||
`udivmodM4'
|
||
Similar, but does unsigned division.
|
||
|
||
`ashlM3'
|
||
Arithmetic-shift operand 1 left by a number of bits specified by
|
||
operand 2, and store the result in operand 0. Here M is the mode
|
||
of operand 0 and operand 1; operand 2's mode is specified by the
|
||
instruction pattern, and the compiler will convert the operand to
|
||
that mode before generating the instruction.
|
||
|
||
`ashrM3', `lshrM3', `rotlM3', `rotrM3'
|
||
Other shift and rotate instructions, analogous to the `ashlM3'
|
||
instructions.
|
||
|
||
`negM2'
|
||
Negate operand 1 and store the result in operand 0.
|
||
|
||
`absM2'
|
||
Store the absolute value of operand 1 into operand 0.
|
||
|
||
`sqrtM2'
|
||
Store the square root of operand 1 into operand 0.
|
||
|
||
The `sqrt' built-in function of C always uses the mode which
|
||
corresponds to the C data type `double'.
|
||
|
||
`ffsM2'
|
||
Store into operand 0 one plus the index of the least significant
|
||
1-bit of operand 1. If operand 1 is zero, store zero. M is the
|
||
mode of operand 0; operand 1's mode is specified by the instruction
|
||
pattern, and the compiler will convert the operand to that mode
|
||
before generating the instruction.
|
||
|
||
The `ffs' built-in function of C always uses the mode which
|
||
corresponds to the C data type `int'.
|
||
|
||
`one_cmplM2'
|
||
Store the bitwise-complement of operand 1 into operand 0.
|
||
|
||
`cmpM'
|
||
Compare operand 0 and operand 1, and set the condition codes. The
|
||
RTL pattern should look like this:
|
||
|
||
(set (cc0) (compare (match_operand:M 0 ...)
|
||
(match_operand:M 1 ...)))
|
||
|
||
`tstM'
|
||
Compare operand 0 against zero, and set the condition codes. The
|
||
RTL pattern should look like this:
|
||
|
||
(set (cc0) (match_operand:M 0 ...))
|
||
|
||
`tstM' patterns should not be defined for machines that do not use
|
||
`(cc0)'. Doing so would confuse the optimizer since it would no
|
||
longer be clear which `set' operations were comparisons. The
|
||
`cmpM' patterns should be used instead.
|
||
|
||
`movstrM'
|
||
Block move instruction. The addresses of the destination and
|
||
source strings are the first two operands, and both are in mode
|
||
`Pmode'.
|
||
|
||
The number of bytes to move is the third operand, in mode M.
|
||
Usually, you specify `word_mode' for M. However, if you can
|
||
generate better code knowing the range of valid lengths is smaller
|
||
than those representable in a full word, you should provide a
|
||
pattern with a mode corresponding to the range of values you can
|
||
handle efficiently (e.g., `QImode' for values in the range 0-127;
|
||
note we avoid numbers that appear negative) and also a pattern
|
||
with `word_mode'.
|
||
|
||
The fourth operand is the known shared alignment of the source and
|
||
destination, in the form of a `const_int' rtx. Thus, if the
|
||
compiler knows that both source and destination are word-aligned,
|
||
it may provide the value 4 for this operand.
|
||
|
||
Descriptions of multiple `movstrM' patterns can only be beneficial
|
||
if the patterns for smaller modes have fewer restrictions on their
|
||
first, second and fourth operands. Note that the mode M in
|
||
`movstrM' does not impose any restriction on the mode of
|
||
individually moved data units in the block.
|
||
|
||
These patterns need not give special consideration to the
|
||
possibility that the source and destination strings might overlap.
|
||
|
||
`clrstrM'
|
||
Block clear instruction. The addresses of the destination string
|
||
is the first operand, in mode `Pmode'. The number of bytes to
|
||
clear is the second operand, in mode M. See `movstrM' for a
|
||
discussion of the choice of mode.
|
||
|
||
The third operand is the known alignment of the destination, in
|
||
the form of a `const_int' rtx. Thus, if the compiler knows that
|
||
the destination is word-aligned, it may provide the value 4 for
|
||
this operand.
|
||
|
||
The use for multiple `clrstrM' is as for `movstrM'.
|
||
|
||
`cmpstrM'
|
||
Block compare instruction, with five operands. Operand 0 is the
|
||
output; it has mode M. The remaining four operands are like the
|
||
operands of `movstrM'. The two memory blocks specified are
|
||
compared byte by byte in lexicographic order. The effect of the
|
||
instruction is to store a value in operand 0 whose sign indicates
|
||
the result of the comparison.
|
||
|
||
`strlenM'
|
||
Compute the length of a string, with three operands. Operand 0 is
|
||
the result (of mode M), operand 1 is a `mem' referring to the
|
||
first character of the string, operand 2 is the character to
|
||
search for (normally zero), and operand 3 is a constant describing
|
||
the known alignment of the beginning of the string.
|
||
|
||
`floatMN2'
|
||
Convert signed integer operand 1 (valid for fixed point mode M) to
|
||
floating point mode N and store in operand 0 (which has mode N).
|
||
|
||
`floatunsMN2'
|
||
Convert unsigned integer operand 1 (valid for fixed point mode M)
|
||
to floating point mode N and store in operand 0 (which has mode N).
|
||
|
||
`fixMN2'
|
||
Convert operand 1 (valid for floating point mode M) to fixed point
|
||
mode N as a signed number and store in operand 0 (which has mode
|
||
N). This instruction's result is defined only when the value of
|
||
operand 1 is an integer.
|
||
|
||
`fixunsMN2'
|
||
Convert operand 1 (valid for floating point mode M) to fixed point
|
||
mode N as an unsigned number and store in operand 0 (which has
|
||
mode N). This instruction's result is defined only when the value
|
||
of operand 1 is an integer.
|
||
|
||
`ftruncM2'
|
||
Convert operand 1 (valid for floating point mode M) to an integer
|
||
value, still represented in floating point mode M, and store it in
|
||
operand 0 (valid for floating point mode M).
|
||
|
||
`fix_truncMN2'
|
||
Like `fixMN2' but works for any floating point value of mode M by
|
||
converting the value to an integer.
|
||
|
||
`fixuns_truncMN2'
|
||
Like `fixunsMN2' but works for any floating point value of mode M
|
||
by converting the value to an integer.
|
||
|
||
`truncMN2'
|
||
Truncate operand 1 (valid for mode M) to mode N and store in
|
||
operand 0 (which has mode N). Both modes must be fixed point or
|
||
both floating point.
|
||
|
||
`extendMN2'
|
||
Sign-extend operand 1 (valid for mode M) to mode N and store in
|
||
operand 0 (which has mode N). Both modes must be fixed point or
|
||
both floating point.
|
||
|
||
`zero_extendMN2'
|
||
Zero-extend operand 1 (valid for mode M) to mode N and store in
|
||
operand 0 (which has mode N). Both modes must be fixed point.
|
||
|
||
`extv'
|
||
Extract a bit field from operand 1 (a register or memory operand),
|
||
where operand 2 specifies the width in bits and operand 3 the
|
||
starting bit, and store it in operand 0. Operand 0 must have mode
|
||
`word_mode'. Operand 1 may have mode `byte_mode' or `word_mode';
|
||
often `word_mode' is allowed only for registers. Operands 2 and 3
|
||
must be valid for `word_mode'.
|
||
|
||
The RTL generation pass generates this instruction only with
|
||
constants for operands 2 and 3.
|
||
|
||
The bit-field value is sign-extended to a full word integer before
|
||
it is stored in operand 0.
|
||
|
||
`extzv'
|
||
Like `extv' except that the bit-field value is zero-extended.
|
||
|
||
`insv'
|
||
Store operand 3 (which must be valid for `word_mode') into a bit
|
||
field in operand 0, where operand 1 specifies the width in bits and
|
||
operand 2 the starting bit. Operand 0 may have mode `byte_mode' or
|
||
`word_mode'; often `word_mode' is allowed only for registers.
|
||
Operands 1 and 2 must be valid for `word_mode'.
|
||
|
||
The RTL generation pass generates this instruction only with
|
||
constants for operands 1 and 2.
|
||
|
||
`movMODEcc'
|
||
Conditionally move operand 2 or operand 3 into operand 0 according
|
||
to the comparison in operand 1. If the comparison is true,
|
||
operand 2 is moved into operand 0, otherwise operand 3 is moved.
|
||
|
||
The mode of the operands being compared need not be the same as
|
||
the operands being moved. Some machines, sparc64 for example,
|
||
have instructions that conditionally move an integer value based
|
||
on the floating point condition codes and vice versa.
|
||
|
||
If the machine does not have conditional move instructions, do not
|
||
define these patterns.
|
||
|
||
`sCOND'
|
||
Store zero or nonzero in the operand according to the condition
|
||
codes. Value stored is nonzero iff the condition COND is true.
|
||
COND is the name of a comparison operation expression code, such
|
||
as `eq', `lt' or `leu'.
|
||
|
||
You specify the mode that the operand must have when you write the
|
||
`match_operand' expression. The compiler automatically sees which
|
||
mode you have used and supplies an operand of that mode.
|
||
|
||
The value stored for a true condition must have 1 as its low bit,
|
||
or else must be negative. Otherwise the instruction is not
|
||
suitable and you should omit it from the machine description. You
|
||
describe to the compiler exactly which value is stored by defining
|
||
the macro `STORE_FLAG_VALUE' (*note Misc::.). If a description
|
||
cannot be found that can be used for all the `sCOND' patterns, you
|
||
should omit those operations from the machine description.
|
||
|
||
These operations may fail, but should do so only in relatively
|
||
uncommon cases; if they would fail for common cases involving
|
||
integer comparisons, it is best to omit these patterns.
|
||
|
||
If these operations are omitted, the compiler will usually
|
||
generate code that copies the constant one to the target and
|
||
branches around an assignment of zero to the target. If this code
|
||
is more efficient than the potential instructions used for the
|
||
`sCOND' pattern followed by those required to convert the result
|
||
into a 1 or a zero in `SImode', you should omit the `sCOND'
|
||
operations from the machine description.
|
||
|
||
`bCOND'
|
||
Conditional branch instruction. Operand 0 is a `label_ref' that
|
||
refers to the label to jump to. Jump if the condition codes meet
|
||
condition COND.
|
||
|
||
Some machines do not follow the model assumed here where a
|
||
comparison instruction is followed by a conditional branch
|
||
instruction. In that case, the `cmpM' (and `tstM') patterns should
|
||
simply store the operands away and generate all the required insns
|
||
in a `define_expand' (*note Expander Definitions::.) for the
|
||
conditional branch operations. All calls to expand `bCOND'
|
||
patterns are immediately preceded by calls to expand either a
|
||
`cmpM' pattern or a `tstM' pattern.
|
||
|
||
Machines that use a pseudo register for the condition code value,
|
||
or where the mode used for the comparison depends on the condition
|
||
being tested, should also use the above mechanism. *Note Jump
|
||
Patterns::.
|
||
|
||
The above discussion also applies to the `movMODEcc' and `sCOND'
|
||
patterns.
|
||
|
||
`call'
|
||
Subroutine call instruction returning no value. Operand 0 is the
|
||
function to call; operand 1 is the number of bytes of arguments
|
||
pushed as a `const_int'; operand 2 is the number of registers used
|
||
as operands.
|
||
|
||
On most machines, operand 2 is not actually stored into the RTL
|
||
pattern. It is supplied for the sake of some RISC machines which
|
||
need to put this information into the assembler code; they can put
|
||
it in the RTL instead of operand 1.
|
||
|
||
Operand 0 should be a `mem' RTX whose address is the address of the
|
||
function. Note, however, that this address can be a `symbol_ref'
|
||
expression even if it would not be a legitimate memory address on
|
||
the target machine. If it is also not a valid argument for a call
|
||
instruction, the pattern for this operation should be a
|
||
`define_expand' (*note Expander Definitions::.) that places the
|
||
address into a register and uses that register in the call
|
||
instruction.
|
||
|
||
`call_value'
|
||
Subroutine call instruction returning a value. Operand 0 is the
|
||
hard register in which the value is returned. There are three more
|
||
operands, the same as the three operands of the `call' instruction
|
||
(but with numbers increased by one).
|
||
|
||
Subroutines that return `BLKmode' objects use the `call' insn.
|
||
|
||
`call_pop', `call_value_pop'
|
||
Similar to `call' and `call_value', except used if defined and if
|
||
`RETURN_POPS_ARGS' is non-zero. They should emit a `parallel'
|
||
that contains both the function call and a `set' to indicate the
|
||
adjustment made to the frame pointer.
|
||
|
||
For machines where `RETURN_POPS_ARGS' can be non-zero, the use of
|
||
these patterns increases the number of functions for which the
|
||
frame pointer can be eliminated, if desired.
|
||
|
||
`untyped_call'
|
||
Subroutine call instruction returning a value of any type.
|
||
Operand 0 is the function to call; operand 1 is a memory location
|
||
where the result of calling the function is to be stored; operand
|
||
2 is a `parallel' expression where each element is a `set'
|
||
expression that indicates the saving of a function return value
|
||
into the result block.
|
||
|
||
This instruction pattern should be defined to support
|
||
`__builtin_apply' on machines where special instructions are needed
|
||
to call a subroutine with arbitrary arguments or to save the value
|
||
returned. This instruction pattern is required on machines that
|
||
have multiple registers that can hold a return value (i.e.
|
||
`FUNCTION_VALUE_REGNO_P' is true for more than one register).
|
||
|
||
`return'
|
||
Subroutine return instruction. This instruction pattern name
|
||
should be defined only if a single instruction can do all the work
|
||
of returning from a function.
|
||
|
||
Like the `movM' patterns, this pattern is also used after the RTL
|
||
generation phase. In this case it is to support machines where
|
||
multiple instructions are usually needed to return from a
|
||
function, but some class of functions only requires one
|
||
instruction to implement a return. Normally, the applicable
|
||
functions are those which do not need to save any registers or
|
||
allocate stack space.
|
||
|
||
For such machines, the condition specified in this pattern should
|
||
only be true when `reload_completed' is non-zero and the function's
|
||
epilogue would only be a single instruction. For machines with
|
||
register windows, the routine `leaf_function_p' may be used to
|
||
determine if a register window push is required.
|
||
|
||
Machines that have conditional return instructions should define
|
||
patterns such as
|
||
|
||
(define_insn ""
|
||
[(set (pc)
|
||
(if_then_else (match_operator
|
||
0 "comparison_operator"
|
||
[(cc0) (const_int 0)])
|
||
(return)
|
||
(pc)))]
|
||
"CONDITION"
|
||
"...")
|
||
|
||
where CONDITION would normally be the same condition specified on
|
||
the named `return' pattern.
|
||
|
||
`untyped_return'
|
||
Untyped subroutine return instruction. This instruction pattern
|
||
should be defined to support `__builtin_return' on machines where
|
||
special instructions are needed to return a value of any type.
|
||
|
||
Operand 0 is a memory location where the result of calling a
|
||
function with `__builtin_apply' is stored; operand 1 is a
|
||
`parallel' expression where each element is a `set' expression
|
||
that indicates the restoring of a function return value from the
|
||
result block.
|
||
|
||
`nop'
|
||
No-op instruction. This instruction pattern name should always be
|
||
defined to output a no-op in assembler code. `(const_int 0)' will
|
||
do as an RTL pattern.
|
||
|
||
`indirect_jump'
|
||
An instruction to jump to an address which is operand zero. This
|
||
pattern name is mandatory on all machines.
|
||
|
||
`casesi'
|
||
Instruction to jump through a dispatch table, including bounds
|
||
checking. This instruction takes five operands:
|
||
|
||
1. The index to dispatch on, which has mode `SImode'.
|
||
|
||
2. The lower bound for indices in the table, an integer constant.
|
||
|
||
3. The total range of indices in the table--the largest index
|
||
minus the smallest one (both inclusive).
|
||
|
||
4. A label that precedes the table itself.
|
||
|
||
5. A label to jump to if the index has a value outside the
|
||
bounds. (If the machine-description macro
|
||
`CASE_DROPS_THROUGH' is defined, then an out-of-bounds index
|
||
drops through to the code following the jump table instead of
|
||
jumping to this label. In that case, this label is not
|
||
actually used by the `casesi' instruction, but it is always
|
||
provided as an operand.)
|
||
|
||
The table is a `addr_vec' or `addr_diff_vec' inside of a
|
||
`jump_insn'. The number of elements in the table is one plus the
|
||
difference between the upper bound and the lower bound.
|
||
|
||
`tablejump'
|
||
Instruction to jump to a variable address. This is a low-level
|
||
capability which can be used to implement a dispatch table when
|
||
there is no `casesi' pattern.
|
||
|
||
This pattern requires two operands: the address or offset, and a
|
||
label which should immediately precede the jump table. If the
|
||
macro `CASE_VECTOR_PC_RELATIVE' evaluates to a nonzero value then
|
||
the first operand is an offset which counts from the address of
|
||
the table; otherwise, it is an absolute address to jump to. In
|
||
either case, the first operand has mode `Pmode'.
|
||
|
||
The `tablejump' insn is always the last insn before the jump table
|
||
it uses. Its assembler code normally has no need to use the
|
||
second operand, but you should incorporate it in the RTL pattern so
|
||
that the jump optimizer will not delete the table as unreachable
|
||
code.
|
||
|
||
`canonicalize_funcptr_for_compare'
|
||
Canonicalize the function pointer in operand 1 and store the result
|
||
into operand 0.
|
||
|
||
Operand 0 is always a `reg' and has mode `Pmode'; operand 1 may be
|
||
a `reg', `mem', `symbol_ref', `const_int', etc and also has mode
|
||
`Pmode'.
|
||
|
||
Canonicalization of a function pointer usually involves computing
|
||
the address of the function which would be called if the function
|
||
pointer were used in an indirect call.
|
||
|
||
Only define this pattern if function pointers on the target machine
|
||
can have different values but still call the same function when
|
||
used in an indirect call.
|
||
|
||
`save_stack_block'
|
||
`save_stack_function'
|
||
`save_stack_nonlocal'
|
||
`restore_stack_block'
|
||
`restore_stack_function'
|
||
`restore_stack_nonlocal'
|
||
Most machines save and restore the stack pointer by copying it to
|
||
or from an object of mode `Pmode'. Do not define these patterns on
|
||
such machines.
|
||
|
||
Some machines require special handling for stack pointer saves and
|
||
restores. On those machines, define the patterns corresponding to
|
||
the non-standard cases by using a `define_expand' (*note Expander
|
||
Definitions::.) that produces the required insns. The three types
|
||
of saves and restores are:
|
||
|
||
1. `save_stack_block' saves the stack pointer at the start of a
|
||
block that allocates a variable-sized object, and
|
||
`restore_stack_block' restores the stack pointer when the
|
||
block is exited.
|
||
|
||
2. `save_stack_function' and `restore_stack_function' do a
|
||
similar job for the outermost block of a function and are
|
||
used when the function allocates variable-sized objects or
|
||
calls `alloca'. Only the epilogue uses the restored stack
|
||
pointer, allowing a simpler save or restore sequence on some
|
||
machines.
|
||
|
||
3. `save_stack_nonlocal' is used in functions that contain labels
|
||
branched to by nested functions. It saves the stack pointer
|
||
in such a way that the inner function can use
|
||
`restore_stack_nonlocal' to restore the stack pointer. The
|
||
compiler generates code to restore the frame and argument
|
||
pointer registers, but some machines require saving and
|
||
restoring additional data such as register window information
|
||
or stack backchains. Place insns in these patterns to save
|
||
and restore any such required data.
|
||
|
||
When saving the stack pointer, operand 0 is the save area and
|
||
operand 1 is the stack pointer. The mode used to allocate the
|
||
save area defaults to `Pmode' but you can override that choice by
|
||
defining the `STACK_SAVEAREA_MODE' macro (*note Storage
|
||
Layout::.). You must specify an integral mode, or `VOIDmode' if
|
||
no save area is needed for a particular type of save (either
|
||
because no save is needed or because a machine-specific save area
|
||
can be used). Operand 0 is the stack pointer and operand 1 is the
|
||
save area for restore operations. If `save_stack_block' is
|
||
defined, operand 0 must not be `VOIDmode' since these saves can be
|
||
arbitrarily nested.
|
||
|
||
A save area is a `mem' that is at a constant offset from
|
||
`virtual_stack_vars_rtx' when the stack pointer is saved for use by
|
||
nonlocal gotos and a `reg' in the other two cases.
|
||
|
||
`allocate_stack'
|
||
Subtract (or add if `STACK_GROWS_DOWNWARD' is undefined) operand 1
|
||
from the stack pointer to create space for dynamically allocated
|
||
data.
|
||
|
||
Store the resultant pointer to this space into operand 0. If you
|
||
are allocating space from the main stack, do this by emitting a
|
||
move insn to copy `virtual_stack_dynamic_rtx' to operand 0. If
|
||
you are allocating the space elsewhere, generate code to copy the
|
||
location of the space to operand 0. In the latter case, you must
|
||
ensure this space gets freed when the corresponding space on the
|
||
main stack is free.
|
||
|
||
Do not define this pattern if all that must be done is the
|
||
subtraction. Some machines require other operations such as stack
|
||
probes or maintaining the back chain. Define this pattern to emit
|
||
those operations in addition to updating the stack pointer.
|
||
|
||
`probe'
|
||
Some machines require instructions to be executed after space is
|
||
allocated from the stack, for example to generate a reference at
|
||
the bottom of the stack.
|
||
|
||
If you need to emit instructions before the stack has been
|
||
adjusted, put them into the `allocate_stack' pattern. Otherwise,
|
||
define this pattern to emit the required instructions.
|
||
|
||
No operands are provided.
|
||
|
||
`check_stack'
|
||
If stack checking cannot be done on your system by probing the
|
||
stack with a load or store instruction (*note Stack Checking::.),
|
||
define this pattern to perform the needed check and signaling an
|
||
error if the stack has overflowed. The single operand is the
|
||
location in the stack furthest from the current stack pointer that
|
||
you need to validate. Normally, on machines where this pattern is
|
||
needed, you would obtain the stack limit from a global or
|
||
thread-specific variable or register.
|
||
|
||
`nonlocal_goto'
|
||
Emit code to generate a non-local goto, e.g., a jump from one
|
||
function to a label in an outer function. This pattern has four
|
||
arguments, each representing a value to be used in the jump. The
|
||
first argument is to be loaded into the frame pointer, the second
|
||
is the address to branch to (code to dispatch to the actual label),
|
||
the third is the address of a location where the stack is saved,
|
||
and the last is the address of the label, to be placed in the
|
||
location for the incoming static chain.
|
||
|
||
On most machines you need not define this pattern, since GNU CC
|
||
will already generate the correct code, which is to load the frame
|
||
pointer and static chain, restore the stack (using the
|
||
`restore_stack_nonlocal' pattern, if defined), and jump indirectly
|
||
to the dispatcher. You need only define this pattern if this code
|
||
will not work on your machine.
|
||
|
||
`nonlocal_goto_receiver'
|
||
This pattern, if defined, contains code needed at the target of a
|
||
nonlocal goto after the code already generated by GNU CC. You
|
||
will not normally need to define this pattern. A typical reason
|
||
why you might need this pattern is if some value, such as a
|
||
pointer to a global table, must be restored when the frame pointer
|
||
is restored. Note that a nonlocal goto only ocurrs within a
|
||
unit-of-translation, so a global table pointer that is shared by
|
||
all functions of a given module need not be restored. There are
|
||
no arguments.
|
||
|
||
`exception_receiver'
|
||
This pattern, if defined, contains code needed at the site of an
|
||
exception handler that isn't needed at the site of a nonlocal
|
||
goto. You will not normally need to define this pattern. A
|
||
typical reason why you might need this pattern is if some value,
|
||
such as a pointer to a global table, must be restored after
|
||
control flow is branched to the handler of an exception. There
|
||
are no arguments.
|
||
|
||
`builtin_setjmp_setup'
|
||
This pattern, if defined, contains additional code needed to
|
||
initialize the `jmp_buf'. You will not normally need to define
|
||
this pattern. A typical reason why you might need this pattern is
|
||
if some value, such as a pointer to a global table, must be
|
||
restored. Though it is preferred that the pointer value be
|
||
recalculated if possible (given the address of a label for
|
||
instance). The single argument is a pointer to the `jmp_buf'.
|
||
Note that the buffer is five words long and that the first three
|
||
are normally used by the generic mechanism.
|
||
|
||
`builtin_setjmp_receiver'
|
||
This pattern, if defined, contains code needed at the site of an
|
||
builtin setjmp that isn't needed at the site of a nonlocal goto.
|
||
You will not normally need to define this pattern. A typical
|
||
reason why you might need this pattern is if some value, such as a
|
||
pointer to a global table, must be restored. It takes one
|
||
argument, which is the label to which builtin_longjmp transfered
|
||
control; this pattern may be emitted at a small offset from that
|
||
label.
|
||
|
||
`builtin_longjmp'
|
||
This pattern, if defined, performs the entire action of the
|
||
longjmp. You will not normally need to define this pattern unless
|
||
you also define `builtin_setjmp_setup'. The single argument is a
|
||
pointer to the `jmp_buf'.
|
||
|
||
`eh_epilogue'
|
||
This pattern, if defined, affects the way `__builtin_eh_return',
|
||
and thence `__throw' are built. It is intended to allow
|
||
communication between the exception handling machinery and the
|
||
normal epilogue code for the target.
|
||
|
||
The pattern takes three arguments. The first is the exception
|
||
context pointer. This will have already been copied to the
|
||
function return register appropriate for a pointer; normally this
|
||
can be ignored. The second argument is an offset to be added to
|
||
the stack pointer. It will have been copied to some arbitrary
|
||
call-clobbered hard reg so that it will survive until after reload
|
||
to when the normal epilogue is generated. The final argument is
|
||
the address of the exception handler to which the function should
|
||
return. This will normally need to copied by the pattern to some
|
||
special register.
|
||
|
||
This pattern must be defined if `RETURN_ADDR_RTX' does not yield
|
||
something that can be reliably and permanently modified, i.e. a
|
||
fixed hard register or a stack memory reference.
|
||
|
||
`prologue'
|
||
This pattern, if defined, emits RTL for entry to a function. The
|
||
function entry is resposible for setting up the stack frame,
|
||
initializing the frame pointer register, saving callee saved
|
||
registers, etc.
|
||
|
||
Using a prologue pattern is generally preferred over defining
|
||
`FUNCTION_PROLOGUE' to emit assembly code for the prologue.
|
||
|
||
The `prologue' pattern is particularly useful for targets which
|
||
perform instruction scheduling.
|
||
|
||
`epilogue'
|
||
This pattern, if defined, emits RTL for exit from a function. The
|
||
function exit is resposible for deallocating the stack frame,
|
||
restoring callee saved registers and emitting the return
|
||
instruction.
|
||
|
||
Using an epilogue pattern is generally preferred over defining
|
||
`FUNCTION_EPILOGUE' to emit assembly code for the prologue.
|
||
|
||
The `epilogue' pattern is particularly useful for targets which
|
||
perform instruction scheduling or which have delay slots for their
|
||
return instruction.
|
||
|
||
`sibcall_epilogue'
|
||
This pattern, if defined, emits RTL for exit from a function
|
||
without the final branch back to the calling function. This
|
||
pattern will be emitted before any sibling call (aka tail call)
|
||
sites.
|
||
|
||
The `sibcall_epilogue' pattern must not clobber any arguments used
|
||
for parameter passing or any stack slots for arguments passed to
|
||
the current function.
|
||
|
||
|
||
File: gcc.info, Node: Pattern Ordering, Next: Dependent Patterns, Prev: Standard Names, Up: Machine Desc
|
||
|
||
When the Order of Patterns Matters
|
||
==================================
|
||
|
||
Sometimes an insn can match more than one instruction pattern. Then
|
||
the pattern that appears first in the machine description is the one
|
||
used. Therefore, more specific patterns (patterns that will match
|
||
fewer things) and faster instructions (those that will produce better
|
||
code when they do match) should usually go first in the description.
|
||
|
||
In some cases the effect of ordering the patterns can be used to hide
|
||
a pattern when it is not valid. For example, the 68000 has an
|
||
instruction for converting a fullword to floating point and another for
|
||
converting a byte to floating point. An instruction converting an
|
||
integer to floating point could match either one. We put the pattern
|
||
to convert the fullword first to make sure that one will be used rather
|
||
than the other. (Otherwise a large integer might be generated as a
|
||
single-byte immediate quantity, which would not work.) Instead of
|
||
using this pattern ordering it would be possible to make the pattern
|
||
for convert-a-byte smart enough to deal properly with any constant
|
||
value.
|
||
|