You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

210 lines
9.4 KiB
HTML

<html lang="en">
<head>
<title>define_peephole - GNU Compiler Collection (GCC) Internals</title>
<meta http-equiv="Content-Type" content="text/html">
<meta name="description" content="GNU Compiler Collection (GCC) Internals">
<meta name="generator" content="makeinfo 4.13">
<link title="Top" rel="start" href="index.html#Top">
<link rel="up" href="Peephole-Definitions.html#Peephole-Definitions" title="Peephole Definitions">
<link rel="next" href="define_005fpeephole2.html#define_005fpeephole2" title="define_peephole2">
<link href="http://www.gnu.org/software/texinfo/" rel="generator-home" title="Texinfo Homepage">
<!--
Copyright (C) 1988-2015 Free Software Foundation, Inc.
Permission is granted to copy, distribute and/or modify this document
under the terms of the GNU Free Documentation License, Version 1.3 or
any later version published by the Free Software Foundation; with the
Invariant Sections being ``Funding Free Software'', the Front-Cover
Texts being (a) (see below), and with the Back-Cover Texts being (b)
(see below). A copy of the license is included in the section entitled
``GNU Free Documentation License''.
(a) The FSF's Front-Cover Text is:
A GNU Manual
(b) The FSF's Back-Cover Text is:
You have freedom to copy and modify this GNU Manual, like GNU
software. Copies published by the Free Software Foundation raise
funds for GNU development.-->
<meta http-equiv="Content-Style-Type" content="text/css">
<style type="text/css"><!--
pre.display { font-family:inherit }
pre.format { font-family:inherit }
pre.smalldisplay { font-family:inherit; font-size:smaller }
pre.smallformat { font-family:inherit; font-size:smaller }
pre.smallexample { font-size:smaller }
pre.smalllisp { font-size:smaller }
span.sc { font-variant:small-caps }
span.roman { font-family:serif; font-weight:normal; }
span.sansserif { font-family:sans-serif; font-weight:normal; }
--></style>
</head>
<body>
<div class="node">
<a name="define_peephole"></a>
<a name="define_005fpeephole"></a>
<p>
Next:&nbsp;<a rel="next" accesskey="n" href="define_005fpeephole2.html#define_005fpeephole2">define_peephole2</a>,
Up:&nbsp;<a rel="up" accesskey="u" href="Peephole-Definitions.html#Peephole-Definitions">Peephole Definitions</a>
<hr>
</div>
<h4 class="subsection">16.18.1 RTL to Text Peephole Optimizers</h4>
<p><a name="index-define_005fpeephole-3713"></a>
A definition looks like this:
<pre class="smallexample"> (define_peephole
[<var>insn-pattern-1</var>
<var>insn-pattern-2</var>
...]
"<var>condition</var>"
"<var>template</var>"
"<var>optional-insn-attributes</var>")
</pre>
<p class="noindent">The last string operand may be omitted if you are not using any
machine-specific information in this machine description. If present,
it must obey the same rules as in a <code>define_insn</code>.
<p>In this skeleton, <var>insn-pattern-1</var> and so on are patterns to match
consecutive insns. The optimization applies to a sequence of insns when
<var>insn-pattern-1</var> matches the first one, <var>insn-pattern-2</var> matches
the next, and so on.
<p>Each of the insns matched by a peephole must also match a
<code>define_insn</code>. Peepholes are checked only at the last stage just
before code generation, and only optionally. Therefore, any insn which
would match a peephole but no <code>define_insn</code> will cause a crash in code
generation in an unoptimized compilation, or at various optimization
stages.
<p>The operands of the insns are matched with <code>match_operands</code>,
<code>match_operator</code>, and <code>match_dup</code>, as usual. What is not
usual is that the operand numbers apply to all the insn patterns in the
definition. So, you can check for identical operands in two insns by
using <code>match_operand</code> in one insn and <code>match_dup</code> in the
other.
<p>The operand constraints used in <code>match_operand</code> patterns do not have
any direct effect on the applicability of the peephole, but they will
be validated afterward, so make sure your constraints are general enough
to apply whenever the peephole matches. If the peephole matches
but the constraints are not satisfied, the compiler will crash.
<p>It is safe to omit constraints in all the operands of the peephole; or
you can write constraints which serve as a double-check on the criteria
previously tested.
<p>Once a sequence of insns matches the patterns, the <var>condition</var> is
checked. This is a C expression which makes the final decision whether to
perform the optimization (we do so if the expression is nonzero). If
<var>condition</var> is omitted (in other words, the string is empty) then the
optimization is applied to every sequence of insns that matches the
patterns.
<p>The defined peephole optimizations are applied after register allocation
is complete. Therefore, the peephole definition can check which
operands have ended up in which kinds of registers, just by looking at
the operands.
<p><a name="index-prev_005factive_005finsn-3714"></a>The way to refer to the operands in <var>condition</var> is to write
<code>operands[</code><var>i</var><code>]</code> for operand number <var>i</var> (as matched by
<code>(match_operand </code><var>i</var><code> ...)</code>). Use the variable <code>insn</code>
to refer to the last of the insns being matched; use
<code>prev_active_insn</code> to find the preceding insns.
<p><a name="index-dead_005for_005fset_005fp-3715"></a>When optimizing computations with intermediate results, you can use
<var>condition</var> to match only when the intermediate results are not used
elsewhere. Use the C expression <code>dead_or_set_p (</code><var>insn</var><code>,
</code><var>op</var><code>)</code>, where <var>insn</var> is the insn in which you expect the value
to be used for the last time (from the value of <code>insn</code>, together
with use of <code>prev_nonnote_insn</code>), and <var>op</var> is the intermediate
value (from <code>operands[</code><var>i</var><code>]</code>).
<p>Applying the optimization means replacing the sequence of insns with one
new insn. The <var>template</var> controls ultimate output of assembler code
for this combined insn. It works exactly like the template of a
<code>define_insn</code>. Operand numbers in this template are the same ones
used in matching the original sequence of insns.
<p>The result of a defined peephole optimizer does not need to match any of
the insn patterns in the machine description; it does not even have an
opportunity to match them. The peephole optimizer definition itself serves
as the insn pattern to control how the insn is output.
<p>Defined peephole optimizers are run as assembler code is being output,
so the insns they produce are never combined or rearranged in any way.
<p>Here is an example, taken from the 68000 machine description:
<pre class="smallexample"> (define_peephole
[(set (reg:SI 15) (plus:SI (reg:SI 15) (const_int 4)))
(set (match_operand:DF 0 "register_operand" "=f")
(match_operand:DF 1 "register_operand" "ad"))]
"FP_REG_P (operands[0]) &amp;&amp; ! FP_REG_P (operands[1])"
{
rtx xoperands[2];
xoperands[1] = gen_rtx_REG (SImode, REGNO (operands[1]) + 1);
#ifdef MOTOROLA
output_asm_insn ("move.l %1,(sp)", xoperands);
output_asm_insn ("move.l %1,-(sp)", operands);
return "fmove.d (sp)+,%0";
#else
output_asm_insn ("movel %1,sp@", xoperands);
output_asm_insn ("movel %1,sp@-", operands);
return "fmoved sp@+,%0";
#endif
})
</pre>
<p>The effect of this optimization is to change
<pre class="smallexample"> jbsr _foobar
addql #4,sp
movel d1,sp@-
movel d0,sp@-
fmoved sp@+,fp0
</pre>
<p class="noindent">into
<pre class="smallexample"> jbsr _foobar
movel d1,sp@
movel d0,sp@-
fmoved sp@+,fp0
</pre>
<p><var>insn-pattern-1</var> and so on look <em>almost</em> like the second
operand of <code>define_insn</code>. There is one important difference: the
second operand of <code>define_insn</code> consists of one or more RTX's
enclosed in square brackets. Usually, there is only one: then the same
action can be written as an element of a <code>define_peephole</code>. But
when there are multiple actions in a <code>define_insn</code>, they are
implicitly enclosed in a <code>parallel</code>. Then you must explicitly
write the <code>parallel</code>, and the square brackets within it, in the
<code>define_peephole</code>. Thus, if an insn pattern looks like this,
<pre class="smallexample"> (define_insn "divmodsi4"
[(set (match_operand:SI 0 "general_operand" "=d")
(div:SI (match_operand:SI 1 "general_operand" "0")
(match_operand:SI 2 "general_operand" "dmsK")))
(set (match_operand:SI 3 "general_operand" "=d")
(mod:SI (match_dup 1) (match_dup 2)))]
"TARGET_68020"
"divsl%.l %2,%3:%0")
</pre>
<p class="noindent">then the way to mention this insn in a peephole is as follows:
<pre class="smallexample"> (define_peephole
[...
(parallel
[(set (match_operand:SI 0 "general_operand" "=d")
(div:SI (match_operand:SI 1 "general_operand" "0")
(match_operand:SI 2 "general_operand" "dmsK")))
(set (match_operand:SI 3 "general_operand" "=d")
(mod:SI (match_dup 1) (match_dup 2)))])
...]
...)
</pre>
</body></html>