2005-08-26 17:39:27 +00:00
|
|
|
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2 Final//EN">
|
|
|
|
<html>
|
|
|
|
|
|
|
|
<head>
|
|
|
|
<link HREF="mailto:drh@microsoft.com" REV="made" TITLE="David R. Hanson">
|
|
|
|
<title>The lcc 4.1 Code-Generation Interface</title>
|
|
|
|
</head>
|
|
|
|
|
|
|
|
<body>
|
|
|
|
|
|
|
|
<h1>The lcc 4.1 Code-Generation Interface</h1>
|
|
|
|
|
|
|
|
<p ALIGN="LEFT"><strong><a HREF="http://www.research.microsoft.com/~cwfraser/">Christopher
|
|
|
|
W. Fraser</a> and <a HREF="http://www.research.microsoft.com/~drh/">David R. Hanson</a>, <a
|
|
|
|
HREF="http://www.research.microsoft.com/">Microsoft Research</a></strong></p>
|
|
|
|
|
|
|
|
<h2>Contents</h2>
|
|
|
|
|
|
|
|
<dir>
|
|
|
|
<li><a HREF="#intro">Introduction</a> </li>
|
|
|
|
<li><a HREF="#metrics">5.1 Type Metrics</a></li>
|
|
|
|
<li><a HREF="#symbols">5.3 Symbols</a> </li>
|
|
|
|
<li><a HREF="#operators">5.5 Dag Operators</a></li>
|
|
|
|
<li><a HREF="#flags">5.6 Interface Flags</a></li>
|
|
|
|
<li><a HREF="#definitions">5.8 Definitions</a></li>
|
|
|
|
<li><a HREF="#constants">5.9 Constants</a></li>
|
|
|
|
<li><a HREF="#upcalls">5.12 Upcalls</a></li>
|
|
|
|
</dir>
|
|
|
|
|
|
|
|
<h2><a NAME="intro">Introduction</a></h2>
|
|
|
|
|
|
|
|
<p>Version 4.1 is the latest release of <a
|
|
|
|
HREF="http://www.cs.princeton.edu/software/lcc/">lcc</a>, the ANSI C compiler described in
|
|
|
|
our book <cite>A Retargetable C Compiler: Design and Implementation</cite>
|
|
|
|
(Addison-Wesley, 1995, ISBN 0-8053-1670-1). This document summarizes the differences
|
|
|
|
between the 4.1 code-generation interface and the 3.x interface described in Chap. 5 of <cite>A
|
|
|
|
Retargetable C Compiler</cite>.</p>
|
|
|
|
|
|
|
|
<p>Previous versions of lcc supported only three sizes of integers, two sizes of floats,
|
|
|
|
and insisted that pointers fit in unsigned integers (see Sec. 5.1 of <cite>A Retargetable
|
|
|
|
C Compiler</cite>). These assumptions simplified the compiler, and were suitable for
|
|
|
|
32-bit architectures. But on 64-bit architectures, such as the DEC ALPHA, it's natural to
|
|
|
|
have four sizes of integers and perhaps three sizes of floats, and on 16-bit
|
|
|
|
architectures, 32-bit pointers don't fit in unsigned integers. Also, the 3.x constaints
|
|
|
|
limited the use of lcc's back ends for other languages, such as Java.</p>
|
|
|
|
|
|
|
|
<p>Version 4.x removes all of these restrictions: It supports any number of sizes for
|
|
|
|
integers and floats, and the size of pointers need not be related to the size of any of
|
|
|
|
the integer types. The major changes in the code-generation interface are:
|
|
|
|
|
|
|
|
<ul>
|
|
|
|
<li>The number of type suffixes has been reduced to 6.</li>
|
|
|
|
<li>Dag operators are composed of a generic operator, a type suffix, and a size.</li>
|
|
|
|
<li>Unsigned variants of several operators have been added.</li>
|
|
|
|
<li>Several interface functions have new signatures.</li>
|
|
|
|
</ul>
|
|
|
|
|
|
|
|
<p>In addition, version 4.x is written in ANSI C and uses the standard I/O library and
|
|
|
|
other standard C functions.</p>
|
|
|
|
|
|
|
|
<p>The sections below parallel the subsections of Chap. 5 of <cite>A Retargetable C
|
|
|
|
Compiler</cite> and summarize the differences between the 3.x and 4.x code-generation
|
|
|
|
interface. Unaffected subsections are omitted. Page citations refer to pages in <cite>A
|
|
|
|
Retargetable C Compiler</cite>.</p>
|
|
|
|
|
|
|
|
<h2><a NAME="metrics">5.1 Type Metrics</a></h2>
|
|
|
|
|
|
|
|
<p>There are now 10 metrics in an interface record:</p>
|
|
|
|
|
|
|
|
<pre>Metrics charmetric;
|
|
|
|
Metrics shortmetric;
|
|
|
|
Metrics intmetric;
|
|
|
|
Metrics longmetric;
|
|
|
|
Metrics longlongmetric;
|
|
|
|
Metrics floatmetric;
|
|
|
|
Metrics doublemetric;
|
|
|
|
Metrics longdoublemetric;
|
|
|
|
Metrics ptrmetric;
|
|
|
|
Metrics structmetric;</pre>
|
|
|
|
|
|
|
|
<p>Each of these specifies the size and alignment of the corresponding type. <code>ptrmetric</code>
|
|
|
|
describes all pointers.</p>
|
|
|
|
|
|
|
|
<h2><a NAME="symbols">5.3 Symbols</a></h2>
|
|
|
|
|
|
|
|
<p>The actual value of a constant is stored in the <code>u.c.v</code> field of a symbol,
|
|
|
|
which holds a <code>Value</code>:</p>
|
|
|
|
|
|
|
|
<pre>typedef union value {
|
|
|
|
long i;
|
|
|
|
unsigned long u;
|
|
|
|
long double d;
|
|
|
|
void *p;
|
|
|
|
void (*g)(void);
|
|
|
|
} Value;</pre>
|
|
|
|
|
|
|
|
<p>The value is stored in the appropriate field according to its type, which is given by
|
|
|
|
the symbol's <code>type</code> field.</p>
|
|
|
|
|
|
|
|
<h2><a NAME="operators">5.5 Dag Operators</a></h2>
|
|
|
|
|
|
|
|
<p>The <code>op</code> field a of <code>node</code> structure holds a dag operator, which
|
|
|
|
consists of a generic operator, a type suffix, and a size indicator. The type suffixes
|
|
|
|
are:</p>
|
|
|
|
|
|
|
|
<pre>enum {
|
|
|
|
F=FLOAT,
|
|
|
|
I=INT,
|
|
|
|
U=UNSIGNED,
|
|
|
|
P=POINTER,
|
|
|
|
V=VOID,
|
|
|
|
B=STRUCT
|
|
|
|
};
|
|
|
|
|
|
|
|
#define sizeop(n) ((n)<<10)</pre>
|
|
|
|
|
|
|
|
<p>Given a generic operator <code>o</code>, a type suffix <code>t</code>, and a size <code>s</code>,
|
|
|
|
a type- and size-specific operator is formed by <code>o+t+sizeop(s)</code>. For example, <code>ADD+F+sizeop(4)</code>
|
|
|
|
forms the operator <code>ADDF4</code>, which denotes the sum of two 4-byte floats.
|
|
|
|
Similarly, <code>ADD+F+sizeop(8)</code> forms <code>ADDF8</code>, which denotes 8-byte
|
|
|
|
floating addition. In the 3.x code-generation interface, <code>ADDF</code> and <code>ADDD</code>
|
|
|
|
denoted these operations. There was no size indicator in the 3.x operators because the
|
|
|
|
type suffix supplied both a type and a size.</p>
|
|
|
|
|
|
|
|
<p>Table 5.1 lists each generic operator, its valid type suffixes, and the number of <code>kids</code>
|
|
|
|
and <code>syms</code> that it uses; multiple values for <code>kids</code> indicate
|
|
|
|
type-specific variants. The notations in the <strong>syms</strong> column give the number
|
|
|
|
of <code>syms</code> values and a one-letter code that suggests their uses: 1V indicates
|
|
|
|
that <code>syms[0]</code> points to a symbol for a variable, 1C indicates that <code>syms[0]</code>
|
|
|
|
is a constant, and 1L indicates that <code>syms[0]</code> is a label. For 1S, <code>syms[0]</code>
|
|
|
|
is a constant whose value is a size in bytes; 2S adds <code>syms[1]</code>, which is a
|
|
|
|
constant whose value is an alignment. For most operators, the type suffix and size
|
|
|
|
indicator denote the type and size of operation to perform and the type and size of the
|
|
|
|
result.</p>
|
|
|
|
|
|
|
|
<table WIDTH="100%" BORDER="0" CELLPADDING="0" CELLSPACING="0">
|
|
|
|
<tr>
|
|
|
|
<td COLSPAN="6" ALIGN="CENTER"><strong>Table 5.1<img SRC="/~drh/resources/dot_clear.gif"
|
|
|
|
ALT="|" WIDTH="18" HEIGHT="1">Node Operators.</strong></td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td><strong>syms</strong></td>
|
|
|
|
<td><strong>kids</strong></td>
|
|
|
|
<td><strong>Operator</strong></td>
|
|
|
|
<td><strong>Type Suffixes</strong></td>
|
|
|
|
<td><strong>Sizes</strong></td>
|
|
|
|
<td><strong>Operation</strong></td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td>1V</td>
|
|
|
|
<td>0</td>
|
|
|
|
<td><code>ADDRF</code></td>
|
|
|
|
<td><code>...P..</code></td>
|
|
|
|
<td>p</td>
|
|
|
|
<td>address of a parameter</td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td>1V</td>
|
|
|
|
<td>0</td>
|
|
|
|
<td><code>ADDRG</code></td>
|
|
|
|
<td><code>...P..</code></td>
|
|
|
|
<td>p</td>
|
|
|
|
<td>address of a global</td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td>1V</td>
|
|
|
|
<td>0</td>
|
|
|
|
<td><code>ADDRL</code></td>
|
|
|
|
<td><code>...P..</code></td>
|
|
|
|
<td>p</td>
|
|
|
|
<td>address of a local</td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td>1C</td>
|
|
|
|
<td>0</td>
|
|
|
|
<td><code>CNST</code></td>
|
|
|
|
<td><code>FIUP..</code></td>
|
|
|
|
<td>fdx csilh p</td>
|
|
|
|
<td>constant</td>
|
|
|
|
</tr>
|
|
|
|
<tr ALIGN="LEFT" VALIGN="TOP">
|
|
|
|
<td><img SRC="/~drh/resources/dot_clear.gif" ALT="|" WIDTH="1" HEIGHT="12"></td>
|
|
|
|
<td></td>
|
|
|
|
<td></td>
|
|
|
|
<td></td>
|
|
|
|
<td></td>
|
|
|
|
<td></td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td></td>
|
|
|
|
<td>1</td>
|
|
|
|
<td><code>BCOM</code></td>
|
|
|
|
<td><code>.IU...</code></td>
|
|
|
|
<td>ilh</td>
|
|
|
|
<td>bitwise complement</td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td>1S</td>
|
|
|
|
<td>1</td>
|
|
|
|
<td><code>CVF</code></td>
|
|
|
|
<td><code>FI....</code></td>
|
|
|
|
<td>fdx ilh</td>
|
|
|
|
<td>convert from float</td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td>1S</td>
|
|
|
|
<td>1</td>
|
|
|
|
<td><code>CVI</code></td>
|
|
|
|
<td><code>FIU...</code></td>
|
|
|
|
<td>fdx csilh csilhp</td>
|
|
|
|
<td>convert from signed integer</td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td>1S</td>
|
|
|
|
<td>1</td>
|
|
|
|
<td><code>CVP</code></td>
|
|
|
|
<td><code>..U..</code></td>
|
|
|
|
<td>p</td>
|
|
|
|
<td>convert from pointer</td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td>1S</td>
|
|
|
|
<td>1</td>
|
|
|
|
<td><code>CVU</code></td>
|
|
|
|
<td><code>.IUP..</code></td>
|
|
|
|
<td>csilh p</td>
|
|
|
|
<td>convert from unsigned integer</td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td></td>
|
|
|
|
<td>1</td>
|
|
|
|
<td><code>INDIR</code></td>
|
|
|
|
<td><code>FIUP.B</code></td>
|
|
|
|
<td>fdx csilh p</td>
|
|
|
|
<td>fetch</td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td></td>
|
|
|
|
<td>1</td>
|
|
|
|
<td><code>NEG</code></td>
|
|
|
|
<td><code>FI....</code></td>
|
|
|
|
<td>fdx ilh</td>
|
|
|
|
<td>negation</td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td><img SRC="/~drh/resources/dot_clear.gif" ALT="|" WIDTH="1" HEIGHT="12"></td>
|
|
|
|
<td></td>
|
|
|
|
<td></td>
|
|
|
|
<td></td>
|
|
|
|
<td></td>
|
|
|
|
<td></td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td></td>
|
|
|
|
<td>2</td>
|
|
|
|
<td><code>ADD</code></td>
|
|
|
|
<td><code>FIUP..</code></td>
|
|
|
|
<td>fdx ilh ilhp p</td>
|
|
|
|
<td>addition</td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td></td>
|
|
|
|
<td>2</td>
|
|
|
|
<td><code>BAND</code></td>
|
|
|
|
<td><code>.IU...</code></td>
|
|
|
|
<td>ilh</td>
|
|
|
|
<td>bitwise AND</td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td></td>
|
|
|
|
<td>2</td>
|
|
|
|
<td><code>BOR</code></td>
|
|
|
|
<td><code>.IU...</code></td>
|
|
|
|
<td>ilh</td>
|
|
|
|
<td>bitwise inclusive OR</td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td></td>
|
|
|
|
<td>2</td>
|
|
|
|
<td><code>BXOR</code></td>
|
|
|
|
<td><code>.IU...</code></td>
|
|
|
|
<td>ilh</td>
|
|
|
|
<td>bitwise exclusive OR</td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td></td>
|
|
|
|
<td>2</td>
|
|
|
|
<td><code>DIV</code></td>
|
|
|
|
<td><code>FIU...</code></td>
|
|
|
|
<td>fdx ilh</td>
|
|
|
|
<td>division</td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td></td>
|
|
|
|
<td>2</td>
|
|
|
|
<td><code>LSH</code></td>
|
|
|
|
<td><code>.IU...</code></td>
|
|
|
|
<td>ilh</td>
|
|
|
|
<td>left shift</td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td></td>
|
|
|
|
<td>2</td>
|
|
|
|
<td><code>MOD</code></td>
|
|
|
|
<td><code>.IU...</code></td>
|
|
|
|
<td>ilh</td>
|
|
|
|
<td>modulus</td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td></td>
|
|
|
|
<td>2</td>
|
|
|
|
<td><code>MUL</code></td>
|
|
|
|
<td><code>FIU...</code></td>
|
|
|
|
<td>fdx ilh</td>
|
|
|
|
<td>multiplication</td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td></td>
|
|
|
|
<td>2</td>
|
|
|
|
<td><code>RSH</code></td>
|
|
|
|
<td><code>.IU...</code></td>
|
|
|
|
<td>ilh</td>
|
|
|
|
<td>right shift</td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td></td>
|
|
|
|
<td>2</td>
|
|
|
|
<td><code>SUB</code></td>
|
|
|
|
<td><code>FIUP..</code></td>
|
|
|
|
<td>fdx ilh ilhp p</td>
|
|
|
|
<td>subtraction</td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td><img SRC="/~drh/resources/dot_clear.gif" ALT="|" WIDTH="1" HEIGHT="12"></td>
|
|
|
|
<td></td>
|
|
|
|
<td></td>
|
|
|
|
<td></td>
|
|
|
|
<td></td>
|
|
|
|
<td></td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td>2S</td>
|
|
|
|
<td>2</td>
|
|
|
|
<td><code>ASGN</code></td>
|
|
|
|
<td><code>FIUP.B</code></td>
|
|
|
|
<td>fdx csilh p</td>
|
|
|
|
<td>assignment</td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td>1L</td>
|
|
|
|
<td>2</td>
|
|
|
|
<td><code>EQ</code></td>
|
|
|
|
<td><code>FIU...</code></td>
|
|
|
|
<td>fdx ilh ilhp</td>
|
|
|
|
<td>jump if equal</td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td>1L</td>
|
|
|
|
<td>2</td>
|
|
|
|
<td><code>GE</code></td>
|
|
|
|
<td><code>FIU...</code></td>
|
|
|
|
<td>fdx ilh ilhp</td>
|
|
|
|
<td>jump if greater than or equal</td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td>1L</td>
|
|
|
|
<td>2</td>
|
|
|
|
<td><code>GT</code></td>
|
|
|
|
<td><code>FIU...</code></td>
|
|
|
|
<td>fdx ilh ilhp</td>
|
|
|
|
<td>jump if greater than</td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td>1L</td>
|
|
|
|
<td>2</td>
|
|
|
|
<td><code>LE</code></td>
|
|
|
|
<td><code>FIU...</code></td>
|
|
|
|
<td>fdx ilh ilhp</td>
|
|
|
|
<td>jump if less than or equal</td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td>1L</td>
|
|
|
|
<td>2</td>
|
|
|
|
<td><code>LT</code></td>
|
|
|
|
<td><code>FIU...</code></td>
|
|
|
|
<td>fdx ilh ilhp</td>
|
|
|
|
<td>jump if less than</td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td>1L</td>
|
|
|
|
<td>2</td>
|
|
|
|
<td><code>NE</code></td>
|
|
|
|
<td><code>FIU...</code></td>
|
|
|
|
<td>fdx ilh ilhp</td>
|
|
|
|
<td>jump if not equal</td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td></td>
|
|
|
|
<td></td>
|
|
|
|
<td></td>
|
|
|
|
<td></td>
|
|
|
|
<td></td>
|
|
|
|
<td></td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td>2S</td>
|
|
|
|
<td>1</td>
|
|
|
|
<td><code>ARG</code></td>
|
|
|
|
<td><code>FIUP.B</code></td>
|
|
|
|
<td>fdx ilh p</td>
|
|
|
|
<td>argument</td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td>1</td>
|
|
|
|
<td>1 or 2</td>
|
|
|
|
<td><code>CALL</code></td>
|
|
|
|
<td><code>FIUPVB</code></td>
|
|
|
|
<td>fdx ilh p</td>
|
|
|
|
<td>function call</td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td></td>
|
|
|
|
<td>1</td>
|
|
|
|
<td><code>RET</code></td>
|
|
|
|
<td><code>FIUPV.</code></td>
|
|
|
|
<td>fdx ilh p</td>
|
|
|
|
<td>return from function</td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td><img SRC="/~drh/resources/dot_clear.gif" ALT="|" WIDTH="1" HEIGHT="12"></td>
|
|
|
|
<td></td>
|
|
|
|
<td></td>
|
|
|
|
<td></td>
|
|
|
|
<td></td>
|
|
|
|
<td></td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td></td>
|
|
|
|
<td>1</td>
|
|
|
|
<td><code>JUMP</code></td>
|
|
|
|
<td><code>....V.</code></td>
|
|
|
|
<td></td>
|
|
|
|
<td>unconditional jump</td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td>1L</td>
|
|
|
|
<td>0</td>
|
|
|
|
<td><code>LABEL</code></td>
|
|
|
|
<td><code>....V.</code></td>
|
|
|
|
<td></td>
|
|
|
|
<td>label definition</td>
|
|
|
|
</tr>
|
|
|
|
</table>
|
|
|
|
|
|
|
|
<p>The entries in the <strong>Sizes</strong> column indicate sizes of the operators that
|
|
|
|
back ends must implement. Letters denote the size of float (f), double (d), long double
|
|
|
|
(x), character (c), short integer (s), integer (i), long integer (l), "long
|
|
|
|
long" integer (h) , and pointer (p). These sizes are separated into sets for each
|
|
|
|
type suffix, except that a single set is used for both I and U when the set for I is
|
|
|
|
identical to the set for U.</p>
|
|
|
|
|
|
|
|
<p>The actual values for the size indicators, fdxcsilhp, depend on the target. A
|
|
|
|
specification like <code>ADDF</code>f denotes the operator <code>ADD+F+sizeop(</code>f<code>)</code>,
|
|
|
|
where "f" is replaced by a target-dependent value, e.g., <code>ADDF4</code> and <code>ADDF8</code>.
|
|
|
|
For example, back ends must implement the following <code>CVI</code> and <code>MUL</code>
|
|
|
|
operators.</p>
|
|
|
|
|
|
|
|
<blockquote>
|
|
|
|
<p><code>CVIF</code>f <code>CVIF</code>d <code>CVIF</code>x<br>
|
|
|
|
<code>CVII</code>c <code>CVII</code>s <code>CVII</code>i <code>CVII</code>l <code>CVII</code>h<br>
|
|
|
|
<code>CVIU</code>c <code>CVIU</code>s <code>CVIU</code>i <code>CVIU</code>l <code>CVIU</code>h
|
|
|
|
<code>CVIU</code>p<br>
|
|
|
|
<br>
|
|
|
|
<code>MULF</code>f <code>MULF</code>d <code>MULF</code>x<br>
|
|
|
|
<code>MULI</code>i <code>MULI</code>l <code>MULI</code>h<br>
|
|
|
|
<code>MULU</code>i <code>MULU</code>l <code>MULU</code>h</p>
|
|
|
|
</blockquote>
|
|
|
|
|
|
|
|
<p>On most platforms, there are fewer than three sizes of floats and six sizes of
|
|
|
|
integers, and pointers are usually the same size as one of the integers. And lcc doesn't
|
|
|
|
support the "long long" type, so h is not currently used. So the set of
|
|
|
|
platform-specific operators is usually smaller than the list above suggests. For example,
|
|
|
|
the X86, SPARC, and MIPS back ends implement the following <code>CVI</code> and <code>MUL</code>
|
|
|
|
operators.</p>
|
|
|
|
|
|
|
|
<blockquote>
|
|
|
|
<p><code>CVIF</code>4 <code>CVIF</code>8<br>
|
|
|
|
<code>CVII</code>1 <code>CVII</code>2 <code>CVII</code>4<br>
|
|
|
|
<code>CVIU</code>1 <code>CVIU</code>2 <code>CVIU</code>4 <br>
|
|
|
|
<br>
|
|
|
|
<code>MULF</code>4 <code>MULF</code>8<br>
|
|
|
|
<code>MULI</code>4<br>
|
|
|
|
<code>MULU</code>4</p>
|
|
|
|
</blockquote>
|
|
|
|
|
|
|
|
<p>The set of operators is thus target-dependent; for example, <code>ADDI8</code> appears
|
|
|
|
only if the target supports an 8-byte integer type. <a
|
|
|
|
HREF="ftp://ftp.cs.princeton.edu/pub/packages/lcc/contrib/ops.c"><code>ops.c</code></a> is
|
|
|
|
a program that, given a set of sizes, prints the required operators and their values,
|
|
|
|
e.g.,</p>
|
|
|
|
|
|
|
|
<blockquote>
|
|
|
|
<pre>% <em>ops c=1 s=2 i=4 l=4 h=4 f=4 d=8 x=8 p=4</em>
|
|
|
|
...
|
|
|
|
CVIF4=4225 CVIF8=8321
|
|
|
|
CVII1=1157 CVII2=2181 CVII4=4229
|
|
|
|
CVIU1=1158 CVIU2=2182 CVIU4=4230
|
|
|
|
...
|
|
|
|
MULF4=4561 MULF8=8657
|
|
|
|
MULI4=4565
|
|
|
|
MULU4=4566
|
|
|
|
...
|
|
|
|
131 operators</pre>
|
|
|
|
</blockquote>
|
|
|
|
|
|
|
|
<p>The type suffix for a conversion operator denotes the type of the result and the size
|
|
|
|
indicator gives the size of the result. For example, <code>CVUI4</code> converts an
|
|
|
|
unsigned (<code>U</code>) to a 4-byte signed integer (<code>I4</code>). The <code>syms[0]</code>
|
|
|
|
field points to a symbol-table entry for a integer constant that gives the size of the
|
|
|
|
source operand. For example, if <code>syms[0]</code> in a <code>CVUI4</code> points to a
|
|
|
|
symbol-table entry for 2, the conversion widens a 2-byte unsigned integer to a 4-byte
|
|
|
|
signed integer. Conversions that widen unsigned integers zero-extend; those that widen
|
|
|
|
signed integers sign-extend.</p>
|
|
|
|
|
|
|
|
<p>The front end composes conversions between types <em>T</em><sub>1</sub> and <em>T</em><sub>2</sub>
|
|
|
|
by widening <em>T</em><sub>1</sub> to it's "supertype", if necessary, converting
|
|
|
|
that result to <em>T</em><sub>2</sub>'s supertype, then narrowing the result to <em>T</em><sub>2</sub>,
|
|
|
|
if necessary. The following table lists the supertypes; omitted entries are their own
|
|
|
|
supertypes.</p>
|
|
|
|
|
|
|
|
<blockquote>
|
|
|
|
<table BORDER="0" CELLPADDING="0" CELLSPACING="0">
|
|
|
|
<tr>
|
|
|
|
<td><strong>Type</strong></td>
|
|
|
|
<td><img SRC="/~drh/resources/dot_clear.gif" ALT="|" WIDTH="24" HEIGHT="1"></td>
|
|
|
|
<td><strong>Supertype</strong></td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td>signed char</td>
|
|
|
|
<td></td>
|
|
|
|
<td>int</td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td>signed short</td>
|
|
|
|
<td></td>
|
|
|
|
<td>int</td>
|
|
|
|
</tr>
|
|
|
|
<tr ALIGN="LEFT" VALIGN="TOP">
|
|
|
|
<td>unsigned char</td>
|
|
|
|
<td></td>
|
|
|
|
<td>int, if sizeof (char) < sizeof (int)<br>
|
|
|
|
unsigned, otherwise</td>
|
|
|
|
</tr>
|
|
|
|
<tr ALIGN="LEFT" VALIGN="TOP">
|
|
|
|
<td>unsigned short</td>
|
|
|
|
<td></td>
|
|
|
|
<td>int, if sizeof (short) < sizeof (int)<br>
|
|
|
|
unsigned, otherwise</td>
|
|
|
|
</tr>
|
|
|
|
<tr ALIGN="LEFT" VALIGN="TOP">
|
|
|
|
<td>void *</td>
|
|
|
|
<td></td>
|
|
|
|
<td>an unsigned type as large as a pointer</td>
|
|
|
|
</tr>
|
|
|
|
</table>
|
|
|
|
</blockquote>
|
|
|
|
|
|
|
|
<p>Pointers are converted to an unsigned type of the same size, even when that type is not
|
|
|
|
one of the integer types.</p>
|
|
|
|
|
|
|
|
<p>For example, the front end converts a signed short to a float by first converting it to
|
|
|
|
an int and then to a float. It converts an unsigned short to an int with a single <code>CVUI</code>i
|
|
|
|
conversion, when shorts are smaller than ints.</p>
|
|
|
|
|
|
|
|
<p>There are now signed and unsigned variants of <code>ASGN</code>, <code>INDIR</code>, <code>BCOM</code>,
|
|
|
|
<code>BOR</code>, <code>BXOR</code>, <code>BAND</code>, <code>ARG</code>, <code>CALL</code>,
|
|
|
|
and <code>RET</code> to simplify code generation on platforms that use different
|
|
|
|
instructions or register set for signed and unsigned operations. Likewise there are now
|
|
|
|
pointer variants of <code>ASGN</code>, <code>INDIR</code>, <code>ARG</code>, <code>CALL</code>,
|
|
|
|
and <code>RET</code>.</p>
|
|
|
|
|
|
|
|
<h2><a NAME="flags">5.6 Interface Flags</a></h2>
|
|
|
|
|
|
|
|
<pre>unsigned unsigned_char:1;</pre>
|
|
|
|
|
|
|
|
<p>tells the front end whether plain characters are signed or unsigned. If it's zero, char
|
|
|
|
is a signed type; otherwise, char is an unsigned type.</p>
|
|
|
|
|
|
|
|
<p>All the interface flags can be set by command-line options, e.g., <code>-Wf-unsigned_char=1</code>
|
|
|
|
causes plain characters to be unsigned.</p>
|
|
|
|
|
|
|
|
<h2><a NAME="definitions">5.8 Definitions</a></h2>
|
|
|
|
|
|
|
|
<p>The front end announces local variables by calling</p>
|
|
|
|
|
|
|
|
<pre>void (*local)(Symbol);</pre>
|
|
|
|
|
|
|
|
<p>It announces temporaries likewise; these have the symbol's <code>temporary</code> flag
|
|
|
|
set, which indicates that the symbol will be used only in the next call to <code>gen</code>.
|
|
|
|
If a temporary's <code>u.t.cse</code> field is nonnull, it points to the node that
|
|
|
|
computes the value assigned to the temporary; see page 346.</p>
|
|
|
|
|
|
|
|
<p>The front end calls</p>
|
|
|
|
|
|
|
|
<pre>void (*address)(Symbol p, Symbol q, long n);</pre>
|
|
|
|
|
|
|
|
<p>to initialize <code>q</code> to a symbol that represents an address of the form <em>x</em>+<code>n</code>,
|
|
|
|
where <em>x</em> is the address represented by <code>p</code> and the long integer <code>n</code>
|
|
|
|
is positive or negative.</p>
|
|
|
|
|
|
|
|
<h2><a NAME="constants">5.9 Constants</a></h2>
|
|
|
|
|
|
|
|
<p>The interface function</p>
|
|
|
|
|
|
|
|
<pre>void (*defconst)(int suffix, int size, Value v);</pre>
|
|
|
|
|
|
|
|
<p>initializes constants. defconst emits directives to define a cell and initialize it to
|
|
|
|
a constant value. v is the constant value, suffix identifies the type of the value, and
|
|
|
|
size is the size of the value in bytes. The value of suffix indicates which field of v
|
|
|
|
holds the value, as shown in the following table.</p>
|
|
|
|
|
|
|
|
<blockquote>
|
|
|
|
<table BORDER="0" CELLPADDING="1" CELLSPACING="1">
|
|
|
|
<tr>
|
|
|
|
<td><strong>suffix</strong></td>
|
|
|
|
<td><img SRC="/~drh/resources/dot_clear.gif" ALT="|" WIDTH="24" HEIGHT="1"></td>
|
|
|
|
<td><strong>v Field</strong></td>
|
|
|
|
<td><img SRC="/~drh/resources/dot_clear.gif" ALT="|" WIDTH="24" HEIGHT="1"></td>
|
|
|
|
<td><strong>size</strong></td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td><code>F</code></td>
|
|
|
|
<td></td>
|
|
|
|
<td><code>v.d</code></td>
|
|
|
|
<td></td>
|
|
|
|
<td>float, double, long double</td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td><code>I</code></td>
|
|
|
|
<td></td>
|
|
|
|
<td><code>v.i</code></td>
|
|
|
|
<td></td>
|
|
|
|
<td>signed char, signed short, signed int, signed long</td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td><code>U</code></td>
|
|
|
|
<td></td>
|
|
|
|
<td><code>v.u</code></td>
|
|
|
|
<td></td>
|
|
|
|
<td>unsigned char, unsigned short, unsigned int, unsigned long</td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td><code>P</code></td>
|
|
|
|
<td></td>
|
|
|
|
<td><code>v.p</code></td>
|
|
|
|
<td></td>
|
|
|
|
<td>void *</td>
|
|
|
|
</tr>
|
|
|
|
</table>
|
|
|
|
</blockquote>
|
|
|
|
|
|
|
|
<p><code>defconst</code> must narrow <code>v.</code>x when <code>size</code> is less than <code>sizeof</code>
|
|
|
|
<code>v.</code>x; e.g., to emit an unsigned char, <code>defconst</code> should emit <code>(unsigned
|
|
|
|
char)v.i</code>.</p>
|
|
|
|
|
|
|
|
<h2><a NAME="upcalls">5.12 Upcalls</a></h2>
|
|
|
|
|
|
|
|
<p>lcc 4.x uses standard I/O and its I/O functions have been changed accordingly. lcc
|
|
|
|
reads input from the standard input, emits code to the standard output, and writes
|
|
|
|
diagnostics to the standard error output. It uses <code>freopen</code> to redirect these
|
|
|
|
streams to explicit files, when necessary.</p>
|
|
|
|
|
|
|
|
<p><code>bp</code>, <code>outflush</code>, and <code>outs</code> have been eliminated.</p>
|
|
|
|
|
|
|
|
<pre>extern void fprint(FILE *f, const char *fmt, ...);
|
|
|
|
extern void print(const char *fmt, ...);</pre>
|
|
|
|
|
|
|
|
<p>print formatted data to file <code>f</code> (<code>fprint</code>) or the standard
|
|
|
|
output (<code>print</code>). These functions are like standard C's <code>printf</code> and
|
|
|
|
<code>fprintf</code>, but support only some of the standard conversion specifiers and do
|
|
|
|
not support flags, precision, and field-width specifications. They support the following
|
|
|
|
new conversion specifiers in addition to those described on page 99.</p>
|
|
|
|
|
|
|
|
<blockquote>
|
|
|
|
<table BORDER="0" CELLPADDING="0" CELLSPACING="0">
|
|
|
|
<tr>
|
|
|
|
<td><strong>Specifiers</strong></td>
|
|
|
|
<td><img SRC="/~drh/resources/dot_clear.gif" ALT="|" WIDTH="24" HEIGHT="1"></td>
|
|
|
|
<td><strong>Corresponding printf Specifiers</strong></td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td><code>%c</code></td>
|
|
|
|
<td></td>
|
|
|
|
<td><code>%c</code></td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td><code>%d %D</code></td>
|
|
|
|
<td></td>
|
|
|
|
<td><code>%d %ld</code></td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td><code>%u %U</code></td>
|
|
|
|
<td></td>
|
|
|
|
<td><code>%u %lu</code></td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td><code>%x %X</code></td>
|
|
|
|
<td></td>
|
|
|
|
<td><code>%x %lx</code></td>
|
|
|
|
</tr>
|
|
|
|
<tr>
|
|
|
|
<td><code>%f %e %g</code></td>
|
|
|
|
<td></td>
|
|
|
|
<td><code>%e %f %g</code></td>
|
|
|
|
</tr>
|
|
|
|
<tr ALIGN="LEFT" VALIGN="TOP">
|
|
|
|
<td><code>%p</code></td>
|
|
|
|
<td></td>
|
|
|
|
<td>Converts the corresponding void * argument to unsigned long and prints it with the <code>printf</code>
|
|
|
|
<code>%#x</code> specifier or just <code>%x</code> when the argument is null.</td>
|
|
|
|
</tr>
|
|
|
|
<tr ALIGN="LEFT" VALIGN="TOP">
|
|
|
|
<td><code>%I</code></td>
|
|
|
|
<td></td>
|
|
|
|
<td>Prints the number of spaces given by the corresponding argument.</td>
|
|
|
|
</tr>
|
|
|
|
</table>
|
|
|
|
</blockquote>
|
|
|
|
|
|
|
|
<pre>#define generic(op) ((op)&0x3F0)
|
|
|
|
#define specific(op) ((op)&0x3FF)</pre>
|
|
|
|
|
|
|
|
<p><code>generic(op)</code> returns the generic variant of <code>op</code>; that is,
|
|
|
|
without its type suffix and size indicator. <code>specific(op)</code> returns the
|
|
|
|
type-specific variant of <code>op</code>; that is, without its size indicator.</p>
|
|
|
|
|
|
|
|
<p><code>newconst</code> has been replaced by</p>
|
|
|
|
|
|
|
|
<pre>extern Symbol intconst(int n);</pre>
|
|
|
|
|
|
|
|
<p>which installs the integer constant <code>n</code> in the symbol table, if necessary,
|
|
|
|
and returns a pointer to the symbol-table entry.</p>
|
|
|
|
|
|
|
|
<hr>
|
|
|
|
|
|
|
|
<address>
|
|
|
|
<a HREF="http://www.research.microsoft.com/~cwfraser/">Chris Fraser</a> / <a
|
|
|
|
HREF="mailto:cwfraser@microsoft.com">cwfraser@microsoft.com</a><br>
|
|
|
|
<a HREF="http://www.research.microsoft.com/~drh/">David Hanson</a> / <a
|
|
|
|
HREF="mailto:drh@microsoft.com">drh@microsoft.com</a><br>
|
|
|
|
$Revision: 145 $ $Date: 2001-10-17 16:53:10 -0500 (Wed, 17 Oct 2001) $
|
|
|
|
</address>
|
|
|
|
</body>
|
|
|
|
</html>
|