source: public/doc/gnu-c/Floating-Representations.html@ 02598c2

Last change on this file since 02598c2 was 02598c2, checked in by Mikhail Kirillov <w96k@…>, on Oct 6, 2022 at 12:36:29 PM

Add gnu-c

  • Property mode set to 100644
File size: 4.8 KB
Line 
1<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd">
2<html>
3<!-- Copyright (C) 2022 Richard Stallman and Free Software Foundation, Inc.
4
5(The work of Trevis Rothwell and Nelson Beebe has been assigned or
6licensed to the FSF.)
7
8Permission is granted to copy, distribute and/or modify this document
9under the terms of the GNU Free Documentation License, Version 1.3 or
10any later version published by the Free Software Foundation; with the
11Invariant Sections being "GNU General Public License," with the
12Front-Cover Texts being "A GNU Manual," and with the Back-Cover
13Texts as in (a) below. A copy of the license is included in the
14section entitled "GNU Free Documentation License."
15
16(a) The FSF's Back-Cover Text is: "You have the freedom to copy and
17modify this GNU manual. Buying copies from the FSF supports it in
18developing GNU and promoting software freedom." -->
19<!-- Created by GNU Texinfo 6.7, http://www.gnu.org/software/texinfo/ -->
20<head>
21<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
22<title>Floating Representations (GNU C Language Manual)</title>
23
24<meta name="description" content="Floating Representations (GNU C Language Manual)">
25<meta name="keywords" content="Floating Representations (GNU C Language Manual)">
26<meta name="resource-type" content="document">
27<meta name="distribution" content="global">
28<meta name="Generator" content="makeinfo">
29<link href="index.html" rel="start" title="Top">
30<link href="Symbol-Index.html" rel="index" title="Symbol Index">
31<link href="index.html#SEC_Contents" rel="contents" title="Table of Contents">
32<link href="Floating-Point-in-Depth.html" rel="up" title="Floating Point in Depth">
33<link href="Floating-Type-Specs.html" rel="next" title="Floating Type Specs">
34<link href="Floating-Point-in-Depth.html" rel="prev" title="Floating Point in Depth">
35<style type="text/css">
36<!--
37a.summary-letter {text-decoration: none}
38blockquote.indentedblock {margin-right: 0em}
39div.display {margin-left: 3.2em}
40div.example {margin-left: 3.2em}
41div.lisp {margin-left: 3.2em}
42kbd {font-style: oblique}
43pre.display {font-family: inherit}
44pre.format {font-family: inherit}
45pre.menu-comment {font-family: serif}
46pre.menu-preformatted {font-family: serif}
47span.nolinebreak {white-space: nowrap}
48span.roman {font-family: initial; font-weight: normal}
49span.sansserif {font-family: sans-serif; font-weight: normal}
50ul.no-bullet {list-style: none}
51-->
52</style>
53
54
55</head>
56
57<body lang="en">
58<span id="Floating-Representations"></span><div class="header">
59<p>
60Next: <a href="Floating-Type-Specs.html" accesskey="n" rel="next">Floating Type Specs</a>, Up: <a href="Floating-Point-in-Depth.html" accesskey="u" rel="up">Floating Point in Depth</a> &nbsp; [<a href="index.html#SEC_Contents" title="Table of contents" rel="contents">Contents</a>][<a href="Symbol-Index.html" title="Index" rel="index">Index</a>]</p>
61</div>
62<hr>
63<span id="Floating_002dPoint-Representations"></span><h3 class="section">28.1 Floating-Point Representations</h3>
64<span id="index-floating_002dpoint-representations"></span>
65<span id="index-representation-of-floating_002dpoint-numbers"></span>
66
67<span id="index-IEEE-754_002d2008-Standard"></span>
68<p>Storing numbers as <em>floating point</em> allows representation of
69numbers with fractional values, in a range larger than that of
70hardware integers. A floating-point number consists of a sign bit, a
71<em>significand</em> (also called the <em>mantissa</em>), and a power of a
72fixed base. GNU C uses the floating-point representations specified by
73the <cite>IEEE 754-2008 Standard for Floating-Point Arithmetic</cite>.
74</p>
75<p>The IEEE 754-2008 specification defines basic binary floating-point
76formats of five different sizes: 16-bit, 32-bit, 64-bit, 128-bit, and
77256-bit. The formats of 32, 64, and 128 bits are used for the
78standard C types <code>float</code>, <code>double</code>, and <code>long double</code>.
79GNU C supports the 16-bit floating point type <code>_Float16</code> on some
80platforms, but does not support the 256-bit floating point type.
81</p>
82<p>Each of the formats encodes the floating-point number as a sign bit.
83After this comes an exponent that specifies a power of 2 (with a fixed
84offset). Then comes the significand.
85</p>
86<p>The first bit of the significand, before the binary point, is always
871, so there is no need to store it in memory. It is called the
88<em>hidden bit</em> because it doesn&rsquo;t appear in the floating-point
89number as used in the computer itself.
90</p>
91<p>All of those floating-point formats are sign-magnitude representations,
92so +0 and -0 are different values.
93</p>
94<p>Besides the IEEE 754 format 128-bit float, GNU C also offers a format
95consisting of a pair of 64-bit floating point numbers. This lacks the
96full exponent range of the IEEE 128-bit format, but is useful when the
97underlying hardware platform does not support that.
98</p>
99
100
101
102</body>
103</html>
Note: See TracBrowser for help on using the repository browser.