Quadruple precision

Quadruple precision

In computing, quadruple precision (also commonly shortened to quad precision) is a computer numbering format that occupies four storage locations in computer memory at address, address+1, address+2, and address+3. A quad-precision number, sometimes simply a quad, may be defined to be an integer, fixed point, or floating point.

In IEEE 754-2008 this 128-bit format is officially referred to as binary128. It is the fourth basic binary floating point interchange format, together with the 64-bit double precision, the 32-bit single precision, and the 16-bit half precision formats.

Quadruple precision memory format

Sign bit: 1 Exponent width: 15 Significand precision: 112 (113 implicit)

The format is written with an implicit integer bit with value 1 unless the written exponent is all zeros. Thus only 112 bits of the fraction appear in the memory format, but the total precision is 113 bits (approximately 34 decimal digits, log_{10}(2^{113}) approx 34.016).

syyy yyyy yyyy yyyy xxxx xxxx xxxx xxxx … xxxx xxxx (112 xs)

Exponent encodings

Emin (0x0001) = -16382 Emax (0x7ffe) = 16383 Exponent bias (0x3fff) = 16383The true exponent = written exponent - exponent bias

0x0000 and 0x7fff are reserved exponents 0x0000 is used to represent zero and denormals 0x7fff is used to represent infinity and NaNs

All bit patterns are valid encodings.

Quadruple precision examples in hexadecimal

3fff 0000 0000 0000 0000 0000 0000 0000 = 1

c000 0000 0000 0000 0000 0000 0000 0000 = -2

7ffe ffff ffff ffff ffff ffff ffff ffff ~ 1.189731495357231765085759326628007 x 104932 (Max Quad) 3ffd 5555 5555 5555 5555 5555 5555 5555 ~ 1/3By default, 1/3 rounds down like double precision, because of the odd number of bits in the significand.So the bits beyond the rounding point are 0101... which is less than 1/2 of a unit in the last place.

0000 0000 0000 0000 0000 0000 0000 0000 = 0 8000 0000 0000 0000 0000 0000 0000 0000 = -0

7fff 0000 0000 0000 0000 0000 0000 0000 = Infinity ffff 0000 0000 0000 0000 0000 0000 0000 = -Infinity

See also

* half precision – single precision – double precision – quadruple precision
* Floating point
* long double

External links

* [http://crd.lbl.gov/~dhbailey/mpdist/ High-Precision Software Directory]


Wikimedia Foundation. 2010.

Игры ⚽ Поможем написать курсовую

Look at other dictionaries:

  • Quadruple-precision floating-point format — In computing, quadruple precision (also commonly shortened to quad precision) is a binary floating point computer number format that occupies 16 bytes (128 bits) in computer memory. In IEEE 754 2008 the 128 bit base 2 format is officially… …   Wikipedia

  • Extended precision — The term extended precision refers to storage formats for floating point numbers taking advantage of an opportunity not falling in to a regular sequence of single, double and quadruple precision such as 32 bit, 64 bit and 128 bit occupying two,… …   Wikipedia

  • Double precision — In computing, double precision is a computer numbering format that occupies two adjacent storage locations in computer memory. A double precision number, sometimes simply called a double, may be defined to be an integer, fixed point, or floating… …   Wikipedia

  • Single precision — In computing, single precision is a computer numbering format that occupies one storage location in computer memory at a given address. A single precision number, sometimes simply a single, may be defined to be an integer, fixed point, or… …   Wikipedia

  • Half precision — In computing, half precision is a computer numbering format that occupies only half of one storage location (word) in computer memory at some address. A half precision number may be defined to be a binary or decimal integer, fixed point, or… …   Wikipedia

  • Double-precision floating-point format — In computing, double precision is a computer number format that occupies two adjacent storage locations in computer memory. A double precision number, sometimes simply called a double, may be defined to be an integer, fixed point, or floating… …   Wikipedia

  • Floating point — In computing, floating point describes a method of representing real numbers in a way that can support a wide range of values. Numbers are, in general, represented approximately to a fixed number of significant digits and scaled using an exponent …   Wikipedia

  • Kahan summation algorithm — In numerical analysis, the Kahan summation algorithm (also known as compensated summation) significantly reduces the numerical error in the total obtained by adding a sequence of finite precision floating point numbers, compared to the obvious… …   Wikipedia

  • Comparison of programming languages (basic instructions) — Programming language comparisons General comparison Basic syntax Basic instructions Arrays Associative arrays String operations …   Wikipedia

  • C data types — C Standard Library Data types Character classification Strings Mathematics File input/output Date/time Memory allocation …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”