Multiplication ALU

In digital design, a multiplier or multiplication ALU is a hardware circuit dedicated to multiplying two binary values.

A variety of techniques can be used to implement a digital multiplier. Most techniques involve computing a set of "partial products", and then summing the partial products together. This process is similar to the method taught to primary schoolchildren for conducting long multiplication on base-10 integers, but has been modified here for application to a base-2 (binary) numeral system.

Multiplication basics

In basic school we learn a multiplication method for multiplying decimal numbers, which is based on calculating partial products, shifting them to the left and then adding them together. The most difficult part is to obtain the partial products, as that involves multiplying a long number by one digit (from 0 to 9):

123 x 456 =

738 (this is 123 x 6) 615 (this is 123 x 5, shifted one position to the left) + 492 (this is 123 x 4, shifted two positions to the left) =

56088

A binary computer does exactly the same, but with binary numbers. In binary encoding each long number is multiplied by one digit (either 0 or 1), and that is much easier than in decimal, as the product by 0 or 1 is just 0 or the same number. Therefore, the multiplication of two binary numbers comes down to calculating partial products (which are 0 or the first number), shifting them left, and then adding them together (a binary addition, of course):

1011 (this is 11 in decimal) x 1110 (this is 14 in decimal) =

0000 (this is 1011 x 0) 1011 (this is 1011 x 1, shifted one position to the left) 1011 (this is 1011 x 1, shifted two positions to the left) + 1011 (this is 1011 x 1, shifted three positions to the left) =

10011010 (this is 154 in decimal)

This is much simpler than in the decimal system, as there is no table of multiplication to remember: just shifts and adds.

This method is mathematically correct, but it has two serious engineering problems. The first is that it involves 32 intermediate adds in a 32 bits computer, or 64 intermediate adds in a 64 bits computer. These additions take a lot of time. The engineering implementation of binary multiplication consists, really, of taking a very simple mathematical process and complicating it a lot, in order to do fewer additions; a modern processor can multiply two 64 bit numbers with 16 additions (rather than 64), and can do several steps in parallel -- but at a cost of making the process almost unreadable.

The second problem is that the basic school method handles the sign with a separate rule ("+ with + yields +", "+ with - yields -", etc..). Modern computers embed the sign of the number in the number itself, usually in the two's complement representation. That forces the multiplication process to be adapted to handle two's complement numbers, and that complicates the process a bit more. Similarly, processors that use one's complement, sign-and-magnitude, IEEE-754 and other binary representations require specific adjustments to the multiplication process.

Engineering approach: an unsigned example

For example, suppose we want to multiply two unsigned eight bit integers together: "a" [7:0] and "b" [7:0] . We can produce eight partial products by performing eight one-bit multiplications, one for each bit in multiplicand "a": p0 [7:0] = a [0] × b [7:0] = {8{a [0] & b [7:0] p1 [7:0] = a [1] × b [7:0] = {8{a [1] & b [7:0] p2 [7:0] = a [2] × b [7:0] = {8{a [2] & b [7:0] p3 [7:0] = a [3] × b [7:0] = {8{a [3] & b [7:0] p4 [7:0] = a [4] × b [7:0] = {8{a [4] & b [7:0] p5 [7:0] = a [5] × b [7:0] = {8{a [5] & b [7:0] p6 [7:0] = a [6] × b [7:0] = {8{a [6] & b [7:0] p7 [7:0] = a [7] × b [7:0] = {8{a [7] & b [7:0]

where {8{a [0] means repeating a [0] (the 0th bit of a) 8 times (Verilog notation).

To produce our product, we then need to add up all eight of our partial products, as shown here: p0 [7] p0 [6] p0 [5] p0 [4] p0 [3] p0 [2] p0 [1] p0 [0] + p1 [7] p1 [6] p1 [5] p1 [4] p1 [3] p1 [2] p1 [1] p1 [0] 0 + p2 [7] p2 [6] p2 [5] p2 [4] p2 [3] p2 [2] p2 [1] p2 [0] 0 0 + p3 [7] p3 [6] p3 [5] p3 [4] p3 [3] p3 [2] p3 [1] p3 [0] 0 0 0 + p4 [7] p4 [6] p4 [5] p4 [4] p4 [3] p4 [2] p4 [1] p4 [0] 0 0 0 0 + p5 [7] p5 [6] p5 [5] p5 [4] p5 [3] p5 [2] p5 [1] p5 [0] 0 0 0 0 0 + p6 [7] p6 [6] p6 [5] p6 [4] p6 [3] p6 [2] p6 [1] p6 [0] 0 0 0 0 0 0 + p7 [7] p7 [6] p7 [5] p7 [4] p7 [3] p7 [2] p7 [1] p7 [0] 0 0 0 0 0 0 0 ------------------------------------------------------------------------------------------- P [15] P [14] P [13] P [12] P [11] P [10] P [9] P [8] P [7] P [6] P [5] P [4] P [3] P [2] P [1] P [0]

In other words, "P" [15:0] is produced by summing "p"0, "p"1 << 1, "p"2 << 2, and so forth, to produce our final unsigned 16-bit product.

Engineering approach: signed integers

If "b" had been a signed integer instead of an unsigned integer, then the partial products would need to have been sign-extended up to the width of the product before summing. If "a" had been a signed integer, then partial product "p7" would need to be subtracted from the final sum, rather than added to it.

The above array multiplier can be modified to support two's complement notation signed numbers by inverting several of the product terms and inserting a one to the left of the first partial product term:

1 -p0 [7] p0 [6] p0 [5] p0 [4] p0 [3] p0 [2] p0 [1] p0 [0] -p1 [7] +p1 [6] +p1 [5] +p1 [4] +p1 [3] +p1 [2] +p1 [1] +p1 [0] 0 -p2 [7] +p2 [6] +p2 [5] +p2 [4] +p2 [3] +p2 [2] +p2 [1] +p2 [0] 0 0 -p3 [7] +p3 [6] +p3 [5] +p3 [4] +p3 [3] +p3 [2] +p3 [1] +p3 [0] 0 0 0 -p4 [7] +p4 [6] +p4 [5] +p4 [4] +p4 [3] +p4 [2] +p4 [1] +p4 [0] 0 0 0 0 -p5 [7] +p5 [6] +p5 [5] +p5 [4] +p5 [3] +p5 [2] +p5 [1] +p5 [0] 0 0 0 0 0 -p6 [7] +p6 [6] +p6 [5] +p6 [4] +p6 [3] +p6 [2] +p6 [1] +p6 [0] 0 0 0 0 0 0 1 +p7 [7] -p7 [6] -p7 [5] -p7 [4] -p7 [3] -p7 [2] -p7 [1] -p7 [0] 0 0 0 0 0 0 0 ------------------------------------------------------------------------------------------------------------ P [15] P [14] P [13] P [12] P [11] P [10] P [9] P [8] P [7] P [6] P [5] P [4] P [3] P [2] P [1] P [0]

Do not be misled by the minus sign (-) notation above. It does not mean arithmetic negation ( -(7) = -7), but instead binary complementation, more commonly denoted xprime (read x prime), ar x (x bar) or ~x (tilde x), achieved by flipping all the bits. In two's complement to get the negation of a number, you complement the number then add 1, so they are NOT equivalent.

There are a lot of simplifications in the bit array above that are not shown and are not obvious, so I'll try to explain a few. The sequences of one complemented bit followed by noncomplemented bits are just implementing a two's complement trick to avoid sign extension. The sequence of p7 (noncomplemented bit followed by all complemented bits) is because we're subtracting this term so they were all negated to start out with (and a 1 was added in the least significant position). For both types of sequences, the last bit is flipped and an implicit -1 should be added directly below the MSB. When the +1 from the two's complement negation for p7 in bit position 0 (LSB) and all the -1's in bit columns 7 through 15 (where each of the MSBs are located) are added together, they can be simplified to the single 1 that "magically" is floating out to the left. For an explanation and proof of why flipping the MSB saves us the sign extension, see a computer arithmetic book.

Implementations

Older multiplier architectures employed a shifter and accumulator to sum each partial product, often one partial product per cycle, trading off speed for die area. Modern multiplier architectures use the Baugh-Wooley algorithm, Wallace trees, or Dadda multipliers to add the partial products together in a single cycle. The performance of the Wallace tree implementation is sometimes improved by "modified" Booth encoding one of the two multiplicands, which reduces the number of partial products that must be summed.

See also

* Booth's multiplication algorithm
* Multiply-accumulate (Fused multiply-add)
* Wallace tree
* binary multiplier

External links

* [http://www.andraka.com/multipli.htm Multiplier Designs] targeted at FPGAs

Wikimedia Foundation. 2010.

Игры ⚽ Поможем написать реферат

Look at other dictionaries:

Multiplication — Multiply redirects here. For other uses, see Multiplication (disambiguation). For methods of computing products, including those of very large numbers, see Multiplication algorithm. Four bags of three marbles gives twelve marbles. There are also… … Wikipedia
Booth's multiplication algorithm — is a multiplication algorithm that multiplies two signed binary numbers in two s complement notation.ProcedureBooth s algorithm involves repeatedly adding one of two predetermined values A and S to a product P , then performing a rightward… … Wikipedia
Arithmetic logic unit — schematic symbol Cascadable 8 … Wikipedia
List of mathematics articles (M) — NOTOC M M estimator M group M matrix M separation M set M. C. Escher s legacy M. Riesz extension theorem M/M/1 model Maass wave form Mac Lane s planarity criterion Macaulay brackets Macbeath surface MacCormack method Macdonald polynomial Machin… … Wikipedia
Chris Wallace (computer scientist) — Professor Christopher Stewart Wallace (26 October 1933 7 August 2004) was an Australian computer scientist (and physicist, who also contributed to a variety of other areas) notable for having devised: The minimum message length principle (Wallace … Wikipedia
List of electronics topics — Alphabetization has been neglected in some parts of this article (the b section in particular). You can help by editing it. This is a list of communications, computers, electronic circuits, fiberoptics, microelectronics, medical electronics,… … Wikipedia
computer — computerlike, adj. /keuhm pyooh teuhr/, n. 1. Also called processor. an electronic device designed to accept data, perform prescribed mathematical and logical operations at high speed, and display the results of these operations. Cf. analog… … Universalium
Calculator — For mechanical precursors to the modern calculator, see mechanical calculator. For other uses, see Calculator (disambiguation). An electronic pocket calculator with a 7‑segment LCD display, that can perform basic arithmetic operations … Wikipedia
computer science — computer scientist. the science that deals with the theory and methods of processing information in digital computers, the design of computer hardware and software, and the applications of computers. [1970 75] * * * Study of computers, their… … Universalium
Intel Pentium 4 — Pour les articles homonymes, voir P4. Pentium 4 Processeur Pentium 4 (Northwood) / 2,40 GHz. A gauche le die est visible, à droite s … Wikipédia en Français

Academic Dictionaries and Encyclopedias

Multiplication ALU

Look at other dictionaries:

Share the article and excerpts

Academic Dictionaries and Encyclopedias

Wikipedia

Multiplication ALU

Look at other dictionaries:

Share the article and excerpts

Direct link