Signed number representations

In computing, signed number representations are required to encode negative numbers in binary number systems.

In mathematics, negative numbers in any base are represented by prefixing them with a minus ("−") sign. However, in computer hardware, numbers are represented only as sequences of bits, without extra symbols. The four best-known methods of extending the binary numeral system to represent signed numbers are: sign-and-magnitude, ones' complement, two's complement, and excess-K. Some of the alternative methods use implicit instead of explicit signs, such as negative binary, using the base −2. Corresponding methods can be devised for other bases, whether positive, negative, fractional, or other elaborations on such themes.

There is no definitive criterion by which any of the representations is universally superior. The representation used in most current computing devices is two's complement, although the Unisys ClearPath Dorado series mainframes use ones' complement.

History

The early days of digital computing were marked by a lot of competing ideas about both hardware technology and mathematics technology (numbering systems). One of the great debates was the format of negative numbers, with some of the era's most expert people having very strong and different opinions. One camp supported two's complement, the system that is dominant today. Another camp supported ones' complement, where any positive value is made into its negative equivalent by inverting all of the bits in a word. A third group supported "sign & magnitude" (sign-magnitude), where a value is changed from positive to negative simply by toggling the word's sign (high-order) bit.

There were arguments for and against each of the systems. Sign & magnitude allowed for easier tracing of memory dumps (a common process 40 years ago) as small numeric values use fewer 1 bits. Internally, these systems did ones' complement math so numbers would have to be converted to ones' complement values when they were transmitted from a register to the math unit and then converted back to sign-magnitude when the result was transmitted back to the register. The electronics required more gates than the other systems – a key concern when the cost and packaging of discrete transistors was critical. IBM was one of the early supporters of sign-magnitude, with their 704, 709 and 709x series computers being perhaps the best known systems to use it.

Ones' complement allowed for somewhat simpler hardware designs as there was no need to convert values when passed to and from the math unit. But it also shared an undesirable characteristic with sign-magnitude – the ability to represent negative zero (−0). Negative zero behaves exactly like positive zero; when used as an operand in any calculation, the result will be the same whether an operand is positive or negative zero. The disadvantage, however, is that the existence of two forms of the same value necessitates two rather than a single comparison when checking for equality with zero. Ones' complement subtraction can also result in an end-around borrow (described below). It can be argued that this makes the addition/subtraction logic more complicated or that it makes it simpler as a subtraction requires simply inverting the bits of the second operand as it is passed to the adder. The PDP-1, CDC 160 series, CDC 3000 series, CDC 6000 series, UNIVAC 1100 series, and the LINC computer use ones' complement representation.

Two's complement is the easiest to implement in hardware, which may be the ultimate reason for its widespread popularity. Processors on the early mainframes often consisted of thousands of transistors – eliminating a significant number of transistors was a significant cost savings. Mainframes such as the IBM System/360, the GE-600 series,^[1] and the PDP-6 and PDP-10 use two's complement, as did minicomputers such as the PDP-5 and PDP-8 and the PDP-11 and VAX. The architects of the early integrated circuit-based CPUs (Intel 8080, etc.) chose to use two's complement math. As IC technology advanced, virtually all adopted two's complement technology. x86,^[2] m68k, Power Architecture,^[3] MIPS, SPARC, ARM, Itanium, PA-RISC, and DEC Alpha processors are all two's complement.

Signed magnitude representation

This representation is also called "sign–magnitude" or "sign and magnitude" representation. In this approach, the problem of representing a number's sign can be to allocate one sign bit to represent the sign: setting that bit (often the most significant bit) to 0 is for a positive number or positive zero, and setting it to 1 is for a negative number or negative zero. The remaining bits in the number indicate the magnitude (or absolute value). Hence, in a byte with only seven bits (apart from the sign bit), the magnitude can range from 0000000 (0) to 1111111 (127). Thus numbers ranging from −127₁₀ to +127₁₀ can be represented once the sign bit (the eighth bit) is added. A consequence of this representation is that there are two ways to represent zero, 00000000 (0) and 10000000 (−0). This way, −43₁₀ encoded in an eight-bit byte is 10101011.

This approach is directly comparable to the common way of showing a sign (placing a "+" or "−" next to the number's magnitude). Some early binary computers (e.g., IBM 7090) use this representation, perhaps because of its natural relation to common usage. Signed magnitude is the most common way of representing the significand in floating point values.

Ones' complement

Eight-bit ones' complement
Binary value	Ones' complement interpretation	Unsigned interpretation
00000000	+0	0
00000001	1	1
⋮	⋮	⋮
01111101	125	125
01111110	126	126
01111111	127	127
10000000	−127	128
10000001	−126	129
10000010	−125	130
⋮	⋮	⋮
11111101	−2	253
11111110	−1	254
11111111	−0	255

Main article: Ones' complement

Alternatively, a system known as ones' complement can be used to represent negative numbers. The ones' complement form of a negative binary number is the bitwise NOT applied to it — the "complement" of its positive counterpart. Like sign-and-magnitude representation, ones' complement has two representations of 0: 00000000 (+0) and 11111111 (−0).

As an example, the ones' complement form of 00101011 (43₁₀) becomes 11010100 (−43₁₀). The range of signed numbers using ones' complement is represented by −(2^N−1 − 1) to (2^N−1 − 1) and ±0. A conventional eight-bit byte is −127₁₀ to +127₁₀ with zero being either 00000000 (+0) or 11111111 (−0).

To add two numbers represented in this system, one does a conventional binary addition, but it is then necessary to do an end-around carry: that is, add any resulting carry back into the resulting sum. To see why this is necessary, consider the following example showing the case of the addition of −1 (11111110) to +2 (00000010):

          binary    decimal
        11111110     −1
     +  00000010     +2
     ───────────     ──
      1 00000000      0   ← Not the correct answer
               1     +1   ← Add carry
     ───────────     ──
        00000001      1   ← Correct answer

In the previous example, the first binary addition gives 00000000, which is incorrect. The correct result (00000001) only appears when the carry is added back in.

A remark on terminology: The system is referred to as "ones' complement" because the negation of a positive value x (represented as the bitwise NOT of x) can also be formed by subtracting x from the ones' complement representation of zero that is a long sequence of ones (−0). Two's complement arithmetic, on the other hand, forms the negation of x by subtracting x from a single large power of two that is congruent to +0.^[4] Therefore, ones' complement and two's complement representations of the same negative value will differ by one.

Note that the ones' complement representation of a negative number can be obtained from the sign-magnitude representation merely by bitwise complementing the magnitude.

Two's complement

Eight-bit two's complement
Binary value	Two's complement interpretation	Unsigned interpretation
00000000	0	0
00000001	1	1
⋮	⋮	⋮
01111110	126	126
01111111	127	127
10000000	−128	128
10000001	−127	129
10000010	−126	130
⋮	⋮	⋮
11111110	−2	254
11111111	−1	255

Main article: Two's complement

The problems of multiple representations of 0 and the need for the end-around carry are circumvented by a system called two's complement. In two's complement, negative numbers are represented by the bit pattern which is one greater (in an unsigned sense) than the ones' complement of the positive value.

In two's-complement, there is only one zero, represented as 00000000. Negating a number (whether negative or positive) is done by inverting all the bits and then adding one to that result.^[5] This actually reflects the ring structure on all integers modulo 2^N: $\mathbb {Z} /2^{N}\mathbb {Z}$ . Addition of a pair of two's-complement integers is the same as addition of a pair of unsigned numbers (except for detection of overflow, if that is done); the same is true for subtraction and even for N lowest significant bits of a product (value of multiplication). For instance, a two's-complement addition of 127 and −128 gives the same binary bit pattern as an unsigned addition of 127 and 128, as can be seen from the 8-bit two's complement table.

An easier method to get the negation of a number in two's complement is as follows:

	Example 1	Example 2
1. Starting from the right, find the first "1"	00101001	00101100
2. Invert all of the bits to the left of that one	11010111	11010100

Method two:

Invert all the bits through the number
Add one

Example: for +2, which is 00000010 in binary (the ~ character is the C bitwise NOT operator, so ~X means "invert all the bits in X"):

~00000010 → 11111101
11111101 + 1 → 11111110 (−2 in two's complement)

Excess-`K`

Eight-bit excess-128
Binary value	Excess-128 interpretation	Unsigned interpretation
00000000	−128	0
00000001	−127	1
⋮	⋮	⋮
01111111	-1	127
10000000	0	128
10000001	1	129
⋮	⋮	⋮
11111111	+127	255

Main article: Excess-K

Excess-K, also called offset binary or biased representation, uses a pre-specified number K as a biasing value. A value is represented by the unsigned number which is K greater than the intended value. Thus 0 is represented by K, and −K is represented by the all-zeros bit pattern. This can be seen as a slight modification and generalization of the aforementioned two's-complement, which is virtually the excess-(2^N−1 − 1) representation with negated most significant bit.

Biased representations are now primarily used for the exponent of floating-point numbers. The IEEE floating-point standard defines the exponent field of a single-precision (32-bit) number as an 8-bit excess-127 field. The double-precision (64-bit) exponent field is an 11-bit excess-1023 field; see exponent bias. It also had use for binary-coded decimal numbers as excess-3.

Base −2

Comparison table

The following table shows the positive and negative integers that can be represented using four bits.

Four-bit integer representations
Decimal	Unsigned	Sign and magnitude	Ones' complement	Two's complement	Excess-8 (biased)	Base −2
+16	N/A	N/A	N/A	N/A	N/A	N/A
+15	1111	N/A	N/A	N/A	N/A	N/A
+14	1110	N/A	N/A	N/A	N/A	N/A
+13	1101	N/A	N/A	N/A	N/A	N/A
+12	1100	N/A	N/A	N/A	N/A	N/A
+11	1011	N/A	N/A	N/A	N/A	N/A
+10	1010	N/A	N/A	N/A	N/A	N/A
+9	1001	N/A	N/A	N/A	N/A	N/A
+8	1000	N/A	N/A	N/A	N/A	N/A
+7	0111	0111	0111	0111	1111	N/A
+6	0110	0110	0110	0110	1110	N/A
+5	0101	0101	0101	0101	1101	0101
+4	0100	0100	0100	0100	1100	0100
+3	0011	0011	0011	0011	1011	0111
+2	0010	0010	0010	0010	1010	0110
+1	0001	0001	0001	0001	1001	0001
+0	0000	0000	0000	0000	1000	0000
−0	0000	1000	1111	0000	1000	0000
−1	N/A	1001	1110	1111	0111	0011
−2	N/A	1010	1101	1110	0110	0010
−3	N/A	1011	1100	1101	0101	1101
−4	N/A	1100	1011	1100	0100	1100
−5	N/A	1101	1010	1011	0011	1111
−6	N/A	1110	1001	1010	0010	1110
−7	N/A	1111	1000	1001	0001	1001
−8	N/A	N/A	N/A	1000	0000	1000
−9	N/A	N/A	N/A	N/A	N/A	1011
−10	N/A	N/A	N/A	N/A	N/A	1010
−11	N/A	N/A	N/A	N/A	N/A	N/A

Same table, as viewed from "given these binary bits, what is the number as interpreted by the representation system":

Binary	Unsigned	Sign and magnitude	Ones' complement	Two's complement	Excess-8	Base −2
0000	0	0	0	0	−8	0
0001	1	1	1	1	−7	1
0010	2	2	2	2	−6	−2
0011	3	3	3	3	−5	−1
0100	4	4	4	4	−4	4
0101	5	5	5	5	−3	5
0110	6	6	6	6	−2	2
0111	7	7	7	7	−1	3
1000	8	−0	−7	−8	0	−8
1001	9	−1	−6	−7	1	−7
1010	10	−2	−5	−6	2	−10
1011	11	−3	−4	−5	3	−9
1100	12	−4	−3	−4	4	−4
1101	13	−5	−2	−3	5	−3
1110	14	−6	−1	−2	6	−6
1111	15	−7	−0	−1	7	−5

Other systems

Google's Protocol Buffers "zig-zag encoding" is a system similar to sign-and-magnitude, but uses the least significant bit to represent the sign and has a single representation of zero. This has the advantage to make variable-length quantity encoding efficient with signed integers.^[6]

Another approach is to give each digit a sign, yielding the signed-digit representation. For instance, in 1726, John Colson advocated reducing expressions to "small numbers", numerals 1, 2, 3, 4, and 5. In 1840, Augustin Cauchy also expressed preference for such modified decimal numbers to reduce errors in computation.

References

↑ GE-625 / 635 Programming Reference Manual. General Electric. January 1966. Retrieved August 15, 2013.
↑ Intel 64 and IA-32 Architectures Software Developer’s Manual (PDF). Intel. Section 4.2.1. Retrieved August 6, 2013.
↑ Power ISA Version 2.07. Power.org. Section 1.4. Retrieved August 6, 2013. ,
↑ Donald Knuth: The Art of Computer Programming, Volume 2: Seminumerical Algorithms, chapter 4.1
↑ Thomas Finley (April 2000). "Two's Complement". Cornell University. Retrieved 15 September 2015.
↑ Protocol Buffers: Signed Integers

Ivan Flores, The Logic of Computer Arithmetic, Prentice-Hall (1963)
Israel Koren, Computer Arithmetic Algorithms, A.K. Peters (2002), ISBN 1-56881-160-8

This article is issued from Wikipedia - version of the 10/14/2016. The text is available under the Creative Commons Attribution/Share Alike but additional terms may apply for the media files.