Fixed-Point Numbers

Fixed-point numbers are a way of representing real numbers by fixing the position of the decimal point in advance and using a positional numeral system. Given $n$ places for digits, the idea is to write a real number $v$ in the form

v = (d_{n - 1} \cdot b^{n - 1} + \dots + d_{1} \cdot b^{1} + d_{0} \cdot b^{0}) \cdot b^{r},

where $d_{k}$ is the digits at the $k$ -th place going right to left, $b$ is the chosen base and $r \in Z$ is the radix which determines the position of the decimal point.

NOTE

Despite the name, $r$ has nothing to do with the base $b$ .

The radix $r$ determines the position of the decimal point in the following way:

When $r = 0$ , the number is multiplied with $b^{0} = 1$ , i.e. it remains just $v = d_{n - 1} \cdot b^{n - 1} + \dots + d_{1} \cdot b^{1} + d_{0} \cdot b^{0}$ , meaning that $v$ is an integer.
When $r > 0$ , the number is multiplied with $b^{r} > 1$ . Its digits are shifted $r$ places towards the most significant digit and the gaps are filled with zeros. The decimal point disappears, which means that we sacrifice precision but can represent larger numbers.
When $r < 0$ , the number is multiplied with $b^{r} < 1$ . The decimal point moves $r$ places towards the most significant digit. We increase the precision, but the largest representable number becomes smaller.

The distance between consecutive representable numbers using this format is constant and is equal to $b^{r}$ .

Algebraic Sign

So far, the fixed-point numbers format can only represent non-negative (unsigned) numbers. We thus need to find a way to extend it to allow for negative (signed) numbers. There are three main ways to do this when $b = 2$ :

sign and absolute value;
ones’ complement;
twos’ complement.

Sign and Absolute Value

In the sign and absolute value system, we treat the first bit as the sign: $1$ indicates a negative sign, while $0$ indicates a positive sign. The rest of the bits are treated as the absolute value of the number.

For example, $001 0_{2}$ represents $2$ , while $101 0_{2}$ represents $- 2$ .

The main advantage of this format is its simplicity. However, it comes with the problem that there are always two ways to represent zero, namely $10 \dots 0_{2}$ and $00 \dots 0_{2}$ . This makes computations more difficult for computers.

Ones’ Complement

In the ones’ complement system, the negative of a number is constructed by inverting all its bits. For example, $001 0_{2}$ represents $2$ , while $110 1_{2}$ represents $- 2$ . This has the advantage that addition and subtraction work in (mostly) the same way as usual:

001 1_{2} 3 + + 001 0_{2} 2 = = 010 1_{2} 5

101 1_{2} - 4 + + 001 0_{2} 2_{10} = = 110 1_{2} - 2

However, this format still has the problem that there are two ways to represent zero, namely $1 \dots 1$ and $0 \dots 0$ . Moreover, adding $1 \dots 1_{2}$ ( $- 0$ ) to $00 \dots 1_{2}$ ( $1$ ) results in $0 \dots 0_{2}$ ( $0$ ) instead of $00 \dots 1_{2}$ ( $1$ ).

Two’s Complement

In the two’s complement system, the negative of a number is built by first constructing its ones’ complement and then adding the $1_{2}$ to it. For example, $001 0_{2}$ represents $2$ , while $111 0_{2}$ represents $- 2$ .

This solves all problems with the double representation of zero and so addition and subtraction always work as possible. The only problem is that it makes the positive and negative numbers asymmetric: for any fixed number of bits, the total number of representable negative numbers is always greater by one than the total number of representable positive numbers.

IMPORTANT

All modern computers use two’s complement.

Razon

Explorer

Fixed-Point Numbers

Fixed-Point Numbers

Algebraic Sign

Sign and Absolute Value

Ones’ Complement

Two’s Complement

Graph View

Table of Contents