ERROR CHECKING AND THE STRUCTURE OF BINARY ADDITION

By

Harvey Louis Garner

A dissertation submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy in the University of Michigan

1958

July, 1958

IP-305
Doctoral Committee:

Professor Alan B. Macnee, Co-Chairman
Professor Norman R. Scott, Co-Chairman
Professor Arthur W. Burks
Professor Harry H. Goode
Professor Gunnar Hok
ACKNOWLEDGMENT

The author wishes to express his gratitude to all the members of his doctoral committee. In particular, the author is indebted to the committee for the critical review afforded the first draft copy of this dissertation.

The author wishes to express his gratitude for the encouragement and aid given by his wife during the period in which this dissertation was being written and prepared for publication.

Last, an expression of thanks is due to the Industry Program of the College of Engineering for the excellent services rendered in the preparation and reproduction of this dissertation.
# TABLE OF CONTENTS

<table>
<thead>
<tr>
<th>Section</th>
<th>Page</th>
</tr>
</thead>
<tbody>
<tr>
<td>ACKNOWLEDGMENTS</td>
<td>11</td>
</tr>
<tr>
<td>LIST OF TABLES</td>
<td>v</td>
</tr>
<tr>
<td>LIST OF FIGURES</td>
<td>vi</td>
</tr>
<tr>
<td>NOMENCLATURE</td>
<td>vii</td>
</tr>
<tr>
<td>I.  INTRODUCTION</td>
<td>1</td>
</tr>
<tr>
<td>II. GENERALIZED PARITY CHECKING</td>
<td>7</td>
</tr>
<tr>
<td>Introduction</td>
<td>7</td>
</tr>
<tr>
<td>Congruences</td>
<td>8</td>
</tr>
<tr>
<td>Generalization of Parity</td>
<td>9</td>
</tr>
<tr>
<td>Arithmetic Invariance of the Digital Parity Check</td>
<td>12</td>
</tr>
<tr>
<td>Examples of Arithmetic Checking (m=b=10)</td>
<td>18</td>
</tr>
<tr>
<td>Numerical Checking, Mod m(\cdot)b, of Arithmetic Operations</td>
<td>20</td>
</tr>
<tr>
<td>The Diminished Base Numerical Check</td>
<td>21</td>
</tr>
<tr>
<td>The Augmented Base Numerical Check</td>
<td>23</td>
</tr>
<tr>
<td>Examples of the Augmented Base Check</td>
<td>25</td>
</tr>
<tr>
<td>Conclusion</td>
<td>26</td>
</tr>
<tr>
<td>III. VECTOR SPACES AND BINARY ADDITION</td>
<td>27</td>
</tr>
<tr>
<td>Introduction</td>
<td>27</td>
</tr>
<tr>
<td>The Elementary Concepts of Vector Spaces</td>
<td>28</td>
</tr>
<tr>
<td>Vector Representation of Binary Numbers</td>
<td>34</td>
</tr>
<tr>
<td>Vector Length and Parity</td>
<td>37</td>
</tr>
<tr>
<td>The Vector Space of Binary Addition</td>
<td>39</td>
</tr>
<tr>
<td>The Carry Vector</td>
<td>42</td>
</tr>
<tr>
<td>Methods of Generation of the Carry Vector</td>
<td>46</td>
</tr>
<tr>
<td>Synthesis by Vector Space Operations</td>
<td>54</td>
</tr>
<tr>
<td>Numerical Relationships</td>
<td>57</td>
</tr>
<tr>
<td>Addition Algorithms</td>
<td>60</td>
</tr>
<tr>
<td>Constraints and Some Properties of the Addition Algorithms</td>
<td>62</td>
</tr>
<tr>
<td>Evaluation of the kp and the Vp Logic</td>
<td>69</td>
</tr>
<tr>
<td>IV.  THE ERROR STRUCTURE OF BINARY ADDITION AND THE EFFECTIVENESS OF THE PARITY CHECK</td>
<td>73</td>
</tr>
<tr>
<td>Introduction</td>
<td>73</td>
</tr>
<tr>
<td>The Error Structure of the Carry Logic of Standard Addition</td>
<td>75</td>
</tr>
</tbody>
</table>

iii
# TABLE OF CONTENTS (CONT'D)

<table>
<thead>
<tr>
<th>Section</th>
<th>Page</th>
</tr>
</thead>
<tbody>
<tr>
<td>The Nature of the Propagated Error</td>
<td>82</td>
</tr>
<tr>
<td>The Average Length of Carry Propagation</td>
<td>86</td>
</tr>
<tr>
<td>The Error Structure of the k, p and V Generating Logic for Standard Addition</td>
<td>89</td>
</tr>
<tr>
<td>The Error Structure of the rc Sum Logic</td>
<td>96</td>
</tr>
<tr>
<td>The Effectiveness of Parity Checking</td>
<td>98</td>
</tr>
<tr>
<td><strong>V. THE RESIDUE CODE</strong></td>
<td>105</td>
</tr>
<tr>
<td>An Extension of Numerical Checking</td>
<td>105</td>
</tr>
<tr>
<td>Residue Addition and Multiplication</td>
<td>110</td>
</tr>
<tr>
<td>Subtraction and the Representation of Negative Numbers</td>
<td>114</td>
</tr>
<tr>
<td>Conversion From a Residue Code to a Normal Number Representation</td>
<td>117</td>
</tr>
<tr>
<td>Greater Than or Less Than Relations for Residue Codes</td>
<td>120</td>
</tr>
<tr>
<td>Redundancy</td>
<td>125</td>
</tr>
<tr>
<td>Division</td>
<td>135</td>
</tr>
<tr>
<td>Conclusions</td>
<td>139</td>
</tr>
<tr>
<td><strong>VI. CONCLUSIONS AND RECOMMENDATIONS</strong></td>
<td>141</td>
</tr>
<tr>
<td><strong>APPENDIX</strong></td>
<td>148</td>
</tr>
<tr>
<td>Sign and Overflow Convention for One's and Two's Complement Arithmetic</td>
<td>148</td>
</tr>
<tr>
<td><strong>BIBLIOGRAPHY</strong></td>
<td>153</td>
</tr>
</tbody>
</table>
LIST OF TABLES

<table>
<thead>
<tr>
<th>Table</th>
<th>Description</th>
<th>Page</th>
</tr>
</thead>
<tbody>
<tr>
<td>I</td>
<td>Relation Between $a_j$, $b_j$, and $v_j$, $k_j$</td>
<td>57</td>
</tr>
<tr>
<td>II</td>
<td>Frequency of Occurrence of Different Types of Carry Logic Malfunctions</td>
<td>79</td>
</tr>
<tr>
<td>III</td>
<td>Summary of Errors</td>
<td>99</td>
</tr>
<tr>
<td>IV</td>
<td>Error Type</td>
<td>100</td>
</tr>
<tr>
<td>V</td>
<td>Error Distribution (Normalized)</td>
<td>100</td>
</tr>
<tr>
<td>VI</td>
<td>Redundancy of a Non-Relatively Primed Base Representation</td>
<td>106</td>
</tr>
<tr>
<td>VII</td>
<td>Natural Numbers and Corresponding Residue Numbers</td>
<td>108</td>
</tr>
<tr>
<td>VIII</td>
<td>Number of States and Digits Associated with a Residue Representation</td>
<td>109</td>
</tr>
<tr>
<td>IX</td>
<td>Mod 2 and Mod 3 Sums and Products</td>
<td>111</td>
</tr>
<tr>
<td>X</td>
<td>Residue Code Efficiency</td>
<td>133</td>
</tr>
<tr>
<td>Number</td>
<td>Description</td>
<td>Page</td>
</tr>
<tr>
<td>--------</td>
<td>-----------------------------------------------------------------------------</td>
<td>------</td>
</tr>
<tr>
<td>1</td>
<td>Venn Diagram.</td>
<td>32</td>
</tr>
<tr>
<td>2</td>
<td>Carry Vector.</td>
<td>43</td>
</tr>
<tr>
<td>3</td>
<td>Logic for Independent, Sequential Generation of One Carry Element.</td>
<td>49</td>
</tr>
<tr>
<td>4</td>
<td>Standard Carry Logic.</td>
<td>53</td>
</tr>
<tr>
<td>5</td>
<td>Sum Dependent Carry Logic.</td>
<td>56</td>
</tr>
<tr>
<td>6</td>
<td>Half Adder.</td>
<td>70</td>
</tr>
<tr>
<td>7</td>
<td>Realization of the kp Addition Algorithm.</td>
<td>70</td>
</tr>
<tr>
<td>8</td>
<td>Carry Logic for the Standard Addition Algorithm.</td>
<td>71</td>
</tr>
<tr>
<td>9</td>
<td>Carry Logic for the vp Addition Algorithm.</td>
<td>71</td>
</tr>
<tr>
<td>10</td>
<td>Adder Logic.</td>
<td>74</td>
</tr>
<tr>
<td>11</td>
<td>Probability that c₁ = 1.</td>
<td>78</td>
</tr>
<tr>
<td>12</td>
<td>Carry Logic.</td>
<td>80</td>
</tr>
<tr>
<td>13</td>
<td>Flow Diagram for the Average Length of Carry Propagation.</td>
<td>88</td>
</tr>
<tr>
<td>14</td>
<td>Logic for the Determination of the Greater or Less Than Relationship.</td>
<td>124</td>
</tr>
<tr>
<td>15</td>
<td>Relative Efficiency of the Residue Code.</td>
<td>134</td>
</tr>
</tbody>
</table>
NOMENCLATURE

[A] - the matrix A

[A^{-1}] - the inverse matrix of A

[A^t] - the transpose matrix of A

D(x) - the dimension of x

d_{10} - diode number one, open

d_{1S} - diode number one, short

eq - is equivalent to

[G] - the metric matrix

\varepsilon_{ij} - an element of the metric matrix

k - the result of X \otimes Y

k_j - an element of k

p(\alpha) - the probability of n one digits in a vector \alpha

p(\overline{r}_j) - the probability that r_j, an element of the vector is zero

p\left(\frac{c_{j+1}}{r_{j+1}}\right) - the probability that c_{j+1}, an element of the carry vector, is unity when r_j is constrained to a value of unity

p\left(\frac{\triangle c_{j+1}}{r_{j+1}}\right) - the probability that c_{j+1}, an element of the carry vector is changed due to a constraint r_{j+1}

p(\Delta^* c_j) - the probability that the last element altered by a propagated error is c_j

\overline{p}\left(\frac{k_1}{r_1}\right) - the probability that k_1 = 0 if r_1 = 0

r_j - an element of \rho

s_j - an element of \sigma
NOMENCLATURE (CONT'D)

- mean (statistical)
- the result of $X \lor Y$
- the connective for inclusive disjunction
- an element of $V$
- variance (statistical)
- diminished radix complement, also the bar is used to indicate logical negation
- radix complement or additive inverse
- zero or null vector
- Kronecker delta
- "an element of"
- the result of $X \circ Y$
- the sum vector
- congruence relation $A \equiv a \mod b$ reads $A$ is congruent to $a$ modulo $b$
- modulo addition sign, also the symbol is used to indicate the exclusive disjunction which is addition modulo 2
- modulo multiplication or logic conjunction which is multiplication modulo 2
- is replaced by
- is in one to one correspondence with
- material implication connective; $A \Rightarrow B$ is read if $A$ then $B$
- and gate
- or gate
- exclusive or gate
- negation element
CHAPTER I

INTRODUCTION

The modern electronic digital computing machine owes its origin to the concepts presented by Charles Babbage in 1835.\(^1\) However, it was not until 1946 that technology reached a sufficiently advanced state to permit the development of a large scale digital electronic computer. The ENIAC,\(^2\) completed in 1946, consisted, in part, of 18,000 vacuum tubes. Successful computation was performed on the ENIAC during an era in which some difficulty was encountered in obtaining consistent operation of radar and communication circuitry consisting of two hundred or so vacuum tubes.

The state of the art in regard to computer reliability has made tremendous advances in recent years. The efforts to obtain more reliable computing systems have followed three general lines of attack.\(^1\) The past decade has witnessed the introduction of new devices which offer increased component reliability. Furthermore, the operational components of a computer such as the flip-flops have been subjected to a thorough engineering study. The results of such detailed engineering studies have led to optimal design techniques in terms of known tolerance levels.\(^2\) A second basic approach to the problem of increased reliability is through the use of redundant components. The work in this field ranges from consideration of increased reliability obtained by duplication up to n-tuplication considered by Von Neumann.\(^3\) The efforts in this area are divided roughly into two general classes, those which employ circuit redundancy with no error.
detection capability and those which use redundant circuitry employing error detection schemes. The error detecting schemes frequently utilize a majority element which must function perfectly. The majority element determines the correct output on the basis of the majority output of the redundant circuits. It has been shown that the redundancy is most effective if applied at the lowest level. (3) The third approach to increased computer reliability is the utilization of redundant codes. The subject of redundant information coding has received considerable attention in recent years.\(^{(4-9)}\) However, most of the effort in this field has been directed toward error detecting and correcting schemes pertinent to communication systems. Error detection and correction for arithmetic operations in a digital computer require a code which is arithmetically invariant. This is not a characteristic of most of the error correcting and detecting codes developed for communication applications. One exception is the Hamming code.\(^{(5)}\) Error correction by means of the Hamming code for a general purpose computer of the Princeton class has been studied by Robertson.\(^{(10)}\)

Another class of arithmetically invariant codes, the residue code, was used in the RAYDAC.\(^*\) The checking procedure used in this machine has been described by Block,\(^{(11,12)}\) but effectiveness of the check is not considered in this paper. The material of this thesis provides further information on the utility of the residue code.

\(^*\) (RAYDAC) Raytheon Digital Automatic Computer.
There have been many computers which have incorporated a simple parity checking system. Without exception the parity check has been used only to determine the validity of the computer storage mechanism and no attempts have been made to utilize the arithmetic invariant properties of the digital parity check. In fact, it appears that the true nature of the digital parity check has not been clearly understood. This is largely due to a cumbersome definition involving the oddness or evenness of the number of one digits of the representation.

The main problem attacked in this thesis is concerned with the effectiveness of parity checking procedures in the detection of errors in the arithmetic operations of a binary computer. In the course of the investigation certain topics pertinent to the process of binary arithmetic were discovered and this material is also included. Particular emphasis is given to an extension of the numerical parity check which leads to an arithmetic system without carry.

In Chapter II two types of parity checks are defined. The checks are the digital check which is a function of the digit symbols and the numerical check which may be a function of both the digit symbols and the positional weights. In this chapter congruence notation is introduced and the arithmetic invariance of the digital check is proved for the operation of addition. The results are then extended to the other arithmetic operations. The class of errors detectable by either type of check is discussed and examples are presented. Also in Chapter II congruences are used to generalize the concept of the digital parity check to include non-base-two number systems.
It is necessary to determine the error structure of the binary addition process before the effectiveness of the parity checking schemes can be evaluated. The binary addition process is complicated by the fact that the sum digits are not independent. The lack of independence is due to the propagated carry which is a feature of all weighted number systems. In Chapter III the structure of binary addition is investigated with particular emphasis given to the carry process. The mathematics of vector spaces was found to be particularly appropriate to the study of this problem. The general concept and properties of vector spaces are introduced and the mechanisms of binary addition are defined in terms of vector space concepts. Considerable use is made of matrix notation. The carry process is completely defined by a particular matrix. A study of the carry matrix is made and various circuits yielding different degrees of carry element independence are presented. Specific configurations are considered and it is shown that in general the price that must be paid in terms of time and equipment to obtain independence of the carry digits is too high. All of the different carry schemes which have been used in binary arithmetic units are derivable from the carry matrix presented in this chapter. The use of vector space concepts in connection with binary number representations sheds new light on certain problems. In particular, the conversion from one number code to another may be regarded simply as a change of basis. The length of a vector in an n dimensional space which corresponds to an n digit binary number is shown to be equivalent to the magnitude of the digital parity check.
The concepts of dimensionality of vector spaces are used extensively to define possible checking procedures and also to provide alternative definitions of the addition algorithm. In particular, it is shown that identical sums are obtained from: (1) the binary addition of two operands (2) the binary addition of the conjunction of the two operands and the inclusive disjunction of the two operands. (3) the binary addition of the exclusive disjunction of the two operands and two times the conjunction of the two operands (4) the binary subtraction of the exclusive disjunction of the two operands from two times the inclusive disjunction of the two operands. It is shown that the carry process associated with the last two definitions has different characteristics than the carry process associated with the standard addition algorithm, but performance-wise the new algorithms offer no advantages not associated with the standard algorithms.

In Chapter IV the results of Chapter III are applied to the detailed error structure of the binary addition process. The error structure is considered for the standard addition algorithm. The effects of logical malfunctions of the components which comprise the arithmetic circuitry are considered and classified. The probability of success of specific digital and numerical parity checks is determined in respect to the different classes of errors due to circuit malfunctions. It is found that the numerical check is 100% effective for single addition operations.

The concept of the numerical check is extended in Chapter V. If a sufficient number of numerical parity checks are employed, it is
possible to obtain a representation consisting of the numerical parity check digits which may be substituted for the original representation. In fact, the use of an excess number of parity check digits leads to an alternative representation containing redundancy. A number representation so derived is termed a residue code. The arithmetic properties and the utilization of redundant information are considered.

In Chapter VI conclusions and recommendations are presented. Appendix I is a description of a binary arithmetic sign and overflow convention pertinent to the material of Chapter IV.
CHAPTER II

GENERALIZED PARITY CHECKING

Introduction

The concept of the parity check(s) is widely known and has frequently been used to detect and in some cases to correct errors in digital systems. The parity check is obtained by associating with each representation of information an auxiliary digit or group of digits known as the parity check digit(s). The proper choice of the relationship between the check digits and the information digits permits the establishment of mathematical operations, which may indicate erroneous digits or provide correction data. Parity checking is usually defined for the binary system in terms of the odd or evenness of the number of ONE digits in a specified block of binary digits. If the number of ONE digits is even the parity check digit is a zero. If the number of ONE digits is odd the parity check digit is a one. This definition is sufficient for most practical applications of the parity concept. However, the definition gives no insight into the fundamental principles of parity. A restatement of parity in terms of linear congruences provides a mathematical basis for the parity concept and permits a generalization which includes number systems of radix \( r \neq 2 \). Congruence notation also provides the mathematical tool required to prove the parity check invariant to addition for any consistently weighted number system if the carries are processed correctly.
Congruences

The remainder of this paper will make considerable use of congruence notation. A short presentation of pertinent concepts and relationships is included at this point. Additional material on the subject may be found in any standard text on Number Theory.∗

The congruence

\[ A \equiv \alpha \pmod{b} \]

is read "\( A \) is congruent to \( \alpha \) Modulo \( b \)." The congruence relationship states that the equation

\[ A = \alpha + bt \]

is valid for some value of \( t \), where \( A, \alpha, b \) and \( t \) are integers. \( \alpha \) is called the residue and \( b \) the base or modulus of the number \( A \).

As examples of congruences consider:

\[ 10 \equiv 7 \pmod{3} \]
\[ 10 \equiv 4 \pmod{3} \]
\[ 10 \equiv 1 \pmod{3} \]

In these examples the integers 7, 4 and 1 form a residue class of 10 Mod 3. Of particular importance is the least positive residue of the class which in this example is one. The least positive residue is that residue for which \( 0 \leq \alpha \leq b \).**

∗ Hardy and Wright\((13)\) gives an excellent presentation on congruences.

** The equality sign may exist on only one side of the inequality.
Unless otherwise specified the term residue as used in the remainder of this paper will mean the least positive residue.

Congruences for the most part may be treated in the same manner as the equality sign in ordinary arithmetic. The main exception pertains to division. The following properties of congruences will be used in this paper and are presented below without proof.

1. Congruences to the same modulus may be added and the result is a valid congruence.

\[ \sum_{i=1}^{n} A_i \equiv (\sum_{i=1}^{n} \alpha_i) \mod b \]

2. Congruences to the same modulus may be multiplied and the result is a valid congruence.

\[ \prod_{i=1}^{n} A_i \equiv (\prod_{i=1}^{n} \alpha_i) \mod b \]

3. Congruences are transitive. If \( A \equiv B \) and \( B \equiv C \) then \( A \equiv C \).

4. The following congruence relationships provide a basis for numerical checking procedures

   (a) \( \sigma b^n \equiv \sigma \mod (b-1) \)

   (b) \( \sigma b^n \equiv +\sigma \mod (b+1) \) n even

\[ \equiv -\sigma \mod (b+1) \) n odd

**Generalization of Parity**

A parity check need not necessarily include every digit of the number. In the general case multiple parity checks may be employed,
each check concerning only specific digits. However, for purposes of
simplification, only the case of a single parity check over all digits
will be considered here. The parity of a number from a consistently
weighted number system is now defined. Consider the number

\[ \sigma_n \sigma_{n-1} \ldots \sigma_2 \sigma_1 \]

This is the usual shorthand notation for the polynomial representation
of a number \( N \) in a consistently weighted number system, base \( b \):

\[ N = \sigma_n b^n - \sigma_{n-1} b^{n-2} + \ldots + \sigma_2 b^1 + \sigma_1 b^0 \]

The parity check digit \( p \) associated with number \( N \) is defined by:

\[ F(N) \equiv p \mod m \]

where \( p \) is the least positive residue.

In this paper two different functions of \( N \) will be considered.
The first function is

\[ F(N) = N \]

and the parity check digit \( p \) is given by

\[ N \equiv p \mod m \quad m \neq b \]

This type of check is based directly on the numerical properties of
linear congruences and the check digit is a function of the magnitude
of the number \( N \). This type of check is called a numerical parity check.

If the check base \( m \) is identical to the number base \( b \) then

\[ N \equiv \sigma_1 \mod b \]

The check is unsatisfactory because it is a function of only the low
order digit of the number representation. This is due to the fact that

\[ \sigma_1 b^n \equiv 0 \mod b \quad n > 0 \]
A check for the case \( m = b \) is obtained if
\[
F(N) = σ_n + σ_{n-1} + ... + σ_2 + σ_1
\]
The parity check digit, \( p \), for this type of check which is termed digital checking is defined by the relation
\[
\sum_{i=1}^{n} σ_i \equiv p \mod b
\]
An equivalent definition may be given in terms of ring addition. Ring addition modulo \( b \) is specified by the symbol \( ⊕ \). Ring addition is simply normal addition without carry, for example
\[
(b-1) ⊕ 1 = 0
\]
In terms of ring addition modulo \( b \) the parity digit \( p \) is given as
\[
p = σ_n ⊕ σ_{n-1} ⊕ ... ⊕ σ_2 ⊕ σ_1
\]
If the number system is binary, and the check base is two then the previous definition for the digital check provides a parity digit which makes the total number of one digits of the number plus check representation even in correspondence with the usual parity procedures.
The definition is easily extended to an odd parity check. For this case the parity check digit is given by \( \bar{p} \), the diminished radix complement of \( p \) defined as
\[
\bar{p} ⊕ p = b - 1
\]
As an example of the parity definition for \( m = b \), consider the following:

Given 12894, a number in base 10
then \((1 + 2 + 8 + 9 + 4) \equiv p \mod 10\)
or \( p = 1 ⊕ 2 ⊕ 8 ⊕ 9 ⊕ 4 \equiv 4 \)
Both types of parity checks are restricted in application to the detection of certain classes of errors in the number representation. Error correction is not possible unless multiple parity checks are employed. The digital parity check, \( m = b \), detects all alterations except the alterations in which the sum of the digit perturbations is congruent to 0 modulo \( b \). Let the decimal number of the previous example be corrupted by 5 in the first column and by -5 in the third column. The result is an erroneous representation but the error is obviously undetectable.

\[
1 \times 2 \times 3 \times 9 \times 9 \times 0 = p = 4
\]

A numerical parity check \( \text{Mod} \ m, \ m \neq b \), is insensitive to an error if the magnitude of the error is congruent to zero \( \text{Mod} \ m \). Thus the error is detectable only if the check base \( m \) is not a divisor of the magnitude of the error. The magnitude of the error in the previous example is 495. The error is not detected by \( \text{Mod} 3, 5, 9 \) or \( 11 \) checks.

Parity checks are particularly effective in detecting errors for which only one digit has been modified. For this class of errors a parity check \( m = b \) is always effective. If \( m \) is less than \( b \) then perturbations congruent to zero modulo \( m \) are undetectable.

**Arithmetic Invariance of the Digital Parity Check**

In the arithmetic system employed in most digital machines a one-to-one correspondence is established between the machine digit representation and a set of positive and negative fractions. The rules of machine arithmetic operations are set up in such a way as to preserve
this one-to one correspondence under the operation of addition. Addition is the fundamental arithmetic operation of the machine and the other arithmetic operations, subtraction, multiplication and division are defined in terms of addition and the auxiliary operations, shift and complement.

It is obvious that the digital parity check is invariant to the shift operation if no digits are deleted or added to the representation. This is true because the digital parity check is a check function of the modulo sum of the individual digit rather than a function of magnitude. Also, the parity check is independent of the position of the radix point. For this reason the remaining discussion will not explicitly consider whether the representations are in correspondence with fractions or integers.

Consider now the addition of two consistently weighted numbers in base be representation. The check base is, of course, also equal to b.

\[ A + D = S \]

or

\[
\begin{array}{c}
\scriptstyle a_n \ldots a_1 \\
+ \scriptstyle d_n \ldots d_1 \\
\hline
\scriptstyle c_{n+1} s_n \ldots s_1
\end{array}
\]

now

\[ s_i = a_i \oplus d_i \oplus c_i \]

and

\[ (a_i + d_i + c_i) = s_i + c_{i+1} b \]

\[ c_1 = 0 \]

\[ c_l = 0 \text{ or } c_l = 1 \text{ if } i \neq l \]
The parity check digit \( p \) for the sum is given as

\[ c_{n+1} \oplus s_n \oplus s_{n-1} \oplus \ldots \oplus s_1 = p_S \]

or

\[ c_{n+1} \oplus a_n \oplus d_n \oplus c_n \oplus \ldots \oplus a_1 \oplus d_1 = p_S \]

but

\[ a_n \oplus \ldots \oplus a_1 = p_A \]

and

\[ d_n \oplus \ldots \oplus d_1 = p_D \]

therefore

\[ c_{n+1} \oplus \ldots \oplus c_2 \oplus p_A \oplus p_D = p_S \]

The last expression provides a means of checking the addition operation in terms of the parity check digits of the sum, the addend, and the augend and the generated carries. Alternatively the parity check may be in terms \( p' \), the radix complement or additive inverse of \( p \) defined as

\[ p' \oplus p = 0 \]

If \( p' \) is used as the check digit rather than \( p \) the check procedure for addition is given as

\[ c_n \oplus \ldots \oplus c_1 \oplus p'_S = p'_A \oplus p'_D \]

The class of addition errors detected by the digital parity check is the same as the class of representation errors defined in the previous section. However, it must be remembered that the logic of the usual addition circuits is such that the various carry digits are not independent. The parity checking procedure is valid only if the carry digits are correct.
Consider the decimal addition of the following numbers:

\[
\begin{align*}
\begin{array}{c}
6 & 6 & 9 & 4 & 1 & 6 \\
\hline
3 & 2 & 4 & 5 & 1 & 5 \\
\hline
9 & 9 & 3 & 9 & 2 & 2
\end{array}
\end{align*}
\]

Due to the occurrence of one carry, the check of addition is

\[6 \oplus 5 \oplus 1 = 2\]
\[2 = 2\]

The check of the complement operation is straightforward.

The diminished radix complement of a representation \(d\) is obtained by substituting for each digit \(\sigma_i\) the diminished radix complement, \(\bar{\sigma_i}\).

\[\sigma_i \oplus \bar{\sigma_i} = b-1\]

given:

\[\sigma_n \oplus \ldots \oplus \sigma_1 = p_s\]

then

\[\sigma_n \oplus \bar{\sigma_n} \oplus \ldots \oplus \bar{\sigma_1} = p_s \oplus \bar{\sigma_n} \oplus \ldots \oplus \bar{\sigma_1}\]

let

\[\bar{\sigma_n} \oplus \ldots \oplus \bar{\sigma_1} \oplus p_s = p_s \oplus p_s = f\]

then

\[f \equiv n(b-1) \mod b\]
\[(f+n) \equiv nb \mod b\]
\[(f+n) \equiv 0 \mod b\]

then

\[p_s + p_s + n \equiv 0 \mod b\]

or

\[p_s \oplus p_s \oplus n \mod b = 0\]

A check of the diminished radix complement consists of the modulo \(b\) sum of the parity check of the normal representation, the parity check of the complemented representation and the number of digits comprising the representation. In the special case where \(b = 2\), the
check of the complemented representation is equal to the check of the normal representation if \( n \) is even and opposite if \( n \) is odd.

A check of the radix complement operation is somewhat more complex than that required for the diminished radix complement. The complications are due to the definition of the radix complement rather than to the nature of the parity check procedure. Two checking procedures are possible. The first is based on the fact that the radix complement \( s' \) is obtained from the diminished radix complement \( s \) by the addition of one to the lowest order digit. This method requires cognizance of the carries produced when the one is added to the lowest order digit. The check is given by the relationship

\[
p_{s'} = 1 \oplus p_s \oplus \sum_{i=2}^{n+1} c_i \text{ Mod } b
\]

The second procedure for obtaining the radix complement check is based upon the well-known rule for obtaining the digitwise radix complement: "Starting with the low order digit, radix complement the first non-zero digit; the remaining higher order digits are changed by diminished radix complementation." This method requires a knowledge of the number of zeros preceding the first non-zero digit. Let this number be \( q \).

Given \( \sigma_n \oplus \dots \oplus \sigma_1 = p_s \)

then

\[
\sigma_n \oplus \bar{\sigma}_n \oplus \dots \oplus \sigma_{q+1} \oplus \sigma_{q+1}' \oplus \sigma_q \oplus \dots = \\
p_s \oplus \sigma_n \oplus \dots \oplus \sigma_{q+1}'
\]

\( i = 0 \) if \( i = q \)
\[ p_s' = \bar{\sigma}_n \otimes \ldots \otimes \bar{\sigma}_{q+2} \otimes \sigma_{q+1} \]

\[ p_s + p_s' \equiv [(n-q-1)(b-1) + b] \mod b \]

\[ p_s + p_s' + (n - q) \equiv 1 \mod b \]

or

\[ p_s \otimes p_s' \otimes (n - q) \mod b = 1 \]

Multiplication consists of repetitive addition with appropriate shift operations. The result is a double-length product. The digital parity of the double-length product is found from consideration of the addition operations since parity is not affected by the shift operation. The linear congruence is multiplicative. The check procedure for the product \( P = A \times D \) is

\[ p_P = (p_A \otimes p_D) \otimes \sum c_1 \mod b \]

or

\[ (p_A \times p_D) + \sum c_1 \equiv p_P \mod b \]

The symbol \( \otimes \) indicates multiplication, modulo \( b \). It is observed that the digital parity check provides no protection against faulty shift operations. Parity protection against this type of error is obtained if multiple checks are employed.

The division algorithm commonly employed is easily explained in terms of the remainder theorem.

Consider

\[ \frac{X}{Y} = Q + \frac{R}{Y} \]

or

\[ X = QY + R \]

\[ R = X - QY \]
where \[ Q = q_n b^{n-1} + \ldots q_j b^{j-1} \]
then \[ R = X - (q_n b^{n-1} + \ldots q_j b^{j-1}) Y \]

The subtraction of \( Y \) from \( X \), \( q_i \) times is accomplished by the addition of \( \overline{Y} \), \( q_i \) times. Carries which are generated must be considered. The parity check expression is given as

\[ P_R = P_X \oplus (P_Q \oplus P_{\overline{Y}}) \oplus \sum c_i \mod b \]

This check may be employed for each division operation or only as a check of the overall division process. In any case the check is insensitive to shift errors.

**Examples of Arithmetic Checking (m = b = 10)**

**Addition**

\[
\begin{array}{c}
3 & 4 & 6 \\
+ & 4 & 5 & 8 \\
\hline
8 & 0 & 4
\end{array}
\]

○ the circled digits are the check digits

**Check**

\[ P_A \oplus P_D \oplus \sum c_i \mod b = P_S \]

\[ 3 \oplus 7 \oplus 2 = 2 \]

**Complement**

nines complement

\[
\begin{align*}
A &= 0345 \quad \circ \\
\overline{A} &= 9654 \quad \circ
\end{align*}
\]

**Check**

\[ P_{\overline{A}} \oplus P_A \oplus n \mod b = 0 \]

\[ 4 \oplus 2 \oplus 4 = 0 \]
ten's complement

A = 034500 ②
A' = 965500 ③

Check

P_A' Θ P_A Θ (n-q) Mod b = 1
5 Θ 2 Θ 4 = 1

Product

\[
\begin{array}{c}
346 \\
213 \\
\hline
346 \\
346 \\
\hline
692 \\
346 \\
\hline
1038 \\
\hline
346 \\
\hline
4498 \\
346 \\
\hline
39098 \\
\hline
73698 ③
\end{array}
\]

5 carries generated

Check

P_p = (P_A Θ P_D) Θ \sum c_i Mod b
3 = (3 Θ 6) Θ 5
3 = 8 Θ 5 = 3

Division

Q = \frac{073699}{0346} ④ = \frac{X}{Y}

the ten's complement of 0346 ② is 9645 ④
### Numerical Checking, Mod m (m ≠ b), of Arithmetic Operations

The modulus of the numerical parity check is not the same magnitude as the radix of the number representation. Congruence relationships are valid for any modulus. However, only certain moduli yield procedures which permit straightforward computation of the parity digits. There exist in particular two interesting and useful checks based on procedures congruent modulo (b-1) and congruent modulo (b+1). These checks will be called the diminished base numerical check and the augmented base numerical check respectively. The simplest checks of the numerical class depend directly on certain congruence relationships. Numerical checking has one very desirable feature not associated with digital checking procedures. Numerical checking procedures do not require a tally or cognizance of the carries generated by the addition process.

<table>
<thead>
<tr>
<th>Carry</th>
<th>Calculation</th>
<th>Result</th>
</tr>
</thead>
<tbody>
<tr>
<td>073699</td>
<td>2</td>
<td>0347</td>
</tr>
<tr>
<td>9654</td>
<td></td>
<td>9654</td>
</tr>
<tr>
<td>039099</td>
<td>2</td>
<td>0001</td>
</tr>
<tr>
<td>9654</td>
<td></td>
<td>9654</td>
</tr>
<tr>
<td>004499</td>
<td>3</td>
<td>9655</td>
</tr>
<tr>
<td>9654</td>
<td></td>
<td>001</td>
</tr>
<tr>
<td>04499</td>
<td>3</td>
<td></td>
</tr>
<tr>
<td>9654</td>
<td></td>
<td></td>
</tr>
<tr>
<td>Q = 213</td>
<td>6</td>
<td></td>
</tr>
<tr>
<td>R = 1</td>
<td></td>
<td></td>
</tr>
<tr>
<td>∑c₁ = 13</td>
<td></td>
<td></td>
</tr>
<tr>
<td>Check</td>
<td>P₀ = P₀ (P₀ ⊕ P₁) ⊕ ∑c₁ Mod b</td>
<td></td>
</tr>
<tr>
<td>l = 4 ⊕ (6 ⊕ 4) ⊕ 3</td>
<td></td>
<td></td>
</tr>
</tbody>
</table>
**The Diminished Base Numerical Check**

The diminished base numerical check is dependent on the following properties of linear congruences:

$$\sigma_{ib} \equiv \sigma_{i} \mod (b-1)$$

This relationship permits parity calculation independent of the digit position or weight. Consider the sum

$$X + Y = S$$

$$X = x_n b^{n-1} + \ldots + x_1 b^0$$

$$Y = y_n b^{n-1} + \ldots + y_1 b^0$$

$$S = c_{n+1} b^n + s_n b^{n-1} + \ldots + s_1 b^0$$

Since congruences are additive we have

$$X = \sum_{i=1}^{n} x_i b^{i-1} \equiv \sum_{i=1}^{n} x_i \mod (b-1)$$

The parity check digit \( p_X \) associated with the representation \( X \) is defined by the term \( \sum x_i \) reduced to the least positive residue of the class. The parity check digit \( p_X \) may also be defined by

$$p_X = x_n \oplus \ldots \oplus x_1$$

for either definition

$$X = \sum_{i=1}^{n} x_i b^{i-1} \equiv p_X \mod (b-1)$$

The check digit \( p_Y \) is given as

$$Y = \sum_{i=1}^{n} y_i b^{i-1} \equiv p_Y \mod (b-1)$$

Linear congruences are additive, therefore:

$$X + Y = \sum_{i=1}^{n} (x_i + y_i) b^{i-1} \equiv (p_X + p_Y) \mod (b-1)$$

and

$$p_S = p_X \oplus p_Y$$
The modulo \((b-1)\) check is numerical and hence does not require
cognizance of the generated carries. The check is, nevertheless, sensi-
tive to errors in carry generation and propagation. The check detects
all digit alterations for which the modulo \((b-1)\) sum of the changes
does not equal zero. This is equivalent to the statement that an
alteration is detectable only if \(b-1\) is not a factor of the magnitude
of the arithmetic error. A single carry failure causes an alteration
of minus one and is therefore detectable. However, the error is not
correctable on the basis of the parity information.

A check of the complement operation is obtained from the
basic definition of the complemented representation. The definitions
which follow are for representations of \(n + m + 1\) digits. The repre-
sentation has \(n\) digits to the left of the radix point, \(m\) digits to the
right of the radix point and one sign digit. The diminished radix
complement of \(X\) is \(\bar{X}\) where

\[ X + \bar{X} = b^{n+1} - b^{-m} \]

The radix complement \(X'\) is given as

\[ X + X' = b^{n+1} \]

The usual congruence relationships are defined for integer values. It
is, therefore, convenient to consider the radix point on the extreme
right of the representation. The check is obtained by observing that

\[ b^m(b^{n+1} - b^{-m}) \equiv 0 \mod (b-1) \]

and

\[ b^m b^{n+1} \equiv 1 \mod (b-1) \]

Then the check for the diminished radix complement is

\[ p_X \oplus p_{\bar{X}} = 0 \]
and the check for the radix complement is

\[ P_X \ominus P_X' = 1 \]

Congruences are multiplicative. Therefore, a check of the product

\[ P = R \times S \]

is given simply as

\[ P_P = P_R \ominus P_S \]

with no cognizance of carries required.

A check involving the division of congruences is not practical since the correct division procedure is dependent upon whether the divisor and \((b-1)\) have any common factors. However, this is of little consequence since machine division is usually obtained by a process consisting of repetitive add and shift operations.

\[
\begin{align*}
\frac{X}{Y} &= Q + \frac{R}{Y} \\
X &= QY + R
\end{align*}
\]

The check of the division process is independent of carries and must be true at every step in the division process. The check is

\[ P_X = (P_Q \ominus P_Y) \ominus P_R \]

The Augmented Base Numerical Check

The diminished base numerical check avoids the need for carry cognizance and for that reason offers a simpler check procedure than is obtainable by means of digital checking. However, for the binary number system the diminished base check is meaningless. One solution is the
augmented base check. This system is also carry independent and
parity check digits are relatively easy to obtain. The augmented
base check is dependent on the following property of congruences.

\[ \sigma_1 b^j \equiv + \sigma_1 \pmod{(b+1)} \text{ if } j \text{ is even} \]

\[ \equiv - \sigma_1 \pmod{(b+1)} \text{ if } j \text{ is odd} \]

The check digit for representation A is given as

\[ (\ldots + a_5 b^4 + a_4 b^3 + a_3 b^2 + a_2 b + a_1) \equiv p \pmod{(b+1)} \]

where

\[ A = \ldots + a_5 b^4 + a_4 b^3 + a_3 b^2 + a_2 b + a_1 b^0 \]

The checking procedures employed for the arithmetic processes
are identical to the procedures for the diminished radix except that
ring addition 0 and ring multiplication 0 are defined modulo (b+1).

In the binary case the number base is two and the check
base is three. The parity of a representation A is given as

\[ (\ldots + a_5 + 2a_4 + a_3 + 2a_2 + a_1) \equiv p \pmod{3} \]

or

\[ \ldots \ominus a_5 \ominus 2a_4 \ominus a_3 \ominus 2a_2 \ominus a_1 = p \]

Direct subtraction is avoided by using the radix complement of one
with respect to the base three.

Note that for the binary case the augmented base check is
identical to the diminished base check, base four.
Examples of the Augmented Base Check

Let \( b = 2 \) then check is modulo 3

Addition \( X + Y = S \)

\[
\begin{array}{ccccccc}
0 & 0 & 0 & 1 & 1 & 1 & 0 \\
0 & 1 & 0 & 1 & 0 & 1 & 0 \\
0 & 1 & 1 & 1 & 0 & 0 & 1 \\
\end{array}
\]

Check

\[
P_X \oplus P_Y = P_S \\
10 \oplus 01 = 00
\]

Complement

\( A = 00011101 \)

\( \bar{A} = 11100010 \)

Check

\[
P_A \oplus P_{\bar{A}} = 0 \\
10 \oplus 01 = 00
\]

Diminished Radix Complement

\( A' = 11100011 \)

Check

\[
P_A \oplus P_{A'} = 1 \\
10 \oplus 10 = 00
\]

Radix Complement

\( A' = 11100011 \)

Check

\[
P_A \oplus P_{A'} = 1 \\
10 \oplus 10 = 00
\]

Multiplication \( R \times S = P \)

\[
\begin{array}{c}
1011 \\
\times \ 1001 \\
\end{array}
\]

Check

\[
P_R \oplus P_S = P_P \\
10 \oplus 00 = 00
\]

Division \( \frac{X}{Y} = Q + \frac{R}{Y} \)

\[
\begin{array}{c}
0 \ 110011 \\
1 \ 0101 \\
0 \ 000111 \\
\end{array}
\]

Check

\[
P_X = (P_Q \oplus P_Y) \oplus P_R \\
00 = (01 \oplus 10) \oplus 01 \\
= 10 \oplus 10 = 00
\]

The above is a check of the first step in the division process.
Conclusion

The congruence notation used in this paper provides a fundamental basis for the concept of both the digital and the numerical type of parity check. Congruence notation also provides the mathematical tools needed to consider the arithmetic properties of the parity checks and the extension of the parity check concept to non-binary systems.

In the final analysis the effectiveness of a parity check is dependent upon the error structure of the carry process. It is necessary to consider in detail the mechanism of binary addition before the effectiveness of the parity check may be evaluated. The process of binary addition is investigated in the next chapter and the effectiveness of the parity check is discussed in Chapter IV.
CHAPTER III

VECTOR SPACES AND BINARY ADDITION

Introduction

The concepts, representations and operations associated with vector spaces are particularly appropriate to the study of the process of binary addition. Vectors or n-tuples correspond to binary numbers. The vector addition of two vectors corresponds to the logical addition of two binary numbers. The length of a vector is shown to correspond to the digital definition (m=b) of parity. A study of the dimensionality properties of vectors facilitates the establishment of various numerical relationships pertinent to checking procedures. These and other relationships are discussed in the material which follows.

The application of the vector space concepts and the use of matrix notation represent a basic and unified approach to the subject of binary addition. The vector space approach to binary addition shows clearly that the basic problem of binary addition from the viewpoint of both error structure and speed of addition is the generation of the carry. The different logical configurations employed to generate the carry may be obtained from the matrix representation of the carry process. This particular approach also facilitates the study of the differences in the expected performance of the various known methods of carry generation.

The representation of binary numbers as vectors is not a new concept. Extensive use of vector space concepts has been made by
Muller\(^{(22)}\) in the problem of logical design and coding. Reed\(^{(8)}\) has used vector space concepts to discuss a particular code. More recently Campeau\(^{(21)}\) has made extensive use of Boolean matrices as a logical design tool. The earliest application of matrices to the problem of synthesis of different logic configurations is due to Lunts\(^{(19)}\). The study of binary addition by means of vector spaces presented in this chapter is new as are the applications of the concepts of dimensionality and length.

In the next section the basic properties of vector spaces are considered. The application of these concepts to the problem of binary addition is considered in the remaining sections.

**The Elementary Concepts of Vector Spaces**

Ordered n-tuples of scalar functions, \(a_1\) of a field \(F\), are termed vectors of \(n\) dimensions if the operations of scalar multiplication and vector addition are closed. A set of n-tuples in which vector addition and scalar multiplication are closed is called a vector space. The dimension of the space is equal to the number of elements of an n-tuple.

Given n-tuples \(\alpha\) and \(\beta\) and a scalar \(k\)

\[
\alpha = (a_1, a_2, a_3 \ldots a_n) \\
\beta = (b_1, b_2, b_3 \ldots b_n) \\
k \in F
\]

Scalar multiplication is defined in terms of the multiplication operation, \(\theta\), of the field.

\[
k \theta \alpha = (k \theta a_1, k \theta a_2, k \theta a_3, \ldots k \theta a_n)
\]
In order to secure compactness of notation, multiplication appropriate to the field under consideration will be assumed and the notation for scalar multiplication abbreviated to the form

\[ k \mathbf{\alpha} = (ka_1, ka_2, ka_3, \ldots ka_n) \]

Vector addition is defined in terms of the rules of addition of the field elements (scalars).

\[ \mathbf{\alpha} \oplus \mathbf{\beta} = (a_1 \oplus b_1, a_2 \oplus b_2, \ldots a_n \oplus b_n) \]

Algebraically the vectors and the scalars of a vector field are associative, commutative, and distributive with respect to the operations of vector addition and scalar multiplication. This follows from the definition of the operations and the fact that the field has the associative, commutative and distributive properties.

A vector of an \( n \) dimensional space is usually specified as a function of the basis of the vector space. The basis may be any set of \( n \) linearly independent vectors. A set of \( n \) vectors \( \mathbf{\alpha}_1, \ldots, \mathbf{\alpha}_n \), is linearly dependent if there exists in \( F \), \( n \) scalars \( k_1 \ldots k_n \), not all zero, such that:

\[ k_1 \mathbf{\alpha}_1 \oplus k_2 \mathbf{\alpha}_2 \oplus \ldots \oplus k_n \mathbf{\alpha}_n = 0 \]

or

\[ k_1 a_{11} \oplus k_2 a_{21} \oplus \ldots \oplus k_n a_{n1} = 0 \]
\[ k_1 a_{12} \oplus k_2 a_{22} \oplus \ldots \oplus k_n a_{n2} = 0 \]
\[ k_1 a_{13} \oplus k_2 a_{23} \oplus \ldots \oplus k_n a_{n3} = 0 \]
\[ \ldots \ldots \ldots \ldots \]
\[ k_1 a_{1n} \oplus k_2 a_{2n} \oplus \ldots \oplus k_n a_{nn} = 0 \]
Note, in particular, that a suitable basis requires only linearly independent base vectors. The base vectors need not be orthogonal.

If the inner product is defined, then the conditions for linear dependence may be written in matrix notation as

\[ [A] k = 0, \quad Z \text{ denotes the null vector} \]

The convention used in this paper for matrices is as follows: An \( m \times n \) matrix \( A \) is bracketed, \([A]\). A row or column matrix or vector is not bracketed. This convention eliminates needless brackets and leads to more readable equations. The expression \([A] k\) always represents a matrix product and should not be taken as a scalar product. In general vectors are represented by letters from the Greek alphabet. The inner product of two vectors, \( \alpha \) and \( \beta \), with respect to a basis \( \gamma \) is defined as

\[
= (a_1 \gamma_1 \oplus \ldots \oplus a_n \gamma_n) (b_1 \gamma_1 \oplus \ldots \oplus b_n \gamma_n) = \\
\sum_{i,j=1}^{n} a_i b_j (\gamma_i, \gamma_j) = \sum_{i,j=1}^{n} a_i b_j g_{ij}
\]

The result is a scalar function. If the basis is orthonormal then

\[ g_{ij} = \delta_{ij} \]

and

\[ a_1 b_1 \oplus \ldots \oplus a_n b_n \]

The inner product is associative, commutative and also distributive with respect to addition.

The inner product of a vector \( \alpha \) with itself in vector space spanned by an orthonormal basis defines the square of the vector length.
The inner product gives the sum of each orthogonal component squared.

\[ \langle \alpha | \alpha \rangle = a_1^* a_1 + a_2^* a_2 + \ldots + a_n^* a_n \]

The basis may be changed by a not necessarily orthogonal linear transformation.

\[ \alpha_1 = [A] \alpha \]

then

\[ \alpha = [A^{-1}]^t \alpha_1 \]

\[ \alpha^t \alpha = \alpha_1^t [A^{-1}]^t [A^{-1}] \alpha_1 \]

let

\[ G = [A^{-1}]^t [A^{-1}] \]

The square of the vector length defined by the inner product remains invariant if

\[ \langle \alpha | \alpha \rangle = \sum_{i,j=1}^{n} a_i^* a_j \xi_{ij} \quad \text{where } \xi_{ij} \in [G] \]

The length as such may not be obtainable since this requires the inclusion of the square root in the field.

The inner product may be used to determine the orthogonality of a pair of vectors. A vector pair is orthogonal if and only if the inner product vanishes. This follows from the definition of the inner product given as

\[ \langle \alpha | \beta \rangle = |\alpha| \cdot |\beta| \cos \theta \]

A theorem concerning the dimensionality of vector spaces which will be used later in this paper is now considered. The dimension of a vector space or subspace is equal the minimum number of base vectors required to span the space or subspace.
Theorem: If \( S \) and \( T \) are any two subspaces of a vector space \( Q \), then the subspace dimensions satisfy
\[
D(S) + D(T) = D(S \cap T) + D(S \cap T^c)
\]

![Venn Diagram](image)

Figure 1. Venn Diagram.

This theorem is readily verified. Consider the Venn Diagram shown in Figure 1. The objects of class \( s_j \) are contained by the left-hand circle, the objects of class \( t_j \) by the right-hand circle. The objects common to both classes are contained by the conjunction of \( s_j \) and \( t_j \) designated as \( s_j \cap t_j \). The objects of either class \( s_j \) or \( t_j \), written \( s_j \cup t_j \), and called the inclusive disjunction may be calculated by summing the number of objects in class \( s_j \) and the number in class \( t_j \) and subtracting the number objects contained by the conjunction. The subtraction of the conjunction is required since it has been summed twice. The result is
\[
N(s_j \cup t_j) = N(s_j) + N(t_j) - N(s_j \cap t_j)
\]

A formula of the above form could be written for every base vector under consideration. A given base vector either spans or does not span a
particular vector subspace. Therefore, $N(x)$ may have only two values, zero and one. If the resulting set of equations are summed the result would be

$$\sum_{j=1}^{m} N(s_j v t_j) = \sum_{j=1}^{m} N(s_j) + \sum_{j=1}^{m} N(t_j) - \sum_{j=1}^{m} N(s_j o t_j)$$

The dimension of a vector subspace is defined as

$$D(x) = \sum_{j=1}^{m} N(x_j)$$

so

$$D(S v T) = D(S) + D(T) - D(S o T)$$

A single vector is a vector subspace. Therefore, the fundamental dimensionality relationship, which involves two vector subspaces, must be valid for vector pairs. The dimension of a single vector, with elements from the binary field is equal to the number of elements having the value one. The dimension concept can be extended to include the vector addition connective

$$D(S o T) = D(S v T) - D(S o T)$$

A simple and rapid proof of the above relationship is obtained from consideration of the definition of the connectives or the Venn diagram and by virtue of the fact that the dimension relationships must be valid for any set of bases including a single base. Two additional dimensional relationships are obtained by linear combination of the previous dimension formula.

$$D(S) + D(T) = D(S o T) + 2D(S o T) = 2D(S v T) - D(S o T)$$

The preceding material has presented informally the concepts of the vector spaces necessary for an understanding of the remainder of
this paper. For a formal presentation of this material the reader is referred to any standard text on the subject of vector spaces. *

Vector Representation of Binary Numbers

The binary number system as employed in the digital computer is closed. The digit symbols are the elements of the modulo two field. Vector addition and scalar multiplication are obviously closed. Therefore, the closed set of binary numbers may be represented by a set of vectors in a vector space of finite dimensions.

The basis of the vector space may be any set of linearly independent vectors. The basis employed in the usual binary number representation is orthogonal. The base vectors are the set of n-tuples or binary numbers representing the powers of two from zero to m-1, where m is the dimension of the vector space. The number of binary digits is also equal to m. Let the base vectors be designated \( \mu_1 \). Then the familiar polynomial representation of a binary number \( A \)

\[ A = a_n 2^{n-1} + a_{n-1} 2^{n-2} + \ldots + a_2 b_1 + a_1 b_0 \]

is in one to one correspondence to the vector

\[ = a_n \mu_n \oplus a_{n-1} \mu_{n-1} \oplus \ldots \oplus a_2 \mu_1 \oplus a_1 \mu_1 \]

* See for example Birkhoff and MacLane. \(^{(14)}\)
where
\[ \mathbf{\mu}_1 = (0 \ 0 \ 0 \ldots \ 0 \ 0 \ 0 \ 0 \ 1) \]
\[ \mathbf{\mu}_2 = (0 \ 0 \ 0 \ldots \ 0 \ 0 \ 0 \ 1 \ 0) \]
\[ \mathbf{\mu}_3 = (0 \ 0 \ 0 \ldots \ 0 \ 0 \ 1 \ 0 \ 0) \]
\[ \vdots \]
\[ \mathbf{\mu}_{n-1} = (0 \ 1 \ 0 \ldots \ 0 \ 0 \ 0 \ 0 \ 0) \]
\[ \mathbf{\mu}_n = (1 \ 0 \ 0 \ldots \ 0 \ 0 \ 0 \ 0 \ 0) \]

In both expressions the \( a_i \) coefficients are elements of the modulo 2 field.

The cyclically permuted binary code is now considered as an illustration of a non-orthogonal basis.

\[ \mathbf{c} = [\mathbf{T}] \mathbf{\alpha} \]

Each column in the \( \mathbf{T} \) matrix is one of the base vectors. The vector representation of the cyclically permuted number is

\[ \mathbf{c} = a_n \delta_n \oplus a_{n-1} \delta_{n-1} \oplus \ldots \oplus a_2 \delta_2 \oplus a_1 \delta_1 \]
where
\[ \delta_1 = (0 \ 0 \ 0 \ 0 \ \ldots \ 0 \ 0 \ 0 \ 1) \]
\[ \delta_2 = (0 \ 0 \ 0 \ 0 \ \ldots \ 0 \ 0 \ 1 \ 1) \]
\[ \delta_3 = (0 \ 0 \ 0 \ 0 \ \ldots \ 0 \ 1 \ 1 \ 0) \]
\[ \delta_{n-2} = (0 \ 0 \ 1 \ 1 \ \ldots \ 0 \ 0 \ 0 \ 0) \]
\[ \delta_{n-1} = (0 \ 1 \ 1 \ 0 \ \ldots \ 0 \ 0 \ 0 \ 0) \]
\[ \delta_n = (1 \ 1 \ 0 \ 0 \ \ldots \ 0 \ 0 \ 0 \ 0) \]

The conversion from a cyclically permuted code to a binary code is defined by
\[ \gamma = [T^{-1}] \ c \]
\[ \begin{bmatrix}
1 & 1 & 1 & 1 & \ldots & 1 & 1 & 1 \\
0 & 1 & 1 & 1 & \ldots & 1 & 1 & 1 \\
\vdots & \vdots & \ddots & \vdots & \ddots & \vdots & \vdots & \vdots \\
0 & 0 & 0 & 0 & \ldots & 0 & 1 & 1 \\
0 & 0 & 0 & 0 & \ldots & 0 & 0 & 1 \\
\end{bmatrix} \]
where \[ [T^{-1}] = \begin{bmatrix}
0 & 1 & 1 & 1 & \ldots & 1 & 1 & 1 \\
0 & 0 & 1 & 1 & \ldots & 1 & 1 & 1 \\
\vdots & \vdots & \vdots & \ddots & \vdots & \vdots & \vdots & \vdots \\
0 & 0 & 0 & 0 & \ldots & 0 & 1 & 1 \\
0 & 0 & 0 & 0 & \ldots & 0 & 0 & 1 \\
\end{bmatrix} \]

Thus the binary number \( A \) is also in one to one correspondence with
\[ \gamma = c_n \epsilon_n \oplus c_{n-1} \epsilon_{n-1} \oplus \ldots \oplus c_2 \epsilon_2 \oplus c_1 \epsilon_1 \]
where
\[ \epsilon_1 = (0 \ 0 \ 0 \ \ldots \ 0 \ 0 \ 1) \]
\[ \epsilon_2 = (0 \ 0 \ 0 \ \ldots \ 0 \ 1 \ 1) \]
\[ \vdots \]
\[ \epsilon_{n-1} = (0 \ 1 \ 1 \ \ldots \ 1 \ 1 \ 1) \]
\[ \epsilon_n = (1 \ 1 \ 1 \ \ldots \ 1 \ 1 \ 1) \]

The vector representation of the binary number is unique. Therefore,
\[ A \leftrightarrow \alpha \]
and
\[ A \leftrightarrow \gamma \]
\[ \alpha = \gamma \]
The $\alpha$ and $\gamma$ vectors are identical and are representations of the same number $A$ with respect to different bases.

**Vector Length and Parity**

The square of the length of a vector $\alpha$ is given by the inner product of $\alpha$ with itself. Let us consider the vector representation, $\alpha$, of a binary number, $A$, with respect to an orthogonal set of bases, $\mu$.

Let

$$A = a_n \ldots a_1$$

then

$$= a_n \mu_n \Theta \ldots \Theta a_1 \mu_n$$

and

$$= L^2 = a_n a_n \Theta \ldots \Theta a_1 a_1$$

The last equation expressed in congruence notation is

$$\sum_{k=1}^{n} a_k a_k \equiv L^2 \pmod{2}$$

The only elements of the mod 2 field are zero and one. Therefore, the length and the square of the length are identical. The length, $L$, of the vector representing the binary number, $A$, is identical to the parity of $A$ modulo 2.

$$(\alpha \alpha \alpha) \equiv (L^2) \equiv (L) \equiv p$$

The square of the length of a vector, $\gamma$, with respect to a non-orthogonal basis is

$$\gamma \gamma = \sum_{i,j=1}^{n} a_i a_j \theta_{ij}$$
As an example the length of the vector with respect to the cyclically permuted code basis is determined.

\[
\begin{bmatrix}
G \\
\end{bmatrix} = \begin{bmatrix}
T^{-1} \\
T^{-1}
\end{bmatrix} = \begin{bmatrix}
1 & 1 & 1 & 1 & 1 & \ldots \\
1 & 0 & 0 & 0 & 0 & \ldots \\
1 & 0 & 1 & 1 & 1 & \ldots \\
1 & 0 & 1 & 0 & 0 & \ldots \\
1 & 0 & 1 & 0 & 1 & 1 & \ldots \\
1 & 0 & 1 & 0 & 1 & 0 & \ldots \\
\vdots & \vdots & \vdots & \vdots & \vdots & \ddots & \vdots \\
\end{bmatrix}
\]

The \( G \) matrix\(^*\) is symmetrical, \( g_{ij} = g_{ji} \). It follows from the symmetry of the \( G \) matrix and the identity

\( k \oplus k = 0 \)

that the off diagonal matrix elements do not contribute to \( L^2 \). The diagonal elements have a value

\( 1 = g_{ii} \mod 2 \)

The length of the vector in terms of the cyclically permuted code basis is

\( L = c_1 \oplus c_3 \oplus c_5 \oplus \ldots \)

which must be equal to

\( L = a_1 \oplus a_2 \oplus a_3 \oplus \ldots \oplus a_n = p \)

\(^*\) The \( G \) matrix is the metric matrix. A good reference is Brillouin [15]
The invariance of the length permits a check of the operation of conversion from one number system to another. A check procedure involving the even digits of the cyclically permuted code may be obtained if \( c_1 \) and \( a_1 \) are deleted from the respective representations by a replacement of the form.

\[
a_i \leftarrow a_{i+1} \\
c_i \leftarrow c_{i+1}
\]

\( a_2 \) and \( c_2 \) are the low order elements of \( A \) and \( C \) respectively. Therefore, the following relations must be valid

\[
L' = c_2 \oplus c_4 \oplus c_6 \oplus \ldots \\
L' = a_2 \oplus a_3 \oplus a_4 \oplus \ldots \oplus a_n
\]

**The Vector Space of Binary Addition**

The standard procedure for the addition of two \( m \) digit binary numbers may be described in a vector space in terms of the vector addition of three vectors. The three vectors are the augend vector, \( \alpha \), the addend vector, \( \beta \), and the carry vector, \( \gamma \). The sum vector is denoted by

\[
\sigma = \alpha \oplus \beta \oplus \gamma = \rho \oplus \gamma
\]

If the basis of the vector space is orthogonal and in one to one correspondence with the powers of two from zero to \( m + 1 \), then the carry vector may be defined recursively as

\[
c_1 = 0 \\
c_2 = a_1 b_1 \\
\vdots \\
c_{j+1} = c_j (a_j \oplus b_j) \oplus a_j b_j = c_j r_j \oplus k_j
\]
where \[ r_j = a_j \oplus b_j \]

and \[ k_j = a_j b_j \]

In column form the addition process is

\[
\begin{array}{cccc}
\text{r}_m & \ldots & \text{r}_2 & \text{r}_1 \\
\oplus & c_{m+1} & c_m & \ldots & c_2 & c_1 \\
\hline
s_{m+1} & s_m & \ldots & s_2 & s_1 \\
\end{array}
\]

Practical arithmetic circuitry is often designed to accept a \( c_1 \) element of unity. For example the insertion of a one digit in the low order column facilitates two complement arithmetic in a parallel machine. However, the arithmetic structures considered in this paper will be the type for which \( c_1 \) is always equal to zero.

Let the augend and the addend vectors have a maximum dimension of \( m \). The vector addition of the addend and augend vector is closed and the partial sum, \( \xi \), has a maximum dimension of \( m \). The carry vector as defined above has \( m + 1 \) elements but the first element of the carry vector, \( c_1 \), is always equal to zero. Therefore, the maximum dimension of the carry vector is \( m \). The sum is obtained from the vector addition of the partial sum vector and the carry vector. The actual dimension of the vector addition of the partial sum vector and the carry vector is at most \( m - 1 \) since \( c_1 = 0 \) and \( r_{m+1} = 0 \). The sum vector, \( \sigma \), has a maximum dimension of \( m + 1 \). The elements of the
sum vector are

\[ s_{m+1} = c_{m+1} \]
\[ s_i = r_i \quad i \neq 1 \]
\[ s_j = r_j \oplus c_j \quad j \neq m + 1 \]

A vector space of dimension \( m + 1 \) is required if the augend vector, the addend vector and the carry vector are to be considered simultaneously. It is seldom necessary to consider all three vectors at once. The vector space usually considered in the remaining sections will be a subspace and have a dimension of \( m \) or \( m - 1 \). The need for a dimension of \( m + 1 \) to represent the sum is in exact correspondence with the binary addition of two \( m \) digit binary numbers. The \( m + 1 \) dimension corresponds to the \( m + 1 \) digit which is the overflow digit.

The actual mechanization of the process of binary addition is relatively straightforward except for the generation of the carry vector. The logic required for the formation of the partial sum \( \xi \) by the vector addition of the addend and augend vector, and the vector addition of \( \xi \) and the carry vector \( \gamma \) is essentially singular. The absence of intercoordinate relationships is characteristic of the process of vector addition. The carry vector elements on the other hand have a high degree of intercoordinate dependence for

\[ c_{j+1} = F(a_j, b_j, \ldots, a_1, b_1) \]

The lack of independence of the carry elements is the basic factor to be considered in the design of a binary adder. The logic of various
digital parallel adders differs only in the means used to generate the carry. Furthermore, the error structure of the adder is determined primarily by the characteristics of the carry vector.

The Carry Vector

The recursive definition of the carry vector is adequate for many applications including the design of certain binary adders. However, for the purposes of this paper the expression of the carry vector in matrix notation has considerable utility and constitutes a unified approach to the problem of carry generation. In matrix notation the carry vector \( \mathbf{y} \) is given as

\[
\mathbf{y} = [D] \mathbf{k}
\]

The matrix equation in expanded form is shown in Figure 2.

The fact that the \( \mathbf{k} \) vector and \( \mathbf{c} \) vector are always orthogonal is an important characteristic of the process of carry vector generation. The proof of this orthogonality is as follows: By the definition of \( \mathbf{c} \) and \( \mathbf{k} \) we have

\[
\begin{align*}
    r_j &= 1 \Rightarrow k_j = 0 \\
    k_j &= 1 \Rightarrow r_j = 0
\end{align*}
\]

Both statements, by the definition of implication, are equivalent to,

\[
\bar{r}_j \lor \bar{k}_j = 1
\]

The proof is completed by application of De Morgan's Theorem.

\[
\begin{align*}
    \bar{r}_j \lor \bar{k}_j &= 1 \\
    r_j k_j &= 0
\end{align*}
\]
\[
\begin{bmatrix}
1 \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\n\end{bmatrix}
\begin{bmatrix}
k_1 \\
k_2 \\
k_3 \\
k_4 \\
k_5 \\
k_6 \\
k_7 \\
k_8 \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot \\
\cdot 
\end{bmatrix}
\]

where \( r_j = a_j \oplus b_j \)

and \( k_j = a_j \cdot b_j \)

**Figure 2.** Carry Vector.
Therefore:

\[ \varphi_{k} = 0 \]

The form of \( c_j \), the \( j \)th element of the carry vector is

\[ c_j = k_{j-1} \oplus k_{j-2} r_{j-1} \oplus k_{j-3} r_{j-2} r_{j-1} \oplus k_{j-4} r_{j-3} r_{j-2} r_{j-1} \oplus \ldots \]

It may be concluded from the orthogonality of the \( k \) and \( \varphi \) vectors that at most only one term in the above expression for \( c_j \) may be different from zero. This fact is seen immediately if \( c_j \) is expressed in the following form.

\[ c_j = k_{j-1} \oplus r_{j-1} \left[ k_{j-2} \oplus r_{j-2} \left[ k_{j-3} \oplus r_{j-3} \left[ k_{j-4} \oplus \ldots \right] \right] \right] \]

In addition to giving information of the structure of the carry vector, the orthogonality of the \( \varphi \) and \( k \) vectors has very practical consequences. In particular, since the conjunction of \( \varphi \) and \( k \) never occurs, the convenient mathematical connective, ring addition or exclusive disjunction \( \oplus \), may be replaced by the physically simpler connective, inclusive disjunction. The recursive definition is then

\[ c_{j+1} = (a_j \oplus b_j) \cdot c_j \cdot a_j b_j \]

The recursive definition may be further simplified by the use of the identities

\[ a_j \oplus b_j = (a_j \lor b_j) \overline{a_j b_j} \]

and

\[ a_j \lor b_j = a_j \lor b_j \]

The result is

\[ c_{j+1} = (a_j \lor b_j) \cdot c_j \lor a_j b_j \]
The physical logic associated with this definition is somewhat simpler than that associated with the definition of the carry using exclusive disjunction connectives. In fact the carry definition in terms of the inclusive disjunction is the most frequently used definition and might have been chosen as the starting point for this work. This was not done for two reasons. First the definition in terms of exclusive disjunction is mathematically more convenient. The convenience is primarily due to the fact that a variable is self inverse for the exclusive disjunction connective. Secondly, the definition in terms of the exclusive disjunction is more restrictive than the definition in terms of inclusive disjunction. Thus, more is learned about the carry vector structure if the most restricted case is considered first. Later the restrictions may be removed one by one and the effects observed.

Let \( v_j \) denote the inclusive disjunction of the \( j \)th augend and addend elements. If \( r_j \) is replaced by \( v_j \) then any particular row of the carry matrix may contain more than one, ONE element since

\[
c_j = k_{j-1} \lor k_{j-2}v_{j-1} \lor k_{j-3}v_{j-2}v_{j-1} \lor j_{j-4}v_{j-3}v_{j-2}v_{j-1} \lor \ldots
\]

and

\[
k_n \Rightarrow v_n
\]

In the discussion which follows \( \mathcal{E} \) and \( V \) may usually be used interchangeably. Special note will be given to situations where interchangeability does not exist.
Methods of Generation of the Carry Vector

The matrix relationship

\[ y = [D]k \]

completely defines the generation of the carry vector in terms of the k and the \( \mathcal{P} \) or \( V \) vectors. Essentially instantaneous generation of the carry vector is achieved if the elements of the physical logic correspond one by one to the logical connectives found in the matrix definition of the carry vector. This particular realization of the carry logic is characterized by the fact that the elements of the carry vector are independent. The elements of the carry vectors are independent in the sense that if k and \( \mathcal{P} \) or V are correct then a single error in the structure of the carry logic will cause an error in at most one element of \( y \). Unfortunately, for all practical purposes the number of logical elements required to realize an instantaneous and independent type of carry is prohibitive. A study of the carry vector reveals that, in addition to the logic required to generate the elements of \( \mathcal{P} \) or V and k, the carry logic required of each \( c_j \) element is one OR gate and \( j-2 \) AND gates; \( j \geq 3 \). Let \( m \) be the number of elements comprising the augend and addend vectors then \( k_j \) must drive \( m + 1 - j \) gate inputs and \( r_j \) or \( v_j \) must drive \((j-1) \cdot (m+1-j)\) inputs. The last formula is not valid for \( j = 1 \). \( r_1 \) does not influence the carry process and is not considered. \( r_1 \) is identical to the sum element \( s_1 \). The total carry logic of the instantaneous and independent type of carry requires
m - 1 OR gates
with \( \frac{m(m+1)}{2} \) - 1 inputs

\( \frac{m(m-1)}{2} \) AND gates
with \( \frac{1}{2} \sum_{j=3}^{m+1} (j - j - 2) = (m+1)(m+2) \frac{m}{6} - m \) inputs

The number of the logical elements required for the instantaneous and independent carry can best be illustrated by a numerical example. The equipment needed for the carry logic of a 30 bit binary adder is

29 OR gates with 464 inputs
435 AND gates with 4930 inputs

\( k_1 \) drives 30 gate inputs
\( k_2 \) drives 29 gate inputs
\( \vdots \)
\( k_{30} \) drives 1 gate input

\( *r_2 \) drives 29 gate inputs
\( r_3 \quad " \quad 56 \quad " \quad " \)
\( r_4 \quad " \quad 81 \quad " \quad " \)
\( r_5 \quad " \quad 104 \quad " \quad " \)
\( \vdots \)
\( r_{15} \quad " \quad 224 \quad " \quad " \)
\( r_{16} \quad " \quad 225 \quad " \quad " \)
\( r_{17} \quad " \quad 224 \quad " \quad " \)
\( \vdots \)

* \( r_j \) or \( v_j \)
\( r_{29} \) drives 56 gate inputs
\( r_{30} \)

The independence of the elements of the carry vector may be maintained and the number of logical elements reduced if the instantaneous feature of the previous carry logic is not required. The elements of any row of the D matrix may be generated sequentially. The logic for one carry digit \( c_j \) is shown in Figure 3. The logic required for each \( c_j \) is \( 2j-5 \) AND gates and one OR gate. For \( m \) digits the total carry logic consists of \( m-1 \) OR gates with a total of \( \frac{m(m+1)}{2} \) inputs and

\[
\sum_{j=3}^{m+1} 2j - 5 = (m+1)(m-3) + 4
\]

AND gates. Each AND gate has exactly two inputs. An element of \( k, k_j \), and an element of \( \rho, r_j \), must each drive \( m-j+1 \) inputs. This scheme of carry generation requires the following components if \( m \) is equal to thirty as in the previous example.

29 OR gates with 464 inputs
841 AND gates with 1682 inputs

\( k_1 \) drives 30 gate inputs
\( \vdots \)
\( k_{30} \)
\( \vdots \)

\( r_2 \)
\( \vdots \)

\( r_j \) or \( v_j \)

\( r_{30} \)
Figure 3. Logic for Independent, Sequential Generation of One Carry Element.

It is observed that realization of a carry, with independent elements, by the above procedure is by no means economical. Furthermore, the method suffers considerable time delay due to the sequential method employed to generate the elements of the D matrix. The sacrifice in time does permit a significant reduction in the required drive capabilities of the \( r_1 \) or \( v_1 \) elements. All in all a high price is paid in terms of either components, or time, or both, to achieve independence of the carry elements.

The generation of the carry vector in a synchronous logical structure in general leads to more extensive logic or requires more time than a corresponding asynchronous circuit which permits carry
rippling. This problem of carry generation for synchronous circuitry has been studied by Weinberger and Smith. The paper details a particular technique used to simplify the logical configuration of the D matrix. There is no indication that the technique used by Weinberger and Smith is optimal though the results are reasonable.

The carry vector may be considered to be composed of a set of sub-carry vectors according to the following relationship

\[ \gamma = \gamma_m \oplus \gamma_{m-1} \oplus \ldots \oplus \gamma_2 \oplus \gamma_1 \]

In particular let \( \gamma_j \) be chosen as the \( j \)th set of diagonal elements from the matrix definition of the carry. Then

\[ \gamma_1 = (k_m, k_{m-1}, \ldots, k_2, k_1, 0) \]
\[ \gamma_2 = (k_{m-1} r_m, k_{m-2} r_{m-1}, \ldots, k_2 r_2, 0, 0) \]
\[ \gamma_3 = (k_{m-2} r_{m-1} r_m, k_{m-3} r_{m-2} r_{m-1}, k_1 r_2 r_3, 0, 0, 0) \]
\[ \vdots \]
\[ \gamma_m = (k_1 r_2 r_3 \ldots, r_m, 0, 0, \ldots, 0, 0, 0) \]

The set of carry sub-vectors as defined above form an orthogonal set and therefore the sum of the sub-vector may be obtained by the inclusive disjunction connective.

\[ \gamma = \gamma_m \lor \gamma_{m-1} \lor \ldots \lor \gamma_2 \lor \gamma_1 \]

\( \lor \) is replaceable by \( \lor \) if and only if the inclusive disjunctive connective is used to sum the sub-carry vectors. The sub-carry vectors may be defined recursively. The recursive definition of the sub-carry vectors corresponds to a carry realization procedure economically feasible in equipment but time consuming. The recursive definition is
\[ y_1 = \begin{bmatrix} 2 \end{bmatrix} k \]
\[ y_2 = \begin{bmatrix} 2 \end{bmatrix} (\begin{bmatrix} 2 \end{bmatrix} k) = \begin{bmatrix} 2 \end{bmatrix} (y_{1\rho}) \]
\[ \vdots \]
\[ y_j = \begin{bmatrix} 2 \end{bmatrix} (y_{j-1\rho}) \]

The notation \( \begin{bmatrix} 2 \end{bmatrix} \) corresponds to a left shift of one element. In matrix notation

\[
\begin{bmatrix} 0 & \ldots & \ldots & \ldots \\
1 & 0 & \ldots & \ldots \\
0 & 1 & 0 & \ldots \\
\vdots & \begin{bmatrix} 0 & 1 & 0 \\
\vdots & \begin{bmatrix} 0 & 1 & 0 \\
\vdots & \begin{bmatrix} \end{bmatrix} \end{bmatrix} \end{bmatrix} \\
\vdots & \begin{bmatrix} \end{bmatrix} \end{bmatrix}
\]

The multiplications specified in the recursive definition of the sub-carry vectors are non-associative. The product \( y_{j-1} \) is formed first, followed by the left shift of one element. An addition process which obtains a carry vector by means of operations utilizing the sub-vectors is sometimes termed "programmed addition" because the process can easily be executed on a basic automatic computer in the absence of an addition instruction. The generation of the carry vector from the sub-vectors was employed by Robertson\(^{10}\) as a means of preventing the propagation of errors. Error propagation is prevented if suitable error detection and correction procedures are applied after the creation of each partial sum of the carry vector. Robertson employed a Hamming
Code\(^{(5)}\) to obtain the redundancy needed to detect and correct single errors which might occur at each step in the carry generation procedure.

The simplest logic which generates the carry vector is obtained if the logic corresponds to the recursive definition of the carry vector.

\[ c_2 = k_1 \]
\[ c_j = c_{j-1} \cdot k_{j-1} \]
\[ c_j = c_{j-1} \cdot v_{j-1} 
\]

or

This definition may be represented by the matrix equation

\[ k = [D^{-1}]y \]

\(D^{-1}\) is the inverse of the D matrix which has been used previously to define the carry process. The inverse matrix exists only for the definition of D in terms of \(\varphi\),

\[
[D^{-1}] =
\begin{bmatrix}
1 & 0 & . & . & . \\
r_2 & 1 & 0 & . & . \\
0 & r_3 & 1 & 0 & . \\
. & 0 & r_4 & 1 & . \\
. & . & 0 & r_5 & . \\
. & . & . & . & .
\end{bmatrix}
\]

The logic of this type of carry generation is shown in Figure 4. The logic is characterized by the interdependence of the carry digits and by the fact that different carry sequences may propagate simultaneously. The carry digit interdependence is completely characterized by the k and
\( \rho \) vectors. In all cases a propagation of carries is triggered by a \( k_j \) element equal to one. The carry is propagated through the successive elements only if successive elements of \( \rho \) are equal to one. The carry propagation sequence stops at \( r_{j+n} \), the first non-unity element of the \( \rho \) vector succeeding \( r_j \). Of course, a carry sequence may have been initiated by \( k_{j+n} \) but this constitutes a new carry sequence and not a continued propagation of a carry sequence. Due to the orthogonality of the \( \rho \) and \( k \) vectors no carry sequences overlap. If \( k_i \) initiates a carry then \( r_i \) must stop the propagation of any previous sequence.

If \( V \) is substituted for \( \rho \) then the sequence may propagate through points of carry generation. This will affect the error structure of the adder but will not affect the ordinary addition process. It has been shown in Burks et al.\((17)\) that if \( m \) is equal to 40 then the average length of the maximum carry propagation is 4.62 elements.

The characteristics of the carry propagation and generation is considered in a later section.

Figure 4. Standard Carry Logic.
Synthesis by Vector Space Operations

The process of binary addition for all sum digits except the first is completely defined by the matrix equation

$$\sigma = \rho \Theta [D] k$$

In the language of vector spaces the binary addition process consists of an affine transformation of the vector $k$. The vector $k$ is rotated as a result of the multiplication of $k$ by the $D$ matrix. The result of the multiplication is then translated by the vector addition of the $\rho$ vector. The whole process is complicated by the fact that the $D$ matrix is a function of the $\rho$ vector.

New logical configurations for the process of binary addition may be synthesised by either of two procedures: (1) by the factorization of the $D$ matrix and (2) by a change of basis transformation over the whole addition process. The different logical configurations discussed in the previous section were obtained as a result of different factorizations of the $D$ matrix.

One factorization of the $D$ matrix follows from the definition of the $D$ matrix as

$$D = [D^{-1}]^{n-1} [D]^n$$

The resulting carry logic has a logical circuit depth of two. If a value of $n$ equal to 5 is chosen the resulting logic is considerably simpler than instantaneous carry logic but not as simple as the multiple depth logic proposed by Weinberger and Smith.\(^{(16)}\) A factorization in this manner can lead to fast carry logic which is economical in terms
of the number of logical components. However, the factorization does not lead to a carry logic particularly appropriate for error checking.

The difficulties encountered when a change of basis transformation on the whole addition process is attempted is due to the fact that the $D$ matrix is a function of the $\rho$ vector. Consider the following transformation

$$[T] \sigma = [T] \rho \Theta [T] [D] k$$

While $\sigma$, $\rho$ and $k$ may be considered in terms of a new basis the elements of the $D$ matrix are given with respect to the original basis. Thus the transformed addition process defined as

$$\sigma' = \rho' \Theta [D] k'$$

cannot be expected to lead to a simpler definition of the carry process because of the functional dependence of the $D$ matrix on the $\rho$ vector.

Thus the transformed addition process is to be regarded in terms of the original basis. A simple transformation of this type and in all probability the only practical transformation of this type is obtained if the transformation matrix is identical to the inverse $D$ matrix. The multiplication of the matrix definition of the sum by the inverse $D$ matrix yields

$$[D^{-1}] \sigma = [D^{-1}] \rho \Theta k$$

The indices have the same range as specified above. The matrix equation involving the inverse $D$ matrix specifies a recursive definition for the sum elements.
\[ s_2 = r_2 \Theta k_1 \]
\[ s_3 = r_2 (1 \Theta s_2) \Theta r_3 \Theta k_2 \]
\[ \vdots \]
\[ s_j = r_{j-1} (1 \Theta s_{j-1}) \Theta r_j \Theta k_{j-1} \]

As a result of the orthogonality of \( k \) and \( \rho \) the equations are reduced to the form

\[ s_j = (r_{j-1} \bar{s}_{j-1} \lor k_{j-1}) \Theta r_j \]

The logic corresponding to this particular method is shown in Figure 5.

The circuit has a very prominent weakness. A propagated carry must pass through four logical elements per stage rather than the two elements per stage required in the conventional scheme of Figure 4. The count is four logical elements per stage rather than three because the usual realization of the exclusive disjunction requires a cascade of two logical elements. The circuit was included here to illustrate a particular method of generating alternative logical structures by matrix manipulation.

![Figure 5. Sum Dependent Carry Logic.](image-url)
Numerical Relationships

In this section the dimensionality concept is used to establish particular numerical relationships which exist between the results of specific logical operations. One important consequence of the existence of the numerical relationships is the feasibility of using numerical checking and correcting procedures to control the errors produced by faulty elements in a logical structure. The numerical relationships will be used in the next section to specify alternate algorithms of the addition process.

Consider the dimensionality relationship

\[ D(\alpha) + D(\beta) = D(V) + D(k) \]

where \( V = \alpha \lor \beta \)

and \( k = \alpha \beta \)

The dimensionality relationship must be valid for any set of coordinates. If a single coordinate is considered then the following relationships exist between the values of \( a_j, b_j, v_j, \) and \( k_j \).

| TABLE I |
| --- | --- | --- | --- |
| RELATION BETWEEN \( a_j, b_j, v_j, k_j \) |
| \( a_j \) | \( b_j \) | \( v_j \) | \( k_j \) |
| 0 | 0 | 0 | 0 |
| 0 | 1 | 1 | 0 |
| 1 | 0 | 1 | 0 |
| 1 | 1 | 1 | 1 |
Digitally the difference between \( a_i \), \( b_i \) and \( v_i \), \( k_i \) is a possible zero and one interchange between columns as shown in row two of Table I. The interchange occurs if \( v_j \) and \( k_j \) are substituted for \( a_j \) and \( b_j \) but there is no numerical effect since

\[
0 + 1 = 1 + 0
\]

Therefore, the dimensionality relationship implies the numerical relationship

\[
\sigma = \alpha + \beta = V + k
\]

A second numerical relationship is obtained from the dimensionality equation.

\[
D(\rho) + D(k) = D(V)
\]

where

\[
\rho = \alpha \times \beta
\]

The numerical relationship is

\[
\rho + k = V
\]

The validity of this equation is established by the following argument. The \( \rho \) and \( k \) vectors are orthogonal and therefore the magnitude of the sum of a particular coordinate, \( r_j + k_j \), cannot exceed unity. The addition of \( \rho \) and \( k \) can never produce a carry. This fact and the implication

\[
k_j \geq v_j
\]

assure the validity of the numerical relationship between \( \rho \), \( k \) and \( v \).

The dimension relationships may be expanded to include more than a pair of variables. In particular, the following dimension equation is obtained by considering a Venn diagram with three overlapping classes or by application of the previously derived dimension
formula of two variables.

\[ D(\alpha \lor \beta \lor \gamma) = D(\alpha) + D(\beta) + D(\gamma) - D(\alpha \beta) - D(\alpha \gamma) - D(\beta \gamma) + D(\alpha \beta \gamma) \]

The corresponding numerical relationship is

\[ (\alpha \lor \beta \lor \gamma) = \alpha + \beta + \gamma - \alpha \beta - \alpha \gamma - \beta \gamma + \alpha \beta \gamma \]

The following examples are given to illustrate the numerical relationship previously described.

Let \( \alpha = (1 \ 0 \ 1 \ 1 \ 0 \ 1) = 45 \)

and \( \beta = (1 \ 1 \ 1 \ 0 \ 0 \ 1) = 57 \)

then \( V = (1 \ 1 \ 1 \ 1 \ 0 \ 1) = 61 \)

\( k = (1 \ 0 \ 1 \ 0 \ 0 \ 1) = 41 \)

\( \rho = (0 \ 1 \ 0 \ 1 \ 0 \ 0) = 20 \)

now \( \alpha + \beta = V + k \)

\[ 45 + 57 = 61 + 41 = 102 \]

and \( \rho + k = V \)

\[ 20 + 41 = 61 \]

Now let us consider the extension to three variables.

Let \( \delta = (1 \ 1 \ 0 \ 0 \ 1 \ 0) = 50 \)

then

\[ (\alpha \lor \beta \lor \delta) = \alpha + \beta + \delta - \alpha \beta - \alpha \delta - \beta \delta + \alpha \beta \delta \]

\[ 63 = 45 + 57 + 50 - 41 - 32 - 48 + 32 \]

\[ 63 = 63 \]

In this section we have shown the existence of two relationships which can be employed to check the generation of the \( \rho \), \( k \) and \( V \) elements which are functions of the addend and augend elements. The most important characteristic of the derived checks is the fact that
the checks are numerical. This means that the checks may be applied on a digit by digit basis, as a check on certain parts of a representation or as a check of all elements of the representation. Both check processes are of the type which detect the occurrence of a single error. The check
\[ \alpha + \beta = V + k \]
will detect malfunctions in the V and the k elements if the \( \alpha \) and \( \beta \) elements are correct. Likewise the check
\[ \rho + k = V \]
may be used to check errors in the \( \rho \) elements only when the k and the V elements are correct.

It is logical to ask whether similar check procedures exist for the carry generation process. Unfortunately the answer is no. The checks given above owe their existence to either the absence of carries or the presence of identical carry processes appearing on opposite sides of the equality sign. The carry generation process is characterized by the presence of constraints and the dependent nature of the elements of the carry vector. A check of the carry generation process can only be made by either duplicating the carry generation or by partitioning the carry generation process into checkable parts as is done for the carry generation process used in programmed addition.

**Addition Algorithms**

The numerical relationships derived from the dimension concept in the previous section may be used to obtain alternate algorithms for binary addition.
The relationship
\[ \sigma = \alpha + \beta = V + k \]
\[ V = \alpha \lor \beta \]
\[ k = \alpha \beta \]
states that the sum of the augend and addend is identical to the sum
of the disjunction and the conjunction of the augend and the addend.

The same carry vector is associated with either algorithm for
\[ \rho = \alpha \cdot \beta = V \cdot k \]
\[ k = \alpha \beta = kV \]

The numerical relationships
\[ \sigma = \alpha + \beta = V + k \]
and
\[ \sigma + k = V \]
\[ \rho = \alpha \cdot \beta \]
may be combined linearly to give two binary addition algorithms of
the form
\[ \sigma = k + k + \rho = 2k + \rho \]
\[ \sigma = V + V - \rho = 2V - \rho \]

In the remaining part of this paper the following terminology is
adopted. The addition algorithm described by the first equation
above is called \( k\rho \) addition. The second process is called \( V\rho \)
addition. The usual addition procedure is called standard addition
and defined by \( \sigma = \alpha + \beta \). The process \( \sigma = V + k \) is termed \( Vk \) addition.

The concept of dimensionality has led to the definition of
alternative algorithms for addition. The important aspect of the \( k\rho \)
and \( V \rho \) algorithms is that the carry or borrow vectors associated with these processes differ from the carry vector associated with the standard and the \( V k \) addition processes. The fact that the \( k \rho \) carry vector and the \( V \rho \) borrow vector are different from the standard carry vector is sufficient reason to investigate in detail the addition algorithms. The characteristics of the carry or borrow vector determine both the propagation of errors and the average addition time of a parallel type binary adder with asynchronous carry.

**Constraints and Some Properties of the Addition Algorithms**

Constraints play an important role in the determination of the characteristics of the \( k \rho \) and \( V \rho \) addition processes. The constraints and some of the effects of the constraints are discussed in this section.

The constraint of the \( k \rho \) addition process is

\[
k_j \rho_j = 0
\]

Consider the addition process as expressed in column form

\[
\begin{array}{ccc}
\ldots & k_3 & k_2 & k_1 \\
+ & \ldots & r_4 & r_3 & r_2 & r_1 \\
\ldots & s_4 & s_3 & s_2 & s_1
\end{array}
\]

The sum of any column is given by the vector sum of the \( k \), and carry elements.

\[
\sigma = 2k \oplus \rho \oplus \delta
\]
The effect of the constraint is to limit the number of elements with a value of one in any column. The number of ones in any column cannot exceed two. The proof of this statement is obtained by considering the following situation. Let \( k_{j-1} \) and \( r_j \) both have a value of one so that a carry is generated. Then \( r_{j-1} \) must be equal to zero and no carry from a coordinate or column less than \( j-1 \) is propagated through \( j-1 \).

It is also obvious that a carry cannot be generated at column \( j-1 \). The number of ones in columns \( j-1 \) and \( j \) is at most two. In column \( j+1 \), \( k_j \) is constrained to a value of zero and \( c_{j+1} \) the carry element of the column has a value of one. If \( r_{j+1} \) is equal to zero the \( j+1 \) column has only one digit of value one. If \( r_{j+1} \) is equal to one there are two one digits in the column. Thus, three one digits never occur in any column and a two input type of adder is sufficient. However, no overall saving of components results since a two input adder is required to form each \( \rho \), \( k \) element pair. One other effect of the constraint in \( \rho \) type addition is that carry propagation is determined solely by the elements of \( \rho \). If the carry is propagated then \( r_{j+1} \) must be equal to one. But if this is true, then \( k_{j+1} \) is equal to zero and \( r_{j+2} \) must equal one to propagate the carry. The carry continues to propagate until \( r_n \) is equal zero.
The following example is presented as an illustration of the \( \kappa \rho \) algorithm.

Given:
\[
\begin{align*}
\alpha &= (1 \ 1 \ 0 \ 0 \ 1 \ 1 \ 1 \ 0 \ 1) \\
\beta &= (1 \ 1 \ 0 \ 0 \ 0 \ 0 \ 0 \ 0 \ 1) \\
\rho &= (0 \ 0 \ 0 \ 0 \ 1 \ 1 \ 0 \ 0 \ 0) \\
k &= (1 \ 1 \ 0 \ 0 \ 0 \ 0 \ 0 \ 1 \ 0)
\end{align*}
\]

then:
\[
\begin{align*}
\sigma &= 2k + \rho \\
000 & \ 011 \ 000 \\
+ & \ 1100 \ 001 \ 01 \\
\hline
1100 & \ 100 \ 010
\end{align*}
\]

the sum is obtained by the addition.

the carry vector of the process is
\[
\boldsymbol{y} = (0 \ 000 \ 110 \ 00)
\]

For this example the carry vector associated with standard addition is
\[
(1 \ 100 \ 111 \ 01)
\]

The \( \kappa \rho \) algorithm uses subtraction to obtain the sum. It is therefore appropriate to discuss the generation and propagation of borrows rather than carries. A borrow is generated only if the minuend element has a value zero and the corresponding subtrahend element a value one. The borrow propagates through succeeding elements if the subtrahend and minuend elements have identical values. The borrow \( h_j \) is given the equation
\[
h_j = (x_{j-1} \ \text{eq} \ \bar{y}_{j-1}) \ h_{j-1} \lor \ x_{j-1} \ y_{j-1}
\]
The above form of the equation is appropriate for probabilistic consideration since the causes of the propagation and the generation of the borrows are separated. In practice a more convenient logic is

\[ h_j = (\bar{x}_{j-1} \lor y_{j-1}) h_{j-1} \lor \bar{x}_{j-1} y_{j-1} \]

The \( V \bar{e} \) addition process is constrained by the relations:

\[ r_j v_j \]

or

\[ \bar{r}_j v_j = 1 \]

In column form the addition process is represented by

\[
\begin{array}{cccc}
\vdots & v_3 & v_2 & v_1 \\
- & r_4 & r_3 & r_2 & r_1 \\
\vdots & s_4 & s_3 & s_2 & s_1
\end{array}
\]

The constraint has the following effects. A borrow is generated if

\[ \bar{v}_{j-1} r_j = 1 \]

The borrow propagates the succeeding column if \( r_{j+1} \) is of value unity. \( v_j \) does not influence the borrow propagations since the value of \( v_j \) is constrained to one if \( r_j \) is equal to one. Thus the constraining relationship propagates with the carry. In the normal subtraction process the borrow is propagated if equality exists between the minuend and the subtrahend elements. In the \( V \bar{a} \) algorithm a borrow is propagated only if both the minuend and the subtrahend are unity. In practice only a two input subtractor is required but a two input adder is required to generate each pair of \( \rho \) and \( V \) elements.
The previous example is considered in terms of the $V\rho$ algorithm.

\[
\alpha = (1\ 1\ 0\ 0\ 1\ 1\ 1\ 0\ 1)
\]

\[
\beta = (1\ 1\ 0\ 0\ 0\ 0\ 1\ 0\ 1)
\]

\[
\rho = (0\ 0\ 0\ 0\ 1\ 1\ 1\ 0\ 0)
\]

\[
V = (1\ 1\ 0\ 0\ 1\ 1\ 1\ 1\ 0)
\]

\[
\begin{array}{cccc}
1 & 1 & 0 & 0 \\
- & 0 & 0 & 0 \\
1 & 1 & 0 & 0 \\
\end{array}
\]

In this example the borrow vector was the null vector. Consider as a second example

\[
\alpha = (1\ 1\ 1\ 1\ 0)
\]

\[
\beta = (0\ 0\ 1\ 0\ 0\ 1)
\]

\[
\rho = (1\ 1\ 0\ 1\ 1\ 1)
\]

\[
V = (1\ 1\ 1\ 1\ 1)
\]

\[
\begin{array}{cccc}
1 & 1 & 1 & 1 \\
- & 1 & 1 & 0 \\
1 & 0 & 0 & 1 \\
\end{array}
\]

The borrow vector $H$ is

\[
H = (0\ 0\ 0\ 1\ 1\ 1)
\]

An additional algorithm for binary addition consisting of both $k\rho$ and $V\rho$ addition processes may be verified by the comparing
of the carry and borrow characteristics of $k \rho$ and $V \rho$ addition respectively. The new algorithm will be called $k \rho$ $V$ addition and is now described. The process requires the generation of vectors $c$, $2k$ and $2V$. The addition process is started with the $k \rho$ algorithm at the low order column. The $k \rho$ algorithm is applied to succeeding columns until a column is found which would generate a carry. Either the $k \rho$ sum or the $V \rho$ sum is recorded for the carry generating column but the next column is summed using the $V \rho$ algorithm. The $V \rho$ algorithm is used until a column is found which generates a borrow. The $k \rho$ or the $V \rho$ sum is recorded for this column but the sum for all succeeding columns, until a carry is generated, is found by the $k \rho$ algorithm. At the columns which generate a carry or a borrow both $k \rho$ and $V \rho$ addition give the same result. The procedure is continued until the final digit is summed. The validity of the algorithm is seen in terms of the constraints of the $k \rho$ and $V \rho$ addition processes. The first column never generates a carry for the $k \rho$ algorithm since no element of $2k$ corresponds to $r_1$. The $k \rho$ algorithm is then applied to the second column. Let us assume that column $j$ will generate a carry, i.e., $(r_j = 1, k_{j-1} = 1)$. The constraints dictate that $v_{j-1} = 1$, $r_{j-1} = 0$ and $v_j = 1$. For column $j$ the $k \rho$ and $V \rho$ algorithms give the same sums since

$$r_j \oplus k_{j-1} = r_j \oplus v_{j-1} = 0$$

The $V \rho$ algorithm may be substituted for the $k \rho$ algorithm at the $j$th column only if there is no borrow propagation into the $j$th column.
This is assured since $r_{j-1} = 0$ and $v_{j-2} = 0$ if no borrow is propagated from the $j-2$ column and $v_{j-2} = 1$ if a borrow is propagated from the $j-2$ column. The $k\rho$ algorithm may be substituted for the $V\rho$ algorithm when a borrow occurs at the $p$th column if there is no carry propagation into the $p$th column. The borrow occurs when $r_p = 1$ and $v_{p-1} = 0$. Therefore, $k_{p-1} = 0$, and $r_{p-1} = 0$. If $k_{p-2} = 1$ there can be no carry sequence into the $p-1$ or the $p$ column. Thus it has been shown that due to the constraint the specified algorithm change is valid.

At the moment the $k\rho V$ algorithm is only of academic interest. No practical logical configuration has been found for the $k\rho V$ algorithm which is not identical to some simpler algorithm. In fact the circuits examined contained unnecessary redundancy. However, the $k\rho V$ algorithm does constitute an interesting description of the addition process. An example of the $k\rho V$ algorithm is now given.

Let
\[
\alpha = (1\ 1\ 0\ 0\ 0\ 1\ 0\ 1\ 1)
\]
\[\beta = (0\ 1\ 1\ 1\ 0\ 1\ 1\ 1\ 0)\]
then
\[k = (0\ 1\ 0\ 0\ 0\ 1\ 0\ 1\ 0)\]
\[V = (1\ 1\ 1\ 1\ 0\ 1\ 1\ 1\ 0)\]
\[\rho = (1\ 0\ 1\ 1\ 0\ 0\ 1\ 0\ 1)\]

\[
\begin{array}{cccccc}
0 & 1 & 0 & 0 & 0 & 1 \\
1 & 1 & 1 & 1 & 0 & 1
\end{array}
\]

\[
\begin{array}{cccccc}
0 & 1 & 0 & 0 & 0 & 1 \\
1 & 0 & 1 & 1 & 0 & 1
\end{array}
\]

\[
\begin{array}{cccccc}
1 & 0 & 1 & 1 & 0 & 0 \\
1 & 0 & 0 & 1 & 1 & 1
\end{array}
\]

\[
\begin{array}{cccccc}
1 & 0 & 0 & 1 & 1 & 1 \\
0 & 0 & 1 & 0 & 1 & 1
\end{array}
\]

\[
\begin{array}{cccccc}
1 & 0 & 0 & 1 & 1 & 1 \\
0 & 0 & 1 & 0 & 1 & 1
\end{array}
\]

\[
\begin{array}{cccccc}
1 & 0 & 0 & 1 & 1 & 1 \\
0 & 0 & 1 & 0 & 1 & 1
\end{array}
\]

\[
\begin{array}{cccccc}
1 & 0 & 0 & 1 & 1 & 1 \\
0 & 0 & 1 & 0 & 1 & 1
\end{array}
\]

\[
\begin{array}{cccccc}
1 & 0 & 0 & 1 & 1 & 1 \\
0 & 0 & 1 & 0 & 1 & 1
\end{array}
\]

\[
\begin{array}{cccccc}
1 & 0 & 0 & 1 & 1 & 1 \\
0 & 0 & 1 & 0 & 1 & 1
\end{array}
\]

\[
\begin{array}{cccccc}
1 & 0 & 0 & 1 & 1 & 1 \\
0 & 0 & 1 & 0 & 1 & 1
\end{array}
\]

\[
\begin{array}{cccccc}
1 & 0 & 0 & 1 & 1 & 1 \\
0 & 0 & 1 & 0 & 1 & 1
\end{array}
\]

\[
\begin{array}{cccccc}
1 & 0 & 0 & 1 & 1 & 1 \\
0 & 0 & 1 & 0 & 1 & 1
\end{array}
\]

\[
\begin{array}{cccccc}
1 & 0 & 0 & 1 & 1 & 1 \\
0 & 0 & 1 & 0 & 1 & 1
\end{array}
\]

\[
\begin{array}{cccccc}
1 & 0 & 0 & 1 & 1 & 1 \\
0 & 0 & 1 & 0 & 1 & 1
\end{array}
\]

\[
\begin{array}{cccccc}
1 & 0 & 0 & 1 & 1 & 1 \\
0 & 0 & 1 & 0 & 1 & 1
\end{array}
\]

\[
\begin{array}{cccccc}
1 & 0 & 0 & 1 & 1 & 1 \\
0 & 0 & 1 & 0 & 1 & 1
\end{array}
\]

\[
\begin{array}{cccccc}
1 & 0 & 0 & 1 & 1 & 1 \\
0 & 0 & 1 & 0 & 1 & 1
\end{array}
\]
Evaluation of the $k\rho$ and the $V\rho$ Logic

The generation of the $k$, $V$ or $\rho$ elements required by the $k\rho$ or the $V\rho$ addition processes can easily be realized by the conventional half adder circuitry shown in Figure 6. Additional logic must be provided to generate and propagate carries or borrows and also to perform the vector addition of the $k$, $\rho$, and carry elements or the vector subtraction of the $V$, $\rho$, and the borrow elements. Due to the effect of the constraints the vector addition or subtraction operations require only one modified half adder or modified half subtractor per adder stage. The logic required for the $k\rho$ algorithm is shown in Figure 7. The logic required for the $V\rho$ algorithm is shown in Figure 9.

It is necessary to compare the $k\rho$ and the $V\rho$ logical configurations with the standard adder configurations in order to determine whether the new algorithms have any merit. The logic of a standard adder configuration is shown in Figure 8. A comparison of the logic shown in Figure 8 and that of Figure 7 reveals immediately the minor differences between the $k\rho$ logic and the standard logic. The difference is in the point at which the carry is obtained from the carry logic. For the $k\rho$ logic the carry is obtained from the output of an AND gate. For the standard logic the carry is obtained from the output of an OR gate. The change in the point at which the carry is derived has no effect on the performance of the adder circuit. We conclude therefore that the $k\rho$ adder offers no advantage over the standard adder.
Figure 6. Half Adder.

Figure 7. Realization of the kp Addition Algorithm.
Figure 8. Logic for the Standard Addition Algorithm.

Figure 9. Logic for the \( \nu \) Addition Algorithm.
configurations. The same is true of the $V\rho$ adder when compared against standard half adder - half subtractor configurations. However, the change in the point of carry derivation does change the statistics of the carry generation process. An analysis of the statistics of the carry process for $k\rho$ or $V\rho$ addition would reveal that the probability of carry initiation is equal to 1/8 and the probability of carry propagation is equal to 1/2. For the standard addition algorithm the probability of carry initiation is 1/4 and the probability of carry propagation is equal to 1/2. However, these statistics can be very misleading. The probabilities associated with the inputs to the AND gate from which the sum element is obtained are identical for all the addition algorithms. In view of this we must conclude that the new addition algorithms presented here form interesting alternate descriptions of the addition process. But no logical configurations have been found which differ substantially from the standard logical configurations. In view of this the discussion in the next chapter is restricted to the standard addition algorithm.
CHAPTER IV

THE ERROR STRUCTURE OF BINARY ADDITION AND
THE EFFECTIVENESS OF THE PARITY CHECK

Introduction

The sections which follow contain a detailed investigation of the effects of logical malfunctions on the correct sum representation for standard binary addition. The carry logic is the most important single factor which determines the error structure. If it were not for the carry process, the sum elements could be completely independent and the error structure of the addition process would be extremely simple.

The logic of binary addition is divided into three parts. The first part consists of a half adder circuit which obtains the k, r and v elements from the addend and augend elements. This logic will be referred to as the generating logic. The second part of the logic of the adder is the carry logic. A standard carry logic will be considered. The carry logic requires one AND gate and one OR gate per digit. Initially the carry logic is assumed to have no logical elements in common with any other part of the adder logic. The third part of the adder logic performs the vector addition of a carry element and an element of the \( \rho \) vector to produce the sum element and is called the rc sum logic.

The form of the standard adder logic under consideration is shown in Figure 10. Conventionally gates \( g_5 \) and \( g_6 \) are combined since

-73-
the output of gate $g_6$ provides the required input to gate $g_4$. In the immediate discussion which follows the separation of $g_5$ and $g_6$ will be assumed, though consideration will be given to the possibility that the input of $g_5$ is either $r_1$ or $v_1$.

In the last sections of this chapter the effectiveness of both digital and numerical error detecting methods is considered for the different error structures associated with particular malfunctions of circuit components.

![Diagram of adder logic](image)

**Figure 10. Adder Logic.**
The Error Structure of the Carry Logic of Standard Addition

In this section the effects of various malfunctions in the standard carry logic are considered. The matrix representation of the carry logic is particularly appropriate for the study of the carry error structure. The elements of the vectors of the addition process are considered as random variables. The probability that an element \( k_i \) of the \( k \) vector equals one is \( 1/4 \). The probability that an element \( r_i \) of the \( \rho \) vector equals one is \( 1/2 \). If these values are substituted into the carry matrix shown in Figure 2 the result will be the matrix shown in Figure 11. The matrix defines the probability \( p(c_1) \) that a given element \( c_1 \) of the carry vector has a value of unity. The matrix representation of the carry vector in terms of the \( k \) and \( \rho \) elements is particularly appropriate for probabilistic studies since the elements of any row of the carry matrix are mutually exclusive. The probability that a given carry element is of value unity is obtained simply by summing the corresponding row elements and multiplying the sum by \( 1/4 \). If the carry matrix were defined in terms of the \( k \) and \( V \) elements it would be necessary to consider the fact that the matrix elements in terms of \( k \) and \( V \) vector elements are not mutually exclusive. However, the values for \( p(c_1) \) would be identical, since the same carry vector must be obtained regardless of whether the carry matrix is defined in terms of \( \rho \) and \( k \) or \( V \) and \( k \) vector elements. It is simpler to use the carry matrix in terms of \( k \) and \( \rho \) vector elements because of the mutually exclusive properties. However,
the carry vector determined by $k$ and $p$ vector elements will have a
different error structure from the carry vector defined in terms of $k$
and $V$ vector elements. Both definitions provide the same results if
there are no logical malfunctions. It is only when circuit malfunctions
are present that the difference in the error structure of the two
schemes becomes apparent. For reasons of mathematical convenience
the present studies of the error structure of the carry vector are
continued in terms of elements of the $p$ and $k$ vectors. $p(c_i)$, the
probability that $c_i = 1$, obtained from the matrix representation of
Figure 11, is

$$p(c_j) = \frac{1}{4} (2 - 2^{-1+2})$$
$$= \frac{1}{2} - 2^{-i} \quad 2 \leq i \leq m+1$$

Consider the carry logic shown in Figure 12. The logic repre-
sents that required for standard addition. The logic is implemented
using semi-conductor diodes. A study of the circuit will reveal that
the following correspondences exist between open diodes and $k_1$ or $r_1$
constraints

$$d_{40} \quad \text{corresponds to } k_1 = 0$$
$$d_{20} \quad \text{" } \quad r_1 = 1$$
$$d_{30} \quad \text{" } \quad r_1 = 0$$
$$d_{10} \quad \text{" } \quad c_1 = 1$$

But $c_1 = 1$ if $k_{i-1} = 1$

Thus the effect of a certain diode malfunction may be analyzed by a
constraint on the $k_1$ or $r_1$ elements.
The result of diode shorts is now determined for the carry logic of Figure 12. In the practical circuitry the parameters \(k_1\) and \(r_1\) are usually derived from voltage sources. The effect of a short is, therefore, somewhat dependent on clipping levels. If \(d_4\) is shorted the effect is the same as obtained for \(d_3\) open. If \(d_3\) is shorted, the result is indeterminate since the circuit will probably continue to function until \(d_2\) or \(d_4\) malfunctions. A short in \(d_2\) renders the succeeding carry elements independent of the previous carry elements and \(c_{i+1}\) becomes a function of only \(r_1\) and \(k_1\) which are orthogonal or mutually exclusive. The same effect is obtained if \(d_1\) has an open.

If \(d_1\) shorts, the carry propagation of the \(i\)th stage in the carry logic is independent of \(r_1\) and is equivalent to \(d_2\) open. The effects of shorted diodes are now summarized.

\[
\begin{align*}
d_{4s} & \quad \text{corresponds to } r_1 = 0 \\
d_{3s} & \quad \text{indeterminate} \\
d_{2s} & \quad \text{corresponds to } k_{i-1} = 1 \\
d_{1s} & \quad \text{corresponds to } r_1 = 1
\end{align*}
\]

Let the probability of a short or an open be identical and neglect the inferred result of a short in \(d_3\). The frequency of occurrence of particular values of \(k_{i-1}\), \(k_1\) and \(r_1\) in correspondence with the diode malfunctions is shown in Table II.
\[
\begin{bmatrix}
p(c_2) & 1 \\
p(c_3) & 1/2 & 1 \\
p(c_4) & 1/4 & 1/2 & 1 \\
p(c_5) & 1/8 & 1/4 & 1/2 & 1 \\
p(c_6) & 1/16 & 1/8 & 1/4 & 1/2 & 1 \\
\vdots & \vdots & \vdots & \vdots & \vdots & \vdots \\
p(c_{m+1}) & 2^{-n+2} & 2^{-n+3} & 2^{-n+4} & 2^{-n+5} & 2^{-n+6} \\
\vdots & \vdots & \vdots & \vdots & \vdots \\
\end{bmatrix}
\]

Figure 11. Probability that \( c_1 = 1 \).
<table>
<thead>
<tr>
<th>Element Value</th>
<th>Frequency</th>
</tr>
</thead>
<tbody>
<tr>
<td>$k_1 = 0$</td>
<td>$1/8$</td>
</tr>
<tr>
<td>$k_{1-1} = 1$</td>
<td>$1/4$</td>
</tr>
<tr>
<td>$r_1 = 0$</td>
<td>$1/4$</td>
</tr>
<tr>
<td>$r_1 = 1$</td>
<td>$1/4$</td>
</tr>
<tr>
<td>no effect</td>
<td>$1/8$</td>
</tr>
</tbody>
</table>

The changes in the carry error structure if $V$ is substituted for $\rho$ are simple. In reference to the diode logic of Figure 12 only one circuit malfunction is sensitive to the replacement of $\rho$ by $V$. If diode $d_4$ is open, $1/8$ of the time an error will not be observed. This is due to the fact that if $c_1 = 1$ and $k_1 = 1$ then $v_1 = 1$ but $r_1 = 0$. The value of $p(c_1) = 1/2$ and $p(k_1) = 1/4$. The probability that the malfunction does not cause an error is the product $p(k_1) \cdot p(c_1)$ and is equal to $1/8$. The result of the substitution of $V$ for $\rho$ is a change in the frequency of occurrence of the $k_1 = 0$ and the "no effect" type of error. The frequency of $k_1 = 0$ is $7/64$ and the frequency of the "no effect" condition is given as $9/64$. 
Figure 12. Carry Logic.

A malfunction in the carry logic does not necessarily cause an error in the element of the carry representation. A malfunction which does produce at least one error is termed effective. We now proceed to calculate the effectiveness of the various logical malfunctions in terms of the \( k_i \) or \( r_i \) constraints. In the following equations \( p(\Delta c_{i+1}) \) denotes the probability that the \( i + 1 \)th element of the carry vector is changed due to a malfunction represented by some constraint.
\( r_1 = 1 \) type of malfunction

\[
p(c_{i+1}) = p(k_1) + p(r_1) p(c_1)
\]

\( r_1 = 1 \) then

\[
p(c_{i+1}) = p(k_1) + p(c_1) - p(k_1) p(c_1)
\]

\( r_1 = 0 \) type of malfunction

\[
p(c_{i+1}) = p(k_1) + p(r_1) p(c_1)
\]

\( r_1 = 0 \) then

\[
p(c_{i+1}) = p(k_1)
\]

\( r_1 = 1 \) type of malfunction

\[
p(c_{i+1}) = p(k_1) + p(r_1) p(c_1)
\]

\( k_1 = 1 \) type of malfunction

\[
p(c_{i+1}) = p(k_1) + p(r_1) p(c_1)
\]

\( k_1 = 0 \) type of malfunction

\[
p(c_{i+1}) = p(r_1) p(c_1)
\]

\( r_1 = 0 \) then

\[
p(c_{i+1}) = p(k_1)
\]

\( r_1 = 1 \) type of malfunction

\[
p(c_{i+1}) = p(k_1) + p(r_1) p(c_1)
\]

\( k_1 = 1 \) type of malfunction

\[
p(c_{i+1}) = p(k_1) = 1/4
\]

\( k_1 = 0 \) type of malfunction

\[
p(c_{i+1}) = p(r_1) p(c_1)
\]

\( r_1 = 0 \) then

\[
p(c_{i+1}) = 1/4
\]
The Nature of the Propagated Error

In the remaining sections of this chapter a number of different logical malfunctions will be considered. All malfunctions of the adder logic may result in a propagated error. We shall at this time investigate the characteristics of the propagated error.

The mechanism for the propagation of an error is independent of the cause of the error. The type of malfunction does determine whether a zero or a one is propagated. Constraints of the type \( r_1 = 1 \) and \( k_1 = 1 \) may initiate erroneous one elements while the constraints \( r_1 = 0 \) and \( k_1 = 0 \) initiate erroneous zero elements in the carry vector representation. An element \( c_{i+2} \) of the carry vector succeeding an erroneous element \( c_{i+1} \) is also in error, if \( r_{i+1} = 1 \) and no carry is initiated by \( k_{i+1} \). If \( r_{i+1} = 1 \) then \( k_{i+1} \) is constrained to zero. Hence, propagation and initiation never occur simultaneously at the same element.

The condition for error propagation is simply \( r_{i+1} = 1 \) and the probability of propagation is one half. The probability of not propagating is also equal to one half. The probability that an error initiated at the \( i+1 \)th element by a \( r_1 \) or \( k_1 \) type malfunction propagates to the \( j \)th element of the carry vector and no further is

\[
p(i+1 \ldots \ldots j) = 2^{-j+1} \quad j \geq i+1
\]

An error in the \( j \)th element of the carry vector caused by a malfunction of the \( k_1 \) or \( r_1 \) type implies an error in all carry elements between and including \( i+1 \)th element and the \( j \)th element. Thus

\[
p(\Delta^* c_j) = p(\Delta c_{i+1}) 2^{-j+1} \quad j \geq i+1
\]
The asterisk is used to indicate that the propagation of the error stopped at the jth element. The probability that the error includes the j element is

\[ p(\Delta c_j) = p(\Delta c_{j+1}) \cdot 2^{-j+1+1} \]

If only effective malfunctions are considered the expected number of carry elements in error is a function of the mechanism of carry propagation and is independent of the frequency of occurrence of effective errors of each type. Then

\[ p(\Delta^* c_j) = 2^{-j+1} \]

Consider an effective malfunction in the ith stage of the carry logic. The carry element \( c_{i+1} \) is then in error. The error propagates only if \( r_{i+1} = 1 \) regardless of whether the erroneous \( c_{i+1} \) element has a value of one or zero. The propagation of the error is halted if \( r_{i+n} = 0 \). This situation is illustrated below. An erroneous carry element is denoted by \( \tilde{c} \) and an erroneous sum element by \( \tilde{s} \).

\[
\begin{array}{ccccccc}
 r_{i+5} & 0 & 1 & 1 & 1 & \tilde{r}_1 \\
 c_{i+5} & \tilde{c}_{i+4} & \tilde{s}_{i+3} & \tilde{s}_{i+2} & \tilde{c}_{i+1} & c_1 \\
 s_{i+5} & \tilde{s}_{i+4} & \tilde{s}_{i+3} & \tilde{s}_{i+2} & \tilde{s}_{i+1} & s_1 \\
\end{array}
\]

The sum elements in question may assume one of two representations where one representation is correct and the other is erroneous. Which is erroneous and which is correct depends on the correct value of the \( c_{i+1} \) element. For our purposes the question of which representation is correct is of no importance. The two representations are
\[ \ldots s_{i+5} 0 1 1 1 s_i \ldots \text{ Type A error} \]
\[ \ldots s_{i+5} 1 0 0 0 s_i \ldots \text{ Type B error} \]

If the error doesn't propagate then \( r_{i+1} = 0 \) and only one element of the sum is changed.

One other situation must be considered. This is the case where the error propagates to the last element of the representation. There exist only certain conditions for which an error may propagate to \( c_{n+1} \), the last element of the sum representation. The element \( c_{n+1} \) is usually ignored if negative numbers are represented in two's complement notation but the \( c_{n+1} \) digit may not be ignored if a numerical check is used. If the negative numbers are represented in one's complement notation, then \( c_{n+1} \) indicates the need of the end around carry associated with the one's complement addition of two negative numbers or with the addition of positive and negative operands which produce a positive sum. The \( n \)th element of the representation is the sign element. An overflow may be detected for either system of negative number representation by an examination of the sign digit of the sum. The details of the problem of an overflow representation are not elaborated at this time but are presented in Appendix I. In regard to the addition of two numbers the following obtains for two's complement arithmetic.
addition of two positive operands

normal \( r_n=0 \)  overflow \( r_n=0 \)
\( c_n=0 \)  \( c_n=1 \)
\( c_{n+1}=0 \)  \( c_{n+1}=0 \)

addition of two negative operands

normal \( r_n=0 \)  overflow \( r_n=0 \)
\( c_n=1 \)  \( c_n=0 \)
\( c_{n+1}=1 \)  \( c_{n+1}=1 \)

addition of a positive and a negative operand

positive sum \( r_n=1 \)  overflow cannot occur
\( c_n=1 \)
\( c_{n+1}=1 \)

negative sum \( r_n=1 \)
\( c_n=0 \)
\( c_{n+1}=1 \)

For one's complement arithmetic the following situations obtain before the end-around carry correction. The addition of two positive operands results in the same value of \( r_n \), \( c_n \) and \( c_{n+1} \) as specified above for two's complement arithmetic.

addition of two negative operands

normal \( r_n=0 \)  overflow \( r_n=0 \)
\( c_n=0 \)  or \( c_n=1 \)
\( d_{n+1}=1 \)  \( c_{n+1}=1 \)
addition of a positive and a negative operand

positive sum
\[ r_n=1 \]
\[ c_n=1 \]
\[ c_{n+1}=1 \]

negative sum
\[ r_n=1 \]
\[ c_n=0 \]
\[ c_{n+1}=0 \]

If an effective malfunction in the carry logic occurs in the nth stage a single carry digit \( c_{n+1} \) will be in error. A malfunction which occurs prior to the nth stage cannot propagate an error to the \( n+1 \) carry element unless \( r_n=1 \). The only time the element \( r_n=1 \) is for the addition of a negative and a positive pair of operands. This is true for both one's complement and two's complement addition. For this special case where the error may propagate to the end of the representation the resulting erroneous representation is either a Type A or Type B error. In short, if the \( c_{n+1} \) element is considered as part of the sum representation it is impossible to propagate an error consisting of all ones or zeros to the end of the representation. This fact is of little significance for the digital check but is of paramount importance for the numerical check.

The Average Length of Carry Propagation

In order to indicate the general range of effect of a propagated error we shall consider here the average length of the propagated
carries resulting from the addition of two m digit binary numbers. The carry sequence is initiated by a unity element of the k vector and is propagated if the successive p vector elements are equal to unity. The average length of carry propagation is defined by the total number of digits involved in the propagation divided by the total number of propagation sequences. Note in particular that a carry initiation not followed by propagation is not counted as either a propagation sequence or as a propagation digit. A carry initiation followed by propagation is counted as a propagating sequence, but the carry initiation digit is not counted as a propagating digit.

The calculation of the average length of carry propagation involves the consideration of various conditional probabilities. The problem is easily described in terms of a flow diagram such as that shown in Figure 13. The flow diagram may be considered as a representation of the process of examination of m augend and addend digit pairs. The examination procedure starts with the lowest order digit and proceeds consecutively toward the higher order digits. State I corresponds to carry initiation. State P corresponds to carry propagation. State 0 corresponds to digit pairs which are not a part of either the carry initiation or propagation process. The square boxes on the flow diagram which have been labeled A and B are counters. The counter A indicates the total number of propagation sequences while the counter B indicates the total number of digit places involved.
in the propagation minus the total number of propagating sequences. Thus the ratio \( \frac{A+B}{A} \) is the average length of carry propagation.

Figure 13. Flow Diagram for the Average Length of Carry Propagation.

The calculation of the average length of carry from the flow diagram is complicated by the fact that the sequence of digit pairs under consideration is finite rather than infinite. It is readily seen for an infinite sequence that the expected value of \( B \) for each unit of \( A \) is one half. Thus for infinite sequences \( \frac{A+B}{A} = \frac{1+0.5}{1} = 1.5 \). One would expect the truncation of the sequence to have very little effect on the value of the average length of carry propagation. This is indeed the case. All possible propagating sequences for \( m \) digit pairs, for \( m \)
equal to 3 and \( m \) equal to 4 were examined and the average propagation length was computed. For \( m \) equal to 3 the average propagation length is 1.33. For \( m \) equal to 4 the average propagation length is 1.43. For computer applications the range of \( m \) is typically 30 to 50 digits. The effects of truncation are certainly negligible and the average length of carry propagation is very nearly 1.5 digits.

The Error Structure of the \( k, \rho \) and \( V \) Generating Logic for Standard Addition.

In the previous sections the effect of malfunctions in the carry logic has been considered in detail. The analysis is now extended to consider the effects of errors in the logic which generate the \( k_1 \) and \( r_1 \) or \( v_1 \) elements. The input to the generating logic is, of course, the addend and augend elements. The generating logic is implemented with a basic half adder circuit. Each stage of the generating logic is independent of every other stage and the dependence of the sum elements is a function of only the carry logic. However, the various outputs of one stage of the generating logic are not necessarily independent. The generating logic may produce \( r_1, v_1 \) and \( k_1 \) and the exact dependence of the \( \rho \), \( k \) and \( V \) elements is a function of the type of half adder circuit employed.

For the moment, the possible dependence of the outputs of the generating logic is ignored and the effect of single \( \rho \) or \( k \) malfunctions on the sum representation is considered. An error in a particular \( k_1 \) element can disturb only the carry vector. Therefore,
the sum error structure resulting from a faulty \( k_i \) element is identical to the sum error structure previously associated with a malfunction in the carry logic of the \( k_i \) type. If the carry logic is a function of \( \rho \) and \( k \) then an error in a \( r_i \) element can yield an erroneous \( \rho \) vector and may also cause an error in the carry vector. An effective \( r_i \) element error results in at least one erroneous sum element \( s_i \) and the error may propagate to element \( i+n \) in precisely the same manner as the error due to a \( r_i \) type of carry logic malfunction. The sum element \( s_i \) is the vector sum of \( r_i \) and \( c_i \) while \( c_i \) is a function of \( r_{i-1} \), \( k_{i-1} \), and \( c_{i-1} \). If \( c_i = 0 \) the error in \( r_i \) affects only \( s_i \). If \( c_i = 1 \) the error affects \( s_i \) and also propagates. The propagation of an erroneous one requires \( r_i = 0 \leftrightarrow r_i = 1 \). The erroneous sum element \( s_i \) is then zero and if error propagation occurs it is of Type B. The propagation of an erroneous zero requires \( r_i = 1 \leftrightarrow r_i = 0 \) and the erroneous sum element \( s_i \) is a one. The propagated error is of Type A. The probability that an \( r_i = 0 \) element type of malfunction is effective is one half. The probability that the \( r_i = 0 \) malfunction changes \( s_{i+1} \) is given by the probability that \( c_i = 1 \), which is approximately equal to one half.

\[
\begin{align*}
p(\Delta s_i) &= p(r_i) = 1/2 \\
p(\Delta^* s_i) &= p(r_i) p(\overline{c_i}) = 1/4 \\
p(\Delta s_{i+1}) &= p(r_i) p(c_i) = 1/4 \\
p(\Delta^* s_{i+1}) &= p(r_i) p(c_i) p(\overline{r_{i+1}}) = 1/8
\end{align*}
\]
The following probabilities are obtained for an element malfunction of the type \( r_1 = 1 \).

\[
\begin{align*}
p(\Delta s_1) &= p(\overline{r}_1) = 1/2 \\
p(\Delta^* s_1) &= p(\overline{r}_1) p(\overline{c}_1) + p(c_1) p(k_1) \\
&= 1/2 \left[ 1/2 + 1/2 \left( 1/4 \right) \right] = 5/16 \\
p(\Delta s_{1+1}) &= p(\overline{r}_1) p(c_1) p(k_1) = 3/16 \\
p(\Delta^* s_{1+1}) &= p(\Delta s_{1+1}) p(\overline{r}_{1+1}) = 3/32
\end{align*}
\]

There exist half adder circuits which yield independent \( r \) and \( k \) elements. However, typical of the half adder circuits used to generate the \( r_1, v_1 \) and \( k_1 \) element is the circuit shown in Figure 6. For this circuit, errors in the \( v_1 \) and \( k_1 \) elements are independent of the \( r_1 \) element but the \( r_1 \) element is dependent on both \( v_1 \) and \( k_1 \). If \( v_1 \) is erroneous then \( r_1 \) is erroneous unless \( k_1 = 1 \). The probability that \( k_1 = 1 \) is 1/4 so 3/4 of the time an error in \( v_1 \) will also give rise to a \( r_1 \) element error. Similarly, if \( k_1 \) is in error \( r_1 \) will be in error unless \( v_1 = 0 \). \( p(\overline{v}_1) = 1/4 \). The error in \( k_1 \) affects the \( r_1 \) element three fourths of the time. The constraining relationships associated with the particular half adder under discussion are described by the following relationships.

1. \( k_1 = 1 \Rightarrow r_1 = 0 \) and \( c_{1+1} = 1 \)
2. \( k_1 = 0 \Rightarrow r_1 = v_1 \)
3. \( v_1 = 1 \Rightarrow r_1 = \overline{k}_1 \)
4. \( v_1 = 0 \Rightarrow r_1 = 0 \)
We consider first an adder which employs only $p$ and $k$ elements. The form of the sum errors resulting from an element error $k_1 = 1$ which constrains $r_1$ to a value of zero is now determined. An error occurs if the correct $r_1$ element has a value of one, $p(r_1) = 1/2$. If $c_1 = 1$, $p(c_1) \leq 1/2$, the result is an erroneous sum representation in which only one element $s_1$ is in error. For this situation the $k_1 = 1$ constraint initiates the proper carry element at $c_{i+1}$ and hence corrects the error in the carry vector due to $r_1 = 0$. If $c_1 = 0$, $p(\bar{c}_1) = 1/2$, then an error $s_1 = 0$ occurs and a faulty carry sequence is initiated at $c_{i+1}$. Error propagation, if present, is Type B. If the correct value of $r_1$ is zero the effect of a type $k_1 = 1$ malfunction is the same as that associated with a $k_1 = 1$ malfunction for the carry.

The following probabilities are associated with an element generating malfunction of the type $k_1 = 1$ which constrains $r_1$ to a value of zero.

\begin{align*}
 p(\Delta s_1) &= p(r_1) = 1/2 \\
 p(\bar{\Delta} s_1) &= p(r_1) p(c_1) \leq 1/4 \\
 p(\Delta s_{i+1}) &= 1 - p(c_{i+1}) \leq 1/2 \\
 p(\bar{\Delta} s_{i+1}) &\geq 1/4
\end{align*}

The malfunction of type $k_1 = 0 \Rightarrow r_1 = v_1$ will result in an error only when the correct value for $k_1$ is unity. $r_1$ will then have an erroneous value of unity since $k_1 \supseteq v_1$. The carry $c_{i+1}$ which is required will not be initiated unless $c_1 = 1$. So if $c_1 = 1$ only one sum element, $s_1$, is in error. If $c_1 = 0$ then $s_1$ is erroneous with a value of one and the required carry is not initiated at $c_{i+1}$. Error propagation if present, is Type A.
The error probabilities associated with the $k_1 = 0 \Rightarrow r_1 = v_1$ type of constraint are

\begin{align*}
    p(\Delta s_{i+1}) &= p(k_1) = 1/4 \\
    p(\Delta s_{i+1}) &= p(k_1) p(c_1) = 1/8 \\
    p(\Delta s_{i+1}) &= p(k_1) p(c_1) = 1/8 \\
    p(\Delta s_{i+1}) &\approx 1/16
\end{align*}

The form of the errors resulting from a malfunction of the type $v_1 = 1 \Rightarrow r_1 = \overline{k_1}$ may be described in terms of a single $r_1 = 1$ element error though the probabilities associated with the error are changed. The constraint results in an error whenever the correct value of both $r_1$ and $k_1$ is zero.

\begin{align*}
    p(\Delta s_{i+1}) &= p(r_1) p(\overline{k_1}) = 1/4 \\
    p(\Delta s_{i+1}) &= p(r_1) p(\overline{k_1}) p(c_1) = 1/8 \\
    p(\Delta s_{i+1}) &\approx 1/8 \\
    p(\Delta s_{i+1}) &\approx 1/16
\end{align*}

The errors due to the constraint $v_1 = 0 \Rightarrow r_1 = 0$ are identical with the errors resulting from a generating element malfunction of the type $r_1 = 0$.

The preceding discussion of constraints has been in terms of an adder logic employing only $k$ and $\rho$ elements. However, if the four constraining relationships are considered in terms of an adder employing $k$, $\rho$ and $v$ elements the same error structure will be obtained.
One aspect of the structure of the errors associated with the generating logic must yet be investigated. This is the error structure due to malfunctions in the input logic of the generating circuits. The input logic is identified in Figure 10 as gates \( g_1 \) and \( g_2 \). The input logic is symmetrical with respect to the input addend and augend elements \( a_1 \) and \( b_1 \). It is therefore sufficient to consider the results of constraints applied to only one of the variables. The results obtained apply equally well to the other variable. The input logic in terms of the addend and augend elements is described by the following equations:

\[
(a_1 \overline{b}_1) \ (\overline{a}_1 b_1) = r_1 \\
\overline{a}_1 b_1 = k_1
\]

If \( a_1 \) is constrained to one then

\[
r_1 = \overline{b}_1 \\
k_1 = b_1
\]

and if \( a_1 \) is constrained to zero

\[
r_1 = b_1 \\
k_1 = 0
\]

The malfunction of type \( a_1 = 1 \) is effective only when the correct value of the element \( a_1 \) is zero. Two different types of error result. The type of error is determined by the value of \( b_1 \). If \( b_1 = 0 \) then the correct value of \( k_1 \) is obtained but the result \( r_1 = 1 \) is in error. The resulting error in the sum representation consists
of only one altered element if \( c_1 = 0 \). If \( c_1 = 1 \) the error may propagate. If propagation occurs the error is Type B. The probabilities of the error process are

\[
\begin{align*}
    p(\Delta s_1) &= p(\overline{a_1}) \ p(\overline{b_1}) = \frac{1}{4} \\
    p(\Delta^* s_1) &= p(\Delta s_1) \ p(c_1) \approx \frac{1}{8} \\
    p(\Delta s_{i+1}) &= p(\Delta s_i) \ p(c_1) \approx \frac{1}{8} \\
    p(\Delta^* s_{i+1}) &= p(\Delta s_{i+1}) \ p(\overline{r_{i+1}}) \approx \frac{1}{16}
\end{align*}
\]

If \( b_1 = 1 \) then both \( k_1 \) and \( r_1 \) assume erroneous values, \( k_1 = 1 \) and \( r_1 = 0 \). When \( c_1 = 1 \) only one element \( s_1 \) is changed. When \( c_1 = 0 \) the malfunction may propagate an error of Type B. The probabilities associated with the error process are

\[
\begin{align*}
    p(\Delta s_1) &= p(\overline{a_1}) \ p(b_1) = \frac{1}{4} \\
    p(\Delta^* s_1) &= p(\Delta s_1) \ p(c_1) \approx \frac{1}{8} \\
    p(\Delta s_{i+1}) &= p(\Delta s_i) \ p(c_1) \approx \frac{1}{8} \\
    p(\Delta^* s_{i+1}) &= p(\Delta s_{i+1}) \ p(\overline{r_{i+1}}) \approx \frac{1}{16}
\end{align*}
\]

The malfunction of the type \( a_1 = 0 \) is effective only when the correct value of \( a_1 \) is one. If at the time of the constraint the value of \( b_1 \) is zero the resulting error is the generating element type \( r_1 = 0 \). If \( b_1 = 1 \) both the \( k_1 \) and the \( r_1 \) outputs will be in error. The erroneous outputs are \( k_1 = 0 \) and \( r_1 = 1 \). The resulting error may be described completely in terms of the constraint \( k_1 = 0 \Rightarrow r_1 = 1 \) which has already been discussed.
The Error Structure of the rc Sum Logic

In reference to Figure 10 the rc sum generating logic consists of gates $g_6$, $g_7$ and $g_8$. The error structure of the rc sum logic is perfectly straightforward if the carry logic and the rc sum logic are separated as shown in Figure 10. For this configuration a malfunction in the rc sum logic may cause at most one error in the sum representation. The propagation of errors is determined only by the carry logic and this precedes the rc sum logic.

In the usual logic of the binary adder the rc sum logic and the carry logic are not independent. In Figure 10 both gates $g_5$ and $g_6$ generate the conjunctive function needed for the carry logic. Thus $g_5$ may be eliminated and $g_6$ is then part of both the carry logic and the rc sum logic. Malfunctions in $g_6$ may affect both the carry vector and the rc sum process. The effect of a malfunction of the gate $g_6$ on the carry vector is completely described in terms of the results of carry logic malfunctions. We shall now consider the possible carry vector errors due to malfunctions of the components of gate $g_6$. The carry vector error and the rc sum process error combined determine the sum error.

The diodes associated with gate $g_6$ are diodes $d_1$ and $d_2$ shown in Figure 12. In a previous section the following correspondences were obtained between diode malfunctions and representative constraints of the carry logic inputs
\[ \begin{align*}
\text{d}_{18} & \Rightarrow r_1 = 1 \\
\text{d}_{2s} & \Rightarrow c_1 = 1 \\
\text{d}_{10} & \Rightarrow c_1 = 1 \\
\text{d}_{20} & \Rightarrow r_1 = 0 \\
\end{align*} \]

The malfunction represented by \( r_1 = 1 \) applied only to gate \( g_6 \) is effective only if \( r_1 = 0 \) and \( c_1 = 1 \). The resulting sum error is Type B. If \( c_1 = 0 \) no error is produced.

\[ 
\begin{align*}
p(\Delta s_1) &= p(\overline{r_1}) \ p(c_1) \cong 1/4 \\
p(\Delta^* s_1) &= 0 \\
p(\Delta s_{i+1}) &= p(\Delta s_i) \cong 1/4 \\
p(\Delta^* s_{i+1}) &= p(\Delta s_i) \cong 1/8
\end{align*} \]

Malfunctions corresponding to the constraint \( c_1 = 1 \) result in one erroneous sum element \( s_1 \) if \( r_1 = c_1 = 0 \). If \( c_1 = 0 \) and \( r_1 = 1 \) a Type B error is produced by the constraint \( c_1 = 1 \).

\[ 
\begin{align*}
p(\Delta s_1) &= p(r_1) \ p(\overline{c_1}) \cong 1/4 \\
p(\Delta^* s_1) &= 0 \\
p(\Delta s_{i+1}) &= p(\Delta s_i) \cong 1/4 \\
p(\Delta^* s_{i+1}) &= p(\Delta s_i) \ p(\overline{r_{i+1}}) \cong 1/8
\end{align*} \]

The malfunction represented by the constraint \( r_1 = 0 \) produces an error only when \( r_1 = 1 \) and \( c_1 = 1 \). The resulting propagated error is Type A.
The Effectiveness of Parity Checking

The effectiveness of the digital and numerical parity checks is found by considering the probability that a logical malfunction will cause an undetectable change in the sum representation. The effectiveness of a checking procedure can best be evaluated in terms of effective malfunctions. In this case only the form and the propagation of errors are considered and the rate of initiation of errors is ignored.

A summary of the characteristics of the logical malfunctions associated with the logic of a binary adder is given in Table III. The table was compiled from the information derived in the previous sections. The information needed to study the effectiveness of the various parity checks is the normalized error distribution. The error distribution $\Delta^*s_{i+n}$ is the probability that the sum elements $s_i$ to $s_{i+n}$ have been modified due to some effective error. For our purposes the important aspect of the error distribution is the ratio of the various $\Delta^*s_{i+n}$ values. It is convenient to normalize the error distribution so that $\sum_{n=0}^{t} \Delta^*s_{i+n} = 1$. Then $\Delta^*s_{i}$ is the probability that the malfunction produces an error in only one element. $\Delta^*s_{i+1}$ is the probability that the resulting error disturbs two and only two successive elements of the sum. $\Delta^*s_{i+2}$ corresponds to only three successive errors, etc. Four different error distributions have been found. The different distributions shown in Table IV are designated E.D. 1, E.D. 2, E.D. 3 and E.D. 4.

In the calculations which follow the error distribution series are considered infinite in length. The approximation is valid since the
<table>
<thead>
<tr>
<th>Constraint</th>
<th>Other Conditions</th>
<th>Error Type</th>
<th>Error Distribution</th>
</tr>
</thead>
<tbody>
<tr>
<td>GEM $v_1 = 0$ or CIM $r_1 = 0$</td>
<td></td>
<td>A</td>
<td>1</td>
</tr>
<tr>
<td>GEM $v_1 = 1$ or CIM $r_1 = 1$</td>
<td></td>
<td>B</td>
<td>1</td>
</tr>
<tr>
<td>GEM or CIM $k_1 = 0$</td>
<td></td>
<td>A</td>
<td>1</td>
</tr>
<tr>
<td>GEM or CIM $k_1 = 1$</td>
<td></td>
<td>B</td>
<td>1</td>
</tr>
<tr>
<td>GEM $r_1 = 0$ or $c_1 = 1$, $k_1 = 0$</td>
<td></td>
<td>A and $\bar{s}_1 = 1$</td>
<td>1</td>
</tr>
<tr>
<td>GEM $v_4 = 0 \Rightarrow r_1 = 0$, $c_1 = 0$</td>
<td></td>
<td>$\bar{s}_1 = 0$</td>
<td></td>
</tr>
<tr>
<td>GEM $r_1 = 1$ $c_1 = 1$, $k_1 = 0$</td>
<td></td>
<td>B and $\bar{s}_1 = 0$</td>
<td>2</td>
</tr>
<tr>
<td>GEM $r_1 = 1$ $c_1 = 1$, $k_1 = 1$</td>
<td></td>
<td>$\bar{s}_1 = 0$</td>
<td></td>
</tr>
<tr>
<td>GEM $k_1 = 1 \Rightarrow r_1 = 0$ $r_1 = c_1 = 1$</td>
<td>$s_1 = 1$</td>
<td></td>
<td></td>
</tr>
<tr>
<td>GEM $k_1 = 0 \Rightarrow r_1 = v_4$ $k_1 = c_1 = 1$</td>
<td>$s_1 = 0$</td>
<td></td>
<td></td>
</tr>
<tr>
<td>GEM $v_4 = 1 \Rightarrow r_1 = \bar{v}_4$ $k_1 = c_1 = 0$</td>
<td>$s_1 = 1$</td>
<td></td>
<td></td>
</tr>
<tr>
<td>GEM $a_1 = 1$ $c_1 = 0$</td>
<td>$s_1 = 1$</td>
<td></td>
<td></td>
</tr>
<tr>
<td>GEM $a_1 = 0$ $b_1 = 0$</td>
<td>same as GEM $r_1 = 0$ with $k_1 = 0$</td>
<td></td>
<td>1</td>
</tr>
<tr>
<td>GEM $b_1 = 1$</td>
<td>same as GEM $k_1 = 0 \Rightarrow r_4 = v$</td>
<td></td>
<td>1</td>
</tr>
<tr>
<td>CGM $r_1 = 1$ $c_1 = 1$</td>
<td>B and $\bar{s}_1 = 0$</td>
<td></td>
<td>3</td>
</tr>
<tr>
<td>CGM $r_1 = 0$ $c_1 = 1$</td>
<td>A and $s_1 = 1$</td>
<td></td>
<td>3</td>
</tr>
</tbody>
</table>

GEM generating element malfunction
CIM carry logic malfunction
CGM common gate malfunction
### TABLE IV

**ERROR TYPE**

<table>
<thead>
<tr>
<th></th>
<th>$s_{i+n}$</th>
<th>$s_{i+n-1} \ldots s_{i+2}$</th>
<th>$s_{i+1}$</th>
<th>- sum element</th>
</tr>
</thead>
<tbody>
<tr>
<td>A</td>
<td>0</td>
<td>1</td>
<td></td>
<td>1</td>
</tr>
<tr>
<td>B</td>
<td>1</td>
<td>0</td>
<td></td>
<td>0</td>
</tr>
</tbody>
</table>

### TABLE V

**ERROR DISTRIBUTION (NORMALIZED)**

<table>
<thead>
<tr>
<th></th>
<th>$\Delta * s_{i}$</th>
<th>$\Delta * s_{i+1}$</th>
<th>$\Delta * s_{i+2}$</th>
<th>$\Delta * s_{i+3}$</th>
</tr>
</thead>
<tbody>
<tr>
<td>1</td>
<td>1/2</td>
<td>1/4</td>
<td>1/8</td>
<td>1/16</td>
</tr>
<tr>
<td>2</td>
<td>5/8</td>
<td>3/16</td>
<td>3/32</td>
<td>3/64</td>
</tr>
<tr>
<td>3</td>
<td>0</td>
<td>1/2</td>
<td>1/4</td>
<td>1/8</td>
</tr>
<tr>
<td>4</td>
<td>1/3</td>
<td>1/3</td>
<td>1/6</td>
<td>1/12</td>
</tr>
</tbody>
</table>
number of elements usually involved in computer arithmetic operations is thirty to forty bits. The fact that the series is not truncated will affect most severely the error distributions resulting from the initiation of errors in the first few high order elements. The number of elements so affected is small compared to the total number of elements involved.

The effectiveness of a simple digital parity check employing one check digit is dependent on the probability that the error in the sum representation involves an odd or an even number of elements. If the number of erroneous sum digits is even the check fails. The check is successful if the number of erroneous sum digits is odd. The probability that the check succeeds \( p(S) \) or fails \( p(F) \) for each of the four error distributions is

\[
\begin{align*}
p_1(S) &= \frac{1}{2} + \frac{1}{8} + \frac{1}{32} + \ldots = \frac{2}{3} \\
p_1(F) &= \frac{1}{4} + \frac{1}{16} + \frac{1}{64} + \ldots = \frac{1}{3} \\
p_2(S) &= \frac{5}{8} + \frac{3}{32} + \frac{3}{128} + \ldots = \frac{3}{4} \\
p_2(F) &= \frac{3}{16} + \frac{3}{64} + \frac{3}{256} + \ldots = \frac{1}{4} \\
p_3(S) &= \frac{1}{4} + \frac{1}{16} + \frac{1}{64} + \ldots = \frac{1}{3} \\
p_3(F) &= \frac{1}{2} + \frac{1}{8} + \frac{1}{32} + \ldots = \frac{2}{3} \\
p_4(S) &= \frac{1}{3} + \frac{1}{16} + \frac{1}{24} + \ldots = \frac{5}{9} \\
p_4(F) &= \frac{1}{3} + \frac{1}{12} + \frac{1}{48} + \ldots = \frac{4}{9}
\end{align*}
\]

The effectiveness of a digital parity check consisting of two digits is now determined. The check is constructed so that elements in odd positions are incorporated in one check and the elements in the even positions in the other check. The check is judged successful if as a result of an effective error either check fails.
\[ p_1(S) = \frac{1}{2} + \frac{1}{4} + \frac{1}{8} + \frac{1}{16} + \frac{1}{32} + \frac{1}{64} + \ldots = \frac{14}{15} \approx 0.92 \]

\[ p_2(F) = \frac{1}{16} + \frac{1}{256} + \frac{1}{4096} + \ldots = \frac{1}{15} \]

\[ p_2(S) = \frac{5}{8} + \frac{3}{16} + \frac{3}{32} + \frac{3}{128} + \frac{3}{256} + \frac{3}{512} + \ldots = \frac{19}{20} = 0.95 \]

\[ p_2(F) = \frac{3}{64} + \frac{3}{1024} + \ldots = \frac{1}{20} \]

\[ p_3(S) = 0 + \frac{1}{2} + \frac{1}{4} + \frac{1}{16} + \frac{1}{32} + \frac{1}{64} + \ldots = \frac{13}{15} \approx 0.87 \]

\[ p_3(F) = \frac{1}{8} + \frac{1}{128} + \ldots = \frac{2}{15} \]

\[ p_4(S) = \frac{1}{3} + \frac{1}{3} + \frac{1}{6} + \ldots = \frac{41}{45} \approx 0.91 \]

\[ p_4(F) = \frac{1}{12} + \frac{1}{192} + \ldots = \frac{4}{45} \]

The effectiveness of any simple digital parity check constructed as above and consisting of \( n \) check digits may be found by evaluating a recurring binary fraction consisting of repeated groups of \( 2^{2n-1} \) one digits and one zero digit. The magnitude of the non-terminated fraction is \( p_1(S) \). \( p_1(S) \) for a representation of the type

\[ \ldots ll \ldots 10\ldots \]

is

\[ p_1(S) = \frac{2^{2n-2}}{2^{2n-1}} \text{ for } n \text{ check digits} \]

\( p_2(S), p_3(S) \text{ and } p_4(S) \) are obtainable if \( p_1(S) \) is known.

\[ p_2(S) = \frac{1}{4} + \frac{3}{4} p_1(S) \]

\[ p_3(S) = 2 p_1(S) - 1 \]

\[ p_4(S) = \frac{4}{3} p_1(S) - \frac{1}{4} \]

Numerical checking requires the residue of the sum representation with respect to the check base. Check bases of magnitude \( 2^N - 1, n > 0 \), are particularly appropriate for binary representations.
The process of numerical checking requires the assignment of a check weight associated with each element of a representation. The check weight is the least positive residue of the normal element weight with respect to the check base. The following examples illustrate the weighting scheme for several possible check bases.

\[ \ldots 2 \ 1 \ 2 \ 1 \ 2 \ 1 \ m = 3 \]
\[ \ldots 4 \ 2 \ 1 \ 4 \ 2 \ 1 \ m = 7 \]
\[ \ldots 2 \ 1 \ 6 \ 4 \ 2 \ 1 \ m = 15 \]

It is observed that an error in a single digit is detectable by any numerical check. Furthermore, it has been shown that the errors due to logical malfunctions which change more than one sum element are restricted in form. The permitted error forms consist of all zeros except the last erroneous sum digit which is a one, Type B, or an erroneous representation consisting of all ones except the last digit which is a zero, Type A. The permitted error forms are always detectable by a numerical check with a check base of \(2^{N-1}\).

Consider the following example which employs a modulo 3 numerical check.

\[ S = 11 \ 00 \ 01 - 10 \ 11 \ 10 = x-y = x+y' \]

\[
\begin{array}{c}
0 \ 11 \ 00 \ 01 \times 1 \text{ parity} \\
1 \ 01 \ 00 \ 10 \ y' 3 \text{ digit} \\
\hline
1 \ 10 \ 00 \ 11 \ p \\
11 \ 10 \ 00 \ 0 \ c \\
10 \ 00 \ 00 \ 11 \ S 2
\end{array}
\]

Consider now an error in the carry logic such that the error propagates to \(c_{n+1}\), for example, an effective malfunction which changes \(c_n\).
It follows then that \( c_{n+1} \) is also changed since in this example \( r_n = 1 \). The erroneous carry representation is 00 10 00 0 and the erroneous sum is 01 00 00 11. The modulo 3 parity check bit associated with the erroneous sum is clearly one and the error is detected.

As a result of the various constraints which exist in the process of binary addition the simple modulo 3 numerical parity check is 100% effective if the \( c_{n+1} \) element of the sum is considered. The check requires only two bits. For operations consisting of repetitive addition such as multiplication or subtraction, the numerical parity check is 100% effective only if applied at each addition step. It is possible to obtain an accumulated error due to several addition operations which is undetectable by the numerical check.
CHAPTER V
THE RESIDUE CODE

An Extension of Numerical Checking

If a sufficient number of numerical parity checks are employed in a number system the parity check representation may ultimately contain as much information as that possessed by the number system being checked. If this is the case it then is possible to replace the original number representation by the check representation. The extended check representation is termed a residue number system. We shall now proceed to study the construction and properties of the residue number system.

The first requirement for a residue number system is that the bases of the different elements of the representation must be relatively prime. If the residue number system is considered as an evolution of the numerical checking procedures the above statement may be reinterpreted as a requirement that the different check bases be relatively prime to each other. If a pair of bases are not relatively prime the effect is the introduction of redundancy. The following example will illustrate this fact. Contrast the combination of a modulo 2 and a modulo 6 check against the combination of a modulo 3 and a modulo 4 check. In the first case the two check bases are not relatively prime while in the second case the check bases are relatively prime. The combination of a modulo 2 and a modulo 6 check is unique for only 6 states while the combination of the modulo 3 and modulo 4 check provides a unique residue representation for 12 states. This is further clarified by Table VI.
TABLE VI

REDUNDANCY OF A NON-RELATIVELY PRIMED BASE REPRESENTATION

<table>
<thead>
<tr>
<th>Number</th>
<th>Mod 2</th>
<th>Mod 6</th>
<th>Mod 3</th>
<th>Mod 4</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>1</td>
<td>1</td>
<td>1</td>
<td>1</td>
<td>1</td>
</tr>
<tr>
<td>2</td>
<td>0</td>
<td>2</td>
<td>2</td>
<td>2</td>
</tr>
<tr>
<td>3</td>
<td>1</td>
<td>3</td>
<td>0</td>
<td>3</td>
</tr>
<tr>
<td>4</td>
<td>0</td>
<td>4</td>
<td>1</td>
<td>0</td>
</tr>
<tr>
<td>5</td>
<td>1</td>
<td>5</td>
<td>2</td>
<td>1</td>
</tr>
<tr>
<td>6</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>2</td>
</tr>
<tr>
<td>7</td>
<td>1</td>
<td>1</td>
<td>1</td>
<td>3</td>
</tr>
<tr>
<td>8</td>
<td>0</td>
<td>2</td>
<td>2</td>
<td>0</td>
</tr>
<tr>
<td>9</td>
<td>1</td>
<td>3</td>
<td>0</td>
<td>1</td>
</tr>
<tr>
<td>10</td>
<td>0</td>
<td>4</td>
<td>1</td>
<td>2</td>
</tr>
<tr>
<td>11</td>
<td>1</td>
<td>5</td>
<td>2</td>
<td>3</td>
</tr>
<tr>
<td>12</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>13</td>
<td>1</td>
<td>1</td>
<td>1</td>
<td>1</td>
</tr>
<tr>
<td>14</td>
<td>0</td>
<td>2</td>
<td>2</td>
<td>2</td>
</tr>
</tbody>
</table>
If the bases of the residue number system are relatively prime then the number of states uniquely represented without redundancy is equal to the product of the magnitude of the bases. The most efficient residue number system representation is obtained by a consecutive sequence of prime numbers starting with the integer two. Since only a relatively prime relationship is desired between the bases it would be possible to employ one base equal to a non-prime integer as long as this base is relatively prime to the other bases. However, there are certain advantages which are associated with the prime number base but are not associated with the non-prime base. In particular, the residues with respect to a non-prime base form a ring, while the residues with respect to a prime base form a field. The characteristics of rings and fields which are important to the present discussion are the following: Two operations, multiplication and addition are defined for both rings and fields. Also an additive inverse exists for both rings and fields. This means that subtraction is possible as well as the defined operations of addition and multiplication. The ring does not require a multiplicative inverse. The field must contain a unique multiplicative inverse for every element. Furthermore, the field admits the solution to linear algebraic equations.

The residue number system differs from the standard consistently weighted number systems in many ways. The most important difference is that the residue number system, being based upon a number of
fields or rings, has two defined operations. It will be recalled that arithmetic operations in the consistently based number system usually employed in the digital computer are based upon the definition of one operation, that operation being addition. The other difference is that the residue representation possesses digital independence of the elements comprising the representation. There exists no carry to constrain the elements. The absence of a carry plus the fact that multiplication is defined gives rise to the expectation that the residue number system should be amenable to simple error detecting and correcting schemes. It should also be possible to execute rapid addition and multiplication.

An example of a residue number system is presented in Table VII. The number system shown in Table VII uses the prime bases 2, 3, 5 and 7. The number system therefore contains 210 states. The 210 states may correspond to the positive integers 0 to 209. Table VII shows the residue number representation corresponding to the positive integers 0 to 29. Additional integers of the number system may be found by

TABLE VII

<table>
<thead>
<tr>
<th>NATURAL NUMBERS AND CORRESPONDING RESIDUE NUMBERS</th>
</tr>
</thead>
<tbody>
<tr>
<td>N.N.</td>
</tr>
<tr>
<td>-------</td>
</tr>
<tr>
<td>0</td>
</tr>
<tr>
<td>1</td>
</tr>
<tr>
<td>2</td>
</tr>
<tr>
<td>3</td>
</tr>
<tr>
<td>4</td>
</tr>
<tr>
<td>5</td>
</tr>
<tr>
<td>6</td>
</tr>
<tr>
<td>7</td>
</tr>
<tr>
<td>8</td>
</tr>
<tr>
<td>9</td>
</tr>
</tbody>
</table>
congruence operations. Let \( a, b, c \) and \( d \) be the digits associated with the bases 2, 3, 5 and 7 respectively. The following congruences define \( a, b, c \) and \( d \) for the residue representation of the number \( N \):

\[
N \equiv a \mod 2 \\
N \equiv b \mod 3 \\
N \equiv c \mod 5 \\
N \equiv d \mod 7
\]

The residue number system is readily extended to include more states. For example, if a base11 is added to the representation it is then possible to represent 2310 states. Table VIII shows the product and sum of the first ten consecutive primes greater than or equal to 2. The product of the primes indicates the number of states of the number system while the sum of the primes is a measure of the size of the representation in terms of digits. Table VIII also includes the number of bits required to represent each prime base in the binary number system.

**TABLE VIII**

NUMBER OF STATES AND DIGITS ASSOCIATED WITH A RESIDUE REPRESENTATION

<table>
<thead>
<tr>
<th>( i )</th>
<th>( p_1 )</th>
<th>( \sum_{i=1}^n p_1 )</th>
<th>( \prod_{i=1}^n p_1 )</th>
<th>( p_1 ) bits</th>
<th>( \sum p_1 ) bits</th>
</tr>
</thead>
<tbody>
<tr>
<td>1</td>
<td>2</td>
<td>2</td>
<td>2</td>
<td>1</td>
<td>1</td>
</tr>
<tr>
<td>2</td>
<td>3</td>
<td>5</td>
<td>6</td>
<td>2</td>
<td>3</td>
</tr>
<tr>
<td>3</td>
<td>5</td>
<td>10</td>
<td>30</td>
<td>3</td>
<td>6</td>
</tr>
<tr>
<td>4</td>
<td>7</td>
<td>17</td>
<td>210</td>
<td>3</td>
<td>9</td>
</tr>
<tr>
<td>5</td>
<td>11</td>
<td>28</td>
<td>2,310</td>
<td>4</td>
<td>13</td>
</tr>
<tr>
<td>6</td>
<td>13</td>
<td>41</td>
<td>30,030</td>
<td>4</td>
<td>17</td>
</tr>
<tr>
<td>7</td>
<td>17</td>
<td>58</td>
<td>510,510</td>
<td>5</td>
<td>22</td>
</tr>
<tr>
<td>8</td>
<td>19</td>
<td>77</td>
<td>9,699,699</td>
<td>5</td>
<td>27</td>
</tr>
<tr>
<td>9</td>
<td>23</td>
<td>103</td>
<td>223,092,670</td>
<td>5</td>
<td>32</td>
</tr>
</tbody>
</table>
Residue Addition and Multiplication

The residue number representation consists of several elements and is assumed to be in one to one correspondence with some positive integers of the real number system. The elements of the residue representation are the least positive residues of these real positive integers with respect to the different moduli which form the bases of the residue representation. It follows as a direct consequence of the structure of the residue number system and the properties of linear congruences that the operations of addition and multiplication are valid in the residue number system subject to one proviso. The proviso is that the residue system must possess a number of states sufficient to represent the generated sum or product. If the residue number system does not have a sufficient number of states to represent the sums and the products generated by a particular finite set of real integers then the residue system will overflow and more than one sum or product of the real number system may correspond to one residue representation.

For a residue number with a sufficient number of states an isomorphic relation exists with respect to the operations of addition and multiplication in the residue system and a finite system of real positive integers.

Each element of the residue number system is obtained with respect to a different base or modulus. It follows therefore, that the rules of arithmetic associated with each element will be different. For example, the addition and multiplication of the elements associated with the moduli 2 and 3 follow rules specified in Table IX.
- Table IX -

<table>
<thead>
<tr>
<th>MOD 2 AND MOD 3 SUMS AND PRODUCTS</th>
</tr>
</thead>
<tbody>
<tr>
<td>$\emptyset$</td>
</tr>
<tr>
<td>0</td>
</tr>
<tr>
<td>1</td>
</tr>
<tr>
<td>sum Mod 2</td>
</tr>
<tr>
<td>0</td>
</tr>
<tr>
<td>0</td>
</tr>
<tr>
<td>1</td>
</tr>
<tr>
<td>1</td>
</tr>
<tr>
<td>2</td>
</tr>
<tr>
<td>2</td>
</tr>
<tr>
<td>sum Mod 3</td>
</tr>
<tr>
<td>0</td>
</tr>
<tr>
<td>0</td>
</tr>
<tr>
<td>1</td>
</tr>
</tbody>
</table>

No carry tables are necessary since the residue number system does not have a carry mechanism. Addition of two residue representations is effected by the modulo addition of corresponding elements of the two representations. Corresponding elements must have the same base or modulus. Modulo addition of elements which have different bases is not defined. Multiplication in the residue system is effected by obtaining the modulo product of corresponding elements. The operations of addition and multiplication of two residue numbers are indicated by the following notation:

$$S = A \oplus B$$

$$p = A \cdot B$$

Consider a residue number representation with base 2, 3, 5 and 7. We assume an isomorphic relation between the residue number system
and the real positive numbers 0 to 209. An isomorphic relation then exists for the operations of multiplication and addition only if the product or sum is less than 209. The following examples employing residue numbers illustrate the addition and multiplication operations and the presence of an isomorphism or the lack of isomorphism in the case of overflow. Residue numbers will be distinguished by the use of parentheses.

\[ 29 + 27 = 56 \]

\[ 29 \leftrightarrow (1 2 4 1) \]
\[ 27 \leftrightarrow (1 0 2 6) \]
\[ 56 \leftrightarrow (0 2 1 0) \]

\[ (1 2 4 1) \]
\[ \oplus (1 0 2 6) \]
\[ (0 2 1 0) \]

The following operations are considered in performing the addition of the two residue representations.

\[ 1 + 1 \equiv 0 \mod 2 \]
\[ 2 + 0 \equiv 2 \mod 3 \]
\[ 4 + 2 \equiv 1 \mod 5 \]
\[ 1 + 6 \equiv 0 \mod 7 \]

Consider the addition of two numbers which produce a sum greater than 209.

\[ S = 100 + 200 \]

\[ (0 1 0 2) \]
\[ \oplus (0 2 0 4) \]
\[ (0 0 0 6) \]
The residue representation (0006) corresponds to the real positive number 90. In this particular example the sum has overflowed the residue representation. The resulting sum is the correct sum modulo 210.

\[ 300 \equiv 90 \mod 210 \]

Finite real number systems and residue number systems have the same overflow characteristics. The sum which remains after the overflow is the correct sum with respect to a modulus numerically equal to the number of states in the finite number system.

The following is presented as an example of the process of residue multiplication.

\[ p = 10 \times 17 = 170 \]

10 ⟷ (0 1 0 3)
17 ⟷ (1 2 2 3)
170 ⟷ (0 2 0 2)

0 (0 1 0 3)
(1 2 2 3)
(0 2 0 2)

The process of multiplication involved consideration of the following relations for each element.

\[ 1 \times 0 \equiv 0 \mod 2 \]
\[ 1 \times 2 \equiv 2 \mod 3 \]
\[ 0 \times 2 \equiv 0 \mod 5 \]
\[ 3 \times 3 \equiv 2 \mod 7 \]
An overflow resulting from a multiplication is no different than the overflow resulting from an addition. Consider the product obtained from the residue multiplication of the real numbers 10 and 100. The result in the modulo 210 number system corresponds to the real number 160 which is 1000 modulo 210.

Subtraction and the Representation of Negative Numbers

The process of subtraction is obtainable in the residue number system by employing a complement representation consisting of the additive inverses of the positive residue representation. Since the elements of the residue representation are elements of a field, the additive inverse always exists. There is no basic problem associated with the subtraction operation. There is, however, a problem associated with the representation of negative numbers. In particular, some mechanism must be included in the number system which will distinguish positive and negative differences. This problem will be discussed in detail in a later section.

The additive inverse of a residue number is defined by the following:

\[ a \oplus a' = 0 \]

The formula may be considered to apply to an element of the residue system or equally well to the whole residue representation. Consider the following examples with reference to the modulo 210 residue number system.

\[ a = (1 2 4 1) \]
then \[ a' = (1 1 1 6) \]
since \[ \oplus (1 1 1 6) \]
\[ (0 0 0 0) \]
The following examples have been chosen to illustrate the subtraction process and to some extent the difficulties associated with the sign of the difference.

\[ D = A \oplus B = A \oplus B' \]

We consider first the case where the magnitude of \( A \) is greater than \( B \).

Let \( A = 200 \quad B = 100 \)

In residue representation

then \[ B' = (0 \ 2 \ 0 \ 5) \]
and \[ \begin{array}{c}
(0 \ 2 \ 0 \ 4) \\
0 \ 2 \ 0 \ 5 \\
0 \ 1 \ 0 \ 2
\end{array} \]

The residue representation of the difference corresponds to positive 100 in the real number domain. We consider next the case where the magnitude of \( B \) is greater than the magnitude of \( A \).

\[ A' = (0 \ 1 \ 0 \ 3) \]
then \[ D = A' \oplus B \]
and \[ \begin{array}{c}
(0 \ 1 \ 0 \ 3) \\
0 \ 1 \ 0 \ 2 \\
0 \ 2 \ 0 \ 5
\end{array} \]

The difference \((0 \ 2 \ 0 \ 5)\) is the additive inverse of \((0 \ 1 \ 0 \ 2)\). Unless additional information is supplied the correct interpretation of the representation \((0 \ 2 \ 0 \ 5)\) is in doubt. \((0 \ 2 \ 0 \ 5)\) may correspond to either +110 or -100.

The difficulties associated with whether a residue representation corresponds to positive or negative integer can be partially
removed by the division of the residue number range into two parts.
This is exactly the scheme that is employed to obtain a machine repre-
sentation of positive and negative natural numbers. For the system
of natural numbers two different machine representations of the nega-
tive numbers may be obtained and are commonly designated the radix
complement representation of negative numbers and the diminished radix
complement representation of negative numbers.

The complement representation for a residue code is defined
in terms of the additive inverse. Thus the representation of negative
A is A' where A ⊕ A' = 0, and the range of A is restricted to approxi-
mately one half of the total possible range of the residue representa-
tion. This can be illustrated by consideration of a specific residue
code. The residue representation employing bases of magnitude 2, 3, 5,
and 7, is divided into two parts. The residue representations corre-
sponding to the natural numbers 0 to 104 are considered positive. The
residue representations corresponding to the natural numbers 105 to
209 are considered negative. The range of this particular number
system is from -105 to +104. The arithmetic rules pertaining to sign
and overflow conventions for this particular number system are the
same rules normally associated with radix complement arithmetic.

The complement representation does eliminate in principle any
ambiguity concerning the sign of the result of an arithmetic operation.
However, there is a practical difficulty. The determination of the
sign of a residue representation requires the determination of the
magnitude of the representation relative to the magnitude of the representation which separates the positive and negative representations. The determination of relative magnitude for a residue representation is discussed in a later section. It will be shown that the determination of relative magnitude is not a simple problem.

Conversion From a Residue Code to a Normal Number Representation

It is frequently desirable to determine the natural number associated with a particular residue representation. The need for this conversion occurs frequently when investigating the properties of the residue system. The residue representation is constructed in such a manner that magnitude is not readily obtainable. The presence of column weights in the normal number representation greatly facilitates the determination of magnitude. It is possible to assign a weight to each digit of the residue representation in such a manner that the modulo \( m \) sum of the digit-weight products is the real natural number in consistently weighted representation. \( m \) is the product of all the bases employed in the residue representation. The conversion technique is known as "The Chinese Remainder Theorem". The material which follows describes the conversion technique but omits the proof. A simple and straightforward proof is found in Dickson.\(^{(20)}\) The proof does not refer specifically to residue number systems but rather to a system of linear congruences. If so regarded, a system of congruences defines a component of a residue number system.
Consider a residue number system with bases \( m \) \( \ldots \) \( m_t \). The corresponding digits are labeled \( a_1 \) \( \ldots \) \( a_t \). The following equations define the conversion process.

\[
a_1 A_1 \frac{m}{m_1} + \ldots + a_t A_t \frac{m}{m_t} \equiv S \mod m
\]

where

\[
A_i \frac{m}{m_i} \equiv 1 \mod m_i
\]

and

\[
m = \prod_{j=1}^{t} m_j
\]

The conversion formula for a particular residue number system is now obtained.

\[
m_1 = 2 \quad m_2 = 3 \quad m_3 = 5 \quad m_4 = 7
\]

\[
105 \ A_1 \equiv 1 \mod 2 \quad \text{so} \quad A_1 = 1
\]

\[
70 \ A_2 \equiv 1 \mod 3 \quad \text{so} \quad A_2 = 1
\]

\[
42 \ A_3 \equiv 1 \mod 5 \quad \text{so} \quad A_3 = 3
\]

\[
2 \ A_4 \equiv 1 \mod 7 \quad \text{so} \quad A_4 = 4
\]

\[
105 \ a_1 + 70 \ a_2 + 126 \ a_3 + 120 \ a_4 \equiv S \mod 210
\]

The conversion formula is now used to determine the natural number corresponding to the residue representation \((1 \ 2 \ 0 \ 4)\).

\[
105 \ (1) + 70 \ (2) + 126 \ (0) + 120 \ (4) = 725
\]

\[
725 \equiv S \mod 210
\]

\[S = 95\]
Other conversion techniques exist. In particular it is possible by means of a deductive process to determine the magnitude of a particular residue representation. This requires both a knowledge of the nature of the residue system and the natural number representation associated with at least one residue representation.

Due to the deductive nature of the process it is more suitable for human computation than for machine computation. The process is explained using the residue number of the previous example (1 2 0 4). The knowledge of the residue representation for unity which is (1 1 1 1) is assumed. Consider the effect of changing the second digit from one to two. The change adds the product \( m_1m_3m_4 = 70 \) to the number since 70 is congruent 1 modulo 3. The resulting residue representation (1 2 1 1) corresponds to 71. The effect of changing the third digit is to change the magnitude by some multiple of the product \( m_1m_2m_4 = 42 \). The correct change in magnitude is 42\( x \) where 42\( x \equiv 4 \text{ Mod } 5 \). So 42\( x = 84 \) and the residue representation 1 2 0 1 corresponds to 155. The fourth digit is modified by the addition of a three. The effect of this change is determined by 30\( x \equiv 3 \text{ Mod } 7 \). The magnitude change is 150. The sum of 150 and 155 modulo 210 yields the correct result 95, in correspondence with (1 2 0 4).

The conversion formulae provide a possible means of achieving conversion from a residue code to an analog representation. The conversion would involve first the transformation of the residue code to a normal weighted representation and then a transformation to analog representation by means of standard techniques. Alternatively the conversion
process need not involve the normal representation if the analog equipment were capable of precise subtraction of multiples of \( m \) where \( m \) is the product of the bases. A third conversion process would employ counters for both the residue system and the normal number system. Comparison would be employed to determine the accumulation of the desired number of counts. At the time of comparison one set of counters would contain the residue representation and the other the desired normal representation.

**Greater Than or Less Than Relations for Residue Codes**

Some of the arithmetic operations for the residue code are dependent on the determination of a greater than or less than relationship. A possible technique might involve the conversion techniques described in the previous section. Such a scheme would involve the standard comparison techniques associated with the determination of the relative magnitude of two numbers represented in a weighted representation. The procedure described in this section does not require an explicit conversion from residue code to normal representation. Consider a residue code consisting of \( t \) digits. The \( t \) digits of the residue code are associated with \( t \) congruence relationships as follows:

\[
S = a_i \mod m_i \quad 1 \leq i \leq t
\]
S is the magnitude of the number expressed in normal representation.

It is also possible to express the number S as

\[ S = a_1 + A_1 m_1 \]

\( A_1 \) is the integer part of the quotient of \( S \) divided by \( m_1 \). In regard to a greater or less than relationship, the determination of \( A_1 \) divides the range of the residue representation into \( m/m_1 \) parts. We proceed to calculate \( A_1 \) from the set of \( t \) equations given above.

Let

\[ S = a_t + A_t m_t \] \[ A_t < \frac{m}{m_t} \]

This equation is then used to replace \( S \) in the remaining \( t-1 \) equations, yielding \( t-1 \) equations of the form

\[ A_t m_t = (a_1 + a'_t) \mod m_1 \quad 1 \leq i \leq t-1 \]

or

\[ A_t = (a_1 + a'_t) / m_t \mod m_1 \] \[ A_t = d_t \mod m_1 \]

where \( /m_t \) is the multiplicative inverse of \( m_t \) with respect to base \( m_1 \).

The multiplicative inverse is defined as

\[ x_t / x_t^i \equiv 1 \mod m_1 \]

\( d_t \) is the least positive residue of \( (a_1 + a'_t) / m_t \) with respect to base \( i \).

\( a'_t \) is the additive inverse of \( a_t \).

Let \( A_t \) be expressed as

\[ A_t = d_{t-1} + A_{t-1} m_{t-1} \] \[ A_{t-1} < \frac{m}{m_t m_{t-1}} \]
If this expression is substituted for $A_t$ a set of $t-2$ equations remain. The equations are of the form

$$A_{t-1} = \left[ d_t^i + \left( d_{t-1}^i \right) \right] / m_{t-1} \mod m_1 \quad 1 \leq i \leq t - 2$$

$$A_{t-1} = d_{t-1}^1 \mod m_1$$

The system of equations shown below is generated by repetition of the above substitution process until no equations remain.

$$S = a_t + A_t m_t$$
$$A_t = d_t^{t-1} + A_{t-1} m_{t-1}$$
$$A_{t-1} = d_{t-1}^{t-2} + A_{t-2} m_{t-2}$$
$$\vdots$$
$$\vdots$$
$$A_3 = d_3^2 + A_2 m_2$$
$$A_2 = d_2^1 \mod m_1$$

The equations are combined to yield:
\[ S = a_t + m_t \left\{ d_{t-1}^{t-1} + m_{t-1} \left[ d_{t-2}^{t-2} + m_{t-2} (d_{t-3}^{t-3} + \ldots) \right] \right\} \\
= a_t + m_t d_{t-1}^{t-1} + m_t m_{t-1} d_{t-2}^{t-2} + m_t m_{t-1} m_{t-2} d_{t-3}^{t-3} + \ldots + \frac{m}{m_1} d_2^1 \]

where \( A_t < m_t \)

\[ d_{t-n-1}^{t-n-1} < m_{t-n-1} \]

Therefore, \( S \) is never equal to or greater than \( m \) and \( d_2^1 \) divides the range into \( m_1 \) parts, \( d_3^2 \) divides each of the \( m_1 \) parts into \( m_2 \) parts, \( d_4^3 \) divides each of the \( m_2 \) parts into \( m_3 \) parts, etc.

The determination of the less than or greater than relationship consists of the successive comparison of the \( d_{t-n-1}^{t-n-1} \) constants corresponding to two residue representations. Let the representations be designated \( E \) and \( F \). The first step of the greater than or less than determination is the comparison of \( d_2^1(E) \) and \( d_2^1(F) \). If the two constants are different the process may be terminated and the larger number is associated with the larger constant. If the constants are equal in value the comparison process must consider the pair of constants \( d_3^2(E) \) and \( d_3^2(F) \). The process is continued in this manner until a set of non-identical constants are found. If all of the \( d \) constants are identical a final comparison is made on the basis of the pair of \( t \) th digits of the two residue representations.

The formulae which define the greater than, less than process may be applied recursively to obtain a formula for a greater number of digits. The process has been extended to five variables and the results are shown in Figure 14.
Figure 14. Logic for the Determination of the Greater or Less Than Relationship.
Admittedly the process required to obtain a greater than or less than relationship leaves much to be desired. One presumed advantage of the residue number was the absence of a carry process. The greater or less than process is essentially sequential and is in many ways similar to the carry process.

Redundancy

A residue code contains redundant information if the product of the bases exceeds the number of states required to represent a specific range of numbers. In particular, if sufficient redundancy is introduced the residue code may be made independent of one or more digits. Redundancy is obtained by adding one or more additional digits to the structure of a residue code sufficient to represent the desired number range. Consider the addition of a prime base \( m_i \) to a residue code with bases \( m_1, \ldots, m_t \). The resulting code is redundant. Any one digit of the code may be deleted and the code will still contain the required amount of information. Furthermore, any number of digits may be deleted if the product of the remaining bases is greater than the magnitude of the represented number. This means that normally it is possible to delete more than one low order digit but in general only one high order digit may be deleted. Any degree of redundancy is obtainable by the addition of more redundant digits.

Redundancy in the residue code is easily obtained. The real problem concerns the realization of the procedures for error correction and detection. This problem may be considered from the viewpoint of
the Chinese Remainder Theorem. A residue representation of $t+1$ digits has one redundant digit. The Chinese Remainder Theorem may be applied using only $t$ digits of the residue representation. In fact the theorem may be applied once for each of the $t+1$ combinations. Each combination consists of $t$ digits. $t+1$ solutions are obtained and if the redundant residue representation contains one erroneous digit then only one of the $t+1$ solutions is correct. The correct solution is, of course, the solution which does not involve the erroneous digit. Unfortunately, there is no way in which the correct and the incorrect solution may be distinguished within the framework of the residue code with a single redundant digit. We now consider the question of whether two of the incorrect solutions may be identical. Given a residue code with bases $m_1, \ldots, m_{t+1}$, we assume the $j$th digit $a_j$ is erroneous by $d$ units. The $t$ erroneous solutions obtained by means of the Chinese Remainder Theorem deviate from the correct solution by $\Delta$ where

$$\Delta = d \mod m_j$$

$$\Delta_i = \frac{x_i m}{m_j m_1}$$

Let the correct solution be $S$ and a pair of erroneous solutions $S_i$ and $S_k$, then

$$S + \frac{x_i m}{m_j m_1} \equiv S_i \mod \frac{m}{m_1}$$

$$S + \frac{x_k m}{m_j m_k} \equiv S_k \mod \frac{m}{m_k}$$

$$S + \frac{x_i m}{m_j m_1} = S_i + s_i m \mod \frac{m}{m_1}$$

$$S + \frac{x_k m}{m_j m_k} = S_k + s_k m \mod \frac{m}{m_k}$$

$$S < \frac{m}{m_1}$$

$$S < \frac{m}{m_k}$$
\( a < 2; \ b < 2 \)
\( a = 0, 1; \ b = 0, 1 \)

Assume \( S_i = S_k \) then
\[
\frac{x_i}{m_i} - \frac{x_k}{m_k} = \left[ \frac{a}{m_i} - \frac{b}{m_k} \right] m_j
\]

We consider the four combinations of values of \( a \) and \( b \)

1. \( \frac{x_i}{m_i} - \frac{x_k}{m_k} = 0 \) if \( x_i = m_i f \) \( x_k = m_k f \)
   \( f \) is an integer

2. \( \neq \frac{m_i}{m_i} \)

3. \( \neq \frac{m_j}{m_k} \)

4. \( \frac{x_i}{m_i} - \frac{x_k}{m_k} = \frac{m_j}{m_i} - \frac{m_j}{m_k} \)
   \( x_i m_k - x_k m_i = m_j m_k - m_j m_i \)
   \( m_k (x_i - m_j) = m_i (x_k - m_j) \)

The equality is satisfied if
\[
x_i = m_j + m_i f \quad f \text{ is an integer}
\]
\[
x_k = m_j + m_k f
\]
\[
m \left( \frac{m_i + m_i f}{m_j m_i} \right) \equiv d \text{ Mod } m_j
\]
\[
m \left( \frac{m_i + m_k f}{m_j m_k} \right) \equiv d \text{ Mod } m_j
\]
\[
\frac{m}{m_i} \equiv 0 \text{ Mod } m_j
\]
\[
\frac{m}{m_k} \equiv 0 \text{ Mod } m_j
\]
Thus both equations reduce to
\[
\frac{m_i^f}{m_j} \equiv d \mod m_j
\]

The same solution is obtained for \( x_i = m_i^f \) and \( x_k = m_k^f \) (combination one) therefore:
\[
\frac{x_i}{m_i} - \frac{x_k}{m_k} = \left( \frac{m_i}{m_i} - \frac{m_k}{m_k} \right) \text{ or } 0
\]

implies a single value of \( d \) and hence \( S_i = S_k \) only if \( d_i = d_k \). It follows that none of the solutions are identical. However, unless some auxiliary error detecting scheme is employed it is impossible to separate the correct from the incorrect solutions.

If two redundant digits are employed it becomes possible to distinguish between the correct and the incorrect solutions. The representation consists of \( t+2 \) digits and the number of solutions is the number of combinations of \( t+2 \) digits taken \( t \) at a time. When only one digit is in error the number of correct solutions will be equal to the number of combinations of \( t+1 \) things taken \( t \) at a time. The number of correct solutions is \( t+1 \). Since the total number of solutions is \( \frac{(t+1)(t+2)}{2} \) the ratio of correct to incorrect solutions is given as \( \frac{2}{t+2} \). We shall now show for a residue code with two redundant digits that none of the erroneous solutions are identical. Typically \( \Delta \) is of the form
\[
\Delta = \frac{x_{ik} m}{m_j m_i m_k} \equiv d \mod m_j
\]
A pair of erroneous solutions \( S_{1k} \) and \( S_{rs} \) are defined by

\[
S + \frac{x_{ik}m}{mjm_1m_k} = S_{1k} + a \frac{m}{m_1m_k}
\]

\[
S + \frac{x_{rs}m}{m_jm_rm_s} = S_{rs} + b \frac{m}{m_rm_s}
\]

\[a = 0, 1; \quad b = 0, 1\]

Let us assume \( S_{1k} = S_{rs} \) then

\[
\frac{x_{ik}}{m_1m_k} - \frac{x_{rs}}{m_rm_s} = \frac{am}{m_jm_k} - \frac{bm}{m_rm_s}
\]

If \( i = r \) then the relationship becomes

\[
\frac{x_{ik}}{m_k} - \frac{x_{rs}}{m_s} = \frac{am_j}{m_k} - \frac{bm_j}{m_s}
\]

This situation has already been considered for the case of one redundant digit. We need, therefore, only consider the case where \( i \neq r \). Four combinations exist since the possible values of \( a \) and \( b \) are zero and one.

1. \( \frac{x_{ik}}{m_1m_k} - \frac{x_{rs}}{m_rm_s} = 0 \)
2. \( \neq \frac{m_j}{m_1m_k} \)
3. \( \neq - \frac{m_j}{m_rm_s} \)
4. \( = \frac{m_j}{m_1m_k} - \frac{m_j}{m_rm_s} \)

The first combination is satisfied only if

\[
x_{ik} = f \frac{m_1m_k}{m}
\]

\[
x_{rs} = f \frac{m_rm_s}{m}
\]
The fourth combination is satisfied only if
\[ x_{ik} = m_j + f \cdot m_k \]
\[ x_{rs} = m_j + f \cdot m_s \]

The results indicate that identical erroneous solutions must correspond to the same d. Erroneous solutions are not identical.

Since the erroneous solutions are not identical, the presence of two identical solutions is sufficient to indicate the correct solution. It is then possible to use a smaller set of solutions involving less computation. The set of solutions is chosen so that for every base there exist at least two solutions which are not functions of that base. Let the total number of digits of a residue code be designated as \( n \). This includes both information and redundant digits. If \( n \) is even the above requirement may be met with \( n \) properly chosen solutions. If \( n \) is odd then \( n+1 \) solutions are required. There exists considerable flexibility in the choice of the solutions. If the number of solutions used is somewhat more than the required minimum the range of the representation is extended. This fact may be illustrated by an example. Consider a residue representation of six digits. The bases are consecutive primes. The lowest base is two. Six solutions are chosen with the following base pairs deleted: \( a_1 a_6, a_2 a_6, a_1 a_5, a_2 a_4, a_3 a_5, a_3 a_4 \). The range of the representation is given by the expression
\[ 0 \leq R < \frac{m}{m_1 m_j} \]
where \( m_1m_j \) is the largest product of bases corresponding to the deleted pairs of digits. For this example the largest product of deleted base digits corresponds to \( m_3m_5 \) and is equal to 55. The range of the representation if six solutions are used is zero to 551. If the code does not have the property of single error correction the largest number represented is 30030. The range of the representation can easily be extended by the addition of only one solution. The solution which did not involve \( a_3 \) and \( a_5 \) is replaced by a solution deleting \( a_5 \) and \( a_1 \) or \( a_2 \) and a solution deleting \( a_3 \) and \( a_1 \) or \( a_2 \) is added. By the addition of one solution the largest number represented for the example above is \( m/m_2m_6 \)-1 which is equal to 770.

The maximum range of the representation of an \( n \) digit residue code with single error correction is given as

\[
0 \leq R_m \leq \frac{m}{m_2m_n}
\]

An upper limit for the number of solutions required to realize this range is obtained by considering the deletion of the digits \( a_3 \) to \( a_n \) with the deletion of \( a_1 \) and \( a_2 \). This scheme will yield \( 2n-4 \) solutions and represents the absolute maximum number of solutions required to realize the above range. Usually the required number of solutions will be less than the maximum.

The efficiency of the residue code for the correction and detection of single digit errors is now compared with the theoretical efficiency predicted by Shannon's theory.\(^{(4)}\) It is known that a single error detecting and correcting code of the Hamming type obtains the
maximum theoretical efficiency if the total number of digits of the representation is $2^m - 1$, where $m$ is any non zero positive integer. The Hamming code is least efficient when the total number of digits is equal to $2^m$. The residue code will be considered from two viewpoints. First, the number of represented states will be translated to bits without regard to the form of the final code. Secondly the residue code will be considered in terms of a corresponding binary code for each residue digit. The binary coded residue representation does not use the information capacity of the binary code in the most efficient manner. However, the binary coded residue representation is one form in which the residue code would be used in practice.

The data of Table $X$ indicates the efficiency of a residue code constructed from consecutive prime bases starting from two. The code has error detecting and correcting properties and the efficiency is defined as the ratio of the logarithm base two of the number of states of the representation corresponding to information to the logarithm base two of the total number of states of the representation.

The theoretical maximum efficiency for single error correction and detection is obtained for the assumption that for an $m$ digit code $m+1$ events may occur with equal probability. The possible events are any single digit in error or no error.
<table>
<thead>
<tr>
<th>Number of Residue Digits</th>
<th>Efficiency of the Residue Code</th>
<th>Efficiency of the Binary Encoded Residue Representation</th>
</tr>
</thead>
<tbody>
<tr>
<td>4</td>
<td>.43</td>
<td>.37</td>
</tr>
<tr>
<td>5</td>
<td>.55</td>
<td>.47</td>
</tr>
<tr>
<td>6</td>
<td>.65</td>
<td>.56</td>
</tr>
<tr>
<td>7</td>
<td>.70</td>
<td>.60</td>
</tr>
<tr>
<td>8</td>
<td>.75</td>
<td>.64</td>
</tr>
<tr>
<td>9</td>
<td>.78</td>
<td>.68</td>
</tr>
<tr>
<td>10</td>
<td>.80</td>
<td>.71</td>
</tr>
<tr>
<td>11</td>
<td>.83</td>
<td>.74</td>
</tr>
</tbody>
</table>

The following formula due to Shannon specifies the maximum efficiency. \[ C = \log_2 N - \sum_{i=1}^{m+1} p_i \log_2 p_i \]

The bits of information per binary symbol is given by \[ \frac{C}{m} = 1 - \frac{1}{m} \log_2 p_1 \]

The results of Table X and the above formula are compared graphically in Figure 15. The graphical presentation permits a rapid comparison of the relative efficiencies of the various codes in spite of the fact that the total number of binary digits varies. The variation in the number of binary digits associated with a particular number of residue digits is due to the inefficient use of binary digits when each digit of the residue code is expressed by a binary code.
Figure 15. Relative Efficiency of the Residue Code.
**Division**

In any number system the operation of division is a problem. This is particularly true of the residue system. Every element of a field except zero is associated with a unique multiplicative inverse. This is one of the requirements which must be met by a mathematical system if it is to be a field. The multiplicative inverse of a field element \( x \) is denoted by \( /x \) and is defined by the following relationship:

\[
x/x = 1 \mod m_X
\]

The definition also requires closure.

The division process for residue codes is complicated by two factors. The first is the absence of a multiplicative inverse for the zero element. The second difficulty is the fact that residue division and the normal division process are in one to one correspondence only when the resulting quotient is an integer value. We shall consider first the problem of residue division of the elements of a single field and shall consider later the elements of several fields considered as a residue code. The division process represented in equation form as

\[
\frac{a}{b} = q
\]

implies the following equation:

\[
a = bq
\]

The difference between normal arithmetic and residue arithmetic is that in residue arithmetic the product \( bq \) need not necessarily be equal to \( a \), only the congruence of \( a \) and \( bq \) is required.

\[
bq \equiv a \mod m_n
\]
Multiplication by the multiplicative inverse of $b$ obtains

$$q \equiv \frac{a}{b} \text{ Mod } m_n$$

The correct interpretation of $q$ in the above equation is that the number $a$ is obtained by forming the sum consisting of $b$, $q$ representations. The sum is carried out in a closed number system of base $m_n$. Thus $q$ corresponds to the quotient only when the quotient has an integer value. Examples may be obtained from the consideration of a modulo 5 field.

$$\frac{4}{2} = 2$$

$$2q \equiv 4 \text{ Mod } 5$$
$$q \equiv 2 \text{ Mod } 5$$

$$\frac{4}{3} = q$$

$$3q \equiv 4 \text{ Mod } 5$$
$$q \equiv 3 \text{ Mod } 5$$

Note: $3 \times 3 \equiv 4 \text{ Mod } 5$

$$\frac{3}{4} = q$$

$$4q \equiv 3 \text{ Mod } 5$$
$$q \equiv 2 \text{ Mod } 5$$

In the above examples $q$ corresponds to the quotient only in the first example.
The residue code representation of a number consists of many elements. Each digit of the representation is associated with a different prime base. In other words each digit of the representation is an element of a different field. The division of two numbers in residue code may be expressed by a system of congruences. The solution \( A = (q_1, q_2, \ldots, q_n) \) must satisfy all the congruence relationships of the system. A zero digit in the divisor \( B = (b_1, b_2, \ldots, b_n) \) must be given special consideration. The congruence corresponding to any \( b_1 = 0 \) must be removed from the system. The deletion of the congruences for which \( b_1 = 0 \) is consistent with the residue definition of division given above. In general, if some \( b_1 \) equal zero then

\[
QB \not\equiv A \mod m
\]

For the special case in which \( b_1 = 0 \) and

\[
QB \equiv A \mod m
\]

it is also true that \( Q \) has an integer value. The deletion of the congruence associated with \( b_1 = 0 \) results in a relationship of the following form if there are no elements \( b_j = 0 \).

\[
QB_1 \equiv A_1 \mod \left( \frac{m}{\sum_{i=1}^{n} m_i} \right)
\]

The examples which follow are presented to clarify the process of residue division. Consider a residue code with bases 2, 3, 5 and 7. Since some confusion might exist between the normal number representation and the residue representation the residue representations
are enclosed in parenthesis.

\[ q = \frac{9}{4} \quad \frac{(1042)}{(0144)} = (q) \quad (1042) = q(0144) \]

\[ q = (x014) \]

in terms of congruences

\[ 4q \equiv 9 \text{ Mod } 105 \]
\[ 324 \equiv 9 \text{ Mod } 105 \]
\[ \frac{29}{11} = (q) = \frac{(1241)}{(1214)} = (1142) \leftrightarrow 79 \]
\[ 79 \times 11 \equiv 29 \text{ Mod } 210 \]
\[ \frac{7}{6} = (q) = \frac{(1120)}{(0005)} = (xx \ 20) \]
\[ 6 \times 7 \equiv 7 \text{ Mod } 35 \]
\[ \frac{24}{8} = (q) = \frac{(0043)}{(0231)} = (x \ 223) \]
\[ 8 \times 3 \equiv 24 \text{ Mod } 105 \]

The process of residue division has certain interesting properties and quite possibly has applications in respect to special problems. Unfortunately, the residue division process is not a substitute for normal division. It appears that the only way in which division can be effected in the residue code is by the utilization of techniques similar to those employed for division in a consistently weighted number system. The division process then requires trial and error subtraction or addition and the greater than or less than relationship. The division algorithm could also include trial multiplication since in the residue system addition and multiplication require the same period of time.
Conclusions

The material of this chapter forms a preliminary investigation of the applicability of residue number systems to the arithmetic operations of digital computers. The residue system has been found attractive in terms of the operations of multiplication and addition. It is possible to realize practical logical circuitry to yield the product in the same operation time as for the sum since the product is not obtained by the usual procedure of repetitive addition. The main disadvantages of the residue number system are associated with the necessity of determining absolute magnitude. Thus the division process, the detection of an overflow and the determination of the correct sign of a subtraction operation are operations which at this stage of the investigation seem to involve considerable complexity. Nevertheless there are certainly many special purpose applications well suited to the residue code. In particular there exists a class of control problems characterized by the absence of the need for division, the existence of a well defined range for the variables and also by the fact that the sign of the variables is known. For the problems of this class, the use of the residue code should result in a reduction of the overall computation period and give a computer with a higher bandwidth than obtainable with the conventional number system. It seems that one appropriate application of the residue code would be in the design of a concurrent or parallel digital differential analyzer.
It has been shown that the residue code with redundancy can never obtain the theoretical efficiency predicted by Shannon. The code does have the obvious advantage that an erroneous digit does not result in a propagated error. It is not unlikely that the residue code could have important applications in the field of communications. This statement is based on the fact that a sine wave may be residue encoded by phase modulation.

The ultimate usefulness of the residue code will probably be determined largely by the success of the circuit designer in perfecting circuitry ideally suited for residue code operations. In this respect there is one recent development in permanent storage devices which should revolutionize some of the standard concepts of computer design. It is not unlikely that the computer of the future will consist in part of a large number of addressable permanent tables. The standard computer operations would be executed by a series of table look-ups. The designer of such a computer should give serious consideration to the use of a residue code. It appears that the combination of residue coding and permanent addressable tables obtains numerous advantages. The combination would permit arithmetic without carry and the inclusion of the redundancy needed for error detection and correction. The permanent storage tables would greatly simplify the problems associated with the determination of signs and relative magnitude and the error correcting and detecting routine.
CHAPTER VI

CONCLUSIONS AND RECOMMENDATIONS

This thesis has dealt primarily with techniques appropriate to error checking in a computer arithmetic unit. Particular attention has been given to the logic and error structure associated with the binary adder. The conventional digital parity check has been shown to have the properties of arithmetic invariance required to check arithmetic operations. The digital parity check has been applied previously to storage and transmission systems but to the author's knowledge the check has never been applied to an arithmetic system. Previous applications of the digital parity check in computer systems have determined the parity of the result of an arithmetic operation by a means independent of the parity of the operands. A parity checking scheme which provides a check of the arithmetic operations requires cognizance of the number of generated carries. This fact is one decided disadvantage of the digital checking system. The digital parity check is not necessarily restricted to binary number systems. The digital parity check may be defined and applied to check arithmetic operations for any consistently weighted number system regardless of the base. The essential features of the digital parity check are the dependence of the check on the magnitude of the symbols of a number representation and the independence of the check on the magnitude of the weight associated with the symbols. The
definition of the digital parity check in terms of the sum of the symbols modulo b where b is the base of the number system is a much more fundamental definition than the usual definition of the digital parity check given for the binary number system. The usual definition of the digital parity check is given in terms of the oddness or the evenness of the number of ones contained in the representation.

The numerical check is based on the concept of linear congruences. The check base may therefore be any number except the magnitude of the number base. The main advantage of the numerical check is that carry cognizance is not required. The check is, however, sensitive to carry malfunctions.

The question of the effectiveness of the different parity checks in the detection of errors is closely related to the logical structure of the adder. A cursory examination of the problem reveals immediately that the major difficulty is associated with the propagation of carries in the addition process. If it were not for carry propagation it would be easy to obtain independence of the elements involved in the addition process. If the elements were independent then the simple digital parity check would be one hundred percent effective against single component malfunctions. It is possible to design arithmetic structures in which each carry element is determined independently. However a severe penalty in terms of time and equipment is associated with structures of this type. For example the carry logic normally associated with a 30 bit binary adder with non-independent carry elements consists of 29 AND gates and 29 OR gates each gate
having two inputs. The generation of instantaneous and independent carry elements would require 29 OR gates with 464 inputs and 435 AND gates with 4930 inputs. One example of a non instantaneous but independent carry logic requires 29 OR gates with 464 inputs and 481 AND gates with 1682 inputs. These figures illustrate the high price paid to obtain independence of the carry elements. There may exist some carry logic design in between the extremes illustrated by the examples given. In particular it should be possible to design a carry logic for which the carry elements are not completely independent but are less dependent than the carry elements of the standard carry logic. It should be possible to synthesize carry logic of this type by an extension of the vector space concepts which have been used to investigate the structure of binary addition. The extension of the vector space concepts to the problem of the synthesis of arithmetic structures has been given only brief consideration and should be an important area for future work. Vector space concepts have been found particularly appropriate for the analysis of the process of binary addition. The author found that the study of the binary addition process by means of vectors tended to clarify the problems associated with the structure of the binary adder. In particular the introduction of the carry matrix permitted a unified approach to the study of different configurations of carry logic.

A natural extension of the vector space concept of measure has resulted in the specification of three new addition algorithms.
The algorithms provide an interesting description of the addition process but do not lead to logical circuits having any advantage over the conventional circuits.

The study of the structure of the errors for a standard parallel adder has shown that the propagated errors due to single logical element malfunctions exist in only two forms. The two forms are a series of zeros terminated by a one or a series of ones terminated by a zero. The form of the error sequence is due largely to the fact that the propagation of an error to some element $s_{i+1}$ requires that $r_i = 1$. Error propagation is halted when some element of the vector, $r_j = 0$, where $j > i$. The probability of error propagation has been found to be a function of the type of malfunction which initiated the error. The study of the error structure of the standard adder has revealed the existence of only four different classes of error propagation probabilities corresponding to all of the possible single logical malfunctions.

Thus the effectiveness of the digital parity check is a function of the type of malfunction. A simple digital parity check consisting of only one parity check bit has a probability of success of $2/3$, $3/4$, $1/3$ and $5/9$ for each of the four possible error propagation probabilities. The digital check may be considerably more effective in the detection of errors of more than one parity check digit is employed. For example if two digits are employed in such a manner that alternate digits are associated with one check bit and the other digits with the other check digit then the probabilities associated with the success of the check
become 14/15, 19/20, 13/15 and 41/45 respectively. The extension of the
digital check to include additional check digits is not particularly
fruitful since the numerical parity check has been shown to be 100%
effective. Furthermore the numerical check has the advantage of not
requiring carry cognizance. The figures which have been stated for
the effectiveness of the various checks apply only to the results of
single addition-like operations. Multiplication and division, which
consist of repetitive addition, must be checked at each addition step
in order to obtain the parity check effectiveness stated above. The
parity effectiveness for a sequence of addition operations has not
been investigated in this paper. This would be an excellent topic
for future study.

The numerical check is superior to the digital check for the
purposes of error detection. However there are certain difficulties
associated with numerical checking if error correcting properties are
desired. The extension of the numerical check to include additional
check digits will not provide the information needed to correct an
erroneous representation unless the check representation contains all
of the information of the original representation. In this case it
becomes possible to carry out the arithmetic operations in terms of
the check representation and the original representation may be ignored.
We have called the numerical check representation for this case a
residue number system. There appears to be a fundamental limitation
in regard to the application of numerical checking procedures for the
purposes of error correcting since the numerical check is dependent on
magnitude. For example a numerical check modulo 35 consisting of check bases 7 and 5 would provide error correction information for any error of magnitude less than 35. If the magnitude of the error is greater than or equal to 35 the correction data is ambiguous. It would seem that there exists an interesting possible combination of error detecting techniques used in conjunction with a reasonable amount of circuit duplication.

The residue number system has been investigated in detail. The main advantage of the number system appears to be the complete independence of the digits of a residue representation. Addition and multiplication are both defined operations and are executed without carry between the residue digits. However, complete element independence may not be obtained within a residue digit when the digit is represented in binary code unless special logical circuitry is employed. Independent addition elements must be employed within the adder logic of a single residue digit to give complete bit by bit independence. Also if standard carry logic were employed within a residue digit the addition time while still less than that of the standard adder would not be significantly less. Multiplication in a residue code would be significantly faster than that presently achieved by standard techniques even if standard carry logic were employed within the residue digits. However, it is doubtful that the residue number system will ever be used extensively for general purpose computation due to the difficulties involved in performing the division operation. The basic difficulty
pertain to the determination of magnitude. The ultimate success of the residue code for general purpose computation is very much contingent upon the development of special components suitable to the residue code. At the present time the most probable application of the residue code is in the realm of special purpose computation. The residue code is unique in the sense that the residue digits are independent. Furthermore, the code may obtain error correcting capabilities simply by the addition of more residue digits providing of course that the base of the added digits is relatively prime to the bases of the other digits. The error correcting routine required by the residue code is moderately complicated when considered in terms of equipment. The success of the residue code as a means of error correction is dependent on the development of components suited to the properties of the residue code.

It appears that the control of computation errors due to circuit malfunctions may be obtained by means of error detecting and correcting codes which are arithmetically invariant. In this thesis the simple digital parity check and the numerical parity check have been considered in detail. While the numerical check is very effective in error detection it is not particularly suited for error correction in a general computing system. It is suggested that perhaps the most effective control of errors is to be obtained by the use of a hybrid system employing both component redundancy and error coding.
APPENDIX

SIGN AND OVERFLOW CONVENTION FOR
ONE’S AND TWO’S COMPLEMENT ARITHMETIC

The following material is a detailed description of one possible sign and overflow convention for binary arithmetic. The material is pertinent to the discussion of the form of errors which occur in the process of binary addition. This subject was discussed in Chapter IV. We consider here the addition and the subtraction processes for both one’s complement and two’s complement arithmetic.

It is conventional to speak of a binary representation to the left of the radix point as consisting of \( n \) digits plus a sign. In this case the sign would be the \( n \)th plus one digit and the last digit or element of the carry vector would be the \( n \)th plus two digit. However, the convention adopted in Chapter III is somewhat different than this. The convention of Chapter III will be used in the remaining discussion. In this convention the number including sign is represented by \( n \) digits or elements to the left of the radix point and \( m \) digits to the right of the radix point.

For two’s complement arithmetic the complement representation of a negative number \(-A\) is defined by the following relationship

\[
A' = 2^n - A \quad \text{where} \quad |A| < 2^{n-1}
\]

In two’s complement arithmetic zero is represented as a positive entity. The largest negative number is \(-2^{n-1}\) and is represented by \(2^{n-1}\) in the two's complement representation. The largest positive number representable by the \( n+m \) elements is \(2^{n-1} - 2^{-m}\). We now proceed to examine
the results of the addition process for four different augend, addend combinations.

Consider first the case where both the augend and addend are positive. The resulting sum must also be positive. Therefore, any sum which carries to the nth element constitutes an overflow of the representation. It is readily verified that every sum of two positive operands which overflows will cause a one to appear in the nth element of the sum. The smallest overflow is $2^{n-1}$ while the largest overflow obtained by the addition of the two largest positive numbers representable by n digits is $2^n - 2^m + 1$. Both of these overflow sums contain a one in the nth digit place.

We consider next the addition of a positive and a negative operand. In regard to the overflow problem there is no difficulty since an overflow is not possible. An important question is whether the last carry element may assume a value of unity. The last carry element corresponds to the nth + 1 element of the sum. It is obvious that if the resulting sum is negative then the nth + 1 element cannot equal zero for the nth element of the sum is a one and $r_n$ equals one and $c_n$ must equal zero. It is therefore, impossible for $c_{n+1}$ to equal unity. On the other hand if the resulting sum is positive it is possible for the $c_{n+1}$ digits to have a value of unity. Normally in two's complement arithmetic this digit is ignored. The positive sum obtained by the addition of a pair of positive and negative operands extends over a range defined by the addition of the largest positive
number and, in absolute magnitude, the smallest negative number to the
number zero. Zero is obtained by the addition of a positive number and
a negative number identical in absolute magnitude.

\[ 2^n \leq R \leq 2^n + 2^{n-1} - 2^{-m} \]

It is observed that the \( n \)th + 1 element is always a one over the range
of the positive sum. Furthermore, the \( n \)th element has a value of zero.

The last case is the addition of two negative operands. In
this case the \( n \)th element of both operands has a value of unity and hence,
the \( n \)th + 1 element of the sum and the carry vector will always be
unity. The investigation here concerns the behavior of the \( n \)th digit.
The sum of two negative operands can overflow. We consider first the
range of the non-overflow sum obtained by the addition of the two
smallest negative numbers (absolute magnitude) and the sum of any two
negative numbers which produces the largest (absolute magnitude) nega-
tive number which can be represented. This range is given as

\[ 2^{n-1} \leq R \leq 2^n - 2^{-m} \]

If the sum of the two negative operands overflows the result must lie
in the range specified by the sum of the two negative numbers largest
in absolute magnitude and the sum of the largest negative number in
absolute magnitude and minus one.

\[ 2^n \leq 0 \leq 2^n + 2^{n-1} - 2^{-m} \]

It is observed that if no overflow occurs the \( n \)th digit of the sum is
correct. The sum is negative and the \( n \)th digit is a one. If an over-
flow occurs the \( n \)th digit becomes a zero which is the incorrect sign.
The \( n \)th + 1 digit is a one for both situations.
In one's complement arithmetic the minimum negative number in absolute magnitude is zero given as $2^n - 2^{-m}$. The magnitude of the largest negative number is $2^{n-1} - 2^{-m}$ and is represented as $2^n - 2^{n-1}$. The largest and smallest positive numbers are $2^{n-1} - 2^{-m}$ and $2^{-m}$.

The overflow representation, due to the addition of two positive numbers, lies in the range

$$2^{n-1} \leq 0 \leq 2^n - 2^{-m+1}$$

The overflow does not affect the $n+1$ digit but does produce a one in the $n$th digit. The overflow is detected by the erroneous sign. The normal addition of two positive operands produces a sum in the range

$$2^{-m+1} \leq R \leq 2^{n-1} - 2^{-m}$$

The addition of two negative numbers always requires an end around carry correction. Before correction the range of the correct sum and the overflow sum are

$$2^{n+1} - 2^{n-1} - 2^{-m} \leq R \leq 2^{n+1} - 2^{-m+1}$$

$$2^{n+1} - 2^n \leq 0 \leq 2^{n+1} - 2^{n-1} - 2^{-m+1}$$

Before correction the overflow representation and the normal representation will always have a one value for the correction digit. The sign digit for the correct representation is either a zero or a one. The sign digit of the overflow representation is always zero. After correction the overflow and normal range of the sum is

$$2^{n+1} - 2^{n-1} \leq R \leq 2^{n+1} - 2^{-m}$$

$$2^{n+1} - 2^n + 2^{-m} \leq 0 \leq 2^{n+1} - 2^{n-1} - 2^{-m}$$

After correction the correction digit remains a one for both the normal and the overflow representation. The sign digit for the normal sum is
a one while the sign for the overflow representation is a zero which is incorrect.

The addition of a positive and a negative operand can never produce an overflow but the sum may be either positive or negative. If the sum is positive an end around correction is required. This is verified by

\[ S = A + B \quad \text{where} \quad |A| > |B| \]

\[ = 2^n - B - 2^{-m} + A \quad A \leq 2^{n-1} - 2^{-m} \]

\[ 2^n \leq S \leq 2^n + 2^{n-1} - 2^{-m+1} \]

Thus an end around correction is always required for a positive sum and the sign is correct both before and after the correction. After correction the representation is in the range.

\[ 2^n + 2^{-m} \leq S \leq 2^n + 2^{n-1} - 2^{-m} \]

The negative sum is obtained for \( S = A + B \), if \( |A| \leq |B| \)

\[ 2^{n-1} \leq S \leq 2^n - 2^{-m} \]

The negative sum obtained has a zero value for the correction digit and a one value for the sign digit. End around carry correction is not required.


