JPS6029406B2

JPS6029406B2 - Arithmetic control method

Info

Publication number: JPS6029406B2
Application number: JP54060720A
Authority: JP
Inventors: 眞宏橋本; 健一和田
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1979-05-17
Filing date: 1979-05-17
Publication date: 1985-07-10
Also published as: JPS55153051A

Description

【発明の詳細な説明】本発明は、演算制御方式に関し、特にプリント・パター
ンによる伝播遅延の大きい加算器方式に関するものであ
る。DETAILED DESCRIPTION OF THE INVENTION The present invention relates to an arithmetic control system, and more particularly to an adder system with large propagation delays due to printed patterns.

高速度処理能力が要求される情報処理装置において、処
理速度の向上を計るための最も一般的な方法は、１マシ
ン・サイクル内に処理できるデータ長をできる限り大き
くすることである。In information processing devices that require high-speed processing capability, the most common method for improving processing speed is to maximize the length of data that can be processed within one machine cycle.

通常の高速処理装置では、８バイト長程度のデータ幅を
１マシン・サイクル内に処理している。いま、２進の先
見的加算器について考えた場合、４バイト幅の加算器と
８バイト幅の加算器を比較してみると、両者のゲート段
数の差は１段ないし２段程度であり、加算器に関する限
りゲート・ディレィの差は従来それほど大きくない。A typical high-speed processing device processes data width of approximately 8 bytes in one machine cycle. Now, when considering a binary look-ahead adder, if you compare a 4-byte wide adder and an 8-byte wide adder, the difference in the number of gate stages between the two is about 1 or 2 stages, Conventionally, as far as adders are concerned, the difference in gate delay is not that large.

ところで、最近では、高集積度半導体技術の進歩により
、ゲート１段当りの速度が飛躍的に向上してきたため、
半導体素子相互間を連絡するプリント・パターンによる
伝播遅延が、処理装置のマシン・サイクル内に占める比
重としてきわめて大きくなっている。４バイトの加算器
と８バイトの加算器を比較した場合、前者では最下位ビ
ットから上位３１ビットまで桁上げを伝達すればよいの
に対して、後者では最下位ビットから上位６３ビットま
で桁上げを伝達する必要があり、必然的に後者による桁
上げ伝播のための半導体素子間遅延時間は前者より大き
０くなる。By the way, in recent years, due to advances in highly integrated semiconductor technology, the speed per gate stage has improved dramatically.
Propagation delays caused by printed patterns that communicate between semiconductor devices have become extremely important in the machine cycles of processing equipment. When comparing a 4-byte adder and an 8-byte adder, the former only needs to transmit a carry from the least significant bit to the upper 31 bits, while the latter only needs to transmit a carry from the least significant bit to the upper 63 bits. It is necessary to transmit the delay time between semiconductor elements for the carry propagation due to the latter, which is inevitably larger than the former and becomes zero.

また、従来の処理装置においては、加算器が一組しか設
けられていないため、四則演算命令を一組の加算器で制
御することになり、加算器に入力すべきデータ・パスが
多くなる。Further, in the conventional processing device, since only one set of adders is provided, four arithmetic operation instructions are controlled by one set of adders, and the number of data paths to be input to the adder increases.

タしたがって、論理装置を実際の半導体集積回路で実
現しようとすると、半導体集積回路は入出力ピンに物理
的制約を加えられるため、各素子を隔離して配置しなけ
ればならない。Therefore, when attempting to realize a logic device using an actual semiconductor integrated circuit, physical constraints are imposed on the input/output pins of the semiconductor integrated circuit, and each element must be placed in isolation.

そこで、このような物理的制約から解放して、装置の処
理能力を向上させるために、複数個の演算器を設置し、
各演算器ごとに処理すべき命令群を割当てておく方法が
用いられている。Therefore, in order to free ourselves from these physical constraints and improve the processing capacity of the device, we installed multiple computing units.
A method is used in which a group of instructions to be processed is assigned to each arithmetic unit.

第１図は、、複数個の演算ユニットを備えた機能分散型
演算器の説明図である。FIG. 1 is an explanatory diagram of a functionally distributed arithmetic unit including a plurality of arithmetic units.

第１図において、演算器１は第１の演算ユニット（以下
１ユニットという）２と第２の演算ユニット（以下Ｇユ
ニットという）３に分割されている。In FIG. 1, an arithmetic unit 1 is divided into a first arithmetic unit (hereinafter referred to as 1 unit) 2 and a second arithmetic unit (hereinafter referred to as G unit) 3.

Ｆユニット２は、浮動小数点命令および固定小数点乗除
算を実行するものであり、Ｇユニット３は固定小数点加
減算、論理演算、１坊隼演算、特権命令および割込み処
理等を実行するものである。第１図のような分割構成に
すると、装置全体の金物量は増大するが、各演算ユニッ
ト２，３がそれぞれ構成上得意とする分野において機能
と向上を計っているため、装置全体の処理能力を上げる
ことができる。すなわち、第１図の場合、Ｇユニット３
の加算器は固定小数点加減算、１Ｇ星演算および論理演
算を行うのに対して、Ｆユニット２の加算器は、２進加
減算を行う機能だけを具備すれば、浮動小数点命令と乗
除算を実行できるので、２進加減算のみを考慮して構成
すればよく、Ｇユニット３の加算器より必要とする金物
量は少くてよい。The F unit 2 executes floating point instructions and fixed point multiplication/division, and the G unit 3 executes fixed point addition/subtraction, logical operations, single-point operations, privileged instructions, interrupt processing, and the like. The split configuration shown in Figure 1 increases the amount of hardware for the entire device, but since each calculation unit 2 and 3 is designed to improve functionality in its own field of expertise, the processing capacity of the entire device can be increased. can be raised. That is, in the case of FIG. 1, G unit 3
The adder in F unit 2 performs fixed-point addition and subtraction, 1G star operations, and logical operations, whereas the adder in F unit 2 only needs to have the function to perform binary addition and subtraction, and can execute floating-point instructions and multiplication and division. Therefore, it is sufficient to configure the adder considering only binary addition and subtraction, and the amount of hardware required is smaller than that of the adder of the G unit 3.

他方、Ｇユニット３の加算器は、要求される演算の種類
数がＦユニット２の加算器より多いが、乗除算等の演算
ループとなるような命令はＦユニット２で行っているの
で、Ｇユニット３では加算器の１マシン・サイクル内に
とる時間を長く設定することが可能である。On the other hand, the adder in G unit 3 requires more types of operations than the adder in F unit 2, but since instructions that result in an operation loop such as multiplication and division are performed in F unit 2, In unit 3, it is possible to set the time taken within one machine cycle of the adder to be long.

次に、演算器１に対する入力データについて着目した場
合、Ｆユニット２では浮動小数点命令と乗除算命令の第
１および第２オペランドを入力するための汎用レジスタ
、浮動小数点レジスタ、ストレツジおよびインタフェー
スを用意すればよいのに対して、Ｇユニット３では、一
般命令の２つのオペランドのデータ入力の他に、特権命
令等を処理するために、装置内の制御情報（プログラム
状態語、制御レジスタの内容等）や入出力チャネルのチ
ャネル・アドレスの送出等、装置内で連絡すべきリソー
スの数が多い。Next, when focusing on the input data to the arithmetic unit 1, the F unit 2 must prepare general-purpose registers, floating-point registers, storage, and interfaces for inputting the first and second operands of floating-point instructions and multiplication/division instructions. In contrast, in the G unit 3, in addition to data input of the two operands of a general instruction, in order to process privileged instructions, etc., the G unit 3 inputs control information within the device (program status word, contents of control registers, etc.) There are many resources that need to be communicated within the device, such as sending channel addresses for input and output channels.

したがって、Ｇュニット３では、前述のように、半導体
集積回路が入出力ピン数に物理的制約を加えられるため
、加算器の上位と下位のビット位置を実装上近接して設
置することが難しくなる。このように、第１図の構成に
おけるＧユニット３の加算器はＦユニット２に比較して
、演算ユニット自体への入力データ・パス数が多く、か
つ加算器に要求される演算の種類数が多いので、Ｆユニ
ット２に比較して実装上近接させて配置することが難し
い。Therefore, in Gunit 3, as mentioned above, the semiconductor integrated circuit is subject to physical constraints on the number of input/output pins, which makes it difficult to place the upper and lower bit positions of the adder close to each other due to implementation. . In this way, the adder in G unit 3 in the configuration shown in FIG. Since there are many units, it is difficult to arrange them closer to each other than the F unit 2 in terms of mounting.

したがって、１マシン・サイクル内での所定の時間内に
桁上げを行うことが難しくなる。本発明の目的は、この
ような問題を解決するため、加算器の桁上げを所定のマ
シン・サイクル内に伝達することが困難な場合に、装置
のマシン・サイクル内で処理性能を劣化さることなく、
加減算を実行する演算制御方式を提供することにある。Therefore, it becomes difficult to perform a carry within a predetermined time within one machine cycle. An object of the present invention is to solve such problems by reducing the processing performance within the machine cycle of the device when it is difficult to transmit the carry of the adder within a predetermined machine cycle. Without,
The object of the present invention is to provide an arithmetic control method for performing addition and subtraction.

本発明の演算制御方式は、入力データ長の上位部分を演
算する上位加算器と、下位部分を演算する下位加算器と
からなり、各々１サイクルで演算結果を出力する２つの
加算器、前記下位加算器で生じた上位への桁上げを保持
する第１のフリップフロツプ、該第１のフリツプフロッ
プの桁上げを次サイクルで前記上位加算器に与える第２
のフリップフロップ、および比較命令時に動作作し、前
記上位加算器での等価、桁上げの有無、前記下位加算器
での等価、桁上げの有無を、それぞれ入力して判定する
条件判定手段を有し、加算命令時には、第１のサイクル
で下位を入力して前記下位加算器で加算した後、第２サ
イクルで上位を入力して前記上位加算器で加算するとと
もに、比較命令時には、上位、下位を同時に減算動作さ
せ、下位から上位に桁上げを伝搬することなく、条件判
定手段で判定することに特徴がある。The arithmetic control method of the present invention consists of an upper adder that calculates the upper part of an input data length and a lower adder that calculates the lower part. a first flip-flop that holds the carry to the higher order generated in the adder; a second flip-flop that supplies the carry of the first flip-flop to the upper adder in the next cycle;
a flip-flop, and a condition determining means that operates at the time of a comparison instruction and determines by inputting the equality in the upper adder, the presence or absence of a carry, and the equality in the lower adder, and the presence or absence of a carry, respectively. However, in the case of an addition instruction, after inputting the lower part in the first cycle and adding it in the lower adder, in the second cycle, inputting the higher part and adding it in the upper adder, and in the case of a comparison command, the upper and lower The characteristic is that the condition determination means performs a subtraction operation at the same time, and the condition determination means determines the carry without propagating the carry from the lower to the upper.

以下、本発明の実施例を、図面により説明する。Embodiments of the present invention will be described below with reference to the drawings.

第２図は、本発明による加算器のブロック構成図である
。FIG. 2 is a block diagram of an adder according to the present invention.

以後、説明の都合上、基本データ幅を８バイトと仮定す
る。Hereinafter, for convenience of explanation, it is assumed that the basic data width is 8 bytes.

Ｘレジスタ４とＹレジスタ５は、それぞれ８バイト・レ
ジスタで、加算器へのＸ側入力とＹ側入力とになる。X register 4 and Y register 5 are each 8-byte registers and serve as the X-side input and Y-side input to the adder.

なお、Ｘ側入力レジスタ４、Ｙ側入力レジスタ５とも１
本ずつの入力線を有しているが、それぞれ複数本ずつ有
していてもよい。また、加算器６は、Ｘレジスタ４の上
位４バイトとＹレジスタ５の上位４バイトを入力して４
バイトの加算を行う上位加算器であり、加算器７はＸレ
ジスタ４の下位４バイトとＹレジスタ５の下位４バイト
を入力して４バイトの加算を行う下位加算器である。加
算器６，７の演算結果を記憶する出力レジスタ８，９は
、信号線１０４を介してＸレジスタ４あるいはＹレジス
タ５に演算結果を戻すことができる。また「フリツプ・
フロツプ１川ま下位４バイトの最上位ビットからのキャ
リーを記憶するもので、フリップ・フロッブ１１は８バ
イトを加減算を行う場合に下位キャリーを上位キャリー
に伝達するディレィ・フリップ・フロツプである。さら
に、条件判定部１２は、比較命令により減算した結果を
判定するものである。なお、Ｘレジスタ４、Ｙレジスタ
５は、汎用レジスタあるいは記憶装置とデータの転送を
行うため、あるいはシフタ等、演算器内の他の演算ユニ
ットとデータ転送を行うために、適当なデータ・パスが
設けられるが、直接関係がないので省略してある。第２
図では、×レジスタ４、Ｙレジスタ５から加算器６，７
に入力されたデータは、加算器６，７において所要の演
算を実施された後、出力レジス夕８，９を経由して信号
線１０４からＸレジスタ４、Ｙレジスタ５に戻るまでに
、１マシン・サイクルが割当てられる。Note that both the X-side input register 4 and the Y-side input register 5 are 1.
Although each input line has one input line, each input line may have a plurality of input lines. Additionally, the adder 6 inputs the upper 4 bytes of the X register 4 and the upper 4 bytes of the Y register 5, and
The adder 7 is a high-order adder that performs byte addition, and the adder 7 is a low-order adder that inputs the lower 4 bytes of the X register 4 and the lower 4 bytes of the Y register 5 and performs 4-byte addition. Output registers 8 and 9 that store the calculation results of adders 6 and 7 can return the calculation results to X register 4 or Y register 5 via signal line 104. Also, “Flip
Flop 1 stores the carry from the most significant bit of the lower four bytes, and flip-flop 11 is a delay flip-flop that transmits the lower carry to the upper carry when adding or subtracting 8 bytes. Furthermore, the condition determination unit 12 determines the result of subtraction using a comparison instruction. Note that the X register 4 and Y register 5 are connected to an appropriate data path in order to transfer data with a general-purpose register or storage device, or with other arithmetic units in the arithmetic unit such as a shifter. Although it is provided, it is omitted because it is not directly related. Second
In the figure, adders 6 and 7 are added from × register 4 and Y register 5.
After the data input to the adders 6 and 7 perform the necessary operations, the data is processed by one machine before returning from the signal line 104 to the X register 4 and Y register 5 via the output registers 8 and 9. - Cycles are assigned.

加算器６，７は、論理和、論理積、排他的論理和等の論
理演算と、２進数あるいは１Ｇ隻数の加減算を実行する
機能を備えている。The adders 6 and 7 have functions of performing logical operations such as logical sum, logical product, and exclusive logical sum, and addition and subtraction of binary numbers or the number of 1G ships.

加算器で信号伝達に最も時間を費すのは、最下位ビット
から最上位ビットへのキャリーの伝達であるが、論理演
算では上位ビットと下位ビット間での信号伝達を必要と
しないため、ゲート段数は少〈、加減算に比べて短時間
で実行できる。したがって、第２図の加算器においては
、上位４バイトの加算器６と下位４バイトの加算器７に
２分割し、８バイトの加減算を実行するには、最初のサ
イクルで下位４バイトの加算を行い、次のサイクルで上
位４バイトの加算を行って、８バイトの加算結果を２サ
イクルで得るようにして、マシン・サイクルに対する加
算器の時間的制約を緩和すると同時に、実質的性能を劣
化させないようにするため、論理演算、比較命令および
連続演算に対して次の方法を実行する。The most time-consuming signal transmission in an adder is the carry transmission from the least significant bit to the most significant bit, but logic operations do not require signal transmission between the upper and lower bits, so gates The number of steps is small, and it can be executed in a shorter time than addition and subtraction. Therefore, in the adder shown in Fig. 2, it is divided into two, adder 6 for the upper 4 bytes and adder 7 for the lower 4 bytes, and in order to perform addition and subtraction of 8 bytes, the adder for the lower 4 bytes must be added in the first cycle. and add the upper 4 bytes in the next cycle to obtain the 8-byte addition result in 2 cycles, which alleviates the time constraints of the adder with respect to machine cycles and at the same time reduces the actual performance. In order to prevent this, implement the following method for logical operations, comparison instructions, and continuous operations.

すなわち、論理演算に対しては、１マシン・サイクルで
８バイト処理を実行し、比較命令に対しては、８バイト
の入力データの比較を下位４バイトから上位４バイトへ
のキャリーの伝達を行うことなく、１マシン・サイクル
で比較結果を算出し、さらに連続演算に対しては、上位
４バイトと下位４バイトを演算を連続的に１サイクル・
ピッチで行うことにより、８バイトの加算器に比べて性
能の劣化を１マシン・サイクルの増加のみにとどめる。
これらの論理演算、８バイト加減算、比較命令処理およ
び連続演算について、第３図〜第６図により説明する。That is, for logical operations, 8-byte processing is executed in one machine cycle, and for comparison instructions, 8-byte input data is compared and carry is transmitted from the lower 4 bytes to the upper 4 bytes. The comparison result is calculated in one machine cycle without any processing, and for continuous operations, the upper 4 bytes and lower 4 bytes are calculated continuously in one machine cycle.
By doing so in pitch, the performance degradation is limited to an increase of one machine cycle compared to an 8-byte adder.
These logical operations, 8-byte addition/subtraction, comparison instruction processing, and continuous operation will be explained with reference to FIGS. 3 to 6.

第２図において、８バイトの入力データと８バイトの入
力データがＸレジスタ４とＹレジス夕５に設定された状
態で、加算器６，７に論理演算動作が要求された場合、
加算器６，７はキャリーの伝達がないため、１マシン・
サイクル内に８バイトの論理演算を完了する。第３図は
、第２図における論理演算動作のタイム・チャートであ
る。In FIG. 2, when 8-byte input data and 8-byte input data are set in X register 4 and Y register 5, when adders 6 and 7 are requested to perform a logical operation,
Since adders 6 and 7 do not transmit carry, one machine/machine
Completes an 8-byte logical operation within a cycle. FIG. 3 is a time chart of logical operation operations in FIG. 2.

装置の１マシン・サイクルは時刻パルスｔｏ〜ｔ３に分
割されており、Ｘレジスタ（Ｘ−ＲＥＧ）４およびＹレ
ジスタ（Ｙ−ＲＥＧ）５に入力されたデータが同時に加
算器６，７に加えられると、演算結果が時刻りこ出力レ
ジスタ（０一ＲＥＧ）８，９にセットされ、続いて信号
線１０４を介してＸレジスタ４およびＹレジス夕５にそ
のデータが戻るので、１マシン・サイクル（第３図のＥ
ｏ）で８バイトの論理演算が実行される。One machine cycle of the device is divided into time pulses to to t3, and the data input to the X register (X-REG) 4 and the Y register (Y-REG) 5 are simultaneously added to adders 6 and 7. Then, the calculation results are set in the timer output registers (01REG) 8 and 9, and then the data is returned to the X register 4 and Y register 5 via the signal line 104, so one machine cycle (the E in figure 3
An 8-byte logical operation is performed in o).

第４図は、第２図における８バイト加減算動作のタイム
・チャートである。FIG. 4 is a time chart of the 8-byte addition/subtraction operations in FIG.

Ｅｏサイクルの先頭でそれぞれ８バイトの入力データが
、Ｘレジスタ４とＹレジスタ５に入力されると、Ｅｏサ
イクルの時刻ビーこ先ず下位４バイトの演算結果が下位
４バイト出力レジスタ（０−ＲＥＧ（Ｌ））９にセット
され、これと同時に下位０４バイトから上位へのキャリ
ーがフリップ・フロツプＦＦＩＯにセットされる。When 8 bytes of input data are input to the X register 4 and Y register 5 at the beginning of the Eo cycle, the operation result of the lower 4 bytes is first transferred to the lower 4 byte output register (0-REG( L)) is set to 9, and at the same time the carry from the lower 04 bytes to the upper one is set in the flip-flop FFIO.

フリツプ・フロツプ１０の内容は、時刻ｔｏでフリップ
・フロップＦＦＩＩに移送され、次のＥ．サイクルの
時刻ら‘，こおいて、入力Ｘレジスタ４の上位４バイト
と入力Ｙレジスタ５の上位４バイトと下位からのキャリ
ー・フリツプ・フロッブＦＦＩＩの内容により、上位４
バイトの加算結果が出力レジスタ（０−ＲＥＧ（Ｕ））
８にセットされる。このようにして、Ｅ，サイクルの時
刻らで出力上位４バイト・レジスタ（０一ＲＥＧ（Ｕ）
）８と出力下位４バイト．レジスタ（０一ＲＥＧ（Ｌ）
）９に８バイトの演算結果が設定され、Ｅ，サイクルの
終りの時刻ｔｏで演算結果がＸレジスタ４、およびＹレ
ジスタ５に転送されることにより、８バイトの加減算は
終結する。第５図は、第２図における比較命令の条件判
定部の詳細ブロック図である。The contents of flip-flop 10 are transferred to flip-flop FFI I at time to, and the contents of flip-flop 10 are transferred to the next E. At cycle time ', the upper 4 bytes of the input X register 4, the upper 4 bytes of the input Y register 5, and the contents of the carry flip-flop FFII from the lower
Byte addition result is output register (0-REG(U))
Set to 8. In this way, the output upper 4 byte register (01 REG (U)
)8 and output lower 4 bytes. Register (01 REG (L)
) 9 is set, and the 8-byte addition/subtraction is completed by transferring the operation result to the X register 4 and the Y register 5 at time to at the end of the cycle. FIG. 5 is a detailed block diagram of the condition determination section of the comparison instruction in FIG. 2.

８バイトの２つのオペランドを比較する場合、Ｘレジス
タ４、Ｙレジスタ５の内容を×−ＲＥＧ，Ｙ−ＲＥＧと
すると、条件判定部は次の各条件を判定し条件コードを
設定する。When comparing two 8-byte operands, if the contents of the X register 4 and Y register 5 are x-REG and Y-REG, the condition determining section determines the following conditions and sets a condition code.

×−ＲＥＧ＝Ｙ−ＲＥＧＸ−ＲＥＧ＜Ｙ−ＲＥＧ｝，．．．．．‘１，Ｘ−ＲＥ
Ｇ＞Ｙ−ＲＥＧ比較命令が要求されると、加算器６，７
はＸ−ＲＥＧからＹ−ＲＥＧを減算するが、この８バイ
ト減算においては、１マシン・サイクルで動作を完了す
る。×-REG=Y-REG X-REG<Y-REG},. ．．．．．．．． '1,X-RE
When a G>Y-REG comparison instruction is requested, adders 6 and 7
subtracts Y-REG from X-REG, but this 8-byte subtraction completes in one machine cycle.

通常、比較命令においては演算結果は不要であり、演算
結果の最上位からのキャリーと、演算結果がすべて「Ｏ
Ｊであるか否かの情報のみが必要となる。このため、比
較命令を実行するに際して、上位４バイト加算器６への
キヤリーは、フリップ・フロップ１１とは無関係にＹ入
力の２の補数をとるべく強制的に「１」が加えられ、演
算終了時には、上位出力レジスタに下位バイトからのキ
ヤリ−を無視した上位４バイトに対する演算結果がセッ
トされる。Normally, the comparison instruction does not require the operation result, and the operation result is carried from the top of the operation result and all the operation results are
Only the information as to whether it is J or not is required. Therefore, when executing the comparison instruction, "1" is forcibly added to the carry to the upper 4-byte adder 6 to take the two's complement of the Y input, regardless of the flip-flop 11, and the operation ends. Sometimes, the result of operation for the upper four bytes, ignoring the carry from the lower byte, is set in the upper output register.

いま、上位４バイトの範囲内における大小判定のもを考
えた場合、×レジスタ４の上位４バイトをＸ−ＲＥＧ（
Ｕ）、Ｙレジスタ５の上位４バイトをＹ一ＲＥＧ（Ｕ）
と・すると、×一ＲＥＧ（Ｕ）＝Ｙ−ＲＥＧ（Ｕ）の場
合には、減算の結果が「０」となり、×−ＲＥＧ（Ｕ｝
＜Ｙ−ＲＥＧ（Ｕ）の場合には、減算結果が「０」でな
く、かつ減算結果の最上位からのキャリーが「０」であ
り、またＸ−ＲＥＧ（Ｕ）＞Ｙ−ＲＥＧ（Ｕ）の場合に
は、減算結果の最上位からのキャリーが「１」であるこ
とをもって、それぞれ判定できる。Now, when considering the size judgment within the range of the upper 4 bytes, the upper 4 bytes of × register 4 are
U), upper 4 bytes of Y register 5 as Y-REG(U)
Then, if x-REG(U)=Y-REG(U), the result of subtraction is "0", and x-REG(U}
In the case of <Y-REG(U), the subtraction result is not "0" and the carry from the top of the subtraction result is "0", and ), each can be determined based on the fact that the carry from the top of the subtraction result is "1".

さて、８バイトの比較においては、加算器６，７が上位
４バイトと下位４バイトの減算を独立に行うのであるが
、上位４バイトで、Ｘ−ＲＥＧ（Ｕ）＞Ｙ一ＲＥＧ（Ｔ
）、あるいは×一ＲＥＧ（Ｕ）くＹ−ＲＥＧ（Ｕ）が判
定できた場合、下位４バイトの演算結果に関係なく、上
位４バイトの判定結果のみで、２つのオペランドの比較
結果が得られるのは明らかである。Now, when comparing 8 bytes, adders 6 and 7 independently subtract the upper 4 bytes and lower 4 bytes.
), or if ×1 REG (U) × Y-REG (U) can be determined, the comparison result of the two operands can be obtained only from the determination result of the upper 4 bytes, regardless of the operation result of the lower 4 bytes. It is clear that

一方、上位４バイトの減算結果が「０」の場合には、Ｘ
レジス夕４とＹレジスタ５の上位４バイトのデータが全
く等しいというだけで、８バイト・データの全体の大４
・関係は下位４バイトの減算結果によって決定される。On the other hand, if the subtraction result of the upper 4 bytes is "0",
Just because the upper 4 bytes of register Y 4 and Y register 5 are exactly the same, the entire 8-byte data is
- The relationship is determined by the subtraction result of the lower 4 bytes.

第５図において、信号線１０４は上位４バイトの演算結
果がすべて「０」であるとき「１」となり、信号線１０
５は上位４バイトの演算結果で、最上位ビットから上位
へキヤリーがある場合に「１」となり、信号線１０６は
加算器７の下位４バイトがすべて「０」の場合に「１」
となり、信号線１０７は下位４バイトから上位へキヤリ
ーがあるとき「１」、すなわちフリツプ・フロツプ１０
と接続されている。第５図において、×−ＲＥＧ＝Ｙ−
ＲＥＧの場合、信号線１０４と１０６がアンド回路１９
に入力することにより信号線１０８が「ＩＪとなり、Ｘ
一ＲＥＧ＜Ｙ−ＲＥＧの場合、上位４バイトが等しけれ
ばアンド回路１８により、また上位４バイトが等しけれ
ばアンド回路２１により、オア回路２３を通して信号線
１１０が「１」となり、さらにＸ−ＲＥＧ＞Ｙ−ＲＥＧ
の場合、上位４バイトが等しくなければアンド回路１７
により、また上位４バイトが等しければアンド回路２０
‘こより、オア回路２２を通して信号線１０９が「１」
となる。In FIG. 5, the signal line 104 becomes "1" when the operation results of the upper 4 bytes are all "0";
5 is the operation result of the upper 4 bytes, which becomes "1" when there is a carry from the most significant bit to the upper order, and the signal line 106 becomes "1" when the lower 4 bytes of the adder 7 are all "0".
Therefore, the signal line 107 is "1" when there is a carry from the lower 4 bytes to the upper one, that is, the flip-flop 10
is connected to. In FIG. 5, ×-REG=Y-
In the case of REG, the signal lines 104 and 106 are connected to the AND circuit 19
By inputting the signal line 108 to "IJ",
In the case of 1REG<Y-REG, if the upper 4 bytes are equal, the AND circuit 18 sets the signal line 110 to "1", and if the upper 4 bytes are equal, the AND circuit 21 sets the signal line 110 to "1" through the OR circuit 23, and then X-REG> Y-REG
In this case, if the upper 4 bytes are not equal, the AND circuit 17
Also, if the upper 4 bytes are equal, the AND circuit 20
' From this, the signal line 109 becomes "1" through the OR circuit 22.
becomes.

このような構成の条件判定部１２を設けることにより、
比較命令を実施する場合に、下位４バイトから上位４バ
イトへのキヤ１」ーを無視し、各々独立な４バイト減算
を行うことによって、８バイトの比較を１マシン・サイ
クルで実行することが可能となる。By providing the condition determining section 12 with such a configuration,
When executing a comparison instruction, it is possible to perform an 8-byte comparison in one machine cycle by ignoring the "carrier 1" from the lower 4 bytes to the upper 4 bytes and subtracting each 4 byte independently. It becomes possible.

次に、本発明の第３の特徴である連続加算動作を、第６
図により説明する。Next, the continuous addition operation, which is the third feature of the present invention, is
This will be explained using figures.

本発明においては、通常、８バイトの加算結果を得るた
めには、２マシン・サイクルを必要とするが、Ｘ＝Ｘ十
Ｙの連続加算のように、、前のサイクルの加算結果を次
のサイクルで新たに加えていく連続演算においては、８
バイト加算を１サイクルで行える加算器に比べると、ル
ープ性能を１サイクル低下させるのみで連続演算を行う
ことが可能である。In the present invention, normally two machine cycles are required to obtain an 8-byte addition result, but as in the case of continuous addition of X=X0Y, the addition result of the previous cycle is transferred to the next In continuous operations where new additions are made in cycles, 8
Compared to an adder that can add bytes in one cycle, it is possible to perform continuous operations with only one cycle of deterioration in loop performance.

Ｘ＝Ｘ＋Ｙの演算ループがｎ回競いた場合、最終結果×
は、ふを変数Ｘの初期値とすると、次式により表わされ
る。If the calculation loop of X=X+Y competes n times, the final result ×
is expressed by the following equation, where F is the initial value of the variable X.

×＝ふ十宮Ｙｉ．．．．‐■ Ｘ，Ｙの上位、下位４バイトをＸＨｉ，ＹＨｉ，ＸＬ
ｉ，ＹＨｉとすると、第ｉループ目の下位４バイトは次
式となる。× = Fujunomiya Yi. ．．．．．． -■ Upper and lower 4 bytes of X, Y as XHi, YHi, XL
When i, YHi, the lower 4 bytes of the i-th loop are as follows.

ｉ−１ＸＬｉ＝ＸＬ。i-1 XLi=XL.

十ｉ≧。ＹＬｒ…・‘３｝このとき、下位４バイトから
上位へ伝わるキヤリーをＣｉとすると、第ｉループ目で
得るべき上位４バイトの演算結果は次式で表わされる。10i≧. YLr...'3} At this time, if the carry transmitted from the lower 4 bytes to the higher order is Ci, then the operation result of the upper 4 bytes to be obtained in the i-th loop is expressed by the following equation.

Ｘ比＝Ｘ地十三≦；Ｙ瓜十三≧；Ｃｉ‐‐…‐｛４｝し
たがって、第ｎ回目の加算終了時点において、Ｘ地＝Ｘ
…＋；≦；Ｙ山十室三Ｃｉ‐‐…‐‘５１により最終演
算結果が得られる。X ratio =
...+;≦;The final calculation result is obtained by Y mountain, 10th room, 3 Ci--...-'51.

本発明においては、下位４バイトから上位４バイトに転
送するフリップ’フロップ１０，１１を用いることによ
り、連続的な加減算のオーバ・ヘッドを生ずることなく
８バイト演算を実行することができる。In the present invention, by using flip-flops 10 and 11 that transfer data from the lower 4 bytes to the upper 4 bytes, 8-byte operations can be performed without the overhead of continuous addition and subtraction.

つまり、前記演算ループにおいて、第ｉ回目のループで
伝達すべき下位からのキャリーを１サイクル前のキヤ１
」ーＣｉ−１とするのである。すなわち、ｊ−１したがって、第ｎ回ループの終了時点において、Ｘ…＝
Ｘ地十迄；Ｙ山十賓客Ｃｉ−‐…‐｛７１が得られ、前
記【５｝式と第｛７｝式との差は、最終回の下位からの
キャリーＣｎ−１だけ第の式の方が小さいだけである。In other words, in the arithmetic loop, the carry from the lower order to be transmitted in the i-th loop is transferred to the carry 1 from the previous cycle.
”-Ci-1. That is, j-1 Therefore, at the end of the nth loop, X...=
From X ground to 10; Y mountain 10 guests Ci--...-{71 is obtained, and the difference between the above formula [5} and the {7} formula is that the carry Cn-1 from the bottom of the last round is is just smaller.

そこで、ｎ回のループの後、ＸＨｎ十１＝ＸＨｎ＋Ｃｎ
−１……‘８１を実行すれば、前記■式の演算ループの
上位４バイトの結果と等しい結果が得られる。So, after n loops, XHn+1=XHn+Cn
-1...'81, a result equal to the result of the upper 4 bytes of the arithmetic loop of formula (2) can be obtained.

第６図の第１演算サイクル（ＩＣＹＬ）においては、１
マシン・サイクルで上位４バイトと下位４バイトの加算
器６，７により演算が実行され、演算結果が×レジスタ
４に設定される。In the first calculation cycle (ICYL) in FIG.
In a machine cycle, the adders 6 and 7 for the upper 4 bytes and the lower 4 bytes perform calculations, and the calculation results are set in the x register 4.

この場合、上位４バイトの加算器６への下位４バイトか
らのキャリ−は、強制的に「０」にされる。In this case, the carry from the lower 4 bytes to the adder 6 of the upper 4 bytes is forced to "0".

第１演算サイクル（ＩＣＹＬ）の終了時点で、下位４バ
イトから上位へのキヤリーはラッチ（ＦＦＩＯ）にセッ
トされ、第２演算サイクル（２０ＹＬ）において上位４
バイトへの下位４バイトからのキャリーとして加算され
る。さらに、第２演算サイクル（２０ＹＬ）では、演算
結果が×レジスタ４に、下位４バイトから上位４バイト
へのキヤリーがフリップ・フロツプ１０なし、し１１に
セットされる。以下、同じようにして、第ｎ回目の演算
ループ終了時点では、Ｘレジスタ４の下位４バイトには
次式の正しい値が得られる。Ｘ机＝Ｘ山十害；Ｙけ……
【９１一方、Ｘレジスタ４の上位４バイトには前記（７｝式の
値が得られるが、この値は第ｎサイクルにおける下位４
バイトから上位バイトに対するキヤリー分だけ少ない。At the end of the first operation cycle (ICYL), the carry from the lower 4 bytes to the upper order is set to the latch (FFIO), and in the second operation cycle (20YL), the carry from the lower 4 bytes to the upper
It is added as a carry from the lower 4 bytes to the byte. Further, in the second operation cycle (20YL), the operation result is set in the x register 4, and the carry from the lower 4 bytes to the upper 4 bytes is set in the flip-flops 10 and 11. Thereafter, in the same manner, at the end of the nth calculation loop, the correct value of the following equation is obtained in the lower four bytes of the X register 4. X machine = X mountain ten harms; Yke...
[91 On the other hand, the value of the above formula (7) is obtained in the upper 4 bytes of the X register 4, but this value is the lower 4 bytes in the nth cycle.
It is less by the carry amount from the byte to the upper byte.

そこで、第ｎ＋１演算サイクルにおいて、Ｘレジスタ４
の上位４バイトに対し、第ｎサイクル目のキャリーを加
算することにより上位４バイトの最終結果が得られる。
すなわち、最終回における加算は、上位４バイトと下位
４バイトの加算器６，７のＹ側入力を強制的に「０」と
し、×＋０の演算を実行させ、このとき上位４バイトの
加算器６へのキヤリーとしてフリツプ・フロップ１１の
内容を伝達すればよい。このようにして、加算器６，７
において、ｎ回の演算ループをｎ＋１回の加算を繰返す
ことにより実現できる。Therefore, in the (n+1)th operation cycle, the X register 4
The final result of the upper 4 bytes is obtained by adding the carry of the nth cycle to the upper 4 bytes of .
That is, in the final addition, the Y-side inputs of the adders 6 and 7 of the upper 4 bytes and lower 4 bytes are forced to 0, and the operation of ×+0 is executed. The contents of the flip-flop 11 may be transmitted as a carry to the flip-flop 6. In this way, adders 6, 7
In this case, n calculation loops can be realized by repeating addition n+1 times.

このためには、第１サイクル目に、下位からのキャリー
を強制的に「０」とする制御手段と、最終サイクルにお
いてＹ側入力を強制的に「０」とし、×レジスタ４とフ
リツプ・フロップ１１の内容を下位からのキャリーとし
てて上位４バイトの加算器６に入力できる制御手段があ
ればよい。なお、加算器６，７のＸまたはＹ側入力に、
数ビットの左プレ・シフタと１Ｇ隼補正回路を設けるこ
とにより、２進数を１坊隼数に変換することができる。To achieve this, a control means is required to forcibly set the carry from the lower order to "0" in the first cycle, and to forcibly set the Y-side input to "0" in the final cycle, It is sufficient if there is a control means that can input the contents of 11 as a carry from the lower order to the adder 6 of the upper 4 bytes. In addition, to the X or Y side input of adders 6 and 7,
By providing a several-bit left pre-shifter and a 1G Hayabusa correction circuit, a binary number can be converted to a 1-bit Hayabusa number.

この場合、下位から上位に行くプレ・シフタの出力に、
第２図に示すフリップ・フロップ１０，１１を設け、第
６図で説明したような制御を施せば簡単に実現できる。
このようにして、第１図に示すような機能分散型の演算
器において、実装的制約からＧユニット３の加算器を近
接領域に配置することが難しい場合は、Ｇユニット３の
加算器に対し本発明を適用する一方、Ｆユニット２の加
算器には、１マシン・サイクルで所望のデータ幅の桁上
げを行うように設計する。In this case, the output of the pre-shifter going from the lower to the upper
This can be easily realized by providing the flip-flops 10 and 11 shown in FIG. 2 and controlling as explained in FIG. 6.
In this way, in a functionally distributed arithmetic unit as shown in Figure 1, if it is difficult to place the adder of G unit 3 in a nearby area due to implementation constraints, the adder of G unit 3 can be While applying the present invention, the adder of F unit 2 is designed to perform carry of a desired data width in one machine cycle.

なお、Ｆユニット２に対する演算命令として乗除算、浮
動小数点命令を選定すると、Ｇユニット３で実行すべき
２進加減算は、殆んど４バイト程度となることが予想さ
れる。以上説明したように、本発明によれば、情報処理
装置のデータ幅に匹適する桁数の加算器のディレィ・タ
イムを所定のマシン・サイクル内で実現させることが困
難な場合において、処理装置の性能を劣化させることな
く、装置のデータ幅に等しい加算器を実現することがで
きる。なお、本発明は、機能分散型の演算器において、
特に効果が大であるが、これに限定されることなく、一
般に、プリント・パターンによる伝播遅延が大きい加算
器に適用すれば有効である。Note that if multiplication/division and floating point instructions are selected as the operation instructions for the F unit 2, it is expected that the binary addition/subtraction to be executed by the G unit 3 will be approximately 4 bytes. As explained above, according to the present invention, when it is difficult to realize the delay time of an adder with a number of digits suitable for the data width of the information processing device within a predetermined machine cycle, It is possible to realize an adder with the same data width as the device without degrading performance. Note that the present invention provides a functionally distributed arithmetic unit,
This is particularly effective, but is not limited to this, and is generally effective when applied to adders with large propagation delays due to printed patterns.

[Brief explanation of the drawing]

第１図は機能分散型演算器の説明図、第２図は本発明の
実施例を示す加算器のブロック図、第３図および第４図
は第２図における論理演算と加減算の動作タイム・チャ
ート、第５図は第２図における比較命令の条件判定部の
ブロック図、第６図は第２図における連続加算の動作説
明図である。１：演算器、２：第１の演算ユニット（Ｆユニット）、
３：第２の演算ユニット（Ｇユニット）、４：Ｘレジス
タ、５：Ｙレジスタ、６：上位４バイト加算器、７：下
位４バイト加算器、８，９：上位および下位出力レジス
夕、１０：桁上げ伝達フリツプ・フロツプ、１１：デイ
レイ・フリツプ・フロップ、１２：条件判定部、１３〜
１６：反転回路、１７〜２１：アンド回路、２２，２３
：オア回路。第１図第２図第３図第４図第５図第６図FIG. 1 is an explanatory diagram of a functionally distributed arithmetic unit, FIG. 2 is a block diagram of an adder showing an embodiment of the present invention, and FIGS. FIG. 5 is a block diagram of the condition determination section of the comparison instruction in FIG. 2, and FIG. 6 is an explanatory diagram of the continuous addition operation in FIG. 2. 1: arithmetic unit, 2: first arithmetic unit (F unit),
3: Second arithmetic unit (G unit), 4: X register, 5: Y register, 6: Upper 4-byte adder, 7: Lower 4-byte adder, 8, 9: Upper and lower output registers, 10 : Carry transmission flip-flop, 11: Delay flip-flop, 12: Condition judgment section, 13-
16: Inversion circuit, 17-21: AND circuit, 22, 23
:OR circuit. Figure 1 Figure 2 Figure 3 Figure 4 Figure 5 Figure 6

Claims

[Claims]

1. An upper adder that calculates the upper part of the input data length;
a low-order adder that calculates a low-order part; two adders each outputting a calculation result in one cycle; a first flip-flop that holds a carry to a high-order position generated in the low-order adder; a second flip-flop that provides a carry of the flip-flop to the upper adder in the next cycle;
and equality in the upper adder, which operates at the time of the comparison instruction,
It has a case determination means for inputting and determining the presence or absence of a carry, the equality in the lower adder, and the presence or absence of a carry, and when an addition command is issued, inputs the lower order in the first cycle and checks the lower adder. After addition, in the second cycle, the upper part is input and added by the upper adder, and at the time of a comparison instruction, the upper and lower parts are subtracted at the same time, and the condition judgment is performed without propagating the carry from the lower part to the upper part. An arithmetic control method characterized by making decisions using means.