JPS595941B2

JPS595941B2 - Data array engineering

Info

Publication number: JPS595941B2
Application number: JP49117553A
Authority: JP
Inventors: 真理雄所; 秀夫相磯; 俊一内田; 信夫林
Original assignee: Takeda Riken Industries Co Ltd
Current assignee: Advantest Corp
Priority date: 1974-10-11
Filing date: 1974-10-11
Publication date: 1984-02-08
Also published as: JPS5162635A

Description

【発明の詳細な説明】この発明は複数要素の列を構成するデータアレイについ
て行なう演算処理装置に関する。DETAILED DESCRIPTION OF THE INVENTION The present invention relates to an arithmetic processing device that performs arithmetic processing on a data array that constitutes a column of a plurality of elements.

従来データアレイについての演算は一般的な要求は少な
く、特定のものに限られていた。Conventionally, operations on data arrays have had few general requirements and have been limited to specific operations.

このためその演算のための専用の装置が使用されていた
。その場合、その演算を高速度に処理するため、必要な
演算処理を最小演算単位に分け、その演算単位が順次、
即ち縦続的に行なわれる。いわゆるパイプライン構成と
されていた。この演算装置は予め決められた一つの演算
機能の他の演算は行なうことができなかつた。一方マイ
クロプログラム制御装置を利用すればそのプログラムを
入れ替えることにより各種の機能を行なうことができ、
融通性に富む。しかしその動作は記憶装置からデータを
読出し、一つの最小演算単位の演算を行なうと、記憶装
置に一度格納することを繰返して演算を行なうものであ
り、上記データアレイのようなものについて各データに
つき複数の最小演算単位を必要とする演算を行なうには
非常に長い時間を必要ハとする。For this reason, a dedicated device was used for the calculation. In that case, in order to process the calculation at high speed, the necessary calculation processing is divided into the minimum calculation units, and the calculation units are sequentially
That is, they are carried out in series. It had a so-called pipeline configuration. This arithmetic device was unable to perform other arithmetic operations than one predetermined arithmetic function. On the other hand, if you use a microprogram control device, you can perform various functions by replacing the program.
Full of flexibility. However, this operation involves reading data from the storage device, performing an operation on one minimum unit of operation, storing it once in the storage device, and performing the operation repeatedly. It takes a very long time to perform an operation that requires a plurality of minimum operation units.

この発明の目的は融通性に富み、しかも高速度の演算を
行なうことができるデータアレイ醐算装置を提供するに
ある。SUMMARY OF THE INVENTION An object of the present invention is to provide a data array calculation device that is highly flexible and capable of performing high-speed calculations.

この発明によれば読み書き可能な少なくとも一つの記憶
装置を設け、これにデータアレイがその構成要素につき
連続的に番地付けられて格納される。According to the invention, at least one readable/writable memory device is provided in which a data array is stored sequentially addressed with respect to its constituent elements.

この記憶装置はアドレス発生装置から発生された番地に
て指定されてデータが読出され、そのデータは読出バス
を通じて演算装置へ供給される。この演算装置は上記記
憶装置の読出し、書込み周期を最小演算単位とし、その
演算単位の複数倍１をもつて演算処理が完了するパイプ
ライン構成とされる。この演算装置からの演算結果は書
込バスを通じて記憶装置へ書込データとして供給される
。上記読出し書込制御はマイクロプログラム制御装置に
より行なわれる。各種の演算が行なえるようにマイクロ
プログラム制御装置に対し、各種のプログラムを書替え
ることかできる。また演算装置内にスイツチが挿入され
、そのスイツチの制御により演算機能を変更できるよう
にされ、又は演算機能が異なる演算装置が予め複数設け
られる。演算機能の指定、使用記憶装置の指定などは演
算操作に入る前に予めセツトアツブレジスタにセツトす
ることにより、演算操作中のマイクロ命令ではこれら演
算機能の指定や記憶装置の指定のためのビツトを使用し
ない。また演算装置をパイプライン構成とし、適当に制
御用レジスタを設けることにより、マイクロ命令の融通
性を増しても、そのためにマイクロ命令の構成ビツト数
を比較的少なくでき、処理速度が低下しない。また動作
始め及び終りにおいて演算装置におけるパイプラインが
詰まるまで、及びパイプラインからデータが出てしまう
までを除き、記憶装置に対し、データの読出し書込みが
各サイクル毎に行なわれ、その毎に演算装置の各パイプ
ラインのステツプにおいて最小演算単位の処理が行なわ
れてその内部のデータは１ステツプ移動される。記憶装
置の１回の読出しごとに１つの演算結果が得られ、高速
度の処理が行なわれる。次に図面を参照してこの発明に
よるデータアレイ演算装置を詳細に説明しよう。Data is read out from this storage device by being specified by an address generated by an address generator, and the data is supplied to an arithmetic unit through a read bus. This arithmetic device has a pipeline configuration in which the read/write cycle of the storage device is the minimum arithmetic unit, and arithmetic processing is completed in multiples of the arithmetic unit. The calculation results from this calculation device are supplied as write data to the storage device via the write bus. The above read/write control is performed by a microprogram control device. Various programs can be rewritten to the microprogram control device so that various calculations can be performed. Further, a switch is inserted into the arithmetic device, and the arithmetic function can be changed by controlling the switch, or a plurality of arithmetic devices with different arithmetic functions are provided in advance. By setting the arithmetic function specification, storage device specification, etc. in the set-up register before starting the arithmetic operation, the microinstruction during the arithmetic operation can use the bits for specifying the arithmetic function and storage device. Don't use. Further, even if the flexibility of microinstructions is increased by configuring the arithmetic unit in a pipeline and providing appropriate control registers, the number of bits constituting the microinstructions can be made relatively small, so that the processing speed does not decrease. In addition, data is read from and written to the storage device every cycle, except until the pipeline in the arithmetic unit becomes clogged at the beginning and end of the operation, and until data is output from the pipeline. At each step of the pipeline, the minimum unit of operation is processed, and the data within it is moved one step. One calculation result is obtained each time the storage device is read, and high-speed processing is performed. Next, the data array arithmetic device according to the present invention will be explained in detail with reference to the drawings.

第１図に示すようにこの例ではランダムアクセス型の読
出し書込みができる主記憶装置ＭＭ，，ＭＭ２，ＭＭ３
の３個が設けられ、これ等主記憶装置ＭＭ，〜ＭＭ３は
書込みレジスタＷＲｌ〜ＷＲ３内のデータがそれぞれ書
込まれ、ＭＭｌ〜ＭＭ３から読出されたデータは読出レ
ジスタＲＲｌ〜ＲＲ３にそれぞれ蓄えられる。主記憶装
置ＭＭ，〜ＭＭ３はそれぞれ各番地は左右の１６ビツト
づつに分けられ、これ等各主記憶装置にはデータアレイ
の各要素が１つの番地に記憶され、そのアレイの順に各
要素は連続する番地に順次記憶される。これ等主記憶装
置ＭＭｌ〜ＭＭ３に対するアドレスがアドレス発生装置
ＭＡＵにて作られる。アドレス発生装置ＭＡＵは主記憶
装置ＭＭが読出されるごとにデータカウンタＤＣが１加
算され、そのカウンタの出力が直接、基底アドレスレジ
スタＢＡＲｌ，ＢＡＲ２の内容と第６図の回路１０及び
１１でそれぞれ加算、又は減算されてアドレスシフトレ
ジスタＡＳＲｌ及びＡＳＲ２をそれぞれ通じて主記憶装
置ＭＭｌ及びＭＭ２、更に必要に応じてＭＭ３にアドレ
ス情報として与えられる。As shown in FIG. 1, in this example, the main memories MM, MM2, MM3 are capable of random access type reading and writing.
The data in write registers WR1-WR3 is written into these main memories MM, -MM3, respectively, and the data read from MM1-MM3 are stored in read registers RR1-RR3, respectively. Each address in the main memory devices MM, ~MM3 is divided into 16 bits on the left and right, and each element of the data array is stored in one address in each of these main memory devices, and each element is consecutive in the order of the array. The data is stored sequentially at the address. Addresses for these main memories MM1 to MM3 are generated by address generator MAU. In the address generator MAU, the data counter DC is incremented by 1 each time the main memory MM is read, and the output of the counter is directly added to the contents of the base address registers BARl and BAR2 in circuits 10 and 11 of FIG. 6, respectively. , or subtracted and provided as address information to main memories MM1 and MM2, and further to MM3 as necessary, through address shift registers ASR1 and ASR2, respectively.

或いは後で詳細に説明するようにデータカウンタＤＣの
出力はそのビツトの順位がビツト入替回路１Ｘにて変更
され、更に基底アドレスが加算又は減算された後アドレ
スレジスタＡＳＲを通じて主記憶装置ＭＭ，〜ＭＭ３（
以下ＭＭ，，ＭＭ２，ＭＭ３を代表してＭＭと記す）に
与えられる。データカウンタＤＣの出力及びそのビツト
順位が変更されたものが選択的にアドレス補正回路ＴＷ
Ａを通じて主記憶装置ＭＭ３に与えられる。書込バス１
２，１３はそれぞれ３２ビツトのバスであり、このバス
のデータが主記憶装置ＭＭｌ〜ＭＭ３の上記アドレス発
生装置ＭＡＵにて指定された番地に書込まれ、また記憶
装置ＭＭｌ〜ＭＭ３のその指定された番地から読出され
たデータはそれぞれ３２ビツトのアウトバス１４，１５
に伝送される。Alternatively, as will be explained in detail later, the bit order of the output of the data counter DC is changed by the bit switching circuit 1X, and the base address is added or subtracted, and then the output is sent to the main memories MM, to MM3 through the address register ASR. (
MM, MM2, and MM3 are hereinafter referred to as MM). The output of the data counter DC and its bit order changed are selectively sent to the address correction circuit TW.
A is given to the main memory device MM3 through A. writing bus 1
2 and 13 are 32-bit buses, respectively, and the data on these buses is written to the address specified by the address generator MAU in the main memory devices MM1 to MM3, and is written to the specified address in the memory devices MM1 to MM3. The data read from the address is sent to the 32-bit outbus 14, 15, respectively.
transmitted to.

バス１２〜１５はこれを通過するデータの幅が１ワード
（１６ビツト）であるか、２ワード（３２ビツト）であ
るかにより、バススキユニツトＢＳＵにてそれぞれ切替
えられる。またバスと記憶装置ＭＭとの間において切替
ユニツトＢＳＳにて接続されるバスの選択を行なうこと
ができる。主記憶装置ＭＭの読出し、書込みサイクルを
最小演算単位とし、その演算単位の複数倍の演算により
一つの演算機能を完了する。Buses 12 to 15 are switched by bus skiing unit BSU depending on whether the width of the data passing through them is 1 word (16 bits) or 2 words (32 bits). Furthermore, the switching unit BSS can select the bus to be connected between the bus and the storage device MM. A read/write cycle of the main memory device MM is taken as the minimum calculation unit, and one calculation function is completed by multiple times the calculation unit.

いわゆるパイプライン構成の演算装置ＦＡＬＵが設けら
れる。この演算装置内にスイツチが挿入され、その制御
により複数の演算機能が得られる。記憶装置ＭＭ，〜Ｍ
Ｍ３から読出されたデータはバス１４又は１５を通じて
演算装置ＦＡＬＵへ供給され、演算装置ＦＡＬＵにて演
算され、その結果はバス１２又は１３を通じて記憶装置
ＭＭ，〜ＭＭ３の何れかに書込まれる。後の説明から理
解されるように記憶装置ＭＭ３から読出されたデータは
読出バスを通じることなく演算装置ＦＡＬＵに直接供給
することもできる。演算装置ＦＡＬＵにおいてどのよう
な演算を行なうかの演算モードの設定、アドレス発生装
置ＭＡＵの初期アドレス設定、即ち基底アドレスレジス
タＢＡＲｌ，ＢＡＲ２の設定や記憶装置ＭＭｌ〜ＭＭ３
の書込み、読出しなどの制御をマイクロプログラム制御
装置ＣＭにて行なう、このマイクロプログラム制御装置
ＣＭに対し、処理装置（図示せず）からインターフエー
ス装置ＩＮＴを通じて制御プログラムが記憶される。次
に信号ｆ（ｔ）をデジタル的にフーリエ変換する場合の
例として各部の詳細を説明する。先ずフーリエ変換につ
き述べると、一般に信号ｆ（ｔ）はとフーリエ変換でき
る。Ｔはｆ（ｔ）の周期であり、ｊ− −１である。フ
ーリエ係数Ｄ（研まである。An arithmetic unit FALU having a so-called pipeline configuration is provided. A switch is inserted into this arithmetic device, and its control provides a plurality of arithmetic functions. Storage device MM, ~M
The data read from M3 is supplied to the arithmetic unit FALU via the bus 14 or 15, and is operated on by the arithmetic unit FALU, and the result is written to one of the memory devices MM, -MM3 via the bus 12 or 13. As will be understood from the following description, the data read from the memory device MM3 can also be directly supplied to the arithmetic unit FALU without passing through the read bus. Setting the calculation mode for what kind of calculation is to be performed in the calculation unit FALU, setting the initial address of the address generator MAU, that is, setting the base address registers BARl, BAR2, and storing the memory devices MMl to MM3.
A control program is stored in the microprogram control device CM from a processing device (not shown) through an interface device INT to the microprogram control device CM, which controls writing and reading of the data. Next, details of each part will be explained as an example of digitally Fourier transforming the signal f(t). First, regarding Fourier transformation, generally the signal f(t) can be Fourier transformed. T is the period of f(t) and is j--1. Fourier coefficient D

周期Ｔにおけるサンプリング点の数をＮとすると（２）
式は次のように近似できとなる。If the number of sampling points in period T is N, (2)
The formula can be approximated as follows.

この演算を行なえば任意の信号をデジタル的にフーリエ
級数に展開でき、即ち信号解析をデジタル的に行なうこ
とができる。しかし上記演算をいちいち行なうことはそ
の演算量が莫大なものとなり、従来の電子計算機によれ
ば１０２４点のサンプリング点の場合に１時間もの演算
時間がかＸり、実用的でなかつた。この点より信号のサ
ンプリング点の値を適当に組合せ、その和と、その組合
せの差に重みを掛けたものとを順次取出し、得られたデ
ータアレイにつき同様のことを施すことを繰返して比較
的少ない演算回数でデジタルフーリエ変換が行なわれる
ことが知られている。これは例えば、米国特許第３５１
７１７３号ＤｉｇｉｔａｌＰｒＯｃｅｓｓＯｒｆＯｒＰ
ｅｒｆＯｒｍｉｎｇｆａｓｔＦＯｕｒｉｅｒＴｒａｎｓ
ｆＯｒｍｓに述べられているｏこのフーリエ変換手法と
してはクーリーチユキイ法、及びサンデーチユーキイ法
が知られているが、ここでは後者につき説明する。By performing this calculation, any signal can be digitally developed into a Fourier series, that is, signal analysis can be performed digitally. However, performing the above calculations one by one requires a huge amount of calculations, and using a conventional electronic computer, the calculation time for 1024 sampling points is 1 hour, making it impractical. From this point, the values of the sampling points of the signal are appropriately combined, the sum and the weighted difference between the combinations are sequentially extracted, and the same process is repeated for the obtained data array to compare the results. It is known that digital Fourier transform can be performed with a small number of calculations. For example, U.S. Patent No. 351
No.7173DigitalPrOcessOrfOrP
erfOrmingfastFourierTrans
As the Fourier transform method described in fOrms, the Coolichi-Yukiy method and the Sunday Chikyuki method are known, and the latter will be explained here.

このサンデーチユーキイ法は例えば第２図に示すように
（３）式においてＮ＝１６の場合、入力アレイはＸ。For example, as shown in FIG. 2, when N=16 in equation (3), the input array is X.

−Ｘ，５の１６点の各値の時系列となり、第１ステージ
においてはＸ。−Ｘ７とＸ８〜Ｘ，５とがそれぞれ加算
されてＥ。−Ｅ７が得られ、ＸＯ〜Ｘ７からＸ８〜Ｘｌ
５がそれぞれ減算され、その結果の対応するものに対し
てそれぞれ三角関数値Ｗ。〜Ｗ７が乗算されてＥ８〜Ｅ
ｌ５が得られる。Ｗｎは２πＸ７ｅｘｐ（−ｊ？）であ
る。-X, 5 is a time series of each value of 16 points, and in the first stage, X. -X7 and X8 to X,5 are added to each other to obtain E. -E7 is obtained, X8~Xl is obtained from XO~X7
5 are respectively subtracted, and the corresponding trigonometric function values W are respectively subtracted. ~W7 is multiplied to E8~E
l5 is obtained. Wn is 2πX7exp(-j?).

第２ステージにお１ＧいてはＥ。E at 1G in the second stage.

−Ｅ３とＥ４〜Ｅ７とのそれぞれの和Ｄ。〜Ｄ３が、ま
たＥ。−Ｅ３からＥ４〜Ｅ７をそれぞれ減算した値の対
応するものにＷ。．Ｗ２、Ｗ４、Ｗ６がそれぞれ乗算さ
れてＤ４〜Ｄ７が得られる。以下同様のことが繰返され
、第４ステージの演算の結果Ｆ。−Ｆ，５が求まり、こ
れらをＮ＝１６で割算するとフーリエ係数Ｄ（ｏ）〜Ｄ
Ｏ５）が求まる。以上の演算を第３図に示す。この計算
に必要なプログラムがインターフエイス装置１ＮＴを通
じて電子計算機からマイクロプログラム制御装置ＣＭに
取入れられる。- Sum D of each of E3 and E4 to E7. ~D3 is E again. -W to the corresponding value obtained by subtracting each of E4 to E7 from E3. ．． D4 to D7 are obtained by multiplying W2, W4, and W6, respectively. The same process is repeated thereafter, and the result of the fourth stage calculation is F. -F,5 is found, and by dividing these by N=16, the Fourier coefficient D(o) ~ D
O5) can be found. The above calculations are shown in FIG. A program necessary for this calculation is imported from the electronic computer to the microprogram control device CM through the interface device 1NT.

これよりマイクロプログラム匍脚装置ＣＭが起動して、
その演算に必要なデータを電子計算機からインターフエ
イス装置１ＮＴを通じて主記憶装置に格納される。例え
ばＸ。−Ｘ７は主記憶装置ＭＭｌのＯ〜７番地に、Ｘ７
〜Ｘｌ．が主記憶装置ＭＭ２の８〜１５番地にそれぞれ
記憶される。三角関数値Ｗｎは実用的見地から例えば２
πを８１９２の点に分けた場合の値を予め記憶しておき
、サンプリング点数に応じて必要なものを取出す。しか
しＯ〜π／４までの等間隔の１０２４点の各余弦値と正
弦値とが判れば他の各点はこれ等の値の何れか又はそれ
に負号を付けた値となる。よつて主記憶装置ＭＭ３には
第４図に示すようにＯ〜１０２４のｎ πｎ番
地に、ＣＯｓ（？Ｘ−）及び１０２４４ｎ π Ｓｉｎ（？Ｘ−）（但し、ｎはＯ〜１０２４の正整数）
がそれぞれ記憶される。From this, the micro-programmed leg device CM will start,
Data necessary for the calculation is stored in the main memory from the electronic computer through the interface device 1NT. For example, X. -X7 is located at addresses O to 7 of the main memory device MMl;
~Xl. are respectively stored in addresses 8 to 15 of the main memory device MM2. From a practical standpoint, the trigonometric function value Wn is, for example, 2.
Values obtained when π is divided into 8192 points are stored in advance, and necessary values are extracted according to the number of sampling points. However, if the cosine and sine values of 1024 equally spaced points from O to π/4 are known, each of the other points will be one of these values or a value with a negative sign attached to it. Therefore, as shown in FIG. 4, in the main memory device MM3, COs (? )
are respectively memorized.

この状態で第５図のフローチヤートにおける準備処理に
示すように、アドレス初期値が第６図に示すように基底
アドレスレジスタＢＡＲｌ，ＢＡＲ２にセツトされる。In this state, as shown in the preparation process in the flowchart of FIG. 5, initial address values are set in the base address registers BAR1 and BAR2 as shown in FIG.

主記憶装置ＭＭ，には０番地からデータアレイが入つて
いるため、その最初の要素Ｘ。が入つたＯ番地がＢＡＲ
ｌにセツトされ、主記憶装置ＭＭ２には８番地からデー
タアレイが入つているから、その最初の要素Ｘ８が入つ
ている８番地がＢＡＲ２にセツトされる。次にデータア
レイのサイズ即ちデータアレイＸ。〜Ｘｌ５の構成要素
数Ｎの２分の１、この例では８がデータサイズ指示レジ
スタＤＳＩにセツトされる。更に演算ステージの数を示
すステージ数計数用のシフトレジスタＳＣＳＲがセツト
される。この数ｒはデータアレイの構成要素、即ちサン
プリング点の数Ｎに対しＮ＝２ｒで決り、この例ではＮ
＝１６であるからｒ＝４となり、このシフトレジスタＳ
ＣＳＲは１ステージ毎に１シフトする。また演算装置Ｆ
ＡＬＵにおけるパイプラインの長さ、即ち最小演算単位
のいくつで演算が終了するかを示す値をパイプライン指
示レジスタＰＬｌにセツトされる。この例ではパイプラ
イン長は４段であり、従つて初期状態から演算装置ＦＡ
ＬＵに４回データが入力され、演算装置内でのデータの
流れが４段進むと始めて演算結果が演算装置から出、こ
の時から主記憶装置に対し、演算結果を書込む操作が行
なわれるようになる。有効アドレス、即ち基底アドレス
レジスタＢＡＲｌの内容及びデータカウンタＤＣの内容
が第６図の回路１０で加算されたものと、基底アドレス
レジスタＢＡＲ２の内容及びデータカウンタＤＣの内容
が第６図の回路１１で加算されたものとがそれぞれアド
レスレジスタＡＳＲｌ，ＡＳＲ２の初段にそれぞれセツ
トされる。アドレスシフトレジスタＡＳＲｌ及びＡＳＲ
２の第１段目のアドレスにより第６図の，駆動回路１８
及び１９を通じて主記憶装置ＭＭｌ及びＭＭ２の内容が
それぞれ読出され、アドレスシフトレジスタＡＳＲｌ及
びＡＳＲ２とパイプライン指示レジスタＰＬＩにセツト
された値の段のアドレスにより駆動回路１８，１９を通
じて主記憶装置ＭＭｌ，ＭＭ２に書込みが行なわれる。Since the main memory device MM contains a data array starting from address 0, its first element is X. The address O that contains is the BAR.
Since the main memory device MM2 contains a data array starting from address 8, address 8 containing the first element X8 is set to BAR2. Next, the size of the data array, that is, the data array X. 1/2 of the number N of constituent elements of ~Xl5, 8 in this example, is set in the data size instruction register DSI. Furthermore, a shift register SCSR for counting the number of stages indicating the number of arithmetic stages is set. This number r is determined by N=2r for the number N of the constituent elements of the data array, that is, sampling points, and in this example, N
= 16, so r = 4, and this shift register S
The CSR shifts by one for each stage. Also, calculation device F
A value indicating the length of the pipeline in the ALU, ie, the number of minimum units of operation in which the operation ends, is set in the pipeline instruction register PLl. In this example, the pipeline length is 4 stages, so from the initial state the processing unit FA
When data is input to the LU four times and the flow of data within the arithmetic unit advances four stages, the arithmetic result is output from the arithmetic unit, and from this time on, the operation to write the arithmetic result to the main memory is performed. become. The effective address, that is, the contents of the base address register BARl and the contents of the data counter DC added in circuit 10 of FIG. 6, and the contents of the base address register BAR2 and the contents of data counter DC added in circuit 11 of FIG. The added values are set in the first stages of address registers ASR1 and ASR2, respectively. Address shift registers ASRl and ASR
2, the drive circuit 18 in FIG.
The contents of the main memories MMl and MM2 are read out through the drive circuits 18 and 19, respectively, and the contents of the main memories MMl and MM2 are read out through the drive circuits 18 and 19 according to the stage address of the value set in the address shift registers ASRl and ASR2 and the pipeline instruction register PLI. Writing is performed to.

演算装置ＦＡＬＵにおける演算のステツプ状態を示すパ
イプラインシフトレジスタＰＬＳＲが設けられる。デー
タカウンタＤＣの出力１Ｘ１又はそのビツト順位を入れ
替えた出力１Ｘ２の何れを選択するかを示すインデツク
ス選択レジスタＩＸＭＣＲがセツトされる。またバスス
キユーユニツトＢＳＵや切替ユニツトＳＫを制御設定す
るバス制御レジスタＢＣＲに対し、記憶装置ＭＭｌ，Ｍ
Ｍ，を演算装置ＦＡＬＵに接続するためのセツト、更に
記憶装置ＭＭｌ〜ＭＭ２の読出し、書込みの制御を行な
うメモリ制御レジスタＭＣＲのセツトがそれぞれ行なわ
れ、記憶装置ＭＭ３については読出しのみセツトされる
。演算装置ＦＡＬＵにおける演算機能の設定が演算制御
レジスタＦＣＲｌに、また装置ＦＡＬＵ内のスイツチの
制御が演算制御レジスタＦＣＲ２にそれぞれセツトされ
る。最後にデータカウンタＤＣがクリアされる。なお上
述の各種のセツトに必要とするデータは予め電子計算機
からマイクロプログラム制御装置ＣＭに取入れられてあ
る。次に第５図に示すように第１ステージの前段処理に
おいてパイプライン指示レジスタＰＬＩの設定値とパイ
プラインシフトレジスタＰＬＳＲが示すパイプラインの
進行値とが比較され、最初にＰＬＩは４であるがＰＬＳ
Ｒは１であつて一致せずアドレスシフトレジスタＡＳＲ
ｌ及びＡＳＲ２の１段目のアドレスにて主記憶装置ＭＭ
，，ＭＭ２が同時に読出され、即ちＸ。A pipeline shift register PLSR is provided which indicates the step state of the operation in the arithmetic unit FALU. An index selection register IXMCR is set which indicates which of the output 1X1 of the data counter DC or the output 1X2 with its bit order switched is to be selected. In addition, the bus control register BCR, which controls and sets the bus skew unit BSU and switching unit SK, is
The memory control register MCR is set for connecting the memory device MM1 to the arithmetic unit FALU, and the memory control register MCR for controlling reading and writing of the memory devices MM1 to MM2 is set.The memory device MM3 is set only for reading. The setting of the arithmetic function in the arithmetic unit FALU is set in the arithmetic control register FCR1, and the control of the switch in the device FALU is set in the arithmetic control register FCR2. Finally, the data counter DC is cleared. It should be noted that the data required for the above-mentioned various sets have been previously input into the microprogram control device CM from the electronic computer. Next, as shown in FIG. 5, in the pre-processing of the first stage, the setting value of the pipeline instruction register PLI and the pipeline progress value indicated by the pipeline shift register PLSR are compared, and initially PLI is 4, but PLS
R is 1 and does not match address shift register ASR
Main memory device MM at the first stage address of l and ASR2
,,MM2 are read simultaneously, ie, X.

及びＸ８がそれぞれ読出され、読出バスを通じて演算装
置ＦＡＬＵに与えられる。この読出しの後にデータカウ
ンタＤＣは＋１され、この出力が第６図の回路１０，１
１へ与えられる。またパイプラインシフトレジスタＰＬ
ＳＲも１シフトされる。その後ＰＬＩの内容とＰＬＳＲ
の数とが比較され、同様の操作が行なわれる。演算装置
ＦＡＬＵは例えば第７図に示すように入力端子２１，２
２は第６図の読出バス１４，１５にそれぞれ接続され、
出力端子２３，２４は第６図の書込バス１２，１３にそ
れぞれ接続される。and X8 are read out and applied to the arithmetic unit FALU via the read bus. After this reading, the data counter DC is incremented by 1, and this output is output from the circuits 10 and 1 in FIG.
given to 1. Also, pipeline shift register PL
SR is also shifted by one. After that, the contents of PLI and PLSR
, and similar operations are performed. For example, the arithmetic unit FALU has input terminals 21 and 2 as shown in FIG.
2 are respectively connected to read buses 14 and 15 in FIG.
Output terminals 23 and 24 are connected to write buses 12 and 13, respectively, in FIG. 6.

入力端子２１はスイツチＳ７−Ｓ，−ラツチ（例えばフ
リツプフロツプ、又は遅延回路）ＬｌＬ２−スイツチＳ
３一加減算回路２５−スイツチＳ２−ラツチＬ３−Ｌ４
を通じて出力端子２３に接続され、入力端子２２はスイ
ツチＳ８−ラツチＬ５−スイツチＳ１−ＳｌＯ−ラツチ
Ｌ６−スイツチＳ６−加減算回路２６−スイツチＳ４一
乗算回路２７ラツチＬ７−スイツチＳ２−ラツチＬ８を
通じて出力端子２４に接続される。ラツチＬ２の出力側
はスイツチＳ３を通じて加減算回路２６の他方の入力側
にも接続され、ラツチＬ６の出力側はスイツチＳ６を通
じて加減算回路２５の他方の入力側にも接続される。ス
イツチＳ１及びＳ２は連動とされ、上記接続のように実
線で示す切替位置の他に点線で示すようにスイツチＳ７
がスイツチＳｌＯに、ラツチＬ５の出力側がラツチＬ，
に、また加減算回路２５の出力側がラツチＬ８の入力１
１１に、ラツチＬ７の出力側がラツチＬ３の入力側にそ
れぞれ接続される。入力端子２８は主前憶装置ＭＭ３の
読出し出力側に直接接続され、端子２８はスイツチＳ５
−ラツチＬ９−スイツチＳ９−ラツチＬｌＯを通じて乗
算回路２７の他方の入力側に接続される。いま主記憶装
置ＭＭｌ，ＭＭ２はそれぞれ読出バス１４，１５に、ま
た書込バス１２，１３に接続されているとする。上記第
５図の第１ステージの前段処理において先ずＸ。及びＸ
８が読出されると、ＸＯはラツチＬ，に、Ｘ８はラツチ
Ｌ５に入る。この時主記憶装置ＭＭ３からＷ。が読出さ
れてラツチＬ９に入る。次の演算サイクルではアドレス
シフトレジスタＡＳＲｌ及びＡＳＲ２の初段は１番地及
び９番地となり、主記憶装置ＭＭｌ，ＭＭ２，ＭＭ３か
らＸｌ，Ｘ，，Ｗｌがそれぞれ読出されて、ラツチＬ，
，Ｌ５，Ｌ９に入り、ＸＯ，Ｘ８，ＷＯはラツチＬ２，
Ｌ６，ＬｌＯに入る。前記準備処理の演算モード指定に
より演算制御回路ＣＴＬｌ，ＣＴＬ２よりの制御にて加
減算回路２５は加算動作に加減算回路２６は減算動作と
なつている。よつて加減算回路２５にてＸ。＋Ｘ８＝Ｅ
Ｏが加減算回路２６にてＸ。−Ｘ８＝Ｅｌが計算され、
更にＥｉＸＷＯＥ８が乗算回路２７で行なわれる。次の
演算サイ）クルにおいて同様にしてＸ２，ＸｌＯ，Ｗ２
が読出されてラツチＬ１？Ｌ５２Ｌ９に入り、Ｘ盟Ｘ９
７ＷｌはラツチＬ２，Ｌ６，ＬｌＯに移り、Ｘ，＋Ｘ９
＝Ｅ１、Ｘ，−Ｘ，＝Ｅｄ．ＥｄＸＷｌ＝Ｅ，が演算さ
れ、ＥＯ，Ｅ８はラツチＬ３，Ｌ７に入る。Input terminal 21 is connected to switch S7-S, -latch (for example, flip-flop or delay circuit) LlL2-switch S.
3-addition/subtraction circuit 25-switch S2-latch L3-L4
The input terminal 22 is connected to the output terminal 23 through switch S8 - latch L5 - switch S1 - SlO - latch L6 - switch S6 - addition/subtraction circuit 26 - switch S4 - multiplication circuit 27 latch L7 - switch S2 - latch L8. 24. The output of latch L2 is also connected to the other input of addition/subtraction circuit 26 through switch S3, and the output of latch L6 is also connected to the other input of addition/subtraction circuit 25 through switch S6. The switches S1 and S2 are interlocked, and in addition to the switching position shown by the solid line as in the above connection, the switch S7 is shown as the dotted line.
is connected to switch SlO, the output side of latch L5 is connected to latch L,
Also, the output side of the adder/subtracter circuit 25 is connected to the input 1 of the latch L8.
11, the outputs of latch L7 are respectively connected to the inputs of latch L3. The input terminal 28 is directly connected to the read output side of the main memory device MM3, and the terminal 28 is connected directly to the read output side of the main memory device MM3.
It is connected to the other input side of the multiplier circuit 27 through - latch L9 - switch S9 - latch L1O. It is now assumed that main memories MMl and MM2 are connected to read buses 14 and 15 and to write buses 12 and 13, respectively. In the pre-processing of the first stage in FIG. 5, first, X. and X
When 8 is read, XO goes into latch L, and X8 goes into latch L5. At this time, W from main memory device MM3. is read out and entered into latch L9. In the next operation cycle, the first stages of address shift registers ASRl and ASR2 are addresses 1 and 9, and Xl, X, , Wl are read from main memories MMl, MM2, MM3, respectively, and latches L,
, L5, enter L9, XO, X8, WO enter latch L2,
Enter L6, LIO. According to the arithmetic mode designation of the preparatory process, the adder/subtracter circuit 25 performs an addition operation and the adder/subtracter circuit 26 performs a subtraction operation under the control of the arithmetic control circuits CTLl and CTL2. Therefore, the addition/subtraction circuit 25 outputs "X". +X8=E
O becomes X in the addition/subtraction circuit 26. −X8=El is calculated,
Furthermore, EiXWOE8 is performed in the multiplication circuit 27. Similarly, in the next calculation cycle, X2, XlO, W2
is read and the latch L1? Enter L52L9, X Alliance X9
7Wl moves to latches L2, L6, LlO, X, +X9
=E1,X,-X,=Ed. EdXWl=E, is calculated, and EO and E8 enter latches L3 and L7.

更に次に演算サイクルにはＸ３，Ｘｌｌ，Ｗ３が読出さ
れ、ラツチＬｌ，Ｌ５，Ｌ９に入り、その他のデータは
順次出力側に移り、ラツチＬ４，Ｌ８にはＥ。，Ｅ８が
それぞれ入力される。この時パイプラインシフトレジス
タＰＬＳＲは４を計数し、パイプライン指定レジスタＰ
ＬＩの値と一致し、演算装置ＦＡＬＵのパイプラインが
全て詰まつたことが検出される。よつてこれまでは各演
算サイクルにおいて、記憶装置についての読出し、書込
みモード沖の書込みモードが阻止されていたが、この書
込みモードも有効になり、アドレスシフトレジスタＡＳ
Ｒｌ，ＡＳＲ２のパイプライン指示レジスタＰＬＩが示
す値、即ち４段目に在るアドレスＯ番地及び８番地にＥ
。及びＥ８がそれぞれ書込まれる。この状態から第５図
における第１ステージ処理に入る。Furthermore, in the next operation cycle, X3, Xll, and W3 are read out and entered into latches L1, L5, and L9, and other data are sequentially transferred to the output side, and E is input to latches L4 and L8. , E8 are respectively input. At this time, the pipeline shift register PLSR counts 4, and the pipeline specification register P
This matches the value of LI, and it is detected that the pipeline of the arithmetic unit FALU is completely clogged. Therefore, up until now, the read mode and write mode for the storage device were blocked in each calculation cycle, but this write mode is now also enabled, and the address shift register AS
The value indicated by the pipeline instruction register PLI of Rl, ASR2, that is, the address O and 8 in the fourth stage are set to E.
. and E8 are written respectively. From this state, the first stage processing shown in FIG. 5 begins.

この段階ではデータカウンタＤＣの内容とデータサイズ
指示レジスタＤＳＩの内容Ｎ／２とが比較され、一致し
ない場合は主記憶装置ＭＭ，，ＭＭ２からアドレスシフ
トレジスタＡＳＲｌ，ＡＳＲ２の初段内のアドレスで同
時に読出されて演算装置ＦＡＬＵへ供給され、つづいて
演算装置ＦＡＬＵの演算結果がその出力端子２３，２４
からそれぞれ主記憶装置ＭＭｌ，ＭＭ２のＡＳＲｌ，Ａ
ＳＲ２の４段目内のアドレスにそれぞれ書込まれる。そ
の書込の後にデータカウンタＤＣは１加算され、その値
がＤＳＩの設定値と一致するか否か調べられ、以下同様
のことが繰返されて、第３図の第１ステージの演算が次
々に行なわれる。乗算回路２７において乗算する重み、
即ち三角関数値、いわゆる回転因子Ｗｎは上述したよう
に、ＣＯｓＯ〜ＣＯｓπ／４及びＳｉｎＯ−Ｓｉｎπ／
４の値１ πが？Ｘ−ごとに主記憶装置ＭＭ３に
その番地順に記憶されている。At this stage, the contents of the data counter DC and the contents N/2 of the data size instruction register DSI are compared, and if they do not match, they are read out simultaneously from the main memories MM, MM2 at the addresses in the first stage of address shift registers ASRl and ASR2. The calculation result of the calculation unit FALU is then supplied to the output terminals 23 and 24.
ASRl and A of main memories MMl and MM2, respectively.
Each address is written in the fourth stage of SR2. After writing, the data counter DC is incremented by 1, and it is checked whether the value matches the set value of DSI, and the same process is repeated, and the calculations in the first stage in Fig. 3 are performed one after another. It is done. weights to be multiplied in the multiplication circuit 27;
That is, the trigonometric function value, the so-called twiddle factor Wn, is COsO~COsπ/4 and SinO−Sinπ/4, as described above.
The value of 4 is 1 π? Each X- is stored in the main memory device MM3 in the order of its address.

入力データアレイのサイズＮ−２ｍが１０２４Ｘ４＝１
０１２以下の場合は、その指数ｍに応じて必要な回転因
子だけを飛越して読出して利用する。このため第６図の
アドレス補正部ＴＷＡにおいて第ｒ番目のステージにお
いて（ｒ−１）＋（１３−ｍ）だけ、データカウンタＤ
Ｃの出力１Ｘ１又はそのビツト位置を変更した出力１Ｘ
２を左へシフトし、下位ビツトを０とし、即ちそのＯと
した番地を読出さないようにする。但しアドレス補正部
ＴＷＡは１２ビツトの容量であつて、これよりオーバー
フローしたものは無視する。従つて上記例のようにＮ＝
１６＝２４で第１ステージｒ＝１の場合は、１−１＋１
３４＝９ビツトだけアドレスカウンタからのＩＸｌは左
にシフトされ、主記憶装置ＭＭ３の２９ごとの番地を読
出す。このアドレス補正部ＴＷＡは例えばインデクスＩ
Ｘｌ又はＩＸ２の各ビツトが一組のゲートに入力され、
そのゲート出力は所定数ビツトシフトされたものとなり
、このゲートをステージ計数用シフトレジスタＳＣＳＲ
の対応する出力段にて開し、各ステージに対応して上記
１Ｘ１又はＩＸ２が入力され、所定のビツトシフトされ
た出力が得られるゲートが設けられ、このゲートが各ス
テージごとに一組だけ開けられるように論理回路で構成
される。第８図に示すようにＯ〜πの間はＳｉｎＯは正
であり、ＣＯｓθはＯ〜π／２では正であるがπ／２〜
πの間は負である。The size of the input data array N-2m is 1024X4=1
012 or less, only the necessary twiddle factors are skipped and read and used according to the index m. Therefore, in the address correction unit TWA shown in FIG.
Output 1X1 of C or output 1X with changed bit position
2 is shifted to the left and the lower bit is set to 0, that is, the address set to O is not read out. However, the address correction unit TWA has a capacity of 12 bits, and any overflow beyond this capacity is ignored. Therefore, as in the above example, N=
If 16=24 and first stage r=1, then 1-1+1
IXl from the address counter is shifted to the left by 34=9 bits to read every 29th address in main memory MM3. For example, this address correction unit TWA
Each bit of Xl or IX2 is input to a set of gates,
The gate output is shifted by a predetermined number of bits, and this gate is transferred to the stage counting shift register SCSR.
A gate is provided which is opened at the corresponding output stage, inputs the above 1X1 or IX2 corresponding to each stage, and obtains a predetermined bit-shifted output, and only one set of these gates is opened for each stage. It is composed of logic circuits like this. As shown in Figure 8, SinO is positive between O and π, and COsθ is positive between O and π/2, but between π/2 and
It is negative between π.

よつてアドレス補正部ＴＷＡの出力アドレス中の１１ビ
ツト目及び１２ビツト目が共に６１″又は共に″Ｏ゛の
場合はＯ〜π／２の間であつてＣＯｓθは正とし、１１
ビツト目及び１２ビツトの一方が″ｒ”で、他方が６０
゛の場合はπ／２〜πの間であつて、ＣＯｓθは負とさ
れる。また第８図においてＣＯｓθの曲線３０とＳｉｎ
θの曲線３１とを比較すれば理解されるようにＣＯｓθ
のπ／４〜π／２の絶対値ぱ８１ｎθのπ／４〜０と等
しく、従つてＣＯｓθについてはＯ〜π／４を２等〜２
１０番地まで読出し、そのＣＯｓθの値を使用し、π／
４〜π／２の間は２１０〜２ｓ番地と逆に読出してその
時のＳｉｎθの値を使用する。更にＣＯｓθはπ／２〜
３π／４では２使〜２１０番地を順次読出してその時の
Ｓｉｎθの値に負符号を付け、３π／４〜πでは２１０
〜２θ番地を読出してそのＣＯｓθの値に負符号を付け
ればよい。Ｓｉｎθについてもπ／４〜π／２では２１
０〜２。番地と逆に読出してその時のＣＯｓθの値を使
用し、π／２〜３π／４では２の〜２１０番地のＣＯｓ
θを、３π／４〜πでは２１０〜２２番地のＳｉｎθ値
をそれぞれ使用すればよい。ＣＯｓθ、Ｓｉｎθの何れ
についても、２す〜２１０番地を読出したら、次にそれ
を逆順に読出すことを繰返せばよいことになり、またそ
の時、記憶中のＣＯｓθとＳｉｎθの値を、実際の回転
因子Ｗｎｆ）ＣＯｓとするかＳｉｎとするかの関係は上
述の関係から、記憶装置ＭＭ３に対するアドレスビツト
中の１０ビツト目及び１１ビツト目が共に１ｒ”又は共
に″０”の場合は記憶装置ＭＭ３のＣＯｓθ及びＳｉｎ
θをそれぞれ回転因子Ｗｎの実数部及び虚数部とし、一
方が゛１―他方が”Ｏ゛の場合はＣＯｓθ及びＳｉｎθ
をそれぞれＷｎの虚数部及び実数部とする。主記憶装置
ＭＭ３から読出されたＣＯｓθ及びＳｉｎθは回転因子
入替制御部ＴＷＳＣにおいて、アドレスの１０ビツト目
及び１１ビツト目の排他的論理和の結果に応じてＣＯｓ
θ、Ｓｉｎθを実数部にするか虚数部にするかの入替え
が行なわれると共に、１１ビツト目及び１２ビツト目の
排他的論理和の結果により、その実数部の正負符号が決
定される。その入替匍智部ＴＷＳＣの出力が演算装置Ｆ
ＡＬＵの入力端子２８に与えられる。上述のようにして
主記憶装置ＭＭｌ〜ＭＭ３を読出し、演算を行ない、ま
た書込み操作を行なうが、その場合の第１ステージの処
理及びその前段処理におけるステツプ及びデータカウン
タＤＣの内容、アドレスシフトレジスタＡＳＲｌ，ＡＳ
Ｒ２に与えられる内容（以下インデクスと称す）、記憶
装置ＭＭ，，ＭＭ２の読出し番地、アドレス補正部ＴＷ
Ａの出力（以下ＴＷインデクスと呼ぶ）、主記憶装置Ｍ
Ｍ３の読出番地、回転因子入替制御部ＴＷＳＣの出力は
第９図の第１ステージの表のようになる。Therefore, if the 11th and 12th bits of the output address of the address correction unit TWA are both 61'' or both are ``O'', it is assumed that COsθ is between O and π/2 and is positive, and 11
One of the 1st and 12th bits is "r" and the other is 60
In the case of ', it is between π/2 and π, and COsθ is negative. In addition, in FIG. 8, the curve 30 of COsθ and the curve 30 of Sin
As can be understood by comparing the curve 31 of θ, COsθ
The absolute value of π/4 to π/2 is equal to π/4 to 0 of 81nθ, so for COsθ, O
Read up to address 10, use the value of COsθ, and calculate π/
4 to π/2 is read in the opposite way to addresses 210 to 2s, and the value of Sin θ at that time is used. Furthermore, COsθ is π/2~
For 3π/4, read addresses 2 to 210 sequentially and add a negative sign to the value of Sinθ at that time, and for 3π/4 to π, 210
It is sufficient to read the address ˜2θ and add a negative sign to the value of COsθ. Regarding Sinθ, it is also 21 for π/4 to π/2.
0-2. Read the address in reverse and use the value of COsθ at that time, and for π/2 to 3π/4, COs at addresses 2 to 210
For θ, the sin θ values at addresses 210 to 22 may be used for 3π/4 to π, respectively. For both COsθ and Sinθ, once addresses 2 to 210 have been read, all that is required is to read them in reverse order, and at that time, the stored values of COsθ and Sinθ can be changed to the actual values. The relationship between the twiddle factor Wnf) COs and Sin is based on the above relationship. If the 10th and 11th bits of the address bits for the storage device MM3 are both 1r" or both "0", the storage device MM3 COsθ and Sin
Let θ be the real part and imaginary part of the twiddle factor Wn, respectively, and if one is "1" and the other is "O", COsθ and Sinθ
Let be the imaginary part and the real part of Wn, respectively. COsθ and Sinθ read from the main memory device MM3 are stored in the twiddle factor exchange control unit TWSC according to the exclusive OR result of the 10th and 11th bits of the address.
θ and Sin θ are switched between real and imaginary parts, and the sign of the real part is determined based on the result of the exclusive OR of the 11th and 12th bits. The output of the replacement part TWSC is the arithmetic unit F.
It is applied to the input terminal 28 of the ALU. As described above, the main memories MMl to MM3 are read, arithmetic operations are performed, and write operations are performed, but in this case, the steps in the first stage processing and the preceding stage processing, the contents of the data counter DC, and the address shift register ASRl are ,AS
Contents given to R2 (hereinafter referred to as index), read addresses of memory devices MM, MM2, address correction unit TW
Output of A (hereinafter referred to as TW index), main memory M
The read address of M3 and the output of the twiddle factor exchange control unit TWSC are as shown in the table of the first stage in FIG.

第５図のフローチヤートにおける第１ステージの処理を
行ない、データカウンタＤＣの内容がデータサイズ指示
レジスタＤＳＩの設定値Ｎ／２になつた時は、主記憶装
置ＭＭｌ，ＭＭ２からデータＸ７，Ｘｌ５がそれぞれ演
算装置ＦＡＬＵに送られ、第１ステージのデータは全部
読出されたことになる。このデータＸ７，Ｘｌ５に対す
る演算結果が記瞳装置ＭＭ，，ＭＭ２に書込まれるまで
はＭＭｌ，ＭＭ２の読出しは中止される。即ち第５図の
第１ステージ後段処理に示すようにＤＳ一ＤＣがＹＥＳ
になると、パイプラインシフトレジスタＰＬＳＲはクリ
アされ、これとパイプライン指示レジスタＰＬＩの内容
とが一致するまでは演算装置ＦＡＬＵの演算結果がアド
レスシフトレジスタＡＳＲｌ，ＡＳＲ２の４段目のアド
レスで指定された記憶装置ＭＭ，，ＭＭ２内にそれぞれ
書込まれ、次にパイプラインシフトレジスタＰＬＳＲが
１段シフトされ、これとパイプライン指示レジスタＰＬ
Ｉの内容とが比較されることが繰返される。このように
してデータＸ７，Ｘ，５に対する演算結果が記憶装置Ｍ
Ｍｌ，ＭＭ２に書込まれた時は、ＰＬＩ＝ＰＬＳＲがＹ
ＥＳとなる。これより第２ステージに入るがそのための
準備処理が行なわれる。When the first stage processing in the flowchart of FIG. 5 is performed and the content of the data counter DC reaches the set value N/2 of the data size instruction register DSI, data X7 and Xl5 are transferred from the main memories MMl and MM2. Each of the data is sent to the arithmetic unit FALU, and all the data of the first stage has been read out. Reading of MMl and MM2 is stopped until the calculation results for the data X7 and Xl5 are written into the pupil recording devices MM, MM2. That is, as shown in the first stage post-processing of FIG. 5, DS-DC is YES.
When this happens, the pipeline shift register PLSR is cleared, and until this matches the contents of the pipeline instruction register PLI, the operation result of the arithmetic unit FALU is specified by the fourth stage address of the address shift registers ASR1 and ASR2. are written into the storage devices MM, , MM2, and then the pipeline shift register PLSR is shifted by one stage, and this and the pipeline instruction register PLSR are written.
The comparison with the contents of I is repeated. In this way, the calculation results for data X7, X, 5 are stored in the storage device M.
When written to Ml, MM2, PLI=PLSR is Y
It becomes ES. The second stage is now entered, and preparation processing for that stage is performed.

まず第６図におけるインデクス選択レジスタＩＸＭＣＲ
にＩＸｌに代つてビツト順変更回路Ｘの出力１Ｘ２がイ
ンデクスとしてアドレスシフトレジスタＡＳＲｌ，ＡＳ
Ｒ２アドレス補正部ＴＷＡに供給されるようにセツトさ
れる。このＩＸ２と基底アドレスレジスタＢＡＲｌ，Ｂ
ＡＲ２の内容とが回路１０，１１にてそれぞれ加算され
て、アドレスシフトレジスタＡＳＲｌ，ＡＳＲ２の初段
にそれぞれセツトされる。また演算装置ＦＡＬＵのモー
ド及びスイツチ制御が指定され、この場合はモードは上
述と同時にフーリエ演算であり、スイツチＳｌ，Ｓ２の
切替が各演算ステツプごとに省なわれるモードになるよ
うにレジスタＦＣＲ２が設定される。この第２ステージ
における演算は、第２図及び第３図から理解されるよう
に、ＥＯ＋Ｅ４、（ＥＯ一Ｅ４）ＷＯ，．Ｅ８＋Ｅｌ２
、（Ｅ８−Ｅ，２）ＷＯなどである。First, the index selection register IXMCR in FIG.
Then, instead of IXl, the output 1X2 of the bit order change circuit
It is set so that it is supplied to the R2 address correction section TWA. This IX2 and base address register BARl,B
The contents of AR2 are added in circuits 10 and 11, respectively, and set in the first stages of address shift registers ASR1 and ASR2, respectively. In addition, the mode and switch control of the arithmetic unit FALU are specified; in this case, the mode is Fourier computation as described above, and register FCR2 is set so that switching of switches Sl and S2 is omitted for each computation step. be done. As can be understood from FIGS. 2 and 3, the calculations in this second stage are EO+E4, (EO-E4)WO, . E8+El2
, (E8-E,2)WO, etc.

所で上述したように第１ステージにおける演算結果中の
Ｅ。−Ｅ７を記憶装置ＭＭｌのＯ〜７番地にＥ８〜Ｅ，
５を記憶装置ＭＭ２の８〜１５番地にそれぞれ記憶して
おくと、第２ステージにおける主記憶装置ＭＭｌ，ＭＭ
２に対するアドレスの発生が極めて容易になる。即ち一
般にデータサイズＮ＝２ｒの場合に第ｒ番目のステージ
にはビツト入替回路１ＸにおいてデータカウンタＤＣの
出力中の下位の（ｍ−ｒ＋１）ビツトのビツト配列を反
転させればよい。このビツト入替回路１Ｘは例えばデー
タカウンタＤＣの各ビツトがそれぞれ一組のゲートに供
給され、そのゲート中の所定の下位ビツトはビツト順位
が入替えられるようにゲートの出力側が接続され、この
一組のゲートはステージカウンタ用シフトレジスタＳＣ
ＳＲの所定の段の出力で．開かれ、同様に各ステージに
対応して下位ビツトが入替えられたゲートが設けられ、
レジスタＳＣＳＲの状態により、一組のゲートだけが自
動的に開けられる。上記Ｎ＝１６においてはｍ＝４であ
り、その第２ステージでは下位の４−２＋１＝３だけビ
ツト順位を反転させることになり、その反転結果は第９
・図における第２ステージのインデツクスＩＸとして示
され、これよりこの第２ステージの演算ステツプの第１
番目においてＯ番地及び８番地が記憶装置ＭＭ，，ＭＭ
２からそれぞれＥ。By the way, as mentioned above, E in the calculation result in the first stage. −E7 to addresses O to 7 of the storage device MMl, E8 to E,
5 in addresses 8 to 15 of the storage device MM2, the main storage devices MMl and MM in the second stage
2 becomes extremely easy to generate. That is, in general, when the data size N=2r, the bit arrangement of the lower (m-r+1) bits in the output of the data counter DC may be inverted in the r-th stage in the bit switching circuit 1X. In this bit switching circuit 1X, for example, each bit of a data counter DC is supplied to a set of gates, and the output side of the gate is connected so that the bit order of a predetermined lower bit in the gate is switched. Gate is shift register SC for stage counter
At the output of a given stage of SR. Gates are opened and the lower bits are replaced corresponding to each stage.
Depending on the state of register SCSR, only one set of gates is automatically opened. In the case of N=16 above, m=4, and in the second stage, the bit order is inverted by the lower order by 4-2+1=3, and the result of the inversion is the 9th bit order.
・It is shown as index IX of the second stage in the figure, and from this, the first calculation step of this second stage is
At the th address O and 8 are the storage devices MM, MM
2 to E respectively.

及びＥ８が読出されて、演算装置のラツチＬｌ，Ｌ５に
与えられる。次に４番地及び１２番地からＥ４及びＥｌ
２がそれぞれ読出され、この時演算装置のスイツチＳｌ
，Ｓ２は点線のように切替つているからラツチＬ６及び
Ｌ５に入り、ＥＯ，Ｅ８はＬ２，Ｌｌにそれぞれ入る。
よつてＬ２のＥ。とＬ６のＥ４とが演算回路２５，２６
，２７において演算され、ＥＯ＋Ｅ４＝ＤＯ、（ＥＯ−
Ｅ４）ＷＯ−Ｄ４がそれぞれ演算される。次に２番地及
び１０番地からＥ２及びＥｌＯがそれぞれ読出され、ス
イツチＳｌ，Ｓ２は実線となつているためＬｌ，Ｌ５に
入り、Ｅ８，Ｅ，２がＬ２，Ｌ６にそれぞれ入り、Ｅ８
＋Ｅｌ２−Ｄ８、（Ｅ８一Ｅｌ２）ＷＯ−Ｄ２が演算さ
れ、先のＤ。，Ｄ４はＬ３，Ｌ７に入る。次に６番地及
び１４番地からＥ６及びＥ，４がそれぞれ読出され、Ｌ
６，Ｌ５にそれぞれ入り、Ｅ２＋Ｅ６＝Ｄ２、（Ｅ２−
Ｅ６）Ｗ４＝Ｄ６が演算され、ＤＯ，Ｄ４はＬ４，Ｌ３
に、Ｄ８，Ｄ，２はＬ８，Ｌ７にそれぞれ入る。この時
、パイプライン指示レジスタＰＬＩの内容とパイプライ
ンシフトレジスタＰＬＳＲの計数値とが一致し、即ち演
算装置ＦＡＬＵのパイプラインが全部詰まり、第２ステ
ージの前段処理が終る。よつて記憶装置ＭＭｌ，ＭＭ２
に対してはそれまでは読出のみ行なわれたが、次の演算
ステツプからＭＭｌ，ＭＭ２に対する書込みも行なわれ
る。この場合、，上記スイツチＳｌ，Ｓ２の操作により
端子２３，２４にはＤ２，Ｄ８が現われ、アドレスシフ
トレジスタＡＳＲｌＡＳＲ２の４段目の内容が示す記憶
装置ＭＭｌ，ＭＭ２のＯ番地及び８番地にそれぞれ書込
まれる。このようにして互に演算されるべきものでない
が同時に二つのデータを読出し、同時に二つの演算結果
を得込むことができ、高速度で演算が行なわれる。第２
ステージのデータＥｌｌ及びＥｌ５が読出され、データ
サイズ指示レジスタＤＳＩの内容とデータカウンタＤＣ
の内容とが一致すると、ステージシフトレジスタＳＣＳ
Ｒが１シフトされて第３ステージの演算処理に移る。こ
の場合第６図のステージシフトレジスタＳＣＳＲの出力
によりビツト入替回路１Ｘ及びアドレス補正回路ＴＷＡ
のｒが変化される。よつて第３ステージにおいては回路
１Ｘでは下２ビツトだけがビツト逆順とされ、ＴＷＡで
は１１ビツト左シフトされる。and E8 are read out and applied to latches L1 and L5 of the arithmetic unit. Next, from addresses 4 and 12, E4 and El
2 are respectively read out, and at this time the switch Sl of the arithmetic unit is
, S2 are switched as shown by the dotted lines, so they enter latches L6 and L5, and EO and E8 enter latches L2 and Ll, respectively.
Therefore, E of L2. and E4 of L6 are arithmetic circuits 25 and 26
, 27, EO+E4=DO, (EO−
E4) WO-D4 are calculated respectively. Next, E2 and ElO are read from addresses 2 and 10, respectively, switches Sl and S2 are solid lines, so they go into Ll and L5, E8, E, and 2 go into L2 and L6, respectively, and E8
+El2-D8, (E8-El2)WO-D2 is calculated, and the previous D is calculated. , D4 enters L3 and L7. Next, E6 and E,4 are read from addresses 6 and 14, respectively, and L
6 and L5 respectively, E2+E6=D2, (E2-
E6) W4=D6 is calculated, DO, D4 are L4, L3
Then, D8, D, and 2 enter L8 and L7, respectively. At this time, the contents of the pipeline instruction register PLI and the count value of the pipeline shift register PLSR match, that is, the pipeline of the arithmetic unit FALU is completely clogged, and the pre-processing of the second stage ends. Therefore, storage devices MMl, MM2
Until then, only reading was performed for MM1 and MM2, but from the next calculation step, writing to MM1 and MM2 will also be performed. In this case, D2 and D8 appear on the terminals 23 and 24 by operating the switches Sl and S2, and are written to addresses O and 8 of the storage devices MMl and MM2 indicated by the contents of the fourth stage of the address shift register ASRlASR2, respectively. be included. In this way, two pieces of data, which should not be calculated on each other, can be read out at the same time and two calculation results can be obtained at the same time, and calculations can be performed at high speed. Second
Stage data Ell and El5 are read out, and the contents of the data size instruction register DSI and the data counter DC are read out.
If the contents of the stage shift register SCS match, the stage shift register SCS
R is shifted by 1 and the process moves to the third stage of arithmetic processing. In this case, the bit switching circuit 1X and the address correction circuit TWA are activated by the output of the stage shift register SCSR shown in FIG.
r is changed. Therefore, in the third stage, only the lower two bits in circuit 1X are reversed in bit order, and in TWA they are shifted to the left by 11 bits.

この状態で第２ステージと同様の動作が行なわれる。こ
の場合の各種のアドレスなどは第９図の第３ステージの
ようになる。同様にして第４ステージも処理される。こ
の例ではｍ＝４であり，、ステージ数の最大は４である
から、第４ステージにおいてデータＣｌ４，Ｃｌ５が読
出されると、ステージシフトレジスタＳＣＳＲがシフト
され、よつてこれがオーバーフローする。その後は演算
装置ＦＡＬＵ内に残つているデータの処理を行なう最終
処理に移り、これは第１ステージの後段処理と同様であ
るがパイプラインシフトレジスタＰＬＳＲがパイプライ
ン指示レジスタＰＬＩの内容と一致するとすべての演算
が終了する。このようにして１６個のサンプリング点Ｘ
。−Ｘ，５にて代表された信号ｆ（ｔ）のフーリエ変換
におけるフーリエ係数Ｆ。−Ｆｌ５が得られる。フーリ
エ逆変換を行なうには演算装置ＦＡＬＵにおけるスイツ
チＳ９を虚数部符号反転切替回路５０側にセツトしてラ
ツチＬ９の出力はこの回路５０を通じて虚数部の符号が
反転されてラツチＬ，Ｏへ供給されるようにモード設定
を行なえばよい。In this state, the same operation as in the second stage is performed. In this case, various addresses etc. are as shown in the third stage of FIG. The fourth stage is processed in the same way. In this example, m=4, and the maximum number of stages is 4, so when data Cl4 and Cl5 are read in the fourth stage, the stage shift register SCSR is shifted, and thus overflows. After that, the process moves to the final process of processing the data remaining in the arithmetic unit FALU, which is similar to the post-processing of the first stage, but when the pipeline shift register PLSR matches the contents of the pipeline instruction register PLI, all data are processed. The calculation ends. In this way, 16 sampling points
. -Fourier coefficient F in the Fourier transform of the signal f(t) represented by X,5. -Fl5 is obtained. To perform inverse Fourier transform, switch S9 in arithmetic unit FALU is set to the imaginary part sign inversion switching circuit 50 side, and the output of latch L9 is passed through this circuit 50, with the sign of the imaginary part inverted and supplied to latches L and O. All you have to do is set the mode so that

更にフーリエ変換は連続波形の有限区間を取出し、その
区間の周期関数であることを前提として解析している。
しかし必ずしもそのようになつていない。そのため解析
値に誤差が生じる。この誤差がなるべく小さくなるよう
に従来のデジタルフーリエ変換装置において、いわゆる
ウインドウ処理が行なわれていた。このウインドウ処理
はハミング法、ハニング法、バートレツト法などがある
が、このようなウインドウ処理もこの発明装置において
行なうことができる。例えばハニング法は１ｍ（１−Ｃ
Ｏｓ−２π）なる補償関数をデータに乗２Ｎｍじるが、
このＣＯｓ−２πなる値は上記記憶装置ＮＭＭ３に回転
因子として記憶されている点より、これを利用できる。Furthermore, Fourier transform extracts a finite section of a continuous waveform and analyzes it on the assumption that it is a periodic function of that section.
However, this is not necessarily the case. Therefore, an error occurs in the analytical value. In order to reduce this error as much as possible, so-called window processing is performed in conventional digital Fourier transform devices. This window processing includes the Hamming method, Hanning method, Bartlett method, etc., and such window processing can also be performed in the apparatus of the present invention. For example, the Hanning method uses 1m (1-C
The data is multiplied by 2Nm by the compensation function Os-2π),
Since this value COs-2π is stored as a twiddle factor in the storage device NMM3, it can be used.

このためには第７図の演算装置ＦＡＬＵにおいて、スイ
ツチＳｌ，Ｓ２は実線のまＸとし、スイツチＳ７，Ｓ８
Ｏ一方端子５１又は５２側に切替えてＯのみが入力され
、他方からデータＸｎが入力される。またスイツチＳ９
は回路５３側に切替えられ、この回路５３は１から入力
を減算する回路５４とその出力を右へ１ビツトシフトし
て２で割る回路５５よりなり、ラツチＬ９２π １
２πｍからのＣＯｓ−ｍは一（１−ＣＯｓ？）の演算Ｎ
２Ｎが回路５３で行なわれる。For this purpose, in the arithmetic unit FALU shown in FIG.
O is switched to one terminal 51 or 52 and only O is input, and data Xn is input from the other side. Also Switch S9
is switched to the circuit 53 side, and this circuit 53 consists of a circuit 54 that subtracts the input from 1 and a circuit 55 that shifts the output by 1 bit to the right and divides it by 2.
COs-m from 2πm is the operation N of 1 (1-COs?)
2N is performed in circuit 53.

記憶装置ＭＭｌ，ＭＭ２から読出されたデータＸｎを端
子２１及び２２の一方、スィツチＳ７，Ｓ８が接続され
ている方に入力し、記憶装置ＭＭ３から回転因子を読出
して端子２８を通じて回路５３へ供給する。例えばデー
タを端子２２に与えると、ラツチＬ，の入力は常にＯに
なり、加減算回路２６を加算モードとし、端子２２より
のデータに回路５３からの一（１２πＭｃＯｓ？）が回
路２７で乗算され、スイツチＮＳ２の状態により、端子
２３，２４の一方から書込バスへ出力され、更に記憶装
置ＭＭ，，ＭＭ２に書込まれる。The data Xn read from the storage devices MMl and MM2 is inputted to one of the terminals 21 and 22, which is connected to the switches S7 and S8, and the twiddle factor is read from the storage device MM3 and supplied to the circuit 53 through the terminal 28. . For example, when data is applied to the terminal 22, the input of the latch L is always O, the addition/subtraction circuit 26 is set to addition mode, and the data from the terminal 22 is multiplied by 1 (12πMcOs?) from the circuit 53 in the circuit 27. Depending on the state of the switch NS2, the data is output from one of the terminals 23 and 24 to the write bus, and further written to the memory devices MM, MM2.

上述したこの発明装置においてはフーリエ変換のみなら
ず、各種のデータアレイの演算を行なうことができる。The device of the present invention described above can perform not only Fourier transform but also various data array operations.

以下その例を述べる。二つのデータアレイ相互の加減算
は第７図においてスイツチＳｌ，Ｓ２を実線とし、スイ
ツチＳ３をラツチＬ２側、スイツチＳ４をラツチＬ７側
、スイツチＳ６をラツチＬ６側とし、加減算回路２５を
加算又は減算動作とし、加減算回路２６を減算又は加算
動作の何れか一方を指定する。端子２１，２２の入力デ
ータＡｎ，Ｂｎに対し、端子２３，２４からデータアレ
イＡｎ＋Ｂｎ，．Ａｎ−Ｂｎ又はＡｎ−Ｂｎ，．Ａｎ＋
Ｂｎが得られる。データアレイＡｎ−Ｂｎの乗算は端子
２１，２２の一方と端子２８にゼータＡｎＢｎを入れ、
スイツチＳ７，Ｓ８のデータが供給されない側は端子５
１，５２に接続してＯを入力とし、スイツチＳ５を端子
２８に、スイツチＳ９はラツチＬ，に接続し、その他は
ウインドウ処理と同一とする。これにより端子２３，２
４の一方に乗算結果のアレイが得られる。データアレイ
Ａｎに任意の定数Ｋを加算・減算或いは乗算する。加減
算は端子２１，２２の一方にゼータＡｎを他方に定数Ｋ
を入力し、上記データアレイＡｎＢｎの加減算と同様に
すればよい。またデータアレイＡｎを端子２１，２２の
一方に定数Ｋを端子２８に入力して上記乗算と同様にす
れば、データアレイＫＡｎが得られる。更にデータアレ
イからデータアレイへの変換としては例えば積分の場合
はスイツチＳ，を点線、Ｓ２は実線とし、スイツチＳ３
，Ｓ４はそれぞれラツチＬ７側とし、スイツチＳ７は端
子２１に、Ｓ８は端子５２に、Ｓ５はスイツチＳ８に、
ＳｌＯをスイツチＳ１にそれぞれ接続し、加減算回路２
６を加算動作にする。An example will be described below. Addition and subtraction between the two data arrays is performed by setting the switches S1 and S2 as solid lines in FIG. 7, setting the switch S3 to the latch L2 side, setting the switch S4 to the latch L7 side, and setting the switch S6 to the latch L6 side, and adding or subtracting the addition/subtraction circuit 25. The addition/subtraction circuit 26 is designated to perform either subtraction or addition. In response to input data An, Bn at terminals 21, 22, data arrays An+Bn, . An-Bn or An-Bn, . An+
Bn is obtained. For multiplication of data array An-Bn, zeta AnBn is put into one of terminals 21 and 22 and terminal 28,
The side to which data of switches S7 and S8 is not supplied is terminal 5.
1 and 52 and O is input, switch S5 is connected to terminal 28, switch S9 is connected to latch L, and the rest is the same as the window processing. As a result, terminals 23, 2
An array of multiplication results is obtained on one side of 4. An arbitrary constant K is added to, subtracted from, or multiplied by the data array An. For addition and subtraction, set zeta An to one of terminals 21 and 22 and constant K to the other.
, and perform the addition and subtraction of the data array AnBn described above. Further, by inputting the data array An to one of the terminals 21 and 22 and the constant K to the terminal 28 and performing the same multiplication as described above, a data array KAn is obtained. Furthermore, for conversion from data array to data array, for example, in the case of integration, switch S is set as a dotted line, S2 is set as a solid line, switch S3 is set as a dotted line, S2 is set as a solid line,
, S4 are respectively on the latch L7 side, switch S7 is connected to terminal 21, S8 is connected to terminal 52, S5 is connected to switch S8,
Connect each SlO to switch S1 and add/subtract circuit 2.
6 to add operation.

データアレイＡｎを端子２１に与える。データＡＯがラ
ツチＬ７に入ると回路２６においてＡ，と加算され、Ａ
Ｏ＋Ａ１となり、次のステツプでＡ。＋Ａ１＋Ａ２が得
られ、積分されたアレイが端子２４に得られる。微分動
作の場合はスイツチＳ，，Ｓ２は実線、スイツチＳ３は
ラツチＬ２、Ｓ６はラツチＬ６、Ｓ，ＯはラツチＬ２に
それぞれ接続され、その他は積分の場合と同様であり、
回路２６は減算動作とされる。その結果、ラツチＬ７か
らの前の結果と、ラツチＬ６からの新たなデータとの差
が回路２６でとられ、その結果がＬ７に入力されて微分
データアレイが端子２４に得られる。更にデータアレイ
の各要素の総和を求めるには、上記積分動作と同一とす
るが、その結果を記憶装置に入れるのは最終値だけを書
込む。或いは第１図において１ワードのレジスタＧＲを
１つ又は複数設けておき、例えば、ＧＲｌ〜ＧＲ４が設
けられ、これ等はそれぞれ切替回路を通じて書込バス及
び読出バスにそれぞれ接続される。よつて上記サンメー
シヨンをとる場合に、演算出力をレジスタＧＲの１つに
書込めば、このレジスタには常に最も新しい演算結果が
記憶される。上記実施例は以上の各種のデータアレイの
演算を行なうことができるが、そのための制御は上述し
たようにマイクロプログラム匍脚装置ＣＭで行なう。A data array An is applied to terminal 21. When data AO enters latch L7, it is added to A in circuit 26, and A
O+A1, then A in the next step. +A1+A2 is obtained and the integrated array is obtained at terminal 24. In the case of differential operation, switches S,, S2 are connected to solid lines, switch S3 is connected to latch L2, S6 is connected to latch L6, and S and O are connected to latch L2, respectively, and the rest is the same as in the case of integral operation.
The circuit 26 is operated as a subtractor. As a result, the difference between the previous result from latch L7 and the new data from latch L6 is taken in circuit 26, and the result is input to L7 to provide a differential data array at terminal 24. Furthermore, to obtain the sum of each element of the data array, the same integration operation as described above is used, but only the final value is written into the storage device. Alternatively, in FIG. 1, one or more 1-word registers GR are provided, for example GR1 to GR4, which are connected to the write bus and the read bus through switching circuits, respectively. Therefore, when performing the above-mentioned sunmation, if the calculation output is written to one of the registers GR, the newest calculation result is always stored in this register. The above-mentioned embodiment is capable of performing the above-described calculations on the various data arrays, but the control for this is performed by the microprogram pedestal device CM as described above.

その場合すべての制御をその都度マイクロ命令で設定す
るにはマイクロ命令のビツト数が非常に多くなる。よつ
て１つの演算の間中変化しないようなもの、即ちデータ
バス、ビツト幅、演算の種類、発生するアドレスパター
ンの各設定はマイクロプログラム制御によりセツトアツ
プレジスタに設定し、その演算中はそのレジスタの内容
は固定とされる。記憶装置の読出し、書込み、アドノレ
スの更新、演算の進行などのタイミングの制御、ステイ
タスセンスなどはマイクロ命令として与えられる。In that case, the number of microinstruction bits would be extremely large if all controls were to be set each time using microinstructions. Therefore, settings that do not change during one operation, such as the data bus, bit width, type of operation, and address pattern to be generated, are set in the setup register under microprogram control, and the settings in that register are kept unchanged during the operation. The contents of are fixed. Control of the timing of reading and writing of the storage device, address updating, progress of calculations, status sense, etc. are given as microinstructions.

以下代表的なマイクロ命令の例を述べる。先ずムーブ又
はＭＤ命令（ＭＯｖｅＤａｔａ命令）は第１０図Ａに示
すように３２ビツトよりなり、第１図におけるバス制御
レジスタＢＣＲ、メモリ御御レジスタＭＣＲの内容に従
つて主記憶装置ＭＭｌ，ＭＭ２より演算装置ＦＡＬＵ，
これより主記憶装置ＭＭｌ，ＭＭ２へのデータ転送を行
なう。この場合、読出された記憶装置と、書込み記憶装
置とは別のものでもよい。制御フイールドは第０〜第５
ビツトであり、第０ビツトは１０゛で割込みを受付け、
“１゛で割込みを禁止するＩＮＴＤＳ（Ｉｎｔｅｒｒｕ
ｐｔＤｉｓａｂｌｅ）であり、第１ビツトＲＥＴ，ＳＥ
Ｔ（ＲｅｔｕｒｎＡｄｄｒｅｓｓＳｅｔ）は６０″はＮ
ＯＯｐｅｒａｔｉＯｎであり、１ビはリターンアドレス
レジスタに現在実行中のアドレス＋１を格納する。第２
ビツトＲＥＴ（Ｒｅｔｕｒｎ）はＯでＮＯＯｐｅｒａｔ
ｉＯｎ（ＮＯＰ）、１１″ではリターンアドレスレジス
タの内容番地へジャンプする。た〜し割込トラツプ発生
時は行なわない。第３ビツトＤＡＲ−ＨＩＴ（ＨｉｔＤ
ｉｒｅｃｔＡｄｄｒｅｓｓＲｅｇｉｓｔｅｒ）は″Ｏ゛
でＮＯＰ，．″Ｒ５でダイレクトアドレスレジスタの内
容を＋１する。第４ビツトＩＸＨＩＴ（ＨｉｔＩｎｄｅ
ｘＣＯｕｎｔｅｒ）は゛０”でＮＯＰ．６ｌ”でデータ
カウンタＤＣの内容を＋１する。第５ビツトＡＳＲ−Ｓ
ＥＴ（ＳｅｔＡｄｄｒｅｓｓＳｈｉｆｔＲｅｇｉｓｔｅ
ｒ）は１０゛でＮＯＰ．．８ｌ゛でインデツクスをアド
レスシフトレジスタセツトする。第６及び第７ビツトは
６０゛でＮＯＰ、゛１゛でメモリ制御フイールドでメモ
リ読出制御レジスタＭＲＣＲ及びメモリ書込制御レジス
タＭＷＣＲの内容に従つて、主記憶装置の読出し、書込
みを制御する。第６ビツトが６ｒ”なら記憶装置から読
出し、第７ビツトが１１゛なら記憶装置へ書込む。なお
１命令で第６ビツト及び第７ビツトを同時に６１゛にす
ることはできない。第８、第９ビツトはＢＣＲフイール
ドで主記憶装置ＭＭ又は一般レジスタＧＲとバス及び演
算装置ＦＡＬＵ間の転送をバス制御レジスタＢＣＲの内
容に従つて制御する。第８ビツトの“１゛はその書込み
側のバス制御レジスタＢＣＲに従つてデータを転送し、
第９ビツトの６１゛は読出し側のバス制御レジスタＢＣ
Ｒに従つてデ一夕を転送する。第８、第９ビツトが同時
に”１゛に指令することはできない。第１０〜第１８ビ
ツトはトラツプフイールドであつてインデツクスダイレ
クトテイテクダブルビツト（ＩＸ−ＤＤＢ）を見て、ト
ラツプを受付けるか否かを決め、受付けの時は次の実行
アドレスを指定する。その第１０〜第１３ビツトはＩＸ
ＤＤＢＣＯＮＤ（ＩｎｄｅｘＤＤＢｃＯｎｄｉｔｉＯｎ
）のフイールドで゛１゛であるビツトに対応するＩＸＤ
ＤＢのビツトがすべて″１′２の時にトラツプ条件が満
足したとする。Examples of typical microinstructions are described below. First, the move or MD instruction (MOveData instruction) consists of 32 bits as shown in FIG. device FALU,
Data is then transferred to the main memories MM1 and MM2. In this case, the read storage device and the write storage device may be separate devices. Control fields are 0th to 5th
bit, the 0th bit is 10゛, interrupt is accepted,
“INTDS (Interru
ptDisable), and the first bit RET,SE
T (ReturnAddressSet) is 60'' is N
OOperatiOn, and 1 bit stores the address currently being executed +1 in the return address register. Second
Bit RET (Return) is O and NO Operat
iOn (NOP), 11'' jumps to the content address of the return address register. However, this is not performed when an interrupt trap occurs. The third bit DAR-HIT (HitD
directAddressRegister) is NOP with "O", and the content of the direct address register is incremented by 1 with "."R5. 4th bit IXHIT (HitInde
xCounter) is ``0'' and NOP.6l'' increases the contents of the data counter DC by 1. 5th bit ASR-S
ET(SetAddressShiftRegister
r) is 10゛ and NOP. ．． At 8l, the index is set in the address shift register. The 6th and 7th bits are NOP at 60', and the memory control field at 1' controls reading and writing of the main memory according to the contents of the memory read control register MRCR and the memory write control register MWCR. If the 6th bit is 6r'', it is read from the storage device, and if the 7th bit is 11'', it is written to the storage device. Note that the 6th and 7th bits cannot be set to 61'' at the same time in one instruction. The 9th bit is the BCR field, which controls the transfer between the main memory device MM or general register GR, the bus, and the arithmetic unit FALU according to the contents of the bus control register BCR.The 8th bit "1" controls the bus on the write side. Transfer data according to register BCR,
The 9th bit 61 is the bus control register BC on the read side.
Transfer data according to R. The 8th and 9th bits cannot be set to ``1'' at the same time.The 10th to 18th bits are trap fields, and they read the index direct data bit (IX-DDB) and accept traps. Decide whether or not to execute, and specify the next execution address at the time of acceptance.The 10th to 13th bits are IX
DDBCOND (IndexDDBcOnditiOn
) corresponding to the bit that is ``1'' in the field
Assume that the trap condition is satisfied when all bits of DB are "1"2.

第１４ビツトＩＸＤＤＢＣＬＲ（ＮｄｃｘＤＤＢｃｌｅ
ａｒ）は１１゛５でトラツプが受付けられた時に限り、
ＩＸ−ＤＤＢＣＯＮＤフイールドの゛１゛に対応するＩ
Ｘ−ＤＤＢのビツトをすべて″０゛にする。14th bit IXDDBCLR (NdcxDDBcle
ar) only when a trap is accepted at 11゛5,
I corresponding to "1" in the IX-DDBCOND field
Set all bits of X-DDB to "0".

トラツプが受付けられた時に次に実行する命令のアドレ
スは第１５〜１８ビツトＴＲＡＰＡＤＤＲＥＳＳで決め
る。この第１９〜２４ビツトにパルスフイールドで、そ
の第１９ビツトＸ−ＣＴＬ（ＩｎｄｅｘｃＯｎｔｒＯｌ
）はインデクスモードコントロールレジスタ（Ｘ−ＭＣ
Ｒ）のＣＴＬビツトが″ｒ”の時だけ有効であり、６０
”でアドレスシフトレジスタＡＳＲ２のアドレス源とし
てＩＸｌを選択し、゛１゛の場合はＡＳＲ２のアドレス
源としてＩＸ２を選択する。第２０ビツトＰＵＳＨばｒ
′でバス制御レジスタＢＣＲで指定した演算装置へ演算
実行パルスを送る。第２１〜２４ビツトＴ１〜Ｔ７は″
ｒ”で特定時刻Ｔ，，Ｔ３，Ｔ５，Ｔ７にパルスを演算
装置へ送る。第２７ビツトＮＯＴＲＡＰば０゛でトラツ
プ受付可能であり、６１゛でトラツプ受付禁止である。
命令フイールドに書かれているデータを結合された演算
装置のレジスタへ転送するＥＭＩＴ命令は第１０図Ｂに
示すように第０〜第５ビツトはＭＤ命令と同一であり、
第６〜第９ビツトにてデータを受取るべき演算装置を指
令し、第１０〜第１３ビツトでその演算装置内のレジス
タを指定し、第１４〜第２９ビツトで転送されるデータ
が表示される。The address of the next instruction to be executed when a trap is accepted is determined by the 15th to 18th bits TRAPADDRESS. The 19th to 24th bits are pulse fields, and the 19th bit X-CTL (IndexcOntrOl
) is the index mode control register (X-MC
R) is valid only when the CTL bit is "r", and 60
” selects IXl as the address source of address shift register ASR2, and when “1” selects IX2 as the address source of ASR2. 20th bit PUSH bar
' sends an arithmetic execution pulse to the arithmetic unit specified by the bus control register BCR. The 21st to 24th bits T1 to T7 are "
r'' sends pulses to the arithmetic unit at specific times T, T3, T5, and T7.If the 27th bit NOTRAP is 0, it is possible to accept a trap, and when it is 61, it is prohibited to accept a trap.
The EMIT instruction, which transfers the data written in the instruction field to the register of the connected arithmetic unit, has the 0th to 5th bits the same as the MD instruction, as shown in FIG. 10B.
The 6th to 9th bits instruct the arithmetic unit that should receive the data, the 10th to 13th bits specify the register within that arithmetic unit, and the 14th to 29th bits display the data to be transferred. .

主記憶装置の読出しバツフアレジスタやジエネラルレジ
スタの内容を指定された演算装置内のレジスタへ転送す
るＳＳ命令（ＳｅｔＳｔａｔｕｓ）は第１０図Ｃのよう
に第１０図Ｂと対応する部分は同符号を示し、データ源
となる記憶装置の指令を第１４〜第１６ビツトＲＥＧで
行なう。そのＬ／Ｒば０゛は左半分からの転送、”１゛
は右半分からの転送とする。演算装置内の制御レジスタ
の内容を記憶装置やジエネラルレジスタへ転送するＦＳ
命令（ＦｅｔｃｈＳｔａｔｕｓ）は第１０図Ｄのように
、第１４〜第１７ビツトは格納されるべき記憶装置やレ
ジスタを示す。その他ジエネラルレジスタ内の算術論理
演算を行なうＡＬＯＰ命令、記憶装置中のデータをジエ
ネラルレジスタに格納するＬＲ命令、逆にジエネラルレ
ジスタのデータを記憶装置へ書込むＳＲ命令、割込処理
ルーチンから復帰するＲＦＩ命令Ｓｔａｔｕｓセンス用
条件ジアップＪＯＴＪＮＴ命令、Ｓｔａｔｕｓをセンス
し、命令に続く５つのアドレスの１つへジヤンプするＳ
ＫＰ命令、記憶装置の読出し、書込みのみを行なうＡＭ
命令などがある。The SS instruction (SetStatus) that transfers the contents of the read buffer register or general register of the main memory to the register in the specified arithmetic unit is shown in Figure 10C, and the parts corresponding to Figure 10B have the same symbols. The 14th to 16th bits REG are used to command the storage device serving as the data source. For L/R, 0゛ is transfer from the left half, and ``1'' is transfer from right half. FS transfers the contents of the control register in the arithmetic unit to the storage device or general register.
For the instruction (FetchStatus), as shown in FIG. 10D, the 14th to 17th bits indicate the storage device or register to be stored. Other ALOP instructions that perform arithmetic and logical operations in general registers, LR instructions that store data in a storage device to general registers, SR instructions that write data in general registers to storage devices, and interrupt processing routines. Return RFI instructionStatus sensing conditionsJOTJNT instruction, senses the Status and jumps to one of the five addresses following the instructionS
AM that only performs KP command, reading and writing of storage device
There are commands, etc.

上述においては演算装置ＦＡＬＵ内のスイツチの切替な
どにより演算機能を変化させ、各種のアレイ演算を可能
にしたが、先にも述べたように読出バス及び書込バス間
に互に機能の異なる複数の演算装置を接続し、その必要
なものを使用する。例えば第１１図に示すようにデイジ
タルフーリエ変換を行なう演算装置ＦＡＬＵ、割算用演
算装置ＤＡＬＵ及び平方根用演算装置ＳＡＬＵなどが書
込みバス１２，１３、読出バス１４，１５間にそれぞれ
接続される。これら演算装置は何れもパイプライン構成
であり、割算用演算装置ＤＡＬＵにおいては例えば第１
２図に示すように４段のパイプラインＰＬｌ〜ＰＬ４よ
りなる。各パイプラインは３２ビツトの被除数が蓄えら
れるレジスタＬＲと、１６ビツトの除数が入力されるレ
ジスタＬＲ″と、レジスタＬＲ内の数からレジスタＬＲ
′内の数を減算する加減算器ＡＳＵと、その残りが入力
され、そのピツトを左へ１ビツトシフトする桁移し回路
ＳＬＵと、その桁移し回路の出力を被除数レジスタＬＲ
及び次段ステツプの被除数レジスタＬＲへ切替え接続す
るＳＷと、加減算器ＡＳＵより桁上げが生じたか否かを
回路Ｃにて検出し、その検出状態により制御する制御回
路ＬＣと、その回路ＬＣにより加減算器ＡＳＵの１回の
演算毎に１ビツト加算された割算結果が入力されるレジ
スタＲＲとよりなる。読出バスからデータがレジスタＬ
Ｒ及びＬＲ′に入力されると、これ等が減算器ＡＳＵに
て互に減算され、レジスタＲＲに１が入力されると共に
引算結果は１ビツトだけ左にシフ卜されて、レジスタＬ
Ｒに入れられる。このレジスタＬＲの内容が除数レジス
タＬＲ′の内容にて弓かれ、その結果レジスタＲＲは１
加算され、引算結果が１ビツト左へシフトされてレジス
タＬＲに入力され、以下同様のことが繰返され、４回引
算が行なわれると、上記の各引算の回数がカウンタＬＣ
Ｕにて計数され、スイツチＳＷが次段の被除数レジスタ
ＬＲ側へ接続され、引算結果を１ビツト左シフトされた
値が次段の被除数レジスタＬＲへ供給される。次に記憶
装置から新しいデータが演算装置ＤＡＬＵへ入力される
と、それまでの第１ステツプＰＬｌにおける除数レジス
タＬＲ′の内容は第２ステツプＰＬ２の除数レジスタＬ
Ｒ／に、第１ステツプの割算結果のレジスタＲＲの内容
は第２ステツプＰＬ２の結果レジスタＲＲへそれぞれ移
される。以下同様に各ステツプの除数レジスタＬＲ７、
結果レジスタＲＲの各内容はそれぞれその次段の対応す
るものに移される。このようにして第４ステツプＰＬ４
の演算が行なわれると、始めて３２ビツトの被除数は１
６ビツト左へシフトされ、１６ビツトの除数による割算
が完了し、書込みバスへ供給される。このようにして主
記憶装置の読出しサイクルよりも割算の演算速度が遅い
場合でも主記憶装置の読出し速度を下げることなく割算
することができる。以上述べたようにこの発明データア
レイ演算装′置によれば演算装置をパイプライン構成と
し、その高速性をいかし、しかもマイクロプログラム制
御の融通性もあり、各種の演算機能を同一装置で行なう
ことができる。In the above, various array operations were made possible by changing the operation function by switching the switch in the operation unit FALU, but as mentioned earlier, there are multiple array operations with different functions between the read bus and the write bus. Connect your computing devices and use what you need. For example, as shown in FIG. 11, an arithmetic unit FALU for performing digital Fourier transform, a division arithmetic unit DALU, a square root arithmetic unit SALU, and the like are connected between write buses 12 and 13 and read buses 14 and 15, respectively. All of these arithmetic units have a pipeline configuration, and in the division arithmetic unit DALU, for example, the first
As shown in FIG. 2, it consists of four stages of pipelines PLl to PL4. Each pipeline has a register LR where a 32-bit dividend is stored, a register LR'' where a 16-bit divisor is input, and a register LR from the number in register LR.
an adder/subtracter ASU that subtracts the number in ', a shift circuit SLU that receives the remainder and shifts the pit one bit to the left, and an output of the shift circuit that is sent to the dividend register LR.
and a SW that is switched and connected to the dividend register LR of the next step, a control circuit LC that detects whether or not a carry has occurred from the adder/subtractor ASU in a circuit C, and controls based on the detection state, and a control circuit LC that performs addition/subtraction using the circuit LC. It consists of a register RR into which the division result obtained by adding one bit for each operation of the unit ASU is input. Data is transferred from the read bus to register L.
When input to R and LR', they are subtracted from each other in a subtracter ASU, 1 is input to register RR, and the subtraction result is shifted to the left by 1 bit, and is stored in register L.
It can be placed in R. The contents of this register LR are rounded by the contents of the divisor register LR', and as a result, the register RR becomes 1.
The result of the subtraction is shifted one bit to the left and input into the register LR, and the same process is repeated. When subtraction is performed four times, the number of times of each subtraction is stored in the counter LC.
The switch SW is connected to the dividend register LR of the next stage, and the value obtained by shifting the subtraction result to the left by one bit is supplied to the dividend register LR of the next stage. Next, when new data is input from the storage device to the arithmetic unit DALU, the contents of the divisor register LR' in the first step PLl are changed to the divisor register L of the second step PL2.
At R/, the contents of the register RR of the division result of the first step are respectively transferred to the result register RR of the second step PL2. Similarly, the divisor register LR7 of each step,
Each content of the result register RR is transferred to its corresponding one in the next stage. In this way, the fourth step PL4
When the calculation is performed, the 32-bit dividend becomes 1 for the first time.
Shifted to the left by 6 bits, division by the 16 bit divisor is completed and provided to the write bus. In this way, even if the calculation speed of division is slower than the read cycle of the main memory, the division can be performed without reducing the read speed of the main memory. As described above, according to the data array arithmetic device of the present invention, the arithmetic device has a pipeline configuration, taking advantage of its high speed and flexibility in microprogram control, allowing various arithmetic functions to be performed by the same device. I can do it.

その場合適当に制御用レジスタを設けてマイクロ命令の
構成ビツト桁数を少なくして、各種の制御を可能として
いる。なおデータアレイでないデータについての演算を
パイプライン構成の演算装置で行なうことは時間が長く
なる。この点から通常の演算装置ＡＬＵを読出しバス及
び書込みバス間に接続することもできる。In this case, appropriate control registers are provided to reduce the number of bits constituting the microinstruction, thereby making various controls possible. Note that it takes a long time to perform an operation on data other than a data array using a pipeline-configured arithmetic unit. From this point on, a conventional arithmetic unit ALU can also be connected between the read bus and the write bus.

[Brief explanation of the drawing]

第１図はこの発明によるアレイ演算装置の一例を示すプ
ロツク図、第２図はサンデーチユーキ法によりデジタル
フーリエ変換のアルゴリズムを説明するための図、第３
図はその演算を示す表、第４図は回転因子の記憶状態を
示す図、第５図はデジタルフーリエ変換のフローチヤー
ト、第６図はアドレス発生部の例を示すプロツク図、第
７図は演算装置の一例を示すプロツク図、第８図は正弦
値及び余弦値の関係を示す図、第９図は各ステージにお
けるカウンタ、インデクス、回転因子の関係を示す表、
第１０図は各種マイクロ命令の例を示す図、第１１図は
この発明アレイ演算装置の他の例を示すプロツク図、第
１２図はその割算用演算装置の一例を示すプロツク図で
ある。FIG. 1 is a block diagram showing an example of an array calculation device according to the present invention, FIG. 2 is a diagram for explaining a digital Fourier transform algorithm using the Sunday-Chiyuki method, and FIG.
Figure 4 is a table showing the calculation, Figure 4 is a diagram showing the storage status of twiddle factors, Figure 5 is a flowchart of digital Fourier transform, Figure 6 is a block diagram showing an example of the address generation section, and Figure 7 is a diagram showing the storage state of twiddle factors. A block diagram showing an example of an arithmetic device; FIG. 8 is a diagram showing the relationship between sine values and cosine values; FIG. 9 is a table showing the relationship between counters, indexes, and twiddle factors at each stage;
FIG. 10 is a diagram showing examples of various microinstructions, FIG. 11 is a block diagram showing another example of the array arithmetic device of the present invention, and FIG. 12 is a block diagram showing an example of the division arithmetic device.

Claims

[Claims]

1 At least one readable and writable storage device in which a data array is stored in a sequentially addressed manner, an address generator that generates read and write addresses for the storage device, and a read and write cycle of the storage device that minimizes read and write cycles. The unit of operation is multiple times the unit of operation to complete one operation, two data array input terminals, at least one output terminal to which the data array of the operation result is output, and the amount of delay of the minimum operation unit. at least one delay element, at least one arithmetic element, and at least one switch for switching the connection of these input terminals, delay element, and arithmetic element, and the arithmetic function to be processed can be changed by controlling the switch. a readout bus that supplies two data arrays read from the storage device to two input terminals of the calculation device; and a readout bus that supplies the calculation results from the output terminals of the calculation device to the storage device. controls a write bus to be supplied as write data to the memory device and the switch to designate the arithmetic function of the arithmetic device, read the data array from the storage device, and write the arithmetic results of the arithmetic device to the storage device. A data array arithmetic processing device comprising a microprogram control device.