JP4597640B2

JP4597640B2 - FDTD computing device and FDTD computing method

Info

Publication number: JP4597640B2
Application number: JP2004331127A
Authority: JP
Inventors: 秀俊鈴木; 良山口; 真司上林; 雄太高儀
Original assignee: NTT Docomo Inc
Current assignee: NTT Docomo Inc
Priority date: 2004-11-15
Filing date: 2004-11-15
Publication date: 2010-12-15
Anticipated expiration: 2024-11-15
Also published as: JP2006139723A

Description

本発明はＦＤＴＤ演算装置、ＦＤＴＤ演算方法に関し、特にアンテナの特性や、人体を模擬した誘電体に電波を照射した時の吸収量をシミュレータで評価する等の電磁界シミュレーション全般に用いるＦＤＴＤ演算装置、ＦＤＴＤ演算方法に関する。 The present invention relates to an FDTD arithmetic apparatus and an FDTD arithmetic method, and more particularly to an FDTD arithmetic apparatus used in general electromagnetic field simulation such as evaluating characteristics of an antenna and absorption amount when a radio wave is irradiated on a dielectric simulating a human body, The present invention relates to an FDTD calculation method.

ＦＤＴＤ（ＦｉｎｉｔＤｅｆｆｅｒｅｎｃｅＴｉｍｅＤｏｍａｉｎＭｅｔｈｏｄ）法とは、解析する領域をセルと呼ばれる格子状の立方体のブロックで分割し、各辺上に電界を、各面の中心に垂直方向に磁界を割り当ててマクスウェルの方程式を解くことにより、解析領域内における電界、磁界の空間的・時間的な変化を表現する電磁界シミュレーション手法である。本手法はＦｏｒｔｒａｎやＣ等のプログラミング言語を用いて実現されている。 The FDTD (Finite Difference Time Domain Method) method divides a region to be analyzed into blocks of lattice cubes called cells, assigns an electric field on each side, and assigns a magnetic field in the vertical direction to the center of each surface. It is an electromagnetic field simulation technique that expresses spatial and temporal changes of electric and magnetic fields in the analysis region by solving equations. This method is implemented using programming languages such as Fortran and C.

ＦＤＴＤ法の概要を図１３に示す。同図（ａ）は解析対象となる領域を示す図である。解析対象となる領域を格子状のセルで分割した場合において、その分割した１つのセルが同図中の破線円内に示されている。このセルの１つに着目した拡大図が同図（ｂ）である。同図（ｂ）において、白抜き矢印は電界、黒塗り矢印は磁界を示しており、電界によって磁界が発生している。同図（ｃ）を参照すると、面Ｓにおいて、電界Ｅｙ及び電界Ｅｘによって、磁界Ｈｚが発生している。セルを構成する他の面についても同様に、電界によって磁界が発生することになる。また、この発生した磁界によって、電界が発生する。その様子が同図（ｄ）に示されている。同図（ｄ）では、磁界Ｈｘ及びＨｙによって、電界Ｅｚが発生している。
このように、電界によって磁界が発生し、磁界によって電界が発生することに鑑み、ＦＤＴＤ法では次に示すマクスウェルの方程式を差分計算し、電界・磁界の値を順次計算することによって電磁界の変化を計算する。 An outline of the FDTD method is shown in FIG. FIG. 4A shows a region to be analyzed. When a region to be analyzed is divided by lattice-like cells, one divided cell is shown in a broken-line circle in FIG. An enlarged view focusing on one of the cells is FIG. In FIG. 2B, the white arrow indicates an electric field, and the black arrow indicates a magnetic field, and a magnetic field is generated by the electric field. Referring to FIG. 3C, a magnetic field Hz is generated on the surface S by the electric field Ey and the electric field Ex. Similarly, a magnetic field is generated by the electric field on the other surfaces constituting the cell. An electric field is generated by the generated magnetic field. This is shown in FIG. In FIG. 4D, an electric field Ez is generated by the magnetic fields Hx and Hy.
Thus, in view of the fact that a magnetic field is generated by an electric field and the electric field is generated by a magnetic field, the FDTD method calculates the difference of the following Maxwell's equations and sequentially calculates the electric field and magnetic field values to change the electromagnetic field. Calculate

磁界から電界を計算する式および電界から磁界を計算する式を示す。磁界から電界を計算する際には、１時間ステップ前の同じ位置の電界値とその周りを囲むように位置する磁界(４パラメータ)および３係数(Cex、Cey、Cez等)から計算され、電界から磁界を計算する際には、１時間ステップ前の同じ位置の磁界値とその周りを囲むように位置する電界(４パラメータ)および３係数(Chx、Chy、Chz等)から計算される。
電界の演算式は以下の通りである。 An expression for calculating an electric field from a magnetic field and an expression for calculating a magnetic field from the electric field are shown. When calculating the electric field from the magnetic field, the electric field is calculated from the electric field value at the same position one hour before and the surrounding magnetic field (four parameters) and three coefficients (Cex, Cey, Cez, etc.) The magnetic field is calculated from the magnetic field value at the same position one hour before and the electric field (four parameters) and the three coefficients (Chx, Chy, Chz, etc.) positioned so as to surround it.
The calculation formula of the electric field is as follows.

ただし、以下に示すＣｅｘ、Ｃｅｙ、Ｃｅｚ、Ｃｅｘｌｙ、Ｃｅｘｌｚ、Ｃｅｙｌｘ、Ｃｅｙｌｚ、Ｃｅｚｌｘ、および、Ｃｅｚｌｙはセルの位置に依存する定数である。 However, Cex, Cey, Cez, Cexly, Cexlz, Ceylx, Ceylz, Cezlx, and Cezly shown below are constants depending on the position of the cell.

磁界の演算式は以下の通りである。 The calculation formula of the magnetic field is as follows.

ただし、以下に示すＣｈｘ、Ｃｈｙ、Ｃｈｚ、Ｃｈｘｌｙ、Ｃｈｘｌｚ、Ｃｈｙｌｘ、Ｃｈｙｌｚ、Ｃｈｚｌｘ、および、Ｃｈｚｌｙはセルの位置に依存する定数である。 However, Chx, Chy, Chz, Chxly, Chxlz, Chylx, Chylz, Chzlx, and Chzly shown below are constants depending on the position of the cell.

ここで、括弧内の数値は、ｘ座標、ｙ座標、ｚ座標、の位置を示し、それぞれの変数の意味は次表のとおりである。 Here, the numerical values in parentheses indicate the positions of the x coordinate, the y coordinate, and the z coordinate, and the meaning of each variable is as shown in the following table.

なお、特許文献１には、プリント基板を設計する際に、ＦＤＴＤ法による電磁界シミュレータを用いて放射パターンを求めた旨が記載されており、様々な製品の設計の際にＦＤＴＤ法が用いられる。
特開２００２−２４３１２号公報（段落[００６６]） Patent Document 1 describes that a radiation pattern was obtained using an electromagnetic field simulator based on the FDTD method when designing a printed circuit board, and the FDTD method is used when designing various products. .
JP 2002-24312 A (paragraph [0066])

ＦＤＴＤ法を、プログラム言語で記述して実現する場合、コンピュータの自動制御により計算が進行し、メモリ読込みおよび書き込みに関する制御は不要である。その一方、並列計算ではないため各セルを１つ１つ順番に計算することになる。
しかし、ＦＤＴＤ法は、解析領域を細かく分割して、１セル毎に逐次計算を行うため、計算機資源(メモリ)を多く必要とし、計算には長時間を要することが判っている。 When the FDTD method is described and realized in a program language, calculation proceeds by automatic control of a computer, and control regarding memory reading and writing is unnecessary. On the other hand, since it is not parallel calculation, each cell is calculated one by one.
However, since the FDTD method divides the analysis region into fine pieces and sequentially performs calculation for each cell, it is known that a lot of computer resources (memory) are required and the calculation takes a long time.

そのため、高速化することの一手法として、計算機で実現していたアルゴリズムをＦＰＧＡ（ＦｉｅｌｄＰｒｏｇｒａｍｍａｂｌｅＧａｔｅＡｒｒａｙ）に実装して並列計算を行うことによる高速化の検討がなされている。
ＦＰＧＡは大規模データの高速処理用および科学技術演算のアクセラレータとして使用可能な汎用演算器である。処理データの保存用として外部メモリを有しており、並列演算時の一時保存用としてバッファメモリを有している。ＦＰＧＡ並列演算器の構成の概要を図１４に、ＦＰＧＡにＦＤＴＤ法を適用した場合の処理フローを図１５に示す。 Therefore, as one method for speeding up, studies have been made on speeding up by implementing an algorithm implemented by a computer in an FPGA (Field Programmable Gate Array) and performing parallel computation.
The FPGA is a general-purpose computing unit that can be used for high-speed processing of large-scale data and as an accelerator for scientific and technical computation. An external memory is provided for storing processing data, and a buffer memory is provided for temporary storage during parallel computation. FIG. 14 shows an outline of the configuration of the FPGA parallel computing unit, and FIG. 15 shows a processing flow when the FDTD method is applied to the FPGA.

図１４を参照すると、ＦＰＧＡ並列演算器は、演算対象となるデータ全体を記憶しておくための外部メモリ１と、この外部メモリ１に記憶されているデータのうち１回の演算対象となるデータを読み出して記憶するためのバッファメモリ２と、バッファメモリ２に記憶されているデータについて並列演算を行うＣＰＵとして機能するＦＰＧＡ部３とを含んで構成されている。 Referring to FIG. 14, the FPGA parallel computing unit includes an external memory 1 for storing the entire data to be computed, and data to be computed once among the data stored in the external memory 1. Are read and stored, and an FPGA unit 3 that functions as a CPU that performs a parallel operation on the data stored in the buffer memory 2 is configured.

図１５を参照すると、最初に、初期データを外部メモリ１に書き込む（ステップＳ１）。次に、外部メモリから演算対象となるデータを読み出し（ステップＳ２）、そのデータをバッファメモリ２に一時保存する（ステップＳ３）。
この状態において、バッファメモリ２に記憶されているデータを用いて、ＦＰＧＡで並列演算を行う（ステップＳ４）。この演算結果を外部メモリ１に書き込む（ステップＳ５）。その後、演算が終了したかどうか判断し（ステップＳ６）、演算終了まで、ステップＳ２からステップＳ５までを繰返し実行する。 Referring to FIG. 15, first, initial data is written in the external memory 1 (step S1). Next, data to be calculated is read from the external memory (step S2), and the data is temporarily stored in the buffer memory 2 (step S3).
In this state, using the data stored in the buffer memory 2, parallel operation is performed by the FPGA (step S4). This calculation result is written in the external memory 1 (step S5). Thereafter, it is determined whether or not the calculation is completed (step S6), and steps S2 to S5 are repeatedly executed until the calculation is completed.

ところで、図１４に示されているようにＦＰＧＡ並列演算器を構成した場合、ＦＰＧＡ部３から外部メモリ１へのアクセス時間は、ＦＰＧＡ部３からバッファメモリ２へのアクセス時間よりも長いので、この外部メモリ１へのアクセス時間が演算処理時間の大勢を占めることになる。例えば、演算処理全体にかかる処理時間の割合は、図１５中のステップＳ２：ステップＳ３：ステップＳ４において、約２０：３：５の割合である。このため、演算処理時間の短縮が困難である、という問題点がある。
本発明は上述した従来技術の問題点を解決するためになされたものであり、その目的は演算処理時間を短縮することのできるＦＤＴＤ演算装置、ＦＤＴＤ演算方法を提供することである。 By the way, when the FPGA parallel computing unit is configured as shown in FIG. 14, the access time from the FPGA unit 3 to the external memory 1 is longer than the access time from the FPGA unit 3 to the buffer memory 2. The access time to the external memory 1 occupies most of the processing time. For example, the ratio of the processing time for the entire calculation process is approximately 20: 3: 5 in step S2: step S3: step S4 in FIG. For this reason, there is a problem that it is difficult to shorten the processing time.
The present invention has been made to solve the above-described problems of the prior art, and an object of the present invention is to provide an FDTD arithmetic apparatus and an FDTD arithmetic method that can reduce the arithmetic processing time.

本発明の請求項１によるＦＤＴＤ演算装置は、解析対象となる領域を市松模様状に第１ブロック群と第２ブロック群とからなる格子状の複数のブロックによって分割し、この分割した各ブロックの各辺に電界及び磁界の一方を、前記各ブロックの各面に他方を割り当ててマクスウェルの方程式を解く演算を行うことにより、前記領域内における電界及び磁界の空間的及び時間的な変化を得るためのＦＤＴＤ演算装置であって、
外部メモリから読み込んだデータを記憶するバッファ記憶手段と、
前記解析対象となる領域を構成する複数のブロックについて前記第１ブロック群に対応するデータのうち、前記バッファ記憶手段に既に記憶されており次の回の演算において前記第１ブロック群の各辺のうち、それらにそれぞれ隣接する前記第２ブロック群の各辺に接する辺に対応するデータを再利用データとし、前記再利用データ以外のデータを読み込むデータ読み込み手段と、前記バッファ記憶手段に記憶されているデータを用いて前記演算を行う演算手段とを含むことを特徴とする。解析対象となる領域を構成する複数のブロックについてその配列の１つおきのブロックに対応するデータのうち、バッファメモリに既に記憶されており次の回の演算において用いるデータ以外のデータを読み込むようにしたので、外部メモリへのアクセス回数が少なくなり、演算時間を短縮できる。 The FDTD arithmetic unit according to claim 1 of the present invention divides an area to be analyzed into a checkered pattern by a plurality of grid-like blocks including a first block group and a second block group, and each of the divided blocks is divided. one of electric and magnetic fields to each side, by performing an operation to solve Maxwell's assigning the other on each side of the blocks, for obtaining a spatial and temporal changes in the electric field and magnetic field in the region FDTD arithmetic unit,
Buffer storage means for storing data read from external memory;
Of the data corresponding to the first block group for a plurality of blocks constituting the region to be analyzed, the data is already stored in the buffer storage means and is calculated for each side of the first block group in the next calculation. Among them, data corresponding to the sides in contact with each side of the second block group adjacent to each other is used as reuse data, and is stored in the data storage means for reading data other than the reuse data and the buffer storage means. And calculating means for performing the calculation using existing data. Among the data corresponding to every other block in the array for the plurality of blocks constituting the analysis target area, data other than the data already stored in the buffer memory and used in the next calculation is read. As a result, the number of accesses to the external memory is reduced and the computation time can be shortened.

本発明の請求項２によるＦＤＴＤ演算装置は、請求項１において、前記データ読み込み手段は、前記複数のブロックのうち前記領域の端部に位置するブロックについて奇数番目のもの及び偶数番目のもののいずれか一方に対応するデータを読み込み、次の回は他方に対応するデータを読み込むことを特徴とする。奇数番目のデータ、偶数番目のデータを交互に読み込むことにより、重複して用いるデータ以外のデータを読み込んで演算することができ、外部メモリへのアクセス回数を少なくすることができ、演算時間を短縮できる。 According to a second aspect of the present invention, in the FDTD arithmetic apparatus according to the first aspect, the data reading means is either an odd-numbered one or an even-numbered one of the plurality of blocks located at the end of the region. Data corresponding to one is read, and data corresponding to the other is read the next time. By alternately reading odd-numbered data and even-numbered data, data other than redundantly used data can be read and calculated, the number of accesses to the external memory can be reduced, and calculation time is reduced. it can.

本発明の請求項３によるＦＤＴＤ演算装置は、請求項１又は２において、前記演算において固定的に使用するパラメータを、該演算の間、前記バッファ記憶手段に固定的に記憶させておくことを特徴とする。固定的に使用するパラメータを、バッファメモリに固定的に記憶させておくことにより、外部メモリへのアクセス回数を少なくすることができ、演算時間をより短縮できる。 According to a third aspect of the present invention, the FDTD arithmetic apparatus according to the first or second aspect is characterized in that, in the first or second aspect, the parameters that are used fixedly in the calculation are fixedly stored in the buffer storage means during the calculation. And By storing fixedly used parameters in the buffer memory, the number of accesses to the external memory can be reduced, and the calculation time can be further shortened.

本発明の請求項４によるＦＤＴＤ演算方法は、解析対象となる領域を市松模様状に第１ブロック群と第２ブロック群とからなる格子状の複数のブロックによって分割し、この分割した各ブロックの各辺に電界及び磁界の一方を、前記各ブロックの各面に他方を割り当ててマクスウェルの方程式を解く演算を、バッファメモリと該バッファメモリよりもアクセス時間が長い外部メモリとに対するデータ読み込みを制御する並列演算機によって行うことにより、前記領域内における電界及び磁界の空間的及び時間的な変化を得るためのＦＤＴＤ演算方法であって、
前記並列演算機によって、前記外部メモリから読み込んだデータをバッファメモリに記憶する記憶ステップと、
前記並列演算機によって、前記解析対象となる領域を構成する複数のブロックについて前記第１ブロック群に対応するデータのうち、前記バッファメモリに既に記憶されており次の回の演算において前記第１ブロック群の各辺のうち、それらにそれぞれ隣接する前記第２ブロック群の各辺に接する辺に対応するデータを再利用データとし、前記再利用データ以外のデータを、前記外部メモリから前記バッファメモリに読み込むデータ読み込みステップと、前記並列演算機によって、前記バッファメモリに記憶されているデータを用いて前記演算を行う演算ステップとを含むことを特徴とする。解析対象となる領域を構成する複数のブロックについてその配列の１つおきのブロックに対応するデータのうち、バッファメモリに既に記憶されており次の回の演算において用いるデータ以外のデータを読み込むようにしたので、外部メモリ１へのアクセス回数が少なくなり、演算時間を短縮できる。 The FDTD calculation method according to claim 4 of the present invention divides a region to be analyzed into a checkered pattern by a plurality of grid-like blocks including a first block group and a second block group, and each of the divided blocks is divided. Controls reading of data into the buffer memory and an external memory having a longer access time than the buffer memory, by assigning one of an electric field and a magnetic field to each side and assigning the other to each side of each block to solve Maxwell's equations An FDTD calculation method for obtaining a spatial and temporal change of an electric field and a magnetic field in the region by performing by a parallel calculator ,
A storage step of storing data read from the external memory in the buffer memory by the parallel computing unit ;
Of the data corresponding to the first block group for the plurality of blocks constituting the region to be analyzed by the parallel computing device, the data is already stored in the buffer memory, and the first block in the next computation Among the sides of the group, the data corresponding to the sides in contact with the sides of the second block group adjacent to each other is used as reuse data, and data other than the reuse data is transferred from the external memory to the buffer memory. A data reading step to be read; and a calculation step of performing the calculation using the data stored in the buffer memory by the parallel calculator . Among the data corresponding to every other block in the array for the plurality of blocks constituting the analysis target area, data other than the data already stored in the buffer memory and used in the next calculation is read. As a result, the number of accesses to the external memory 1 is reduced, and the calculation time can be shortened.

以上説明したように本発明は、解析対象となる領域を構成する複数のブロックについてその配列の１つおきのブロックに対応するデータのうち、バッファメモリに既に記憶されており次の回の演算において用いるデータ以外のデータを読み込むようにしたので、演算時間を短縮することができるという効果がある。 As described above, according to the present invention, among the data corresponding to every other block in the array, a plurality of blocks constituting the region to be analyzed are already stored in the buffer memory and are calculated in the next calculation. Since data other than the data to be used is read, there is an effect that the calculation time can be shortened.

以下、本発明の実施の形態を、図面を参照して説明する。なお、以下の説明において参照する各図では、他の図と同等部分は同一符号によって示されている。
本実施の形態では、４行４列（Ｎ行Ｍ列のＮ＝４、Ｍ＝４）のセルブロック構成の場合について説明する。
（実施の形態）
図１は電界Ｅｚを計算するためのセルブロック構成例を、図２は電界Ｅｘを計算するためのセルブロック構成例を、図３は電界Ｅｙを計算するためのセルブロック構成例を、図４は磁界Ｈｚを計算するためのセルブロック構成例を、図５は磁界Ｈｘを計算するためのセルブロック構成例を、図６は磁界Ｈｙを計算するためのセルブロック構成例を、それぞれ示す図である。 Hereinafter, embodiments of the present invention will be described with reference to the drawings. In the drawings referred to in the following description, the same parts as those in the other drawings are denoted by the same reference numerals.
In this embodiment, a case of a cell block configuration of 4 rows and 4 columns (N rows and M columns, N = 4, M = 4) will be described.
(Embodiment)
1 shows a cell block configuration example for calculating the electric field Ez, FIG. 2 shows a cell block configuration example for calculating the electric field Ex, FIG. 3 shows a cell block configuration example for calculating the electric field Ey, and FIG. Is a diagram showing a cell block configuration example for calculating the magnetic field Hz, FIG. 5 is a diagram showing a cell block configuration example for calculating the magnetic field Hx, and FIG. 6 is a diagram showing a cell block configuration example for calculating the magnetic field Hy. is there.

図１において、黒塗り矢印は電界Ｅｚ、白抜き矢印及び網掛け矢印は、磁界Ｈｘ又は磁界Ｈｙ，を示している。図２において、黒塗り矢印は電界Ｅｘ、白抜き矢印及び網掛け矢印は、磁界Ｈｙ又は磁界Ｈｚ，を示している。図３において、黒塗り矢印は電界Ｅｙ、白抜き矢印及び網掛け矢印は、磁界Ｈｘ又は磁界Ｈｚ，を示している。
図４において、黒塗り矢印は磁界Ｈｚ、白抜き矢印及び網掛け矢印は、電界Ｅｘ又は電界Ｅｙ，を示している。図５において、黒塗り矢印は磁界Ｈｘ、白抜き矢印及び網掛け矢印は、電界Ｅｙ又は電界Ｅｚ，を示している。図６において、黒塗り矢印は磁界Ｈｙ、白抜き矢印及び網掛け矢印は、電界Ｅｘ又は電界Ｅｚ，を示している。 In FIG. 1, black arrows indicate the electric field Ez, white arrows and shaded arrows indicate the magnetic field Hx or the magnetic field Hy. In FIG. 2, black arrows indicate the electric field Ex, white arrows and shaded arrows indicate the magnetic field Hy or the magnetic field Hz. In FIG. 3, black arrows indicate the electric field Ey, white arrows and shaded arrows indicate the magnetic field Hx or the magnetic field Hz.
In FIG. 4, the black arrow indicates the magnetic field Hz, and the white arrow and the shaded arrow indicate the electric field Ex or the electric field Ey. In FIG. 5, black arrows indicate the magnetic field Hx, white arrows and shaded arrows indicate the electric field Ey or the electric field Ez. In FIG. 6, the black arrow indicates the magnetic field Hy, and the white arrow and the shaded arrow indicate the electric field Ex or the electric field Ez.

（重複データ以外のデータの読み込み）
ところで、図１〜図６にそれぞれ示されているセルブロック構成においては、白抜きで示されているブロックと網掛けで示されているブロックとが交互に市松模様状に配列されている。ここで、図３に注目すると、白抜きで示されているブロック全てを構成する辺に対応するデータを用いて第１回目の演算を行った後、網掛けで示されているブロック全てを構成する辺に対応するデータを用いて第２回目の演算を行い、合計２回の演算を１セットとして、同図のセルブロック構成について演算を行う場合を考える。 (Reading data other than duplicate data)
By the way, in the cell block configuration shown in FIGS. 1 to 6, the blocks shown in white and the blocks shown in shaded are alternately arranged in a checkered pattern. Here, paying attention to FIG. 3, after performing the first calculation using data corresponding to the sides constituting all the blocks shown in white, all the blocks shown in shaded are constituted. Consider a case where the second calculation is performed using data corresponding to the edge to be performed, and the calculation is performed on the cell block configuration of FIG.

その場合、第１回目の演算と第２回目の演算とで重複して利用するデータが存在する。この重複するデータについては、第１回目の演算の前に外部メモリからバッファメモリに読み込んでいるので、それを第２回目の演算に用いる（１時間ステップ前に使用した値を用いる）。このため、第２回目の演算の前には重複して利用するデータ以外のデータをバッファメモリに読み込むようにすれば、外部メモリへのアクセス回数が少なくなり、演算時間を短縮できる。
第１回目の演算の前に外部メモリからバッファメモリに読み込むデータについて図７を、第２回目の演算の前に外部メモリからバッファメモリに読み込むデータについて図８を、それぞれ参照して説明する。 In that case, there is data that is used redundantly in the first calculation and the second calculation. Since this duplicated data is read from the external memory into the buffer memory before the first calculation, it is used for the second calculation (the value used before one time step is used). Therefore, if data other than redundantly used data is read into the buffer memory before the second calculation, the number of accesses to the external memory is reduced, and the calculation time can be shortened.
The data read from the external memory to the buffer memory before the first calculation will be described with reference to FIG. 7, and the data read from the external memory to the buffer memory before the second calculation will be described with reference to FIG.

図７を参照すると、第１回目の演算においては、白抜きで示されているブロックを演算対象とするので、その演算の前に外部メモリからバッファメモリに読み込むデータは、
磁界Ｈｘ（１）、Ｈｘ（３）、Ｈｘ（５）〜Ｈｘ（８）、Ｈｘ（９）〜Ｈｘ（１２）、Ｈｘ（１３）〜Ｈｘ（１６）、Ｈｘ（１８）、及び、Ｈｘ（２０）、
磁界Ｈｚ（１）〜Ｈｚ（４）、Ｈｚ（７）〜Ｈｚ（１０）、Ｈｚ（１１）〜Ｈｚ（１４）、及び、Ｈｚ（１７）〜Ｈｚ（２０）、
電界Ｅｙ（１）、Ｅｙ（３）、Ｅｙ（６）、Ｅｙ（８）、Ｅｙ（９）、Ｅｙ（１１）、Ｅｙ（１４）、及び、Ｅｙ（１６）、
である。これらに対応するデータ全てを外部メモリからバッファメモリに読み込むことになる。ここで、第２回目の演算に用いるデータは、同図中の網掛けで示されているブロックである。この網掛けで示されているブロックに注目すると、第１回目の演算に用いるデータと大部分が共通する。すなわち、
磁界Ｈｘ（５）〜Ｈｘ（８）、Ｈｘ（９）〜Ｈｘ（１２）、Ｈｘ（１３）〜Ｈｘ（１６）、
磁界Ｈｚ（２）〜Ｈｚ（４）、Ｈｚ（７）〜Ｈｚ（９）、Ｈｚ（１２）〜Ｈｚ（１４）、Ｈｚ（１７）〜Ｈｚ（１９）、
については、第１回目の演算で用いた後、同じデータを第２回目の演算でも共通して用いることになる。そこで、この共通して用いるデータについては、外部メモリから再度読み込むのではなく、バッファメモリに既に記憶されているデータをそのまま用いる。 Referring to FIG. 7, in the first calculation, since the blocks shown in white are the calculation target, the data read from the external memory to the buffer memory before the calculation is
Magnetic fields Hx (1), Hx (3), Hx (5) to Hx (8), Hx (9) to Hx (12), Hx (13) to Hx (16), Hx (18), and Hx ( 20),
Magnetic fields Hz (1) to Hz (4), Hz (7) to Hz (10), Hz (11) to Hz (14), and Hz (17) to Hz (20),
Electric fields Ey (1), Ey (3), Ey (6), Ey (8), Ey (9), Ey (11), Ey (14), and Ey (16),
It is. All data corresponding to these are read from the external memory into the buffer memory. Here, the data used for the second calculation is a block indicated by shading in FIG. When attention is paid to the block indicated by the shaded area, most of the data is the same as the data used for the first calculation. That is,
Magnetic fields Hx (5) to Hx (8), Hx (9) to Hx (12), Hx (13) to Hx (16),
Magnetic fields Hz (2) to Hz (4), Hz (7) to Hz (9), Hz (12) to Hz (14), Hz (17) to Hz (19),
As for, after being used in the first calculation, the same data is commonly used in the second calculation. Therefore, for the data used in common, the data already stored in the buffer memory is used as it is, instead of reading from the external memory again.

この結果、図８を参照すると、第２回目の演算においては、網掛けで示されているブロックを演算対象とするので、その演算の前に外部メモリからバッファメモリに読み込むデータは、
磁界Ｈｘ（２）、Ｈｘ（４）、Ｈｘ（１７）、及び、Ｈｘ（１９）、
磁界Ｈｚ（５）、Ｈｚ（６）、Ｈｚ（１５）、及び、Ｈｚ（１６）、
電界Ｅｙ（２）、Ｅｙ（４）、Ｅｙ（５）、Ｅｙ（７）、Ｅｙ（１０）、Ｅｙ（１２）、Ｅｙ（１３）、及び、Ｅｙ（１５）、
である。したがって、第２回目の演算においては、第１回目の演算よりもはるかに少ないデータを外部メモリからバッファメモリに読み込むだけで済むことになる。よって、外部メモリへのアクセス回数を少なくすることができる。 As a result, referring to FIG. 8, in the second calculation, since the block indicated by shading is the calculation target, the data read from the external memory to the buffer memory before the calculation is
Magnetic fields Hx (2), Hx (4), Hx (17), and Hx (19),
Magnetic fields Hz (5), Hz (6), Hz (15), and Hz (16),
Electric fields Ey (2), Ey (4), Ey (5), Ey (7), Ey (10), Ey (12), Ey (13), and Ey (15),
It is. Therefore, in the second calculation, it is only necessary to read much less data from the external memory into the buffer memory than in the first calculation. Therefore, the number of accesses to the external memory can be reduced.

（演算処理の処理フロー）
図７及び図８のように、白抜きで示されているブロックに対応するデータについて第１回目の演算を行い、その後網掛けで示されているブロックに対応するデータについて第２回目の演算を行う場合の演算処理の処理フローについて、図９を参照して説明する。
同図において、ステップＳ２からＳ５までの処理が上記の第１回目の演算に対応し、ステップＳ６〜Ｓ９までの処理が上記の第２回目の演算に対応する。
同図を参照すると、最初に、初期データを外部メモリ１に書込む（ステップＳ１）。次に、演算対象となる、上記の白抜きで示されているブロックに対応するデータを外部メモリから読み出し（ステップＳ２）、そのデータをバッファメモリ２に一時保存する（ステップＳ３）。 (Processing flow of arithmetic processing)
As shown in FIG. 7 and FIG. 8, the first calculation is performed on the data corresponding to the blocks shown in white, and then the second calculation is performed on the data corresponding to the blocks shown in shading. A processing flow of the arithmetic processing in the case of performing will be described with reference to FIG.
In the figure, the processing from step S2 to S5 corresponds to the first calculation, and the processing from step S6 to S9 corresponds to the second calculation.
Referring to the figure, first, initial data is written into the external memory 1 (step S1). Next, the data corresponding to the above-described blocks shown in white are read from the external memory (step S2), and the data is temporarily stored in the buffer memory 2 (step S3).

この状態において、バッファメモリ２に記憶されているデータを用いて、ＦＰＧＡで並列演算を行う（ステップＳ４）。この演算結果を外部メモリ１に書き込む（ステップＳ５）。
次に、上記の網掛けで示されているブロックに対応するデータのうち、上記ステップＳ２において読み出した以外のデータ（重複データ以外のデータ）を外部メモリから読み出し（ステップＳ６）、そのデータをバッファメモリ２に一時保存する（ステップＳ７）。 In this state, using the data stored in the buffer memory 2, parallel operation is performed by the FPGA (step S4). This calculation result is written in the external memory 1 (step S5).
Next, among the data corresponding to the blocks indicated by the above shaded data, data other than the data read in step S2 (data other than duplicate data) is read from the external memory (step S6), and the data is buffered. Temporarily stored in the memory 2 (step S7).

この状態において、バッファメモリ２に記憶されているデータを用いて、ＦＰＧＡで並列演算を行う（ステップＳ８）。この演算結果を外部メモリ１に書き込む（ステップＳ９）。その後、演算が終了したかどうか判断する（ステップＳ１０）。演算終了でなければ、ステップＳ２に戻り、処理を続行する。
以上のように、ステップＳ２からＳ５まで、及び、ステップＳ６からＳ９まで、の２回の演算を１セットとして、図３のセルブロック構成についての演算が完了することになる。 In this state, using the data stored in the buffer memory 2, parallel operation is performed by the FPGA (step S8). This calculation result is written in the external memory 1 (step S9). Thereafter, it is determined whether or not the calculation is completed (step S10). If the calculation is not finished, the process returns to step S2 to continue the process.
As described above, the calculation for the cell block configuration of FIG. 3 is completed by setting two calculations of steps S2 to S5 and steps S6 to S9 as one set.

次に、外部メモリに記憶されているデータの例について図１０を、バッファメモリへ読み込まれたデータの例について図１１及び図１２を、それぞれ参照して説明する。
（外部メモリの記憶内容）
図１０を参照すると、外部メモリ１には、図１から図８までの各図に示されているＨｘ、Ｈｙ、Ｈｚ、Ｅｘ、Ｅｙ、Ｅｚに対応するデータが書き込まれ、記憶されている。ＦＰＧＡは、演算に先立ち、外部メモリ１にアクセスし、上記の演算に必要なデータを読み出し、バッファメモリに書き込む。外部メモリ１に記憶されているデータについては、本例では、最大４個単位ブロックの読み出し又は書き込みが可能であるものとする。例えば、Ｈｘ（１，２，１）、Ｈｘ（２，２，１）、Ｈｘ（３，２，１）、Ｈｘ（４，２，１）をまとめて読み出して、Ｈｘ（５）〜Ｈｘ（８）としてバッファメモリに書き込む。 Next, an example of data stored in the external memory will be described with reference to FIG. 10, and an example of data read into the buffer memory will be described with reference to FIGS. 11 and 12.
(Contents stored in external memory)
Referring to FIG. 10, data corresponding to Hx, Hy, Hz, Ex, Ey, Ez shown in each of FIGS. 1 to 8 is written and stored in the external memory 1. Prior to the calculation, the FPGA accesses the external memory 1 to read out data necessary for the above calculation and write it into the buffer memory. With respect to data stored in the external memory 1, in this example, it is assumed that a maximum of four unit blocks can be read or written. For example, Hx (1, 2, 1), Hx (2, 2, 1), Hx (3, 2, 1), and Hx (4, 2, 1) are collectively read and Hx (5) to Hx ( 8) Write to the buffer memory.

ここで、図７及び図８を再び参照すると、Ｈｘ（１）からＨｘ（４）までの４個のデータについては、第１回目の演算に対応している図７では奇数番目のＨｘ（１）及びＨｘ（３）を用い、第２回目の演算に対応している図８では偶数番目のＨｘ（２）及びＨｘ（４）を用いている。したがって、最初の４個のデータについては、第１回目の演算では奇数番目の２個のデータをバッファメモリに読み込み、第２回目の演算では偶数番目の２個のデータをバッファメモリに読み込めばよいことになる。 Here, referring again to FIG. 7 and FIG. 8, for the four data from Hx (1) to Hx (4), the odd-numbered Hx (1) in FIG. 7 corresponding to the first calculation is shown. ) And Hx (3), and even-numbered Hx (2) and Hx (4) are used in FIG. 8 corresponding to the second calculation. Therefore, for the first four data, the odd number of two data is read into the buffer memory in the first calculation, and the even number of two data is read into the buffer memory in the second calculation. It will be.

また、図７及び図８を参照すると、Ｈｘ（１７）からＨｘ（２０）までの４個のデータについては、第１回目の演算に対応している図７では奇数番目のＨｘ（１８）及びＨｘ（２０）を用い、第２回目の演算に対応している図８では偶数番目のＨｘ（１７）及びＨｘ（１９）を用いている。したがって、最初の４個のデータについては、第１回目の演算では偶数番目の２個のデータをバッファメモリに読み込み（奇数番目は読み込まない）、第２回目の演算では奇数番目の２個のデータをバッファメモリに読み込めば（偶数番目は読み込まない）よいことになる。 Also, referring to FIG. 7 and FIG. 8, for the four data from Hx (17) to Hx (20), the odd-numbered Hx (18) and Hx (18) in FIG. 7 corresponding to the first calculation are shown. In FIG. 8, which corresponds to the second calculation using Hx (20), even-numbered Hx (17) and Hx (19) are used. Therefore, for the first four data, the even numbered two data are read into the buffer memory in the first calculation (the odd number is not read), and the odd number of the two data in the second calculation. Can be read into the buffer memory (even numbers are not read).

図１０に戻り、上述したＨｘ（１）からＨｘ（４）までの４個のデータとして用いるＨｘ（１，１，１）、Ｈｘ（２，１，１）、Ｈｘ（３，１，１）、Ｈｘ（４，１，１）のデータＤ１３については、第１回目の演算では奇数番目の２個のデータＨｘ（１，１，１）、Ｈｘ（３，１，１）を読み出し、Ｈｘ（１）、Ｈｘ（３）としてバッファメモリに書き込み、第２回目の演算では偶数番目の２個のデータＨｘ（２，１，１）、Ｈｘ（４，１，１）を読み出し、Ｈｘ（２）、Ｈｘ（４）としてバッファメモリに書き込めばよい。同様に、上述したＨｘ（１７）からＨｘ（２０）までの４個のデータとして用いるデータについては、第１回目の演算では偶数番目の２個のデータを読み出し、Ｈｘ（１８）、Ｈｘ（２０）としてバッファメモリに書き込み、第２回目の演算では奇数番目の２個のデータを読み出し、Ｈｘ（１７）、Ｈｘ（１９）としてバッファメモリに書き込めばよい。 Returning to FIG. 10, Hx (1,1,1), Hx (2,1,1), Hx (3,1,1) used as the four data from Hx (1) to Hx (4) described above. , Hx (4,1,1) data D13 is read in the first calculation by reading odd-numbered two pieces of data Hx (1,1,1) and Hx (3,1,1). 1), Hx (3) is written in the buffer memory, and even-numbered two pieces of data Hx (2,1,1) and Hx (4,1,1) are read in the second calculation, and Hx (2) , Hx (4) may be written to the buffer memory. Similarly, for the data used as the four data from Hx (17) to Hx (20) described above, the even-numbered two data are read in the first calculation, and Hx (18), Hx (20 ) In the buffer memory, and in the second calculation, the odd-numbered two pieces of data are read out and written into the buffer memory as Hx (17) and Hx (19).

Ｈｚについても同様に、上述したＨｚ（１）からＨｚ（４）までの４個のデータとして用いるデータについては、第１回目の演算では奇数番目の２個のデータを読み出し、Ｈｚ（１）、Ｈｚ（３）としてバッファメモリに書き込み、第２回目の演算では偶数番目の２個のデータを読み出し、Ｈｘ（２）、Ｈｘ（４）としてバッファメモリに書き込めばよい。そして、上述したＨｚ（１７）からＨｚ（２０）までの４個のデータとして用いるデータについては、第１回目の演算では偶数番目の２個のデータを読み出し、Ｈｚ（１８）、Ｈｚ（２０）としてバッファメモリに書き込み、第２回目の演算では奇数番目の２個のデータを読み出し、Ｈｚ（１７）、Ｈｚ（１９）としてバッファメモリに書き込めばよい。 Similarly for Hz, for the data used as the four data from Hz (1) to Hz (4) described above, the odd-numbered two data are read out in the first calculation, and Hz (1), What is necessary is just to write to the buffer memory as Hz (3), to read two even-numbered data in the second calculation, and to write to the buffer memory as Hx (2) and Hx (4). And about the data used as four data from the above-mentioned Hz (17) to Hz (20), even number two data are read in the 1st calculation, and Hz (18), Hz (20) Is written into the buffer memory, and in the second calculation, the odd-numbered two pieces of data are read out and written into the buffer memory as Hz (17) and Hz (19).

（バッファメモリの記憶内容）
ここで、図１１には第１回目の演算の前にバッファメモリに読み込んだデータの内容が示されている。同図に示されているように、Ｈｘ（１）からＨｘ（４）までの４個のデータとして用いるデータについては奇数番目の２個のデータが読み出され、Ｈｘ（１）、Ｈｘ（３）としてバッファメモリに記憶され、Ｈｘ（１７）からＨｘ（２０）までの４個のデータとして用いるデータについては偶数番目の２個のデータが読み出され、Ｈｘ（１８）、Ｈｘ（２０）としてバッファメモリに記憶されている。また、Ｈｚ（１）からＨｚ（４）までの４個のデータとして用いるデータについては奇数番目の２個のデータが読み出され、Ｈｚ（１）、Ｈｚ（３）としてバッファメモリに記憶され、Ｈｚ（１７）からＨｚ（２０）までの４個のデータとして用いるデータについては偶数番目の２個のデータが読み出され、Ｈｚ（１８）、Ｈｚ（２０）としてバッファメモリに記憶されている。なお、図７中の白抜きで示されているブロックについて演算するため、Ｅｙ（１）、Ｅｙ（３）、Ｅｙ（６）、Ｅｙ（８）、Ｅｙ（９）、Ｅｙ（１１）、Ｅｙ（１４）、Ｅｙ（１６）がバッファメモリに記憶されている。 (Contents stored in buffer memory)
Here, FIG. 11 shows the contents of data read into the buffer memory before the first calculation. As shown in the figure, for data used as four data from Hx (1) to Hx (4), two odd-numbered data are read out, and Hx (1), Hx (3 ) Are stored in the buffer memory and used as the four data from Hx (17) to Hx (20), the even-numbered two data are read out as Hx (18) and Hx (20). Stored in buffer memory. For data used as four data from Hz (1) to Hz (4), odd-numbered two data are read and stored in the buffer memory as Hz (1) and Hz (3). For the data used as four data from Hz (17) to Hz (20), two even-numbered data are read and stored in the buffer memory as Hz (18) and Hz (20). In addition, since it calculates about the block shown with the outline in FIG. 7, Ey (1), Ey (3), Ey (6), Ey (8), Ey (9), Ey (11), Ey (14) and Ey (16) are stored in the buffer memory.

図１２には第２回目の演算の前にバッファメモリに読み込んだデータの内容が示されている。同図に示されているように、Ｈｘ（１）からＨｘ（４）までの４個のデータとして用いるデータについては偶数番目の２個のデータが読み出され、Ｈｘ（２）、Ｈｘ（４）としてバッファメモリに記憶され、Ｈｘ（１７）からＨｘ（２０）までの４個のデータとして用いるデータについては奇数番目の２個のデータが読み出され、Ｈｘ（１７）、Ｈｘ（１９）としてバッファメモリに記憶されている。また、Ｈｚ（１）からＨｚ（４）までの４個のデータとして用いるデータについては偶数番目の２個のデータが読み出され、Ｈｚ（２）、Ｈｚ（４）としてバッファメモリに記憶され、Ｈｚ（１７）からＨｚ（２０）までの４個のデータとして用いるデータについては奇数番目の２個のデータが読み出され、Ｈｚ（１７）、Ｈｚ（１９）としてバッファメモリに記憶されている。なお、図８中の網掛けで示されているブロックについて演算するため、Ｅｙ（２）、Ｅｙ（４）、Ｅｙ（５）、Ｅｙ（７）、Ｅｙ（１０）、Ｅｙ（１２）、Ｅｙ（１３）、Ｅｙ（１５）がバッファメモリに記憶されている。 FIG. 12 shows the contents of data read into the buffer memory before the second calculation. As shown in the figure, for the data used as four data from Hx (1) to Hx (4), two even-numbered data are read out, and Hx (2), Hx (4 ) Are stored in the buffer memory and used as the four data from Hx (17) to Hx (20), the odd-numbered two data are read out as Hx (17) and Hx (19). Stored in buffer memory. For data used as four data from Hz (1) to Hz (4), even-numbered two data are read and stored in the buffer memory as Hz (2) and Hz (4). For the data used as four data from Hz (17) to Hz (20), the odd-numbered two data are read out and stored in the buffer memory as Hz (17) and Hz (19). In order to calculate the blocks indicated by shading in FIG. 8 , Ey (2), Ey (4), Ey (5), Ey (7), Ey (10), Ey (12), Ey (13) and Ey (15) are stored in the buffer memory.

以上のことから、図１０に示されている外部メモリ１からバッファメモリへの読み込みは、以下のように行われる。
（１）各４個単位ブロックの先頭ビットのアドレスを探す。
（２）磁界Ｈｘ、磁界Ｈｚについては、５単位ブロックを読み出すため、最初のブロックと最後のブロックについては、４個のデータを奇数番目と偶数番目とに分ける。
（３）最初のブロックの奇数番目のデータと、２〜４番目の単位ブロックの全データと、最後のブロックの偶数番目のデータとを読み出す。
（４）電界Ｅｙについては、市松模様状に用いるため、各単位ブロック毎に、奇数番目、奇数番目、偶数番目、偶数番目、奇数番目、奇数番目、偶数番目、偶数番目、の順に読み出す。 From the above, reading from the external memory 1 shown in FIG. 10 to the buffer memory is performed as follows.
(1) Search for the address of the first bit of each four unit block.
(2) For the magnetic field Hx and the magnetic field Hz, since 5 unit blocks are read out, for the first block and the last block, the four data are divided into odd and even numbers.
(3) Read odd-numbered data of the first block, all data of the second to fourth unit blocks, and even-numbered data of the last block.
(4) The electric field Ey is read in the order of odd-numbered, odd-numbered, even-numbered, even-numbered, odd-numbered, odd-numbered, even-numbered, even-numbered for each unit block because it is used in a checkered pattern.

（５）第１回目の演算を行う。
（６）磁界Ｈｘ、磁界Ｈｚについては、最初のブロックの偶数番目、及び、最後のブロックの奇数番目のみを読み出す。
（７）電界Ｅｙについては、市松模様状に用いるため、各単位ブロック毎に、偶数番目、偶数番目、奇数番目、奇数番目、偶数番目、偶数番目、奇数番目、奇数番目、の順に読み出す。
（８）第２回目の演算を行う。
以上のように、第２回目の演算の前に外部メモリからバッファメモリに読み込むデータは非常に少なくなるので、外部メモリへのアクセス回数を少なくすることができ、演算時間を短縮できる。 (5) Perform the first calculation.
(6) For the magnetic field Hx and the magnetic field Hz, only the even number of the first block and the odd number of the last block are read.
(7) The electric field Ey is read in the order of even-numbered, even-numbered, odd-numbered, odd-numbered, even-numbered, even-numbered, odd-numbered, odd-numbered for each unit block because it is used in a checkered pattern.
(8) The second calculation is performed.
As described above, since the data read from the external memory to the buffer memory before the second calculation is very small, the number of accesses to the external memory can be reduced and the calculation time can be shortened.

本例では、電界Ｅｙ(電界のｙ方向成分)を算出する場合、かつ計算単位としてセルのブロックを４×４×１とした場合について説明する。
前出の計算式より、電界Ｅｙを計算するために使用するパラメータは電界Ｅｙが１個、磁界Ｈｘが２個、磁界Ｈｚが２個、Ｃｅｙ、ＣｅｙｌｘおよびＣｅｙｌｚが各１個の合計８個のパラメータである。 In this example, a case where the electric field Ey (y-direction component of the electric field) is calculated and a block of cells is set to 4 × 4 × 1 as a calculation unit will be described.
From the above formula, the parameters used to calculate the electric field Ey are one electric field Ey, two magnetic fields Hx, two magnetic fields Hz, and one each of Cey, Ceylx and Ceylz. It is a parameter.

８個のパラメータを毎回外部メモリから読み出し、計算を実行して、計算結果の電界Ｅｚを外部メモリに書き込む処理を実施すると、８パラメータ×１６セル＝１２８回の外部メモリアクセスが発生する。
ここで、図３を参照して既に説明したように、セルを順番に計算すると網掛け矢印は隣接するセルにおける計算で重複して使用することが判る。 If 8 parameters are read from the external memory each time, the calculation is executed, and the electric field Ez of the calculation result is written in the external memory, 8 parameters × 16 cells = 128 times of external memory access occurs.
Here, as already described with reference to FIG. 3, when cells are calculated in order, it can be seen that the shaded arrows are used redundantly in calculations in adjacent cells.

そこで、１つ置きの間隔で配置された網掛けの面の計算を同時に実行し（第１回目の演算）、次に白抜きの面の計算を同時に実行する（第２回目の演算）処理を施すことにより、重複するパラメータをバッファメモリに蓄えておくことができ、外部メモリへのアクセス回数を削減することで計算時間の短縮を図る。本実施例においては、８パラメータ×１６セル−２４＝１０４回のアクセスで済み、速度として１．２３倍の高速化が見込める。すなわち、セルのブロック単位をＮ行×Ｍ列×１とすると、
８ＮＭ／（５ＮＭ−（Ｎ・（Ｍ−１）＋Ｍ・（Ｎ−１）））…（１）
であり、式（１）にＮ＝４、Ｍ＝４を代入すると、
仮に、本発明を採用しない場合（式（１）の分子）には８パラメータ×１６セル＝１２８回の外部メモリへのアクセスが発生することになる。これに対し、本実施例においては、８パラメータ×１６セル−２４＝１０４回（式（１）の分母）のアクセスで済み、速度として１．２３倍の高速化が見込める。 Therefore, the calculation of the shaded surfaces arranged at every other interval is executed simultaneously (first calculation), and the calculation of the white surface is executed simultaneously (second calculation). As a result, duplicate parameters can be stored in the buffer memory, and the calculation time can be shortened by reducing the number of accesses to the external memory. In this embodiment, 8 parameters × 16 cells−24 = 104 accesses are required, and a speed increase of 1.23 times can be expected. That is, if the cell block unit is N rows × M columns × 1,
8NM / (5NM- (N. (M-1) + M. (N-1))) (1)
And substituting N = 4 and M = 4 into equation (1),
If the present invention is not adopted (numerator of formula (1)), 8 parameters × 16 cells = 128 accesses to the external memory occur. On the other hand, in this embodiment, it is only necessary to access 8 parameters × 16 cells−24 = 104 times (the denominator of Expression (1)), and a speed increase of 1.23 times can be expected.

さらに、Ｃｅｙ、ＣｅｙｌｘおよびＣｅｙｌｚの３個のパラメータは、セルの位置に依存する定数であることから、バッファメモリ内にテーブルとして固定的に保持することが可能である。これらを演算の間、バッファメモリに固定的に記憶させておくことにより、読込むパラメータを５個に削減することが可能である。したがって、上記処理と組み合わせることにより、５パラメータ×１６セル−２４＝５６回まで外部メモリへのアクセスを削減できる。この場合の処理速度は、８個のパラメータを外部メモリからバッファメモリに毎回読み込む場合に比べて２.２８倍の高速化が図れる。セルのブロック単位をＮ×Ｎ×１とすることにより、８Ｎ／(３Ｎ＋２)倍の高速化が図れる。 Further, since the three parameters of Cey, Ceylx, and Ceylz are constants depending on the position of the cell, they can be fixedly held as a table in the buffer memory. It is possible to reduce the number of parameters to be read to five by storing them in the buffer memory in a fixed manner during the calculation. Therefore, by combining with the above processing, access to the external memory can be reduced up to 5 parameters × 16 cells−24 = 56 times. In this case, the processing speed can be increased by 2.28 times as compared with the case where the eight parameters are read from the external memory into the buffer memory each time. By setting the cell block unit to N × N × 1, the speed can be increased by 8N / (3N + 2) times.

ところで、上記Ｎ、Ｍの値の少なくとも一方は偶数であることが望ましい。Ｎ、Ｍが共に奇数の場合には、演算に用いるために読み込むデータの数が奇数になるので、第１回目の演算データ量と第２回目の演算データ量とが異なり、ＦＰＧＡの演算が行われない期間が発生し、演算効率が低下するからである。 By the way, it is desirable that at least one of the values of N and M is an even number. When N and M are both odd numbers, the number of data to be read for use in the calculation is an odd number. Therefore, the first calculation data amount is different from the second calculation data amount, and the FPGA calculation is performed. This is because a period that is not broken occurs and the calculation efficiency decreases.

（重複パラメータの識別手法）
これまで示した、重複して使用されるパラメータを複数回読込むことのないよう識別する手法について、以下説明する。
（１）メモリアドレス設定法
バッファメモリの領域内で、重複パラメータが書き込まれるメモリアドレスにフラグを付加し、フラグの付加されたメモリアドレスのパラメータは２回の計算で使用された後に次のパラメータを読込む制御を行う。つまり、２回の演算で使用するパラメータの内、バッファメモリに既に記憶されており次の回の演算において用いるパラメータ以外のパラメータのみを外部メモリより読込む。 (Duplicate parameter identification method)
The technique for identifying the parameters used so far as being duplicated so as not to be read multiple times will be described below.
(1) Memory address setting method A flag is added to the memory address where the duplicate parameter is written in the buffer memory area. The parameter of the memory address to which the flag is added is used in two calculations, and then the next parameter is set. Control reading. That is, of the parameters used in the two calculations, only the parameters other than those already stored in the buffer memory and used in the next calculation are read from the external memory.

（２）端部読込み法
図３を参照すれば判るように、重複しないパラメータは、Ｎ個×Ｍ個×１個のセルブロックの内、端部に位置するブロックのパラメータおよび計算面に垂直のパラメータ(図３の実施例ではＥｙ)である。これより、端部に位置するパラメータおよび計算面に垂直向きのパラメータのみを識別して外部メモリより読込む。 (2) Edge reading method As can be seen with reference to FIG. 3, non-overlapping parameters are perpendicular to the parameter and calculation plane of the block located at the edge of the N × M × 1 cell blocks. Parameter (Ey in the embodiment of FIG. 3). As a result, only the parameters located at the end and the parameters perpendicular to the calculation plane are identified and read from the external memory.

本発明によれば、ＦＤＴＤ法をＦＰＧＡのような並列演算機に適用した際に外部メモリへのアクセス回数の削減により電磁界演算時間を短縮できる。このため、人体近傍の電磁環境、携帯電話機の輻射電力、及び、屋内電磁環境等を高速に推定でき、低消費電力・電磁界環境設計効率の向上を実現できる。また、各種製品の開発効率の飛躍的な向上を実現できる。 According to the present invention, when the FDTD method is applied to a parallel computing machine such as an FPGA, the electromagnetic field computation time can be shortened by reducing the number of accesses to the external memory. For this reason, the electromagnetic environment near the human body, the radiated power of the mobile phone, the indoor electromagnetic environment, and the like can be estimated at high speed, and low power consumption and electromagnetic field environment design efficiency can be improved. In addition, the development efficiency of various products can be dramatically improved.

電界Ｅｚを計算するためのセルブロック構成例を示す図である。It is a figure which shows the cell block structural example for calculating the electric field Ez. 電界Ｅｘを計算するためのセルブロック構成例を示す図である。It is a figure which shows the example of a cell block structure for calculating the electric field Ex. 電界Ｅｙを計算するためのセルブロック構成例を示す図である。It is a figure which shows the example of a cell block structure for calculating the electric field Ey. 磁界Ｈｚを計算するためのセルブロック構成例を示す図である。It is a figure which shows the example of a cell block structure for calculating the magnetic field Hz. 磁界Ｈｘを計算するためのセルブロック構成例を示す図である。It is a figure which shows the cell block structural example for calculating the magnetic field Hx. 磁界Ｈｙを計算するためのセルブロック構成例を示す図である。It is a figure which shows the cell block structural example for calculating the magnetic field Hy. 第１回目の演算の前に外部メモリからバッファメモリに読み込むデータを示す図である。It is a figure which shows the data read from the external memory to the buffer memory before the 1st calculation. 第２回目の演算の前に外部メモリからバッファメモリに読み込むデータを示す図である。It is a figure which shows the data read from the external memory to the buffer memory before the 2nd calculation. 本発明によるＦＰＧＡ演算装置の処理フローを示す図である。It is a figure which shows the processing flow of the FPGA arithmetic unit by this invention. 外部メモリに記憶されているデータの例を示す図である。It is a figure which shows the example of the data memorize | stored in the external memory. 第１回目の演算の前にバッファメモリに読み込んだデータの内容を示す図である。It is a figure which shows the content of the data read into the buffer memory before the 1st calculation. 第２回目の演算の前にバッファメモリに読み込んだデータの内容を示す図である。It is a figure which shows the content of the data read into the buffer memory before the 2nd calculation. ＦＤＴＤ法の概要を示す図である。It is a figure which shows the outline | summary of the FDTD method. ＦＰＧＡ並列演算器の構成の概要を示す図である。It is a figure which shows the outline | summary of a structure of a FPGA parallel computing unit. ＦＰＧＡ並列演算器の処理フローを示す図である。It is a figure which shows the processing flow of an FPGA parallel computing unit.

Explanation of symbols

１外部メモリ
２バッファメモリ
３ＦＰＧＡ 1 External memory 2 Buffer memory 3 FPGA

Claims

The region to be analyzed is divided into a checkered pattern by a plurality of grid-like blocks including the first block group and the second block group, and one of the electric field and the magnetic field is applied to each side of each of the divided blocks. An FDTD arithmetic unit for obtaining a spatial and temporal change of an electric field and a magnetic field in the region by assigning the other to each surface of a block and performing an operation to solve Maxwell's equation,
Buffer storage means for storing data read from external memory;
Of the data corresponding to the first block group for a plurality of blocks constituting the region to be analyzed, the data is already stored in the buffer storage means and is calculated for each side of the first block group in the next calculation. Among them, data corresponding to the sides in contact with each side of the second block group adjacent to each other is used as reuse data, and is stored in the data storage means for reading data other than the reuse data and the buffer storage means. And an arithmetic means for performing the arithmetic operation using the existing data.

The data reading means reads data corresponding to one of an odd-numbered block and an even-numbered block for a block located at an end of the region among the plurality of blocks, and the next time, data corresponding to the other block The FDTD arithmetic unit according to claim 1, wherein:

3. The FDTD arithmetic apparatus according to claim 1, wherein parameters used in the calculation are fixedly stored in the buffer storage means during the calculation.

The region to be analyzed is divided into a checkered pattern by a plurality of grid-like blocks including the first block group and the second block group, and one of the electric field and the magnetic field is applied to each side of each of the divided blocks. An operation for allocating the other to each face of the block and solving Maxwell's equation is performed by a parallel computing unit that controls reading of data into the buffer memory and an external memory having a longer access time than the buffer memory . An FDTD calculation method for obtaining spatial and temporal changes in electric and magnetic fields,
By the parallel operation machine, a storage step of storing the data read from the external memory to the buffer memory,
Of the data corresponding to the first block group for the plurality of blocks constituting the region to be analyzed by the parallel computing device, the data is already stored in the buffer memory, and the first block in the next computation Among the sides of the group, the data corresponding to the sides in contact with the sides of the second block group adjacent to each other is used as reuse data, and data other than the reuse data is transferred from the external memory to the buffer memory. An FDTD calculation method comprising: a data reading step to be read; and a calculation step of performing the calculation using the data stored in the buffer memory by the parallel calculator .