JP7530231B2

JP7530231B2 - Prediction device, learning device and program

Info

Publication number: JP7530231B2
Application number: JP2020134499A
Authority: JP
Inventors: 剛司深沢
Original assignee: Mitsubishi FBR Systems Inc
Current assignee: Mitsubishi FBR Systems Inc
Priority date: 2020-08-07
Filing date: 2020-08-07
Publication date: 2024-08-07
Anticipated expiration: 2040-08-07
Also published as: JP2022030449A

Description

特許法第３０条第２項適用刊行物名：２０１９年度年次大会まんず，あべ！機械学会さ！－秋田に集うしったげおもしぇ技術－サスティナビリティ，ＡＩ社会の機械工学少子高齢化・人手不足を支えるテクノロジー，発行日：令和元年（２０１９年）９月２日，発行者：一般社団法人日本機械学会（東京都新宿区信濃町３５番地信濃町煉瓦館５階），公開者：深沢剛司（三菱ＦＢＲシステムズ株式会社），公開された発明の内容：人工知能を活用した減衰要素のモデル化に関する発明〔刊行物等〕集会名：２０１９年度年次大会，開催日：令和元年（２０１９年）９月８日～１１日，主催者：一般社団法人日本機械学会（東京都新宿区信濃町３５番地信濃町煉瓦館５階），開催場所：秋田大学手形キャンパス（秋田県秋田市手形学園町１番１号），公開者：深沢剛司（三菱ＦＢＲシステムズ株式会社），公開日：令和元年９月９日，公開された発明の内容：人工知能を活用した減衰要素のモデル化に関する発明Article 30, paragraph 2 of the Patent Act applies Publication name: 2019 Annual Conference Manzu, Abe! The Society of Mechanical Engineers! -Shittage Omoshie Technology Gathered in Akita- Sustainability, Mechanical Engineering in an AI Society Technology to Support the Low Birthrate, Aging Population, and Labor Shortage, Published: September 2, 2019, Publisher: The Japan Society of Mechanical Engineers (5F Shinanomachi Rengakan, 35 Shinanomachi, Shinjuku-ku, Tokyo), Disclosed by: Takeshi Fukazawa (Mitsubishi FBR Systems Co., Ltd.), Disclosed invention: Invention related to modeling of damping elements using artificial intelligence [Publications, etc.] Meeting name: 2019 Annual Conference, Date held: September 8-11, 2019, Organizer: The Japan Society of Mechanical Engineers (5F Shinanomachi Rengakan, 35 Shinanomachi, Shinjuku-ku, Tokyo), Venue: Akita University Tegata Campus (1-1 Tegata Gakuen-cho, Akita City, Akita Prefecture), Disclosed by: Takeshi Fukazawa (Mitsubishi FBR Systems Co., Ltd.), Publication date: September 9, 2019. Details of the disclosed invention: An invention related to modeling of damping elements using artificial intelligence.

本発明は、人工知能を利用した予測装置、学習装置及びプログラムに関し、特に、力学モデルにおける荷重、速度及び変位の時系列データのうち何れか２つの入力時系列データから、もう１つの出力時系列データを予測あるいは学習する技術に関する。 The present invention relates to a prediction device, learning device, and program that utilize artificial intelligence, and in particular to a technology for predicting or learning output time series data from any two of input time series data of load, velocity, and displacement in a mechanical model.

建築構造物等の耐震性を検証するために時刻歴応答解析が活用されている（例えば、非特許文献１参照）。時刻歴応答解析を行う場合、構造物の弾性域から終局域に至る応答を把握する必要がある。 Time history response analysis is used to verify the earthquake resistance of architectural structures and the like (see, for example, Non-Patent Document 1). When performing time history response analysis, it is necessary to understand the response of the structure from its elastic region to its ultimate region.

深沢ほか，“高振動・低振幅に着目したオイルダンパーの解析モデルに関する検討”，日本建築学会構造計論文集，一般社団法人日本建築学会，2018 年 12 月，第 83 巻，第 754 号，p.1777－1787Fukazawa et al., “Study on analytical model of oil damper focusing on high vibration and low amplitude”, Proceedings of the Architectural Institute of Japan, Structural Design Journal, Architectural Institute of Japan, December 2018, Vol. 83, No. 754, pp. 1777-1787

ところが、一般に終局域の挙動は強い非線形性を伴うため、終局域の挙動を数式でモデル化することが難しい。たとえこれをモデル化することができたものとしても、その構成式は複雑なものなる。 However, since the behavior in the ultimate region is generally highly nonlinear, it is difficult to model the behavior in the ultimate region mathematically. Even if it were possible to model it, the constitutive equation would be complicated.

本発明は、上記事情に鑑みてなされたものであって、数式でモデル化することができない力学モデルであっても時刻歴応答を算出することを目的とする。 The present invention was made in consideration of the above circumstances, and aims to calculate the time history response even for mechanical models that cannot be modeled mathematically.

以上の課題を解決するために、土木建築構造物をモデル化した力学モデルにおける荷重、速度及び変位の時系列データのうち何れか２つの入力時系列データから、もう１つの出力時系列データを予測する予測装置は、入力から出力に向けて順に入力層、隠れ層及び出力層を有するリカレントニューラルネットワークを構築するＲＮＮ構築部と、前記２つの入力時系列データを前記入力層に入力して、予めの学習により得られた複数の重みからなる学習済みパラメータ群を前記隠れ層に割り当てて前記隠れ層を活性化させることによって前記出力層における前記出力時系列データを算出する予測部と、を備え、前記ＲＮＮ構築部は複数の長短期記憶ブロックが時系列で結合されてなる層を前記隠れ層として構築する。 In order to solve the above problems, a prediction device that predicts output time series data from any two of input time series data of time series data of load, velocity, and displacement in a mechanical model that models a civil engineering and architectural structure includes an RNN construction unit that constructs a recurrent neural network having an input layer, a hidden layer, and an output layer in that order from input to output, and a prediction unit that inputs the two input time series data to the input layer, assigns a learned parameter group consisting of a plurality of weights obtained by prior learning to the hidden layer, and activates the hidden layer to calculate the output time series data in the output layer , and the RNN construction unit constructs a layer in which a plurality of long and short-term memory blocks are connected in time series as the hidden layer .

また、土木建築構造物をモデル化した力学モデルにおける荷重、速度及び変位の時系列データのうち何れか２つの入力時系列データともう１つの正解時系列データとを学習させる学習装置は、入力から出力に向けて順に入力層、隠れ層及び出力層を有するリカレントニューラルネットワークを構築するＲＮＮ構築部と、前記２つの入力時系列データを前記入力層に入力して、複数の重みからなる学習済みパラメータ群を前記隠れ層に割り当てて前記隠れ層を活性化させることによって前記出力層における出力時系列データを算出し、前記出力時系列データと前記正解時系列データの損失を算出し、バックプロパゲーション処理により前記損失を前記複数の重みでそれぞれ微分することによって微分パラメータを算出し、勾配降下法により前記微分パラメータと前記複数の重みとに基づいて前記複数の重みを更新する学習部と、を備え、前記ＲＮＮ構築部は複数の長短期記憶ブロックが時系列で結合されてなる層を前記隠れ層として構築する。
Further, a learning device for learning any two input time series data of time series data of load, velocity, and displacement in a mechanical model that models a civil engineering and architectural structure and another ground truth time series data includes an RNN construction unit that constructs a recurrent neural network having an input layer, a hidden layer, and an output layer in that order from input to output, and a learning unit that inputs the two input time series data to the input layer, assigns a learned parameter group consisting of a plurality of weights to the hidden layer and activates the hidden layer to calculate output time series data in the output layer, calculates losses of the output time series data and the ground truth time series data, calculates differential parameters by differentiating the losses with the plurality of weights by backpropagation processing, and updates the plurality of weights based on the differential parameters and the plurality of weights by a gradient descent method , and the RNN construction unit constructs a layer in which a plurality of long and short-term memory blocks are connected in time series as the hidden layer .

本発明によれば、力学モデルが数式でモデル化することができなくても、荷重、速度及び変位の時系列データのうち何れか２つの入力時系列データからもう１つの出力時系列データを算出できる。出力時系列データは、時間経過に伴って変化する荷重、速度及び変位の何れか２つが力学モデルに与えられた場合の応答である。 According to the present invention, even if the mechanical model cannot be modeled by a formula, it is possible to calculate another output time series data from any two of the input time series data of the load, velocity, and displacement. The output time series data is the response when any two of the load, velocity, and displacement that change over time are given to the mechanical model.

モデル化された減衰装置の概略図である。FIG. 2 is a schematic diagram of a modeled damping device. 学習装置及び予測装置を兼ねた人工知能装置のブロック図である。FIG. 1 is a block diagram of an artificial intelligence device that functions as both a learning device and a prediction device. リカレントニューラルネットワークの模式図である。FIG. 1 is a schematic diagram of a recurrent neural network. 長短期記憶ブロックの模式図である。FIG. 1 is a schematic diagram of long short-term memory blocks.

〔１．人工知能装置の構成〕
図１は、力学的にモデル化された減衰装置９０の概略図である。図２は、人工知能装置１のブロック図である。 [1. Configuration of the artificial intelligence device]
1 is a schematic diagram of a dynamically modelled damping system 90. FIG. 2 is a block diagram of an artificial intelligence system 1.

減衰装置９０は例えばオイルダンパーである。減衰装置９０を力学的にモデル化すると、減衰装置９０がダッシュポット９１、弾性バネ９２、弾性バネ９３及び摩擦抵抗９４から構成され、ダッシュポット９１、弾性バネ９２及び弾性バネ９３が直列され、これらの直列体と摩擦抵抗９４が並列されている。ダッシュポット９１及び弾性バネ９２が減衰装置９０の粘弾性要素であり、弾性バネ９３が減衰装置９０の弾性要素であり、摩擦抵抗９４が減衰装置９０の可動部（例えば、ロッド）に生じるものである。地震、振動又は衝突等が生じると、時間経過に伴い変化する荷重及び速度が減衰装置９０に与えられ、それに伴って減衰装置９０が応答して変位する。図２に示す人工知能装置１は、減衰装置９０の応答、つまり時間経過に伴う減衰装置９０の変位の変化を予測する。つまり、人工知能装置１は、人工知能を利用して、荷重の時系列データと速度の時系列データから、変位の時系列データを算出する。 The damping device 90 is, for example, an oil damper. When the damping device 90 is mechanically modeled, it is composed of a dashpot 91, an elastic spring 92, an elastic spring 93, and a friction resistance 94, and the dashpot 91, the elastic spring 92, and the elastic spring 93 are connected in series, and the series is connected in parallel with the friction resistance 94. The dashpot 91 and the elastic spring 92 are viscoelastic elements of the damping device 90, the elastic spring 93 is an elastic element of the damping device 90, and the friction resistance 94 is generated in a moving part (e.g., a rod) of the damping device 90. When an earthquake, vibration, collision, or the like occurs, a load and a speed that change over time are applied to the damping device 90, and the damping device 90 responds and displaces accordingly. The artificial intelligence device 1 shown in FIG. 2 predicts the response of the damping device 90, that is, the change in the displacement of the damping device 90 over time. In other words, the artificial intelligence device 1 uses artificial intelligence to calculate the time series data of the displacement from the time series data of the load and the time series data of the speed.

人工知能装置１は、予測装置と学習装置を兼ねたものである。人工知能装置１は、演算処理装置１０、表示デバイス３０、入力デバイス４０及びストレージ５０等を備える。
演算処理装置１０は、ＣＰＵ、ＧＰＵ、ＲＡＭ、システムバス及び各種インターフェース等を備える。 The artificial intelligence device 1 functions as both a prediction device and a learning device, and includes a processor 10, a display device 30, an input device 40, and a storage 50.
The arithmetic processing device 10 includes a CPU, a GPU, a RAM, a system bus, and various interfaces.

表示デバイス３０は例えば液晶ディスプレイデバイス、有機ＥＬディスプレイデバイス又はプロジェクタである。演算処理装置１０が映像信号を生成し、その映像信号を表示デバイス１２に出力する。そうすると、映像信号に従った画面が表示デバイス３０によって表示される。 The display device 30 is, for example, a liquid crystal display device, an organic EL display device, or a projector. The arithmetic processing device 10 generates a video signal and outputs the video signal to the display device 12. Then, a screen according to the video signal is displayed by the display device 30.

入力デバイス４０は例えばスイッチ、キーボード、ポインティングである。入力デバイス４０は、操作されると操作内容に応じた信号を演算処理装置１０に出力する。 The input device 40 is, for example, a switch, a keyboard, or a pointing device. When the input device 40 is operated, it outputs a signal corresponding to the operation to the arithmetic processing device 10.

ストレージ５０には、プログラム５１、教師データ５２、学習済みパラメータ群５６及び分析対象データ５７が格納されている。 The storage 50 stores a program 51, training data 52, a learned parameter set 56, and data to be analyzed 57.

教師データ５２は、後述のリカレントニューラルネットワーク２００（図３参照）の学習に利用される。教師データ５２は、互いに対応付けされた荷重の時系列データ５３と速度の時系列データ５４と変位の時系列データ５５とから構成される。これら時系列データ５３，５４，５５は、それぞれ荷重、速度、変位の値が時系列で配列された離散的なデータ列である。変位の時系列データ５５は、正解として利用される。 The teacher data 52 is used for learning the recurrent neural network 200 (see FIG. 3) described below. The teacher data 52 is composed of load time series data 53, velocity time series data 54, and displacement time series data 55, which are associated with each other. These time series data 53, 54, and 55 are discrete data strings in which the load, velocity, and displacement values are arranged in time series, respectively. The displacement time series data 55 is used as the correct answer.

これら時系列データ５３、５４，５５は予め行われたシミュレーション又は実験により求められたものである。非日常的な地震或いは衝撃から日常的な微振動までを想定して、シミュレーション又は実験ごとの荷重、速度及び変位の条件を変化させて、多数回のシミュレーション又は実験により得られた多数の教師データ５２がストレージ５０に格納されている。 These time series data 53, 54, 55 were obtained by simulations or experiments carried out in advance. Assuming a range of events from extraordinary earthquakes or shocks to everyday micro-vibrations, the load, speed, and displacement conditions were changed for each simulation or experiment, and a large number of teacher data 52 obtained by performing multiple simulations or experiments are stored in storage 50.

学習済みパラメータ群５６は、リカレントニューラルネットワーク２００の多数のニューロンにおける重み及びバイアスから構成されている。学習済みパラメータ群５６は、リカレントニューラルネットワーク２００が学習する度に更新される。なお、リカレントニューラルネットワーク２００が未学習状態である場合、学習済みパラメータ群５６の各重み及びバイアスの初期値は乱数である。 The trained parameter group 56 is composed of weights and biases for a large number of neurons in the recurrent neural network 200. The trained parameter group 56 is updated each time the recurrent neural network 200 trains. When the recurrent neural network 200 is in an untrained state, the initial values of each weight and bias in the trained parameter group 56 are random numbers.

分析データ５７は、リカレントニューラルネットワーク２００を用いた変位の時系列データの予測に利用される。分析データ５７は、互いに対応付けされた荷重の時系列データ５８と速度の時系列データ５９とから構成される。これら時系列データ５８，５９は、それぞれ荷重、速度の値が時系列で配列された離散的なデータ列である。 The analysis data 57 is used to predict the time series data of displacement using the recurrent neural network 200. The analysis data 57 is composed of time series data 58 of load and time series data 59 of velocity, which are associated with each other. These time series data 58 and 59 are discrete data strings in which the load and velocity values are respectively arranged in time series.

プログラム５１は、演算処理装置１０にとって読取可能及び実行可能なものである。演算処理装置１０が、プログラム５１を実行することによって、ＲＮＮモデル構築部１１、学習部１２及び予測部１３として機能する。ＲＮＮモデル構築部１１及び学習部１２は学習装置の機能であり、ＲＮＮモデル構築部１１及び予測部１３は予測装置の機能である。以下、ＲＮＮモデル構築部１１、学習部１２及び予測部１３の機能について詳細に説明する。 The program 51 is readable and executable by the arithmetic processing device 10. By executing the program 51, the arithmetic processing device 10 functions as an RNN model construction unit 11, a learning unit 12, and a prediction unit 13. The RNN model construction unit 11 and the learning unit 12 are functions of a learning device, and the RNN model construction unit 11 and the prediction unit 13 are functions of a prediction device. The functions of the RNN model construction unit 11, the learning unit 12, and the prediction unit 13 are described in detail below.

〔２．ＲＮＮモデル構築部〕
ＲＮＮモデル構築部１１は、図３に示すように、積層された長短期記憶（Long short-term memory：LSTM）のアーキテクチャを利用したリカレントニューラルネットワーク２００を構築する。リカレントニューラルネットワーク２００は再帰型ニューラルネットワークともいう。リカレントニューラルネットワーク２００は、入力層２１０、４層の隠れ層２２０、２３０，２４０，２５０、全結合層２６０、ドロップアウト層２７０及び出力層２８０から構成されている。 [2. RNN model construction section]
The RNN model construction unit 11 constructs a recurrent neural network 200 using a stacked long short-term memory (LSTM) architecture, as shown in Fig. 3. The recurrent neural network 200 is also called a recurrent neural network. The recurrent neural network 200 is composed of an input layer 210, four hidden layers 220, 230, 240, and 250, a fully connected layer 260, a dropout layer 270, and an output layer 280.

入力層２１０は、時系列の複数の入力ユニット２１１から構成されている。入力層２１０には、２種類の時系列データ、具体的には変位の時系列データと速度の時系列データが入力される。つまり、入力層２１０の各入力ユニット２１１には２つのニューロンが用いられ、変位データと速度データがこれらニューロンにそれぞれ入力される。従って、各入力ユニット２１１に入力される入力データｘ_ｔはベクトルであり、入力データｘ_ｔの要素は同一ステップ（つまり、同一時刻）の変位データと速度データからなる。ここで、添字の「ｔ」は時系列のステップ順を表す。 The input layer 210 is composed of a plurality of time-series input units 211. Two types of time-series data, specifically, displacement time-series data and velocity time-series data, are input to the input layer 210. That is, two neurons are used for each input unit 211 of the input layer 210, and displacement data and velocity data are input to these neurons, respectively. Therefore, the input data _xt input to each input unit 211 is a vector, and the elements of the input data _xt consist of displacement data and velocity data of the same step (i.e., the same time). Here, the subscript "t" represents the order of the steps in the time series.

１層目の隠れ層２２０は、時系列の長短期記憶ブロック２２１から構成されている。２層目～４層目の隠れ層２３０，２４０，２５０もそれぞれ時系列の長短期記憶ブロック２３１，２４１，２５１から構成されている。ここで、ｊを１から４の任意の数とした場合、添字の「（ｔ，ｊ）」の「ｊ」は隠れ層２１０，２３０，２４０，２５０の層順を表す。 The first hidden layer 220 is composed of a time-series long short-term memory block 221. The second to fourth hidden layers 230, 240, and 250 are also composed of time-series long short-term memory blocks 231, 241, and 251, respectively. Here, if j is an arbitrary number from 1 to 4, the "j" in the subscript "(t,j)" represents the layer order of the hidden layers 210, 230, 240, and 250.

隠れ層２１０，２３０，２４０，２５０の各々では、或るステップにおけるニューロンの出力が、次のステップのニューロンの入力と次の層のニューロンの入力とに結合されている。具体的には、１層目の各長短期記憶ブロック２２１は、同一ステップの入力ユニット２１１から入力データｘ_ｔを入力するとともに前のステップの長短期記憶ブロック２２１から隠れ状態データｈ_{（ｔ－１，１）}を入力すると、隠れ状態データｈ_{（ｔ，１）}を次のステップの長短期記憶ブロック２２１に出力するとともにデータｙ_{（ｔ，１）}を次の層の長短期記憶ブロック２３１に出力する。また、２層目の長短期記憶ブロック２３１は、１層目の長短期記憶ブロック２２１からデータｙ_{（ｔ，１）}を入力するとともに前のステップの長短期記憶ブロック２３１から隠れ状態データｈ_{（ｔ－１，２）}を入力すると、隠れ状態データｈ_{（ｔ，２）}を次のステップの長短期記憶ブロック２３１に出力するとともにデータｙ_{（ｔ，２）}を次の層の長短期記憶ブロック２４１に出力する。また、３層目の長短期記憶ブロック２４１は、２層目の長短期記憶ブロック２３１からデータｙ_{（ｔ，２）}を入力するとともに前のステップの長短期記憶ブロック２４１から隠れ状態データｈ_{（ｔ－１，３）}を入力すると、隠れ状態データｈ_{（ｔ，３）}を次のステップの長短期記憶ブロック２４１に出力するとともにデータｙ_{（ｔ，３）}を次の層の長短期記憶ブロック２５１に出力する。また、４層目の長短期記憶ブロック２５１は、３層目の長短期記憶ブロック２４１からデータｙ_{（ｔ，３）}を入力するとともに前のステップの長短期記憶ブロック２５１から隠れ状態データｈ_{（ｔ－１，４）}を入力すると、隠れ状態データｈ_{（ｔ，４）}を次のステップの長短期記憶ブロック２５１に出力するとともにデータｙ_{（ｔ，４）}を全結合層２６０の同一ステップの全結合ブロック２６１に出力する。１ステップの長短期記憶ブロック２２１，２３１，２４１，２５１の数は長短期記憶ブロック２２１，２３１，２４１，２５１の順に多く、例えば１ステップの長短期記憶ブロック２２１，２３１，２４１，２５１の数がそれぞれ２００，１２５，７８，４９である。 In each of the hidden layers 210, 230, 240, and 250, the output of a neuron in a certain step is connected to the input of a neuron in the next step and the input of a neuron in the next layer. Specifically, when each long short-term memory block 221 in the first layer inputs input data _xt from the input unit 211 of the same step and inputs hidden state data h _(t-1,1) from the long short-term memory block 221 of the previous step, it outputs the hidden state data h _(t,1) to the long short-term memory block 221 of the next step and outputs data y _(t,1) to the long short-term memory block 231 of the next layer. Furthermore, when the second-layer long short-term memory block 231 inputs data y _(t,1) from the first-layer long short-term memory block 221 and hidden state data h _(t-1,2) from the previous step long short-term memory block 231, it outputs the hidden state data h _(t,2) to the next step long short-term memory block 231 and outputs data y _(t,2) to the next layer long short-term memory block 241. Furthermore, when the third-layer long short-term memory block 241 inputs data y( _t,2) from the second-layer long short-term memory block 231 and hidden state data h _(t-1,3) from the previous step long short-term memory block 241, it outputs hidden state data h _(t,3) to the next step long short-term memory block 241 and outputs data y _(t,3) to the next layer long short-term memory block 251. Furthermore, when the long short-term memory block 251 in the fourth layer inputs data y _(t,3) from the long short-term memory block 241 in the third layer and inputs hidden state data h _(t-1,4) from the long short-term memory block 251 in the previous step, it outputs the hidden state data h _(t,4) to the long short-term memory block 251 in the next step and outputs data y _(t,4) to the fully connected block 261 of the same step in the fully connected layer 260. The numbers of long short-term memory blocks 221, 231, 241, 251 in one step are greatest in the order of long short-term memory blocks 221, 231, 241, 251. For example, the numbers of long short-term memory blocks 221, 231, 241, 251 in one step are 200, 125, 78, 49, respectively.

全結合層２６０は、時系列の全結合ブロック２６１から構成されている。全結合ブロック２６１は複数、例えば３０のニューロンを有し、全結合ブロック２６１のニューロンが同一ステップの４層目の長短期記憶ブロック２５１の出力に全結合されている。全結合ブロック２６１は、同一ステップの長短期記憶ブロック２５１の出力データｙ_{（ｔ，４）}を入力すると、それらに重みを乗算して、その積にバイアスを加算する。 The fully connected layer 260 is composed of a time-series fully connected block 261. The fully connected block 261 has a plurality of neurons, for example 30 neurons, and the neurons of the fully connected block 261 are fully connected to the output of the long short-term memory block 251 in the fourth layer of the same step. When the fully connected block 261 receives the output data y _(t,4) of the long short-term memory block 251 in the same step, it multiplies them by weights and adds a bias to the product.

ドロップアウト層２７０は、時系列のドロップアウトブロック２７１から構成されている。ドロップアウトブロック２７１は複数のニューロンを有し、ドロップアウトブロック２７１のニューロンが同一ステップの全結合ブロック２６１のニューロンに全結合されている。学習の際に幾つかのニューロンを無効にするように更新し、予測の際に同一ステップの全結合ブロック２６１のニューロンからこのドロップアウトブロック２７１の無効ニューロンへの入力をゼロに設定する。これにより、ドロップアウトブロック２７１は、過学習による精度低下を低減させるものである。 The dropout layer 270 is composed of a time-series dropout block 271. The dropout block 271 has multiple neurons, and the neurons of the dropout block 271 are fully connected to the neurons of the fully connected block 261 of the same step. During learning, some neurons are updated to be disabled, and during prediction, the input from the neurons of the fully connected block 261 of the same step to the disabled neurons of this dropout block 271 is set to zero. In this way, the dropout block 271 reduces the deterioration of accuracy due to overlearning.

出力層２８０は、時系列の複数の出力ユニット２８１から構成されている。入力層２８０の各出力ユニット２８１には１つのニューロンが用いられ、出力ユニット２８１のニューロンが同一ステップのドロップアウトブロック２７１のニューロンに結合され、出力ユニット２８１は、ドロップアウトブロック２７１の有効なニューロンの出力を入力すると、出力データｚ_ｔを出力する。よって、各出力ユニット２８１に出力される出力データｚ_ｔはスカラーの荷重データであり、出力層２８０から１種類の時系列データ、具体的には荷重の時系列データが出力される。 The output layer 280 is composed of a plurality of time-series output units 281. One neuron is used for each output unit 281 in the input layer 280, and the neuron of the output unit 281 is connected to the neuron of the dropout block 271 of the same step, and when the output unit 281 receives the output of a valid neuron of the dropout block 271, it outputs output data _zt . Therefore, the output data _zt output to each output unit 281 is scalar weight data, and one type of time-series data, specifically, weight time-series data, is output from the output layer 280.

図４を参照して、長短期記憶ブロック２２１，２３１，２４１，２５１について説明する。
長短期記憶ブロック２２１，２３１，２４１，２５１は、入力ブロック３１０、入力ゲートブロック３２０、忘却ゲートブロック３３０、出力ゲートブロック３４０、乗算ブロック３５０、乗算ブロック３６０、加算ブロック３７０、メモリセル３７５、活性化関数ブロック３８０及び乗算ブロック３９０から構成されている。ここで、１層目の長短期記憶ブロック２２１の場合には、図４における入力データｖ_{（ｔ，ｊ－１）}が図３における入力データｘ_ｔであり、図４における出力データｖ_{（ｔ，ｊ）}が図３におけるデータｙ_{（ｔ，１）}である。２層目の長短期記憶ブロック２３１の場合には、図４における入力データｖ_{（ｔ，ｊ－１）}が図３におけるデータｙ_{（ｔ，１）}であり、図４における出力データｖ_{（ｔ，ｊ）}が図３におけるデータｙ_{（ｔ，２）}である。３層目の長短期記憶ブロック２４１の場合には、図４における入力データｖ_{（ｔ，ｊ－１）}が図３におけるデータｙ_{（ｔ，２）}であり、図４における出力データｖ_{（ｔ，ｊ）}が図３におけるデータｙ_{（ｔ，３）}である。４層目の長短期記憶ブロック２５１の場合には、図４における入力データｖ_{（ｔ，ｊ－１）}が図３におけるデータｙ_{（ｔ，３）}であり、図４における出力データｖ_{（ｔ，ｊ）}が図３におけるデータｙ_{（ｔ，４）}である。 The long short-term memory blocks 221, 231, 241, and 251 will now be described with reference to FIG.
The long short-term memory blocks 221, 231, 241, and 251 are composed of an input block 310, an input gate block 320, a forget gate block 330, an output gate block 340, a multiplication block 350, a multiplication block 360, an addition block 370, a memory cell 375, an activation function block 380, and a multiplication block 390. Here, in the case of the first-layer long short-term memory block 221, the input data v _(t,j-1) in Fig. 4 is the input data _xt in Fig. 3, and the output data v _(t,j) in Fig. 4 is the data y _(t,1) in Fig. 3. In the case of the second-layer long short-term memory block 231, the input data v _(t,j-1) in Fig. 4 is the data y _(t,1) in Fig. 3, and the output data v _(t,j) in Fig. 4 is the data y _(t,2) in Fig. 3. In the case of the third-layer long short-term memory block 241, the input data v _(t,j-1) in Fig. 4 is the data y _(t,2) in Fig. 3, and the output data v _(t,j) in Fig. 4 is the data y _(t,3) in Fig. 3. In the case of the fourth-layer long short-term memory block 251, the input data v _(t,j-1) in Fig. 4 is the data y _(t,3) in Fig. 3, and the output data v _(t,j) in Fig. 4 is the data y _(t,4) in Fig. 3.

入力ブロック３１０は、入力データｖ_{（ｔ，ｊ－１）}及び隠れ状態データｈ_{（ｔ－１，ｊ）}を入力すると、次式（１）により入力ブロック３１０の状態データｓ_{（ｔ，ｊ）}を算出する。ここで、入力ブロック３１０は重み付け和ブロック３１１及び活性化関数ブロック３１２から構成されている。重み付け和ブロック３１１は、入力データｖ_{（ｔ，ｊ－１）}と重みＵ_{ｓ（ｔ，ｊ）}の積と、隠れ状態データｈ_{（ｔ－１，ｊ）}と重みＷ_{ｓ（ｔ，ｊ）}の積と、バイアスｂ_{ｓ（ｔ，ｊ）}とを加算することによって、それらの和をネット値として算出する。活性化関数ブロック３１２は、重み付け和ブロック３１１によって算出されたネット値を活性化関数（具体的には、双曲線正接関数）に当て嵌めることによって、入力ブロック３１０の状態データｓ_{（ｔ，ｊ）}を算出する。 When the input data v _(t,j-1) and the hidden state data h _(t-1,j) are input to the input block 310, the input block 310 calculates the state data s _(t,j) of the input block 310 by the following formula (1). Here, the input block 310 is composed of a weighted sum block 311 and an activation function block 312. The weighted sum block 311 calculates the sum of the product of the input data v _(t,j-1) and the weight Us _(t, j), the product of the hidden state data h _(t-1,j) and the weight Ws _(t, _{j), and the bias bs(t,j)} as a net value by adding them. The activation function block 312 calculates the state data s _(t,j) of the input block 310 by fitting the net value calculated by the weighted sum block 311 to an activation function (specifically, a hyperbolic tangent function).

ここで、Ｕ_{ｓ（ｔ，ｊ）}及びＷ_{ｓ（ｔ，ｊ）}は入力ブロック３１０における重みであり、ｂ_{ｓ（ｔ，ｊ）}は入力ブロック３１０におけるバイアスである。 where U _{s (t,j)} and W _{s (t,j)} are the weights at the input block 310 and b _{s (t,j)} is the bias at the input block 310 .

入力ゲートブロック３２０は、入力データｖ_{（ｔ，ｊ－１）}及び隠れ状態データｈ_{（ｔ－１，ｊ）}を入力すると、次式（２）により入力ゲートブロック３２０の状態データｉ_{（ｔ，ｊ）}を算出する。ここで、入力ゲートブロック３２０は重み付け和ブロック３２１及び活性化関数ブロック３２２から構成されている。重み付け和ブロック３２１は、入力データｖ_{（ｔ，ｊ－１）}と重みＵ_{ｉ（ｔ，ｊ）}の積と、隠れ状態データｈ_{（ｔ－１，ｊ）}と重みＷ_{ｉ（ｔ，ｊ）}の積と、バイアスｂ_{ｉ（ｔ，ｊ）}とを加算することによって、その和をネット値として算出する。活性化関数ブロック３２２は、重み付け和ブロック３２１によって算出されたネット値を活性化関数（具体的には、シグモイド関数）に当て嵌めることによって、入力ゲートブロック３２０の状態データｉ_{（ｔ，ｊ）}を算出する。 When the input data v _(t,j-1) and the hidden state data h _(t-1,j) are input, the input gate block 320 calculates the state data i _(t,j) of the input gate block 320 by the following formula (2). Here, the input gate block 320 is composed of a weighted sum block 321 and an activation function block 322. The weighted sum block 321 calculates the sum as a net value by adding the product of the input data v _(t,j-1) _and the weight U i(t,j), the product of the hidden state data h _(t-1,j ₎ and the weight W _{i(t,j), and the bias b i(} t,j). The activation function block 322 calculates the state data i _(t,j) of the input gate block 320 by fitting the net value calculated by the weighted sum block 321 to an activation function (specifically, a sigmoid function).

ここで、Ｕ_{ｉ（ｔ，ｊ）}及びＷ_{ｉ（ｔ，ｊ）}は入力ゲートブロック３２０における重みであり、ｂ_{ｉ（ｔ，ｊ）}は入力ゲートブロック３２０におけるバイアスである。 where U _{i (t,j)} and W _{i (t,j)} are the weights at the input gate block 320 and b _{i (t,j)} is the bias at the input gate block 320 .

忘却ゲートブロック３３０は、入力データｖ_{（ｔ，ｊ－１）}及び隠れ状態データｈ_{（ｔ－１，ｊ）}を入力すると、次式（３）により忘却ゲートブロック３３０の状態データｆ_{（ｔ，ｊ）}を算出する。ここで、忘却ゲートブロック３３０は重み付け和ブロック３３１及び活性化関数ブロック３３２から構成されている。重み付け和ブロック３３１は、入力データｖ_{（ｔ，ｊ－１）}と重みＵ_{ｆ（ｔ，ｊ）}の積と、隠れ状態データｈ_{（ｔ－１，ｊ）}と重みＷ_{ｆ（ｔ，ｊ）}の積と、バイアスｂ_{ｆ（ｔ，ｊ）}とを加算して、その和をネット値として算出する。活性化関数ブロック３３２は、重み付け和ブロック３３１によって算出されたネット値を活性化関数（具体的には、シグモイド関数）に当て嵌めることによって、入力ゲートブロック３２０の状態データｉ_{（ｔ，ｊ）}を算出する。 When the forgetting gate block 330 receives the input data v _(t,j-1) and the hidden state data h _(t-1,j) , it calculates the state data f _(t,j) of the forgetting gate block 330 by the following formula (3). Here, the forgetting gate block 330 is composed of a weighted sum block 331 and an activation function block 332. The weighted sum block 331 adds the product of the input data v _(t,j-1) and the weight Uf _(t,j) , the product of the hidden state data h _(t-1,j) and the weight Wf _(t,j) , and a bias bf _(t,j) , and calculates the sum as a net value. The activation function block 332 calculates the state data i _(t,j) of the input gate block 320 by fitting the net value calculated by the weighted sum block 331 to an activation function (specifically, a sigmoid function).

ここで、Ｕ_{ｆ（ｔ，ｊ）}及びＷ_{ｆ（ｔ，ｊ）}は忘却ゲートブロック３３０における重みであり、ｂ_{ｆ（ｔ，ｊ）}は忘却ゲートブロック３３０におけるバイアスである。 where U _f(t,j) and W _f(t,j) are the weights in the forget gate block 330 and b _f(t,j) is the bias in the forget gate block 330.

出力ゲートブロック３４０は、入力データｖ_{（ｔ，ｊ－１）}及び隠れ状態データｈ_{（ｔ－１，ｊ）}を入力すると、次式により出力ゲートブロック３４０の状態データｏ_{（ｔ，ｊ）}を算出する。ここで、出力ゲートブロック３４０は重み付け和ブロック３４１及び活性化関数ブロック３４２から構成され、重み付け和ブロック３４１は、入力データｖ_{（ｔ，ｊ－１）}と重みＵ_{ｏ（ｔ，ｊ）}の積と、隠れ状態データｈ_{（ｔ－１，ｊ）}と重みＷ_{ｏ（ｔ，ｊ）}の積と、バイアスｂ_{ｏ（ｔ，ｊ）}とを加算することによって、それら和をネット値として算出する。活性化関数ブロック３４２は、重み付け和ブロック３４１によって算出されたネット値を活性化関数（具体的には、シグモイド関数）に当て嵌めることによって、入力ゲートブロック３２０の状態データｏ_{（ｔ，ｊ）}を算出する。 When the output gate block 340 receives input data v _(t,j-1) and hidden state data h _(t-1,j) , it calculates state data o _(t,j) of the output gate block 340 by the following formula. Here, the output gate block 340 is composed of a weighted sum block 341 and an activation function block 342. The weighted sum block 341 adds the product of the input data v _(t,j-1) and the weight _Uo(t,j) , the product of the hidden state data h _(t-1,j) and the weight Wo _(t,j) , and a bias b _o(t,j) to calculate the sum as a net value. The activation function block 342 calculates state data o( _{t,j) of the input gate block 320 by fitting the net value calculated by the weighted sum block 341 to an activation function (specifically, a sigmoid function)} .

ここで、Ｕ_{ｏ（ｔ，ｊ）}及びＷ_{ｏ（ｔ，ｊ）}は出力ゲートブロック３４０における重みであり、ｂ_{ｏ（ｔ，ｊ）}は出力ゲートブロック３４０におけるバイアスである。 where U _{o (t, j)} and W _{o (t, j)} are the weights at the output gate block 340 and bo _{(t, j)} is the bias at the output gate block 340 .

乗算ブロック３５０は、入力ブロック３１０によって算出された状態データｉ_{（ｔ，ｊ）}と入力ゲートブロック３２０によって算出された状態データｉ_{（ｔ，ｊ）}を入力すると、これらを次式（５）のようにアダマール乗算することによってアダマール積を算出する。アダマール積とは、同じサイズの行列の要素ごとの積をいう。 The multiplication block 350 receives the state data i _(t,j) calculated by the input block 310 and the state data i _(t,j) calculated by the input gate block 320, and calculates the Hadamard product by Hadamard multiplication of these data as shown in the following equation (5). The Hadamard product is the product of each element of matrices of the same size.

乗算ブロック３６０は、忘却ゲートブロック３３０によって算出された状態データｆ_{（ｔ，ｊ）}と、メモリセル３７５に保持された前のステップの加算ブロック３７０の計算結果（記憶データＣ_{（ｔ－１，ｊ）}）とを入力すると、これらを次式（６）のようにアダマール乗算することによってアダマール積を算出する。 Multiplication block 360 receives state data f _(t,j) calculated by forget gate block 330 and the calculation result of addition block 370 in the previous step (stored data C _(t-1,j) ) held in memory cell 375, and calculates the Hadamard product by Hadamard multiplication of these data as shown in the following equation (6).

加算ブロック３７０は、乗算ブロック３５０によって算出されたアダマール積と、乗算ブロック３６０によって算出されたアダマール積とを入力すると、これらを次式（７）のように加算して、その計算結果（記憶データＣ_{（ｔ，ｊ）}）をメモリセル３７５に格納する。 Addition block 370 receives the Hadamard product calculated by multiplication block 350 and the Hadamard product calculated by multiplication block 360, adds them together as shown in the following equation (7), and stores the calculation result (storage data C _{(t, j)} ) in memory cell 375.

式（７）はメモリセル３７５の更新式である。つまり、メモリセル３７５は、保持していた前のステップの加算ブロック３７０の計算結果（記憶データＣ_{（ｔ－１，ｊ）}）を本ステップの加算ブロック３７０の計算結果（記憶データＣ_{（ｔ，ｊ）}）に更新して、それを保持する。 Equation (7) is an update equation for memory cell 375. That is, memory cell 375 updates the calculation result (stored data C _(t-1,j) ) of addition block 370 in the previous step that it has held to the calculation result (stored data C _(t,j) ) of addition block 370 in the current step, and holds it.

活性化関数ブロック３８０は、メモリセル３７５に更新保持された加算ブロック３７０の計算結果（記憶データＣ_{（ｔ，ｊ）}）を活性化関数（具体的には、双曲線正接関数）に当て嵌めることによって、活性化状態データを算出する。 The activation function block 380 calculates activation state data by fitting the calculation result (stored data C _(t,j) ) of the addition block 370 updated and stored in the memory cell 375 to an activation function (specifically, a hyperbolic tangent function).

乗算ブロック３３０は、忘却ゲートブロック３３０によって算出された状態データｆ_{（ｔ，ｊ）}と、活性化関数ブロック３８０によって算出された活性化状態データとを入力すると、これらを次式（８）のようにアダマール乗算することによって隠れ状態データｈ_{（ｔ，ｊ）}を算出する。 The multiplication block 330 inputs the state data f _{(t, j)} calculated by the forget gate block 330 and the activation state data calculated by the activation function block 380, and calculates hidden state data h _{(t, j)} by Hadamard multiplication of these as shown in the following equation (8).

図３及び図４に示すように、乗算ブロック３３０によって算出された隠れ状態データｈ_{（ｔ，ｊ）}は、同一層の次のステップに長短期記憶ブロックに出力される。乗算ブロック３３０によって算出された隠れ状態データｈ_{（ｔ，ｊ）}に等しいデータｙ_{（ｔ，ｊ）}が次の層の長短期ブロック又は全結合ブロック２６１に出力される。 3 and 4, the hidden state data h _(t,j) calculated by the multiplication block 330 is output to the long short-term memory block in the next step of the same layer. Data y _{(t,j) equal to the hidden state data h(t,j} ₎ calculated by the multiplication block 330 is output to the long short-term block or the full connection block 261 in the next layer.

上述のように構成されたリカレントニューラルネットワーク２００の各長短期記憶ブロック２２１，２３１，２４１，２５１における重みＵ_{ｓ（ｔ，ｊ）}，Ｗ_{ｓ（ｔ，ｊ）}，ｂ_{ｓ（ｔ，ｊ）}，Ｕ_{ｉ（ｔ，ｊ）}，Ｗ_{ｉ（ｔ，ｊ）}，ｂ_{ｉ（ｔ，ｊ）}，Ｕ_{ｆ（ｔ，ｊ）}，Ｗ_{ｆ（ｔ，ｊ）}，ｂ_{ｆ（ｔ，ｊ）}，Ｕ_{ｏ（ｔ，ｊ）}，Ｗ_{ｏ（ｔ，ｊ）}，ｂ_{ｏ（ｔ，ｊ）}は学習済みパラメータ群５６としてストレージ５０に記憶されている。未学習の場合、学習済みパラメータ群５６としてストレージ５０に予め記憶された重みＵ_{ｓ（ｔ，ｊ）}，Ｗ_{ｓ（ｔ，ｊ）}，ｂ_{ｓ（ｔ，ｊ）}，Ｕ_{ｉ（ｔ，ｊ）}，Ｗ_{ｉ（ｔ，ｊ）}，ｂ_{ｉ（ｔ，ｊ）}，Ｕ_{ｆ（ｔ，ｊ）}，Ｗ_{ｆ（ｔ，ｊ）}，ｂ_{ｆ（ｔ，ｊ）}，Ｕ_{ｏ（ｔ，ｊ）}，Ｗ_{ｏ（ｔ，ｊ）}，ｂ_{ｏ（ｔ，ｊ）}の値は乱数である。 The weights Us _(t,j) , Ws _{(t,j), bs(t,j)} , Ui(t,j), Wi _{(t,j), bi(t,j)} , _Uf(t,j) , Wf _(t,j) , bf _(t,j) , Uo _(t, _{j), Wo(t,j)} _, and bo _{(t,j) in each long and short-} _term _{memory block 221, 231, 241, and} 251 of the recurrent neural network 200 configured as described above are stored in the storage 50 as a learned parameter set 56. In the unlearned state, the values of weights _Us(t,j) , Ws _{(t,j), bs(t,j)} , Ui _{(t,j), Wi(t,j} ₎ , _bi(t,j) , Uf(t,j), _Wf(t,j) , _bf( _t,j) , _Uo( _t,j) , Wo _(t,j) , and bo _(t,j) pre-stored in storage 50 as learned parameter set 56 are random numbers.

〔３．学習部〕
学習部１２は、ＲＮＮモデル構築部１１によって構築されたリカレントニューラルネットワーク２００に多数の教師データ５２を学習させて、学習済みパラメータ群５６を更新する。以下、具体的に説明する。 [3. Learning Section]
The learning unit 12 causes the recurrent neural network 200 constructed by the RNN model construction unit 11 to learn a large amount of teacher data 52, and updates the learned parameter group 56. A specific description will be given below.

まず、学習部１２は、ストレージ５０から現在の学習済みパラメータ群５６を取得して、学習済みパラメータ群５６の重みＵ_{ｓ（ｔ，ｊ）}，Ｗ_{ｓ（ｔ，ｊ）}，ｂ_{ｓ（ｔ，ｊ）}，Ｕ_{ｉ（ｔ，ｊ）}，Ｗ_{ｉ（ｔ，ｊ）}，，ｂ_{ｉ（ｔ，ｊ）}，Ｕ_{ｆ（ｔ，ｊ）}，Ｗ_{ｆ（ｔ，ｊ）}，ｂ_{ｆ（ｔ，ｊ）}，Ｕ_{ｏ（ｔ，ｊ）}，Ｗ_{ｏ（ｔ，ｊ）}，ｂ_{ｏ（ｔ，ｊ）}をリカレントニューラルネットワーク２００の各長短期記憶ブロック２２１，２３１，２４１，２５１に割り当てる。 First, the learning unit 12 retrieves the current learned parameter set 56 from the storage 50 and assigns weights Us _{(t,j), Ws(t,j)} , bs _(t,j) _, Ui _{(t,j), Wi(t,j)} , bi _(t,j) , Uf(t,j ₎ , _Wf(t,j) , _bf(t,j) , Uo _(t,j) , Wo _(t _,j) , _bo(t,j) of the learned parameter set 56 to each long and short-term memory block 221, 231, 241, 251 of the recurrent neural network 200.

次に、学習部５６は、ストレージ５０から何れかの教師データ５２を取得して、教師データ５２の荷重及び速度の時系列データ５３、５４をリカレントニューラルネットワーク２００の入力層２１０に入力して、リカレントニューラルネットワーク２００のフォワード処理を実行する。これにより、学習部５６は、リカレントニューラルネットワーク２００の隠れ層２２０，２３０，２４０，２５０、全結合層２６０及びドロップアウト層２７０を活性化させることによって出力層２８０の各出力ユニット２８１における出力データｚ_ｔを算出して、出力データｚ_ｔを時系列で配列した出力時系列データを取得する。 Next, the learning unit 56 acquires any one of the teacher data 52 from the storage 50, and inputs the load and speed time series data 53, 54 of the teacher data 52 to the input layer 210 of the recurrent neural network 200 to execute forward processing of the recurrent neural network 200. As a result, the learning unit 56 activates the hidden layers 220, 230, 240, 250, the fully connected layer 260, and the dropout layer 270 of the recurrent neural network 200 to calculate output data _zt in each output unit 281 of the output layer 280, and acquires output time series data in which the output data _zt is arranged in time series.

次に、学習部５６は、各ステップにつき、教師データ５２の変位時系列データ５３における変位データと出力データｚ_ｔとを損失関数に当て嵌めて、教師データ５２の変位時系列データ５３における変位データと出力データｚ_ｔとの損失を算出する。損失関数は半平均二乗誤差関数であることが好ましいが、平均二乗誤差関数、平均絶対誤差関数、平均対数誤差関数又は交差エントロピー誤差関数であってもよい。 Next, the learning unit 56 fits the displacement data in the displacement time-series data 53 of the teacher data 52 and the output data _zt to a loss function for each step, and calculates the loss between the displacement data in the displacement time-series data 53 of the teacher data 52 and the output data _zt . The loss function is preferably a half mean squared error function, but may also be a mean squared error function, a mean absolute error function, a mean logarithmic error function, or a cross entropy error function.

次に、学習部５６は、バックプロパゲーション処理により、損失を各長短期記憶ブロック２２１，２３１，２４１，２５１における現在の重みＵ_{ｓ（ｔ，ｊ）}，Ｗ_{ｓ（ｔ，ｊ）}，ｂ_{ｓ（ｔ，ｊ）}，Ｕ_{ｉ（ｔ，ｊ）}，Ｗ_{ｉ（ｔ，ｊ）}，ｂ_{ｉ（ｔ，ｊ）}，Ｕ_{ｆ（ｔ，ｊ）}，Ｗ_{ｆ（ｔ，ｊ）}，ｂ_{ｆ（ｔ，ｊ）}，Ｕ_{ｏ（ｔ，ｊ）}，Ｗ_{ｏ（ｔ，ｊ）}，ｂ_{ｏ（ｔ，ｊ）}でそれぞれ微分した微分パラメータ（勾配）を算出する。ここで、微分パラメータの算出の際には、勾配クリッピング法が採用される。つまり、微分パラメータが所定の閾値を超える場合には、その微分パラメータは所定の閾値（例えば、1.0）で正規化される。 Next, the learning unit 56 calculates differential parameters (gradients) by differentiating the losses with the current weights Us _(t,j) , Ws _(t,j) , bs _(t, j), Ui _(t,j) , _Wi(t,j) , bi _(t,j) , Uf( _t, j), _Wf(t,j) , bf _(t,j) , Uo _(t,j) , _Wo(t,j) , and _bo(t,j) in each long short-term memory block 221, 231, 241, and 251 through backpropagation processing. Here, a gradient clipping method is used when calculating the differential parameters. In other words, if the differential parameters exceed a predetermined threshold, the differential parameters are normalized with a predetermined threshold (for example, 1.0).

次に、学習部５６は、勾配降下法、特にＡｄａｍの最適化アルゴリズムを利用した勾配降下法により、各長短期記憶ブロック２２１，２３１，２４１，２５１における現在の重みＵ_{ｓ（ｔ，ｊ）}とその微分パラメータとに基づいて新たな重みＵ_{ｓ（ｔ，ｊ）}を算出する。同様に、学習部５６は、各長短期記憶ブロック２２１，２３１，２４１，２５１における新たな重みＷ_{ｓ（ｔ，ｊ）}，ｂ_{ｓ（ｔ，ｊ）}，Ｕ_{ｉ（ｔ，ｊ）}，Ｗ_{ｉ（ｔ，ｊ）}，ｂ_{ｉ（ｔ，ｊ）}，Ｕ_{ｆ（ｔ，ｊ）}，Ｗ_{ｆ（ｔ，ｊ）}，ｂ_{ｆ（ｔ，ｊ）}，Ｕ_{ｏ（ｔ，ｊ）}，Ｗ_{ｏ（ｔ，ｊ）}，ｂ_{ｏ（ｔ，ｊ）}を算出する。Ａｄａｍの最適化アルゴリズムの場合、微分パラメータの１次モーメント（平均値）に係わる係数の値は例えば0.900であり、微分パラメータの２次モーメント（分散）に係わる係数の値は例えば0.999であり、初期学習率の値は0.005であるが、これらの数値に限るものではない。 Next, the learning unit 56 calculates new weights Us(t,j) based on the current weights Us _(t,j) and their differential parameters in each of the long short-term memory blocks 221, 231, 241, and 251 by using a gradient descent method, particularly a gradient descent method using Adam's optimization algorithm. Similarly, the learning unit 56 calculates new weights Ws _(t,j) _, _bs(t,j) , Ui _(t,j) , _{Wi(t,j), bi(t,j)} , Uf _(t,j) , Wf(t _,j) , _bf(t,j) , Uo _(t,j) _, Wo _(t,j) , _{and bo(t,j) in each of the long short-term memory blocks 221, 231, 241, and 251} . In the case of Adam's optimization algorithm, the value of the coefficient related to the first moment (average value) of the differential parameter is, for example, 0.900, the value of the coefficient related to the second moment (variance) of the differential parameter is, for example, 0.999, and the value of the initial learning rate is 0.005, but these values are not limited to these.

次に、学習部５６は、ストレージ５０に格納された学習済みパラメータ群５６を、算出した新たな重みＵ_{ｓ（ｔ，ｊ）}，Ｗ_{ｓ（ｔ，ｊ）}，ｂ_{ｓ（ｔ，ｊ）}，Ｕ_{ｉ（ｔ，ｊ）}，Ｗ_{ｉ（ｔ，ｊ）}，ｂ_{ｉ（ｔ，ｊ）}，Ｕ_{ｆ（ｔ，ｊ）}，Ｗ_{ｆ（ｔ，ｊ）}，ｂ_{ｆ（ｔ，ｊ）}，Ｕ_{ｏ（ｔ，ｊ）}，Ｗ_{ｏ（ｔ，ｊ）}，ｂ_{ｏ（ｔ，ｊ）}に更新する。 Next, the learning unit 56 updates the learned parameter set 56 stored in the storage 50 to the calculated new weights Us _(t,j) , Ws _{(t,j), bs(t,j)} _, Ui _(t,j) , _Wi(t,j) , bi _(t,j ), Uf _(t,j) , _Wf(t,j) , bf _(t,j) , _Uo(t,j) , Wo _(t,j) , bo _(t,j) .

以上が１回の学習処理であり、学習部５６が他の教師データ５２を利用して同様の処理を繰り返し実行すると、学習済みパラメータ群５６が最適化される。ここで、上述したように勾配クリッピング法が採用されているため、学習が繰り返されても、勾配消失及び勾配爆発が抑えられる。 The above is one learning process, and when the learning unit 56 repeatedly executes the same process using other teacher data 52, the learned parameter set 56 is optimized. Here, because the gradient clipping method is adopted as described above, gradient vanishing and gradient explosion are suppressed even when learning is repeated.

〔４．予測部〕
予測部１３は、ＲＮＮモデル構築部１１によって構築されたリカレントニューラルネットワーク２００を用いて、分析データ５７から変位の時系列データを予測する。以下、具体的に説明する。 [4. Prediction Section]
The prediction unit 13 predicts the time series data of the displacement from the analysis data 57 using the recurrent neural network 200 constructed by the RNN model construction unit 11. This will be specifically described below.

まず、予測部１３は、ストレージ５０から学習済みパラメータ群５６を取得して、学習済みパラメータ群５６の重みＵ_{ｓ（ｔ，ｊ）}，Ｗ_{ｓ（ｔ，ｊ）}，ｂ_{ｓ（ｔ，ｊ）}，Ｕ_{ｉ（ｔ，ｊ）}，Ｗ_{ｉ（ｔ，ｊ）}，，ｂ_{ｉ（ｔ，ｊ）}，Ｕ_{ｆ（ｔ，ｊ）}，Ｗ_{ｆ（ｔ，ｊ）}，ｂ_{ｆ（ｔ，ｊ）}，Ｕ_{ｏ（ｔ，ｊ）}，Ｗ_{ｏ（ｔ，ｊ）}，ｂ_{ｏ（ｔ，ｊ）}をリカレントニューラルネットワーク２００の各長短期記憶ブロック２２１，２３１，２４１，２５１に割り当てる。 First, the prediction unit 13 retrieves the learned parameter group 56 from the storage 50 and assigns weights Us _{(t,j), Ws(t,j)} , bs _(t,j) , Ui _(t _,j) , Wi(t,j), bi _(t,j) , Uf _(t,j) , Wf _(t, j), _bf(t,j) , Uo _(t,j) , _Wo(t,j) _, _bo(t,j) of the learned parameter group 56 to each long and short-term memory block 221, 231, 241, 251 of the recurrent neural network 200.

次に、予測部１３は、ストレージ５０から分析データ５７を取得して、分析データ５７の荷重及び速度の時系列データ５８、５９をリカレントニューラルネットワーク２００の入力層２１０に入力する。これにより、予測部５６は、リカレントニューラルネットワーク２００の隠れ層２２０，２３０，２４０，２５０、全結合層２６０及びドロップアウト層２７０を活性化させることによって出力層２８０の各出力ユニット２８１における出力データｚ_ｔを算出して、出力データｚ_ｔを時系列で配列した予測変位の時系列データを取得する。この時系列データは、予測した変位の値が時系列で配列された離散的なデータ列である。 Next, the prediction unit 13 acquires the analysis data 57 from the storage 50, and inputs the time series data 58, 59 of the load and speed of the analysis data 57 to the input layer 210 of the recurrent neural network 200. As a result, the prediction unit 56 activates the hidden layers 220, 230, 240, 250, the fully connected layer 260, and the dropout layer 270 of the recurrent neural network 200 to calculate the output data _zt in each output unit 281 of the output layer 280, and acquires time series data of predicted displacement in which the output data _zt is arranged in time series. This time series data is a discrete data string in which the predicted displacement values are arranged in time series.

次に、予測部１３は、算出した予測変位の時系列データを予測結果の時系列データ６０としてストレージ５０に記録する。 Next, the prediction unit 13 records the time series data of the calculated predicted displacement in the storage 50 as time series data 60 of the prediction result.

また、予測部１３は、予測変位の時系列データからチャートを生成し、そのチャートを画面に含む映像信号を表示デバイス３０に出力する。そうすると、そのチャートが表示デバイス３０に表示される。チャートは、例えば、時間を横軸で表し、予測変位を縦軸で表し、予測変位の値が時系列でプロットされたものである。 The prediction unit 13 also generates a chart from the time series data of the predicted displacement, and outputs a video signal including the chart on the screen to the display device 30. The chart is then displayed on the display device 30. The chart, for example, represents time on the horizontal axis and predicted displacement on the vertical axis, with the values of the predicted displacement plotted in time series.

また、予測部１３は、予測変位の時系列データの各値と、荷重の時系列データ５８の各値とを同一時刻で対応付けして、予測変位の時系列データと荷重の時系列データ５８との対応関係を表したチャートを生成し、そのチャートを画面に含む映像信号を表示デバイス３０に出力する。そうすると、そのチャートが表示デバイス３０に表示される。チャートは、例えば、予測変位を横軸で表し、荷重を縦軸で表し、対応付けられた荷重の値と予測変位の値とがプロットされたものである。 The prediction unit 13 also associates each value of the time series data of the predicted displacement with each value of the time series data of the load 58 at the same time, generates a chart showing the correspondence between the time series data of the predicted displacement and the time series data of the load 58, and outputs a video signal including the chart on the screen to the display device 30. The chart is then displayed on the display device 30. The chart, for example, shows the predicted displacement on the horizontal axis and the load on the vertical axis, with the associated load values and predicted displacement values plotted.

また、予測部１３は、予測変位の時系列データの各値と、速度の時系列データ５８の各値とを同一時刻で対応付けして、予測変位の時系列データと速度の時系列データ５８との対応関係を表したチャートを生成し、そのチャートを画面に含む映像信号を表示デバイス３０に出力する。そうすると、そのチャートが表示デバイス３０に表示される。チャートは、例えば、予測変位を横軸で表し、速度を縦軸で表し、対応付けられた速度の値と予測変位の値とがプロットされたものである。 The prediction unit 13 also associates each value of the time series data of the predicted displacement with each value of the time series data of the speed 58 at the same time, generates a chart showing the correspondence between the time series data of the predicted displacement and the time series data of the speed 58, and outputs a video signal including the chart on the screen to the display device 30. The chart is then displayed on the display device 30. For example, the chart shows the predicted displacement on the horizontal axis and the speed on the vertical axis, with the associated speed values and predicted displacement values plotted.

〔５．有利な効果〕
減衰装置９０を数式でモデル化しなくても、学習済みのリカレントニューラルネットワーク２００を用いて荷重及び速度の時系列データ５８，５９から変位の時系列データを算出することができる。つまり、時間の経過に伴って変化する荷重及び速度が減衰装置９０に与えられた場合の応答を変位の時系列データとして算出することができる。特に日常に生じる小さな荷重であっても、また終局に至る大きな荷重であっても、応答変位を算出することができる。 5. Advantageous Effects
Even without modeling the damping device 90 with a mathematical formula, it is possible to calculate the time series data of the displacement from the time series data 58, 59 of the load and the velocity by using the trained recurrent neural network 200. In other words, it is possible to calculate the response when the load and the velocity that change with time are applied to the damping device 90 as the time series data of the displacement. In particular, it is possible to calculate the response displacement even for small loads that occur in daily life and large loads that reach an ultimate state.

リカレントニューラルネットワーク２００を用いたため、時系列の回帰型のモデル化が可能となる。つまり、隠れ層２１０，２３０，２４０，２５０の各々では、或るステップにおけるニューロンの入力が前のステップのニューロンの出力に結合されているため、前のステップの計算結果が次のステップの計算に回帰されて影響を及ぼすため、リカレントニューラルネットワーク２００は時系列データの算出に適している。 The use of recurrent neural network 200 enables regression modeling of time series. In other words, in each of hidden layers 210, 230, 240, and 250, the input of a neuron at a certain step is connected to the output of a neuron at the previous step, so that the calculation result of the previous step regresses and affects the calculation of the next step, making recurrent neural network 200 suitable for calculating time series data.

リカレントニューラルネットワーク２００が長短期記憶（Long short-term memory：LSTM）のアーキテクチャを利用したものであるため、時系列における長期にわたる記憶が実現され、勾配消失が生じにくい。従って、長短期記憶のアーキテクチャを利用したリカレントニューラルネットワーク２００は、地震等のように大小様々な荷重が長期にわたって励起するような場合の応答解析に適している。 Because the recurrent neural network 200 uses a long short-term memory (LSTM) architecture, it realizes long-term memory in time series and is less susceptible to gradient vanishing. Therefore, the recurrent neural network 200 using the long short-term memory architecture is suitable for response analysis in cases where various loads of different sizes are excited over a long period of time, such as an earthquake.

また、リカレントニューラルネットワーク２００は複数の隠れ層２１０，２３０，２４０，２５０を有するため、予測精度が向上するが、複数の隠れ層２１０，２３０，２４０，２５０は勾配消失等の要因になり得る。そのような問題点は長短期記憶のアーキテクチャによって解消される。 In addition, because the recurrent neural network 200 has multiple hidden layers 210, 230, 240, and 250, prediction accuracy is improved, but the multiple hidden layers 210, 230, 240, and 250 can cause gradient vanishing and other problems. Such problems are solved by the long short-term memory architecture.

〔６．変形例〕
上記実施形態では、分析データ５７が荷重の時系列データ５８と速度の時系列データ５９からなり、予測時には、荷重の時系列データ５８と速度の時系列データ５９がリカレントニューラルネットワーク２００の入力層２１０に入力され、変位の時系列データが出力層２８０に出力される。それに対して、分析データ５７が荷重の時系列データ５８と変位の時系列データからなり、荷重の時系列データ５８と変位の時系列データがリカレントニューラルネットワーク２００の入力層２１０に入力され、速度の時系列データが出力層２８０に出力されてもよい。この場合、教師データ５２における正解は速度の時系列データ５４であり、学習時には、荷重及び変位の時系列データ５３、５５がリカレントニューラルネットワーク２００の入力層２１０に入力され、教師データ５２の速度時系列データ５４における速度データと出力データｚ_ｔとの損失が算出される。 6. Modifications
In the above embodiment, the analysis data 57 is composed of the time series data 58 of the load and the time series data 59 of the speed, and at the time of prediction, the time series data 58 of the load and the time series data 59 of the speed are input to the input layer 210 of the recurrent neural network 200, and the time series data of the displacement is output to the output layer 280. On the other hand, the analysis data 57 may be composed of the time series data 58 of the load and the time series data of the displacement, and the time series data 58 of the load and the time series data of the displacement are input to the input layer 210 of the recurrent neural network 200, and the time series data of the speed is output to the output layer 280. In this case, the correct answer in the teacher data 52 is the time series data 54 of the speed, and at the time of learning, the time series data 53 and 55 of the load and the displacement are input to the input layer 210 of the recurrent neural network 200, and the loss between the speed data in the speed time series data 54 of the teacher data 52 and the output data _zt is calculated.

また、分析データ５７が速度の時系列データ５９と変位の時系列データからなり、速度の時系列データ５９と変位の時系列データがリカレントニューラルネットワーク２００の入力層２１０に入力され、荷重の時系列データが出力層２８０に出力されてもよい。この場合、教師データ５２における正解は荷重の時系列データ５３であり、学習時には、速度及び変位の時系列データ５４、５５がリカレントニューラルネットワーク２００の入力層２１０に入力され、教師データ５２の荷重時系列データ５３における荷重データと出力データｚ_ｔとの損失が算出される。 Alternatively, the analysis data 57 may be composed of time series data 59 of velocity and time series data of displacement, and the time series data 59 of velocity and the time series data of displacement may be input to the input layer 210 of the recurrent neural network 200, and the time series data of load may be output to the output layer 280. In this case, the correct answer in the teacher data 52 is the time series data 53 of load, and during learning, the time series data 54 and 55 of velocity and displacement are input to the input layer 210 of the recurrent neural network 200, and the loss between the load data in the load time series data 53 of the teacher data 52 and the output data _zt is calculated.

上記実施形態では、力学モデルとして減衰装置９０の例を挙げ、減衰装置９０についてのシミュレーション又は実験により求められた時系列データ５３、５４，５５が教師データ５２として利用される。力学モデルは減衰装置９０に限らず、土木建築構造物（例えば、建物、橋梁、梁、柱等）又は機械構造物（例えば、自動車の車体、弾性装置、ばね要素等）であり、土木建築構造物又は機械構造物についてのシミュレーション又は実験により求められた速度、荷重及び変位の時系列データが教師データ５２として利用されてもよい。 In the above embodiment, a damping device 90 is given as an example of a mechanical model, and time series data 53, 54, and 55 obtained by simulation or experiment for the damping device 90 are used as teacher data 52. The mechanical model is not limited to the damping device 90, but may be a civil engineering or architectural structure (e.g., a building, a bridge, a beam, a column, etc.) or a mechanical structure (e.g., an automobile body, an elastic device, a spring element, etc.), and time series data of the speed, load, and displacement obtained by simulation or experiment for the civil engineering or architectural structure or the mechanical structure may be used as teacher data 52.

１０…演算処理装置
１１…ＲＮＮモデル構築部
１２…学習部
１３…予測部
５１…プログラム
５２…教師データ
５７…分析データ
２００…リカレントニューラルネットワーク
２１０…入力層
２２０，２３０，２４０，２５０…隠れ層
２６０…全結合層
２７０…ドロップアウト層
２８０…出力層
２２１，２３１，２４１，２５１…長短期記憶ブロック 10: Processing device 11: RNN model construction unit 12: Learning unit 13: Prediction unit 51: Program 52: Teacher data 57: Analysis data 200: Recurrent neural network 210: Input layer 220, 230, 240, 250: Hidden layer 260: Fully connected layer 270: Dropout layer 280: Output layer 221, 231, 241, 251: Long short-term memory block

Claims

A prediction device for predicting output time series data from any two of input time series data of load, velocity, and displacement in a mechanical model of a civil engineering structure, comprising:
An RNN construction unit that constructs a recurrent neural network having an input layer, a hidden layer, and an output layer in this order from the input to the output;
a prediction unit that inputs the two input time series data to the input layer, assigns a learned parameter group consisting of a plurality of weights obtained by prior learning to the hidden layer, and activates the hidden layer to calculate the output time series data in the output layer ,
The RNN construction unit is a prediction device that constructs a layer in which a plurality of long-term short-term memory blocks are connected in a time series as the hidden layer .

The prediction device according to claim 1 , wherein the RNN construction unit constructs the hidden layer into a plurality of layers.

A learning device for learning two input time series data of time series data of load, velocity, and displacement in a mechanical model of a civil engineering and architectural structure and another correct time series data,
An RNN construction unit that constructs a recurrent neural network having an input layer, a hidden layer, and an output layer in this order from the input to the output;
a learning unit that inputs the two input time series data to the input layer, assigns a learned parameter group consisting of a plurality of weights to the hidden layer and activates the hidden layer to calculate output time series data in the output layer, calculates losses of the output time series data and the ground truth time series data, calculates differential parameters by differentiating the losses with the plurality of weights by a backpropagation process, and updates the plurality of weights based on the differential parameters and the plurality of weights by a gradient descent method ,
The RNN construction unit is a learning device that constructs a layer in which a plurality of long-term short-term memory blocks are connected in a time series as the hidden layer .

The learning device according to claim 3 , wherein the RNN construction unit constructs the hidden layer in a plurality of layers.

A calculation processing device of a prediction device for predicting output time series data from any two of input time series data of load, velocity, and displacement in a mechanical model of a civil engineering and architectural structure, comprising :
an RNN constructing unit that constructs a recurrent neural network having an input layer, a hidden layer, and an output layer in this order from input to output , and constructs a layer in which a plurality of long short-term memory blocks are connected in time series as the hidden layer ;
a prediction unit that inputs the two input time series data to the input layer, assigns a learned parameter group consisting of a plurality of weights obtained by prior learning to the hidden layer, and activates the hidden layer to calculate the output time series data in the output layer.

A processing unit of a learning device that learns two input time series data of time series data of load, velocity, and displacement in a mechanical model that models a civil engineering and architectural structure and another correct time series data,
an RNN constructing unit that constructs a recurrent neural network having an input layer, a hidden layer, and an output layer in this order from input to output , and constructs a layer in which a plurality of long short-term memory blocks are connected in time series as the hidden layer ;
a learning unit that inputs the two input time series data to the input layer, calculates output time series data in the output layer by assigning a learned parameter group consisting of a plurality of weights to the hidden layer and activating the hidden layer, calculates losses of the output time series data and the ground truth time series data, calculates differential parameters by differentiating the losses with the plurality of weights by backpropagation processing, and updates the plurality of weights based on the differential parameters and the plurality of weights by gradient descent processing.