JP7614020B2

JP7614020B2 - PLANT CONTROL SYSTEM, ROLLING MILL CONTROL DEVICE, PLANT CONTROL METHOD, AND PLANT CONTROL PROGRAM

Info

Publication number: JP7614020B2
Application number: JP2021091321A
Authority: JP
Inventors: 正剛綿島; 敬規高田; 哲服部; 佑樹田内; 大輝黒川; 隆阿部
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 2021-05-31
Filing date: 2021-05-31
Publication date: 2025-01-15
Anticipated expiration: 2041-05-31
Also published as: JP2022183827A

Description

本発明は、ニューラルネット等の人工知能技術を用いて行う実時間のフィードバック制御を行うと共に、人工知能の学習の自動更新機能を有するプラント制御システム、圧延機制御装置、プラント制御方法、及びプラント制御プログラムに関する。 The present invention relates to a plant control system, a rolling mill control device, a plant control method, and a plant control program that perform real-time feedback control using artificial intelligence technology such as neural networks and have an automatic update function for the learning of the artificial intelligence.

従来から、各種のプラントにおいてはその制御により所望の制御結果を得るために各種制御理論に基づいたプラント制御が実施されている。 Traditionally, in various plants, plant control based on various control theories has been implemented to obtain the desired control results.

プラントの一例として例えば圧延機制御においては、制御の一例として板の波打ち状態を制御する形状制御を対象とした制御理論として、ファジィ制御やニューロ・ファジィ制御が適用されてきた。ファジィ制御は、クーラントを利用した形状制御に、また、ニューロ・ファジィ制御は、ゼンジミア圧延機の形状制御に適用されている。このうちニューロ・ファジィ制御を適用した形状制御は、特許文献１に示されるように、形状検出器で検出された実績形状パターンと目標形状パターンの差と、予め設定された基準形状パターンとの類似割合を求め、その類似割合からこれも予め設定された基準形状パターンに対する制御操作端操作量によって表現された制御ルールにより、操作端に対する制御出力量を求めることにより行われている。以下、従来技術として、ニューロ・ファジィ制御を用いたゼンジミア圧延機の形状制御を用いるものとする。 In the control of a rolling mill as an example of a plant, fuzzy control and neuro-fuzzy control have been applied as control theories for shape control, which controls the waviness of a plate as an example of control. Fuzzy control is applied to shape control using coolant, and neuro-fuzzy control is applied to shape control of a Sendzimir rolling mill. As shown in Patent Document 1, shape control using neuro-fuzzy control is performed by determining the similarity ratio between the difference between the actual shape pattern detected by a shape detector and the target shape pattern, and a preset reference shape pattern, and then determining the control output amount for the control terminal from the similarity ratio according to a control rule expressed by the control operation amount for the reference shape pattern, which is also set in advance. Hereinafter, shape control of a Sendzimir rolling mill using neuro-fuzzy control will be used as the conventional technology.

また制御装置の管理装置としては、特許文献３に示されるように油圧シリンダーのピストン位置を制御する油圧圧下装置と、複数の前記油圧圧下制御装置の管理装置とを含む油圧圧下制御装置の管理システムが知られている。 As another example of a control device management system, a hydraulic roll-down control device management system is known that includes a hydraulic roll-down device that controls the piston position of a hydraulic cylinder, as shown in Patent Document 3, and a management device for a plurality of the hydraulic roll-down control devices.

この管理システムでは、油圧圧下制御装置の制御モデルのパラメータの更新タイミングを判断し、制御モデルから油圧圧下制御装置への指令値とその実測値によってパラメータを調整する機能がある。 This management system has the function of determining the timing for updating the parameters of the control model of the hydraulic roll-down control device, and adjusting the parameters based on the command value from the control model to the hydraulic roll-down control device and its actual measured value.

図５に、特許文献１の図１に記述されたゼンジミア圧延機の形状制御を示す。ゼンジミア圧延機の形状制御では、ニューロ・ファジィ制御が用いられる。この例では、パターン認識機構５１で、形状検出器５２にて検出した実形状より形状のパターン認識を行い、実形状が予め設定された基準形状パターンのどれに最も近いかを演算する。制御演算機構５３では、図６で示すような予め設定された形状パターンに対する制御操作端操作量で構成される制御ルールを用いて制御を実施する。図６についてより具体的に述べると、パターン認識機構５１では、形状検出器５２にて検出した形状実績と目標形状（εｒｅｆ）との差分（Δε）が、１から８の形状パターン（ε）のどれに最も近いかを演算し、制御演算機構５３では、１から８の制御方法のいずれかを選択し実行する。 Figure 5 shows the shape control of the Sendzimir rolling mill described in Figure 1 of Patent Document 1. Neuro-fuzzy control is used in the shape control of the Sendzimir rolling mill. In this example, the pattern recognition mechanism 51 recognizes the shape pattern from the actual shape detected by the shape detector 52, and calculates which of the preset reference shape patterns the actual shape is closest to. The control calculation mechanism 53 performs control using a control rule consisting of the control operation terminal operation amount for the preset shape pattern as shown in Figure 6. More specifically, in Figure 6, the pattern recognition mechanism 51 calculates which of the shape patterns (ε) from 1 to 8 the difference (Δε) between the shape actual result detected by the shape detector 52 and the target shape (εref) is closest to, and the control calculation mechanism 53 selects and executes one of the control methods from 1 to 8.

ところが特許文献１の手法では、制御ルールの検証のために、圧延中にオペレータに手動操作を行ってもらい制御ルールの検証等行う場合が有るが、予想に反した形状変化を示す場合がある。つまり、上記の様にして決定した制御ルールが現実に則していない場合が発生する。これは、機械的特性の検討不足や圧延機の操業状態や機械条件の変化が原因であるが、予め設定した制御ルールが最も良いルールかどうかを１つ１つ検証するのは、考慮すべき条件が多く困難である。そのため、制御ルールを一度設定してしまうと、不具合が無い限りそのままとしてしまう場合が多い。 However, in the method of Patent Document 1, in order to verify the control rules, an operator may be asked to perform manual operations during rolling, but this may result in unexpected changes in shape. In other words, there are cases where the control rules determined in the above manner do not match reality. This is caused by insufficient consideration of mechanical properties or changes in the operating state and machine conditions of the rolling mill, but verifying each and every preset control rule to see if it is the best rule is difficult, as there are many conditions that must be considered. For this reason, once a control rule has been set, it is often left as it is unless there is a problem.

操業条件の変化等で、制御ルールが現実に則したものでなくなってくると、制御ルールが固定されているため、ある程度以上の制御精度を出すことは困難となってくる。また、一旦形状制御が動作してしまうと、オペレータは手動操作をしなくなる（制御にとって外乱となってしまう）ため、新たな制御ルールをオペレータの手動介入により見つけていくのも困難である。さらに、新しい規格の圧延材を圧延する場合も制御ルールをその材料にあわせて設定するのは困難である。 When the control rules no longer correspond to reality due to changes in operating conditions, etc., it becomes difficult to achieve a certain level of control precision because the control rules are fixed. Also, once shape control is in operation, the operator will no longer perform manual operations (which will become a disturbance to the control), so it is difficult to find new control rules through manual intervention by the operator. Furthermore, when rolling new standards of rolled material, it is difficult to set control rules to suit that material.

以上のように、従来の形状制御においては、予め設定された制御ルールを用いて制御するため、制御ルールを修正するのが困難であるという問題が有った。 As described above, conventional shape control uses preset control rules, which makes it difficult to modify the control rules.

この問題を解決するために、特許文献２に示すような、形状制御を行いながら制御ルールをランダムに変化させ、形状が良くなるルールを学習して行くことで、
１）圧延中に形状制御を実施しながら新たな制御ルールを発見していく。
２）新たな制御ルールは、予め予想できるものでは無く、全く予測できなかった制御ルールが最適となる場合も有る事から、ランダムに制御操作端を動作させ、それに対する制御結果を見ながら見つけていく。
ことを実現している。 In order to solve this problem, as shown in Patent Document 2, the control rules are randomly changed while performing shape control, and the rules that improve the shape are learned.
1) Discover new control rules while controlling shape during rolling.
2) Since new control rules cannot be predicted in advance, and a completely unpredictable control rule may turn out to be optimal, the optimal control rule is found by randomly operating the control operation terminals and observing the control results.
This has been achieved.

特許第２８０４１６１号明細書Patent No. 2804161 specification 特許第４００３７３３号明細書Patent No. 4003733 specification 特許第５３６３３８０号明細書Patent No. 5363380 specification

上記従来技術は、予め代表的な形状を基準形状パターンとして設定し、基準波形パターンに対する制御操作端操作量との関係を示す制御ルールを基に制御を行っている。制御ルールの学習についても、基準波形パターンに対する制御操作端操作量に関するものであり、予め定めている代表的な基準形状パターンはそのまま用いている。そのため、特定の形状パターンにしか反応しない形状制御となってしまう問題がある。 In the above conventional technology, a representative shape is set in advance as a reference shape pattern, and control is performed based on a control rule that indicates the relationship between the control operation terminal operation amount and the reference waveform pattern. The learning of the control rule also relates to the control operation terminal operation amount for the reference waveform pattern, and the predetermined representative reference shape pattern is used as is. This results in a problem of shape control that only responds to specific shape patterns.

基準形状パターンは、人間が予め対象となる圧延機に関する知識や、形状実績と手動介入操作を蓄積した経験より定めたものであるが、対象となる圧延機および被圧延材で発生する全ての形状を網羅する事は困難である。そのため、基準形状パターンとは異なる形状が発生した場合、形状制御による制御が実施されず、形状偏差が抑制されずに残ってしまい、あるいは似たような基準形状パターンと誤認識し、誤った制御操作を行って、逆に形状を悪化させてしまう場合も有る。 The reference shape pattern is determined by humans based on their knowledge of the target rolling mill and their accumulated experience with previous shape records and manual intervention operations, but it is difficult to cover all shapes that can occur in the target rolling mill and the material being rolled. As a result, if a shape different from the reference shape pattern occurs, shape control is not implemented and the shape deviation remains unsuppressed, or it may be mistaken for a similar reference shape pattern and incorrect control operations are performed, which may actually worsen the shape.

以上のように、従来の形状制御においては、予め設定された基準形状パターンとそれに対する制御ルールを用いて制御ルールの学習をし、制御を実施するため、制御精度の向上に限界があるという問題が有った。 As described above, conventional shape control involves learning control rules using preset reference shape patterns and the corresponding control rules, and then carrying out control, which has the problem of limiting the improvement of control accuracy.

また、Deep Learning適用形状制御で用いる制御モデルは、更新直後において高い制御効果を発揮できるものの、一定期間が経過し、プラント環境の経年変化や操業状況の変化などが発生すると、プラントへの適合度が低下し、制御効果を十分に発揮できなくなる。そのため制御モデルをプラントの状態によって逐次最適化しなければならないが、時々刻々と生成される圧延実績データを用いて、教師データの作成や制御モデル構築処理をリアルタイムで実行することは計算機負荷が大きい為、常に制御モデルを更新し続けることは困難である。 In addition, although the control model used in deep learning-based shape control can provide a high level of control immediately after an update, after a certain period of time, as the plant environment ages and operational conditions change, its suitability to the plant decreases and it is no longer able to provide sufficient control effectiveness. For this reason, the control model must be optimized successively depending on the plant's condition, but creating training data and building the control model in real time using rolling performance data that is generated every moment places a heavy burden on the computer, making it difficult to keep updating the control model.

以上のことから本発明においては、制御対象プラントの制御を実施するプラント制御システムは、前記制御対象プラントの実績データと制御操作の組合せに基づいて制御ルールを学習する制御方法学習装置と、前記制御方法学習装置によって学習された前記制御ルールに基づいて前記制御対象プラントの制御を実施する制御実行装置と、前記制御ルールに基づいて前記制御対象プラントの制御を実施した際の前記実績データに基づいて前記制御対象プラントに対する該制御ルールの適合度を演算し、該適合度に基づいて該制御ルールを更新する制御ルール更新判断装置とを備えたことを特徴とする。 In view of the above, in the present invention, a plant control system that controls a controlled plant is characterized by comprising a control method learning device that learns control rules based on a combination of performance data and control operations of the controlled plant, a control execution device that controls the controlled plant based on the control rules learned by the control method learning device, and a control rule update determination device that calculates the suitability of the control rules for the controlled plant based on the performance data when control of the controlled plant is performed based on the control rules, and updates the control rules based on the suitability.

本発明を用いることにより、制御中に形状制御で使用する、形状パターンと操作方法の制御ルールを自動的に修正し最適なものとすることが可能となる。そのため、制御精度の向上、制御部の立上げ期間の短縮、経年変化に対する対応が可能となる等の効果が有る。 By using this invention, it is possible to automatically correct and optimize the control rules for the shape patterns and operation methods used in shape control during control. This has the effect of improving control accuracy, shortening the start-up period of the control unit, and making it possible to respond to changes over time.

さらに本発明によると、制御ルールと実績データの適合度を評価し、制御ルールの更新タイミングを判定し、自動で再学習することにより、適切な頻度で制御モデルを更新することで計算機負荷を低減し、制御ルールの性能維持を実現する。 Furthermore, according to the present invention, the degree of compatibility between the control rules and actual data is evaluated, the timing for updating the control rules is determined, and automatic re-learning is performed, thereby updating the control model at an appropriate frequency, reducing the load on the computer and maintaining the performance of the control rules.

本発明の実施例に係るプラント制御システムの概要を示す図。1 is a diagram showing an overview of a plant control system according to an embodiment of the present invention. 本発明の実施例に係る制御ルール実行部１０の具体的な構成事例を示す図。FIG. 2 is a diagram showing a specific configuration example of the control rule enforcement unit 10 according to the embodiment of the present invention. 本発明の実施例に係る制御ルール学習部１１の具体的な構成事例を示す図。FIG. 2 is a diagram showing a specific configuration example of a control rule learning unit 11 according to an embodiment of the present invention. 本発明をゼンジミア圧延機の形状制御に用いる場合のニューラルネット構成を示す図。FIG. 13 is a diagram showing a neural network configuration when the present invention is used for shape control of a Sendzimir rolling mill. 特許文献１の図１に記述されたゼンジミア圧延機の形状制御を示す図。FIG. 2 is a diagram showing shape control of the Sendzimir rolling mill described in FIG. 1 of Patent Document 1. 特許文献１の図１に記述されたゼンジミア圧延機の形状制御における制御ルールを示す図。FIG. 2 is a diagram showing a control rule for shape control of the Sendzimir rolling mill described in FIG. 1 of Patent Document 1. 制御入力データ作成部２の概要を示す図。FIG. 2 is a diagram showing an overview of a control input data generating unit 2. 制御出力演算部３の概要を示す図。FIG. 2 is a diagram showing an overview of a control output calculation unit 3. 制御出力判定部５の概要を示す図。FIG. 2 is a diagram showing an overview of a control output determination unit 5. 形状偏差と制御方法について示す図。13A and 13B are diagrams showing shape deviation and a control method. 制御結果良否判定部６の概要を示す図。FIG. 2 is a diagram showing an overview of a control result quality determination unit 6. 制御出力演算部３における各部データや記号の関係を整理して示す図。FIG. 2 is a diagram showing the relationship between data and symbols in the control output calculation unit 3. 学習データ作成部７における処理段階と処理内容を示す図。3A to 3C are diagrams showing processing stages and processing contents in a learning data creation unit 7. 学習データデータベースＤＢ２に保存されたデータ例を示す図。FIG. 4 is a diagram showing an example of data stored in a learning data database DB2. データベース管理テーブルＴＢの例を示す図。FIG. 4 is a diagram showing an example of a database management table TB. 学習データデータベースＤＢ２の例を示す図。FIG. 4 shows an example of a learning data database DB2. 制御ルール適合度評価部２５の概要を示す図。FIG. 2 is a diagram showing an overview of a control rule conformity evaluation unit 25. 制御ルール更新評価部２６の概要を示す図。FIG. 2 is a diagram showing an overview of a control rule update evaluation unit 26. 制御ルール更新処理管理部２４の概要を示す図。FIG. 2 is a diagram showing an overview of a control rule update processing management unit 24. 制御ルール評価値データベースＤＢ５の概要を示す図。FIG. 4 is a diagram showing an overview of a control rule evaluation value database DB5. コンピュータ５００のハードウェアの概要を示す図。FIG. 5 is a diagram showing an outline of the hardware of a computer 500.

以下本発明の実施例について、図面を用いて詳細に説明するが、その前に本発明における知見、並びに本発明に至る経緯について圧延機の形状制御を例にして説明をしておく。 Below, we will explain the embodiments of the present invention in detail using the drawings. Before that, we will explain the findings of the present invention and how it came about, using shape control of a rolling mill as an example.

まず、本発明における上記課題を解決するためには、
１）基準形状パターンと、それに対する制御操作を予め別々に設定し、制御操作方法を学習していくのではなく、形状パターンと制御操作の組合せを学習し、それを用いて制御操作を実施する。
２）新たな制御ルールは、予め予想できるものでは無く、全く予測できなかった制御ルールが最適となる場合も有る事から、ランダムに制御操作端を動作させ、それに対する制御結果を見ながら見つけていく。
ことが必要となる。 First, in order to solve the above problems in the present invention,
1) A reference shape pattern and a control operation for it are separately set in advance, and a control operation method is not learned. Instead, a combination of a shape pattern and a control operation is learned and used to carry out the control operation.
2) Since new control rules cannot be predicted in advance, and a completely unpredictable control rule may turn out to be optimal, the optimal control rule is found by randomly operating the control operation terminals and observing the control results.
It becomes necessary to do so.

これを実現するためには、形状制御に使用する形状パターンと制御操作の組合せを変化させて、制御結果が良くなるように制御操作を変更していく必要がある。そのためには、形状パターンと制御操作の組合せを学習可能なニューラルネットを構成し、圧延機で発生した形状パターンに対する、ニューラルネットの制御操作の出力を、制御結果の良否に応じて変更していく事が必要である。 To achieve this, it is necessary to change the combination of shape patterns and control operations used for shape control, and modify the control operations to improve the control results. To do this, it is necessary to construct a neural network that can learn combinations of shape patterns and control operations, and to change the output of the neural network's control operations for the shape patterns generated by the rolling mill depending on whether the control results are good or bad.

上記を、操業中の圧延機に対して形状制御を実施しながら、実施すると、誤った制御出力を出す場合もあることから、形状が悪化し、板破断等の操業異常が発生する事がある。板破断が発生すると、圧延機で使用するロールの交換に時間を要したり、圧延中の被圧延材が無駄になったりと、ダメージが大きい。そのため、可能な限り誤った制御出力を圧延機に対して出力しないようにする事が必要である。 If the above is carried out while performing shape control on an operating rolling mill, erroneous control output may be output, causing deterioration of the shape and operational abnormalities such as plate breakage. If plate breakage occurs, it will take time to replace the rolls used in the rolling mill and the material being rolled will be wasted, causing great damage. For this reason, it is necessary to avoid outputting erroneous control output to the rolling mill as much as possible.

以上のことから本発明においては、これを実現するため、ニューラルネットが出力した制御操作の良否を、例えば圧延機の簡易モデル等を用いて検証し、明らかに形状が悪化すると考えられる出力は、圧延機の制御操作端に対して出力しないようにし、形状悪化を防止する。この時、ニューラルネットに関しては、その形状パターンに対する制御操作は誤りであるとして学習を実施する。 In order to achieve this, the present invention verifies the quality of the control operations output by the neural network, for example, using a simple model of a rolling mill, and prevents shape deterioration by not outputting any output that is thought to clearly deteriorate the shape to the control operation terminal of the rolling mill. At this time, the neural network is trained by assuming that the control operation for that shape pattern is incorrect.

制御操作の良否の検証方法自体が誤っている可能性が有るため、ある確率で誤っていると判断されたニューラルネットの制御操作量出力についても、圧延機の制御操作端に出力することで、想定外の形状パターンと制御操作の組合せについても学習していく事が可能となる。 Because there is a possibility that the method for verifying whether the control operation is correct or not may itself be incorrect, even if the neural network's control operation output is judged to be incorrect with a certain probability, by outputting it to the control operation terminal of the rolling mill, it becomes possible to learn about combinations of unexpected shape patterns and control operations.

また経年変化によるプラントの環境変化や操業条件の変化により制御ルールがプラントに対して最適でなくなった場合、計算機負荷を監視し、適切なタイミングで制御ルールをプラントの状態によって逐次最適化することで、制御性能の低下を回避することが可能になる。 In addition, if the control rules are no longer optimal for the plant due to changes in the plant's environment or operating conditions over time, it is possible to avoid a deterioration in control performance by monitoring the computer load and successively optimizing the control rules at the appropriate time based on the plant's condition.

プラントに対する制御ルールの適合度は、プラントの操業中に逐次作成される実績データに含まれる形状をニューラルネットに入力することにより出力された値と、実績データに含まれる制御出力との誤差を元に評価することが可能になる。 The suitability of the control rules for the plant can be evaluated based on the error between the value output by inputting the shapes contained in the actual data generated sequentially during plant operation into a neural network and the control output contained in the actual data.

図１に、本発明の実施例に係るプラント制御システムの概要を示す。図１のプラント制御システムは、制御対象プラント１と、制御対象プラント１からの実績データＳｉを入力して図６に例示したような制御ルール（ニューラルネット）に従い定めた制御操作量出力ＳＯを制御対象プラント１に与えて制御する制御実行装置２０と、制御対象プラント１からの実績データＳｉなどを入力して学習を行い、学習した制御ルールを制御実行装置２０における制御ルールに反映させる制御方法学習装置２１と、プラントの操業中に逐次作成される実績データＳｉと制御ルールの適合度を評価し、適切なタイミングで制御ルール学習指示を制御ルール学習部１１へ与える制御ルール更新判断装置２２と、複数のデータベースＤＢ（ＤＢ１からＤＢ５）、並びにデータベースＤＢのデータベース管理テーブルＴＢから構成されている。 Figure 1 shows an overview of a plant control system according to an embodiment of the present invention. The plant control system in Figure 1 is composed of a controlled plant 1, a control execution device 20 that inputs performance data Si from the controlled plant 1 and controls the controlled plant 1 by providing a control operation output SO determined according to a control rule (neural network) such as that shown in Figure 6, a control method learning device 21 that inputs performance data Si from the controlled plant 1 and performs learning and reflects the learned control rule in the control execution device 20, a control rule update judgment device 22 that evaluates the compatibility of the performance data Si created sequentially during plant operation with the control rule and provides a control rule learning instruction to the control rule learning unit 11 at an appropriate timing, multiple databases DB (DB1 to DB5), and a database management table TB for the database DB.

制御実行装置２０は、制御入力データ作成部２、制御ルール実行部１０、制御出力演算部３、制御出力抑制部４、制御出力判定部５、制御操作外乱発生部１６を主たる要素として構成されている。 The control execution device 20 is composed of the following main elements: a control input data creation unit 2, a control rule execution unit 10, a control output calculation unit 3, a control output suppression unit 4, a control output determination unit 5, and a control operation disturbance generation unit 16.

このうち制御実行装置２０においては、まず制御対象プラント１である圧延機の実績データＳｉより、制御入力データ作成部２を用いて、制御ルール実行部１０の入力データＳ１を作成する。制御ルール実行部１０は、制御対象の実績データＳｉと制御操作端操作指令Ｓ２の関係を表現するニューラルネット（制御ルール）を用いて、制御対象の実績データＳｉから制御操作端操作指令Ｓ２を作成する。制御出力演算部３においては、制御操作端操作指令Ｓ２をもとに、制御操作端への制御操作量Ｓ３を演算する。これにより、制御対象プラント１の実績データＳｉに応じて、ニューラルネットを用いて制御操作量Ｓ３を作成する。 In the control execution device 20, first, input data S1 for the control rule execution unit 10 is created using the control input data creation unit 2 from the performance data Si of the rolling mill, which is the controlled plant 1. The control rule execution unit 10 creates a control operation terminal operation command S2 from the controlled plant's performance data Si using a neural network (control rule) that expresses the relationship between the controlled plant's performance data Si and the control operation terminal operation command S2. The control output calculation unit 3 calculates the control operation amount S3 for the control operation terminal based on the control operation terminal operation command S2. As a result, the control operation amount S3 is created using the neural network according to the performance data Si of the controlled plant 1.

また制御実行装置２０内の制御出力判定部５においては、制御対象プラント１からの実績データＳｉおよび制御出力演算部３からの制御操作量Ｓ３を用いて、制御操作端への制御操作量出力可否データＳ４を決定する。制御出力抑制部４においては、制御操作量出力可否データＳ４に応じて制御操作端への制御操作量Ｓ３の出力可否を決定し、可とされた制御操作量Ｓ３を、制御対象プラント１に与える制御操作量出力ＳＯとして出力する。これにより、異常と判断される制御操作量Ｓ３は、制御対象プラント１に出力されなくなる。なお制御操作外乱発生部１６は、プラント制御システムを検証する目的のために、外乱を生成し、制御対象プラント１に与えるものである。 The control output determination unit 5 in the control execution device 20 uses the performance data Si from the controlled plant 1 and the control operation amount S3 from the control output calculation unit 3 to determine the control operation amount output feasibility data S4 to the control operation end. The control output suppression unit 4 determines whether or not to output the control operation amount S3 to the control operation end according to the control operation amount output feasibility data S4, and outputs the control operation amount S3 that is determined to be feasible as the control operation amount output SO to be given to the controlled plant 1. As a result, the control operation amount S3 that is determined to be abnormal is not output to the controlled plant 1. The control operation disturbance generation unit 16 generates a disturbance and gives it to the controlled plant 1 for the purpose of verifying the plant control system.

以上のように構成された制御実行装置２０は、その処理実行のために、さらに後述するように、制御ルール評価値データベースＤＢ１および出力判定データベースＤＢ３を参照する。制御ルール評価値データベースＤＢ１は、制御実行装置２０内の制御ルール実行部１０と、後述する制御方法学習装置２１内の制御ルール学習部１１の双方にアクセス可能に接続されている。制御ルール学習部１１における学習結果としての制御ルール（ニューラルネット）が制御ルール評価値データベースＤＢ１に格納されており、制御ルール実行部１０は制御ルール評価値データベースＤＢ１に格納された制御ルールを参照する。出力判定データベースＤＢ３は、制御実行装置２０内の制御出力判定部５にアクセス可能に接続されている。 The control execution device 20 configured as described above refers to a control rule evaluation value database DB1 and an output judgment database DB3 for executing its processing, as described further below. The control rule evaluation value database DB1 is connected so as to be accessible to both the control rule execution unit 10 in the control execution device 20 and the control rule learning unit 11 in the control method learning device 21, which will be described later. The control rules (neural nets) as the learning results in the control rule learning unit 11 are stored in the control rule evaluation value database DB1, and the control rule execution unit 10 refers to the control rules stored in the control rule evaluation value database DB1. The output judgment database DB3 is connected so as to be accessible to the control output judgment unit 5 in the control execution device 20.

図２は、本発明の実施例に係る制御ルール実行部１０の具体的な構成事例を示している。制御ルール実行部１０は、制御入力データ作成部２で作成した入力データＳ１を入力して、制御出力演算部３に制御操作端操作指令Ｓ２を与える。制御ルール実行部１０はニューラルネット１０１を備えており、ニューラルネット１０１では基本的には図６に例示したような特許文献１の手法により制御操作端操作指令Ｓ２を定めている。本発明においては、制御ルール実行部１０はさらにニューラルネット選択部１０２を備えており、制御ルール評価値データベースＤＢ１に格納された制御ルールを参照することで、ニューラルネット１０１における制御ルールとして、最適な制御ルールを選択し、実行せしめる。このように図２の制御ルール実行部１０においては、オペレータ班や制御目的で分けられた複数のニューラルネットから、必要なニューラルネットを選択し、使用している。制御ルール評価値データベースＤＢ１には、制御対象プラント１からのデータとして、ニューラルネットおよび良否判定基準を選択できるような実績データ（操業班のデータ等）Ｓｉも含むのがよい。なお、ニューラルネットを実行すると制御ルールになるという関係にあることから、本明細書においてはニューラルネットと制御ルールを区別せず、同義の意味で使用している。 Figure 2 shows a specific example of the configuration of the control rule execution unit 10 according to the embodiment of the present invention. The control rule execution unit 10 inputs the input data S1 created by the control input data creation unit 2 and gives the control operation terminal operation command S2 to the control output calculation unit 3. The control rule execution unit 10 is equipped with a neural network 101, which basically determines the control operation terminal operation command S2 by the method of Patent Document 1 as illustrated in Figure 6. In the present invention, the control rule execution unit 10 further includes a neural network selection unit 102, which refers to the control rules stored in the control rule evaluation value database DB1 and selects and executes the optimal control rule as the control rule in the neural network 101. In this way, the control rule execution unit 10 in Figure 2 selects and uses the necessary neural network from multiple neural networks divided by operator teams and control purposes. It is preferable that the control rule evaluation value database DB1 also includes performance data (operation team data, etc.) Si from the controlled plant 1, from which a neural network and a pass/fail judgment criterion can be selected. In addition, since the neural network becomes a control rule when it is executed, in this specification there is no distinction between the neural network and the control rule, but they are used synonymously.

図１に戻り、制御方法学習装置２１においては、制御実行装置２０で使用するニューラルネット１０１の学習を実施する。制御実行装置２０が制御対象プラント１に対して、制御操作量出力ＳＯを出力した場合、実際に制御効果が実績データＳｉの変化となって現れるには時間を要する。このため、その時間だけ時間遅れさせたデータを用いて学習を実施する。図１において、Ｚ^－１は、各データに対する適宜の時間遅れ機能を表している。 Returning to Fig. 1, the control method learning device 21 performs learning of the neural network 101 used in the control execution device 20. When the control execution device 20 outputs a control manipulated variable output SO to the controlled plant 1, it takes time for the control effect to actually appear as a change in the performance data Si. For this reason, learning is performed using data that has been delayed by that time. In Fig. 1, Z ^-1 represents an appropriate time delay function for each piece of data.

制御方法学習装置２１は、制御結果良否判定部６、学習データ作成部７、制御ルール学習部１１、良否判定データベースＤＢ４を主たる要素として構成されている。 The control method learning device 21 is composed of the following main elements: a control result pass/fail judgment unit 6, a learning data creation unit 7, a control rule learning unit 11, and a pass/fail judgment database DB4.

このうち、制御結果良否判定部６は、制御対象プラント１からの実績データＳｉおよび実績データ前回値Ｓｉ０、並びに良否判定データベースＤＢ４に記憶された良否判定データＳ５を用いて、実績データＳｉが良くなる方向に変化したか、悪くなる方向に変化したか判定し、制御結果良否データＳ６を出力する。 The control result pass/fail judgment unit 6 uses the performance data Si and previous performance data value Si0 from the controlled plant 1, as well as the pass/fail judgment data S5 stored in the pass/fail judgment database DB4, to judge whether the performance data Si has changed for the better or for the worse, and outputs the control result pass/fail data S6.

制御方法学習装置２１内の学習データ作成部７においては、制御実行装置２０にて作成した制御操作端操作指令Ｓ２、制御操作量Ｓ３、制御操作量出力可否データＳ４などの入力データをそれぞれ同じ時間だけ時間遅れさせたデータと、制御結果良否判定部６よりの制御結果良否データＳ６を用いて、ニューラルネットの学習に使用する新規の教師データＳ７ａを作成し、制御ルール学習部１１に与える。なお、教師データＳ７ａは、制御ルール実行部１０が出力する制御操作端操作指令Ｓ２に対応するものであり、学習データ作成部７は、制御結果良否判定部６が与える制御結果良否データＳ６を用いて制御ルール実行部１０が出力する制御操作端操作指令Ｓ２を推定して得たデータを、新規の教師データＳ７ａとして求めたものということができる。 The learning data creation unit 7 in the control method learning device 21 creates new teacher data S7a to be used for learning the neural network using data, such as the control operation terminal operation command S2, the control operation amount S3, and the control operation amount output availability data S4, created by the control execution device 20, delayed by the same time, and the control result quality data S6 from the control result quality determination unit 6, and provides this to the control rule learning unit 11. Note that the teacher data S7a corresponds to the control operation terminal operation command S2 output by the control rule execution unit 10, and the learning data creation unit 7 can be said to have obtained the data obtained by estimating the control operation terminal operation command S2 output by the control rule execution unit 10 using the control result quality data S6 provided by the control result quality determination unit 6 as the new teacher data S7a.

制御ルール更新判断装置２２は、実績データＳｉと制御ルールから制御ルールのプラントへの適合度を評価し、制御処理計算機２３が高負荷になることなく処理できるタイミングで新たな教師データＳ７ａから制御ルールを学習し、制御ルールを更新する。制御処理計算機２３は、制御実行装置２０および制御ルール更新判断装置２２を実現する計算機である。 The control rule update judgment device 22 evaluates the suitability of the control rules to the plant based on the performance data Si and the control rules, and learns the control rules from new teacher data S7a at a timing that allows the control processing computer 23 to process without placing a high load, and updates the control rules. The control processing computer 23 is a computer that realizes the control execution device 20 and the control rule update judgment device 22.

図１の説明に戻ると、制御ルール更新判断装置２２は、制御実行装置２０で使用するニューラルネット１０１の更新を実行する。制御ルール更新判断装置２２は、制御ルール更新処理管理部２４、制御ルール適合度評価部２５、制御ルール更新評価部２６、および制御ルール評価値データベースＤＢ５を備える。 Returning to the explanation of FIG. 1, the control rule update determination device 22 executes updates to the neural network 101 used in the control execution device 20. The control rule update determination device 22 includes a control rule update processing management unit 24, a control rule conformance evaluation unit 25, a control rule update evaluation unit 26, and a control rule evaluation value database DB5.

制御ルール適合度評価部２５は、プラントの操業中に逐次作成される実績データＳｉに対して、データベース管理テーブルＴＢから実績データＳｉに対応するニューラルネット（制御ルール）Ｎｏ．Ｓ９を取得し、制御ルール評価値データベースＤＢ１から該当する制御ルールＳ１０を選択する。制御ルール適合度評価部２５は、選択した制御ルールへ実績データＳｉの形状を入力した際の出力と実績データＳｉに含まれる制御出力との差分（誤差）または差分に基づく指標をプラントに対する制御ルールの適合度として算出し、制御ルール評価値データベースＤＢ５へ格納する。制御ルール適合度評価部２５は、制御ルール更新評価部２６に対して、評価したニューラルネット（制御ルール）のＮｏ．と制御ルール更新評価の実行指示Ｓ１２を出力する。 The control rule conformance evaluation unit 25 obtains the neural network (control rule) No. S9 corresponding to the performance data Si from the database management table TB for the performance data Si that is created sequentially during the operation of the plant, and selects the corresponding control rule S10 from the control rule evaluation value database DB1. The control rule conformance evaluation unit 25 calculates the difference (error) between the output when the shape of the performance data Si is input to the selected control rule and the control output included in the performance data Si, or an index based on the difference, as the conformance of the control rule to the plant, and stores it in the control rule evaluation value database DB5. The control rule conformance evaluation unit 25 outputs the No. of the evaluated neural network (control rule) and an instruction S12 to execute a control rule update evaluation to the control rule update evaluation unit 26.

制御ルール更新評価部２６は、制御ルール適合度評価部２５から制御ルール更新評価の実行指示を受けた後、制御ルール適合度評価部２５にて評価されたニューラルネット（制御ルール）Ｎｏ．の制御ルールを取得し（Ｓ１３）、その更新要否を評価し、評価結果を制御ルール評価値データベースＤＢ５に登録し、制御ルール評価値データベースＤＢ５に登録されている制御ルールの更新優先度を更新する（Ｓ１４）。制御ルールは、各ニューラルネットＮｏ．について、最新の所定数の制御ルール適合度評価値の平均の低い順番で高い更新優先度とする。 After receiving an instruction to execute a control rule update evaluation from the control rule compatibility evaluation unit 25, the control rule update evaluation unit 26 acquires the control rule of the neural network (control rule) No. evaluated by the control rule compatibility evaluation unit 25 (S13), evaluates whether or not it needs to be updated, registers the evaluation result in the control rule evaluation value database DB5, and updates the update priority of the control rule registered in the control rule evaluation value database DB5 (S14). For each neural network No., the control rule is assigned a higher update priority in the order of lowest average of the latest predetermined number of control rule compatibility evaluation values.

制御ルール更新処理管理部２４は、制御ルール評価値データベースＤＢ５から更新優先度が最も高い制御ルールを選択し（Ｓ１５）、制御ルール学習部１１へ制御ルールの学習を指示する処理実行指示Ｓ１６を与えることで、制御ルールの更新を実行させる。制御ルール学習の実行は計算機負荷が大きいため、制御ルール更新処理管理部２４は、制御処理計算機２３のＣＰＵ負荷とメモリ使用率等のリソース使用状況を監視し（Ｓ１７）、実績データＳｉから操業中か否かを監視し、制御プラントで更新対象の制御モデルが圧延に使用されていない、および／または、制御ルール学習処理を実行しても制御実行装置２０の処理が遅延しない場合に、処理実行指示Ｓ１６を出力する。その後、制御ルール更新処理管理部２４は、処理実行指示Ｓ１６を出力した制御ルールに関する情報を制御ルール評価値データベースＤＢ５から削除する削除指示Ｓ１８を出力する。 The control rule update processing management unit 24 selects the control rule with the highest update priority from the control rule evaluation value database DB5 (S15), and issues a processing execution instruction S16 to the control rule learning unit 11 to instruct learning of the control rule, thereby updating the control rule. Since the execution of control rule learning places a large load on the computer, the control rule update processing management unit 24 monitors the resource usage status such as the CPU load and memory usage rate of the control processing computer 23 (S17), monitors whether or not the computer is in operation from the performance data Si, and outputs the processing execution instruction S16 when the control model to be updated in the control plant is not used for rolling and/or the processing of the control execution device 20 is not delayed even if the control rule learning processing is executed. After that, the control rule update processing management unit 24 outputs a deletion instruction S18 to delete information on the control rule for which the processing execution instruction S16 was output from the control rule evaluation value database DB5.

図３は、本発明の実施例に係る制御ルール学習部１１の具体的な構成事例を示している。制御ルール学習部１１は、入力データ作成部１１４、教師データ作成部１１５、ニューラルネット処理部１１０、ニューラルネット選択部１１３を主たる構成要素として構成されている。また制御ルール学習部１１は、外部からの入力として制御入力データ作成部２からの入力データＳ１を時間遅れさせたデータＳ８ａを、学習データ作成部７からの新規の教師データＳ７ａを得、また制御ルール評価値データベースＤＢ１および学習データデータベースＤＢ２に蓄積されたデータを参照する。 Figure 3 shows a specific example of the configuration of the control rule learning unit 11 according to an embodiment of the present invention. The control rule learning unit 11 is composed of an input data creation unit 114, a teacher data creation unit 115, a neural network processing unit 110, and a neural network selection unit 113 as main components. The control rule learning unit 11 also obtains data S8a, which is a time-delayed version of the input data S1 from the control input data creation unit 2 as an external input, new teacher data S7a from the learning data creation unit 7, and also refers to the data stored in the control rule evaluation value database DB1 and the learning data database DB2.

制御ルール学習部１１において、入力データＳ１は適宜の時間遅れ補償後に入力データ作成部１１４を介してニューラルネット処理部１１０に取り込まれる。 In the control rule learning unit 11, the input data S1 is input to the neural network processing unit 110 via the input data creation unit 114 after appropriate time delay compensation.

また制御ルール学習部１１において、学習データ作成部７からの新規の教師データＳ７ａは、教師データ作成部１１５において学習データデータベースＤＢ２に記憶されている過去の教師データＳ７ｂも含めた合計の教師データＳ７ｃとして、ニューラルネット処理部１１０に与えられる。これらの教師データＳ７ａ、Ｓ７ｂは、適宜、学習データデータベースＤＢ２に記憶されて、利用される。 In addition, in the control rule learning unit 11, new teacher data S7a from the learning data creation unit 7 is provided to the neural network processing unit 110 as total teacher data S7c including past teacher data S7b stored in the learning data database DB2 in the teacher data creation unit 115. These teacher data S7a and S7b are stored in the learning data database DB2 as appropriate and used.

同様に、制御入力データ作成部２からの入力データＳ８ａは、入力データ作成部１１４において学習データデータベースＤＢ２に記憶されている過去の入力データＳ８ｂも含めた合計の入力データＳ８ｃとして、ニューラルネット処理部１１０に与えられる。これらの入力データＳ８ａ、Ｓ８ｂは、適宜、学習データデータベースＤＢ２に記憶されて、利用される。 Similarly, input data S8a from the control input data creation unit 2 is provided to the neural network processing unit 110 as total input data S8c including past input data S8b stored in the learning data database DB2 in the input data creation unit 114. These input data S8a and S8b are stored in the learning data database DB2 as appropriate and used.

ニューラルネット処理部１１０は、ニューラルネット１１１とニューラルネット学習制御部１１２により構成されており、ニューラルネット１１１は、入力データ作成部１１４からの入力データＳ８ｃ、教師データ作成部１１５からの教師データＳ７ｃ、ニューラルネット選択部１１３が選択した制御ルール（ニューラルネット）を取り込み、最終的に決定したニューラルネットを制御ルール評価値データベースＤＢ１に格納する。 The neural network processing unit 110 is composed of a neural network 111 and a neural network learning control unit 112. The neural network 111 takes in input data S8c from the input data creation unit 114, teacher data S7c from the teacher data creation unit 115, and the control rule (neural network) selected by the neural network selection unit 113, and stores the finally determined neural network in the control rule evaluation value database DB1.

ニューラルネット学習制御部１１２は、入力データ作成部１１４、教師データ作成部１１５、ニューラルネット選択部１１３に対して、適宜のタイミングでこれらを制御し、ニューラルネット１１１の入力を得、また処理結果を制御ルール評価値データベースＤＢ１に格納すべく制御している。 The neural network learning control unit 112 controls the input data creation unit 114, the teacher data creation unit 115, and the neural network selection unit 113 at appropriate times to obtain input for the neural network 111, and also controls the processing results to be stored in the control rule evaluation value database DB1.

ここで、図２の制御実行装置２０におけるニューラルネット１０１と、図３の制御方法学習装置２１におけるニューラルネット１１１は、いずれも同じ概念のニューラルネットであるが、利用するうえでの基本概念上の相違について説明をしておくと、以下のようである。まず制御実行装置２０におけるニューラルネット１０１は、予め定められた内容のニューラルネットであり、入力データＳ１を与えたときに対応する出力としての制御操作端操作指令Ｓ２を求めるものであり、いわば一方方向の処理に利用されるニューラルネットである。これに対し、制御方法学習装置２１におけるニューラルネット１１１は、入力データＳ１と制御操作端操作指令Ｓ２についての入力データＳ８ｃ、教師データＳ７ｃを学習データとして設定したときに、この入出力関係を満足するニューラルネットを学習により求めるためのものである。 The neural network 101 in the control execution device 20 in FIG. 2 and the neural network 111 in the control method learning device 21 in FIG. 3 are both neural networks with the same concept, but the basic conceptual differences in their use will be explained as follows. First, the neural network 101 in the control execution device 20 is a neural network with predetermined contents, which obtains a control operation terminal operation command S2 as a corresponding output when input data S1 is given, and is a neural network used for one-way processing, so to speak. In contrast, the neural network 111 in the control method learning device 21 is used to obtain, by learning, a neural network that satisfies the input/output relationship when input data S1 and input data S8c for the control operation terminal operation command S2 and teacher data S7c are set as learning data.

上記のように構成された制御方法学習装置２１における基本的な処理の考え方は、以下のようである。まず、制御操作量出力可否データＳ４の内容が「可」の場合、制御対象プラント１に制御操作量出力ＳＯを出力し、制御結果良否データＳ６の内容が「良」（実績データＳｉが良くなる方向に変化）の場合、制御ルール実行部１０が出力した制御操作端操作指令Ｓ２は正しいと判断し、ニューラルネットの出力が制御操作端操作指令Ｓ２となるように学習データを作成する。 The basic processing concept of the control method learning device 21 configured as above is as follows. First, if the content of the control operation amount output feasibility data S4 is "yes", the control operation amount output SO is output to the controlled plant 1, and if the content of the control result pass/fail data S6 is "good" (the performance data Si changes in a better direction), it is determined that the control operation terminal operation command S2 output by the control rule execution unit 10 is correct, and learning data is created so that the output of the neural network becomes the control operation terminal operation command S2.

一方、制御操作量出力可否データＳ４の内容が「否」、または、制御対象プラント１に制御操作量出力ＳＯを出力し、制御結果良否データＳ６の内容が「否」（実績データＳｉが悪くなる方向に変化）の場合、制御ルール実行部１０が出力した制御操作端操作指令Ｓ２は誤っていると判断し、ニューラルネットの出力が出ないように学習データを作成する。このとき、制御出力として、同じ制御操作端に対して＋方向、－方向の２種類の出力が出るようにニューラルネット出力を構成しておき、出力した側の制御操作端操作指令Ｓ２が出力されないように学習データを作成する。 On the other hand, if the content of the control operation amount output feasibility data S4 is "no", or if the control operation amount output SO is output to the controlled plant 1 and the content of the control result pass/fail data S6 is "no" (the performance data Si changes in a worsening direction), it is determined that the control operation terminal operation command S2 output by the control rule execution unit 10 is incorrect, and learning data is created so that no neural network output is generated. At this time, the neural network output is configured so that two types of output, a positive output and a negative output, are generated for the same control operation terminal, and learning data is created so that the control operation terminal operation command S2 on the output side is not output.

また図３に例示する制御ルール学習部１１においては、ニューラルネット学習制御部１１２によるデータ処理の結果として、以下のように処理している。ここでは、まず制御実行装置２０への入力データＳ１を時間遅れさせたＳ８ｃと、教師データ作成部１１５にて作成した教師データＳ７ｃの組合せである学習データを用いて、制御ルール実行部１０にて用いたニューラルネット１０１の学習を実施する。実際には、制御ルール実行部１０のニューラルネット１０１と同じニューラルネット１１１を制御ルール学習部１１内に備えて、各種条件で運用テストしてその時の応答を学習し、学習の結果としてより良い結果を生じることが確認された制御ルールを得るものである。学習は、複数個の学習データを用いて行わせる必要があるため、過去に作成された学習データを蓄積している学習データデータベースＤＢ２より、過去の学習データを複数個取り出して、学習し処理を実施するとともに、今回の学習データを学習データデータベースＤＢ２に格納する。また、学習したニューラルネットは、制御ルール実行部１０にて利用するために、制御ルール評価値データベースＤＢ１に格納される。 In the control rule learning unit 11 illustrated in FIG. 3, the following processing is performed as a result of data processing by the neural network learning control unit 112. Here, the neural network 101 used in the control rule execution unit 10 is first learned using learning data that is a combination of S8c, which is the input data S1 to the control execution device 20 delayed in time, and teacher data S7c created by the teacher data creation unit 115. In reality, the same neural network 111 as the neural network 101 of the control rule execution unit 10 is provided in the control rule learning unit 11, and operation tests are performed under various conditions to learn the responses at that time, and a control rule that has been confirmed to produce better results as a result of learning is obtained. Since learning must be performed using multiple learning data, multiple past learning data are taken out from the learning data database DB2, which accumulates learning data created in the past, and learning and processing are performed, and the current learning data is stored in the learning data database DB2. In addition, the learned neural network is stored in the control rule evaluation value database DB1 for use by the control rule execution unit 10.

ニューラルネットの学習は、新しい学習データが作成される毎に、過去の学習データを一緒に用いて学習しても良いし、学習データがある程度（例えば１００個分）蓄積されてから、過去の学習データを一緒に用いて学習しても良い。 When training a neural network, new training data may be created each time new training data is generated, and the new data may be used together with past training data, or after a certain amount of training data (e.g., 100 pieces) has been accumulated, the new data may be used together with past training data.

また、制御結果良否判定部６においては、良否判定データベースＤＢ４からの良否判定基準をもとに良否判定を実施する。制御結果の良否判定は、制御目的に応じて判断結果が異なるため、複数の制御目的に応じたニューラルネットを複数作成し、入力データが同じでも制御目的によりそれぞれ教師データを作成し、学習することで、１回分の入力データに対して複数の教師データを作成し、それぞれの教師データに対応するニューラルネットの学習に用いることで、同時に複数の制御目的に対応したニューラルネットを学習していくことが可能である。ここで、複数の制御目的とは、例えば形状制御の場合、板幅方向でどの部分（板端部、センター部、非対称部等）を優先的に制御したいか、複数の制御対象項目（例えば、板厚と張力、圧延荷重等）のいずれを優先的に制御したいか、等のことである。 In addition, the control result quality judgment unit 6 performs quality judgment based on quality judgment criteria from the quality judgment database DB4. Since the quality judgment of the control result differs depending on the control purpose, multiple neural networks corresponding to multiple control purposes are created, and even if the input data is the same, teacher data is created for each control purpose and learned. By creating multiple teacher data for one input data and using it to learn the neural network corresponding to each teacher data, it is possible to simultaneously learn neural networks corresponding to multiple control purposes. Here, multiple control purposes refer to, for example, in the case of shape control, which part in the plate width direction (plate end, center part, asymmetric part, etc.) is to be controlled with priority, which of multiple control target items (for example, plate thickness and tension, rolling load, etc.) is to be controlled with priority, etc.

上記の様な構成とした場合、一旦制御ルール実行部１０で用いられるニューラルネット１０１が学習してしまうと、新たな制御操作が実施されなくなる。そのため、制御操作外乱発生部１６により、適時新たな操作方法を乱数的に発生させ、制御操作量Ｓ３に加えて制御操作を実行する事で、新たな制御方法を学習していく。 In the above configuration, once the neural network 101 used in the control rule execution unit 10 has learned, new control operations will no longer be performed. Therefore, the control operation disturbance generation unit 16 randomly generates new operation methods at appropriate times, and learns new control methods by executing control operations in addition to the control operation amount S3.

以下、特許文献１に示すようなゼンジミア圧延機における形状制御を対象に、本プラント制御方法の詳細を説明する。なお形状制御に関しては、下記のような仕様Ａ、Ｂを採用するものとして説明する。 The details of this plant control method will be described below, focusing on shape control in a Sendzimir rolling mill as shown in Patent Document 1. Note that the shape control will be described assuming that the following specifications A and B are adopted.

仕様Ａは、優先度についての仕様であり、板幅方向の優先度の情報を持つものとする。例えば形状制御においては、板幅方向全域にわたって目標値に制御する事が、機械特性上困難な場合が多い。そのため、板幅方向で下記２つの優先度についての仕様Ａ１、Ａ２を設ける。このうち優先度についての仕様Ａ１は「板端部を優先する」、優先度についての仕様Ａ２は「中央部を優先する」であり、Ａ１、Ａ２という２つの優先順位に従った制御を実施する。制御を実施する場合は優先度についての仕様Ａ１またはＡ２のいずれかを考慮する。 Specifications A are specifications for priority, and contain priority information in the strip width direction. For example, in shape control, it is often difficult to control to the target value across the entire strip width direction due to machine characteristics. For this reason, the following two priority specifications A1 and A2 are provided in the strip width direction. Of these, priority specification A1 is "give priority to strip ends," and priority specification A2 is "give priority to center," and control is implemented according to the two priorities A1 and A2. When implementing control, either priority specification A1 or A2 is taken into consideration.

仕様Ｂは、予め判明している条件への対応についての仕様である。一例をあげると、形状パターンと制御方法の関係は、種々の条件で変化することから、例えば、仕様Ｂ１を板幅、仕様Ｂ２を鋼種とする区分で分ける必要がある事が考えられる。上記それぞれが変化することで、形状操作端の形状への影響度合が変化する。 Specification B is a specification for dealing with conditions that are known in advance. As an example, since the relationship between the shape pattern and the control method changes under various conditions, it may be necessary to classify them by, for example, specification B1 by plate width and specification B2 by steel type. Changes in each of the above will change the degree of influence on the shape of the shape control terminal.

この事例では制御対象プラント１は、ゼンジミア圧延機であり、実績データは形状実績となる。なおゼンジミア圧延機は、ステンレスなどの硬い材料を冷間圧延するためのクラスターロールを持つ圧延機である。ゼンジミア圧延機では、硬い材料に強圧下を与える目的で、小径のワークロールを用いる。このため、平坦な鋼板を得ることが難しい。この対策として、クラスターロールの構造やさまざまな形状制御部を採用している。ゼンジミア圧延機は一般には、上下の第１中間ロールが片テーパを持ち、シフトできるようになっているほか、上下に６個の分割ロールと２個のＡＳ－Ｕと呼ばれるロールを備えている。以下に説明する事例では、形状の実績データＳｉとしては、形状検出器の検出データを用い、さらに入力データＳ１としては、目標形状との差である、形状偏差を用いる。また制御操作量Ｓ３としては、＃１～＃ｎのＡＳ－Ｕ、上下の第１中間ロールのロールシフト量とする。 In this example, the plant 1 to be controlled is a Sendzimir rolling mill, and the actual data is the shape actual data. The Sendzimir rolling mill is a rolling mill with cluster rolls for cold rolling hard materials such as stainless steel. In the Sendzimir rolling mill, small diameter work rolls are used to apply strong reduction to hard materials. For this reason, it is difficult to obtain flat steel sheets. To address this issue, the structure of cluster rolls and various shape control devices are adopted. In general, the upper and lower first intermediate rolls have a single taper and can be shifted, and the Sendzimir rolling mill is equipped with six split rolls and two rolls called AS-U on the upper and lower sides. In the example described below, the detection data of the shape detector is used as the actual shape data Si, and the shape deviation, which is the difference from the target shape, is used as the input data S1. The control operation amount S3 is the AS-U of #1 to #n and the roll shift amount of the upper and lower first intermediate rolls.

図４に、ゼンジミア圧延機の形状制御に用いる場合のニューラルネット構成を示す。ここでニューラルネットとは、制御ルール実行部１０用ではニューラルネット１０１のことであり、制御ルール学習部１１用ではニューラルネット１１１に示したニューラルネットを示しているが、いずも構造は同じである。 Figure 4 shows the neural network configuration when used for shape control of a Sendzimir rolling mill. Here, the neural network refers to neural network 101 for the control rule execution unit 10, and to neural network 111 for the control rule learning unit 11, but both have the same structure.

図４に示すゼンジミア圧延機の形状制御の事例では、制御対象プラント１からの実績データＳｉは形状検出器のデータ（ここでは、実績形状と目標形状との差である形状偏差が出力されるものとする）を含むゼンジミア圧延機の実績データであり、制御入力データ作成部２では、入力データＳ１として規格化形状偏差２０１、形状偏差段階２０２を得る。これによりニューラルネット１０１、１１１の入力層は、規格化形状偏差２０１、形状偏差段階２０２により構成される。なお図４では、形状偏差段階２０２をニューラルネット入力層への入力としているが、段階に応じてニューラルネットを切替てもよい。 In the example of shape control of a Sendzimir rolling mill shown in Figure 4, actual data Si from the controlled plant 1 is actual data of the Sendzimir rolling mill including data from a shape detector (here, the shape deviation, which is the difference between the actual shape and the target shape, is assumed to be output), and the control input data creation unit 2 obtains a standardized shape deviation 201 and a shape deviation stage 202 as input data S1. As a result, the input layer of the neural networks 101, 111 is composed of the standardized shape deviation 201 and the shape deviation stage 202. Note that in Figure 4, the shape deviation stage 202 is input to the neural network input layer, but the neural network may be switched depending on the stage.

また、出力層は、ゼンジミア圧延機の形状制御操作端である、ＡＳ－Ｕ、第１中間ロールに合わせて、ＡＳ－Ｕ操作度合３０１と第１中間操作度合３０２により構成される。それぞれの操作度合は、ＡＳ－Ｕについては、ＡＳ－Ｕ開方向（ロールギャップ（圧延機の上下作業ロール間の間隔）が開く方向）、ＡＳ－Ｕ閉方向（ロールギャップが閉じる方向）を各ＡＳ－Ｕについて持つ。また、第１中間ロールについては、第１中間ロール開方向（第１中間ロールが圧延機中心より外側に向かって動作する方向）、第１中間ロール閉方向（第１中間ロールが圧延機中心側に向かって動作する方向）を上下第１中間ロールについて持つ。例えば、形状検出器が２０ゾーンで、形状偏差段階２０２を３段階（大、中、小）とした場合、入力層は２３個の入力となる。また、ＡＳ－Ｕのサドルが７本、上下第１中間ロールが板幅方向でシフト可能とすると、出力層はＡＳ－Ｕ操作度合３０１が１４個、１中間操作度合が４個の計１８個となる。中間層の層数および各層のニューロン数については、適時設定する。なお図８を参照して後述するが、出力層であるゼンジミア圧延機の形状制御操作端について、個々の制御操作端に対して＋方向、－方向の２種類の出力が出るようにニューラルネット出力を構成している。 The output layer is also composed of an AS-U operation degree 301 and a first intermediate operation degree 302 according to the AS-U and first intermediate roll, which are the shape control operation terminals of the Sendzimir rolling mill. For the AS-U, each operation degree has an AS-U opening direction (the direction in which the roll gap (the gap between the upper and lower work rolls of the rolling mill) opens) and an AS-U closing direction (the direction in which the roll gap closes). For the first intermediate roll, each has a first intermediate roll opening direction (the direction in which the first intermediate roll moves toward the outside of the center of the rolling mill) and a first intermediate roll closing direction (the direction in which the first intermediate roll moves toward the center of the rolling mill) for the upper and lower first intermediate rolls. For example, if the shape detector has 20 zones and the shape deviation stage 202 has three stages (large, medium, small), the input layer will have 23 inputs. In addition, if there are seven AS-U saddles and the upper and lower first intermediate rolls can be shifted in the strip width direction, the output layer will have 14 AS-U operation degrees 301 and four 1-intermediate operation degrees, for a total of 18. The number of layers in the intermediate layer and the number of neurons in each layer are set appropriately. As will be described later with reference to Figure 8, the neural network output is configured so that two types of output, a positive output and a negative output, are output for each shape control operation end of the Sendzimir rolling mill, which is the output layer.

図１０に形状偏差と制御方法について示している。ここでは図１０上部に、形状偏差が大きい場合の制御方法を示し、図１０の下部に形状偏差が小さい場合の制御方法を示している。なお高さ方向は形状偏差の大きさ、横軸方向は板幅方向であり、板幅の両側が板端部、中央が板中央部を表している。この図１０の上部に示すように、形状偏差が大きい場合は、板幅方向の局部的な形状偏差よりも全体的な形状を修正することを優先する。一方図１０の下部に示すように、形状偏差が小さい場合は、局部的な形状偏差を小さくすることを優先する。 Figure 10 shows shape deviation and control methods. The upper part of Figure 10 shows the control method when the shape deviation is large, and the lower part of Figure 10 shows the control method when the shape deviation is small. Note that the height direction is the magnitude of the shape deviation, the horizontal axis direction is the strip width direction, with both sides of the strip width representing the strip ends and the center representing the strip center. As shown in the upper part of Figure 10, when the shape deviation is large, priority is given to correcting the overall shape over the local shape deviation in the strip width direction. On the other hand, as shown in the lower part of Figure 10, when the shape deviation is small, priority is given to reducing the local shape deviation.

このように、形状偏差の大きさに応じて制御方法を変える必要があるため、図４に示すように形状偏差段階２０２を設けてニューラルネット１０１、１１１に与え、形状偏差の大きさを判定する。形状偏差については形状偏差の大小にかかわらず、例えば０～１に規格化したものを用いるのがよい。これは、一例であって、形状偏差を規格化せずにそのままニューラルネットの入力層へ入力することも考えられるし、形状偏差の大小に応じて、ニューラルネット自体を変える（例えば、２つのニューラルネットを準備し、形状偏差が大きい場合に使用するニューラルネットと、小さい場合に使用するニューラルネットを分ける）事も考えられる。 As such, since it is necessary to change the control method depending on the magnitude of the shape deviation, a shape deviation stage 202 is provided as shown in FIG. 4 and is provided to the neural networks 101, 111 to determine the magnitude of the shape deviation. It is preferable to use a shape deviation that is normalized, for example, between 0 and 1, regardless of the magnitude of the shape deviation. This is just one example, and it is also possible to input the shape deviation directly to the input layer of the neural network without normalizing it, or to change the neural network itself depending on the magnitude of the shape deviation (for example, prepare two neural networks and use one neural network when the shape deviation is large and another neural network when it is small).

以上説明した図４のような構成のニューラルネット１０１、１１１に対して、形状パターンに対する操作方法を学習させ、学習させたニューラルネットを用いて形状制御を実施する。同じ構成のニューラルネットでも、学習の条件により異なった特性となり、同じ形状パターンに対して異なった制御出力を出すようにすることができる。 The neural networks 101 and 111 configured as shown in FIG. 4 described above are trained to learn the operation method for the shape pattern, and the trained neural network is used to perform shape control. Even neural networks with the same configuration can have different characteristics depending on the learning conditions, and can be made to output different control outputs for the same shape pattern.

そのため、形状実績の他の条件に応じて、複数のニューラルネットを使い分けることで、多様な条件に対して最適な制御を構成することができる。これは仕様Ｂへの対応である。先に説明した図２の構成は、係る仕様を行う場合の具体例を示している。図２の構成事例では、制御ルール実行部１０において使用するニューラルネット１０１を、圧延実績や、圧延機オペレータ名、被圧延材の鋼種、板幅等により別個のニューラルネットを準備し、制御ルール評価値データベースＤＢ１に登録しておく。ニューラルネット選択部１０２においては、その時点の条件に合致するニューラルネットを選択し、制御ルール実行部１０のニューラルネット１０１に設定する。なおニューラルネット選択部１０２における、その時点の条件としては、制御対象プラント１における実績データＳｉの中から板幅のデータを取り込み、これに応じてニューラルネットを選択するのがよい。また、ここで使用する複数のニューラルネットは、図４に示すような入力層、出力層を持てば、中間層の層数、各層のユニット数は異なっても良い。 Therefore, by using multiple neural networks according to other conditions of the shape record, it is possible to configure optimal control for various conditions. This corresponds to specification B. The configuration of FIG. 2 described above shows a specific example of the case where such a specification is performed. In the configuration example of FIG. 2, separate neural networks are prepared for the neural network 101 used in the control rule execution unit 10 according to rolling records, the name of the rolling mill operator, the steel type of the material to be rolled, the plate width, etc., and are registered in the control rule evaluation value database DB1. In the neural network selection unit 102, a neural network that matches the conditions at that time is selected and set as the neural network 101 of the control rule execution unit 10. Note that, as the conditions at that time in the neural network selection unit 102, it is preferable to import plate width data from the actual data Si of the controlled plant 1 and select a neural network according to this. In addition, the multiple neural networks used here may have different numbers of layers in the intermediate layer and the number of units in each layer as long as they have input layers and output layers as shown in FIG. 4.

図７に、ニューラルネット１０１、１１１の入力層へ入力するためのデータＳ１（規格化形状偏差２０１、形状偏差段階２０２）を作成する、制御入力データ作成部２の概要を示す。ここでは実績データＳｉとして、制御対象プラント１であるゼンジミア圧延機における圧延時の板形状を検出する、形状検出器の形状検出器データを入力とし、まず、形状偏差ＰＰ値演算装置２１０にて各形状検出器ゾーンの検出結果の最大値と最小値の差である形状偏差ＰＰ値（ＰｅａｋＴｏＰｅａｋ値）Ｓ_ＰＰを求める。形状偏差段階演算装置２１１では、形状偏差ＰＰ値Ｓ_ＰＰにより、形状偏差を大、中、小の３段階に分類する。形状は、被圧延材の伸び率の板幅方向分布であり、伸び率を１０－５単位で表すＩ－ＵＮＩＴが単位として用いられる。例えば、下式のように分類する。 7 shows an outline of the control input data creation unit 2 that creates data S1 (normalized shape deviation 201, shape deviation stage 202) to be input to the input layer of the neural network 101, 111. Here, as the actual data Si, shape detector data of a shape detector that detects the strip shape during rolling in a Sendzimir rolling mill, which is the controlled plant 1, is input, and first, a shape deviation PP value (peak to peak value) S PP, which is the difference between the maximum value and the minimum value of the detection result of each shape detector zone, is calculated in a shape deviation _{PP value calculation device 210. In the shape deviation stage calculation device 211, the shape deviation is classified into three stages, large, medium, and small, according to the shape deviation PP value S PP} _. The shape is the distribution of the elongation rate of the rolled material in the strip width direction, and I-UNIT, which expresses the elongation rate in 10-5 units, is used as the unit. For example, the classification is performed as shown in the following formula.

ここでは、（１）式の成立により形状偏差段階が（大＝１、中＝０、小＝０）とし、（２）式の成立により形状偏差段階が（大＝０、中＝１、小＝０）とし、（３）式の成立により形状偏差段階が（大＝０、中＝０、小＝１）とするように分類している。なおここでは、各ゾーンの形状偏差については、Ｓ_ＰＭ＝Ｓ_ＰＰとした、Ｓ_ＰＭを用いて規格化を実施する。 Here, the shape deviation stages are classified as follows: (large=1, medium=0, small=0) when formula (1) is satisfied, (large=0, medium=1, small=0) when formula (2) is satisfied, and (large=0, medium=0, small=1) when formula (3) is satisfied. Note that the shape deviations of each zone are normalized using _SPM , where _SPM = _SPP .

以上のようにして、ニューラルネット１０１への入力データである規格化形状偏差２０１および形状偏差段階２０２を作成する。規格化形状偏差２０１および形状偏差段階２０２は、制御ルール実行部１０の入力データＳ１である。 In this manner, the normalized shape deviation 201 and shape deviation stage 202 are created, which are input data to the neural network 101. The normalized shape deviation 201 and shape deviation stage 202 are input data S1 to the control rule execution unit 10.

図８に、制御出力演算部３の概要を示す。制御出力演算部３は、制御ルール実行部１０内の、ニューラルネット１０１からの出力である制御操作端操作指令Ｓ２（ゼンジミア圧延機の形状制御の事例では、ＡＳ－Ｕ操作度合３０１、第１中間操作度合３０２がこれに相当する）より、各形状制御操作端への操作指令である制御操作量Ｓ３を作成する。なおここでは、複数個数が存在するＡＳ－Ｕ操作度合３０１、第１中間操作度合３０２について、各１つのデータ例を示しており、各データは開方向度合と閉方向度合の一対のデータで構成されている。 Figure 8 shows an overview of the control output calculation unit 3. The control output calculation unit 3 creates a control operation amount S3, which is an operation command to each shape control operation end, from the control operation end operation command S2 (in the case of shape control of a Sendzimir rolling mill, this corresponds to the AS-U operation degree 301 and the first intermediate operation degree 302), which is the output from the neural network 101 in the control rule execution unit 10. Note that here, one data example is shown for each of the AS-U operation degree 301 and the first intermediate operation degree 302, of which there are multiple values, and each data is composed of a pair of data for the opening direction degree and the closing direction degree.

制御出力演算部３内では、入力されたＡＳ－Ｕ操作度合３０１は、各ＡＳ－Ｕ開方向、閉方向の出力をもつため、それらの差に変換ゲインＧ_ＡＳＵを掛ける事で、各ＡＳ－Ｕへの操作指令を出力する。変換ゲインＧ_ＡＳＵは、各ＡＳ－Ｕへの制御出力がＡＳ－Ｕ位置変更量（単位は長さ）となることから、度合から位置変更量への変換ゲインとなる。 In the control output calculation unit 3, the input AS-U operation degree 301 has outputs for the open and closed directions of each AS-U, so by multiplying the difference between them by the conversion gain _{G_ASU} , an operation command is output to each AS-U. Since the control output to each AS-U is the AS-U position change amount (unit is length), the conversion gain _{G_ASU} becomes the conversion gain from degree to position change amount.

また同じく入力された第１中間操作度合３０２は、第１中間外側、内側の出力をもつため、それらの差に変換ゲインＧ_１ＳＴを掛ける事で、各第１中間ロールシフトへの操作指令を出力する。変換ゲインＧ_１ＳＴは、各第１中間ロールへの制御出力が第１中間ロールシフト位置変更量（単位は長さ）となることから、度合から位置変更量への変換ゲインとなる。 The first intermediate operation degree 302, which is also input, has first intermediate outer and inner outputs, and the difference between them is multiplied by the conversion gain _G1ST to output an operation command for each first intermediate roll shift. The conversion gain _G1ST is a conversion gain from degree to position change amount, since the control output to each first intermediate roll is the first intermediate roll shift position change amount (unit is length).

以上により、制御操作量Ｓ３を演算することができる。制御操作量Ｓ３は、＃１～＃ｎＡＳ－Ｕ位置変更量（ｎはＡＳ－Ｕロールのサドル数による）と、上第１中間シフト位置変更量、下第１中間シフト位置変更量から構成されている。なお、図８には、制御操作外乱発生部１６からの外乱データを制御操作端操作指令Ｓ２に加算する系統が図示されている。 The above allows the control operation amount S3 to be calculated. The control operation amount S3 is composed of the #1 to #n AS-U position change amount (n depends on the number of saddles of the AS-U roll), the upper first intermediate shift position change amount, and the lower first intermediate shift position change amount. Note that FIG. 8 shows the system that adds the disturbance data from the control operation disturbance generating unit 16 to the control operation terminal operation command S2.

図９に、制御出力判定部５の概要を示す。制御出力判定部５は、圧延現象モデル５０１と形状修正良否判定部５０２から構成されており、制御対象プラント１よりの実績データＳｉ、制御出力演算部３からの制御操作量Ｓ３、および出力判定データベースＤＢ３の情報を得て、制御操作端への制御操作量出力可否データＳ４を与える。係る構成により制御出力判定部５においては、制御出力演算部３にて演算した制御操作量Ｓ３を制御対象プラント１である圧延機に出力した場合の形状の変化を、既知の制御対象プラント１のモデル（図９の実施例の場合は、圧延現象モデル５０１）に入力することで予測し、形状が悪化すると予想される場合は制御操作量出力ＳＯを抑制し、形状が大きく悪化する事を防止する。 Figure 9 shows an overview of the control output judgment unit 5. The control output judgment unit 5 is composed of a rolling phenomenon model 501 and a shape correction pass/fail judgment unit 502, and obtains actual data Si from the controlled plant 1, the control operation amount S3 from the control output calculation unit 3, and information from the output judgment database DB3, and provides control operation amount output availability data S4 to the control operation end. With this configuration, the control output judgment unit 5 predicts the change in shape when the control operation amount S3 calculated by the control output calculation unit 3 is output to the rolling mill, which is the controlled plant 1, by inputting it into a known model of the controlled plant 1 (in the case of the embodiment of Figure 9, the rolling phenomenon model 501), and if it is predicted that the shape will deteriorate, the control operation amount output SO is suppressed to prevent the shape from deteriorating significantly.

より詳細に述べると、制御操作量Ｓ３を圧延現象モデル５０１に入力し制御操作量Ｓ３による形状変化を予測し、形状偏差修正量予測データ５０３を演算する。他方、制御対象プラント１からの形状検出器データＳｉ（現時点での形状偏差実績データ５０４）に、形状偏差修正量予測データ５０３を加算する事で形状偏差予測データ５０５を得、形状偏差予測データ５０５を評価することで、制御操作量Ｓ３を制御対象プラント１に出力したときに、形状がどのように変化するかが予測できる。現状の形状偏差実績データ５０４と形状偏差予測データ５０５より、形状修正良否判定部５０２においては、形状が良くなる方向に変化するのか、悪くなる方向に変化するのか判定し、制御操作量出力可否データＳ４を得る。 In more detail, the control operation amount S3 is input to the rolling phenomenon model 501 to predict the shape change due to the control operation amount S3, and shape deviation correction amount prediction data 503 is calculated. On the other hand, the shape deviation correction amount prediction data 503 is added to the shape detector data Si (current shape deviation actual data 504) from the controlled plant 1 to obtain shape deviation prediction data 505, and by evaluating the shape deviation prediction data 505, it is possible to predict how the shape will change when the control operation amount S3 is output to the controlled plant 1. From the current shape deviation actual data 504 and the shape deviation prediction data 505, the shape correction pass/fail judgment unit 502 judges whether the shape will change in a better or worse direction, and obtains the control operation amount output pass/fail data S4.

形状修正良否判定部５０２では、具体的には以下のようにして形状修正の良否判定を行う。まず形状制御の優先度についての仕様Ａ１、Ａ２で示したように、板幅方向での制御優先度を考慮するため、出力判定データベースＤＢ３には、板幅方向の重み係数ｗ（ｉ）を仕様Ａ１、仕様Ａ２の各仕様に対して設定しておく。それを用いて、例えば下記の（４）式のような評価関数Ｊを用いて形状変化の良否を判定する。なお（４）式において、ｗ（ｉ）は重み係数、εｆｂ（ｉ）は形状偏差実績データ５０４、εｅｓｔ（ｉ）は形状偏差予測データ５０５、ｉは形状検出器ゾーン、ｒａｎｄは乱数項である。 Specifically, the shape correction pass/fail judgment unit 502 judges whether the shape correction is pass/fail as follows. First, as shown in specifications A1 and A2 regarding the shape control priority, in order to take into account the control priority in the strip width direction, a weighting coefficient w(i) in the strip width direction is set in the output judgment database DB3 for each of specifications A1 and A2. Using this, the pass/fail of the shape change is judged using an evaluation function J such as the following equation (4). In equation (4), w(i) is the weighting coefficient, εfb(i) is the shape deviation actual data 504, εest(i) is the shape deviation predicted data 505, i is the shape detector zone, and rand is a random number term.

（４）式の評価関数Ｊを用いた場合、形状が良くなるときは評価関数Ｊが正、悪くなるときは評価関数Ｊが負となる。また、ｒａｎｄは乱数項であり、評価関数Ｊの評価結果を乱数的に変化させる。これにより、形状が悪化する場合であっても、評価関数Ｊとしては正になる場合が発生するため、圧延現象モデル５０１が正しくない場合についても形状パターンと制御方法の関係を学習していく事が可能である。ここでｒａｎｄは、試運転当初の様に、制御対象プラント１のモデルが不確実の場合は最大値を大きくし、ある程度制御方法を学習し安定した制御を実施したい場合は０とするように、適時変更する。 When the evaluation function J in equation (4) is used, the evaluation function J is positive when the shape improves, and negative when the shape deteriorates. Furthermore, rand is a random term that randomly changes the evaluation result of the evaluation function J. As a result, even if the shape deteriorates, the evaluation function J may become positive, so it is possible to learn the relationship between the shape pattern and the control method even when the rolling phenomenon model 501 is incorrect. Here, rand is changed as appropriate, by increasing the maximum value when the model of the controlled plant 1 is uncertain, such as at the beginning of a trial run, and setting it to 0 when it is desired to learn the control method to a certain extent and implement stable control.

形状修正良否判定部５０２においては、評価関数Ｊを演算し、Ｊ≧０のとき制御操作量出力可否データＳ４＝１（可）とし、Ｊ＜０のとき制御操作量出力可否データＳ４＝０（否）のように制御操作量出力可否データＳ４を出力する。 The shape modification pass/fail judgment unit 502 calculates the evaluation function J, and outputs the control operation amount output feasibility data S4 such that when J≧0, the control operation amount output feasibility data S4=1 (passive), and when J<0, the control operation amount output feasibility data S4=0 (failed).

制御出力抑制部４においては、制御出力判定部５の判定結果である制御操作量出力可否データＳ４に応じて、制御対象プラント１への制御操作量出力ＳＯの出力有無を決定する。制御操作量出力可否データＳ４は、＃１～＃ｎＡＳ－Ｕ位置変更量出力、上第１中間シフト位置変更量出力、下第１中間シフト位置変更量出力であり、
ＩＦ（制御操作量出力可否データＳ４＝０）ＴＨＥＮ
＃１～＃ｎＡＳ－Ｕ位置変更量出力＝０
上第１中間シフト位置変更量出力＝０
下第１中間シフト位置変更量出力＝０
ＥＬＳＥ
＃１～＃ｎＡＳ－Ｕ位置変更量出力＝＃１～＃ｎＡＳ－Ｕ位置変更量
上第１中間シフト位置変更量出力＝上第１中間シフト位置変更量
下第１中間シフト位置変更量出力＝下第１中間シフト位置変更量
ＥＮＤＩＦ
により決定される。 The control output suppression unit 4 determines whether or not to output the control operation amount output SO to the controlled plant 1 according to the control operation amount output feasibility data S4, which is the determination result of the control output determination unit 5. The control operation amount output feasibility data S4 is the #1 to #nAS-U position change amount output, the upper first middle shift position change amount output, and the lower first middle shift position change amount output,
IF (control operation amount output possibility data S4 = 0) THEN
#1 to #nAS-U position change amount output = 0
Upper first intermediate shift position change amount output=0
Lower first intermediate shift position change amount output=0
ELSE
#1 to #nAS-U position change amount output = #1 to #nAS-U position change amount Upper first intermediate shift position change amount output = upper first intermediate shift position change amount Lower first intermediate shift position change amount output = lower first intermediate shift position change amount ENDIF
is determined by.

制御実行装置２０においては、制御対象プラント１（圧延機）からの実績データＳｉより、上記の演算を実行し、制御操作量出力ＳＯを制御対象プラント１（圧延機）に出力する事により形状制御を実施する。 The control execution device 20 executes the above calculations using the performance data Si from the controlled plant 1 (rolling mill) and performs shape control by outputting the control operation amount output SO to the controlled plant 1 (rolling mill).

次に、制御方法学習装置２１の動作概要について説明する。制御方法学習装置２１においては、制御実行装置２０で用いたデータの時間遅れデータを使用する。時間遅れＺ^－１は、ｅ^－ＴＳを意味し、予め設定した時間Ｔだけ遅延させる事を示す。制御対象プラント１は、時間応答を持つため、制御操作量出力ＳＯにより、実績データが変化するまで時間遅れが存在する。そのため、学習は、制御操作実行後、遅延時間Ｔだけ経過した時点での実績データを用いて実施する。形状制御においては、ＡＳ－Ｕや第１中間ロールに対する操作指令出力後、形状計が形状変化を検出するまで数秒要するため、Ｔ＝２から３秒程度に設定するのがよい（形状検出器の種類や圧延速度によっても、遅れ時間は変化するため、制御操作端の変更が形状変化となるまでの最適な時間をＴとして設定すればよい。）。 Next, an outline of the operation of the control method learning device 21 will be described. In the control method learning device 21, time-delayed data of the data used in the control execution device 20 is used. The time delay Z ^-1 means e ^-TS , and indicates a delay of a preset time T. Since the controlled plant 1 has a time response, there is a time delay until the actual data changes due to the control operation output SO. Therefore, learning is performed using the actual data at the time when the delay time T has elapsed after the control operation is executed. In shape control, since it takes several seconds for the shape gauge to detect a shape change after the operation command is output to the AS-U or the first intermediate roll, it is preferable to set T to about 2 to 3 seconds (the delay time also changes depending on the type of shape detector and the rolling speed, so it is sufficient to set T as the optimal time until the change in the control operation terminal results in a shape change).

図１１に、制御結果良否判定部６の動作概要を示す。形状変化良否判定部６０２においては、下式のような良否判定評価関数Ｊｃを用いる。 Figure 11 shows an overview of the operation of the control result pass/fail judgment unit 6. The shape change pass/fail judgment unit 602 uses the pass/fail judgment evaluation function Jc as shown in the following formula.

なお（５）式において、εｆｂ（ｉ）は実績データＳｉに含まれる形状偏差実績データ、εｌａｓｔ（ｉ）は形状偏差実績データ前回値であり、ｗＣ（ｉ）は良否判定用の板幅方向重み係数である。ここで、良否判定用の重み係数ｗＣ（ｉ）は、良否判定データベースＤＢ４より、制御の優先度についての仕様Ａ１、Ａ２に応じて設定する。良否判定評価関数Ｊｃにより、制御結果の良否を判定する。また、制御出力判定部５の判定結果である制御操作量出力可否データＳ４が０（制御出力不可）の場合についても、実際に制御対象プラント１へ制御操作量出力＝０であるが、形状が悪くなったと判断する。 In equation (5), εfb(i) is the shape deviation actual data included in the actual data Si, εlast(i) is the previous value of the shape deviation actual data, and wC(i) is the plate width direction weighting coefficient for pass/fail judgment. Here, the weighting coefficient wC(i) for pass/fail judgment is set according to the specifications A1 and A2 for the control priority from the pass/fail judgment database DB4. The pass/fail judgment evaluation function Jc judges whether the control result is good or bad. Also, when the control operation amount output feasibility data S4, which is the judgment result of the control output judgment unit 5, is 0 (control output not possible), it is judged that the shape has deteriorated even though the control operation amount output to the controlled plant 1 is actually 0.

ここでは、制御操作量出力可否データＳ４＝０の場合、制御結果良否データＳ６＝－１とする。また閾値上限ＬＣＵと閾値加減ＬＣＬを、閾値条件（ＬＣＵ≧０≧ＬＣＬ）のもとで予め設定しておく。このときに、良否判定評価関数Ｊｃとの比較の結果が、Ｊｃ＞ＬＣＵであれば、制御結果良否データＳ６＝－１（形状が悪くなった）とし、ＬＣＵ≧Ｊｃ≧０であれば、制御結果良否データＳ６＝０（形状が悪くなる方向に変化）とし、０＞Ｊｃ≧ＬＣＬであれば、制御結果良否データＳ６＝１（形状が良くなる方向に変化）とし、Ｊｃ＜ＬＣＬであれば、制御結果良否データＳ６＝０（形状が良くなった）とする。 Here, when the control operation amount output feasibility data S4 = 0, the control result pass/fail data S6 = -1. The threshold upper limit LCU and threshold adjustment LCL are set in advance under the threshold condition (LCU ≥ 0 ≥ LCL). At this time, if the result of the comparison with the pass/fail judgment evaluation function Jc is Jc > LCU, the control result pass/fail data S6 = -1 (the shape has worsened), if LCU ≥ Jc ≥ 0, the control result pass/fail data S6 = 0 (the shape has changed in a worsening direction), if 0 > Jc ≥ LCL, the control result pass/fail data S6 = 1 (the shape has changed in a better direction), and if Jc < LCL, the control result pass/fail data S6 = 0 (the shape has improved).

ここで、制御結果良否データＳ６＝－１は、形状が悪くなったので、出力した制御出力を抑制する場合、制御結果良否データＳ６＝０は、形状変化無し、または形状が良くなったので出力した制御出力を保持する場合、制御結果良否データＳ６＝１は、形状が良くなる方向に変化したが、更に良くなる可能性が有るので、出力した制御量を増大させる場合である。 Here, control result pass/fail data S6 = -1 means that the shape has worsened and the output control output is suppressed, control result pass/fail data S6 = 0 means that there has been no change in the shape or that the shape has improved and the output control output is maintained, and control result pass/fail data S6 = 1 means that the shape has improved but there is a possibility that it could improve further and so the output control amount is increased.

このように、制御の優先度についての仕様Ａ１、Ａ２に応じて、板幅方向の重み係数ｗＣ（ｉ）が変わるため、良否判定評価関数Ｊｃは異なる。そのため、制御結果良否データＳ６の判定結果も異なる事が考えられる。そのため、制御方法学習装置２１においては、制御の優先度についての仕様Ａ１、Ａ２の２種類について、制御結果良否データＳ６の判定を実施する。 In this way, the weighting coefficient wC(i) in the plate width direction changes depending on the specifications A1 and A2 for the control priority, and the pass/fail judgment evaluation function Jc differs. Therefore, it is conceivable that the judgment results of the control result pass/fail data S6 will also differ. Therefore, in the control method learning device 21, judgment is made on the control result pass/fail data S6 for the two types of specifications A1 and A2 for the control priority.

次に、学習データ作成部７の概要について説明する。図１に示したように、学習データ作成部７においては、制御結果良否判定部６からの判定結果（制御結果良否データＳ６）を基にして、制御操作端操作指令Ｓ２、制御操作量Ｓ３、制御出力抑制部の判定結果（制御操作量出力可否データＳ４）より、制御ルール学習部１１で使用するニューラルネット１１１に対する教師データＳ７ａを作成する。 Next, an overview of the learning data creation unit 7 will be described. As shown in FIG. 1, the learning data creation unit 7 creates teacher data S7a for the neural network 111 used in the control rule learning unit 11 based on the judgment result from the control result judgment unit 6 (control result pass/fail data S6), the control operation terminal operation command S2, the control operation amount S3, and the judgment result from the control output suppression unit (control operation amount output feasibility data S4).

この場合の教師データＳ７ａは、図４に示す、ニューラルネット１１１の出力層からの出力である、ＡＳ－Ｕ操作度合３０１、第１中間操作度合３０２となる。学習データ作成部７は、ニューラルネット１０１の出力である制御操作端操作指令Ｓ２（ＡＳ－Ｕ操作度合３０１、第１中間操作度合３０２）と、制御操作量出力ＳＯである＃１～＃ｎＡＳ－Ｕ位置変更量出力、上第１中間シフト位置変更量出力、下第１中間シフト位置変更量出力を用いて、制御ルール学習部１１で使用するニューラルネット１１１に対する教師データＳ７ａを作成する。 The teacher data S7a in this case is the AS-U operation degree 301 and the first intermediate operation degree 302, which are the outputs from the output layer of the neural network 111, as shown in Figure 4. The learning data creation unit 7 creates teacher data S7a for the neural network 111 to be used in the control rule learning unit 11, using the control operation end operation command S2 (AS-U operation degree 301, first intermediate operation degree 302), which is the output of the neural network 101, and the control operation amount output SO, which is the #1 to #n AS-U position change amount output, the upper first intermediate shift position change amount output, and the lower first intermediate shift position change amount output.

学習データ作成部７の動作概要を説明するにあたり、図８の制御出力演算部３における各部データや記号の関係を図１２に整理している。ここでは、ニューラルネット１０１の出力である制御操作端操作指令Ｓ２についてＡＳ－Ｕ操作度合３０１を代表的に示しており、操作度合正側のデータをＯＰｒｅｆ、操作度合負側のデータをＯＭｒｅｆ、制御操作外乱発生部１６からの乱数的に発生する操作度合を操作度合乱数Ｏｒｅｆ、変換ゲインをＧ、制御操作量出力ＳＯをＣｒｅｆとして説明する。このように、ここでは、簡単のため、制御ルール実行部１０のニューラルネット１０１の出力層からの出力として、操作度合正側および操作度合負側、制御操作外乱発生部１６からの乱数的に発生する操作度合を操作度合乱数としている。また、制御操作端に対する制御操作量出力ＳＯを操作指令値としている。 In explaining the operation overview of the learning data creation unit 7, the relationship between the data of each unit and the symbols in the control output calculation unit 3 in FIG. 8 is summarized in FIG. 12. Here, the AS-U operation degree 301 is shown as a representative for the control operation terminal operation command S2, which is the output of the neural network 101, and the operation degree positive side data is OPref, the operation degree negative side data is OMref, the operation degree randomly generated from the control operation disturbance generation unit 16 is the operation degree random number Oref, the conversion gain is G, and the control operation amount output SO is Cref. Thus, for simplicity, the operation degree positive side and operation degree negative side, and the operation degree randomly generated from the control operation disturbance generation unit 16 are the operation degree random number as outputs from the output layer of the neural network 101 of the control rule execution unit 10. Also, the control operation amount output SO for the control operation terminal is the operation command value.

図１３は、学習データ作成部７における処理段階と処理内容を示している。ここで、図１２の記号の約束に則り説明すると、最初の処理段階７１では、操作指令値Ｃｒｅｆを（６）式により求めている。 Figure 13 shows the processing steps and processing contents in the learning data creation unit 7. Here, to explain according to the symbol conventions in Figure 12, in the first processing step 71, the operation command value Cref is calculated by equation (6).

次の処理段階７２では、制御結果良否データＳ６に応じて操作指令値Ｃｒｅｆを修正しＣ´ｒｅｆとする。具体的には制御結果良否データＳ６＝－１のとき（７）式、制御結果良否データＳ６＝０のとき（８）式、制御結果良否データＳ６＝１のとき（９）式により、操作指令値Ｃｒｅｆの修正値Ｃ´ｒｅｆとする。 In the next processing step 72, the operation command value Cref is corrected to C'ref according to the control result pass/fail data S6. Specifically, when the control result pass/fail data S6 = -1, the corrected value C'ref of the operation command value Cref is calculated according to formula (7), when the control result pass/fail data S6 = 0, the formula (8), and when the control result pass/fail data S6 = 1, the formula (9).

処理段階７３では、修正された操作指令値Ｃ´ｒｅｆより、（１０）、（１１）式により操作度合修正量ΔＯｒｅｆを求める。 In processing step 73, the operation degree correction amount ΔOref is calculated from the corrected operation command value C'ref using equations (10) and (11).

処理段階７４では、ニューラルネット１１１への教師データＯＰ´ｒｅｆ、ＯＭ´ｒｅｆを（１２）式により求める。 In processing step 74, the training data OP'ref and OM'ref for the neural network 111 are calculated using equation (12).

このように学習データ作成部７では、図１２に示すように、実際に制御対象プラント１に対して出力した操作指令値Ｃｒｅｆを、制御結果良否判定部６における判定結果である制御結果良否データＳ６に応じて、操作指令値修正値Ｃ´ｒｅｆを演算する。具体的には、制御結果良否データＳ６＝１の場合は、制御方向はＯＫであるが、制御出力が不足していると判断された場合で、操作指令値を同じ方向にΔＣｒｅｆだけ増加するようにする。逆に制御結果良否データＳ６＝－１の場合は、制御方向が間違っていると判断された場合で、操作指令値を逆方向にΔＣｒｅｆだけ減少するようにする。変換ゲインＧは、予め設定したものであるから既知である事から、操作度合正側および操作度合負側の値が判れば、修正量ΔＯｒｅｆを求める事が可能である。ここでΔＣｒｅｆは、予め適当な値をシミュレーション等で求めておき、設定する。以上の手順により、制御ルール学習部１１にて使用する教師データＯＰ´ｒｅｆ、ＯＭ´ｒｅｆは上記の（１２）式により求める事ができる。 In this way, as shown in FIG. 12, the learning data creation unit 7 calculates the operation command value correction value C'ref for the operation command value Cref actually output to the controlled plant 1 according to the control result pass/fail data S6, which is the judgment result in the control result pass/fail judgment unit 6. Specifically, when the control result pass/fail data S6 = 1, the control direction is OK, but the control output is judged to be insufficient, and the operation command value is increased in the same direction by ΔCref. Conversely, when the control result pass/fail data S6 = -1, the control direction is judged to be incorrect, and the operation command value is decreased in the opposite direction by ΔCref. Since the conversion gain G is previously set and therefore known, if the values of the operation degree positive side and the operation degree negative side are known, it is possible to find the correction amount ΔOref. Here, ΔCref is set by finding an appropriate value in advance by simulation or the like. By following the above procedure, the teacher data OP'ref and OM'ref used by the control rule learning unit 11 can be obtained using the above formula (12).

なお図１３では簡便な事例で説明を行っているが、実際には、＃１～＃ｎＡＳ－Ｕに対するＡＳ－Ｕ操作度合３０１および、上第１中間ロールシフト、下第１中間ロールシフトに対する第１中間操作度合３０２についてその全てを実施し、制御ルール学習部１１で用いるニューラルネット１１１の教師データ（ＡＳ－Ｕ操作度合教師データ、１中間操作度合教師データ）とする。 Note that while FIG. 13 uses a simple example for explanation, in reality, all of the AS-U operation degrees 301 for #1 to #n AS-U and the first intermediate operation degrees 302 for the upper first intermediate roll shift and the lower first intermediate roll shift are performed, and these are used as teacher data (AS-U operation degree teacher data, 1 intermediate operation degree teacher data) for the neural network 111 used in the control rule learning unit 11.

図１４は学習データデータベースＤＢ２に保存されたデータ例を示している。ニューラルネット１１１を学習するためには、多数の入力データＳ８ａと教師データＳ７ａの組合せが必要である。従って、学習データ作成部７で作成した教師データＳ７ａ（ＡＳ－Ｕ操作度合教師データ、第１中間操作度合）は、制御実行装置２０にて制御ルール実行部１０に入力された入力データＳ１（規格化形状偏差２０１および形状偏差段階）の時間遅れデータＳ８ａと組み合わせて一組の学習データとして、学習データデータベースＤＢ２に保存される。 Figure 14 shows an example of data stored in the learning data database DB2. In order to train the neural network 111, a combination of a large number of input data S8a and teacher data S7a is required. Therefore, the teacher data S7a (AS-U operation degree teacher data, first intermediate operation degree) created by the learning data creation unit 7 is combined with the time-delay data S8a of the input data S1 (standardized shape deviation 201 and shape deviation stage) input to the control rule execution unit 10 by the control execution device 20, and is stored as a set of learning data in the learning data database DB2.

なお図１のプラント制御システムにおいては、各種のデータベースＤＢ１、ＤＢ２、ＤＢ３、ＤＢ４を使用しているが、図１４に各データベースＤＢ１、ＤＢ２、ＤＢ３、ＤＢ４を連系的に管理運用するためのデータベース管理テーブルＴＢの構成を示す。データベース管理テーブルＴＢは、仕様の管理テーブルを備えている。具体的には、データベース管理テーブルＴＢは、仕様について（Ｂ１）板幅、（Ｂ２）鋼種、および制御の優先度についての仕様Ａ１、Ａ２に応じて区分けされる。（Ｂ１）板幅としては、例えば、３フィート幅、メータ幅、４フィート幅、５フィート幅の４区分が、鋼種としては、鋼種（１）～鋼種（１０）の１０区分程度を用いる。また、制御の優先度についての仕様Ａについては、Ａ１およびＡ２の２種類とする。この場合、８０区分となり、８０個のニューラルネットを、圧延条件に応じて使い分けて使用する事となる。 In the plant control system of FIG. 1, various databases DB1, DB2, DB3, and DB4 are used. FIG. 14 shows the configuration of the database management table TB for managing and operating each database DB1, DB2, DB3, and DB4 in an interconnected manner. The database management table TB has a specification management table. Specifically, the database management table TB classifies the specifications according to (B1) plate width, (B2) steel type, and specifications A1 and A2 for control priority. For (B1) plate width, for example, four categories are used: 3 feet width, meter width, 4 feet width, and 5 feet width, and for steel type, about 10 categories are used: steel type (1) to steel type (10). For specification A for control priority, two types are used: A1 and A2. In this case, there are 80 categories, and 80 neural networks are used according to the rolling conditions.

ニューラルネット学習制御部１１２は、図１４に示すような、入力データおよび教師データの組合せである学習データを、図１５のデータベース管理テーブルＴＢに従って、該当するニューラルネットＮｏ．と紐付けて、図１６に示すような学習データデータベースＤＢ２に格納する。 The neural network learning control unit 112 associates the learning data, which is a combination of input data and teacher data as shown in FIG. 14, with the corresponding neural network number according to the database management table TB in FIG. 15, and stores it in the learning data database DB2 as shown in FIG. 16.

制御実行装置２０が、制御対象プラント１に対して、形状制御を実行するたびに、学習データが２組作成される。これは、同じ入力データ、制御出力に対して、制御結果良否判定が制御の優先度についての仕様Ａ１および仕様Ａ２の２つの評価基準を用いて行われるため、教師データが２種類作成されるためである。教師データがある程度（例えば２００組）蓄積されたら、または新たに学習データデータベースＤＢ２に蓄積されたら、ニューラルネット学習制御部１１２は、ニューラルネット１１１の学習を指示する。 Each time the control execution device 20 executes shape control on the controlled plant 1, two sets of learning data are created. This is because two types of teacher data are created because the control result pass/fail judgment is performed using two evaluation criteria for control priority, specification A1 and specification A2, for the same input data and control output. Once a certain amount of teacher data (e.g., 200 sets) has been accumulated, or new data has been accumulated in the learning data database DB2, the neural network learning control unit 112 instructs the neural network 111 to learn.

制御ルールデータベースＤＢ１には、図１５に示すようなデータベース管理テーブルＴＢに従って、複数のニューラルネットが格納されており、ニューラルネット学習制御部１１２においては、学習が必要なニューラルネットＮｏ．を指定して、ニューラルネット選択部１１３が制御ルール評価値データベースＤＢ１より当該ニューラルネットを取り出し、ニューラルネット１１１に設定する。ニューラルネット学習制御部１１２は、学習データデータベースＤＢ２より、当該ニューラルネットに対応する、入力データおよび教師データの取り出しを、入力データ作成部１１４および教師データ作成部１１５に指示し、それらを用いてニューラルネット１１１の学習を実施する。なおニューラルネットの学習方法は手法が種々提案されており、いずれの手法を用いても良い。 The control rule database DB1 stores multiple neural nets according to a database management table TB as shown in FIG. 15, and in the neural net learning control unit 112, the neural net number that needs to be learned is specified, and the neural net selection unit 113 retrieves the neural net from the control rule evaluation value database DB1 and sets it as the neural net 111. The neural net learning control unit 112 instructs the input data creation unit 114 and the teacher data creation unit 115 to retrieve input data and teacher data corresponding to the neural net from the learning data database DB2, and uses them to learn the neural net 111. Note that various methods have been proposed for learning neural nets, and any of these methods may be used.

ニューラルネット１１１の学習が完了すると、ニューラルネット学習制御部１１２は、学習結果であるニューラルネット１１１を、制御ルール評価値データベースＤＢ１の当該ニューラルネットＮｏ．の位置に書き戻すことで、学習が完了する。 When the learning of the neural network 111 is completed, the neural network learning control unit 112 writes the neural network 111, which is the learning result, back to the position of the corresponding neural network number in the control rule evaluation value database DB1, thereby completing the learning.

学習は、図１５にて定義された全てのニューラルネットに対して定時間間隔（例えば１日毎）で一斉に実施しても良いし、新しい学習データがある程度（例えば１００組）蓄積されたニューラルネットＮｏ．のニューラルネットのみ、その時点で学習させても良い。 The learning may be performed simultaneously for all neural networks defined in FIG. 15 at regular intervals (e.g., once a day), or only the neural network with the neural network number for which a certain amount of new learning data has been accumulated (e.g., 100 sets) may be trained at that point.

図１７は制御ルール適合度評価部２５の構成を示す。制御ルール適合度評価部２５は、実績データＳｉと制御結果良否判定部６からの制御結果良否データＳ６を処理実行判断部２５４へ入力する。制御ルール適合度評価部２５は、形状が良くなる有効な操作が行われたことを確認できた場合に、実績データＳｉで用いられた制御ルールＳ１０を制御ルール評価値データベースＤＢ５から、この制御ルールＳ１０のニューラルネットＮｏ．Ｓ９をデータベース管理テーブルＴＢからそれぞれ取得する。処理実行判断部２５４は、実績データＳｉと制御結果良否データＳ６から、制御出力取得部２５１の処理実行の要否を判断し、制御出力取得部２５１の処理を実行する場合に処理実行指示Ｓ２５０５を出力する。制御出力取得部２５１は、制御ルールＳ１０、実績データＳｉ、および処理実行指示Ｓ２５０５が入力され、制御ルールＳ１０へ実績データＳｉに含まれる実形状を入力し、形状制御装置に対する出力Ｓ２５０１を得る。制御出力誤差演算部２５２は、出力Ｓ２５０１と実績データＳｉに含まれる形状制御の出力の差を（１３）式にて演算する。 Figure 17 shows the configuration of the control rule conformity evaluation unit 25. The control rule conformity evaluation unit 25 inputs the performance data Si and the control result conformity data S6 from the control result conformity judgment unit 6 to the process execution judgment unit 254. When the control rule conformity evaluation unit 25 confirms that an effective operation that improves the shape has been performed, it acquires the control rule S10 used in the performance data Si from the control rule evaluation value database DB5 and the neural net No. S9 of this control rule S10 from the database management table TB. The process execution judgment unit 254 judges whether or not the process of the control output acquisition unit 251 needs to be executed from the performance data Si and the control result conformity data S6, and outputs a process execution instruction S2505 when the process of the control output acquisition unit 251 is to be executed. The control output acquisition unit 251 receives the control rule S10, performance data Si, and process execution instruction S2505, inputs the actual shape included in the performance data Si to the control rule S10, and obtains an output S2501 for the shape control device. The control output error calculation unit 252 calculates the difference between the output S2501 and the shape control output contained in the performance data Si using formula (13).

ここで制御ルールＳ１０の出力Ｓ２５０１をｒ、実績データＳｉに含まれる形状制御の出力をｇ、形状制御の機器の総数をＮ、制御ルールＳ１０と実績データＳｉの出力誤差をＳＵＶとする。 Here, the output S2501 of the control rule S10 is r, the shape control output included in the performance data Si is g, the total number of shape control devices is N, and the output error between the control rule S10 and the performance data Si is SUV.

制御出力誤差演算部２５２は、出力誤差ＳＵＶがある閾値ＴＳ以下となった場合、制御ルールＳ１０は制御対象プラントに対して適合していると判断し、適合度ＤＳＵを１とし、閾値ＴＳを上回った場合、制御ルールは制御対象プラントに対して不適合であると判断し、適合度ＤＳＵを０とする。制御出力誤差演算部２５２は、制御ルール評価値データベースＤＢ５へこの制御ルールＳ１０に対応するニューラルネットＮｏ．と適合度ＤＳＵの値を登録する。適合度ＤＳＵの演算は（１４）式にて行う。 When the output error SUV is equal to or less than a certain threshold value TS, the control output error calculation unit 252 determines that the control rule S10 is suitable for the controlled plant and sets the suitability DSU to 1. When the output error SUV exceeds the threshold value TS, the control rule is determined to be unsuitable for the controlled plant and sets the suitability DSU to 0. The control output error calculation unit 252 registers the neural network number and suitability DSU value corresponding to this control rule S10 in the control rule evaluation value database DB5. The suitability DSU is calculated using equation (14).

閾値ＴＳが大きいほど適合度が１となる割合が増え、閾値ＴＳが小さいほど適合度が０となる割合が増えるので、後述する制御ルール更新判断結果が閾値ＴＳによって大きく変化する。制御ルールを短いスパンで更新したい場合は閾値ＴＳを小さい値に設定するなど、プラントの操業に合わせて柔軟に設定する必要がある。 The larger the threshold value TS, the higher the proportion of conformance values that are 1, and the smaller the threshold value TS, the higher the proportion of conformance values that are 0. Therefore, the control rule update decision results described below vary greatly depending on the threshold value TS. If you want to update the control rules in short intervals, you need to set the threshold value TS flexibly to suit the operation of the plant, for example by setting it to a small value.

その後制御出力誤差演算部２５２は、制御ルール更新評価指示部２５３へ処理実行指示Ｓ２５０４を出力する。制御ルール更新評価指示部２５３は、制御出力誤差演算部２５２からの処理実行指示を受けて、ニューラルネットＮｏ．を含む処理実行指示Ｓ１２を制御ルール更新評価部２６へ出力する。 Then, the control output error calculation unit 252 outputs a processing execution instruction S2504 to the control rule update evaluation instruction unit 253. Upon receiving the processing execution instruction from the control output error calculation unit 252, the control rule update evaluation instruction unit 253 outputs a processing execution instruction S12 including the neural network number to the control rule update evaluation unit 26.

図１８は制御ルール更新評価部２６の構成を示す。制御ルール更新評価部２６は、制御ルール適合度評価部２５からニューラルネットＮｏ．を含む処理実行指示Ｓ１２を受け取ったタイミングで処理を実行する。制御ルール更新要否判定部２６１は、処理実行指示Ｓ１２に含まれるニューラルネットＮｏ．について、一定期間（担当オペレータ毎の操作のばらつきを抑制するために、オペレータのシフトが一巡する期間、例えば一週間）の適合度ＤＳＵを制御ルール評価値データベースＤＢ５から取得し（Ｓ１３ａ）、その平均値ＵＥＶを演算し、制御ルール評価値データベースＤＢ５へ登録する（Ｓ１４ａ）。演算式を（１５）式にて示す。 Figure 18 shows the configuration of the control rule update evaluation unit 26. The control rule update evaluation unit 26 executes processing when it receives a processing execution instruction S12 including a neural net number from the control rule compatibility evaluation unit 25. The control rule update necessity determination unit 261 obtains the compatibility DSU for the neural net number included in the processing execution instruction S12 from the control rule evaluation value database DB5 for a certain period (a period during which an operator's shift goes through one cycle, for example, one week, in order to suppress variation in operation between operators in charge) (S13a), calculates the average value UEV, and registers it in the control rule evaluation value database DB5 (S14a). The calculation formula is shown in formula (15).

ここで一定期間分のデータ数をＷとする。 Let W be the amount of data for a certain period of time.

平均値ＵＥＶが閾値ＴＵ以下となった場合、制御ルール更新要否フラグＲＵＦを１とし、それ以外の場合は制御ルール更新要否フラグＲＵＦを０として制御ルール評価値データベースＤＢ５に登録する。ただし適合度ＤＳＵが一定期間分蓄積されていない場合、制御ルール更新要否フラグＲＵＦ算出演算を実行しない。演算式を（１６）式にて示す。 When the average value UEV becomes equal to or less than the threshold value TU, the control rule update necessity flag RUF is set to 1, otherwise the control rule update necessity flag RUF is set to 0 and registered in the control rule evaluation value database DB5. However, if the conformance degree DSU has not been accumulated for a certain period of time, the calculation of the control rule update necessity flag RUF is not executed. The calculation formula is shown in formula (16).

制御ルール更新要否フラグの演算結果は閾値ＴＵの設定によって変化する。このため、制御ルールの更新頻度を下げたい場合は閾値ＴＵを小さい値に設定するなど、プラントの操業に合わせて柔軟に設定する必要がある。 The calculation result of the control rule update necessity flag changes depending on the setting of the threshold value TU. For this reason, if you want to reduce the frequency of updating the control rules, you need to set the threshold value TU to a small value, and so on, so that it can be set flexibly according to the operation of the plant.

その後制御ルール更新要否判定部２６１は、制御ルール更新優先度更新部２６２へ処理実行指示Ｓ２６０１を出力する。制御ルール更新優先度更新部２６２は、処理実行指示Ｓ２６０１を受けて、制御ルール評価値データベースＤＢ５に登録されているニューラルネットＮｏ．毎の制御ルール更新優先度を読み出し（Ｓ１３ｂ）、更新する（Ｓ１４ｂ）。制御ルール更新優先度は、制御ルール評価値データベースＤＢ５に登録されている平均値ＵＥＶが小さいニューラルネットＮｏ．から順に１，２，３，・・・と自然数を割付けていく。 Then, the control rule update necessity determination unit 261 outputs a processing execution instruction S2601 to the control rule update priority update unit 262. In response to the processing execution instruction S2601, the control rule update priority update unit 262 reads (S13b) and updates (S14b) the control rule update priority for each neural net number registered in the control rule evaluation value database DB5. The control rule update priority is assigned natural numbers 1, 2, 3, ... in order from the neural net number with the smallest average UEV registered in the control rule evaluation value database DB5.

図１９は制御ルール評価値データベースＤＢ５の詳細を示す。制御ルール評価値データベースＤＢ５は、例えばテーブル形式で、ニューラルネットＮｏ．に対して、制御ルール更新要否フラグＲＵＦ、制御ルール更新優先度、平均値ＵＥＶ、各適合度ＤＳＵおよびそれらの演算実行日時を対応付けて登録する。 Figure 19 shows details of the control rule evaluation value database DB5. The control rule evaluation value database DB5 registers, for example in table format, a control rule update necessity flag RUF, a control rule update priority, an average value UEV, each degree of conformance DSU, and the date and time of calculation thereof in association with a neural network number.

図２０は制御ルール更新処理管理部２４の概要を示す。制御ルール更新処理管理部２４は、制御ルール評価値データベースＤＢ５から制御ルール更新優先度が最も高い制御ルールに該当するニューラルネットＮｏ．Ｓ１５を取得し、制御ルール学習部１１へ処理実行指示Ｓ１６を与える。ただし処理実行指示Ｓ１６は、制御ルール学習部１１の実行処理を担う制御処理計算機２３の計算機負荷情報Ｓ１７（例えばＣＰＵ負荷やメモリ使用率等）から、制御処理計算機２３の計算機負荷が低い場合、および／または、実績データＳｉを参照して制御対象プラントが更新対象の制御モデルを圧延に使用している状況でないことを確認できた場合に限定する。これは制御処理計算機２３に過剰な負荷が掛かることによって制御実行装置２０の処理に遅延が発生し、制御対象プラントへの制御出力タイミングが遅延することを防ぎ、また、操業中に制御ルールが変更されることを防ぐためである。制御ルール更新可否判断部２４１から制御ルール学習部１１へ処理実行指示Ｓ１６が送信された後、制御ルール更新可否判断部２４１から制御ルール更新完了処理部２４２へニューラルネットＮｏ．Ｓ１５を含む処理実行指示Ｓ２４０１が出力される。制御ルール更新完了処理部２４２は、処理実行指示Ｓ２４０１を受信すると、制御ルール評価値データベースＤＢ５に登録されているニューラルネットＮｏ．Ｓ１５に紐づく情報を全て削除する。 Figure 20 shows an overview of the control rule update processing management unit 24. The control rule update processing management unit 24 acquires the neural net No. S15 corresponding to the control rule with the highest control rule update priority from the control rule evaluation value database DB5, and issues a processing execution instruction S16 to the control rule learning unit 11. However, the processing execution instruction S16 is limited to the case where the computer load of the control processing computer 23 is low from the computer load information S17 (e.g., CPU load, memory usage rate, etc.) of the control processing computer 23 that is responsible for the execution processing of the control rule learning unit 11, and/or the control model to be updated is not used for rolling in the controlled plant by referring to the actual data Si. This is to prevent a delay in the processing of the control execution device 20 due to an excessive load on the control processing computer 23, which causes a delay in the timing of the control output to the controlled plant, and also to prevent the control rule from being changed during operation. After the control rule update feasibility determination unit 241 transmits a processing execution instruction S16 to the control rule learning unit 11, the control rule update feasibility determination unit 241 outputs a processing execution instruction S2401 including the neural network number S15 to the control rule update completion processing unit 242. When the control rule update completion processing unit 242 receives the processing execution instruction S2401, it deletes all information associated with the neural network number S15 registered in the control rule evaluation value database DB5.

以上により、制御対象プラント１である圧延機の形状を大きく乱すことなく、
１）基準形状パターンと、それに対する制御操作を予め別々に設定し、制御操作方法を学習していくのではなく、形状パターンと制御操作の組合せを学習し、それを用いて制御操作を実施する。
２）新たな制御ルールは、予め予想できるものでは無く、全く予測できなかった制御ルールが最適となる場合も有る事から、ランダムに制御操作端を動作させ、それに対する制御結果を見ながら見つけていく。
事が実現できる。 As a result, the shape of the rolling mill, which is the plant 1 to be controlled, is not significantly disturbed.
1) A reference shape pattern and a control operation for it are separately set in advance, and a control operation method is not learned. Instead, a combination of a shape pattern and a control operation is learned and used to carry out the control operation.
2) Since new control rules cannot be predicted in advance, and a completely unpredictable control rule may turn out to be the optimal rule, the optimal control rule is found by randomly operating the control operation terminals and observing the control results.
Things can be made to happen.

なお、制御ルール評価値データベースＤＢ１には、制御実行装置２０で使用するニューラルネットが格納されるが、格納されるニューラルネットが、乱数でイニシャル処理を実施しただけのものだと、ニューラルネットの学習が進行し、それなりの制御が可能となるまで時間がかかる。そのため、制御対象プラント１に対して、制御部を構築した時に、その時点で判明している制御対象プラント１の制御モデルに基づき、予めシミュレーションにて、制御ルールの学習を実施し、シミュレータでの学習が完了したニューラルネットをデータベースに格納しておく事で、制御対象プラントの立上げ当初から、ある程度の性能の制御を実施する事が可能である。 The control rule evaluation value database DB1 stores the neural net used by the control execution device 20, but if the stored neural net has only been initialized with random numbers, it will take time for the neural net to learn and become capable of reasonable control. Therefore, when a control unit is constructed for the controlled plant 1, the control rules are learned in advance by simulation based on the control model of the controlled plant 1 that is known at that time, and the neural net that has completed learning in the simulator is stored in the database, making it possible to control the controlled plant to a certain degree of performance from the beginning of its startup.

図２１は、制御実行装置２０、制御方法学習装置２１、制御ルール更新判断装置２２、制御処理計算機２３、およびこれらを適宜統合したシステムの各システムを実現するコンピュータ５００のハードウェアの概要を示す図である。コンピュータ５００では、ＣＰＵなどのプロセッサ５１０、ＲＡＭ（Random Access Memory）などのメモリ５２０、ＳＳＤ（Solid State Drive）やＨＤＤ（Hard Disk Drive）などのストレージ５３０、ネットワークＩ／Ｆ（Inter/Face）５４０、入出力装置５５０（例えばキーボード、マウス、タッチパネル、ディスプレイ等）、および周辺装置５６０が、バスを介して接続されている。 Figure 21 is a diagram showing an overview of the hardware of a computer 500 that realizes each system including a control execution device 20, a control method learning device 21, a control rule update determination device 22, a control processing computer 23, and a system that appropriately integrates these systems. In the computer 500, a processor 510 such as a CPU, a memory 520 such as a RAM (Random Access Memory), a storage 530 such as an SSD (Solid State Drive) or an HDD (Hard Disk Drive), a network I/F (Inter/Face) 540, an input/output device 550 (e.g., a keyboard, a mouse, a touch panel, a display, etc.), and a peripheral device 560 are connected via a bus.

コンピュータ５００において、各システムを実現するための各プログラムがストレージ５３０から読み出されプロセッサ５１０およびメモリ５２０の協働により実行されることで、各システムが実現される。あるいは、各システムを実現するための各プログラムは、ネットワークＩ／Ｆ５４０を介した通信により外部のコンピュータから取得されてもよい。あるいは各プログラムは、非一時的記録媒体に記録され、媒体読み取り装置によって読み出されることで取得されてもよい。 In computer 500, each system is realized by reading out each program for realizing each system from storage 530 and executing it through cooperation between processor 510 and memory 520. Alternatively, each program for realizing each system may be obtained from an external computer by communication via network I/F 540. Alternatively, each program may be obtained by recording on a non-transitory recording medium and reading it out by a medium reading device.

なお本発明装置を実プラントに適用するに当たり、ニューラルネットの初期値を定めておく必要があるが、この点に関して実績データと制御操作の組合せを、制御対象プラントでの制御を実施する前に、制御対象プラントの制御モデルを用いてシミュレーションにより作成し、制御対象プラントにおける実績データと制御操作の組合せの学習期間を短縮するのがよい。 When applying the device of the present invention to an actual plant, it is necessary to determine the initial values of the neural network. In this regard, it is advisable to create a combination of performance data and control operations through simulation using a control model of the plant to be controlled before implementing control at the plant, thereby shortening the learning period for the combination of performance data and control operations at the plant to be controlled.

本発明は上述の実施形態に限定されるものではなく、様々な変形例を含む。例えば、上述の実施形態は本発明を分かりやすく説明するために詳細に説明したものであり、必ずしも説明した全ての構成を備えるものに限定されるものではない。また、矛盾しない限りにおいて、ある実施形態の構成の一部を他の実施形態の構成で置き換え、ある実施形態の構成に他の実施形態の構成を加えることも可能である。また、各実施形態の構成の一部について、構成の追加、削除、置換、統合、または分散をすることが可能である。また実施形態で示した構成および処理は、処理効率あるいは実装効率に基づいて適宜分散、統合、または入れ替えることが可能である。 The present invention is not limited to the above-described embodiments, and includes various modifications. For example, the above-described embodiments have been described in detail to clearly explain the present invention, and are not necessarily limited to those having all of the configurations described. Furthermore, as long as there is no contradiction, it is possible to replace part of the configuration of one embodiment with the configuration of another embodiment, and to add the configuration of another embodiment to the configuration of one embodiment. Furthermore, it is possible to add, delete, replace, integrate, or distribute part of the configuration of each embodiment. Furthermore, the configurations and processes shown in the embodiments can be distributed, integrated, or replaced as appropriate based on processing efficiency or implementation efficiency.

１：制御対象プラント、２０：制御実行装置、２１：制御方法学習装置、２２：制御ルール更新判断装置、２３：制御処理計算機、５００：コンピュータ 1: Plant to be controlled, 20: Control execution device, 21: Control method learning device, 22: Control rule update decision device, 23: Control processing computer, 500: Computer

Claims

A plant control system for controlling a controlled plant ,
a control method learning device that learns a control rule based on a combination of performance data of the controlled plant and a control operation;
a control execution device that controls the controlled plant based on the control rule learned by the control method learning device;
a control rule update determination device that calculates a degree of conformity of the control rule with respect to the controlled plant based on the performance data when control of the controlled plant is performed based on the control rule, and updates the control rule based on the degree of conformity,
The control rule update determination device includes:
a control rule conformance evaluation unit that calculates the conformance at each predetermined timing;
a control rule update evaluation unit that determines whether or not the control rule needs to be updated and an update priority based on the degree of conformance at each of the predetermined timings in a recent predetermined period;
a control rule update processing management unit that outputs an update instruction for the control rule based on the necessity of the update and the update priority.

2. The plant control system according to claim 1 ,
The control rule update evaluation unit
determining whether the update is necessary based on a comparison result between the degree of conformity calculated at each of the predetermined timings during the most recent predetermined period and a predetermined reference value.

2. The plant control system according to claim 1 ,
The control rule conformity evaluation unit includes:
When an output error between a control output when control of the controlled plant is performed based on the control rule and a control output included in the performance data is equal to or smaller than a threshold value, the control rule is determined to be suitable for the controlled plant and the degree of suitability is set to 1, and when the output error is greater than the threshold value, the control rule is determined to be incompatible with the controlled plant and the degree of suitability is set to 0;
The control rule update evaluation unit
determining whether the update is necessary and the update priority based on an average of the goodness of fit calculated at each of the predetermined timings during the most recent predetermined period.

2. The plant control system according to claim 1 ,
The control rule update processing management unit
a control execution device that executes the update command on condition that the control rule to be updated is not being used in the controlled plant.

5. The plant control system according to claim 4 ,
a control execution device and a control method learning device for controlling a plant, the control execution device and the control method learning device being operable to output the update instruction on the condition that the control execution device and the control method learning device are not in a predetermined high load state.

2. The plant control system according to claim 1,
The control execution device is
a control rule execution unit that gives a control output in accordance with a predetermined combination of performance data of a controlled plant and a control operation;
a control output determination unit that determines whether or not the control output output by the control rule execution unit is acceptable and notifies the control method learning device that the performance data and the control operation are incorrect;
a control output suppression unit that, when the control output determination unit determines that the performance data of the controlled plant will deteriorate if the control output is output to the controlled plant, prevents the control output from being output to the controlled plant,
The control method learning device includes:
a control result quality judging section for judging whether the performance data has improved or deteriorated compared to before the control, after a time delay until the control effect appears in the performance data, when the control execution device actually outputs a control output to a controlled plant, a learning data creating section for obtaining teacher data by using the quality of the control result judged by the control result quality judging section and the control output, and a control rule learning section for learning the performance data and the teacher data as learning data,
a control rule execution unit that executes control of the plant control method by learning the control method learning device, and that obtains a combination of actual data and control operations for a plurality of control targets in accordance with a state of the plant to be controlled, and that uses the obtained combination of actual data and control operations as a predetermined combination of actual data and control operations for the plant to be controlled in the control rule execution unit.

7. A plant control system according to claim 6 , comprising:
A plant control system that learns and controls combinations of performance data and control operations by using information related to the size of the performance data and information that standardizes the performance data to make it easier to recognize patterns, in order to change the combination of performance data and control operations depending on the size of the performance data of the plant to be controlled.

The plant control system according to claim 6 or 7 ,
a control rule execution unit that holds a predetermined combination of performance data and control operations of a controlled plant as a first neural net, and a control rule learning unit that holds the combination of performance data and control operations as a second neural net, and a second neural net obtained as a result of learning in the control method learning device is used as the first neural net in the control rule execution unit.

A plant control system according to any one of claims 6 to 8 ,
A plant control system, characterized in that the control execution device includes a control operation disturbance generating unit that applies a disturbance to the control output, and the control method learning device learns even when a disturbance is applied.

The plant control system according to any one of claims 6 to 9 ,
a control method learning device that obtains a plurality of combinations of performance data and control operations by learning under a plurality of predetermined specifications, and the control execution device that selects one of the plurality of combinations of performance data and control operations in accordance with an operating state of a plant to be controlled, and provides the control output.

9. The plant control system according to claim 8 ,
A plant control system characterized in that a neural network that learns combinations of performance data and operation methods to be used is changed according to the magnitude of the performance data.

A plant control system according to any one of claims 6 to 11 ,
A plant control system characterized by changing criteria for judging whether a control result is good or bad based on the state of the controlled plant or the experience of the operator of the controlled plant, determining the relationship between performance data and operation methods for the controlled plant, and storing the relationship in a database, thereby controlling the controlled plant using different control methods depending on the state of the controlled plant or the experience of the operator of the controlled plant, etc.

A plant control system according to any one of claims 6 to 12 ,
a control model of a plant to be controlled is used to create a combination of the performance data and the control operation through a simulation before control is performed in the plant to be controlled, thereby shortening a learning period for the combination of the performance data and the control operation in the plant to be controlled.

A rolling mill control device to which the plant control system according to any one of claims 6 to 13 is applied,
13. A rolling mill control device, comprising: a rolling mill, the controlled plant being a rolling mill; and a rolling mill control system, the rolling mill control system being characterized in that the controlled plant is a rolling mill; and the performance data is a delivery shape of the rolling mill.

A plant control method executed by a plant control system that controls a controlled plant, comprising:
a control method learning device for the plant control system learning a control rule based on a combination of performance data and a control operation of the controlled plant;
a control execution device of the plant control system controls the controlled plant based on the control rule learned by the control method learning device;
A control rule update determination device of the plant control system calculates a degree of conformity of the control rule with respect to the controlled plant based on the performance data when the controlled plant is controlled based on the control rule, and updates the control rule based on the degree of conformity.
Including each process,
In the process of updating the control rule,
a control rule conformance evaluation unit of the control rule update determination device calculates the conformance at each predetermined timing,
a control rule update evaluation unit of the control rule update determination device determines whether or not the control rule needs to be updated and an update priority based on the conformance at each of the predetermined timings in a most recent predetermined period;
A control rule update processing management unit of the control rule update decision device outputs an update instruction for the control rule based on the necessity of the update and the update priority.
A plant control method comprising the steps of:

A plant control program for causing a computer to function as the plant control system or the rolling mill control device according to any one of claims 1 to 14 .