JP7749341B2

JP7749341B2 - Modular time series data prediction device, modular time series data prediction method, and program

Info

Publication number: JP7749341B2
Application number: JP2021079844A
Authority: JP
Inventors: 理山中; 由紀夫平岡
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 2021-05-10
Filing date: 2021-05-10
Publication date: 2025-10-06
Anticipated expiration: 2041-05-10
Also published as: WO2022239609A1; JP2022173863A

Description

本発明の実施形態は、モジュラー型時系列データ予測装置、モジュラー型時系列データ予測方法、および、プログラムに関する。 Embodiments of the present invention relate to a modular time series data prediction device, a modular time series data prediction method, and a program.

上下水道システム、雨水排水システム、電力システム、交通システムなどのインフラシステム、あるいは、鉄鋼プロセス、石油化学プロセス、半導体製造プロセス、などのプロセス系の産業プラントなどでは、通常、複数のプロセス状態を測定する複数のオンラインセンサが設置されている。プロセス監視制御システム（ＳＣＡＤＡ: Supervisory Control And Data Acquisition）と呼ばれるシステムは、上記のインフラシステムや産業プラントに設置されたセンサ群の計測により得られるプロセスデータ（流量、温度、水質、操作量など）を取得し、時系列データとしてサーバ上に保持している。プロセス監視制御システムは、通常、これらの時系列データをトレンドグラフとして監視員に提供することが多い。 Infrastructure systems such as water supply and sewerage systems, stormwater drainage systems, power systems, and transportation systems, as well as industrial process plants such as steel processes, petrochemical processes, and semiconductor manufacturing processes, typically have multiple online sensors installed to measure multiple process conditions. A system known as a process monitoring and control system (SCADA: Supervisory Control And Data Acquisition) acquires process data (flow rate, temperature, water quality, manipulated variables, etc.) obtained through measurements by sensors installed in the above infrastructure systems and industrial plants, and stores it on a server as time-series data. Process monitoring and control systems typically provide this time-series data to supervisors as trend graphs.

監視員は、上記のような直接的な時系列データのトレンド監視に加えて、予測情報を利用するプラント監視を行うことがある。予測情報を利用するプラント監視システムによれば、監視中のリアルタイムの実測値（計測値）だけでなく、監視対象の将来の値を予測して、予測結果を運転管理者に提供することにより、適切な運転計画の実現を支援することができる。予測情報を利用するプラント監視システムは、プラントの異常兆候検出および異常診断と並んで、アドバンスト監視の典型的な手段である。 In addition to directly monitoring trends in time-series data as described above, monitors may also monitor plants using predictive information. Plant monitoring systems that use predictive information can predict future values of the monitored object, in addition to real-time actual measurements (measured values) during monitoring, and provide the predicted results to operation managers, thereby helping them to implement appropriate operation plans. Plant monitoring systems that use predictive information are a typical means of advanced monitoring, along with abnormal sign detection and anomaly diagnosis in plants.

予測情報を利用するプラント監視が用いられるインフラシステムの例として、例えば、浄水システムにおける水需要の予測、下水処理場への汚水流入予測、雨天時の下水処理場および雨水排水ポンプ場への雨水流入予測、雨天時の河川水位やダム水位の予測、あるいは、気象レーダなどによる降雨の短期予測（ナウキャスト）、交通システムにおける交通量予測や渋滞予測、電力システムにおける電力需要の予測、あるいは、ビルなどの施設内の食堂や店舗などの顧客数予測、などのシステムがあり、インフラ領域の様々な分野で広く使われている。 Examples of infrastructure systems that use plant monitoring that utilizes predictive information include predicting water demand in water purification systems, predicting wastewater inflow into sewage treatment plants, predicting rainwater inflow into sewage treatment plants and stormwater drainage pumping stations during rainy weather, predicting river and dam water levels during rainy weather, short-term rainfall forecasts (nowcasting) using weather radar, traffic volume forecasts and congestion forecasts in transportation systems, power demand forecasts in power systems, and customer numbers for cafeterias and shops within facilities such as buildings. These systems are widely used in a variety of infrastructure fields.

予測情報を利用するプラント監視システムは、予測情報そのものを付加価値として提供するだけでなく、プラントの運転計画やプラントの制御に予測情報を利用することで、より効率の良い運用やより安全性の高い運用を実現するためにも用いられ得る。例えば、水需要や電力需要等の需要予測は、水運用計画や発電および蓄電計画（ＥＭＳ：エネルギーマネージメントシステム）などの最適化に利用することで、効率の良い運用を実現するために用いることができる。また、例えば降雨予測や流入予測は、雨水排水ポンプの運転制御のための入力情報として予測値を用いることで、浸水の回避や抑制などのリスク低減に貢献する。 Plant monitoring systems that use predictive information not only provide the predictive information itself as added value, but can also be used to achieve more efficient and safer operations by using the predictive information in plant operation plans and plant control. For example, demand forecasts for water demand, electricity demand, etc. can be used to optimize water operation plans and power generation and storage plans (EMS: Energy Management Systems) to achieve more efficient operations. Furthermore, for example, rainfall forecasts and inflow forecasts can be used as input information for controlling the operation of stormwater drainage pumps, contributing to risk reduction by preventing or suppressing flooding.

インフラシステムだけでなくプロセス系のプラントである鉄鋼プロセスや石油化学プロセスなどにおいても、製品品質や歩留まりの予測などに時系列データ解析を用いることも多く、その情報は生産効率（歩留まり）向上のために利用されることも多い。 Time series data analysis is often used not only in infrastructure systems but also in process plants such as steel and petrochemical processes to predict product quality and yield, and this information is often used to improve production efficiency (yield).

時系列データの予測は、インフラシステムやプロセス系の産業プラントで極めて重要な役割を果たし、プラントの監視データを利用したデータドリブンの（ブラックボックス的アプローチの）予測方法として、従来から様々な方法が適用されている。 Time-series data forecasting plays an extremely important role in infrastructure systems and process industrial plants, and various data-driven (black-box approach) forecasting methods have traditionally been applied using plant monitoring data.

従来からプラント監視データなどの時系列データを用いた予測技術は広く用いられているが、近年の統計的機械学習分野を中心とするＡＩ関連の理論、技術、手法の急速な進歩に伴い、時系列データ解析に対するＡＩ手法も同時並行的に進展し、分野および領域を問わず、様々なアドバンストなＡＩ手法を時系列データの予測にも適用する動きが加速している。特に、ＮＮの発展形である深層学習ネットワーク（ＤＮＮ：ディープラーニングネットワーク）などは、昨今注目を集めている代表的な方法であり、ＤＮＮはＡＩの代名詞としても使われる場合もある。 Prediction techniques using time-series data such as plant monitoring data have been widely used for some time. However, with the rapid advances in AI-related theories, technologies, and methods in recent years, particularly in the field of statistical machine learning, AI methods for time-series data analysis have also been advancing in parallel, accelerating the trend toward applying various advanced AI methods to time-series data prediction, regardless of field or area. In particular, deep learning networks (DNNs), an advanced form of neural networks, are a representative method that has been attracting attention in recent years, and DNNs are sometimes used as a synonym for AI.

ＡＩの代表的な手法であるＤＮＮの他にも、重回帰分析で問題になる多重共線性による不安定化（悪条件問題）を回避する方法として（先に述べたＰＬＳとは異なる観点で）導入された正則化に基づく各種の方法、特に、古くからあるＬ２正則化（リッジ回帰）に加え、多数の説明変数候補の中から必要な説明変数を自動選択することのできるＬ１正則化手法であるLassoなどの方法、Lassoとは異なる視点でベイズ推論的な手法で説明変数の自動選択を行うRVM（適合ベクトルマシン）に基づく回帰手法（予測手法）であるＲＶＲ（適合ベクトル回帰）などの方法、最適化を用いて識別問題（分類問題）をロバスト化したＳＶＭ（サポートベクトルマシン）を回帰問題（予測問題）に転用したＳＶＲ（サポートベクトル回帰）などの方法、従来の重回帰などのような線形回帰を非線形に拡張し、さらに確率分布を考えてノンパラメトリック回帰問題に帰着させ、点予測ではなく予測分布を出力するガウス過程回帰などの方法、あるいは、様々な予測サブシステムを組み合わせて予測性能を向上させるバギングやブースティングと呼ばれる各種の方法（ランダムフォレスト、アダブースト、ＸＧブーストなど）など、多様なアドバンストな時系列データの予測手法も用いられる様になってきている。 In addition to DNN, a representative AI method, there are various methods based on regularization that were introduced (from a different perspective than the previously mentioned PLS) as a way to avoid instability (ill-conditioned problem) caused by multicollinearity, which is a problem in multiple regression analysis. In particular, in addition to the long-standing L2 regularization (ridge regression), there are methods such as Lasso, an L1 regularization method that can automatically select the necessary explanatory variables from a large number of candidate explanatory variables, and RVR (relevant vector regression), a regression method (prediction method) based on RVM (relevant vector machine) that automatically selects explanatory variables using a Bayesian inference method from a different perspective than Lasso, and methods that use optimization to solve classification problems ( A variety of advanced time series data forecasting methods are also coming into use, such as SVR (support vector regression), which adapts SVMs (support vector machines), which have been made robust for classification problems, to regression problems (prediction problems); Gaussian process regression, which nonlinearly extends linear regression such as conventional multiple regression and further considers probability distributions to reduce it to a nonparametric regression problem, outputting a predictive distribution rather than a point prediction; and various methods known as bagging and boosting (random forest, adaboost, XG boost, etc.), which combine various prediction subsystems to improve predictive performance.

このように、アドバンストなＡＩに基づく予測手法が次々と開発され、従来から用いられる方法と比較して各段に予測精度が向上するという報告も数多くなされる様になり、ＡＩ関連技術に関する期待が高まっている。 In this way, advanced AI-based prediction methods are being developed one after another, and there have been many reports of significantly improved prediction accuracy compared to traditional methods, raising expectations for AI-related technologies.

一方で、ＡＩ関連技術が実際の現実的なシステム（インフラシステムやプロセス系のプラント）において、必ずしも容易に展開できない理由の一つとして、統計的およびＡＩ的な解析手法が持つ本質的な性質であるブラックボックス的なアプローチの欠点として指摘されている、「結果に対する説明性の欠如」の問題がある。そして、この「結果に対する説明性の欠如」の問題に対する意識が急速に高まっており、この問題を解決すべく、最近では「説明可能ＡＩ（ＸＡＩ：eXplainable AI)の概念が米国ＤＡＲＰＡにより提唱され、注目が集まっている。 On the other hand, one of the reasons why AI-related technologies are not necessarily easily deployed in actual, realistic systems (infrastructure systems or process plants) is the problem of "lack of explainability of results," which has been pointed out as a drawback of the black-box approach that is an essential characteristic of statistical and AI analysis methods. Awareness of this problem of "lack of explainability of results" is rapidly increasing, and in an effort to solve this problem, the concept of "explainable AI (XAI: eXplainable AI)" has recently been proposed by the US DARPA, which has attracted attention.

特許第６２６１９６０号公報Patent No. 6261960

インフラ系のシステムでは、物理的、化学的な知見に立脚したホワイトボックス的なモデルを用いる事も多いが、これは、「Accountability（説明責任）」が求められることに関連していると考えられる。実際、ホワイトボックス的なモデルによる予測精度がブラックボックス的なモデルによる予測精度よりも必ずしも高いわけではなく、また、ホワイトボックス的なモデルが必ずしも、実際の物理現象を正確かつ忠実に再現しているとも限らない。それにも関わらず、ホワイトボックスモデルが時として好まれ、利用される理由は、ホワイトボックスモデルに含まれる「パラメータ」には物理的および化学的な意味があるため「合理的な説明がしやすく、他者に納得してもらいやすい」ことに加え、このような物理的および化学的な意味を持つことにより、精度が十分でない場合に、どのように調整すればよいかという方針や指針をたてやすい、すなわち、調整しやすい、という利点があるためであると考えられる。実際、物理的および化学的に意味を持つパラメータは、パラメータの値自体に物理的および化学的意味があるため、理解がしやすく、そのパラメータの値の取りうる範囲を想定することや、値の増減が結果に与える影響を物理的および化学的知見に基づいて考えることができる。 Infrastructure systems often use white-box models based on physical and chemical knowledge, which is thought to be related to the need for "accountability." In fact, the predictive accuracy of white-box models is not necessarily higher than that of black-box models, and white-box models do not necessarily accurately and faithfully reproduce actual physical phenomena. Despite this, white-box models are sometimes preferred and used because the "parameters" contained in white-box models have physical and chemical meanings, making them "easy to explain rationally and easy to convince others." Furthermore, because these parameters have physical and chemical meanings, it is easy to establish policies and guidelines for how to adjust them if accuracy is insufficient. In fact, parameters with physical and chemical meanings are easy to understand because the parameter values themselves have physical and chemical meanings. It is possible to imagine the range of possible values for the parameter and consider the impact of increasing or decreasing the value on the results based on physical and chemical knowledge.

上記のように、「合理的な説明のしやすさと調整のしやすさ」がホワイトボックスモデルを用いることの大きな動機付けになっている可能性がある。もしＡＩ的な手法によるブラックボックスモデルにおいても「合理的な説明のしやすさと調整のしやすさ」という要素を付加することができれば、その応用範囲は大きく広がると考えられる。 As mentioned above, "ease of rational explanation and ease of adjustment" may be a major motivation for using white box models. If the elements of "ease of rational explanation and ease of adjustment" could be added to black box models using AI-like techniques, it is believed that their range of applications would be greatly expanded.

本発明の実施形態は、上記事情を鑑みて成されたものであって、予測結果を合理的に説明するとともに、容易に予測結果を調整可能とするモジュラー型時系列データ予測装置、モジュラー型時系列データ予測方法、および、プログラムを提供することを目的とする。 Embodiments of the present invention have been made in consideration of the above circumstances, and aim to provide a modular time series data prediction device, a modular time series data prediction method, and a program that rationally explain prediction results and allow for easy adjustment of prediction results.

実施形態によるモジュラー型時系列データ予測装置は、複数のプロセス変数を所定の周期で計測する複数のプロセスセンサを有するシステム又はプロセスに適用される装置であって、複数の前記プロセス変数の時系列データを所定の周期で収集し保存するとともに、複数の前記プロセス変数の中から予測対象となる少なくとも一つの出力変数を選択する出力変数データ選択部と、複数の前記プロセス変数の中から複数の入力変数を選択する入力変数データ選択部と、を含むデータ収集保存部と、前記時系列データから抽出された前記出力変数と複数の前記入力変数との同定用データを用いて、１入力１出力のペア毎にペアワイズ予測モデルのパラメータを同定して複数の前記ペアワイズ予測モデルを定義するペアワイズ予測モデル同定部と、複数の前記ペアワイズ予測モデルから出力されるペアワイズ予測値の合成法を定義する予測モデル合成法定義部と、時間の進行方向に所定の周期又はリアルタイムで前記時系列データから抽出された複数の前記入力変数の予測用データを、複数の前記ペアワイズ予測モデルに入力して、複数の前記入力変数のそれぞれに対応する前記ペアワイズ予測値を演算するペアワイズ出力変数予測部と、前記合成法により複数の前記ペアワイズ予測値を合成して前記出力変数の予測値を演算する合成出力変数予測部と、を有する。
A modular time-series data prediction device according to an embodiment is a device applied to a system or process having a plurality of process sensors that measure a plurality of process variables at a predetermined cycle, and includes a data collection and storage unit that collects and stores time-series data of the plurality of process variables at a predetermined cycle, and that includes an output variable data selection unit that selects at least one output variable to be predicted from the plurality of process variables, and an input variable data selection unit that selects a plurality of input variables from the plurality of process variables; and a data collection and storage unit that performs pairwise prediction for each one-input-one-output pair using identification data of the output variable and the plurality of input variables extracted from the time-series data. a pairwise prediction model identification unit that identifies parameters of a prediction model to define a plurality of pairwise prediction models; a prediction model synthesis method definition unit that defines a synthesis method for pairwise prediction values output from the plurality of pairwise prediction models; a pairwise output variable prediction unit that inputs prediction data for the plurality of input variables extracted from the time-series data in a predetermined cycle in the time progression direction or in real time to the plurality of pairwise prediction models to calculate the pairwise prediction values corresponding to each of the plurality of input variables; and a synthesis output variable prediction unit that synthesizes the plurality of pairwise prediction values by the synthesis method to calculate a prediction value of the output variable.

図１は、一実施形態のモジュラー型時系列データ予測装置を適用した雨量流入予測システムを概略的に示す図である。FIG. 1 is a diagram illustrating a rainfall inflow prediction system to which a modular time-series data prediction device according to an embodiment of the present invention is applied. 図２は、一実施形態のモジュラー型時系列データ予測装置の一構成例を概略的に示す図である。FIG. 2 is a diagram illustrating an example of the configuration of a modular time-series data prediction device according to an embodiment. 図３は、幹線水位データと流入量データとの関係の一例を概略的に示す散布図である。FIG. 3 is a scatter diagram that schematically illustrates an example of the relationship between main water level data and inflow data. 図４は、幹線水位データと流入量データとの関係の一例を概略的に示す散布図である。FIG. 4 is a scatter diagram that schematically illustrates an example of the relationship between main water level data and inflow data. 図５は、一実施形態のモジュラー型の時系列予測装置におけるペアワイズ予測モデルの第１実施例について説明するための図である。FIG. 5 is a diagram illustrating a first example of a pairwise prediction model in a modular time series prediction device according to an embodiment. 図６は、一実施形態のモジュラー型の時系列予測装置におけるペアワイズ予測モデルの第２実施例について説明するための図である。FIG. 6 is a diagram illustrating a second example of a pairwise prediction model in a modular time series prediction device according to an embodiment. 図７は、第２実施例のペアワイズ予測モデルの効果を説明するための図である。FIG. 7 is a diagram for explaining the effect of the pairwise prediction model of the second embodiment. 図８は、第２実施例のペアワイズ予測モデルの効果を説明するための図である。FIG. 8 is a diagram for explaining the effect of the pairwise prediction model of the second embodiment. 図９は、一実施形態のモジュラー型の時系列予測装置におけるペアワイズ予測モデルの第３実施例について説明するための図である。FIG. 9 is a diagram illustrating a third example of a pairwise prediction model in a modular time series prediction device according to an embodiment. 図１０は、第３実施例のペアワイズ予測モデルの効果の一例を説明するための図である。FIG. 10 is a diagram for explaining an example of the effect of the pairwise prediction model of the third embodiment. 図１１は、第３実施例のペアワイズ予測モデルの効果の一例を説明するための図である。FIG. 11 is a diagram for explaining an example of the effect of the pairwise prediction model of the third embodiment.

本実施形態では、調整可能な時系列デー予測を行うための基本的手段として、モジュラー型の時系列データ予測装置について説明する。本実施形態のモジュラー型の時系列予測装置は、予測モデル（システム）を部分モデル（サブシステム）に分解可能にして、各部分モデルを容易に着脱可能にするものである。 In this embodiment, a modular time series data prediction device is described as a basic means for performing adjustable time series prediction. The modular time series prediction device of this embodiment allows a prediction model (system) to be decomposed into sub-models (subsystems), making each sub-model easily attachable and detachable.

モジュラー型の時系列データ予測装置によれば、例えば、複数の監視変数中のいずれか一つの変数のデータが欠測していたり、複数の監視変数にアウトライア（異常データ）が含まれていたり、監視変数がノイズ等を多量に含む不安定な計測データである場合、その影響が全体に波及することを回避できる。また、モジュラー型の時系列データ予測装置によれば、例えば、説明変数が必要であるか否（不要）かを適宜判断して、部分モデルを容易に脱着したり、予測モデルを部分的に調整したりすることができる。 A modular time series data prediction device can prevent the effects of, for example, when data for one of multiple monitored variables is missing, when multiple monitored variables contain outliers (abnormal data), or when monitored variables are unstable measurement data containing a large amount of noise, from spreading to the entire system. Furthermore, a modular time series data prediction device can easily determine whether explanatory variables are necessary (or not), and can easily detach partial models or partially adjust the prediction model.

このような利点は、モジュラー型の構成によりモデルが「分解可能」であるという事によって得られる利点である。予測モデルにおいてモジュラー型の構成をとる利点は、単に分解可能ということによって得られる利点だけでなく、これに加えて、このモジュラー型の構成をとることで、物理法則に矛盾せずに合理的で納得できるモデルを構築することが可能になり、さらに、このことにより調整を容易にすることができる点である。 These advantages are achieved by the modular structure, which allows the model to be "decomposable." The advantage of using a modular structure for predictive models is not simply that it is decomposable; in addition, this modular structure makes it possible to build a rational and convincing model that does not contradict the laws of physics, and this also makes it easier to adjust.

予測モデルにおいてモジュラー型の構成をとる利点について以下に説明する。
一般に、ブラックボックス的な手法が説明可能（Explainable）でない理由は、入力および出力という外部情報だけしかわからず、内部構造が見えない不可視である事が原因であると考えられている。すなわち、ブラックボックス的な手法は、単に内部構造が見えないだけでなく、そもそも、外部の入出力関係からだけでは、内部構造を一意に決めることができず、同じ入出力関係を与える複数（無数）の内部構造がありうる。このため、内部構造の解釈が難しくなり、場合によっては物理法則に矛盾する不合理なモデルになってしまうことがある。これは、モデルの一意性の問題として、制御理論分野の中に含まれるであるシステム同定分野等で知られた事項である。このようなモデルの一意性の概念は、例えば、「ある入力データと出力データのペアに対して、その入出力関係を表すモデルを唯一に決定することができるか」という問題であり、モデルの可同定問題と呼ばれている。 The advantages of a modular approach to predictive models are explained below.
It is generally believed that black-box approaches are not explainable because they only provide external information, such as inputs and outputs, and their internal structure is invisible. In other words, black-box approaches not only hide the internal structure, but also make it impossible to uniquely determine the internal structure from the external input-output relationships alone. There are potentially multiple (or even countless) internal structures that give the same input-output relationships. This makes it difficult to interpret the internal structure, and in some cases can lead to irrational models that contradict the laws of physics. This issue is known as the model uniqueness problem in the field of system identification, which is part of the field of control theory. The concept of model uniqueness is, for example, the question of whether, for a given pair of input and output data, it is possible to uniquely determine a model that represents the input-output relationship. This is known as the model identifiability problem.

可同定問題は、モデルの構造（モデルを表現する数式の形）を予め限定した場合、例えば、モデルを線形伝達関数モデルや時系列モデルに限定した場合、そのモデルに含まれるパラメータ（例えば伝達関数に含まれる係数）を一意に決めることができるかという問題となる。この場合、可同定問題の中でも特にパラメータ可同定性の問題と呼ばれる。このパラメータ可同定性の問題は、現実のデータを扱う問題でも頻繁に現れる問題であり、以下の３つのケースは特に遭遇することが多い問題である。 The identifiability problem arises when the structure of a model (the form of the mathematical formula that represents the model) is limited in advance, for example, when the model is limited to a linear transfer function model or a time series model, and the question is whether the parameters included in that model (for example, the coefficients included in the transfer function) can be uniquely determined. In this case, the problem is specifically called a parameter identifiability problem. This parameter identifiability problem frequently arises in problems that deal with real-world data, and the following three cases are particularly commonly encountered.

一つ目は、先に述べた多重共線性の問題であり、これは、入力変数（説明変数）間に強い相関を持つ場合の問題である。これは、例えば、雨水ポンプ場への雨水流入量を予測する場合に、入力変数としてレーダ雨量情報を用いて、近接する異なる２つのメッシュの降雨量を入力変数とするような場合などに典型的に現れる問題であり、現実のデータでは、説明変数間に強い相関を持つことは極めて多い。 The first is the problem of multicollinearity mentioned earlier, which occurs when there is a strong correlation between input variables (explanatory variables). This is a problem that typically arises when, for example, predicting the amount of rainwater inflow into a stormwater pumping station, using radar rainfall information as an input variable, and using the rainfall amounts of two different adjacent meshes as input variables; in real-world data, it is extremely common for there to be a strong correlation between explanatory variables.

二つ目は、制御理論の分野でよく知られている、持続的励振条件（Persistent Exciting Condition、PE条件）と呼ばれる条件が成立しない場合の問題であり、これは、モデル化の対象プロセスに対して、十分な周波数成分を含まない入力データが適用されている場合に生じる問題である。極端ではあるが、現実にも認められる例として、入力データが一定値である場合、通常出力データも一定値になるが、このような一定値の入出力関係を表す伝達関数モデルは無数に存在するため、係数を一意に決めることができない。例えば、y(t)=b1×u(t-1)+b2×u(t-2)と表される簡単な伝達関数モデル（時系列モデル）を考える場合、u(t-1)=u(t-2)=ｋ（一定値）であるとb1とb2を一意に決めることはできない。この概念を一般化したものが持続的励振条件というパラメータ可同定条件であるが、このような条件が成立しない場合も多い。 The second problem occurs when the persistent exciting condition (PE condition), well known in the field of control theory, is not met. This occurs when input data that does not contain sufficient frequency components is applied to the process being modeled. An extreme, yet realistic, example is when the input data is constant, the output data is usually also constant. However, because there are countless transfer function models that represent this constant input-output relationship, it is not possible to uniquely determine the coefficients. For example, when considering a simple transfer function model (time series model) expressed as y(t) = b1 × u(t-1) + b2 × u(t-2), if u(t-1) = u(t-2) = k (a constant value), b1 and b2 cannot be uniquely determined. A generalization of this concept is the parameter identifiability condition known as the persistent exciting condition, but such conditions often do not hold.

三つ目は、モデルの構造の複雑さと比較して、データの得られる量が少なくデータの質が良くない場合である。典型的な例として、ある対象が非線形的な挙動を示すことが物理的な観点から予めわかっているが、得られるデータが少なかったり、ある特定の運用条件の付近のデータしか得られなかったりすることにより、非線形の係数を同定できない様な場合である。例えば、y(t)=c1×u(t)+c2×u(t)^2と書かれることがわかっているが、データとして得られるu(t)の値が小さい値の場合、u(t)^2<<u(t)となるため、c2を同定することが困難になる様な場合である。 The third case is when the amount of available data is small and the quality of the data is poor compared to the complexity of the model structure. A typical example is when it is known from a physical perspective that a certain object will exhibit nonlinear behavior, but the available data is insufficient, or only data is available around certain operating conditions, making it impossible to identify the nonlinear coefficients. For example, it is known that y(t) can be written as y(t)=c1×u(t)+c2×u(t)^2, but when the value of u(t) available as data is small, u(t)^2<<u(t), making it difficult to identify c2.

上記三つの例は、パラメータの可同定性が欠落する典型的なケースであるが、現実の問題では、このような状況に遭遇することが極めて多い。モデル化の対象が、比較的小さな機械系システムおよび電気系システムの場合は、可同定性を向上させるための実験を行い、可同定性を確保した上で、モデルを構築することが常道である。しかし、上下水道、雨水排水、電力システム、などの巨大な社会インフラシステムの場合、モデル化の対象となるデータが自然現象から得られるデータ（降雨、日射量、風速など）であったり、対象プラントが稼働している状態のデータであったりすることが多く、パラメータの可同定性を向上させるための実験などを行うことができず、モデル化に利用できるデータは与えられたものであることが多いため、可同定性の欠落を回避することが本質的に難しい場合が多い。この結果、社会インフラシステムの場合には、可同定性が欠落した状態でモデルを構築せざるを得ない。 The above three examples are typical cases where parameter identifiability is lacking, but in real-world problems, such situations are extremely common. When modeling relatively small mechanical and electrical systems, it is common practice to conduct experiments to improve identifiability and build a model only after ensuring identifiability. However, when it comes to large-scale social infrastructure systems such as water supply and sewerage, stormwater drainage, and power systems, the data to be modeled is often data obtained from natural phenomena (rainfall, solar radiation, wind speed, etc.) or data from the target plant's operating state. This makes it impossible to conduct experiments to improve parameter identifiability, and the data available for modeling is often given, making it inherently difficult to avoid a lack of identifiability. As a result, when it comes to social infrastructure systems, models must be built without identifiability.

以下、可同定性の欠落による問題の一例として「多重共線性」について、従来の技術と本実施形態のモジュラー型時系列データ予測装置とを対比する。
「多重共線性」とは、複数の説明変数（入力変数）間に相関関係（一次従属関係）があることにより、回帰モデルのパラメータを一意に決めることができなくなるという問題である。これを最も簡単な例を用いて説明する。 Hereinafter, the conventional technology and the modular time series data prediction device of this embodiment will be compared with regard to "multicollinearity," which is an example of a problem caused by a lack of identifiability.
"Multicollinearity" is a problem where the parameters of a regression model cannot be uniquely determined due to a correlation (linear dependence) between multiple explanatory variables (input variables). This will be explained using the simplest example.

一つの出力yが二つの入力ｕ１、ｕ２の重回帰で表される以下の重回帰モデルを考える。
y=a1×u1＋a2×u2 （１）
(1)式は最も簡単な重回帰モデルであり、ａ１とａ２とは回帰係数と呼ばれるパラメータである。ここで、物理的な具体的イメージを持つために、例えば、出力ｙを雨水ポンプ場への雨水流入量、入力ｕ１および入力ｕ２を各々異なる箇所の降雨量であることを想定して考えてみる。この場合、降雨量が増加すれば流入量が増加することは、物理的に考えて自明であるから、ａ１とａ２とは正の値を持たなければならないことは容易にわかる。ここで、降雨量ｕ１と降雨量ｕ２とに従属関係のある多重共線性の問題がある場合を想定する。 Consider the following multiple regression model in which one output y is expressed by multiple regression of two inputs u1 and u2.
y = a1 × u1 + a2 × u2 (1)
Equation (1) is the simplest multiple regression model, with a1 and a2 being parameters called regression coefficients. To get a concrete physical image, let's consider the output y as the amount of rainwater inflow to a stormwater pumping station, and inputs u1 and u2 as the amounts of rainfall at different locations. In this case, it is self-evident from a physical perspective that an increase in rainfall will increase the amount of inflow, so it is easy to see that a1 and a2 must have positive values. Now, let's consider a case where there is a multicollinearity problem, where rainfall u1 and rainfall u2 are dependent on each other.

この多重共線性問題は、実際の現象においても頻繁に見られることであり、この場合、例えば、入力ｕ１と入力ｕ２とをレーダ雨量データの隣接メッシュの降雨量であることを想定してみれば、この問題が特殊な問題ではなく、むしろ自然な問題であることが容易にわかる。例えば、国土交通省が配信するＸＲＡＩＮと呼ばれる気象レーダでは、１メッシュのサイズが２５０ｍ×２５０ｍであるため、隣接メッシュの降雨量とは互いに数百メートル離れた箇所の降雨量を意味しており、その降雨量がほぼ同じであることは物理的に考えて極めて自然な事である。２つの降雨量データがレーダ雨量の隣接メッシュの降雨量ではなくて、２か所の地上雨量データであったとしても、通常雨水排水区の大きさが数キロ四方程度以内であることを考えると比較的近い値を持つことは容易に想像できる。従って、このような場合ｕ１≒ｕ２となっていることは、自然な事であり、これは多重共線性問題そのものである。 This multicollinearity problem is frequently observed in real-world phenomena. In this case, if we consider inputs u1 and u2 as rainfall amounts for adjacent meshes of radar rainfall data, it becomes clear that this problem is not unique, but rather a natural one. For example, in the weather radar called XRAIN provided by the Ministry of Land, Infrastructure, Transport and Tourism, the size of each mesh is 250m x 250m, so the rainfall amounts for adjacent meshes refer to rainfall amounts at locations several hundred meters apart, and it is quite natural from a physical perspective that these rainfall amounts are nearly identical. Even if the two rainfall data are not rainfall amounts for adjacent meshes of radar rainfall but ground rainfall data from two locations, it is easy to imagine that the values will be relatively close, given that the size of a stormwater drainage district is usually within a few square kilometers. Therefore, in such cases, it is natural that u1 ≒ u2, which is a true multicollinearity problem.

そこで、ここでは簡単のため、ｕ１＝ｕ２という完全な多重共線性の状態にあった場合を想定する。そして、例えば、２０ｍｍ／ｈの強度の雨量に対して、１０ｍ^３／ｓの雨水流入があるとする。今、ｕ１＝ｕ２であるので、ｕ：＝ｕ１＝ｕ２と定義すると、（１）式は、以下の（２）式の様に書き換えられる。 For simplicity's sake, let us assume that there is perfect multicollinearity, i.e., u1 = u2. For example, suppose that there is rainfall of 20 mm/h and rainwater inflow of 10 ^m3 /s. Now, since u1 = u2, if we define u: = u1 = u2, then equation (1) can be rewritten as equation (2) below.

y=(a1+a2)×u （２）
上記(2)式は単回帰の形であり、ｕ：＝ｕ１＝ｕ２であるから、ａ１＋ａ２＝０．５となる。しかし、ａ１＋ａ２＝０．５という制約条件を満足してさえいれば、ａ１とａ２とがいかなる値であっても、（２）式は全く同じ出力結果を出すため、ａ１とａ２の各値はａ１＋ａ２＝０．５を満たす範囲で任意の値を持つことができる。そのため、例えば、ａ１＝３．５、ａ２＝－３でも、ａ１＝１００、ａ２＝－９９．５となっても問題がなく、全く同じ入出力関係を与えるため、各々のパラメータ値の相違を入出力データから識別することができなくなる。 y = (a1 + a2) × u (2)
The above equation (2) is in the form of a simple regression, and since u = u1 = u2, a1 + a2 = 0.5. However, as long as the constraint a1 + a2 = 0.5 is satisfied, equation (2) will produce the exact same output result regardless of the values of a1 and a2. Therefore, the values of a1 and a2 can have any value within the range that satisfies a1 + a2 = 0.5. Therefore, for example, it is no problem if a1 = 3.5 and a2 = -3 or if a1 = 100 and a2 = -99.5, and they give the exact same input-output relationship, so it becomes impossible to distinguish the differences in each parameter value from the input-output data.

一方、ａ１とａ２とは、各雨量に対する係数であるから、本来、すくなくとも正の値を持つべきであり、２地点の降雨量が同じであるなら、その値はほぼ等しくなるべきであることも物理的に考えれば妥当である。しかし、このように、入力変数間の相関関係（一次従属関係）により、パラメータ値を一意に決めることができず、本来のパラメータ値を正しく同定できなくなる事が多重共線性の問題の本質である。 On the other hand, since a1 and a2 are coefficients for each rainfall amount, they should essentially have at least positive values, and it is physically reasonable that if the rainfall amounts at two locations are the same, their values should be roughly equal. However, due to the correlation (linear dependence) between input variables, parameter values cannot be uniquely determined, and the true parameter values cannot be correctly identified; this is the essence of the problem of multicollinearity.

このような多重共線性などのパラメータ可同定性の欠如により、入出力関係から一意にパラメータを決めることができなくなる問題は、システム同定分野や統計分野（統計的機械学習分野≒AI分野）では、数学的な「悪条件問題」として扱われ、この悪条件を回避するための各種のアルゴリズムが開発されている。先に述べた正則化やＰＬＳなどの方法は、このような悪条件回避のための代表的な技術的手段（テクニック）であり、アドバンストなＡＩ手法の多くには、正則化や（ＰＬＳで用いられる）次元圧縮などの手法(テクニック)が採用されている。 This lack of parameter identifiability, such as multicollinearity, makes it impossible to uniquely determine parameters from input-output relationships. In the fields of system identification and statistics (statistical machine learning = AI), this problem is treated as a mathematical "ill-posed problem," and various algorithms have been developed to avoid this ill-posedness. Methods such as the regularization and PLS mentioned above are typical technical means (techniques) for avoiding such ill-posedness, and many advanced AI methods employ techniques such as regularization and dimensionality reduction (used in PLS).

しかし、例えば、正則化やＰＬＳなどのテクニックを駆使したアドバンストなアルゴリズムを（１）式の問題に適用したとしても、上記の問題に関して、ａ１＞０、ａ２＞０、かつａ１≒ａ２となるような値を推定するとは限らず、むしろ、そのような値にならない可能性の方が高い。これは、正則化や次元圧縮などは、多重共線性（やこれを一般化した可同定性の欠如）の問題を、数学的な悪条件の問題として捉え、これを回避するためのある種の“テクニック”であり、悪条件を回避してパラメータを推定（同定）できるようにしているだけであるからである。すなわち、多重共線性を持つ重回帰問題に対して古典的な最小２乗法を適用すると、「答えを求めることができない＝パラメータ値を求めることができない」という問題になるため、悪条件を回避するというテクニック（正則化や次元圧縮）をアルゴリズムに導入することで、「答えを求めることができる＝パラメータ値を同定できる」問題にしているだけであり、同定した値が物理的に合理的に納得できるものであるか否かを直接考慮しているわけではないからである。 However, even if an advanced algorithm utilizing techniques such as regularization or PLS is applied to the problem in equation (1), it is not guaranteed that it will estimate values such that a1 > 0, a2 > 0, and a1 ≒ a2. In fact, it is highly likely that such values will not be obtained. This is because regularization and dimensionality reduction are techniques for avoiding the problem of multicollinearity (or its generalized equivalent, the lack of identifiability) as a mathematically ill-conditioned problem; they merely avoid ill-conditions and enable parameter estimation (identification). In other words, applying classical least squares to a multiple regression problem with multicollinearity results in the problem of "not being able to find an answer = not being able to find parameter values." Therefore, by introducing techniques for avoiding ill-conditions (regularization and dimensionality reduction) into the algorithm, the problem becomes "being able to find an answer = being able to identify parameter values," without directly considering whether the identified values are physically reasonable.

実際、例えば、正則化のテクニックでは、悪条件を回避するために、パラメータを同定する際に、予測誤差を評価するだけでなく、「パラメータの値自身が大きくなりすぎない」というパラメータに関するノルム（大きさを図る指標）を小さくする、という評価指標を加えることで悪条件を回避しており、これはパラメータの物理的な意味づけや解釈を行うこととは直接関係していない。 In fact, for example, in regularization techniques, in order to avoid bad conditions, when identifying parameters, not only is the prediction error evaluated, but an evaluation index is added that reduces the parameter norm (an index that measures magnitude) so that "the parameter value itself does not become too large." This is not directly related to assigning physical meaning or interpretation to the parameters.

一方、このような問題に対して、合理的に納得のできるパラメータ値を得る方法について以下に説明する。はじめに、（１）式のモデルではなく、（３）式と（４）式との単回帰モデルを考える。
y=a1´×u1 （３）
y=a2´×u2 （４） On the other hand, a method for obtaining parameter values that are reasonably acceptable for such a problem will be described below. First, a simple regression model of equations (3) and (4) will be considered, instead of the model of equation (1).
y=a1´×u1 (3)
y=a2´×u2 (4)

この時、ｕ１＝ｕ２であり、ｕ１＝ｕ２＝２０ｍｍ／ｈに対して、ｙ＝１０ｍ^３／ｓを出力するモデルは、単位系をそのままで考えると、ａ１´＝ａ２´＝０．５であることは明らかである。従って、このような関係を持つ場合に、（３）式と（４）式と単回帰を適用すると、ａ１´＝ａ２´＝０．５に近い値がパラメータとして同定（学習）されることになる。この０．５という値は、（回帰モデルなので物理的な意味は明確ではないが）合理的で納得できる値である。 In this case, when u1 = u2 and u1 = u2 = 20 mm/h, the model that outputs y = 10 ^m3 /s clearly has a1' = a2' = 0.5 if we consider the unit system as it is. Therefore, when such a relationship exists, if we apply simple regression to equations (3) and (4), a value close to a1' = a2' = 0.5 will be identified (learned) as the parameter. This value of 0.5 is a reasonable and convincing value (although the physical meaning is unclear because it is a regression model).

次に、（３）式と（４）式とを用いて、（１）式の重回帰モデルを構築することを考える。
今回の例では、入力変数が２か所の降雨量となり、その和をとっているため、入力となる降雨量が倍になるので、各降雨量に対する係数のａ１とａ２とは、一つの降雨量を入力とした単回帰モデルの係数の半分となり、ａ１＝ａ２＝０．２５となると考える事が物理的に妥当な納得できるパラメータ値であると考えられる。 Next, consider constructing a multiple regression model of equation (1) using equations (3) and (4).
In this example, the input variables are the rainfall amounts at two locations, and since the sum of these is taken, the input rainfall amount doubles. Therefore, the coefficients a1 and a2 for each rainfall amount are half the coefficients of a simple regression model with one rainfall amount as input, and it is considered that a1 = a2 = 0.25 is a physically reasonable and plausible parameter value.

このようにして、（３）式と（４）式とでｕ１とｙとの関係、および、ｕ２とｙとの関係を単回帰で各々求めて、２つの予測モデルを構築する。そして、次に２つの予測モデルの出力ｙの平均を計算する。このように、構成的（Constructive）な手順を踏むと、ａ１とａ２との値は、各々０．２５と、物理的に妥当と考えられる納得のできる値として推定され、結果的に、（１）式の形の重回帰式の係数を定めることができる。これは、部分モデル（サブシステム）を結合していくモジュラー型のアプローチそのものである。 In this way, the relationship between u1 and y, and the relationship between u2 and y are determined using simple regression in equations (3) and (4), respectively, to construct two predictive models. Next, the average output y of the two predictive models is calculated. By following this constructive procedure, the values of a1 and a2 are each estimated to be 0.25, which are reasonable values that are considered physically reasonable, and as a result, the coefficients of the multiple regression equation in the form of equation (1) can be determined. This is the very essence of a modular approach, which combines partial models (subsystems).

上記重回帰モデルにより係数を定めることができる理由は、一般に、入出力関係を表すデータとして同じデータを用いる場合、モデル構造が簡単（≒モデルに含まれるパラメータ数が少ない）なモデルの方が、可同定性が高くなる事（＝パラメータ値を一意に同定可能）が知られているためである。本実施形態のモジュラー型時系列データ予測装置は、上記の特性を利用している。すなわち、同じ入出力データであったとしても、その可同定性はモデルの構造やパラメータ数に依存し、パラメータ数が多いモデルでは可同定でなくなる場合であっても、パラメータ数が少なければ可同定になる場合がある、ということを利用している。 The reason that the coefficients can be determined using the above multiple regression model is that, generally, when the same data is used to represent input/output relationships, models with simpler model structures (i.e., models with fewer parameters) have higher identifiability (i.e., parameter values can be uniquely identified). The modular time series data prediction device of this embodiment makes use of this characteristic. In other words, even when the same input/output data is used, identifiability depends on the model structure and number of parameters, and even if a model with a large number of parameters is not identifiable, it may be identifiable if the number of parameters is small.

上記の例でいえば、本実施形態のモジュラー型時系列データ予測装置では、２入力の重回帰モデルでは多重共線性により可同定でなくなるが、これを１入力の単回帰の組み合わせと考えることにより、多重共線性の問題を本質的に回避して、可同定なモデルを積み上げることによって合理的なモデル構築を可能にしている。 In the above example, in the modular time series data forecasting device of this embodiment, a two-input multiple regression model becomes non-identifiable due to multicollinearity, but by considering this as a combination of one-input simple regressions, the problem of multicollinearity is essentially avoided, and rational model construction is made possible by stacking identifiable models.

次に、本実施形態のモジュラー型時系列データ予測装置と、いわゆる「Explainable AI（ＸＡＩ）」との関係および相違について説明する。
いわゆるＸＡＩでは、全体を一つの大きなシステムとして捉えてブラックボックス的にモデルを構築して、後から、そのモデルを適切に解釈可能な形に分解して説明を加えようとするトップダウン型のアプローチ（あるいは帰納的なアプローチ）である。これに対し、本実施形態のモジュラー型時系列データ予測装置の考え方は、個別には比較的簡単に理解できる合理的な説明が可能なモデル（サブシステム）を統合していく事により、全体の予測モデルを構築していくボトムアップ型のアプローチ（あるいは演繹的なアプローチ）である。 Next, the relationship and differences between the modular time-series data prediction device of this embodiment and so-called "Explainable AI (XAI)" will be described.
So-called XAI is a top-down approach (or inductive approach) in which a black-box model is constructed by treating the whole as one large system, and then the model is broken down into an appropriately interpretable form and an explanation is added later.In contrast, the modular time-series data prediction device of this embodiment is a bottom-up approach (or deductive approach) in which an overall prediction model is constructed by integrating models (subsystems) that are individually relatively easy to understand and capable of rational explanation.

上記のようなボトムアップ型のアプローチでは、サブシステム間の相互の干渉を予め考慮することができないため、予測精度面に注目した場合には、トップダウン的（ＸＡＩ的）なアプローチに劣る可能性を否定できない。しかしながら、ボトムアップ型のアプローチは、合理的な説明性という意味では、先に説明したようにパラメータの可同定性を維持したモデル構築が可能になり、トップダウン的なアプローチよりも優れる可能性が高い。 The bottom-up approach described above cannot take into account mutual interference between subsystems in advance, so it cannot be denied that it may be inferior to a top-down (XAI) approach when focusing on prediction accuracy. However, in terms of rational explainability, a bottom-up approach makes it possible to build a model that maintains parameter identifiability, as explained above, and is therefore likely to be superior to a top-down approach.

また、ボトムアップ型のアプローチは、ホワイトボックス的なモデリングのアプローチとも類似している。ホワイトボックスモデリングは、基本的に演繹的、構成的なモデリング手法である。ホワイトボックスモデリングでは、物理的に意味のあるパラメータを用いて構成的にモデルを構築するのに対し、本実施形態のモジュラー型アプローチでは、物理的な意味を持たせることはできないものの、個々のパラメータを入出力と直接関係づけることにより解釈が容易なパラメータを同定し、それを用いて構成的にモデル構築している。 The bottom-up approach is also similar to the white-box modeling approach. White-box modeling is essentially a deductive, constructive modeling method. In white-box modeling, models are constructed constructively using parameters that have physical meaning, whereas in the modular approach of this embodiment, parameters that are easy to interpret are identified by directly relating each parameter to input and output, even though they cannot be given physical meaning, and models are constructed constructively using these parameters.

本実施形態のモジュラー型アプローチがＸＡＩ的なアプローチよりも合理的な説明性において優れていることは以下の事からも明らかである。例えば、先の例において、もしトップダウン的（ＸＡＩ的）なアプローチを行おうとすると、正則化やＰＬＳなどの方法を使って係数パラメータのａ１やａ２を求め、それに対して何等かの解釈を加える処理を施して結果を「Explainable（説明可能）」にする方法の確立を目ざす。しかしながら、上記の簡単な例からわかる様に、ａ１とａ２が０．２５という値からかけ離れた値で同定されていた場合（実際、正則化やＰＬＳを用いると、そのようになる可能性がある）には、いかなる方法を用いて「説明」を試みても、合理的な説明が原理的に困難であることは容易にわかる。例えば、ＰＬＳや正則化を用いてパラメータ値を求めることができても、ａ１かａ２のいずれかが負の値（例：ａ１＝３．５、ａ２＝－３）となっている場合には、合理的に納得のできる説明は不可能である。 The following makes it clear that the modular approach of this embodiment is superior to the XAI approach in terms of rational explainability. For example, in the previous example, if a top-down (XAI) approach were to be used, the goal would be to establish a method for determining the coefficient parameters a1 and a2 using methods such as regularization or PLS, and then to add some kind of interpretation to the results to make them "explainable." However, as can be seen from the simple example above, if a1 and a2 are identified as values far removed from 0.25 (which is indeed possible when using regularization or PLS), it is easy to see that, in principle, a rational explanation is difficult to achieve, no matter what method is used to attempt an "explanation." For example, even if parameter values can be determined using PLS or regularization, if either a1 or a2 is a negative value (e.g., a1 = 3.5, a2 = -3), a rationally convincing explanation is impossible.

つまり、本来合理的に納得できる値である0.25という値を求められていなかったとしても、可同定性が欠如している場合には、ある制約さえ満たしていればどのような内部パラメータ値であろうが予測精度だけに着目する限りは差異が見られないため、「精度の良いモデル」と見なされる。しかしながら、精度が良いモデルが、必ずしもそのモデルのパラメータ値の妥当性を補償する合理的で妥当なモデルを意味しないため、トップダウン的（ＸＡＩ的）なアプローチでは原理的に合理的な説明が困難になってしまう。 In other words, even if the value of 0.25, which is originally a reasonably acceptable value, was not obtained, in the absence of identifiability, as long as certain constraints are met, no difference will be seen as long as the focus is solely on prediction accuracy, regardless of the internal parameter values, and so the model will be considered to be of high accuracy. However, a high-accuracy model does not necessarily mean a rational and reasonable model that compensates for the validity of the model's parameter values, and so a top-down (XAI) approach makes it difficult in principle to provide a rational explanation.

なお、ＸＡＩ的なアプローチでは、ａ１とａ２とのそれぞれに対する個別の意味付けは困難であるが、ａ１＋ａ２に対する意味づけは可能であるため、説明不可能であるというわけではない。ただし、ＸＡＩ的なアプローチでは、内部に含まれる全てのパラメータに対して意味付けを行うことは原理的に困難になる可能性が高く、何等かの「特徴量」を抽出して、それに対して意味付けを求めることになる。 In an XAI approach, it is difficult to assign meaning to a1 and a2 individually, but it is possible to assign meaning to a1 + a2, so it is not impossible to explain. However, in principle, it is highly likely that an XAI approach will be difficult to assign meaning to all of the parameters contained within, so some kind of "feature" will be extracted and a meaning will be sought for that.

従って、「合理的な説明性の高いモデル」を構築することを主目的にするのであれば、そもそも、正則化やＰＬＳなどの数学的に悪条件を回避する様なアルゴリズムに頼る前に、可同定性が欠如する様な状態を回避してモデルの内部構造（内部パラメータ）の一意性を保証する様にモデルを構築しておき、合理的な説明性を担保しながらモデルを統合していくボトムアップ的、構成的、演繹的なモジュラー型アプローチの方が優れていると可能性が高い。 Therefore, if the main goal is to build a "model with high rational explainability," it is highly likely that a bottom-up, constructive, deductive, modular approach that builds models to avoid states that lack identifiability and ensure the uniqueness of the model's internal structure (internal parameters), and integrates models while ensuring rational explainability, would be superior, rather than relying on algorithms that avoid mathematically ill-conditioning such as regularization or PLS.

また、このように合理的説明性を担保したモデルを構築できることに加え、最初に述べた分離可能な構造とすることで、モデルの調整も容易にすることができ、これらを合わせて合理的説明可能かつ調整可能なモデル構築を行うことが可能になる。
本実施形態のモジュラー型時系列データ予測装置は、以上の洞察に基づいて成されたものである。 In addition to being able to build a model that ensures rational explainability in this way, the separable structure mentioned at the beginning also makes it easier to adjust the model, and by combining these, it becomes possible to build a model that is rationally explainable and adjustable.
The modular time-series data prediction device of this embodiment is based on the above insight.

以下、実施形態のモジュラー型時系列データ予測装置について、図面を参照して詳細に説明する。
図１は、一実施形態のモジュラー型時系列データ予測装置を適用した雨量流入予測システムを概略的に示す図である。 Hereinafter, a modular time-series data prediction device according to an embodiment will be described in detail with reference to the drawings.
FIG. 1 is a diagram illustrating a rainfall inflow prediction system to which a modular time-series data prediction device according to an embodiment of the present invention is applied.

以下では、本実施形態のモジュラー型時系列データ予測装置を搭載したシステムの実施イメージとその効果をより明確にするために、雨水流入予測システムを対象として説明する。なお、本実施形態のモジュラー型時系列データ予測装置が適用される対象プロセスは雨水流入予測システムに限定されるものではなく、２種類以上の計測項目の時系列データが存在する任意のプロセスに対して適用することができる。モジュラー型時系列データ予測装置は、例えば、鉄鋼プラント、石油化学プラント、食品プラント、製薬プロセス、半導体製造プロセス、発電プラント、交通監視設備、空調監視設備、などのプロセスに適用することができる。 The following describes a stormwater inflow prediction system to more clearly illustrate the implementation image and effects of a system equipped with the modular time series data prediction device of this embodiment. Note that the target process to which the modular time series data prediction device of this embodiment is applied is not limited to stormwater inflow prediction systems, but can be applied to any process in which time series data for two or more measurement items exists. The modular time series data prediction device can be applied to processes such as steel plants, petrochemical plants, food plants, pharmaceutical processes, semiconductor manufacturing processes, power plants, traffic monitoring equipment, and air conditioning monitoring equipment.

雨量流入予測システムの対象プロセス１は、流量計１１と、幹線流量計１２と、幹線水位計１３１-１３Ｋと、地上雨量計１４１-１４Ｍと、レーダ雨量計１５と、ポンプ井水位計１６１、１６２と、流入渠１７と、雨水ポンプ井１８と、雨水ポンプ１９と、流入ゲート１１０と、を含む都市雨水排水プロセス１である。レーダ雨量計１５は、Ｑ×Ｐメッシュのメッシュ毎のレーダ雨量計１５１１～１５ＱＰを含む。 The target process 1 of the rainfall inflow prediction system is an urban stormwater drainage process 1 that includes a flow meter 11, a main flow meter 12, main water level gauges 131-13K, ground rain gauges 141-14M, a radar rain gauge 15, pump well water level gauges 161 and 162, an inlet culvert 17, a stormwater pump well 18, a stormwater pump 19, and an inlet gate 110. The radar rain gauge 15 includes radar rain gauges 1511-15QP for each mesh in a QxP mesh.

都市雨水排水プロセス１の各種センサ１１－１５、１６１、１６２は、所定の周期（例えば、３０秒、６０秒）で、プロセスの状態を表す量や運転操作に関わる量などの計測を行う。各種センサ１１－１５、１６１、１６２で計測された値は、モジュラー型時系列データ予測装置で収集および保存される。 Various sensors 11-15, 161, and 162 in the urban stormwater drainage process 1 measure quantities that represent the state of the process and quantities related to operation at predetermined intervals (e.g., 30 seconds, 60 seconds). The values measured by the various sensors 11-15, 161, and 162 are collected and stored by a modular time-series data forecasting device.

流入渠１７は、下水管やポンプ場から送られてきた雨水が流れ込む渠である。
雨水ポンプ井１８は、流入渠１７から流入した雨水が貯められる貯水槽である。雨水ポンプ井１８に流入する雨水は、沈砂池で一緒に流れてきた砂などが予め取り除かれていてもよい。
雨水ポンプ１９は、雨水ポンプ井１８に貯められた雨水を強制的に河川等へ送り出すポンプである。 The inflow culvert 17 is a culvert into which rainwater sent from sewer pipes and pumping stations flows.
The stormwater pump well 18 is a water tank that stores the rainwater that flows in from the inflow conduit 17. The rainwater that flows into the stormwater pump well 18 may have sand and other contaminants removed in advance in a grit basin.
The rainwater pump 19 is a pump that forcibly pumps the rainwater stored in the rainwater pump well 18 into a river or the like.

流入ゲート１１０は、流入渠１７と雨水ポンプ井１８との間の流路に設けられ、開閉することにより流入渠１７から雨水ポンプ井１８に流入する雨水の量を調整するように動作する。流入ゲート１１０の前後には水位計１６１、１６２が設置され、例えば、水位計１６１、１６２により測定された値に応じて流入ゲート１１０の動作が制御される。 The inflow gate 110 is installed in the flow path between the inflow culvert 17 and the stormwater pump well 18, and operates to adjust the amount of stormwater flowing from the inflow culvert 17 to the stormwater pump well 18 by opening and closing it. Water level gauges 161 and 162 are installed before and after the inflow gate 110, and the operation of the inflow gate 110 is controlled, for example, according to the values measured by the water level gauges 161 and 162.

図２は、一実施形態のモジュラー型時系列データ予測装置の一構成例を概略的に示す図である。
本実施形態のモジュラー型時系列データ予測装置は、データ収集保存部２と、データ抽出部３と、ペアワイズ予測モデル同定部４と、予測モデル合成法定義部５と、ペアワイズ出力変数予測部６と、合成出力変数予測部７と、予測誤差評価部８と、ペアワイズ予測モデル修正部９と、出力予測結果観測部１０と、を備えている。
本実施形態のモジュラー型時系列データ予測装置の構成は、例えば、ＣＰＵやＭＰＵなどのプロセッサを少なくとも１つと、プロセッサにより実行されるプログラムが格納されるメモリとを備えた演算装置であって、種々の機能をソフトウェアにより、若しくは、ソフトウェアとハードウエアとの組み合わせにより実現することが出来る。 FIG. 2 is a diagram illustrating an example of the configuration of a modular time-series data prediction device according to an embodiment.
The modular time-series data prediction device of this embodiment includes a data collection and storage unit 2, a data extraction unit 3, a pairwise prediction model identification unit 4, a prediction model synthesis method definition unit 5, a pairwise output variable prediction unit 6, a synthesis output variable prediction unit 7, a prediction error evaluation unit 8, a pairwise prediction model correction unit 9, and an output prediction result observation unit 10.
The configuration of the modular time-series data prediction device of this embodiment is, for example, a computing device including at least one processor such as a CPU or an MPU, and a memory in which a program executed by the processor is stored, and various functions can be realized by software or a combination of software and hardware.

データ収集保存部２は、各種センサ１１－１５、１６１、１６２で計測されたプロセス変数の値を収集し保存する。データ収集保存部２は、出力変数データ選択部２１と、入力変数データ選択部２２とを備えている。 The data collection and storage unit 2 collects and stores the values of process variables measured by various sensors 11-15, 161, and 162. The data collection and storage unit 2 includes an output variable data selection unit 21 and an input variable data selection unit 22.

出力変数データ選択部２１は、各種センサ１１－１５、１６１、１６２で計測された値の少なくともいずれかの量を出力変数として選択する。
入力変数データ選択部２２は、各種センサ１１－１５、１６１、１６２で計測された値から、複数の入力変数として選択する。なお複数の入力変数の数は、複数のプロセス変数の数以下であればよい。 The output variable data selection unit 21 selects at least one of the quantities measured by the various sensors 11-15, 161, and 162 as an output variable.
The input variable data selection unit 22 selects a plurality of input variables from the values measured by the various sensors 11-15, 161, and 162. The number of the plurality of input variables may be equal to or less than the number of the plurality of process variables.

データ抽出部３は、データ収集保存部２に保存された時系列データの中から所定のデータを抽出する。データ抽出部３は、オフライン予測モデル同定用データ抽出部３１と、オンライン予測用データ抽出部３２と、評価用データ抽出部３３とを備えている。 The data extraction unit 3 extracts predetermined data from the time-series data stored in the data collection and storage unit 2. The data extraction unit 3 includes an offline prediction model identification data extraction unit 31, an online prediction data extraction unit 32, and an evaluation data extraction unit 33.

オフライン予測モデル同定用データ抽出部３１は、データ収集保存部２から、所定の同定周期若しくは外部からの要求により、予測モデル同定用のデータとして各種プロセスセンサ１１－１５、１６１、１６２で計測された（当該時刻よりも過去の）所定の期間のデータを抽出する。 The offline prediction model identification data extraction unit 31 extracts data measured by the various process sensors 11-15, 161, and 162 for a specified period (prior to the specified time) from the data collection and storage unit 2 at a specified identification interval or in response to an external request, as data for prediction model identification.

オンライン予測用データ抽出部３２は、データ収集保存部２から、所定の予測周期で、各種プロセスセンサ１１－１５、１６１、１６２の計測値からオンラインの予測に必要となる予測用データを、リアルタイムで抽出する。 The online prediction data extraction unit 32 extracts prediction data required for online prediction from the measurement values of various process sensors 11-15, 161, and 162 in real time from the data collection and storage unit 2 at a predetermined prediction period.

評価用データ抽出部３３は、データ収集保存部２から、所定の評価周期もしくは外部からの要求により、予測誤差評価に必要となる各種プロセスセンサ１１－１５、１６１、１６２の計測値を、リアルタイムで抽出する。 The evaluation data extraction unit 33 extracts, in real time, the measurement values of the various process sensors 11-15, 161, and 162 required for prediction error evaluation from the data collection and storage unit 2 at a predetermined evaluation interval or in response to an external request.

ペアワイズ予測モデル同定部４は、オフライン予測モデル同定用データ抽出部３１で抽出された計測値のデータを用いて、出力変数データ選択部２１で選択された出力変数および入力変数データ選択部２２で選択された入力変数を変数毎にペアワイズとし、後述する方法により予測モデルを同定する。 The pairwise prediction model identification unit 4 uses the measurement value data extracted by the offline prediction model identification data extraction unit 31 to pair the output variables selected by the output variable data selection unit 21 and the input variables selected by the input variable data selection unit 22 for each variable, and identifies a prediction model using the method described below.

予測モデル合成法定義部５は、ペアワイズ予測モデル同定部４で同定したペアワイズの予測モデルの出力値（ペアワイズ予測値）から、出力変数（出力変数が複数の場合は出力変数毎）を合成する方法を定義する。 The prediction model synthesis method definition unit 5 defines a method for synthesizing output variables (for each output variable if there are multiple output variables) from the output values (pairwise prediction values) of the pairwise prediction model identified by the pairwise prediction model identification unit 4.

ペアワイズ出力変数予測部６は、ペアワイズ予測モデル同定部４で同定したペアワイズの予測モデルに対して、時間の進行方向における所定の監視周期で、オンライン予測用データ抽出部３２から抽出した入力変数各々に対応する予測用データを入力して、入力変数毎に出力変数（出力変数が複数の場合は出力変数毎）のペアワイズの予測を行う。 The pairwise output variable prediction unit 6 inputs prediction data corresponding to each input variable extracted from the online prediction data extraction unit 32 into the pairwise prediction model identified by the pairwise prediction model identification unit 4 at a predetermined monitoring cycle in the time progression direction, and performs pairwise predictions of the output variable for each input variable (for each output variable if there are multiple output variables).

合成出力変数予測部７は、ペアワイズ出力変数予測部６の出力であるペアワイズの予測値に対して、予測モデル合成法定義部５で定義した方法に従って合成した出力変数の予測値を出力する。 The composite output variable prediction unit 7 outputs a predicted value of the output variable synthesized according to the method defined in the prediction model synthesis method definition unit 5, based on the pairwise predicted value output by the pairwise output variable prediction unit 6.

予測誤差評価部８は、ペアワイズ出力変数予測部６の出力であるペアワイズの予測値と、合成出力変数予測部７の出力であるペアワイズの予測値とを合成した予測値を保存し、所定の周期もしくは所定のタイミングで、評価用データ抽出部３３により抽出した出力変数の実績値と保存したペアワイズおよび合成予測値との誤差を、各々評価する。 The prediction error evaluation unit 8 stores a prediction value that is a composite of the pairwise prediction value output by the pairwise output variable prediction unit 6 and the pairwise prediction value output by the composite output variable prediction unit 7, and evaluates the error between the actual value of the output variable extracted by the evaluation data extraction unit 33 and the stored pairwise and composite prediction values at a predetermined interval or at a predetermined timing.

ペアワイズ予測モデル修正部９は、予測誤差評価部８によって評価した予測誤差に基づいて、ペアワイズ予測モデルの調整の必要性の有無を判断し、必要なペアワイズ予測モデルの修正を行う。ペアワイズ予測モデル修正部９は、例えば、ペアワイズ予測モデルのパラメータ調整、もしくは、ペアワイズ予測モデルの削除、もしくは、ペアワイズ予測モデルの統合、を行うことができる。ペアワイズ予測モデル修正部９は、修正したペアワイズ予測モデルをペアワイズ出力変数予測部６に供給する。 The pairwise prediction model correction unit 9 determines whether or not adjustment of the pairwise prediction model is necessary based on the prediction error evaluated by the prediction error evaluation unit 8, and performs the necessary pairwise prediction model correction. The pairwise prediction model correction unit 9 can, for example, adjust the parameters of the pairwise prediction model, delete the pairwise prediction model, or integrate the pairwise prediction models. The pairwise prediction model correction unit 9 supplies the corrected pairwise prediction model to the pairwise output variable prediction unit 6.

出力予測結果観測部１０は、例えばモニタなどの表示手段と、ユーザインタフェースとを備え、合成出力変数予測部７による合成出力変数の予測値を少なくとも観測者に提示することができる。出力予測結果観測部１０は、例えば、パーソナルコンピュータ、タブレット端末や、スマートフォンなどの携帯端末であってもよい。出力予測結果観測部１０は、必要に応じて、ペアワイズ出力変数予測部６によるペアワイズ予測モデルの複数の出力と、それから合成される予測分布情報の少なくとも一方を、観測者に更に提示してもよい。 The output prediction result observation unit 10 is equipped with a display means such as a monitor and a user interface, and is capable of presenting at least the predicted value of the composite output variable by the composite output variable prediction unit 7 to the observer. The output prediction result observation unit 10 may be, for example, a personal computer, a tablet terminal, or a mobile terminal such as a smartphone. If necessary, the output prediction result observation unit 10 may further present to the observer at least one of multiple outputs of the pairwise prediction model by the pairwise output variable prediction unit 6 and the predictive distribution information synthesized from them.

次に、本実施形態のモジュラー型時系列データ予測装置の動作について説明する。
例えば都市雨水排水プロセス１では、各種プロセスセンサ１１－１５、１６１、１６２によって、所定の周期でプロセスの情報が計測され、データ収集保存部２に計測値（プロセス変数の値）が供給される。データ収集保存部２は、収集した計測値を、予め決められたフォーマットに従って、時系列データとして保存する。 Next, the operation of the modular time-series data prediction device of this embodiment will be described.
For example, in an urban stormwater drainage process 1, various process sensors 11-15, 161, and 162 measure process information at predetermined intervals, and the measured values (values of process variables) are supplied to the data collection and storage unit 2. The data collection and storage unit 2 stores the collected measured values as time-series data in accordance with a predetermined format.

出力変数データ選択部２１は、各種プロセスセンサ１１－１５、１６１、１６２で計測されたプロセス変数の中から予測の対象となる変数を選択して、選択した変数を出力変数と設定する。
例えば都市雨水排水プロセス１では、通常雨水ポンプ井１８への流入量を予測の対象とする場合には、出力変数データ選択部２１は、流量計（雨水ポンプ井流入量計）１１で計測されている雨水ポンプ井流入量が予測対象の出力変数として選択する。 The output variable data selection unit 21 selects variables to be predicted from among the process variables measured by the various process sensors 11-15, 161, and 162, and sets the selected variables as output variables.
For example, in the urban stormwater drainage process 1, when the inflow volume into the normal stormwater pump well 18 is to be predicted, the output variable data selection unit 21 selects the stormwater pump well inflow volume measured by the flow meter (stormwater pump well inflow volume meter) 11 as the output variable to be predicted.

また、流量計（雨水ポンプ井流入量計）１１が設置されていない様な場合には、雨水ポンプ井水位計１６１、１６２の計測値と、雨水ポンプ１９および流入ゲート１１０の運用状態とから、雨水ポンプ井１８への流入量を計算によって求めて、これを予測対象として選択しても良い。この場合、出力変数データ選択部２１は、流入量の計算値を計算して保存しておき、流入量の計算値を出力変数として選択する。 Also, if a flow meter (rainwater pump well inflow meter) 11 is not installed, the inflow volume into the rainwater pump well 18 can be calculated from the measurement values of the rainwater pump well water level gauges 161, 162 and the operational status of the rainwater pump 19 and inflow gate 110, and this can be selected as the prediction target. In this case, the output variable data selection unit 21 calculates and saves the calculated value of the inflow volume, and selects the calculated value of the inflow volume as the output variable.

また、例えば都市雨水排水プロセス１の複数の項目を予測する場合には、出力変数データ選択部２１は、複数の項目を出力変数として選択してもよい。都市雨水排水プロセス１においても、例えば、雨水ポンプ井１８に複数の箇所から雨水の流入があるような場合は、出力変数データ選択部２１は、複数の流入箇所各々における流入量を出力変数として選択してもよい。 Furthermore, for example, when predicting multiple items in the urban stormwater drainage process 1, the output variable data selection unit 21 may select multiple items as output variables. Even in the urban stormwater drainage process 1, for example, when stormwater flows into the stormwater pump well 18 from multiple locations, the output variable data selection unit 21 may select the inflow volume at each of the multiple inflow locations as an output variable.

入力変数データ選択部２２は、出力変数データ選択部２１で選択した出力変数に影響を与える可能性のある測定値を入力変数の候補として選択する。影響を与えるか否かについて事前情報が全く無い場合には、入力変数データ選択部２２は、全てのプロセス変数を機械的に入力変数としておいても良い。入力変数データ選択部２２は、出力変数として選択した変数を入力変数としても選択して構わない。例えば、時系列のデータ解析では、出力変数として選択した変数自身の過去の値から未来の値を予測する所謂自己回帰を検討することも多くある。このような場合には、流量計（雨水ポンプ井流入量計）１１による雨水ポンプ井流入量は、出力変数と入力変数との両方として選択されても構わない。 The input variable data selection unit 22 selects as candidate input variables measured values that may affect the output variables selected by the output variable data selection unit 21. If there is no prior information as to whether or not a variable will have an effect, the input variable data selection unit 22 may automatically set all process variables as input variables. The input variable data selection unit 22 may also select variables selected as output variables as input variables. For example, in time series data analysis, so-called autoregression is often considered, in which future values of the variable selected as the output variable are predicted from its own past values. In such cases, the stormwater pump well inflow rate measured by the flow meter (stormwater pump well inflow rate meter) 11 may be selected as both an output variable and an input variable.

また、出力変数データ選択部２１で選択した出力変数が複数ある場合、入力変数データ選択部２２は、ある出力変数に対して別の出力変数を入力変数として選択してもよい。このようにすることで、複数の出力変数同士の相関を考慮することが可能になる。一方、複数の出力変数が異なる流入箇所の流入量であるときには、プロセスの物理的な構造上、出力変数同士の関連が無い場合が多い。この場合、入力変数データ選択部２２は、ある出力変数に対する入力変数として別の出力変数を除外しておくこともできる。 Furthermore, when there are multiple output variables selected by the output variable data selection unit 21, the input variable data selection unit 22 may select another output variable as an input variable for a given output variable. This makes it possible to consider the correlation between multiple output variables. On the other hand, when multiple output variables are inflow amounts at different inflow points, there is often no correlation between the output variables due to the physical structure of the process. In this case, the input variable data selection unit 22 can also exclude another output variable as an input variable for a given output variable.

入力変数データ選択部２２は、その他の変数に対しても、例えば、物理的な管渠の接続関係などの情報から明らかに出力変数に無関係な変数や因果関係（入力（原因）と出力（結果）との関係）が物理的に成立しない変数を予め入力変数から除外しておくこともできる。 The input variable data selection unit 22 can also exclude from the input variables in advance other variables, such as variables that are clearly unrelated to the output variables based on information such as the physical connection relationships of pipes, or variables for which the causal relationship (the relationship between input (cause) and output (result)) does not physically hold.

例えば、雨水ポンプ井水位計１６１、１６２により計測される雨水ポンプ井水位は、出力変数として選択した流量計（雨水ポンプ井流入量計）１１による雨水ポンプ井流入量の影響を受けて変化することは物理的に考えて明らかであるから、雨水ポンプ井水位を入力変数から除外しておく。 For example, it is physically clear that the rainwater pump well water level measured by the rainwater pump well water level gauges 161 and 162 will change due to the influence of the rainwater pump well inflow measured by the flow meter (rainwater pump well inflow meter) 11 selected as the output variable, so the rainwater pump well water level is excluded from the input variables.

入力変数データ選択部２２は、物理的な構造の関係が明確でない変数について予め出力変数データ選択部２１で選択した出力変数との相関解析を予備的に実施し、相関係数の絶対値が所定の値（例えば０．５）未満の変数を、予め入力変数から除外しておいてもよい。さらに、入力変数データ選択部２２は、相関解析を実施する際に、解析対象の変数が測定された時間をずらしながら相関解析を行い、最も相関の高くなる時のずらした時間Ｌを求め、時間Ｌの符号情報から、因果関係を満たさない符号を持つ変数を入力変数から予め除外しておいてもよい。なお、先の雨水ポンプ井水位の変数は、このような解析によっても入力変数から除外される可能性が高い。 The input variable data selection unit 22 may perform a preliminary correlation analysis between variables with unclear physical structural relationships and the output variables selected in advance by the output variable data selection unit 21, and may exclude variables with absolute values of correlation coefficients less than a predetermined value (e.g., 0.5) from the input variables in advance. Furthermore, when performing the correlation analysis, the input variable data selection unit 22 may perform the correlation analysis while shifting the time at which the variable being analyzed is measured, determine the shifted time L at which the correlation is highest, and, based on the sign information of time L, exclude variables with signs that do not satisfy a causal relationship from the input variables in advance. It is likely that the stormwater pump well water level variable mentioned above will also be excluded from the input variables through such an analysis.

本実施形態のモジュラー型時系列データ予測装置では、入力変数データ選択部２２は、雨水水位計１６１、１６２により計測された雨水ポンプ井水位のみを除いた流量計（雨水ポンプ井流入量計）１１による雨水ポンプ井流入量と、幹線流量計１２により計測された幹線流入量と、幹線水位計１３１-１３Ｋにより計測された幹線水位と、地上雨量計１４１-１４Ｍにより計測された地上雨量と、Ｑ×Ｐメッシュのメッシュ１５１１-１５ＱＰ各々におけるレーダ雨量とを入力変数として選択するものとする。 In the modular time series data prediction device of this embodiment, the input variable data selection unit 22 selects as input variables the rainwater pump well inflow measured by the flow meter (rainwater pump well inflow meter) 11, excluding only the rainwater pump well water level measured by the rainwater level gauges 161 and 162, the main inflow measured by the main flow meter 12, the main water level measured by the main water level gauges 131-13K, the ground rainfall measured by the ground rain gauges 141-14M, and the radar rainfall at each of the meshes 1511-15QP of the QxP mesh.

なお、本実施形態のモジュラー型時系列データ予測装置において、例えば、Ｋ個の幹線水位計１３１～１３Ｋは、最近、下水管渠内の水位情報を正確に把握しようとする行政の動向などから、Ｋの値が１０～１００程度の比較的多数の幹線水位計が設置されることも多く、また、気象レーダは国土交通省が全国にＸＲＡＩＮという気象レーダを設置していることから、多くの場所で取り入れることができ、対象とする雨水排水区の大きさにもよるが、Ｑ×Ｐメッシュのメッシュサイズは１００程度になることも珍しくないため、実際に計測される時系列データの項目数（変数の数）は、かなり多くなる。また、本実施形態のモジュラー型時系列データ予測装置が適用される都市雨水排水プロセス１に限らず、計測変数の項目数が数百～数千に及ぶことは一般的である。 In the modular time series data prediction device of this embodiment, for example, K main water level gauges 131-13K are often installed with a relatively large number of main water level gauges, with K being around 10-100, due to recent government trends toward accurately grasping water level information in sewer pipes. Furthermore, weather radar can be installed in many locations because the Ministry of Land, Infrastructure, Transport and Tourism has installed a weather radar called XRAIN nationwide. Depending on the size of the target stormwater drainage district, it is not uncommon for the mesh size of a QxP mesh to be around 100. Therefore, the number of time series data items (number of variables) actually measured is quite large. Furthermore, the number of measured variables typically ranges from hundreds to thousands, not just in urban stormwater drainage process 1, to which the modular time series data prediction device of this embodiment is applied.

本実施形態のモジュラー型時系列データ予測装置では、入力変数は、例えば、雨水ポンプ井水位計１６１、１６２による計測値のみを除いた流量計（雨水ポンプ井流入量計）１１による雨水ポンプ井流入量の計測値と、幹線流量計１２による幹線流入量の計測値と、幹線水位計１３１～１３ＫによるＫ個の幹線水位の計測値と、地上雨量計１４１～１４ＭによるＭ個の地上雨量の計測値と、Ｑ×Ｐメッシュ１５１１～１５ＱＰの各メッシュのレーダ雨量の計測値と、である。以下では、入力変数の数をｐ（正の整数）として説明する。 In the modular time series data prediction device of this embodiment, the input variables are, for example, the rainwater pump well inflow measured by the flow meter (rainwater pump well inflow meter) 11, excluding only the measurements by the rainwater pump well level gauges 161 and 162; the main inflow measured by the main flow meter 12; K main water level measurements by the main water level gauges 131-13K; M ground rainfall measurements by the ground rain gauges 141-14M; and the radar rainfall measurements for each mesh of the Q×P mesh 1511-15QP. In the following explanation, the number of input variables is assumed to be p (a positive integer).

オフライン予測モデル同定用データ抽出部３１は、予測モデルを構築する人が指定した所定期間のデータを用いて要求されたタイミングで、先に出力変数データ選択部２１と入力変数データ選択部２２との該当する所定期間分の時系列データを取得する。オフライン予測モデル同定用データ抽出部３１は、予測モデルの構築を定期的に行い、モデルを定期的に更新したい場合には、時間の進行方向において所定の周期ＴＬで所定期間の入出力変数の時系列データを定期的に抽出しても良い。このようにして抽出されたデータセットをｕｋ、ｋ＝１、２、…ｐと記載する。ｕｋは列ベクトルであるとし、各行が時系列データの各時刻のサンプルに対応する。同様に出力変数である雨水流入量の時系列データセットｙとしておく、オフライン予測モデル同定用データ抽出部３１は、データセットｕｋとデータセットｙとを所定の周期で抽出する作用を持つ。 The offline prediction model identification data extraction unit 31 first acquires time series data for the specified period from the output variable data selection unit 21 and the input variable data selection unit 22 at the requested timing using data for the specified period specified by the person building the prediction model. If the offline prediction model identification data extraction unit 31 periodically builds a prediction model and periodically updates the model, it may periodically extract time series data of input and output variables for the specified period at a specified cycle TL in the direction of time progression. The dataset extracted in this manner is represented as uk, where k = 1, 2, ... p. uk is a column vector, with each row corresponding to a sample at each time in the time series data. Similarly, the time series dataset for stormwater inflow, which is the output variable, is assumed to be y. The offline prediction model identification data extraction unit 31 extracts dataset uk and dataset y at a specified cycle.

次に、ペアワイズ予測モデル同定部４では、ｕｋとｙとを用いて、（ｕ１、ｙ）、（ｕ２、ｙ）、…（ｕｐ、ｙ）のｐ個のペア毎に予測モデルを同定する。本実施形態のモジュラー型時系列データ予測装置では、出力変数ｙは、流量計（雨水ポンプ井流入量計）１１による雨水ポンプ井流入量のみの一つであるが、複数（ｈ個とする）の出力ｙがある場合には、ｈ個の各々のｙに対して同様の処理を行えば良く、以降の処理は出力変数の数がｌ個に増えた場合でも、ｈ回繰り返して同じ処理を行えばよいだけであるので、一般性を失う事なく出力変数は１個であるとしておく。 Next, the pairwise prediction model identification unit 4 uses uk and y to identify a prediction model for each of the p pairs (u1, y), (u2, y), ... (up, y). In the modular time series data prediction device of this embodiment, the output variable y is only one, the stormwater pump well inflow rate measured by the flow meter (stormwater pump well inflow rate meter) 11. However, if there are multiple (say h) outputs y, the same processing can be performed for each of the h y's. Even if the number of output variables increases to l, the same processing can be repeated h times. Therefore, without loss of generality, we will assume that there is only one output variable.

最も典型的な予測モデルは、次式で表せる線形回帰の形をしたモデルを仮定して同定を行うものである。
y(t)=a1×uk(t-L)+a2×uk(t-L-1)+…+an×uk(t-L-n+1)+c （５）
ここで、ａｋ、ｋ＝１、２、…、ｎとｃは同定すべきパラメータである。ｃは入力ｕｋと出力ｙの平均の差を表すバイアスパラメータであるが、予めｕｋとｙから平均値を除去して平均を０としておくことにより、常に０とすることができるので、以下では簡単のためにｃは０としておく。 The most typical prediction model is one that performs identification by assuming a linear regression model that can be expressed by the following equation:
y(t)=a1×uk(tL)+a2×uk(tL-1)+…+an×uk(tL-n+1)+c (5)
Here, a, k = 1, 2, ..., n and c are parameters to be identified. c is a bias parameter that represents the difference between the averages of the input uk and the output y. However, by removing the average values from uk and y in advance and setting the average to 0, c can always be set to 0. Therefore, for simplicity, c will be set to 0 below.

上記（５）式において、ｕｋがｙ自身ではない場合を、ディジタル信号処理やシステム同定の分野では、有限インパルス応答モデル（ＦＩＲ）モデルと呼ぶ（入力変数に対する重み付き移動平均モデルと呼ぶこともできる）。一方、ｕｋがｙ自身と一致する場合は自己回帰モデル（ＡＲモデル）と呼ばれる。ここで重要な事は（５）式の線形回帰の形であっても、複数の入力変数による有限インパル応答を足し合わせ、さらに自己回帰を足し合わせた多入力１出力（複数の出力を同時に考える場合は多入力多出力）の線形回帰の形（ＡＲＸ（Autoregressive eXogenous Input）などの形）ではなくて、あくまでも１入力１出力の形の線形回帰モデルになっている点であり、これが「ペアワイズ」の意味である。 In the fields of digital signal processing and system identification, when uk in equation (5) above is not y itself, it is called a finite impulse response (FIR) model (it can also be called a weighted moving average model for input variables). On the other hand, when uk is equal to y itself, it is called an autoregressive (AR) model. What is important here is that even though it is in the form of linear regression in equation (5), it is not a multi-input, single-output (multi-input, multi-output when multiple outputs are considered simultaneously) linear regression form (such as ARX (Autoregressive eXogenous Input)) in which finite impulse responses from multiple input variables are added together and then autoregression is added, but rather it is still a linear regression model with one input and one output; this is the meaning of "pairwise."

ペアワイズで同定することの第一の利点は、「複数の入力」という概念自身が無くなるため、先に述べた（複数の入力変数間の関係から生じる）多重共線性の問題が決して生じる事はなく、この問題を本質的に回避できることを含め、パラメータの可同定性の観点では多入力１出力のシステムよりも有利であるからである。これを若干補足する。システム同定の分野では、入力信号ｕｋに含まれる周波数成分の量（数）によって同定可能なパラメータの数が変化することが持続的励振条件（ＰＥ（Persistent Exciting）条件）と呼ばれるパラメータの可同定性（＝パラメータを一意に決定可能）条件として知られており、同定（推定）すべきパラメータａｋの数が多くなるほど、入力信号ｕｋは多数の周波数成分を含まなければならない事が知られている。 The first advantage of pairwise identification is that, because the concept of "multiple inputs" itself is eliminated, the problem of multicollinearity mentioned earlier (which arises from the relationship between multiple input variables) never arises. This essentially avoids this problem, making it more advantageous than multiple-input, single-output systems in terms of parameter identifiability. To expand on this a bit, in the field of system identification, the condition for parameter identifiability (i.e., parameters can be uniquely determined) known as the persistent excitation condition (PE condition) is that the number of identifiable parameters changes depending on the amount (number) of frequency components contained in the input signal uk. It is known that the greater the number of parameters ak to be identified (estimated), the greater the number of frequency components the input signal uk must contain.

一方、本実施形態のモジュラー型時系列データ予測装置において、入力変数の値は、予測モデルの構築者が制御（調整）できるものではなく、所与の観測（計測）情報であるから、どれくらいの周波数成分を含んでいるか否かは、外部条件として与えられるものであり、パラメータの数は少なければ少ないほど可同定性が向上し、逆にパラメータの数が多ければ多いほど可同定性が劣化する可能性が高くなる。より厳密に述べると、パラメータの数が少ない場合に可同定性が悪くなる可能性は０％であり、パラメータ数が少ないものの可同定性の方が高いか、悪くてもパラメータ数が多いものの可同定性と同じである。従って、上記（５）式の１入力１出力のモデルではｎ個のパラメータを含むのに対し、ｐ入力１出力の予測モデルを考えるとｐ×ｎ個のパラメータを含むことになるため、入力数ｐが多くなればなるほどパラメータを一意に同定することが難しくなる可能性が高くなる事がわかる。 On the other hand, in the modular time series data prediction device of this embodiment, the values of the input variables cannot be controlled (adjusted) by the builder of the prediction model, but are given observation (measurement) information. Therefore, the number of frequency components included is given as an external condition. The fewer the number of parameters, the better the identifiability, and conversely, the more parameters there are, the greater the likelihood that identifiability will deteriorate. More precisely, when the number of parameters is small, the likelihood of identifiability deteriorating is 0%, and identifiability with a small number of parameters is higher, or at worst, the same as with a large number of parameters. Therefore, while the one-input, one-output model in equation (5) above contains n parameters, a p-input, one-output prediction model will contain p x n parameters. Therefore, it can be seen that the greater the number of inputs p, the greater the likelihood that it will be difficult to uniquely identify the parameters.

さらに、先に述べた入力変数間の従属関係（相関関係）があると、多重共線性の問題が生じ、同定すべきパラメータ数の多寡に関わらず可同定性が失われ、入力情報は所与のものであるから、いかなる方法を用いても原理的にパラメータ値を一意に決定することが不可能になる。これに対し、１入力１出力の（５）式のモデルでは多重共線性の問題が原理的に生じることはなく、（５）式の形でパラメータが同定できなくなる場合は、入力ｕｋに十分な周波数情報が含まれていない時に限られ、この場合は、次数ｎを調整して同定すべきパラメータ数を調整することで、パラメータａｋを必ず可同定（一意に同定可能）にすることができる。 Furthermore, if there is a dependency (correlation) between the input variables mentioned above, the problem of multicollinearity arises, and identifiability is lost regardless of the number of parameters to be identified. Because the input information is given, it is, in principle, impossible to uniquely determine parameter values no matter what method is used. In contrast, the problem of multicollinearity does not, in principle, arise in the one-input, one-output model of equation (5). The only time parameters cannot be identified in the form of equation (5) is when the input uk does not contain sufficient frequency information. In this case, by adjusting the order n and the number of parameters to be identified, it is possible to ensure that parameter ak is identifiable (uniquely identifiable).

例えばｕｋの値が一定値である場合、uk(t-L)=uk(t-L-1)=…=uk(t-L-n+1)となるが、このような場合はｎ＝１として一つのパラメータａ１だけを同定する様にすればパラメータの可同定性を維持することができる。一般的には、所与のｕｋを用いて上記の持続的励振条件を調べることで同定可能なパラメータ数ｎの上限値ｎｍａｘを求めることができるので、ｎ≦ｎｍａｘの範囲内で、例えば、ＡＩＣ（赤池情報量規範）やＢＩＣ（ベイズ情報量規範）、ＭＤＬ基準（最小符号化基準）などの各種の規範および基準を用いたり、交差検証（クロスバリデーション）を行ったりすることで、ｎの適正値を決めて、（５）式の可同定性を常に維持するこができる。 For example, if the value of uk is constant, then uk(t-L) = uk(t-L-1) = ... = uk(t-L-n+1). In such cases, parameter identifiability can be maintained by setting n = 1 and identifying only one parameter, a1. Generally, the upper limit nmax of the number of identifiable parameters n can be determined by examining the above continuous excitation conditions using a given uk. Therefore, by using various criteria and standards, such as AIC (Akaike Information Criterion), BIC (Bayesian Information Criterion), and MDL criterion (Minimum Decoding Criterion), or by performing cross-validation within the range of n≦nmax, an appropriate value for n can be determined, thereby consistently maintaining the identifiability of equation (5).

ペアワイズで同定することの第二の利点は、たとえ、出力変数を除く入力変数が１つの場合であっても、この方法では出力変数を入力とする自己回帰ＡＲモデルと入力変数（出力変数ではない入力変数）を入力とするＦＩＲモデルとの二つのモデルを分離して、同定することによる効果である。このような場合、ペアワイズでない通常の方法では、ＡＲＸモデルなどを用いて、自己回帰成分（ＡＲ成分）と重み付き移動平均成分（ＦＩＲ成分／ＭＡ成分）とを同時に同定することになるが、多くの場合、出力変数と入力変数との関係（相関）よりも出力変数自身の相関（自己相関）の影響の方が強い場合が多く、予測に対する自己回帰成分の影響が極めて強くなる場合が多い。このような場合、入力変数の値が急変した場合にその変化に追従できなくなり、予測が遅れるという現象が生じやすい。 The second advantage of pairwise identification is that even if there is only one input variable excluding the output variable, this method separates and identifies two models: an autoregressive AR model that uses the output variable as input, and an FIR model that uses the input variable (an input variable that is not an output variable) as input. In such cases, conventional non-pairwise methods would simultaneously identify the autoregressive component (AR component) and weighted moving average component (FIR component/MA component) using an ARX model or the like. However, in many cases, the correlation between the output variable itself (autocorrelation) has a stronger effect than the relationship (correlation) between the output variable and the input variable, and the autoregressive component often has an extremely strong effect on the prediction. In such cases, if the value of the input variable changes suddenly, the system will be unable to track the change, and predictions are likely to be delayed.

本実施形態のモジュラー型時系列データ予測装置の場合、例えば、出力変数はポンプ井流入量であり、入力変数として１か所の地上雨量を仮定すると、流入量に対する自己回帰成分の影響の方が地上雨量に対するＦＩＲ成分の影響よりも強くなり、降雨の変化により地上雨量の値が急変した場合でも流入量に変化が現れるまでは予測値の変化が小さく、予測値が実測値に対して遅れるという現象が生じやすい。これに対し、流入量に対する自己回帰モデル（ＡＲモデル）と地上雨量に対する有限インパルス応答モデル（ＦＩＲモデル）とを各々個別に同定するペアワイズの方法では、監視員等のユーザは、予測に対するＡＲ成分の影響とＦＩＲ成分の影響とを調整することができる。 In the case of the modular time series data forecasting device of this embodiment, for example, if the output variable is pump well inflow and the input variable is ground rainfall at one location, the influence of the autoregressive component on the inflow will be stronger than the influence of the FIR component on the ground rainfall. Even if the ground rainfall value changes suddenly due to a change in rainfall, the change in the predicted value will be small until a change in the inflow appears, and the predicted value will likely lag behind the actual measured value. In contrast, with a pairwise method that separately identifies an autoregressive model (AR model) for the inflow and a finite impulse response model (FIR model) for the ground rainfall, a user such as a supervisor can adjust the influence of the AR component and the FIR component on the forecast.

ペアワイズで同定することの第三の利点は、第二の利点と密接に関係するが、各入力変数に対する（１）式のモデルの意味が明確であることである。ここで、先と同じ様に、出力変数を除く入力変数が１つの場合である下記（２）式の単純なＡＲＸモデルを考えてみる。
y(t)=a1×y(t-1)+ a2×y(t-2)+…+any×y(t-ny)
+b1×u(t-L)+ b2×u(t-L-1)+…+bnu×u(t-L-nu+1) （６） The third advantage of pairwise identification, which is closely related to the second advantage, is that the meaning of the model in equation (1) for each input variable is clear. Here, as before, consider the simple ARX model in equation (2) below, which has one input variable excluding the output variable.
y(t)=a1×y(t-1)+ a2×y(t-2)+…+any×y(t-ny)
+b1×u(tL)+ b2×u(tL-1)+…+bnu×u(tL-nu+1) (6)

この時、上記（６）式をＡＲＸモデルとして同定すると、以下のｙに関する項（Ｙと定義する）とｕに関する項（Ｕと定義する）は、ＹとＵの意味は明確ではなく、ＹとＵとを個別に意味付けして説明することは困難である。 In this case, if the above equation (6) is identified as an ARX model, the meanings of the following terms related to y (defined as Y) and u (defined as U) are unclear, and it is difficult to explain the individual meanings of Y and U.

Y：=a1×y(t-1)+ a2×y(t-2)+…+any×y(t-ny) （７）
U：＝b1×u(t-L)+ b2×u(t-L-1)+…+bnu×u(t-L-nu+1) （８）
一方、（７）式と（８）式とは、各々（５）式の形をしているので、ペアワイズで同定を行うと、ＹとＵとは各々ｙの予測値を意味することになるので、ＹとＵとは、各々別の説明変数（入力変数）に対する出力ｙの予測値であるという解釈をすることができる。 Y:=a1×y(t-1)+ a2×y(t-2)+…+any×y(t-ny) (7)
U:=b1×u(tL)+ b2×u(tL-1)+…+bnu×u(tL-nu+1) (8)
On the other hand, since equations (7) and (8) each have the form of equation (5), when pairwise identification is performed, Y and U each mean a predicted value of y, and therefore Y and U can be interpreted as a predicted value of output y for each separate explanatory variable (input variable).

このようにペアワイズの同定を行うと、（７）式と（８）式とから、（６）式の形の予測モデルを構成的（Constructive）に構築することは容易であり、例えば（７）式のＵと（８）式のＹとの平均（＝（Ｕ＋Ｙ）÷２）として構成することで、（６）式の形の予測モデルを得る事ができ、これは、再度ｙの予測値を意味していることは明らかである。 By performing pairwise identification in this way, it is easy to construct a predictive model in the form of equation (6) constructively from equations (7) and (8). For example, by averaging U in equation (7) and Y in equation (8) (= (U + Y) ÷ 2), a predictive model in the form of equation (6) can be obtained, which clearly again represents the predicted value of y.

上記のように、予測モデルの形（構造）が同じであっても同定の手順を変える事で、その意味づけは変化する。ペアワイズで同定することの利点は、意味付けのしやすいモデルを構築することができる点である。
以上が多入力１出力ではなく１入力１出力のペアワイズで予測モデルを同定することの利点である。加えて、ペアワイズ予測モデルとして、（５）式のＦＩＲもしくはＡＲの形の回帰式の形で同定することで、ディジタル信号処理やシステム同定分野で従来から良く知られている周波数解析や安定解析などの様々な解析手法を、ペアワイズに直接流用できるため、その分野の知識があれば、ペアワイズでより詳細な解析や調整も可能になる。 As mentioned above, even if the form (structure) of a predictive model is the same, changing the identification procedure will change its meaning. The advantage of pairwise identification is that it allows for the construction of a model that is easy to interpret.
These are the advantages of identifying a prediction model pairwise with one input and one output, rather than with multiple inputs and one output. In addition, by identifying the pairwise prediction model in the form of an FIR or AR regression equation as in equation (5), various analysis methods well known in the fields of digital signal processing and system identification, such as frequency analysis and stability analysis, can be directly applied to pairwise analysis. Therefore, if you have knowledge in that field, you can perform more detailed analysis and adjustments in pairwise analysis.

ペアワイズ予測モデルとして、（５）式の様なＡＲモデル／ＦＩＲモデルをさらに分解し、時間遅れを考慮した変数を各々一つの説明変数と見なした単回帰モデルに分解して予測モデルを構築することもできる。
y(t)=a×uk(t-L)+c （９）
上記（９）式の様な単純な形にすると、（９）式に含まれるパラメータは、遅れ時間Ｌ、入力変数ｕｋに対する出力変数の倍率（ゲイン、比例係数）ａ、入力変数と出力変数の平均値の差（入力が０の場合の出力の値、バイアス）ｃの三つだけとなり、全てのパラメータの意味が明確になる。 As a pairwise prediction model, a prediction model can be constructed by further decomposing an AR model/FIR model such as equation (5) into a simple regression model in which each variable taking time delays into account is considered as a single explanatory variable.
y(t)=a×uk(tL)+c (9)
When simplified as in equation (9) above, the parameters included in equation (9) are only three: delay time L, the magnification (gain, proportionality coefficient) a of the output variable relative to the input variable uk, and the difference between the average values of the input variable and the output variable (output value when the input is 0, bias) c, and the meanings of all parameters become clear.

（５）式の形では、遅れ時間Ｌとバイアスｃの解釈は（９）式と同様であるが、ａｋ、ｋ＝１、２、…、ｎという回帰係数（パラメータ）の個々の意味までは明確でなく、システム同定やディジタル信号処理に関する知識を用いて、（５）式自身を解釈することはできるものの、個々のａｋの直感的な解釈は困難である。これに対し、（９）式の単回帰式の場合はａの解釈も容易であり、単純に、平均値を０に処理した場合の入力変数に対する出力変数の倍率を示している。これにより、全てのパラメータの意味を容易に解釈できる様になるため、（９）式を用いると、何等かのアルゴリズムを適用した場合のパラメータ同定結果が直感的な感覚とずれるような場合（例えば、異常データが混入している場合などにそのような事が起こり得る）、例えば予測モデルの構築者が直感的に手動でパラメータを調整することも容易になり、また、予測モデルの中身を他者に説明することも極めて容易になる。 In equation (5), the interpretation of delay time L and bias c is the same as in equation (9), but the individual meanings of the regression coefficients (parameters) a, k = 1, 2, ..., n are unclear. While equation (5) itself can be interpreted using knowledge of system identification and digital signal processing, intuitive interpretation of the individual a is difficult. In contrast, in the simple regression equation of equation (9), a is easy to interpret; it simply indicates the magnification of the output variable relative to the input variable when the mean value is set to 0. This makes it easy to interpret the meaning of all parameters. Therefore, using equation (9) makes it easy for predictive model builders to intuitively and manually adjust parameters when the parameter identification results from applying an algorithm deviate from intuitive perception (such as can occur when anomalous data is mixed in), and it also makes it extremely easy to explain the contents of a predictive model to others.

本実施形態のモジュラー型時系列データ予測装置において、例えば、出力変数はポンプ井流入量であり、入力変数として１か所の地上雨量であると仮定すると、（９）式のｃは地上雨量がゼロの場合の流入量を意味し、これは、汚水と雨水が同一管渠を通る合流式下水の場合は、汚水量に相当するものであるという解釈ができる。そして、（９）式のａは流入量から汚水量を引き去った場合に、地上雨量計による雨量に対する雨水流入量の倍率を示しており、地上雨量と流入量との相関が十分に高ければ、このような単純な倍率も意味を持つ。さらに、（９）式の遅れ時間Ｌは、地上雨量の観測点から雨水ポンプ井までの雨水の流れの遅れ時間を意味し、下水管に流出するまでの流出の遅れと下水管路を流れて雨水ポンプ井に到達するまでの流下時間の和を意味すると解釈することができる。 In this embodiment of the modular time series data forecasting device, for example, if the output variable is the pump well inflow and the input variable is ground rainfall at one location, then c in equation (9) represents the inflow when ground rainfall is zero. In the case of a combined sewer system in which sewage and rainwater pass through the same pipe, this can be interpreted as equivalent to the sewage volume. Furthermore, a in equation (9) represents the magnification of the rainwater inflow volume relative to the rainfall measured by the ground rain gauge when the sewage volume is subtracted from the inflow volume. If the correlation between ground rainfall and inflow volume is sufficiently high, even this simple magnification is meaningful. Furthermore, the delay time L in equation (9) represents the delay time for rainwater flow from the ground rainfall observation point to the stormwater pump well, and can be interpreted as the sum of the outflow delay before it flows into the sewer pipe and the flow time through the sewer pipe to reach the stormwater pump well.

このように、解釈や説明が極めて容易になることが（９）式の様な単純なモデルを用いることの利点であり、実際に（９）式の形だけを用いた最も簡単なモデルは、流入量予測の簡易モデルとして、特許文献１の中でも採用されている。 In this way, the advantage of using a simple model such as equation (9) is that it is extremely easy to interpret and explain. In fact, the simplest model using only equation (9) is also adopted in Patent Document 1 as a simplified model for inflow volume prediction.

上記のように直感的な解釈が可能になることにより、手動でのパラメータ調整が容易になる具体的な例について説明する。
図３および図４は、幹線水位データと流入量データとの関係の一例を概略的に示す散布図である。 A specific example will be described in which the above-described intuitive interpretation makes it easier to manually adjust parameters.
3 and 4 are scatter diagrams that schematically show an example of the relationship between main water level data and inflow data.

図３および図４において、横軸は、幹線水位計１３１～１３Ｋのいずれかに相当するある箇所の幹線水位データ、縦軸は流入量計１１に相当する予測対象となる流入量データとした散布図の一例を示している。
図３に示す直線は、最小２乗法によって、単回帰のゲインとバイアスを求めたものであり、この場合ゲインａ＝９．６０２４、バイアスｃ＝２３．３１４２となっている。これは、予測値の平均２乗誤差（ＭＳＥもしくはＲＭＳＥ）が最小になるという意味で、最適な回帰直線である。 Figures 3 and 4 show an example of a scatter plot in which the horizontal axis represents main water level data at a location corresponding to one of the main water level gauges 131 to 13K, and the vertical axis represents inflow data to be predicted corresponding to inflow meter 11.
The straight line shown in Figure 3 is the gain and bias of simple regression calculated by the least squares method, and in this case the gain a = 9.6024 and the bias c = 23.3142. This is the optimal regression line in the sense that it minimizes the mean square error (MSE or RMSE) of the predicted value.

図３において、例えば幹線水位の値が－１．５であるとき、回帰直線に従うと流入量の値はおおよそ９程度と予測されることになる。しかしながら、図３の散布図によれば幹線水位の値が－１．５であるときの流入量は、９よりも大きい値である場合が圧倒的に多いことがわかる。 In Figure 3, for example, when the main water level is -1.5, the regression line predicts that the inflow value will be approximately 9. However, the scatter plot in Figure 3 shows that when the main water level is -1.5, the inflow value is overwhelmingly greater than 9.

上記のように、回帰直線による予測値と測定値との間にズレが生じる理由は、幹線水位が－２．５乃至－２付近のデータ数が圧倒的に多いため、この付近のデータにより適合する様に係数が推定されるためである。この回帰直線は、理論上は正しいものであるが、実際の流入量予測においては、必ずしも好ましくない場合が多い。なぜなら、流入量の予測値は、雨水排水ポンプの制御や運転支援に用いられるため、浸水リスクを回避するためには、流入量が多くなる場合をできる限り正確に予測したいという暗黙の要請があるためであり、流入量が少ない場合の予測精度はそれほど重要にならない場合が多いためである。 As mentioned above, the reason for the discrepancy between the predicted value based on the regression line and the measured value is that there is an overwhelming amount of data for main water levels around -2.5 to -2, so the coefficients are estimated to better fit the data in this range. While this regression line is correct in theory, it is often not necessarily desirable for actual inflow predictions. This is because the predicted inflow values are used to control and assist the operation of stormwater drainage pumps, and in order to avoid the risk of flooding, there is an implicit desire to predict cases where inflow volumes are high as accurately as possible, and prediction accuracy when inflow volumes are low is often not as important.

しかし、このような暗黙の要請をアルゴリズムに組みこむ場合には、流入量が多い箇所の予測誤差に重みをつけるなどの処置が必要であり、その重みの調整なども含め、一般にかなりの労力を要する。 However, incorporating such implicit requirements into an algorithm requires measures such as weighting the prediction errors in areas with high inflows, which generally requires a considerable amount of effort, including adjusting those weights.

図４では、最小２乗法と、最小２乗法の他の１０種類の様々な回帰手法とを、図３と同じデータに適用した結果得られた回帰直線Ｌ１－Ｌ１１の一例を示している。この中で、１番乃至１０番の方法は、Pythonと呼ばれるソフトウェアのパッケージに組み込まれている様々なアドバンストな回帰手法であるが、いずれの方法による回帰直線Ｌ１－Ｌ１０も、流入量が多いときに適切な予測値が得られるものではなかった。 Figure 4 shows an example of regression lines L1-L11 obtained by applying the least squares method and 10 other regression methods to the same data as in Figure 3. Of these, methods 1 through 10 are various advanced regression methods built into a software package called Python, but none of the regression lines L1-L10 obtained by any of these methods were able to produce appropriate predictions when the inflow volume was large.

１１番目の方法は、幹線水位が－２．２乃至－１．５までの範囲のデータだけを抽出し、他のデータを使用せずに、通常の最小２乗法を適用した方法である。これは、本願発明者らが、幹線水位－２．５乃至－２付近のデータに適合するように回帰直線が求められることを理解した上で、その付近のデータを意図的に捨てることによって、流入量が多い場合のデータに適合するように意図的に調整した方法である。 The 11th method extracts only data from the main water level range of -2.2 to -1.5, and applies the ordinary least squares method without using any other data. The inventors of this application, understanding that a regression line can be obtained to fit data for main water levels around -2.5 to -2, intentionally discarded data in this area, and intentionally adjusted the method to fit data when inflow volumes are high.

このように、アルゴリズムに頼った調整を行おうとすると、例えばデータの範囲を限定する、あるいは、データを間引くなどの様々な工夫を施さないと、本来望むパラメータが得られないことがある。しかし、１１番目の方法により得られる直線Ｌ１１は、２点を決めれば直線が唯一に定まるという極めて単純な原理を用いれば、人間が目視で直線Ｌ１１のようなラインを引くことにはほとんど労力を要しない。上記のように、単回帰という極めてわかりやすく、また図３や図４に示す散布図上に明確に可視化できるパラメータを用いることによって、手動での調整が極めて容易にできることがわかる。
すなわち、単回帰モデルを用いると、倍率（ゲイン）とバイアスという容易に理解できるパラメータを用いて解釈できて、調整が容易になる。 As such, when attempting to perform adjustments that rely on algorithms, it may be necessary to take various measures, such as limiting the range of data or thinning out the data, to obtain the desired parameters. However, if the straight line L11 obtained by the 11th method is based on the extremely simple principle that a straight line can be uniquely determined by determining two points, it takes almost no effort for a human to visually draw a line like the straight line L11. As described above, by using simple regression, which is extremely easy to understand, and parameters that can be clearly visualized on the scatter diagrams shown in Figures 3 and 4, it can be seen that manual adjustments can be made extremely easily.
In other words, when a simple regression model is used, it can be interpreted using easily understandable parameters, namely, gain and bias, making adjustments easy.

さらに、単回帰を用いると別の観点で解釈することも可能になる。すなわち、統計分野における基本的な統計量である、平均、標準偏差、および、相関、という三つのパラメータを用いて解釈することが可能になる。 Furthermore, simple regression allows for interpretation from a different perspective. That is, it allows for interpretation using three parameters, the mean, standard deviation, and correlation, which are basic statistical quantities in the field of statistics.

単回帰モデルの回帰係数（比例係数、傾き、倍率、ゲイン）ａは、相関係数ｒと次式の関係にあることが広く知られている。
a=r×(σy/σu) （１０）
ここで、ｒ、σｙ、σｕは、各々、入力変数と出力変数の相関係数、出力変数の標準偏差、入力変数の標準偏差、である。また、回帰係数ａとバイアスｃは、単回帰モデルの予測２乗誤差が最小となる様に最小２乗法で同定した場合、以下の関係になることも広く知られている。
c=μy-a×μu （１１）
ここで、μｙ、μｕは、各々、出力変数の平均と入力変数の平均、を表す。 It is widely known that the regression coefficient (proportional coefficient, slope, magnification, gain) a of a simple regression model has the following relationship with the correlation coefficient r:
a = r × (σy/σu) (10)
where r, σy, and σu are the correlation coefficient between the input variable and the output variable, the standard deviation of the output variable, and the standard deviation of the input variable, respectively. It is also widely known that the regression coefficient a and bias c have the following relationship when identified by the least squares method so as to minimize the prediction square error of the simple regression model.
c = μy - a × μu (11)
Here, μy and μu represent the mean of the output variables and the mean of the input variables, respectively.

従って、（９）式において、入力変数と出力変数を各々、平均と標準偏差を用いて正規化しておけば、回帰係数ａと相関係数ｒの値は一致し、ｃ＝０となる。
この関係が意味することは、単回帰モデルのゲインとバイアスでモデルを説明することと、各変数（入力変数と出力変数）の平均、標準偏差と、入力と出力との相関係数、のみでモデルを説明することは等価であるということである。従って、予め、変数毎に適切な平均と標準偏差の推定値を用いて各変数のデータを正規化しておけば、単回帰の入出力関係は、次式のように、相関係数のみで、全てを説明できることになる。
y´(t)=a×uk´(t-L)=r×uk´(t-L) （１２）
ここで、y´=(y-μy)/σy、uk´=(uk-μuk)/σukで定義される正規化された出力変数と入力変数である。 Therefore, in equation (9), if the input variables and output variables are normalized using the mean and standard deviation, respectively, the values of the regression coefficient a and the correlation coefficient r will match, and c=0.
This relationship means that explaining the model using the gain and bias of a simple regression model is equivalent to explaining the model using only the mean and standard deviation of each variable (input variable and output variable) and the correlation coefficient between the input and output. Therefore, if you normalize the data for each variable in advance using appropriate estimates of the mean and standard deviation for each variable, the input-output relationship of simple regression can be explained entirely using only the correlation coefficient, as shown in the following equation.
y´(t)=a×uk´(tL)=r×uk´(tL) (12)
where y' = (y - μy)/σy and uk' = (uk - μuk)/σuk are the normalized output and input variables.

このように解釈すると、例えば、入出力の相関が無い場合、すなわちｒ＝０となる様な入出力関係を持つ場合は、その出力に対して対応する入力は全く予測能力を持たない事が（１１）式の関係から明確に理解することができる。また、現実のデータは、様々なノイズやアウトライア（外れ値）で汚染されている（コンタミされている）場合も多いが、このような時、平均、標準偏差、相関係数、の三つのパラメータを外れ値に対してロバストに推定するロバスト推定法を用いて推定することにより、外れ値に対してロバストな予測モデルを構築することもできる。もちろん、より複雑なモデルのパラメータ推定に対する様々なロバスト推定法も開発されているが、一般にロバスト推定した結果を解釈することは容易ではないのに対し、平均、標準偏差、相関係数という三つのパラメータに対するロバスト推定の場合は、何等かのロバスト推定を適用した推定結果の良否の判断や解釈も容易になるため、ノイズやアウトライアにコンタミされたデータに対する予測モデル構築を行いたい場合の解釈や調整も容易になる。 When interpreted in this way, for example, if there is no correlation between input and output, i.e., if the input-output relationship is such that r = 0, then the input corresponding to that output has no predictive ability whatsoever, as can be clearly seen from the relationship in equation (11). Furthermore, real-world data is often contaminated with various noises and outliers. In such cases, a robust estimation method that robustly estimates the three parameters (mean, standard deviation, and correlation coefficient) against outliers can be used to construct a predictive model that is robust against outliers. Of course, various robust estimation methods have been developed for estimating the parameters of more complex models, but the results of robust estimation are generally not easy to interpret. However, robust estimation of the three parameters (mean, standard deviation, and correlation coefficient) makes it easy to interpret and judge the quality of the results of any robust estimation applied. This also facilitates interpretation and adjustment when building a predictive model for data contaminated by noise or outliers.

このように、（９）式や（１０）～(１２)式は、容易に理解しやすいパラメータのみで構成されているが、（５）式と比較しても、極めて単純な構造をしており、実際の応用においては、その表現能力が十分でなく、十分な予測精度が得られない可能性があった。そこで、（９）式（あるいは（１０）～（１２）式）の形から、(５)式の形を合成することを検討する。（９）式のモデルを用いて(５)式の形を得るためには、(９)式の遅れ時間Ｌを可変にして足し合わせればよい。すなわち、まず、（９）式における遅れ時間Ｌ（（５）式のＬと区別するため以下ではＬ´とする）の最小の値を（５）式のＬと対応させてＬ´＝Ｌとし、Ｌ以上の遅れ時間について、Ｌ´＝Ｌ＋１、Ｌ´＝Ｌ＋２、…Ｌ´＝Ｌ＋ｎ－１としたｎ個の（９）式のモデルを加え合わせれば（５）式の形のモデルが得られる。 As such, equations (9) and (10) through (12) are composed only of easily understandable parameters. However, compared to equation (5), they have an extremely simple structure, and in actual applications, their expressive power may be insufficient, resulting in insufficient prediction accuracy. Therefore, we consider synthesizing equation (5) from equation (9) (or equations (10) through (12)). To obtain equation (5) using the model of equation (9), we simply make the delay time L in equation (9) variable and add them together. That is, first, we match the minimum value of the delay time L in equation (9) (hereafter referred to as L' to distinguish it from L in equation (5)) to L in equation (5), setting L' = L. Then, for delay times greater than L, add n models of equation (9) where L' = L+1, L' = L+2, ..., L' = L+n-1 to obtain a model in the form of equation (5).

すなわち、以下の（１３）式のｎ個の単回帰モデルを加え合わせれば、形式的に（５）式と等価な（１４）式のＦＩＲもしくはＡＲモデルの形のモデルが得られる。
y(t)=a1×uk(t-L)+c （１３＿１）
y(t)=a2×uk(t-L-1)+c （１３＿２）
y(t)=an×uk(t-L-n+1)+c （１３＿ｎ）
y(t)=1/n×(a1×uk(t-L)+a2×uk(t-L-1)+…+an×uk(t-L-n+1))+c （１４） That is, by adding together n simple regression models of the following equation (13), a model in the form of an FIR or AR model of equation (14) which is formally equivalent to equation (5) can be obtained.
y(t)=a1×uk(tL)+c (13_1)
y(t)=a2×uk(tL-1)+c (13_2)
y(t)=an×uk(tL-n+1)+c (13_n)
y(t)=1/n×(a1×uk(tL)+a2×uk(tL-1)+…+an×uk(tL-n+1))+c (14)

異なるｎ個の遅れ時間のｎ個の単回帰モデルを作成した上で、その平均モデルを作成すれば、形式的には（１）式と同じＦＩＲあるいはＡＲモデルの形のモデルが得られる。この際、上記の様に単純に平均化処理を行うこともできるが、先の述べた単回帰のゲインとバイアスと相関係数との関係を用いて、以下の様に重み付き平均を行うこともできる。 By creating n simple regression models for n different delay times and then creating an average model of these, a model in the form of an FIR or AR model, formally the same as equation (1), can be obtained. In this case, a simple averaging process can be performed as above, but a weighted average can also be performed as shown below, using the relationship between the gain, bias, and correlation coefficient of the simple regression described above.

まず、予め、入出力変数を正規化して、（９）式を（１２）式の形で表しておく。すると、回帰係数は相関係数と一致するので、この回帰係数（の絶対値）で重みづけした重み付き平均値として（１４）式に類似した（１５）式のＦＩＲ／ＡＲモデルの形の式を得ることができる（以下では、正規化した変数と正規化していない変数との記号を区別せず、文脈に応じてｕ（ｔ）、ｙ（ｔ）は正規化した変数を表すこととする。）
y(t)=(a1×uk(t-L)+…+an×uk(t-L-n+1))/(|a1|+|a2|+…+|an|)) （１５） First, input and output variables are normalized in advance to express equation (9) in the form of equation (12). Then, the regression coefficient coincides with the correlation coefficient, so an equation in the form of an FIR/AR model, equation (15) similar to equation (14), can be obtained as a weighted average weighted by (the absolute value of) this regression coefficient (hereinafter, no distinction will be made between the symbols for normalized and non-normalized variables, and u(t) and y(t) will represent normalized variables depending on the context).
y(t)=(a1×uk(tL)+…+an×uk(tL-n+1))/(|a1|+|a2|+…+|an|)) (15)

このようにすると、相関が強くなる遅れ時間による予測に対して重みをつけたペアワイズ予測モデルの合成が可能になる。なお、回帰係数と相関係数の関係が（１２）式の様に陽に関係つけられない場合でも相関係数絶対値もしくは回帰係数の絶対値で重みづけ平均化処理を行うことは可能であるが、（１２）式の関係があることで、重みづけを行う事の意味がより明確で説得性の高いものとなる。また、（１５）式において、相関係数の絶対値ではなく相関係数の２乗によって重みづけを行っても良い。 This makes it possible to synthesize pairwise prediction models that are weighted for predictions based on lag times where correlation is stronger. Even if the relationship between the regression coefficient and the correlation coefficient cannot be explicitly determined as in equation (12), it is possible to perform weighted averaging using the absolute value of the correlation coefficient or the absolute value of the regression coefficient. However, the existence of the relationship in equation (12) makes the significance of weighting clearer and more persuasive. Furthermore, in equation (15), weighting can also be performed using the square of the correlation coefficient rather than the absolute value of the correlation coefficient.

ペアワイズ予測モデルとして、上記の二つの合成法は、いずれの方法でも入出力間の線形の関係しか表現することができないが、入出力間に非線形の関係がある場合も多い。このような場合は、（５）式の代わりに、適当な非線形関数φ（・）を用いて、以下の（１６）式の様な非線形回帰の形として非線形変換すればよい。 As pairwise prediction models, both of the above synthesis methods can only express linear relationships between input and output, but there are often nonlinear relationships between input and output. In such cases, instead of equation (5), an appropriate nonlinear function φ(·) can be used to perform a nonlinear transformation into a nonlinear regression form such as equation (16) below.

y(t)=a1×φ（uk(t-L)）+…+an×φ（uk(t-L-n+1)）+c （１６）
これは、機械学習の分野で広く知られている非線形回帰のテクニックであり、パラメータａｋ、ｋ＝１、２、…、ｎに関する線形性さえ維持していれば、線形回帰の様々な手法を直接適用することができる。また、機械学習の分野でよく知られている様に、非線形関数φ（・）を直接指定しなくても、（１６）式を計算する際に必要となる計画行列と呼ばれる行列に対して、φ（・）を指定することと等価に変換できるカーネル関数（類似度関数）を直接指定しても良い。ただし、その場合には、直感的な解釈性が若干低下する可能性がある事には注意する必要がある。 y(t)=a1×φ(uk(tL))+…+an×φ(uk(tL-n+1))+c (16)
This is a nonlinear regression technique that is widely known in the field of machine learning, and various linear regression methods can be directly applied as long as linearity with respect to the parameters a, k = 1, 2, ..., n is maintained. Also, as is well known in the field of machine learning, instead of directly specifying the nonlinear function φ(·), it is possible to directly specify a kernel function (similarity function) that can be converted to be equivalent to specifying φ(·) for a matrix called a design matrix that is required when calculating equation (16). However, in this case, it should be noted that intuitive interpretability may be slightly reduced.

（１６）式の様な非線形回帰を用いる事で、ある入力変数と出力変数との間に非線形関係がある場合にも対応することが可能になる。なお、ペアワイズ予測モデルの同定では、入力変数毎に出力変数との回帰モデルを構築するので、ある入力変数と出力変数の関係は非線形であるが、別の入力変数と出力変数との関係は線形であるような場合には、入力変数毎に（５）式と（１６）式とを単に使い分けるだけでよく、これによって、一般的には説明性も向上することが期待される。以上の一連の作用が、本実施形態におけるペアワイズ予測モデル同定部４の作用である。 By using nonlinear regression such as equation (16), it is possible to handle cases where there is a nonlinear relationship between a certain input variable and an output variable. In identifying pairwise prediction models, a regression model with the output variable is constructed for each input variable. Therefore, if the relationship between one input variable and the output variable is nonlinear but the relationship between another input variable and the output variable is linear, it is sufficient to simply use equations (5) and (16) for each input variable, which is generally expected to improve interpretability. The above series of actions are the actions of the pairwise prediction model identification unit 4 in this embodiment.

次に予測モデル合成法定義部５では、ペアワイズ予測モデル同定部４で定義した予測モデルの合成法を定義する。
最も簡単な予測モデルの合成法は、ペアワイズ予測モデルで各入力変数に対して出力変数を予測するモデルが構築されているため、その平均により予測モデルの合成を行う方法である。この場合は、予測モデル合成法定義部５では、ペアワイズ予測モデル同定部４のｐ個の予測出力（以下では、各入力変数ｕ１、ｕ２、…、ｕｐに対する予測出力をｙ１、ｙ２、…、ｙｐとする。）を平均化する次式を定義することになる。
y(t)=mean(y1(t)、y2(t)、…、yp(t))=1/p×(y1(t)+y2(t)+…+yp(t)) （１７）
（１７）式の定義は、最も基本的な定義方法であるが、この定義を改良することにより、合成した予測出力に対する信頼性を向上させたり、意図的に予測に傾向（バイアス）を持たせた予測を行ったりすることが可能になる。これを以下に順に説明する。 Next, the prediction model synthesis method definition unit 5 defines a synthesis method for the prediction models defined by the pairwise prediction model identification unit 4 .
The simplest method for combining prediction models is to combine prediction models by averaging the p prediction outputs (hereinafter, the prediction outputs for the input variables u1, u2, ..., up) of the pairwise prediction model identification unit 4, as follows:
y(t)=mean(y1(t), y2(t),..., yp(t))=1/p×(y1(t)+y2(t)+...+yp(t)) (17)
The definition of equation (17) is the most basic definition method, but by improving this definition, it becomes possible to improve the reliability of the synthesized prediction output and to perform prediction with an intentional tendency (bias) in the prediction. This will be explained in order below.

まず、（１７）式の様な単純な平均化処理（標本平均）を行うと、各ペアワイズ予測モデルの予測値が平等に扱われているため、予測精度の良いペアワイズ予測モデルの予測値と予測精度の悪いペアワイズ予測モデルの予測値が混合されて、合成した全体の予測精度が劣化してしまう可能性がある。また、実際の運用においては、ある入力変数の計測データの信頼性が低く、別の入力変数の計測データの信頼性が高いという場合も稀ではないが、信頼性の高い入力変数に対するペアワイズ予測モデルの予測値も信頼性の低い入力変数に対するペアワイズ予測モデルの予測値も平均化してしまうことで、合成した予測出力の予測精度が劣化してしまう。極端な場合、例えば、ある入力変数のセンサの故障や不具合などにより、当該入力変数の時系列データにアウトライア（外れ値、異常値）が多量に含まれる様になる場合などには、予測精度が劣化するだけでなくアウトライアに引きずられて合成した予測自身が無意味になる（破綻してしまう）可能性がある。 First, when performing a simple averaging process (sample mean) such as in equation (17), the predicted values of each pairwise prediction model are treated equally. This means that predicted values from pairwise prediction models with good prediction accuracy may be mixed with predicted values from pairwise prediction models with poor prediction accuracy, potentially degrading the overall prediction accuracy of the combined model. Furthermore, in actual operation, it is not uncommon for measurement data for one input variable to be unreliable while measurement data for another input variable is highly reliable. However, averaging the predicted values of pairwise prediction models for both highly reliable and unreliable input variables can degrade the prediction accuracy of the combined prediction output. In extreme cases, such as when a sensor failure or malfunction for a certain input variable causes the time series data for that input variable to contain a large number of outliers (abnormal values), not only does prediction accuracy deteriorate, but the outliers can also render the combined prediction meaningless (even causing it to fail).

このような状況に対応するために予測モデル合成法定義部５で、以下の様な複数のアプローチをすることができる。
一つ目は、主に後者のアウトライアに対して合成した予測値自身が破綻しない事を重視するアプローチであり、（１７）式の標本平均処理に変えて、ロバスト推定を採用する方法である。 To deal with such a situation, the prediction model synthesis method definition unit 5 can take the following multiple approaches.
The first is an approach that places emphasis on ensuring that the synthesized predicted value itself does not break down, mainly for the latter outliers, and is a method of adopting robust estimation instead of the sample averaging process of equation (17).

（１７）式で用いる「標本平均」という処理は、ｙ１、ｙ２、…、ｙｐのｐ個の予測値の中から代表値を推定する方法の一つであり、このような代表値の推定を統計分野では位置母数の推定と呼ぶ。すなわち、「標本平均」は、位置母数の推定方法の一つであり、統計的には、標本平均は推定効率の観点では良い性質を持つことが知られているが、一方でアウトライアに対するロバスト性の面では最も脆弱（非ロバスト）であることも知られている。従って、外れ値に対するロバスト性を向上させるためには、（１７）式をロバストな位置母数推定で置き換える方が良い。 The "sample mean" process used in equation (17) is one method of estimating a representative value from the p predicted values y1, y2, ..., yp, and in the field of statistics, this type of representative value estimation is called location parameter estimation. In other words, the "sample mean" is one method of estimating a location parameter, and statistically, the sample mean is known to have good properties in terms of estimation efficiency, but it is also known to be the most vulnerable (non-robust) in terms of robustness against outliers. Therefore, in order to improve robustness against outliers, it is better to replace equation (17) with a robust location parameter estimation.

ロバスト推定の分野では外れ値に対するロバスト性を評価するいくつかの指標が知られているが、最も直感的に理解しやすい指標としてブレークダウンポイントという指標がある。これは、統計的推定に用いるデータの中に何パーセントアウトライアが混入することを許容するかという指標であり、用いるデータのＸ％を、仮想的なアウトライアを想定して∞に置き換えた場合、推定量（平均などの位置母数を推定する場合は位置母数の推定値）が∞になる（＝破綻する）かならないかの境界（∞になる直前）のＸの値をブレークダウンポイントと呼ぶ。 In the field of robust estimation, several indices are known for evaluating robustness against outliers, but the one that is easiest to understand intuitively is the breakdown point. This is an indicator of what percentage of outliers is acceptable to be mixed into the data used in statistical estimation. If X% of the data used is replaced with ∞ to assume hypothetical outliers, the value of X at the boundary (just before ∞) where the estimator (the estimated value of the location parameter, when estimating a location parameter such as the mean) reaches ∞ (i.e. breaks down) or not is called the breakdown point.

例えば、「標本平均」という位置母数の推定に対しては、ただ１点のデータを∞に置換するだけでその標本平均値も∞になるので、「標本平均」のブレークダウンポイントは０％である。ロバスト統計ではブレークダウンポイントの最大値は５０％であることが知られており、最大のブレークダウンポイントを持つ位置母数推定量として「中央値（メジアン）」が知られている。従って、予測値が外れ値に影響されないようにすることを最大の目的とする場合には、（１７）式の「平均」を「中央値」で置換して、予測モデル合成法定義部５の定義とすることもできる。 For example, when estimating the location parameter known as the "sample mean," simply replacing one point of data with ∞ also makes the sample mean value ∞, so the breakdown point of the "sample mean" is 0%. In robust statistics, it is known that the maximum breakdown point is 50%, and the "median" is known as the location parameter estimator with the largest breakdown point. Therefore, if the primary goal is to ensure that predicted values are not affected by outliers, the "mean" in equation (17) can be replaced with the "median" to form the definition in the prediction model synthesis definition unit 5.

一方、「中央値」はロバストではあるが、推定効率が悪い事が知られており、直感的にもｐ個の変数で予測している中の一つの予測値しか用いないため、精度の高い予測を行うことを重視する場合に最良の方法ではないことは容易に推測できる。そのため、推定効率の向上とロバスト性の向上を両立するための各種のロバスト推定方法が知られている。最も単純な方法は、「トリム平均（刈込平均）」と呼ばれる処理であり、ｐ個のデータの上位と下位とのα％を削除した上で平均をとる方法である。直感的に明らかな様にαを大きくするとロバスト性は向上し、その極限として中央値推定があり、αを小さくして０に近づけるとロバスト性が低下しその極限が通常の平均（標本平均）であることは明らかである。従って、このトリム平均を予測モデル合成法定義部５の定義とすることもできる。この場合、αの調整によってロバスト性を調整できるが、適切に調整する事自身が難しい場合も考えられるため、別のロバスト推定方法をとることもできる。 On the other hand, while the "median" is robust, it is known to have poor estimation efficiency. It is intuitively obvious that, since it only uses one predicted value out of p variables, it is not the best method when high-accuracy predictions are important. For this reason, various robust estimation methods are known that achieve both improved estimation efficiency and robustness. The simplest method is a process called the "trimmed mean," which removes α% of the top and bottom values of p data and then takes the average. As is intuitively clear, increasing α improves robustness, with median estimation as the limit. Reducing α toward 0 clearly decreases robustness, with the limit being the normal mean (sample mean). Therefore, this trimmed mean can also be used as the definition in the prediction model synthesis definition unit 5. In this case, robustness can be adjusted by adjusting α, but since it may be difficult to adjust it appropriately, another robust estimation method can be used.

このような方法の代表的な例として、ｐ個のｙ１、ｙ２、…、ｙｐの中からすべての組み合わせの（ｐ（ｐ－１）／２個）の、二つのｙｉとｙｊ（ｉ≠ｊ）との平均を計算した上で、その中央値を採用するホッジスレーマン推定量（ＨＬ推定）と呼ばれる位置母数推定を行うこともできる。この方法は、ブレークダウンポイントが約３０％でロバスト性が高いうえに推定効率も良いことが知られており、このＨＬ推定を予測モデル合成法定義部５の定義とすることもできる。 A typical example of such a method is to perform location parameter estimation known as the Hodges-Lehmann estimator (HL estimation), which calculates the average of two yi and yj (i ≠ j) for all combinations (p(p-1)/2) of p values y1, y2, ..., yp, and then uses the median. This method is known to be highly robust with a breakdown point of approximately 30%, as well as having good estimation efficiency, and this HL estimation can also be defined in the prediction model synthesis method definition unit 5.

他のロバスト推定法として、一般には多変量データに対して用いられる手法であるＭＣＤと呼ばれる手法が知られているがこれを適用することもできる。ＭＣＤは、ｐ個のデータの所定の割合（通常５０％～７５％）のデータを取り出し、そのすべての組み合わせの中で分散が最小になるデータを用いて推定を行う手法であり、この方法を用いて平均を推定することもできる。ＭＣＤもロバスト性が高く推定効率の良い方法として知られている。一方、ＨＬ推定やＭＣＤ推定は、ｐ個のデータの中から取り出すデータの数（ＨＬ推定場合２個、ＭＣＤの場合はｐ／２～３ｐ／４個程度）の全ての組み合わせに対する計算が必要となるため、ペアワイズモデルの入力変数の数ｐが増加すると処理時間が飛躍的に増加するため、予測モデル合成法定義部５の定義として用いても、実際に予測を行う際にリアルタイム性を確保できない可能性がある。 Another robust estimation method known as MCD, which is generally used for multivariate data, can also be applied. MCD extracts a specified percentage (usually 50% to 75%) of p data items and performs estimation using the data with the smallest variance among all combinations. This method can also be used to estimate the mean. MCD is also known as a highly robust and efficient estimation method. On the other hand, HL estimation and MCD estimation require calculations for all combinations of the number of data items extracted from p data items (2 for HL estimation, approximately p/2 to 3p/4 for MCD). Therefore, as the number of input variables p in the pairwise model increases, processing time increases dramatically. Therefore, even if they are used as the definition for the prediction model synthesis method definition unit 5, real-time performance may not be ensured when actually making predictions.

このような場合に対応するため、「全ての組み合わせ」に対して処理を行うのではなく、リアルタイムで処理が可能な回数だけ繰り返し処理を行う様にブートストラップ法を用いて代用しても良い。すなわち、ｐ個のｙ１、ｙ２、…、ｙｐの中から、所定のｋ個のデータをランダムに繰り返し抽出し、所定の回数（Ｎ回）、その平均値を求める。繰り返し求めたＮ個の平均値の中から、ＨＬ推定と同じように、例えばＮ個の平均値の中央値を採用する。このようなブートストラップ法による平均値推定を、予測モデル合成法定義部５の定義とすることで、リアルタイム性を維持しながら、推定効率が良くロバスト性の高い予測値の合成を行うことができると考えられる。
上記以外にも外れ値に重みを付けて推定を行うM推定などのロバスト推定を位置母数推定に適用した方法を予測モデル合成法定義部５の定義とすることもできる。 To address such a situation, instead of processing "all combinations," a bootstrap method may be used to repeatedly perform processing as many times as possible in real time. That is, a predetermined k number of data items are randomly and repeatedly extracted from p data items y1, y2, ..., yp, and the average value is calculated a predetermined number of times (N times). From the N average values calculated repeatedly, for example, the median of the N average values is used, as in HL estimation. By defining such average value estimation using the bootstrap method in the prediction model synthesis method definition unit 5, it is believed that it is possible to synthesize prediction values with high estimation efficiency and robustness while maintaining real-time performance.
In addition to the above, the prediction model synthesis method definition unit 5 can also define a method in which a robust estimation such as M estimation, which performs estimation by weighting outliers, is applied to the location parameter estimation.

二つ目は、先のアプローチが主に外れ値（外れた予測値）に対してロバストに予測値の代表値を推定することを目的としていたのに対し、実際に予測精度の良い予測値に重みを付けて合成した予測を行うことを目的としたアプローチである。基本的な考え方は、（１７）式の単純な標本平均に変えて、重み付き平均を行う方法であり、次式で表される式で合成予測出力を定義する。
y(t)=w1×y1(t)+w2×y2(t)+…+wp×yp(t) （１８）
ここで、ｗｋ、ｋ＝１、２、…、ｐは、重みであり、ｗ１＋ｗ２＋…＋ｗｐ＝１となる制約を満たす。予測モデル合成法定義部５の定義では、この重みの設定法を定義する必要がある。 The second approach aims to perform a composite prediction by weighting predicted values with good prediction accuracy, whereas the previous approach aimed primarily at estimating a representative value of predicted values robustly against outliers (outlier predicted values). The basic idea is to perform a weighted average instead of the simple sample average in equation (17), and the composite prediction output is defined by the following equation:
y(t)=w1×y1(t)+w2×y2(t)+…+wp×yp(t) (18)
Here, wk, k = 1, 2, ..., p are weights that satisfy the constraint that w1 + w2 + ... + wp = 1. In the definition of the prediction model synthesis method definition unit 5, it is necessary to define a method for setting these weights.

これには、ペアワイズ予測モデル同定部４で、ペアワイズ予測モデルを同定した際の予測出力と実際の出力の間の（重）相関係数、あるいは、その２乗である決定係数を用いることができる。これにより、少なくともペアワイズの予測モデル同定時のデータに対して、予測精度の高い予測能力を持つモデルの予測出力を重視した重み付き平均で予測出力を合成することができる。 For this purpose, the pairwise prediction model identification unit 4 can use the (multiple) correlation coefficient between the predicted output when the pairwise prediction model is identified and the actual output, or the coefficient of determination, which is its square. This makes it possible to synthesize predicted outputs using a weighted average that places emphasis on the predicted output of models with high prediction ability and high prediction accuracy, at least for the data used when identifying the pairwise prediction model.

また、ペアワイズの予測モデルを、例えば上述の（１５）式のように構築した場合には、各遅れ時間毎の説明変数と出力変数の間の相関係数が既に算出されているので、統合したペアワイズ予測モデル（（１４）式に相当）毎に、各相関係数の絶対値の平均値ｒｋ、ｋ＝１、２、３、…、ｐを求め、ｒｋに応じて、重みｗｋをｗｋ＝ｒｋ／（ｒ１＋ｒ２＋…＋ｒｐ）と定義することもできる。 Furthermore, if a pairwise prediction model is constructed, for example, as in equation (15) above, the correlation coefficients between the explanatory variables and output variables for each delay time have already been calculated, so for each integrated pairwise prediction model (corresponding to equation (14)), the average value rk, k = 1, 2, 3, ..., p of the absolute values of each correlation coefficient can be calculated, and the weight wk can be defined as wk = rk/(r1 + r2 + ... + rp) according to rk.

このように、ペアワイズ予測モデルの予測能力に応じて重み付き平均値の重みを決定して（１８）式により、予測モデルの合成法を定義する動作が、本実施形態における予測モデル合成法定義部５の動作の一例である。 In this way, the operation of determining the weights of the weighted average values according to the predictive capabilities of the pairwise prediction models and defining the prediction model synthesis method using equation (18) is an example of the operation of the prediction model synthesis method definition unit 5 in this embodiment.

三つ目は、二つ目と同様に（１７）式の、各ペアワイズ予測モデルの重み付き平均値により、合成した予測出力を定義するが、この重みを、データを用いて同定（推定）することで決定するものである。この際、ペアワイズ予測モデルの同定時に用いたデータと同定したパラメータ値を用いると、その同定データに対する各ペアワイズ予測モデルの予測値ｙ１、ｙ２、…、ｙｐの時系列データが得られる。 The third method, like the second method, defines the combined prediction output using the weighted average of each pairwise prediction model in equation (17), but determines the weights by identifying (estimating) them using data. In this case, by using the data used when identifying the pairwise prediction model and the identified parameter values, time series data of the predicted values y1, y2, ..., yp of each pairwise prediction model for the identified data can be obtained.

そして、（１７）式を、この予測値ｙ１、ｙ２、…、ｙｐを入力として、同定に用いた実際の出力データを出力ｙと見なすと、（１７）式自身が重みｗｋ、ｋ＝１、２、…、ｐを回帰係数とする重回帰モデルの形式になっている。従って、これらの予測時系列データと同定に用いた出力データとを用いて、重みｗｋ、ｋ＝１、２、…、ｐを重回帰の方法で同定することができる。ただし、重みは各々正でなければならないという不等式制約とｗ１＋ｗ２＋…＋ｗｐ＝１という等式制約を満たす必要があるため、通常の重回帰で用いる最小２乗法は適用できないが、混合線形推定法などの制約条件を考慮できる線形回帰の推定法を用いて、重みを推定することが可能になる。 If we consider equation (17) with the predicted values y1, y2, ..., yp as inputs and the actual output data used for identification as the output y, then equation (17) itself takes the form of a multiple regression model with weights wk, k = 1, 2, ..., p as regression coefficients. Therefore, using this predicted time series data and the output data used for identification, the weights wk, k = 1, 2, ..., p can be identified using multiple regression. However, because the weights must each satisfy the inequality constraint that they must be positive and the equality constraint that w1 + w2 + ... + wp = 1, the least squares method used in ordinary multiple regression cannot be applied. However, it is possible to estimate the weights using a linear regression estimation method that can take constraints into account, such as mixed linear estimation.

なお、この方法は、（１３）式から（１４）式を得る代わりに（１５）式を得る箇所にも同様の考え方を適用することができるので、単回帰モデル⇒ＦＩＲ／ＡＲモデル⇒多入力の伝達関数モデル、という３段階の段階的な学習（推定）によって予測モデルを合成することができる。 This method can also be applied to deriving equation (15) instead of equation (14) from equation (13), so a predictive model can be synthesized through three stages of learning (estimation): simple regression model ⇒ FIR/AR model ⇒ multi-input transfer function model.

本実施形態において、上記のようにして重みｗｋ、ｋ＝１、２、…、ｐを推定した（１７）式によって予測モデルの合成法を定義する方法が、予測モデル合成法定義部５の動作の他の例である。 In this embodiment, the method of defining the prediction model synthesis method using equation (17) in which the weights wk, k = 1, 2, ..., p are estimated as described above is another example of the operation of the prediction model synthesis method definition unit 5.

四つ目は、一つ目から三つ目とは異なり、合成した予測に傾向（バイアス）を持たせたい場合のアプローチであり、意図的に過大、あるいは、過小の予測を行う事を目的とする場合の予測出力の合成法である。 The fourth approach, which differs from the first three, is used when you want to impart a bias to the synthesized prediction, and is a method of synthesizing prediction outputs when the goal is to intentionally over- or under-predict.

上記の予測を行いたい状況は、現実の問題ではしばしば遭遇する。例えば、本実施形態で行う雨水の流入量予測の場合、この予測情報は雨水排水ポンプの起動や停止のタイミングを図るための支援情報として利用される場合が多い。このような場合、実際に流入する雨水流入量より過小な流入量の予測を行ってしまうことはリスク回避の観点から致命的になる事がある。
このような場合、もちろん、正確な流入量を予測することが好ましい事は言うまでもないが、それが現実的に困難な場合は、多少多めの量を予測しておく方が安全である。 Situations where the above prediction is required are often encountered in real-world problems. For example, in the case of the rainwater inflow prediction performed in this embodiment, the prediction information is often used as support information for determining the timing of starting and stopping a rainwater drainage pump. In such cases, predicting an inflow amount that is lower than the actual rainwater inflow amount can be fatal from the perspective of risk avoidance.
In such cases, it goes without saying that it is preferable to accurately predict the amount of inflow, but if this is practically difficult, it is safer to predict a slightly larger amount.

また、例えば雨水排水ポンプの制御において流入量予測を利用する際には、ポンプの起動タイミングを図るために過大側の予測をすることが好ましいが、停止タイミングを図るためには過小側の予測をすることが好ましいため、過小側の予測を行いたい場合もある。 Furthermore, for example, when using inflow volume predictions to control a stormwater drainage pump, it is preferable to make an over-prediction to determine the timing to start the pump, but it is preferable to make an under-prediction to determine the timing to stop the pump, so there are cases where it is necessary to make an under-prediction.

また、例えば、水や電力の需要予測などの場合は、あまり過小に予測を行ってしまうと、水や電力の供給に支障をきたす可能性が考えられるため、若干過大側に予測を行っておく方が安全な場合もある。 Also, for example, when predicting demand for water or electricity, if the prediction is too low, it could cause disruptions to the supply of water or electricity, so it may be safer to predict slightly higher.

また、このように、過大側あるいは過小側の予測を行える様にしておくと、純粋に過大な予測や過小の予測を行いたいという場合以外にも、実際に予測システムを運用して長期評価した場合に、予測値が過小に出る傾向がある場合には過大側に再調整したり、逆に予測値が過大に出る場合には過小側に再調整したりして微調整を行うことも可能になる。 Furthermore, by making it possible to make predictions on the over or under side in this way, it becomes possible not only to simply make over or under predictions, but also to make fine adjustments when actually operating the prediction system and performing long-term evaluations, by readjusting the predictions to the over side if they tend to be underestimated, or conversely, by readjusting the predictions to the under side if they tend to be overestimated.

このような動機に基づいて行う予測出力の合成法を以下に示す。これを行うためには、各ペアワイズ予測モデルの出力予測値ｙ１、ｙ２、…、ｙｐに対して、先に述べた位置母数の推定方法を定義する以外に、標準偏差や分散に対応する尺度母数の推定方法も同時に定義する。例えば、位置母数を平均とした場合には、尺度母数として、標準偏差を採用する様に定義する。また、位置母数としてロバスト推定法のメジアン（中央値）を用いた場合には、尺度母数としてＭＡＤ（中央値絶対偏差：Median Absolute Deviation）を採用する。さらに、ＨＬ推定、ＭＡＤ推定、ブートストラップ推定、Ｍ推定などを用いた場合には、各々の位置母数に対する尺度母数の推定方法を定義する。このようにして、ｙ１、ｙ２、…、ｙｐに対して適用する位置母数と尺度母数の推定法を予め定義しておく。これを以下では各々μとσと表す（通常μは平均値、σは標準偏差の意味で使われることが多いが、ここでは、必ずしも平均と標準偏差ではなく、ここで定義した位置母数と尺度母数を表すものとする）。 Based on this motivation, the method for synthesizing predicted outputs is described below. To achieve this, in addition to defining the estimation method for the location parameter described above for each pairwise prediction model output predicted value y1, y2, ..., yp, we also simultaneously define estimation methods for the scale parameters corresponding to the standard deviation and variance. For example, if the location parameter is the mean, we define the standard deviation as the scale parameter. Furthermore, if the median (center value) of the robust estimation method is used as the location parameter, we define MAD (Median Absolute Deviation) as the scale parameter. Furthermore, if HL estimation, MAD estimation, bootstrap estimation, M estimation, etc. are used, we define the estimation method for the scale parameter for each location parameter. In this way, we predefine the estimation methods for the location parameters and scale parameters to be applied to y1, y2, ..., yp. Below, we denote these as μ and σ, respectively (μ and σ are usually used to mean the mean and standard deviation, but here we will use them to refer to the location parameter and scale parameter defined here, not necessarily the mean and standard deviation).

次に、各ペアワイズ予測モデルの予測出力の集合、ｙ１、ｙ２、…、ｙｐに対して、その集合のμ±ｋσの値を合成した予測出力とするように、ｋの値と符号とを指定する。例えば、ｋ＝１としてμ＋σを指定すると、ｙ１、ｙ２、…、ｙｐの中の位置母数から尺度母数の１倍だけ過大な値を合成した予測出力とすることを意味する。逆にｋ＝－１とすると、ｙ１、ｙ２、…、ｙｐの中の位置母数から尺度母数の１倍だけ過小な値を合成した予測出力とすることを意味する。もし、位置母数と尺度母数として平均と標準偏差を用いた場合は、ｙ１、ｙ２、…、ｙｐが正規分布に従うと仮定すると、μ＋σは約６８％の値（μが５０％の値）、μ－σは約３２％の値を取り出すことに対応する。 Next, for the set of predicted outputs y1, y2, ..., yp of each pairwise prediction model, the value and sign of k are specified so that the predicted output is a combination of μ±kσ values for that set. For example, specifying μ+σ with k=1 means that the predicted output is a combination of values that are one time larger than the scale parameter and one time larger than the location parameter in y1, y2, ..., yp. Conversely, specifying k=-1 means that the predicted output is a combination of values that are one time smaller than the scale parameter and one time larger than the location parameter in y1, y2, ..., yp. If the mean and standard deviation are used as the location and scale parameters, and assuming that y1, y2, ..., yp follow a normal distribution, μ+σ corresponds to extracting approximately 68% of the value (μ is the 50% value), and μ-σ corresponds to extracting approximately 32% of the value.

なお、平均や標準偏差を用いなくても、ｙ１、ｙ２、…、ｙｐの中のＫ％にあたるＫ％分位点を取り出して合成出力とするという方法も、位置母数と尺度母数を適切に定義すれば、μ±ｋσの形式で書けるので、このように設定した分位点と直接抽出するという方法で指定することも可能である。例えば、過大側の予測を行いたい場合に予測の異常値をある程度除外する事を想定して９０％分位点にあたる予測値を採用し過大予測を行う様に合成した予測出力を定義したり、１０％分位点にあたる予測値を採用し過小予測を行う様に合成した予測出力を定義したりすることもできる。 Incidentally, even without using the mean or standard deviation, you can extract the K% quantile corresponding to K% of y1, y2, ..., yp and use it as the composite output. If the location parameter and scale parameter are defined appropriately, this can be written in the form μ±kσ, so it is also possible to specify it by directly extracting the quantile set in this way. For example, if you want to make an overprediction, you can define a composite forecast output that uses a predicted value corresponding to the 90% quantile to make an overprediction, assuming that you will exclude some outliers in the prediction, or you can define a composite forecast output that uses a predicted value corresponding to the 10% quantile to make an underprediction.

予測モデル合成法定義部５の定義方法として四つの方法を説明したが、予測モデル合成法定義部５は、これらのいずれか、あるいは、その組み合わせによって予想モデルの合成法と定義することができる。なお、以上の一連の作用は、オフラインで過去のデータを用いて定期的もしくは非定期的に実行される。 Four methods have been described as definition methods for the prediction model synthesis method definition unit 5, but the prediction model synthesis method definition unit 5 can define a prediction model synthesis method using any of these methods, or a combination of these. The above series of operations is executed offline, either periodically or irregularly, using past data.

次に、オフラインで同定および定義した定義式を用いて、オンラインで予測を行う。
オンラインの予測は、時間の進行方向における所定の周期ＴＨ（＜＜ＴＬ（オフライン同定を定期的に行う場合の周期））で行われる。 Next, prediction is performed online using the definition equations identified and defined offline.
Online prediction is performed at a predetermined period TH (<<TL (period when offline identification is performed periodically)) in the direction of time progression.

まず、オンライン予測用データ抽出部３２から周期ＴＨで、入力変数のデータを抽出する。本実施形態では、入力変数は、流量計（雨水ポンプ井流入量計）１１による雨水ポンプ井流入量と、幹線流量計１２による幹線流入量と、幹線水位計１３１～１３ＫによるＫ個の幹線水位計と、地上雨量計１４１～１４ＭによるＭ個の地上雨量と、Ｑ×Ｐメッシュ１５１１～１５ＱＰの各メッシュにおけるレーダ雨量と、を含む。 First, input variable data is extracted from the online prediction data extraction unit 32 at a period TH. In this embodiment, the input variables include the rainwater pump well inflow rate measured by the flow meter (rainwater pump well inflow rate meter) 11, the main inflow rate measured by the main flow meter 12, K main water level gauges measured by the main water level gauges 131-13K, M ground rainfall rates measured by the ground rain gauges 141-14M, and radar rainfall rates in each of the QxP meshes 1511-15QP.

次に、ペアワイズ出力変数予測部６では、ペアワイズ予測モデル同定部４で同定した各入力変数と出力変数のペアワイズ予測モデルを用いて、ｐ個の予測出力を計算する。これは、（５）式、（１４）式、（１５）式などの形で同定したパラメータ値が同定されたペアワイズ予測モデルにオンライン予測用データ抽出部３２で抽出したオンラインの入力変数データを入力することで直ちに計算できる。これがペアワイズ出力変数予測部６の作用である。 Next, the pairwise output variable prediction unit 6 calculates p predicted outputs using the pairwise prediction models of each input variable and output variable identified by the pairwise prediction model identification unit 4. This can be calculated immediately by inputting the online input variable data extracted by the online prediction data extraction unit 32 into the pairwise prediction model whose parameter values have been identified in the form of equations (5), (14), (15), etc. This is the function of the pairwise output variable prediction unit 6.

次に、合成出力変数予測部７では、予測モデル合成法定義部５で定義した（１７）式や（１８）式に相当する合成法の定義式を用いて、ペアワイズ出力変数予測部６から出力される各入力変数に対するペアワイズの予測出力を入力することで合成した予測出力が計算される。 Next, the composite output variable prediction unit 7 uses the definition equations of the synthesis method, which correspond to equations (17) and (18) defined in the prediction model synthesis method definition unit 5, to input the pairwise predicted outputs for each input variable output from the pairwise output variable prediction unit 6, and calculates a synthesized predicted output.

次に、予測誤差評価部８は、合成出力変数予測部７で計算された予測出力の予測結果とペアワイズ出力変数予測部６のペアワイズの予測出力結果とを取得し、時間の進行方向における所定の周期ＴＨで時系列データとして保存する。ここで、保存された予測結果は、誤差評価を行うタイミングまで保持される。予測誤差の評価を行うタイミングは、予測モデルのユーザなどが外部から指示しても良いし、時間の進行方向における所定の周期ＴＭ（例えばＴＨ＜ＴＭ＜ＴＬ）で実施しても良い。いずれの方法であっても、予測誤差の評価を行うタイミングで、評価用データ抽出部３３で、予測誤差評価部８に保存された予測出力の時系列データと対応する時間の計測された出力変数、すなわち、本実施形態では雨水流入量の時系列データを抽出する。 Next, the prediction error evaluation unit 8 obtains the predicted output results calculated by the composite output variable prediction unit 7 and the pairwise predicted output results from the pairwise output variable prediction unit 6, and saves them as time-series data at a predetermined cycle TH in the direction of time progression. The saved prediction results are retained until the time to perform error evaluation. The timing for evaluating the prediction error may be externally instructed by a user of the prediction model, or may be performed at a predetermined cycle TM in the direction of time progression (e.g., TH<TM<TL). In either case, at the time to evaluate the prediction error, the evaluation data extraction unit 33 extracts the output variable measured at the time corresponding to the time-series data of the predicted output saved in the prediction error evaluation unit 8, i.e., in this embodiment, the time-series data of stormwater inflow.

そして、予測誤差評価部８では、抽出された出力（雨水流入量）の時系列データとｐ個のペアワイズ予測モデルの予測出力との誤差、および、合成した予測出力の誤差を評価する。この際、誤差の評価は、通常時系列解析で行われるＭＳＥ（平方２乗誤差＝Ｌ２誤差）やその平方根であるＲＭＳＥなどであっても良いが、目的に応じて、誤差評価の評価基準を適宜設定しておいても良い。例えば、ピークの雨水流入量の予測精度が重要である場合はピーク流入量の差や誤差が最大になる場合の差（Ｌ∞誤差＝∞ノルム誤差）を誤差評価基準としても良いし、Nash-Sutcliffe係数などの評価基準を用いても良い。また、流入量が急激に増加する場合の遅れを改善する事を目的としたい場合には、予測出力と実績出力との位相差を計算し、位相差を小さくすることを誤差の評価基準としても良い。いずれにしても、予測誤差評価部８は、設定した誤差評価基準に基づいて、所定のタイミングもしくは周期ＴＭで、時系列データとｐ個のペアワイズ予測モデルの予測出力との誤差、および、合成した予測出力との誤差を評価する。 The forecast error evaluation unit 8 then evaluates the error between the time series data of the extracted output (stormwater inflow) and the predicted output of the p pairwise forecast models, as well as the error of the combined forecast output. In this case, the error evaluation can be performed using MSE (mean square error = L2 error), which is typically performed in time series analysis, or its square root, RMSE. However, appropriate error evaluation criteria can be set depending on the purpose. For example, if the prediction accuracy of peak stormwater inflow is important, the error evaluation criteria can be the difference in peak inflow or the difference at the maximum error (L∞ error = ∞ norm error). Evaluation criteria such as the Nash-Sutcliffe coefficient can also be used. Furthermore, if the goal is to improve delays when inflow increases suddenly, the phase difference between the predicted output and actual output can be calculated, and the reduction of the phase difference can be used as the error evaluation criterion. In either case, the forecast error evaluation unit 8 evaluates the error between the time series data and the predicted output of the p pairwise forecast models, as well as the error of the combined forecast output, at a predetermined timing or period TM based on the set error evaluation criteria.

次に、ペアワイズ予測モデル修正部９では、予測誤差評価部８で評価した予測誤差に基づいて、ペアワイズ予測モデルのパラメータの調整、あるいは、特定のペアワイズモデルの削除、もしくは、複数のペアワイズモデルの統合を行う。 Next, the pairwise prediction model correction unit 9 adjusts the parameters of the pairwise prediction model, deletes specific pairwise models, or integrates multiple pairwise models based on the prediction error evaluated by the prediction error evaluation unit 8.

以下に、上記のような調整機能、削除機能、および、統合機能を付加したモジュラー型予測をより具体化した方法の例について説明する。
一つ目のパラメータ調整について、ペアワイズ予測モデル修正部９は、予測誤差評価部８で評価した予測誤差に基づいて、著しく予測精度が悪い（予測誤差が所定の許容誤差より大きい）と判断される入出力ペアを抽出し、抽出された入出力ペアのペアワイズ予測モデルの再同定を行う。予測精度の悪さの判定は、（１）予測誤差に対するしきい値（許容誤差）を直接指定してしきい値を超過したペアワイズ予測モデルを予測精度の悪いモデル（あるいは入出力ペア）と判断する、（２）合成した予測出力の誤差とペアワイズ予測モデルの各予測誤差とを比較し、合成した予測誤差に対して相対的に著しく誤差が大きい（予め設定された許容範囲を超える）予測誤差を持つペアワイズ予測モデルを予測精度の悪いモデルと判断する、（３）ペアワイズ予測モデルの予測誤差（＋合成した予測誤差も含んでも良い）に対する位置母数μｅと尺度母数σｅを計算し、μｅ±ｋσｅ（ｋは設定するパラメータで２～３ぐらいで通常は設定する）から外れた予測誤差を持つペアワイズ予測モデルを予測精度の悪いモデルと判断する、などの方法で行うことができる。（１）－（３）のような判断によって、パラメータの再調整（再同定）が必要と判断された場合、ペアワイズ予測モデル修正部９は、所定の期間の直近のデータを用いてパラメータの再同定を行う。 An example of a more specific method of modular prediction to which the above-described adjustment function, deletion function, and integration function are added will be described below.
For the first parameter adjustment, the pairwise prediction model correction unit 9 extracts input/output pairs that are judged to have significantly poor prediction accuracy (prediction errors larger than a predetermined allowable error) based on the prediction errors evaluated by the prediction error evaluation unit 8, and re-identifies the pairwise prediction models of the extracted input/output pairs. The poor prediction accuracy can be determined by the following methods: (1) directly specifying a threshold (tolerance) for the prediction error and determining a pairwise prediction model (or input/output pair) that exceeds the threshold as having poor prediction accuracy; (2) comparing the error of the combined prediction output with each prediction error of the pairwise prediction model and determining a pairwise prediction model with a prediction error that is significantly larger (exceeding a preset tolerance) than the combined prediction error as having poor prediction accuracy; or (3) calculating the location parameter μe and the scale parameter σe for the prediction error of the pairwise prediction model (which may include the combined prediction error) and determining a pairwise prediction model with a prediction error outside μe±kσe (where k is a parameter that can be set and is usually set to about 2 to 3) as having poor prediction accuracy. If it is determined that parameter readjustment (re-identification) is necessary based on the determinations (1) to (3), the pairwise prediction model correction unit 9 re-identifies the parameters using the most recent data from a predetermined period.

本実施形態において、例えば、Ｍ個の地上雨量データの中のいずれかの地上雨量データと雨水流入量との間のペアワイズ予測モデルの精度が悪化したと判断された場合、ペアワイズ予測モデル修正部９は、地上雨量と雨水流入量との間のペアワイズ予測モデルのパラメータのみを再同定する。このように部分的な同定による再調整を行う事ができる点は、モジュラー型の構成（構成的なアプローチ）の大きな利点である。 In this embodiment, for example, if it is determined that the accuracy of the pairwise forecast model between any of the M pieces of ground rainfall data and rainwater inflow has deteriorated, the pairwise forecast model correction unit 9 re-identifies only the parameters of the pairwise forecast model between ground rainfall and rainwater inflow. The ability to perform readjustment through partial identification in this way is a major advantage of a modular configuration (configurative approach).

二つ目の特定のペアワイズモデルの削除は、一つ目のパラメータの調整と類似の手続きで行うことができる。基本的な手順としては、ペアワイズ予測モデル修正部９は、一つ目のパラメータの調整で述べた方法と類似の方法で誤差評価を行い、誤差が著しく大きくなり精度が劣化したペアワイズ予測モデルを抽出し、抽出したペアワイズモデルを削除する。誤差評価の方法は一つ目の方法と同じであるが、ペアワイズ予測モデル修正部９は、（１）劣化の程度を判断するしきい値を一つ目のものよりも大きくして、一つ目の調整よりも明らかに予測精度が劣化しているものを削除対象とする、（２）まず一つ目のパラメータの再調整を行い、パラメータの再調整を行っても精度劣化の判断に用いた基準をクリアできない場合に削除対象とする、などの方法で削除すべきペアワイズ予測モデルを抽出する。 The second specific pairwise model can be deleted using a procedure similar to that used for adjusting the first parameter. The basic procedure involves the pairwise prediction model correction unit 9 evaluating the error using a method similar to that described for adjusting the first parameter, extracting pairwise prediction models with significantly larger errors and degraded accuracy, and deleting the extracted pairwise prediction models. The error evaluation method is the same as the first method, but the pairwise prediction model correction unit 9 extracts pairwise prediction models to be deleted by (1) setting a higher threshold for judging the degree of degradation than the first one, and deleting models whose prediction accuracy is clearly worse than after the first adjustment, or (2) first readjusting the first parameter, and deleting models that still do not meet the criteria used to determine accuracy degradation even after parameter readjustment.

本実施形態においては、例えば、幹線水位計などは水位計が水没することなどにより、水位を正確に測ることができなくなりアウトライア（異常値）が多量に混入してしまうことなどが考えられる。この場合、水没した水位計によるペアワイズ予測モデルは分離されることになる。このように、予測精度に悪影響を与えている部分を部分的に切り離すことで予測精度の改善を図れることもモジュラー型の構成（構成的なアプローチ）の大きな利点である。 In this embodiment, for example, if a main water level meter becomes submerged, it may become unable to accurately measure water levels, resulting in the inclusion of a large number of outliers (abnormal values). In this case, the pairwise prediction model based on the submerged water level meter will be separated. In this way, a major advantage of a modular configuration (configurative approach) is that prediction accuracy can be improved by partially isolating parts that are negatively affecting prediction accuracy.

３つ目のペアワイズ予測モデルの統合は、以下の様に実施される。まず、ペアワイズ予測モデル修正部９は、ペアワイズ予測モデルの予測誤差を相互に比較評価し、その予測誤差同士の差の絶対値が所定のしきい値以下となる相互に類似の予測精度を持つペアワイズ予測モデルを、相互に類似する予測モデルとして抽出しグループ化する。 The third integration of pairwise prediction models is performed as follows: First, the pairwise prediction model correction unit 9 compares and evaluates the prediction errors of the pairwise prediction models, and extracts and groups pairwise prediction models with similar prediction accuracy, where the absolute value of the difference between their prediction errors is below a predetermined threshold, as similar prediction models.

次に、ペアワイズ予測モデル修正部９は、このようにしてグループ化されたペアワイズ予測モデルの持つパラメータの値をグループ内で相互に比較する。例えば、係数パラメータａｋ、ｋ＝１、２、…、ｎをベクトル化してＡ＝［ａ１、ａ２、…、ａｎ］とし、相互に比較する予測モデルの係数パラメータベクトル同士の類似度を例えば内積などにより算出し、所定のしきい値により類似していると判断されたペアワイズ予測モデルを統合の対象候補とする。 Next, the pairwise prediction model correction unit 9 compares the parameter values of the pairwise prediction models grouped in this way within the group. For example, the coefficient parameters a k, k = 1, 2, ..., n are vectorized to A = [a1, a2, ..., an], and the similarity between the coefficient parameter vectors of the prediction models being compared is calculated, for example, using the inner product, and pairwise prediction models that are determined to be similar according to a predetermined threshold value are selected as candidates for integration.

本実施形態のモジュラー型の構成的なアプロ―チでは、パラメータの可同定性（一意決定可能性）が基本的に担保されているので、予測結果が類似しておりパラメータ値も類似していれば、その入力変数はほぼ同じ値を持つはずであるが、念のため、ペアワイズ予測モデル修正部９は、候補として抽出されたペアワイズ予測モデルの入力変数同士の相関係数を求めて、所定のしきい値（例えば、相関係数０．９５）以上である場合に最終的に統合すべきペアワイズ予測モデルと判断する。そして、ペアワイズ予測モデル修正部９は、最終的に統合すべきと判断された複数のペアワイズ予測モデルの入力変数の平均をとった変数を、新たな一つの合成された入力変数として定義する。 In the modular constructive approach of this embodiment, parameter identifiability (unique decidability) is basically guaranteed, so if the prediction results and parameter values are similar, the input variables should have approximately the same values. However, just to be sure, the pairwise prediction model correction unit 9 calculates the correlation coefficient between the input variables of the pairwise prediction models extracted as candidates, and if it is equal to or greater than a predetermined threshold (e.g., a correlation coefficient of 0.95), it determines that the pairwise prediction models should ultimately be merged. The pairwise prediction model correction unit 9 then defines a variable that is the average of the input variables of the multiple pairwise prediction models determined to ultimately be merged as a new, combined input variable.

そして、ペアワイズ予測モデル修正部９は、（１）先に比較した係数ベクトルの平均値を採用する、あるいは、（２）新たに合成した平均化された入力変数と出力変数に対して再同定を行って推定する、のいずれかの方法を用いて、合成された入力変数に対するペアワイズ予測モデルの係数を決定する。 The pairwise prediction model correction unit 9 then determines the coefficients of the pairwise prediction model for the combined input variables using either (1) adopting the average value of the previously compared coefficient vectors, or (2) re-identifying and estimating the newly combined averaged input and output variables.

上記のような統合は、本実施形態のモジュラー型時系列データ予測装置においては、例えば次の様なケースにおいて生じると考えられる。入力変数の中にＱ×Ｐメッシュ１５１１～１５ＱＰの各々のレーダ雨量のデータが含まれているケースにおいて、レーダ雨量のメッシュのサイズは、例えばXRAINと呼ばれるＸバンドの気象レーダの場合約２５０ｍ×２５０ｍであるため、隣接するメッシュの雨量データはほとんど同じ値をとる場合が多い。このような場合、相互に隣接するメッシュの雨量データは統合の対象となる可能性が高く、Ｑ×Ｐ個の入力変数が、少数の互いに独立した合成入力変数として統合されることになる可能性が高い。このような統合が起こると、レーダ雨量のメッシュ状のデータが、雨水流入量に与える影響の類似性に応じて統合されるため、入力変数の数を減らせるだけでなく、説明性が向上する事が期待できる。 In the modular time series data forecasting device of this embodiment, the above-mentioned integration is thought to occur, for example, in the following case. In a case where the input variables include radar rainfall data for each of QxP meshes 1511 to 15QP, the size of the radar rainfall mesh is approximately 250m x 250m for an X-band weather radar called XRAIN, for example, so the rainfall data in adjacent meshes often have almost identical values. In such cases, the rainfall data in adjacent meshes is likely to be integrated, and the QxP input variables are likely to be integrated into a small number of independent composite input variables. When this integration occurs, the radar rainfall mesh data is integrated according to the similarity of its impact on rainwater inflow, which not only reduces the number of input variables but is also expected to improve interpretability.

また、このような場合は、先に述べた多重共線性の問題が生じているケースであるが、これに対し、例えば、Lassoと呼ばれる方法などを用いて入力変数を自動的に選択すると、いくつかのメッシュの雨量データが代表的な入力変数として選択され、選択された入力変数に対応するメッシュの雨量と極めて類似するメッシュの雨量データは無視されることになる。雨量データが極めて類似しているので、このように代表メッシュの雨量を入力変数として選択しても通常は問題がない。しかしながら、近年頻発している局所的な豪雨が見られる場合、例えば、たまたま選択したメッシュの雨量はあまり多くないのに、入力変数として選択されなかった近くのメッシュの雨量が非常に大きい様な局所的な降雨が生じた場合、あるメッシュを代表メッシュとして選択してしまうと、流入量が増加しないという誤った予測を行う可能性がある。 In addition, in cases like this, the problem of multicollinearity mentioned earlier occurs. To address this issue, if input variables are automatically selected using a method such as Lasso, rainfall data from several meshes will be selected as representative input variables, and rainfall data from meshes that are very similar to the rainfall in the mesh corresponding to the selected input variables will be ignored. Because the rainfall data is very similar, there is usually no problem with selecting rainfall from representative meshes in this way as input variables. However, in cases where there has been frequent localized heavy rainfall in recent years, for example, if there is localized rainfall where the rainfall in the selected mesh happens to be not very heavy, but a nearby mesh that was not selected as an input variable experiences very heavy rainfall, selecting a certain mesh as the representative mesh could result in an erroneous prediction that inflow volume will not increase.

一方、ここで述べた入力変数の統合を行うと、類似の影響を与えるメッシュの雨量が平均化され、これは統合されたメッシュの平均雨量という意味を持つため、このようなケースでも妥当な予測結果を与える事が期待できると同時に、合理的な説明性も向上する。このように統合すべき入力変数をまとめて解釈し、説明しやすくするようにできる事もモジュラー型の構成（構成的なアプローチ）の大きな利点である。 On the other hand, when the input variables described here are integrated, the rainfall in meshes with similar effects is averaged, which means the average rainfall in the integrated meshes. Therefore, even in such cases, it is expected that reasonable forecast results will be obtained, while also improving rational interpretability. The ability to collectively interpret input variables to be integrated in this way, making them easier to explain, is another major advantage of a modular structure (constructive approach).

最後に、合成出力変数予測部７で算出（合成）された予測出力は、出力予測結果観測部１０に送られ、例えばいわゆるトレンドグラフと呼ばれる時系列データとしてユーザであるプラントの管理者や運転員に提示される。この際、リアルタタイム監視している監視の現在までの値は、実際の出力値である雨水流入量とその予測値を同時に表示し、将来の予測値は予測出力のみを表示する様にしておくことが好ましい。 Finally, the predicted output calculated (synthesized) by the composite output variable prediction unit 7 is sent to the output prediction result observation unit 10 and presented to the user, i.e., the plant manager or operator, as time-series data known as a trend graph. At this time, it is preferable that the current real-time monitored values be displayed simultaneously with the actual output value (stormwater inflow volume) and its predicted value, and that future predicted values be displayed only as the predicted output.

また、合成した予測出力（予測雨水流入量）だけでなく、ペアワイズ予測モデルのｐ個の予測出力を用いて、その位置母数μｙと尺度母数σｙを計算し、μｙ±Ｋσｙ（Ｋは設定パラメータで２～３程度で設定する）の範囲を同時表示したり、あるいは、ペアワイズ予測モデルの予測出力の中から最大値（あるいはロバスト性を考えて最大値に近い９５％点にもっとも近い予測値）や最小値（あるいはロバスト性を考えて最小値に近い５％点にもっとも近い予測値）を直接予測出力の範囲として同時に表示したりしても良い。 In addition to the synthesized forecast output (forecasted stormwater inflow), the location parameter μy and scale parameter σy can be calculated using the p forecast outputs of the pairwise forecast model, and the range of μy±Kσy (K is a setting parameter set to about 2-3) can be simultaneously displayed, or the maximum value (or the forecast value closest to the 95th percentile, which is close to the maximum value, considering robustness) and minimum value (or the forecast value closest to the 5th percentile, which is close to the minimum value, considering robustness) from the forecast outputs of the pairwise forecast model can be simultaneously displayed as the range of the direct forecast output.

本実施形態のモジュラー型時系列データ予測装置によれば、一般にはブラックボックスモデルと呼ばれるモデルに対して、その内部パラメータに対して物理法則に矛盾しない合理的な解釈が可能になる様な仕組みを導入することで、ホワイトボックスモデルに近い、モデルの解釈と合理的な説明性を向上させて、部分的な調整、削除、統合などのモデルの維持管理持管理（メンテナンス）の煩雑さを大幅に低減できるという効果が得られる点である。 The modular time series data forecasting device of this embodiment introduces a mechanism that enables a rational interpretation of the internal parameters of what are generally called black box models that is consistent with the laws of physics. This improves the model's interpretability and rational explainability, making it closer to white box models, and significantly reduces the complexity of model maintenance, such as partial adjustments, deletions, and integrations.

なお、このような効果を得るための根本的なアイデアは、本来、入出力データしか与えられていない内部構造が完全に未知である状況下で、出力に関連する可能性のある要素を要素毎に出力と直接関係づけ（ペアワイズ予測モデルに対応）、各要素と出力の関係を、最終的な入出力関係の内部構造として定義される様に、構成的に予測モデルを構築していることである。このようなアイデアを用いることで、本実施形態のモジュラー型時系列データ予測装置では、本来知ることのできない内部構造に立ち入ることなく、疑似的にホワイトボックスモデルと同様の説明性と調整の容易さを実現している。 The fundamental idea behind achieving this effect is to construct a prediction model constructively, in a situation where only input and output data are given and the internal structure is completely unknown, by directly associating elements that may be related to the output with each element (corresponding to a pairwise prediction model) and defining the relationship between each element and the output as the final internal structure of the input and output relationships. By using this idea, the modular time series data prediction device of this embodiment achieves the same explainability and ease of adjustment as a white-box model, without delving into the internal structure, which would otherwise be unknowable.

また、このモジュラー型時系列予測モデルの構築方法は、統計的機械学習分野でよく知られている。先に述べたバギングやブースティングなどのアンサンブル学習と呼ばれる手法とも類似している。バギングやブースティングは、弱学習器（弱学習モデル）と呼ばれる、簡単に構成できるが、精度が必ずしも高くないモデルをたくさん集めることで、精度の高い強学習器（強学習モデル）と呼ばれる予測モデルを構築する方法であり、バギングは弱学習器に対して多数決などの処理で強学習を行う方法、ブースティングは、弱学習器の結果を用いて強学習器を学習させる方法である。ただし、バギングやブースティングでは、学習(同定)に利用データを変えることでたくさんの弱学習器を生成しているが、本実施形態では、同定データを変えるのではなく、説明変数を変えることで、弱学習器に相当するペアワイズ予測モデルを同定している点で異なっている。なお、本実施形態のモジュラー型時系列データ予測装置において、合成出力の予測方法を定義する際に、ロバスト位置母数推定や予め指定した重みを用いた重み付き平均を用いるような方法はバギングの考え方に近く、重み付き平均の重みを再同定によって求める方法はブースティングの考え方に近い。 This method of constructing a modular time series prediction model is well known in the field of statistical machine learning. It is similar to ensemble learning techniques such as bagging and boosting, mentioned above. Bagging and boosting are methods for constructing a highly accurate prediction model called a strong learner (strong learning model) by collecting many weak learners (weak learning models), which are easily constructed but not necessarily highly accurate. Bagging is a method of performing strong learning on weak learners using processes such as majority voting, while boosting is a method of training a strong learner using the results of weak learners. However, while bagging and boosting generate many weak learners by changing the data used for learning (identification), this embodiment differs in that pairwise prediction models equivalent to weak learners are identified by changing the explanatory variables rather than changing the identification data. Note that in the modular time series data prediction device of this embodiment, when defining the prediction method for the composite output, methods such as robust location parameter estimation and weighted averaging using pre-specified weights are similar to the bagging approach, while methods for determining the weights for the weighted average by re-identification are similar to the boosting approach.

以下、本実施形態のモジュラー型の時系列予測装置におけるペアワイズ予測モデルの複数の実施例について説明する。
（第１実施例）
図５は、一実施形態のモジュラー型の時系列予測装置におけるペアワイズ予測モデルの第１実施例について説明するための図である。
図５では、（１）式のＦＩＲモデルやＡＲモデルをペアワイズ予測モデルとした例を示している。 Below, several examples of pairwise prediction models in the modular time series prediction device of this embodiment will be described.
(First Example)
FIG. 5 is a diagram illustrating a first example of a pairwise prediction model in a modular time series prediction device according to an embodiment.
FIG. 5 shows an example in which the FIR model or AR model of equation (1) is used as the pairwise prediction model.

本実施例のペアワイズ予測モデルＭＡは、回帰ブロックＡ１－ＡＮと、予測出力の合成ブロックＡ５と、を備えている。
例えば回帰ブロックＡ１－Ａ３のそれぞれは、入力変数に対して重み付き移動平均成分（ＦＩＲ成分／ＭＡ成分）を予測値として出力する。例えば回帰ブロックＡＮは、入力変数（過去の出力変数）に対して自己回帰成分（ＡＲ成分）を予測値として出力する。 The pairwise prediction model MA of this embodiment comprises regression blocks A1-AN and a synthesis block A5 of the predicted output.
For example, each of the regression blocks A1-A3 outputs a weighted moving average component (FIR component/MA component) as a predicted value for the input variable. For example, the regression block AN outputs an autoregressive component (AR component) as a predicted value for the input variable (past output variable).

予測出力の合成ブロックＡ５は、回帰ブロックＡ１－ＡＮから出力された予測値を取得し、予測値を合成した合成予測値（Y(t+1)=F(Y1(t+1),Y2(t+1),…,Yn(t+1))）を演算して出力する。
例えば、通常のＡＲＸモデルなどによってパラメータを同定すると自己回帰成分（ＡＲ成分）の重みが大きくなり、ＦＩＲ成分に相当する入力変数の値が急激に変化した場合に、予測出力が計測出力に追従できなくなり予測に位相遅れが生じる事がある。しかし、本実施例の構成をとることにより、ＡＲ成分の影響を調整することが可能になり、位相遅れを改善して計測出力の値が変化する前に出力の変化を予測することが可能になる。 The prediction output synthesis block A5 acquires the prediction values output from the regression blocks A1-AN, and calculates and outputs a synthesized prediction value (Y(t+1) = F(Y1(t+1), Y2(t+1), ..., Yn(t+1))) by synthesizing the prediction values.
For example, when parameters are identified using a normal ARX model or the like, the weight of the autoregressive component (AR component) becomes large, and if the value of the input variable corresponding to the FIR component changes suddenly, the predicted output may not be able to follow the measured output, resulting in a phase lag in the prediction. However, by adopting the configuration of this embodiment, it is possible to adjust the influence of the AR component, improve the phase lag, and predict changes in the output before the value of the measured output changes.

（第２実施例）
図６は、一実施形態のモジュラー型の時系列予測装置におけるペアワイズ予測モデルの第２実施例について説明するための図である。
図６は、合成した予測に傾向（バイアス）を持たせたい場合に用いられる予測モデルの一例として、意図的に過大の予測および過小の予測を行う場合の予測モデルを示している。 (Second Example)
FIG. 6 is a diagram illustrating a second example of a pairwise prediction model in a modular time series prediction device according to an embodiment.
FIG. 6 shows a prediction model for intentionally overpredicting and underpredicting, as an example of a prediction model used when it is desired to impart a bias to the combined prediction.

本実施例のペアワイズ予測モデルＭＢは、回帰ブロックＢ１－ＢＮと、予測出力の合成ブロックＢ５と、を備え、合成ブロックＢ５が、過大推定予測値（Y₊(t+1)=F₊(Y1(t+1),Y2(t+1),…,Yn(t+1))）と、通常推定予測値（Y(t+1)=F(Y1(t+1),Y2(t+1),…,Yn(t+1))）と、過小推定予測値（Y_－(t+1)=F_－(Y1(t+1),Y2(t+1),…,Yn(t+1))）と、を出力する点において上述の第１実施例のペアワイズ予測モデルＭＡと異なっている。 The pairwise prediction model MB of this embodiment includes regression blocks B1-BN and a prediction output synthesis block B5. The synthesis block B5 outputs an overestimated prediction value (Y ₊ (t+1) = F ₊ (Y1(t+1), Y2(t+1), ..., Yn(t+1))), a normal estimation prediction value (Y(t+1) = F(Y1(t+1), Y2(t+1), ..., Yn(t+1))), and an underestimated prediction value (Y _- (t+1) = F _- (Y1(t+1), Y2(t+1), ..., Yn(t+1))).

本実施例では、ペアワイズ予測モデルの位置母数と尺度母数とを用いて、通常の予測に加えて過大な予測および過小な予測を行うことにより、安全側の予測（リスクを回避する側の予測）などのニーズ（要求）に応じて、予測結果を微調整することができる。 In this embodiment, the location parameters and scale parameters of the pairwise prediction model are used to make overpredictions and underpredictions in addition to normal predictions, allowing for fine-tuning of prediction results according to needs (requests), such as conservative predictions (predictions that avoid risk).

図７および図８は、第２実施例のペアワイズ予測モデルの効果を説明するための図である。
図７は、回帰手法の一種であるL1正則化を導入したLasso回帰による雨水流入予測値と測定値との一例を示している。
図８は、第２実施例の予測モデルによる雨水流入予測値と測定値との一例を示している。 7 and 8 are diagrams for explaining the effect of the pairwise prediction model of the second embodiment.
FIG. 7 shows an example of the rainwater inflow predicted value and the measured value obtained by Lasso regression with L1 regularization, which is a type of regression method.
FIG. 8 shows an example of the rainwater inflow predicted value and the measured value according to the prediction model of the second embodiment.

図７に示す雨水流入予測値と測定値とを比較すると、予測誤差は小さいが、若干推定値が測定値よりも小さい結果となっており、過小推定になっている事がわかる。
一方、図８に示す本実施例の手法で得られた通常の推定（平均的な推定）、過大推定、および、過少推定を行った結果と測定値とを比較すると、通常の推定値（予測流入量）では予想誤差は小さいく、ピーク付近での予測精度は良いものの流入量が増加するタイミングでの予測値が測定値を下回っている。 Comparing the predicted rainwater inflow values shown in FIG. 7 with the measured values, it can be seen that although the prediction error is small, the estimated values are slightly smaller than the measured values, resulting in an underestimation.
On the other hand, when comparing the results of normal estimation (average estimation), overestimation, and underestimation obtained by the method of this embodiment shown in Figure 8 with the measured values, the normal estimates (predicted inflow volumes) have small prediction errors and good prediction accuracy near peaks, but the predicted values at times when inflow volumes increase are lower than the measured values.

このような場合、この予測値を制御に用いる場合には、もう少し流入量急増時の予測を過大に評価しておきたい場合がある。このとき、過大推定予測を行うと、全体的にＭＳＥ（平均２乗誤差）での予測精度は劣化するが、流入量急増時に急増する可能性を示唆する予測が可能になることがわかる。また、通常の予測値よりも過小に評価をしておきたい場合には、過小推定予測を行うと、予測値が小さくなるように調整することができる。 In such cases, if this predicted value is to be used for control, it may be desirable to overestimate the prediction when there is a sudden increase in inflow. In this case, if an overestimated prediction is made, the overall prediction accuracy in terms of MSE (mean squared error) will deteriorate, but it will be possible to make a prediction that suggests the possibility of a sudden increase in inflow when it occurs. Also, if you want to underestimate the normal predicted value, you can adjust the predicted value to be smaller by making an underestimated prediction.

このような過大推定および過小推定により予測値を調整可能とすると、長期の運用の結果、通常の予測値が実測値に対して過小推定、あるいは、過大推定になっている様な傾向（バイアス）が見られる場合、通常の予測値を過大推定値、あるいは過小推定値に置き換えることで、バイアスを補正してより精度の高い予測を行うことができる。 By making it possible to adjust predicted values using such overestimation and underestimation, if, as a result of long-term operation, a tendency (bias) is observed in which normal predicted values tend to underestimate or overestimate actual values, the normal predicted values can be replaced with overestimated or underestimated values to correct the bias and make more accurate predictions.

（第３実施例）
図９は、一実施形態のモジュラー型の時系列予測装置におけるペアワイズ予測モデルの第３実施例について説明するための図である。
本実施例では、ペアワイズ予測モデルとして、（５）式のようなＡＲモデル／ＦＩＲモデルをさらに分解し、時間遅れを考慮した変数を各々一つの説明変数と見なした単回帰モデルに分解して予測モデルを構築している。 (Third Example)
FIG. 9 is a diagram illustrating a third example of a pairwise prediction model in a modular time series prediction device according to an embodiment.
In this example, as a pairwise prediction model, an AR model/FIR model such as that in equation (5) is further decomposed, and a prediction model is constructed by decomposing the variables that take time delays into account into a simple regression model in which each variable is regarded as a single explanatory variable.

本実施例のペアワイズ予測モデルＭＣは、複数の時間遅れブロックと、複数の単回帰ブロックと、相関重み付き平均ブロックと、を含む予測出力ブロックＣ１－ＣＮを入力変数１－Ｎ毎に含み、当該複数の予測出力ブロックＣ１－ＣＮから出力された予測値を合成した最終合成予測値を出力する予測出力値の合成ブロックＣ（Ｎ＋１）を更に備える。 The pairwise prediction model MC of this embodiment includes prediction output blocks C1-CN, each including multiple time delay blocks, multiple simple regression blocks, and a correlation weighted average block, for each input variable 1-N, and further includes a prediction output value synthesis block C(N+1) that outputs a final synthesis prediction value by synthesizing the prediction values output from the multiple prediction output blocks C1-CN.

予測出力ブロックＣ１は、複数（ｍ）の時間遅れブロック１１１－１１ｍと、複数（ｍ）の単回帰ブロック１２１－１２ｍと、相関重み付き平均ブロック１３と、を含む。
複数の時間遅れブロック１１１－１１ｍは、入力変数１（例えば時刻ｔにおける値）に対してそれぞれに割り当てられた遅れ時間Ｌ１１－Ｌ１ｍ１だけ遅れた変数（例えば時刻ｔ－Ｌ１１、…、ｔ－Ｌ１ｍ１における値）を出力する。 The prediction output block C1 includes a plurality (m) of time lag blocks 111-11m, a plurality (m) of simple regression blocks 121-12m, and a correlation weighted average block 13.
The plurality of time delay blocks 111-11m output variables (for example, values at times t-L11, . . . , t-L1m1) delayed by the delay times L11-L1m1 assigned to the input variable 1 (for example, value at time t).

複数の単回帰ブロック１２１－１２ｍは、単回帰係数を用いて、時間遅れブロック１１１－１１ｍから出力された変数に基づく予測値を演算する出力する。
相関重み付き平均ブロックは、複数の単回帰ブロック１２１－１２ｍから出力された複数の予測値の相関重み付き平均値を演算して出力する。 The plurality of simple regression blocks 121-12m use simple regression coefficients to calculate and output predicted values based on the variables output from the time lag blocks 111-11m.
The correlation weighted average block calculates and outputs a correlation weighted average of a plurality of predicted values output from a plurality of simple regression blocks 121-12m.

予測出力ブロックＣ２－ＣＮは、上記予測出力ブロックＣ１と同様の構成であるため、個々の説明は省略する。
予測出力値の合成ブロックＣ（Ｎ＋１）は、予測出力ブロックＣ１－ＣＮから出力された予測値を合成（例えば、相関重み付き平均）して、最終推定予測値（Y(t+1)=F(Y1(t+1),Y2(t+1),…,Yn(t+1))）を出力する。 The predicted output blocks C2-CN have the same configuration as the predicted output block C1, and therefore individual explanations will be omitted.
The synthesis block C(N+1) of the predicted output value synthesizes (for example, performs a correlation weighted average) the predicted values output from the prediction output blocks C1-CN and outputs the final estimated predicted value (Y(t+1) = F(Y1(t+1), Y2(t+1), ..., Yn(t+1))).

本実施例のペアワイズ予測モデルは、遅れ、ゲイン、バイアスの３項でモデルを説明することが可能であり、パラメータの微調整が容易である。また、単回帰係数と相関係数とには陽な関係があるため、各入力変数と各出力変数との相関係数を求めるだけで全体の予測モデルの構築ができ、予測モデルの解釈が容易である。
更に、本実施例のペアワイズ予測モデルでは、第１実施例のペアワイズ予測モデルと同様の効果を得ることができる。 The pairwise prediction model of this embodiment can be explained using three terms: delay, gain, and bias, making it easy to fine-tune the parameters. Furthermore, since there is a positive relationship between the simple regression coefficient and the correlation coefficient, the entire prediction model can be constructed simply by calculating the correlation coefficient between each input variable and each output variable, making it easy to interpret the prediction model.
Furthermore, the pairwise prediction model of this embodiment can achieve the same effects as the pairwise prediction model of the first embodiment.

図１０および図１１は、第３実施例のペアワイズ予測モデルの効果の一例を説明するための図である。
図１０および図１１では、第３実施例のペアワイズ予測モデルに対し１０個の入力変数を入力して雨水流入予測を行った例である。ここでは、各入力変数に対する遅れ時間を一つだけ考えた最も単純な場合の予測結果の例を示している。 10 and 11 are diagrams for explaining an example of the effect of the pairwise prediction model of the third embodiment.
10 and 11 show an example of rainwater inflow prediction using 10 input variables input to the pairwise prediction model of Example 3. Here, an example of the prediction results for the simplest case is shown, where only one delay time is considered for each input variable.

入力変数の一つである降雨強度（地上雨量）の遅れ時間を推定すると３３分という推定結果が得られた。図１１は、上記推定結果を用いた雨水流入量の予測結果の一例を示している。図１１の予測結果によれば、全体的には、ある程度の精度で予測ができているが、流入量が急増する部分での予測の遅れ（位相遅れ）が大きい事がわかる。 When estimating the delay time for rainfall intensity (ground rainfall), one of the input variables, an estimated result of 33 minutes was obtained. Figure 11 shows an example of the rainwater inflow forecast results using the above estimation results. The forecast results in Figure 11 show that, overall, predictions were made with a certain degree of accuracy, but there was a large delay in the forecast (phase lag) in areas where the inflow volume increased sharply.

この時、位相遅れが大きい理由として、遅れ時間３３分というものが大きすぎる可能性が推測できる。この推測に基づき、図１０に、遅れ時間を１５分と調整した場合の雨水流入量の予測結果の一例を示す。図１０に示す予測結果によれば、遅れ時間を調整することにより、全体的な予測精度には大きな影響を与えることなく、流入量急増時の位相遅れを大幅に改善できていることがわかる。 In this case, it can be assumed that the reason for the large phase lag is that the delay time of 33 minutes is too long. Based on this assumption, Figure 10 shows an example of the forecast results for rainwater inflow when the delay time is adjusted to 15 minutes. The forecast results shown in Figure 10 show that by adjusting the delay time, the phase lag during sudden increases in inflow can be significantly improved without significantly affecting the overall forecast accuracy.

このように、本実施例のペアワイズ予測モデルによれば、容易に解釈可能なパラメータのみで予測モデルを構築可能であり、説明性が向上するだけでなく予測モデルの微調整を容易に行うことができる。 In this way, the pairwise prediction model of this embodiment makes it possible to build a prediction model using only easily interpretable parameters, which not only improves interpretability but also makes it easy to fine-tune the prediction model.

また、単回帰係数と相関係数とは、入出力データを各々平均と標準偏差とで正規化すれば、一致することが知られており、これを逆に非正規化すれば、回帰係数とバイアスとに換算することができる。したがって、各入出力データの平均、標準偏差、および、入出力の相関のみを計算すれば、遅れ時間を指定するだけで、全ての予測モデルのパラメータを同定することができる。 It is also known that the simple regression coefficient and correlation coefficient will match if the input and output data are normalized by their respective means and standard deviations; conversely, if they are denormalized, they can be converted into regression coefficients and biases. Therefore, by calculating only the mean, standard deviation, and input/output correlation of each input and output data, it is possible to identify the parameters of all prediction models simply by specifying the delay time.

さらに、合成出力の計算を相関係数の重み付き平均で計算する方法とすることにより、予測モデルを構築するために必要な計算は、遅れ時間の推定（指定）と、入出力変数の平均および標準偏差と、入出力変数の相関と、のみであり、予測モデルに含まれるパラメータの全てを、この４種類の推定値（遅れ時間、平均、標準偏差、相関係数）だけを用いて説明することが可能になる。 Furthermore, by calculating the composite output using a weighted average of the correlation coefficient, the only calculations required to build a prediction model are the estimation (specification) of the delay time, the mean and standard deviation of the input and output variables, and the correlation between the input and output variables. All of the parameters included in the prediction model can be explained using only these four estimated values (delay time, mean, standard deviation, and correlation coefficient).

（効果）
上記のように、本実施形態によれば、入力変数と出力変数との相関関係が変化したり劣化したり、入力変数の時系列データに多量のアウトライアが含まれたりするなどの理由で予測精度が十分でない場合に、全体の予測モデルを再同定することなく、合成した予測出力値に影響を与えるペアワイズの予測モデルの部分的な調整や、悪影響を与える部分の削除を容易に行う事ができると同時に、これにより、予測精度劣化の理由を合理的に説明することが可能になる。 (effect)
As described above, according to this embodiment, when prediction accuracy is insufficient due to reasons such as a change or deterioration in the correlation between input variables and output variables, or the inclusion of a large number of outliers in the time-series data of the input variables, it is possible to easily perform partial adjustments of the pairwise prediction models that affect the synthesized predicted output value or delete parts that have a negative impact, without having to re-identify the entire prediction model. At the same time, this makes it possible to rationally explain the reasons for the deterioration in prediction accuracy.

また、入力変数間の多重共線性が極めて強くほぼ同じ説明能力を持つ入力変数が含まれる場合にも、パラメータの可同定(一意決定可能性)を維持することができ、物理法則に矛盾しない合理的な説明が可能な予測モデルを構築できると同時に、ほぼ同じ説明能力を持つ入力変数を統合することで、予測精度を維持しながら、説明性を向上させることができる。
すなわち、本実施形態によれば、予測結果を合理的に説明するとともに、容易に予測結果を調整可能とするモジュラー型時系列データ予測装置、モジュラー型時系列データ予測方法、および、プログラムを提供することができる。 Furthermore, even when there is extremely strong multicollinearity between input variables and the input variables have roughly the same explanatory power, it is possible to maintain the identifiability of the parameters (unique decidability), making it possible to construct a predictive model that allows for rational explanations that do not contradict the laws of physics.At the same time, by integrating input variables that have roughly the same explanatory power, it is possible to improve interpretability while maintaining prediction accuracy.
That is, according to the present embodiment, it is possible to provide a modular time-series data prediction device, a modular time-series data prediction method, and a program that can rationally explain prediction results and easily adjust prediction results.

本発明のいくつかの実施形態を説明したが、これらの実施形態は、例として提示したものであり、発明の範囲を限定することは意図していない。これら新規な実施形態は、その他の様々な形態で実施されることが可能であり、発明の要旨を逸脱しない範囲で、種々の省略、置き換え、変更を行うことができる。これら実施形態やその変形は、発明の範囲や要旨に含まれるとともに、特許請求の範囲に記載された発明とその均等の範囲に含まれる。
（付記１）
複数のプロセス変数を所定の周期で計測する複数のプロセスセンサを有するシステム又はプロセスに適用される装置であって、
複数の前記プロセス変数の時系列データを所定の周期で収集し保存するとともに、複数の前記プロセス変数の中から予測対象となる少なくとも一つの出力変数を選択する出力変数データ選択部と、複数の前記プロセス変数の中から複数の入力変数の候補を選択する入力変数データ選択部と、を含むデータ収集保存部と、
前記時系列データから抽出された前記出力変数と複数の前記入力変数との同定用データを用いて、１入力１出力のペア毎にペアワイズ予測モデルのパラメータを同定して複数の前記ペアワイズ予測モデルを定義するペアワイズ予測モデル同定部と、
複数の前記ペアワイズ予測モデルから出力されるペアワイズ予測値の合成法を定義する予測モデル合成法定義部と、
時間の進行方向に所定の周期又はリアルタイムで前記時系列データから抽出された複数の前記入力変数の予測用データを、複数の前記ペアワイズ予測モデルに入力して、複数の前記入力変数のそれぞれに対応する前記ペアワイズ予測値を演算するペアワイズ出力変数予測部と、
前記合成法により複数の前記ペアワイズ予測値を合成して前記出力変数の予測値を演算する合成出力変数予測部と、
を有するモジュラー型時系列データ予測装置。
（付記２）
前記ペアワイズ予測モデル同定部は、前記出力変数以外の複数の前記入力変数の同定用データに有限インパルス応答モデルを適用し、前記出力変数に自己回帰モデルを適用する、付記１記載のモジュラー型時系列データ予測装置。
（付記３）
前記ペアワイズ予測モデル同定部は、複数の前記入力変数の各々ついて１又は複数の遅れ時間を組み込んだ単回帰モデルを用いて、１入力１出力の伝達関数モデルを構築し、前記単回帰モデルを合成したモデルを前記ペアワイズ予測モデルとする、付記１記載のモジュラー型時系列データ予測装置。
（付記４）
前記ペアワイズ予測モデル同定部は、複数の前記入力変数各々の同定用データに対して非線形変換を行い、前記ペアワイズ予測モデルを構築する、付記１記載のモジュラー型時系列データ予測装置。
（付記５）
前記予測モデル合成法定義部は、複数の前記ペアワイズ予測値の重み付き平均を演算することを前記合成法とし、前記合成法における重みを、前記ペアワイズ予測モデル同定部によって同定された複数の前記ペアワイズ予測モデル各々の予測精度を表す指標に基づいて決定する、付記１記載のモジュラー型時系列データ予測装置。
（付記６）
前記予測モデル合成法定義部は、複数の前記ペアワイズ予測値の重み付き平均を演算することを前記合成法とし、前記ペアワイズ予測モデル同定部で用いた同定用データと同じデータを用いて、前記合成法における重みの値を同定して決定する、付記１記載のモジュラー型時系列データ予測装置。
（付記７）
前記ペアワイズ出力変数予測部から出力される複数の前記ペアワイズ予測値と、前記合成出力変数予測部から出力される前記出力変数の予測値とに対して、予め指定した所定の周期毎あるいは所定期間で、前記時系列データから抽出した実績値との誤差を評価する予測誤差評価部と、
複数の前記ペアワイズ予測値の誤差と、前記出力変数の予測値の誤差とを比較することにより、調整すべき前記ペアワイズ予測モデル、統合すべき前記ペアワイズ予測モデル、および、分離すべき前記ペアワイズ予測モデルを決定して、前記ペアワイズ出力変数予測部で用いられる複数の前記ペアワイズ予測モデルを修正するペアワイズ予測モデル修正部と、
を更に有する付記１記載のモジュラー型時系列データ予測装置。
（付記８）
前記ペアワイズ予測モデル修正部は、前記予測誤差評価部によって評価された複数の誤差の少なくとも一つが所定の許容誤差を超過したときに、当該誤差に対応する前記入力変数の候補に関する前記ペアワイズ予測モデルの再同定を行い、前記ペアワイズ出力変数予測部で用いられる複数の前記ペアワイズ予測モデルを修正する付記７記載のモジュラー型時系列データ予測装置。
（付記９）
前記ペアワイズ予測モデル修正部は、前記予測誤差評価部によって評価された複数の前記誤差の少なくとも一つが所定の許容誤差を超過したときに、当該誤差に対応する前記入力変数に関する前記ペアワイズ予測モデルを削除して、前記ペアワイズ出力変数予測部で用いられる複数の前記ペアワイズ予測モデルを修正する付記７記載のモジュラー型時系列データ予測装置。
（付記１０）
前記ペアワイズ予測モデル修正部は、前記予測誤差評価部によって評価された複数の前記誤差を比較し、複数の前記誤差の差が所定の値以下であるときに、当該誤差に対応する前記入力変数の和あるいは平均値を一つの新たな入力変数として定義し、前記ペアワイズ出力変数予測部で当該新たな入力変数に関する前記ペアワイズ予測モデルが用いられるように、複数の前記ペアワイズ予測モデルを修正する付記７記載のモジュラー型時系列データ予測装置。
（付記１１）
複数のプロセス変数を所定の周期で計測する複数のプロセスセンサを有するシステム又はプロセスに適用される方法であって、
複数の前記プロセス変数の時系列データを所定の周期で収集して保存し、
複数の前記プロセス変数の中から予測対象となる少なくとも一つの出力変数を選択し、
複数の前記プロセス変数の中から複数の入力変数の候補を選択し、
前記時系列データから抽出された前記出力変数と複数の前記入力変数との同定用データを用いて、１入力１出力のペア毎にペアワイズ予測モデルのパラメータを同定して複数の前記ペアワイズ予測モデルを定義し、
複数の前記ペアワイズ予測モデルから出力されるペアワイズ予測値の合成法を定義し、
時間の進行方向に所定の周期又はリアルタイムで前記時系列データから抽出された複数の前記入力変数の予測用データを、複数の前記ペアワイズ予測モデルに入力して、複数の前記入力変数のそれぞれに対応する前記ペアワイズ予測値を演算し、
前記合成法により複数の前記ペアワイズ予測値を合成して前記出力変数の予測値を演算する、
モジュラー型時系列データ予測方法。
（付記１２）
コンピュータに、付記１１に記載の方法を実行させるモジュラー型時系列データ予測プログラム。 Although several embodiments of the present invention have been described, these embodiments are presented as examples and are not intended to limit the scope of the invention. These novel embodiments can be embodied in various other forms, and various omissions, substitutions, and modifications can be made without departing from the spirit of the invention. These embodiments and their modifications are included within the scope and spirit of the invention, and are also included in the scope of the invention and its equivalents as defined in the claims.
(Appendix 1)
An apparatus applied to a system or process having a plurality of process sensors that measure a plurality of process variables at a predetermined cycle,
a data collection and storage unit that includes an output variable data selection unit that collects and stores time series data of the plurality of process variables at a predetermined cycle, and selects at least one output variable to be predicted from the plurality of process variables, and an input variable data selection unit that selects a plurality of input variable candidates from the plurality of process variables;
a pairwise prediction model identification unit that identifies parameters of a pairwise prediction model for each one-input, one-output pair using identification data of the output variable and the plurality of input variables extracted from the time-series data, thereby defining the plurality of pairwise prediction models;
a prediction model synthesis method definition unit that defines a synthesis method for pairwise prediction values output from the plurality of pairwise prediction models;
a pairwise output variable prediction unit that inputs prediction data for the plurality of input variables extracted from the time-series data in a predetermined cycle in the time progression direction or in real time into the plurality of pairwise prediction models, and calculates the pairwise predicted values corresponding to each of the plurality of input variables;
a composite output variable prediction unit that calculates a predicted value of the output variable by combining a plurality of the pairwise predicted values by the combination method;
A modular time series data forecasting device having the above.
(Appendix 2)
2. The modular time series data prediction device according to claim 1, wherein the pairwise prediction model identification unit applies a finite impulse response model to identification data of the plurality of input variables other than the output variable, and applies an autoregressive model to the output variable.
(Appendix 3)
the pairwise prediction model identification unit constructs a one-input, one-output transfer function model using a simple regression model incorporating one or more delay times for each of the plurality of input variables, and defines a model obtained by combining the simple regression models as the pairwise prediction model.
(Appendix 4)
2. The modular time-series data prediction device according to claim 1, wherein the pairwise prediction model identification unit performs nonlinear transformation on identification data for each of the plurality of input variables to construct the pairwise prediction model.
(Appendix 5)
the prediction model synthesis method definition unit defines, as the synthesis method, a weighted average of the plurality of pairwise prediction values, and determines weights in the synthesis method based on an index representing the prediction accuracy of each of the plurality of pairwise prediction models identified by the pairwise prediction model identification unit.
(Appendix 6)
2. The modular time-series data prediction device according to claim 1, wherein the prediction model synthesis method definition unit defines the synthesis method as calculating a weighted average of the pairwise prediction values, and identifies and determines weight values in the synthesis method using the same data as identification data used by the pairwise prediction model identification unit.
(Appendix 7)
a prediction error evaluation unit that evaluates an error between the plurality of pairwise predicted values output from the pairwise output variable prediction unit and the predicted value of the output variable output from the composite output variable prediction unit and an actual value extracted from the time series data at a predetermined cycle or for a predetermined period;
a pairwise prediction model correction unit that corrects the pairwise prediction models used in the pairwise output variable prediction unit by comparing errors in the pairwise prediction values with errors in the predicted values of the output variables to determine the pairwise prediction models to be adjusted, the pairwise prediction models to be integrated, and the pairwise prediction models to be separated;
2. The modular time series data prediction device according to claim 1, further comprising:
(Appendix 8)
and wherein, when at least one of the plurality of errors evaluated by the prediction error evaluation unit exceeds a predetermined allowable error, the pairwise prediction model correction unit re-identifies the pairwise prediction model for the candidates of the input variables corresponding to the error, and corrects the plurality of pairwise prediction models used in the pairwise output variable prediction unit.
(Appendix 9)
8. The modular time-series data prediction device according to claim 7, wherein, when at least one of the plurality of errors evaluated by the prediction error evaluation unit exceeds a predetermined allowable error, the pairwise prediction model correction unit deletes the pairwise prediction model for the input variable corresponding to the error, and corrects the plurality of pairwise prediction models used in the pairwise output variable prediction unit.
(Appendix 10)
the pairwise prediction model correction unit compares the errors evaluated by the prediction error evaluation unit, and, when a difference between the errors is equal to or less than a predetermined value, defines a sum or average of the input variables corresponding to the errors as a new input variable, and corrects the pairwise prediction models so that the pairwise output variable prediction unit uses the pairwise prediction model related to the new input variable.
(Appendix 11)
1. A method for application to a system or process having a plurality of process sensors measuring a plurality of process variables at a predetermined period, comprising:
collecting and storing time series data of the plurality of process variables at a predetermined cycle;
selecting at least one output variable to be predicted from the plurality of process variables;
selecting a plurality of candidate input variables from among the plurality of process variables;
identifying parameters of a pairwise prediction model for each one-input, one-output pair using identification data of the output variable and the plurality of input variables extracted from the time-series data, thereby defining a plurality of the pairwise prediction models;
defining a method for combining pairwise prediction values output from a plurality of said pairwise prediction models;
inputting prediction data of the plurality of input variables extracted from the time-series data in a predetermined cycle or in real time in the time progression direction into the plurality of pairwise prediction models, and calculating the pairwise predicted values corresponding to each of the plurality of input variables;
a combination of the pairwise predicted values by the combination method to calculate a predicted value of the output variable;
Modular time series data forecasting method.
(Appendix 12)
A modular time series data forecasting program that causes a computer to execute the method described in Appendix 11.

１…都市雨水排水プロセス、１１…流入量計、１２…幹線流量計、１３１－１３Ｋ…幹線水位計、１４１～１４Ｍ…地上雨量計、１５、１５１１～１５ＱＰ…レーダ雨量計、１７…流入渠、１１０…流入ゲート、１８…雨水ポンプ井、１９…雨水ポンプ、１６１、１６２…雨水ポンプ井水位計、２…データ収集保存部、２１…出力変数データ選択部、２２…入力変数データ選択部、３…データ抽出部、３１…オフライン予測モデル同定用データ抽出部、３２…オンライン予測用データ抽出部、３３…評価用データ抽出部、４…ペアワイズ予測モデル同定部、５…予測モデル合成法定義部、６…ペアワイズ出力変数予測部、７…合成出力変数予測部、８…予測誤差評価部、９…ペアワイズ予測モデル修正部、１０…出力予測結果観測部、１１１－１１ｍ…時間遅れブロック、１２１－１２ｍ…単回帰ブロック、１３…相関重み付き平均ブロック、Ａ１－ＡＮ…回帰ブロック、Ａ５…合成ブロック、Ｂ１－ＢＮ…回帰ブロック、Ｂ５…合成ブロック、Ｃ１－ＣＮ…予測出力ブロック、Ｃ（Ｎ＋１）…合成ブロック。 1...Urban stormwater drainage process, 11...Inflow meter, 12...Main flow meter, 131-13K...Main water level gauge, 141-14M...Ground rain gauge, 15, 1511-15QP...Radar rain gauge, 17...Inflow culvert, 110...Inflow gate, 18...Rainwater pump well, 19...Rainwater pump, 161, 162...Rainwater pump well water level gauge, 2...Data collection and storage unit, 21...Output variable data selection unit, 22...Input variable data selection unit, 3...Data extraction unit, 31...Data extraction unit for offline prediction model identification, 32...Data extraction unit for online prediction, 33...For evaluation Data extraction unit, 4...pairwise prediction model identification unit, 5...prediction model synthesis method definition unit, 6...pairwise output variable prediction unit, 7...synthetic output variable prediction unit, 8...prediction error evaluation unit, 9...pairwise prediction model correction unit, 10...output prediction result observation unit, 111-11m...time delay block, 121-12m...simple regression block, 13...correlation weighted average block, A1-AN...regression block, A5...synthesis block, B1-BN...regression block, B5...synthesis block, C1-CN...prediction output block, C(N+1)...synthesis block.

Claims

An apparatus applied to a system or process having a plurality of process sensors that measure a plurality of process variables at a predetermined cycle,
a data collection and storage unit that includes an output variable data selection unit that collects and stores time series data of the plurality of process variables at a predetermined cycle, and selects at least one output variable to be predicted from the plurality of process variables, and an input variable data selection unit that selects a plurality of input variables from the plurality of process variables;
a pairwise prediction model identification unit that identifies parameters of a pairwise prediction model for each one-input, one-output pair using identification data of the output variable and the plurality of input variables extracted from the time-series data, thereby defining the plurality of pairwise prediction models;
a prediction model synthesis method definition unit that defines a synthesis method for pairwise prediction values output from the plurality of pairwise prediction models;
a pairwise output variable prediction unit that inputs prediction data for the plurality of input variables extracted from the time-series data in a predetermined cycle in the time progression direction or in real time into the plurality of pairwise prediction models, and calculates the pairwise predicted values corresponding to each of the plurality of input variables;
a composite output variable prediction unit that calculates a predicted value of the output variable by combining a plurality of the pairwise predicted values by the combination method;
A modular time series data forecasting device having the above.

The modular time series data prediction device of claim 1, wherein the pairwise prediction model identification unit applies a finite impulse response model to identification data for the plurality of input variables other than the output variable, and applies an autoregressive model to the output variable.

2. The modular time series data prediction device according to claim 1, wherein the pairwise prediction model identification unit constructs a one-input, one-output transfer function model using a simple regression model incorporating one or more delay times for each of the plurality of input variables, and defines a model obtained by combining the simple regression models as the pairwise prediction model.

The modular time series data prediction device of claim 1, wherein the pairwise prediction model identification unit performs nonlinear transformation on the identification data for each of the multiple input variables to construct the pairwise prediction model.

The modular time series data prediction device of claim 1, wherein the prediction model synthesis method definition unit defines the synthesis method as calculating a weighted average of the plurality of pairwise prediction values, and determines the weights in the synthesis method based on an index representing the prediction accuracy of each of the plurality of pairwise prediction models identified by the pairwise prediction model identification unit.

The modular time series data prediction device of claim 1, wherein the prediction model synthesis method definition unit defines the synthesis method as calculating a weighted average of the pairwise prediction values, and identifies and determines the weight values in the synthesis method using the same identification data as used by the pairwise prediction model identification unit.

a prediction error evaluation unit that evaluates an error between the plurality of pairwise predicted values output from the pairwise output variable prediction unit and the predicted value of the output variable output from the composite output variable prediction unit and an actual value extracted from the time series data at a predetermined cycle or for a predetermined period;
a pairwise prediction model correction unit that corrects the pairwise prediction models used in the pairwise output variable prediction unit by comparing errors in the pairwise prediction values with errors in the predicted values of the output variables to determine the pairwise prediction models to be adjusted, the pairwise prediction models to be integrated, and the pairwise prediction models to be separated;
2. The modular time series data forecasting device according to claim 1, further comprising:

8. The modular time-series data prediction device according to claim 7, wherein, when at least one of the plurality of errors evaluated by the prediction error evaluation unit exceeds a predetermined allowable error, the pairwise prediction model correction unit re-identifies the pairwise prediction model for the input variable corresponding to the error, and corrects the plurality of pairwise prediction models used in the pairwise output variable prediction unit.

A modular time series data prediction device as described in claim 7, wherein, when at least one of the multiple errors evaluated by the prediction error evaluation unit exceeds a predetermined allowable error, the pairwise prediction model correction unit deletes the pairwise prediction model for the input variable corresponding to that error, and corrects the multiple pairwise prediction models used in the pairwise output variable prediction unit.

A modular time series data prediction device as described in claim 7, wherein the pairwise prediction model correction unit compares the multiple errors evaluated by the prediction error evaluation unit, and when the difference between the multiple errors is equal to or less than a predetermined value, defines the sum or average of the input variables corresponding to the error as a new input variable, and corrects the multiple pairwise prediction models so that the pairwise output variable prediction unit uses the pairwise prediction model for the new input variable.

1. A method for application to a system or process having a plurality of process sensors measuring a plurality of process variables at a predetermined period, comprising:
The processor:
collecting and storing time series data of the plurality of process variables at a predetermined cycle;
selecting at least one output variable to be predicted from the plurality of process variables;
selecting a plurality of input variables from among said plurality of process variables;
identifying parameters of a pairwise prediction model for each one-input, one-output pair using identification data of the output variable and the plurality of input variables extracted from the time-series data, thereby defining a plurality of the pairwise prediction models;
defining a method for combining pairwise prediction values output from a plurality of said pairwise prediction models;
inputting prediction data of the plurality of input variables extracted from the time-series data in a predetermined cycle or in real time in the time progression direction into the plurality of pairwise prediction models, and calculating the pairwise predicted values corresponding to each of the plurality of input variables;
a combination of the pairwise predicted values by the combination method to calculate a predicted value of the output variable;
Modular time series data forecasting method.

A modular time-series data forecasting program that causes a computer to execute the method of claim 11.