JP6917004B2

JP6917004B2 - Evaluation device, evaluation method and its program

Info

Publication number: JP6917004B2
Application number: JP2017083501A
Authority: JP
Inventors: 安藤　丹一; 丹一安藤
Original assignee: Omron Corp
Current assignee: Omron Corp
Priority date: 2017-04-20
Filing date: 2017-04-20
Publication date: 2021-08-11
Anticipated expiration: 2037-04-20
Also published as: JP2018181184A; WO2018193934A1

Description

本発明は、評価装置、評価方法及びそのプログラムに関する。 The present invention relates to an evaluation device, an evaluation method, and a program thereof.

従来より、ニューラルネットワークの技術を用いてシステムの制御を行うことが知られている。例えば、特許文献１には、ニューラルネットワークを用いて、複数のかごの中から乗り場へ移動させるべき最適なかごを選択するエレベータシステムが開示されている。 Conventionally, it has been known to control a system by using a neural network technique. For example, Patent Document 1 discloses an elevator system that uses a neural network to select an optimum car to be moved from a plurality of cars to a landing.

また、引用文献２には、ニューラルネットワークを用いたパターン認識装置において誤認識を低減するために、ニューラルネットワークに対して追加学習を行う方法が開示されている。 Further, Cited Document 2 discloses a method of performing additional learning on a neural network in order to reduce erroneous recognition in a pattern recognition device using a neural network.

特開２００５−２５５２８９号公報Japanese Unexamined Patent Publication No. 2005-255289 特開平９−６２６４８号公報Japanese Unexamined Patent Publication No. 9-62648

ニューラルネットワークの技術を用いた制御システムでは、制御性能を向上等させるためにニューラルネットワークの追加学習を行うことがある。例えば、最初の学習時には、図１１の時刻１から時刻２の期間に得られたデータからランダムに選択した学習データＡを用い、その後、図１１の時刻２から時刻３の期間に得られたデータからランダムに選択した学習データＢを用いて追加学習を行う場合が考えられる。 In a control system using neural network technology, additional learning of the neural network may be performed in order to improve control performance and the like. For example, at the time of the first learning, the learning data A randomly selected from the data obtained in the period from time 1 to time 2 in FIG. 11 is used, and then the data obtained in the period from time 2 to time 3 in FIG. 11 is used. It is conceivable that additional learning is performed using the learning data B randomly selected from.

このような場合に、学習データの選択をランダムに行っていることに起因して、学習データＢが不適切な学習データを含む場合がある。このように不適切な学習データを含む追加学習データＢを用いて追加学習を行った場合、追加学習後のニューラルネットワークの制御性能は、追加学習以前のニューラルネットワークと比較して性能が劣化することがある。 In such a case, the learning data B may include inappropriate learning data due to the random selection of the learning data. When additional learning is performed using the additional learning data B including inappropriate learning data in this way, the control performance of the neural network after the additional learning deteriorates as compared with the neural network before the additional learning. There is.

また、時刻２から時刻３の期間において特殊なイベントが発生していた場合は、追加学習後のニューラルネットワークが、特殊なイベントに特化したものとなる可能性がある。その結果、追加学習後のニューラルネットワークの制御性能は、追加学習以前のニューラルネットワークと比較して性能が劣化してしまう恐れがある。 Further, when a special event occurs in the period from time 2 to time 3, the neural network after the additional learning may be specialized for the special event. As a result, the control performance of the neural network after the additional learning may be deteriorated as compared with the neural network before the additional learning.

そこで、本発明は、追加学習後に学習モジュールの制御性能が劣化した場合に対処し得る評価装置、評価方法及びそのプログラムを提供することを目的とする。 Therefore, an object of the present invention is to provide an evaluation device, an evaluation method, and a program thereof that can deal with a case where the control performance of the learning module deteriorates after additional learning.

本発明の一態様に係る第１の学習モジュールを追加学習させた第２の学習モジュールを評価する評価装置は、第２の学習モジュールが達成すべき学習目標を受け付ける学習目標受付部と、少なくとも学習目標に含まれる評価項目について第２の学習モジュールの評価を行い、評価データを生成する評価部と、学習目標と評価データとを用いて、第２の学習モジュールが学習目標を達成したか否か判定する判定部と、学習目標を達成しないと判定される場合、少なくとも学習目標に基づいて、第２の学習モジュールと異なる第３の学習モジュールを取得する学習モジュール選択部とを備える。 The evaluation device for evaluating the second learning module obtained by additionally learning the first learning module according to one aspect of the present invention includes a learning goal receiving unit that receives a learning goal to be achieved by the second learning module, and at least learning. Whether or not the second learning module has achieved the learning goal by using the evaluation unit that evaluates the evaluation items included in the goal and generates the evaluation data and the learning goal and the evaluation data. It includes a determination unit for determining, and a learning module selection unit for acquiring a third learning module different from the second learning module, at least based on the learning objective, when it is determined that the learning target is not achieved.

この態様によれば、追加学習後に学習モジュールの制御性能が学習目標を達成しなかった場合に、学習目標を達成しなかった学習モジュールを使用し続けることを回避することができるので、学習モジュールを用いるシステムの信頼性を高めることができる。例えば、学習目標を達成しなかった学習モジュールの使用を回避することで、システム全体の処理精度の低下を防ぐことができる。また、学習目標を達成しない場合に、少なくとも学習目標に基づいて、学習目標を達成しなかった学習モジュールと異なる学習モジュールを取得する構成により、学習モジュールを用いて制御を行う制御装置、評価を行う評価装置のいずれも複数の学習モジュールを保持する必要がないので、学習モジュールを記録するためのハードウェア資源を最小限にすることができる。 According to this aspect, if the control performance of the learning module does not achieve the learning goal after the additional learning, it is possible to avoid continuing to use the learning module that did not achieve the learning goal. The reliability of the system used can be improved. For example, by avoiding the use of a learning module that does not achieve the learning goal, it is possible to prevent a decrease in the processing accuracy of the entire system. In addition, when the learning goal is not achieved, at least based on the learning goal, a control device that controls using the learning module and an evaluation are performed by a configuration in which a learning module different from the learning module that did not achieve the learning goal is acquired. Since it is not necessary to hold a plurality of learning modules in any of the evaluation devices, the hardware resource for recording the learning modules can be minimized.

上記評価装置において、第３の学習モジュールは、第１の学習モジュールとしてもよい。この態様によれば、性能劣化前の学習モジュールを用いてシステムの制御を続行することができるので、学習モジュールを用いるシステムの安定性の低下を防ぐことができる。 In the evaluation device, the third learning module may be the first learning module. According to this aspect, since the control of the system can be continued by using the learning module before the performance deterioration, it is possible to prevent the stability of the system using the learning module from being lowered.

上記評価装置において、学習モジュールが学習目標を達成することができなかった要因を推定して、学習モジュールの追加学習に用いる学習データに関する要因改善データを生成する要因推定部をさらに備えてもよい。この態様によれば、性能劣化した学習モジュールから有益な情報を得ることができる。このような情報を活用することにより、学習モジュールの精度向上に必要となる学習データや処理を低減することができ、ＣＰＵの負荷を低減することができる。 The evaluation device may further include a factor estimation unit that estimates the factors that the learning module could not achieve the learning target and generates factor improvement data related to the learning data used for the additional learning of the learning module. According to this aspect, useful information can be obtained from the learning module whose performance has deteriorated. By utilizing such information, it is possible to reduce the learning data and processing required for improving the accuracy of the learning module, and it is possible to reduce the load on the CPU.

上記評価装置において、学習モジュール選択部は、要因改善データに基づいて学習指示を生成してもよい。この態様によれば、性能劣化した学習モジュールが有する問題を克服し得る、効率的な追加学習を学習モジュールにさせることができる。この効率的な追加学習をさせた学習モジュールをシステムに用いることで、最短時間で、学習モジュールの更新による性能劣化を改善することができる。 In the evaluation device, the learning module selection unit may generate a learning instruction based on the factor improvement data. According to this aspect, it is possible to make the learning module perform efficient additional learning that can overcome the problems of the learning module whose performance has deteriorated. By using the learning module with this efficient additional learning in the system, it is possible to improve the performance deterioration due to the update of the learning module in the shortest time.

上記評価装置において、要因推定部は、推定した要因を出力し、推定した要因に関するユーザ入力を受信して、ユーザ入力に基づいて要因改善データを生成してもよい。この態様によれば、より信頼性の高い情報に基づいて、性能劣化要因に関する情報を生成することができる。 In the evaluation device, the factor estimation unit may output the estimated factor, receive the user input regarding the estimated factor, and generate the factor improvement data based on the user input. According to this aspect, it is possible to generate information on performance deterioration factors based on more reliable information.

本発明の他の態様に係る、第１の学習モジュールを追加学習させた第２の学習モジュールを評価する評価方法は、第２の学習モジュールが達成すべき学習目標を受け付ける工程と、少なくとも学習目標に含まれる評価項目について第２の学習モジュールの評価を行い、評価データを生成する工程と、学習目標と評価データとを用いて、第２の学習モジュールが学習目標を達成したか否か判定する工程と、学習目標を達成しないと判定される場合、少なくとも学習目標に基づいて、第２の学習モジュールと異なる第３の学習モジュールを取得する工程とを含む。 The evaluation method for evaluating the second learning module obtained by additionally learning the first learning module according to another aspect of the present invention includes a step of accepting a learning goal to be achieved by the second learning module and at least a learning goal. The evaluation item included in is evaluated by the second learning module, and the process of generating the evaluation data and the learning target and the evaluation data are used to determine whether or not the second learning module has achieved the learning target. It includes a step and a step of acquiring a third learning module different from the second learning module, at least based on the learning goal, when it is determined that the learning goal is not achieved.

本発明の他の態様に係るプログラムは、コンピュータに、第１の学習モジュールを追加学習させた第２の学習モジュールを評価させるためのプログラムであって、第２の学習モジュールが達成すべき学習目標を受け付ける処理と、少なくとも学習目標に含まれる評価項目について第２の学習モジュールの評価を行い、評価データを生成する処理と、学習目標と評価データとを用いて、第２の学習モジュールが学習目標を達成したか否か判定する処理と、学習目標を達成しないと判定される場合、第２の学習モジュールと異なる第３の学習モジュールを取得する処理とを実行させる。 The program according to another aspect of the present invention is a program for causing a computer to evaluate a second learning module obtained by additionally learning a first learning module, and is a learning goal to be achieved by the second learning module. The second learning module evaluates at least the evaluation items included in the learning goal and generates the evaluation data, and the second learning module uses the learning goal and the evaluation data. When it is determined that the learning target is not achieved, the process of determining whether or not the above is achieved and the process of acquiring a third learning module different from the second learning module are executed.

本発明によれば、追加学習後に学習モジュールの制御性能が性能劣化した場合に対処し得る評価装置、評価方法及びそのプログラムを提供することができる。 According to the present invention, it is possible to provide an evaluation device, an evaluation method, and a program thereof that can deal with a case where the control performance of a learning module deteriorates after additional learning.

本発明の実施形態に係る学習システムのネットワーク構成を示す図である。It is a figure which shows the network configuration of the learning system which concerns on embodiment of this invention. 本発明の実施形態に係る学習装置の物理的構成を示す図である。It is a figure which shows the physical structure of the learning apparatus which concerns on embodiment of this invention. 本発明の実施形態に係る追加学習制御装置の物理的構成を示す図である。It is a figure which shows the physical structure of the additional learning control apparatus which concerns on embodiment of this invention. 本発明の実施形態に係る学習装置の機能ブロック図である。It is a functional block diagram of the learning apparatus which concerns on embodiment of this invention. 本発明の実施形態に係る追加学習制御装置の機能ブロック図である。It is a functional block diagram of the additional learning control device which concerns on embodiment of this invention. 本発明の実施形態に係る追加学習制御装置によって実行される追加学習処理の流れを示すフローチャートである。It is a flowchart which shows the flow of the additional learning process executed by the additional learning control device which concerns on embodiment of this invention. 本発明の実施形態に係る追加学習制御装置によって実行される追加学習処理の流れを示すフローチャートである。It is a flowchart which shows the flow of the additional learning process executed by the additional learning control device which concerns on embodiment of this invention. 本発明の実施形態に係る追加学習制御装置によって出力される画面の一例である。This is an example of a screen output by the additional learning control device according to the embodiment of the present invention. 本発明の別の実施形態に係る学習システムのネットワーク構成を示す図である。It is a figure which shows the network configuration of the learning system which concerns on another embodiment of this invention. 本発明の実施形態に係る評価装置の物理的構成を示す図である。It is a figure which shows the physical structure of the evaluation apparatus which concerns on embodiment of this invention. 学習データを選択する際の時系列を示す図である。It is a figure which shows the time series at the time of selecting a training data.

添付図面を参照して、本発明の実施形態について説明する。なお、以下の実施形態は、本発明の理解を容易にするためのものであり、本発明を限定して解釈するためのものではない。また、本発明は、その要旨を逸脱しない限り、さまざまな変形が可能である。さらに、当業者であれば、以下に述べる各要素を均等なものに置換した実施形態を採用することが可能であり、係る実施形態も本発明の範囲に含まれる。 Embodiments of the present invention will be described with reference to the accompanying drawings. It should be noted that the following embodiments are for facilitating the understanding of the present invention, and are not for limiting and interpreting the present invention. Further, the present invention can be modified in various ways as long as it does not deviate from the gist thereof. Further, those skilled in the art can adopt an embodiment in which each element described below is replaced with an equal one, and such an embodiment is also included in the scope of the present invention.

（ネットワーク構成）
図１を参照して、本発明の所定の実施形態に係る学習システム１のネットワーク構成について説明する。学習システム１は、学習装置１０、追加学習制御装置２０、１又は複数のセンサ３０及び記憶装置４０を備える。学習装置１０は、通信ネットワークＮを介して、追加学習制御装置２０、１又は複数のセンサ３０及び記憶装置４０に接続される。通信ネットワークＮは、有線又は無線回線により構成された有線通信網及び無線通信網のいずれであってもよく、インターネットやＬｏｃａｌＡｒｅａＮｅｔｗｏｒｋ（ＬＡＮ）であってよい。 (Network configuration)
The network configuration of the learning system 1 according to the predetermined embodiment of the present invention will be described with reference to FIG. The learning system 1 includes a learning device 10, an additional learning control device 20, one or a plurality of sensors 30, and a storage device 40. The learning device 10 is connected to the additional learning control device 20, one or a plurality of sensors 30 and the storage device 40 via the communication network N. The communication network N may be either a wired communication network or a wireless communication network configured by a wired or wireless line, and may be the Internet or a Local Area Network (LAN).

学習装置１０は、記憶装置４０に記憶された学習データに基づいて、学習モジュールの学習を行い、学習済モジュールを記憶装置４０に記憶する。本実施形態に係る学習装置１０は、学習モジュールを備えるが、学習モジュールは、学習装置１０と別体の装置に備えられてもよい。 The learning device 10 learns the learning module based on the learning data stored in the storage device 40, and stores the learned module in the storage device 40. The learning device 10 according to the present embodiment includes a learning module, but the learning module may be provided in a device separate from the learning device 10.

なお、学習モジュールとは、学習能力を備えた専用若しくは汎用のハードウェア若しくはソフトウェアの一単位、又は、当該ハードウェア若しくはソフトウェアの一単位の組合せを含む。当該学習を行う学習モジュールには、学習データによりすでに何らかの学習を行っているものもあれば、学習前のものも含む。ここで、学習能力とは、あるタスクの処理能力を、学習データから得られる経験に基づいて向上させることのできる能力をいう。 The learning module includes a unit of dedicated or general-purpose hardware or software having learning ability, or a combination of one unit of the hardware or software. The learning module that performs the learning includes those that have already learned something from the learning data and those that have not been learned. Here, the learning ability means an ability that can improve the processing ability of a certain task based on the experience obtained from the learning data.

追加学習制御装置２０は、学習済モジュールを用いて、入力データの特徴に応じた出力データを出力する。本実施形態に係る追加学習制御装置２０は、学習済モジュール又は当該学習済モジュールの複製物を学習装置１０から取得して、学習モジュールとして設定する。追加学習制御装置２０は、設定された学習モジュールを用いて出力した出力データに対する評価を行い、評価データを出力することができる。追加学習制御装置２０は、設定された学習モジュールが達成すべき学習目標と評価データとを用いて、現在設定されている学習モジュールが学習目標を達成したか否か判定することができる。学習目標を達成していないと判定される場合、追加学習制御装置２０は、例えば以前設定されていた学習モジュールを学習装置１０から取得して、学習モジュールとして設定してもよい。なお、追加学習制御装置２０は、後述する評価装置６０の機能構成を備えており、評価装置６０を実質的に含むものである。 The additional learning control device 20 uses the learned module to output output data according to the characteristics of the input data. The additional learning control device 20 according to the present embodiment acquires a learned module or a duplicate of the learned module from the learning device 10 and sets it as a learning module. The additional learning control device 20 can evaluate the output data output using the set learning module and output the evaluation data. The additional learning control device 20 can determine whether or not the currently set learning module has achieved the learning goal by using the learning goal and the evaluation data to be achieved by the set learning module. When it is determined that the learning target has not been achieved, the additional learning control device 20 may acquire, for example, a previously set learning module from the learning device 10 and set it as a learning module. The additional learning control device 20 has a functional configuration of the evaluation device 60 described later, and substantially includes the evaluation device 60.

なお、学習済モジュールの複製物とは、学習済モジュールの機能を再現することができる専用若しくは汎用のハードウェア若しくはソフトウェアの一単位、又は、当該ハードウェア若しくはソフトウェアの一単位の組合せを含む。 The duplicate of the learned module includes one unit of dedicated or general-purpose hardware or software capable of reproducing the function of the learned module, or a combination of one unit of the hardware or software.

学習済モジュールの複製物は、必ずしも学習能力を備えていなくてもよい。また、学習済モジュールの構成と、学習済モジュールの複製物の構成は、必ずしも一致していなくてもよい。また、学習済モジュールの複製物は、いわゆる蒸留によって得られる学習モジュールを含む。すなわち、学習済モジュールの複製物は、学習済モジュールの機能を保つように、学習済モジュールと構造が異なる他の学習モジュールを学習させることで得られる、学習済みの当該他の学習モジュールを含む。 A copy of the trained module does not necessarily have the ability to learn. Further, the configuration of the trained module and the configuration of the duplicate of the trained module do not necessarily have to match. A replica of the trained module also includes a learning module obtained by so-called distillation. That is, the duplicate of the trained module includes the other trained learning module obtained by training another learning module having a structure different from that of the trained module so as to maintain the function of the trained module.

ここで、当該他の学習モジュールは、学習済モジュールよりも構造が単純であってよく、よりデプロイに適したものであってよいし、当該他の学習モジュールの学習には、学習済モジュールの出力データを用いてよい。なお、学習済モジュールの複製物は、学習モジュールの学習過程において、オーバーフィッティングを防ぐ正則化の方法を変えたり、バックプロパゲーションの学習率を変えたり、重み係数の更新アルゴリズムを変えたりして得られる学習済モジュールを含む。 Here, the other learning module may have a simpler structure than the learned module and may be more suitable for deployment, and for learning the other learning module, the output of the learned module may be obtained. Data may be used. In addition, the duplicate of the trained module can be obtained by changing the regularization method to prevent overfitting, changing the learning rate of backpropagation, and changing the weight coefficient update algorithm in the learning process of the learning module. Includes trained modules to be.

また、学習済モジュール又は当該学習済モジュールの複製物を取得するとは、学習済モジュールの機能を追加学習制御装置２０において再現するために必要な情報を取得することをいう。例えば、学習モジュールがニューラルネットワークを含む場合、学習済モジュール又は当該学習済モジュールの複製物を取得するとは、少なくとも、ニューラルネットワークのレイヤ数、各レイヤに関するノード数、ノード間を繋ぐリンクの重みパラメータ、各ノードに関するバイアスパラメータ及び各ノードに関する活性化関数の関数形に関する情報を取得することをいう。 Further, acquiring a learned module or a duplicate of the learned module means acquiring information necessary for reproducing the function of the learned module in the additional learning control device 20. For example, when the training module includes a neural network, acquiring the trained module or a duplicate of the trained module means at least the number of layers of the neural network, the number of nodes for each layer, and the weight parameter of the link connecting the nodes. Acquiring information about the bias parameter for each node and the functional form of the activation function for each node.

センサ３０は、物理量を検出する物理量センサ、化学量を検出する化学量センサ、情報を検出する情報センサのいずれであってもよいが、これらに限られるものではなく、任意のセンサを含み得る。物理量センサは、例えば光を検出して画像データや動画データを出力するカメラや、人の心拍を検出して心拍データを出力する心拍センサ、人の血圧を検出して血圧データを出力する血圧センサ及び人の体温を検出して体温データを出力する体温センサ等のバイタルセンサを含み、その他任意の物理量を検出して電気的信号を出力するセンサを含む。化学量センサは、例えばガスセンサ、湿度センサ、イオンセンサを含み、その他任意の化学量を検出して電気信号を出力するセンサを含む。情報センサは、例えば統計データから特定のパターンを検出するセンサを含み、その他任意の情報を検出するセンサを含む。 The sensor 30 may be any of a physical quantity sensor for detecting a physical quantity, a chemical quantity sensor for detecting a chemical quantity, and an information sensor for detecting information, but the sensor 30 is not limited to these, and may include any sensor. Physical quantity sensors are, for example, cameras that detect light and output image data and video data, heart rate sensors that detect a person's heartbeat and output heart rate data, and blood pressure sensors that detect a person's blood pressure and output blood pressure data. It also includes vital sensors such as a body temperature sensor that detects a person's body temperature and outputs body temperature data, and also includes a sensor that detects an arbitrary physical quantity and outputs an electrical signal. The stoichiometry sensor includes, for example, a gas sensor, a humidity sensor, an ion sensor, and other sensors that detect an arbitrary stoichiometry and output an electric signal. The information sensor includes, for example, a sensor that detects a specific pattern from statistical data, and also includes a sensor that detects arbitrary information.

記憶装置４０は、センサ３０によって出力されたセンシングデータを記憶する。また、記憶装置４０は、学習装置１０によって出力された学習済モジュールを記憶する。図１では、記憶装置４０を単一の記憶部として示しているが、記憶装置４０は、１又は複数のファイルサーバによって構成されてよい。 The storage device 40 stores the sensing data output by the sensor 30. Further, the storage device 40 stores the learned module output by the learning device 10. Although the storage device 40 is shown as a single storage unit in FIG. 1, the storage device 40 may be composed of one or a plurality of file servers.

なお、図１において、学習装置１０、追加学習制御装置２０及び記憶装置４０は、それぞれ別体として構成されているが、これらを一体として構成してもよい。すなわち、学習装置１０、追加学習制御装置２０及び記憶装置４０の全てを一体として構成してもよく、学習装置１０、追加学習制御装置２０及び記憶装置４０のうちの２つを選択的に一体として構成してもよい。このとき、一体として構成された、学習装置１０、追加学習制御装置２０及び記憶装置４０の各要素間は、内部バスを介して接続される。 Although the learning device 10, the additional learning control device 20, and the storage device 40 are configured as separate bodies in FIG. 1, they may be configured as one. That is, the learning device 10, the additional learning control device 20, and the storage device 40 may all be integrally configured, and two of the learning device 10, the additional learning control device 20, and the storage device 40 may be selectively integrated. It may be configured. At this time, the elements of the learning device 10, the additional learning control device 20, and the storage device 40, which are integrally configured, are connected via an internal bus.

（物理的構成：学習装置）
図２を参照して、本発明の所定の実施形態に係る学習装置１０の物理的構成について説明する。学習装置１０は、制御部１０ａと、記憶部１０ｂと、通信部１０ｃと、入力部１０ｄと、表示部１０ｅを有する。これら各構成は、バスを介して相互にデータ送受信可能に接続される。 (Physical configuration: learning device)
The physical configuration of the learning device 10 according to the predetermined embodiment of the present invention will be described with reference to FIG. The learning device 10 includes a control unit 10a, a storage unit 10b, a communication unit 10c, an input unit 10d, and a display unit 10e. Each of these configurations is connected to each other via a bus so that data can be transmitted and received.

制御部１０ａは、ハードウェアプロセッサに相当するＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ（ＣＰＵ）、及びメモリに相当するＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ（ＲＡＭ）を含む。ＣＰＵが記憶部１０ｂに記憶されているプログラムをＲＡＭに展開し、ＲＡＭに展開された当該プログラムを解釈及び実行することにより、制御部１０ａは、後述する図４の各部として機能する。 The control unit 10a includes a Central Processing Unit (CPU) corresponding to a hardware processor and a Random Access Memory (RAM) corresponding to a memory. The CPU expands the program stored in the storage unit 10b into the RAM, interprets and executes the program expanded in the RAM, and the control unit 10a functions as each unit of FIG. 4 to be described later.

なお、ハードウェアプロセッサの種類はＣＰＵに限定されない。例えば、ハードウェアプロセッサとして、ＣＰＵ、ＧｒａｐｈｉｃｓＰｒｏｃｅｓｓｉｎｇＵｎｉｔ（ＧＰＵ）、Ｆｉｅｌｄ−ｐｒｏｇｒａｍｍａｂｌｅＧａｔｅＡｒｒａｙ（ＦＰＧＡ）、ＤｉｇｉｔａｌＳｉｇｎａｌＰｒｏｃｅｓｓｏｒ（ＤＳＰ）、ＡｐｐｌｉｃａｔｉｏｎＳｐｅｃｉｆｉｃＩｎｔｅｇｒａｔｅｄＣｉｒｃｕｉｔ（ＡＳＩＣ）を単独で、又は、組合せて使用することができる。ＲＡＭは、データの書き換えが可能な記憶部であり、例えば半導体記憶素子で構成される。ＲＡＭは、ＣＰＵが実行するアプリケーション等のプログラムやデータを一時的に記憶する。 The type of hardware processor is not limited to the CPU. For example, as a hardware processor, CPU, Graphics Processing Unit (GPU), Field-programmable Gate Array (FPGA), Digital Signal Processor (DSP), Application Specic Can be done. The RAM is a storage unit capable of rewriting data, and is composed of, for example, a semiconductor storage element. The RAM temporarily stores programs and data such as applications executed by the CPU.

記憶部１０ｂは、例えばＨａｒｄＤｉｓｋＤｒｉｖｅ（ＨＤＤ）やＳｏｌｉｄＳｔａｔｅＤｒｉｖｅ（ＳＤＤ）等の不揮発性の記憶媒体である。記憶部１０ｂは、ＣＰＵが実行するプログラム及びデータを記憶する。 The storage unit 10b is a non-volatile storage medium such as a Hard Disk Drive (HDD) or a Solid State Drive (SDD). The storage unit 10b stores programs and data executed by the CPU.

通信部１０ｃは、学習装置１０を通信ネットワークＮに接続するハードウェアインタフェースである。 The communication unit 10c is a hardware interface that connects the learning device 10 to the communication network N.

入力部１０ｄは、ユーザからの入力を受け付けるものであり、例えば、キーボードやマウス、タッチパネルで構成される。 The input unit 10d receives input from the user, and is composed of, for example, a keyboard, a mouse, and a touch panel.

表示部１０ｅは、ＣＰＵによる処理結果を視覚的に表示するものであり、例えば、ＬｉｑｕｉｄＣｒｙｓｔａｌＤｉｓｐｌａｙ（ＬＣＤ）により構成される。 The display unit 10e visually displays the processing result by the CPU, and is composed of, for example, a Liquid Crystal Display (LCD).

学習装置１０は、例えば一般のパーソナルコンピュータのＣＰＵによって本実施形態に係る学習プログラムを実行することで構成されてよい。学習プログラムは、ＲＡＭや記憶部１０ｂ等のコンピュータによって読み取り可能な記憶媒体に記憶されて提供されてもよいし、通信部１０ｃにより接続される通信ネットワークＮを介して提供されてもよい。これらの物理的な構成は例示であって、必ずしも独立した構成でなくてもよい。 The learning device 10 may be configured by executing the learning program according to the present embodiment by, for example, the CPU of a general personal computer. The learning program may be stored and provided in a storage medium readable by a computer such as a RAM or a storage unit 10b, or may be provided via a communication network N connected by the communication unit 10c. These physical configurations are exemplary and do not necessarily have to be independent configurations.

（物理的構成：追加学習制御装置）
図３を参照して、本発明の所定の実施形態に係る追加学習制御装置２０の物理的構成について説明する。追加学習制御装置２０も、学習装置１０と同様に、ＣＰＵ及びＲＡＭを含む制御部２０ａ、データ等を記憶する記憶部２０ｂ、ネットワークＮと接続するための通信部２０ｃ、ユーザからの入力を受け付ける入力部２０ｄ、表示部２０ｅ等を有する。これら各構成は、バスを介して相互にデータ送受信可能に接続される。ＣＰＵが記憶部２０ｂに記憶されているプログラムをＲＡＭに展開し、ＲＡＭに展開された当該プログラムを解釈及び実行することにより、制御部２０ａは、後述する図５の各部として機能する。 (Physical configuration: additional learning control device)
The physical configuration of the additional learning control device 20 according to the predetermined embodiment of the present invention will be described with reference to FIG. Like the learning device 10, the additional learning control device 20 also has a control unit 20a including a CPU and a RAM, a storage unit 20b for storing data and the like, a communication unit 20c for connecting to the network N, and an input for receiving input from a user. It has a unit 20d, a display unit 20e, and the like. Each of these configurations is connected to each other via a bus so that data can be transmitted and received. The CPU expands the program stored in the storage unit 20b into the RAM, interprets and executes the program expanded in the RAM, and the control unit 20a functions as each unit of FIG. 5 described later.

追加学習制御装置２０は、例えば一般のパーソナルコンピュータのＣＰＵによって追加学習制御プログラムを実行することで構成されてよい。追加学習制御プログラムは、ＲＡＭや記憶部２０ｂ等のコンピュータによって読み取り可能な記憶媒体に記憶されて提供されてもよいし、通信部２０ｃにより接続される通信ネットワークＮを介して提供されてもよい。 The additional learning control device 20 may be configured by executing an additional learning control program by, for example, a CPU of a general personal computer. The additional learning control program may be stored and provided in a storage medium readable by a computer such as a RAM or a storage unit 20b, or may be provided via a communication network N connected by the communication unit 20c.

（機能構成：学習装置）
図４を参照して、本発明の所定の実施形態に係る学習装置１０の機能構成について説明する。学習装置１０は、学習指示受付部１０１、学習データ取得部１０２、学習制御部１０３、学習モジュール１０４、学習済モジュール出力部１０５及び学習済モジュール抽出部１０６を備える。 (Functional configuration: learning device)
With reference to FIG. 4, the functional configuration of the learning device 10 according to the predetermined embodiment of the present invention will be described. The learning device 10 includes a learning instruction receiving unit 101, a learning data acquisition unit 102, a learning control unit 103, a learning module 104, a learned module output unit 105, and a learned module extraction unit 106.

学習指示受付部１０１は、入力部１０ｄを介したユーザからの学習指示、又は通信部１０ｃを介した追加学習制御装置２０からの学習指示を受け付けて、学習指示に含まれる情報を後述の学習データ取得部１０２に引き渡す。本実施形態では、学習指示には、学習データ取得条件、入力パラメータの指定等が含まれる。学習データ取得条件とは、学習モジュール１０４を学習させるための学習データとして使用できるデータのうちでも、ユーザからの学習指示を満たすために必要となる条件をいう。例えばセンサ３０で取得されるデータのうち、取得日時を指定したものでもよい。入力パラメータとは、学習指示に含まれる情報のうち、学習済モジュールの制御の性能に影響を与える要因をいう。 The learning instruction receiving unit 101 receives a learning instruction from the user via the input unit 10d or a learning instruction from the additional learning control device 20 via the communication unit 10c, and inputs the information included in the learning instruction to the learning data described later. It is handed over to the acquisition unit 102. In the present embodiment, the learning instruction includes learning data acquisition conditions, designation of input parameters, and the like. The learning data acquisition condition refers to a condition required to satisfy a learning instruction from a user among data that can be used as learning data for training the learning module 104. For example, among the data acquired by the sensor 30, the acquisition date and time may be specified. The input parameter refers to a factor that affects the control performance of the learned module among the information included in the learning instruction.

学習データ取得部１０２は、学習データ取得条件を受信して、受信した学習データ取得条件に基づいて記憶装置４０から学習データを取得する。 The learning data acquisition unit 102 receives the learning data acquisition condition and acquires the learning data from the storage device 40 based on the received learning data acquisition condition.

学習制御部１０３は、学習データ取得部１０２が取得した学習データを用いて、学習モジュール１０４を学習させる。学習制御部１０３は、学習指示受付部１０１で受け付けた学習指示に基づいて学習を完了させる。学習が完了したと判断する基準は、例えば所定個数の学習データによる学習をした場合でもよい。または学習済モジュールの制御の性能が後述の学習目標を満たした場合に、学習を完了してもよい。学習が完了すると、学習制御部１０３は、学習済モジュールを記憶装置４０に記憶する。この際、本実施形態では、学習制御部１０３は、学習済モジュールを一意に識別可能な学習モジュール識別子及び学習データ取得条件と関連付けて学習済モジュールを記憶する。 The learning control unit 103 trains the learning module 104 using the learning data acquired by the learning data acquisition unit 102. The learning control unit 103 completes learning based on the learning instruction received by the learning instruction receiving unit 101. The criterion for determining that the learning is completed may be, for example, the case where learning is performed using a predetermined number of learning data. Alternatively, learning may be completed when the control performance of the learned module satisfies the learning objective described later. When the learning is completed, the learning control unit 103 stores the learned module in the storage device 40. At this time, in the present embodiment, the learning control unit 103 stores the learned module in association with the learning module identifier and the learning data acquisition condition that can uniquely identify the learned module.

ここで、記憶装置４０に記憶される学習済モジュールは、その更新履歴がわかるようにバージョン管理されることが望ましい。バージョン管理は、学習モジュール識別子そのもので行ってもよいし、別途設けたバージョン情報で行ってもよい。 Here, it is desirable that the learned module stored in the storage device 40 is version-controlled so that its update history can be known. Version control may be performed by the learning module identifier itself or by the version information provided separately.

学習モジュール１０４は、機械学習を実現するためのモジュールである。ここでは、学習モジュール１０４の一例としてニューラルネットワークを適用した実施例について説明する。しかしながら、ニューラルネットワークは学習モジュール１０４の一例にすぎず、学習装置１０は、学習モジュール１０４として他の構成を適用してもよい。 The learning module 104 is a module for realizing machine learning. Here, an example in which a neural network is applied as an example of the learning module 104 will be described. However, the neural network is only an example of the learning module 104, and the learning device 10 may apply another configuration as the learning module 104.

学習済モジュール出力部１０５は、学習済モジュールと学習モジュール識別子とを例えば追加学習制御装置２０のような外部に出力する。 The learned module output unit 105 outputs the learned module and the learning module identifier to the outside such as the additional learning control device 20.

学習済モジュール抽出部１０６は、学習モジュール抽出条件を受信して、受信した学習モジュール抽出条件に基づいて記憶装置４０から学習済モジュールを取得する。本実施形態では、学習モジュール抽出条件には、現在設定されている学習モジュールの学習モジュール識別子、及び抽出ポイントが含まれる。抽出ポイントは、性能が劣化する前の日付や現在設定されている学習モジュールから遡るバージョンの指定等、抽出対象となる学習モジュールを指定可能な情報を含む。例えば、抽出ポイントは、「２０１７年１２月３１日以前」、「現在設定されている学習モジュールの１つ前のバージョン」とすることができる。 The learned module extraction unit 106 receives the learning module extraction condition, and acquires the learned module from the storage device 40 based on the received learning module extraction condition. In the present embodiment, the learning module extraction condition includes the currently set learning module identifier of the learning module and the extraction point. The extraction point includes information that can specify the learning module to be extracted, such as the date before the performance deterioration and the specification of the version that goes back from the currently set learning module. For example, the extraction point can be "before December 31, 2017" or "the version immediately before the currently set learning module".

（機能構成：追加学習制御装置）
図５を参照して、本発明の所定の実施形態に係る追加学習制御装置２０の機能構成について説明する。追加学習制御装置２０は、学習済モジュール受付部２０１、学習モジュール２０２、学習目標受付部２０３、評価部２０４、判定部２０５、学習モジュール選択部２０６、要因推定部２０７、制御部２０８及びデータベース（ＤＢ）２０９を備える。 (Functional configuration: Additional learning control device)
With reference to FIG. 5, the functional configuration of the additional learning control device 20 according to the predetermined embodiment of the present invention will be described. The additional learning control device 20 includes a learned module reception unit 201, a learning module 202, a learning target reception unit 203, an evaluation unit 204, a judgment unit 205, a learning module selection unit 206, a factor estimation unit 207, a control unit 208, and a database (DB). ) 209.

学習済モジュール受付部２０１は、学習済モジュールと学習モジュール識別子とを受け付けて、受け付けた学習済モジュールを学習モジュール２０２として設定する。本実施形態では、学習済モジュール受付部２０１は、学習装置１０の学習済モジュール出力部１０５から学習済モジュールと学習モジュール識別子とを受け付けて、受け付けた学習済モジュールを学習モジュール２０２として設定する。なお、学習済モジュール受付部２０１は、学習済モジュールを記憶装置４０から受け付けて、学習モジュール２０２として設定してもよい。ここでは、学習モジュール２０２の一例としてニューラルネットワークを適用した実施例について説明する。しかしながら、ニューラルネットワークは学習モジュール２０２の一例にすぎず、追加学習制御装置２０は、学習モジュール２０２として他の構成を適用してもよい。 The learned module reception unit 201 receives the learned module and the learning module identifier, and sets the received learned module as the learning module 202. In the present embodiment, the learned module receiving unit 201 receives the learned module and the learning module identifier from the learned module output unit 105 of the learning device 10, and sets the received learned module as the learning module 202. The learned module reception unit 201 may receive the learned module from the storage device 40 and set it as the learning module 202. Here, an example in which a neural network is applied as an example of the learning module 202 will be described. However, the neural network is only an example of the learning module 202, and the additional learning control device 20 may apply another configuration as the learning module 202.

学習目標受付部２０３は、入力部２０ｄを介して、学習モジュール２０２が達成すべき学習目標を受け付けて、受け付けた学習目標をＤＢ２０９に記憶する。本実施形態では、学習目標には、評価項目及び条件が含まれる。評価項目は、学習モジュール２０２を評価するための項目であり、例えば、学習モジュールが出力する出力データの精度を判断するために用いられる項目である。 The learning goal receiving unit 203 receives the learning goal to be achieved by the learning module 202 via the input unit 20d, and stores the received learning goal in the DB 209. In this embodiment, the learning goal includes evaluation items and conditions. The evaluation item is an item for evaluating the learning module 202, and is, for example, an item used for determining the accuracy of the output data output by the learning module.

条件は、評価項目に対する条件として、例えば、評価項目「制御部２０８が制御する装置に対する外部操作回数／日」に対して、「当該評価項目が基準値ｘ以下」、「当該評価項目が直前に設定されていた学習モジュールより小さい」、評価項目「制御部２０８が制御する装置の消費電力量／月」に対して、「当該評価項目が基準値ｙ以下」、「当該評価項目が直前に設定されていた学習モジュールより小さい」、評価項目「所定の時間内に学習モジュール２０２が算出した値が許容変化率を超える回数／月」に対して、「当該評価項目が基準値ｚ以下」、「当該評価項目が直前に設定されていた学習モジュールより小さい」等とすることができる。 The conditions are, for example, for the evaluation item "number of external operations / day for the device controlled by the control unit 208", "the evaluation item is the reference value x or less", and "the evaluation item is immediately before". For the evaluation item "power consumption of the device controlled by the control unit 208 / month", "the evaluation item is less than or equal to the reference value y", and "the evaluation item is set immediately before". For the evaluation items "the number of times the value calculated by the learning module 202 exceeds the permissible change rate / month", "the evaluation item is less than or equal to the reference value z", and " The evaluation item is smaller than the learning module set immediately before. "

評価部２０４は、記憶装置４０に記憶されたセンシングデータを用いて学習モジュール２０２の評価を行って評価データを生成し、学習済モジュール受付部２０１が受け付けた学習モジュール識別子と関連付けて、生成した評価データをＤＢ２０９に記憶する。本実施形態では、評価部２０４は、少なくとも学習目標に含まれる評価項目について評価を行い、評価データを生成することができる。評価部２０４は、学習モジュール２０２が設定されてから一定の期間の経過後に自動で評価を行うこともできるし、別途、ユーザからの評価指示を受け付けて評価を行ってもよい。例えば、評価部２０４は、学習目標を受け付けることに応答して、評価を行ってもよい。なお、評価部２０４は学習目標に含まれる評価項目の他に、予め設定された評価項目について評価を行ってもよい。 The evaluation unit 204 evaluates the learning module 202 using the sensing data stored in the storage device 40, generates evaluation data, associates it with the learning module identifier received by the learned module reception unit 201, and generates the evaluation. The data is stored in the DB 209. In the present embodiment, the evaluation unit 204 can evaluate at least the evaluation items included in the learning goal and generate evaluation data. The evaluation unit 204 may automatically perform the evaluation after a lapse of a certain period of time after the learning module 202 is set, or may separately receive an evaluation instruction from the user to perform the evaluation. For example, the evaluation unit 204 may perform the evaluation in response to accepting the learning goal. In addition to the evaluation items included in the learning goal, the evaluation unit 204 may evaluate preset evaluation items.

判定部２０５は、学習目標受付部２０３が受け付けた学習目標とＤＢ２０９に記憶された評価データとを用いて、現在設定されている学習モジュールが、学習目標を達成したか否か判定する。学習目標を達成したと判定される場合、判定部２０５は、処理を終了する。 The determination unit 205 determines whether or not the currently set learning module has achieved the learning target by using the learning target received by the learning target reception unit 203 and the evaluation data stored in the DB 209. When it is determined that the learning goal has been achieved, the determination unit 205 ends the process.

学習モジュール選択部２０６は、学習目標を達成していないと判定される場合に、学習装置１０の学習済モジュール抽出部１０６に後述するとおり学習モジュール抽出条件を送信し、その応答として得られた学習済モジュールを学習モジュール２０２として設定する。学習モジュール選択部２０６が学習済モジュール抽出部１０６に学習モジュール抽出条件を送信することにより得られた学習済モジュールは、学習モジュール２０２として設定される前に、再度学習目標を達成するか評価されてもよい。 When it is determined that the learning target has not been achieved, the learning module selection unit 206 transmits a learning module extraction condition to the learned module extraction unit 106 of the learning device 10 as described later, and the learning obtained as a response thereof. The completed module is set as the learning module 202. The learned module obtained by the learning module selection unit 206 transmitting the learning module extraction condition to the learned module extraction unit 106 is evaluated to achieve the learning goal again before being set as the learning module 202. May be good.

また、学習モジュール選択部２０６は、後述の要因推定部２０７に、学習目標を達成することができなかった要因の推定を指示して、要因改善データを得てもよい。その後、要因推定部２０７から得られた要因改善データを用いて、学習モジュール選択部２０６は、学習装置１０に再追加学習を指示してもよい。 Further, the learning module selection unit 206 may instruct the factor estimation unit 207, which will be described later, to estimate the factors for which the learning goal could not be achieved, and obtain factor improvement data. After that, the learning module selection unit 206 may instruct the learning device 10 to perform re-additional learning using the factor improvement data obtained from the factor estimation unit 207.

要因推定部２０７は、現在設定されている学習モジュールが学習目標を達成していないと判定される場合に、学習目標を達成することができなかった要因を推定する。本実施形態では、要因推定部２０７は、学習モジュール選択部２０６からの指示に応答して、学習目標と記憶装置４０のセンシングデータとを用いて、学習目標を達成することができなかった要因を推定する。学習目標を達成することができなかった要因を推定するために、記憶装置４０のセンシングデータを統計処理する。 The factor estimation unit 207 estimates the factors that could not achieve the learning goal when it is determined that the currently set learning module has not achieved the learning goal. In the present embodiment, the factor estimation unit 207 responds to the instruction from the learning module selection unit 206 and uses the learning target and the sensing data of the storage device 40 to determine the factor that could not achieve the learning target. presume. The sensing data of the storage device 40 is statistically processed in order to estimate the factors that failed to achieve the learning goal.

制御部２０８は、学習モジュール２０２が算出した値を用いて、制御を行う。本実施形態では学習モジュールを用いてシステムの制御を行う一例として制御を行う制御部を示すが、後述するように、本発明の実施形態は、学習モジュールを用いて処理を実行する様々なシステムに適用することができる。 The control unit 208 controls using the value calculated by the learning module 202. In the present embodiment, a control unit that controls the system is shown as an example of controlling the system using the learning module. However, as will be described later, the embodiment of the present invention applies to various systems that execute processing using the learning module. Can be applied.

ＤＢ２０９には、学習目標ＤＢ２０９１及び評価データＤＢ２０９２が記憶される。本実施形態では、学習目標ＤＢ２０９１には、学習目標受付部２０３が受け付けた学習目標に含まれる、評価項目及び条件が記憶される。また、評価データＤＢ２０９２には、評価部２０４が生成した評価データと評価対象である学習モジュールの学習モジュール識別子とが関連付けて記憶される。 The learning target DB2091 and the evaluation data DB2092 are stored in the DB 209. In the present embodiment, the learning goal DB 2091 stores evaluation items and conditions included in the learning goal received by the learning goal reception unit 203. Further, in the evaluation data DB 2092, the evaluation data generated by the evaluation unit 204 and the learning module identifier of the learning module to be evaluated are stored in association with each other.

（追加学習処理）
［第１実施形態］
図６のフローチャートに沿って、追加学習制御装置２０によって実行される追加学習処理の第１実施形態について説明する。本発明の実施形態は、学習モジュールを用いて処理を実行する様々なシステムに適用することができ、その分野は特に限定されないが、以下の説明においては空調制御システムを例として記載する。 (Additional learning process)
[First Embodiment]
A first embodiment of the additional learning process executed by the additional learning control device 20 will be described with reference to the flowchart of FIG. The embodiment of the present invention can be applied to various systems that execute processing using a learning module, and the field thereof is not particularly limited, but in the following description, an air conditioning control system will be described as an example.

第１実施形態では、制御部２０８は例えば、空調制御部である。学習モジュール２０２はセンサ３０によって出力された現在室温、外気温、湿度等の値や既知の日時、部屋の体積等を入力パラメータとし、室温設定値を算出する。追加学習制御装置２０の制御部２０８は、学習モジュール２０２によって算出された室温設定値を用いて空調制御を行う。ここでは、制御部２０８が学習モジュール２０２を用いて実際に空調制御を行い、評価部２０４は、制御部２０８による制御結果について評価を行う実施例について説明する。しかしながら、代替として、追加学習制御装置２０はシミュレーション部（図示せず）を備えてもよい。代替の実施例では、シミュレーション部が学習モジュール２０２を用いて空調制御を行い、評価部２０４は、シミュレーション部によるシミュレーション結果について評価を行うことができる。 In the first embodiment, the control unit 208 is, for example, an air conditioning control unit. The learning module 202 calculates the room temperature set value by using the values such as the current room temperature, the outside air temperature, and the humidity output by the sensor 30, the known date and time, the volume of the room, and the like as input parameters. The control unit 208 of the additional learning control device 20 performs air conditioning control using the room temperature set value calculated by the learning module 202. Here, an embodiment in which the control unit 208 actually performs air conditioning control using the learning module 202 and the evaluation unit 204 evaluates the control result by the control unit 208 will be described. However, as an alternative, the additional learning control device 20 may include a simulation unit (not shown). In the alternative embodiment, the simulation unit performs air conditioning control using the learning module 202, and the evaluation unit 204 can evaluate the simulation result by the simulation unit.

また、ここでは、追加学習制御装置２０が学習モジュール２０２及び制御部２０８を備えた実施例について説明するが、代替として、追加学習制御装置２０とは別体の制御装置が学習モジュール及び制御部を備えてもよい。 Further, although an embodiment in which the additional learning control device 20 includes the learning module 202 and the control unit 208 will be described here, as an alternative, a control device separate from the additional learning control device 20 provides the learning module and the control unit. You may prepare.

学習装置１０の学習制御部１０３によって事前に行われた学習の結果、記憶装置４０には、２０１０年１月１日から２０１２年１２月３１日までの期間の学習データを用いて学習したニューラルネットワーク０と、２０１０年１月１日から２０１４年１２月３１日までの期間の学習データを用いて学習したニューラルネットワーク１と、２０１０年１月１日から２０１６年１２月３１日までの期間の学習データを用いて学習したニューラルネットワーク２とが記憶されているものとする。ニューラルネットワーク０は、学習モジュール識別子０と関連付けて記憶装置４０に記憶されている。ニューラルネットワーク１は、学習モジュール識別子１と関連付けて記憶装置４０に記憶されている。ニューラルネットワーク２は、学習モジュール識別子２と関連付けて記憶装置４０に記憶されている。 As a result of learning performed in advance by the learning control unit 103 of the learning device 10, the storage device 40 is a neural network learned using the learning data for the period from January 1, 2010 to December 31, 2012. 0, the neural network 1 learned using the training data from January 1, 2010 to December 31, 2014, and the learning from January 1, 2010 to December 31, 2016. It is assumed that the neural network 2 learned using the data is stored. The neural network 0 is stored in the storage device 40 in association with the learning module identifier 0. The neural network 1 is stored in the storage device 40 in association with the learning module identifier 1. The neural network 2 is stored in the storage device 40 in association with the learning module identifier 2.

追加学習制御装置２０は、２０１７年１月３１日まで、ニューラルネットワーク１を学習モジュール２０２として設定しているものとする。記憶装置４０には、センサ３０によって出力されたセンシングデータが随時記憶されている。すなわち、記憶装置４０には、ニューラルネットワーク１を用いて空調制御を行っていた期間中の、現在室温や外気温等の入力パラメータに関するセンシングデータ、及び室温設定値に対する外部操作回数や消費電力量等の評価項目に関するセンシングデータが記憶されている。また、評価部２０４によって以前行われた評価の結果、追加学習制御装置２０の評価データＤＢ２０９２には、ニューラルネットワーク０の評価データ０、及びニューラルネットワーク１の評価データ１が記憶されている。 It is assumed that the additional learning control device 20 sets the neural network 1 as the learning module 202 until January 31, 2017. The storage device 40 stores the sensing data output by the sensor 30 at any time. That is, the storage device 40 contains sensing data regarding input parameters such as the current room temperature and outside temperature during the period during which the air conditioning control is performed using the neural network 1, the number of external operations for the room temperature set value, the amount of power consumption, and the like. Sensing data related to the evaluation items of is stored. Further, as a result of the evaluation previously performed by the evaluation unit 204, the evaluation data 0 of the neural network 0 and the evaluation data 1 of the neural network 1 are stored in the evaluation data DB 2092 of the additional learning control device 20.

Ｓ６０１において、追加学習制御装置２０の学習済モジュール受付部２０１は、ニューラルネットワークと学習モジュール識別子とを受け付けて、受け付けたニューラルネットワークを学習モジュール２０２として設定する。本実施形態では、２０１７年２月１日に、学習済モジュール受付部２０１は、学習装置１０の学習済モジュール出力部１０５から、ニューラルネットワーク２と学習モジュール識別子２と受け付けて、受け付けたニューラルネットワーク２を学習モジュール２０２として設定する。２０１７年２月１日以降、制御部２０８は、設定されたニューラルネットワーク２によって算出された室温設定値を用いて空調制御を行っている。その間、前述したように、記憶装置４０には、ニューラルネットワーク２を用いて空調制御を行っている期間中の、センサ３０によって出力されたセンシングデータが随時記憶されている。 In S601, the learned module reception unit 201 of the additional learning control device 20 receives the neural network and the learning module identifier, and sets the received neural network as the learning module 202. In the present embodiment, on February 1, 2017, the trained module reception unit 201 receives the neural network 2 and the learning module identifier 2 from the trained module output unit 105 of the learning device 10, and receives the neural network 2 Is set as the learning module 202. Since February 1, 2017, the control unit 208 has been performing air conditioning control using the room temperature set value calculated by the set neural network 2. During that time, as described above, the storage device 40 stores the sensing data output by the sensor 30 at any time during the period during which the air conditioning control is performed using the neural network 2.

Ｓ６０２において、追加学習制御装置２０の学習目標受付部２０３は、入力部２０ｄを介して、学習モジュール２０２が達成すべき学習目標を受け付けて、受け付けた学習目標をＤＢ２０９の学習目標ＤＢ２０９１に記憶する。本実施形態では、２０１７年４月１日に、入力部２０ｄを介して追加学習制御装置２０の管理者から学習目標を受け付けて、学習目標受付部２０３は、受け付けた学習目標を学習目標ＤＢ２０９１に記憶する。学習目標には、評価項目「室温設定値に対する外部操作回数／日」と、条件「直前に設定されていた学習モジュールより少ない」とが含まれるものとする。 In S602, the learning target receiving unit 203 of the additional learning control device 20 receives the learning target to be achieved by the learning module 202 via the input unit 20d, and stores the received learning target in the learning target DB2091 of the DB 209. In the present embodiment, on April 1, 2017, a learning goal is received from the administrator of the additional learning control device 20 via the input unit 20d, and the learning goal receiving unit 203 sends the received learning goal to the learning goal DB2091. Remember. The learning goal shall include the evaluation item "number of external operations / day for the room temperature set value" and the condition "less than the learning module set immediately before".

Ｓ６０３において、追加学習制御装置２０の評価部２０４は、学習モジュール２０２の評価を行って評価データを生成し、学習済モジュール受付部２０１が受け付けた学習モジュール識別子と関連付けて、生成した評価データをＤＢ２０９の評価データＤＢ２０９２に記憶する。本実施形態では、Ｓ６０２で学習目標を受け付けることに応答して、評価部２０４は、ニューラルネットワーク２を用いて空調制御が行われた期間中に記憶装置４０に記憶されたセンシングデータを用いて、ニューラルネットワーク２の評価を行う。 In S603, the evaluation unit 204 of the additional learning control device 20 evaluates the learning module 202, generates evaluation data, associates it with the learning module identifier received by the learned module reception unit 201, and associates the generated evaluation data with the DB 209. It is stored in the evaluation data DB 2092 of. In the present embodiment, in response to receiving the learning target in S602, the evaluation unit 204 uses the sensing data stored in the storage device 40 during the period when the air conditioning control is performed using the neural network 2. The neural network 2 is evaluated.

ここで、評価部２０４は、Ｓ６０２で受け付けた学習目標に含まれる評価項目のみならず、予め設定された他の評価項目について評価を行ってもよい。このような構成とすることで、今後の学習目標において、異なる評価項目について、以前設定されていた学習モジュールとの比較条件が指定される場合に対応することができる。本実施形態では、評価部２０４は、学習目標に含まれる評価項目「室温設定値に対する外部操作回数／日」のみならず、予め設定された他の評価項目について評価を行って評価データ２を生成し、学習モジュール識別子２と関連付けて、評価データ２を評価データＤＢ２０９２に記憶する。 Here, the evaluation unit 204 may evaluate not only the evaluation items included in the learning goal received in S602 but also other preset evaluation items. With such a configuration, it is possible to cope with the case where the comparison condition with the previously set learning module is specified for different evaluation items in the future learning goal. In the present embodiment, the evaluation unit 204 evaluates not only the evaluation item "number of external operations / day for the room temperature set value" included in the learning target but also other preset evaluation items to generate evaluation data 2. Then, the evaluation data 2 is stored in the evaluation data DB 2092 in association with the learning module identifier 2.

次に、Ｓ６０４において、追加学習制御装置２０の判定部２０５は、Ｓ６０２で学習目標受付部２０３が受け付けた学習目標と評価データＤＢ２０９２に記憶された評価データとを用いて、現在設定されている学習モジュールが、学習目標を達成したか否か判定する。学習目標を達成したと判定される場合（Ｓ６０４：Ｙｅｓ）、追加学習制御装置２０は、処理を終了する。 Next, in S604, the determination unit 205 of the additional learning control device 20 uses the learning target received by the learning target reception unit 203 in S602 and the evaluation data stored in the evaluation data DB 2092, and is currently set for learning. Determine if the module has achieved its learning goals. When it is determined that the learning target has been achieved (S604: Yes), the additional learning control device 20 ends the process.

本実施形態では、判定部２０５は、評価項目「室温設定値に対する外部操作回数／日」、及び、条件「直前に設定されていた学習モジュールより少ない」を含む学習目標と、評価データ１及び評価データ２とを用いて、ニューラルネットワーク２が、学習目標を達成したか否か判定する。その結果、学習目標を達成していないと判定され（Ｓ６０４：Ｎｏ）、処理はＳ６０５に進む。 In the present embodiment, the determination unit 205 includes a learning goal including the evaluation item "number of external operations / day for the room temperature set value" and the condition "less than the learning module set immediately before", the evaluation data 1, and the evaluation. Using the data 2, it is determined whether or not the neural network 2 has achieved the learning goal. As a result, it is determined that the learning goal has not been achieved (S604: No), and the process proceeds to S605.

Ｓ６０５において、追加学習制御装置２０の学習モジュール選択部２０６は、学習装置１０の学習済モジュール抽出部１０６に学習モジュール抽出条件を送信し、その応答として得られた学習モジュールを学習モジュール２０２として設定し、処理を終了する。本実施形態では、学習モジュール選択部２０６は、学習済モジュール抽出部１０６に学習モジュール抽出条件「現在設定されている学習モジュールの学習モジュール識別子：学習モジュール識別子２、抽出ポイント：現在設定されている学習モジュールの１つ前のバージョン」を送信し、応答として得たニューラルネットワーク１を学習モジュール２０２として設定する。 In S605, the learning module selection unit 206 of the additional learning control device 20 transmits the learning module extraction condition to the learned module extraction unit 106 of the learning device 10, and sets the learning module obtained as the response as the learning module 202. , End the process. In the present embodiment, the learning module selection unit 206 has the learning module extraction condition "learning module identifier of the currently set learning module: learning module identifier 2, extraction point: currently set learning" in the learned module extraction unit 106. The "previous version of the module" is transmitted, and the neural network 1 obtained as a response is set as the learning module 202.

［第２実施形態］
第１実施形態では、学習目標を達成していないと判定されたニューラルネットワーク２について、更新前のバージョンであるニューラルネットワーク１に戻す例について説明した。第２実施形態では、学習目標を達成していないと判定された場合に、ニューラルネットワークをさらに追加学習させる例について説明する。 [Second Embodiment]
In the first embodiment, an example of returning the neural network 2 determined not to achieve the learning goal to the version before the update, the neural network 1, has been described. In the second embodiment, an example in which the neural network is further trained when it is determined that the learning goal has not been achieved will be described.

図７は、追加学習制御装置２０によって実行される追加学習処理の流れを示すフローチャートである。第２実施形態では第１実施形態と共通の事柄についての記述を省略し、異なる点についてのみ説明する。図７のＳ７０１からＳ７０５は、図６のＳ６０１からＳ６０５と同じ処理であるので、これらの処理の詳細な説明は省略する。 FIG. 7 is a flowchart showing the flow of the additional learning process executed by the additional learning control device 20. In the second embodiment, the description of the matters common to the first embodiment will be omitted, and only the differences will be described. Since S701 to S705 in FIG. 7 are the same processes as S601 to S605 in FIG. 6, detailed description of these processes will be omitted.

Ｓ７０６において、学習モジュール選択部２０６は、追加学習制御装置２０の要因推定部２０７に、学習目標を達成することができなかった要因の推定を指示し、要因推定部２０７は、学習目標を達成することができなかった要因を推定する。本実施形態では、要因推定部２０７は、学習目標に含まれる評価項目「室温設定値に対する外部操作回数／日」について、記憶装置４０のセンシングデータを用いて要因を推定する。例えば、要因推定部２０７は、ニューラルネットワーク２に用いた学習データの期間である２０１０年１月１から２０１６年１２月３１日までの期間のセンシングデータを用いて、要因を推定する。 In S706, the learning module selection unit 206 instructs the factor estimation unit 207 of the additional learning control device 20 to estimate the factors for which the learning target could not be achieved, and the factor estimation unit 207 achieves the learning target. Estimate the factors that could not be achieved. In the present embodiment, the factor estimation unit 207 estimates the factor for the evaluation item “number of external operations / day with respect to the room temperature set value” included in the learning target by using the sensing data of the storage device 40. For example, the factor estimation unit 207 estimates the factor using the sensing data in the period from January 1, 2010 to December 31, 2016, which is the period of the training data used in the neural network 2.

本実施形態では、要因推定部２０７は、既知のアルゴリズムを使用して、例えば学習目標に含まれる評価項目「室温設定値に対する外部操作回数／日」が他の日の値から大きく外れて多かった日について、考えられるパターンを見出す。例えば、特定の期間「２０１６年７月１日から２０１６年７月３１日まで」において「室温設定値に対する外部操作回数／日」が多いというパターンを見出したものとする。なお、パターンについては期間によるものだけでなく、入力パラメータに含まれる外気温、湿度等の値が所定の値と異なる値が続いたときに、「入力パラメータの値」が大きい又は少ないというパターンを見出してもよい。また、入力パラメータに含まれる部屋の体積が変わっていた場合に、「部屋の体積」が変化したというパターンを見出してもよい。 In the present embodiment, the factor estimation unit 207 uses a known algorithm, and for example, the evaluation item “number of external operations / day with respect to the room temperature set value” included in the learning goal is often greatly deviated from the values of other days. Find possible patterns for the day. For example, it is assumed that a pattern is found in which "the number of external operations / day with respect to the room temperature set value" is large in a specific period "from July 1, 2016 to July 31, 2016". It should be noted that the pattern is not limited to the period, but when the values of the outside air temperature, humidity, etc. included in the input parameters continue to be different from the predetermined values, the pattern that the "input parameter value" is large or small is used. You may find it. Further, when the volume of the room included in the input parameter is changed, the pattern that the "volume of the room" is changed may be found.

Ｓ７０７において、要因推定部２０７は、外れ値のパターンを見出したか否か判定する。外れ値のパターンを見出していない場合（Ｓ７０７：Ｎｏ）、追加学習制御装置２０は処理を終了する。一方、外れ値のパターンを見出した場合（Ｓ７０７：Ｙｅｓ）、Ｓ７０８に進み、要因推定部２０７は、見出した外れ値のパターンを出力してもよい。本実施形態では、図８に示されるように、見出した外れ値のパターンを示すテキスト８０１「２０１６年７月１日から２０１６年７月３１日において、室温設定値に対する外部操作回数／日が多い。」、パターンが性能劣化要因か否かを設定する性能劣化要因ラジオボタン８０２、詳細設定ボタン８０３、及び決定ボタン８０４を含む要因確定画面を出力する。詳細設定ボタンについては後述する。 In S707, the factor estimation unit 207 determines whether or not an outlier pattern has been found. If no outlier pattern is found (S707: No), the additional learning control device 20 ends the process. On the other hand, when an outlier pattern is found (S707: Yes), the process proceeds to S708, and the factor estimation unit 207 may output the found outlier pattern. In the present embodiment, as shown in FIG. 8, the text 801 indicating the pattern of the found outliers “From July 1, 2016 to July 31, 2016, the number of external operations / day with respect to the room temperature set value is large. A factor confirmation screen including a performance deterioration factor radio button 802, a detailed setting button 803, and a decision button 804 for setting whether or not the pattern is a performance deterioration factor is output. The detailed setting button will be described later.

Ｓ７０９において、要因推定部２０７は、ユーザ入力を受信したか否か判定する。ユーザ入力を受信した場合、Ｓ７１０に進み、要因推定部２０７は、ユーザ入力に基づいて要因改善データを生成し、生成した要因改善データを学習モジュール選択部２０６に戻す。要因改善データには、Ｓ７０８において要因推定部２０７が出力した外れ値のパターンのうち、性能劣化要因ラジオボタン８０２で性能劣化要因として選択されること等により、学習データに含めることが望ましくないことが指示されたパターンが含まれる。 In S709, the factor estimation unit 207 determines whether or not the user input has been received. When the user input is received, the process proceeds to S710, and the factor estimation unit 207 generates factor improvement data based on the user input, and returns the generated factor improvement data to the learning module selection unit 206. It is not desirable to include the factor improvement data in the training data because it is selected as a performance deterioration factor by the performance deterioration factor radio button 802 among the outlier patterns output by the factor estimation unit 207 in S708. The indicated pattern is included.

本実施形態では、追加学習制御装置２０の管理者は、２０１６年７月１日から２０１６年７月３１日までの期間において室内工事が行われていたことを確認し、Ｓ７０８において出力された画面において、性能劣化要因ラジオボタン８０２で性能劣化要因として選択し、決定ボタン８０４を押下したものとする。その結果、要因推定部２０７は、受信したユーザ入力に基づいて要因改善データ「２０１６年７月１日から２０１６年７月３１日までの期間を除く」を生成し、生成した要因改善データを学習モジュール選択部２０６に戻す。 In the present embodiment, the administrator of the additional learning control device 20 confirms that the indoor construction was carried out during the period from July 1, 2016 to July 31, 2016, and the screen output in S708. In the above, it is assumed that the performance deterioration factor radio button 802 is selected as the performance deterioration factor and the enter button 804 is pressed. As a result, the factor estimation unit 207 generates factor improvement data "excluding the period from July 1, 2016 to July 31, 2016" based on the received user input, and learns the generated factor improvement data. Return to the module selection unit 206.

Ｓ７１１において、学習モジュール選択部２０６は、要因改善データに基づいて学習指示を生成し、生成した学習指示を学習装置１０の学習指示受付部１０１に送信して、処理を終了する。本実施形態では、学習モジュール選択部２０６は、「学習データ取得条件：２０１０年１月１日から２０１６年１２月３１日まで、２０１６年７月１日から２０１６年７月３１日までの期間を除く」を含む学習指示を学習指示受付部１０１に送信する。 In S711, the learning module selection unit 206 generates a learning instruction based on the factor improvement data, transmits the generated learning instruction to the learning instruction receiving unit 101 of the learning device 10, and ends the process. In the present embodiment, the learning module selection unit 206 sets the “learning data acquisition condition: from January 1, 2010 to December 31, 2016, and from July 1, 2016 to July 31, 2016. A learning instruction including "exclude" is transmitted to the learning instruction receiving unit 101.

なお、本実施形態では、Ｓ７０７で外れ値のパターンを見出したと判定される場合に、Ｓ７１０でユーザの入力に基づいて要因改善データを生成したが、Ｓ７０８からＳ７１０を省略してユーザの入力を受け付けることなく、要因推定部２０７が、見出した外れ値のパターンに基づいて要因改善データを生成してもよい。 In the present embodiment, when it is determined that the outlier pattern is found in S707, the factor improvement data is generated based on the user's input in S710, but the user's input is accepted by omitting S708 to S710. Instead, the factor estimation unit 207 may generate factor improvement data based on the found outlier pattern.

このようにすることで、学習装置１０は、受信した学習指示に基づいて、ニューラルネットワークをさらに追加学習させることができる。本実施形態では、学習装置１０は、２０１６年７月１日から２０１６年７月３１日までの期間を除く学習データを用いて、ニューラルネットワーク１をさらに追加学習させることができる。 By doing so, the learning device 10 can further train the neural network based on the received learning instruction. In the present embodiment, the learning device 10 can further train the neural network 1 by using the learning data excluding the period from July 1, 2016 to July 31, 2016.

［第３実施形態］
第２実施形態では、学習目標を達成していないと判定された場合に、学習データに含めることが望ましくないデータを除外して、ニューラルネットワークをさらに追加学習させる例について説明した。第３実施形態では、学習目標を達成していないと判定された場合に、ニューラルネットワークに対してさらに学習させるべきデータを追加して、ニューラルネットワークを追加学習させる例について説明する。 [Third Embodiment]
In the second embodiment, when it is determined that the learning goal has not been achieved, data that is not desirable to be included in the learning data is excluded, and an example in which the neural network is further trained has been described. In the third embodiment, when it is determined that the learning target has not been achieved, data to be further trained is added to the neural network, and an example in which the neural network is additionally trained will be described.

図７を参照して、第３実施形態について説明する。第３実施形態においても、第１実施形態及び第２実施形態と共通の事柄についての記述を省略し、異なる点についてのみ説明する。図７のＳ７０１からＳ７０５までは、第１実施形態と同じ処理であるので、これらの処理の詳細な説明は省略する。 A third embodiment will be described with reference to FIG. 7. Also in the third embodiment, the description of the matters common to those of the first embodiment and the second embodiment will be omitted, and only the differences will be described. Since S701 to S705 of FIG. 7 are the same processes as those in the first embodiment, detailed description of these processes will be omitted.

Ｓ７０６において、要因推定部２０７は、学習目標を達成することができなかった要因を推定する。本実施形態では、特定の時間帯「毎週土曜日の８時から９時」において「室温設定値に対する外部操作回数」が多いというパターンを見出したものとする。なお、パターンについては期間によるものだけでなく、入力パラメータに含まれる外気温、湿度等の値が所定の値と異なる値が続いたときに、「入力パラメータの値」が大きい又は少ないというパターンを見出してもよい。また、入力パラメータに含まれる部屋の体積が変わっていた場合に、「部屋の体積」が変化したというパターンを見出してもよい。 In S706, the factor estimation unit 207 estimates the factors for which the learning goal could not be achieved. In the present embodiment, it is assumed that a pattern is found in which the "number of external operations with respect to the room temperature set value" is large in a specific time zone "every Saturday from 8:00 to 9:00". It should be noted that the pattern is not limited to the period, but when the values of the outside air temperature, humidity, etc. included in the input parameters continue to be different from the predetermined values, the pattern that the "input parameter value" is large or small is used. You may find it. Further, when the volume of the room included in the input parameter is changed, the pattern that the "volume of the room" is changed may be found.

Ｓ７０７において、要因推定部２０７は、外れ値のパターンを見出したか否か判定する。本実施形態では、外れ値のパターンを見出したのでＳ７０８に進み、要因推定部２０７は、見出した外れ値のパターンを出力してもよい。本実施形態では、見出した外れ値のパターンを示すテキスト８０１「毎週土曜日の８時から９時において、室温設定値に対する外部操作が多い。」、性能劣化要因ラジオボタン８０２、詳細設定ボタン８０３、及び決定ボタン８０４を含む要因確定画面を出力する。 In S707, the factor estimation unit 207 determines whether or not an outlier pattern has been found. In the present embodiment, since the pattern of the outliers has been found, the process proceeds to S708, and the factor estimation unit 207 may output the pattern of the found outliers. In the present embodiment, the text 801 indicating the pattern of the found outliers "There are many external operations on the room temperature set value from 8:00 to 9:00 every Saturday", the performance deterioration factor radio button 802, the detailed setting button 803, and the detailed setting button 803. The factor confirmation screen including the decision button 804 is output.

本実施形態では、追加学習制御装置２０の管理者は、毎週土曜日は平日よりも少ない数のユーザが部屋を利用することから、室温設定値を平日と比較して高く設定することが望ましいことを確認し、Ｓ７０９において出力された画面において、性能劣化要因ラジオボタン８０２で非性能劣化要因として選択し、詳細設定ボタン８０３を押下したものとする。例えば詳細設定ボタン８０３を押下することにより、詳細設定画面が表示され、管理者は、この詳細設定画面において入力パラメータの調整を行ってもよい。代替として、追加学習制御装置２０が、入力パラメータの調整を行ってもよい。 In the present embodiment, the administrator of the additional learning control device 20 desires to set the room temperature setting value higher than that on weekdays because a smaller number of users use the room every Saturday than on weekdays. After confirming, on the screen output in S709, it is assumed that the performance deterioration factor radio button 802 is selected as the non-performance deterioration factor and the detailed setting button 803 is pressed. For example, by pressing the detailed setting button 803, the detailed setting screen is displayed, and the administrator may adjust the input parameters on this detailed setting screen. Alternatively, the additional learning control device 20 may adjust the input parameters.

Ｓ７０９において、要因推定部２０７は、ユーザ入力を受信したか否か判定する。本実施形態では、管理者が、詳細設定画面において入力パラメータの調整をし、その後、決定ボタン８０４を押下したものとする。その結果、処理はＳ７１０に進み、要因推定部２０７は、ユーザ入力に基づいて要因改善データを生成し、生成した要因改善データを学習モジュール選択部２０６に戻す。本実施形態では、要因改善データには、詳細設定画面で指定された入力パラメータに関する設定が含まれる。 In S709, the factor estimation unit 207 determines whether or not the user input has been received. In the present embodiment, it is assumed that the administrator adjusts the input parameters on the detailed setting screen and then presses the enter button 804. As a result, the process proceeds to S710, and the factor estimation unit 207 generates factor improvement data based on the user input, and returns the generated factor improvement data to the learning module selection unit 206. In the present embodiment, the factor improvement data includes the settings related to the input parameters specified on the detailed setting screen.

Ｓ７１１において、学習モジュール選択部２０６は、要因改善データに基づいて学習指示を生成し、生成した学習指示を学習装置１０の学習指示受付部１０１に送信して、処理を終了する。本実施形態では、学習モジュール選択部２０６は、詳細設定画面で指定された入力パラメータに関する設定を含む学習指示を学習指示受付部１０１に送信する。 In S711, the learning module selection unit 206 generates a learning instruction based on the factor improvement data, transmits the generated learning instruction to the learning instruction receiving unit 101 of the learning device 10, and ends the process. In the present embodiment, the learning module selection unit 206 transmits a learning instruction including settings related to the input parameters specified on the detailed setting screen to the learning instruction receiving unit 101.

このようにすることで、学習装置１０は、受信した学習指示に基づいて、ニューラルネットワークをさらに追加学習させることができる。本実施形態では、学習装置１０は、追加学習制御装置２０が見出したパターンが性能劣化要因ではなく、ニューラルネットワークにさらに学習させるべきパターンである場合に、追加の学習データを用いて、ニューラルネットワーク２をさらに追加学習させることができる。 By doing so, the learning device 10 can further train the neural network based on the received learning instruction. In the present embodiment, the learning device 10 uses the additional learning data when the pattern found by the additional learning control device 20 is not a performance deterioration factor but a pattern to be further trained by the neural network, and the neural network 2 Can be further learned.

［別の実施形態］
上述したとおり、別の実施形態では、図９に示されるように、学習システム１は、学習済モジュール受付部２０１と、学習モジュール２０２と、制御部２０８とを備えた制御装置５０を備えてもよい。学習システム１が制御装置５０を備える本実施形態では、学習システム１は、学習目標受付部２０３、評価部２０４、判定部２０５、学習モジュール選択部２０６、要因推定部２０７及びＤＢ２０９を備えた評価装置６０を備える。前述したように、追加学習制御装置２０は、評価装置６０を含む。 [Another Embodiment]
As described above, in another embodiment, as shown in FIG. 9, the learning system 1 may include a control device 50 including a learned module reception unit 201, a learning module 202, and a control unit 208. good. In the present embodiment in which the learning system 1 includes the control device 50, the learning system 1 is an evaluation device including a learning target reception unit 203, an evaluation unit 204, a determination unit 205, a learning module selection unit 206, a factor estimation unit 207, and a DB 209. 60 is provided. As described above, the additional learning control device 20 includes the evaluation device 60.

評価装置６０は、図１０に示されるように、ＣＰＵ及びＲＡＭを含む制御部６０ａ、ＤＢ２０９のデータ等を記憶する記憶部６０ｂ、ネットワークＮと接続するための通信部６０ｃ、ユーザからの入力を受け付ける入力部６０ｄ、表示部６０ｅ等を備えている。これら各構成は、バスを介して相互にデータ送受信可能に接続される。ＣＰＵが記憶部に記憶されているプログラムをＲＡＭに展開し、ＲＡＭに展開された当該プログラムを解釈及び実行することにより、制御部６０ａは、図９の各部として機能する。 As shown in FIG. 10, the evaluation device 60 receives input from a control unit 60a including a CPU and RAM, a storage unit 60b for storing data of DB209, a communication unit 60c for connecting to a network N, and an input from a user. It includes an input unit 60d, a display unit 60e, and the like. Each of these configurations is connected to each other via a bus so that data can be transmitted and received. The CPU expands the program stored in the storage unit into the RAM, interprets and executes the program expanded in the RAM, and the control unit 60a functions as each unit in FIG.

評価装置６０は、例えば一般のパーソナルコンピュータのＣＰＵによって追加学習プログラムを実行することで構成されてよい。追加学習プログラムは、ＲＡＭや記憶部６０ｂ等のコンピュータによって読み取り可能な記憶媒体に記憶されて提供されてもよいし、通信部により接続される通信ネットワークＮを介して提供されてもよい。 The evaluation device 60 may be configured by executing an additional learning program by, for example, a CPU of a general personal computer. The additional learning program may be stored and provided in a storage medium readable by a computer such as a RAM or a storage unit 60b, or may be provided via a communication network N connected by the communication unit.

本明細書において説明した各処理を実施するプログラムは、記録媒体に記憶させてもよい。この記録媒体を用いれば、コンピュータに上記プログラムをインストールすることにより、当該コンピュータを評価装置６０又は追加学習制御装置２０として機能させることができる。ここで、上記プログラムを記憶した記録媒体は、非一過性の記録媒体であってもよい。非一過性の記録媒体は特に限定されないが、例えば、ＣＤ−ＲＯＭ等の記録媒体であってもよい。 The program that performs each process described in the present specification may be stored in a recording medium. By using this recording medium, the computer can be made to function as the evaluation device 60 or the additional learning control device 20 by installing the above program on the computer. Here, the recording medium in which the above program is stored may be a non-transient recording medium. The non-transient recording medium is not particularly limited, but may be, for example, a recording medium such as a CD-ROM.

上記の実施形態の一部又は全部は、以下の付記のようにも記載され得るが、以下には限られない。 Some or all of the above embodiments may also be described, but not limited to:

（付記１）
少なくとも１つのメモリと、前記メモリと接続された少なくとも１つのハードウェアプロセッサとを備えた、第１の学習モジュールを追加学習させた第２の学習モジュールを評価する評価装置であって、
前記ハードウェアプロセッサが、
前記第２の学習モジュールが達成すべき学習目標を受け付け、
少なくとも前記学習目標に含まれる評価項目について前記第２の学習モジュールの評価を行い、評価データを生成し、
前記学習目標と前記評価データとを用いて、前記第２の学習モジュールが前記学習目標を達成したか否か判定し、
前記学習目標を達成しないと判定される場合、少なくとも前記学習目標に基づいて、前記第２の学習モジュールと異なる第３の学習モジュールを取得する、
評価装置。 (Appendix 1)
An evaluation device for evaluating a second learning module in which a first learning module is additionally trained, which includes at least one memory and at least one hardware processor connected to the memory.
The hardware processor
Accepting the learning goals to be achieved by the second learning module,
At least the evaluation items included in the learning goal are evaluated by the second learning module, and evaluation data is generated.
Using the learning goal and the evaluation data, it is determined whether or not the second learning module has achieved the learning goal.
If it is determined that the learning goal is not achieved, a third learning module different from the second learning module is acquired, at least based on the learning goal.
Evaluation device.

（付記２）
第１の学習モジュールを追加学習させた第２の学習モジュールを評価する評価方法であって、
少なくとも１つ以上のハードウェアプロセッサによって、前記第２の学習モジュールが達成すべき学習目標を受け付け、
前記ハードウェアプロセッサによって、少なくとも前記学習目標に含まれる評価項目について前記第２の学習モジュールの評価を行い、評価データを生成し、
前記ハードウェアプロセッサによって、前記学習目標と前記評価データとを用いて、前記第２の学習モジュールが前記学習目標を達成したか否か判定し、
前記学習目標を達成しないと判定される場合、少なくとも前記学習目標に基づいて、前記第２の学習モジュールと異なる第３の学習モジュールを取得する、
評価方法。 (Appendix 2)
This is an evaluation method for evaluating the second learning module in which the first learning module is additionally learned.
At least one or more hardware processors accept the learning goals to be achieved by the second learning module.
The hardware processor evaluates the second learning module for at least the evaluation items included in the learning goal, generates evaluation data, and generates evaluation data.
Using the learning goal and the evaluation data, the hardware processor determines whether or not the second learning module has achieved the learning goal.
If it is determined that the learning goal is not achieved, a third learning module different from the second learning module is acquired, at least based on the learning goal.
Evaluation method.

１…学習システム、１０…学習装置、１０ａ…制御部、１０ｂ…記憶部、１０ｃ…通信部、１０ｄ…入力部、１０ｅ…表示部、１０１…学習指示受付部、１０２…学習データ取得部、１０３…学習制御部、１０４…学習モジュール、１０５…学習済モジュール出力部、１０６…学習済モジュール抽出部、２０…追加学習制御装置、２０ａ…制御部、２０ｂ…記憶部、２０ｃ…通信部、２０ｄ…入力部、２０ｅ…表示部、２０１…学習済モジュール受付部、２０２…学習モジュール、２０３…学習目標受付部、２０４…評価部、２０５…判定部、２０６…学習モジュール選択部、２０７…要因推定部、２０８…制御部、２０９…ＤＢ、２０９１…学習目標ＤＢ、２０９２…評価データＤＢ、３０…センサ、４０…記憶装置、５０…制御装置、６０…評価装置、６０ａ…制御部、６０ｂ…記憶部、６０ｃ…通信部、６０ｄ…入力部、６０ｅ…表示部、８０１…テキスト、８０２…性能劣化要因ラジオボタン、８０３…詳細設定ボタン、８０４…決定ボタン 1 ... Learning system, 10 ... Learning device, 10a ... Control unit, 10b ... Storage unit, 10c ... Communication unit, 10d ... Input unit, 10e ... Display unit, 101 ... Learning instruction reception unit, 102 ... Learning data acquisition unit, 103 ... learning control unit, 104 ... learning module, 105 ... learned module output unit, 106 ... learned module extraction unit, 20 ... additional learning control device, 20a ... control unit, 20b ... storage unit, 20c ... communication unit, 20d ... Input unit, 20e ... Display unit, 201 ... Learned module reception unit, 202 ... Learning module, 203 ... Learning target reception unit, 204 ... Evaluation unit, 205 ... Judgment unit, 206 ... Learning module selection unit, 207 ... Factor estimation unit , 208 ... Control unit, 209 ... DB, 2091 ... Learning target DB, 2092 ... Evaluation data DB, 30 ... Sensor, 40 ... Storage device, 50 ... Control device, 60 ... Evaluation device, 60a ... Control unit, 60b ... Storage unit , 60c ... communication unit, 60d ... input unit, 60e ... display unit, 801 ... text, 802 ... performance deterioration factor radio button, 803 ... detailed setting button, 804 ... enter button

Claims

It is an evaluation device that evaluates the second learning module in which the first learning module is additionally learned.
A learning goal reception unit that receives learning goals to be achieved by the second learning module,
An evaluation unit that evaluates the second learning module and generates evaluation data for at least the evaluation items included in the learning goal.
Using the learning goal and the evaluation data, a determination unit for determining whether or not the second learning module has achieved the learning goal, and
Wherein when it is determined not to achieve learning objectives, evaluated and a learning module selection unit for acquiring third learning module different from the second learning module device.

The evaluation device according to claim 1, wherein the third learning module is the first learning module.

The claim further includes a factor estimation unit that estimates a factor that the second learning module could not achieve the learning goal and generates factor improvement data related to the learning data used for additional learning of the learning module. The evaluation device according to 1.

The evaluation device according to claim 3, wherein the learning module selection unit generates a learning instruction based on the factor improvement data.

The evaluation device according to claim 3, wherein the factor estimation unit outputs the estimated factor, receives a user input regarding the estimated factor, and generates the factor improvement data based on the user input.

An evaluation method for evaluating a second learning module in which the first learning module is additionally learned, and a computer provided with a control unit is used.
The process of accepting the learning goals to be achieved by the second learning module, and
A process of evaluating the second learning module for at least the evaluation items included in the learning goal and generating evaluation data, and
A step of determining whether or not the second learning module has achieved the learning goal by using the learning goal and the evaluation data, and
Wherein when it is determined not to achieve learning objectives, evaluation method comprising a step of acquiring a third learning module different from the second learning modules.

A program for causing a computer to evaluate a second learning module in which the first learning module is additionally learned.
The process of accepting the learning goal to be achieved by the second learning module,
At least the evaluation items included in the learning goal are evaluated by the second learning module, and evaluation data is generated.
A process of determining whether or not the second learning module has achieved the learning goal by using the learning goal and the evaluation data.
A program that executes a process of acquiring a third learning module different from the second learning module when it is determined that the learning goal is not achieved.