JP7700542B2

JP7700542B2 - Information processing device, information processing method, and program

Info

Publication number: JP7700542B2
Application number: JP2021111255A
Authority: JP
Inventors: 聡志川村
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 2021-07-05
Filing date: 2021-07-05
Publication date: 2025-07-01
Anticipated expiration: 2041-07-05
Also published as: JP2023008028A

Description

本発明は、情報処理装置、情報処理方法およびプログラムに関する。 The present invention relates to an information processing device, an information processing method, and a program.

ニューラルネットワーク（以下、「ＮＮ」とも表記する。）は、画像認識などにおいて高い性能を有する。ＮＮの学習精度を高めるためには、膨大な入力データとそれに対応した教師ラベルが必要となることが知られている。しかし、教師ラベルは人手によって付与される場合が多い。そのため、膨大な入力データに対して教師ラベルを付与する負担がユーザに掛かってしまう。 Neural networks (hereafter referred to as "NNs") have high performance in areas such as image recognition. It is known that in order to improve the learning accuracy of NNs, a huge amount of input data and corresponding teacher labels are required. However, teacher labels are often assigned manually. As a result, the burden of assigning teacher labels to huge amounts of input data falls on the user.

近年、この課題を解決するため、収集した入力データのうち少量のデータに教師ラベルを付与し、残りのデータには教師ラベルを付与せずにＮＮの学習を行う半教師あり学習の研究が盛んになっている。半教師あり学習によれば、ユーザの負担が大きく削減され得る。一般に、半教師あり学習に用いられる損失関数は、教師ラベル付きデータセット（ラベル付きデータセット）に対応する損失関数と、教師ラベル無しデータセット（ラベル無しデータセット）に対応する損失関数との重み和によって定義される。 In recent years, in order to solve this problem, research has been actively conducted into semi-supervised learning, in which teacher labels are assigned to a small amount of collected input data and the remaining data is trained by a neural network without teacher labels. Semi-supervised learning can significantly reduce the burden on users. In general, the loss function used in semi-supervised learning is defined as the weighted sum of the loss function corresponding to the dataset with teacher labels (labeled dataset) and the loss function corresponding to the dataset without teacher labels (unlabeled dataset).

非特許文献１に記載の手法は、特に画像認識における半教師あり学習の手法の一つであり、教師ラベルが付されていない画像に対して２種類のデータ拡張を施し、２種類のデータ拡張によって得られた２種類の画像同士を比較することに基づいて学習を行う手法である。これによって、学習精度を高めることが可能となる。非特許文献１に記載の手法においては、教師ラベルが付されていない入力データ（ラベル無しデータ）に依存しない定数が個々のラベル無しデータに対応する損失関数に乗じられることによって、ラベル無しデータ全体に対応する損失関数が算出される。 The method described in Non-Patent Document 1 is one of the semi-supervised learning methods in image recognition, in which two types of data augmentation are applied to an image without teacher labels, and learning is performed based on comparing the two types of images obtained by the two types of data augmentation. This makes it possible to improve the learning accuracy. In the method described in Non-Patent Document 1, a constant that does not depend on input data without teacher labels (unlabeled data) is multiplied by the loss function corresponding to each piece of unlabeled data to calculate the loss function corresponding to the entire unlabeled data.

非特許文献２に記載の手法は、半教師あり学習の一つであり、ラベル無しデータごとに影響度を示す係数を算出し、ラベル無しデータごとの係数と損失関数との重み和によってラベル無しデータ全体に対応する損失関数を算出する手法である。ただし、係数の算出は、学習用データとは別に用意された検証用の教師ラベル付きデータセットを用いて行われる。 The method described in Non-Patent Document 2 is a type of semi-supervised learning method that calculates a coefficient indicating the degree of influence for each piece of unlabeled data, and calculates a loss function corresponding to the entire unlabeled data by a weighted sum of the coefficient for each piece of unlabeled data and the loss function. However, the coefficients are calculated using a dataset with teacher labels for validation that is prepared separately from the training data.

Kihyuk Sohn、他8名、"FixMatch:Simplifying Semi-Supervised Learning with Consistencyand Confidence"、[online]、［令和3年6月18日検索］、インターネット＜https://arxiv.org/abs/2001.07685＞Kihyuk Sohn and 8 others, "FixMatch: Simplifying Semi-Supervised Learning with Consistencyand Confidence", [online], [Retrieved June 18, 2021], Internet <https://arxiv.org/abs/2001.07685> Zhongzheng Ren、他2名、"Not AllUnlabeled Data are Equal: Learning to Weight Data in Semi-supervised Learning"、[online]、［令和3年6月18日検索］、インターネット＜https://arxiv.org/abs/2007.01293＞Zhongzheng Ren and 2 others, "Not AllUnlabeled Data are Equal: Learning to Weight Data in Semi-supervised Learning", [online], [Retrieved June 18, 2021], Internet <https://arxiv.org/abs/2007.01293>

しかしながら、非特許文献１に記載の手法によれば、個々のラベル無しデータに対応する損失関数が均等に扱われた上で、損失関数に基づくＮＮの更新が行われる。そのため、学習を妨害するラベル無しデータに対応する損失関数も均等に扱われるため、学習が不安定となりやすく、構築されるＮＮの精度も高くなりにくい。 However, according to the technique described in Non-Patent Document 1, the loss functions corresponding to each piece of unlabeled data are treated equally, and then the NN is updated based on the loss functions. As a result, the loss functions corresponding to unlabeled data that interferes with learning are also treated equally, making learning unstable and making it difficult to increase the accuracy of the constructed NN.

また、非特許文献２に記載の手法によれば、ラベル無しデータの影響度がサンプルごとに決定され得る。しかし、学習用データとは別に検証用の教師ラベル付きデータセットを用意する必要があるため、ユーザに掛かる負担が高くなる可能性がある。 In addition, according to the method described in Non-Patent Document 2, the influence of unlabeled data can be determined for each sample. However, since it is necessary to prepare a dataset with teacher labels for validation in addition to the training data, this may impose a heavy burden on users.

そこで、本発明は、これらの問題点を解決すべく提案されたものであり、学習用データとは別に検証用の教師ラベル付きデータセットを用意せずとも、高精度なＮＮを構築することを可能とする技術が提供されることが望まれる。 The present invention has been proposed to solve these problems, and it is hoped that a technology will be provided that makes it possible to build a highly accurate neural network without having to prepare a data set with teacher labels for validation purposes in addition to the training data.

上記問題を解決するために、本発明のある観点によれば、ラベル無しデータを取得するとともに、教師ラベルが付されたラベル有りデータを取得する入力部と、前記ラベル無しデータと前記ラベル有りデータとニューラルネットワークとに基づいて、前記ラベル無しデータに対応する推論結果と前記ラベル有りデータに対応する推論結果とを出力する推測部と、前記ラベル無しデータに対応する推論結果に基づいて、前記ラベル無しデータの影響度を示す係数を算出する係数生成部と、前記ラベル無しデータに対応する推論結果と前記係数とに基づいて、前記ラベル無しデータに対応する評価結果を出力するラベル無しデータ評価部と、前記ラベル有りデータに対応する推論結果と前記教師ラベルとに基づいて、前記ラベル有りデータに対応する評価結果を出力するラベル有りデータ評価部と、前記ラベル無しデータに対応する評価結果と、前記ラベル有りデータに対応する評価結果とに基づいて、前記ニューラルネットワークの重みパラメータを更新する更新部と、を備える、情報処理装置が提供される。
本発明の実施形態に係る技術においては、ラベル無しデータとラベル付きデータとが学習に用いられる。したがって、本発明の実施形態に係る技術は、半教師あり機械学習に関する技術に該当する。 In order to solve the above problem, according to an aspect of the present invention, there is provided an information processing device comprising: an input unit that acquires unlabeled data and labeled data to which a teacher label has been attached; an estimation unit that outputs an inference result corresponding to the unlabeled data and an inference result corresponding to the labeled data based on the unlabeled data, the labeled data, and a neural network; a coefficient generation unit that calculates a coefficient indicating a degree of influence of the unlabeled data based on the inference result corresponding to the unlabeled data; an unlabeled data evaluation unit that outputs an evaluation result corresponding to the unlabeled data based on the inference result corresponding to the unlabeled data and the coefficient; a labeled data evaluation unit that outputs an evaluation result corresponding to the labeled data based on the inference result corresponding to the labeled data and the teacher label; and an update unit that updates weight parameters of the neural network based on the evaluation result corresponding to the unlabeled data and the evaluation result corresponding to the labeled data.
In the technology according to the embodiment of the present invention, unlabeled data and labeled data are used for learning. Therefore, the technology according to the embodiment of the present invention corresponds to a technology related to semi-supervised machine learning.

前記ラベル無しデータに対応する推論結果は、クラスごとの推論値を含み、前記係数生成部は、ラベル無しデータごとに推論値が最大となる第１のクラスの推論値と前記第１のクラスとは異なる１または複数のクラスの推論値とに基づく差分を算出し、前記ラベル無しデータごとに前記差分に基づいて前記係数を算出してもよい。 The inference result corresponding to the unlabeled data may include an inference value for each class, and the coefficient generation unit may calculate a difference between the inference value of a first class having a maximum inference value for each unlabeled data and the inference values of one or more classes different from the first class, and calculate the coefficient for each unlabeled data based on the difference.

前記係数生成部は、前記ラベル無しデータごとに前記第１のクラスの推論値と推論値が２番目に大きい第２のクラスの推論値との差分を算出してもよい。 The coefficient generation unit may calculate the difference between the inferred value of the first class and the inferred value of the second class having the second largest inferred value for each of the unlabeled data.

前記係数生成部は、前記ラベル無しデータごとに前記第１のクラスの推論値と前記第１のクラスとは異なる複数のクラスの推論値の平均値との差分を算出してもよい。 The coefficient generation unit may calculate the difference between the inferred value of the first class and the average value of inferred values of multiple classes different from the first class for each of the unlabeled data.

前記係数生成部は、前記ラベル無しデータごとに前記差分に対して正の相関を有する数を前記係数として算出してもよい。 The coefficient generation unit may calculate, for each of the unlabeled data, a number that has a positive correlation with the difference as the coefficient.

前記ラベル無しデータに対応する推論結果は、クラスごとの推論値を含み、前記係数生成部は、ラベル無しデータごとに推論値に基づく予測クラスを特定し、前記予測クラスに対応するラベル無しデータの数を前記予測クラスの度数として算出し、前記ラベル無しデータごとに前記予測クラスの度数に基づいて前記係数を算出してもよい。 The inference result corresponding to the unlabeled data may include an inference value for each class, and the coefficient generation unit may identify a predicted class based on the inference value for each unlabeled data, calculate the number of unlabeled data corresponding to the predicted class as a frequency of the predicted class, and calculate the coefficient based on the frequency of the predicted class for each unlabeled data.

前記係数生成部は、前記ラベル無しデータごとに前記推論値が最大となるクラスを予測クラスとして特定してもよい。 The coefficient generation unit may identify, for each of the unlabeled data, the class for which the inference value is maximum as the predicted class.

前記係数生成部は、前記ラベル無しデータごとに前記予測クラスの度数に対して負の相関を有する数を前記係数として算出してもよい。 The coefficient generation unit may calculate, for each of the unlabeled data, a number that has a negative correlation with the frequency of the predicted class as the coefficient.

前記ラベル無しデータ評価部は、前記ラベル無しデータに対応する推論結果に基づく損失と前記係数とを乗算することに基づいて、前記ラベル無しデータに対応する評価結果を出力してもよい。 The unlabeled data evaluation unit may output an evaluation result corresponding to the unlabeled data based on multiplying a loss based on an inference result corresponding to the unlabeled data by the coefficient.

また、本発明の別の観点によれば、ラベル無しデータを取得するとともに、教師ラベルが付されたラベル有りデータを取得することと、前記ラベル無しデータと前記ラベル有りデータとニューラルネットワークとに基づいて、前記ラベル無しデータに対応する推論結果と前記ラベル有りデータに対応する推論結果とを出力することと、前記ラベル無しデータに対応する推論結果に基づいて、前記ラベル無しデータの影響度を示す係数を算出することと、前記ラベル無しデータに対応する推論結果と前記係数とに基づいて、前記ラベル無しデータに対応する評価結果を出力することと、前記ラベル有りデータに対応する推論結果と前記教師ラベルとに基づいて、前記ラベル有りデータに対応する評価結果を出力することと、前記ラベル無しデータに対応する評価結果と、前記ラベル有りデータに対応する評価結果とに基づいて、前記ニューラルネットワークの重みパラメータを更新することと、を備える、コンピュータが実行する情報処理方法が提供される。
According to another aspect of the present invention, there is provided an information processing method executed by a computer, comprising: acquiring unlabeled data and labeled data to which a teacher label has been attached; outputting an inference result corresponding to the unlabeled data and an inference result corresponding to the labeled data based on the unlabeled data, the labeled data, and a neural network; calculating a coefficient indicating a degree of influence of the unlabeled data based on the inference result corresponding to the unlabeled data; outputting an evaluation result corresponding to the unlabeled data based on the inference result corresponding to the unlabeled data and the coefficient; outputting an evaluation result corresponding to the labeled data based on the inference result corresponding to the labeled data and the teacher label; and updating weight parameters of the neural network based on the evaluation result corresponding to the unlabeled data and the evaluation result corresponding to the labeled data.

また、本発明の別の観点によれば、コンピュータを、ラベル無しデータを取得するとともに、教師ラベルが付されたラベル有りデータを取得する入力部と、前記ラベル無しデータと前記ラベル有りデータとニューラルネットワークとに基づいて、前記ラベル無しデータに対応する推論結果と前記ラベル有りデータに対応する推論結果とを出力する推測部と、前記ラベル無しデータに対応する推論結果に基づいて、前記ラベル無しデータの影響度を示す係数を算出する係数生成部と、前記ラベル無しデータに対応する推論結果と前記係数とに基づいて、前記ラベル無しデータに対応する評価結果を出力するラベル無しデータ評価部と、前記ラベル有りデータに対応する推論結果と前記教師ラベルとに基づいて、前記ラベル有りデータに対応する評価結果を出力するラベル有りデータ評価部と、前記ラベル無しデータに対応する評価結果と、前記ラベル有りデータに対応する評価結果とに基づいて、前記ニューラルネットワークの重みパラメータを更新する更新部と、を備える情報処理装置として機能させるプログラムが提供される。 According to another aspect of the present invention, a program is provided that causes a computer to function as an information processing device including an input unit that acquires unlabeled data and labeled data to which a teacher label has been attached, an estimation unit that outputs an inference result corresponding to the unlabeled data and an inference result corresponding to the labeled data based on the unlabeled data, the labeled data, and a neural network, a coefficient generation unit that calculates a coefficient indicating the influence of the unlabeled data based on the inference result corresponding to the unlabeled data, an unlabeled data evaluation unit that outputs an evaluation result corresponding to the unlabeled data based on the inference result corresponding to the unlabeled data and the coefficient, a labeled data evaluation unit that outputs an evaluation result corresponding to the labeled data based on the inference result corresponding to the labeled data and the teacher label, and an update unit that updates weight parameters of the neural network based on the evaluation result corresponding to the unlabeled data and the evaluation result corresponding to the labeled data.

以上説明したように本発明によれば、学習用データとは別に検証用の教師ラベル付きデータセットを用意せずとも、高精度なＮＮを構築することを可能とする技術が提供される。 As described above, the present invention provides technology that makes it possible to build a highly accurate neural network without having to prepare a dataset with teacher labels for validation in addition to the training data.

本発明の第１の実施形態に係る学習装置の機能構成例を示す図である。1 is a diagram illustrating an example of a functional configuration of a learning device according to a first embodiment of the present invention. 同実施形態に係る学習装置によって実行される学習段階の動作例を示すフローチャートである。13 is a flowchart showing an example of an operation in a learning stage executed by the learning device according to the embodiment. 学習装置の例としての情報処理装置のハードウェア構成を示す図である。FIG. 2 is a diagram illustrating a hardware configuration of an information processing device as an example of a learning device.

以下に添付図面を参照しながら、本発明の好適な実施の形態について詳細に説明する。なお、本明細書及び図面において、実質的に同一の機能構成を有する構成要素については、同一の符号を付することにより重複説明を省略する。 The preferred embodiment of the present invention will be described in detail below with reference to the attached drawings. Note that in this specification and the drawings, components having substantially the same functional configuration are designated by the same reference numerals to avoid redundant description.

また、本明細書および図面において、実質的に同一の機能構成を有する複数の構成要素を、同一の符号の後に異なる数字を付して区別する場合がある。ただし、実質的に同一の機能構成を有する複数の構成要素等の各々を特に区別する必要がない場合、同一符号のみを付する。また、異なる実施形態の類似する構成要素については、同一の符号の後に異なるアルファベットを付して区別する場合がある。ただし、異なる実施形態の類似する構成要素等の各々を特に区別する必要がない場合、同一符号のみを付する。 In addition, in this specification and drawings, multiple components having substantially the same functional configuration may be distinguished by adding different numbers after the same reference symbol. However, if there is no particular need to distinguish between multiple components having substantially the same functional configuration, only the same reference symbol will be used. In addition, similar components in different embodiments may be distinguished by adding different letters after the same reference symbol. However, if there is no particular need to distinguish between similar components in different embodiments, only the same reference symbol will be used.

（０．実施形態の概要）
本発明の実施形態の概要について説明する。本発明の実施形態では、ニューラルネットワークの学習を行う情報処理装置（以下、「学習装置」とも言う。）について説明する。学習装置においては、学習用データに基づいてニューラルネットワークの学習が行われる（学習段階）。その後、識別装置において、学習済みのニューラルネットワークと識別用データ（テストデータ）とに基づいて推定ラベルが出力される。 (0. Overview of the embodiment)
An overview of an embodiment of the present invention will be described. In the embodiment of the present invention, an information processing device that trains a neural network (hereinafter, also referred to as a "learning device") will be described. In the learning device, the neural network is trained based on training data (learning stage). After that, in the classification device, an estimated label is output based on the trained neural network and classification data (test data).

本発明の実施形態では、学習装置と識別装置とが同一のコンピュータによって実現される場合を主に想定する。しかし、学習装置と識別装置とは、別のコンピュータによって実現されてもよい。かかる場合には、学習装置によって生成された学習済みのニューラルネットワークが識別装置に提供される。例えば、学習済みのニューラルネットワークは、学習装置から識別装置に記録媒体を介して提供されてもよいし、通信を介して提供されてもよい。以下では、学習装置において実行される「学習段階」について説明する。 In the embodiment of the present invention, it is mainly assumed that the learning device and the identification device are realized by the same computer. However, the learning device and the identification device may be realized by different computers. In such a case, the trained neural network generated by the learning device is provided to the identification device. For example, the trained neural network may be provided from the learning device to the identification device via a recording medium or via communication. The "learning stage" executed in the learning device is described below.

（１．第１の実施形態）
まず、本発明の第１の実施形態について説明する。本発明の第１の実施形態においては、学習装置によって半教師あり学習が行われる。 1. First embodiment
First, a first embodiment of the present invention will be described. In the first embodiment of the present invention, semi-supervised learning is performed by a learning device.

（学習装置の構成）
図１を参照しながら、本発明の第１の実施形態に係る学習装置の構成例について説明する。図１は、本発明の第１の実施形態に係る学習装置１０の機能構成例を示す図である。図１に示されるように、本発明の第１の実施形態に係る学習装置１０は、入力部１１１と、推測部１２２と、係数生成部１３１と、ラベル無しデータ評価部１３２と、ラベル有りデータ評価部１３３と、更新部１３４とを備える。 (Configuration of the learning device)
An example of the configuration of a learning device according to a first embodiment of the present invention will be described with reference to Fig. 1. Fig. 1 is a diagram showing an example of the functional configuration of a learning device 10 according to a first embodiment of the present invention. As shown in Fig. 1, the learning device 10 according to the first embodiment of the present invention includes an input unit 111, an estimation unit 122, a coefficient generation unit 131, an unlabeled data evaluation unit 132, a labeled data evaluation unit 133, and an update unit 134.

本発明の第１の実施形態では、推測部１２２が、ニューラルネットワーク１２０に含まれる場合を主に想定する。すなわち、推測部１２２は、ニューロンによって構築される計算グラフが処理順に接続されて構成されており、全体として１つのニューラルネットワークとみなされ得る。以下では、ニューラルネットワークを「ＮＮ」とも表記する。より詳細に、推測部１２２は、畳み込み層およびプーリング層を主に含んでよい。以下では、畳み込み層として、２次元畳み込み層が用いられる場合を主に想定するが、３次元畳み込み層が用いられてもよい。 In the first embodiment of the present invention, it is mainly assumed that the estimation unit 122 is included in the neural network 120. That is, the estimation unit 122 is configured by connecting computation graphs constructed by neurons in the order of processing, and can be regarded as one neural network as a whole. In the following, the neural network is also written as "NN". In more detail, the estimation unit 122 may mainly include a convolution layer and a pooling layer. In the following, it is mainly assumed that a two-dimensional convolution layer is used as the convolution layer, but a three-dimensional convolution layer may also be used.

推測部１２２の他、入力部１１１、係数生成部１３１、ラベル無しデータ評価部１３２、ラベル有りデータ評価部１３３および更新部１３４などは、ＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）またはＧＰＵ（ＧｒａｐｈｉｃｓＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）などの演算装置を含み、ＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）により記憶されているプログラムが演算装置によりＲＡＭに展開されて実行されることにより、その機能が実現され得る。このとき、当該プログラムを記録した、コンピュータに読み取り可能な記録媒体も提供され得る。あるいは、これらのブロックは、専用のハードウェアにより構成されていてもよいし、複数のハードウェアの組み合わせにより構成されてもよい。演算装置による演算に必要なデータは、図示しない記憶部によって適宜記憶される。 In addition to the estimation unit 122, the input unit 111, the coefficient generation unit 131, the unlabeled data evaluation unit 132, the labeled data evaluation unit 133, and the update unit 134 include a calculation device such as a CPU (Central Processing Unit) or a GPU (Graphics Processing Unit), and the functions can be realized by the calculation device expanding a program stored in a ROM (Read Only Memory) into a RAM and executing it. At this time, a computer-readable recording medium on which the program is recorded can also be provided. Alternatively, these blocks may be composed of dedicated hardware or a combination of multiple hardware components. Data necessary for the calculation by the calculation device is appropriately stored in a storage unit (not shown).

ラベル無しデータセット１０１、ラベル付きデータセット１０２および重みパラメータ１２１は、図示しない記憶部によって記憶される。かかる記憶部は、ＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）、ハードディスクドライブまたはフラッシュメモリなどのメモリによって構成されてよい。 The unlabeled dataset 101, the labeled dataset 102, and the weight parameters 121 are stored in a storage unit (not shown). Such a storage unit may be configured with a memory such as a RAM (Random Access Memory), a hard disk drive, or a flash memory.

初期状態において、重みパラメータ１２１には、初期値が設定されている。例えば、重みパラメータ１２１に設定される初期値は、ランダムな値であってよいが、どのような値であってもよい。例えば、重みパラメータ１２１に設定される初期値は、あらかじめ学習によって得られた学習済みの値であってもよい。 In the initial state, an initial value is set for the weight parameter 121. For example, the initial value set for the weight parameter 121 may be a random value, but may be any value. For example, the initial value set for the weight parameter 121 may be a learned value obtained in advance by learning.

（ラベル無しデータセット１０１）
ラベル無しデータセット１０１は、教師ラベルがそれぞれ対応付けられていない複数の学習用データ（入力データ）を含んで構成される。以下では、教師ラベルが対応付けられていない学習用データを「ラベル無しデータ」とも言う。なお、本発明の実施形態では、ラベル無しデータが画像データである場合（特に、静止画像データである場合）を主に想定する。しかし、ラベル無しデータの種類は特に限定されず、静止画像データ以外もラベル無しデータとして用いられ得る。例えば、ラベル無しデータは、複数のフレームを含んだ動画像データであってもよいし、時系列データまたは音声データであってもよい。 (Unlabeled Dataset 101)
The unlabeled dataset 101 includes a plurality of learning data (input data) each of which is not associated with a teacher label. Hereinafter, learning data that is not associated with a teacher label is also referred to as "unlabeled data". Note that in the embodiment of the present invention, it is mainly assumed that the unlabeled data is image data (particularly, still image data). However, the type of unlabeled data is not particularly limited, and data other than still image data may be used as unlabeled data. For example, the unlabeled data may be video data including a plurality of frames, or may be time-series data or audio data.

（ラベル付きデータセット１０２）
ラベル付きデータセット１０２は、複数の学習用データ（入力データ）と当該複数の学習用データそれぞれに対応付けられた教師ラベルとを含んで構成される。以下では、教師ラベルが対応付けられた学習用データを「ラベル有りデータ」とも言う。また、教師ラベルとラベル有りデータとの組み合わせを「ラベル付きデータ」とも言う。教師ラベルは、人手または図示しない機能によって付与される。なお、ラベル無しデータの種類と同様に、ラベル有りデータの種類も特に限定されない。 (Labeled Dataset 102)
The labeled data set 102 includes a plurality of pieces of learning data (input data) and teacher labels associated with each of the plurality of pieces of learning data. Hereinafter, the learning data associated with the teacher labels is also referred to as "labeled data." In addition, the combination of the teacher labels and the labeled data is also referred to as "labeled data." The teacher labels are assigned manually or by a function not shown. Note that, like the types of unlabeled data, the types of labeled data are not particularly limited.

（入力部１１１）
入力部１１１は、ラベル無しデータセット１０１からラベル無しデータを順次に取得し、取得したラベル無しデータをもとにミニバッチを作成し、作成したミニバッチをニューラルネットワーク１２０の推測部１２２に出力する。さらに、入力部１１１は、ラベル付きデータセット１０２からラベル付きデータ（教師ラベルとラベル有りデータとの組み合わせ）を順次に取得し、取得したラベル付きデータをもとにミニバッチを作成し、作成したミニバッチをニューラルネットワーク１２０の推測部１２２に出力する。ミニバッチのサイズは特に限定されない。 (Input unit 111)
The input unit 111 sequentially acquires unlabeled data from the unlabeled dataset 101, creates mini-batches based on the acquired unlabeled data, and outputs the created mini-batches to the estimation unit 122 of the neural network 120. Furthermore, the input unit 111 sequentially acquires labeled data (combinations of teacher labels and labeled data) from the labeled dataset 102, creates mini-batches based on the acquired labeled data, and outputs the created mini-batches to the estimation unit 122 of the neural network 120. The size of the mini-batches is not particularly limited.

（推測部１２２）
推測部１２２は、入力部１１１から出力されたミニバッチに含まれるラベル無しデータとニューラルネットワーク１２０とに基づいてラベル無しデータに対応する推論結果を得る。より詳細に、推測部１２２は、重みパラメータ１２１が設定されたニューラルネットワーク１２０にラベル無しデータを入力させたことに基づいて、ニューラルネットワーク１２０から出力されるデータをラベル無しデータに対応する推論結果として得る。推測部１２２は、ラベル無しデータに対応する推論結果を係数生成部１３１に出力する。 (Estimation unit 122)
The estimation unit 122 obtains an inference result corresponding to the unlabeled data based on the unlabeled data included in the mini-batch output from the input unit 111 and the neural network 120. More specifically, the estimation unit 122 obtains data output from the neural network 120 as an inference result corresponding to the unlabeled data based on inputting the unlabeled data to the neural network 120 to which the weight parameters 121 are set. The estimation unit 122 outputs the inference result corresponding to the unlabeled data to the coefficient generation unit 131.

このとき、推測部１２２は、ラベル無しデータに対応する推論結果として、半教師あり学習の枠組みに基づく２種類のラベルを係数生成部１３１に出力し得る。ここで、２種類のラベルを得るためのアルゴリズムは、特定のアルゴリズムに限定されず、半教師あり学習に用いられるアルゴリズムが用いられてよい。 At this time, the estimation unit 122 may output two types of labels based on a semi-supervised learning framework to the coefficient generation unit 131 as an inference result corresponding to the unlabeled data. Here, the algorithm for obtaining the two types of labels is not limited to a specific algorithm, and an algorithm used in semi-supervised learning may be used.

例えば、入力部１１１が、ラベル無しデータセット１０１から取得したラベル無しデータに基づいて２種類のラベル無しデータを得てもよい。一例として、入力部１１１は、ラベル無しデータに対して２種類のデータ拡張を施すことによって２種類のラベル無しデータを得てもよい。このとき、入力部１１１は、推測部１２２に対して２種類のラベル無しデータを出力し、推測部１２２は、２種類のラベル無しデータそれぞれに対応するラベルを２種類のラベルとして係数生成部１３１に出力する。 For example, the input unit 111 may obtain two types of unlabeled data based on the unlabeled data acquired from the unlabeled dataset 101. As an example, the input unit 111 may obtain two types of unlabeled data by performing two types of data extension on the unlabeled data. In this case, the input unit 111 outputs the two types of unlabeled data to the estimation unit 122, and the estimation unit 122 outputs labels corresponding to the two types of unlabeled data as two types of labels to the coefficient generation unit 131.

あるいは、入力部１１１から推測部１２２に出力されるラベル無しデータは１種類であり、推測部１２２において、２種類の重みパラメータを使用してもよい。一例として、推測部１２２は、入力部１１１から出力されるラベル無しデータに対して、重みパラメータ１２１の全部を適用して得たデータおよび重みパラメータ１２１の一部を適用して得たデータを２種類のラベルとして得てもよい。このとき、推測部１２２は、２種類のラベルを係数生成部１３１に出力する。 Alternatively, one type of unlabeled data may be output from the input unit 111 to the estimation unit 122, and two types of weight parameters may be used in the estimation unit 122. As an example, the estimation unit 122 may obtain two types of labels, namely data obtained by applying all of the weight parameters 121 to the unlabeled data output from the input unit 111, and data obtained by applying part of the weight parameters 121. In this case, the estimation unit 122 outputs the two types of labels to the coefficient generation unit 131.

例えば、推測部１２２は、２種類のラベルのうち、一方をラベル無しデータに対応する擬似的な教師ラベルとし、他方をラベル無しデータに対応する推定ラベルとして係数生成部１３１に出力する。なお、２種類のラベルのどちらを疑似的な教師ラベルとするかは限定されない。例えば、より弱いデータ拡張によって得られたラベルが疑似的な教師ラベルとされてもよい。あるいは、重みパラメータ１２１の全部の適用によって得られたラベルが疑似的な教師ラベルとされてもよい。 For example, the estimation unit 122 outputs one of the two types of labels to the coefficient generation unit 131 as a pseudo teacher label corresponding to unlabeled data, and the other as an estimated label corresponding to unlabeled data. Note that there is no limitation on which of the two types of labels is to be the pseudo teacher label. For example, the label obtained by weaker data augmentation may be the pseudo teacher label. Alternatively, the label obtained by applying all of the weight parameters 121 may be the pseudo teacher label.

さらに、推測部１２２は、入力部１１１から出力されたミニバッチに含まれるラベル有りデータとニューラルネットワーク１２０とに基づいてラベル有りデータに対応する推論結果を得る。より詳細に、推測部１２２は、重みパラメータ１２１が設定されたニューラルネットワーク１２０にラベル有りデータを入力させたことに基づいて、ニューラルネットワーク１２０から出力されるデータをラベル有りデータに対応する推論結果として得る。推測部１２２は、ラベル有りデータに対応する推論結果をラベル有りデータ評価部１３３に出力する。 Furthermore, the estimation unit 122 obtains an inference result corresponding to the labeled data based on the labeled data included in the mini-batch output from the input unit 111 and the neural network 120. More specifically, the estimation unit 122 obtains the data output from the neural network 120 as an inference result corresponding to the labeled data based on inputting the labeled data to the neural network 120 to which the weight parameter 121 is set. The estimation unit 122 outputs the inference result corresponding to the labeled data to the labeled data evaluation unit 133.

なお、推測部１２２から出力される推論結果の形式は、特に限定されない。しかし、推測部１２２から出力される推論結果の形式は、教師ラベルの形式と合わせて設定されているのがよい。例えば、教師ラベルが分類問題のクラスを示し、クラス数分の長さを有するｏｎｅ－ｈｏｔベクトルである場合、推測部１２２から出力される推論結果の形式も、クラス数分の長さを有するベクトルであってよい。このとき、推測部１２２から出力される推論結果は、クラスごとの値（以下、「推論値」とも言う。）を含み得る。 The format of the inference result output from the estimation unit 122 is not particularly limited. However, it is preferable that the format of the inference result output from the estimation unit 122 is set in accordance with the format of the teacher label. For example, if the teacher label indicates the class of the classification problem and is a one-hot vector having a length equal to the number of classes, the format of the inference result output from the estimation unit 122 may also be a vector having a length equal to the number of classes. In this case, the inference result output from the estimation unit 122 may include a value for each class (hereinafter also referred to as an "inference value").

一例として、推測部１２２によって全クラスの推論値の合計が１になるように調整される場合には、それぞれのクラスに対応する推論値は、それぞれのクラスに対応する確率に相当し得る。しかし、全クラスの推論値の合計は、推測部１２２によって１になるように調整されていなくてもよい。いずれの場合であっても、推測部１２２から出力される推論値は、そのクラスの確からしさが高いほど、大きい値であり得る。 As an example, when the inference values of all classes are adjusted by the inference unit 122 to sum to 1, the inference value corresponding to each class may correspond to the probability corresponding to each class. However, the sum of the inference values of all classes does not have to be adjusted by the inference unit 122 to sum to 1. In either case, the inference value output from the inference unit 122 may be larger the higher the likelihood of the class.

（係数生成部１３１）
係数生成部１３１は、推測部１２２から出力されたラベル無しデータに対応する推論結果に基づいて、ラベル無しデータの影響度を示す係数を算出する。より詳細に、係数生成部１３１は、ミニバッチに含まれるラベル無しデータごとに推論値に基づく予測クラスを特定する。バッチサイズＢとすると、ラベル無しデータｘ^ｕ＝｛ｘ_１ ^ｕ，…，ｘ_Ｂ ^ｕ｝の推論値に基づく予測クラスは、ｙ^ｕ＝｛ｙ_１ ^ｕ，…，ｙ_Ｂ ^ｕ｝として特定される。なお、ｙ^ｕの各要素は、予測クラスの番号であってよい。 (Coefficient Generation Unit 131)
The coefficient generation unit 131 calculates a coefficient indicating the influence of the unlabeled data based on the inference result corresponding to the unlabeled data output from the estimation unit 122. More specifically, the coefficient generation unit 131 specifies a predicted class based on the inference value for each piece of unlabeled data included in the mini-batch. If the batch size is B, the predicted class based on the inference value of the unlabeled data x ^u = {x ₁ ^u , ..., x _B ^u } is specified as y ^u = {y ₁ ^u , ..., y _B ^u }. Each element of y ^u may be a predicted class number.

ここで、予測クラスは、どのようにして特定されてもよい。一例として、推論値が最大となるクラスは、確からしさが最も高いクラスであると考えられる。そこで、係数生成部１３１は、ミニバッチに含まれるラベル無しデータごとに推論値が最大となるクラスを予測クラスとして特定してもよい。例えば、予測クラスの特定に用いられる推論値としては、２種類のラベルのいずれが用いられてもよいが、疑似的な教師ラベルが用いられるのが望ましい。 Here, the predicted class may be identified in any manner. As an example, the class with the largest inference value is considered to be the most likely class. Therefore, the coefficient generation unit 131 may identify the class with the largest inference value for each piece of unlabeled data included in the mini-batch as the predicted class. For example, either of two types of labels may be used as the inference value used to identify the predicted class, but it is preferable to use a pseudo teacher label.

そして、係数生成部１３１は、予測クラスに対応するラベル無しデータの数を予測クラスの度数として算出する。例えば、ニューラルネットワーク１２０がＮクラスへの分類問題を解く場合には（すなわち、ラベル無しデータに対応する推論結果がＮクラス分の推論値を含む場合には）、予測クラスの度数ｃ_ｉは、以下の式（１）のように表現され得る。 Then, the coefficient generating unit 131 calculates the number of unlabeled data corresponding to the predicted class as the frequency of the predicted class. For example, when the neural network 120 solves a classification problem into N classes (i.e., when the inference result corresponding to the unlabeled data includes inference values for N classes), the frequency c _i of the predicted class can be expressed as the following formula (1).

係数生成部１３１は、ミニバッチに含まれるラベル無しデータｘ^ｕごとに、予測クラスの度数ｃに基づいて係数ｔを算出する。一例として、係数生成部１３１は、ミニバッチに含まれるラベル無しデータｘ^ｕごとに予測クラスの度数ｃに対して負の相関を有する数を係数ｔとして算出するのが望ましい。これによって、度数ｃが小さい予測クラスに対応するラベル無しデータほど、影響度が高く扱われるようになる。 The coefficient generating unit 131 calculates a coefficient t for each unlabeled data x ^u included in the mini-batch based on the frequency c of the predicted class. As an example, it is preferable that the coefficient generating unit 131 calculates a number having a negative correlation with the frequency c of the predicted class for each unlabeled data x ^u included in the mini-batch as the coefficient t. As a result, the unlabeled data corresponding to a predicted class with a smaller frequency c is treated as having a higher degree of influence.

例えば、入力値に対して負の相関を示す出力値を返却する関数をｆとすると、入力値に対して負の相関を示す出力値を返却する関数ｆは、以下の式（２）のように表現され得る。 For example, if f is a function that returns an output value that shows a negative correlation with respect to an input value, then the function f that returns an output value that shows a negative correlation with respect to an input value can be expressed as in the following equation (2).

例えば、入力値に対して負の相関を示す出力値を返却する関数ｆの例として、以下の式（３）が挙げられる。 For example, the following formula (3) is an example of a function f that returns an output value that shows a negative correlation with an input value.

係数生成部１３１は、ミニバッチに含まれるラベル無しデータごとの推論結果および係数を、ラベル無しデータ評価部１３２に出力する。 The coefficient generation unit 131 outputs the inference results and coefficients for each unlabeled data included in the mini-batch to the unlabeled data evaluation unit 132.

（ラベル無しデータ評価部１３２）
ラベル無しデータ評価部１３２は、ミニバッチに含まれるラベル無しデータごとの推論結果および係数に基づいて、ラベル無しデータに対応する評価結果を得る。より詳細に、ラベル無しデータ評価部１３２は、ミニバッチに含まれるラベル無しデータごとに、推論結果に基づいて損失を算出し、ラベル無しデータごとの損失および係数に基づいて、ラベル無しデータに対応する評価結果を得る。 (Unlabeled Data Evaluation Unit 132)
The unlabeled data evaluation unit 132 obtains an evaluation result corresponding to the unlabeled data based on the inference result and the coefficient for each piece of unlabeled data included in the mini-batch. More specifically, the unlabeled data evaluation unit 132 calculates a loss for each piece of unlabeled data included in the mini-batch based on the inference result, and obtains an evaluation result corresponding to the unlabeled data based on the loss and the coefficient for each piece of unlabeled data.

まず、ラベル無しデータ評価部１３２は、ラベル無しデータごとに、疑似的な教師ラベルに基づいて推定ラベルを評価して損失を算出する。 First, the unlabeled data evaluation unit 132 evaluates the estimated labels for each piece of unlabeled data based on the pseudo teacher labels and calculates the loss.

ここで、損失の算出に用いられる損失関数は特定の関数に限定されず、一般的なニューラルネットワークにおいて用いられる損失関数と同様の損失関数が用いられてよい。例えば、損失関数は、ラベル無しデータに対応する疑似的な教師ラベルとラベル無しデータに対応する推定ラベルとの差分に基づく平均二乗誤差であってもよいし、ラベル無しデータに対応する疑似的な教師ラベルとラベル無しデータに対応する推定ラベルとの差分に基づく交差エントロピー誤差であってもよい。 Here, the loss function used to calculate the loss is not limited to a specific function, and a loss function similar to that used in a general neural network may be used. For example, the loss function may be a mean square error based on the difference between the pseudo teacher label corresponding to the unlabeled data and the estimated label corresponding to the unlabeled data, or a cross entropy error based on the difference between the pseudo teacher label corresponding to the unlabeled data and the estimated label corresponding to the unlabeled data.

次に、ラベル無しデータ評価部１３２は、ラベル無しデータごとの損失および係数に基づいて、ラベル無しデータに対応する評価結果を得る。より詳細に、ラベル無しデータ評価部１３２は、ラベル無しデータごとに損失と係数とを乗算することに基づいて、ラベル無しデータに対応する評価結果を得る。 Next, the unlabeled data evaluation unit 132 obtains an evaluation result corresponding to the unlabeled data based on the loss and coefficient for each piece of unlabeled data. More specifically, the unlabeled data evaluation unit 132 obtains an evaluation result corresponding to the unlabeled data based on multiplying the loss and the coefficient for each piece of unlabeled data.

一例として、ラベル無しデータをｘ^ｕとし、重みパラメータ１２１をθとし、係数生成部１３１によって算出された係数をｔとすると、ラベル無しデータに対応する評価結果ｌ_ｕは、以下の式（４）に示すように、ラベル無しデータごとの損失と係数との乗算結果のミニバッチにおける合計によって算出され得る。 As an example, if the unlabeled data is x ^u , the weight parameter 121 is θ, and the coefficient calculated by the coefficient generation unit 131 is t, the evaluation result l _u corresponding to the unlabeled data can be calculated by summing up the multiplication results of the loss and the coefficient for each unlabeled data in a mini-batch, as shown in the following equation (4).

ラベル無しデータ評価部１３２は、ラベル無しデータに対応する評価結果を更新部１３４に出力する。 The unlabeled data evaluation unit 132 outputs the evaluation results corresponding to the unlabeled data to the update unit 134.

（ラベル有りデータ評価部１３３）
ラベル有りデータ評価部１３３は、ミニバッチに含まれるラベル有りデータごとに、ラベル有りデータに対応する教師ラベルに基づいて、ラベル有りデータを評価してラベル有りデータごとの評価結果を得る。より詳細に、ラベル有りデータ評価部１３３は、ラベル有りデータに対応する教師ラベルとラベル有りデータとに基づいて損失を算出し、ラベル有りデータごとの損失に基づいて、ラベル有りデータに対応する評価結果を得る。 (Labeled Data Evaluation Unit 133)
The labeled data evaluation unit 133 evaluates each piece of labeled data included in the mini-batch based on the teacher label corresponding to the labeled data to obtain an evaluation result for each piece of labeled data. More specifically, the labeled data evaluation unit 133 calculates a loss based on the teacher label corresponding to the labeled data and the labeled data, and obtains an evaluation result corresponding to the labeled data based on the loss for each piece of labeled data.

まず、ラベル有りデータ評価部１３３は、ラベル有りデータごとに、ラベル有りデータに対応する教師ラベルに基づいてラベル有りデータを評価して損失を算出する。 First, the labeled data evaluation unit 133 evaluates each piece of labeled data based on the teacher label corresponding to the labeled data and calculates the loss.

ここで、損失関数は特定の関数に限定されず、一般的なニューラルネットワークにおいて用いられる損失関数と同様の損失関数が用いられてよい。例えば、損失関数は、ラベル有りデータに対応する教師ラベルとラベル有りデータとの差分に基づく平均二乗誤差であってもよいし、ラベル有りデータに対応する教師ラベルとラベル有りデータとの差分に基づく交差エントロピー誤差であってもよい。 Here, the loss function is not limited to a specific function, and a loss function similar to that used in a general neural network may be used. For example, the loss function may be a mean square error based on the difference between the labeled data and the teacher label corresponding to the labeled data, or a cross entropy error based on the difference between the labeled data and the teacher label corresponding to the labeled data.

次に、ラベル有りデータ評価部１３３は、ラベル有りデータごとの損失に基づいて、ラベル有りデータに対応する評価結果を得る。より詳細に、ラベル有りデータ評価部１３３は、ラベル有りデータごとの損失のミニバッチにおける合計によって、ラベル有りデータに対応する評価結果を得る。 Next, the labeled data evaluation unit 133 obtains an evaluation result corresponding to the labeled data based on the loss for each labeled data. More specifically, the labeled data evaluation unit 133 obtains an evaluation result corresponding to the labeled data by summing the losses for each labeled data in a mini-batch.

一例として、ラベル有りデータをｘ^ｔとし、ラベル有りデータに対応する教師ラベルをｘ^ｔとし、重みパラメータ１２１をθとすると、ラベル有りデータに対応する評価結果ｌ_ｓは、以下の式（５）に示すように表現され得る。 As an example, if labeled data is ^xt , a teacher label corresponding to the labeled data is ^xt , and a weight parameter 121 is θ, then the evaluation result l _s corresponding to the labeled data can be expressed as shown in the following equation (5).

ラベル有りデータ評価部１３３は、ラベル有りデータに対応する評価結果を更新部１３４に出力する。 The labeled data evaluation unit 133 outputs the evaluation result corresponding to the labeled data to the update unit 134.

（更新部１３４）
更新部１３４は、ラベル無しデータ評価部１３２から出力されたラベル無しデータに対応する評価結果とラベル有りデータ評価部１３３から出力されたラベル有りデータに対応する評価結果とに基づいて、重みパラメータ１２１の更新を行う。これによって、ラベル無しデータに対応する推定ラベルがラベル無しデータに対応する疑似的な教師ラベルに近づくように、かつ、ラベル有りデータがラベル有りデータに対応する教師ラベルに近づくように、重みパラメータ１２１が訓練され得る。 (Update unit 134)
The update unit 134 updates the weighting parameters 121 based on the evaluation result corresponding to the unlabeled data output from the unlabeled data evaluation unit 132 and the evaluation result corresponding to the labeled data output from the labeled data evaluation unit 133. This allows the weighting parameters 121 to be trained so that the estimated labels corresponding to the unlabeled data approach the pseudo teacher labels corresponding to the unlabeled data and so that the labeled data approach the teacher labels corresponding to the labeled data.

例えば、更新部１３４は、ラベル有りデータに対応する評価結果とラベル無しデータに対応する評価結果との重み付き和（以下、単に「重み付き和」とも言う。）に基づいて、重みパラメータ１２１の更新を行ってよい。より詳細に、更新部１３４は、ラベル有りデータに対応する評価結果とラベル無しデータに対応する評価結果との重み付き和に基づく誤差逆伝播法（バックプロパゲーション）によって重みパラメータ１２１を更新してよい。 For example, the update unit 134 may update the weight parameter 121 based on a weighted sum of the evaluation results corresponding to the labeled data and the evaluation results corresponding to the unlabeled data (hereinafter, simply referred to as the "weighted sum"). More specifically, the update unit 134 may update the weight parameter 121 by backpropagation based on the weighted sum of the evaluation results corresponding to the labeled data and the evaluation results corresponding to the unlabeled data.

重み付き和は、どのように表現されてもよい。一例として、式（５）に示されたように、ラベル有りデータに対応する評価結果をｌ_ｓとし、式（４）に示されたように、ラベル無しデータに対応する評価結果をｌ_ｕとし、重み付き和を取るためのハイパーパラメータをλとすると、重み付き和Ｌは、以下の式（６）に示すように表現され得る。 The weighted sum may be expressed in any way. As an example, if the evaluation result corresponding to the labeled data is l _s as shown in formula (5), the evaluation result corresponding to the unlabeled data is l _u as shown in formula (4), and the hyperparameter for taking the weighted sum is λ, the weighted sum L can be expressed as shown in formula (6) below.

なお、更新部１３４は、重みパラメータ１２１の更新が終わるたびに、学習の終了条件が満たされたか否かを判断する。学習の終了条件が満たされていないと判断された場合には、入力部１１１によって次の入力データ（ラベル有りデータおよび教師ラベルの組み合わせ、および、ラベル無しデータ）が取得され、推測部１２２、係数生成部１３１、ラベル無しデータ評価部１３２、ラベル有りデータ評価部１３３および更新部１３４それぞれによって、当該次の入力データに基づく各自の処理が再度実行される。一方、学習の終了条件が満たされたと判断された場合には、学習が終了される。 Note that the update unit 134 judges whether or not the learning termination condition has been satisfied each time the weight parameter 121 is updated. If it is judged that the learning termination condition has not been satisfied, the input unit 111 acquires the next input data (a combination of labeled data and teacher label, and unlabeled data), and the estimation unit 122, the coefficient generation unit 131, the unlabeled data evaluation unit 132, the labeled data evaluation unit 133, and the update unit 134 each re-execute their own processing based on the next input data. On the other hand, if it is judged that the learning termination condition has been satisfied, the learning is terminated.

なお、学習の終了条件は特に限定されず、ニューラルネットワーク１２０の学習がある程度行われたことを示す条件であればよい。具体的に、学習の終了件は、当該重み付き和の値が閾値よりも小さいという条件を含んでもよい。あるいは、学習の終了条件は、当該重み付き和の値の変化が閾値よりも小さいという条件（当該重み付き和の値が収束状態になったという条件）を含んでもよい。あるいは、学習の終了条件は、重みパラメータ１２１の更新が所定の回数行われたという条件を含んでもよい。あるいは、ニューラルネットワーク１２０の精度（例えば、正解率など）が算出される場合、学習の終了条件は、精度が所定の割合（例えば、９０％など）を超えるという条件を含んでもよい。 The learning end condition is not particularly limited, and may be any condition that indicates that the neural network 120 has been learned to a certain extent. Specifically, the learning end condition may include a condition that the value of the weighted sum is smaller than a threshold value. Alternatively, the learning end condition may include a condition that the change in the value of the weighted sum is smaller than a threshold value (a condition that the value of the weighted sum has converged). Alternatively, the learning end condition may include a condition that the weight parameter 121 has been updated a predetermined number of times. Alternatively, when the accuracy of the neural network 120 (e.g., accuracy rate, etc.) is calculated, the learning end condition may include a condition that the accuracy exceeds a predetermined percentage (e.g., 90%).

以上、本発明の第１の実施形態に係る学習装置の構成例について説明した。 The above describes an example of the configuration of a learning device according to the first embodiment of the present invention.

（学習段階の動作）
続いて、図２を参照しながら、本発明の第１の実施形態に係る学習装置１０によって実行される「学習段階」の動作の流れについて説明する。図２は、本発明の第１の実施形態に係る学習装置１０によって実行される学習段階の動作例を示すフローチャートである。 (Learning stage operation)
Next, a flow of operations in a "learning stage" executed by the learning device 10 according to the first embodiment of the present invention will be described with reference to Fig. 2. Fig. 2 is a flowchart showing an example of operations in a learning stage executed by the learning device 10 according to the first embodiment of the present invention.

まず、入力部１１１は、ラベル無しデータセット１０１からバッチサイズのラベル無しデータを取得することによってミニバッチを作成し、作成したミニバッチをニューラルネットワーク１２０の推測部１２２に出力する（Ｓ１０１）。 First, the input unit 111 creates a mini-batch by acquiring unlabeled data of the batch size from the unlabeled dataset 101, and outputs the created mini-batch to the estimation unit 122 of the neural network 120 (S101).

続いて、推測部１２２は、入力部１１１によって作成されたミニバッチに含まれるラベル無しデータとニューラルネットワーク１２０とに基づいてラベル無しデータに対応する推論結果を得る（Ｓ１０２）。推測部１２２は、ラベル無しデータに対応する推論結果を係数生成部１３１に出力する。 Then, the estimation unit 122 obtains an inference result corresponding to the unlabeled data based on the unlabeled data included in the mini-batch created by the input unit 111 and the neural network 120 (S102). The estimation unit 122 outputs the inference result corresponding to the unlabeled data to the coefficient generation unit 131.

このとき、推測部１２２は、ラベル無しデータに対応する推論結果として、２種類のラベルを係数生成部１３１に出力し得る。例えば、推測部１２２は、２種類のラベルのうち、一方をラベル無しデータに対応する擬似的な教師ラベルとし、他方をラベル無しデータに対応する推定ラベルとして係数生成部１３１に出力する。 At this time, the estimation unit 122 may output two types of labels to the coefficient generation unit 131 as inference results corresponding to the unlabeled data. For example, the estimation unit 122 outputs one of the two types of labels to the coefficient generation unit 131 as a pseudo teacher label corresponding to the unlabeled data, and the other as an estimated label corresponding to the unlabeled data.

係数生成部１３１は、推測部１２２から出力されたラベル無しデータに対応する推論結果に基づいて、ラベル無しデータの影響度を示す係数を算出する（Ｓ１０３）。より詳細に、係数生成部１３１は、ミニバッチに含まれるラベル無しデータごとに推論値に基づく予測クラスを特定する。一例として、係数生成部１３１は、ミニバッチに含まれるラベル無しデータごとに推論値が最大となるクラスを予測クラスとして特定してもよい。例えば、予測クラスの特定に用いられる推論値としては、２種類のラベルのいずれが用いられてもよいが、疑似的な教師ラベルが用いられるのが望ましい。 The coefficient generation unit 131 calculates a coefficient indicating the influence of the unlabeled data based on the inference result corresponding to the unlabeled data output from the estimation unit 122 (S103). More specifically, the coefficient generation unit 131 identifies a predicted class based on the inference value for each piece of unlabeled data included in the mini-batch. As an example, the coefficient generation unit 131 may identify the class with the maximum inference value for each piece of unlabeled data included in the mini-batch as the predicted class. For example, either of two types of labels may be used as the inference value used to identify the predicted class, but it is preferable to use a pseudo teacher label.

そして、係数生成部１３１は、予測クラスに対応するラベル無しデータの数を予測クラスの度数として算出する。係数生成部１３１は、ミニバッチに含まれるラベル無しデータごとに、予測クラスの度数に基づいて係数を算出する。一例として、係数生成部１３１は、ミニバッチに含まれるラベル無しデータごとに予測クラスの度数に対して負の相関を有する数を係数として算出するのが望ましい。これによって、度数が小さい予測クラスに対応するラベル無しデータほど、影響度が高く扱われるようになる。 Then, the coefficient generation unit 131 calculates the number of unlabeled data corresponding to the predicted class as the frequency of the predicted class. The coefficient generation unit 131 calculates a coefficient for each unlabeled data included in the mini-batch based on the frequency of the predicted class. As an example, it is desirable for the coefficient generation unit 131 to calculate a number that has a negative correlation with the frequency of the predicted class for each unlabeled data included in the mini-batch as the coefficient. As a result, the unlabeled data corresponding to a predicted class with a smaller frequency is treated as having a higher degree of influence.

ラベル無しデータ評価部１３２は、ミニバッチに含まれるラベル無しデータごとの推論結果および係数に基づいて、ラベル無しデータに対応する評価結果を得る（Ｓ１０４）。より詳細に、ラベル無しデータ評価部１３２は、ミニバッチに含まれるラベル無しデータごとに、推論結果に基づいて損失を算出し、ラベル無しデータごとの損失および係数に基づいて、ラベル無しデータに対応する評価結果を得る。ラベル無しデータ評価部１３２は、ラベル無しデータに対応する評価結果を更新部１３４に出力する。 The unlabeled data evaluation unit 132 obtains an evaluation result corresponding to the unlabeled data based on the inference result and coefficient for each piece of unlabeled data included in the mini-batch (S104). More specifically, the unlabeled data evaluation unit 132 calculates a loss for each piece of unlabeled data included in the mini-batch based on the inference result, and obtains an evaluation result corresponding to the unlabeled data based on the loss and coefficient for each piece of unlabeled data. The unlabeled data evaluation unit 132 outputs the evaluation result corresponding to the unlabeled data to the update unit 134.

続いて、入力部１１１は、ラベル付きデータセット１０２からバッチサイズのラベル付きデータを教師ラベルとラベル有りデータとの組み合わせとして取得することによってミニバッチを作成し、作成したミニバッチをニューラルネットワーク１２０の推測部１２２に出力する（Ｓ１０５）。 Then, the input unit 111 creates a mini-batch by acquiring labeled data of the batch size from the labeled dataset 102 as a combination of teacher labels and labeled data, and outputs the created mini-batch to the estimation unit 122 of the neural network 120 (S105).

続いて、推測部１２２は、入力部１１１によって作成されたミニバッチに含まれるラベル有りデータとニューラルネットワーク１２０とに基づいてラベル有りデータに対応する推論結果を得る（Ｓ１０６）。推測部１２２は、ラベル有りデータに対応する推論結果をラベル有りデータ評価部１３３に出力する。 Then, the estimation unit 122 obtains an inference result corresponding to the labeled data based on the labeled data included in the mini-batch created by the input unit 111 and the neural network 120 (S106). The estimation unit 122 outputs the inference result corresponding to the labeled data to the labeled data evaluation unit 133.

ラベル有りデータ評価部１３３は、ミニバッチに含まれるラベル有りデータごとに、ラベル有りデータに対応する教師ラベルに基づいて、ラベル有りデータを評価してラベル有りデータごとの評価結果を得る（Ｓ１０７）。より詳細に、ラベル有りデータ評価部１３３は、ラベル有りデータに対応する教師ラベルとラベル有りデータとに基づいて損失を算出し、ラベル有りデータごとの損失に基づいて、ラベル有りデータに対応する評価結果を得る。ラベル有りデータ評価部１３３は、ラベル有りデータに対応する評価結果を更新部１３４に出力する。 For each piece of labeled data included in the mini-batch, the labeled data evaluation unit 133 evaluates the labeled data based on the teacher label corresponding to the labeled data to obtain an evaluation result for each piece of labeled data (S107). In more detail, the labeled data evaluation unit 133 calculates a loss based on the teacher label corresponding to the labeled data and the labeled data, and obtains an evaluation result corresponding to the labeled data based on the loss for each piece of labeled data. The labeled data evaluation unit 133 outputs the evaluation result corresponding to the labeled data to the update unit 134.

更新部１３４は、ラベル無しデータ評価部１３２から出力されたラベル無しデータに対応する評価結果とラベル有りデータ評価部１３３から出力されたラベル有りデータに対応する評価結果とに基づいて、重みパラメータ１２１の更新を行う（Ｓ１０８）。これによって、ラベル無しデータに対応する推定ラベルがラベル無しデータに対応する疑似的な教師ラベルに近づくように、かつ、ラベル有りデータがラベル有りデータに対応する教師ラベルに近づくように、重みパラメータ１２１が訓練され得る。 The update unit 134 updates the weighting parameters 121 based on the evaluation results corresponding to the unlabeled data output from the unlabeled data evaluation unit 132 and the evaluation results corresponding to the labeled data output from the labeled data evaluation unit 133 (S108). This allows the weighting parameters 121 to be trained so that the estimated labels corresponding to the unlabeled data approach the pseudo teacher labels corresponding to the unlabeled data, and so that the labeled data approach the teacher labels corresponding to the labeled data.

例えば、更新部１３４は、ラベル有りデータに対応する評価結果とラベル無しデータに対応する評価結果との重み付き和に基づいて、重みパラメータ１２１の更新を行ってよい。より詳細に、更新部１３４は、ラベル有りデータに対応する評価結果とラベル無しデータに対応する評価結果との重み付き和に基づく誤差逆伝播法（バックプロパゲーション）によって重みパラメータ１２１を更新してよい。 For example, the update unit 134 may update the weight parameter 121 based on a weighted sum of the evaluation result corresponding to the labeled data and the evaluation result corresponding to the unlabeled data. More specifically, the update unit 134 may update the weight parameter 121 by backpropagation based on the weighted sum of the evaluation result corresponding to the labeled data and the evaluation result corresponding to the unlabeled data.

更新部１３４は、重みパラメータ１２１の更新が終わるたびに、学習の終了条件が満たされたか否かを判断する（Ｓ１０９）。学習の終了条件が満たされていないと判断された場合には（Ｓ１０９において「ＮＯ」）、Ｓ１０１に動作が移行され、入力部１１１によって次の入力データが取得され、推測部１２２、係数生成部１３１、ラベル無しデータ評価部１３２、ラベル有りデータ評価部１３３および更新部１３４それぞれによって、当該次の入力データに基づく各自の処理が再度実行される。一方、学習の終了条件が満たされたと判断された場合には、学習が終了される。 Each time the update unit 134 finishes updating the weight parameter 121, it determines whether the learning termination condition has been satisfied (S109). If it is determined that the learning termination condition has not been satisfied ("NO" in S109), the operation proceeds to S101, the input unit 111 acquires the next input data, and the estimation unit 122, the coefficient generation unit 131, the unlabeled data evaluation unit 132, the labeled data evaluation unit 133, and the update unit 134 each execute their own processing based on the next input data again. On the other hand, if it is determined that the learning termination condition has been satisfied, the learning is terminated.

以上、本発明の第１の実施形態に係る学習装置１０によって実行される「学習段階」の動作の流れについて説明した。 The above describes the flow of operations in the "learning stage" performed by the learning device 10 according to the first embodiment of the present invention.

（第１の実施形態のまとめ）
以上に説明したように、本発明の第１の実施形態によれば、ラベル無しデータに対応する推論結果に基づいて、ラベル無しデータごとの擬似的な予測クラスが特定される。そして、疑似的な予測クラスに基づいて損失に影響する度合いがラベル無しデータごとに自動的に決定される。 (Summary of the first embodiment)
As described above, according to the first embodiment of the present invention, a pseudo predicted class is identified for each piece of unlabeled data based on the inference result corresponding to the unlabeled data. Then, the degree of influence on the loss is automatically determined for each piece of unlabeled data based on the pseudo predicted class.

これによって、学習段階（特に学習初期段階）において発生し得る現象（すなわち、推論結果が特定のクラスに集中してしまう現象）に対して、推論結果が集中してしまうクラスの損失への影響度を下げることが可能となる。その結果として、安定した学習が可能となるという効果が享受され得る。 This makes it possible to reduce the impact of a phenomenon that can occur during the learning stage (especially the early stages of learning) (i.e., the phenomenon in which inference results are concentrated in a specific class) on the loss of the class in which the inference results are concentrated. As a result, the effect of stable learning can be enjoyed.

また、本発明の第１の実施形態によれば、半教師あり学習のアルゴリズムに依存せず、ラベル無しデータの損失への影響度が決定され得る。さらに、本発明の第１の実施形態によれば、損失への影響度が比較的小さいラベル無しデータが存在する場合であっても、その影響度が係数として学習に用いられるため、閾値などを利用した人手によるデータ選別作業を不要としつつ、安定した学習が可能になる。 In addition, according to the first embodiment of the present invention, the impact of unlabeled data on loss can be determined without relying on a semi-supervised learning algorithm. Furthermore, according to the first embodiment of the present invention, even if there is unlabeled data whose impact on loss is relatively small, the impact is used as a coefficient for learning, making it possible to perform stable learning while eliminating the need for manual data selection using thresholds, etc.

以上、本発明の第１の実施形態について説明した。 The above describes the first embodiment of the present invention.

（２．第２の実施形態）
続いて、本発明の第２の実施形態について説明する。本発明の第２の実施形態においても、学習装置によって半教師あり学習が行われる。 2. Second embodiment
Next, a second embodiment of the present invention will be described. In the second embodiment of the present invention, semi-supervised learning is also performed by a learning device.

図１に示されるように、本発明の第２の実施形態に係る学習装置１０は、本発明の第１の実施形態に係る学習装置１０と同様に、入力部１１１と、推測部１２２と、係数生成部１３１と、ラベル無しデータ評価部１３２と、ラベル有りデータ評価部１３３と、更新部１３４とを備える。 As shown in FIG. 1, the learning device 10 according to the second embodiment of the present invention includes an input unit 111, an estimation unit 122, a coefficient generation unit 131, an unlabeled data evaluation unit 132, a labeled data evaluation unit 133, and an update unit 134, similar to the learning device 10 according to the first embodiment of the present invention.

本発明の第２の実施形態に係る学習装置１０は、本発明の第１の実施形態に係る学習装置１０と比較して、係数生成部１３１の機能が主に異なる。したがって、以下では、係数生成部１３１の機能について主に説明を行い、他のブロックの機能についての詳細な説明は省略する。 The learning device 10 according to the second embodiment of the present invention differs from the learning device 10 according to the first embodiment of the present invention mainly in the function of the coefficient generation unit 131. Therefore, the following mainly describes the function of the coefficient generation unit 131, and a detailed description of the functions of the other blocks is omitted.

（係数生成部１３１）
係数生成部１３１は、推測部１２２から出力されたラベル無しデータに対応する推論結果に基づいて、ラベル無しデータの影響度を示す係数を算出する。より詳細に、係数生成部１３１は、ミニバッチに含まれるラベル無しデータごとに推論値が最大となる第１のクラスを特定する。そして、係数生成部１３１は、第１のクラスの推論値と第１のクラスとは異なる１または複数のクラスの推論値とに基づく差分を算出する。例えば、差分の算出に用いられる推論値としては、２種類のラベルのいずれが用いられてもよいが、疑似的な教師ラベルが用いられるのが望ましい。係数生成部１３１は、ラベル無しデータごとに差分に基づいて係数を算出する。 (Coefficient Generation Unit 131)
The coefficient generating unit 131 calculates a coefficient indicating the influence of the unlabeled data based on the inference result corresponding to the unlabeled data output from the estimation unit 122. More specifically, the coefficient generating unit 131 identifies a first class having a maximum inference value for each piece of unlabeled data included in the mini-batch. Then, the coefficient generating unit 131 calculates a difference based on the inference value of the first class and the inference values of one or more classes different from the first class. For example, either of two types of labels may be used as the inference value used to calculate the difference, but it is preferable to use a pseudo teacher label. The coefficient generating unit 131 calculates a coefficient based on the difference for each piece of unlabeled data.

一例として、係数生成部１３１は、推論値が２番目に大きい第２のクラスを特定してもよい。このとき、係数生成部１３１は、ラベル無しデータごとに第１のクラスの推論値と第２のクラスの推論値との差分を算出してもよい。例えば、ニューラルネットワーク１２０がＮクラスへの分類問題を解く場合には（すなわち、ラベル無しデータに対応する推論結果がＮクラス分の推論値を含む場合には）、差分ｄ_ｉは、以下の式（７）のように表現され得る。 As an example, the coefficient generating unit 131 may identify a second class having the second largest inference value. In this case, the coefficient generating unit 131 may calculate the difference between the inference value of the first class and the inference value of the second class for each piece of unlabeled data. For example, when the neural network 120 solves a classification problem into N classes (i.e., when the inference result corresponding to the unlabeled data includes inference values for N classes), the difference d _i may be expressed as in the following formula (7).

式（７）において、ｖは、ラベル無しデータに対応する推論結果を示し、各クラスに対応する推論値の集合を示す。ａｒｇｍａｘは、引数として渡された推論値に対応するクラス番号を出力値として返却する関数である。 In equation (7), v represents the inference result corresponding to the unlabeled data and represents a set of inference values corresponding to each class. argmax is a function that returns the class number corresponding to the inference value passed as an argument as an output value.

なお、第１のクラスの推論値と第１のクラスとは異なる１または複数のクラスの推論値とに基づく差分を算出する手法は限定されない。例えば、係数生成部１３１は、ラベル無しデータごとに第１のクラスの推論値と第１のクラスとは異なる複数のクラスの推論値の平均値との差分を算出してもよい。このとき、平均値を取る関数をａｖｅとすると、差分ｄ_ｉは、以下の式（８）のように表現され得る。 The method of calculating the difference based on the inferred value of the first class and the inferred values of one or more classes different from the first class is not limited. For example, the coefficient generating unit 131 may calculate the difference between the inferred value of the first class and the average value of the inferred values of multiple classes different from the first class for each piece of unlabeled data. In this case, if the function that takes the average value is ave, the difference d _i can be expressed as the following formula (8).

係数生成部１３１は、ミニバッチに含まれるラベル無しデータごとに、差分ｄに基づいて係数ｔを算出する。一例として、係数生成部１３１は、ミニバッチに含まれるラベル無しデータごとに差分ｄに対して正の相関を有する数を係数ｔとして算出するのが望ましい。これによって、大きい差分に対応するラベル無しデータほど、影響度が高く扱われるようになる。 The coefficient generation unit 131 calculates a coefficient t for each piece of unlabeled data included in the mini-batch based on the difference d. As an example, it is preferable that the coefficient generation unit 131 calculates a number that has a positive correlation with the difference d as the coefficient t for each piece of unlabeled data included in the mini-batch. This allows unlabeled data that corresponds to a larger difference to be treated as having a higher degree of influence.

例えば、入力値に対して正の相関を示す出力値を返却する関数をｇとすると、入力値に対して正の相関を示す出力値を返却する関数ｇは、以下の式（９）のように表現され得る。 For example, if a function that returns an output value that shows a positive correlation with an input value is g, the function g that returns an output value that shows a positive correlation with an input value can be expressed as the following equation (9).

例えば、入力値に対して正の相関を示す出力値を返却する関数ｇの例として、以下の式（１０）が挙げられる。 For example, the following formula (10) is an example of a function g that returns an output value that shows a positive correlation with an input value.

以上、本発明の第２の実施形態に係る学習装置の構成例について説明した。 The above describes an example of the configuration of a learning device according to the second embodiment of the present invention.

（学習段階の動作）
続いて、図２を参照しながら、本発明の第２の実施形態に係る学習装置１０によって実行される「学習段階」の動作の流れについて説明する。本発明の第２の実施形態に係る学習装置１０によって実行される「学習段階」の動作は、本発明の第１の実施形態に係る学習装置１０によって実行される「学習段階」の動作と比較して、係数生成部１３１の動作が主に異なる。したがって、以下では、係数生成部１３１の動作について主に説明を行い、他の動作についての詳細な説明は省略する。 (Learning stage operation)
Next, the flow of operations in the "learning stage" performed by the learning device 10 according to the second embodiment of the present invention will be described with reference to Fig. 2. The operations in the "learning stage" performed by the learning device 10 according to the second embodiment of the present invention differ from the operations in the "learning stage" performed by the learning device 10 according to the first embodiment of the present invention mainly in the operations of the coefficient generation unit 131. Therefore, the following mainly describes the operations of the coefficient generation unit 131, and detailed descriptions of other operations are omitted.

本発明の第２の実施形態においても、本発明の第１の実施形態と同様に、Ｓ１０１～Ｓ１０２が実行される。続いて、係数生成部１３１は、推測部１２２から出力されたラベル無しデータに対応する推論結果に基づいて、ラベル無しデータの影響度を示す係数を算出する（Ｓ１０３）。 In the second embodiment of the present invention, steps S101 to S102 are executed in the same manner as in the first embodiment of the present invention. Next, the coefficient generation unit 131 calculates a coefficient indicating the degree of influence of the unlabeled data based on the inference result corresponding to the unlabeled data output from the estimation unit 122 (S103).

より詳細に、係数生成部１３１は、ミニバッチに含まれるラベル無しデータごとに推論値が最大となる第１のクラスを特定する。そして、係数生成部１３１は、第１のクラスの推論値と第１のクラスとは異なる１または複数のクラスの推論値とに基づく差分を算出する。例えば、差分の算出に用いられる推論値としては、２種類のラベルのいずれが用いられてもよいが、疑似的な教師ラベルが用いられるのが望ましい。係数生成部１３１は、ラベル無しデータごとに差分に基づいて係数を算出する。 More specifically, the coefficient generation unit 131 identifies a first class that has the maximum inference value for each piece of unlabeled data included in the mini-batch. Then, the coefficient generation unit 131 calculates a difference based on the inference value of the first class and the inference values of one or more classes different from the first class. For example, either of two types of labels may be used as the inference value used to calculate the difference, but it is preferable to use a pseudo teacher label. The coefficient generation unit 131 calculates a coefficient based on the difference for each piece of unlabeled data.

一例として、係数生成部１３１は、推論値が２番目に大きい第２のクラスを特定してもよい。このとき、係数生成部１３１は、ラベル無しデータごとに第１のクラスの推論値と第２のクラスの推論値との差分を算出してもよい。あるいは、係数生成部１３１は、ラベル無しデータごとに第１のクラスの推論値と第１のクラスとは異なる複数のクラスの推論値の平均値との差分を算出してもよい。 As an example, the coefficient generation unit 131 may identify a second class having the second largest inferred value. In this case, the coefficient generation unit 131 may calculate the difference between the inferred value of the first class and the inferred value of the second class for each piece of unlabeled data. Alternatively, the coefficient generation unit 131 may calculate the difference between the inferred value of the first class and the average value of the inferred values of multiple classes different from the first class for each piece of unlabeled data.

係数生成部１３１は、ミニバッチに含まれるラベル無しデータごとに、差分に基づいて係数を算出する。一例として、係数生成部１３１は、ミニバッチに含まれるラベル無しデータごとに差分に対して正の相関を有する数を係数として算出するのが望ましい。これによって、大きい差分に対応するラベル無しデータほど、影響度が高く扱われるようになる。 The coefficient generation unit 131 calculates a coefficient based on the difference for each piece of unlabeled data included in the mini-batch. As an example, it is preferable that the coefficient generation unit 131 calculates a number that has a positive correlation with the difference for each piece of unlabeled data included in the mini-batch as the coefficient. This allows unlabeled data that corresponds to a larger difference to be treated as having a higher degree of influence.

本発明の第２の実施形態においても、本発明の第１の実施形態と同様に、Ｓ１０４～Ｓ１０９が実行される。 In the second embodiment of the present invention, steps S104 to S109 are executed in the same manner as in the first embodiment of the present invention.

以上、本発明の第２の実施形態に係る学習装置１０によって実行される「学習段階」の動作の流れについて説明した。 The above describes the flow of operations in the "learning stage" performed by the learning device 10 according to the second embodiment of the present invention.

（第２の実施形態のまとめ）
以上に説明したように、本発明の第２の実施形態によれば、ラベル無しデータに対応する推論結果に基づいて、ラベル無しデータごとに推論値が最大となる第１のクラスの推論値と第１のクラスとは異なる１または複数のクラスの推論値とに基づく差分が算出される。そして、差分に基づいて損失に影響する度合いがラベル無しデータごとに自動的に決定される。 (Summary of the second embodiment)
As described above, according to the second embodiment of the present invention, a difference is calculated for each piece of unlabeled data based on an inference result corresponding to the unlabeled data, between an inferred value of a first class having a maximum inferred value and inferred values of one or more classes different from the first class. Then, a degree of influence on loss is automatically determined for each piece of unlabeled data based on the difference.

これによって、本発明の第１の実施形態と同様の効果が享受され得る。さらに、本発明の第２の実施形態によれば、バッチサイズが比較的小さい場合であっても、損失に影響する度合いが高精度に決定され得る。そのため、本発明の第２の実施形態によれば、バッチサイズが比較的小さい場合であっても、安定した学習が可能になる。 This allows the same effects as those of the first embodiment of the present invention to be obtained. Furthermore, according to the second embodiment of the present invention, even if the batch size is relatively small, the degree of influence on the loss can be determined with high accuracy. Therefore, according to the second embodiment of the present invention, stable learning is possible even if the batch size is relatively small.

以上、本発明の第２の実施形態について説明した。 The second embodiment of the present invention has been described above.

（３．ハードウェア構成例）
続いて、本発明の第１の実施形態に係る学習装置１０のハードウェア構成例について説明する。なお、本発明の第２の実施形態に係る学習装置１０のハードウェア構成も、本発明の第１の実施形態に係る学習装置１０のハードウェア構成と同様に実現され得る。 (3. Hardware Configuration Example)
Next, a hardware configuration example of the learning device 10 according to the first embodiment of the present invention will be described. Note that the hardware configuration of the learning device 10 according to the second embodiment of the present invention can be realized in the same manner as the hardware configuration of the learning device 10 according to the first embodiment of the present invention.

以下では、本発明の第１の実施形態に係る学習装置１０のハードウェア構成例として、情報処理装置９００のハードウェア構成例について説明する。なお、以下に説明する情報処理装置９００のハードウェア構成例は、学習装置１０のハードウェア構成の一例に過ぎない。したがって、学習装置１０のハードウェア構成は、以下に説明する情報処理装置９００のハードウェア構成から不要な構成が削除されてもよいし、新たな構成が追加されてもよい。 Below, an example of the hardware configuration of an information processing device 900 will be described as an example of the hardware configuration of the learning device 10 according to the first embodiment of the present invention. Note that the example of the hardware configuration of the information processing device 900 described below is merely one example of the hardware configuration of the learning device 10. Therefore, the hardware configuration of the learning device 10 may be such that unnecessary components are deleted from the hardware configuration of the information processing device 900 described below, or new components may be added.

図３は、本発明の第１の実施形態に係る学習装置１０の例としての情報処理装置９００のハードウェア構成を示す図である。情報処理装置９００は、ＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）９０１と、ＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）９０２と、ＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）９０３と、ホストバス９０４と、ブリッジ９０５と、外部バス９０６と、インタフェース９０７と、入力装置９０８と、出力装置９０９と、ストレージ装置９１０と、通信装置９１１と、を備える。 Figure 3 is a diagram showing the hardware configuration of an information processing device 900 as an example of a learning device 10 according to the first embodiment of the present invention. The information processing device 900 includes a CPU (Central Processing Unit) 901, a ROM (Read Only Memory) 902, a RAM (Random Access Memory) 903, a host bus 904, a bridge 905, an external bus 906, an interface 907, an input device 908, an output device 909, a storage device 910, and a communication device 911.

ＣＰＵ９０１は、演算処理装置および制御装置として機能し、各種プログラムに従って情報処理装置９００内の動作全般を制御する。また、ＣＰＵ９０１は、マイクロプロセッサであってもよい。ＲＯＭ９０２は、ＣＰＵ９０１が使用するプログラムや演算パラメータ等を記憶する。ＲＡＭ９０３は、ＣＰＵ９０１の実行において使用するプログラムや、その実行において適宜変化するパラメータ等を一時記憶する。これらはＣＰＵバス等から構成されるホストバス９０４により相互に接続されている。 The CPU 901 functions as an arithmetic processing device and control device, and controls the overall operation of the information processing device 900 in accordance with various programs. The CPU 901 may also be a microprocessor. The ROM 902 stores programs and arithmetic parameters used by the CPU 901. The RAM 903 temporarily stores programs used in the execution of the CPU 901 and parameters that change appropriately during the execution. These are interconnected by a host bus 904 consisting of a CPU bus, etc.

ホストバス９０４は、ブリッジ９０５を介して、ＰＣＩ（ＰｅｒｉｐｈｅｒａｌＣｏｍｐｏｎｅｎｔＩｎｔｅｒｃｏｎｎｅｃｔ／Ｉｎｔｅｒｆａｃｅ）バス等の外部バス９０６に接続されている。なお、必ずしもホストバス９０４、ブリッジ９０５および外部バス９０６を分離構成する必要はなく、１つのバスにこれらの機能を実装してもよい。 The host bus 904 is connected to an external bus 906, such as a PCI (Peripheral Component Interconnect/Interface) bus, via a bridge 905. Note that the host bus 904, bridge 905, and external bus 906 do not necessarily need to be configured separately, and these functions may be implemented on a single bus.

入力装置９０８は、マウス、キーボード、タッチパネル、ボタン、マイクロフォン、スイッチおよびレバー等ユーザが情報を入力するための入力手段と、ユーザによる入力に基づいて入力信号を生成し、ＣＰＵ９０１に出力する入力制御回路等から構成されている。情報処理装置９００を操作するユーザは、この入力装置９０８を操作することにより、情報処理装置９００に対して各種のデータを入力したり処理動作を指示したりすることができる。 The input device 908 is composed of input means for the user to input information, such as a mouse, keyboard, touch panel, button, microphone, switch, and lever, and an input control circuit that generates an input signal based on the user's input and outputs it to the CPU 901. A user who operates the information processing device 900 can input various data to the information processing device 900 and instruct processing operations by operating this input device 908.

出力装置９０９は、例えば、ＣＲＴ（ＣａｔｈｏｄｅＲａｙＴｕｂｅ）ディスプレイ装置、液晶ディスプレイ（ＬＣＤ）装置、ＯＬＥＤ（ＯｒｇａｎｉｃＬｉｇｈｔＥｍｉｔｔｉｎｇＤｉｏｄｅ）装置、ランプ等の表示装置およびスピーカ等の音声出力装置を含む。 The output device 909 includes, for example, display devices such as a CRT (Cathode Ray Tube) display device, a Liquid Crystal Display (LCD) device, an OLED (Organic Light Emitting Diode) device, and a lamp, as well as audio output devices such as a speaker.

ストレージ装置９１０は、データ格納用の装置である。ストレージ装置９１０は、記憶媒体、記憶媒体にデータを記録する記録装置、記憶媒体からデータを読み出す読出し装置および記憶媒体に記録されたデータを削除する削除装置等を含んでもよい。ストレージ装置９１０は、例えば、ＨＤＤ（ＨａｒｄＤｉｓｋＤｒｉｖｅ）で構成される。このストレージ装置９１０は、ハードディスクを駆動し、ＣＰＵ９０１が実行するプログラムや各種データを格納する。 The storage device 910 is a device for storing data. The storage device 910 may include a storage medium, a recording device for recording data on the storage medium, a reading device for reading data from the storage medium, and a deleting device for deleting data recorded on the storage medium. The storage device 910 is configured, for example, with a HDD (Hard Disk Drive). This storage device 910 drives the hard disk and stores the programs executed by the CPU 901 and various data.

通信装置９１１は、例えば、ネットワークに接続するための通信デバイス等で構成された通信インタフェースである。また、通信装置９１１は、無線通信または有線通信のどちらに対応してもよい。 The communication device 911 is, for example, a communication interface configured with a communication device for connecting to a network. The communication device 911 may support either wireless communication or wired communication.

以上、本発明の第１の実施形態に係る学習装置１０のハードウェア構成例について説明した。 The above describes an example of the hardware configuration of the learning device 10 according to the first embodiment of the present invention.

（４．まとめ）
以上、添付図面を参照しながら本発明の好適な実施形態について詳細に説明したが、本発明はかかる例に限定されない。本発明の属する技術の分野における通常の知識を有する者であれば、特許請求の範囲に記載された技術的思想の範疇内において、各種の変更例または修正例に想到し得ることは明らかであり、これらについても、当然に本発明の技術的範囲に属するものと了解される。 (4. Summary)
Although the preferred embodiment of the present invention has been described in detail above with reference to the accompanying drawings, the present invention is not limited to such an example. It is clear that a person having ordinary knowledge in the technical field to which the present invention pertains can conceive of various modified or altered examples within the scope of the technical ideas described in the claims, and it is understood that these also naturally belong to the technical scope of the present invention.

本発明の第１の実施形態および本発明の第２の実施形態では、学習用データが画像データである場合（特に、静止画像データである場合）について主に説明した。しかし、学習用データの種類は特に限定されない。例えば、学習用データの種類に合わせた特徴量が抽出されれば、静止画像データ以外も学習用データとして用いられ得る。例えば、学習用データは、複数のフレームを含んだ動画像データであってもよいし、音声データであってもよい。 In the first embodiment and the second embodiment of the present invention, the case where the learning data is image data (particularly, still image data) has been mainly described. However, the type of learning data is not particularly limited. For example, as long as features that match the type of learning data are extracted, data other than still image data may also be used as learning data. For example, the learning data may be video data including multiple frames, or may be audio data.

このとき、学習用データが静止画像データである場合には、推測部１２２に含まれる畳み込み層として２次元畳み込み層が用いられるのが一般的である。一方、推測部１２２に含まれる畳み込み層として３次元畳み込み層が用いられれば、学習用データとして動画像データが適用され得る。 In this case, when the learning data is still image data, a two-dimensional convolutional layer is generally used as the convolutional layer included in the estimation unit 122. On the other hand, when a three-dimensional convolutional layer is used as the convolutional layer included in the estimation unit 122, video image data can be applied as the learning data.

本発明の第１の実施形態では、入力値に対して負の相関を示す出力値を返却する関数ｆの例として、以下の式（３）を挙げて説明した。しかし、入力値に対して負の相関を示す出力値を返却する関数ｆは、かかる例に限定されない。例えば、入力値に対して負の相関を示す出力値を返却する関数ｆの例として、以下の式（３－Ａ）および式（３－Ｂ）なども挙げられる。 In the first embodiment of the present invention, the following formula (3) has been given as an example of a function f that returns an output value that shows a negative correlation with an input value. However, the function f that returns an output value that shows a negative correlation with an input value is not limited to this example. For example, the following formulas (3-A) and (3-B) are also examples of a function f that returns an output value that shows a negative correlation with an input value.

本発明の第２の実施形態では、入力値に対して正の相関を示す出力値を返却する関数ｇの例として、以下の式（９）を挙げて説明した。しかし、入力値に対して正の相関を示す出力値を返却する関数ｇは、かかる例に限定されない。例えば、入力値に対して正の相関を示す出力値を返却する関数ｇの例として、以下の式（９－Ａ）および式（９－Ｂ）なども挙げられる。 In the second embodiment of the present invention, the following formula (9) has been given as an example of function g that returns an output value that shows a positive correlation with an input value. However, function g that returns an output value that shows a positive correlation with an input value is not limited to this example. For example, the following formulas (9-A) and (9-B) are also examples of function g that returns an output value that shows a positive correlation with an input value.

１０学習装置
１０１ラベル無しデータセット
１０２ラベル付きデータセット
１１１入力部
１２０ニューラルネットワーク
１２１重みパラメータ
１２２推測部
１３１係数生成部
１３２ラベル無しデータ評価部
１３３ラベル有りデータ評価部
１３４更新部

REFERENCE SIGNS LIST 10 Learning device 101 Unlabeled data set 102 Labeled data set 111 Input section 120 Neural network 121 Weight parameters 122 Estimation section 131 Coefficient generation section 132 Unlabeled data evaluation section 133 Labeled data evaluation section 134 Update section

Claims

An input unit that acquires unlabeled data and labeled data to which a teacher label is attached;
an inference unit that outputs an inference result corresponding to the unlabeled data and an inference result corresponding to the labeled data based on the unlabeled data, the labeled data, and a neural network;
a coefficient generation unit that calculates a coefficient indicating an influence of the unlabeled data based on an inference result corresponding to the unlabeled data;
an unlabeled data evaluation unit that outputs an evaluation result corresponding to the unlabeled data based on an inference result corresponding to the unlabeled data and the coefficients;
a labeled data evaluation unit that outputs an evaluation result corresponding to the labeled data based on an inference result corresponding to the labeled data and the teacher label;
an update unit that updates weight parameters of the neural network based on an evaluation result corresponding to the unlabeled data and an evaluation result corresponding to the labeled data;
An information processing device comprising:

the inference results corresponding to the unlabeled data include an inference value for each class;
the coefficient generation unit calculates a difference between an inferred value of a first class having a maximum inferred value for each unlabeled data and an inferred value of one or more classes different from the first class, and calculates the coefficient for each unlabeled data based on the difference;
The information processing device according to claim 1 .

the coefficient generation unit calculates, for each of the unlabeled data, a difference between an inferred value of the first class and an inferred value of a second class having a second largest inferred value;
The information processing device according to claim 2 .

the coefficient generation unit calculates, for each of the unlabeled data, a difference between the inferred value of the first class and an average value of inferred values of a plurality of classes different from the first class;
The information processing device according to claim 2 .

the coefficient generation unit calculates, for each of the unlabeled data, a number having a positive correlation with the difference as the coefficient;
The information processing device according to any one of claims 2 to 4.

the inference results corresponding to the unlabeled data include an inference value for each class;
the coefficient generation unit specifies a predicted class based on an inference value for each unlabeled data, calculates a number of unlabeled data corresponding to the predicted class as a frequency of the predicted class, and calculates the coefficient based on the frequency of the predicted class for each of the unlabeled data.
The information processing device according to claim 1 .

The coefficient generation unit identifies a class for which the inference value is maximum as a predicted class for each of the unlabeled data.
The information processing device according to claim 6 .

the coefficient generation unit calculates, for each of the unlabeled data, a number having a negative correlation with a frequency of the predicted class as the coefficient;
8. The information processing device according to claim 6 or 7.

the unlabeled data evaluation unit outputs an evaluation result corresponding to the unlabeled data by multiplying a loss based on an inference result corresponding to the unlabeled data by the coefficient.
The information processing device according to any one of claims 2 to 8.

Obtaining unlabeled data and labeled data to which a teacher label is attached;
outputting an inference result corresponding to the unlabeled data and an inference result corresponding to the labeled data based on the unlabeled data, the labeled data, and a neural network;
calculating a coefficient indicating an influence of the unlabeled data based on an inference result corresponding to the unlabeled data;
outputting an evaluation result corresponding to the unlabeled data based on the inference result corresponding to the unlabeled data and the coefficients;
outputting an evaluation result corresponding to the labeled data based on an inference result corresponding to the labeled data and the teacher label;
updating weight parameters of the neural network based on the evaluation results corresponding to the unlabeled data and the evaluation results corresponding to the labeled data;
13. A computer-implemented information processing method comprising:

Computer,
An input unit that acquires unlabeled data and labeled data to which a teacher label is attached;
an inference unit that outputs an inference result corresponding to the unlabeled data and an inference result corresponding to the labeled data based on the unlabeled data, the labeled data, and a neural network;
a coefficient generation unit that calculates a coefficient indicating an influence of the unlabeled data based on an inference result corresponding to the unlabeled data;
an unlabeled data evaluation unit that outputs an evaluation result corresponding to the unlabeled data based on an inference result corresponding to the unlabeled data and the coefficients;
a labeled data evaluation unit that outputs an evaluation result corresponding to the labeled data based on an inference result corresponding to the labeled data and the teacher label;
an update unit that updates weight parameters of the neural network based on an evaluation result corresponding to the unlabeled data and an evaluation result corresponding to the labeled data;
A program that causes the information processing device to function as an information processing device having the above-mentioned.