JP7673757B2

JP7673757B2 - ANNOTATION DEVICE, ANNOTATION METHOD, AND ANNOTATION PROGRAM

Info

Publication number: JP7673757B2
Application number: JP2022569380A
Authority: JP
Inventors: 佑樹北岸; 岳至森; 歩相名神山
Original assignee: Nippon Telegraph and Telephone Corp; NTT Inc USA
Current assignee: NTT Inc; NTT Inc USA
Priority date: 2020-12-15
Filing date: 2020-12-15
Publication date: 2025-05-09
Anticipated expiration: 2040-12-15
Also published as: WO2022130516A1; JPWO2022130516A1; JP7823699B2; JP2024164314A

Description

本発明は、アノテーション装置、アノテーション方法およびアノテーションプログラムに関する。 The present invention relates to an annotation device, an annotation method, and an annotation program.

従来、機械学習における教師あり学習のためには、学習データとそれに対応する正解ラベルが必要である。多くの研究では、複数名でデータを視聴等してメタデータを付与する作業（アノテーション）が行われている。Traditionally, supervised learning in machine learning requires training data and corresponding correct labels. In many studies, multiple people view the data and assign metadata (annotation).

例えば、音声や動画像に対するアノテーションの場合、作業者（適宜、「アノテータ」）は、提示された数秒～数十秒の音声や動画像を視聴し、仕様に合うようにメタデータを付与する。具体的には、音声からの感情認識の研究開発に向けたアノテーションであれば、聴取した音声に対して最も適切な感情を選択するし、画像に対するオブジェクト検出やオブジェクト認識であれば、オブジェクトの画像内における領域を選択し、オブジェクトに対する説明を付与する。For example, when annotating audio or video, a worker (sometimes called an "annotator") listens to a few to several tens of seconds of the audio or video provided and adds metadata to meet the specifications. Specifically, if the annotation is for research and development of emotion recognition from audio, the worker would select the emotion most appropriate for the audio heard, and if it is object detection or recognition for an image, the worker would select the area of the object in the image and add a description of the object.

従来のアノテーション手法は、作業の比較対象の有無に分けることができる。比較対象がない場合、アノテータは静止画もしくは数秒程度の音声や動画像を視聴して、メタデータを付与する。この手法は、データ視聴回数＝総サンプル数Ｎとなるため、時間コストが低い。また、短時間でも確実に誰もが理解できるタスク（例：文字起こし、オブジェクトへのタグ付け、誰が見聞きしても明らかに怒っている状態等）であれば、正確にアノテーションを行うことができる。Conventional annotation methods can be divided into those that have a comparison target for the task and those that do not. When there is no comparison target, the annotator watches still images or a few seconds of audio or video and assigns metadata. This method has a low time cost because the number of times data is viewed = the total number of samples N. In addition, accurate annotation can be performed for tasks that can be reliably understood by anyone even in a short time (e.g., transcription, tagging objects, or a state in which the person is clearly angry no matter who sees or hears it).

一方、比較対象がある場合、アノテータは長時間（数十秒～数分）の音声や動画像を視聴して連続的かつ相対的な事象の変化に関するメタデータを付与したり（例えば、非特許文献３参照）、複数の音声や動画像を視聴して相対的に順位やスコアを付与したりする（例えば、非特許文献４参照）。この手法は、比較する対象があるため、アノテータ間のブレを低減し、より正確なメタデータを付与できる。On the other hand, when there is something to compare, annotators can listen to audio or video for a long period of time (tens of seconds to several minutes) and assign metadata about continuous and relative changes in events (see, for example, Non-Patent Document 3), or listen to multiple audio or video clips and assign relative rankings or scores (see, for example, Non-Patent Document 4). This method reduces variation between annotators and allows for more accurate metadata to be assigned, since there is something to compare.

Mohammad Soleymani， and Martha Larson， “Crowdsourcing for Affective Annotation of Video: Development of a Viewer-reported Boredom Corpus”， 2010．Mohammad Soleymani, and Martha Larson, “Crowdsourcing for Affective Annotation of Video: Development of a Viewer-reported Boredom Corpus”, 2010. Ryutaro Tanno， Ardavan Saeedi， Swami Sankaranarayanan， Daniel C. Alexander， and Nathan Silberman， “Learning From Noisy Labels By Regularized Estimation Of Annotator Confusion”， 2019.Ryutaro Tanno, Ardavan Saeedi, Swami Sankaranarayanan, Daniel C. Alexander, and Nathan Silberman, “Learning From Noisy Labels By Regularized Estimation Of Annotator Confusion”, 2019. David Melhart， Antonios Liapis， and Georgios N. Yannakakis “PAGAN: Video Affect Annotation Made Easy”， 2019.David Melhart, Antonios Liapis, and Georgios N. Yannakakis “PAGAN: Video Affect Annotation Made Easy”, 2019. Lifang Yang， and Rui Zhu， “Subjective Evaluation of Cooling Fan Sound based on Grade Scoring and Paired Comparison”， 2016.Lifang Yang, and Rui Zhu, “Subjective Evaluation of Cooling Fan Sound based on Grade Scoring and Paired Comparison”, 2016.

しかしながら、上述した従来技術では、機械学習における教師あり学習において、より低コストかつ高精度なアノテーションを行うことができない。なぜならば、比較対象がないアノテーション手法では、短時間の視聴では理解が難しいタスクの場合、正確なアノテーションができず、アノテータ間の付与ラベルのばらつきが大きくなり、アノテーション結果の信頼性が低くなるといった問題がある。However, the above-mentioned conventional techniques cannot perform low-cost and highly accurate annotation in supervised learning in machine learning. This is because annotation methods without a comparison target cannot perform accurate annotation for tasks that are difficult to understand through short viewing, and there is a large variance in the labels assigned by annotators, resulting in low reliability of annotation results.

このような問題に対して、アノテータの品質、回答傾向の考慮、多人数アノテーションでノイズの影響を小さくする等の対応があるが、アノテーションそのものの信頼性を上げるという根本的な解決にはなっていない（例えば、非特許文献１、２参照）。例えば、集中度のアノテーションを行う場合、非常に集中している、または全く集中していない様子、つまり誰が見聞きしても明らかにわかる状態であれば複数アノテータによる投票は一致しやすいが、集中しているのかそうでないのかがわかりにくい状態の場合、正確なアノテーションは難しく、結果として微妙な違いを表現できない。 To address this issue, there are approaches such as taking into consideration the quality of the annotators, answer trends, and reducing the effects of noise by having multiple people annotate, but these do not fundamentally solve the problem of increasing the reliability of the annotation itself (see, for example, Non-Patent Documents 1 and 2). For example, when annotating the degree of concentration, if the person appears to be very focused or not focused at all, that is, if it is clear to anyone who sees or hears, then the votes of multiple annotators are likely to match, but if it is difficult to tell whether someone is focused or not, accurate annotation is difficult, and as a result, subtle differences cannot be expressed.

一方、比較対象がないアノテーション手法では、長時間もしくは大量のデータを視聴する必要があり、アノテーションに膨大なコストを要する。例えば、いくつかのデータの組み合わせを同時に視聴する場合、全Ｎサンプルのデータからｎ個ずつ選択すると最大_ＮＣ_ｎ個の組み合わせが存在し得る。心理学実験法を参考にアノテーションの品質を保ちつつ組み合わせ数を削減することは可能かもしれないが、そのためにはどの組み合わせを除外するかについては慎重な検討が必要である。 On the other hand, annotation methods without a comparison target require viewing a long time or a large amount of data, which requires huge costs for annotation. For example, when viewing several combinations of data simultaneously, if n pieces of data are selected from all N samples, a maximum of _N C _n combinations may exist. It may be possible to reduce the number of combinations while maintaining the quality of annotation by referring to psychological experiment methods, but careful consideration is required to determine which combinations to exclude.

上述した課題を解決し、目的を達成するために、本発明に係るアノテーション装置は、機械学習に用いられる第１の学習データを取得する取得部と、前記取得部によって取得された前記第１の学習データを複数のアノテータに配信する第１配信部と、各アノテータによって前記第１の学習データにそれぞれ付与された第１の正解ラベルの信頼度に基づいて、前記第１の学習データを分類する分類部と、前記分類部によって分類された前記第１の学習データの分類結果を配信する第２配信部とを備えることを特徴とする。In order to solve the above-mentioned problems and achieve the objective, the annotation device of the present invention is characterized in that it comprises an acquisition unit that acquires first learning data to be used in machine learning, a first distribution unit that distributes the first learning data acquired by the acquisition unit to a plurality of annotators, a classification unit that classifies the first learning data based on the reliability of a first correct label that is assigned to the first learning data by each annotator, and a second distribution unit that distributes the classification result of the first learning data classified by the classification unit.

また、本発明に係るアノテーション方法は、アノテーション装置によって実行されるアノテーション方法であって、機械学習に用いられる第１の学習データを取得する取得工程と、前記取得工程によって取得された前記第１の学習データを複数のアノテータに配信する第１配信工程と、各アノテータによって前記第１の学習データにそれぞれ付与された第１の正解ラベルの信頼度に基づいて、前記第１の学習データを分類する分類工程と、前記分類工程によって分類された前記第１の学習データの分類結果を配信する第２配信工程とを含むことを特徴とする。 The annotation method according to the present invention is an annotation method executed by an annotation device, and is characterized in that it includes an acquisition step of acquiring first learning data to be used in machine learning, a first distribution step of distributing the first learning data acquired by the acquisition step to a plurality of annotators, a classification step of classifying the first learning data based on the reliability of a first correct label assigned to the first learning data by each annotator, and a second distribution step of distributing the classification result of the first learning data classified by the classification step.

また、本発明に係るアノテーションプログラムは、機械学習に用いられる第１の学習データを取得する取得ステップと、前記取得ステップによって取得された前記第１の学習データを複数のアノテータに配信する第１配信ステップと、各アノテータによって前記第１の学習データにそれぞれ付与された第１の正解ラベルの信頼度に基づいて、前記第１の学習データを分類する分類ステップと、前記分類ステップによって分類された前記第１の学習データの分類結果を配信する第２配信ステップとをコンピュータに実行させることを特徴とする。 The annotation program of the present invention is characterized in that it causes a computer to execute an acquisition step of acquiring first learning data to be used in machine learning, a first distribution step of distributing the first learning data acquired by the acquisition step to a plurality of annotators, a classification step of classifying the first learning data based on the reliability of a first correct label assigned to the first learning data by each annotator, and a second distribution step of distributing the classification result of the first learning data classified by the classification step.

本発明では、機械学習における教師あり学習において、より低コストかつ高精度なアノテーションを行うことができる。 The present invention enables lower-cost and more accurate annotation in supervised learning in machine learning.

図１は、第１の実施形態に係るアノテーションシステムの構成例を示す図である。FIG. 1 is a diagram showing an example of the configuration of an annotation system according to the first embodiment. 図２は、第１の実施形態に係るアノテーション装置の構成例を示すブロック図である。FIG. 2 is a block diagram showing an example of the configuration of the annotation device according to the first embodiment. 図３は、第１の実施形態に係る学習データの一例を示す図である。FIG. 3 is a diagram illustrating an example of learning data according to the first embodiment. 図４は、第１の実施形態に係る第１の学習データと第１の正解ラベルの一例を示す図である。FIG. 4 is a diagram illustrating an example of first learning data and a first correct label according to the first embodiment. 図５は、第１の実施形態に係る第２の学習データと第２の正解ラベルの一例を示す図である。FIG. 5 is a diagram illustrating an example of second learning data and a second correct label according to the first embodiment. 図６は、第１の実施形態に係るアノテーション処理の流れの一例を示すフローチャートである。FIG. 6 is a flowchart showing an example of the flow of the annotation process according to the first embodiment. 図７は、第１の実施形態に係る第１の学習データの分類処理の流れの一例を示すフローチャートである。FIG. 7 is a flowchart showing an example of the flow of the first learning data classification process according to the first embodiment. 図８は、プログラムを実行するコンピュータを示す図である。FIG. 8 is a diagram illustrating a computer that executes a program.

以下に、本発明に係るアノテーション装置、アノテーション方法およびアノテーションプログラムの実施形態を図面に基づいて詳細に説明する。なお、本発明は、以下に説明する実施形態により限定されるものではない。 Below, an annotation device, an annotation method, and an annotation program according to the present invention will be described in detail with reference to the drawings. Note that the present invention is not limited to the embodiments described below.

〔第１の実施形態〕
以下に、本実施形態に係るアノテーションシステムの構成、アノテーション装置の構成、アノテーション処理の具体例、アノテーション処理の流れ、データの分類処理の流れを順に説明し、最後に本実施形態の効果を説明する。 First Embodiment
The configuration of the annotation system according to this embodiment, the configuration of the annotation device, a specific example of the annotation process, the flow of the annotation process, and the flow of the data classification process will be described below in this order, and finally the effects of this embodiment will be described.

［アノテーションシステムの構成］
図１を用いて、本実施形態に係るアノテーションシステム（適宜、本システム）１００の構成を詳細に説明する。図１は、第１の実施形態に係るアノテーションシステムの一例を示す図である。アノテーションシステム１００は、サーバ等のアノテーション装置１０、各種端末等のアノテータ２０（２０Ａ、２０Ｂ、２０Ｃ）および各種データベース３０（３０Ａ、３０Ｂ、３０Ｃ）を有する。 [Configuration of annotation system]
The configuration of an annotation system (or "this system" as appropriate) 100 according to this embodiment will be described in detail with reference to Fig. 1. Fig. 1 is a diagram showing an example of an annotation system according to a first embodiment. The annotation system 100 includes an annotation device 10 such as a server, annotators 20 (20A, 20B, 20C) such as various terminals, and various databases 30 (30A, 30B, 30C).

ここで、アノテーション装置１０とアノテータ２０とデータベース３０とは、図示しない所定の通信網を介して、有線または無線により通信可能に接続される。なお、図１に示したアノテーションシステム１００には、複数台のアノテーション装置１０が含まれてもよい。Here, the annotation device 10, the annotator 20, and the database 30 are connected to each other via a predetermined communication network (not shown) so as to be able to communicate with each other by wire or wirelessly. Note that the annotation system 100 shown in FIG. 1 may include multiple annotation devices 10.

まず、アノテーション装置１０は、各種データベース３０から、研究や開発に必用な学習データを第１の学習データとして取得する（ステップＳ１）。ここで、取得する学習データとは、音声、画像、動画等のデータであって、当該研究や開発の目的に応じた媒体、規模で取得される。First, the annotation device 10 acquires learning data required for research and development as first learning data from various databases 30 (step S1). Here, the acquired learning data is data such as audio, images, and videos, and is acquired in a medium and on a scale appropriate to the purpose of the research and development.

次に、アノテーション装置１０は、取得した第１の学習データをアノテータ２０に配信する（ステップＳ２）。ここで、アノテータ２０は、配信された学習データにそれぞれ正解ラベルを付与する端末および当該端末のユーザであるが、特に限定されない。アノテータ２０は、別途作成された特定の正解ラベルを付与できる機械学習モデルであってもよい。Next, the annotation device 10 distributes the acquired first learning data to the annotator 20 (step S2). Here, the annotator 20 is a terminal and a user of the terminal that respectively assign correct answer labels to the distributed learning data, but is not limited thereto. The annotator 20 may be a machine learning model that can assign a specific correct answer label that has been created separately.

続いて、アノテータ２０は、配信された第１の学習データに正解ラベル（第１の正解ラベル）を付与する（ステップＳ３）。また、アノテーション装置１０は、正解ラベルを付与された第１の学習データを取得する（ステップＳ４）。Next, the annotator 20 assigns a correct label (first correct label) to the distributed first learning data (step S3). The annotation device 10 also acquires the first learning data to which the correct label has been assigned (step S4).

その後、アノテーション装置１０は、第１の正解ラベルをもとに第１の学習データを分類する（ステップＳ５）。このとき、アノテーション装置１０は、アノテータ２０から取得した回答に基づいて、信頼できる正解データを付与された学習データを基準点（適宜、「基準データ」）Ｓとして選定する。また、アノテーション装置１０は、基準点Ｓ以外の学習データをさらに、正確に正解ラベルを付与しやすいデータ（適宜、「データＤ」）と正確に正解ラベルを付与しにくいデータ（適宜、「データＥ」）とに分類する。Thereafter, the annotation device 10 classifies the first learning data based on the first correct answer label (step S5). At this time, the annotation device 10 selects the learning data to which reliable correct answer data has been assigned based on the answer obtained from the annotator 20 as a reference point (suitably, "reference data") S. The annotation device 10 also further classifies the learning data other than the reference point S into data to which it is easy to assign an accurate correct answer label (suitably, "data D") and data to which it is difficult to assign an accurate correct answer label (suitably, "data E").

さらに、アノテーション装置１０は、分類した第１の学習データから第２の学習データを生成する（ステップＳ６）。このとき、アノテーション装置１０は、発信源が同一である基準点Ｓ、データＥおよびデータＤを含むデータ群を生成する。なお、第１の学習データの分類や第２の学習データの生成については後述する。Furthermore, the annotation device 10 generates second learning data from the classified first learning data (step S6). At this time, the annotation device 10 generates a data group including the reference point S, data E, and data D that have the same source. The classification of the first learning data and the generation of the second learning data will be described later.

そして、アノテーション装置１０は、生成した第２の学習データをアノテータ２０に配信する（ステップＳ７）。このとき、アノテーション装置１０は、第２の学習データのデータ群を配信するときに、基準点Ｓを視聴した後、データＥ、データＤを視聴するように、各データをアノテータ２０に配信する。また、アノテータ２０は、配信された第２の学習データに正解ラベル（第２の正解ラベル）を付与する（ステップＳ８）。最後に、アノテーション装置１０は、正解ラベルを付与された第２の学習データを取得する（ステップＳ９）。Then, the annotation device 10 distributes the generated second learning data to the annotator 20 (step S7). At this time, when distributing the data group of the second learning data, the annotation device 10 distributes each data to the annotator 20 so that the user views the reference point S, followed by data E and data D. The annotator 20 also assigns a correct answer label (second correct answer label) to the distributed second learning data (step S8). Finally, the annotation device 10 acquires the second learning data to which the correct answer label has been assigned (step S9).

本実施形態に係るアノテーションシステム１００では、アノテーション装置１０が、正解ラベルを付与したい事象の信頼できるデータをデータ群に含めて、かつそれを明示する。このため、アノテータ２０がそれらのデータを比較対象として活用できるようになり、より正確なアノテーションを実現することができる。In the annotation system 100 according to this embodiment, the annotation device 10 includes reliable data for the event to which the correct label is to be assigned in the data group and indicates it clearly. This allows the annotator 20 to use the data as a comparison target, enabling more accurate annotation.

［アノテーション装置の構成］
図２を用いて、本実施形態に係るアノテーション装置１０の構成を詳細に説明する。図２は、本実施形態に係るアノテーション装置の構成例を示すブロック図である。アノテーション装置１０は、入力部１１、出力部１２、通信部１３、記憶部１４および制御部１５を有する。 [Configuration of annotation device]
The configuration of the annotation device 10 according to this embodiment will be described in detail with reference to Fig. 2. Fig. 2 is a block diagram showing an example of the configuration of the annotation device according to this embodiment. The annotation device 10 has an input unit 11, an output unit 12, a communication unit 13, a storage unit 14, and a control unit 15.

入力部１１は、当該アノテーション装置１０への各種情報の入力を司る。入力部１１は、例えば、マウスやキーボード等であり、当該アノテーション装置１０への設定情報等の入力を受け付ける。また、出力部１２は、当該アノテーション装置１０からの各種情報の出力を司る。出力部１２は、例えば、ディスプレイ等であり、当該アノテーション装置１０に記憶された設定情報等を出力する。The input unit 11 is responsible for inputting various types of information to the annotation device 10. The input unit 11 is, for example, a mouse or a keyboard, and accepts input of setting information, etc. to the annotation device 10. The output unit 12 is responsible for outputting various types of information from the annotation device 10. The output unit 12 is, for example, a display, and outputs setting information, etc. stored in the annotation device 10.

通信部１３は、他の装置との間でのデータ通信を司る。例えば、通信部１３は、各通信装置との間でデータ通信を行う。また、通信部１３は、図示しないオペレータの端末との間でデータ通信を行うことができる。The communication unit 13 is responsible for data communication with other devices. For example, the communication unit 13 performs data communication with each communication device. The communication unit 13 can also perform data communication with an operator's terminal (not shown).

記憶部１４は、制御部１５が動作する際に参照する各種情報や、制御部１５が動作した際に取得した各種情報を記憶する。ここで、記憶部１４は、例えば、ＲＡＭ（Random Access Memory）、フラッシュメモリ等の半導体メモリ素子、または、ハードディスク、光ディスク等の記憶装置等である。なお、図２の例では、記憶部１４は、アノテーション装置１０の内部に設置されているが、アノテーション装置１０の外部に設置されてもよいし、複数の記憶部が設置されていてもよい。The memory unit 14 stores various information referenced when the control unit 15 operates and various information acquired when the control unit 15 operates. Here, the memory unit 14 is, for example, a semiconductor memory element such as a random access memory (RAM) or a flash memory, or a storage device such as a hard disk or an optical disk. Note that in the example of FIG. 2, the memory unit 14 is installed inside the annotation device 10, but it may be installed outside the annotation device 10, or multiple memory units may be installed.

記憶部１４は、後述するデータベース３０から取得した第１の学習データ、アノテータ２０から取得した第１の正解ラベルが付与された第１の学習データ、制御部１５の分類部１５ｃが分類した分類結果、生成部１５ｄが生成した第２の学習データ、アノテータ２０から取得した第２の正解ラベルが付与された第２の学習データ等の他、アノテータ２０の情報として、ユーザ名や機械学習モデルの識別番号等を記憶する。The memory unit 14 stores the first learning data obtained from the database 30 described below, the first learning data to which a first correct answer label obtained from the annotator 20 has been assigned, the classification results classified by the classification unit 15c of the control unit 15, the second learning data generated by the generation unit 15d, the second learning data to which a second correct answer label obtained from the annotator 20 has been assigned, etc., as well as information on the annotator 20, such as the user name and the identification number of the machine learning model.

制御部１５は、当該アノテーション装置１０全体の制御を司る。制御部１５は、取得部１５ａ、第１配信部１５ｂ、分類部１５ｃ、生成部１５ｄおよび第２配信部１５ｅを有する。ここで、制御部１５は、例えば、ＣＰＵ（Central Processing Unit）やＭＰＵ（Micro Processing Unit）等の電子回路やＡＳＩＣ（Application Specific Integrated Circuit）やＦＰＧＡ（Field Programmable Gate Array）等の集積回路である。The control unit 15 is responsible for controlling the entire annotation device 10. The control unit 15 has an acquisition unit 15a, a first distribution unit 15b, a classification unit 15c, a generation unit 15d, and a second distribution unit 15e. Here, the control unit 15 is, for example, an electronic circuit such as a CPU (Central Processing Unit) or an MPU (Micro Processing Unit), or an integrated circuit such as an ASIC (Application Specific Integrated Circuit) or an FPGA (Field Programmable Gate Array).

取得部１５ａは、機械学習に用いられる第１の学習データを取得する。例えば、取得部１５ａは、音声、画像または動画を含む第１の学習データを取得する。また、取得部１５ａは、データベース３０から第１の学習データを取得する。また、取得部１５ａは、アノテータ２０から正解ラベルが付与された学習データを取得する。さらに、取得部１５ａは、第１の学習データ、正解ラベルが付与された学習データ等を記憶部１４に格納する。The acquisition unit 15a acquires first learning data used for machine learning. For example, the acquisition unit 15a acquires first learning data including audio, images, or videos. The acquisition unit 15a also acquires the first learning data from the database 30. The acquisition unit 15a also acquires learning data to which a correct answer label has been assigned from the annotator 20. The acquisition unit 15a further stores the first learning data, the learning data to which a correct answer label has been assigned, etc. in the memory unit 14.

第１配信部１５ｂは、取得部１５ａによって取得された第１の学習データを複数のアノテータ２０に配信する。例えば、第１配信部１５ｂは、第１の正解ラベルとして、所定の数字を付与させる形式の第１の学習データを配信する。また、第１配信部１５ｂは、アノテータ２０として、機械学習モデルに第１の学習データを配信する。なお、第１の学習データおよび第１の正解ラベルの詳細な処理については後述する。The first distribution unit 15b distributes the first learning data acquired by the acquisition unit 15a to multiple annotators 20. For example, the first distribution unit 15b distributes the first learning data in a format in which a predetermined number is assigned as a first correct answer label. The first distribution unit 15b also distributes the first learning data to a machine learning model as an annotator 20. Detailed processing of the first learning data and the first correct answer label will be described later.

分類部１５ｃは、各アノテータによって第１の学習データにそれぞれ付与された第１の正解ラベルの信頼度に基づいて、第１の学習データを分類する。例えば、分類部１５ｃは、第１の学習データを、信頼度として第１の正解ラベルの分散に基づいて、基準データ、正確に正解ラベルを付与しやすいデータ、または正確に正解ラベルを付与しにくいデータに分類する。また、分類部１５ｃは、第１の学習データを、信頼度として第１の正解ラベルの事後確率に基づいて分類する。さらに、分類部１５ａは、第１の正解ラベルの信頼度の計算結果、信頼度に基づく分類結果を記憶部１４に格納する。The classification unit 15c classifies the first learning data based on the reliability of the first correct label assigned to the first learning data by each annotator. For example, the classification unit 15c classifies the first learning data into reference data, data to which the correct label is easily assigned accurately, and data to which the correct label is difficult to assign accurately, based on the variance of the first correct label as the reliability. The classification unit 15c also classifies the first learning data based on the posterior probability of the first correct label as the reliability. Furthermore, the classification unit 15a stores the calculation result of the reliability of the first correct label and the classification result based on the reliability in the storage unit 14.

ここで、信頼度とは、アノテータ２０が人である場合は、ある学習データに対する各アノテータの正解ラベルの数値の分散であるが、特に限定されない。信頼度に用いる指標は、数値のばらつきを表わすものであればよく、数値のばらつきが小さいほど正解ラベルの信頼度が高い。また、信頼度とは、アノテータ２０が機械学習モデルである場合は、ある学習データに対する機械学習モデルの推定結果となる数値の事後確率であるが、特に限定されない。信頼度に用いる指標は、機械学習モデルの推定結果の精度を表わすものであればよく、推定結果の精度が大きいほど正解ラベルの信頼度が高い。Here, when the annotator 20 is a human, the reliability is the variance of the numerical values of the correct labels of each annotator for certain training data, but is not particularly limited thereto. The index used for the reliability may be one that represents the variance of the numerical values, and the smaller the variance of the numerical values, the higher the reliability of the correct labels. Also, when the annotator 20 is a machine learning model, the reliability is the posterior probability of the numerical values that are the estimation results of the machine learning model for certain training data, but is not particularly limited thereto. The index used for the reliability may be one that represents the accuracy of the estimation results of the machine learning model, and the higher the accuracy of the estimation results, the higher the reliability of the correct labels.

生成部１５ｄは、分類結果として、基準データであって極値の異なる複数の基準データ、正確に正解ラベルを付与しやすいデータ、および正確に正解ラベルを付与しにくいデータを含み、かつ、各データの発生源が同一のデータ群である第２の学習データを生成する。さらに、生成部１５ｄは、第２の学習データ等の分類結果を記憶部１４に格納する。The generating unit 15d generates, as a classification result, second learning data that includes a plurality of reference data having different extreme values, data that is easy to accurately assign a correct answer label to, and data that is difficult to accurately assign a correct answer label to, and the data are a data group whose source is the same. Furthermore, the generating unit 15d stores the classification result of the second learning data, etc., in the memory unit 14.

ここで、極値とは、例えば、正解ラベルとして、集中度等の特定の状態の度合いを５段階｛１，２，３，４，５｝の数字で判定させる形式の学習データであった場合、最小の数字「１」と最大の数字「５」であるが、特に限定されない。極値は、アノテータ２０が極端な状態であると明確に判定できることを示す数字であればよく、事前に正解ラベルとして設定された数字の範囲の最小値または最大値に限られない。Here, the extreme values are, for example, the minimum number "1" and the maximum number "5" in the case of learning data in a format in which the degree of a particular state such as concentration is judged as a correct label using a number on a 5-point scale {1, 2, 3, 4, 5}, but are not particularly limited thereto. The extreme values are not limited to the minimum or maximum value of the range of numbers set in advance as the correct label, as long as they are numbers that indicate that the annotator 20 can clearly judge that the state is extreme.

第２配信部１５ｅは、分類部１５ｃによって分類された第１の学習データの分類結果を配信する。例えば、第２配信部１５ｅは、極値の異なる複数の基準データを最初に配信する。また、第２配信部１５ｅは、分類結果を第１の学習データを配信した複数のアノテータ、または第１の学習データを配信した複数のアノテータ以外の所定のアノテータに配信する。The second distribution unit 15e distributes the classification results of the first learning data classified by the classification unit 15c. For example, the second distribution unit 15e first distributes multiple reference data with different extreme values. In addition, the second distribution unit 15e distributes the classification results to multiple annotators who distributed the first learning data, or to a specified annotator other than the multiple annotators who distributed the first learning data.

ここで、分類結果とは、付与された正解ラベルの信頼度に基づいて分類部１５ｃによって分類された第１の学習データであり、例えば、基準点Ｓ（基準データ）、データＥ（正確に正解ラベルを付与しやすいデータ）およびデータＤ（正確に正解ラベルを付与しにくいデータ）の３分類がラベリングされた学習データであるが、特に限定されない。分類結果は、正解ラベルの信頼度がラベリングされた学習データであってもよいし、生成部１５ｄによって選定された学習データであってもよい。Here, the classification result is the first learning data classified by the classification unit 15c based on the reliability of the assigned correct answer label, and is, for example, learning data labeled with three categories: reference point S (reference data), data E (data to which it is easy to assign an accurate correct answer label), and data D (data to which it is difficult to assign an accurate correct answer label), but is not particularly limited thereto. The classification result may be learning data labeled with the reliability of the correct answer label, or may be learning data selected by the generation unit 15d.

［アノテーション処理の具体例］
図３～図５を用いて、本実施形態に係るアノテーション装置１０のアノテーション処理の具体例を説明する。図３は、第１の実施形態に係る学習データの一例を示す図である。図４は、第１の実施形態に係る第１の学習データと第１の正解ラベルの一例を示す図である。図５は、第１の実施形態に係る第２の学習データと第２の正解ラベルの一例を示す図である。 [Specific example of annotation processing]
A specific example of the annotation process of the annotation device 10 according to the present embodiment will be described with reference to Fig. 3 to Fig. 5. Fig. 3 is a diagram showing an example of training data according to the first embodiment. Fig. 4 is a diagram showing an example of first training data and a first correct label according to the first embodiment. Fig. 5 is a diagram showing an example of second training data and a second correct label according to the first embodiment.

（第１のアノテーション処理）
第１に、第１の学習データの取得から、第１の正解ラベルの付与された第１の学習データの取得までの、第１のアノテーション処理について説明する。まず、図３を用いて、アノテーション装置１０がデータベース３０等から取得する第１の学習データについて説明する。ここで、取得する第１の学習データは、音声、画像、動画等のデータであって、研究や開発の目的に応じた媒体、規模で取得されたデータである。例えば、音声からの集中度推定の実現に向けてアノテーションを行う場合、アノテーション装置１０は、データベース３０の中で音声データを記憶する音声データベースから音声データを取得する。 (First annotation process)
First, the first annotation process from acquisition of the first learning data to acquisition of the first learning data to which the first correct answer label is assigned will be described. First, the first learning data acquired by the annotation device 10 from the database 30 or the like will be described with reference to FIG. 3. Here, the acquired first learning data is data such as audio, images, videos, etc., and is data acquired in a medium and on a scale according to the purpose of research and development. For example, when performing annotation to realize concentration level estimation from audio, the annotation device 10 acquires audio data from an audio database that stores audio data in the database 30.

図３は、音声データを保持するデータセットＸを示した図であり、データセットＸには、｛ｘ_０，ｘ_１，ｘ_２，ｘ_３，ｘ_４，・・・ｘ_Ｎ｝の音声データが含まれる。なお、図３に示した音声データは、音声波形を時間経過と音声信号強度との関係として表したものである。 Fig. 3 shows a data set X that holds voice data, and the data set X includes voice data { _x0 , _x1 , _x2 , _x3 , _x4 , ... _xN }. The voice data shown in Fig. 3 represents a voice waveform as a relationship between the passage of time and the voice signal strength.

以降の説明では、音声データを第１の学習データとして用いたアノテーション処理について説明するが、学習データの種類は特に限定されない。第１の学習データは、音声データ以外にも、画像データ、動画データ、またはそれらの組み合わせであってもよい。さらに、第１の学習データは、上記の音声データ等を数値化、テキスト化したデータであってもよい。In the following explanation, annotation processing using audio data as the first training data will be described, but the type of training data is not particularly limited. The first training data may be image data, video data, or a combination thereof in addition to audio data. Furthermore, the first training data may be data obtained by digitizing or converting the above-mentioned audio data, etc., into text.

次に、図４を用いて、アノテーション装置１０がアノテータ２０に配信する第１の学習データと、アノテーション装置１０がアノテータ２０から取得する第１の学習データに付与された第１の正解ラベルについて説明する。例えば、音声からの集中度推定の実現に向けてアノテーションを行う場合、アノテーション装置１０は、５段階の集中度（「１」：集中していない、「２」：やや集中していない、「３」：どちらとも言えない（フラット）、「４」：やや集中している、「５」：集中している）を事前に設定し、データセットＸが保持する各音声データに対して、どの集中度が最も適しているかの正解ラベルを付与させる第１の学習データをアノテータ２０に配信する。Next, the first learning data that the annotation device 10 distributes to the annotator 20 and the first correct label assigned to the first learning data that the annotation device 10 acquires from the annotator 20 will be described with reference to FIG. 4. For example, when performing annotation to realize concentration level estimation from voice, the annotation device 10 pre-sets five levels of concentration levels ("1": not concentrated, "2": slightly not concentrated, "3": neither concentrated nor unconcentrated (flat), "4": slightly concentrated, "5": concentrated), and distributes the first learning data to the annotator 20, which assigns a correct label indicating which concentration level is most appropriate for each voice data held by the dataset X.

なお、アノテーション装置１０は、音声からの集中度推定の実現に向けてアノテーションを行う場合、例えば、授業中の教師と生徒の発問と対話に関する音声データから、発問された生徒が授業に集中していたか、集中していなかったか、また、どのくらい集中していたか等を５段階で判定するための学習データを配信する。また、アノテーション装置１０は、画像データや動画データから集中度に関する正解ラベルを付与する場合は、授業中の画像や動画から、生徒の表情等をアノテータ２０に読み取らせ、集中度を判定するための学習データを配信してもよい。When annotation device 10 performs annotation to estimate concentration level from voice, it distributes learning data for judging, for example, from voice data regarding questions and dialogue between a teacher and a student during class, whether the student to whom a question was asked was concentrating on the class, or not, and how concentrated he or she was, on a five-point scale. When annotation device 10 assigns correct answer labels regarding concentration level from image data or video data, it may distribute learning data for judging the concentration level by having annotator 20 read the facial expressions of students from images or videos during class.

そして、アノテーション装置１０は、アノテータ２０によって正解ラベルを付与された第１の学習データを取得する。図３では、「アノテータ０１」～「アノテータ０３」のデータセットＸの音声データ「ｘ_０」～「ｘ_Ｎ」に対する正解ラベルが示されている（図４「ＡＮＮＯＴ１（Ｘ）」参照）。例えば、音声データｘ_０についての「アノテータ０１」～「アノテータ０３」が付与した正解ラベルは、それぞれ、「２」、「１」、「１」である。 Then, the annotation device 10 acquires the first learning data to which the correct labels have been assigned by the annotator 20. In Fig. 3, the correct labels for the speech data " _x0 " to " _xN " of the data set X of "annotator 01" to "annotator 03" are shown (see "ANNOT1(X)" in Fig. 4). For example, the correct labels assigned by "annotator 01" to "annotator 03" to the speech data _x0 are "2", "1", and "1", respectively.

（第２のアノテーション処理）
第２に、第１の正解ラベルの付与された第１の学習データの分類から、第２の正解ラベルの付与された第２の学習データの取得までの、第２のアノテーション処理について説明する。まず、図４を用いて、アノテータ２０から取得した正解ラベルの信頼度に基づく、第１の学習データの分類処理の具体例について説明する。アノテーション装置１０は、各音声データに付与された正解ラベルの平均および分散の数値を算出する。 (Second annotation process)
Secondly, the second annotation process from classification of the first learning data assigned with the first correct label to acquisition of the second learning data assigned with the second correct label will be described. First, a specific example of classification process of the first learning data based on the reliability of the correct label acquired from the annotator 20 will be described with reference to Fig. 4. The annotation device 10 calculates the average and variance values of the correct label assigned to each piece of speech data.

図４において、ｘ_０では、平均「１．３」、分散「０．３」（分散小）である。同様にして、ｘ_１では、平均「５」、分散「０」（全アノテータの回答が一致）、ｘ_２では、平均「１」、分散「０」（全アノテータの回答が一致）、ｘ_３では、平均「３．３」、分散「０．３」（分散小）、ｘ_４では、平均「４．０」、分散「１．０」（分散大）、ｘ_Ｎでは、平均「１．６」、分散「１．３」（分散大）である。 In Fig. 4, _x0 has a mean of 1.3 and a variance of 0.3 (small variance). Similarly, _x1 has a mean of 5 and a variance of 0 (all annotators' answers match), _x2 has a mean of 1 and a variance of 0 (all annotators' answers match), _x3 has a mean of 3.3 and a variance of 0.3 (small variance), _x4 has a mean of 4.0 and a variance of 1.0 (large variance), and _xN has a mean of 1.6 and a variance of 1.3 (large variance).

このとき、アノテーション装置１０は、アノテータ２０から取得した回答から、信頼できる正解データを付与された学習データを基準点Ｓとして選定する。図４の例では、全アノテータの回答が一致していて、かつ、付与された正解ラベルの数値が極値であるものとして、ｘ_１（極値「５」）およびｘ_２（極値「１」）を選定する。 At this time, the annotation device 10 selects, from the answers acquired from the annotator 20, learning data to which reliable correct answer data has been assigned as a reference point S. In the example of Fig. 4, _x1 (extreme value "5") and _x2 (extreme value "1") are selected as the learning data in which the answers of all annotators match and the numerical values of the assigned correct answer labels are extreme values.

また、アノテーション装置１０は、基準点Ｓ以外の学習データをさらに、正確に正解ラベルを付与しやすいデータをデータＤとして、また正確に正解ラベルを付与しにくいデータをデータＥとして分類する。例えば、アノテーション装置１０は、信頼度の閾値を設けて、分散１．０以上であればデータＤに分類し、そうでなければデータＥに分類する。図４の例では、アノテーション装置１０は、ｘ_０とｘ_３は分散１．０未満なのでデータＥに分類し、ｘ_４とｘ_Ｎは分散１．０以上なのでデータＤに分類する。 Furthermore, the annotation device 10 further classifies the learning data other than the reference point S into data D, which is easy to accurately assign a correct label to, and data E, which is difficult to accurately assign a correct label to. For example, the annotation device 10 sets a reliability threshold, and classifies data D if the variance is 1.0 or more, and classifies data E if not. In the example of Fig. 4, the annotation device 10 classifies _x0 and _x3 into data E because their variances are less than 1.0, and classifies _x4 and _xN into data D because their variances are 1.0 or more.

なお、アノテーション装置１０は、正解ラベルとして、別途作成された機械学習モデルによる推定結果を用いる場合は、例えば、推定結果となる数値の事後確率が８０％以上である学習データを基準点Ｓ、事後確率が５０％以上で８０％未満である学習データをデータＥ、事後確率が５０％未満である学習データをデータＤとして分類する。また、アノテーション装置１０は、学習データの分類数や閾値等の分類方式を、静的に、または動的に変更することができる。When the annotation device 10 uses the estimation results from a separately created machine learning model as the correct label, it classifies, for example, learning data in which the posterior probability of the estimated numerical value is 80% or more as reference point S, learning data in which the posterior probability is 50% or more but less than 80% as data E, and learning data in which the posterior probability is less than 50% as data D. Furthermore, the annotation device 10 can statically or dynamically change the classification method, such as the number of classifications and thresholds, of the learning data.

続いて、図５を用いて、第１の学習データの分類に基づく、第２の学習データの生成処理および配信処理の具体例について説明する。アノテーション装置１０は、基準点Ｓ、データＥおよびデータＤの３種類のデータを含むデータ群を第２の学習データとして生成する。このとき、基準点Ｓについてはそれぞれの極値である「１」および「５」のデータが必ず含まれるようにする。また、各データ群のデータの発生源（話者、動画像に映る人物、オブジェクト等）は同一のものとする。 Next, a specific example of the generation and distribution process of the second learning data based on the classification of the first learning data will be described with reference to FIG. 5. The annotation device 10 generates a data group including three types of data, namely, a reference point S, data E, and data D, as the second learning data. At this time, the reference point S is always ensured to include data of "1" and "5", which are the respective extreme values. In addition, the source of the data for each data group (speaker, person appearing in a video, object, etc.) is assumed to be the same.

例えば、アノテーション装置１０は、データ群｛ｐ_０，ｐ_１，・・・ｐ_Ｍ｝を要素とするデータ群集合Ｐを生成する。ここで、データ群ｐ_０には、｛ｘ_０，ｘ_１，ｘ_２，ｘ_３，ｘ_４，ｘ_Ｎ｝（図３、図４参照）が要素として含まれ、データ群ｐ_Ｍには、｛ｘ_ａ，ｘ_ｂ，ｘ_ｃ，ｘ_ｄ，ｘ_ｅ，ｘ_ｆ｝（図３、図４では図示せず）が要素として含まれるものとする。図５の例では、アノテーション装置１０は、データ群ｐ_０として、基準点Ｓ｛ｘ_１，ｘ_２｝、データＥ｛ｘ_０，ｘ_３｝、データＤ｛ｘ_４，ｘ_Ｎ｝を選定し、データ群ｐ_Ｍとして、基準点Ｓ｛ｘ_ａ，ｘ_ｂ｝、データＥ｛ｘ_ｃ，ｘ_ｄ｝、データＤ｛ｘ_ｅ，ｘ_ｆ｝を選定している。 For example, the annotation device 10 generates a data group set P having data groups { _p0 , _p1 , ..., _pM } as elements. Here, the data group _p0 includes { _x0 , _x1 , _x2 , _x3 , _x4 , _xN } (see Figs. 3 and 4) as elements, and the data group _pM includes { _xa , xb, _xc , _xd , _xe , _xf _} (not shown in Figs. 3 and 4) as elements. In the example of Figure 5, the annotation device 10 selects reference point S{ _x1 , _x2 }, data E{ _x0 , _x3 }, and data D{ _x4 , _xN } as data group _p0 , and selects reference point S{ _xa , _xb }, data E{ _xc , _xd }, and data D{ _xe , _xf } as data group _pM .

なお、各データ群内のデータの個数や選定方法は、上述したように異なる極値である基準点Ｓを含み、かつ同一の発生源である条件を満たせば、任意に変更することができる。例えば、データの個数は、一定の範囲内でランダムな個数としてもよい。また、含まれるデータは、データＥとデータＤを２つずつ用意して、それぞれのアノテーションの結果の平均値が「１」または「５」寄りになるようにしてもよい。 The number of data in each data group and the selection method can be changed arbitrarily as long as they include reference points S that are different extreme values as described above and satisfy the condition of being from the same source. For example, the number of data may be a random number within a certain range. Also, the included data may be two each of data E and data D, and the average value of each annotation result may be closer to "1" or "5".

その後、アノテーション装置１０は、上記のように選定したデータ群を第２の学習データとしてアノテータ２０に配信する。このとき、アノテーション装置１０は、第２の学習データのデータ群を配信するときに、データ群ごとに、最初に基準点Ｓを配信し、その後、データＥ、データＤをアノテータ２０に配信する。図５の例では、アノテーション装置１０は、データ群ｐ_０の配信に際して、基準点Ｓ｛ｘ_１，ｘ_２｝、データＥ｛ｘ_０，ｘ_３｝、データＤ｛ｘ_４，ｘ_Ｎ｝の順に配信し、データ群ｐ_Ｍの配信に際して、基準点Ｓ｛ｘ_ａ，ｘ_ｂ｝、データＥ｛ｘ_ｃ，ｘ_ｄ｝、データＤ｛ｘ_ｅ，ｘ_ｆ｝の順に配信している。 Thereafter, the annotation device 10 distributes the data group selected as described above to the annotator 20 as the second learning data. At this time, when distributing the data group of the second learning data, the annotation device 10 distributes the reference point S for each data group first, and then distributes the data E and the data D to the annotator _20. In the example of Fig. 5, the annotation device 10 distributes the reference point S{ _x1 , _x2 }, the data E{ _x0 , _x3 }, and the data D{ _x4 , _xN } in this order when distributing the data group _p0 , and distributes the reference point S{ _xa , _xb }, the data E{ _xc , _xd }, and the data D{ _xe , _xf } in this order when distributing the data group pM.

なお、アノテーション装置１０は、アノテータ２０にデータＥ、データＤの順にデータを視聴するようにデータ視聴順を指示してもよい。また、アノテーション装置１０は、データＥ、データＤをランダムに視聴するように配信してもよい。さらに、アノテーション装置１０は、最初に配信した基準点Ｓについては、配信と同時に正解ラベルを提示し、アノテータ２０に正解ラベルを付与しないように指示してもよいし、学習データの分類に関わらず、全ての学習データに正解ラベルを付与するように指示してもよい。The annotation device 10 may instruct the annotator 20 to view the data in the order of data E and data D. The annotation device 10 may also distribute data E and data D to be viewed randomly. Furthermore, the annotation device 10 may present a correct label for the reference point S that was distributed first at the same time as distribution and instruct the annotator 20 not to assign a correct label, or may instruct the annotator 20 to assign a correct label to all learning data regardless of the classification of the learning data.

最後に、アノテーション装置１０は、アノテータ２０によって正解ラベルを付与された第２の学習データを取得する。図５の例では、アノテーション装置１０は、各データ群について、基準点Ｓを除くデータＥ、データＤの正解ラベルをアノテータごとに取得しており、例えば、「アノテータ０１」について、データ群ｐ_０の｛ｘ_０，ｘ_３，ｘ_４，ｘ_Ｎ｝の学習データに対して、順に｛１，４_，３，２｝の正解ラベルを取得し、データ群ｐ_Ｍの｛ｘ_ｃ，ｘ_ｄ，ｘ_ｅ，ｘ_ｆ｝の学習データに対して、順に｛２，４_，３，３｝の正解ラベルを取得している（図５「ＡＮＮＯＴ２（Ｘ）」参照）。 Finally, the annotation device 10 acquires the second learning data to which the correct labels have been assigned by the annotator 20. In the example of Fig. 5, the annotation device 10 acquires the correct labels of data E and data D excluding the reference point S for each data group for each annotator, and for example, for "annotator 01", the annotation device ₁₀ acquires the correct labels of { ₁ , _4, ₃ , 2} for the learning data of {x0, x3, x4 _, _xN } in the data group p0, and acquires the correct labels of {2, 4 _, 3, 3} for the learning data of { _xc , _xd , _xe , _xf } in the data group _pM (see "ANNOT2(X)" in Fig. 5).

なお、第２の学習データに付与された正解ラベルの最終的な処理については、特に限定されない。アノテーション装置１０は、学習データごとに多数決をとり、最も多い正解ラベルを最終的な正解ラベルとして決定してもよいし、数値の平均点を計算し、その数値を最終的な正解ラベルとして決定してもよい。Note that there is no particular limitation on the final processing of the correct labels assigned to the second training data. The annotation device 10 may take a majority vote for each training data and determine the most common correct label as the final correct label, or may calculate the average score of the numerical values and determine the numerical value as the final correct label.

［アノテーション処理の流れ］
図６を用いて、本実施形態に係るアノテーション処理の流れを詳細に説明する。図６は、第１の実施形態に係るアノテーション処理の流れの一例を示すフローチャートである。 [Annotation process flow]
The flow of annotation processing according to this embodiment will be described in detail with reference to Fig. 6. Fig. 6 is a flowchart showing an example of the flow of annotation processing according to the first embodiment.

まず、アノテーション装置１０の取得部１５ａは、データベース３０等から音声、画像、動画等を含む第１の学習データを取得する（ステップＳ１０１）。このとき、取得部１５ａは、記憶部１４から第１の学習データを取得してもよい。また、取得部１５ａは、データベース３０や記憶部１４から取得した音声データ等の元データを加工し、学習データとして適切なサイズに分割したり、適切な分類をしたりしてもよい。さらに、取得部１５ａは、入力部１１を介して外部から音声データ等を取得してもよい。First, the acquisition unit 15a of the annotation device 10 acquires first learning data including audio, images, videos, etc. from the database 30, etc. (step S101). At this time, the acquisition unit 15a may acquire the first learning data from the memory unit 14. The acquisition unit 15a may also process the original data, such as audio data, acquired from the database 30 or the memory unit 14, and divide the original data into an appropriate size as learning data or perform appropriate classification. Furthermore, the acquisition unit 15a may acquire audio data, etc. from the outside via the input unit 11.

次に、第１配信部１５ｂは、第１の学習データをアノテータ２０に配信する（ステップＳ１０２）。このとき、第１配信部１５ｂは、第１の学習データに応じて配信するアノテータ２０を選定してもよい。また、取得部１５ａは、アノテータ２０によって第１の正解ラベルを付与された第１の学習データを取得する（ステップＳ１０３）。Next, the first distribution unit 15b distributes the first learning data to the annotator 20 (step S102). At this time, the first distribution unit 15b may select the annotator 20 to distribute the first learning data according to the first learning data. In addition, the acquisition unit 15a acquires the first learning data to which the first correct answer label has been assigned by the annotator 20 (step S103).

そして、分類部１５ｃは、第１の正解ラベルの信頼度をもとに第１の学習データを分類する（ステップＳ１０４）。また、生成部１５ｄは、分類された第１の学習データから第２の学習データを生成する（ステップＳ１０５）。続いて、第２配信部１５ｅは、第２の学習データをアノテータ２０に配信する（ステップＳ１０６）。Then, the classification unit 15c classifies the first learning data based on the reliability of the first correct label (step S104). The generation unit 15d generates second learning data from the classified first learning data (step S105). Next, the second distribution unit 15e distributes the second learning data to the annotator 20 (step S106).

なお、第２配信部１５ｅは、第１の学習データを配信したアノテータ２０以外のアノテータに第２の学習データを配信することもできる。例えば、第２配信部１５ｅは、第１の学習データを人であるアノテータに配信し、第２の学習データを機械学習モデルであるアノテータに配信することもできる。In addition, the second distribution unit 15e can also distribute the second learning data to an annotator other than the annotator 20 that distributed the first learning data. For example, the second distribution unit 15e can distribute the first learning data to a human annotator and distribute the second learning data to an annotator that is a machine learning model.

最後に、取得部１５ａは、アノテータ２０によって第２の正解ラベルを付与された第２の学習データを取得し（ステップＳ１０７）、処理が終了する。なお、取得された第２の正解ラベルの精度が十分でない場合は、ステップＳ１０４～Ｓ１０７の処理を再度行ってもよい。Finally, the acquisition unit 15a acquires the second learning data to which the second correct answer label has been assigned by the annotator 20 (step S107), and the process ends. Note that if the accuracy of the acquired second correct answer label is not sufficient, the processes of steps S104 to S107 may be performed again.

［第１の学習データの分類処理の流れ］
図７を用いて、本実施形態に係る第１の学習データの分類処理の流れを詳細に説明する。図７は、第１の実施形態に係る第１の学習データの分類処理の流れの一例を示すフローチャートである。まず、アノテーション装置１０の取得部１５ａは、アノテータ２０から第１の学習データに付与された第１の正解ラベルを取得する（ステップＳ２０１）。次に、分類部１５ｃは、アノテータ２０が人である場合（ステップＳ２０２：アノテータは人）、ステップＳ２０３～Ｓ２０５の処理に基づいて、ステップＳ２０８～Ｓ２１０の分類処理を行う。 [Flow of classification process of first learning data]
The flow of the classification process of the first learning data according to the present embodiment will be described in detail with reference to Fig. 7. Fig. 7 is a flowchart showing an example of the flow of the classification process of the first learning data according to the first embodiment. First, the acquisition unit 15a of the annotation device 10 acquires the first correct answer label assigned to the first learning data from the annotator 20 (step S201). Next, when the annotator 20 is a human (step S202: annotator is a human), the classification unit 15c performs the classification process of steps S208 to S210 based on the processes of steps S203 to S205.

分類部１５ｃは、全アノテータの回答が一致し（ステップＳ２０３：肯定）、その回答が極値である場合（ステップＳ２０４：肯定）、その正解ラベルを付与された第１の学習データを基準点Ｓに分類する（ステップＳ２０８）。また、分類部１５ｃは、アノテータ２０の回答に一致しないものが含まれる場合（ステップＳ２０３：否定）、またアノテータ２０の回答が極値ではない場合（ステップＳ２０４：否定）、ステップＳ２０５の処理を行う。If the answers of all annotators match (step S203: YES) and the answers are extreme values (step S204: YES), the classification unit 15c classifies the first learning data to which the correct answer label has been assigned as the reference point S (step S208). If the answers of the annotator 20 include ones that do not match (step S203: NO) or if the answers of the annotator 20 are not extreme values (step S204: NO), the classification unit 15c performs the process of step S205.

分類部１５ｃは、アノテータ２０の回答の分散が１．０以上である場合（ステップＳ２０５：肯定）、その正解ラベルを付与された第１の学習データをデータＥに分類する（ステップＳ２０９）。また、アノテータ２０の回答の分散が１．０未満である場合（ステップＳ２０５：否定）、その正解ラベルを付与された第１の学習データをデータＤに分類する（ステップＳ２１０）。分類部１５ｃは、ステップＳ２０８～Ｓ２１０の分類処理が終了した場合、処理を終了する。If the variance of the annotator 20's answer is 1.0 or more (step S205: Yes), the classification unit 15c classifies the first learning data to which the correct answer label has been assigned as data E (step S209). If the variance of the annotator 20's answer is less than 1.0 (step S205: No), the classification unit 15c classifies the first learning data to which the correct answer label has been assigned as data D (step S210). When the classification process of steps S208 to S210 is completed, the classification unit 15c ends the process.

一方、分類部１５ｃは、アノテータ２０が機械学習モデルである場合（ステップＳ２０２：アノテータは機械学習モデル）、ステップＳ２０６～Ｓ２０７の処理に基づいて、ステップＳ２０８～Ｓ２１０の分類処理を行う。分類部１５ｃは、アノテータ２０の推定結果となる値の事後確率が８０％以上である場合（ステップＳ２０６：肯定）、その正解ラベルを付与された第１の学習データを基準点Ｓに分類する（ステップＳ２０８）。On the other hand, if the annotator 20 is a machine learning model (step S202: annotator is a machine learning model), the classification unit 15c performs classification processing of steps S208 to S210 based on the processing of steps S206 to S207. If the posterior probability of the value that is the estimated result of the annotator 20 is 80% or more (step S206: Yes), the classification unit 15c classifies the first learning data to which the correct label has been assigned as the reference point S (step S208).

また、分類部１５ｃは、アノテータ２０の推定結果となる値の事後確率が８０％未満であり（ステップＳ２０６：否定）、その事後確率が５０％以上である場合（ステップＳ２０７：肯定）、その正解ラベルを付与された第１の学習データをデータＥに分類する（ステップＳ２０９）。また、分類部１５ｃは、アノテータ２０の推定結果となる値の事後確率が５０％未満である場合（ステップＳ２０７：否定）、その正解ラベルを付与された第１の学習データをデータＤに分類する（ステップＳ２１０）。分類部１５ｃは、ステップＳ２０８～Ｓ２１０の分類処理が終了した場合、処理を終了する。Furthermore, if the posterior probability of the value that is the estimation result of the annotator 20 is less than 80% (step S206: negative) and the posterior probability is 50% or more (step S207: positive), the classification unit 15c classifies the first learning data to which the correct label has been assigned as data E (step S209). Furthermore, if the posterior probability of the value that is the estimation result of the annotator 20 is less than 50% (step S207: negative), the classification unit 15c classifies the first learning data to which the correct label has been assigned as data D (step S210). When the classification process of steps S208 to S210 is completed, the classification unit 15c ends the process.

［第１の実施形態の効果］
第１に、上述した本実施形態に係るアノテーション処理では、機械学習に用いられる第１の学習データを取得し、取得した第１の学習データを複数のアノテータに配信し、各アノテータによって第１の学習データにそれぞれ付与された第１の正解ラベルの信頼度に基づいて、前記第１の学習データを分類し、分類した第１の学習データの分類結果を配信する。このため、本処理では、機械学習における教師あり学習において、より低コストかつ高精度なアノテーションを行うことができる。 [Effects of the First Embodiment]
First, in the annotation process according to the present embodiment described above, first learning data used in machine learning is acquired, the acquired first learning data is distributed to a plurality of annotators, the first learning data is classified based on the reliability of the first correct answer labels respectively assigned to the first learning data by each annotator, and the classification results of the classified first learning data are distributed. Therefore, in this process, annotation can be performed at a lower cost and with higher accuracy in supervised learning in machine learning.

第２に、上述した本実施形態に係るアノテーション処理では、音声、画像または動画を含む第１の学習データを取得し、第１の正解ラベルとして、所定の数字を付与させる形式の第１の学習データを配信し、第１の学習データを、信頼度として第１の正解ラベルの分散に基づいて、基準データ、正確に正解ラベルを付与しやすいデータ、または正確に正解ラベルを付与しにくいデータに分類する。このため、本処理では、機械学習における教師あり学習において、比較対象がない場合であっても信頼性の高い正解ラベルの付与を可能とし、より低コストかつ高精度なアノテーションを行うことができる。 Secondly, in the annotation process according to the present embodiment described above, first learning data including audio, images or videos is acquired, the first learning data is distributed in a format in which a predetermined number is assigned as a first correct label, and the first learning data is classified into reference data, data that is easy to accurately assign a correct label to, or data that is difficult to accurately assign a correct label to, based on the variance of the first correct label as the reliability. Therefore, in the present process, in supervised learning in machine learning, it is possible to assign a highly reliable correct label even in the absence of a comparison target, and annotation can be performed at a lower cost and with higher accuracy.

第３に、上述した本実施形態に係るアノテーション処理では、アノテータとして、機械学習モデルに第１の学習データを配信し、第１の学習データを、信頼度として第１の正解ラベルの事後確率に基づいて分類する。このため、本処理では、機械学習における教師あり学習において、アノテータが人でない場合であっても信頼性の高い正解ラベルの付与を可能とし、より低コストかつ高精度なアノテーションを行うことができる。 Thirdly, in the annotation process according to the present embodiment described above, the annotator distributes first learning data to the machine learning model, and classifies the first learning data based on the posterior probability of the first correct label as the reliability. Therefore, in this process, in supervised learning in machine learning, even if the annotator is not human, it is possible to assign a highly reliable correct label, and annotation can be performed at a lower cost and with higher accuracy.

第４に、上述した本実施形態に係るアノテーション処理では、分類結果として、基準データであって極値の異なる複数の基準データ、正確に正解ラベルを付与しやすいデータ、および正確に正解ラベルを付与しにくいデータを含み、かつ、各データの発生源が同一のデータ群である第２の学習データを生成し、複数の基準データを最初に配信する。このため、本処理では、機械学習における教師あり学習において、比較対象がない場合であっても信頼性が高く、効率的な正解ラベルの付与を可能とし、より低コストかつ高精度なアノテーションを行うことができる。 Fourth, in the annotation process according to the present embodiment described above, as a classification result, second learning data is generated that is a data group that includes multiple reference data with different extreme values, data that is easy to accurately assign a correct answer label, and data that is difficult to accurately assign a correct answer label to, and each data source is the same, and the multiple reference data are distributed first. Therefore, in supervised learning in machine learning, this process enables reliable and efficient assignment of correct answer labels even in the absence of a comparison target, and can perform annotation at a lower cost and with higher accuracy.

第５に、上述した本実施形態に係るアノテーション処理では、分類結果を第１の学習データを配信した複数のアノテータ、または第１の学習データを配信した複数のアノテータ以外の所定のアノテータに配信する。このため、本処理では、機械学習における教師あり学習において、比較対象がない場合であっても信頼性が高く、効率的で、より柔軟な正解ラベルの付与を可能とし、より低コストかつ高精度なアノテーションを行うことができる。Fifth, in the annotation process according to the present embodiment described above, the classification results are distributed to the multiple annotators who distributed the first learning data, or to a specific annotator other than the multiple annotators who distributed the first learning data. Therefore, in supervised learning in machine learning, this process enables reliable, efficient, and more flexible assignment of correct answer labels even in the absence of a comparison target, and enables annotation to be performed at lower cost and with higher accuracy.

〔システム構成等〕
上記実施形態に係る図示した各装置の各構成要素は機能概念的なものであり、必ずしも物理的に図示のごとく構成されていることを要しない。すなわち、各装置の分散・統合の具体的形態は図示のものに限られず、その全部または一部を、各種の負荷や使用状況などに応じて、任意の単位で機能的または物理的に分散・統合して構成することができる。さらに、各装置にて行なわれる各処理機能は、その全部または任意の一部が、ＣＰＵおよび当該ＣＰＵにて解析実行されるプログラムにて実現され、あるいは、ワイヤードロジックによるハードウェアとして実現され得る。 [System configuration, etc.]
Each component of each device shown in the figures according to the above embodiment is a functional concept, and does not necessarily have to be physically configured as shown in the figures. In other words, the specific form of distribution and integration of each device is not limited to that shown in the figures, and all or a part of them can be functionally or physically distributed and integrated in any unit according to various loads, usage conditions, etc. Furthermore, each processing function performed by each device can be realized in whole or in any part by a CPU and a program analyzed and executed by the CPU, or can be realized as hardware using wired logic.

また、上記実施形態において説明した各処理のうち、自動的に行われるものとして説明した処理の全部または一部を手動的に行うこともでき、あるいは、手動的に行われるものとして説明した処理の全部または一部を公知の方法で自動的に行うこともできる。この他、上記文書中や図面中で示した処理手順、制御手順、具体的名称、各種のデータやパラメータを含む情報については、特記する場合を除いて任意に変更することができる。 Furthermore, among the processes described in the above embodiments, all or part of the processes described as being performed automatically can be performed manually, or all or part of the processes described as being performed manually can be performed automatically by a known method. In addition, the information including the processing procedures, control procedures, specific names, various data and parameters shown in the above documents and drawings can be changed as desired unless otherwise specified.

〔プログラム〕
また、上記実施形態において説明したアノテーション装置１０が実行する処理をコンピュータが実行可能な言語で記述したプログラムを作成することもできる。この場合、コンピュータがプログラムを実行することにより、上記実施形態と同様の効果を得ることができる。さらに、かかるプログラムをコンピュータ読み取り可能な記録媒体に記録して、この記録媒体に記録されたプログラムをコンピュータに読み込ませて実行することにより上記実施形態と同様の処理を実現してもよい。〔program〕
It is also possible to create a program in which the processing executed by the annotation device 10 described in the above embodiment is written in a language executable by a computer. In this case, the same effect as in the above embodiment can be obtained by the computer executing the program. Furthermore, such a program may be recorded on a computer-readable recording medium, and the program recorded on the recording medium may be read and executed by a computer to realize the same processing as in the above embodiment.

図８は、プログラムを実行するコンピュータを示す図である。図８に例示するように、コンピュータ１０００は、例えば、メモリ１０１０と、ＣＰＵ（Central Processing Unit）１０２０と、ハードディスクドライブインタフェース１０３０と、ディスクドライブインタフェース１０４０と、シリアルポートインタフェース１０５０と、ビデオアダプタ１０６０と、ネットワークインタフェース１０７０とを有し、これらの各部はバス１０８０によって接続される。 Figure 8 is a diagram showing a computer that executes a program. As illustrated in Figure 8, the computer 1000 has, for example, a memory 1010, a CPU (Central Processing Unit) 1020, a hard disk drive interface 1030, a disk drive interface 1040, a serial port interface 1050, a video adapter 1060, and a network interface 1070, and each of these components is connected by a bus 1080.

メモリ１０１０は、図８に例示するように、ＲＯＭ（Read Only Memory）１０１１及びＲＡＭ１０１２を含む。ＲＯＭ１０１１は、例えば、ＢＩＯＳ（Basic Input Output System）等のブートプログラムを記憶する。ハードディスクドライブインタフェース１０３０は、図８に例示するように、ハードディスクドライブ１０９０に接続される。ディスクドライブインタフェース１０４０は、図８に例示するように、ディスクドライブ１１００に接続される。例えば、磁気ディスクや光ディスク等の着脱可能な記憶媒体が、ディスクドライブ１１００に挿入される。シリアルポートインタフェース１０５０は、図８に例示するように、例えば、マウス１１１０、キーボード１１２０に接続される。ビデオアダプタ１０６０は、図８に例示するように、例えばディスプレイ１１３０に接続される。 The memory 1010 includes a ROM (Read Only Memory) 1011 and a RAM 1012, as illustrated in FIG. 8. The ROM 1011 stores a boot program such as a BIOS (Basic Input Output System). The hard disk drive interface 1030 is connected to a hard disk drive 1090, as illustrated in FIG. 8. The disk drive interface 1040 is connected to a disk drive 1100, as illustrated in FIG. 8. For example, a removable storage medium such as a magnetic disk or an optical disk is inserted into the disk drive 1100. The serial port interface 1050 is connected to a mouse 1110 and a keyboard 1120, as illustrated in FIG. 8. The video adapter 1060 is connected to a display 1130, as illustrated in FIG. 8.

ここで、図８に例示するように、ハードディスクドライブ１０９０は、例えば、ＯＳ１０９１、アプリケーションプログラム１０９２、プログラムモジュール１０９３、プログラムデータ１０９４を記憶する。すなわち、上記のプログラムは、コンピュータ１０００によって実行される指令が記述されたプログラムモジュールとして、例えば、ハードディスクドライブ１０９０に記憶される。8, the hard disk drive 1090 stores, for example, an OS 1091, an application program 1092, a program module 1093, and program data 1094. That is, the above programs are stored, for example, in the hard disk drive 1090 as program modules in which instructions to be executed by the computer 1000 are written.

また、上記実施形態で説明した各種データは、プログラムデータとして、例えば、メモリ１０１０やハードディスクドライブ１０９０に記憶される。そして、ＣＰＵ１０２０が、メモリ１０１０やハードディスクドライブ１０９０に記憶されたプログラムモジュール１０９３やプログラムデータ１０９４を必要に応じてＲＡＭ１０１２に読み出し、各種処理手順を実行する。In addition, the various data described in the above embodiment are stored as program data, for example, in memory 1010 or hard disk drive 1090. Then, CPU 1020 reads out program module 1093 and program data 1094 stored in memory 1010 or hard disk drive 1090 into RAM 1012 as necessary, and executes various processing procedures.

なお、プログラムに係るプログラムモジュール１０９３やプログラムデータ１０９４は、ハードディスクドライブ１０９０に記憶される場合に限られず、例えば着脱可能な記憶媒体に記憶され、ディスクドライブ等を介してＣＰＵ１０２０によって読み出されてもよい。あるいは、プログラムに係るプログラムモジュール１０９３やプログラムデータ１０９４は、ネットワーク（ＬＡＮ（Local Area Network）、ＷＡＮ（Wide Area Network）等）を介して接続された他のコンピュータに記憶され、ネットワークインタフェース１０７０を介してＣＰＵ１０２０によって読み出されてもよい。In addition, the program module 1093 and program data 1094 related to the program are not limited to being stored in the hard disk drive 1090, but may be stored in, for example, a removable storage medium and read by the CPU 1020 via a disk drive or the like. Alternatively, the program module 1093 and program data 1094 related to the program may be stored in another computer connected via a network (such as a LAN (Local Area Network), WAN (Wide Area Network)), and read by the CPU 1020 via the network interface 1070.

上記の実施形態やその変形は、本願が開示する技術に含まれると同様に、請求の範囲に記載された発明とその均等の範囲に含まれるものである。The above embodiments and their variations are included in the scope of the invention and its equivalents described in the claims, as well as in the technology disclosed in this application.

１０アノテーション装置
１１入力部
１２出力部
１３通信部
１４記憶部
１５制御部
１５ａ取得部
１５ｂ第１配信部
１５ｃ分類部
１５ｄ生成部
１５ｅ第２配信部
２０、２０Ａ、２０Ｂ、２０Ｃアノテータ
３０、３０Ａ、３０Ｂ、３０Ｃデータベース
１００アノテーションシステム REFERENCE SIGNS LIST 10 Annotation device 11 Input unit 12 Output unit 13 Communication unit 14 Storage unit 15 Control unit 15a Acquisition unit 15b First distribution unit 15c Classification unit 15d Generation unit 15e Second distribution unit 20, 20A, 20B, 20C Annotator 30, 30A, 30B, 30C Database 100 Annotation system

Claims

an acquisition unit that acquires first learning data to be used in machine learning;
a first distribution unit that distributes the first learning data acquired by the acquisition unit to a plurality of annotators;
a classification unit that classifies the first training data based on a reliability of a first correct label that is assigned to each of the first training data by each annotator;
a second distribution unit that distributes a classification result of the first learning data classified by the classification unit to the plurality of annotators or a predetermined annotator other than the plurality of annotators ;
The acquisition unit acquires the first learning data including audio, images, or videos,
The first distribution unit distributes the first learning data in a format in which a predetermined number is assigned as the first correct label;
The classification unit classifies the first learning data into reference data, data that is easy to accurately assign a correct label, or data that is difficult to accurately assign a correct label based on the variance of the first correct label as the reliability.

The first distribution unit distributes the first training data to a machine learning model as the annotator;
The annotation device according to claim 1 , wherein the classification unit classifies the first learning data based on a posterior probability of the first correct label as the reliability.

a generation unit that generates second learning data as the classification result, the second learning data including a plurality of reference data having different extreme values, the data to which the correct label is easily assigned accurately, and the data to which the correct label is difficult to assign accurately, the second learning data being generated from a same data group;
The annotation device according to claim 1 , wherein the second distribution unit distributes the plurality of reference data first.

An annotation method performed by an annotation device, comprising:
An acquisition step of acquiring first learning data to be used in machine learning;
a first distribution step of distributing the first training data acquired by the acquisition step to a plurality of annotators;
a classification step of classifying the first training data based on the reliability of a first correct label assigned to each of the first training data by each annotator;
a second distribution step of distributing the classification result of the first learning data classified by the classification step to the plurality of annotators or a predetermined annotator other than the plurality of annotators ,
The acquiring step acquires the first learning data including audio, an image, or a video;
The first distribution step distributes the first learning data in a format in which a predetermined number is assigned as the first correct label;
The annotation method is characterized in that the classification step classifies the first learning data into reference data, data that is easy to accurately assign a correct label, or data that is difficult to accurately assign a correct label based on the variance of the first correct label as the reliability.

An acquisition step of acquiring first learning data used in machine learning;
a first distribution step of distributing the first training data acquired by the acquisition step to a plurality of annotators;
a classification step of classifying the first training data based on a reliability of a first correct label assigned to each of the first training data by each annotator;
a second distribution step of distributing the classification result of the first learning data classified by the classification step to the plurality of annotators or a predetermined annotator other than the plurality of annotators ;
The acquiring step acquires the first learning data including audio, an image, or a video;
The first distribution step distributes the first learning data in a format in which a predetermined number is assigned as the first correct label;
The classification step classifies the first learning data into reference data, data that is easy to accurately assign a correct label, or data that is difficult to accurately assign a correct label based on the variance of the first correct label as the reliability.

a classification unit that classifies first learning data used in machine learning based on the reliability of first correct labels that are respectively assigned by a plurality of annotators to the first learning data;
a distribution unit that distributes a classification result of the first learning data classified by the classification unit to the plurality of annotators or a predetermined annotator other than the plurality of annotators ,
The annotation device, wherein the reliability represents a variance of the first correct label.

a classification unit that classifies first learning data used in machine learning based on the reliability of first correct labels that are respectively assigned by a plurality of annotators to the first learning data;
a distribution unit that distributes a classification result of the first learning data classified by the classification unit to the plurality of annotators or a predetermined annotator other than the plurality of annotators ,
An annotation device characterized in that the classification result includes reference data selected based on the reliability.

a classification unit that classifies first learning data used in machine learning based on the reliability of first correct labels that are respectively assigned by a plurality of annotators to the first learning data;
a distribution unit that distributes a classification result of the first learning data classified by the classification unit to the plurality of annotators or a predetermined annotator other than the plurality of annotators ,
The annotation device, characterized in that the classification result includes classification of the first learning data into data that is easy to accurately assign a correct label to, or data that is difficult to accurately assign a correct label to, based on the reliability.

An annotation method performed by an annotation device, comprising:
a classification step of classifying first learning data used for machine learning based on the reliability of first correct labels respectively assigned by a plurality of annotators to the first learning data;
a distribution step of distributing the classification result of the first learning data classified by the classification step to the plurality of annotators or a predetermined annotator other than the plurality of annotators ,
The annotation method, wherein the reliability represents a variance of the first correct label.

An annotation method performed by an annotation device, comprising:
a classification step of classifying first learning data used for machine learning based on the reliability of first correct labels respectively assigned by a plurality of annotators to the first learning data;
a distribution step of distributing the classification result of the first learning data classified by the classification step to the plurality of annotators or a predetermined annotator other than the plurality of annotators ,
An annotation method, characterized in that the classification result includes reference data selected based on the reliability.

An annotation method performed by an annotation device, comprising:
a classification step of classifying first learning data used for machine learning based on the reliability of first correct labels respectively assigned by a plurality of annotators to the first learning data;
a distribution step of distributing the classification result of the first learning data classified by the classification step to the plurality of annotators or a predetermined annotator other than the plurality of annotators ,
The annotation method, characterized in that the classification result includes classification of the first learning data into data that is easy to accurately assign a correct label to, or data that is difficult to accurately assign a correct label to, based on the reliability.

a classification step of classifying first learning data used for machine learning based on the reliability of first correct labels respectively assigned by a plurality of annotators to the first learning data;
a distribution step of distributing the classification result of the first learning data classified by the classification step to the plurality of annotators or a predetermined annotator other than the plurality of annotators ;
The annotation program, wherein the reliability represents a variance of the first correct label.

a classification step of classifying first learning data used for machine learning based on the reliability of first correct labels respectively assigned by a plurality of annotators to the first learning data;
a distribution step of distributing the classification result of the first learning data classified by the classification step to the plurality of annotators or a predetermined annotator other than the plurality of annotators ;
The annotation program, wherein the classification result includes reference data selected based on the reliability.

a classification step of classifying first learning data used for machine learning based on the reliability of first correct labels respectively assigned by a plurality of annotators to the first learning data;
a distribution step of distributing the classification result of the first learning data classified by the classification step to the plurality of annotators or a predetermined annotator other than the plurality of annotators ;
The annotation program, characterized in that the classification result includes classification of the first learning data into data that is easy to accurately assign a correct label to, or data that is difficult to accurately assign a correct label to, based on the reliability.